Query lcl|NC_012784.1_cdsid_YP_002939713.1 [gene=CUR041] [protein=putative capsid protein] [protein_id=YP_002939713.1] [location=23962..25209] Match_columns 415 No_of_seqs 133 out of 970 Neff 10.0 Searched_HMMs 1612 Date Thu Nov 7 13:19:54 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_45 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_45_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4600 Length: 415 # 100.0 3.7E-86 2.3E-89 488.9 44.9 415 1-415 1-415 (415) 2 protein:vir:4700 Length: 415 # 100.0 3.7E-86 2.3E-89 488.9 44.9 415 1-415 1-415 (415) 3 protein:vir:98339 Length: 415 100.0 7.5E-86 4.7E-89 487.3 45.2 415 1-415 1-415 (415) 4 protein:vir:81100 Length: 415 100.0 7.5E-86 4.7E-89 487.3 45.2 415 1-415 1-415 (415) 5 protein:vir:79987 Length: 415 100.0 7.5E-86 4.7E-89 487.3 45.2 415 1-415 1-415 (415) 6 protein:vir:9410 Length: 415 # 100.0 2.6E-85 1.6E-88 484.3 44.9 415 1-415 1-415 (415) 7 protein:vir:4953 Length: 397 # 100.0 1.4E-73 8.7E-77 420.0 42.0 386 1-415 1-396 (397) 8 protein:vir:485 Length: 407 # 100.0 2.8E-72 1.7E-75 412.8 41.0 394 1-411 1-407 (407) 9 protein:vir:1025 Length: 408 # 100.0 9.6E-72 6E-75 409.9 41.8 390 1-415 4-404 (408) 10 protein:vir:9704 Length: 394 # 100.0 2.2E-71 1.4E-74 408.0 41.2 386 1-408 3-394 (394) 11 protein:vir:4830 Length: 397 # 100.0 8.1E-71 5E-74 404.8 42.0 386 1-415 1-396 (397) 12 protein:vir:4997 Length: 397 # 100.0 6.5E-71 4.1E-74 405.4 41.4 386 1-415 1-396 (397) 13 protein:vir:7409 Length: 408 # 100.0 1.2E-70 7.2E-74 404.0 42.1 390 1-415 4-404 (408) 14 protein:vir:3870 Length: 400 # 100.0 1.1E-69 6.7E-73 398.7 42.2 386 1-405 1-400 (400) 15 protein:vir:4456 Length: 401 # 100.0 6E-70 3.7E-73 400.1 40.5 387 1-404 1-401 (401) 16 protein:vir:81160 Length: 371 100.0 3E-69 1.9E-72 396.2 39.4 358 1-404 1-371 (371) 17 protein:vir:3991 Length: 404 # 100.0 9.6E-69 6E-72 393.5 41.9 390 1-415 4-404 (404) 18 protein:vir:100247 Length: 425 100.0 5.3E-69 3.3E-72 394.9 40.1 381 1-405 21-425 (425) 19 protein:vir:1268 Length: 397 # 100.0 5.1E-69 3.1E-72 395.0 40.0 382 1-404 1-397 (397) 20 protein:vir:4511 Length: 409 # 100.0 4.6E-68 2.8E-71 389.8 41.6 397 1-407 1-409 (409) 21 protein:vir:100172 Length: 394 100.0 5.2E-68 3.2E-71 389.4 40.8 385 1-415 1-390 (394) 22 protein:vir:102119 Length: 404 100.0 9.2E-68 5.7E-71 388.1 40.8 395 1-408 1-404 (404) 23 protein:vir:3845 Length: 395 # 100.0 2E-67 1.3E-70 386.2 42.2 385 1-415 1-394 (395) 24 protein:vir:1084 Length: 437 # 100.0 1.1E-67 6.6E-71 387.8 39.1 395 1-412 1-437 (437) 25 protein:vir:102082 Length: 392 100.0 1.3E-67 8.3E-71 387.2 39.4 378 1-409 1-392 (392) 26 protein:vir:105004 Length: 392 100.0 1.3E-67 8.3E-71 387.2 39.4 378 1-409 1-392 (392) 27 protein:vir:107593 Length: 392 100.0 1.3E-67 8.3E-71 387.2 39.4 378 1-409 1-392 (392) 28 protein:vir:102873 Length: 392 100.0 1.3E-67 8.3E-71 387.2 39.4 378 1-409 1-392 (392) 29 protein:vir:100884 Length: 389 100.0 2.9E-67 1.8E-70 385.4 40.7 382 1-411 1-389 (389) 30 protein:vir:962 Length: 397 # 100.0 4E-67 2.5E-70 384.6 37.6 384 1-404 1-397 (397) 31 protein:vir:1383 Length: 421 # 100.0 1.3E-66 8.1E-70 381.8 38.3 386 1-415 1-394 (421) 32 protein:vir:6242 Length: 390 # 100.0 1.4E-65 8.5E-69 376.2 39.5 382 1-405 3-390 (390) 33 protein:vir:1328 Length: 392 # 100.0 2.1E-65 1.3E-68 375.2 39.8 383 1-405 1-392 (392) 34 protein:vir:100135 Length: 418 100.0 4E-65 2.5E-68 373.6 40.5 392 1-407 21-418 (418) 35 protein:vir:81070 Length: 390 100.0 1.9E-64 1.2E-67 370.0 40.0 381 1-402 1-390 (390) 36 protein:vir:95376 Length: 425 100.0 3.1E-64 1.9E-67 368.8 39.6 396 1-408 1-425 (425) 37 protein:vir:10364 Length: 390 100.0 4.1E-64 2.5E-67 368.1 40.0 382 1-402 1-390 (390) 38 protein:vir:105038 Length: 428 100.0 4.3E-64 2.6E-67 368.0 39.2 395 1-404 1-428 (428) 39 protein:vir:6212 Length: 434 # 100.0 8.2E-64 5.1E-67 366.4 40.0 400 1-409 1-434 (434) 40 protein:vir:1886 Length: 385 # 100.0 1.3E-63 8.2E-67 365.3 40.3 379 1-405 1-385 (385) 41 protein:vir:191 Length: 385 # 100.0 1.3E-63 8.2E-67 365.3 40.3 379 1-405 1-385 (385) 42 protein:vir:97053 Length: 390 100.0 1.3E-63 7.9E-67 365.4 39.7 382 1-402 1-390 (390) 43 protein:vir:4339 Length: 395 # 100.0 9.2E-63 5.7E-66 360.7 40.0 380 1-404 1-395 (395) 44 protein:vir:81227 Length: 413 100.0 1.8E-62 1.1E-65 359.1 40.6 395 1-407 2-413 (413) 45 protein:vir:104256 Length: 458 100.0 2.4E-62 1.5E-65 358.4 40.2 400 1-404 5-458 (458) 46 protein:vir:7855 Length: 497 # 100.0 8E-62 5E-65 355.5 39.3 398 1-408 1-497 (497) 47 protein:vir:101650 Length: 497 100.0 8E-62 5E-65 355.5 39.3 398 1-408 1-497 (497) 48 protein:vir:80376 Length: 435 100.0 1.5E-61 9.3E-65 354.0 39.1 397 1-410 1-435 (435) 49 protein:vir:2685 Length: 387 # 100.0 5.4E-62 3.4E-65 356.5 36.5 383 1-410 1-387 (387) 50 protein:vir:94424 Length: 387 100.0 5.4E-62 3.4E-65 356.5 36.5 383 1-410 1-387 (387) 51 protein:vir:96978 Length: 387 100.0 5.4E-62 3.4E-65 356.5 36.5 383 1-410 1-387 (387) 52 protein:vir:8102 Length: 543 # 100.0 1.6E-61 1E-64 353.9 39.0 383 1-405 142-543 (543) 53 protein:vir:94673 Length: 419 100.0 4.3E-61 2.7E-64 351.5 41.2 397 1-406 4-419 (419) 54 protein:vir:9361 Length: 402 # 100.0 1.3E-61 7.9E-65 354.4 36.1 383 1-410 16-402 (402) 55 protein:vir:93881 Length: 387 100.0 2.7E-61 1.7E-64 352.6 37.3 382 1-410 1-387 (387) 56 protein:vir:1433 Length: 435 # 100.0 4.7E-61 2.9E-64 351.3 38.1 397 1-410 1-435 (435) 57 protein:vir:101607 Length: 379 100.0 3E-60 1.9E-63 346.9 39.6 368 1-404 1-379 (379) 58 protein:vir:4856 Length: 293 # 100.0 4.6E-62 2.9E-65 356.8 29.0 284 117-415 1-292 (293) 59 protein:vir:8420 Length: 477 # 100.0 1.9E-58 1.2E-61 337.0 36.8 407 1-410 1-477 (477) 60 protein:vir:93616 Length: 645 100.0 2.7E-58 1.7E-61 336.2 36.1 398 1-409 193-645 (645) 61 protein:vir:4092 Length: 390 # 100.0 1.4E-55 8.5E-59 321.4 37.0 362 1-415 1-381 (390) 62 protein:vir:78640 Length: 352 100.0 3.3E-56 2E-59 324.8 33.3 350 1-410 1-352 (352) 63 protein:vir:80128 Length: 466 100.0 5E-55 3.1E-58 318.3 37.7 393 1-415 16-455 (466) 64 protein:vir:41 Length: 299 # N 100.0 2.3E-56 1.4E-59 325.6 28.5 282 116-405 1-299 (299) 65 protein:vir:98635 Length: 377 100.0 6.4E-56 4E-59 323.2 29.7 353 1-404 1-377 (377) 66 protein:vir:9574 Length: 300 # 100.0 1.3E-55 8.3E-59 321.4 28.3 280 122-404 1-300 (300) 67 protein:vir:1638 Length: 298 # 100.0 2.4E-55 1.5E-58 320.0 28.7 276 125-403 1-298 (298) 68 protein:vir:7771 Length: 330 # 100.0 3E-55 1.9E-58 319.5 29.0 294 113-410 1-330 (330) 69 protein:vir:105905 Length: 304 100.0 7.6E-55 4.7E-58 317.3 28.1 281 113-403 1-304 (304) 70 protein:vir:94142 Length: 304 100.0 7.6E-55 4.7E-58 317.3 28.1 281 113-403 1-304 (304) 71 protein:vir:9759 Length: 303 # 100.0 1.7E-54 1E-57 315.4 28.4 279 123-404 1-303 (303) 72 protein:vir:94771 Length: 298 100.0 2.4E-54 1.5E-57 314.5 28.0 276 125-403 1-298 (298) 73 protein:vir:95963 Length: 395 100.0 2.3E-52 1.4E-55 303.7 36.3 362 1-415 1-387 (395) 74 protein:vir:8187 Length: 311 # 100.0 1.1E-53 7.1E-57 310.8 28.6 279 123-405 1-311 (311) 75 protein:vir:97148 Length: 324 100.0 3.9E-53 2.4E-56 307.9 30.1 304 75-415 1-322 (324) 76 protein:vir:2430 Length: 318 # 100.0 1.6E-53 1E-56 310.0 27.9 295 95-409 1-318 (318) 77 protein:vir:2344 Length: 397 # 100.0 2.3E-53 1.4E-56 309.2 28.0 295 112-415 1-326 (397) 78 protein:vir:96392 Length: 324 100.0 6.6E-53 4.1E-56 306.7 29.7 304 75-411 1-324 (324) 79 protein:vir:78830 Length: 324 100.0 6.6E-53 4.1E-56 306.7 29.7 304 75-411 1-324 (324) 80 protein:vir:9309 Length: 324 # 100.0 1E-52 6.4E-56 305.6 30.2 302 96-409 1-324 (324) 81 protein:vir:9643 Length: 377 # 100.0 2.8E-52 1.8E-55 303.2 32.3 345 1-404 1-377 (377) 82 protein:vir:95763 Length: 297 100.0 9.8E-53 6.1E-56 305.7 28.0 280 113-405 1-297 (297) 83 protein:vir:80684 Length: 315 100.0 1.3E-52 7.9E-56 305.1 27.8 286 121-410 1-315 (315) 84 protein:vir:5739 Length: 366 # 100.0 1.8E-52 1.1E-55 304.3 28.5 336 58-404 1-366 (366) 85 protein:vir:99749 Length: 324 100.0 3.4E-52 2.1E-55 302.7 30.0 304 75-411 1-324 (324) 86 protein:vir:4226 Length: 326 # 100.0 2.6E-52 1.6E-55 303.4 27.7 298 97-407 1-326 (326) 87 protein:vir:103955 Length: 324 100.0 5.6E-52 3.4E-55 301.6 29.5 304 75-411 1-324 (324) 88 protein:vir:9509 Length: 381 # 100.0 1.7E-51 1E-54 299.0 31.8 349 1-414 3-381 (381) 89 protein:vir:101291 Length: 381 100.0 1.7E-51 1E-54 299.0 31.8 349 1-414 3-381 (381) 90 protein:vir:100632 Length: 381 100.0 1.6E-51 9.9E-55 299.1 31.6 349 1-414 3-381 (381) 91 protein:vir:104085 Length: 320 100.0 2.4E-52 1.5E-55 303.6 27.0 292 100-406 1-320 (320) 92 protein:vir:96223 Length: 324 100.0 6.5E-52 4E-55 301.2 29.3 304 75-411 1-324 (324) 93 protein:vir:96762 Length: 632 100.0 2.9E-51 1.8E-54 297.7 32.0 386 1-403 201-632 (632) 94 protein:vir:99920 Length: 311 100.0 6E-52 3.7E-55 301.4 27.6 279 122-403 1-311 (311) 95 protein:vir:78350 Length: 383 100.0 3.1E-51 1.9E-54 297.5 31.1 361 1-409 1-383 (383) 96 protein:vir:2504 Length: 305 # 100.0 9E-52 5.6E-55 300.4 27.6 280 121-415 1-305 (305) 97 protein:vir:78523 Length: 338 100.0 1.8E-51 1.1E-54 298.8 28.8 299 100-409 1-338 (338) 98 protein:vir:78223 Length: 333 100.0 2.8E-51 1.7E-54 297.7 28.4 295 107-405 1-333 (333) 99 protein:vir:97397 Length: 517 100.0 2E-44 1.2E-47 260.1 27.5 379 1-407 127-517 (517) 100 protein:vir:4197 Length: 314 # 100.0 1.8E-41 1.1E-44 244.0 25.9 296 102-407 1-314 (314) 101 protein:vir:4159 Length: 315 # 100.0 1.1E-41 7.1E-45 245.1 23.4 295 94-401 1-315 (315) 102 protein:vir:4074 Length: 480 # 100.0 8.8E-38 5.4E-41 223.7 21.6 358 1-413 111-480 (480) 103 protein:vir:3158 Length: 321 # 100.0 1.9E-35 1.2E-38 211.0 25.9 301 104-415 1-321 (321) 104 protein:vir:3033 Length: 272 # 100.0 2.4E-31 1.5E-34 188.5 24.8 266 121-407 1-272 (272) 105 protein:vir:9820 Length: 272 # 100.0 2.4E-31 1.5E-34 188.5 24.8 266 121-407 1-272 (272) 106 protein:vir:3613 Length: 272 # 99.9 3.8E-24 2.4E-27 149.0 20.5 267 121-404 1-272 (272) 107 protein:vir:93742 Length: 274 99.9 4.7E-23 2.9E-26 143.0 22.8 266 121-408 1-274 (274) 108 protein:vir:96833 Length: 275 99.8 2.9E-22 1.8E-25 138.7 21.4 267 120-408 1-275 (275) 109 protein:vir:105334 Length: 276 99.8 2.9E-21 1.8E-24 133.2 21.4 268 121-410 1-276 (276) 110 protein:vir:96123 Length: 274 99.8 5.4E-21 3.3E-24 131.7 22.3 266 121-413 1-274 (274) 111 protein:vir:94494 Length: 274 99.8 9.4E-21 5.8E-24 130.4 23.2 266 121-408 1-274 (274) 112 protein:vir:97433 Length: 274 99.8 9.4E-21 5.8E-24 130.4 23.2 266 121-408 1-274 (274) 113 protein:vir:80930 Length: 278 99.8 1.4E-20 8.4E-24 129.5 21.8 271 121-405 1-278 (278) 114 protein:vir:79928 Length: 393 99.8 1.9E-20 1.2E-23 128.8 20.5 360 7-413 1-393 (393) 115 protein:vir:94933 Length: 330 99.8 1E-20 6.2E-24 130.2 18.8 307 92-407 1-330 (330) 116 protein:vir:1239 Length: 274 # 99.8 1.4E-19 9E-23 123.9 22.1 266 121-408 1-274 (274) 117 protein:vir:95898 Length: 274 99.7 6.8E-19 4.2E-22 120.2 22.9 266 121-413 1-274 (274) 118 protein:vir:96262 Length: 274 99.7 6.8E-19 4.2E-22 120.2 22.9 266 121-413 1-274 (274) 119 protein:vir:95107 Length: 270 99.7 1.1E-18 6.8E-22 119.0 21.2 266 123-409 1-270 (270) 120 protein:vir:93858 Length: 400 99.7 8.7E-19 5.4E-22 119.6 19.3 376 1-402 8-400 (400) 121 protein:vir:739 Length: 231 # 99.7 5.1E-18 3.2E-21 115.4 17.0 228 158-404 1-231 (231) 122 protein:vir:97255 Length: 310 99.6 3.7E-16 2.3E-19 105.2 23.1 280 121-403 1-310 (310) 123 protein:vir:8324 Length: 410 # 99.5 6.5E-15 4E-18 98.4 19.1 369 1-402 1-410 (410) 124 protein:vir:108211 Length: 318 99.5 4.5E-15 2.8E-18 99.2 17.5 282 117-405 1-318 (318) 125 protein:vir:7990 Length: 273 # 99.5 1.1E-14 6.7E-18 97.1 19.2 264 127-404 1-273 (273) 126 protein:vir:105822 Length: 273 99.4 3.1E-14 1.9E-17 94.7 20.1 264 127-404 1-273 (273) 127 protein:vir:102605 Length: 273 99.4 3.1E-14 1.9E-17 94.7 20.1 264 127-404 1-273 (273) 128 protein:vir:94622 Length: 341 99.4 6.8E-14 4.2E-17 92.8 15.9 292 113-406 1-341 (341) 129 protein:vir:99424 Length: 360 99.3 1.2E-12 7.6E-16 85.9 22.4 299 87-409 1-360 (360) 130 protein:vir:8885 Length: 347 # 99.3 1.6E-13 9.7E-17 90.8 17.5 294 110-405 1-347 (347) 131 protein:vir:94576 Length: 347 99.3 5.7E-13 3.5E-16 87.7 16.8 293 110-404 1-347 (347) 132 protein:vir:6324 Length: 335 # 99.3 1.6E-12 9.6E-16 85.3 18.7 297 110-415 1-335 (335) 133 protein:vir:2201 Length: 345 # 99.2 1.2E-12 7.6E-16 85.9 17.5 291 110-404 1-345 (345) 134 protein:vir:80213 Length: 334 99.2 9.9E-13 6.2E-16 86.4 16.6 288 116-406 1-334 (334) 135 protein:vir:103323 Length: 364 99.2 1.9E-11 1.2E-14 79.4 22.3 295 116-415 1-351 (364) 136 protein:vir:10450 Length: 344 99.2 2E-12 1.2E-15 84.7 16.1 293 110-404 1-344 (344) 137 protein:vir:78935 Length: 335 99.2 4.8E-12 3E-15 82.6 17.9 297 110-415 1-335 (335) 138 protein:vir:3364 Length: 347 # 99.2 6.1E-12 3.8E-15 82.1 17.4 293 110-406 1-347 (347) 139 protein:vir:94711 Length: 347 99.2 1.7E-12 1.1E-15 85.1 14.3 291 113-405 1-347 (347) 140 protein:vir:80180 Length: 381 99.1 9.5E-12 5.9E-15 81.0 17.6 299 110-415 1-352 (381) 141 protein:vir:100057 Length: 375 99.1 6.8E-11 4.2E-14 76.3 21.4 295 113-411 1-375 (375) 142 protein:vir:5974 Length: 324 # 99.1 4.6E-11 2.9E-14 77.2 20.0 282 121-415 1-302 (324) 143 protein:vir:1541 Length: 347 # 99.1 5.3E-11 3.3E-14 76.9 19.0 293 112-406 1-347 (347) 144 protein:vir:78739 Length: 332 99.1 1.4E-11 8.8E-15 80.1 15.8 294 107-402 1-332 (332) 145 protein:vir:99675 Length: 324 99.0 4E-11 2.5E-14 77.6 16.9 257 154-415 1-316 (324) 146 protein:vir:102944 Length: 330 99.0 2E-10 1.2E-13 73.8 19.9 284 121-415 1-308 (330) 147 protein:vir:95318 Length: 328 99.0 8.9E-11 5.5E-14 75.7 16.4 230 116-351 1-328 (328) 148 protein:vir:1583 Length: 351 # 99.0 3.4E-10 2.1E-13 72.5 19.3 283 121-415 1-306 (351) 149 protein:vir:102655 Length: 322 98.9 4E-10 2.5E-13 72.1 18.8 285 115-405 1-322 (322) 150 protein:vir:3136 Length: 322 # 98.9 1.6E-10 1E-13 74.2 16.3 281 120-408 1-322 (322) 151 protein:vir:9927 Length: 295 # 98.9 1.5E-10 9.2E-14 74.5 14.1 268 119-409 1-295 (295) 152 protein:vir:97031 Length: 402 98.8 4.4E-10 2.7E-13 71.9 15.0 295 116-415 1-346 (402) 153 protein:vir:105645 Length: 400 98.8 6.3E-10 3.9E-13 71.0 15.7 295 116-415 1-344 (400) 154 protein:vir:7019 Length: 401 # 98.7 6.1E-10 3.8E-13 71.1 13.5 293 116-415 1-344 (401) 155 protein:vir:9875 Length: 296 # 98.7 1.2E-09 7.3E-13 69.5 13.2 269 112-405 1-296 (296) 156 protein:vir:106647 Length: 303 98.7 2.2E-09 1.4E-12 68.1 14.2 268 117-411 1-303 (303) 157 protein:vir:103285 Length: 296 98.7 6.8E-09 4.2E-12 65.4 16.6 278 121-404 1-296 (296) 158 protein:vir:103759 Length: 330 98.7 3.7E-09 2.3E-12 66.8 15.1 230 116-351 1-330 (330) 159 protein:vir:107826 Length: 331 98.5 1.8E-08 1.1E-11 63.1 16.0 230 116-351 1-331 (331) 160 protein:vir:107388 Length: 331 98.5 1.8E-08 1.1E-11 63.1 16.0 230 116-351 1-331 (331) 161 protein:vir:98525 Length: 331 98.5 1.8E-08 1.1E-11 63.1 16.0 230 116-351 1-331 (331) 162 protein:vir:7324 Length: 335 # 98.5 1.7E-08 1.1E-11 63.1 14.1 231 116-352 1-335 (335) 163 protein:vir:107687 Length: 319 98.4 1.4E-07 8.7E-11 58.1 18.5 298 96-402 1-319 (319) 164 protein:vir:95131 Length: 325 98.4 2.4E-07 1.5E-10 56.9 19.2 286 124-415 1-305 (325) 165 protein:vir:80068 Length: 301 98.3 3.3E-07 2E-10 56.1 18.1 274 123-402 1-301 (301) 166 protein:vir:104342 Length: 314 98.3 3E-07 1.8E-10 56.4 17.0 296 97-404 1-314 (314) 167 protein:vir:8843 Length: 317 # 98.2 1.1E-06 6.7E-10 53.3 17.5 280 117-406 1-317 (317) 168 protein:vir:93966 Length: 400 98.2 9.1E-08 5.7E-11 59.2 11.6 371 1-402 8-400 (400) 169 protein:vir:108303 Length: 418 98.1 3.3E-06 2E-09 50.6 19.7 271 124-415 1-323 (418) 170 protein:vir:96792 Length: 315 98.1 3.6E-06 2.2E-09 50.4 18.8 275 120-415 1-292 (315) 171 protein:vir:79642 Length: 329 98.1 1.5E-06 9.5E-10 52.5 16.5 303 96-407 1-329 (329) 172 protein:vir:1663 Length: 393 # 98.0 1.6E-07 9.9E-11 57.9 10.2 371 1-402 1-393 (393) 173 protein:vir:99075 Length: 392 98.0 2.8E-06 1.7E-09 51.1 16.7 272 127-415 1-320 (392) 174 protein:vir:80128 Length: 466 97.8 1.8E-05 1.1E-08 46.6 17.9 396 1-415 1-444 (466) 175 protein:vir:80446 Length: 367 97.8 2.2E-05 1.4E-08 46.1 19.3 279 119-415 1-347 (367) 176 protein:vir:95875 Length: 401 97.6 1.8E-05 1.1E-08 46.6 15.4 293 113-405 1-401 (401) 177 protein:vir:1781 Length: 221 # 97.5 1.7E-05 1.1E-08 46.7 14.0 194 205-415 1-212 (221) 178 protein:vir:94800 Length: 319 97.4 6.6E-05 4.1E-08 43.5 19.9 292 102-415 1-307 (319) 179 protein:vir:97331 Length: 319 97.4 6.6E-05 4.1E-08 43.5 19.9 292 102-415 1-307 (319) 180 protein:vir:79548 Length: 652 97.4 7.8E-05 4.8E-08 43.1 25.4 382 1-401 185-652 (652) 181 protein:vir:5255 Length: 304 # 97.1 6.5E-05 4E-08 43.5 13.6 270 126-401 1-304 (304) 182 protein:vir:95512 Length: 693 97.0 0.0002 1.3E-07 40.8 23.9 386 1-409 258-693 (693) 183 protein:vir:3525 Length: 423 # 97.0 0.00022 1.3E-07 40.7 19.1 273 120-415 1-378 (423) 184 protein:vir:94989 Length: 349 96.9 0.00028 1.7E-07 40.1 23.5 280 123-415 1-327 (349) 185 protein:vir:78387 Length: 349 96.9 0.0003 1.8E-07 39.9 22.0 281 123-415 1-327 (349) 186 protein:vir:107120 Length: 329 96.8 0.00035 2.1E-07 39.6 19.2 306 75-415 1-318 (329) 187 protein:vir:105374 Length: 423 96.5 0.00054 3.4E-07 38.5 20.3 274 127-415 1-343 (423) 188 protein:vir:2016 Length: 357 # 96.2 0.00085 5.3E-07 37.4 17.5 303 101-413 1-357 (357) 189 protein:vir:105522 Length: 423 96.2 0.00087 5.4E-07 37.4 21.0 271 127-415 1-333 (423) 190 protein:vir:174 Length: 423 # 96.2 0.00087 5.4E-07 37.4 20.3 274 120-415 1-333 (423) 191 protein:vir:6061 Length: 357 # 96.0 0.0011 7.1E-07 36.7 17.7 303 101-413 1-357 (357) 192 protein:vir:98566 Length: 355 95.9 0.0013 8.1E-07 36.4 19.0 298 101-409 1-355 (355) 193 protein:vir:80986 Length: 528 95.9 0.0014 8.4E-07 36.3 16.5 358 1-415 1-514 (528) 194 protein:vir:1829 Length: 355 # 95.8 0.0014 8.8E-07 36.2 19.1 300 101-411 1-355 (355) 195 protein:vir:5694 Length: 357 # 95.7 0.0016 1E-06 35.9 17.3 303 101-413 1-357 (357) 196 protein:vir:79171 Length: 337 95.6 0.0017 1.1E-06 35.8 19.4 294 101-404 1-337 (337) 197 protein:vir:104011 Length: 337 95.5 0.0019 1.2E-06 35.5 19.2 294 101-404 1-337 (337) 198 protein:vir:78777 Length: 358 95.3 0.0023 1.4E-06 35.1 18.9 308 97-415 1-355 (358) 199 protein:vir:79157 Length: 339 95.1 0.0028 1.7E-06 34.6 18.0 295 101-405 1-339 (339) 200 protein:vir:95451 Length: 313 94.8 0.0034 2.1E-06 34.1 15.6 281 122-406 1-313 (313) 201 protein:vir:79008 Length: 299 94.8 0.0034 2.1E-06 34.1 20.9 269 127-406 1-299 (299) 202 protein:vir:78186 Length: 337 94.7 0.0038 2.3E-06 33.9 17.7 294 101-404 1-337 (337) 203 protein:vir:1153 Length: 338 # 94.5 0.0043 2.6E-06 33.6 18.6 296 101-406 1-338 (338) 204 protein:vir:3643 Length: 336 # 94.5 0.0044 2.7E-06 33.5 13.7 303 83-402 1-336 (336) 205 protein:vir:270 Length: 341 # 94.2 0.0052 3.2E-06 33.1 16.4 299 97-415 1-338 (341) 206 protein:vir:103463 Length: 521 94.0 0.0056 3.5E-06 32.9 15.9 357 1-415 3-505 (521) 207 protein:vir:861 Length: 318 # 93.9 0.0031 1.9E-06 34.3 10.2 301 79-402 1-318 (318) 208 protein:vir:94070 Length: 339 93.4 0.0075 4.7E-06 32.2 15.3 308 45-402 1-339 (339) 209 protein:vir:100331 Length: 342 93.4 0.0078 4.9E-06 32.1 18.0 298 101-408 1-342 (342) 210 protein:vir:7855 Length: 497 # 93.1 0.0088 5.4E-06 31.9 26.0 395 8-415 1-494 (497) 211 protein:vir:101650 Length: 497 93.1 0.0088 5.4E-06 31.9 26.0 395 8-415 1-494 (497) 212 protein:vir:96490 Length: 348 92.9 0.0094 5.8E-06 31.7 18.6 279 121-405 1-348 (348) 213 protein:vir:95603 Length: 463 92.8 0.0097 6E-06 31.6 13.4 293 73-415 1-308 (463) 214 protein:vir:99311 Length: 463 92.8 0.0097 6E-06 31.6 13.4 293 73-415 1-308 (463) 215 protein:vir:2736 Length: 348 # 92.7 0.01 6.4E-06 31.5 19.8 279 125-405 1-348 (348) 216 protein:vir:98856 Length: 343 92.4 0.011 7.1E-06 31.2 18.1 301 101-412 1-343 (343) 217 protein:vir:78558 Length: 336 91.7 0.014 8.9E-06 30.7 14.3 304 83-402 1-336 (336) 218 protein:vir:101557 Length: 336 90.4 0.021 1.3E-05 29.8 14.1 303 83-402 1-336 (336) 219 protein:vir:98143 Length: 524 90.4 0.021 1.3E-05 29.8 13.0 359 1-415 1-510 (524) 220 protein:vir:7214 Length: 521 # 89.5 0.026 1.6E-05 29.3 16.0 359 1-415 3-505 (521) 221 protein:vir:1268 Length: 397 # 84.8 0.057 3.6E-05 27.4 19.7 366 1-389 6-397 (397) 222 protein:vir:3746 Length: 336 # 83.4 0.069 4.3E-05 27.0 17.5 292 101-411 1-336 (336) 223 protein:vir:106998 Length: 468 82.8 0.074 4.6E-05 26.8 19.5 348 28-415 1-455 (468) 224 protein:vir:98480 Length: 348 82.5 0.076 4.7E-05 26.7 18.5 275 122-403 1-348 (348) 225 protein:vir:6601 Length: 528 # 82.0 0.081 5E-05 26.6 17.6 358 1-415 1-514 (528) 226 protein:vir:106734 Length: 336 81.6 0.085 5.2E-05 26.5 13.5 304 83-402 1-336 (336) 227 protein:vir:4902 Length: 348 # 81.5 0.085 5.3E-05 26.4 18.8 279 121-405 1-348 (348) 228 protein:vir:3783 Length: 336 # 79.8 0.1 6.3E-05 26.0 17.4 292 101-411 1-336 (336) 229 protein:vir:102823 Length: 470 79.2 0.11 6.6E-05 25.9 9.9 281 75-415 1-338 (470) 230 protein:vir:103886 Length: 302 77.8 0.12 7.5E-05 25.6 18.6 266 122-410 1-302 (302) 231 protein:vir:97053 Length: 390 76.5 0.13 8.4E-05 25.4 23.6 358 4-415 1-390 (390) 232 protein:vir:81070 Length: 390 76.2 0.14 8.6E-05 25.3 23.3 359 4-415 1-390 (390) 233 protein:vir:6901 Length: 522 # 75.6 0.15 9E-05 25.2 18.6 358 1-415 4-512 (522) 234 protein:vir:96666 Length: 462 74.2 0.16 0.0001 24.9 14.1 309 73-415 1-336 (462) 235 protein:vir:78920 Length: 290 73.5 0.17 0.00011 24.8 20.0 261 122-401 1-290 (290) 236 protein:vir:102335 Length: 312 72.1 0.19 0.00012 24.6 20.4 270 127-408 1-312 (312) 237 protein:vir:99888 Length: 309 71.5 0.2 0.00012 24.5 14.1 267 127-405 1-309 (309) 238 protein:vir:100603 Length: 529 71.3 0.2 0.00012 24.4 18.8 359 18-415 1-515 (529) 239 protein:vir:80835 Length: 464 66.7 0.26 0.00016 23.8 11.5 313 73-415 1-343 (464) 240 protein:vir:96079 Length: 382 66.6 0.26 0.00016 23.7 12.0 328 65-402 1-382 (382) 241 protein:vir:79987 Length: 415 65.7 0.28 0.00017 23.6 22.1 380 8-415 1-405 (415) 242 protein:vir:81100 Length: 415 65.7 0.28 0.00017 23.6 22.1 380 8-415 1-405 (415) 243 protein:vir:98339 Length: 415 65.7 0.28 0.00017 23.6 22.1 380 8-415 1-405 (415) 244 protein:vir:10364 Length: 390 65.2 0.29 0.00018 23.6 25.8 358 4-415 1-390 (390) 245 protein:vir:94673 Length: 419 57.7 0.43 0.00027 22.6 28.0 373 1-415 1-416 (419) 246 protein:vir:9410 Length: 415 # 54.2 0.51 0.00032 22.2 18.6 374 8-415 1-405 (415) 247 protein:vir:101811 Length: 529 53.0 0.54 0.00033 22.1 19.1 361 18-415 1-515 (529) 248 protein:vir:101039 Length: 529 52.8 0.55 0.00034 22.0 18.7 361 18-415 1-515 (529) 249 protein:vir:106286 Length: 534 52.7 0.55 0.00034 22.0 15.4 365 1-415 1-520 (534) 250 protein:vir:104915 Length: 470 48.3 0.67 0.00042 21.5 18.3 351 14-415 1-457 (470) 251 protein:vir:5670 Length: 514 # 46.6 0.73 0.00045 21.3 15.2 355 19-415 1-499 (514) 252 protein:vir:348 Length: 321 # 46.3 0.74 0.00046 21.3 16.4 278 99-404 1-321 (321) 253 protein:vir:104549 Length: 462 40.8 0.96 0.00059 20.7 19.5 339 29-415 1-449 (462) 254 protein:vir:4339 Length: 395 # 40.0 0.99 0.00061 20.6 21.5 349 1-395 5-395 (395) 255 protein:vir:103181 Length: 457 38.5 1.1 0.00066 20.4 18.9 341 29-415 1-448 (457) 256 protein:vir:962 Length: 397 # 37.9 1.1 0.00068 20.4 20.8 357 1-395 19-397 (397) 257 protein:vir:105038 Length: 428 36.6 1.2 0.00072 20.2 18.7 380 8-415 1-425 (428) 258 protein:vir:4600 Length: 415 # 36.6 1.2 0.00072 20.2 20.8 380 8-415 1-405 (415) 259 protein:vir:4700 Length: 415 # 36.6 1.2 0.00072 20.2 20.8 380 8-415 1-405 (415) 260 protein:vir:79078 Length: 307 36.3 1.2 0.00073 20.2 14.0 275 121-412 1-307 (307) 261 protein:vir:100135 Length: 418 34.8 1.3 0.00079 20.0 21.5 361 1-415 4-416 (418) 262 protein:vir:99576 Length: 388 33.8 1.3 0.00083 19.9 9.5 339 44-402 1-388 (388) 263 protein:vir:1383 Length: 421 # 33.4 1.4 0.00084 19.8 22.1 379 1-415 7-406 (421) 264 protein:vir:100851 Length: 514 32.2 1.4 0.0009 19.7 9.3 329 51-415 1-368 (514) 265 protein:vir:105464 Length: 346 30.6 1.6 0.00097 19.5 20.1 277 127-415 1-314 (346) 266 protein:vir:5942 Length: 523 # 28.0 1.8 0.0011 19.2 17.0 321 40-404 1-523 (523) 267 protein:vir:107732 Length: 379 27.2 1.9 0.0012 19.1 13.0 335 41-402 1-379 (379) 268 protein:vir:9704 Length: 394 # 26.1 2 0.0012 19.0 19.5 349 6-415 1-391 (394) 269 protein:vir:94870 Length: 318 20.8 2.7 0.0017 18.2 11.8 303 80-402 1-318 (318) 270 protein:vir:1433 Length: 435 # 20.3 2.8 0.0017 18.1 17.9 374 7-415 1-430 (435) No 1 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=3.7e-86 Score=488.92 Aligned_cols=415 Identities=100% Similarity=1.341 Sum_probs=375.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||+++||++++.++++++.++.+++++.+++++.++++++++++++|+++|+++++..++.++................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 99999999999999999999999999999999999999999999999999999998888777766655555555555555 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................+.+.+....+..........+..+++++||+++.+.|++.+++.++++++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec Confidence 55555555555555555556666667777777777666667777778899999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+.+.+.|++||+++|+++.++|+.|++.+++++++++||+|+++|+.++|++||.++|++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 240 (415) T protein:vir:46 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998788899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+.+...............+..+++++++++.++..+++.+++|||||++|.+|++++|++|+|||.+++ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~ 320 (415) T protein:vir:46 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) T ss_pred HHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCc Confidence 99999999998887777666666666777888899999999999999999999999999999999999999999999999 Q ss_pred cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) .++.+++|+|+||++++++|.+++++..++||||+++|++++|+++++++++|.++++.+|+++|+|+++++|+||++++ T Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:46 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred CCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEE Confidence 99999999999999999999999899999999999989999999999999999999999999999999999999999999 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) +++++.|+||++++| T Consensus 401 ~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 401 YDDSERGEGDLGLEA 415 (415) T ss_pred eeccCCCCCCccCCC Confidence 999999999999999 No 2 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=3.7e-86 Score=488.92 Aligned_cols=415 Identities=100% Similarity=1.341 Sum_probs=375.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||+++||++++.++++++.++.+++++.+++++.++++++++++++|+++|+++++..++.++................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 99999999999999999999999999999999999999999999999999999998888777766655555555555555 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................+.+.+....+..........+..+++++||+++.+.|++.+++.++++++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec Confidence 55555555555555555556666667777777777666667777778899999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+.+.+.|++||+++|+++.++|+.|++.+++++++++||+|+++|+.++|++||.++|++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 240 (415) T protein:vir:47 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998788899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+.+...............+..+++++++++.++..+++.+++|||||++|.+|++++|++|+|||.+++ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~ 320 (415) T protein:vir:47 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) T ss_pred HHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCc Confidence 99999999998887777666666666777888899999999999999999999999999999999999999999999999 Q ss_pred cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) .++.+++|+|+||++++++|.+++++..++||||+++|++++|+++++++++|.++++.+|+++|+|+++++|+||++++ T Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:47 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred CCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEEEEE Confidence 99999999999999999999999899999999999989999999999999999999999999999999999999999999 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) +++++.|+||++++| T Consensus 401 ~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 401 YDDSERGEGDLGLEA 415 (415) T ss_pred eeccCCCCCCccCCC Confidence 999999999999999 No 3 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=7.5e-86 Score=487.26 Aligned_cols=415 Identities=100% Similarity=1.347 Sum_probs=373.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||+++||++++.++++++.++++++++.+++++.++++++++++++|+++|+++++.++++................... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 99999999999999999999999999999999999999999999999999999998888877776666655555555555 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................+.+.+....+..........+..+||++||+++.+.|++.+++.++|+++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:98 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec Confidence 44444444444444444444555566666666666666666667778899999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++++++++.++...+.|++|++++|+++.++|+.+++.+++++++++||+|+++|+.++|++||.++|++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:98 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998778899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+.+...............+..+|+++++++.++..+++.+++|+|||++|..|+++||++|+|||.+++ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) T protein:vir:98 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) T ss_pred HHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCc Confidence 99999999998877777666666677777888999999999999999999999999999999999999999999999999 Q ss_pred cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) .++.+++|+|+||++++++|.+++++.+++||||+++|++++|.++++++++|.++++.+|+++|+|+++++|+||++++ T Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:98 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE Confidence 99999999999999999999998899999999999989999999999999999999999999999999999999999999 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) +++++.|+||++++| T Consensus 401 ~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 401 YDDSERGEGDLGLEA 415 (415) T ss_pred EeccCCCCCccccCC Confidence 999999999999999 No 4 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=7.5e-86 Score=487.26 Aligned_cols=415 Identities=100% Similarity=1.347 Sum_probs=373.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||+++||++++.++++++.++++++++.+++++.++++++++++++|+++|+++++.++++................... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 99999999999999999999999999999999999999999999999999999998888877776666655555555555 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................+.+.+....+..........+..+||++||+++.+.|++.+++.++|+++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:81 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec Confidence 44444444444444444444555566666666666666666667778899999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++++++++.++...+.|++|++++|+++.++|+.+++.+++++++++||+|+++|+.++|++||.++|++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:81 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998778899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+.+...............+..+|+++++++.++..+++.+++|+|||++|..|+++||++|+|||.+++ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) T protein:vir:81 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) T ss_pred HHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCc Confidence 99999999998877777666666677777888999999999999999999999999999999999999999999999999 Q ss_pred cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) .++.+++|+|+||++++++|.+++++.+++||||+++|++++|.++++++++|.++++.+|+++|+|+++++|+||++++ T Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:81 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE Confidence 99999999999999999999998899999999999989999999999999999999999999999999999999999999 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) +++++.|+||++++| T Consensus 401 ~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 401 YDDSERGEGDLGLEA 415 (415) T ss_pred EeccCCCCCccccCC Confidence 999999999999999 No 5 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=7.5e-86 Score=487.26 Aligned_cols=415 Identities=100% Similarity=1.347 Sum_probs=373.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||+++||++++.++++++.++++++++.+++++.++++++++++++|+++|+++++.++++................... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 99999999999999999999999999999999999999999999999999999998888877776666655555555555 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................+.+.+....+..........+..+||++||+++.+.|++.+++.++|+++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:79 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec Confidence 44444444444444444444555566666666666666666667778899999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++++++++.++...+.|++|++++|+++.++|+.+++.+++++++++||+|+++|+.++|++||.++|++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:79 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998778899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+.+...............+..+|+++++++.++..+++.+++|+|||++|..|+++||++|+|||.+++ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) T protein:vir:79 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) T ss_pred HHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCc Confidence 99999999998877777666666677777888999999999999999999999999999999999999999999999999 Q ss_pred cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) .++.+++|+|+||++++++|.+++++.+++||||+++|++++|.++++++++|.++++.+|+++|+|+++++|+||++++ T Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:79 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE Confidence 99999999999999999999998899999999999989999999999999999999999999999999999999999999 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) +++++.|+||++++| T Consensus 401 ~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 401 YDDSERGEGDLGLEA 415 (415) T ss_pred EeccCCCCCccccCC Confidence 999999999999999 No 6 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=2.6e-85 Score=484.33 Aligned_cols=415 Identities=100% Similarity=1.343 Sum_probs=374.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||++++|++++.++++++.++.+++++.+++++.++++++.+++++|+++|+++++..++..+................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 99999999999999999999999999999999999999999999999999999988888777776666655555555555 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................+.+.+....+.......+..+..+||+++|+++.+.|++.+++.++++++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeec Confidence 55555555555555555555556666667767766666666777778899999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++++++++.++.+.+.|++|++.+|+++.++|+.|++.+++++++++||+|+++|+.++|++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:94 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998778899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+.+..............++..++++++++++++..+++.+++|+|||++|.+|+++||++|+|||.+++ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) T protein:vir:94 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) T ss_pred HHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCc Confidence 99999999998888777766666667777788899999999999999999999999999999999999999999999999 Q ss_pred cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) .++.+++|+|+||++++++|.++.++..++||||+++|++++|+++++++++|.++++.+|+++|+|+++++|+||++++ T Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 400 (415) T protein:vir:94 321 KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEEEEE Confidence 99999999999999999999999889999999999989999999999999999999999999999999999999999999 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) +++++.|+||++++| T Consensus 401 ~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 401 YDDSERGEGDLGLEA 415 (415) T ss_pred EeccCCCCCccccCC Confidence 999999999999999 No 7 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=1.4e-73 Score=419.99 Aligned_cols=386 Identities=20% Similarity=0.252 Sum_probs=308.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++.+.++++++....++......++. .+++++++++++.++++++.+++..+................... T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 999999999999999999888777666554433 356777788888888777776666554433332211111111111 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .............+. .+...............+.++||++||+++.+.|++.+++.++++++|+++ T Consensus 81 ~~~~~~~~~~~~~~~--------------~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 146 (397) T protein:vir:49 81 KSEEEVKAGFVKDFK--------------NLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVE 146 (397) T ss_pred cchhHHHHHHHHHHH--------------HHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhcee Confidence 000000000000000 000011111112233456678899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) ++++.++++++++..+. +.+.|++||+++|+++.++|+++++++++++++++||+|+++|+.++|++||.++|++++++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 226 (397) T protein:vir:49 147 NVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVV 226 (397) T ss_pred ecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 99999999998876654 67899999999998778999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++|.+|++|+|++.+.+ +..+++++++++.++..++..+++|+|||++|..|++|||++|||+|+ T Consensus 227 ~~d~ai~~G~g~~~~~~---------------~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~ 291 (397) T protein:vir:49 227 TRNKAILEAIAALPTKP---------------TLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLME 291 (397) T ss_pred HHHHHHHhhcccccccc---------------ccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeec Confidence 99999999999876533 234589999999999999999999999999999999999999999999 Q ss_pred CcccCCCCceecceeeEEecc--ccccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccEE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCRI 390 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~v 390 (415) +++.++.+++|+|+||+++++ +|.++.++..++||||+++|+++++++++++++++ .++.+.+|++.|+|+++ T Consensus 292 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~ 371 (397) T protein:vir:49 292 RDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVA 371 (397) T ss_pred cCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEE Confidence 999999999999999998765 56667778889999999999999999999998764 34567799999999999 Q ss_pred eccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 391 LDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 391 ~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) .+|+||+.+++++++.+.|++++|| T Consensus 372 ~~~~a~~~~~~~~~~~~~~~~~~~~ 396 (397) T protein:vir:49 372 TDTEAFVPASFKAIADQKGNLGSTA 396 (397) T ss_pred ecccceEEEEeecccCCCCCccccc Confidence 9999999999999999999999999 No 8 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=2.8e-72 Score=412.85 Aligned_cols=394 Identities=15% Similarity=0.112 Sum_probs=304.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+.++++++.+.++++.+++..++....+++ ..++..++..+++.++++++++++..++.................. T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~~~~-~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-- 77 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKRIDA-IEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNK-- 77 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-- Confidence 9999999999999888887665544433332 3345566777777777777776666554433322221111111110 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) .. ......+....+. .....+...+.......+...||++||+++.++|++.+++.++++++|+++++ T Consensus 78 ~~---~e~~~a~~~~l~~---------g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~ 145 (407) T protein:vir:48 78 VA---SEHKEAFIGFMRK---------GREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITL 145 (407) T ss_pred hh---hHHHHHHHHHHhc---------cchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeec Confidence 00 0000011111000 00011122233444555667889999999999999999999999999999888 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++. +.++...+++.+.|++|++.+|+++.++|+.+++.++|++++++||+|+++|+.++|++||.++|++++++++| T Consensus 146 ~~~~--~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~ 223 (407) T protein:vir:48 146 GGSD--YKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEE 223 (407) T ss_pred CCCc--eEEEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHH Confidence 7665 44556677788999999999998777899999999999999999999999999999999999999999999999 Q ss_pred HHHhhcccccccccccccccccc------------ccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEG------------KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK 308 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk 308 (415) .+|++|+|++.|.++........ ..+...+..++++++++++.+..+|+.+++|+||+++|..|++|| T Consensus 224 ~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lk 303 (407) T protein:vir:48 224 IAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLK 303 (407) T ss_pred hhhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhh Confidence 99999999988776654332211 122334556799999999999999999999999999999999999 Q ss_pred ccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEec Q lcl|NC_012784. 309 DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQD 387 (415) Q Consensus 309 d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d 387 (415) |++|||||+++++.+.+++|+|+||+++++||..+++.++++||||+.+|++++|.++++..++|. .+.+.+|++.|+| T Consensus 304 D~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d 383 (407) T protein:vir:48 304 DNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTG 383 (407) T ss_pred ccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeeccccCCcEEEEEEEEec Confidence 999999999999999999999999999999998888888999999999899999999999888775 5567799999999 Q ss_pred cEEeccccEEEEEeecCCCCcccc Q lcl|NC_012784. 388 CRILDYKSAIVIEYDDSERGEGDL 411 (415) Q Consensus 388 ~~v~~p~a~~~~~~t~~~~~~~~~ 411 (415) +++++|+||+.+++++++.+-|.- T Consensus 384 ~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 384 GMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred cEEecccceEEEEeeccCCCCCCC Confidence 999999999999999877766655 No 9 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=9.6e-72 Score=409.92 Aligned_cols=390 Identities=19% Similarity=0.225 Sum_probs=306.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) |++++||++++.++++++++..+++...+.++. .++++++.++++.++++++.+++++++.................. T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 667999999999999999988887766654433 456677777888887777777766655443322111111111000 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .............+... ... ..........+.....+..+||++||+++++.|++.+++.++++++++++ T Consensus 84 ~~~~~~~~~~~~~~~~~----~~~------~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 153 (408) T protein:vir:10 84 KSENELKDKFVKDFVNM----VRN------PMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) T ss_pred cchhhhHHHHHHHHHHH----hhc------chhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhccee Confidence 00000000000000000 000 00111122334455567778899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) ++++.++++++++..+. +.+.|++|++++|+++.++|++|++.+++++++++||+|+++|+.++|++||.++|++++++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 233 (408) T protein:vir:10 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) T ss_pred eccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 99999999999877654 56889999999998888999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) +++.+|++|+|++.+.. +..++++++++++ .+...|+.+++|+|||++|.+|+++||++|+|+| T Consensus 234 ~~~~~il~g~g~~~~~~---------------~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~ 298 (408) T protein:vir:10 234 TRNQAIIEVMKAAPKKP---------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLL 298 (408) T ss_pred HHHHHHhhccccccccc---------------ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEe Confidence 99999999999875432 2345888888775 5667788889999999999999999999999999 Q ss_pred cCcccCCCCceecceeeEEecc--ccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEEEEeccE Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIAVRQDCR 389 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r~d~~ 389 (415) ++++.++.+++|+|+||+++++ +|..+++...++||||+++|.+++|++++++++++. ++.+.+|++.|+|++ T Consensus 299 ~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) T protein:vir:10 299 EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) T ss_pred ccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccE Confidence 9999999999999999999775 466566777899999999999999999999988753 456789999999999 Q ss_pred EeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 390 ILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 390 v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +.+|+||+.+++++++...|++++++ T Consensus 379 v~~~~a~~~~~~~~~~~~~~~~~~~~ 404 (408) T protein:vir:10 379 ATDSEALVAGSFSAIADQVGNFKTTT 404 (408) T ss_pred EeccccEEEEEeeccccCCCCCCCCC Confidence 99999999999999999999999999 No 10 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=2.2e-71 Score=407.97 Aligned_cols=386 Identities=27% Similarity=0.381 Sum_probs=304.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) .++++||++++.+++++++++.+++++.+++++.+++++++++++++++++++++++++..+.................. T Consensus 3 ~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~ 82 (394) T protein:vir:97 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccch Confidence 34699999999999999999999999999999999999999999999999999988877665544322221111111111 Q ss_pred hhhhhHHHHHHHHHHHHHhhh------hhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKV------TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~ 154 (415) ..................... ...+..... .............+..+||+++|+++.+.|++.+++.++++++ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~ 161 (394) T protein:vir:97 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPI-NETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) T ss_pred hhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHH-HhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhh Confidence 111111111111111100000 000000000 1111122223345677789999999999999999999999999 Q ss_pred ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHH Q lcl|NC_012784. 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~ 234 (415) +++++++++++.+++.+. .++.+.|++|++++|+++.++|+.|++.++|++++++||+|+++|+.++|++||.++|+++ T Consensus 162 ~~~~~~~~~~~~~~~~~~-~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~ 240 (394) T protein:vir:97 162 TTVYQAKKASGKYPVLQR-ATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) T ss_pred ceeeeccCcceEEEEEec-CCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHH Confidence 999999999888888764 4457789999999998778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcc Q lcl|NC_012784. 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) Q Consensus 235 ~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~ 314 (415) ++++++.+|++|.+++.+. +..+++++++++...... ..+++|+|||++|..|++|+|++||| T Consensus 241 ~~~~~~~~i~~g~~~~~~~----------------~~~~~~~~~~~~~~~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~ 303 (394) T protein:vir:97 241 KVNTTNDAIAKVLKSFTTK----------------TVKNLDEIKALLNGGFDP-AYNVSLIVSQSFYQTLDTLKDGNGRY 303 (394) T ss_pred HHHHHHHHHhhcccccccc----------------ccccHHHHHHHHHhhhhh-hhCCEEEEcHHHHHHHHHhhccCCCe Confidence 9999999999987765331 234578888888766544 44689999999999999999999999 Q ss_pred cccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccc Q lcl|NC_012784. 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) Q Consensus 315 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~ 394 (415) ||.+++.++.+++|+|+||+++++++. +++.++||||+++|++++|++++++++++.++.+.+|+++|+|++|.+|+ T Consensus 304 i~~~~~~~~~~~~l~G~pv~~~~~~~~---~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~ 380 (394) T protein:vir:97 304 LLQDDITAVSGKVLLGKPVFVLSDEVL---GANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) T ss_pred eeecCcCCCCCceeccceeEEeccccc---CCccEEEeeccccEEEEEecceEEEEecccccceeEEEEEEEccEEeccc Confidence 999999999999999999999876543 44568999999989999999999999999999999999999999999999 Q ss_pred cEEEEEeecCCCCc Q lcl|NC_012784. 395 SAIVIEYDDSERGE 408 (415) Q Consensus 395 a~~~~~~t~~~~~~ 408 (415) ||+.+++++++.+. T Consensus 381 a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 381 AGYYVTFTPEPLPL 394 (394) T ss_pred ceEEEEecccccCC Confidence 99999999999999 No 11 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=8.1e-71 Score=404.84 Aligned_cols=386 Identities=20% Similarity=0.233 Sum_probs=302.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNN--DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e--~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++.+.++++++....+++.....+ ...+++++++.+++++.++++.+++..+................... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 9999999999999988888776666544332 23456677777777777777666655543333222111111111000 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) ......... ..+... .+...............+.++||++||+++++.|++.+++.++++++++++ T Consensus 81 ~~~~~~~~~----~~~~~~----------~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 146 (397) T protein:vir:48 81 KSEEEVKAG----FVKDFK----------NLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVE 146 (397) T ss_pred chhhHHHHH----HHHHHH----------HHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhcee Confidence 000000000 000000 000000111111223345667899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecC-CcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) ++++.++.+++++..+ .+.++|++|++.+|+++.++|++|++++++++++++||+|+++|+.++|++||.++|++++++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~ 226 (397) T protein:vir:48 147 NVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVV 226 (397) T ss_pred eccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 9999999998877654 456899999999998777899999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++|.+|++|+|++.+.+ +..+++++++++.++...+..+++|+|||++|..|++|||++|+|+|+ T Consensus 227 ~~d~~il~G~g~~~~~~---------------~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~ 291 (397) T protein:vir:48 227 TRNKAILEAIATLPTKP---------------TLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLME 291 (397) T ss_pred HHHHHHhhccccccccc---------------ccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeec Confidence 99999999998875432 334689999999999999999999999999999999999999999999 Q ss_pred CcccCCCCceecceeeEEecc--ccccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccEE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCRI 390 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~v 390 (415) +++.++.+++|+|+||+++++ +|.+..++..++||||++++.++++++++++.+++ .++.+.+|+++|+|+++ T Consensus 292 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~ 371 (397) T protein:vir:48 292 RDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVA 371 (397) T ss_pred cCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEE Confidence 999999999999999998765 45566678889999999999999999999988763 45567899999999999 Q ss_pred eccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 391 LDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 391 ~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ++|+||+.+++++++.+.++++++| T Consensus 372 ~~~~a~~~~~~~~~~~~~~~~~~~~ 396 (397) T protein:vir:48 372 TDTESFVPASFKAIADQKGNLGSTA 396 (397) T ss_pred ecccceEEEEecccccCCCCccccC Confidence 9999999999999999999999999 No 12 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=6.5e-71 Score=405.36 Aligned_cols=386 Identities=19% Similarity=0.202 Sum_probs=302.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNN--DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e--~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++++.+++++++...+++.....+ ...+++++++.+++.++++++.+++..+..+................ T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 9999999999999999888776665544332 33456777777777777777766665544433322211111111110 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .............+... ...............+.+.||++||+++.+.|++.+++.++++++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~--------------l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 146 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNL--------------VRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVE 146 (397) T ss_pred chhhHHHHHHHHHHHHH--------------hhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhccee Confidence 00000000000000000 0000111122333456677899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) +++++++++++++.... +.+.|++|++.+|+++.++|+.|++++++++++++||+|+++|+.++|++||.++|++++++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 226 (397) T protein:vir:49 147 NVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVV 226 (397) T ss_pred eccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 99999999999877654 67889999999998777899999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++|.+|++|+|++.+.. +..+++++++++.++..+++.+++|+|||++|..|++|||++|+|||. T Consensus 227 ~~d~ail~G~g~~~~~~---------------~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~ 291 (397) T protein:vir:49 227 TRNKAILEAIGTLPNKP---------------TLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLME 291 (397) T ss_pred HHHHHHHhccccccccc---------------cccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 99999999999876532 234689999999999999999999999999999999999999999999 Q ss_pred CcccCCCCceecceeeEEecc--ccccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccEE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCRI 390 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~v 390 (415) +++.++.+++|+|+||+++++ +|...+++..++||||+++|+++++++++++++++ .++.+.+|+++|+|+++ T Consensus 292 ~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~ 371 (397) T protein:vir:49 292 RDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVS 371 (397) T ss_pred ccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEE Confidence 999999999999999998764 56667778889999999999999999999998764 35567799999999999 Q ss_pred eccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 391 LDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 391 ~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ++|+||+++++++++..-+.+++++ T Consensus 372 ~~~~a~~~~~~~~~~~~~~~~~~~~ 396 (397) T protein:vir:49 372 TDTEAFVPASFKAIADQKAKLSTAG 396 (397) T ss_pred ecccceEEEEecccccccCcccccC Confidence 9999999999999888777777666 No 13 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=1.2e-70 Score=404.00 Aligned_cols=390 Identities=20% Similarity=0.231 Sum_probs=305.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) |++++||++++.++.+++++..++....+.+.. .++++++..+++.++++++.++++++................... T Consensus 4 ~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 678999999999999999888777666554333 345667777777777777776666554333222211111111100 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .. ............ ....+.................+...||++||+++++.|++.+++.++++++++++ T Consensus 84 ~~----~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 153 (408) T protein:vir:74 84 KS----ENELKDKFVKDF------VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) T ss_pred ch----hhhhHHHHHHHH------HHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhccee Confidence 00 000000000000 00111111222233444455667778899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) +++++++.+++++..+. +.+.|++|++.+|+++.++|++|+++++|++++++||+|+++|+.++|++||.++|++++++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 233 (408) T protein:vir:74 154 SVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVV 233 (408) T ss_pred eccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 99999999999887664 56679999999998888999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) ++|.+|++|+|++.+.+ +..++++++++++ .+...+..+++|+|||.+|.+|+++||++|+|+| T Consensus 234 ~~d~~il~G~G~~~~~~---------------~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~ 298 (408) T protein:vir:74 234 TRNQAIIAAMGTVPKKP---------------TIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLL 298 (408) T ss_pred HHHHHHhhccccccccc---------------ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEe Confidence 99999999999876532 2234788888774 6667788899999999999999999999999999 Q ss_pred cCcccCCCCceecceeeEEecc--ccccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccE Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCR 389 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~ 389 (415) .+++.++.+++|+|+||+++++ +|..+++...++||||+++|++++|++++++++++ .++.+.+|+++|+|++ T Consensus 299 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) T protein:vir:74 299 EPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) T ss_pred ccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcE Confidence 9999999999999999999875 56666777889999999999999999999998864 4566789999999999 Q ss_pred EeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 390 ILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 390 v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +++|+||+.+++++.++..|++++++ T Consensus 379 ~~~~~a~~~~~~~~~~~~~~~~~~~~ 404 (408) T protein:vir:74 379 ATDSEALVAGSFTAIADQVGNFKTTT 404 (408) T ss_pred EecccceEEEEeecccCCCCCCCCCc Confidence 99999999999999999999999888 No 14 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.1e-69 Score=398.68 Aligned_cols=386 Identities=21% Similarity=0.357 Sum_probs=291.7 Q ss_pred CChHHH---HHHHHHHHHHHHHHHHHHHHHhhchH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Q lcl|NC_012784. 1 MKTKEE---LQSEISDIKRQIDLKVKYATRALNND----ELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQ 73 (415) Q Consensus 1 Mk~~~e---l~~~l~~l~~~~~~~~~~~~~~~~e~----~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 73 (415) |+..++ |++++.+++++++++.++.+..+.+. ...+.+++.++++.+++++++++++++.............. T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~ 80 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSG 80 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 876554 45777778788877777766554322 23345666677777888887777776655444333222211 Q ss_pred cccccchhhhhhHHHHHHHH----HHHHHhhhhhHHHHHH---HHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh Q lcl|NC_012784. 74 SVEVNEARTYRNQANINDLG----ISIQNTKVTSQEVRDF---TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE 146 (415) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~ 146 (415) ......... .......... ................ ...............+..+||++||+++.+.|++.++ T Consensus 81 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~ 159 (400) T protein:vir:38 81 KKPDHPEEH-SYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQ 159 (400) T ss_pred ccccchhhh-hHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHH Confidence 111111100 0000000000 0000000000000000 0011111122233346677899999999999999999 Q ss_pred hhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHH Q lcl|NC_012784. 147 VEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQE 226 (415) Q Consensus 147 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~ 226 (415) +.+++++++++++++++++.+|+++..+ +.+.|++|++.+|+.+.++|++|++.+++++++++||+|+++|+.+++++| T Consensus 160 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~ 238 (400) T protein:vir:38 160 TVVDLKPFTNVFQASTQKGTYPTVANAT-TKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGL 238 (400) T ss_pred hhhhhhhcceeEeccCcceEEEEEecCC-CccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHH Confidence 9999999999999999999999887544 567899999999987789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH Q lcl|NC_012784. 227 LKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) Q Consensus 227 l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~ 306 (415) |.++|++++..+++.+|++|+|++.+. +..+++++.+++....+.+ .+++|+|||++|..|++ T Consensus 239 i~~~l~~~~~~~~~~~i~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~-~~a~~v~~~~~~~~l~~ 301 (400) T protein:vir:38 239 IAQNGQQIKVNTTNGAVATLLKGFTAK----------------TISSVDDLKHINNVDLDPA-YSRVIIASQSFYNFLDT 301 (400) T ss_pred HHHHHHHHHHHHHHHhhhhcccccccc----------------ccccHHHHHHHHHhhhhhh-hCcEEEEcHHHHHHHHH Confidence 999999999999999999998865432 2334778888777655544 47899999999999999 Q ss_pred hhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEe Q lcl|NC_012784. 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQ 386 (415) Q Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~ 386 (415) |||++|+|||.+++.++.+++|+|+||++++++|.+..+++.++||||+++|++++|+++++.++++++|.+.+|+++|+ T Consensus 302 lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 381 (400) T protein:vir:38 302 VKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRF 381 (400) T ss_pred hhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEe Confidence 99999999999999999999999999999999999988999999999999999999999999999999999999999999 Q ss_pred ccEEeccccEEEEEeecCC Q lcl|NC_012784. 387 DCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 387 d~~v~~p~a~~~~~~t~~~ 405 (415) |++|++|+||+.+++++++ T Consensus 382 d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 382 GVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ccEEecccceEEEEeecCC Confidence 9999999999999999888 No 15 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=6e-70 Score=400.08 Aligned_cols=387 Identities=14% Similarity=0.089 Sum_probs=285.7 Q ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~-~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) |+. ++++++.+.++++++++......+.+. +-..+..++..+++.+++++++++...+..........+......... T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~~~~-~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDKRVE-AIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKV 79 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 444 355555555555554443322222221 112233445566666666666655554443332222111111111110 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) . ......+....+.. ..........+.....+.+.||++||+++.++|++.+++.++++++|++++ T Consensus 80 --~---~e~~~a~~~~lr~~---------~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 145 (401) T protein:vir:44 80 --A---AEHKDAFVGFLRKG---------REDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVIT 145 (401) T ss_pred --h---HHHHHHHHHHHhhh---------hhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeee Confidence 0 00111111111000 001111122334455566788999999999999999999999999999999 Q ss_pred ccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) Q Consensus 160 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~ 239 (415) ++++... ++...+++.+.|++|++.+|+++.++|++|++.++|++++++||+|+++|+.++|++||.++|++++++++ T Consensus 146 ~~~~~~~--~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~ 223 (401) T protein:vir:44 146 VGGSDYK--KLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQE 223 (401) T ss_pred cCCCceE--EEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 8766544 55567778889999999999877789999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcccccccccccccccccc------------ccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh Q lcl|NC_012784. 240 NKAIIDVITKGSTGSTSSGFEKEG------------KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM 307 (415) Q Consensus 240 d~~il~g~g~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l 307 (415) |.+||+|+|++.|.++........ ..+...+..++++++++++.+...|..+++|+|||++|..|++| T Consensus 224 ~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~l 303 (401) T protein:vir:44 224 EIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLL 303 (401) T ss_pred HhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHh Confidence 999999999988776553222111 12223445569999999999999999999999999999999999 Q ss_pred hccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEe Q lcl|NC_012784. 308 KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQ 386 (415) Q Consensus 308 kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~ 386 (415) +|++|||||+++++.+.+++|+|+||++++++|...+++++++||||+++|++++|.++++..+++. ++...+|+++|+ T Consensus 304 kd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~ 383 (401) T protein:vir:44 304 KDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRT 383 (401) T ss_pred hccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeeccccCCcEEEEEEEEe Confidence 9999999999999999999999999999999998888888899999999899999999999888775 456679999999 Q ss_pred ccEEeccccEEEEEeecC Q lcl|NC_012784. 387 DCRILDYKSAIVIEYDDS 404 (415) Q Consensus 387 d~~v~~p~a~~~~~~t~~ 404 (415) |+++++|+||+++++.++ T Consensus 384 d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 384 GGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ccEEecccceEEEEeecC Confidence 999999999999999999 No 16 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=3e-69 Score=396.23 Aligned_cols=358 Identities=27% Similarity=0.379 Sum_probs=289.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |. +++.++++++..+.++.+..+++++.+++++++++++.|+++|+++++..++..+........... T Consensus 1 M~------k~l~~l~e~~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~------ 68 (371) T protein:vir:81 1 MP------KELRELLEQINNKKEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPT------ 68 (371) T ss_pred Cc------HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc------ Confidence 55 345555555566666666677788888899999999999999988777655443322111111000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) .... ......+...++..........+...||++||+++++.|++.+++.++++++++++++ T Consensus 69 -~~~~-----------------~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~ 130 (371) T protein:vir:81 69 -VQVK-----------------ENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPV 130 (371) T ss_pred -hhhH-----------------HHHHHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeec Confidence 0000 0111122222233333444556677889999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+.+.+.|++||+.+|+++.++|++++++++|++++++||+|+++|+.++|++||.++|++++++++| T Consensus 131 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~ 210 (371) T protein:vir:81 131 TTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRN 210 (371) T ss_pred cCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHH Confidence 99989999999988899999999999998778999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+|++|+|++.+.+ ..+++++..++. .+...+..+++|+|||++|..|+++||++|+|+|.++ T Consensus 211 ~~i~~g~g~~~~~~----------------~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~ 274 (371) T protein:vir:81 211 GLIINVLNTKAKTA----------------IADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPS 274 (371) T ss_pred HHHHhhcccccccc----------------cccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecc Confidence 99999998765422 234677777664 5667788899999999999999999999999999999 Q ss_pred ccCCCCceecceeeEEeccccccc-------cCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEEEEec Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEILPDEVLGQ-------KGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIAVRQD 387 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~~~~~~~~-------~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r~d 387 (415) +.++.+++|+|+||+++++||.+. .+...++||||+++|++++|.+++++++++. ++.+.+|++.|+| T Consensus 275 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d 354 (371) T protein:vir:81 275 ISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMD 354 (371) T ss_pred cCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 999999999999999999998543 3456789999999999999999999987754 4567899999999 Q ss_pred cEEeccccEEEEEeecC Q lcl|NC_012784. 388 CRILDYKSAIVIEYDDS 404 (415) Q Consensus 388 ~~v~~p~a~~~~~~t~~ 404 (415) +++.+|+||++++++++ T Consensus 355 ~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 355 VKMRDDEAFVFGEVQLA 371 (371) T ss_pred cEEecccceEEEEEecC Confidence 99999999999999999 No 17 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=9.6e-69 Score=393.48 Aligned_cols=390 Identities=19% Similarity=0.224 Sum_probs=300.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) +.+++||++++.++++++++..+++...+.+.+ .++++++.+++++++.+++.++++++................... T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (404) T protein:vir:39 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 336799999999999999988888776665433 445666777777777777776666555433222211111111110 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .............+....+. ..........+.....+.++||++||+++++.|++.+++.++++++++++ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (404) T protein:vir:39 84 KSEYELKDKFVKEFVNMVRN----------PMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (404) T ss_pred cchhhhHHHHHHHHHHHHhc----------chhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhccee Confidence 00000001111111111000 01111222334445567788899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) +++++.+++++++..+. +.+.|++|++++|+++.++|+.+++++++++++++||+|+++|+.++|++||.++|++++++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 233 (404) T protein:vir:39 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVV 233 (404) T ss_pred eccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 99999999998877554 67899999999998788999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) ++|.+|++|+|++.+.+ ...+++++.++++ .+...+..+++|+|||++|..|+++||++|||+| T Consensus 234 ~~d~~il~g~g~~~~~~---------------~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~ 298 (404) T protein:vir:39 234 TRNQAIIAAMGTVPKKP---------------TIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLL 298 (404) T ss_pred HHHHHHHhccccccccc---------------ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceee Confidence 99999999998875432 2234788888876 4556677889999999999999999999999999 Q ss_pred cCcccCCCCceecceeeEEeccc--cccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccE Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCR 389 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~ 389 (415) ++++.++.+++|+|+||++++++ |..+.++..++||||+++|.+++++++++.++++ .++.+.+|++.|+|++ T Consensus 299 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~ 378 (404) T protein:vir:39 299 EPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (404) T ss_pred ccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccE Confidence 99999999999999999998764 4555566789999999999999999999998775 3556789999999999 Q ss_pred EeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 390 ILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 390 v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +.+|+||+.+++++++..+|.+++-= T Consensus 379 ~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 379 TTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred EecccceEEEEeeccccCCCCCCCCC Confidence 99999999999999999888876555 No 18 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=5.3e-69 Score=394.88 Aligned_cols=381 Identities=13% Similarity=0.123 Sum_probs=275.8 Q ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_012784. 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQ---------EITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) Q Consensus 1 Mk~-~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~---------e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) |++ +.|+|++..+..+++.++..+..+.+.+++.+++++++. +++.++.+++.++..+++.......... T Consensus 21 ~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~ 100 (425) T protein:vir:10 21 VPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAAQM 100 (425) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 332 333443332111111122222222233333333333332 2333334444333333322221111111 Q ss_pred ccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhh-hhhhhhcccccccceeecchhHHhHHHHHHhhhh Q lcl|NC_012784. 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLET-RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEF 149 (415) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~ 149 (415) ..... ......... ..+...++. .........+.+.||++||+++++.|++.+++.+ T Consensus 101 ~~~~~-----~~~~~~~~~-----------------~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s 158 (425) T protein:vir:10 101 GANGV-----KPLRDPEYT-----------------EAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLIS 158 (425) T ss_pred ccccc-----cccccHHHH-----------------HHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhh Confidence 00000 000000000 011111111 1122334456778899999999999999999999 Q ss_pred hhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHH Q lcl|NC_012784. 150 NLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKL 229 (415) Q Consensus 150 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~ 229 (415) +++++|++++++++...++ ...+.+.+.|++|++.+|+++.++|+++++.++|++++++||+|+++|+.++|++||.+ T Consensus 159 ~l~~l~~~~~~~~~~~~~~--~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~ 236 (425) T protein:vir:10 159 PMRQLCRVQPVSKAGFSKL--FNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLAT 236 (425) T ss_pred hhhhhceeeeccCCceEEE--EEcCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHH Confidence 9999999999887766655 45677789999999999987778999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhcccccccccccccccccc------------ccccccchhhHHHHHHHHHHhhhhccCCCEEEEc Q lcl|NC_012784. 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG------------KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVS 297 (415) Q Consensus 230 ~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~ 297 (415) +|++++++++|.+||+|+|++.|.++........ ..+...+..+++++++++..+...|+.+++|+|| T Consensus 237 ~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn 316 (425) T protein:vir:10 237 EVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMN 316 (425) T ss_pred HHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEc Confidence 9999999999999999999988776654332211 1223445668899999999999999999999999 Q ss_pred HHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cC Q lcl|NC_012784. 298 QTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HF 376 (415) Q Consensus 298 ~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~ 376 (415) |++|.+|++|||++|||||++++..+.+++|+|+||+++++||....+..+++||||+++|++++|.++++..++|. .+ T Consensus 317 ~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~ 396 (425) T protein:vir:10 317 RNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKP 396 (425) T ss_pred hHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecccccCC Confidence 99999999999999999999999999999999999999999998888888999999999999999999999887765 45 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) .+.++++.|+|++|++|+||+.+++.++. T Consensus 397 ~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 397 YVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred cEEEEEEEEeccEeecccceEEEEeeccC Confidence 66789999999999999999999999988 No 19 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=5.1e-69 Score=395.01 Aligned_cols=382 Identities=25% Similarity=0.326 Sum_probs=290.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc----c- Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQS----V- 75 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~----~- 75 (415) |. -+|++++.+++++++++..+.+..+.+++.++++++.+++++|+++|+.+++..+...+........... . T Consensus 1 ~~--~~m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (397) T protein:vir:12 1 MP--MQMSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPE 78 (397) T ss_pred CC--CcHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhc Confidence 21 1223334444444455555566667778888899999999999999988776554433222211111110 0 Q ss_pred --cccchhhh-hhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhh Q lcl|NC_012784. 76 --EVNEARTY-RNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD 152 (415) Q Consensus 76 --~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~ 152 (415) ........ ........+........ ...+. ...............+.++||++||+++.+.|++.+++.++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~-~~~~~---~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~ 154 (397) T protein:vir:12 79 GQRSQGQGNEERQQQYSKAFLKGLRGKR-LTDEE---RDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLE 154 (397) T ss_pred ccccccchhhHHHHHHHHHHHHHHhccC-CcHHH---HHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHH Confidence 00000000 11111111111111111 11111 1111222334445667778899999999999999999999999 Q ss_pred hcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHH Q lcl|NC_012784. 153 KYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMA 232 (415) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la 232 (415) +++++++++++++.+++++.++.+.+.|++||+++|+++.++|+.|++.++|++++++||+|+++|+.++|++||.++|+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~ 234 (397) T protein:vir:12 155 QYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFA 234 (397) T ss_pred hhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHH Confidence 99999999999999999999999999999999999987789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccC Q lcl|NC_012784. 233 RTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKL 311 (415) Q Consensus 233 ~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~ 311 (415) +++++++|.+|++|+|++.+.+ ..+++++++++. .+..++..+++|+|||++|.+|+++||++ T Consensus 235 ~~~~~~~d~~il~G~g~~~~~g----------------~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~ 298 (397) T protein:vir:12 235 KKSVVTRNNLILAAIASLKKVD----------------IDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGT 298 (397) T ss_pred HHHHHHHHHHHHhccccccccc----------------cccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccC Confidence 9999999999999998875432 234788888775 67788888999999999999999999999 Q ss_pred CcccccCcccCCCCceecceeeEEecc-ccccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEE Q lcl|NC_012784. 312 GNYLIQPDVKEKTQQRLLGAKIEILPD-EVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVR 385 (415) Q Consensus 312 G~~l~~~~~~~~~~~~l~G~pV~~~~~-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r 385 (415) |+|+|++++.++.+++|+|+||+++++ +|..+.++..++||||+++|.++++++++++++++ ..+.+.+|+++| T Consensus 299 G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r 378 (397) T protein:vir:12 299 GRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIER 378 (397) T ss_pred CceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEe Confidence 999999999999999999999988776 46566778889999999999999999999988764 345678999999 Q ss_pred eccEEeccccEEEEEeecC Q lcl|NC_012784. 386 QDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 386 ~d~~v~~p~a~~~~~~t~~ 404 (415) +|+++++|+||+++++|+- T Consensus 379 ~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 379 EDVRKWDEDAVVFGQITVE 397 (397) T ss_pred eccEEecccceEEEEEeeC Confidence 9999999999999999998 No 20 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=4.6e-68 Score=389.77 Aligned_cols=397 Identities=14% Similarity=0.107 Sum_probs=298.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHH-HhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYAT-RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~-~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) |+ ++||+++++++.+++++..++.. +.+++++.+++++++++++.|+++|++.++..+.................... T Consensus 1 M~-l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 79 (409) T protein:vir:45 1 MK-LHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPE 79 (409) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCC Confidence 99 68999999999988888777654 46889999999999999999999998877665443332222222111111111 Q ss_pred hhhhhhHHHHHHHHHHHH--HhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQ--NTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) ............+..... .......+...+ ......+..+...||++||+++.++|++.+++.++++++|++ T Consensus 80 ~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~------~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~ 153 (409) T protein:vir:45 80 NNSQQDEKRAQVFDKWMRHGASELTSEERKAL------RELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQI 153 (409) T ss_pred CcchhhHHHHHHHHHHHHhhhhhccHHHHHHH------HHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhcee Confidence 111111111111111111 111111111111 122344455667789999999999999999999999999999 Q ss_pred EEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a-~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) ++++++...++.........+.|++|++.+|++ .+.|+.+++.++|++ ++++||+|+++|+.++|++||.++|+++++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~-~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~ 232 (409) T protein:vir:45 154 LTTSDGRTMEWATADGTSEVGVLLGENEEAGEE-DTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIG 232 (409) T ss_pred eecCCCceEEEEeeccCcccccccccccccccc-ccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHH Confidence 998766533333333334567899999999975 579999999999985 688999999999999999999999999999 Q ss_pred HHHHHHHhhccccccc---cccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEE--EEcHHHHHHHHHhhccC Q lcl|NC_012784. 237 ATRNKAIIDVITKGST---GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVA--IVSQTMFAKLDKMKDKL 311 (415) Q Consensus 237 ~~~d~~il~g~g~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~~~~~~~~l~~lkd~~ 311 (415) .+++.+||+|+|++.+ .++.... .........+..+++++++++..+...|+.++.| +||+.+|..|++|||++ T Consensus 233 ~~~~~a~l~G~G~~~~~~p~Gil~~~-~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~ 311 (409) T protein:vir:45 233 RGEARYLIQGTGAGTPKQPKGLAASV-TGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQ 311 (409) T ss_pred HHHHHHhhccCCCCCccccceeeecc-ccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCC Confidence 9999999999998743 2332221 2223334445667899999999999999888865 67999999999999999 Q ss_pred CcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc---cCceEEEEEEEecc Q lcl|NC_012784. 312 GNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM---HFGECLMIAVRQDC 388 (415) Q Consensus 312 G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~---~~~~~~~~~~r~d~ 388 (415) |||||++++..+.+.+|+|+||+++++||....++.+++||||+++ .+.++.+++++++++. .+.+.+|++.|+|+ T Consensus 312 G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~-~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~ 390 (409) T protein:vir:45 312 GRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRF-IIRRVRYMILKRLVERYAEYDQTGFLAFHRFDC 390 (409) T ss_pred CceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhh-heeeccceEEEEeecccccCCcEEEEEEEEecc Confidence 9999999999999999999999999999987778888999999985 5677889998876543 34567999999999 Q ss_pred EEeccccEEEEEeecCCCC Q lcl|NC_012784. 389 RILDYKSAIVIEYDDSERG 407 (415) Q Consensus 389 ~v~~p~a~~~~~~t~~~~~ 407 (415) ++++|+||+.+++.+++.+ T Consensus 391 ~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 391 ILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EeechhheEEEEeccCCCC Confidence 9999999999999888777 No 21 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=5.2e-68 Score=389.44 Aligned_cols=385 Identities=23% Similarity=0.384 Sum_probs=284.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |++++++++++.+..+++.++.++..... ....++.++++++++++..+++.++++++.++.................. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~-~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 79 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDE-NASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQP 79 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhh-hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcc Confidence 99999999888887777666654432221 12234555666666666666666665555444333222111111110000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ............. ......+...............+.+.||++||+++++.|++.+++.++++++|+++++ T Consensus 80 ~~~~~~~~~~~~~---------~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) T protein:vir:10 80 NGTDLKKKPIDAK---------KKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPV 150 (394) T ss_pred cccchhhhHHHHH---------HHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeec Confidence 0000000000000 0011111111111222334456777889999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+ ..+.|++|++++|+++.++|++|++.+++++++++||+|+|+|+.++|++||.++|+++++.++| T Consensus 151 ~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~ 229 (394) T protein:vir:10 151 TTPKGTYPILKRAT-DRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYN 229 (394) T ss_pred cCCceEEEEEecCC-CccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Confidence 99999999887644 56789999999998788999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV 320 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~ 320 (415) .+|++|+|++.+... .+..+++++.+++......++ +++|||||++|..|++|+|++|||||++++ T Consensus 230 ~~il~g~g~~~~~~~-------------~~~~~~d~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~ 295 (394) T protein:vir:10 230 AMIAPVLQSFTAKAT-------------TTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDAS 295 (394) T ss_pred HHHhhcccccccccc-------------cccccHHHHHHHHHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeeccc Confidence 999999987654322 234567888888876666555 589999999999999999999999998776 Q ss_pred cC----CCCceecceeeEEecccc-ccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEecccc Q lcl|NC_012784. 321 KE----KTQQRLLGAKIEILPDEV-LGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKS 395 (415) Q Consensus 321 ~~----~~~~~l~G~pV~~~~~~~-~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a 395 (415) .. +.+++|+|+||++++++. ....++..++||||+++|++++++++++.++++..|.+.+++++|+|+++++|+| T Consensus 296 ~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~a 375 (394) T protein:vir:10 296 DSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYGRYLGAAFRFGVKQADSNA 375 (394) T ss_pred cccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccc Confidence 44 455799999999887653 3445677899999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecCCCCcccccccC Q lcl|NC_012784. 396 AIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 396 ~~~~~~t~~~~~~~~~~~~~ 415 (415) |+++++++++.+ ++| T Consensus 376 i~~~~~~~~~~~-----~~~ 390 (394) T protein:vir:10 376 GYFVTNTDAASG-----STS 390 (394) T ss_pred EEEEEeecccCC-----CCC Confidence 999999976543 222 No 22 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=9.2e-68 Score=388.09 Aligned_cols=395 Identities=19% Similarity=0.213 Sum_probs=296.1 Q ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~-~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) |++ +++++++++++.++++...++. +...++++++++++++|+++|++.++..+................. . T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~-----~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--~ 73 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKD-----GVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGK--E 73 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhc-----CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc--c Confidence 887 6677777777766655544321 1223467788889999999988766654433322221111111111 0 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) ........ .................. ..............+.+.||++||+++.+.|++.+++.+++++++++.+ T Consensus 74 ~~~~~~~~---~~~~~~~~~~~~~~~~~~--~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~ 148 (404) T protein:vir:10 74 ENVIYNGA---LFVRAIADNLLKQKNQRG--LNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEP 148 (404) T ss_pred hhhHHHHH---HHHHHHHHHHHHHHHhhh--hcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceee Confidence 01111111 111111111111111111 1111222333445566788999999999999999999999999999999 Q ss_pred ccCCceeEEEEeecCCccccccccccccccc-ccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQSEVAALEKVEELEENPEL-AVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 160 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) +++.++.+++++..+.+.+.|++|++.+|.+ ..++|+.++++++|++++++||+|+++|+.++|++||.++|+++++++ T Consensus 149 ~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~ 228 (404) T protein:vir:10 149 VFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRIT 228 (404) T ss_pred ccCCccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999998865 358899999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 239 RNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 239 ~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) +|.+||+|+|++.+..+.... .........+...++++..++. .+...+..+++|+|||++|.+|+++||++|||+|. T Consensus 229 ~~~~il~G~g~~~~~~gi~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~ 307 (404) T protein:vir:10 229 RNAEILYGAGGDEHATGIMTA-NKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQ 307 (404) T ss_pred HHHHHhhcCCCCCcccceeec-cccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 999999999988765554432 2223334445567889888876 55566777889999999999999999999999999 Q ss_pred CcccCCCCceecceeeEEe-ccccccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccEEe Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEIL-PDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCRIL 391 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~-~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~v~ 391 (415) +++.++.+++|||+||+++ +.++..+.++.+++||||++++++++|.+++++++++ .++.+.+|+++|+|+++. T Consensus 308 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~ 387 (404) T protein:vir:10 308 PDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVK 387 (404) T ss_pred cCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 9999999999999999865 4567677788889999999999999999999988654 345677999999999999 Q ss_pred ccccEEEEEeecCCCCc Q lcl|NC_012784. 392 DYKSAIVIEYDDSERGE 408 (415) Q Consensus 392 ~p~a~~~~~~t~~~~~~ 408 (415) +|+||+.+++++++.+. T Consensus 388 ~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 388 DSEALLIAEIPVESVQA 404 (404) T ss_pred cccceEEEEeecccCCC Confidence 99999999999988887 No 23 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=2e-67 Score=386.22 Aligned_cols=385 Identities=21% Similarity=0.238 Sum_probs=293.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+. +||++++.++.+++++..++.+....+...++.+...+++++++++++.+++..+..++................. T Consensus 1 M~~-~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (395) T protein:vir:38 1 MNI-NQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKK 79 (395) T ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 987 6699999998888877776666655555555555666777788888877776665544443332222111111111 Q ss_pred hhh-hhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 81 RTY-RNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 81 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) ... ...... .......+....+ ........+.+.||++||+++++.|++.+++.++++.+|++++ T Consensus 80 ~~~~~~~~~~------------~~~~~~~~~~~~~--~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~ 145 (395) T protein:vir:38 80 PLPVKDGKPD------------AQAMKNQFVKDFK--NLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVEN 145 (395) T ss_pred ccchhhhhHH------------HHHHHHHHHHHHH--HHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceee Confidence 000 000000 0000111111111 1112234455678999999999999999999999999999999 Q ss_pred ccCCceeEEEEeecC-CcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 160 ~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) ++++.+.++++...+ .+.+.|++|++.+|+++.++|+.|+++++|++++++||+|+++|+.++|++||.++|+++++++ T Consensus 146 ~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~ 225 (395) T protein:vir:38 146 VTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVT 225 (395) T ss_pred ccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 999999998877655 4567899999999987779999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 239 RNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 239 ~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) +|.+|++|+|++.+... ...++++.+++. .+...++.+++|+|||.+|..|++++|++|+|+|+ T Consensus 226 ~~~~il~g~g~~~~~~~---------------~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~ 290 (395) T protein:vir:38 226 RNAKILEVMGKAPKKPT---------------ISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQ 290 (395) T ss_pred HHHHHhhcccccccccc---------------cccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 99999999998765322 234778888776 56677888999999999999999999999999999 Q ss_pred CcccCCCCceecceeeEEecccccc-ccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccEEe Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEVLG-QKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCRIL 391 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~-~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~v~ 391 (415) +++.++.+++|+|+||+++++++.+ ..++..++||||+++|+++++++++++++++ .++.+.+|++.|+|+++. T Consensus 291 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~ 370 (395) T protein:vir:38 291 PDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLI 370 (395) T ss_pred cCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 9999999999999999999887655 3466789999999999999999999998764 355678999999999999 Q ss_pred ccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 392 DYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 392 ~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +|+||+.+++++++++-..-..+- T Consensus 371 ~~~a~~~~~~~~~~~~~~~~~~~~ 394 (395) T protein:vir:38 371 DDGAFAAASFKTVANQAQGTAGTG 394 (395) T ss_pred cccceEEEEeecccCCCCCccCCC Confidence 999999999997654433322222 No 24 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.1e-67 Score=387.76 Aligned_cols=395 Identities=20% Similarity=0.283 Sum_probs=278.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc------- Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENN------- 71 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~------- 71 (415) || ++||++++.++++++..+.++++....+.+ .++.++..++++++.++++.+++++++........... T Consensus 1 Mk-i~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~ 79 (437) T protein:vir:10 1 MK-IEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDL 79 (437) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99 889999999999998888777765544222 22333334444444444444333333222111100000 Q ss_pred --------cccccccchhhhhhHHH---------HHHHH----------HHHHHhhhhhHHHHHHHHHHhhhhhhhhccc Q lcl|NC_012784. 72 --------QQSVEVNEARTYRNQAN---------INDLG----------ISIQNTKVTSQEVRDFTEYLETRNDIQGGSL 124 (415) Q Consensus 72 --------~~~~~~~~~~~~~~~~~---------~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 124 (415) ................. ..... ...............+............... T Consensus 80 ~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 159 (437) T protein:vir:10 80 VAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGI 159 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhc Confidence 00000000000000000 00000 0000000001111112222222233344455 Q ss_pred ccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee Q lcl|NC_012784. 125 KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) Q Consensus 125 ~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k 204 (415) +...+|++||.++.+.|. .++..++++.++++++++++++.++++... .+.+.|++|++.+|+++.++|+.|++.+++ T Consensus 160 ~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k 237 (437) T protein:vir:10 160 ALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNS-TDLLTAHTEYGQTTKNATPVITPILWDLKT 237 (437) T ss_pred ccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeecc-ccccccccccccccccccccceeeeeehhh Confidence 677889999999988665 567888999999999999998888887654 457899999999998888999999999999 Q ss_pred EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHH-H Q lcl|NC_012784. 205 HRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-L 283 (415) Q Consensus 205 ~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 283 (415) ++++++||+|+|+|+.++|++||.++|+++++.+++.+|++|+|++.+.+ +++..++++.+++. . T Consensus 238 ~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~--------------~~~~~~~~~~~~~~~~ 303 (437) T protein:vir:10 238 YTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKT--------------TSTYLLGDLKKVLNVT 303 (437) T ss_pred eeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc--------------ccccchhhHHHHHHhh Confidence 99999999999999999999999999999999999999999998765432 22334667777665 6 Q ss_pred hhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccc--cccccCCceEEEechhhcEEEE Q lcl|NC_012784. 284 NVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLF 361 (415) Q Consensus 284 ~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~gd~~~~~~~~ 361 (415) +..+|..+++|+|||++|..|++|||++|+|||.++++++.+++|+|+||++++++ |.+++++.+++||||+++|.++ T Consensus 304 l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 383 (437) T protein:vir:10 304 LKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINF 383 (437) T ss_pred hhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEE Confidence 66778889999999999999999999999999999999999999999999999876 6667788889999999999999 Q ss_pred eecceEEEEee-cccCceEEEEEEEeccEEeccccEEEEEeec--CCCCccccc Q lcl|NC_012784. 362 DRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDD--SERGEGDLG 412 (415) Q Consensus 362 ~~~~~~i~~~~-~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~--~~~~~~~~~ 412 (415) +|+++++.+++ ++.+.+.+++++|+|+++++|+||+.|+.+. .+...+..+ T Consensus 384 ~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 384 KLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred eeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeeccccccCCCCCC Confidence 99999998875 6677889999999999999999999988553 222222222 No 25 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1.3e-67 Score=387.21 Aligned_cols=378 Identities=27% Similarity=0.386 Sum_probs=284.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+ ++|.+++++++++.++++..+++++.++++++.+++++|+++|++.++..+...+... .......... T Consensus 1 M~------k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~----~~~~~~~~~~ 70 (392) T protein:vir:10 1 MS------KELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERN----NGREVETRNV 70 (392) T ss_pred Cc------HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccccccccCc Confidence 55 4444555555555556666677788889999999999999999876654433222111 1111111111 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ... ......+........ ...+.+.+. ............+.++||++||+++.+.|++.+++.++|+++++++++ T Consensus 71 ~~~--~~~~~~~~~~l~~~~-~~~~~~~~~--~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~ 145 (392) T protein:vir:10 71 DGE--MEYRDVFMKALRNKP-LNAEEREFL--EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) T ss_pred cch--HHHHHHHHHHHhccc-ccHHHHHHH--hhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec Confidence 110 011111111111111 111111111 112223334445667889999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+++.+.|++|++++|+++.++|++|++.++|++++++||+|+++|+.++|++||.++|+++++++++ T Consensus 146 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 225 (392) T protein:vir:10 146 RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRN 225 (392) T ss_pred cCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998777999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+|++|+|++.+. +..+++++++++. .+...++.+++|+|||++|.+|+++||++|||||.++ T Consensus 226 ~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~ 289 (392) T protein:vir:10 226 VLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) T ss_pred HHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecC Confidence 9999998876432 2345788888774 6777888899999999999999999999999999999 Q ss_pred ccCCCCceecceeeEEe-ccc----cccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccE Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEIL-PDE----VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCR 389 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~-~~~----~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~ 389 (415) +.++.+++|+|+|++++ +++ +....++..++||||+++|.+++|.+++++++++ .++.+.+|+++|+|++ T Consensus 290 ~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 290 PTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred ccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 99999999999876654 333 2334567789999999999999999999998764 3456779999999999 Q ss_pred EeccccEEEEEeecC---CCCcc Q lcl|NC_012784. 390 ILDYKSAIVIEYDDS---ERGEG 409 (415) Q Consensus 390 v~~p~a~~~~~~t~~---~~~~~ 409 (415) +.+|+||+.++++++ .++-| T Consensus 370 v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EecccceEEEEecccccccCCCC Confidence 999999999998642 22223 No 26 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1.3e-67 Score=387.21 Aligned_cols=378 Identities=27% Similarity=0.386 Sum_probs=284.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+ ++|.+++++++++.++++..+++++.++++++.+++++|+++|++.++..+...+... .......... T Consensus 1 M~------k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~----~~~~~~~~~~ 70 (392) T protein:vir:10 1 MS------KELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERN----NGREVETRNV 70 (392) T ss_pred Cc------HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccccccccCc Confidence 55 4444555555555556666677788889999999999999999876654433222111 1111111111 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ... ......+........ ...+.+.+. ............+.++||++||+++.+.|++.+++.++|+++++++++ T Consensus 71 ~~~--~~~~~~~~~~l~~~~-~~~~~~~~~--~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~ 145 (392) T protein:vir:10 71 DGE--MEYRDVFMKALRNKP-LNAEEREFL--EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) T ss_pred cch--HHHHHHHHHHHhccc-ccHHHHHHH--hhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec Confidence 110 011111111111111 111111111 112223334445667889999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+++.+.|++|++++|+++.++|++|++.++|++++++||+|+++|+.++|++||.++|+++++++++ T Consensus 146 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 225 (392) T protein:vir:10 146 RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRN 225 (392) T ss_pred cCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998777999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+|++|+|++.+. +..+++++++++. .+...++.+++|+|||++|.+|+++||++|||||.++ T Consensus 226 ~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~ 289 (392) T protein:vir:10 226 VLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) T ss_pred HHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecC Confidence 9999998876432 2345788888774 6777888899999999999999999999999999999 Q ss_pred ccCCCCceecceeeEEe-ccc----cccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccE Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEIL-PDE----VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCR 389 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~-~~~----~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~ 389 (415) +.++.+++|+|+|++++ +++ +....++..++||||+++|.+++|.+++++++++ .++.+.+|+++|+|++ T Consensus 290 ~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 290 PTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred ccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 99999999999876654 333 2334567789999999999999999999998764 3456779999999999 Q ss_pred EeccccEEEEEeecC---CCCcc Q lcl|NC_012784. 390 ILDYKSAIVIEYDDS---ERGEG 409 (415) Q Consensus 390 v~~p~a~~~~~~t~~---~~~~~ 409 (415) +.+|+||+.++++++ .++-| T Consensus 370 v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EecccceEEEEecccccccCCCC Confidence 999999999998642 22223 No 27 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1.3e-67 Score=387.21 Aligned_cols=378 Identities=27% Similarity=0.386 Sum_probs=284.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+ ++|.+++++++++.++++..+++++.++++++.+++++|+++|++.++..+...+... .......... T Consensus 1 M~------k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~----~~~~~~~~~~ 70 (392) T protein:vir:10 1 MS------KELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERN----NGREVETRNV 70 (392) T ss_pred Cc------HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccccccccCc Confidence 55 4444555555555556666677788889999999999999999876654433222111 1111111111 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ... ......+........ ...+.+.+. ............+.++||++||+++.+.|++.+++.++|+++++++++ T Consensus 71 ~~~--~~~~~~~~~~l~~~~-~~~~~~~~~--~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~ 145 (392) T protein:vir:10 71 DGE--MEYRDVFMKALRNKP-LNAEEREFL--EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) T ss_pred cch--HHHHHHHHHHHhccc-ccHHHHHHH--hhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec Confidence 110 011111111111111 111111111 112223334445667889999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+++.+.|++|++++|+++.++|++|++.++|++++++||+|+++|+.++|++||.++|+++++++++ T Consensus 146 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 225 (392) T protein:vir:10 146 RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRN 225 (392) T ss_pred cCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998777999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+|++|+|++.+. +..+++++++++. .+...++.+++|+|||++|.+|+++||++|||||.++ T Consensus 226 ~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~ 289 (392) T protein:vir:10 226 VLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) T ss_pred HHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecC Confidence 9999998876432 2345788888774 6777888899999999999999999999999999999 Q ss_pred ccCCCCceecceeeEEe-ccc----cccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccE Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEIL-PDE----VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCR 389 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~-~~~----~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~ 389 (415) +.++.+++|+|+|++++ +++ +....++..++||||+++|.+++|.+++++++++ .++.+.+|+++|+|++ T Consensus 290 ~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 290 PTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred ccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 99999999999876654 333 2334567789999999999999999999998764 3456779999999999 Q ss_pred EeccccEEEEEeecC---CCCcc Q lcl|NC_012784. 390 ILDYKSAIVIEYDDS---ERGEG 409 (415) Q Consensus 390 v~~p~a~~~~~~t~~---~~~~~ 409 (415) +.+|+||+.++++++ .++-| T Consensus 370 v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EecccceEEEEecccccccCCCC Confidence 999999999998642 22223 No 28 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1.3e-67 Score=387.21 Aligned_cols=378 Identities=27% Similarity=0.386 Sum_probs=284.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+ ++|.+++++++++.++++..+++++.++++++.+++++|+++|++.++..+...+... .......... T Consensus 1 M~------k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~----~~~~~~~~~~ 70 (392) T protein:vir:10 1 MS------KELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERN----NGREVETRNV 70 (392) T ss_pred Cc------HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccccccccCc Confidence 55 4444555555555556666677788889999999999999999876654433222111 1111111111 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ... ......+........ ...+.+.+. ............+.++||++||+++.+.|++.+++.++|+++++++++ T Consensus 71 ~~~--~~~~~~~~~~l~~~~-~~~~~~~~~--~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~ 145 (392) T protein:vir:10 71 DGE--MEYRDVFMKALRNKP-LNAEEREFL--EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) T ss_pred cch--HHHHHHHHHHHhccc-ccHHHHHHH--hhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec Confidence 110 011111111111111 111111111 112223334445667889999999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++++.+++++..+++.+.|++|++++|+++.++|++|++.++|++++++||+|+++|+.++|++||.++|+++++++++ T Consensus 146 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 225 (392) T protein:vir:10 146 RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRN 225 (392) T ss_pred cCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998777999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccccccchhhHHHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+|++|+|++.+. +..+++++++++. .+...++.+++|+|||++|.+|+++||++|||||.++ T Consensus 226 ~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~ 289 (392) T protein:vir:10 226 VLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) T ss_pred HHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecC Confidence 9999998876432 2345788888774 6777888899999999999999999999999999999 Q ss_pred ccCCCCceecceeeEEe-ccc----cccccCCceEEEechhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccE Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEIL-PDE----VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCR 389 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~-~~~----~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~ 389 (415) +.++.+++|+|+|++++ +++ +....++..++||||+++|.+++|.+++++++++ .++.+.+|+++|+|++ T Consensus 290 ~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 290 PTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred ccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 99999999999876654 333 2334567789999999999999999999998764 3456779999999999 Q ss_pred EeccccEEEEEeecC---CCCcc Q lcl|NC_012784. 390 ILDYKSAIVIEYDDS---ERGEG 409 (415) Q Consensus 390 v~~p~a~~~~~~t~~---~~~~~ 409 (415) +.+|+||+.++++++ .++-| T Consensus 370 v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EecccceEEEEecccccccCCCC Confidence 999999999998642 22223 No 29 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=2.9e-67 Score=385.38 Aligned_cols=382 Identities=21% Similarity=0.362 Sum_probs=280.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |++++++.+++++..+++.++.++.... .+...++++++.+++++++++++.++++++.+................... T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~-~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQD-ENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKK 79 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHh-HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 8888777777766666555444432221 122344566667777777777777666665544333222111111111110 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHh--hhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLE--TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .......... .+...+...++ ..........+.++||++||+++.+.|++.++++++++++|+++ T Consensus 80 ~~~~~~~~~~-------------~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~ 146 (389) T protein:vir:10 80 GTDLSKKPID-------------AKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT 146 (389) T ss_pred ccccchhHHH-------------HHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhccee Confidence 0000000000 00001111111 11122334456678899999999999999999999999999999 Q ss_pred EccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) +++++++.+++.+..+ ..+.|++|++++|+.+.++|+.|++.+++++++++||+|+++|+.++|++||.++|+++++++ T Consensus 147 ~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~ 225 (389) T protein:vir:10 147 PVTTPKGTYPILKRAT-DRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNT 225 (389) T ss_pred eccCCeeEEEEEecCC-CccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 9999999999888755 566799999999987889999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccC Q lcl|NC_012784. 239 RNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQP 318 (415) Q Consensus 239 ~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~ 318 (415) ++.+|++|++++.+.+ ..+..+++++.++++...+..+ +++|+|||++|..|++|||++|||||++ T Consensus 226 ~~~~i~~g~~~~~~~~-------------~~~~~~~d~l~~~~~~~~~~~~-~a~~~~n~~~~~~L~~lkd~~G~~i~~~ 291 (389) T protein:vir:10 226 YNAMIAPVLQSFTAKK-------------TTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHD 291 (389) T ss_pred HHHHHhhhhccccccc-------------ccccccHHHHHHHHHhhhhhhh-CcEEEecHHHHHHHHHhhccCCCeeeec Confidence 9999999988764321 2334567888888775555444 6899999999999999999999999987 Q ss_pred cccC----CCCceecceeeEEeccc-cccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEecc Q lcl|NC_012784. 319 DVKE----KTQQRLLGAKIEILPDE-VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDY 393 (415) Q Consensus 319 ~~~~----~~~~~l~G~pV~~~~~~-~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p 393 (415) ++.+ +.+++|||+||+++++. +...+++.+++||||+++|++++|+++++.++++.+|.+.+|+++|+|+++++| T Consensus 292 ~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~ 371 (389) T protein:vir:10 292 ASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIYGKYLGAAFRFGVQKADS 371 (389) T ss_pred CcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccccccceEEEEEEeccEEecc Confidence 7644 44579999999887654 444567778999999998999999999999999999999999999999999999 Q ss_pred ccEEEEEeecCCCCcccc Q lcl|NC_012784. 394 KSAIVIEYDDSERGEGDL 411 (415) Q Consensus 394 ~a~~~~~~t~~~~~~~~~ 411 (415) +||+++++++++.+.+.= T Consensus 372 ~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 372 KAGYFVTNTDVPGSALGK 389 (389) T ss_pred cceEEEEeeccCCCCCCC Confidence 999999999643332221 No 30 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=4e-67 Score=384.58 Aligned_cols=384 Identities=25% Similarity=0.413 Sum_probs=277.4 Q ss_pred CCh--------HHHHHHHHHHHH---HHHHHHHHHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_012784. 1 MKT--------KEELQSEISDIK---RQIDLKVKYATRALNNDE-LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTS 68 (415) Q Consensus 1 Mk~--------~~el~~~l~~l~---~~~~~~~~~~~~~~~e~~-~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~ 68 (415) |.. ++++++++.+++ +++.++.+++.+.+.+.. .++..++++++++|+.+++.++++++++.+..... T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433 233333333332 334444444444444332 23344555555666666655555554443332222 Q ss_pred hhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhh Q lcl|NC_012784. 69 ENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~ 148 (415) ................. ......................+....+..........+...+++.+|+++.+.|++ ++.. T Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~ 158 (397) T protein:vir:96 81 EDELAKAADPTDQKPKD-GEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDI 158 (397) T ss_pred HHHHHhhhhhhhhhhHH-HHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhh Confidence 11111111000000000 000111111111111122222233333333334444567778899999999999987 5788 Q ss_pred hhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHH Q lcl|NC_012784. 149 FNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELK 228 (415) Q Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~ 228 (415) .+++.++++++++++++.+++++..+ ..+.|++|++..|+.+.++|+.|++.++++++++++|+|+++|+.+++++||. T Consensus 159 ~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~ 237 (397) T protein:vir:96 159 VDLSKYVRSVPVNSASGKFPVISKSG-SKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIA 237 (397) T ss_pred hhHHHhhhhccccccceeEEEEeccC-CccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHH Confidence 89999999999999999999887644 56789999999998788999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh Q lcl|NC_012784. 229 LWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK 308 (415) Q Consensus 229 ~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk 308 (415) ++|+++++.+++.+|++|+|.+.+. +..+++++.++++.....++ +++|+|||++|..|++|| T Consensus 238 ~~l~~~~~~~~~~~i~~g~g~~~~~----------------~~~~~d~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lk 300 (397) T protein:vir:96 238 DEIQDQSLNTKNADIAAVLKTATAK----------------SVVGVDGLKDLINKEIKKVY-DVKLFISASMYSELDKLK 300 (397) T ss_pred HHHHHHHHHHHHHHHhhcccccccc----------------cccchHHHHHHHHHhhhhhc-CcEEEEcHHHHHHHHHhh Confidence 9999999999999999998876432 33468888888887666554 789999999999999999 Q ss_pred ccCCcccccCcccCCCCceecceeeEEeccc-cccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEec Q lcl|NC_012784. 309 DKLGNYLIQPDVKEKTQQRLLGAKIEILPDE-VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQD 387 (415) Q Consensus 309 d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~-~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d 387 (415) |++|||+|.+++.++.+++|+|+||++++++ +....++.+++||||+++|++++|+++++.++++.+|.+.+|+++|+| T Consensus 301 d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d 380 (397) T protein:vir:96 301 DKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIYGQLLAGIIRYD 380 (397) T ss_pred ccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEecccccceeEEEEEEEc Confidence 9999999999999999999999999987765 445567788999999999999999999999999999999999999999 Q ss_pred cEEeccccEEEEEeecC Q lcl|NC_012784. 388 CRILDYKSAIVIEYDDS 404 (415) Q Consensus 388 ~~v~~p~a~~~~~~t~~ 404 (415) ++|++|+||+.++++++ T Consensus 381 ~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 381 VKATDKKAGFYVTFTIG 397 (397) T ss_pred cEEecccceEEEEeecC Confidence 99999999999999998 No 31 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.3e-66 Score=381.77 Aligned_cols=386 Identities=18% Similarity=0.173 Sum_probs=294.2 Q ss_pred CCh---HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~---~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) |+. +++|++++.++.++++...++++...++.+.++++++.+++++|+++++.++++.+.................. T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 553 66677888888888888888888888887788899999999999999998887776655444333222222111 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhh--hhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcc Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLET--RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~ 155 (415) ................. ..+...++. .........+.++||++||+++.+.|++.+++.++++++| T Consensus 81 ~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~ 148 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQL------------SAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHC 148 (421) T ss_pred ccccccchhHHHHHHHH------------HHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhc Confidence 11111111111111000 011111100 0011112345667899999999999999999999999999 Q ss_pred eeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Q lcl|NC_012784. 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTI 235 (415) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~ 235 (415) ++++++++++.++++.......+.|++|++.+|+ +.++|+.|++.+++++++++||+|+++|+.++|++||.++|++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~-s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~ 227 (421) T protein:vir:13 149 HVIPVNRNAGKMPVRAGASVDKLANLAKDTELVK-AMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFA 227 (421) T ss_pred eeeeccCCceEEEEeecCCccceeeccccccccc-cccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHH Confidence 9999999999999988888788889999999986 579999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccc Q lcl|NC_012784. 236 AATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) Q Consensus 236 ~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l 315 (415) ..+++..+++.. .+.. ..++..++++++++++++..+++.+++|+|||++|..|++|||++|||| T Consensus 228 ~~~~~~~i~~~~-----~g~~----------~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i 292 (421) T protein:vir:13 228 VNTENAEIVKQA-----KAVL----------AEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPL 292 (421) T ss_pred HHHhhhhHhhhh-----hhcc----------ccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 999998877421 1111 1223456899999999999999999999999999999999999999999 Q ss_pred ccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC---ceEEEEEEEeccEEec Q lcl|NC_012784. 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF---GECLMIAVRQDCRILD 392 (415) Q Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~---~~~~~~~~r~d~~v~~ 392 (415) |+ ++..+.+++|||+||++++++|.++.+...++||||+++|++++|++++++++++..| .+.+|++.|+|+++++ T Consensus 293 ~~-~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~ 371 (421) T protein:vir:13 293 LK-ELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPL 371 (421) T ss_pred ec-CcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeec Confidence 96 4777888999999999999999888888899999999999999999999999887665 4578999999999999 Q ss_pred cccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 393 YKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 393 p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) |+||+.+.......-.....+.+ T Consensus 372 ~~a~~~~~~~~~~a~v~~~~~~~ 394 (421) T protein:vir:13 372 DKSSDAEKIRKFGVIVKLQEVLK 394 (421) T ss_pred chhhheeeecccceeeccccccC Confidence 99987665443221111111111 No 32 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.4e-65 Score=376.18 Aligned_cols=382 Identities=13% Similarity=0.030 Sum_probs=280.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHH-HHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYA-TRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~-~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) +.+++++++++.++.+++.+..++. .+.+++++.+++++++.++++|+++|++..+...................... T Consensus 3 ~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~- 81 (390) T protein:vir:62 3 ATTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGA- 81 (390) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Confidence 3446788888877777766665543 34677888889999999999999999877666544433222221111111100 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) .... ........+.. . ..+.+.... ........+..++++++|+.+...|++.++..++++.++++.+ T Consensus 82 -~~~~-~~~~~~~~r~~---~--~~~~r~~~~-----~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~ 149 (390) T protein:vir:62 82 -QRSA-DVDDDATLRAG---N--LGEARSFEF-----APEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFT 149 (390) T ss_pred -hhhc-chHHHHHHhhh---h--hhhhHHHHh-----hhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeee Confidence 0000 00000010000 0 000111000 0111122333445555555555666677788888888999988 Q ss_pred ccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) Q Consensus 160 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~ 239 (415) ++++ ..+.+++.++.+.+.|++|++.+|++ +++|++++++++|++++++||+|+++|+.+++++||.++|+++++.++ T Consensus 150 ~~~~-~~~~~p~~~~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~ 227 (390) T protein:vir:62 150 TSDA-NPLDFTVITGRSSASIVGETAEIPES-YPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAM 227 (390) T ss_pred cCCC-ceeEEEEEcCCcceeeeccccccccc-ccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 7644 45778888888899999999999975 689999999999999999999999999999999999999999999999 Q ss_pred HHHHhhccccccccccccccccc--cccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 240 NKAIIDVITKGSTGSTSSGFEKE--GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 240 d~~il~g~g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) |.+|++|+|. |.++....... .......+..++++++++++.+...|..+++|+|||+++..|++|||++|||||+ T Consensus 228 d~~~l~G~G~--p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~ 305 (390) T protein:vir:62 228 GRHFITGTGQ--PRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQ 305 (390) T ss_pred HhhhhccCCc--cccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeec Confidence 9999999875 34443332221 2222333456789999999999999999999999999999999999999999999 Q ss_pred CcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeeccc---CceEEEEEEEeccEEeccc Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH---FGECLMIAVRQDCRILDYK 394 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~r~d~~v~~p~ 394 (415) +++..+.+++|+|+||++++++|.. .++||||++ |.+++++++++..+.+.. +.+.+|++.|+|+++++|+ T Consensus 306 ~~~~~g~~~~l~G~Pv~~~~~~p~~-----~i~~gd~s~-~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~ 379 (390) T protein:vir:62 306 SGLTVGAPSLFNGKVVETDDGMPAD-----KILFADLSK-YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDAR 379 (390) T ss_pred CCcCCCccceecccceEEecCCCCc-----cEEEeeccc-eeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeechh Confidence 9999999999999999999999853 488999997 678889999999877554 4567899999999999999 Q ss_pred cEEEEEeecCC Q lcl|NC_012784. 395 SAIVIEYDDSE 405 (415) Q Consensus 395 a~~~~~~t~~~ 405 (415) ||+.+++++++ T Consensus 380 A~~~l~~~~~a 390 (390) T protein:vir:62 380 GAKVLTVTPGA 390 (390) T ss_pred heEEEEeecCC Confidence 99999999988 No 33 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=2.1e-65 Score=375.20 Aligned_cols=383 Identities=13% Similarity=0.053 Sum_probs=283.1 Q ss_pred CCh--HHHHHHHHHHHHHHHHHHHHHHH-HhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKT--KEELQSEISDIKRQIDLKVKYAT-RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~--~~el~~~l~~l~~~~~~~~~~~~-~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) |.+ +++|+++++++.+++.+..++.. +.+++++.+++++++.++++|+++|++..+..+.................. T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSG 80 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc Confidence 665 57888888888777777666543 566777888899999999999999876555443322222211111111110 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHH-Hhhhhhhhhcce Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKL-KEVEFNLDKYVT 156 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~-~~~~~~l~~~~~ 156 (415) ... .. ........+... ..+.+..... ......+.+++|.++|+++...++.. +...++++.+++ T Consensus 81 --~~~--~~--~~~~~~~~r~g~--~~~~~~~~~~------~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~ 146 (392) T protein:vir:13 81 --AQR--SA--DHDDDAVLRAGN--LGEARSFEFA------PEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGAS 146 (392) T ss_pred --hhh--hh--hHHHHHHHhccc--hhhhHHHHhh------hhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcce Confidence 000 00 000000000000 0111111110 11112234444556666666666654 555566777888 Q ss_pred eEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) ++++.+ ...+.++..++.+.+.|++|++++|++ .++|+.+++.++|++++++||+|+|+|+.++|++||.++|+++++ T Consensus 147 ~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~ 224 (392) T protein:vir:13 147 TFTTSD-ANPMDFTVITGRATAGIVGETAEIPES-YPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIG 224 (392) T ss_pred eeecCC-CceeEEEEEcCCcceeeeccccccccc-ccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHH Confidence 877654 346778888888999999999999976 689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccccccccccccccccc--cccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcc Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTGSTSSGFEKE--GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~ 314 (415) +++|.+||+|+|++.|.++....... .......+..++++++++++.+...++.+++|+|||+++..|++|+|++|+| T Consensus 225 ~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~ 304 (392) T protein:vir:13 225 DAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQY 304 (392) T ss_pred HHHHHHHhcccCCccccccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCce Confidence 99999999999998877665443222 2223334566799999999999999999999999999999999999999999 Q ss_pred cccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeeccc---CceEEEEEEEeccEEe Q lcl|NC_012784. 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH---FGECLMIAVRQDCRIL 391 (415) Q Consensus 315 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~r~d~~v~ 391 (415) ||+++++.+.+++|+|+||++++++|.+ .++||||++ |.++++++++++.+.+.+ +.+.+|++.|+|+++. T Consensus 305 l~~~~~~~g~~~~l~G~Pv~~~~~~~~~-----~i~~Gdf~~-~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~ 378 (392) T protein:vir:13 305 LWQSALTVGAPDTFNGKVVETDDGMPAD-----KVLFADLSK-YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLV 378 (392) T ss_pred eecCCcCCCCCceecceeeEEcCCCCCC-----cEEEeeccc-eeEEeecceEEEeeccccccCCcEEEEEEEEeccEEe Confidence 9999999999999999999999999854 489999997 678899999998876554 4567999999999999 Q ss_pred ccccEEEEEeecCC Q lcl|NC_012784. 392 DYKSAIVIEYDDSE 405 (415) Q Consensus 392 ~p~a~~~~~~t~~~ 405 (415) +|+||+.+++++++ T Consensus 379 ~~~A~~~~~~~~aa 392 (392) T protein:vir:13 379 DARGAKVLTVTPAA 392 (392) T ss_pred cccceEEEEeeccC Confidence 99999999999988 No 34 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=4e-65 Score=373.63 Aligned_cols=392 Identities=13% Similarity=0.102 Sum_probs=266.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+++++++++++++.+++++..++......+. .+..++.++.++++.+++++++++++.+.+............. . T Consensus 21 ~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~-~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~---~ 96 (418) T protein:vir:10 21 EQVLETVTKELKRIGDEVKSAGEKALAEAKRA-GDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELET---P 96 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch---h Confidence 33344444443333333322222111110000 0011223333444444444444444333332222111111000 0 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ........................ ..................+..++|++||+++++.|++.+++.++++++++++++ T Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~ 174 (418) T protein:vir:10 97 KTLGQLVTESEEMKGMDGSARKSV--RVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQT 174 (418) T ss_pred hhhhHHhhhHHHHHHHHHHHhhhh--hhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeec Confidence 000000000000000000000000 000011111122233345666788899999999999999999999999999999 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++++..+++.. ..++.+.|++|++++|++ +++|++|++.+++++++++||+|+++|++ ++++||.++|++++++++| T Consensus 175 ~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d 251 (418) T protein:vir:10 175 SSSSIEYTVET-GFTNNAAAVAEGAQKPTS-DLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEE 251 (418) T ss_pred cCCceeEEEEe-cCCCceeeeccCcccccc-ccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHH Confidence 87766665543 235688999999999875 68999999999999999999999999986 7999999999999999999 Q ss_pred HHHhhccccccc-cccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGST-GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+||+|+|++.. .++.............++...++++++++..+...++.+++|+|||.+|..|++++|++|+|||. + T Consensus 252 ~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~-~ 330 (418) T protein:vir:10 252 GQILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVG-N 330 (418) T ss_pred HHHhccCCCCccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceecc-c Confidence 999999998763 33333333333444445566789999999999999999999999999999999999999999995 6 Q ss_pred ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEEEEeccEEeccc Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIAVRQDCRILDYK 394 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r~d~~v~~p~ 394 (415) +.++.+++|+|+||+++++||.+ .++||||+++|+++++.++++.++++. ++.+.+|++.|+|+++++|+ T Consensus 331 ~~~~~~~~l~G~pV~~~~~~p~~-----~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~ 405 (418) T protein:vir:10 331 PVNGTTPRLWNLPVVETQAMTAN-----EFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPE 405 (418) T ss_pred cccCCCceecceeeEEcCCCCCC-----cEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEeccc Confidence 77788899999999999999865 379999998899999999999987654 45667999999999999999 Q ss_pred cEEEEEeecCCCC Q lcl|NC_012784. 395 SAIVIEYDDSERG 407 (415) Q Consensus 395 a~~~~~~t~~~~~ 407 (415) ||+++++++++.| T Consensus 406 a~~~~~~~~~~~g 418 (418) T protein:vir:10 406 SFVTGALVEQAGG 418 (418) T ss_pred ceEEEEeccCCCC Confidence 9999999999888 No 35 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.9e-64 Score=369.97 Aligned_cols=381 Identities=12% Similarity=0.142 Sum_probs=283.5 Q ss_pred CChHHH-HHHHHHHHHHHHHHHHHHHHH--hhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKTKEE-LQSEISDIKRQIDLKVKYATR--ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~~~e-l~~~l~~l~~~~~~~~~~~~~--~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) |+.+.+ |++++.++.+++++..++..+ .+.++..+++++++++++.|+++|+++++.+.+............... T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~-- 78 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV-- 78 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-- Confidence 998866 778888888887776655443 355677788899999999999999887766554333222111111110 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) ................... . ........+.. .......+...+|+++|+++...|++.+++.+++++++++ T Consensus 79 --~~~~~~~~~~~~~~~~~~~-----~-~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~ 149 (390) T protein:vir:81 79 --GDMFVASEQFQASAGRWND-----R-SARATMNIKAA-LNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGS 149 (390) T ss_pred --hhhhhhhHHHHHHHHHHhh-----h-hhhhhhHHHHH-HHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcce Confidence 0000000000000000000 0 00000111111 1122334556777888889999999999999999999999 Q ss_pred EEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) .+++++...++ +..+. +.+.|++||+++|++ .++|+++++.+++++++++||+|+++|++ ++++||.++|+++++ T Consensus 150 ~~~~~~~~~~~--~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~ 225 (390) T protein:vir:81 150 GRTDSALIEYV--QETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLK 225 (390) T ss_pred eeccCCceEEE--EEecCCcceeeecCCcccccc-cceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHH Confidence 99887665554 44443 578999999999975 58999999999999999999999999986 799999999999999 Q ss_pred HHHHHHHhhcccccccc-ccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccc Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTG-STSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l 315 (415) +++|.+||+|+|++.+. ++..............+...++++++++..+...++.+++|+|||++|..|+++||++|+|| T Consensus 226 ~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l 305 (390) T protein:vir:81 226 VKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYL 305 (390) T ss_pred HHHHHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 99999999999988743 33333333333444556677899999999999999999999999999999999999999999 Q ss_pred ccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec----ccCceEEEEEEEeccEEe Q lcl|NC_012784. 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY----MHFGECLMIAVRQDCRIL 391 (415) Q Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~----~~~~~~~~~~~r~d~~v~ 391 (415) |.+ +..+.+++|+|+||++++++|.+ .++||||+++|.++++++++++++++ .++.+.+|++.|+|+++. T Consensus 306 ~~~-~~~~~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~ 379 (390) T protein:vir:81 306 IGN-ARGTLTPTLWGLPVVATQAMAPG-----EFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVY 379 (390) T ss_pred ecC-cccccCceecceeeEEcCCCCCC-----cEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEe Confidence 975 45666779999999999999865 37999999988999999999998764 235567999999999999 Q ss_pred ccccEEEEEee Q lcl|NC_012784. 392 DYKSAIVIEYD 402 (415) Q Consensus 392 ~p~a~~~~~~t 402 (415) +|+||++++|. T Consensus 380 ~~~a~v~~t~a 390 (390) T protein:vir:81 380 RPEALISGSFA 390 (390) T ss_pred cccceEEEEeC Confidence 99999999999 No 36 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=3.1e-64 Score=368.78 Aligned_cols=396 Identities=14% Similarity=0.156 Sum_probs=264.3 Q ss_pred CChHHHH--------HHHHHHHHH---HHHHHHHHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_012784. 1 MKTKEEL--------QSEISDIKR---QIDLKVKYATRALNNDE-LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTS 68 (415) Q Consensus 1 Mk~~~el--------~~~l~~l~~---~~~~~~~~~~~~~~e~~-~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~ 68 (415) |+-.+.+ ++++.++.+ ++.++..++.+.+++.. .++...+.++++.++++++.+++....++...... T Consensus 1 ~~~~~~~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~ 80 (425) T protein:vir:95 1 MALRQLMLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQL 80 (425) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5444322 222333322 23333334444433322 23445555566555555555444443333222211 Q ss_pred hhcccc------ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHH Q lcl|NC_012784. 69 ENNQQS------VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDIL 142 (415) Q Consensus 69 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii 142 (415) ...... ........................... ......................+.+++|++||+++.+.|+ T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii 159 (425) T protein:vir:95 81 EDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREML-KTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIM 159 (425) T ss_pred HHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHH-hhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHH Confidence 111000 000000000000000000000000000 0000000111111112222334556789999999999999 Q ss_pred HHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHH Q lcl|NC_012784. 143 KLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN 222 (415) Q Consensus 143 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~ 222 (415) +.+++.++++++++++++++ .+.+++..+.+.+.|++|++++|+.+.++|++|++++++++++++||+|+++|+.++ T Consensus 160 ~~l~~~~~i~~~~~~~~~~g---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~ 236 (425) T protein:vir:95 160 DIMGDYTTLYPLVDKIRVKG---TTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIIN 236 (425) T ss_pred HHHHhhhhHHHhhceeecCc---eeEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHH Confidence 99999999999999988753 455667788899999999999998777899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccc--cccccccccccccccccchhhHHHHHHHHHHhhhhcc--CCCEEEEcH Q lcl|NC_012784. 223 VLQELKLWMARTIAATRNKAIIDVITKGST--GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY--EHNVAIVSQ 298 (415) Q Consensus 223 l~~~l~~~la~~~~~~~d~~il~g~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~ 298 (415) |++||.++|+++++.++|.+||+|+|++.+ .++..........+......+++++.+++..+..++. .+++|+||+ T Consensus 237 l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 316 (425) T protein:vir:95 237 LDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKR 316 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeC Confidence 999999999999999999999999997744 4444333333333444567788999999888776654 567899999 Q ss_pred HHH-H---HHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc Q lcl|NC_012784. 299 TMF-A---KLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) Q Consensus 299 ~~~-~---~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~ 374 (415) .++ . .|+.++|++|||+|+.. .+..++|+|+||+++++||.+ .++||||++ |.+++|+++++.++++. T Consensus 317 ~~~~~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~pvv~~~~~~~~-----~i~~Gd~~~-~~~~~~~~~~i~~~~~~ 388 (425) T protein:vir:95 317 STYYNRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGLRVVFNNFLDDD-----TVLFGEFEQ-YTLVERENITIDSSTHV 388 (425) T ss_pred hHHHHHHHHHHhhcCCCCceeeccC--CCCCccccceeeEEcCcCCCc-----cEEEEeccc-EEEEeecceEEEeeccc Confidence 874 3 46678999999999743 344568999999999999854 489999997 67788999999998875 Q ss_pred cC---ceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 375 HF---GECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 375 ~~---~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) +| .+.+|++.|+|+++++|+||+++++++|..|. T Consensus 389 ~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 389 KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred ccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 54 55789999999999999999999999999888 No 37 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=4.1e-64 Score=368.10 Aligned_cols=382 Identities=12% Similarity=0.126 Sum_probs=280.0 Q ss_pred CChHHH-HHHHHHHHHHHHHHHHHHHHH--hhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKTKEE-LQSEISDIKRQIDLKVKYATR--ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~~~e-l~~~l~~l~~~~~~~~~~~~~--~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) |+.+.+ |++++.++.++++...++..+ .++++...+++++++++++|+++|++++++.++............... T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~-- 78 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV-- 78 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch-- Confidence 888666 677788887777776655443 355677778899999999999999887776655433222111111110 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) .....+.. ............ . ............ .....+...+|.++|+++...|++.+++.+++++++++ T Consensus 79 -~~~~~~~~-~~~~~~~~~~~~-----~-~~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~ 149 (390) T protein:vir:10 79 -GDLFVASE-QFQASAGRWNDR-----S-ARATMNIKAALN-TASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGS 149 (390) T ss_pred -hhhhhhhH-HHHHHHHhhhhh-----h-hhhhhHHHHHHH-hhhcccccccccccchhHHHHHHHHHHhhchhhhhcce Confidence 00000100 000000000000 0 000011111111 22233444556677888889999999999999999999 Q ss_pred EEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) +++++++..+++... ..+.+.|++|++++|+. .++|+.+++.+++++++++||+|+++|++ ++++||.++|++++++ T Consensus 150 ~~~~~~~~~~~~~~~-~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~ 226 (390) T protein:vir:10 150 GRTDSALIEYVQETG-FVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKV 226 (390) T ss_pred eeccCCceEEEEEec-CCcceeeecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHH Confidence 998877656554332 23578999999999975 68999999999999999999999999986 7999999999999999 Q ss_pred HHHHHHhhccccccc-cccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 238 TRNKAIIDVITKGST-GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 238 ~~d~~il~g~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) ++|.+||+|+|++.. .++..............+...++++++++..+..+++.+++|+|||++|..|++++|++|+||| T Consensus 227 ~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~ 306 (390) T protein:vir:10 227 KEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLI 306 (390) T ss_pred HHHHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 999999999998764 3333322223333444556678999999999999999999999999999999999999999999 Q ss_pred cCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec-c---cCceEEEEEEEeccEEec Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-M---HFGECLMIAVRQDCRILD 392 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~---~~~~~~~~~~r~d~~v~~ 392 (415) ++.. .+.+++|+|+||++++.+|.+ .++||||+++|.+++++++++++++. . ++.+.+|++.|+|+++++ T Consensus 307 ~~~~-~~~~~~l~G~pv~~~~~~p~~-----~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~ 380 (390) T protein:vir:10 307 GNAR-GTLTPTLWGLPVVATQAMAPG-----EFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYR 380 (390) T ss_pred cCCc-CcCCceecceeeEEcCCCCCC-----cEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEec Confidence 8654 455679999999999999864 47999999989999999999988764 2 355678999999999999 Q ss_pred cccEEEEEee Q lcl|NC_012784. 393 YKSAIVIEYD 402 (415) Q Consensus 393 p~a~~~~~~t 402 (415) |+||+++++. T Consensus 381 ~~a~~~~~~a 390 (390) T protein:vir:10 381 PEALISGSFA 390 (390) T ss_pred cccEEEEEeC Confidence 9999999999 No 38 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=4.3e-64 Score=368.01 Aligned_cols=395 Identities=17% Similarity=0.164 Sum_probs=271.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHH--HHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc--cc- Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVK--YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQ--SV- 75 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~--~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~--~~- 75 (415) |+++++|++++.++.+++++..+ ...+.+++++.++++++++++++|+++|+++++..+.............. .. T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 80 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVI 80 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccc Confidence 99999999999988777665443 33456888999999999999999999999877654433322221111110 10 Q ss_pred cccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHH-HHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhc Q lcl|NC_012784. 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFT-EYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~ 154 (415) ...+........ ...................... ..............+.+.||++||+++.++|++.+++.++++++ T Consensus 81 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~ 159 (428) T protein:vir:10 81 VKAEPKQYTGAG-MTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKL 159 (428) T ss_pred cccccchhhhHH-HHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhh Confidence 111111111111 1111111100000000111111 11111112222233445678999999999999999999999998 Q ss_pred -ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 155 -VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 155 -~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) ++++++++ +.+.+|+.++++.+.|++||+.+|++ +++|++|++.+++++++++||+|+++|+.++|++||.++|++ T Consensus 160 ~~~~~~~~~--g~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ 236 (428) T protein:vir:10 160 GARSIPLPN--GNMSLPRLAGGATASYTGENQDAKVS-EARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILT 236 (428) T ss_pred cceeeecCC--cceEEEEEeCCcceeeeccCcccccc-ccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHH Confidence 56666544 45666677788899999999999975 699999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccccc-cccccccccccc--ccccccchhhHH---HHHH---HHHHhhhhccCCCEEEEcHHHHHHH Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGS-TGSTSSGFEKEG--KKLEVKKAKSLD---DIKD---AINLNVKPNYEHNVAIVSQTMFAKL 304 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~-~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~---~~~~~~~~~~~~~~~v~~~~~~~~l 304 (415) ++++++|++||+|+|++. |.++........ .........+++ .+.+ ........+..+++|+|||.+|..| T Consensus 237 ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L 316 (428) T protein:vir:10 237 AISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKL 316 (428) T ss_pred HHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHH Confidence 999999999999999863 433332211111 111111222222 2222 3334445667789999999999999 Q ss_pred HHhhccCCcccccCcccCCCCceecceeeEEeccccccc---cCCceEEEechhhcEEEEeecceEEEEeecc------- Q lcl|NC_012784. 305 DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQ---KGNNTLIIGNLKDAIVLFDRSQYQASWTDYM------- 374 (415) Q Consensus 305 ~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~------- 374 (415) ++++|++|+|+|++ . .+++|+|+||++++++|... .+...++||||++ ++++++++++++++++. T Consensus 317 ~~lkd~~G~~i~~~-~---~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~i~i~~~~~~~~~~~~~ 391 (428) T protein:vir:10 317 FGLRDGNGNKVYPE-M---AQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFND-VVIGEDGNMKVDFSKEASYIDTDG 391 (428) T ss_pred HHhhccCCceeccC-C---CCCeeeceeeEEeccccccccCCCccceEEEEecce-EEEEEecceEEEeecccccccccc Confidence 99999999999963 2 23589999999999998642 2345789999997 56788999999988753 Q ss_pred -------cCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 375 -------HFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 375 -------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .+...+|+++|+|+++.+|+||+.++-..= T Consensus 392 ~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 392 KLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 334568999999999999999999883333 No 39 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=8.2e-64 Score=366.43 Aligned_cols=400 Identities=16% Similarity=0.165 Sum_probs=258.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc----- Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV----- 75 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~----- 75 (415) |+.. |+++++....++..+..+...+. .+...++.+++++++++|+++++.++++++++++............ T Consensus 1 M~l~-el~~~~~~~~~~~~a~l~~~~~~-~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~ 78 (434) T protein:vir:62 1 MNLK-EILNASLTRTKSRLAELQGKVEK-NEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEK 78 (434) T ss_pred CCHH-HHHHHHHHHHHHHHHHHHHHHhc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhh Confidence 9844 44333333222222221111111 1111233344444555555555554444433322211111100000 Q ss_pred ---ccc-chhhhhh---HHHHHHHHHHHHHh--------hhhhHHHHHHHHHHhh---hhhhhhcccccccceeecchhH Q lcl|NC_012784. 76 ---EVN-EARTYRN---QANINDLGISIQNT--------KVTSQEVRDFTEYLET---RNDIQGGSLKTDSGFVVIPEEI 137 (415) Q Consensus 76 ---~~~-~~~~~~~---~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~vP~~~ 137 (415) ... ....... .............. .........+...+.. .......+.++++||++||+++ T Consensus 79 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~ 158 (434) T protein:vir:62 79 KEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFL 158 (434) T ss_pred hcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhh Confidence 000 0000000 00000000000000 0001111112111111 1112222344567899999999 Q ss_pred HhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccc---ccccccccccccccceeeEeeeeeEEEeehhhHH Q lcl|NC_012784. 138 VTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK---VEELEENPELAVKPFFQLAYDINTHRGYFRISRE 214 (415) Q Consensus 138 ~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~---v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e 214 (415) .+.|++.+++++++++++++++++ ++.++|+.. ..+.+.| .+|++.+|+ ++++|++|++.+|+++++++||+| T Consensus 159 ~~~Ii~~l~~~~~i~~~~~~~~~~-~~~~~p~~~--~~~~a~~~~~~~e~~~~~~-~~~~f~~v~~~~~k~~~~~~iS~e 234 (434) T protein:vir:62 159 SKEIITYAQEENFLRRLGTGVKTK-ENIKYPVLV--KKAEAQGHKNERTNNEMPE-TDIEFDEIELSPTEFDALATVTKK 234 (434) T ss_pred HHHHHHhhhhhhhhhhhcceeccC-CceEEEEEe--cCCcccceecccccccccc-cccceeeEEeeheeeEeehhhHHH Confidence 999999999999999999988765 345666554 3444444 466777775 569999999999999999999999 Q ss_pred HHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEE Q lcl|NC_012784. 215 AIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVA 294 (415) Q Consensus 215 ~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (415) +|+|+.++|++||.++|++++++++|.+||+|+|++.+..+... .........++..++++++++.++..+|+.+++| T Consensus 235 ll~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~--~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~ 312 (434) T protein:vir:62 235 LLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALA--KKAVEFKTDEKNLYDALVKMKNTPVKEVRKKARW 312 (434) T ss_pred HHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceee--cccccccccccchhhHHHHHHhhcchhhhcCCEE Confidence 99999999999999999999999999999999998876554332 2233444556678999999999999999999999 Q ss_pred EEcHHHHHHHHHhhccCCcccccCc--ccCCCCceecceeeEEeccccccccCCce-EEEechhhcEEEEeec-ceEEEE Q lcl|NC_012784. 295 IVSQTMFAKLDKMKDKLGNYLIQPD--VKEKTQQRLLGAKIEILPDEVLGQKGNNT-LIIGNLKDAIVLFDRS-QYQASW 370 (415) Q Consensus 295 v~~~~~~~~l~~lkd~~G~~l~~~~--~~~~~~~~l~G~pV~~~~~~~~~~~~~~~-~~~gd~~~~~~~~~~~-~~~i~~ 370 (415) +|||.+|.+|++|||++|||||++. ..++.+.+|+|+||+++++||.+.+++.. ++||||++| ++++|. .++++. T Consensus 313 v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~-~i~~~~g~~~i~~ 391 (434) T protein:vir:62 313 VLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKF-YIQDVIGSLEVQK 391 (434) T ss_pred EEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccce-EEEEeeceeEEEe Confidence 9999999999999999999999864 45577889999999999999988776654 778999975 467765 477877 Q ss_pred eecc---cCceEEEEEEEeccEEec-cccEEEEEeecCCCCcc Q lcl|NC_012784. 371 TDYM---HFGECLMIAVRQDCRILD-YKSAIVIEYDDSERGEG 409 (415) Q Consensus 371 ~~~~---~~~~~~~~~~r~d~~v~~-p~a~~~~~~t~~~~~~~ 409 (415) +++. ++++.+|++.|+|+++++ |.++..+++.-.++.+| T Consensus 392 ~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 392 LVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred ehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 6654 345679999999999876 98888888874444444 No 40 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=1.3e-63 Score=365.31 Aligned_cols=379 Identities=12% Similarity=0.126 Sum_probs=273.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |.++++|+++++++.+++++..++.+..+ ++..++.++++++++.+.++++..+++++...+...... ..... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 73 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEI-ESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGA------ENPGE 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------cccch Confidence 99999999999998887776654433221 122233344444444444444444433333222111110 00000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ................. .................+.+.+|.++|++++..|++.+++.+++++++++.++ T Consensus 74 ~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~ 143 (385) T protein:vir:18 74 KKSFSERAAEELIKSWD----------GKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRT 143 (385) T ss_pred hhhhHHHHHHHHHHHHH----------HhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecc Confidence 00000000000000000 00011111111222334555567788889999999999999999999999998 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++...+++.. ...+.+.|++|++.+|+. +++|+++++.+++++++++||+|+++|++ ++++||.++|++++++++| T Consensus 144 ~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d 220 (385) T protein:vir:18 144 SSNALEYVREE-VFTNNADVVAEKALKPES-DITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEE 220 (385) T ss_pred cCcceEEEEEe-cCCcceeeeccCcccccc-ccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHH Confidence 87765555443 235678899999999875 68999999999999999999999999986 6999999999999999999 Q ss_pred HHHhhccccccccccccccc-cccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFE-KEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+||+|+|++.+..+..... .........+...++++.+++.++...++.+++|+|||.+|.+|+++||++|||+|. + T Consensus 221 ~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~-~ 299 (385) T protein:vir:18 221 GQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFG-G 299 (385) T ss_pred HHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceecc-C Confidence 99999999988754433222 222233344556789999999999999999999999999999999999999999996 5 Q ss_pred ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEEEEeccEEeccc Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIAVRQDCRILDYK 394 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r~d~~v~~p~ 394 (415) +..+.+++|+|+||++++++|.+ .++||||+++|+++++++++++++++. ++...+|++.|+|+++.+|+ T Consensus 300 ~~~~~~~~l~G~pV~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~ 374 (385) T protein:vir:18 300 PQAFTSNIMWGLPVVPTKAQAAG-----TFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPT 374 (385) T ss_pred cccCCCceecceeeEEcCcCCCC-----cEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc Confidence 67788899999999999999865 489999999899999999999886543 44567899999999999999 Q ss_pred cEEEEEeecCC Q lcl|NC_012784. 395 SAIVIEYDDSE 405 (415) Q Consensus 395 a~~~~~~t~~~ 405 (415) ||+++++++++ T Consensus 375 a~~~~~~~aa~ 385 (385) T protein:vir:18 375 AIIKGTFSSGS 385 (385) T ss_pred ceEEEEeccCC Confidence 99999999988 No 41 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=1.3e-63 Score=365.31 Aligned_cols=379 Identities=12% Similarity=0.126 Sum_probs=273.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |.++++|+++++++.+++++..++.+..+ ++..++.++++++++.+.++++..+++++...+...... ..... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 73 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEI-ESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGA------ENPGE 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------cccch Confidence 99999999999998887776654433221 122233344444444444444444433333222111110 00000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ................. .................+.+.+|.++|++++..|++.+++.+++++++++.++ T Consensus 74 ~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~ 143 (385) T protein:vir:19 74 KKSFSERAAEELIKSWD----------GKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRT 143 (385) T ss_pred hhhhHHHHHHHHHHHHH----------HhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecc Confidence 00000000000000000 00011111111222334555567788889999999999999999999999998 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++...+++.. ...+.+.|++|++.+|+. +++|+++++.+++++++++||+|+++|++ ++++||.++|++++++++| T Consensus 144 ~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d 220 (385) T protein:vir:19 144 SSNALEYVREE-VFTNNADVVAEKALKPES-DITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEE 220 (385) T ss_pred cCcceEEEEEe-cCCcceeeeccCcccccc-ccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHH Confidence 87765555443 235678899999999875 68999999999999999999999999986 6999999999999999999 Q ss_pred HHHhhccccccccccccccc-cccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFE-KEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) .+||+|+|++.+..+..... .........+...++++.+++.++...++.+++|+|||.+|.+|+++||++|||+|. + T Consensus 221 ~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~-~ 299 (385) T protein:vir:19 221 GQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFG-G 299 (385) T ss_pred HHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceecc-C Confidence 99999999988754433222 222233344556789999999999999999999999999999999999999999996 5 Q ss_pred ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEEEEeccEEeccc Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIAVRQDCRILDYK 394 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r~d~~v~~p~ 394 (415) +..+.+++|+|+||++++++|.+ .++||||+++|+++++++++++++++. ++...+|++.|+|+++.+|+ T Consensus 300 ~~~~~~~~l~G~pV~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~ 374 (385) T protein:vir:19 300 PQAFTSNIMWGLPVVPTKAQAAG-----TFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPT 374 (385) T ss_pred cccCCCceecceeeEEcCcCCCC-----cEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc Confidence 67788899999999999999865 489999999899999999999886543 44567899999999999999 Q ss_pred cEEEEEeecCC Q lcl|NC_012784. 395 SAIVIEYDDSE 405 (415) Q Consensus 395 a~~~~~~t~~~ 405 (415) ||+++++++++ T Consensus 375 a~~~~~~~aa~ 385 (385) T protein:vir:19 375 AIIKGTFSSGS 385 (385) T ss_pred ceEEEEeccCC Confidence 99999999988 No 42 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1.3e-63 Score=365.41 Aligned_cols=382 Identities=12% Similarity=0.134 Sum_probs=280.8 Q ss_pred CChHH-HHHHHHHHHHHHHHHHHHHHHH--hhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKTKE-ELQSEISDIKRQIDLKVKYATR--ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~~~-el~~~l~~l~~~~~~~~~~~~~--~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) |..+. +|++++.++.+++++..++..+ .++++..+++++++.+++.+++++++++++.................. T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~-- 78 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV-- 78 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-- Confidence 76664 4667777777766665544332 356677788888889999999888887766554333222211111110 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) ............+........ .......+. ........+..++|+++|+++++.|++.+++.+++++++++ T Consensus 79 --~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~-~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~ 149 (390) T protein:vir:97 79 --GDMFVASEQFQASTGRWNDRS------ARATMNIKA-ALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGS 149 (390) T ss_pred --hhhhhhhHHHHHHHHHhhhhh------hhhhhHHHH-HHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcce Confidence 000000111011100000000 000001111 11223344667778899999999999999999999999999 Q ss_pred EEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) .+++++...+++... ..+.+.|++||+++|++ .++|+++++++++++++++||+|+++|++ ++++||.++|++++++ T Consensus 150 ~~~~~~~~~~~~~~~-~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~ 226 (390) T protein:vir:97 150 GRTDSALIEYVQETG-FVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKV 226 (390) T ss_pred eeccCCceEEEEEec-CCcceeeecCCcccccc-ccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHH Confidence 998877655555432 23578999999999865 68999999999999999999999999986 7999999999999999 Q ss_pred HHHHHHhhccccccc-cccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 238 TRNKAIIDVITKGST-GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 238 ~~d~~il~g~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) ++|.+||+|+|++.. .++.............++...++++.+++..+...++.+++|+|||++|..|+++||++|+||| T Consensus 227 ~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~ 306 (390) T protein:vir:97 227 KEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLI 306 (390) T ss_pred HHHHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 999999999998763 3333322233333344556678899999999999999999999999999999999999999999 Q ss_pred cCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec----ccCceEEEEEEEeccEEec Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY----MHFGECLMIAVRQDCRILD 392 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~----~~~~~~~~~~~r~d~~v~~ 392 (415) .+ +..+.+++|+|+||++++.+|.+ .++||||+++|.+++++++++.++++ .++.+.+|++.|+|+++++ T Consensus 307 ~~-~~~~~~~~l~G~pV~~~~~~~~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~ 380 (390) T protein:vir:97 307 GN-ARGTLTPTLWGLPVVATQAMAPG-----EFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYR 380 (390) T ss_pred cC-ccCCCCceecceeeEEcCCCCCC-----cEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEec Confidence 76 44566789999999999999864 47999999888899999999998753 3455678999999999999 Q ss_pred cccEEEEEee Q lcl|NC_012784. 393 YKSAIVIEYD 402 (415) Q Consensus 393 p~a~~~~~~t 402 (415) |+||+++++. T Consensus 381 ~~a~v~~~~a 390 (390) T protein:vir:97 381 PEALITGSFA 390 (390) T ss_pred cccEEEEEeC Confidence 9999999999 No 43 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=9.2e-63 Score=360.70 Aligned_cols=380 Identities=12% Similarity=0.080 Sum_probs=260.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEK-------AEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQ 73 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~-------~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 73 (415) |. ++.+++++++++++++.++.+... +...++ .+++.++++.+..+++.++.++++............. T Consensus 1 m~---~~~k~l~el~~~~~~~~~~~~~~~-e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (395) T protein:vir:43 1 MS---DFEKQIGELNASLKQVGDQIKSQA-EQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDG 76 (395) T ss_pred Ch---hHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 43 333334444444443333332221 111112 2333344444444444444444333222221111100 Q ss_pred cccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 74 SVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) .... .................. ..................+...+|+++|+++++.|++.+++.+++++ T Consensus 77 ~~~~--~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~ 145 (395) T protein:vir:43 77 GEEA--PKTAGQMVAESLKEQGVT---------SSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRD 145 (395) T ss_pred ccch--hhhHHHHHHHHHHHHHHH---------HHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHh Confidence 0000 000000000000000000 00000011111222334456677889999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) ++++.+++++++++++... ..+.+.|++|++.+|++ .++|+++++++++++++++||+|+++|++ ++++||.++|++ T Consensus 146 l~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~ 222 (395) T protein:vir:43 146 LVAPGTTESNSVEYVRETG-FVNNAAPVSEGTQKPYS-DLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARY 222 (395) T ss_pred hccceecCCCceEEEEEec-CCCceeeecCCcccccc-ccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHH Confidence 9999999877666555432 34678999999999875 68999999999999999999999999986 699999999999 Q ss_pred HHHHHHHHHHhhcccccccccccccccc---ccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhcc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEK---EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDK 310 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~ 310 (415) ++++++|.+||+|+|++.+..+...... ........+...++++.+++..+...++.+++|+|||++|..|++++|+ T Consensus 223 a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~ 302 (395) T protein:vir:43 223 GLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDA 302 (395) T ss_pred HHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhcc Confidence 9999999999999998877433222111 1122233345568999999999999999999999999999999999999 Q ss_pred CCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEEEE Q lcl|NC_012784. 311 LGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIAVR 385 (415) Q Consensus 311 ~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r 385 (415) +|+|||. ++.++.+++|+|+||++++++|.+. ++||||++++++++|.+++++++++. ++.+.+|++.| T Consensus 303 ~G~~i~~-~~~~~~~~~l~G~pVv~~~~~~~~~-----~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r 376 (395) T protein:vir:43 303 ENRYIIG-SPQNGTTPTLWRLPVVETQAITQDE-----FLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEER 376 (395) T ss_pred CCceecc-ccccCCCceecceeeEEcCCCCCCc-----EEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEe Confidence 9999996 5677778899999999999998653 79999999899999999999987643 45667999999 Q ss_pred eccEEeccccEEEEEeecC Q lcl|NC_012784. 386 QDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 386 ~d~~v~~p~a~~~~~~t~~ 404 (415) +|+++++|+||++++++++ T Consensus 377 ~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 377 LAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred eccEEecccceEEEEeccC Confidence 9999999999999999999 No 44 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=1.8e-62 Score=359.13 Aligned_cols=395 Identities=10% Similarity=0.077 Sum_probs=255.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNN--DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e--~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||...+ . ..++..++.++++..+++ ...+..+++.++++.+.+.++..++...................... T Consensus 2 ~ke~~~---~---~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (413) T protein:vir:81 2 VKEAGD---A---PTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYK 75 (413) T ss_pred hhhHHH---H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhh Confidence 222222 1 111122222222222111 11122222233333333333322222111111110000000000000 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .......+...........................+..........+.+.+++++|+++++.|++.+++.+++++++++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~ 155 (413) T protein:vir:81 76 SIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL 155 (413) T ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhccee Confidence 00000000000000000000000000000011111222223334456678889999999999999999999999999999 Q ss_pred EccCCceeEEEEeecC--CcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~--~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) ++++++..+++.+... ...+.|++||+.+|+++.++|+.+++.+++++++++||+|+|+|++. |++||.++|+++++ T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~-l~~~i~~~la~~~~ 234 (413) T protein:vir:81 156 TMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYDF-LVSYINARLLEELA 234 (413) T ss_pred eccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHHH-HHHHHHHHHHHHHH Confidence 9998888887765443 34678999999999876678999999999999999999999999975 99999999999999 Q ss_pred HHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHHhhccCCccc Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKP-NYEHNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~lkd~~G~~l 315 (415) +++|.+||+|+|++.+..+.............++...++++..++..+..+ ++.+++|+|||++|.+|++|||++|||| T Consensus 235 ~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l 314 (413) T protein:vir:81 235 IEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYY 314 (413) T ss_pred HHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCcee Confidence 999999999999987744443333333333334455577777777766544 4566789999999999999999999999 Q ss_pred ccCcccC-------CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEEEE Q lcl|NC_012784. 316 IQPDVKE-------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLMIA 383 (415) Q Consensus 316 ~~~~~~~-------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~ 383 (415) |.+.... ..+.+|||+||++++++|.+ .++||||+++|++++|++++++++++. ++.+.+|++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~-----~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~ 389 (413) T protein:vir:81 315 GGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVG-----KPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAE 389 (413) T ss_pred ccccccccccccccccCceecceeeEEcCCCCcc-----cEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEE Confidence 9765433 23458999999999999854 479999999899999999999987754 456689999 Q ss_pred EEeccEEeccccEEEEEeecCCCC Q lcl|NC_012784. 384 VRQDCRILDYKSAIVIEYDDSERG 407 (415) Q Consensus 384 ~r~d~~v~~p~a~~~~~~t~~~~~ 407 (415) +|+|+++.+|+||++++++++++| T Consensus 390 ~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 390 ERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EeeccEEecccceEEEEecCCCCC Confidence 999999999999999999998888 No 45 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=2.4e-62 Score=358.43 Aligned_cols=400 Identities=11% Similarity=0.037 Sum_probs=254.2 Q ss_pred CChHHHHHH--HHHHHHHHH--HHHHHHHHHhhch------------------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 1 MKTKEELQS--EISDIKRQI--DLKVKYATRALNN------------------DELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) Q Consensus 1 Mk~~~el~~--~l~~l~~~~--~~~~~~~~~~~~e------------------~~~~~~~~~~~e~~~l~~~i~~~~~~~ 58 (415) |.++++-+. ++...++.+ ..+..+..+...+ +..+++++..++++++.+++++..+.. T Consensus 5 ~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~~~ 84 (458) T protein:vir:10 5 INKLKEELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNELF 84 (458) T ss_pred hhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333332110 000000100 0000001110000 111111222222333332222222111 Q ss_pred HHHHHHHh--------hhhhcccccc--------ccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhc Q lcl|NC_012784. 59 DKLKEKDG--------TSENNQQSVE--------VNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG 122 (415) Q Consensus 59 ~~~~~~~~--------~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (415) ....+... .........+ ...................................... ....... T Consensus 85 a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~-~~~a~~~ 163 (458) T protein:vir:10 85 AQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQR-HLKAVNQ 163 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhh-hhhhhhh Confidence 11000000 0000000000 00000000000000000000000000000000000000 0111122 Q ss_pred ccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccc-----cccccee Q lcl|NC_012784. 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL-----AVKPFFQ 197 (415) Q Consensus 123 ~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-----~~~~f~~ 197 (415) ..+...+++++|+++.+.|++.+++.++++.++++++++++...++ +....+.+.|++|++.++++ +.++|++ T Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~--~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~ 241 (458) T protein:vir:10 164 SSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTML--VEPDAGKATWVAASTYGTDTTTGEEVKGALKE 241 (458) T ss_pred cccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEE--EecCCcceeeccccccccccccccccccccee Confidence 3345578899999999999999999999999999999887655544 55677889999999988754 3568999 Q ss_pred eEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccc------ccccccch Q lcl|NC_012784. 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG------KKLEVKKA 271 (415) Q Consensus 198 v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~------~~~~~~~~ 271 (415) +++.++|++++++||+|+++|+.++|++||.++|++++++++|.+||+|+|++.|.++........ ........ T Consensus 242 i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 321 (458) T protein:vir:10 242 IHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVL 321 (458) T ss_pred eEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccccccc Confidence 999999999999999999999999999999999999999999999999999988776654332221 12223345 Q ss_pred hhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc----ccCCCCceecceeeEEeccccccccCCc Q lcl|NC_012784. 272 KSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD----VKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~----~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) .++++++++++.+...|+.+++|+|||++|.+|++++|++|+|+|.+. +..+.+.+|||+||+++++||.++ ++. T Consensus 322 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~-~~~ 400 (458) T protein:vir:10 322 VTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKA-NSA 400 (458) T ss_pred ccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccccc-CCc Confidence 578999999999999999999999999999999999999999998654 344667799999999999999764 455 Q ss_pred eEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 348 TLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 348 ~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .++||||+++|.++++.++++..++|. ++...+|+..|+|+.+++|+||++++++++ T Consensus 401 ~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 401 EFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 689999998899999999999988875 446679999999999999999999999998 No 46 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=8e-62 Score=355.53 Aligned_cols=398 Identities=13% Similarity=0.092 Sum_probs=253.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHhh Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEK-------------QEELDKLKEKDGT 67 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~-------------~~~~~~~~~~~~~ 67 (415) ||+...|+++..++.+++.....+.++...|-+ +..+++.+++++++++++.. .+.++.+...... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 79 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKK-EALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999888888887776665444333322211 11112222222222221111 1111111110000 Q ss_pred hhhccccccc---cchhh----hhh---HHHHHHHHHHHHHh-------hhhhHHHHHHHHHHh-hhhhhhhcccccccc Q lcl|NC_012784. 68 SENNQQSVEV---NEART----YRN---QANINDLGISIQNT-------KVTSQEVRDFTEYLE-TRNDIQGGSLKTDSG 129 (415) Q Consensus 68 ~~~~~~~~~~---~~~~~----~~~---~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 129 (415) .+........ ..... ... .............. .........+..... ..........+.+.+ T Consensus 80 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 159 (497) T protein:vir:78 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) T ss_pred HHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCccc Confidence 0000000000 00000 000 00000000000000 000000000111111 112223334566778 Q ss_pred eeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecC-CcccccccccccccccccccceeeEeeeeeEEEe Q lcl|NC_012784. 130 FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGY 208 (415) Q Consensus 130 ~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~ 208 (415) |++||+++.+.|++.+++.++++++++++++++++.. +++..+ .+.+.|++|++.+|++ +++|++|++.+||++++ T Consensus 160 g~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~--~~~~~~~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~a~~ 236 (497) T protein:vir:78 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANA 236 (497) T ss_pred ccccchhhhHHHHHHHHhhhhHHhhccccccCCCceE--EEEEcCCCCcceeeccCcccccc-cccceeeEeeeeeeEee Confidence 8999999999999999999999999999998876544 555444 4678999999999975 69999999999999999 Q ss_pred ehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc--------------------- Q lcl|NC_012784. 209 FRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE--------------------- 267 (415) Q Consensus 209 ~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~--------------------- 267 (415) ++||+|||+|++ ++++||.++|++++++++|.+||+|+|++.+.++............ T Consensus 237 ~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) T protein:vir:78 237 LTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) T ss_pred cHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccc Confidence 999999999987 5999999999999999999999999999877665432211110000 Q ss_pred ---------------------------------ccchhhHHHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 268 ---------------------------------VKKAKSLDDIKDAINLNVKP-NYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 268 ---------------------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) .+.....+++..++..+... ++.+++|+|||.+|..|+++||++|| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~ 395 (497) T protein:vir:78 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) T ss_pred cchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCc Confidence 01111223334444444333 44567999999999999999999999 Q ss_pred ccccCccc------CCCCceecceeeEEeccccccccCCceEEEechhh-cEEEEeecceEEEEeec-----ccCceEEE Q lcl|NC_012784. 314 YLIQPDVK------EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTDY-----MHFGECLM 381 (415) Q Consensus 314 ~l~~~~~~------~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~-----~~~~~~~~ 381 (415) |||++.+. ...+.+|||+||+++++||.+. ++||||++ ++.+++|.+++|+++++ .++.+.+| T Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~-----~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r 470 (497) T protein:vir:78 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) T ss_pred eeccCcccccccccccCCceeeceeeEecCCCCCCc-----eEEeecccceEEEEEecccEEEeecccchhhhcCcEEEE Confidence 99986542 2344589999999999998653 68999997 46688999999999764 34567899 Q ss_pred EEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 382 IAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 382 ~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) ++.|+|+.|++|+||+++++++++++- T Consensus 471 ~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 471 AEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred EEEeecceeeccccEEEEEecCCccCC Confidence 999999999999999999999887776 No 47 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=8e-62 Score=355.53 Aligned_cols=398 Identities=13% Similarity=0.092 Sum_probs=253.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHhh Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEK-------------QEELDKLKEKDGT 67 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~-------------~~~~~~~~~~~~~ 67 (415) ||+...|+++..++.+++.....+.++...|-+ +..+++.+++++++++++.. .+.++.+...... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 79 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKK-EALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999888888887776665444333322211 11112222222222221111 1111111110000 Q ss_pred hhhccccccc---cchhh----hhh---HHHHHHHHHHHHHh-------hhhhHHHHHHHHHHh-hhhhhhhcccccccc Q lcl|NC_012784. 68 SENNQQSVEV---NEART----YRN---QANINDLGISIQNT-------KVTSQEVRDFTEYLE-TRNDIQGGSLKTDSG 129 (415) Q Consensus 68 ~~~~~~~~~~---~~~~~----~~~---~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 129 (415) .+........ ..... ... .............. .........+..... ..........+.+.+ T Consensus 80 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 159 (497) T protein:vir:10 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) T ss_pred HHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCccc Confidence 0000000000 00000 000 00000000000000 000000000111111 112223334566778 Q ss_pred eeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecC-CcccccccccccccccccccceeeEeeeeeEEEe Q lcl|NC_012784. 130 FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGY 208 (415) Q Consensus 130 ~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~ 208 (415) |++||+++.+.|++.+++.++++++++++++++++.. +++..+ .+.+.|++|++.+|++ +++|++|++.+||++++ T Consensus 160 g~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~--~~~~~~~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~a~~ 236 (497) T protein:vir:10 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANA 236 (497) T ss_pred ccccchhhhHHHHHHHHhhhhHHhhccccccCCCceE--EEEEcCCCCcceeeccCcccccc-cccceeeEeeeeeeEee Confidence 8999999999999999999999999999998876544 555444 4678999999999975 69999999999999999 Q ss_pred ehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc--------------------- Q lcl|NC_012784. 209 FRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE--------------------- 267 (415) Q Consensus 209 ~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~--------------------- 267 (415) ++||+|||+|++ ++++||.++|++++++++|.+||+|+|++.+.++............ T Consensus 237 ~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) T protein:vir:10 237 LTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) T ss_pred cHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccc Confidence 999999999987 5999999999999999999999999999877665432211110000 Q ss_pred ---------------------------------ccchhhHHHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 268 ---------------------------------VKKAKSLDDIKDAINLNVKP-NYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 268 ---------------------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) .+.....+++..++..+... ++.+++|+|||.+|..|+++||++|| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~ 395 (497) T protein:vir:10 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) T ss_pred cchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCc Confidence 01111223334444444333 44567999999999999999999999 Q ss_pred ccccCccc------CCCCceecceeeEEeccccccccCCceEEEechhh-cEEEEeecceEEEEeec-----ccCceEEE Q lcl|NC_012784. 314 YLIQPDVK------EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTDY-----MHFGECLM 381 (415) Q Consensus 314 ~l~~~~~~------~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~-----~~~~~~~~ 381 (415) |||++.+. ...+.+|||+||+++++||.+. ++||||++ ++.+++|.+++|+++++ .++.+.+| T Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~-----~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r 470 (497) T protein:vir:10 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) T ss_pred eeccCcccccccccccCCceeeceeeEecCCCCCCc-----eEEeecccceEEEEEecccEEEeecccchhhhcCcEEEE Confidence 99986542 2344589999999999998653 68999997 46688999999999764 34567899 Q ss_pred EEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 382 IAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 382 ~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) ++.|+|+.|++|+||+++++++++++- T Consensus 471 ~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 471 AEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred EEEeecceeeccccEEEEEecCCccCC Confidence 999999999999999999999887776 No 48 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1.5e-61 Score=354.05 Aligned_cols=397 Identities=14% Similarity=0.149 Sum_probs=271.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHH--HHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc--ccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVK--YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQ--SVE 76 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~--~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~ 76 (415) ||. +||++++.++.+++++..+ +..+.+++++.+++++++.++++|+++|+++++..+...+.......... ... T Consensus 1 M~l-~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:80 1 MNV-NELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTAS 79 (435) T ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccc Confidence 886 7788888777776665433 33456888999999999999999999999887543322221111110000 000 Q ss_pred cc-----chhhhhhHHH-HHHHHHHHHHhhh--hhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhh Q lcl|NC_012784. 77 VN-----EARTYRNQAN-INDLGISIQNTKV--TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) Q Consensus 77 ~~-----~~~~~~~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~ 148 (415) .. .......+.. ............. .......................+...||++||.++.++|++.+++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~ 159 (435) T protein:vir:80 80 AAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPK 159 (435) T ss_pred cccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhh Confidence 00 0000000000 0011111100000 00000111111111222223445666789999999999999999999 Q ss_pred hhhhhc-ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchH--HHHH Q lcl|NC_012784. 149 FNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKV--NVLQ 225 (415) Q Consensus 149 ~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~--~l~~ 225 (415) ++++++ ++++++.++ .+.+++.++++.+.|++|++.+|++ .++|++|++.++|++++++||+|+|+|+.+ ++++ T Consensus 160 ~~i~~~~~~~v~~~~~--~~~~p~~~~~~~a~~v~E~~~~~~~-~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~ 236 (435) T protein:vir:80 160 SVVRKLGARTLPLSNG--NITIPRLKGGAIVGYIGADTDIPTT-QQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQ 236 (435) T ss_pred chhhhccceeeecCCC--ceEEEEEeCCcceeeeccCcccccc-ccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHH Confidence 999997 677766655 4556666788889999999999974 689999999999999999999999999854 7999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-cccccccccccccccc---ccchhhHHHHHHHHHHhhhh--ccCCCEEEEcHH Q lcl|NC_012784. 226 ELKLWMARTIAATRNKAIIDVITKGS-TGSTSSGFEKEGKKLE---VKKAKSLDDIKDAINLNVKP--NYEHNVAIVSQT 299 (415) Q Consensus 226 ~l~~~la~~~~~~~d~~il~g~g~~~-~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~ 299 (415) ||.++|++++++++|.+|++|+|++. |.++............ .+......++.+++..+... ++.+++|+|||. T Consensus 237 ~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~ 316 (435) T protein:vir:80 237 IVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPR 316 (435) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHH Confidence 99999999999999999999999764 4433332222111111 11122244666666666543 567899999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccc--cc-CCceEEEechhhcEEEEeecceEEEEeecc-- Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG--QK-GNNTLIIGNLKDAIVLFDRSQYQASWTDYM-- 374 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~--~~-~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-- 374 (415) +|..|++++|++|+|+|. .. .+++|+|+||++++.+|.. .+ +...++||||++ +++++|++++++++++. T Consensus 317 ~~~~L~~lkd~~G~~l~~-~~---~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~i~~~~~~~~ 391 (435) T protein:vir:80 317 TFRFLEGLRDGNGNKVYP-EL---ANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGD-VFIGEEETLEIDYSKEATY 391 (435) T ss_pred HHHHHHhhhccCCceecc-CC---CCCeEeeeeeEEeccccccccCCCCcceEEEEEccc-EEEEeecceEEEEeccccc Confidence 999999999999999994 33 2458999999999999863 22 344689999998 55789999999988764 Q ss_pred ------------cCceEEEEEEEeccEEeccccEEEEEeecCCCCccc Q lcl|NC_012784. 375 ------------HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD 410 (415) Q Consensus 375 ------------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~ 410 (415) ++.+.+|++.|+|+++++|+||+.++-. +.|. T Consensus 392 ~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~----~~~~ 435 (435) T protein:vir:80 392 KDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGV----AWGA 435 (435) T ss_pred cccccchhhhhhcCcceeeeeeeeCcEeecccceEEEecc----CCCC Confidence 3456789999999999999999999833 3333 No 49 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=5.4e-62 Score=356.46 Aligned_cols=383 Identities=12% Similarity=0.105 Sum_probs=274.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++++.++.+++....+++.....++. .+++.++++++++|+++++++++++++++................ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999999999999998888877665554332 356788888888999888888887776654443322222221111 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .....+.........+ ............. ............+.++||++||+++.++|++.+++++++++++++. T Consensus 81 ~~~~~~~~~~~~~~~r---~~~~~~~~~~~~~--~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~ 155 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYR---HAILPNEFEKPSM--EAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred CchhHHHHHHHHHHHH---HHHhhhhHHHHHH--HHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee Confidence 1111111111111111 1111111111111 1111222334456677899999999999999999999999999998 Q ss_pred EccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) ++++. .+|... .+...+.|++||+.++++ .++|+++++.+++++++++||+|+|+||.++|++||.++|+++++++ T Consensus 156 ~~~~~--~~p~~~-~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~ 231 (387) T protein:vir:26 156 NIKGL--EIPRVS-YTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAK 231 (387) T ss_pred ecCCc--eeeeee-ccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 87643 455433 345678999999999875 59999999999999999999999999999999999999999999999 Q ss_pred HHHH-HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 239 RNKA-IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 239 ~d~~-il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++.. |.+|+|++.+.+... .......++...+++++++++.+..+|+.+++|+||+.+|..+..+++.+|+|+|. T Consensus 232 e~~~~~~~g~g~g~~~g~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~ 307 (387) T protein:vir:26 232 ERKDALAVSPKSGLEHMSFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHhHhhcCCCccccceeee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 7654 445666655544332 22233344566799999999999999999999999999998887777777888874 Q ss_pred CcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSA 396 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~ 396 (415) +.|.+|+|+||+++++++ .++||||+.+|.++ .++.+....+. .+.+.++++.|+|+++++|+|| T Consensus 308 -----~~~~~llG~PV~~~~~~~-------~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~ 373 (387) T protein:vir:26 308 -----TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAF 373 (387) T ss_pred -----cCCccccccceEEecCCC-------ceeeechhhhhhhh--hhhhheecccccCCceEEEEEEEeCcEeechhhe Confidence 345789999999998764 47999999877654 34555444443 4577899999999999999999 Q ss_pred EEEEeecCCCCccc Q lcl|NC_012784. 397 IVIEYDDSERGEGD 410 (415) Q Consensus 397 ~~~~~t~~~~~~~~ 410 (415) +++++++++.+.-- T Consensus 374 ~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 374 RIAKAKENTGPLPS 387 (387) T ss_pred EEEEeecCCCCCCC Confidence 99999875544333 No 50 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=5.4e-62 Score=356.46 Aligned_cols=383 Identities=12% Similarity=0.105 Sum_probs=274.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++++.++.+++....+++.....++. .+++.++++++++|+++++++++++++++................ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999999999999998888877665554332 356788888888999888888887776654443322222221111 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .....+.........+ ............. ............+.++||++||+++.++|++.+++++++++++++. T Consensus 81 ~~~~~~~~~~~~~~~r---~~~~~~~~~~~~~--~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~ 155 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYR---HAILPNEFEKPSM--EAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred CchhHHHHHHHHHHHH---HHHhhhhHHHHHH--HHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee Confidence 1111111111111111 1111111111111 1111222334456677899999999999999999999999999998 Q ss_pred EccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) ++++. .+|... .+...+.|++||+.++++ .++|+++++.+++++++++||+|+|+||.++|++||.++|+++++++ T Consensus 156 ~~~~~--~~p~~~-~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~ 231 (387) T protein:vir:94 156 NIKGL--EIPRVS-YTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAK 231 (387) T ss_pred ecCCc--eeeeee-ccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 87643 455433 345678999999999875 59999999999999999999999999999999999999999999999 Q ss_pred HHHH-HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 239 RNKA-IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 239 ~d~~-il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++.. |.+|+|++.+.+... .......++...+++++++++.+..+|+.+++|+||+.+|..+..+++.+|+|+|. T Consensus 232 e~~~~~~~g~g~g~~~g~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~ 307 (387) T protein:vir:94 232 ERKDALAVSPKSGLEHMSFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHhHhhcCCCccccceeee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 7654 445666655544332 22233344566799999999999999999999999999998887777777888874 Q ss_pred CcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSA 396 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~ 396 (415) +.|.+|+|+||+++++++ .++||||+.+|.++ .++.+....+. .+.+.++++.|+|+++++|+|| T Consensus 308 -----~~~~~llG~PV~~~~~~~-------~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~ 373 (387) T protein:vir:94 308 -----TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAF 373 (387) T ss_pred -----cCCccccccceEEecCCC-------ceeeechhhhhhhh--hhhhheecccccCCceEEEEEEEeCcEeechhhe Confidence 345789999999998764 47999999877654 34555444443 4577899999999999999999 Q ss_pred EEEEeecCCCCccc Q lcl|NC_012784. 397 IVIEYDDSERGEGD 410 (415) Q Consensus 397 ~~~~~t~~~~~~~~ 410 (415) +++++++++.+.-- T Consensus 374 ~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 374 RIAKAKENTGPLPS 387 (387) T ss_pred EEEEeecCCCCCCC Confidence 99999875544333 No 51 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=5.4e-62 Score=356.46 Aligned_cols=383 Identities=12% Similarity=0.105 Sum_probs=274.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++++.++.+++....+++.....++. .+++.++++++++|+++++++++++++++................ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999999999999998888877665554332 356788888888999888888887776654443322222221111 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .....+.........+ ............. ............+.++||++||+++.++|++.+++++++++++++. T Consensus 81 ~~~~~~~~~~~~~~~r---~~~~~~~~~~~~~--~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~ 155 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYR---HAILPNEFEKPSM--EAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred CchhHHHHHHHHHHHH---HHHhhhhHHHHHH--HHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee Confidence 1111111111111111 1111111111111 1111222334456677899999999999999999999999999998 Q ss_pred EccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) ++++. .+|... .+...+.|++||+.++++ .++|+++++.+++++++++||+|+|+||.++|++||.++|+++++++ T Consensus 156 ~~~~~--~~p~~~-~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~ 231 (387) T protein:vir:96 156 NIKGL--EIPRVS-YTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAK 231 (387) T ss_pred ecCCc--eeeeee-ccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 87643 455433 345678999999999875 59999999999999999999999999999999999999999999999 Q ss_pred HHHH-HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 239 RNKA-IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 239 ~d~~-il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++.. |.+|+|++.+.+... .......++...+++++++++.+..+|+.+++|+||+.+|..+..+++.+|+|+|. T Consensus 232 e~~~~~~~g~g~g~~~g~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~ 307 (387) T protein:vir:96 232 ERKDALAVSPKSGLEHMSFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHhHhhcCCCccccceeee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 7654 445666655544332 22233344566799999999999999999999999999998887777777888874 Q ss_pred CcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSA 396 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~ 396 (415) +.|.+|+|+||+++++++ .++||||+.+|.++ .++.+....+. .+.+.++++.|+|+++++|+|| T Consensus 308 -----~~~~~llG~PV~~~~~~~-------~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~ 373 (387) T protein:vir:96 308 -----TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAF 373 (387) T ss_pred -----cCCccccccceEEecCCC-------ceeeechhhhhhhh--hhhhheecccccCCceEEEEEEEeCcEeechhhe Confidence 345789999999998764 47999999877654 34555444443 4577899999999999999999 Q ss_pred EEEEeecCCCCccc Q lcl|NC_012784. 397 IVIEYDDSERGEGD 410 (415) Q Consensus 397 ~~~~~t~~~~~~~~ 410 (415) +++++++++.+.-- T Consensus 374 ~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 374 RIAKAKENTGPLPS 387 (387) T ss_pred EEEEeecCCCCCCC Confidence 99999875544333 No 52 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=1.6e-61 Score=353.88 Aligned_cols=383 Identities=13% Similarity=0.133 Sum_probs=264.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHH---HHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKY---ATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~---~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) .+.+++++.++.....+++...++ ....+.+...++++++..+++.++..+...++..+................. T Consensus 142 ~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~- 220 (543) T protein:vir:81 142 PDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPA- 220 (543) T ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh- Confidence 223444444444333333322222 2222233334445555555555555555544444333222211111100000 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHH-HHHhhhhhhhhcce Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDIL-KLKEVEFNLDKYVT 156 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii-~~~~~~~~l~~~~~ 156 (415) . ...................+ .+..........+.+.||++||.++++.|+ ..++..++++.+++ T Consensus 221 ----~---~~a~~~~~~~~~~~~l~~~e-------~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~ 286 (543) T protein:vir:81 221 ----Y---LRAWSKMARNPHAAILTEEE-------KRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFAR 286 (543) T ss_pred ----h---hhHHHHHHHhhHHHHhhhhh-------hhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcc Confidence 0 00000000000000000000 111122333445677889999999998876 66788899999988 Q ss_pred eEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) +.++ ++.+.+++.++.+.+.|++||+.+|+ +.++|+.|++++++++++++||+++++|+ ++|.+||.++|+++++ T Consensus 287 ~~~~---~g~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~ 361 (543) T protein:vir:81 287 QVVA---TGDVWHGVSSAAVQWSWDAEFEEVSD-DSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKD 361 (543) T ss_pred cccC---CcceEEEEecCCcceeecccCccccc-cccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHH Confidence 7654 34566777788899999999999986 57999999999999999999999999998 5899999999999999 Q ss_pred HHHHHHHhhccccc-ccccccccccc--ccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 237 ATRNKAIIDVITKG-STGSTSSGFEK--EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 237 ~~~d~~il~g~g~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) +++|.+||+|+|++ .+.++...... ....+......+++++++++..+...|..+++|+|||++|..|++++|++|+ T Consensus 362 ~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~ 441 (543) T protein:vir:81 362 ELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGA 441 (543) T ss_pred HHHHHHHhccCCCCcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCc Confidence 99999999999986 34444332221 1223334456678999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEecccccc-----ccCCceEEEechhhcEEEEeecceEEEEeecc-------cCceEEE Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLG-----QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-------HFGECLM 381 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~-----~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-------~~~~~~~ 381 (415) |||.+ +..+.+++|+|+||+++++||.+ +.++..++||||+. |.+++++++++.++++. ++...++ T Consensus 442 ~l~~~-~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 519 (543) T protein:vir:81 442 GLWTT-IGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQN-YVIADRIGMTVEFIPHLFGTNRRPNGSRGWF 519 (543) T ss_pred eeccC-cCCCCCccccceeeEEeccccccccccccCCcceEEEeeccc-eeEEeecccEEEEeccccccchhhcCceEEE Confidence 99974 55667889999999999999864 34666799999985 77888999999887653 3456789 Q ss_pred EEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 382 IAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 382 ~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ++.|+|+++.+|+||+.+++++++ T Consensus 520 ~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 520 AYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred EEEeeccEeecccceEEEEecccC Confidence 999999999999999999999988 No 53 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=4.3e-61 Score=351.53 Aligned_cols=397 Identities=11% Similarity=0.089 Sum_probs=271.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||+++|+++++.+..+.+....++.++. .++.....++++++++.+..+++.+++..++................... T Consensus 4 ~~~lee~~a~l~~~~~~~~~~~~~~~~~-~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 81 (419) T protein:vir:94 4 TPTLEEQRAALLARLDDTSLTTEQVQEI-VAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGTF- 81 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc- Confidence 5555666666555544444443333332 22222334555566666666665555444333332221111111111000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHH--HHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDF--TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) ............................. ..................+++.++|..+...|+..++....+++++++. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~ 161 (419) T protein:vir:94 82 RSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ 161 (419) T ss_pred cchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceee Confidence 00000000001111111111000000000 0001111111222334455667788888888888889999999999999 Q ss_pred EccCCceeEEEEe------ecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVR------QSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMA 232 (415) Q Consensus 159 ~~~~~~~~~~~~~------~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la 232 (415) +++++...|+... ....+.+.|++||+.+|+ ++++|+++++++++++++++||+|+++|++ +|++||.++|+ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la 239 (419) T protein:vir:94 162 NADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLT 239 (419) T ss_pred eccCCceeeeeeccccccccccCcccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHH Confidence 9887766665432 233456889999999986 569999999999999999999999999986 79999999999 Q ss_pred HHHHHHHHHHHhhcccccccccccccccc-----ccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh Q lcl|NC_012784. 233 RTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM 307 (415) Q Consensus 233 ~~~~~~~d~~il~g~g~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l 307 (415) +++++++|.+||+|+|++.+.++...... .......+....++++.++++.+..+++.+++|+|||++|..|+++ T Consensus 240 ~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~ 319 (419) T protein:vir:94 240 YGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELD 319 (419) T ss_pred HHHHHHHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHH Confidence 99999999999999999887766533221 1122234455678999999999999999999999999999999999 Q ss_pred hccCCc-ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----cCceEEE Q lcl|NC_012784. 308 KDKLGN-YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----HFGECLM 381 (415) Q Consensus 308 kd~~G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~ 381 (415) +|++|+ |++++++.++.+++|+|+||++++++|.+ .++||||+++|+++++++++++++++. ++.+.+| T Consensus 320 k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~-----~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r 394 (419) T protein:vir:94 320 QAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVIL 394 (419) T ss_pred hhcCCCceeecCCcccCCCccccceeeEEcCCCCCc-----cEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEE Confidence 998766 46778888899999999999999999854 379999999889999999999987653 4567799 Q ss_pred EEEEeccEEeccccEEEEEeecCCC Q lcl|NC_012784. 382 IAVRQDCRILDYKSAIVIEYDDSER 406 (415) Q Consensus 382 ~~~r~d~~v~~p~a~~~~~~t~~~~ 406 (415) ++.|+|+++++|+||++++++++++ T Consensus 395 ~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 395 AEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEEeeccEEeccccEEEEEeccCCC Confidence 9999999999999999999999988 No 54 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.3e-61 Score=354.45 Aligned_cols=383 Identities=13% Similarity=0.105 Sum_probs=273.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||++.||++++.++.+++++..+++.+...++. .++++++++++++|+++++++++++++++................ T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 95 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 95 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 999999999999999998888877766555433 356788888888999888888888776655443222222211111 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .....+.........+...... ....... ............+.++||++||+++.+.|++.++++++++++|+++ T Consensus 96 ~~~~~~~~~~~~~~~r~~~~~~----~~~~~~~-~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~ 170 (402) T protein:vir:93 96 LSDNEKMVKAKAEFYRHAILPN----EFEKPSM-EAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT 170 (402) T ss_pred CchhHHHHHHHHHHHHHHHhhh----hHHHHHH-hHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceee Confidence 1111111111111111111000 0000000 0111223344556677899999999999999999999999999998 Q ss_pred EccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) ++++. .+|... .+...+.|++|++.++++ .++|++|++.+++++++++||+|+|+||.++|++||.++|+++++.+ T Consensus 171 ~~~~~--~~p~~~-~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~ 246 (402) T protein:vir:93 171 NIKGL--EIPRVS-YTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAK 246 (402) T ss_pred ecCCc--eeeeee-ccCCcccccccccccccc-ccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 87643 444432 345678999999999875 59999999999999999999999999999999999999999999999 Q ss_pred HHHH-HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 239 RNKA-IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 239 ~d~~-il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ++.. |.+|+|++.+.+.... ......++...+|+++++++++..+|+.+++|+||+.++..+..+++.+|+|+|. T Consensus 247 e~~~~~~~g~g~g~p~g~~~~----~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~ 322 (402) T protein:vir:93 247 ERKDALAVSPKSGLEHMSFYN----GSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 322 (402) T ss_pred HHHhHhhcCCCccccceeeec----cccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 7654 5566666655443322 2223344556789999999999999999999999999988877666666777774 Q ss_pred CcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccE Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSA 396 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~ 396 (415) +.|.+|+|+||+++++++ .++||||+++|.++++ +.+....+ ..+.+.++++.|+|++|++|+|| T Consensus 323 -----~~~~~llG~PV~~t~~~~-------~i~~GDf~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~ 388 (402) T protein:vir:93 323 -----TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGINYDG--TTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAF 388 (402) T ss_pred -----cCCccccccceEEecCCC-------ceeeechhhhhhhhhh--hhhhhhhcccCCceEEEEEEEeCcEEechhhe Confidence 345799999999998764 4799999998776543 44443333 34577899999999999999999 Q ss_pred EEEEeecCCCCccc Q lcl|NC_012784. 397 IVIEYDDSERGEGD 410 (415) Q Consensus 397 ~~~~~t~~~~~~~~ 410 (415) +++++++++.+--- T Consensus 389 ~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 389 RIAKAKENTGPLPS 402 (402) T ss_pred EEEEeecCCCCCCC Confidence 99999865333222 No 55 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=2.7e-61 Score=352.60 Aligned_cols=382 Identities=12% Similarity=0.109 Sum_probs=271.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) ||+++||++++.++++++....+++.....+++ .++++++++++++|+++++.++++++++................. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 999999999999999999888887776665433 456788888999999999988887776554332222222111111 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) .....+.........+ .... ....... ..............+.++||++||+++.+.|++.++++++++++|+++ T Consensus 81 ~~~~~~~~~~~~~~~r---~~~~-~~~~~~~-~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~ 155 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYR---HAIL-PNEFEKP-SMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT 155 (387) T ss_pred cchhhHHHHHHHHHHH---HHhh-hhhhhhh-hhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeee Confidence 1111111111111111 1110 0010100 011112233445567778899999999999999999999999999998 Q ss_pred EccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~ 238 (415) ++++. .+|.. ..+.+.+.|++|++..+++ .++|++|++.+++++++++||+|+|+||.++|++||.++|+++++++ T Consensus 156 ~~~~~--~~p~~-~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~ 231 (387) T protein:vir:93 156 NIKGL--EIPRV-SYTLDDDDFITDVETAKEL-KLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAK 231 (387) T ss_pred ecCCc--eEEEE-eecCCccccccCccccccc-ccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 87643 44443 2345678999999999875 69999999999999999999999999999999999999999999999 Q ss_pred HHHH-HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHH-HhhccCCcccc Q lcl|NC_012784. 239 RNKA-IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLD-KMKDKLGNYLI 316 (415) Q Consensus 239 ~d~~-il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~-~lkd~~G~~l~ 316 (415) ++.. |.+|+|++.+.+.... ......++...+|+++++++.+..+|+.+++|+||+.+|..+. +++|.+| |+| T Consensus 232 e~~~~~~~g~g~g~p~g~l~~----~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~-~~~ 306 (387) T protein:vir:93 232 ERKDALAVSPKSGLDHMSFYN----GSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFF 306 (387) T ss_pred HHHhHhhcCCCccccceeeec----cccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC-ccc Confidence 8764 5567766665444322 2223344556789999999999999999999999999987755 5566555 444 Q ss_pred cCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEecccc Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKS 395 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a 395 (415) . +.|.+|+|+||+++++++ .++||||+.+|.++ .++.+....+ ......+++..|+|+++++|+| T Consensus 307 ~-----~~~~~llG~PV~~~~~~~-------~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA 372 (387) T protein:vir:93 307 D-----TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSA 372 (387) T ss_pred c-----cCCccccccceEEecCCC-------ceeeeehhhhheeh--hhheeeecccccCCceeEEEEeeeCceeechhh Confidence 3 345799999999998764 37999999877654 4455544433 3446678899999999999999 Q ss_pred EEEEEeecCCCCccc Q lcl|NC_012784. 396 AIVIEYDDSERGEGD 410 (415) Q Consensus 396 ~~~~~~t~~~~~~~~ 410 (415) |+++++++++.+.-. T Consensus 373 ~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 373 FRIAKAKENTGSLPS 387 (387) T ss_pred eEEEEeecCCCCCCC Confidence 999999865444333 No 56 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=4.7e-61 Score=351.32 Aligned_cols=397 Identities=14% Similarity=0.143 Sum_probs=270.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHH--HHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc--- Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKV--KYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV--- 75 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~--~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~--- 75 (415) || ++||++++.++.+++++.. .+..+.+++++.++++++++++++|+++|+++++..+.........+...... T Consensus 1 M~-i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:14 1 MN-VNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAP 79 (435) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhc Confidence 76 4567777777766665543 33456788889999999999999999999987765433222111111000000 Q ss_pred -----cccchhhhhhHHHHHHHHHHHHHhhhhhHH--HHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhh Q lcl|NC_012784. 76 -----EVNEARTYRNQANINDLGISIQNTKVTSQE--VRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) Q Consensus 76 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~ 148 (415) ................+............. ...................+...||++||+++.++|++.+++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~ 159 (435) T protein:vir:14 80 AAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPK 159 (435) T ss_pred cccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhh Confidence 000000000000000111111111000000 0111111112222334455667789999999999999999999 Q ss_pred hhhhhc-ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHH Q lcl|NC_012784. 149 FNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQ 225 (415) Q Consensus 149 ~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~ 225 (415) ++++.+ +++++++++ .+.+++.++++.+.|++|++.+|+ ++++|+.|++.++|++++++||+|+++|+. .+|++ T Consensus 160 ~~i~~~~~~~~~~~~~--~~~~p~~~~~~~a~~v~E~~~~~~-~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~ 236 (435) T protein:vir:14 160 SVVRKLGARTLPLSNG--NITIPRLKGGAIVGYIGADTDIPT-TQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQ 236 (435) T ss_pred chhhhhcceeeecCCC--ceEEEEEeCCcceeeeccCccccc-cccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHH Confidence 999987 666666554 455666778888999999999986 468999999999999999999999999985 46999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-ccccccccccccc---cccccchhhHHHHHHHHHHhhhh--ccCCCEEEEcHH Q lcl|NC_012784. 226 ELKLWMARTIAATRNKAIIDVITKGS-TGSTSSGFEKEGK---KLEVKKAKSLDDIKDAINLNVKP--NYEHNVAIVSQT 299 (415) Q Consensus 226 ~l~~~la~~~~~~~d~~il~g~g~~~-~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~ 299 (415) ||.++|++++++++|.+|++|+|++. +.++......... ....+......++.+++..+... ++.+++|+|||. T Consensus 237 ~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~ 316 (435) T protein:vir:14 237 IVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPR 316 (435) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHH Confidence 99999999999999999999999864 4333222111111 11122222345666666666543 667899999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccc--cc-CCceEEEechhhcEEEEeecceEEEEeeccc- Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG--QK-GNNTLIIGNLKDAIVLFDRSQYQASWTDYMH- 375 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~--~~-~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~- 375 (415) +|..|+++||++|+|+|. +. .+++|+|+||++++.+|.. .. ....++||||++ +++++|+++++.++++.. T Consensus 317 ~~~~L~~lkd~~G~~l~~-~~---~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~~~~~~~~~~ 391 (435) T protein:vir:14 317 TFRFLEGLRDGNGNKVYP-EL---ANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD-VFIGEEETLEIDYSKEATY 391 (435) T ss_pred HHHHHHHhhccCCceecc-CC---CCCeeecceeEeeccccccccCCCccceEEEeeccc-EEEEEecccEEEEeccccc Confidence 999999999999999994 33 2458999999999999863 22 334689999998 557899999999987643 Q ss_pred -------------CceEEEEEEEeccEEeccccEEEEEeecCCCCccc Q lcl|NC_012784. 376 -------------FGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD 410 (415) Q Consensus 376 -------------~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~ 410 (415) +.+.+|+++|+|+++++|+||+.++-.+ .|. T Consensus 392 ~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~----~~~ 435 (435) T protein:vir:14 392 KDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA----WGA 435 (435) T ss_pred cccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC----CCC Confidence 4567899999999999999999988332 222 No 57 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=3e-60 Score=346.87 Aligned_cols=368 Identities=13% Similarity=0.113 Sum_probs=258.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHH---hhchHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATR---ALNNDELEKAEK-LEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE 76 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~---~~~e~~~~~~~~-~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (415) |. +.|++++++++.++++++.++..+ ...+....+... -..++++++.++..++++++++.++....... ... T Consensus 1 m~-~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~--~~~ 77 (379) T protein:vir:10 1 ME-ALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKS--EDK 77 (379) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--ccc Confidence 87 455777776766666554432221 111111111111 12233444444444444444333322211110 000 Q ss_pred ccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcce Q lcl|NC_012784. 77 VNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVT 156 (415) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~ 156 (415) .. ... ......... ... ................+.+.++.++|+++...|++.++..++++++|+ T Consensus 78 ~~---~~~-----~~~~~~~~~----~~~---~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~ 142 (379) T protein:vir:10 78 SD---SLV-----KSITENFND----IKE---VRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVG 142 (379) T ss_pred ch---hHH-----HHHHHHHHh----HHH---HHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhce Confidence 00 000 000000000 000 000000111112223444555567999999999999999999999999 Q ss_pred eEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) ++++++++..|+.....+.+.+.|++||+.+|++ .++|++|++.++|++++++||+|+|+|++ ++.+||.++|+++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~ 220 (379) T protein:vir:10 143 AVSISGGTYTFVRENGAGEGAIGAQVEGATKGQK-DYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYA 220 (379) T ss_pred eeeccCCceEEEEeecCCCcccccccCCcccccc-ccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHH Confidence 9999888878777766667788999999999975 68999999999999999999999999997 599999999999999 Q ss_pred HHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) ++++.+|+.|+|+..+.+. ...++..++++++++++++..+++.+++|+|||++|..|+++||++|+|+| T Consensus 221 ~~~~~~~~~g~~~~~~~~~----------~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~ 290 (379) T protein:vir:10 221 KAENAAFNAVLAANATAST----------EIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYG 290 (379) T ss_pred HHHHHHHhccccccccccc----------ccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceec Confidence 9999999998876533221 123345567899999999999999999999999999999999999999999 Q ss_pred cCccc--CCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEee-----cccCceEEEEEEEeccE Q lcl|NC_012784. 317 QPDVK--EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCR 389 (415) Q Consensus 317 ~~~~~--~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~-----~~~~~~~~~~~~r~d~~ 389 (415) ++++. .+.+.+|||+||++++.||.+ .++||||++++ ++.|+++++++++ |.++.+.+|++.|+|++ T Consensus 291 ~~~~~~~~~~~~~l~G~pvv~s~~~~ag-----~~~~gdf~~~~-~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~ 364 (379) T protein:vir:10 291 LPGVVTQDNGVLRINGIPLFRATWLAAN-----KYYVGDWTRVT-KVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALA 364 (379) T ss_pred cCCccCCCCCcceecceeeEecCCCCCC-----ceEEeecccEE-EEEEeceEEEEeecccccccCCcEEEEEEEEeccE Confidence 88764 466679999999999999864 37999999865 4567888887654 44566789999999999 Q ss_pred EeccccEEEEEeecC Q lcl|NC_012784. 390 ILDYKSAIVIEYDDS 404 (415) Q Consensus 390 v~~p~a~~~~~~t~~ 404 (415) |++|+|||+++|++- T Consensus 365 v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 365 VEQPAALIFGDFTAV 379 (379) T ss_pred EecCccEEEEEecCC Confidence 999999999999998 No 58 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=4.6e-62 Score=356.83 Aligned_cols=284 Identities=24% Similarity=0.295 Sum_probs=255.9 Q ss_pred hhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecC-Ccccccccccccccccccccc Q lcl|NC_012784. 117 NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPF 195 (415) Q Consensus 117 ~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~f 195 (415) ........+.++||++||++++++|++.+++.++++++++++++++.++++++++... .+.+.|++|++++|+++.++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 3334445577788999999999999999999999999999999999999999988764 567899999999998778999 Q ss_pred eeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHH Q lcl|NC_012784. 196 FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) Q Consensus 196 ~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) ++++++++|++++++||+|+++|+.+++++||.++|++++++++|++|++|+++..+. .+..+++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~---------------~~~~~~d 145 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK---------------PTLTKWD 145 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc---------------ccccCHH Confidence 9999999999999999999999999999999999999999999999999988765431 2445699 Q ss_pred HHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccc--cccccCCceEEEec Q lcl|NC_012784. 276 DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGN 353 (415) Q Consensus 276 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~gd 353 (415) ++++++.++..+++.+++|+|||++|..|++|||++|||+|++++.++.+++|+|+||+++++. |....++..++||| T Consensus 146 ~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd 225 (293) T protein:vir:48 146 DIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGD 225 (293) T ss_pred HHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEe Confidence 9999999999999999999999999999999999999999999999999999999999987654 44566777899999 Q ss_pred hhhcEEEEeecceEEEEeec-----ccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-----MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) |+++|++++|++++++.+++ .++.+.+|++.|+|+++.+|+||+.+++++++.+.|+++++| T Consensus 226 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~ 292 (293) T protein:vir:48 226 LKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTA 292 (293) T ss_pred ccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccccC Confidence 99999999999999988753 455678999999999999999999999999999999999999 No 59 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.9e-58 Score=337.02 Aligned_cols=407 Identities=10% Similarity=0.065 Sum_probs=254.6 Q ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHhhchHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_012784. 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDE--------LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENN 71 (415) Q Consensus 1 Mk~-~~el~~~l~~l~~~~~~~~~~~~~~~~e~~--------~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~ 71 (415) |.+ ++||++++.+|+++..+..++++..+++.+ .++.+++.+++++++++++.+++..+++++........ T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~ 80 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERS 80 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 654 667888888887777666655554443221 11222333444555555555444333332221111100 Q ss_pred ------ccc----ccccchhhhhhHHHHHHHHHHHHHhhh----------------hhHHHHHHHHHHhhhhhhhhcccc Q lcl|NC_012784. 72 ------QQS----VEVNEARTYRNQANINDLGISIQNTKV----------------TSQEVRDFTEYLETRNDIQGGSLK 125 (415) Q Consensus 72 ------~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~ 125 (415) ... ........................... .......................+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (477) T protein:vir:84 81 GKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRN 160 (477) T ss_pred hcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhcccccc Confidence 000 000000000000000000000000000 000000011111122223333445 Q ss_pred cccceeecchh-HHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc-ccccccccccc-----cccccccceee Q lcl|NC_012784. 126 TDSGFVVIPEE-IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA-ALEKVEELEEN-----PELAVKPFFQL 198 (415) Q Consensus 126 ~~~~~~~vP~~-~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~Eg~~~-----~~~~~~~f~~v 198 (415) ...||++||++ +.+.|++.+++.++++++++++++++.++++.+|+..+++ .+.|++||+.. |+ ++++|+.+ T Consensus 161 ~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~-s~~~f~~i 239 (477) T protein:vir:84 161 GGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHE-VDLTDGFV 239 (477) T ss_pred CCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccc-cccceeeE Confidence 56677788776 5678999999999999999999999888888888866554 46799998643 43 46789999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccccccccccccccccc------ccch Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG-STGSTSSGFEKEGKKLE------VKKA 271 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~-~~~~~~~~~~~~~~~~~------~~~~ 271 (415) +++++|++++++||+|||+|+.+++++||.++|+++++.++|.+||+|+|++ .|.++............ .... T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~ 319 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQ 319 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHH Confidence 9999999999999999999999999999999999999999999999999975 44444332211111111 1112 Q ss_pred hhHHHHHHHHHHhhhhccCC-CEEEEcHHHHHHHHHhhccCCcccccCc-------------ccCCCCceecceeeEEec Q lcl|NC_012784. 272 KSLDDIKDAINLNVKPNYEH-NVAIVSQTMFAKLDKMKDKLGNYLIQPD-------------VKEKTQQRLLGAKIEILP 337 (415) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~l~~lkd~~G~~l~~~~-------------~~~~~~~~l~G~pV~~~~ 337 (415) ..++++++++..+...+..+ ++|+|||.+|..|+++||++|||||+++ +..+.+++|+|+||++++ T Consensus 320 ~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~ 399 (477) T protein:vir:84 320 IIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDP 399 (477) T ss_pred HHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecC Confidence 23556677777766666654 5799999999999999999999999865 344556799999999999 Q ss_pred ccccc--cc-CCceEEEechhhcEEEEeecceEEEEeecccC---ceEEEEEEEeccEE-eccccEEEEEeecCCCCccc Q lcl|NC_012784. 338 DEVLG--QK-GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF---GECLMIAVRQDCRI-LDYKSAIVIEYDDSERGEGD 410 (415) Q Consensus 338 ~~~~~--~~-~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~---~~~~~~~~r~d~~v-~~p~a~~~~~~t~~~~~~~~ 410 (415) .||.. .. +...++||||++ ++++. .++++..+++.+. ...++++.++++.. .+|+||+.++.++.+++-=. T Consensus 400 ~~p~~~~~~~d~~~i~~gd~~~-~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 400 TLPTTLGTGTDQDVIHVLRASD-LALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred cccccccccCCcceEEEEEece-EEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecccccccccC Confidence 99964 22 234689999987 45554 4677776665433 33466666677655 45999999999975444222 No 60 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2.7e-58 Score=336.18 Aligned_cols=398 Identities=13% Similarity=0.076 Sum_probs=261.5 Q ss_pred CCh---HHHHHHHHHHH---HHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|NC_012784. 1 MKT---KEELQSEISDI---KRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQS 74 (415) Q Consensus 1 Mk~---~~el~~~l~~l---~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 74 (415) |+. ++++++++.++ .+++.++..+..+.+++++.++++++..+++.|+.+|++.++...+.............. T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~ 272 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNG 272 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 332 23334444433 333444444455678888889999999999999999988765432221111100000000 Q ss_pred c----cccc----hhhhhhHHHHHHHHHHHHHhhhhhHHHH-----------HHHHHHhhhhhhhhcccccccceeecch Q lcl|NC_012784. 75 V----EVNE----ARTYRNQANINDLGISIQNTKVTSQEVR-----------DFTEYLETRNDIQGGSLKTDSGFVVIPE 135 (415) Q Consensus 75 ~----~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~vP~ 135 (415) . .... ............................ .................+...||+++|+ T Consensus 273 ~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~ 352 (645) T protein:vir:93 273 NVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQ 352 (645) T ss_pred ccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCch Confidence 0 0000 0000000000000000000000000000 0001111111122223344568899999 Q ss_pred hHHhHHHHHHhhhhhhhhcceeEEcc--CCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhH Q lcl|NC_012784. 136 EIVTDILKLKEVEFNLDKYVTVKRVT--NGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) Q Consensus 136 ~~~~~Ii~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~ 213 (415) ++..+|++.+++.++++.++.....+ ...+.+.+++.++++.++|++||+.+|++ .++|++++++++|++++++||+ T Consensus 353 ~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s-~~~f~~v~l~~~kla~~~~iS~ 431 (645) T protein:vir:93 353 EYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLT-KFDFESITFSHAKVSAIAVLTE 431 (645) T ss_pred hhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCcccccc-ccceeEEEEeeEEEEEeehhHH Confidence 99999999999999999887543222 12346788888999999999999999965 6899999999999999999999 Q ss_pred HHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhcc--CC Q lcl|NC_012784. 214 EAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY--EH 291 (415) Q Consensus 214 e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 291 (415) |||+|+.+++++||.++|++++++++|.+||+|+|++.....+.+....... ..+......++..++..+..++. .+ T Consensus 432 ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~-~~~~~~~~~d~~~~~~~~~~a~~~~~~ 510 (645) T protein:vir:93 432 ELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKG-TASSGNPDADAEAAFGQFVAANLQPTG 510 (645) T ss_pred HHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccc-cccccchHHHHHHHHHHHHhcCCCccc Confidence 9999999999999999999999999999999998876543344443332222 22233455678888877766543 46 Q ss_pred CEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEe Q lcl|NC_012784. 292 NVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT 371 (415) Q Consensus 292 ~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 371 (415) ++|+|||.++.+|+++||++|+|+| +++ ...+++|+|+||++++++|.. +++|||++ +.++++.++.+.++ T Consensus 511 a~~vmn~~~~~~L~~lkd~~G~~~~-~~~-~~~~~tL~G~PV~~s~~vp~~------~~~gd~s~-~~ig~~~~v~i~~s 581 (645) T protein:vir:93 511 AVWLMSSTNALALSMRKNALGQKEY-PDM-TLLGGSFQGLPVIVSQYVGDQ------LVLVNAPD-IYLADDGGVAVDMS 581 (645) T ss_pred cEEEEcHHHHHHHHhccccCCceee-cCC-CCCCceeeceeeEEeccCCcc------eeEecccc-EEEEEecceEEEee Confidence 7999999999999999999999998 344 334569999999999999742 68899987 55667788887765 Q ss_pred ecc-------------------------cCceEEEEEEEeccEEeccccEEEEEeec-CCCCcc Q lcl|NC_012784. 372 DYM-------------------------HFGECLMIAVRQDCRILDYKSAIVIEYDD-SERGEG 409 (415) Q Consensus 372 ~~~-------------------------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~-~~~~~~ 409 (415) ++. .+...+|+++|+|+++.+|+||++++=.. -+..|| T Consensus 582 ~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 582 REASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred cceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 432 34556899999999999999999988221 223333 No 61 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.4e-55 Score=321.36 Aligned_cols=362 Identities=9% Similarity=0.049 Sum_probs=245.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||.++|+++++.++++++...++...+. ++.. +++.+....+..++....+.. ....... T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~~~~~--~e~~---~~~~~~~~~~~~~~~~~~~~~---~~~~~~~------------ 60 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKEGATE--AEQV---TAFTNMAEQIQNNIIAQARKE---VNREMND------------ 60 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhhhhhH--HHHH---HHHHHHHHHHHHHHHHHHHHH---HHHHHHH------------ Confidence 9999999998888777655443322211 1111 111111111111111100000 0000000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ......+.......+.+.+. .......+.+++|++||+++.++|++.++..++++++|+++++ T Consensus 61 ----------~~~~~~~~~~~l~~~~r~~~-------~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~ 123 (390) T protein:vir:40 61 ----------NNVLASRGANALTSDESKYY-------NEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNT 123 (390) T ss_pred ----------HHHHHhcCchhccHHHHHHH-------HHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeec Confidence 00000000000000111110 1112234567889999999999999999999999999999998 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) +++... +++.++.+.+.|++|++.+++.++++|+++++.+|+++++++||+|+++|+.+++++||.++|+++++.+++ T Consensus 124 ~~~~~~--i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~ 201 (390) T protein:vir:40 124 TATTEW--IISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLE 201 (390) T ss_pred CCceeE--EEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Confidence 776555 445667788999999998887778999999999999999999999999999999999999999999999999 Q ss_pred HHHhhcccccccccccccccccc---ccccccchhhHHHHHHHHHHhhhh-------ccCCCEEEEcHHHH-H---HHHH Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEG---KKLEVKKAKSLDDIKDAINLNVKP-------NYEHNVAIVSQTMF-A---KLDK 306 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~v~~~~~~-~---~l~~ 306 (415) .+|++|+|++.|.++........ .........++.+..++...+... +..+++|+|||.++ . .++. T Consensus 202 ~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~ 281 (390) T protein:vir:40 202 AGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATS 281 (390) T ss_pred hhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhh Confidence 99999999988776654322111 111122233444555555444433 45688999999874 3 4457 Q ss_pred hhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeeccc---CceEEEEE Q lcl|NC_012784. 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH---FGECLMIA 383 (415) Q Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~ 383 (415) ++|++|+|+|.. .++|+||+++++||.+ .++||||++ |++++|++++++++++.. +.+.+|++ T Consensus 282 ~~d~~G~~v~~~--------~~~g~pvv~~~~~p~~-----~i~~Gd~s~-~~i~~~~~~~v~~~~~~~f~~~~~~~r~~ 347 (390) T protein:vir:40 282 YMTPQGVWVTGI--------LPVPLEIVQSVAVPVG-----KAVAGRAKD-YFMGIGSEQVIRTSTEYRLLDDETLYYAK 347 (390) T ss_pred ccCCCCcccccc--------CCCceeEEEcCCCCCC-----cEEEEeece-EEEEeecceEEEecchhhhhcCcEEEEEE Confidence 999999999743 3479999999999864 389999997 678899999999988654 45679999 Q ss_pred EEeccEEeccccEEEEEeecCCC--CcccccccC Q lcl|NC_012784. 384 VRQDCRILDYKSAIVIEYDDSER--GEGDLGLEA 415 (415) Q Consensus 384 ~r~d~~v~~p~a~~~~~~t~~~~--~~~~~~~~~ 415 (415) .|+|+++++|+||+.+++++... .-..++++- T Consensus 348 ~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~ 381 (390) T protein:vir:40 348 QYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNN 381 (390) T ss_pred EEeCCEEecccceEEEEeeccCCCCCCCcceeeC Confidence 99999999999999999998642 333444433 No 62 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=3.3e-56 Score=324.76 Aligned_cols=350 Identities=12% Similarity=0.100 Sum_probs=233.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+.+++++++++++ +++++.+++++++++.................. T Consensus 1 ~eei~~l~~~~~~l---------------------------------~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~ 47 (352) T protein:vir:78 1 MEDIKQLETEKAGL---------------------------------QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLN 47 (352) T ss_pred ChhHHHHHHHHHHH---------------------------------HHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 44444444444433 222222222222221111111111111111000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ...+.. ..+....+............... .......+..+.++||++||+++.++|++.++.++++++++++.++ T Consensus 48 ~~~~~~---~~~~~~~r~~~~~~~~~~~~~~~--~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~ 122 (352) T protein:vir:78 48 DNEKLV---KAKAEFYRHAILPNEFEKPSMEA--QRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 122 (352) T ss_pred hhhhHH---HHHHHHHHHHhhhhHHHHHHhhH--HHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEec Confidence 011100 01111111111111111111111 1112233445677889999999999999999999999999999877 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++. .++.. ..+.+.+.|++|++.+|++ .++|++|++.+++++++++||+|+|+|+.++|++||.++|+++++++++ T Consensus 123 ~~~--~~p~~-~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~ 198 (352) T protein:vir:78 123 KGL--EIPRV-SYTLDDDDFITDVETAKEL-KLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKER 198 (352) T ss_pred CCc--eEEEE-ecCCCcccccccccccccc-cccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHH Confidence 543 34433 2344678999999999875 6999999999999999999999999999999999999999999999865 Q ss_pred H-HHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc Q lcl|NC_012784. 241 K-AIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD 319 (415) Q Consensus 241 ~-~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 319 (415) . .|.+|+|++.+.++... ......++...+|+++++++.+..+|+.+++|+||+.++..|.+++|.+|+|+|. T Consensus 199 ~~~~~~g~g~~~~~g~l~~----~~~~~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~-- 272 (352) T protein:vir:78 199 KDALAVSPKSGLEHMSFYN----GSVKEVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-- 272 (352) T ss_pred HhhhhcCCCCcccccceec----cccccccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccc-- Confidence 5 45566666655443322 2222334555689999999999999999999999999999999999999999985 Q ss_pred ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEe-ecccCceEEEEEEEeccEEeccccEEE Q lcl|NC_012784. 320 VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DYMHFGECLMIAVRQDCRILDYKSAIV 398 (415) Q Consensus 320 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~~~r~d~~v~~p~a~~~ 398 (415) +.|.+|+|+||++++.++ .++||||+.+|..+ .++.++.. +...+.+.+++..|+|+++++|+||+. T Consensus 273 ---~~~~~llG~PV~~~~~~~-------~~~~Gdf~~~~~~~--~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~ 340 (352) T protein:vir:78 273 ---TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRI 340 (352) T ss_pred ---cCCccccccceEEecCCC-------ceeEeehhhhhhhh--hhheeeeeccccCCeeEEEEEeeeCceeechhheEE Confidence 335689999999998654 37899999876654 34555433 334556789999999999999999999 Q ss_pred EEeecCCCCccc Q lcl|NC_012784. 399 IEYDDSERGEGD 410 (415) Q Consensus 399 ~~~t~~~~~~~~ 410 (415) +++++++...-. T Consensus 341 l~~~a~~~~~~~ 352 (352) T protein:vir:78 341 AKAKESTGSLPS 352 (352) T ss_pred EEeecccCCCCC Confidence 999987766655 No 63 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=5e-55 Score=318.26 Aligned_cols=393 Identities=13% Similarity=0.114 Sum_probs=240.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhch----HHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNN----DEL----EKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQ 72 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e----~~~----~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~ 72 (415) ...+++|++++.+++++ .+++.+.+.+ ++. ++++++++++.+++++++.+++++++++.......... T Consensus 16 ~~~l~el~e~~~~l~k~----~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e~~~~~ 91 (466) T protein:vir:80 16 KAALAELLEQEKALQKR----SEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQLNNKE 91 (466) T ss_pred HHHHHHHHHHHHHHHHH----HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 22222233333222222 2222222221 111 23344444455555555555554444333322221111 Q ss_pred ccccccchhhhhhHHHH--------HHHHHHHHHhhhhhH-HHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHH Q lcl|NC_012784. 73 QSVEVNEARTYRNQANI--------NDLGISIQNTKVTSQ-EVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILK 143 (415) Q Consensus 73 ~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~ 143 (415) ................. ............... ....................+.++++++||+++.+.|++ T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~ 171 (466) T protein:vir:80 92 PKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRD 171 (466) T ss_pred hccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHH Confidence 11100000000000000 000000000000000 000000011111111222334566778999999999999 Q ss_pred HHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHH Q lcl|NC_012784. 144 LKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNV 223 (415) Q Consensus 144 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l 223 (415) .+++++++++++++.++++ ...++.....+.+.|++|++.+|++ +++|++|++.+|+++++++||+|||+|+.+++ T Consensus 172 ~l~~~~~l~~~~~v~~~~g---~~~~~~~~~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l 247 (466) T protein:vir:80 172 NMHRYSKLISKVRLRPLKG---TARQNIAGAIPEGVWTEAVANLNEL-SLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNL 247 (466) T ss_pred hhhhhhhhhhheeeeecCc---eeEeeeecCCcceeecccccccccc-cccccceeecceeeeeehhhhHHHHhcchHHH Confidence 9999999999999988764 3456666777889999999999975 59999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc------ccc-----------------hhhHHHHHHH Q lcl|NC_012784. 224 LQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE------VKK-----------------AKSLDDIKDA 280 (415) Q Consensus 224 ~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~------~~~-----------------~~~~~~~~~~ 280 (415) ++||..+|+++++.+++.+||+|+|++.|.|+............ ... ...+.++... T Consensus 248 ~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (466) T protein:vir:80 248 ADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLK 327 (466) T ss_pred HHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHH Confidence 99999999999999999999999999988765432211110000 000 0112222222 Q ss_pred HHHhhhhccC-CCEEEEcHHHHHHHHHhh---ccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhh Q lcl|NC_012784. 281 INLNVKPNYE-HNVAIVSQTMFAKLDKMK---DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD 356 (415) Q Consensus 281 ~~~~~~~~~~-~~~~v~~~~~~~~l~~lk---d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~ 356 (415) +......+.. +..|+||+.++..|..++ +.+|.+++.+. + ...|+|+||+++++||.+. +++|||+. T Consensus 328 ~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~--~--~~~i~G~pvv~s~~~~~~~-----~~~g~~~~ 398 (466) T protein:vir:80 328 LSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN--N--TMPIVGGDIVILDFIPDND-----IIGGYGSL 398 (466) T ss_pred HHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCC--C--cccccccceeecCccCccc-----eeeecccc Confidence 3333344444 456999999999999887 67777776432 2 2359999999999998754 79999986 Q ss_pred cEEEEeecceEEEEeeccc---CceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 357 AIVLFDRSQYQASWTDYMH---FGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 357 ~~~~~~~~~~~i~~~~~~~---~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) |.+++|+++++..+++.. +.+.+|++.|+||+|++|+||+.++++.. +++++++ T Consensus 399 -y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~----~~~~~~~ 455 (466) T protein:vir:80 399 -YLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA----NPTTSIT 455 (466) T ss_pred -EEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCC----Cccccee Confidence 678999999999988765 45679999999999999999999998763 2233322 No 64 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=2.3e-56 Score=325.58 Aligned_cols=282 Identities=11% Similarity=0.064 Sum_probs=240.6 Q ss_pred hhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) Q Consensus 116 ~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f 195 (415) ......+..+.+.++++||++++++|++.+++.+++++++++++++++...+++. +.+.+.|++|++.+|++ .++| T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~---~~~~a~~v~E~~~~~~~-~~~f 76 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFM---SGVGAFWVDEAERIQTS-KPTF 76 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEE---cCCceeeeecCcccccc-ccce Confidence 2222333445567788999999999999999999999999999998777665543 35778999999999975 5999 Q ss_pred eeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHH Q lcl|NC_012784. 196 FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) Q Consensus 196 ~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) +++++.++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+.++.... .........+..+++ T Consensus 77 ~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~-~~~~~~~~~~~~~~~ 155 (299) T protein:vir:41 77 TKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSA-TDASNLVEETANKYD 155 (299) T ss_pred eEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccc-cccceeeccccccHH Confidence 9999999999999999999999999999999999999999999999999999998877665432 223333445567799 Q ss_pred HHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechh Q lcl|NC_012784. 276 DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK 355 (415) Q Consensus 276 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~ 355 (415) ++++++.++...++.+++|+|||.+|.+|++++|++|+|+|.+++.++. .+|+|+||++++++|.+. ++..++||||+ T Consensus 156 ~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-~~l~G~PV~~~~~~~~~~-~~~~~~~gdfs 233 (299) T protein:vir:41 156 DLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV-DDVLGLPIAYTPKYTFGD-KDISELVGDWN 233 (299) T ss_pred HHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-ceecceeeEEecccCCCC-CceEEEEEecc Confidence 9999999999999999999999999999999999999999998877654 589999999999999654 56679999998 Q ss_pred hcEEEEeecceEEEEeeccc-----------------CceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 356 DAIVLFDRSQYQASWTDYMH-----------------FGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 356 ~~~~~~~~~~~~i~~~~~~~-----------------~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) + +.++++++++++.+++.+ +...+|++.|+|+++.+|+||++++..++- T Consensus 234 ~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 234 Q-AYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred c-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 7 567889999999877532 345689999999999999999999988877 No 65 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=6.4e-56 Score=323.16 Aligned_cols=353 Identities=12% Similarity=0.050 Sum_probs=248.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |.-..+.++++.+.++++.+..++.. ..+++. +.+.+.++.+.+++.+..+... +. ...... ..... T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~--~~ee~~---~~~~~~~~~~~~~~~~~~~~e~--~~--~~~~~~-~~~~l--- 67 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGA--TSEEQE---KLFEAAFTTMGDEILAKNEEEM--ER--MFDLRD-KNREL--- 67 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhh--hhHHHH---HHHHHHHHhHHHHHHHHHHHHH--HH--HHHhcc-CCccc--- Confidence 44433333333333333322222111 112222 2222334444444433221110 00 000000 00000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ...+.+.+. ......+.+++|++||+++.+.|++.+...++++++|++.++ T Consensus 68 ---------------------t~ee~~~~~--------~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~ 118 (377) T protein:vir:98 68 ---------------------TAEEIKFFN--------DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) T ss_pred ---------------------CHHHHHHHH--------HHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEec Confidence 001111111 111234667889999999999999999999999999999887 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++ .+.+++..+.+.+.|++|+++.++.++++|+++++.+||++++++||+|||+|+.+++++||.++|+++++++++ T Consensus 119 ~~---~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~ 195 (377) T protein:vir:98 119 SL---RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALE 195 (377) T ss_pred Cc---ceEEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHh Confidence 53 345667778889999999988776678999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccc-----cccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGK-----KLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l 315 (415) .+|++|+|++.|.++......... ....+.....+.+.++...+...++.+++|+||+.++..++++||.+|+|+ T Consensus 196 ~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i 275 (377) T protein:vir:98 196 LAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVK 275 (377) T ss_pred hceEeccCCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceE Confidence 999999999988887653322111 111112233467788888888889999999999999999999999999999 Q ss_pred ccCcc--------------cCCCCceeccee--eEEeccccccccCCceEEEechhhcEEEEeecceEEEEeeccc---C Q lcl|NC_012784. 316 IQPDV--------------KEKTQQRLLGAK--IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH---F 376 (415) Q Consensus 316 ~~~~~--------------~~~~~~~l~G~p--V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~ 376 (415) |..++ .+|.+.+++|+| |+.+++||.+ .++||||++ |.+++|++++++.+++.. + T Consensus 276 ~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~-----~i~fgdf~~-Y~i~~r~~~~i~~~~~~~~~~d 349 (377) T protein:vir:98 276 LILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIEEYDQTFAMED 349 (377) T ss_pred EEecccchhhccccccccCCCCccccccCCCceEEecCCCCcc-----cEEEEEecc-eeEEeecceEEEeechhhhhcC Confidence 95333 235566899998 4566677754 379999998 788999999999988764 4 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) ++.+++..|+||++++|+||++++++.- T Consensus 350 ~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 350 LQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 6779999999999999999999998866 No 66 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.3e-55 Score=321.43 Aligned_cols=280 Identities=15% Similarity=0.082 Sum_probs=232.0 Q ss_pred cccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEee Q lcl|NC_012784. 122 GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYD 201 (415) Q Consensus 122 ~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~ 201 (415) ...+.+.+|+++|++++.+|++.+++.+++++++++++++++...+| +.++++.+.|++|++.+|++ .++|++++++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p--~~~~~~~a~wv~Eg~~~~~s-~~~f~~v~l~ 77 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREF--VFDFDSDIDIVAENGKKTHG-GVSLDPVTIV 77 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEE--EEecCcceEEeeCCcccccc-cccceeeEee Confidence 34455566789999999999999999999999999999887765555 55677889999999999964 6999999999 Q ss_pred eeeEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc----ccccc-cccccccccchhh Q lcl|NC_012784. 202 INTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST----SSGFE-KEGKKLEVKKAKS 273 (415) Q Consensus 202 ~~k~a~~~~iS~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~----~~~~~-~~~~~~~~~~~~~ 273 (415) +||++++++||+|+++ |+.++++++|.++|++++++++|.++++|++.+.+.+. ..... .........+... T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (300) T protein:vir:95 78 PLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNP 157 (300) T ss_pred eEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccch Confidence 9999999999999994 67789999999999999999999999999654333221 11111 1111223345667 Q ss_pred HHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCC-ceEEEe Q lcl|NC_012784. 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGN-NTLIIG 352 (415) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~-~~~~~g 352 (415) ++++.+++..+..+++.+++|+|||.++.+|++|||++|||||.+.+.++.+++|+|+||++++.+|.+..+. ..+++| T Consensus 158 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~G 237 (300) T protein:vir:95 158 DESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVG 237 (300) T ss_pred HHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEEe Confidence 8999999999999999999999999999999999999999999888888889999999999999998765443 456789 Q ss_pred chhhcEEEEeecceEEEEeec-----------ccCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 353 NLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 353 d~~~~~~~~~~~~~~i~~~~~-----------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) ||++++.++.|++++++++++ .+++..+|+++|+|+++.+|+||++++-.+- T Consensus 238 Df~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 238 DFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred eccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 999888788899999988764 3445778999999999999999999974433 No 67 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=2.4e-55 Score=320.05 Aligned_cols=276 Identities=14% Similarity=0.061 Sum_probs=229.0 Q ss_pred ccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee Q lcl|NC_012784. 125 KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) Q Consensus 125 ~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k 204 (415) -...+|+++|++++++|++.+++.+++++++++++++++...+ ++.++.+.+.|++|++++|++ .++|++++++++| T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~i--p~~~~~~~a~~v~E~~~~~~~-~~~f~~v~l~~~k 77 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKV--FTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEE--EEEecCcceEEecCCcccccc-ccceeEEEEeeee Confidence 3456688999999999999999999999999999988766554 456677889999999999965 5899999999999 Q ss_pred EEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--ccccc--cccc---ccccccccchhhH Q lcl|NC_012784. 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSS--GFEK---EGKKLEVKKAKSL 274 (415) Q Consensus 205 ~a~~~~iS~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~--~~~~~--~~~~---~~~~~~~~~~~~~ 274 (415) ++++++||+|+++ |+..+|++||.++|++++++++|.++++|++.+.. ..... .... ............+ T Consensus 78 ~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:16 78 VEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHH Confidence 9999999999995 56679999999999999999999999999654332 21111 1111 1111222234457 Q ss_pred HHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccccc-CCceEEEec Q lcl|NC_012784. 275 DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIGN 353 (415) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~gd 353 (415) +++.+++..+..+++.+++|+|||++|..|+++||++|||+|++.+..+.+++|+|+||++++.+|.... +...+++|| T Consensus 158 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GD 237 (298) T protein:vir:16 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) T ss_pred HHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEee Confidence 7899999999999999999999999999999999999999999989999999999999999999986543 345688899 Q ss_pred hhhcEEEEeecceEEEEeec-----------ccCceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-----------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) |++++.++.+++++++++++ .++...+|++.|+|+++++|+||++++-.+ T Consensus 238 fs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 99988888899999988764 234567899999999999999999998655 No 68 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=3e-55 Score=319.46 Aligned_cols=294 Identities=14% Similarity=0.066 Sum_probs=237.6 Q ss_pred HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccc Q lcl|NC_012784. 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 192 (415) +..............++|.++|+++.++|++.+++.+++++++++++++++... +++....+.+.|++|++.+|++ . T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~~-~ 77 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGIS--IPHWTGAVSASWTGEAERKPIT-K 77 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceE--EEEEcCCcceeEecCCCccccc-c Confidence 223333333344455566678888999999999999999999999988766555 5555677889999999999975 6 Q ss_pred ccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc--------cccc Q lcl|NC_012784. 193 KPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE--------KEGK 264 (415) Q Consensus 193 ~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~--------~~~~ 264 (415) ++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|++.+..+..... .... T Consensus 78 ~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~ 157 (330) T protein:vir:77 78 GSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLT 157 (330) T ss_pred ceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeeccccc Confidence 99999999999999999999999999999999999999999999999999999999887654332111 1112 Q ss_pred cccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCC-----CCceecceeeEEeccc Q lcl|NC_012784. 265 KLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLLGAKIEILPDE 339 (415) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~G~pV~~~~~~ 339 (415) .........++++.+++.++...+..+++|+|||++|..|+++||++|||+|+++...+ .+++|+|+||+++++| T Consensus 158 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~ 237 (330) T protein:vir:77 158 TASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNV 237 (330) T ss_pred ccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccc Confidence 22233455688999999999999999999999999999999999999999998765544 4569999999999999 Q ss_pred cccccCC-ceEEEechhhcEEEEeecceEEEEeecc---------------------cCceEEEEEEEeccEEeccccEE Q lcl|NC_012784. 340 VLGQKGN-NTLIIGNLKDAIVLFDRSQYQASWTDYM---------------------HFGECLMIAVRQDCRILDYKSAI 397 (415) Q Consensus 340 ~~~~~~~-~~~~~gd~~~~~~~~~~~~~~i~~~~~~---------------------~~~~~~~~~~r~d~~v~~p~a~~ 397 (415) |.++.++ ..++||||+++ .++++++++++.+++. ++.+.+|++.|+|+++.+|+||+ T Consensus 238 p~~~~~~~~~~~~gd~s~~-~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 316 (330) T protein:vir:77 238 VNGTVGNRVVGVMGDFSQV-IWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFV 316 (330) T ss_pred cCCCCCCccEEEEEecceE-EEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceE Confidence 9876544 45789999984 5788999999876642 34567899999999999999999 Q ss_pred EEEeec-CCCCccc Q lcl|NC_012784. 398 VIEYDD-SERGEGD 410 (415) Q Consensus 398 ~~~~t~-~~~~~~~ 410 (415) +++..+ .++|+-. T Consensus 317 ~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 317 KLTDQVAGTDPEEE 330 (330) T ss_pred EEEeccCCcCCCCC Confidence 998764 4444444 No 69 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=7.6e-55 Score=317.29 Aligned_cols=281 Identities=13% Similarity=0.073 Sum_probs=235.1 Q ss_pred HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccc Q lcl|NC_012784. 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 192 (415) +...........+++.+|++||+++.++|++.+++.+++++++++++++++..+ +++.++.+.+.|++|++.+|+. . T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--ip~~~~~~~a~~v~E~~~~~~~-~ 77 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKK--FTYLAKGVGAYWVSETERIQTS-K 77 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceE--EEEEeCCcceEEeecCcccccc-c Confidence 333333344455667788999999999999999999999999999998776555 5555677889999999999975 6 Q ss_pred ccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-cc---ccccccccc Q lcl|NC_012784. 193 KPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS-GF---EKEGKKLEV 268 (415) Q Consensus 193 ~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~-~~---~~~~~~~~~ 268 (415) ++|++++++++|++++++||+|+++|+.++|++||.++|++++++++|.++++|+|++.+.+... +. ......... T Consensus 78 ~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 157 (304) T protein:vir:10 78 PEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVT 157 (304) T ss_pred ceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999877654322 11 112222233 Q ss_pred cchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCce Q lcl|NC_012784. 269 KKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) .+...++++.+++.++..+++.+++|+|||++|..|++++|++|||+|.++ +++|+|+||++++++|... ++.. T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G~PV~~~~~~~~~~-~~~~ 231 (304) T protein:vir:10 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMGLPLSYTGADVYDK-KKSL 231 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----CccccceeeEEecccccCC-CCcE Confidence 455679999999999999999999999999999999999999999999753 4689999999999998643 4557 Q ss_pred EEEechhhcEEEEeecceEEEEeec-------------------ccCceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 349 LIIGNLKDAIVLFDRSQYQASWTDY-------------------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 349 ~~~gd~~~~~~~~~~~~~~i~~~~~-------------------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) ++||||++ +.++++++++++.+++ .+++..+|+++|+|+++++|+||++++.+. T Consensus 232 ~~~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 232 ALMGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 89999997 5678899999987664 334567899999999999999999999777 No 70 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=7.6e-55 Score=317.29 Aligned_cols=281 Identities=13% Similarity=0.073 Sum_probs=235.1 Q ss_pred HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccc Q lcl|NC_012784. 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 192 (415) +...........+++.+|++||+++.++|++.+++.+++++++++++++++..+ +++.++.+.+.|++|++.+|+. . T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--ip~~~~~~~a~~v~E~~~~~~~-~ 77 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKK--FTYLAKGVGAYWVSETERIQTS-K 77 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceE--EEEEeCCcceEEeecCcccccc-c Confidence 333333344455667788999999999999999999999999999998776555 5555677889999999999975 6 Q ss_pred ccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-cc---ccccccccc Q lcl|NC_012784. 193 KPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS-GF---EKEGKKLEV 268 (415) Q Consensus 193 ~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~-~~---~~~~~~~~~ 268 (415) ++|++++++++|++++++||+|+++|+.++|++||.++|++++++++|.++++|+|++.+.+... +. ......... T Consensus 78 ~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 157 (304) T protein:vir:94 78 PEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVT 157 (304) T ss_pred ceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999877654322 11 112222233 Q ss_pred cchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCce Q lcl|NC_012784. 269 KKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) .+...++++.+++.++..+++.+++|+|||++|..|++++|++|||+|.++ +++|+|+||++++++|... ++.. T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G~PV~~~~~~~~~~-~~~~ 231 (304) T protein:vir:94 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMGLPLSYTGADVYDK-KKSL 231 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----CccccceeeEEecccccCC-CCcE Confidence 455679999999999999999999999999999999999999999999753 4689999999999998643 4557 Q ss_pred EEEechhhcEEEEeecceEEEEeec-------------------ccCceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 349 LIIGNLKDAIVLFDRSQYQASWTDY-------------------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 349 ~~~gd~~~~~~~~~~~~~~i~~~~~-------------------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) ++||||++ +.++++++++++.+++ .+++..+|+++|+|+++++|+||++++.+. T Consensus 232 ~~~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 232 ALMGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 89999997 5678899999987664 334567899999999999999999999777 No 71 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=1.7e-54 Score=315.41 Aligned_cols=279 Identities=14% Similarity=0.077 Sum_probs=230.8 Q ss_pred ccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeee Q lcl|NC_012784. 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) Q Consensus 123 ~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~ 202 (415) ..+.+.+|++||++++++|++.+++.++++++|+++++++++..+| +.++++.+.|++|++++|++ .++|+++++++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip--~~~~~~~a~wv~E~~~~~~s-~~~f~~v~l~~ 77 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF--TFTLDSDIDVVAENGKKTHG-GLSLEPVTIVP 77 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEecCcceEEeecCcccccc-ccceeeEEeee Confidence 2355677899999999999999999999999999999987766655 55677889999999999964 68999999999 Q ss_pred eeEEEeehhhHHHH---hcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc-cc-----cccccccccccchhh Q lcl|NC_012784. 203 NTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS-SG-----FEKEGKKLEVKKAKS 273 (415) Q Consensus 203 ~k~a~~~~iS~e~l---~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~-~~-----~~~~~~~~~~~~~~~ 273 (415) ||+++++++|+|++ .|+.++|.+||.++|++++++++|.++++|++.+.+.+.. .+ ...........+... T Consensus 78 ~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (303) T protein:vir:97 78 IKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDA 157 (303) T ss_pred EEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccch Confidence 99999999999999 4777899999999999999999999999997654332211 11 111222223345567 Q ss_pred HHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCccc-CCCCceecceeeEEeccccccc---cCCceE Q lcl|NC_012784. 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVK-EKTQQRLLGAKIEILPDEVLGQ---KGNNTL 349 (415) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~-~~~~~~l~G~pV~~~~~~~~~~---~~~~~~ 349 (415) ++++.+++..+..+++.+++|+|||.++.+|+++||++|+|+|.++.. ++.+++|+|+||++++++|... .+...+ T Consensus 158 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 237 (303) T protein:vir:97 158 DANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLV 237 (303) T ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEE Confidence 899999999998999999999999999999999999999999988754 4456799999999999998643 234568 Q ss_pred EEechhhcEEEEeecceEEEEeec-----------ccCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 350 IIGNLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 350 ~~gd~~~~~~~~~~~~~~i~~~~~-----------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) +||||+..+.++.|++++++++++ .++...+|++.|+|++|++|+||++++-..- T Consensus 238 ~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 238 IIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 999998888888999999988764 2345678999999999999999999986655 No 72 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=2.4e-54 Score=314.52 Aligned_cols=276 Identities=14% Similarity=0.085 Sum_probs=228.3 Q ss_pred ccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee Q lcl|NC_012784. 125 KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) Q Consensus 125 ~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k 204 (415) -...+|+++|+++.++|++.+++.++++++++++++++++.+ +++.++.+.+.|++||+++|+ +.++|+++++.++| T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~-~~~~f~~v~l~~~k 77 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEK--VFTFTMDSEIDVVAESGKKTH-GGVTLAPQTMVPIK 77 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceE--EEEEecCcceEEeeCCccccc-cccceeEEEEeeeE Confidence 344668899999999999999999999999999998876655 455567788999999999996 46899999999999 Q ss_pred EEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccc--cccccccc---cccccccchhhH Q lcl|NC_012784. 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGS--TSSGFEKE---GKKLEVKKAKSL 274 (415) Q Consensus 205 ~a~~~~iS~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~--~~~--~~~~~~~~---~~~~~~~~~~~~ 274 (415) +++++++|+|+++ |+..+|+++|.++|++++++++|.++++|.+.+. +.. ........ ...........+ T Consensus 78 ~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:94 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred EEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHH Confidence 9999999999996 5567899999999999999999999999854332 111 11111111 111222334457 Q ss_pred HHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccccc-CCceEEEec Q lcl|NC_012784. 275 DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIGN 353 (415) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~gd 353 (415) +++.+++.++..++..+++|+|||++|.+|+++||++|||+|++.+.++.+++|+|+||++++.+|.+.. +...+++|| T Consensus 158 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gd 237 (298) T protein:vir:94 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) T ss_pred HHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEee Confidence 8999999999999999999999999999999999999999999989999999999999999999986543 345688999 Q ss_pred hhhcEEEEeecceEEEEeec-----------ccCceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-----------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) |++++.++.++++++++.++ .++...+|++.|+|+++.+|+||++++-.+ T Consensus 238 fs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99988788899999988764 234556899999999999999999998555 No 73 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=2.3e-52 Score=303.66 Aligned_cols=362 Identities=10% Similarity=0.008 Sum_probs=238.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+.+.++++++..+.+......+.+...... .++.+.+.+.++++..++....... .+.... . T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~--e~~~~~~~~~~~~~~~~~~~~~~~e--~~~~~~---~---------- 63 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASD--EEQSKAFGAMFDALSNDLQEEITAE--INNRVV---D---------- 63 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH---H---------- Confidence 7776666666655533222211111111100 1112222333333333222211100 000000 0 Q ss_pred hhhhhHHHHHHHHHHHHH-hhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 81 RTYRNQANINDLGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) ......+. ......+ +.+. ......+..+||++||+++.+.|++.++..++++++|++++ T Consensus 64 ----------~~~~~~r~~~~l~~ee-~~~~--------~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~ 124 (395) T protein:vir:95 64 ----------NGILAKRSQDPLTSEE-RKFF--------NDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQN 124 (395) T ss_pred ----------HHHHhhcCccccchHH-HHHH--------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Confidence 00000000 0001111 1110 11223466788999999999999999999999999999998 Q ss_pred ccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) Q Consensus 160 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~ 239 (415) +++ ...+++..+.+.+.|+.|+++.++.++++|+++++.+|+++++++||+|||+|+.+++++||.++|++++++++ T Consensus 125 ~~~---~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~ 201 (395) T protein:vir:95 125 AGI---KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVAL 201 (395) T ss_pred cCC---ceEEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHH Confidence 753 35677778888999999887776667899999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcccccc--ccccccccccccc---cccccchhhHHHHHHHHHHhhh--------------hccCCCEEEEcHHH Q lcl|NC_012784. 240 NKAIIDVITKGS--TGSTSSGFEKEGK---KLEVKKAKSLDDIKDAINLNVK--------------PNYEHNVAIVSQTM 300 (415) Q Consensus 240 d~~il~g~g~~~--~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~v~~~~~ 300 (415) |.+|++|+|++. |.++......... ........+++++......+.. .+..+..|+|||++ T Consensus 202 ~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t 281 (395) T protein:vir:95 202 ESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRD 281 (395) T ss_pred hhheeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchh Confidence 999999999875 4444432221111 1111222233333333322221 34456789999998 Q ss_pred HHHHHHhhccCCcccccCcccCCCCceec--ceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeeccc--- Q lcl|NC_012784. 301 FAKLDKMKDKLGNYLIQPDVKEKTQQRLL--GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH--- 375 (415) Q Consensus 301 ~~~l~~lkd~~G~~l~~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--- 375 (415) +. |.+|+|+|++ .+|.+.+++ |+||++++.||.+. ++||||++ |++++|++++++.+++.. T Consensus 282 ~~------~~~g~~~~~~--~~G~~~~~lg~g~~v~~~~~~p~~~-----i~fgdfs~-y~i~~r~~~~i~~~~~~~~~~ 347 (395) T protein:vir:95 282 SW------DVQARYTYLT--ANGGFVTVLPYNVTIITSEFVPEGK-----LVAFVTDR-YNAVRGGGLTVKKFDQTLALE 347 (395) T ss_pred hh------hcCCcceecc--CCCcceeccCCcceEEEcCCCCCCc-----EEEEeccc-EEEEEecceEEEeccchhhhC Confidence 64 5679999986 456677886 55588899998543 89999998 788999999999888654 Q ss_pred CceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 376 FGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 376 ~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +.+.+|+..|+||++++|+||++++++.+..+..-.++-| T Consensus 348 d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~ 387 (395) T protein:vir:95 348 DAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGG 387 (395) T ss_pred CcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCC Confidence 4567999999999999999999999997777776666666 No 74 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=1.1e-53 Score=310.82 Aligned_cols=279 Identities=13% Similarity=0.070 Sum_probs=225.8 Q ss_pred ccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeee Q lcl|NC_012784. 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) Q Consensus 123 ~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~ 202 (415) ..+.+.||+++|+++.+.|++.+++.+++++++++++++++..+ +++.++.+.+.|++||+.+|+ ++++|+++++.+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~--~p~~~~~~~a~wv~Eg~~~~~-~~~~f~~v~l~~ 77 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTAPPRGEVVGEGAQKSE-STATFAPVTAIP 77 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceE--EEEEeCCceeEEeecCccccc-ccceeeEEEEee Confidence 34566679999999999999999999999999999998776555 555677889999999999996 569999999999 Q ss_pred eeEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---ccccccccccc---cccccchhh Q lcl|NC_012784. 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---STSSGFEKEGK---KLEVKKAKS 273 (415) Q Consensus 203 ~k~a~~~~iS~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~---~~~~~~~~~~~---~~~~~~~~~ 273 (415) +|++++++||+|+++ |+..+|+++|.++|++++++++|.++++|++.+.+. +.......... ......... T Consensus 78 ~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~ 157 (311) T protein:vir:81 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) T ss_pred EEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchH Confidence 999999999999995 666789999999999999999999999997654432 22222212211 112222233 Q ss_pred HHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccc----------- Q lcl|NC_012784. 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG----------- 342 (415) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----------- 342 (415) +.++.+++..+...++.+++|+|||.++.+|++|||++|+|+|.+.+..+.+++|+|+||++++.+|.. T Consensus 158 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~ 237 (311) T protein:vir:81 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) T ss_pred HHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchh Confidence 455666777776777788889999999999999999999999998888889999999999999988743 Q ss_pred --ccCCceEEEechhhcEEEEeecceEEEEeec----------ccCceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTDY----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 343 --~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~----------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ..++..++||||++ +.+..+.+++++.+++ .++...+|++.|+|++|.+|+||++++-...+ T Consensus 238 ~~~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cccCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 23345679999998 5566788999987754 23456789999999999999999999876655 No 75 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=3.9e-53 Score=307.88 Aligned_cols=304 Identities=13% Similarity=0.056 Sum_probs=234.7 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHH-HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEY-LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) ++.. .....+.+.|... .+..............+++++|+++.++|++.+++.+++++ T Consensus 1 ~~~~---------------------~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:97 1 MEQT---------------------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ 59 (324) T ss_pred Cccc---------------------hhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhh Confidence 0000 0001111111111 11122223334455668889999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) +++++++++++.. +++.++.+.+.|++|++.+|++ .++|+.++++++|++++++||+|+++|+.+++++||.++|++ T Consensus 60 ~~~~~~~~~~~~~--ip~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~ 136 (324) T protein:vir:97 60 LGKYEPMEGTEKK--FTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hcceeeccCCceE--EEEEecCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 9999998866555 5555677889999999999875 689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) ++++++|++||+|+|++....+..... ........+..+++++.+++.++..+++.+++|+|||.+|..|++++|++|| T Consensus 137 aia~~~d~a~l~G~g~~~~~~gi~~~~-~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~ 215 (324) T protein:vir:97 137 AFYKKFDEAGILNQGNNPFGKSIAQSI-EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred HHHHHHHHHhhccCCCCccCccccccc-cccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCc Confidence 999999999999999875444333222 2223344466789999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cC Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HF 376 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~ 376 (415) |+|.+ +.+++|+|+||++++..+. ++..++||||++ ++++++++++++.+++. ++ T Consensus 216 ~~~~~----~~~~tl~G~PV~~~~~~~~---~~~~~~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d 287 (324) T protein:vir:97 216 ERIYD----RNSDTLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeecC----CCCccccceeeEeecCCCC---CcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcC Confidence 99864 3456899999998876543 344689999997 55778999999887652 34 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) .+.+|+++|+|+++.+|+||++++.+.+.+.. +-| T Consensus 288 ~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~----~~~ 322 (324) T protein:vir:97 288 MVALRATMHVALHIADDKAFAKLVPADKKTDS----VPG 322 (324) T ss_pred cEEEEEEEEeccEEecccceEEEEeccCCCCC----CCC Confidence 56789999999999999999999976543211 111 No 76 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.6e-53 Score=309.99 Aligned_cols=295 Identities=12% Similarity=0.031 Sum_probs=238.5 Q ss_pred HHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecC Q lcl|NC_012784. 95 SIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) ..+.... ..+.+.....+.++++.+||+++.++|++.+++.++++++++++++++++..+ ++.++ T Consensus 1 ~~~~~~~-------------~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~i--p~~~~ 65 (318) T protein:vir:24 1 MAAGTAF-------------AVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKI--PHWVG 65 (318) T ss_pred CCCCCCC-------------CHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEE--EEEeC Confidence 0000000 11223334455666777899999999999999999999999999988766554 45567 Q ss_pred CcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_012784. 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) Q Consensus 175 ~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~ 254 (415) .+.+.|++|++.+|+. +++|++++++++|+++++++|+|+++|+.++++++|.++|++++++++|.++++|+|++.+.+ T Consensus 66 ~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~ 144 (318) T protein:vir:24 66 DVSAQWIGEGDMKPIT-KGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTY 144 (318) T ss_pred CcceEEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcc Confidence 8889999999999875 689999999999999999999999999999999999999999999999999999999887766 Q ss_pred cccccccccccc-cccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCC-----cee Q lcl|NC_012784. 255 TSSGFEKEGKKL-EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ-----QRL 328 (415) Q Consensus 255 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-----~~l 328 (415) +........... ........+++.+++..+...++.+++|+|||++|..|+++||++|+|||.+++.++.+ .++ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i 224 (318) T protein:vir:24 145 IGQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRI 224 (318) T ss_pred cccccccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceE Confidence 554433222222 22233344667788888889999999999999999999999999999999988766654 468 Q ss_pred cceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cCceEEEEEEEeccEEe Q lcl|NC_012784. 329 LGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HFGECLMIAVRQDCRIL 391 (415) Q Consensus 329 ~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~r~d~~v~ 391 (415) +|+||++++.+|. ++..+++|||++ +.++++++++++.+++. ++.+.+|+.+|+|+++. T Consensus 225 ~g~pv~~~~~~~~---~~~~~~~gdfs~-~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 300 (318) T protein:vir:24 225 VARPTILSDHVVE---GTTVGFMGDFSQ-LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCN 300 (318) T ss_pred EEEeeEEeCCCCC---CccEEEEeecce-EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 8999999988865 345679999997 55778999999877643 34567899999999999 Q ss_pred ccccEEEEEeecCCCCcc Q lcl|NC_012784. 392 DYKSAIVIEYDDSERGEG 409 (415) Q Consensus 392 ~p~a~~~~~~t~~~~~~~ 409 (415) +|+||++|+..++..++| T Consensus 301 ~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 301 DAEAFVALTNVVSGGGEG 318 (318) T ss_pred cccceEEEEeeccCCCCC Confidence 999999999998888888 No 77 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.3e-53 Score=309.21 Aligned_cols=295 Identities=13% Similarity=0.029 Sum_probs=234.8 Q ss_pred HHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) .....+.......+.+.++.++|+++..+|++.+++.++++++++++++++++.+ +++....+.+.|++|++.+|++ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~--ip~~~~~~~a~wv~Eg~~~~~s- 77 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIV--IPHWTGDVSAQWIGEGDMKPIT- 77 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceE--EEEEcCCcceEEecCCcccccc- Confidence 1111112222233444444466777899999999999999999999998766555 5556778889999999999874 Q ss_pred cccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccch Q lcl|NC_012784. 192 VKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA 271 (415) Q Consensus 192 ~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~ 271 (415) +++|+++++++||++++++||+|+++|+.+++++||+++|++++++++|+++|+|+|++.+....... .......... T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~--~~~~~~~~~~ 155 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQ--SNKTQSISPN 155 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCccccccccc--ccceeeeccc Confidence 69999999999999999999999999999999999999999999999999999999987664443322 2223333455 Q ss_pred hhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCC-----ceecceeeEEeccccccccCC Q lcl|NC_012784. 272 KSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ-----QRLLGAKIEILPDEVLGQKGN 346 (415) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-----~~l~G~pV~~~~~~~~~~~~~ 346 (415) ..+++++++...+..+++.+++|+|||+++..|+++||++|||+|+++...+.+ ++|+|+||++++++|.+ + T Consensus 156 ~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g---~ 232 (397) T protein:vir:23 156 AYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEG---D 232 (397) T ss_pred chhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCC---c Confidence 667888999999999999999999999999999999999999999987666544 58999999999999854 4 Q ss_pred ceEEEechhhcEEEEeecceEEEEeecc-----------------cCceEEEEEEEeccEEeccccEEEEEeecC----- Q lcl|NC_012784. 347 NTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HFGECLMIAVRQDCRILDYKSAIVIEYDDS----- 404 (415) Q Consensus 347 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~----- 404 (415) ..+++|||+++ .+.+++++.++.+++. ++...+|++.|+|+++++|+||+.++.++. T Consensus 233 ~~~~~gDfs~~-~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~ 311 (397) T protein:vir:23 233 VVGYAGDFSQI-IWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYA 311 (397) T ss_pred eEEEEeecceE-EEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceee Confidence 56789999985 4678899999877643 345678999999999999999999997543 Q ss_pred ----CCCcccccccC Q lcl|NC_012784. 405 ----ERGEGDLGLEA 415 (415) Q Consensus 405 ----~~~~~~~~~~~ 415 (415) +..+|.|+++- T Consensus 312 ~~~~~~~~~~~~~~~ 326 (397) T protein:vir:23 312 LDLDGASAGNFTLSL 326 (397) T ss_pred ecccccCcceEEEEe Confidence 33455555443 No 78 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=6.6e-53 Score=306.67 Aligned_cols=304 Identities=14% Similarity=0.070 Sum_probs=235.7 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHH-HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEY-LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) ++. ..........+... .+..............++++||+++.+.|++.+++.+++++ T Consensus 1 ~~~---------------------~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:96 1 MEQ---------------------TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCc---------------------chhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhh Confidence 000 00001111111111 11112223334456677889999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) +++++++++++.++ ++..+.+.+.|++|++.+|++ .++|+++++.++|++++++||+|+++|+.+++++||.++|++ T Consensus 60 l~~~~~~~~~~~~~--p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ 136 (324) T protein:vir:96 60 LGKYEPMEGTEKKF--TFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hcceeeccCCceEE--EEEecCcceeEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 99999988766554 455677889999999999975 689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) ++++++|.++|+|+|++....+...... .......+..+++++.+++.++...++.+++|+|||++|..|++++|++|+ T Consensus 137 ai~~~~d~a~l~G~g~~~~~~gi~~~~~-~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~ 215 (324) T protein:vir:96 137 AFYKKFDEAGILNQGNNPFGKSIAQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred HHHHHHHHHHhccCCCCCcCcccccccc-ccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCC Confidence 9999999999999987755443332222 223334466779999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cC Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HF 376 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~ 376 (415) |++.+ +.+++|+|+||++++.++. ++..+++|||++ +.++++++++++.+++. ++ T Consensus 216 ~~~~~----~~~~~l~G~PV~~~~~~~~---~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d 287 (324) T protein:vir:96 216 ERIYD----RNSDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeecC----CCCCcccceeeEeeCCCCC---CcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcC Confidence 99853 3467899999998876543 445689999997 55788999999887652 34 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeec--CCCCcccc Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDD--SERGEGDL 411 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~--~~~~~~~~ 411 (415) .+.+|+++|+|+++.+|+||++++-.. +....|++ T Consensus 288 ~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 288 MVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred cEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 577899999999999999999998542 33455566 No 79 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=6.6e-53 Score=306.67 Aligned_cols=304 Identities=14% Similarity=0.070 Sum_probs=235.7 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHH-HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEY-LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) ++. ..........+... .+..............++++||+++.+.|++.+++.+++++ T Consensus 1 ~~~---------------------~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:78 1 MEQ---------------------TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCc---------------------chhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhh Confidence 000 00001111111111 11112223334456677889999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) +++++++++++.++ ++..+.+.+.|++|++.+|++ .++|+++++.++|++++++||+|+++|+.+++++||.++|++ T Consensus 60 l~~~~~~~~~~~~~--p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ 136 (324) T protein:vir:78 60 LGKYEPMEGTEKKF--TFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hcceeeccCCceEE--EEEecCcceeEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 99999988766554 455677889999999999975 689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) ++++++|.++|+|+|++....+...... .......+..+++++.+++.++...++.+++|+|||++|..|++++|++|+ T Consensus 137 ai~~~~d~a~l~G~g~~~~~~gi~~~~~-~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~ 215 (324) T protein:vir:78 137 AFYKKFDEAGILNQGNNPFGKSIAQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred HHHHHHHHHHhccCCCCCcCcccccccc-ccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCC Confidence 9999999999999987755443332222 223334466779999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cC Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HF 376 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~ 376 (415) |++.+ +.+++|+|+||++++.++. ++..+++|||++ +.++++++++++.+++. ++ T Consensus 216 ~~~~~----~~~~~l~G~PV~~~~~~~~---~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d 287 (324) T protein:vir:78 216 ERIYD----RNSDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeecC----CCCCcccceeeEeeCCCCC---CcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcC Confidence 99853 3467899999998876543 445689999997 55788999999887652 34 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeec--CCCCcccc Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDD--SERGEGDL 411 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~--~~~~~~~~ 411 (415) .+.+|+++|+|+++.+|+||++++-.. +....|++ T Consensus 288 ~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 288 MVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred cEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 577899999999999999999998542 33455566 No 80 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=1e-52 Score=305.59 Aligned_cols=302 Identities=14% Similarity=0.053 Sum_probs=232.2 Q ss_pred HHHhhhhhHHHHHHHHHHhh-hhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecC Q lcl|NC_012784. 96 IQNTKVTSQEVRDFTEYLET-RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) ...........+.|...... .............+++++|+++.++|++.+++.++++++++++++++++..+| +.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip--~~~~ 78 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT--FWAD 78 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE--EEec Confidence 00000011111112221111 12222333445556779999999999999999999999999999887765555 4567 Q ss_pred CcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_012784. 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) Q Consensus 175 ~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~ 254 (415) .+.+.|++||+.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|+++|+|+|++.... T Consensus 79 ~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~ 157 (324) T protein:vir:93 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) T ss_pred CcceeeecCCcccccc-ccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCc Confidence 7889999999999975 589999999999999999999999999999999999999999999999999999988765433 Q ss_pred cccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeE Q lcl|NC_012784. 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) +..... ........+..+++++++++..+..+++.+++|+|||++|..|++++|++|+|++.+ +.+++|+|+||+ T Consensus 158 ~~~~~~-~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PVv 232 (324) T protein:vir:93 158 SIAQSI-EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) T ss_pred cccccc-cccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCcccceeeE Confidence 332222 222333445677999999999999999999999999999999999999999999863 346789999999 Q ss_pred EeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cCceEEEEEEEeccEEeccccEE Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HFGECLMIAVRQDCRILDYKSAI 397 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~r~d~~v~~p~a~~ 397 (415) +++..+ .++..+++|||++ +.++.+++++++.+++. ++.+.+|++.|+|+++.+|+||+ T Consensus 233 ~~~~~~---~~~~~i~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~ 308 (324) T protein:vir:93 233 NLKSSN---LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) T ss_pred eecCCC---CCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE Confidence 877554 3455689999997 55788999999987753 34577999999999999999999 Q ss_pred EEEeec----CCCCcc Q lcl|NC_012784. 398 VIEYDD----SERGEG 409 (415) Q Consensus 398 ~~~~t~----~~~~~~ 409 (415) +|+-.. +++|+- T Consensus 309 ~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 309 KLVPADKRTDSVPGEV 324 (324) T ss_pred EEecccccCCCCCCCC Confidence 998332 222222 No 81 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2.8e-52 Score=303.18 Aligned_cols=345 Identities=12% Similarity=0.031 Sum_probs=236.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |....+-++++.+.++++.++.++..+ + .++.+++.+.++++.+++.+..+... +... ...... ... T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~---~--e~~~~~~~~~~~~~~~~~~~~~~~e~--~~~~--~~~~~~-~~l--- 67 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGAT---P--EEQEKLFEAAFTTMGDEILAKNEEEM--ERMF--DLRDKN-REL--- 67 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhccc---H--HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--HhccCC-ccc--- Confidence 666655555555544444443332111 1 11222333344445544443222110 0000 000000 000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ..++.+.+. ......+..+||++||+++.+.|++.+...++++++|++.++ T Consensus 68 ---------------------t~ee~~~~~--------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~ 118 (377) T protein:vir:96 68 ---------------------TAEEIKFFN--------DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) T ss_pred ---------------------CHHHHHHHH--------HHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEec Confidence 000111010 111234677889999999999999999999999999999887 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++ .+.+++..+.+.+.|++|++++++.++++|+++++.+||++++++||++||+|+.+++++||.++|+++++++++ T Consensus 119 ~~---~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~ 195 (377) T protein:vir:96 119 SL---RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALE 195 (377) T ss_pred CC---ceEEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHh Confidence 53 345667778889999999998876678999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccc----------------cccchhhHHHHHHHHHHhhhhcc-----------CCCE Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKL----------------EVKKAKSLDDIKDAINLNVKPNY-----------EHNV 293 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~ 293 (415) .+|++|+|++.|.++........... ......+.+.+.+.+..+...+. .+++ T Consensus 196 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~ 275 (377) T protein:vir:96 196 LAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVK 275 (377) T ss_pred hceEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceE Confidence 99999999998887764322111100 00111234556666555554433 3567 Q ss_pred EEEcHHHHHHHHHhhccCCcccccCcccCCCCceeccee--eEEeccccccccCCceEEEechhhcEEEEeecceEEEEe Q lcl|NC_012784. 294 AIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAK--IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT 371 (415) Q Consensus 294 ~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~p--V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 371 (415) |+|||.++..+ .|+|.|++ .+|.+.+++|+| |++++.||.+ .++||||++ |++++|++++++.+ T Consensus 276 ~~mn~~t~~~~------~~~~~~~~--~~G~~~~~l~~p~~v~~s~~~p~~-----~i~fgdf~~-Y~i~~r~~~~i~~~ 341 (377) T protein:vir:96 276 LLLNPEDRWTL------EAKFTSRN--QFGEYVTVLPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIEEY 341 (377) T ss_pred EEEchhhHHhc------cccccccC--CCCCceeccCCCceEEecCCCCcc-----cEEEEEcCc-EEEEEecccEEEee Confidence 99999997654 46777764 355667888887 4556677754 389999998 88899999999998 Q ss_pred eccc---CceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 372 DYMH---FGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 372 ~~~~---~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) ++.. +++.+|+..|+||++++|+||++++++-- T Consensus 342 ~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 342 DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred hhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 8654 46789999999999999999999998866 No 82 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=9.8e-53 Score=305.71 Aligned_cols=280 Identities=10% Similarity=0.005 Sum_probs=233.4 Q ss_pred HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccc Q lcl|NC_012784. 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 192 (415) +...........+.++++.+||++++++|++.+++.++++++++++++++.. ...+++..+.+.+.|++||+.+|+. . T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~-~ 78 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQ-EKTVYVQTDGISAYWVNETEKIKTD-K 78 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCc-cEEEEEEcCCceeEEeecCcccccc-c Confidence 2222233334455667788999999999999999999999999999987654 4455677788899999999999975 5 Q ss_pred ccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchh Q lcl|NC_012784. 193 KPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAK 272 (415) Q Consensus 193 ~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 272 (415) ++|+.++++++|++++++||+|+++|+.+++++||.++|++++++++|.++|+|+|++.+.+...... .......+.. T Consensus 79 ~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~--~~~~~~~~~~ 156 (297) T protein:vir:95 79 PEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAK--DANKVIGGPI 156 (297) T ss_pred cceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccc--ccceeccccc Confidence 89999999999999999999999999999999999999999999999999999999887765544322 2223334566 Q ss_pred hHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEe Q lcl|NC_012784. 273 SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIG 352 (415) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~g 352 (415) +++++++++.++..+++.+++|+|||.++..|++|+|++|+|+|.+. +++|+|+||+++.+.+. ++..+++| T Consensus 157 t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~-----~~~l~G~Pv~~~~~~~~---~~~~~~~g 228 (297) T protein:vir:95 157 NYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA-----ANTIDGITTVDLKSARF---EKGDLLAG 228 (297) T ss_pred CHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC-----CCcccceeeEeecCCCC---CCceEEEE Confidence 89999999999999999999999999999999999999999999643 46899999998765442 34468999 Q ss_pred chhhcEEEEeecceEEEEeecc-----------------cCceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 353 NLKDAIVLFDRSQYQASWTDYM-----------------HFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 353 d~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ||++ +.++++++++++.+++. .+.+.+|+++|+|+++.+|+||++|+.+++. T Consensus 229 d~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 229 DFDN-LIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred eccc-EEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 9997 55788999999887653 2445689999999999999999999877776 No 83 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.3e-52 Score=305.10 Aligned_cols=286 Identities=12% Similarity=0.016 Sum_probs=224.3 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEe Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~ 200 (415) +...+.+.||+++|++++.+|++.+++.+++++++++++++++.. .+++..+.+.++|++||+.+|+ ++++|+++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~--~ip~~~~~~~a~wv~Eg~~~~~-s~~~f~~v~l 77 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPV--KGAVFSGVPRAKIVGEGEVKPS-ASVDVSAFTA 77 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCce--EEEEEeCCcceEEeeCCccccc-cccceeeeEe Confidence 445567788999999999999999999999999999999876654 4556678889999999999986 5699999999 Q ss_pred eeeeEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhhccccccccccc--cccccccccccccchhhH Q lcl|NC_012784. 201 DINTHRGYFRISREAIEDAKVN----VLQELKLWMARTIAATRNKAIIDVITKGSTGSTS--SGFEKEGKKLEVKKAKSL 274 (415) Q Consensus 201 ~~~k~a~~~~iS~e~l~ds~~~----l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~--~~~~~~~~~~~~~~~~~~ 274 (415) .++|++++++||+|+++|+..+ |+++|.++|++++++++|.++++|++.+.+.+.. .............+...+ T Consensus 78 ~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) T protein:vir:80 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSAT 157 (315) T ss_pred eeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccch Confidence 9999999999999999888765 7899999999999999999999998754432221 111112222223344557 Q ss_pred HHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHHhhccCCccccc----CcccCCCCceecceeeEEecccccccc----C Q lcl|NC_012784. 275 DDIKDAINLNVKP-NYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ----PDVKEKTQQRLLGAKIEILPDEVLGQK----G 345 (415) Q Consensus 275 ~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~----~~~~~~~~~~l~G~pV~~~~~~~~~~~----~ 345 (415) +++.+++..+..+ +..+++|+|||.++..|++++|.+|++++. ++...+.+++|+|+||+++++||.+.. . T Consensus 158 ~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~ 237 (315) T protein:vir:80 158 ADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) T ss_pred HHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCccccccccc Confidence 8888888887655 456778999999999999998877664432 355666778999999999999986532 2 Q ss_pred CceEEEechhhcEEEEeecceEEEEeecc-----------cCceEEEEEEEeccEEeccccEEEEEeec---CCCCccc Q lcl|NC_012784. 346 NNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------HFGECLMIAVRQDCRILDYKSAIVIEYDD---SERGEGD 410 (415) Q Consensus 346 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~---~~~~~~~ 410 (415) ...++||||++ +.+..+++++++++++. +++..+|+++|+|++|.+|+||++|+..+ +..+.++ T Consensus 238 ~~~~~~GDfs~-~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred ccEEEEeeccc-EEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCCC Confidence 34578999998 55667889999877642 34567899999999999999999999654 2333444 No 84 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.8e-52 Score=304.26 Aligned_cols=336 Identities=14% Similarity=0.094 Sum_probs=221.4 Q ss_pred HHHHHHHHhhhhhcc--ccccccchhh-hhhHHHHHHHHHHHHHhhhhhHHHHHHH-HHHhhhhhhhhcccccccceeec Q lcl|NC_012784. 58 LDKLKEKDGTSENNQ--QSVEVNEART-YRNQANINDLGISIQNTKVTSQEVRDFT-EYLETRNDIQGGSLKTDSGFVVI 133 (415) Q Consensus 58 ~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v 133 (415) +. .......+.. .......... ............................ ..............+...||++| T Consensus 1 ~a---~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lv 77 (366) T protein:vir:57 1 MA---AAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALI 77 (366) T ss_pred Cc---ccccccccccccccccccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCcccc Confidence 00 0000000000 0000000000 0000000000000000000000000000 00011111122233455789999 Q ss_pred chhHHhHHHHHHhhhhhhhhc-ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhh Q lcl|NC_012784. 134 PEEIVTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRIS 212 (415) Q Consensus 134 P~~~~~~Ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS 212 (415) |+++.++|++.+++.++++.+ ++++++.+ +.+.+++.++++.+.|++|++.+|++ .++|++|++.++|++++++|| T Consensus 78 P~~~~~~ii~~l~~~s~l~~lg~~~v~~~~--g~~~~p~~t~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~~~~~~iS 154 (366) T protein:vir:57 78 PQNMQNEVIELLRDRTVVRILGARSIPLPN--GNLSMPRLSGGATAGYVGEGKDVVAT-GATFDDVKLSAKTMIALVPVS 154 (366) T ss_pred chhHHHHHHHHHhhhcchhhhceeeeecCC--CceEEEEEeCCcceeeeccCcccccc-ccceeEEEEeeEEEEEeehhh Confidence 999999999999999999987 77766654 45667777888999999999999975 699999999999999999999 Q ss_pred HHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccccccccccccccc----cccchhhHH---HHHHHHHHh Q lcl|NC_012784. 213 REAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS-TGSTSSGFEKEGKKL----EVKKAKSLD---DIKDAINLN 284 (415) Q Consensus 213 ~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~-~~~~~~~~~~~~~~~----~~~~~~~~~---~~~~~~~~~ 284 (415) +|+|+|+.+++++||+++|++++++++|.+||+|+|++. +.++........... ........+ +.+...... T Consensus 155 ~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~ 234 (366) T protein:vir:57 155 NQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMD 234 (366) T ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhc Confidence 999999999999999999999999999999999999763 333322211111111 111112222 333334444 Q ss_pred hhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccc---cCCceEEEechhhcEEEE Q lcl|NC_012784. 285 VKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQ---KGNNTLIIGNLKDAIVLF 361 (415) Q Consensus 285 ~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~---~~~~~~~~gd~~~~~~~~ 361 (415) ...+..+++|+|||.++..|++++|++|+|+|. +. .+++|+|+||++++++|... .+...++||||++ +++. T Consensus 235 ~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~-~~---~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~-~~i~ 309 (366) T protein:vir:57 235 SNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYP-EM---SQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFND-VVIG 309 (366) T ss_pred cccccccCEEEecHHHHHHHHhhhccCCceecc-CC---CCCeecceeeEEccccccccccCCCccEEEEEecce-EEEE Confidence 456778999999999999999999999999995 33 34589999999999998632 2345689999997 5588 Q ss_pred eecceEEEEeecc--------------cCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 362 DRSQYQASWTDYM--------------HFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 362 ~~~~~~i~~~~~~--------------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) ++.+++++++++. .+...+|+++|+|+++.||+||++++=..= T Consensus 310 ~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 310 EDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 8999999987653 234678999999999999999999882222 No 85 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=3.4e-52 Score=302.73 Aligned_cols=304 Identities=13% Similarity=0.058 Sum_probs=233.3 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHH-hhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYL-ETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) ++.. .......+.+.... +..............++.++|+++++.|++.+++.+++++ T Consensus 1 ~~k~---------------------~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:99 1 MEQT---------------------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMR 59 (324) T ss_pred CCCc---------------------hHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhh Confidence 0000 00001111111111 1111222333344556679999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) +++++++++++.. +++..+.+.+.|++|++.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++ T Consensus 60 ~~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ 136 (324) T protein:vir:99 60 LGKYEPMEGTEKK--FTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hcceeeccCCceE--EEEEecCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 9999998876655 4455677889999999999975 589999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) ++++++|.++|+|+|++....+.... .........++.+++++.+++..+...++.+++|+|||++|..|++++|++|+ T Consensus 137 ai~~~~d~~~l~G~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~ 215 (324) T protein:vir:99 137 AFYKKFDEAGILNQGNNPFGKSIAQS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred HHHHHHHHHhhhcCCCCccCcccccc-ccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCc Confidence 99999999999999887543333222 22233344567789999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cC Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HF 376 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~ 376 (415) |+|.+ +.+++|+|+||++++.++. ++..+++|||++ +.++++++++++++++. ++ T Consensus 216 ~~~~~----~~~~~l~G~PVv~~~~~~~---~~~~~i~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) T protein:vir:99 216 ERIYD----RNSDTLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeecC----CCCccccceeEEeecCCCC---CcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcC Confidence 99853 3457899999999876653 344689999998 55788999999987653 34 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecCCC--Ccccc Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGDL 411 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~--~~~~~ 411 (415) .+.+|++.|+|+++.+|+||++++.+.+.. +-|.+ T Consensus 288 ~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 288 MVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred cEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 567899999999999999999998664222 22233 No 86 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=2.6e-52 Score=303.43 Aligned_cols=298 Identities=12% Similarity=0.047 Sum_probs=225.6 Q ss_pred HHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) ... .......+ +.....+.... +.+.+|.++|++++++|++.+++.+++++++++++++++..++ ++.++.+ T Consensus 1 ~~~--~~~r~~~~---~~~~e~~a~~~-~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~--p~~~~~~ 72 (326) T protein:vir:42 1 MAV--NPDRTTPF---LGVNDPKVAQT-GDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKI--PHWTGDV 72 (326) T ss_pred CCC--Cccchhhh---cCcchhhheec-cccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEE--EEEeCCc Confidence 000 00000000 11112222223 3334455789999999999999999999999999988766554 4566788 Q ss_pred ccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_012784. 177 ALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS 256 (415) Q Consensus 177 ~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~ 256 (415) .+.|++||+.+|+. +++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|+++++|+|++.+.+.. T Consensus 73 ~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~ 151 (326) T protein:vir:42 73 SASWIGEGDMKPIT-KGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLA 151 (326) T ss_pred ceEEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccc Confidence 89999999999975 69999999999999999999999999999999999999999999999999999999988776654 Q ss_pred ccccccccc----ccccchhhHHH--HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCC----- Q lcl|NC_012784. 257 SGFEKEGKK----LEVKKAKSLDD--IKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ----- 325 (415) Q Consensus 257 ~~~~~~~~~----~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~----- 325 (415) ......... ........+.+ +...+..+...++.+++|+|||++|..|++|||++|+|||.+....+.+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~ 231 (326) T protein:vir:42 152 QTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRL 231 (326) T ss_pred ccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccC Confidence 333221111 11112222222 3455666667788899999999999999999999999999887665544 Q ss_pred ceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cCceEEEEEEEecc Q lcl|NC_012784. 326 QRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HFGECLMIAVRQDC 388 (415) Q Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~r~d~ 388 (415) ++|+|+||++++++|. ++..+++|||+++ .++++++++++.+++. ++.+.+|+.+|+|+ T Consensus 232 ~~l~G~pv~~~~~~~~---~~~~~~~Gd~s~~-~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~ 307 (326) T protein:vir:42 232 GRIVARPTILSDHVAS---GTVVGYQGDFRQL-VWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAF 307 (326) T ss_pred ceeeeeeEEEcCCCCC---CceEEEEeecceE-EEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEecc Confidence 4799999999999875 3456789999985 4778899999876543 34577899999999 Q ss_pred EEeccccEEEEEeecCCCC Q lcl|NC_012784. 389 RILDYKSAIVIEYDDSERG 407 (415) Q Consensus 389 ~v~~p~a~~~~~~t~~~~~ 407 (415) ++.+|+||++|+-.+++.. T Consensus 308 ~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 308 HCNDKDAFVKLTNVDATEA 326 (326) T ss_pred EEecccceEEEeeccccCC Confidence 9999999999775554433 No 87 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=5.6e-52 Score=301.58 Aligned_cols=304 Identities=13% Similarity=0.058 Sum_probs=233.3 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHH-hhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYL-ETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) .+.. .......+.|.... +..............++.++|+++++.|++.+++.+++++ T Consensus 1 ~~~~---------------------~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:10 1 MEQT---------------------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCCc---------------------hHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhh Confidence 0000 00001111122211 1112222333444556679999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) +++++++++++..+| +..+.+.+.|++|++.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++ T Consensus 60 ~~~~~~~~~~~~~~p--~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ 136 (324) T protein:vir:10 60 LGKYEPMEGTEKKFT--FWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hcceeeccCCceEEE--EEeCCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 999999887765544 55677889999999999975 589999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) ++++++|.++|+|+|++......... .........++.+++++.+++..+..+++.+++|+|||++|..|++++|++|+ T Consensus 137 ai~~~~d~a~l~G~g~~~~~~~i~~~-~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~ 215 (324) T protein:vir:10 137 AFYKKFDEAGILNQGNNPFGKSIAQS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred HHHHHHHHHhhhcCCCCccCcccccc-ccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCc Confidence 99999999999999887543332222 22223334456789999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cC Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HF 376 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~ 376 (415) |+|.+ +.+++|+|+||++++.++. ++..+++|||+++ .++++++++++++++. ++ T Consensus 216 ~~~~~----~~~~~l~G~PV~~~~~~~~---~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (324) T protein:vir:10 216 ERIYD----RNSDTLDGLPVVNLKSSNL---KRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeecC----CCCccccceeEEeecCCCC---CcceEEEEecccE-EEEEecCcEEEEeecccccccccccccchhhhhcC Confidence 99864 3457899999998876543 3456899999984 5778999999887653 34 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecCCC--Ccccc Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGDL 411 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~--~~~~~ 411 (415) .+.+|+++|+|+++.+|+||++++...+.. +-|.+ T Consensus 288 ~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 288 MVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred cEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 567899999999999999999998654322 23333 No 88 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=1.7e-51 Score=298.99 Aligned_cols=349 Identities=8% Similarity=0.002 Sum_probs=236.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||..+++.+.+.++.+.+++.... +...+.+++ .++++.+++....+ ++.+..... .+. . T Consensus 3 ik~~~~~~~~~~e~~~~~~~~~~~------~~~~~~~~~---~~~~~~~~~~~~~~--~e~~~~~~~-~~~--~------ 62 (381) T protein:vir:95 3 INLSETFANAKNEFINAVNNGEPQ------ERQNELYGD---MINQLFEETKLQAK--AEAERVSSL-PKS--A------ 62 (381) T ss_pred hhhHHHHHHHHHHHHHHHhhhhhh------HHHHHHHHH---HHHhhhhhHHHHHH--HHHHHHHHh-ccC--c------ Confidence 666666666655554444322111 111111111 11121111111100 000000000 000 0 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ......+.+.+ ......+.++||++||+++.+.|++.+++.++++++|++.++ T Consensus 63 ------------------~~lt~~e~~~~---------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~ 115 (381) T protein:vir:95 63 ------------------QSLSANQRSFF---------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) T ss_pred ------------------ccccHHHHHHH---------HHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEec Confidence 00000111111 011234566789999999999999999999999999999887 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++ ...+++.++.+.+.|++|+++.++.++++|+++++.+||++++++||++||+|+.+++++||.++|+++++.+++ T Consensus 116 ~~---~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~ 192 (381) T protein:vir:95 116 GL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) T ss_pred Cc---ceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhh Confidence 53 355677788889999999988876668999999999999999999999999999999999999999999999999 Q ss_pred HHHhhcccccccccccccccccccc--------------ccccchhhHHHHHHHHHHhhh-------hccCCCEEEEcHH Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKK--------------LEVKKAKSLDDIKDAINLNVK-------PNYEHNVAIVSQT 299 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~v~~~~ 299 (415) .+|++|+|++.|.++.......... +.......++.+...+..+.. .+..++.|+|||. T Consensus 193 ~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~ 272 (381) T protein:vir:95 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) T ss_pred heeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccc Confidence 9999999999887775432211100 011112223444444444432 4566789999999 Q ss_pred HHHHHHHhh---ccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC Q lcl|NC_012784. 300 MFAKLDKMK---DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF 376 (415) Q Consensus 300 ~~~~l~~lk---d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 376 (415) ++..|+.++ +++|+|+|..+ .|++|++++.||.+ .++||||++ |++++|++++++.+++..| T Consensus 273 t~~~l~~~~~~~~~~G~~v~~l~---------~g~~vv~s~~~p~~-----~iifgDfs~-Y~i~~r~~~~i~~~~~~~~ 337 (381) T protein:vir:95 273 DAFEVQAQYTHLNANGVYVTALP---------FNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) T ss_pred cHHhhccccccCCCCCceeecCC---------CCceEEecCCCCcC-----cEEEEeccc-EEEEEecccEEEeechhHh Confidence 999888654 77899887421 47789999999854 389999997 8889999999999987554 Q ss_pred ---ceEEEEEEEeccEEeccccEEEEEeec---CCCCccccccc Q lcl|NC_012784. 377 ---GECLMIAVRQDCRILDYKSAIVIEYDD---SERGEGDLGLE 414 (415) Q Consensus 377 ---~~~~~~~~r~d~~v~~p~a~~~~~~t~---~~~~~~~~~~~ 414 (415) ++.+|+..|+||++++|+||++++++. ++...+.--|+ T Consensus 338 ~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred hcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 568999999999999999999999885 44555555555 No 89 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=1.7e-51 Score=298.99 Aligned_cols=349 Identities=8% Similarity=0.002 Sum_probs=236.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||..+++.+.+.++.+.+++.... +...+.+++ .++++.+++....+ ++.+..... .+. . T Consensus 3 ik~~~~~~~~~~e~~~~~~~~~~~------~~~~~~~~~---~~~~~~~~~~~~~~--~e~~~~~~~-~~~--~------ 62 (381) T protein:vir:10 3 INLSETFANAKNEFINAVNNGEPQ------ERQNELYGD---MINQLFEETKLQAK--AEAERVSSL-PKS--A------ 62 (381) T ss_pred hhhHHHHHHHHHHHHHHHhhhhhh------HHHHHHHHH---HHHhhhhhHHHHHH--HHHHHHHHh-ccC--c------ Confidence 666666666655554444322111 111111111 11121111111100 000000000 000 0 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) ......+.+.+ ......+.++||++||+++.+.|++.+++.++++++|++.++ T Consensus 63 ------------------~~lt~~e~~~~---------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~ 115 (381) T protein:vir:10 63 ------------------QSLSANQRSFF---------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) T ss_pred ------------------ccccHHHHHHH---------HHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEec Confidence 00000111111 011234566789999999999999999999999999999887 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++ ...+++.++.+.+.|++|+++.++.++++|+++++.+||++++++||++||+|+.+++++||.++|+++++.+++ T Consensus 116 ~~---~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~ 192 (381) T protein:vir:10 116 GL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) T ss_pred Cc---ceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhh Confidence 53 355677788889999999988876668999999999999999999999999999999999999999999999999 Q ss_pred HHHhhcccccccccccccccccccc--------------ccccchhhHHHHHHHHHHhhh-------hccCCCEEEEcHH Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKK--------------LEVKKAKSLDDIKDAINLNVK-------PNYEHNVAIVSQT 299 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~v~~~~ 299 (415) .+|++|+|++.|.++.......... +.......++.+...+..+.. .+..++.|+|||. T Consensus 193 ~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~ 272 (381) T protein:vir:10 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) T ss_pred heeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccc Confidence 9999999999887775432211100 011112223444444444432 4566789999999 Q ss_pred HHHHHHHhh---ccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC Q lcl|NC_012784. 300 MFAKLDKMK---DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF 376 (415) Q Consensus 300 ~~~~l~~lk---d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 376 (415) ++..|+.++ +++|+|+|..+ .|++|++++.||.+ .++||||++ |++++|++++++.+++..| T Consensus 273 t~~~l~~~~~~~~~~G~~v~~l~---------~g~~vv~s~~~p~~-----~iifgDfs~-Y~i~~r~~~~i~~~~~~~~ 337 (381) T protein:vir:10 273 DAFEVQAQYTHLNANGVYVTALP---------FNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) T ss_pred cHHhhccccccCCCCCceeecCC---------CCceEEecCCCCcC-----cEEEEeccc-EEEEEecccEEEeechhHh Confidence 999888654 77899887421 47789999999854 389999997 8889999999999987554 Q ss_pred ---ceEEEEEEEeccEEeccccEEEEEeec---CCCCccccccc Q lcl|NC_012784. 377 ---GECLMIAVRQDCRILDYKSAIVIEYDD---SERGEGDLGLE 414 (415) Q Consensus 377 ---~~~~~~~~r~d~~v~~p~a~~~~~~t~---~~~~~~~~~~~ 414 (415) ++.+|+..|+||++++|+||++++++. ++...+.--|+ T Consensus 338 ~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred hcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 568999999999999999999999885 44555555555 No 90 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.6e-51 Score=299.07 Aligned_cols=349 Identities=8% Similarity=-0.004 Sum_probs=234.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) ||..+++++++.++.+.+++...... +.+.++.+ ..++..+.+..... +..+... ..+. T Consensus 3 ~kl~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~---~~~~~~~~~~~~~~--e~~~~~~-~~~~--------- 61 (381) T protein:vir:10 3 INLSETFANAKNEFINAVNNGEPQER------QNELYGDM---INQLFEETKLQAKA--EAERVSS-LPKS--------- 61 (381) T ss_pred hhHHHHHHHHHHHHHHHHHhhhHHHH------HHHHHHHH---HHhhhhhHHHHHHH--HHHHHHH-hccc--------- Confidence 78777777777666665543321111 11111111 11111111111100 0000000 0000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) .......+.+.+ ......+..+||++||+++.+.|++.++..++++++|+++++ T Consensus 62 -----------------~~~l~~~e~~~~---------~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~ 115 (381) T protein:vir:10 62 -----------------AQTLSANQRNFF---------MDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) T ss_pred -----------------ccccCHHHHHHH---------HHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEec Confidence 000011111110 012234667789999999999999999999999999999887 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++ ...+++.++.+.+.|++|.++.++.++++|+++++.+||++++++||++||+|+.+++++||.++|+++++++++ T Consensus 116 ~~---~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~ 192 (381) T protein:vir:10 116 GL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) T ss_pred Cc---ceEEEeecCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhh Confidence 53 345667778888999999888776778999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccc-------cccchhhHHHHH-------HHHHHhh-------hhccCCCEEEEcHH Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKL-------EVKKAKSLDDIK-------DAINLNV-------KPNYEHNVAIVSQT 299 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~-------~~~~~~~-------~~~~~~~~~v~~~~ 299 (415) .+|++|+|++.|.|+........... ......++.++. ..+..+. ..+..++.|+|||. T Consensus 193 ~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~ 272 (381) T protein:vir:10 193 TAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) T ss_pred ceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchh Confidence 99999999999887754322111100 011111222222 2211111 13556789999999 Q ss_pred HHHHHHHh---hccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC Q lcl|NC_012784. 300 MFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF 376 (415) Q Consensus 300 ~~~~l~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 376 (415) ++..|+.+ .|++|+|+|..+ .|+||+++++||.+ .++||||++ |++++|.+++++.+++..| T Consensus 273 t~~~l~~~~~~~~~~G~~v~~lp---------~g~~vv~~~~~p~~-----~i~fGDfs~-Y~i~~r~~~~i~~~~~~~~ 337 (381) T protein:vir:10 273 DAFEVQAQYTHLNANGVYVTALP---------FNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) T ss_pred hHHhhccccccCCCCCceeecCC---------CCceeEEcCCCCcC-----cEEEEEccc-EEEEEecccEEEeechhhh Confidence 99888764 488999987522 47899999999864 389999997 7889999999999887554 Q ss_pred ---ceEEEEEEEeccEEeccccEEEEEeec---CCCCccccccc Q lcl|NC_012784. 377 ---GECLMIAVRQDCRILDYKSAIVIEYDD---SERGEGDLGLE 414 (415) Q Consensus 377 ---~~~~~~~~r~d~~v~~p~a~~~~~~t~---~~~~~~~~~~~ 414 (415) ++.|++..|+||++++|+||++++++. +|+=++-..++ T Consensus 338 ~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred hcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 568999999999999999999999873 23333333333 No 91 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=2.4e-52 Score=303.59 Aligned_cols=292 Identities=10% Similarity=0.020 Sum_probs=226.3 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccc Q lcl|NC_012784. 100 KVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE 179 (415) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 179 (415) +.. ......+.+.....+.+.++.+||++++++|++.+++.++++++++++++.+++.++ ++..+.+.+. T Consensus 1 ~~~--------~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~--p~~~~~~~a~ 70 (320) T protein:vir:10 1 MAA--------GTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKI--PHWIGDVSAQ 70 (320) T ss_pred CCC--------CccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEE--EEEeCCcceE Confidence 000 000011222233344555666899999999999999999999999999988766554 4556778899 Q ss_pred cccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_012784. 180 KVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF 259 (415) Q Consensus 180 ~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~ 259 (415) |++|++.+|++ .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|++||+|+|++.+....... T Consensus 71 ~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~ 149 (320) T protein:vir:10 71 WIGEGDMKPIT-KGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTT 149 (320) T ss_pred EecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccc Confidence 99999999975 68999999999999999999999999999999999999999999999999999999987765443222 Q ss_pred ccccccc----cccchhhH-HHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCC-----Cceec Q lcl|NC_012784. 260 EKEGKKL----EVKKAKSL-DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKT-----QQRLL 329 (415) Q Consensus 260 ~~~~~~~----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~-----~~~l~ 329 (415) ....... .......+ +.+.+++..+...+..+++|+|||++|.+|+++||++|+|+|.+....+. ..+++ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~ 229 (320) T protein:vir:10 150 KSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIV 229 (320) T ss_pred ccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceee Confidence 2111111 11111222 34667788888889999999999999999999999999999987655443 35799 Q ss_pred ceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cCceEEEEEEEeccEEec Q lcl|NC_012784. 330 GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HFGECLMIAVRQDCRILD 392 (415) Q Consensus 330 G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~r~d~~v~~ 392 (415) |+||++++.+|.+ +..++||||++ ++++.+++++++.+++. ++...+|+++|+|+++++ T Consensus 230 g~pv~~~~~~~~~---~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~ 305 (320) T protein:vir:10 230 SRPTILSDHVADG---TTVGYMGDFRN-VIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNND 305 (320) T ss_pred eeeeEecCCCCCC---ceEEEEeecce-EEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEec Confidence 9999999988754 44578999997 45788999999877643 345678999999999999 Q ss_pred cccEEEEEe-ecCCC Q lcl|NC_012784. 393 YKSAIVIEY-DDSER 406 (415) Q Consensus 393 p~a~~~~~~-t~~~~ 406 (415) |+||++++- ++|++ T Consensus 306 ~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 306 KDAFVKLTNVVTPDA 320 (320) T ss_pred ccceEEEEeccCCCC Confidence 999999984 44444 No 92 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=6.5e-52 Score=301.23 Aligned_cols=304 Identities=13% Similarity=0.063 Sum_probs=230.6 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHh-hhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLE-TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) .+.. .......+.+..... ..............++.++|+++.++|++.+++.+++++ T Consensus 1 ~~~~---------------------~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:96 1 MEQT---------------------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCcc---------------------hhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhh Confidence 0000 000011111111111 111122223334556778999999999999999999999 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) +++++++++++.+|| +..+.+.+.|++|++.+|+. .++|+++++.++|++++++||+|+++|+.+++++||.++|++ T Consensus 60 l~~~~~~~~~~~~~p--~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~ 136 (324) T protein:vir:96 60 LGKYEPMEGTEKKFT--FWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hcceeeccCCceEEE--EEecCcceeeecCCcccccc-ccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 999999887765555 45667788999999999975 689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN 313 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~ 313 (415) ++++++|.++|+|+|++....+....... ......+..+++++++++.++...++.+++|+|||++|..|++++|++|+ T Consensus 137 aia~~~d~~~l~G~g~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~ 215 (324) T protein:vir:96 137 AFYKKFDEAGILNQGNNPFGKSIAQSIKK-TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) T ss_pred HHHHHHHHHhhhcCCCCCcCccccccccc-cceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCC Confidence 99999999999999887654444332222 23334456679999999999999999999999999999999999999999 Q ss_pred ccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cC Q lcl|NC_012784. 314 YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HF 376 (415) Q Consensus 314 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~ 376 (415) |++.+ +.+++|+|+||++++..+. +...+++|||++ +.++++++++++.+++. ++ T Consensus 216 ~~~~~----~~~~~l~G~PV~~~~~~~~---~~~~~~~gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n 287 (324) T protein:vir:96 216 ERIYD----RNSDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeecC----CCCCcccceeeEeecCCCC---CcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcC Confidence 99853 3467899999998765543 344689999997 56778999999887653 34 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeec--CCCCcccc Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDD--SERGEGDL 411 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~--~~~~~~~~ 411 (415) .+.+|+++|+|+++.+|+||++|+-.. +...-|-+ T Consensus 288 ~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 288 MVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred cEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 567899999999999999999998332 11111112 No 93 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.9e-51 Score=297.66 Aligned_cols=386 Identities=12% Similarity=0.075 Sum_probs=232.2 Q ss_pred CChHH----------HHHHHHHHHHHHHHHHHHHHH---HhhchHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_012784. 1 MKTKE----------ELQSEISDIKRQIDLKVKYAT---RALNNDE-LEKAEKLEQEITDLRSQIQEKQEE-LDKLKEKD 65 (415) Q Consensus 1 Mk~~~----------el~~~l~~l~~~~~~~~~~~~---~~~~e~~-~~~~~~~~~e~~~l~~~i~~~~~~-~~~~~~~~ 65 (415) ..+.. .-.....+...+-.++..++. +.+..++ ..++-.-...+++.++++...... ........ T Consensus 201 ~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~ 280 (632) T protein:vir:96 201 ETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKP 280 (632) T ss_pred cccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhh Confidence 00000 000000000000001111111 0111000 000000011112222111110000 00000000 Q ss_pred hh-hhhcccccccc-chhhhhhHHHHHHHHHH-------------------HHHhhhhhHHHHHHHHHHhhhhhhhhccc Q lcl|NC_012784. 66 GT-SENNQQSVEVN-EARTYRNQANINDLGIS-------------------IQNTKVTSQEVRDFTEYLETRNDIQGGSL 124 (415) Q Consensus 66 ~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 124 (415) .. ........... ................. .........+.+.+.........+..... T Consensus 281 ~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~ 360 (632) T protein:vir:96 281 GAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKK 360 (632) T ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcc Confidence 00 00000000000 00000000000000000 00000000000111111111223344555 Q ss_pred ccccceeecchhH-HhHHHHHHhhhhhhhhc-ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeee Q lcl|NC_012784. 125 KTDSGFVVIPEEI-VTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) Q Consensus 125 ~~~~~~~~vP~~~-~~~Ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~ 202 (415) +.+.||++||+++ ...|++.+++.++++++ ++++++ .++.+.+++.++++.++|++|++.+|++ +++|+++++.+ T Consensus 361 t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~--~~g~~~ip~~~~~~~a~wv~E~~~~~~s-~~~f~~i~l~~ 437 (632) T protein:vir:96 361 TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPG--LVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSP 437 (632) T ss_pred cccccccccccccchHHHHHHHhhcchhhhhcceEeec--CCcceEEEEEeCCceeEeecCCcccccc-ccceeeEEeee Confidence 6778899999876 68899999999998887 455544 4556778888899999999999999874 69999999999 Q ss_pred eeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccccccccccccccccccchhhHHHHHHHH Q lcl|NC_012784. 203 NTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS-TGSTSSGFEKEGKKLEVKKAKSLDDIKDAI 281 (415) Q Consensus 203 ~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (415) +|++++++||+|||+|+.++++++|.++|+++++.++|.++|+|+|++. +.++... ..........+..+++++.++. T Consensus 438 ~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~-~~~~~~~~~~~~~~~~~i~~~~ 516 (632) T protein:vir:96 438 KTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNM-TGVPALTYPAGGVDWASVVDME 516 (632) T ss_pred eEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeec-ccccceecccccCCHHHHHHHH Confidence 9999999999999999999999999999999999999999999999654 4333322 2222333344566789999999 Q ss_pred HHhhhhcc--CCCEEEEcHHHHHHHHH--hhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhc Q lcl|NC_012784. 282 NLNVKPNY--EHNVAIVSQTMFAKLDK--MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDA 357 (415) Q Consensus 282 ~~~~~~~~--~~~~~v~~~~~~~~l~~--lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~ 357 (415) .++..++. .+++|+|||.++..|.+ ++|++|+|||.+ ++|+|+||++++++|.+. ++||||+. T Consensus 517 ~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------~~l~G~pv~~s~~ip~~~-----~~~gd~s~- 583 (632) T protein:vir:96 517 TKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPADT-----WIFGDWSQ- 583 (632) T ss_pred HHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------CeecccceEeccccccCc-----EEEeecce- Confidence 88887764 46799999998877765 789999999963 489999999999998653 79999997 Q ss_pred EEEEeecceEEEEeeccc---CceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 358 IVLFDRSQYQASWTDYMH---FGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 358 ~~~~~~~~~~i~~~~~~~---~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) +.+++++++++.++++.. +.+.++++.|+|++|.+|++|++++..| T Consensus 584 ~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 584 IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 557789999999988765 4567999999999999999999999888 No 94 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=6e-52 Score=301.41 Aligned_cols=279 Identities=13% Similarity=0.045 Sum_probs=218.4 Q ss_pred cccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEee Q lcl|NC_012784. 122 GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYD 201 (415) Q Consensus 122 ~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~ 201 (415) ....++++|++||++++++|++.+++.+++++++++++++++..++ |+.++.+.+.|++|++++|++ +++|+++++. T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~--p~~~~~~~a~wv~Eg~~~~~~-~~~f~~v~l~ 77 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDI--ITFNGRPKAEFVGEGQQKSST-TGEFDFVTST 77 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEE--EEEeCCceeEEeecCcccccc-cceeeEEEEe Confidence 2234467788999999999999999999999999999988766554 455677889999999999964 6999999999 Q ss_pred eeeEEEeehhhHHHH---hcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc--cc----ccccccccccchh Q lcl|NC_012784. 202 INTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--GF----EKEGKKLEVKKAK 272 (415) Q Consensus 202 ~~k~a~~~~iS~e~l---~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~--~~----~~~~~~~~~~~~~ 272 (415) ++|++++++||+|++ .|+.++|.+||.++|++++++++|.++|+|+|++.+.+... .. ............. T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~ 157 (311) T protein:vir:99 78 PKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIAN 157 (311) T ss_pred eEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccch Confidence 999999999999999 47788999999999999999999999999988765433221 11 1111111122223 Q ss_pred hHHHHHHHHHHhhhhc--cCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccc-------- Q lcl|NC_012784. 273 SLDDIKDAINLNVKPN--YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG-------- 342 (415) Q Consensus 273 ~~~~~~~~~~~~~~~~--~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~-------- 342 (415) .++++.+++..+...+ +..++|+|||.+|..|++|||++|||+|++.+.++.+++|+|+||++++.+|.. T Consensus 158 ~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~ 237 (311) T protein:vir:99 158 PDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDE 237 (311) T ss_pred hHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccc Confidence 3455555555554443 445679999999999999999999999999999999999999999999987632 Q ss_pred ---ccCCceEEEechhhcEEEEeecceEEEEeec----------ccCceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 343 ---QKGNNTLIIGNLKDAIVLFDRSQYQASWTDY----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 343 ---~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~----------~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) ..+...+++|||++++.+..+.+++++.+++ .++...+|++.|+|++|.+|++++..+.+| T Consensus 238 ~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 238 DLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred hhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 1234457889999988888899999987654 346678999999999999975444433333 No 95 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=3.1e-51 Score=297.52 Aligned_cols=361 Identities=9% Similarity=0.009 Sum_probs=224.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |.. +|+++++++.+++.+..+..+.. +...++.+.+.+.++.+..++.+... ++..+........ ... T Consensus 1 M~~--kl~~~~~~~~e~~~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~---~~g--- 68 (383) T protein:vir:78 1 MTI--KLKNNLANYEEKRTAFVNAVKNE--DTQEIQNKAYVEMVDAMAADIMEQAK--KEARQEADAYISA---SRT--- 68 (383) T ss_pred Cch--hHHHHHHHHHHHHHHHHHHHhcc--ChHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHh---cCC--- Confidence 552 23333333322222222211111 11111222222223333332221100 0000000000000 000 Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc Q lcl|NC_012784. 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 160 (415) .......+.+.+ ......+.++||++||+++.+.|++.++..++++++|++.++ T Consensus 69 -----------------~~~lt~~e~~~~---------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~ 122 (383) T protein:vir:78 69 -----------------DKNITNEEIKFF---------NDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTT 122 (383) T ss_pred -----------------hhhhhHHHHHHH---------HHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEec Confidence 000001111110 112234677889999999999999999999999999999887 Q ss_pred cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) Q Consensus 161 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d 240 (415) ++. ..+++..+.+.+.|++|+++.++.++++|+++++.+||++++++||+|||+|+.+++++||.++|+++++++++ T Consensus 123 ~~~---~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~ 199 (383) T protein:vir:78 123 GLR---TKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALE 199 (383) T ss_pred CCc---eEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHh Confidence 543 45667778888999999888776678999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccccccccccccccccc-------cccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh---cc Q lcl|NC_012784. 241 KAIIDVITKGSTGSTSSGFEKEGKKL-------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK---DK 310 (415) Q Consensus 241 ~~il~g~g~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk---d~ 310 (415) .+|++|+|++.|.++........... ...+..+++++......+ .+++.+..|+||..++..+.+++ +. T Consensus 200 ~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 278 (383) T protein:vir:78 200 SAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNEL-TDVYKYHSVKENGHPLNVAGKVTLLVNP 278 (383) T ss_pred hheEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHH-HHHHhccchhcccchhhhcCceEEEEcC Confidence 99999999998887764332211111 122233344444444333 34555555555555555555444 11 Q ss_pred CCcccccCcc----cCCCCceeccee--eEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC---ceEEE Q lcl|NC_012784. 311 LGNYLIQPDV----KEKTQQRLLGAK--IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF---GECLM 381 (415) Q Consensus 311 ~G~~l~~~~~----~~~~~~~l~G~p--V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~---~~~~~ 381 (415) .+.+.|.+.. .+|.+.+++|+| |+++++||.+ .++||||++ |.+++|++++++.+++.+| ++.+| T Consensus 279 ~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~-----~iifgdfs~-Y~i~~r~~~~i~~~~~~~f~~d~~~f~ 352 (383) T protein:vir:78 279 TDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEK-----KAISYVAER-YDALIGGPLDIGTYDQTLAIEDLNLYA 352 (383) T ss_pred cchhhhccchhccCCCCceeeecCCCceEEecCCCCcc-----cEEEeeccc-eEEEecccceEEecchhhhhcCceEEE Confidence 1112222221 234445677776 5667778754 379999998 7889999999999887654 56899 Q ss_pred EEEEeccEEeccccEEEEEeec---CCCCcc Q lcl|NC_012784. 382 IAVRQDCRILDYKSAIVIEYDD---SERGEG 409 (415) Q Consensus 382 ~~~r~d~~v~~p~a~~~~~~t~---~~~~~~ 409 (415) +..|+||++++|+||++++++- .++++| T Consensus 353 ~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 353 AKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred EEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 9999999999999999988873 445555 No 96 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=9e-52 Score=300.43 Aligned_cols=280 Identities=12% Similarity=0.062 Sum_probs=220.0 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccc----cccccce Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE----LAVKPFF 196 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~----~~~~~f~ 196 (415) +...+.+++|++||++++++|++.+++.++++++++++++.+++.. +++....+.+.|++|++..++ .+.++|+ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~ 78 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH--LPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEE--EEEEeCCcceEEeeccccccccccccccccee Confidence 6667888889999999999999999999999999999998766544 555667889999999986543 3578999 Q ss_pred eeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccc----ccccccccchh Q lcl|NC_012784. 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEK----EGKKLEVKKAK 272 (415) Q Consensus 197 ~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~----~~~~~~~~~~~ 272 (415) ++++++||++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+......... ........... T Consensus 79 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) T protein:vir:25 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) T ss_pred eEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccch Confidence 99999999999999999999999999999999999999999999999999987654332211111 11112222233 Q ss_pred hHHHH----HHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCce Q lcl|NC_012784. 273 SLDDI----KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) Q Consensus 273 ~~~~~----~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) .++++ ..+.......++..+.|+|||.++..|+++||++|||+|++ ++|+|+||++++.+|.. .++.. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~-------~~l~G~Pv~~~~~~~~~-~~~~~ 230 (305) T protein:vir:25 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD-ADAAI 230 (305) T ss_pred hhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC-------CcccccceEEcCccCCC-CCccE Confidence 33333 34444445556777789999999999999999999999964 48999999999998753 34567 Q ss_pred EEEechhhcEEEEeecceEEEEeecc-------------cCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 349 LIIGNLKDAIVLFDRSQYQASWTDYM-------------HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 349 ~~~gd~~~~~~~~~~~~~~i~~~~~~-------------~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ++||||++ +.++++++++++.+++. ++...+|++.|+|+.+.+|+||++++.++.+. +.-.| T Consensus 231 ~~~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~----~~pa~ 305 (305) T protein:vir:25 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV----VAPAA 305 (305) T ss_pred EEEEecce-EEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccc----cCCCC Confidence 89999998 56788999999887653 33456899999999999999999999764221 11111 No 97 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=1.8e-51 Score=298.76 Aligned_cols=299 Identities=13% Similarity=0.040 Sum_probs=225.0 Q ss_pred hhhhHHHHHHHHHHhhhhhh-hhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeec----- Q lcl|NC_012784. 100 KVTSQEVRDFTEYLETRNDI-QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS----- 173 (415) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----- 173 (415) +.. ...++..... .......+.++.+||++++++|++.+++.++++++|+++++++++..+|+.... T Consensus 1 ~~~-------~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~ 73 (338) T protein:vir:78 1 MAT-------LNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQ 73 (338) T ss_pred Ccc-------hHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcccee Confidence 000 0011111110 111122334566999999999999999999999999999998887777765432 Q ss_pred -CCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_012784. 174 -EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGST 252 (415) Q Consensus 174 -~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~ 252 (415) +...+.|++|++.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|++.+ T Consensus 74 v~~~~~~~~~Eg~~~~~~-~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~ 152 (338) T protein:vir:78 74 VGVGTSNEQREGGTKPLS-GTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG 152 (338) T ss_pred eccccccccccccccccc-ccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc Confidence 23456788999999864 6899999999999999999999999999999999999999999999999999999997654 Q ss_pred cccc---ccccc----ccccccccchhhHHHHHHHHHHhhh-hccCCCEEEEcHHHHHHHH---HhhccCCcccccCccc Q lcl|NC_012784. 253 GSTS---SGFEK----EGKKLEVKKAKSLDDIKDAINLNVK-PNYEHNVAIVSQTMFAKLD---KMKDKLGNYLIQPDVK 321 (415) Q Consensus 253 ~~~~---~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~l~---~lkd~~G~~l~~~~~~ 321 (415) .... ..... ............++++.++...+.. ..+..++|+|||.++..|+ +++|++|+|+|.+.+. T Consensus 153 ~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~ 232 (338) T protein:vir:78 153 SALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINL 232 (338) T ss_pred ccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeeccccc Confidence 3221 11011 1111122234457788888777654 4556778999999988774 5789999999998889 Q ss_pred CCCCceecceeeEEecccccc----ccCCceEEEechhhcEEEEeecceEEEEeecc-----------------cCceEE Q lcl|NC_012784. 322 EKTQQRLLGAKIEILPDEVLG----QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-----------------HFGECL 380 (415) Q Consensus 322 ~~~~~~l~G~pV~~~~~~~~~----~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----------------~~~~~~ 380 (415) ++.+++|+|+||++++++|.. ......++||||+. +.++++++++++++++. ++.+.+ T Consensus 233 ~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (338) T protein:vir:78 233 AASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAI 311 (338) T ss_pred CCCCceeeeeeEEEccccCccccccCCcccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEE Confidence 999999999999999999853 22345689999987 66788999999988753 345678 Q ss_pred EEEEEeccEEeccccEEEEEeecCCCCcc Q lcl|NC_012784. 381 MIAVRQDCRILDYKSAIVIEYDDSERGEG 409 (415) Q Consensus 381 ~~~~r~d~~v~~p~a~~~~~~t~~~~~~~ 409 (415) |++.|+|++|++|+||++++-.++ +.+ T Consensus 312 r~~~r~d~~v~~~~a~~~l~~~~~--~~~ 338 (338) T protein:vir:78 312 LIEVTFGWLLGDKQAFVKFVDDED--PDA 338 (338) T ss_pred EEEEEeccEeecccceEEEecccC--CCC Confidence 999999999999999999775433 333 No 98 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.8e-51 Score=297.72 Aligned_cols=295 Identities=13% Similarity=0.018 Sum_probs=225.5 Q ss_pred HHHHHHHhhhhh-hhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccc- Q lcl|NC_012784. 107 RDFTEYLETRND-IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL- 184 (415) Q Consensus 107 ~~~~~~~~~~~~-~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg- 184 (415) -.....++.... ..........++.++|+++.++|++.+++.+++++++++++++++... +++..+.+.+.|++|| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~--~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETI--IPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE--EEEEeCCceeEeecCcc Confidence 000000100000 011112223344589999999999999999999999999998766544 5556777777777766 Q ss_pred -------ccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_012784. 185 -------EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS 257 (415) Q Consensus 185 -------~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~ 257 (415) +.+|+ +.++|+++++.++|++++++||+|+++|+.+++++||+++|++++++++|.+||+|+|++.+..... T Consensus 79 ~~~~~e~~~~~~-~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g 157 (333) T protein:vir:78 79 SNEQREGGLKPL-SGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQG 157 (333) T ss_pred cccccccccccc-cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccc Confidence 44554 5789999999999999999999999999999999999999999999999999999999876533221 Q ss_pred cc-------ccccccccccchhhHHHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHH---hhccCCcccccCcccCCCCc Q lcl|NC_012784. 258 GF-------EKEGKKLEVKKAKSLDDIKDAINLNVKP-NYEHNVAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQ 326 (415) Q Consensus 258 ~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~ 326 (415) .. ..........+...++++++++..+... ++.+++|+|||.+|..|++ ++|++|+|+|.+.+..+.++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~ 237 (333) T protein:vir:78 158 IDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTG 237 (333) T ss_pred ccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCc Confidence 11 1111222333455688999998887655 4556789999999987765 78999999999888888999 Q ss_pred eecceeeEEecccccc----ccCCceEEEechhhcEEEEeecceEEEEeecc--------------cCceEEEEEEEecc Q lcl|NC_012784. 327 RLLGAKIEILPDEVLG----QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM--------------HFGECLMIAVRQDC 388 (415) Q Consensus 327 ~l~G~pV~~~~~~~~~----~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--------------~~~~~~~~~~r~d~ 388 (415) +|+|+||++++++|.. ..+...+++|||++ +.++++++++++.+++. ++...+|+++|+|+ T Consensus 238 ~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~ 316 (333) T protein:vir:78 238 DVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ-LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGW 316 (333) T ss_pred eeeceeeEEccccCCCccccCCCccEEEEEeccc-EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEcc Confidence 9999999999999854 23345789999998 55778999999987753 34566899999999 Q ss_pred EEeccccEEEEEeecCC Q lcl|NC_012784. 389 RILDYKSAIVIEYDDSE 405 (415) Q Consensus 389 ~v~~p~a~~~~~~t~~~ 405 (415) ++.+|+||++++..++| T Consensus 317 ~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 317 LLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEecccceEEEeccCCC Confidence 99999999999888777 No 99 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=2e-44 Score=260.15 Aligned_cols=379 Identities=10% Similarity=0.049 Sum_probs=215.7 Q ss_pred CChHHHHH----HHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhh-cccc Q lcl|NC_012784. 1 MKTKEELQ----SEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQE-ELDKLKEKDGTSEN-NQQS 74 (415) Q Consensus 1 Mk~~~el~----~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~-~~~~~~~~~~~~~~-~~~~ 74 (415) .-++..++ .+..++++... ..++.. +..++.+++..+++++.+++..... ..+.+......... .... T Consensus 127 ~a~I~~vke~~~~e~~~~~~~~a-~~ee~~-----e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~ 200 (517) T protein:vir:97 127 NAVVTYFREEKKKEENKMTFDQN-LMQELL-----DAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKIL 200 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhh-hhhhhh-----hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhc Confidence 11111111 11111111100 000000 0011122222222222222211110 11111000000000 0000 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHH-HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEY-LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) ...................... ....... ................+++.+|..+...|...+...++++. T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~ 271 (517) T protein:vir:97 201 GVEALKVTPEATEFLKTREAEV---------AYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLP 271 (517) T ss_pred ccccccccchhhHHHHHHHHHH---------HHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhcccee Confidence 0000000000000000000000 0000000 00000111112233456888999999999999999998888 Q ss_pred cceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHH----HHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----VLQELKL 229 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~----l~~~l~~ 229 (415) ++++.+++. ..++.......+.|+.||+.+|+ ++++|+.+++.++++++++++|+++++|+.+| |++||.+ T Consensus 272 ~~~~~~i~~----~~~~~~~~~~~a~~~~eG~~kp~-s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~ 346 (517) T protein:vir:97 272 FIRHENLPT----LVVGGDNALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMN 346 (517) T ss_pred eeeeccccc----eeeecccccceeeeeecCCcccc-cccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHH Confidence 877654432 22344455567789999999986 46899999999999999999999999998887 9999999 Q ss_pred HHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhc Q lcl|NC_012784. 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD 309 (415) Q Consensus 230 ~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd 309 (415) +|+++++++++.+||+|+|++.+..................+...++++..+...... ..+++|+|||.+|.+|++||| T Consensus 347 ~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~-a~~a~~vmn~~t~~~I~klKD 425 (517) T protein:vir:97 347 RLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-AADSTLVIHRNDLAAIRFLKD 425 (517) T ss_pred HHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHHHHHHHHhhh-ccCCEEEECHHHHHHHHHhhc Confidence 9999999999999999999876544333332222222333344444444443332221 247899999999999999999 Q ss_pred cCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEE-EEeecccCceEEEEEEEecc Q lcl|NC_012784. 310 KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA-SWTDYMHFGECLMIAVRQDC 388 (415) Q Consensus 310 ~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~r~d~ 388 (415) ++|||||++...++.+.+++|.. ..+|....+...++ +++. |.++++.++++ +..+..+++..|+..+|.+| T Consensus 426 ~~G~Yl~~~~~~~~~~~~l~G~~----~~~~~~~~~~~~~~--~~~~-y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g 498 (517) T protein:vir:97 426 KNGNYVFPVGVSNQTIATHFGFN----RLVQSVAVDEKTAV--SLSG-YVTNGSRGMEFEQGTILVENNKEYLFEMPISG 498 (517) T ss_pred CCCCeeccCcCCcccccccCCcc----ccccccccCceeEe--eccc-cEEEeecceeeeeeeecccCceeEeeeeeecc Confidence 99999999888888889999942 22333333333333 3444 55666677654 33344567888999999999 Q ss_pred EEeccccEEEEEeecCCCC Q lcl|NC_012784. 389 RILDYKSAIVIEYDDSERG 407 (415) Q Consensus 389 ~v~~p~a~~~~~~t~~~~~ 407 (415) .|..|++|+++.|+|+.+| T Consensus 499 ~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 499 SLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred ccccccceEEEEEcCCCCC Confidence 9999999999999999999 No 100 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.8e-41 Score=244.02 Aligned_cols=296 Identities=9% Similarity=-0.048 Sum_probs=224.4 Q ss_pred hhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cCCceeEEEEeecC--Cccc Q lcl|NC_012784. 102 TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TNGSGKYPVVRQSE--VAAL 178 (415) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~--~~~a 178 (415) ... .+.... .....+.+..+||+++|.++. ++++.+++.++++++++++++ .+.+..++...... .+.. T Consensus 1 ~~~-----~~~~~~--~~k~it~~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~ 72 (314) T protein:vir:41 1 MDF-----LNKPFQ--ITPKIDVPDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGR 72 (314) T ss_pred Cch-----hhhHHH--hhcccccccCCCceeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCccccccc Confidence 000 111111 111123456678999998875 699999999999999998754 44444444332111 1344 Q ss_pred ccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHHhhccccccc---- Q lcl|NC_012784. 179 EKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGST---- 252 (415) Q Consensus 179 ~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~--~l~~~l~~~la~~~~~~~d~~il~g~g~~~~---- 252 (415) .|.+|..+.++ +.++|+++++.+||+...++||+|+|+|+.. ||+++|...+++++++.++..+++|+|+..+ T Consensus 73 ~~~~~~~~~~~-~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~ 151 (314) T protein:vir:41 73 NTSGTKVAPTA-DEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGREL 151 (314) T ss_pred ccccCCccCCc-ccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccc Confidence 56677776664 5799999999999999999999999999975 9999999999999999999999999986432 Q ss_pred ----cccccccccccc-cccccchhhHHHHHHHHHHhhhhccC---CCEEEEcHHHHHHHHHhhccCCcccccCcccCCC Q lcl|NC_012784. 253 ----GSTSSGFEKEGK-KLEVKKAKSLDDIKDAINLNVKPNYE---HNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKT 324 (415) Q Consensus 253 ----~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~ 324 (415) .|.......... ....+...+.+.+.+++..+.+.|++ +.+|+||+.++.+++++++.+|+|+|.+....+. T Consensus 152 ~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~ 231 (314) T protein:vir:41 152 YRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGAT 231 (314) T ss_pred hhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCC Confidence 222211111111 11222334456677899999998875 4589999999999999999999999999889999 Q ss_pred CceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC-ceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 325 QQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF-GECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 325 ~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) +.+|+|+||+.++.||...+++.+++||||+++ +...+..+++..+.+... +..+.+..|+|+.+..++|.|+..+.. T Consensus 232 ~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~ 310 (314) T protein:vir:41 232 GLQYDGIPIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDM 310 (314) T ss_pred CceecceeeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeec Confidence 999999999999999998888999999999985 456677788877766655 456788899999999999999888888 Q ss_pred CCCC Q lcl|NC_012784. 404 SERG 407 (415) Q Consensus 404 ~~~~ 407 (415) +..| T Consensus 311 ~~~~ 314 (314) T protein:vir:41 311 SSGG 314 (314) T ss_pred cCCC Confidence 7777 No 101 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=1.1e-41 Score=245.06 Aligned_cols=295 Identities=9% Similarity=-0.051 Sum_probs=214.2 Q ss_pred HHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cCCceeEEEEe- Q lcl|NC_012784. 94 ISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TNGSGKYPVVR- 171 (415) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~- 171 (415) ....... +..+........+.+..+||+++|.... .+++.+.+.+++++.+++++. .+..+.++... T Consensus 1 ~~~~~~~----------~~~~~~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~ 69 (315) T protein:vir:41 1 MLTIEDI----------RGGKPFEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSL 69 (315) T ss_pred Ccccchh----------hcCChhhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeecccccccccccccc Confidence 0000000 0111111111223456688999998765 688999999999999998653 33333322211 Q ss_pred -ecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012784. 172 -QSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVIT 248 (415) Q Consensus 172 -~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~--~l~~~l~~~la~~~~~~~d~~il~g~g 248 (415) ....+...|.+|....++ +.++|+++.+.++++...+.||+++|+|+.. ||++||..++++++++.++.++++|+| T Consensus 70 ~~~~~~g~~~~~~~~~~~~-~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg 148 (315) T protein:vir:41 70 VLDVGPGRDETGQKLAPPE-STAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDT 148 (315) T ss_pred CcccccccccccCcCCCCC-CccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 111123457788777664 5799999999999999999999999999864 999999999999999999999999988 Q ss_pred cccc------cccccccccccc---cccccchhhHHHHHHHHHHhhhhccC---CCEEEEcHHHHHHHHHhhccCCcccc Q lcl|NC_012784. 249 KGST------GSTSSGFEKEGK---KLEVKKAKSLDDIKDAINLNVKPNYE---HNVAIVSQTMFAKLDKMKDKLGNYLI 316 (415) Q Consensus 249 ~~~~------~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~lkd~~G~~l~ 316 (415) +... .+.+........ ........+.+.+.++++.+...|++ +++|+||+.++.++++++|++|+|+| T Consensus 149 ~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw 228 (315) T protein:vir:41 149 SSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLG 228 (315) T ss_pred cCcCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccc Confidence 6422 222221111111 11112233457788899999998874 56899999999999999999999999 Q ss_pred cCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC-ceEEEEEEEeccEEecccc Q lcl|NC_012784. 317 QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF-GECLMIAVRQDCRILDYKS 395 (415) Q Consensus 317 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~r~d~~v~~p~a 395 (415) ++....+.+.+|+|+||+.++.||....++..++||||+++ .+..+.+++++.+.+... ...+....|+|+...++++ T Consensus 229 ~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl-~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~ 307 (315) T protein:vir:41 229 DQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQL-VYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEG 307 (315) T ss_pred cchhhcCCCceecccceEecccccccCCCCccEEEecccce-EEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccc Confidence 99999999999999999999999988888889999999984 556788899987776544 3446777899998887765 Q ss_pred --EEEEEe Q lcl|NC_012784. 396 --AIVIEY 401 (415) Q Consensus 396 --~~~~~~ 401 (415) +..+++ T Consensus 308 ~a~~~~~v 315 (315) T protein:vir:41 308 AVSATITV 315 (315) T ss_pred eeEeeeeC Confidence 444454 No 102 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=8.8e-38 Score=223.74 Aligned_cols=358 Identities=11% Similarity=0.053 Sum_probs=179.2 Q ss_pred CChHHH---HHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_012784. 1 MKTKEE---LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) Q Consensus 1 Mk~~~e---l~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) +..--. +++......+... ..+..+. ..+..+......++.+++++++..+++.+........ ... T Consensus 111 a~~~a~v~~vks~~~~~e~~~~--~~e~~e~-----~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~---~~~- 179 (480) T protein:vir:40 111 SNKGAKVTKVREENKGEQEQMG--ANETQEI-----MKQAIEAGVKVRELEAKVEELNKEREELKKEREASIP---SEK- 179 (480) T ss_pred cchhhhhhhhhhhhhhhhhhhh--hHHHHHH-----HHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhcc---ccc- Confidence 111111 1110000000000 0000000 0000011111222222222222222211111111000 000 Q ss_pred cchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 78 NEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) ....... ......... .......+ ++....... ......++ .+|..+...+........++...+.. T Consensus 180 --~~~~~~~-----e~r~~~~~~-~~~~e~~~---~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (480) T protein:vir:40 180 --PEDAERK-----FMRELGSKM-AEMPEQGF---LREFANGAD-LNVVNSLG-SITSKYARKSGIYDGAMKARFQGLTL 246 (480) T ss_pred --hhhhhhH-----HHHHHHHHh-ccchhhhh---hhhhhhhcc-cccccccc-ccccchhhheeechhhhhhhhhccee Confidence 0000000 000000000 00000001 111111111 11122233 34444444443333333333332221 Q ss_pred EEccCCceeEEEEeecCCccccccccccccccccc-ccceeeEee---eeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAV-KPFFQLAYD---INTHRGYFRISREAIEDAKVNVLQELKLWMAR 233 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~-~~f~~v~~~---~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~ 233 (415) . ..+.....|++|+...+.+.. .++....+. .++++++..+|.++|+|+. +|++||.++|++ T Consensus 247 ~-------------~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~ 312 (480) T protein:vir:40 247 A-------------EDGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVN 312 (480) T ss_pred e-------------eccccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHH Confidence 1 112233456666654433222 223344444 4788888999999999997 799999999999 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH-HHHHHhhhhccCCC-EEEEcHHHHHHHHHhhccC Q lcl|NC_012784. 234 TIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK-DAINLNVKPNYEHN-VAIVSQTMFAKLDKMKDKL 311 (415) Q Consensus 234 ~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~v~~~~~~~~l~~lkd~~ 311 (415) +++++++.+||+|+|+|.+............ +...+.++++ .+++.+..+|+.++ .|||||.+|++|++|||++ T Consensus 313 ~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~----~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~ 388 (480) T protein:vir:40 313 RVIQKVEYNMILGSVDGSNGFYGLKTATDGW----TKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTD 388 (480) T ss_pred HHHHHHHHHhhccCCCCccccccceeecccc----cccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCC Confidence 9999999999999776654333222111111 1122233444 57778888888888 6999999999999999999 Q ss_pred CcccccCcccCCCCceecceeeEEeccc-cccccCCceEEEechhhcEEEEeecceEEEEeecc--cCceEEEEEEEecc Q lcl|NC_012784. 312 GNYLIQPDVKEKTQQRLLGAKIEILPDE-VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM--HFGECLMIAVRQDC 388 (415) Q Consensus 312 G~~l~~~~~~~~~~~~l~G~pV~~~~~~-~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~~r~d~ 388 (415) |||||+|+++.+.+.+|+|+||++++.+ |.. ...+|.++.++.+++++ ++. +.++. .+...+....|++| T Consensus 389 G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~-----~~~~~~~~~~~~~~d~~-~~~-~~~~~~~~~~~~~~~e~~v~g 461 (480) T protein:vir:40 389 GHSRFNELATKEQIAQSFGAVNLETRVWMPKD-----EVAVYNHDEYVLIGDLN-VEN-YNDFDLRYNVEQWLSETLVGG 461 (480) T ss_pred CCeeccCcccccCcceecccceeeeeccccCC-----cceeeeCCccEEEEecc-cce-ecccccccchhhhhhhhhhce Confidence 9999999999999999999998877533 321 13455666677777764 332 22222 33445677789999 Q ss_pred EEeccccEEEEEeecCCCCcccccc Q lcl|NC_012784. 389 RILDYKSAIVIEYDDSERGEGDLGL 413 (415) Q Consensus 389 ~v~~p~a~~~~~~t~~~~~~~~~~~ 413 (415) .+.+|+++.+++...+. +. T Consensus 462 ~~~~~~~~~~~~~~~~~------~~ 480 (480) T protein:vir:40 462 SIRGKNRSAYLKKKGSL------GV 480 (480) T ss_pred eeEccccEEEEEeccCc------CC Confidence 99999999999865433 33 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.9e-35 Score=210.99 Aligned_cols=301 Identities=10% Similarity=-0.020 Sum_probs=212.9 Q ss_pred HHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccc- Q lcl|NC_012784. 104 QEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVE- 182 (415) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~- 182 (415) ...+.+...++........+.+..++|+.||+++...|++.+.+.+++++.++++++...++.++. ...++...|++ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~--~~~~~~~~~~~~ 78 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPT--LNIGERHRRPQD 78 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeee--eccCCccccccc Confidence 112222222222333334444666778899999999999999999999999999999887776554 33444556665 Q ss_pred ccccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc--cc Q lcl|NC_012784. 183 ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS--SG 258 (415) Q Consensus 183 Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~--~~ 258 (415) |+......+.++|+++++.++++...++||+|+|+|+. ++|+++|.+.++++++..++..+++|+|.+.+.... .| T Consensus 79 e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G 158 (321) T protein:vir:31 79 EGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDG 158 (321) T ss_pred ccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchh Confidence 44332335678999999999999999999999999975 589999999999999999999999999887664211 11 Q ss_pred cc----c-ccccccccchhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHH-hhccCCcccccCcccCCCCceecc Q lcl|NC_012784. 259 FE----K-EGKKLEVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDK-MKDKLGNYLIQPDVKEKTQQRLLG 330 (415) Q Consensus 259 ~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G 330 (415) .. . ...........+++.+.+++..+...|+. +.+|+||+.++..++. ++|. +.|+|.+.+.++.+.+|+| T Consensus 159 ~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~~~~~~tl~G 237 (321) T protein:vir:31 159 FITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIMGEADVNPFS 237 (321) T ss_pred hhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhhccccccccc Confidence 11 1 11112223445678888999988888874 4589999999988775 5665 5588888888888889999 Q ss_pred eeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC----ceEEEEE--EEeccEEeccccEEEEE-eec Q lcl|NC_012784. 331 AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF----GECLMIA--VRQDCRILDYKSAIVIE-YDD 403 (415) Q Consensus 331 ~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~----~~~~~~~--~r~d~~v~~p~a~~~~~-~t~ 403 (415) +||+.+++||.. .++++||++++. +.+.++++........ ...++.+ .++|+.|.++++++.++ +.. T Consensus 238 ~pvv~~~~mP~~-----~il~t~~~nl~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~ 311 (321) T protein:vir:31 238 FPIIGSGLWPDD-----KAMFTDPQNLIY-ALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGD 311 (321) T ss_pred eeEEEcCCCCCC-----cEEEeccccEEE-EEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCc Confidence 999999999965 389999998644 4456777765543322 1223433 46888999999999988 222 Q ss_pred CCCCcccccccC Q lcl|NC_012784. 404 SERGEGDLGLEA 415 (415) Q Consensus 404 ~~~~~~~~~~~~ 415 (415) +....-+- ++ T Consensus 312 ~~~~~~~~--~~ 321 (321) T protein:vir:31 312 PLEHLEEE--TS 321 (321) T ss_pred chhcccCC--CC Confidence 11111111 11 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.96 E-value=2.4e-31 Score=188.46 Aligned_cols=266 Identities=13% Similarity=0.094 Sum_probs=205.2 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cC-CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +...++..+..++|+.++..|++.+.....+.+++.+... .+ +...+.+|+....+.+.|++||+.+|. +.++++.+ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPM-TQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccc-cccccceE Confidence 3333455667899999999999999998888777765322 11 223466777777778999999999985 57999999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++.+++++..+.+|+++..++..++.+++.+++++.+++++|..++....+.. ...++..+++++. T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~--------------~~~~~~~t~d~i~ 145 (272) T protein:vir:30 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST--------------QTVEATATVDGVS 145 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHH Confidence 99999999999999999999999999999999999999999999986543221 1122344688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC---cccccCcccCCCCceecceeeEEeccccccccCCceEEEechh Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK 355 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~ 355 (415) +++..+.+.+.....|+|||.++..|++.+..+. .....+.+..+..++|+|+||++++++|.+. .++++ .. T Consensus 146 da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t----~~~~~-~~ 220 (272) T protein:vir:30 146 KALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGT----AYMVR-KG 220 (272) T ss_pred HHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcce----EEEEc-CC Confidence 9999998888889999999999999987642221 1112223455666799999999999998654 24444 33 Q ss_pred hcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecCCCC Q lcl|NC_012784. 356 DAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERG 407 (415) Q Consensus 356 ~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~ 407 (415) ++.++.+.+++++..++. .+.+.+++..|+++++.+|++++.+++.++..- T Consensus 221 -a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 221 -ALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred -eEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 466677888888877654 345668888999999999999999999977666 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.96 E-value=2.4e-31 Score=188.46 Aligned_cols=266 Identities=13% Similarity=0.094 Sum_probs=205.2 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cC-CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +...++..+..++|+.++..|++.+.....+.+++.+... .+ +...+.+|+....+.+.|++||+.+|. +.++++.+ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPM-TQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccc-cccccceE Confidence 3333455667899999999999999998888777765322 11 223466777777778999999999985 57999999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++.+++++..+.+|+++..++..++.+++.+++++.+++++|..++....+.. ...++..+++++. T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~--------------~~~~~~~t~d~i~ 145 (272) T protein:vir:98 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST--------------QTVEATATVDGVS 145 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHH Confidence 99999999999999999999999999999999999999999999986543221 1122344688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC---cccccCcccCCCCceecceeeEEeccccccccCCceEEEechh Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK 355 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~ 355 (415) +++..+.+.+.....|+|||.++..|++.+..+. .....+.+..+..++|+|+||++++++|.+. .++++ .. T Consensus 146 da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t----~~~~~-~~ 220 (272) T protein:vir:98 146 KALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGT----AYMVR-KG 220 (272) T ss_pred HHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcce----EEEEc-CC Confidence 9999998888889999999999999987642221 1112223455666799999999999998654 24444 33 Q ss_pred hcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecCCCC Q lcl|NC_012784. 356 DAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERG 407 (415) Q Consensus 356 ~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~ 407 (415) ++.++.+.+++++..++. .+.+.+++..|+++++.+|++++.+++.++..- T Consensus 221 -a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 221 -ALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred -eEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 466677888888877654 345668888999999999999999999977666 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.88 E-value=3.8e-24 Score=148.95 Aligned_cols=267 Identities=14% Similarity=0.066 Sum_probs=198.0 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCC--ceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG--SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-...++|+.+.+-+.+.+.....+.+++.+.....+ ...+.+|+......+.++.||++++. +..+.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~-~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISL-DKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccCh-hhcCCcce Confidence 3333455567788999999988888888777777765443222 22455666555566778999999874 56889999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..++.+.+.++++..+++.+|+.++....+.. ...+...+++.+. T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~--------------~~~~~~~~~d~i~ 145 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS--------------QTVSTKANVDGVQ 145 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccccccHHHHH Confidence 99999999999999999888888999999999999999999999886543211 1123445688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcc--cccCcccCCCCceecceeeEEeccccccccCCceEEEechhh Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY--LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD 356 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~--l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~ 356 (415) ++...+.+.......++|||.++..|++.......+ ...+....+.-++++|+||++++.+|.++.-...++++. . T Consensus 146 ~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~--g 223 (272) T protein:vir:36 146 AALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNS--P 223 (272) T ss_pred HHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecc--c Confidence 999999888888889999999999998643322211 111112334457899999999999997765444455553 3 Q ss_pred cEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 357 AIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 357 ~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) ++..+..++++++..++ ..+...+++..+|++++++|+++|.++++.. T Consensus 224 A~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 224 ALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred ceeeeecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 45455567788877665 3456678999999999999999999998877 No 107 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.86 E-value=4.7e-23 Score=142.97 Aligned_cols=266 Identities=14% Similarity=0.057 Sum_probs=198.1 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCC--ceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG--SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+++.+.+.+.....+.+++.+.....+ ...+.+|+....+.+.++.||+.++. +..+++.. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~-~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccc-ccccccee Confidence 3334455667889999999999988888777777765332112 12456666655567789999999874 57899999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++...++..++.+.+.++++.++++++|+.++....+.... ..+...+++.+. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~-------------~~~~~~~~d~i~ 146 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-------------VNADITKLNGLQ 146 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------ccccccCHHHHH Confidence 9999999999999999999888899999999999999999999998765443211 112344688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++..++.+.......++|||.++..|++.. .-+++- .+....+..++++|+||++++.+|.+. .++++. T Consensus 147 dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~l~~~ 220 (274) T protein:vir:93 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAKK 220 (274) T ss_pred HHHHHhhhccCCccEEEeCHHHHHHHHhhh--hhcccccccccccceeecccceecCeeEEEcCCCCcce----EEEEeC Confidence 999998888888889999999999997531 111110 111234456789999999999998643 355553 Q ss_pred hhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) .++..+...++.++..++ ..+...+++..++++++++|++++.+++.+++..- T Consensus 221 --gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 221 --GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred --CeEEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 345556667777776654 34566789999999999999999999987766555 No 108 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.84 E-value=2.9e-22 Score=138.67 Aligned_cols=267 Identities=14% Similarity=0.076 Sum_probs=199.0 Q ss_pred hhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCC--ceeEEEEeecCCccccccccccccccccccccee Q lcl|NC_012784. 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG--SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) Q Consensus 120 ~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~ 197 (415) +.....+.-...++|+.+++-+.+.+.....+.+++.+-+...+ ...+.+|.....+.+.++.||+.++. +..+++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPI-DLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcch-hhcccce Confidence 22222344556788999999999999988888888765443211 22455555555556778999999874 5688999 Q ss_pred eEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHH Q lcl|NC_012784. 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) Q Consensus 198 v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (415) .+...++.+..+.++++....+..|+.+...++++..+++++|+.++.-.++.... ......+++.+ T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~-------------~~~~~~~~d~i 146 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK-------------VEADITKLAGL 146 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------ccccccCHHHH Confidence 99999999999999999988877788999999999999999999988655443211 11234568999 Q ss_pred HHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEe Q lcl|NC_012784. 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIG 352 (415) Q Consensus 278 ~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~g 352 (415) .++...+.+.......++|||..+..|+++.+. +++- .+...++.-++++|++|++++.+|.+. .+++| T Consensus 147 ~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~i~~ 220 (275) T protein:vir:96 147 QTAIDKFNDEDLEPMVLFVNPLDAGKLRASATD--NFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGE----AILAK 220 (275) T ss_pred HHHHHHhccccCCccEEEeCHHHHHHHHhcccc--cccccccccccceeccccceecCeeEEEeCCCCcce----EEEEe Confidence 999999988777888999999999999876321 1110 112334556789999999999998654 36776 Q ss_pred chhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 353 NLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 353 d~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) . .++..+...++.++..++ ..+...+++..+++.++++|++++.++++++.-|. T Consensus 221 ~--gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 221 R--GAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred c--cceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 4 345556667778877654 44566788889999999999999999888777776 No 109 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.81 E-value=2.9e-21 Score=133.22 Aligned_cols=268 Identities=12% Similarity=0.050 Sum_probs=197.9 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+++-+.+.+.....+.+++.+...-. +...+.+|.......+.++.||++++. +..+++.. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~-~~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPV-DKIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCc-ccccccee Confidence 332345566778999999999999988888888876543211 222455555555567778999999874 56899999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ....++.+..+.++++....+..|+.+.+.++++..+++++|+.++.-..++.. .......+++.+. T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~-------------~~~~~~~t~d~i~ 146 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKL-------------TVSADIGTLAGLE 146 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------cccccccCHHHHH Confidence 999999999999999999988889999999999999999999998864322111 1122345688899 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+......+++|||..+..|+++.+-+ ++- .+....|.-++++|++|++++.+|.+. .++++. T Consensus 147 ~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~l~~~ 220 (276) T protein:vir:10 147 AAIDTFDDEDLEPMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGE----AILAKR 220 (276) T ss_pred HHHHHhccccCcccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceecceeEEEcCCCCcce----EEEEec Confidence 999998887778889999999999998754322 111 111234455789999999999998654 366764 Q ss_pred hhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecCCCCccc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD 410 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~ 410 (415) .++..+...++.++..+.. .+...+++..+|+.++.+|..++.+++..-+.+-|. T Consensus 221 --gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~~~ 276 (276) T protein:vir:10 221 --GAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDSGA 276 (276) T ss_pred --cceeeeecCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcCCcCCC Confidence 3455666778888877653 455668888999999999999999997763333333 No 110 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.81 E-value=5.4e-21 Score=131.71 Aligned_cols=266 Identities=12% Similarity=0.068 Sum_probs=192.6 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+...+|+.+++-+.+.+.....+.+++.+..... +...+.+|.....+.+..+.||+.++. +..+++.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~-~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCch-hhccccee Confidence 333334556788999999999998888877777765533211 122455555554556678899998874 56899999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..++.+.+.++++..+++.+|..+++...+... .......+++.+. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~-------------~~~~~~~~~d~i~ 146 (274) T protein:vir:96 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------TVEADITKLDGLQ 146 (274) T ss_pred EEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-------------CcCcccccHHHHH Confidence 999999998999999998888889999999999999999999998875433211 1112344688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+.......++|||..+..|+++... +++- ......+.-++++|++|++++.+|.+. .+++|. T Consensus 147 dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t----~~l~~~ 220 (274) T protein:vir:96 147 TAIDKFNDEDLEPMVLFVNPLDAGGLRTSASD--NFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAKK 220 (274) T ss_pred HHHHHhcccCCCceEEEeCHHHHHHHHhcccc--cccccccccccceeecccceecCeeEEEcCCCCcce----EEEEeC Confidence 99999888877888999999999999885311 1111 111233456789999999999999654 366663 Q ss_pred hhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGL 413 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~ 413 (415) .++..+...++.++..+. ..+...+++..+++.++++|+++|.++..++.. +. T Consensus 221 --gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~-----~~ 274 (274) T protein:vir:96 221 --GAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDE-----VM 274 (274) T ss_pred --cceeeeecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcccc-----cC Confidence 345556667777776554 445667888899999999999999987443322 22 No 111 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.81 E-value=9.4e-21 Score=130.38 Aligned_cols=266 Identities=14% Similarity=0.044 Sum_probs=195.6 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+.+-+.+.+.....+.+++.+-.... +...+.+|.......+..+.||+.++ .+..+.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc-cccccccee Confidence 333345566788999999999988887777767766533211 12245555555445677889999887 457889999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..|+.+.+.++++.++++.+|..++....+.... ......+++.+. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~-------------~~~~~~~~d~i~ 146 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-------------VNADITKLNGLQ 146 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-------------ccccccCHHHHH Confidence 9999999989999999988888889999999999999999999988764432110 112344688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+.......++|||..+..|++. ..-+++- .+....+..++++|++|++++.+|.+. .+++|. T Consensus 147 dA~~~l~d~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~l~~~ 220 (274) T protein:vir:94 147 SAIDKFNDEDLEPMVLFVNPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAKK 220 (274) T ss_pred HHHHHhhccCCCceEEEeCHHHHHHHHhh--hhhhccccCcccccceeccccceecCeeEEEcCCCCcce----EEEEeC Confidence 99999988888888999999999999753 1111111 111234556789999999999998643 366664 Q ss_pred hhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) .++..+...++.++..++ ..+...+++..+|++++++|..++.++++.++..- T Consensus 221 --gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 221 --GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred --cceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 345566677788877654 34566788899999999999999999988766555 No 112 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.81 E-value=9.4e-21 Score=130.38 Aligned_cols=266 Identities=14% Similarity=0.044 Sum_probs=195.6 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+.+-+.+.+.....+.+++.+-.... +...+.+|.......+..+.||+.++ .+..+.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc-cccccccee Confidence 333345566788999999999988887777767766533211 12245555555445677889999887 457889999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..|+.+.+.++++.++++.+|..++....+.... ......+++.+. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~-------------~~~~~~~~d~i~ 146 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-------------VNADITKLNGLQ 146 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-------------ccccccCHHHHH Confidence 9999999989999999988888889999999999999999999988764432110 112344688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+.......++|||..+..|++. ..-+++- .+....+..++++|++|++++.+|.+. .+++|. T Consensus 147 dA~~~l~d~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~l~~~ 220 (274) T protein:vir:97 147 SAIDKFNDEDLEPMVLFVNPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT----AILAKK 220 (274) T ss_pred HHHHHhhccCCCceEEEeCHHHHHHHHhh--hhhhccccCcccccceeccccceecCeeEEEcCCCCcce----EEEEeC Confidence 99999988888888999999999999753 1111111 111234556789999999999998643 366664 Q ss_pred hhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) .++..+...++.++..++ ..+...+++..+|++++++|..++.++++.++..- T Consensus 221 --gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 221 --GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred --cceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 345566677788877654 34566788899999999999999999988766555 No 113 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.80 E-value=1.4e-20 Score=129.51 Aligned_cols=271 Identities=13% Similarity=0.031 Sum_probs=188.0 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cC-CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +...++..+..++|+.+++.+.+.+.....+.+++..... .+ +...+.+|+......+.++.||+.++. +..+++.. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDY-SALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcc-ccccccee Confidence 2223445567899999999999988887777777654322 11 122355555554456678999998874 57899999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..++.+.+.++++..+++.+|+.+++...+.... .....+.......++.+. T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~-------~~~~~t~~~~~~~~~~~~ 152 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE-------VKGAINIGLIDKIENTFT 152 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------cccccccchhhhHHHHHH Confidence 9999999989999999999988899999999999999999999988764322111 011111112233466777 Q ss_pred HHHHHhhhhccC-CCEEEEcHHHHHHHHHhhccCCc---ccccCcccCCCCceecceeeEEeccccccccCCceEEEech Q lcl|NC_012784. 279 DAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGN---YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) Q Consensus 279 ~~~~~~~~~~~~-~~~~v~~~~~~~~l~~lkd~~G~---~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~ 354 (415) ++..++..+... ...++|||..+..|++....+.. .+-.+....+.-++++|++|++++++|.+. .++|+. T Consensus 153 da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~l~~~- 227 (278) T protein:vir:80 153 DAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGN----ALAVKA- 227 (278) T ss_pred HHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcce----EEEEec- Confidence 777766555443 34688999999999865322110 111112334556789999999999998653 356653 Q ss_pred hhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 355 KDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 355 ~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) .++..+...+++++..++ ..+...+++..+++.++++|+++|.++..+.. T Consensus 228 -gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 228 -GALKTFLKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred -cceeeeecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 345556667777776654 34566788889999999999999999876655 No 114 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.79 E-value=1.9e-20 Score=128.76 Aligned_cols=360 Identities=16% Similarity=0.147 Sum_probs=203.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhH Q lcl|NC_012784. 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQ 86 (415) Q Consensus 7 l~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) |..=+.++++. -+++.+. +|.+.|+.++++.+... +.+ ....+.++... T Consensus 1 ~~~~~~~~~~~----------~~~~~~~-------~e~k~lr~~me~~et~~----e~~------~~~~~~~~~e~---- 49 (393) T protein:vir:79 1 MENWLKQLKES----------GFTETQV-------QEQKSLRTRMERGETLA----EAD------ANKLALNEEET---- 49 (393) T ss_pred CchHHHHHHhc----------cCchhHH-------HHHHHHHHHhhhhhhhh----hhh------hhhhhcchhHH---- Confidence 11112222111 1222222 22333333333221111 000 00000000000 Q ss_pred HHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cCCce Q lcl|NC_012784. 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TNGSG 165 (415) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~~~~ 165 (415) .....+.+.........+ .......++.++..+||..+++.|.+...+....-+++..+.. .+.+. T Consensus 50 -el~E~f~Kmm~G~~p~~e------------V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm 116 (393) T protein:vir:79 50 -QILESFAKMMEGETPTNE------------VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSM 116 (393) T ss_pred -HHHHHHHHHhcCCCchhh------------eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcce Confidence 000111111111111111 0111123666778999999999999977776666566554444 22222 Q ss_pred eEEEEeecCCcccccccccccccccc--cccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 166 KYPVVRQSEVAALEKVEELEENPELA--VKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAI 243 (415) Q Consensus 166 ~~~~~~~~~~~~a~~v~Eg~~~~~~~--~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~i 243 (415) . ++ ..+.-.+..|+||++.|+.+ ..+|+.|++..+|++..+.+|+|+++||..++.+++.+...++++++.+... T Consensus 117 ~--F~-~~g~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a 193 (393) T protein:vir:79 117 I--FP-SIGIMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKA 193 (393) T ss_pred e--cc-chheeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHH Confidence 2 22 23455778999999998644 3578999999999999999999999999999999999999999999999999 Q ss_pred hhcccccccc---cccccc----ccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh---hccCCc Q lcl|NC_012784. 244 IDVITKGSTG---STSSGF----EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM---KDKLGN 313 (415) Q Consensus 244 l~g~g~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l---kd~~G~ 313 (415) +++..+.... ..+++. ..-.......++.+.+|+++++.+..+..+.+++|+|||-.|+.+++- ....-+ T Consensus 194 ~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~n 273 (393) T protein:vir:79 194 YHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQAN 273 (393) T ss_pred HhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeec Confidence 9987665441 111111 111122345678899999999999999999999999999999999863 222222 Q ss_pred ccccCccc------CCCCceecc-----eeeEEeccccccccCCceEEEechhhcE-EEEeecceEEE-EeecccCceEE Q lcl|NC_012784. 314 YLIQPDVK------EKTQQRLLG-----AKIEILPDEVLGQKGNNTLIIGNLKDAI-VLFDRSQYQAS-WTDYMHFGECL 380 (415) Q Consensus 314 ~l~~~~~~------~~~~~~l~G-----~pV~~~~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~~i~-~~~~~~~~~~~ 380 (415) ++-.-++. ...|..|.| +.|++++.+|.......+-++..-++++ .+..+-+++.+ +++-..+.+.+ T Consensus 274 a~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~i 353 (393) T protein:vir:79 274 PYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNI 353 (393) T ss_pred cccccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceee Confidence 22110111 112223333 5799999998766544443333222222 12234455553 44445677889 Q ss_pred EEEEEeccEEeccc-cE---EEEEeecC---CCCcccccc Q lcl|NC_012784. 381 MIAVRQDCRILDYK-SA---IVIEYDDS---ERGEGDLGL 413 (415) Q Consensus 381 ~~~~r~d~~v~~p~-a~---~~~~~t~~---~~~~~~~~~ 413 (415) ....|+|..|++.. ++ .-++++.+ |---.+++- T Consensus 354 Kl~ERYG~gvLn~gkaiavakNI~~~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 354 KMIERYGIGILNEGKAIAVAKNISMDKSYAEPMLIKNVGN 393 (393) T ss_pred eeeeeeceeeeeCCceEEEEecceeecccccchhhhccCC Confidence 99999999999973 33 33444432 111111111 No 115 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.78 E-value=1e-20 Score=130.21 Aligned_cols=307 Identities=8% Similarity=-0.006 Sum_probs=203.7 Q ss_pred HHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEe Q lcl|NC_012784. 92 LGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR 171 (415) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 171 (415) ..+.+ ......+|....+......+...+....+.+.|......|++.+.+.+.++..+....+.++...|+. T Consensus 1 ~~~~~-----~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r-- 73 (330) T protein:vir:94 1 MVRIC-----TPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNR-- 73 (330) T ss_pred Cceec-----CCccccceeehhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeee-- Confidence 00000 00001111111111112223344555677888999999999999999999999887777766655554 Q ss_pred ecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHH--hcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_012784. 172 QSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITK 249 (415) Q Consensus 172 ~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l--~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~ 249 (415) ...-+.+.|...++..++....+|.+++...+.+.+++.|.+.+. ...+.++..+-.....+++.++.+.++|+|+.+ T Consensus 74 ~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~ 153 (330) T protein:vir:94 74 ENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGT 153 (330) T ss_pred eecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 455678889888877765444589999999999999999999995 456778899999999999999999999999866 Q ss_pred cccccccccccccccc---ccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc---cCC Q lcl|NC_012784. 250 GSTGSTSSGFEKEGKK---LEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV---KEK 323 (415) Q Consensus 250 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~---~~~ 323 (415) +....++......... ....+..+.|++-.++.........++.|+||+....+|+.+....|++-..+.. .+. T Consensus 154 ~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~ 233 (330) T protein:vir:94 154 GNSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGR 233 (330) T ss_pred CccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCC Confidence 4332122111111111 1123555677776666666555667899999999999999998888776543322 222 Q ss_pred CCceecceeeEEecccccc-----ccCCceEEEechh-----hcEEEEe---ecceEEEEee--cccCceEEEEEEEecc Q lcl|NC_012784. 324 TQQRLLGAKIEILPDEVLG-----QKGNNTLIIGNLK-----DAIVLFD---RSQYQASWTD--YMHFGECLMIAVRQDC 388 (415) Q Consensus 324 ~~~~l~G~pV~~~~~~~~~-----~~~~~~~~~gd~~-----~~~~~~~---~~~~~i~~~~--~~~~~~~~~~~~r~d~ 388 (415) ...++.|+||+.++.+|.+ +.+...|++..|- +++.... ..++++..-. +.......++.++++. T Consensus 234 ~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~~ 313 (330) T protein:vir:94 234 QIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKMYCGF 313 (330) T ss_pred EEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEEeeee Confidence 2346789999999988864 2344566665543 2333332 2356665532 3444556788999999 Q ss_pred EEeccccEEEEEeecCCCC Q lcl|NC_012784. 389 RILDYKSAIVIEYDDSERG 407 (415) Q Consensus 389 ~v~~p~a~~~~~~t~~~~~ 407 (415) ++..|+|+.+++=- .-| T Consensus 314 av~~~~a~~~L~~V--~~g 330 (330) T protein:vir:94 314 ANFSQLGLAAIKGL--IPG 330 (330) T ss_pred EEechhheeeeccc--cCC Confidence 99999999987721 112 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.77 E-value=1.4e-19 Score=123.87 Aligned_cols=266 Identities=13% Similarity=0.049 Sum_probs=192.2 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-cC-CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+++-+.+.+.....+.+++.+-.- .+ +...+.+|.....+.+..+.||+.++ .+..+.+.. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc-hhhccccee Confidence 3333445567789999999998888777666666665322 11 12245555554445677889998886 457888999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) +...++.+..+.++++....+..|+.+.+.++++..+++++|+.++....+... .......+++.+. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~-------------~~~~~a~~~d~i~ 146 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNADITKLNGLQ 146 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------cccccccCHHHHH Confidence 999999999999999888777778899999999999999999998876543221 1112345689999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+.......++|||..+..|++.. .-+++- .+....+.-++++|++|++++.+|.+. .+++|. T Consensus 147 dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----~~l~~~ 220 (274) T protein:vir:12 147 SAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGT----AILAKK 220 (274) T ss_pred HHHHHhccccccccEEEeCHHHHHHHHhhh--hhhccccccccccceecccceeecCeeEEEeCCCCcce----EEEEec Confidence 999998887778889999999999987631 111110 112334556789999999999999654 367774 Q ss_pred hhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) .++..+...+++++..+.. .+...+++..+|++++++|..+|.++...++..- T Consensus 221 --gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 221 --GAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred --cceeeeecCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 3455566677888777653 4566788889999999999999999866544443 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.75 E-value=6.8e-19 Score=120.18 Aligned_cols=266 Identities=14% Similarity=0.073 Sum_probs=191.9 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+++-+.+.+.....+.+++.+-+... +...+.+|.......+..+.+|+.++. ...+.+.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 223344556778999999999988888777777765433211 122445555444456678899988864 57888999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..|+.+.+.++++..+++.+|..++.-..+.... ....+.+++.+. T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-------------~~~~~~~~d~i~ 146 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT-------------VEADITKLTGLQ 146 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------ccccccCHHHHH Confidence 9999999989999999888877789999999999999999999988655443211 112345688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+.......++|||..+..|++.. .-+++- .+....|.-++++|++|++++.+|.+. .+++|. T Consensus 147 ~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t----~~l~~~ 220 (274) T protein:vir:95 147 TAIDKFNDEDLEPMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGT----AILAKK 220 (274) T ss_pred HHHHHhccccccccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCCCce----EEEEec Confidence 999998877778889999999999987631 111111 112334556789999999999998654 377775 Q ss_pred hhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGL 413 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~ 413 (415) . ++..+...++.++..+. ..+...+++..++++++++|+++|+++ .+.|.+.. T Consensus 221 g--A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t-----k~~~~~~~ 274 (274) T protein:vir:95 221 G--AVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT-----KGSGSLEM 274 (274) T ss_pred c--ceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEE-----cCCccccC Confidence 3 45556667788877654 445667888999999999999999987 33444444 No 118 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.75 E-value=6.8e-19 Score=120.18 Aligned_cols=266 Identities=14% Similarity=0.073 Sum_probs=191.9 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +....+.-+..++|+.+++-+.+.+.....+.+++.+-+... +...+.+|.......+..+.+|+.++. ...+.+.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 223344556778999999999988888777777765433211 122445555444456678899988864 57888999 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) ++..++.+..+.++++....+..|+.+.+.++++..+++.+|..++.-..+.... ....+.+++.+. T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-------------~~~~~~~~d~i~ 146 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT-------------VEADITKLTGLQ 146 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------ccccccCHHHHH Confidence 9999999989999999888877789999999999999999999988655443211 112345688999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccc-----cCcccCCCCceecceeeEEeccccccccCCceEEEec Q lcl|NC_012784. 279 DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLI-----QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGN 353 (415) Q Consensus 279 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) ++...+.+.......++|||..+..|++.. .-+++- .+....|.-++++|++|++++.+|.+. .+++|. T Consensus 147 ~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t----~~l~~~ 220 (274) T protein:vir:96 147 TAIDKFNDEDLEPMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGT----AILAKK 220 (274) T ss_pred HHHHHhccccccccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCCCce----EEEEec Confidence 999998877778889999999999987631 111111 112334556789999999999998654 377775 Q ss_pred hhhcEEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccc Q lcl|NC_012784. 354 LKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGL 413 (415) Q Consensus 354 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~ 413 (415) . ++..+...++.++..+. ..+...+++..++++++++|+++|+++ .+.|.+.. T Consensus 221 g--A~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t-----k~~~~~~~ 274 (274) T protein:vir:96 221 G--AVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT-----KGSGSLEM 274 (274) T ss_pred c--ceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEE-----cCCccccC Confidence 3 45556667788877654 445667888999999999999999987 33444444 No 119 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.73 E-value=1.1e-18 Score=119.03 Aligned_cols=266 Identities=15% Similarity=0.109 Sum_probs=191.3 Q ss_pred ccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC--CceeEEEEeecCCcccccccccccccccccccceeeEe Q lcl|NC_012784. 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) Q Consensus 123 ~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~ 200 (415) ..-+.-...++|+.+.+-+.+.......+.+++.+.+.-. +...+.+|.....+.+..+.||++++ ....++++... T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~-~~~lt~~~~~a 79 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMD-TTQMSMTTTKV 79 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccc-hhhcccchhee Confidence 1112334567899999999888888877777776533211 22244555555556777899999987 45788999999 Q ss_pred eeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHH Q lcl|NC_012784. 201 DINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDA 280 (415) Q Consensus 201 ~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (415) ..++.+..+.++++....+.-|....+.++++..+++++|+.++.-..... ...+...+++++.++ T Consensus 80 ~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~--------------~~~~~~~t~~~~~dA 145 (270) T protein:vir:95 80 TVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSK--------------QTATVSADATGILDA 145 (270) T ss_pred eeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHHHH Confidence 999999999999998876655778999999999999999998875432211 111234567889999 Q ss_pred HHHhhhhccCCCEEEEcHHHHHHHHHhhccCC-cccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEE Q lcl|NC_012784. 281 INLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG-NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV 359 (415) Q Consensus 281 ~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G-~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~ 359 (415) +..+.+......+++|||.++..|++...-.+ ++ -......+.-++++|++|++++.+|.. ....+|+ +.++. T Consensus 146 ~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~-~~~~~~~G~ig~~~G~~Viv~s~~~~~---~~~~l~~--~gAi~ 219 (270) T protein:vir:95 146 IEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNV-QDRAISKGDLVEIVGVSDIVKSKRVSE---NTAFLQR--YGAME 219 (270) T ss_pred HHHhccccCCCcEEEEcHHHHHHHHhhhccccccc-ccchhcccccceecceeEEEeCCCCCc---eeEEEEe--cccee Confidence 99998888888999999999999986431111 11 111233455678999999998877632 1245666 34566 Q ss_pred EEeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecCCCCcc Q lcl|NC_012784. 360 LFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEG 409 (415) Q Consensus 360 ~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~ 409 (415) .+...++.++..++. .....+.+..+|.+++.+|..+|.+++.++.+.+- T Consensus 220 ~~~~~~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 220 IVNKKKPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred eeecCCceeeeccchhhcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 666777888877654 34556788889999999999999999987666655 No 120 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.72 E-value=8.7e-19 Score=119.59 Aligned_cols=376 Identities=11% Similarity=0.105 Sum_probs=225.3 Q ss_pred CCh--HHHHHHHHHHHHHHHHHHHHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|NC_012784. 1 MKT--KEELQSEISDIKRQIDLKVKYATRA---LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) Q Consensus 1 Mk~--~~el~~~l~~l~~~~~~~~~~~~~~---~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) ||+ ++|....+.++++.+......+... -.=+++.+++++++-+.++..+|...+.++....+..... . T Consensus 8 ~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~~Kgk------~ 81 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK------D 81 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhhcccc------h Confidence 776 5666666777766665555444322 1224466788888999999888877666655333322111 1 Q ss_pred cccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcc Q lcl|NC_012784. 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~ 155 (415) ...+ --+.+.....++...............|.. .....+.+.++....+|..+...|...++.+.++.++. T Consensus 82 ~mte--fLkT~~A~~~fa~~l~~nsg~sd~knaW~A------~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~ 153 (400) T protein:vir:93 82 KMTN--FIESQNAVTEFFDVLKKNSGKSEIKNAWSA------KLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) T ss_pred hHHH--hhhhHHHHHHHHHHHHhhcCCcchhhhhhh------hhhhcccccCCchhhcchHHHHHHHHhhhccCCcccce Confidence 1101 111222223333333333333333332221 11112333344445789999999999999999999987 Q ss_pred eeEEccCCceeEEEEeecCCccccc-ccccccccccccccceeeEeeeeeEEEeehhhHHHHhc--chHHHHHHHHHHHH Q lcl|NC_012784. 156 TVKRVTNGSGKYPVVRQSEVAALEK-VEELEENPELAVKPFFQLAYDINTHRGYFRISREAIED--AKVNVLQELKLWMA 232 (415) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~a~~-v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~d--s~~~l~~~l~~~la 232 (415) .+.+++ .+.+........-+| +.-|.++.+ +..+|...++.|+-++.+..+.+-..++ +.-.|..|++++|. T Consensus 154 ~v~n~p----~l~V~~~~dt~~qa~gHk~G~~K~e-q~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~ 228 (400) T protein:vir:93 154 HVTNVG----ALLVSRSFDSANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELT 228 (400) T ss_pred eeecCC----ceeeecchhhhcccceeccCCcccc-eeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHH Confidence 776663 233332233333445 666777764 4578999999998888877775444432 23358999999999 Q ss_pred HHHHHH-HHHHHhhcccccccccc-----ccccccccccccccchhhHHHHHHH-HHHhhhhccCCCEEEEcHHHHHHHH Q lcl|NC_012784. 233 RTIAAT-RNKAIIDVITKGSTGST-----SSGFEKEGKKLEVKKAKSLDDIKDA-INLNVKPNYEHNVAIVSQTMFAKLD 305 (415) Q Consensus 233 ~~~~~~-~d~~il~g~g~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~l~ 305 (415) ..+.++ .+++++-|+|+.+-.+. ..........+..++...+.+++.- +....+-...+..++|+|..|+.|+ T Consensus 229 q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~ 308 (400) T protein:vir:93 229 QAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLD 308 (400) T ss_pred HHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHH Confidence 999964 79999999887642111 1111222344445556666666654 3333344556678999999999999 Q ss_pred HhhccCCcccccCcccCCCCceeccee-eEEeccccccccCCceEEEechhhcEEEEeecceEE-EEeecccCceEEEEE Q lcl|NC_012784. 306 KMKDKLGNYLIQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA-SWTDYMHFGECLMIA 383 (415) Q Consensus 306 ~lkd~~G~~l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~ 383 (415) .++|++|.+.|.....+....+-+|.- +++....+.. +..+++ | .++.+ +-++++- ....-.+++..+.++ T Consensus 309 ~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~---kp~V~V-D--ek~~i-~~~~~~t~~sf~~~tNs~~ilve 381 (400) T protein:vir:93 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAL---KPTVLV-D--QKYHI-DMQDLTKVDAFEWKTNSNMILVE 381 (400) T ss_pred HhcCCcceeeeeeccccchhhhhcccceeeeeccCCCC---Cceeee-e--hhhhc-cccCceeccceeeeeccceEEee Confidence 999999999996555555545556653 2223333221 112222 3 33333 2233332 122224456667888 Q ss_pred EEeccEEeccccEEEEEee Q lcl|NC_012784. 384 VRQDCRILDYKSAIVIEYD 402 (415) Q Consensus 384 ~r~d~~v~~p~a~~~~~~t 402 (415) ..++|-+.-|++-++++++ T Consensus 382 tlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 382 TLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeccceecccceeeEeeC Confidence 8999999999999998887 No 121 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.66 E-value=5.1e-18 Score=115.37 Aligned_cols=228 Identities=14% Similarity=0.070 Sum_probs=169.9 Q ss_pred EEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) .+.....-.+.+|+. ...+..++||.+++ ....+++..+.+.++++..+.|+++....+.-|......++++.++++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~-~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCC-hhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 111111224555555 34667899999998 456899999999999999999999988877778899999999999999 Q ss_pred HHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc--cc Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~--~l 315 (415) ++|..++.-..+.. ...+...+++.+.++...+.+....+.+++|||.++..|++..+.+.. .. T Consensus 78 kvD~di~~~~~~a~--------------l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~ 143 (231) T protein:vir:73 78 KVDDDLLKAAKTTS--------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) T ss_pred hhhHHHHHhhcccc--------------ccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhhhh Confidence 99999886433211 112234678999999999988888888999999999999985543221 11 Q ss_pred ccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEeccEEeccc Q lcl|NC_012784. 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) Q Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~ 394 (415) -.+-+..|.-+++.|+||++++.+|.++.-...++.. +.++.++...++.++..++. ...+.+++..++..++.+|. T Consensus 144 g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~--~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~ 221 (231) T protein:vir:73 144 GANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSN--SPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) T ss_pred ccceeeecccceEcceEEEEcCCCCCCceeeeeEEee--ccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCc Confidence 1122345667799999999999999765433333332 34566777788888877654 45667888899999999999 Q ss_pred cEEEEEeecC Q lcl|NC_012784. 395 SAIVIEYDDS 404 (415) Q Consensus 395 a~~~~~~t~~ 404 (415) .+|.++++.. T Consensus 222 ~vv~~t~~g~ 231 (231) T protein:vir:73 222 KVVNITFTGV 231 (231) T ss_pred cEEEEEeecC Confidence 9999999877 No 122 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.62 E-value=3.7e-16 Score=105.17 Aligned_cols=280 Identities=10% Similarity=0.019 Sum_probs=186.2 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccc---c--ccccccccccccccc Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE---K--VEELEENPELAVKPF 195 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~---~--v~Eg~~~~~~~~~~f 195 (415) +...+....+.+.+..+...||+.+.+.+.++..+...++.++...|.....-.+..+. | ..++. ++ +..+| T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~--~~-~~~t~ 77 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGA--GK-AAATF 77 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCc--cc-ccccc Confidence 22244444456778889999999999999999999888888777777665544443332 2 22332 23 45789 Q ss_pred eeeEeeeeeEEEeehhhHHHHhc--c-hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccc---ccc Q lcl|NC_012784. 196 FQLAYDINTHRGYFRISREAIED--A-KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL---EVK 269 (415) Q Consensus 196 ~~v~~~~~k~a~~~~iS~e~l~d--s-~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~---~~~ 269 (415) .+++...+-+.+.+.|.+.+.+- + +.+...+=.+...+++.++.+.++|+|+.+..+..++.......... ... T Consensus 78 ~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~g 157 (310) T protein:vir:97 78 TKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATG 157 (310) T ss_pred ceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCC Confidence 99999999999999999876542 2 44555555666779999999999999998765432333222221211 122 Q ss_pred chhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH-hhccCCcccccCc--ccCCCCceecceeeEEecccccc---- Q lcl|NC_012784. 270 KAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK-MKDKLGNYLIQPD--VKEKTQQRLLGAKIEILPDEVLG---- 342 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~-lkd~~G~~l~~~~--~~~~~~~~l~G~pV~~~~~~~~~---- 342 (415) +..+++++-.++.......+.+..|+|||.+..+|+. .+..+++.++... ..+....++.|+|++.++.+|.. T Consensus 158 g~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~ 237 (310) T protein:vir:97 158 SAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKG 237 (310) T ss_pred CCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCcccc Confidence 4556777777776665566788999999999888775 3566666665432 23333358899999999999864 Q ss_pred -ccCCceEEEechh-----hcEEEE---eecceEEEEee--cccCceEEEEEEEeccEEeccccEEEEE-eec Q lcl|NC_012784. 343 -QKGNNTLIIGNLK-----DAIVLF---DRSQYQASWTD--YMHFGECLMIAVRQDCRILDYKSAIVIE-YDD 403 (415) Q Consensus 343 -~~~~~~~~~gd~~-----~~~~~~---~~~~~~i~~~~--~~~~~~~~~~~~r~d~~v~~p~a~~~~~-~t~ 403 (415) +.+...|+..-|- +++... ...++++..-- +.......++.++++.++..|+|+.+++ ++- T Consensus 238 ~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 238 GTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred ccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 2345556554433 233321 12346665533 3445566788899999999999999987 333 No 123 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.49 E-value=6.5e-15 Score=98.38 Aligned_cols=369 Identities=13% Similarity=0.090 Sum_probs=185.3 Q ss_pred CC----hHHHHHHHHHHHHHHHHH-----------HHHHHHHh---hchHH------HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 1 MK----TKEELQSEISDIKRQIDL-----------KVKYATRA---LNNDE------LEKAEKLEQEITDLRSQIQEKQE 56 (415) Q Consensus 1 Mk----~~~el~~~l~~l~~~~~~-----------~~~~~~~~---~~e~~------~~~~~~~~~e~~~l~~~i~~~~~ 56 (415) |- ..++.--++..+....+. +++..++. .+.+. .+...+...++..+..+++.++. T Consensus 1 ~~n~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~~~~~~~~~~E~Rs~~~ 80 (410) T protein:vir:83 1 MGNATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQAQEVNRIAFETRSKGQ 80 (410) T ss_pred CCCcccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhhhHHHHHHHHHHHHHHHHHH Confidence 32 222222111111100000 01111111 11110 11111112222222222222222 Q ss_pred HHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchh Q lcl|NC_012784. 57 ELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEE 136 (415) Q Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~ 136 (415) . ....-.+............+.... ..+............. ..++..........+... ...+|.+ T Consensus 81 ~-------i~~~~~~~r~~p~~~~veyRSaGE---~lkal~~~~~Gd~~A~---~~~e~~r~a~~~~~Tgd~-~~~i~~~ 146 (410) T protein:vir:83 81 A-------VDAAISAMRGSPVGTEVEYRSAGE---YMLDMWNSAQGNASAA---DRLEVYARAADHQKTGDL-QGVIPDP 146 (410) T ss_pred H-------HHhhhccCcCCCCCCCcccccHHH---HHHHHhccCCchHHHH---HHHHHHHHhhccCccccc-ccccchh Confidence 2 211111111111111111111111 1111111111111111 111111112222222322 3356667 Q ss_pred HHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcc-----cccccccccccccccccceeeEeeeeeEEEeehh Q lcl|NC_012784. 137 IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAA-----LEKVEELEENPELAVKPFFQLAYDINTHRGYFRI 211 (415) Q Consensus 137 ~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-----a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~i 211 (415) +....++++.+..++..++..-|.++.+..|++..+..... -+.-.||...+ ..+.+|+..+...++++++..+ T Consensus 147 ~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~-~gKl~~~t~tA~ikTyGGyt~L 225 (410) T protein:vir:83 147 IVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELD-SQKMVIDRLTVNAKTLGGYVNV 225 (410) T ss_pred HhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeeccccccccccccccccccccccc-ccceeeeeccceeehhcCcccc Confidence 88889999999999999998888888888887764443211 12345787776 5678899999999999999999 Q ss_pred hHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH-HHHHHhhhh--c Q lcl|NC_012784. 212 SREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK-DAINLNVKP--N 288 (415) Q Consensus 212 S~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--~ 288 (415) |++.++-|+++..+...+.|..+++.+-+...-......... .. .....+...|..++ ++...+.++ + T Consensus 226 SRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~-----~~----a~~~~Tad~~~~~i~da~~~v~da~~~ 296 (410) T protein:vir:83 226 SRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG-----AV----GYGNATADNVASAIWQAAGAVYTAVKG 296 (410) T ss_pred cceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hh----hhhhccHHHHHHHHHHHHHHHhhhhcc Confidence 999999999999999999999888888876533322111110 00 01111222332222 333333343 4 Q ss_pred cCCCEEEEcHHHHHHHHHhhccCCcccccC------cc-cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEE Q lcl|NC_012784. 289 YEHNVAIVSQTMFAKLDKMKDKLGNYLIQP------DV-KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLF 361 (415) Q Consensus 289 ~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~------~~-~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 361 (415) ..-..+.++|+.+..+..+- .++++.+.+ ++ ..+--+.|+|+||+..+..+++++ +|-|. .++..| T Consensus 297 ~~~~~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA-----~f~~~-~Ai~~~ 369 (410) T protein:vir:83 297 MGRLVIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDA-----YLFST-AAIECF 369 (410) T ss_pred ceeeeEEechhhhhhcccee-eccCCCCcccccccccccccchhhhhcccceEEecCCCcCee-----eEecc-ceeeee Confidence 44567899999987766542 223333221 11 123456899999999988877654 45564 468888 Q ss_pred eecceEEEEeecc-cC-ceEEEEEEEeccEEeccccEEEEEee Q lcl|NC_012784. 362 DRSQYQASWTDYM-HF-GECLMIAVRQDCRILDYKSAIVIEYD 402 (415) Q Consensus 362 ~~~~~~i~~~~~~-~~-~~~~~~~~r~d~~v~~p~a~~~~~~t 402 (415) ....-.++.++.+ .+ +..+- .||...+..|.+++-+.-+ T Consensus 370 eS~~gp~qL~d~~i~nLt~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 370 EQRVGTLQVVEPSVFGLQVAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred ecCCceeEeeCCchhhhhhhhe--eeeeeccccccceeeeccC Confidence 7665444444332 22 22333 6667888888988876533 No 124 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.48 E-value=4.5e-15 Score=99.25 Aligned_cols=282 Identities=11% Similarity=0.057 Sum_probs=163.5 Q ss_pred hhhhhccccccc-ceeec------chhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCC---ccccccccccc Q lcl|NC_012784. 117 NDIQGGSLKTDS-GFVVI------PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEV---AALEKVEELEE 186 (415) Q Consensus 117 ~~~~~~~~~~~~-~~~~v------P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~a~~v~Eg~~ 186 (415) .....+..+... +...+ |+.+.+.|.+.+......-.+.+... ...++.+.+.+.... ..+..|+|+++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~-a~~~~~v~f~~~~p~~~~~d~e~VaEggE 79 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGG-ANPNGVVAYNEGNPSFLEDDVADVAEFGE 79 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhccc-ccccceeEEEecccccccCcHhhccCccc Confidence 111111112222 22222 77677777777665554444444322 223445555443322 46678999999 Q ss_pred ccccccccceeeEe-eeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccc Q lcl|NC_012784. 187 NPELAVKPFFQLAY-DINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK 265 (415) Q Consensus 187 ~~~~~~~~f~~v~~-~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~ 265 (415) +|.. ..+++.-.+ ..+|++.-+.||+|+......+..+....+++..+.+..|+..+...-................. T Consensus 80 iP~~-~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~ 158 (318) T protein:vir:10 80 IPVS-AGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGG 158 (318) T ss_pred cccc-CCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcc Confidence 9954 578877666 55799999999999999999999999999999999999999877655333211111111111100 Q ss_pred -ccccchhhHHHHHH---------HHHHhhhhccCCCEEEEcHHHHHHHHHhhc------cCCccccc-CcccCCCCcee Q lcl|NC_012784. 266 -LEVKKAKSLDDIKD---------AINLNVKPNYEHNVAIVSQTMFAKLDKMKD------KLGNYLIQ-PDVKEKTQQRL 328 (415) Q Consensus 266 -~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd------~~G~~l~~-~~~~~~~~~~l 328 (415) .........+.+.. ....-...++.++.++|||.+|..|.+-.+ .++.+++. ..+++.-++.+ T Consensus 159 ~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~ 238 (318) T protein:vir:10 159 KVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSV 238 (318) T ss_pred cccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccccccccee Confidence 00000011111111 111113456788999999999999965433 34555543 24455667889 Q ss_pred cceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecc--------cCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 329 LGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM--------HFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 329 ~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--------~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) +|+.|+.+..+|.+. .++.+-...-.+.|..+++...-... +.....++..+....|.+|.|+|+|+ T Consensus 239 lGl~vi~s~~~p~~~-----alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~it 313 (318) T protein:vir:10 239 MGLNVIRSRTFPIDR-----VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLT 313 (318) T ss_pred eceEEeecCccCCCe-----eEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEe Confidence 999999999999654 23333221122334445554332211 11122345556678899999999998 Q ss_pred eecCC Q lcl|NC_012784. 401 YDDSE 405 (415) Q Consensus 401 ~t~~~ 405 (415) =--+| T Consensus 314 gi~~~ 318 (318) T protein:vir:10 314 GIVTP 318 (318) T ss_pred eccCC Confidence 44444 No 125 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.47 E-value=1.1e-14 Score=97.14 Aligned_cols=264 Identities=13% Similarity=0.030 Sum_probs=162.4 Q ss_pred ccceeecchhHHhHHHHHHhhhhhhhhcceeEE--ccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k 204 (415) .....++|+.++..+++.++....+..++.... ......++.+++...........++..++ ........++++..+ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~tid~ 79 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccC-ccccccceEEEEEee Confidence 222336799999999999999888777664321 11111245666554444444567777654 345667788888866 Q ss_pred E-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHH Q lcl|NC_012784. 205 H-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL 283 (415) Q Consensus 205 ~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (415) . +.-+.|++.-...+..++.+ +.+++..++++++|..++.-....... .......+....++.+.++... T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:79 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDLIASALKE 150 (273) T ss_pred ecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccccchhhHHHHHHHHHHH Confidence 4 44556666444445567887 456788999999998776432221110 0011112233456778888877 Q ss_pred hhhhccC--CCEEEEcHHHHHHHHHhhccC-Cccccc--CcccCCCCceecceeeEEeccccccccCCceEEEechhhcE Q lcl|NC_012784. 284 NVKPNYE--HNVAIVSQTMFAKLDKMKDKL-GNYLIQ--PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI 358 (415) Q Consensus 284 ~~~~~~~--~~~~v~~~~~~~~l~~lkd~~-G~~l~~--~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~ 358 (415) +-..... +=.++++|..+..|.+..+-- ...... ..+..|..++|+|++|+.++.+|.+.. ..++.+-- .++ T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~--~~~~a~~~-~A~ 227 (273) T protein:vir:79 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFHP-SAA 227 (273) T ss_pred hhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCc--eEEEEEec-cce Confidence 7666652 347999999999987643211 111111 124456678999999999999997543 22333332 233 Q ss_pred EEEeecceEEEEee-cccCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 359 VLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 359 ~~~~~~~~~i~~~~-~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .... +...++..+ ...+.+.+++-+.+|.++++|++++.++.+.+ T Consensus 228 ~~a~-~~~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 228 AYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeee-ehhhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 3332 223343332 34566678888999999999999999988777 No 126 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.45 E-value=3.1e-14 Score=94.66 Aligned_cols=264 Identities=13% Similarity=0.028 Sum_probs=159.9 Q ss_pred ccceeecchhHHhHHHHHHhhhhhhhhcceeEE--ccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k 204 (415) ..-..++|+.++..+++.++..+.+..++..-. ......++.+++...........++..++ ....+...++++..+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccC-ccccccceEEEEEee Confidence 222345799999999999998888777664311 01111245555544433344455665543 234455667777655 Q ss_pred E-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHH Q lcl|NC_012784. 205 H-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL 283 (415) Q Consensus 205 ~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (415) . +.-+.|++.-...+..++.+ +.++...+++.++|..++.-....... .......+....++.+.++... T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc--------cccccccchhHHHHHHHHHHHH Confidence 3 34445665434444567887 556788999999998877532221110 0111122334557888888888 Q ss_pred hhhhccC--CCEEEEcHHHHHHHHHhhccCCc-ccc--cCcccCCCCceecceeeEEeccccccccCCceEEEechhhcE Q lcl|NC_012784. 284 NVKPNYE--HNVAIVSQTMFAKLDKMKDKLGN-YLI--QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI 358 (415) Q Consensus 284 ~~~~~~~--~~~~v~~~~~~~~l~~lkd~~G~-~l~--~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~ 358 (415) +-..... +-.++++|..+..|.+..+--.+ ... ...+..|..++|.|++|+.++.+|.+.. ...+.+.-+ ++ T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~--~~~~~~~~~-A~ 227 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFHPS-AA 227 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc--cEEEEEecc-ce Confidence 8766653 44799999999998764321111 111 1123456678999999999999997543 234444422 33 Q ss_pred EEEeecceEEEEee-cccCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 359 VLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 359 ~~~~~~~~~i~~~~-~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .... +...++..+ ..++...+++-+.+|.++++|++++.++.+.+ T Consensus 228 ~~a~-q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 228 AYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeee-eeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 3332 223343332 34556667888899999999999999987777 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.45 E-value=3.1e-14 Score=94.66 Aligned_cols=264 Identities=13% Similarity=0.028 Sum_probs=159.9 Q ss_pred ccceeecchhHHhHHHHHHhhhhhhhhcceeEE--ccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k 204 (415) ..-..++|+.++..+++.++..+.+..++..-. ......++.+++...........++..++ ....+...++++..+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccC-ccccccceEEEEEee Confidence 222345799999999999998888777664311 01111245555544433344455665543 234455667777655 Q ss_pred E-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHH Q lcl|NC_012784. 205 H-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL 283 (415) Q Consensus 205 ~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (415) . +.-+.|++.-...+..++.+ +.++...+++.++|..++.-....... .......+....++.+.++... T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc--------cccccccchhHHHHHHHHHHHH Confidence 3 34445665434444567887 556788999999998877532221110 0111122334557888888888 Q ss_pred hhhhccC--CCEEEEcHHHHHHHHHhhccCCc-ccc--cCcccCCCCceecceeeEEeccccccccCCceEEEechhhcE Q lcl|NC_012784. 284 NVKPNYE--HNVAIVSQTMFAKLDKMKDKLGN-YLI--QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI 358 (415) Q Consensus 284 ~~~~~~~--~~~~v~~~~~~~~l~~lkd~~G~-~l~--~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~ 358 (415) +-..... +-.++++|..+..|.+..+--.+ ... ...+..|..++|.|++|+.++.+|.+.. ...+.+.-+ ++ T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~--~~~~~~~~~-A~ 227 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFHPS-AA 227 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc--cEEEEEecc-ce Confidence 8766653 44799999999998764321111 111 1123456678999999999999997543 234444422 33 Q ss_pred EEEeecceEEEEee-cccCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 359 VLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 359 ~~~~~~~~~i~~~~-~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .... +...++..+ ..++...+++-+.+|.++++|++++.++.+.+ T Consensus 228 ~~a~-q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 228 AYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeee-eeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 3332 223343332 34556667888899999999999999987777 No 128 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.35 E-value=6.8e-14 Score=92.76 Aligned_cols=292 Identities=11% Similarity=0.024 Sum_probs=162.3 Q ss_pred HhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCC-ceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) +...+.......+.......+|+.++..|++.+.....+..+++....... ..++.+++. +.+.+....++..++ .. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~-g~~~~~d~~~~~~i~-~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRI-SELGVEDKATDVPVG-VQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEecc-CcceeeeecCCCccc-cc Confidence 000011111112333344468999999999999988888777654332211 124555543 456677777777765 34 Q ss_pred cccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-cccccccccccc Q lcl|NC_012784. 192 VKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS-GFEKEGKKLEVK 269 (415) Q Consensus 192 ~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~-~~~~~~~~~~~~ 269 (415) ..+-.+++++..+. +.-+.|++.-...+..|+.+.+.++..+++++++|+.|+.-........... ........+... T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 45667777777443 5556777766656677999999999999999999998875432221111111 111111112222 Q ss_pred chhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhccCCc-ccccCcccCCCCceecceeeEEeccccccccCC Q lcl|NC_012784. 270 KAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLGN-YLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGN 346 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd~~G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 346 (415) ....++.++++...+-...... =.+|++|..+..|.+...-..+ +.-...+..|..++|+|++|+.++++|...... T Consensus 159 ~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~ 238 (341) T protein:vir:94 159 QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSATG 238 (341) T ss_pred hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEecccccccccc Confidence 3345777888877776665543 3688899999999753211111 111123555667799999999999998654321 Q ss_pred ceEEE----------------------echhhc-EEEEeecce-EEEEee-----------------cc--cCceEEEEE Q lcl|NC_012784. 347 NTLII----------------------GNLKDA-IVLFDRSQY-QASWTD-----------------YM--HFGECLMIA 383 (415) Q Consensus 347 ~~~~~----------------------gd~~~~-~~~~~~~~~-~i~~~~-----------------~~--~~~~~~~~~ 383 (415) ...-. +++... .+.+-+..+ .++.-+ +. ++...+++- T Consensus 239 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 318 (341) T protein:vir:94 239 WRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGR 318 (341) T ss_pred ccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhh Confidence 11100 011100 000111110 010000 00 111223444 Q ss_pred EEeccEEeccccEEEEEeecCCC Q lcl|NC_012784. 384 VRQDCRILDYKSAIVIEYDDSER 406 (415) Q Consensus 384 ~r~d~~v~~p~a~~~~~~t~~~~ 406 (415) .=+|.+++||++.+-+..++++- T Consensus 319 ~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 319 QAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred hhhcccccCcceeEEEecCcCCC Confidence 45799999999988887666555 No 129 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.34 E-value=1.2e-12 Score=85.90 Aligned_cols=299 Identities=6% Similarity=-0.063 Sum_probs=164.2 Q ss_pred HHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCcee Q lcl|NC_012784. 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) ...++..-..++. .+....... .+...-+ +..++++....+++.+.+.+++++.++++++.+.++. T Consensus 1 ~~~~~~~~~~~n~------------~~~~i~k~~-it~~~l~-~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~e 66 (360) T protein:vir:99 1 MSSNSTIDSVRNQ------------NMNSLSQKD-IGLAELD-GFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEME 66 (360) T ss_pred CcchhHHHHHhhh------------HHHHHHhhh-ccccccC-ceeecHHHHHHHHHHHhhccchhhhcceeeccccccc Confidence 0000000001110 111111111 1222233 4566777899999999999999999999998888777 Q ss_pred EEEEeecCCcccccccccccccccccccceeeEee-eeeEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYD-INTHRGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNK 241 (415) Q Consensus 167 ~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~-~~k~a~~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~ 241 (415) ++-+.. +.-..-...|+++.++...++...+.+. .+++.....+..+-+++. ...+++.|++.++++++.-++. T Consensus 67 i~kig~-G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~ 145 (360) T protein:vir:99 67 VPQFGV-PRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGL 145 (360) T ss_pred cccccc-ceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHH Confidence 553322 1111111223333333233444455553 345555556666665543 3457899999999999999999 Q ss_pred HHhhcccccccc-----ccccccccccccccc-----------------------------------c-----chhhHHH Q lcl|NC_012784. 242 AIIDVITKGSTG-----STSSGFEKEGKKLEV-----------------------------------K-----KAKSLDD 276 (415) Q Consensus 242 ~il~g~g~~~~~-----~~~~~~~~~~~~~~~-----------------------------------~-----~~~~~~~ 276 (415) ..++|+...... .........++-..+ . ...+..- T Consensus 146 l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~l 225 (360) T protein:vir:99 146 MGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSL 225 (360) T ss_pred HHhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHH Confidence 999887653210 000000000000000 0 0012223 Q ss_pred HHHHHHHhhhhccCC----CEEEEcHHHHHHHH-HhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEE Q lcl|NC_012784. 277 IKDAINLNVKPNYEH----NVAIVSQTMFAKLD-KMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) Q Consensus 277 ~~~~~~~~~~~~~~~----~~~v~~~~~~~~l~-~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) +.+++..++..|+.+ -+|+|||.+....+ .|.+-+. ++.-.-+.++..-+..|+||+.++.+|.+ .++| T Consensus 226 f~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t-~LGd~~l~g~~~~~~~Gipi~~v~~~pd~-----~~ml 299 (360) T protein:vir:99 226 FNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTERED-PLGSAVIFGDSDITPFSYDLVGVNGFPDE-----YMMF 299 (360) T ss_pred HHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCc-ccchhheecccccccceeeeEEcCCCCCC-----ceEE Confidence 457777888887653 38999999866655 3443332 33211233333346789999999999864 3788 Q ss_pred echhhcEEEEeecceEEEEeec-cc---CceEEEE--EEEeccEEeccccEEEEEeecCCCCcc Q lcl|NC_012784. 352 GNLKDAIVLFDRSQYQASWTDY-MH---FGECLMI--AVRQDCRILDYKSAIVIEYDDSERGEG 409 (415) Q Consensus 352 gd~~~~~~~~~~~~~~i~~~~~-~~---~~~~~~~--~~r~d~~v~~p~a~~~~~~t~~~~~~~ 409 (415) -++++.+. +-...+++..+.+ +. .....+. ...+|..+..++|+|+++=-+.| .+ T Consensus 300 T~p~NLi~-g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~--~~ 360 (360) T protein:vir:99 300 TDPNNLAF-GLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETP--TA 360 (360) T ss_pred eccCceeE-EeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCC--CC Confidence 89988543 4456677754322 21 1112333 35688899999999997733222 22 No 130 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.34 E-value=1.6e-13 Score=90.79 Aligned_cols=294 Identities=10% Similarity=0.003 Sum_probs=167.1 Q ss_pred HHHHhhhhhhhhc-ccccccce--eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGG-SLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) Q Consensus 110 ~~~~~~~~~~~~~-~~~~~~~~--~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 186 (415) ............. +....++. .+.-+.++.+++......+.++.+.++..+.+++ ++.+++ .+...+.....|.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~-sv~~~~-iG~~~~~~~~~g~~ 78 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPV-MGRTKGYYLAPGEN 78 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcc-eEEEee-ecceeeeeeccccC Confidence 0000000000000 00111111 2233788999998888888889998887766443 444443 35556666677766 Q ss_pred cccc-ccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---------cccc Q lcl|NC_012784. 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS---------TGST 255 (415) Q Consensus 187 ~~~~-~~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~---------~~~~ 255 (415) ...+ ..+...++++...++ +.-..|.+-=.-++.+|+.+.+.++..+++++..|+.|+.-..... ..+. T Consensus 79 l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:88 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) T ss_pred CCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCc Confidence 4322 235667777777664 2233444333334556889999999999999999998763221110 0010 Q ss_pred cccccccc------ccccccchhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhh-ccCCcccccCcccCCCCc Q lcl|NC_012784. 256 SSGFEKEG------KKLEVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMK-DKLGNYLIQPDVKEKTQQ 326 (415) Q Consensus 256 ~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~ 326 (415) ..+..... ..........++.++++...+...... +=.+|++|..|..|.+-. .....+.-...+..+..+ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~vg 238 (347) T protein:vir:88 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcceee Confidence 00000000 011111223367788888777666654 337899999998876532 333344333345556678 Q ss_pred eecceeeEEeccccccccCCceE--------------------EEechhhcEEEE-e--------ecceEEEEee-cccC Q lcl|NC_012784. 327 RLLGAKIEILPDEVLGQKGNNTL--------------------IIGNLKDAIVLF-D--------RSQYQASWTD-YMHF 376 (415) Q Consensus 327 ~l~G~pV~~~~~~~~~~~~~~~~--------------------~~gd~~~~~~~~-~--------~~~~~i~~~~-~~~~ 376 (415) ++.|++|+.++++|.+..+.... +.+||...+.++ - -.++.++..+ ...+ T Consensus 239 ~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~ 318 (347) T protein:vir:88 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) T ss_pred eeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechhhH Confidence 89999999999999654432211 223444322221 1 1223343332 2344 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ...+++..-+|.+++||++.+.+++++++ T Consensus 319 ~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 45577888899999999999999999888 No 131 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.27 E-value=5.7e-13 Score=87.71 Aligned_cols=293 Identities=9% Similarity=0.012 Sum_probs=166.5 Q ss_pred HHHHhhhhhhhhc-ccccccce--eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGG-SLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) Q Consensus 110 ~~~~~~~~~~~~~-~~~~~~~~--~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 186 (415) ............. .....++. .+.-+.++.+|.+.....+.++.+..+..+.+++ ++.+++ .+...+..+..|.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~-sv~~~~-iG~~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGK-SAQFPV-LGRTKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccc-eEEeee-ccceeEeeeecCcC Confidence 0000000000000 00101111 1233789999999999999999999988776543 444443 46667777888877 Q ss_pred cccc-ccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc----ccccc-----cccc Q lcl|NC_012784. 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV----ITKGS-----TGST 255 (415) Q Consensus 187 ~~~~-~~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g----~g~~~-----~~~~ 255 (415) ...+ ..+...+.++...++ ..-+.|.+-=--++.+|+.+.+.++...++++..|+.|+.- ..... +.+. T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGL 158 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccC Confidence 6432 245677777766654 22223333222345568999999999999999999987621 11100 0011 Q ss_pred cccccc-------ccccccccchhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHh-hccCCcccccCcccCCCC Q lcl|NC_012784. 256 SSGFEK-------EGKKLEVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQ 325 (415) Q Consensus 256 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~ 325 (415) +.+... ............++.++++...+...+.. +=.++++|..|..|.+. ....+.+....++..+.. T Consensus 159 ~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V 238 (347) T protein:vir:94 159 GKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSI 238 (347) T ss_pred CcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccccccccee Confidence 100000 00111122344577888888887776664 23566689998887753 333344433345566777 Q ss_pred ceecceeeEEeccccccccCCce--------------------EEEechhhcEEEE---------eecceEEEEeeccc- Q lcl|NC_012784. 326 QRLLGAKIEILPDEVLGQKGNNT--------------------LIIGNLKDAIVLF---------DRSQYQASWTDYMH- 375 (415) Q Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~--------------------~~~gd~~~~~~~~---------~~~~~~i~~~~~~~- 375 (415) .++.|+||+.++++|....+... -+=+||.+.+-++ .-.+++++..+... T Consensus 239 ~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~ 318 (347) T protein:vir:94 239 RNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRANF 318 (347) T ss_pred EEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechhh Confidence 89999999999999864321111 1223444322211 12334444443333 Q ss_pred CceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 376 FGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 376 ~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) +...+.+..=+|.+++||++.+.+.++.+ T Consensus 319 ~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 319 QADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhcCcccccceeEEEEecCC Confidence 33346667778999999999999998887 No 132 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.26 E-value=1.6e-12 Score=85.32 Aligned_cols=297 Identities=11% Similarity=0.045 Sum_probs=167.0 Q ss_pred HHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 189 (415) ....-....... ..+...-...+ +.++.+|.+.....+.++++..+.++.+++ ++.++.. +...+.....|++.- T Consensus 1 ms~~~~~tr~~~-~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~-s~~~~~i-G~~~~~~~~pG~~l~- 75 (335) T protein:vir:63 1 MSFLNDLTRPNY-AGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSN-VVRLDRL-GNVEAKGRRAGEELE- 75 (335) T ss_pred CCCcccchhhhc-ccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccce-eEEEeee-eeeeeecccCCcCcC- Confidence 000000000000 11222223333 789999999999999999999999887653 5666544 666777777777653 Q ss_pred cccccceeeEeeeeeEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh----hcccccccc--------ccc Q lcl|NC_012784. 190 LAVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAII----DVITKGSTG--------STS 256 (415) Q Consensus 190 ~~~~~f~~v~~~~~k~a-~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il----~g~g~~~~~--------~~~ 256 (415) ...+..++..+....+- ....|.+-=--++.+|+.+.+.+++.+++++..|+.++ .+-....+. ++. T Consensus 76 ~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:63 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcc Confidence 33455677677766542 11222222222355689999999999999999999765 222211111 111 Q ss_pred cccccccccccccchhhHHHHHHHHHHhhhhccC-----CCEEEEcHHHHHHHHHhhccCCc-ccc---cCcccCCCCce Q lcl|NC_012784. 257 SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE-----HNVAIVSQTMFAKLDKMKDKLGN-YLI---QPDVKEKTQQR 327 (415) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~lkd~~G~-~l~---~~~~~~~~~~~ 327 (415) ......+...........+.+..+...+...... .-+.+++|..|..|..-+.--.+ |.. ..++..+.... T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:63 156 EKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAI 235 (335) T ss_pred eeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEE Confidence 1111111111111222234555666666655544 24799999999998874322122 111 11244455678 Q ss_pred ecceeeEEeccccccccC------CceEEEechhhcE-EEEeec--------ceEEEEeeccc-CceEEEEEEEeccEEe Q lcl|NC_012784. 328 LLGAKIEILPDEVLGQKG------NNTLIIGNLKDAI-VLFDRS--------QYQASWTDYMH-FGECLMIAVRQDCRIL 391 (415) Q Consensus 328 l~G~pV~~~~~~~~~~~~------~~~~~~gd~~~~~-~~~~~~--------~~~i~~~~~~~-~~~~~~~~~r~d~~v~ 391 (415) ++|+||+.++++|.++.. ....+=|||.... +++.+. +++.+..+... +...+.+..=+|.++. T Consensus 236 v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:63 236 LNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGAR 315 (335) T ss_pred eeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCccc Confidence 999999999999965321 1223345554322 222222 22222222222 2223444456899999 Q ss_pred ccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 392 DYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 392 ~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ||++.+.+++ +|-|.|..|| T Consensus 316 RPe~a~~i~~----tg~~~~~~~~ 335 (335) T protein:vir:63 316 RPDTAGAIEL----KGIGAFDITA 335 (335) T ss_pred ccceEEEEEE----cCCCceeecC Confidence 9999999994 7888899999 No 133 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.25 E-value=1.2e-12 Score=85.88 Aligned_cols=291 Identities=9% Similarity=0.002 Sum_probs=162.9 Q ss_pred HHHHhhhhhhhhcccccccc--e--eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGGSLKTDSG--F--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~--~--~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 185 (415) ....-.............++ . .+.-+.++.++.......+.++++.++..+.+++ ++.++. .+...+.....|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gk-s~~~~~-iG~~~~~~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGK-SAQFPV-LGRTQAAYLAPGE 78 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccc-eEEEee-ecceEEEeeecCC Confidence 00000000001111111111 1 2233778999999999999999999998888653 555553 3666778888887 Q ss_pred ccccc-ccccceeeEeeee--eEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cccc Q lcl|NC_012784. 186 ENPEL-AVKPFFQLAYDIN--THRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK---------GSTG 253 (415) Q Consensus 186 ~~~~~-~~~~f~~v~~~~~--k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~---------~~~~ 253 (415) +...+ ..+...+.++... +++.. .|.+-=--++.+|+.+.+.+++.+++++..|+.|+.-... +.+. T Consensus 79 ~l~~~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~ 157 (345) T protein:vir:22 79 NLDDKRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIE 157 (345) T ss_pred CCCCCCCCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 75432 1345677444443 33332 2222212235568999999999999999999987731111 1111 Q ss_pred cccccc----ccccc---cccccchhhHHHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhcc-CCcccccCcccCC Q lcl|NC_012784. 254 STSSGF----EKEGK---KLEVKKAKSLDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDK-LGNYLIQPDVKEK 323 (415) Q Consensus 254 ~~~~~~----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~-~G~~l~~~~~~~~ 323 (415) +...+. ...+. .....+...++.++++...+........ .++++|..|..|..-+.- +..+.-..++..| T Consensus 158 ~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G 237 (345) T protein:vir:22 158 GLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKG 237 (345) T ss_pred ccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccccc Confidence 111111 11111 1111233457888888887777666543 789999999988653322 2233322234445 Q ss_pred CCceecceeeEEeccccccccCC------------------c---------eEEEechhhcEEEEeecceEEEEeec-cc Q lcl|NC_012784. 324 TQQRLLGAKIEILPDEVLGQKGN------------------N---------TLIIGNLKDAIVLFDRSQYQASWTDY-MH 375 (415) Q Consensus 324 ~~~~l~G~pV~~~~~~~~~~~~~------------------~---------~~~~gd~~~~~~~~~~~~~~i~~~~~-~~ 375 (415) ...+++|.+|+.++++|.+..+. . +.++...+ ++....-.+++++..+. .+ T Consensus 238 ~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~-A~~~v~~~~~~~e~~r~~~~ 316 (345) T protein:vir:22 238 SIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRS-AVGTVKLRDLALERARRANF 316 (345) T ss_pred eEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehh-heeeeeeecceeeeeechhH Confidence 56789999999999988542211 0 11111111 12222223344444432 23 Q ss_pred CceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 376 FGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 376 ~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) +...+++..=+|.+++||++.+.+++.-. T Consensus 317 ~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 317 QADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 33446667778999999999999998866 No 134 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.24 E-value=9.9e-13 Score=86.38 Aligned_cols=288 Identities=10% Similarity=0.012 Sum_probs=161.6 Q ss_pred hhhhhhcc-----cccccceeecc-hhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccc Q lcl|NC_012784. 116 RNDIQGGS-----LKTDSGFVVIP-EEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) Q Consensus 116 ~~~~~~~~-----~~~~~~~~~vP-~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 189 (415) +.....+. ...+++.+.++ +.++.+|.......+.++++..+..+.+++ ++.++. .+...+....-|+.+.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~-s~~~~~-iG~~~~~~~~~g~~l~~ 78 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTN-QLRVDR-VGASTIAGRKAGEELVV 78 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccc-eEEEee-ecceeeeeecCCCCCCC Confidence 11110010 11222223344 789999999999999999999998887653 555553 36667777777877643 Q ss_pred cccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh----hcccccccc--------ccc Q lcl|NC_012784. 190 LAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAII----DVITKGSTG--------STS 256 (415) Q Consensus 190 ~~~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il----~g~g~~~~~--------~~~ 256 (415) ....-++.++....+ ..-..|.+-=--++.+|+.+.+.+++.++++++.|+.++ .+.....+. +.. T Consensus 79 -~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~ 157 (334) T protein:vir:80 79 -QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGIL 157 (334) T ss_pred -CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcc Confidence 345567777776653 233333332223455689999999999999999999765 222111110 111 Q ss_pred cccccccccc--cccchhhHHHHHHHHHHhhhhccC-----CCEEEEcHHHHHHHHHhhccCCc-cccc---CcccCCCC Q lcl|NC_012784. 257 SGFEKEGKKL--EVKKAKSLDDIKDAINLNVKPNYE-----HNVAIVSQTMFAKLDKMKDKLGN-YLIQ---PDVKEKTQ 325 (415) Q Consensus 257 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~lkd~~G~-~l~~---~~~~~~~~ 325 (415) ......+... .......++.+..+...+...... .=.++++|..|..|..-+.--.+ |.-. .+...+.. T Consensus 158 ~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i 237 (334) T protein:vir:80 158 LPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRI 237 (334) T ss_pred eeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeE Confidence 1111111110 111111234445555555555544 24789999999998764211111 1111 12334446 Q ss_pred ceecceeeEEecccccccc------CCceEEEechhhcEEE-Eeecc--------eEEEEeec-ccCceEEEEEEEeccE Q lcl|NC_012784. 326 QRLLGAKIEILPDEVLGQK------GNNTLIIGNLKDAIVL-FDRSQ--------YQASWTDY-MHFGECLMIAVRQDCR 389 (415) Q Consensus 326 ~~l~G~pV~~~~~~~~~~~------~~~~~~~gd~~~~~~~-~~~~~--------~~i~~~~~-~~~~~~~~~~~r~d~~ 389 (415) .+++|+||+.++++|..+. +....+-|||+..... +.++. ++.+..++ ..+...+.+..-+|.+ T Consensus 238 ~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g 317 (334) T protein:vir:80 238 AMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNIG 317 (334) T ss_pred EEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCCc Confidence 7899999999999996531 2223455676654322 22222 22222221 1222223444567999 Q ss_pred EeccccEEEEEeecCCC Q lcl|NC_012784. 390 ILDYKSAIVIEYDDSER 406 (415) Q Consensus 390 v~~p~a~~~~~~t~~~~ 406 (415) ++||+|.+.++|+-+-+ T Consensus 318 ~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 318 QRRPDAVAVHDITVTNP 334 (334) T ss_pred eeccceEEEEEEeeecC Confidence 99999999999886555 No 135 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.21 E-value=1.9e-11 Score=79.38 Aligned_cols=295 Identities=11% Similarity=0.045 Sum_probs=159.9 Q ss_pred hhhhhhcc---cccccceeecc-hhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGS---LKTDSGFVVIP-EEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 116 ~~~~~~~~---~~~~~~~~~vP-~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) +....... ...++....+. +.+..++.+.....+.++++..+..+.+++ ++.++.. +...+.....|+.. ... T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gk-S~q~~~i-G~~~~~~~~~G~~l-d~~ 77 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTN-SVSNKYI-GETELQVLSPGKSP-DAS 77 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccc-eEEeeee-eeeEEeeeccCccc-CCC Confidence 11111111 11111222233 678899999998899999999998887654 5555543 55555666666553 344 Q ss_pred cccceeeEeeeeeEEE-eehhh--HHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhcc--c---c----cc-cccccc Q lcl|NC_012784. 192 VKPFFQLAYDINTHRG-YFRIS--REAIEDAKVN-VLQELKLWMARTIAATRNKAIIDVI--T---K----GS-TGSTSS 257 (415) Q Consensus 192 ~~~f~~v~~~~~k~a~-~~~iS--~e~l~ds~~~-l~~~l~~~la~~~~~~~d~~il~g~--g---~----~~-~~~~~~ 257 (415) .+.-++.++....+-- ...|- +|.. +.++ +.+.+.+++.+++++..|+.++.-. . . .. +...+. T Consensus 78 ~~~~~k~~itID~ll~a~~~V~diDe~q--~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~ 155 (364) T protein:vir:10 78 PTEFDKNRLVVDTTVIARNTVAHFHDVQ--NDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGH 155 (364) T ss_pred CcccCcEEEEecceeeechhhhhHHHHh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCC Confidence 5566676666655421 11221 2333 3355 6889999999999999999875211 0 0 00 000011 Q ss_pred cccccccccc----ccchhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhhcc-CCcccc--cCcccCCCCcee Q lcl|NC_012784. 258 GFEKEGKKLE----VKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK-LGNYLI--QPDVKEKTQQRL 328 (415) Q Consensus 258 ~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lkd~-~G~~l~--~~~~~~~~~~~l 328 (415) +.......+. ......++.+.++...+-..+.. .=+++++|..|..|.+-.+= +-.|.. ..++..+....+ T Consensus 156 g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v 235 (364) T protein:vir:10 156 GFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKS 235 (364) T ss_pred cceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEE Confidence 1101111111 11122345555666666665554 34789999999888763110 001110 122344555689 Q ss_pred cceeeEEecccccccc------------------CCceEEEechhhc-EEEEee--------cceEEEEeecccC-ceEE Q lcl|NC_012784. 329 LGAKIEILPDEVLGQK------------------GNNTLIIGNLKDA-IVLFDR--------SQYQASWTDYMHF-GECL 380 (415) Q Consensus 329 ~G~pV~~~~~~~~~~~------------------~~~~~~~gd~~~~-~~~~~~--------~~~~i~~~~~~~~-~~~~ 380 (415) .|+||+.++++|..+. ++..-+.|||... ..+|.+ .+++.+..+.... ...+ T Consensus 236 ~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~i 315 (364) T protein:vir:10 236 WNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYI 315 (364) T ss_pred eceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeee Confidence 9999999999985311 1111233454432 222322 3444444433332 3334 Q ss_pred EEEEEeccEEeccccEEEEEeecCCCCccccccc-C Q lcl|NC_012784. 381 MIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLE-A 415 (415) Q Consensus 381 ~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~-~ 415 (415) .+..=+|.+++||++.+.++..++..+.-|--+. | T Consensus 316 da~~a~G~g~lRPeaa~~i~~~~~~~~~~~~~~~~~ 351 (364) T protein:vir:10 316 DTFLAEGAIPDRWEAVAVVTAADTAELATDHNAILA 351 (364) T ss_pred eeehcccCcccCccceEEEEecCCCCCccchhhhhh Confidence 5556689999999999999877766666654332 2 No 136 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.19 E-value=2e-12 Score=84.71 Aligned_cols=293 Identities=9% Similarity=0.026 Sum_probs=159.4 Q ss_pred HHHHhhhhhhhhccccc----ccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGGSLKT----DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) Q Consensus 110 ~~~~~~~~~~~~~~~~~----~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 185 (415) ................. .+--.+.-+.++.++.......+.++++.++..+.+++ ++.++.. +...+..+..|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~-s~~~~~i-G~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGK-SAQFPVL-GRTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccc-eEEEEee-ceeEEEeeecCC Confidence 00000000000000000 00001122678999999999999999999998888653 5555543 556677778887 Q ss_pred cccccc-cccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------ccccc Q lcl|NC_012784. 186 ENPELA-VKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK---------GSTGS 254 (415) Q Consensus 186 ~~~~~~-~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~---------~~~~~ 254 (415) +...+. .+.-.++++...++ ..-..|.+-=--++.+++.+.+.+++.+++++..|+.++.-... ..+.+ T Consensus 79 ~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g 158 (344) T protein:vir:10 79 NLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITG 158 (344) T ss_pred CCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Confidence 765331 24456666665542 12222222222235568999999999999999999887532110 00111 Q ss_pred ccccc----ccccccc---cccchhhHHHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhccC-CcccccCcccCCC Q lcl|NC_012784. 255 TSSGF----EKEGKKL---EVKKAKSLDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDKL-GNYLIQPDVKEKT 324 (415) Q Consensus 255 ~~~~~----~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~~-G~~l~~~~~~~~~ 324 (415) ...+. ....... ...+...++.++++...+...+.... .+|++|..|..|..-+.-+ +.+.-...+..|. T Consensus 159 ~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~ 238 (344) T protein:vir:10 159 LGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKGS 238 (344) T ss_pred ccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeeeE Confidence 11110 0111111 11112346777778777777666432 5777999999886532211 2222122344455 Q ss_pred CceecceeeEEeccccccccCCc-e---------------EEEechhhcE-E--------EEeecceEEEEee-cccCce Q lcl|NC_012784. 325 QQRLLGAKIEILPDEVLGQKGNN-T---------------LIIGNLKDAI-V--------LFDRSQYQASWTD-YMHFGE 378 (415) Q Consensus 325 ~~~l~G~pV~~~~~~~~~~~~~~-~---------------~~~gd~~~~~-~--------~~~~~~~~i~~~~-~~~~~~ 378 (415) ..+++|+||+.++++|.+..... . .+..+|++.+ . .+.-.+++++..+ ..++.. T Consensus 239 V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d 318 (344) T protein:vir:10 239 IRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQAD 318 (344) T ss_pred EEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhHHHH Confidence 67899999999999986432211 1 1122333211 1 1111233444333 233444 Q ss_pred EEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 379 CLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 379 ~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .+++..=+|.+++||++.+.+++++. T Consensus 319 ~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 319 QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHhhcccceecccceEEEEeecC Confidence 56677788999999999999999887 No 137 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.19 E-value=4.8e-12 Score=82.64 Aligned_cols=297 Identities=11% Similarity=0.033 Sum_probs=163.6 Q ss_pred HHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 189 (415) ....-....... ..+...-...+ +.++.+|.+.....+.++++..+..+.+++ ++.++.. +...+.....|++. . T Consensus 1 ms~~~~~t~~~~-~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~-s~~~~~i-G~~~~~~~~pG~~l-~ 75 (335) T protein:vir:78 1 MSFLNDLTRPNY-AGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSN-VVRLDRL-GNVEAKGRRAGEEL-E 75 (335) T ss_pred CCcccccccccc-ccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccce-eEEEeee-eeeeecccccCccc-C Confidence 000000000000 11122222333 789999999999999999999998887653 5666543 66677777777765 3 Q ss_pred cccccceeeEeeeeeEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh----hcccccccc--------ccc Q lcl|NC_012784. 190 LAVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAII----DVITKGSTG--------STS 256 (415) Q Consensus 190 ~~~~~f~~v~~~~~k~a-~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il----~g~g~~~~~--------~~~ 256 (415) ...+..++..+....+- ....|.+-=--++.+|+.+.+.+++.+++++..|+.++ .+.....+. +.. T Consensus 76 ~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:78 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcc Confidence 44456677777766542 11222222122356689999999999999999999765 222211110 110 Q ss_pred cccccccccccccchhhHHHHHHHHHHhhhhccC-----CCEEEEcHHHHHHHHHhhccCCc-ccc---cCcccCCCCce Q lcl|NC_012784. 257 SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE-----HNVAIVSQTMFAKLDKMKDKLGN-YLI---QPDVKEKTQQR 327 (415) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~lkd~~G~-~l~---~~~~~~~~~~~ 327 (415) ......+...........+.+.++...+...... .=+.+++|..|..|..-+.--.+ |.. ..++..+.... T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:78 156 EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAI 235 (335) T ss_pred eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEE Confidence 0011111111122223344555555555544443 23689999999998874221111 111 12344556678 Q ss_pred ecceeeEEeccccccccCC------ceEEEechhh-cEEEEeec--------ceEEEEeeccc-CceEEEEEEEeccEEe Q lcl|NC_012784. 328 LLGAKIEILPDEVLGQKGN------NTLIIGNLKD-AIVLFDRS--------QYQASWTDYMH-FGECLMIAVRQDCRIL 391 (415) Q Consensus 328 l~G~pV~~~~~~~~~~~~~------~~~~~gd~~~-~~~~~~~~--------~~~i~~~~~~~-~~~~~~~~~r~d~~v~ 391 (415) ++|+||+.++++|.++... ....=+||.. ..+++.+. ++..+..+... +...+.+..=+|.++. T Consensus 236 v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:78 236 LNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGAR 315 (335) T ss_pred eeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCccc Confidence 9999999999999653211 1122234433 12222222 22222222222 2223444556899999 Q ss_pred ccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 392 DYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 392 ~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ||++.+.++ .+|-+.|..|| T Consensus 316 RPe~a~~i~----~tg~~~~~~~~ 335 (335) T protein:vir:78 316 RPDTAGAIE----LKGIEAFDITA 335 (335) T ss_pred CcceEEEEE----ecCCCcccccC Confidence 999999999 55677788888 No 138 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.16 E-value=6.1e-12 Score=82.05 Aligned_cols=293 Identities=9% Similarity=0.023 Sum_probs=158.6 Q ss_pred HHHHhhhhhhhhcccccc---cce-e-ecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGGSLKTD---SGF-V-VIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL 184 (415) Q Consensus 110 ~~~~~~~~~~~~~~~~~~---~~~-~-~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 184 (415) ........ ..++.... .+. . +.-+.++.++.......+.++.+.++..+.+++ ++.+++ .+...+.....| T Consensus 1 ~~~~~~~~--~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~-sv~i~~-iG~~t~~~~~~g 76 (347) T protein:vir:33 1 MANIQGGQ--QIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGK-SAQFPV-IGRTKAAYLKPG 76 (347) T ss_pred CCCCccCc--ccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccc-eeEeee-ccceeeeeecCC Confidence 00000000 00000000 111 1 223788999998888889889998887766442 444443 355566677777 Q ss_pred cccccc-ccccceeeEeeeeeEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc-----cccccc----- Q lcl|NC_012784. 185 EENPEL-AVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV-----ITKGST----- 252 (415) Q Consensus 185 ~~~~~~-~~~~f~~v~~~~~k~a-~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g-----~g~~~~----- 252 (415) ..++.. ......+.++...+.- .-..|.+-=--++..++.+.+.++...++++..|+.|+.- .....+ T Consensus 77 ~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:33 77 ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIE 156 (347) T ss_pred CCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 765321 1245566666554332 1122322222235568899999999999999999988621 111000 Q ss_pred c-ccccccc----cccccc--cccchhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhh-ccCCcccccCcccC Q lcl|NC_012784. 253 G-STSSGFE----KEGKKL--EVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMK-DKLGNYLIQPDVKE 322 (415) Q Consensus 253 ~-~~~~~~~----~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lk-d~~G~~l~~~~~~~ 322 (415) . +...+.. ..+... ...+...++.++++...+...... +=.++++|..|..|.+-. -.+..|.-...+.. T Consensus 157 ~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~ 236 (347) T protein:vir:33 157 GLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPER 236 (347) T ss_pred cccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccccccc Confidence 0 0000000 000000 011223467778888888777664 337899999999987532 22233322234555 Q ss_pred CCCceecceeeEEeccccccccCCc---------eE--------EEechhhcE-EE--------EeecceEEEEeec-cc Q lcl|NC_012784. 323 KTQQRLLGAKIEILPDEVLGQKGNN---------TL--------IIGNLKDAI-VL--------FDRSQYQASWTDY-MH 375 (415) Q Consensus 323 ~~~~~l~G~pV~~~~~~~~~~~~~~---------~~--------~~gd~~~~~-~~--------~~~~~~~i~~~~~-~~ 375 (415) |..++++|++|+.++++|..+.... .. +-++|+... .+ +.-.+++++..+. .+ T Consensus 237 G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~ 316 (347) T protein:vir:33 237 GTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANY 316 (347) T ss_pred ceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhh Confidence 6667899999999999986532211 11 112222110 11 1122334444332 33 Q ss_pred CceEEEEEEEeccEEeccccEEEEEeecCCC Q lcl|NC_012784. 376 FGECLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) Q Consensus 376 ~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~ 406 (415) +...+++-..+|.+++||++.+.+++..-.. T Consensus 317 ~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 317 QADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 3445677777899999999999998876555 No 139 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.16 E-value=1.7e-12 Score=85.08 Aligned_cols=291 Identities=10% Similarity=0.015 Sum_probs=153.8 Q ss_pred Hhhhhhhhhcccccc---cce--eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccc Q lcl|NC_012784. 113 LETRNDIQGGSLKTD---SGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEEN 187 (415) Q Consensus 113 ~~~~~~~~~~~~~~~---~~~--~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 187 (415) +.-......++.... ++. .+.-+.+..+++......+.++++.++.++.+++ ++.+++. +...+.....|+.. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~-sv~i~~i-G~~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGK-SAQFPVM-GRTSGVYLAPGERL 78 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccc-eEEEecc-cceeeeeecCCCCc Confidence 000000000000111 111 1223678888888888888888898888876543 4555443 56677777777765 Q ss_pred ccc-ccccceeeEeeeeeEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccc-----cccc----cccc Q lcl|NC_012784. 188 PEL-AVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVIT-----KGST----GSTS 256 (415) Q Consensus 188 ~~~-~~~~f~~v~~~~~k~a-~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g-----~~~~----~~~~ 256 (415) +.. ....-.++++...++- ....|.+-=--++.+++.+.+.++...++++..|+.|+.-.. .+.. .+.. T Consensus 79 ~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~ 158 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLG 158 (347) T ss_pred CCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 321 1123345445544331 111222211123456789999999999999999998763111 1111 1100 Q ss_pred ccccccccccc------ccchhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhccCC-cccccCcccCCCCce Q lcl|NC_012784. 257 SGFEKEGKKLE------VKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLG-NYLIQPDVKEKTQQR 327 (415) Q Consensus 257 ~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd~~G-~~l~~~~~~~~~~~~ 327 (415) .+......... ......++.++++...+....... =.+|++|..|..|..-++-+. .+.-...+..|..++ T Consensus 159 ~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 238 (347) T protein:vir:94 159 TASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRN 238 (347) T ss_pred ccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceEE Confidence 01000000000 111223566667777776655532 378899999988754332222 222222345566689 Q ss_pred ecceeeEEeccccccccCC-------------ceE--------EEechhhcEEE-Eee--------cceEEEEee-cccC Q lcl|NC_012784. 328 LLGAKIEILPDEVLGQKGN-------------NTL--------IIGNLKDAIVL-FDR--------SQYQASWTD-YMHF 376 (415) Q Consensus 328 l~G~pV~~~~~~~~~~~~~-------------~~~--------~~gd~~~~~~~-~~~--------~~~~i~~~~-~~~~ 376 (415) ++|++|+.++++|....+. ... +-+||+..+.+ |-+ .+++++..+ ..++ T Consensus 239 i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~ 318 (347) T protein:vir:94 239 VMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQ 318 (347) T ss_pred EeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhH Confidence 9999999999998532211 111 22223221111 111 122333322 2344 Q ss_pred ceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 377 ~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ...+++..-+|.+++||++.+.++++++- T Consensus 319 ~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 319 GDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred HHHhhhhhhhcCcccccceeEEEEecCCC Confidence 55678888899999999999999888555 No 140 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.14 E-value=9.5e-12 Score=81.00 Aligned_cols=299 Identities=9% Similarity=0.026 Sum_probs=153.6 Q ss_pred HHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCC-ceeEEEEeecCCccccccccccccc Q lcl|NC_012784. 110 TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENP 188 (415) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~ 188 (415) .................+.....+|+.+...+++.+.+...+..++........ .-++.+++. +.+.+..+.++..++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~-g~~~a~d~~~g~~i~ 79 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNI-SRAAVYDKQPQTPVN 79 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeecc-CcceeeeecCCCccc Confidence 000000001111122333335568999999999999888888777654332211 124555543 456777888887765 Q ss_pred ccccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----ccccc-----cc Q lcl|NC_012784. 189 ELAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS----TGSTS-----SG 258 (415) Q Consensus 189 ~~~~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~----~~~~~-----~~ 258 (415) ....+..+++++..+. ..-+.|++.-...+..++.+.+.+++..+++++.|+.++.-..... +.... .. T Consensus 80 -~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~ 158 (381) T protein:vir:80 80 -LQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGD 158 (381) T ss_pred -ccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccc Confidence 3456667777777554 3346777766666667899999999999999999998875322111 11111 01 Q ss_pred cccccccccccchhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhc-cCCcccccCcccCCCCceecceeeEE Q lcl|NC_012784. 259 FEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd-~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) .......+.......++.++++...+-...... =.++++|..+..|.+... .+-.+.-...+..+..++|+|++|+. T Consensus 159 ~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~~Vv~ 238 (381) T protein:vir:80 159 GTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEVIV 238 (381) T ss_pred cccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcceEEEe Confidence 111111122233456788888888877666542 379999999999876421 11123223345667778999999999 Q ss_pred eccccccccCCceEEEechhhcEEEEeecceEE-EE-eecccCceEEEEEEEeccEEeccccEEEE-E--eecCCCCccc Q lcl|NC_012784. 336 LPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA-SW-TDYMHFGECLMIAVRQDCRILDYKSAIVI-E--YDDSERGEGD 410 (415) Q Consensus 336 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i-~~-~~~~~~~~~~~~~~r~d~~v~~p~a~~~~-~--~t~~~~~~~~ 410 (415) ++.+|........+..|-... .. ..+.- .+ -++......++..-.+|.++......+.. . ......+.+- T Consensus 239 Sn~lp~~~~t~~~~~agap~~-~~----~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~ 313 (381) T protein:vir:80 239 TTQIGINSLTGYVNGQGAPTQ-PT----PGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQT 313 (381) T ss_pred ecccccccccceeeecccccc-cc----ccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCce Confidence 999997554333333322111 00 00000 00 01111111122222333333221111110 0 0011111111 Q ss_pred cccc----------------------------------C Q lcl|NC_012784. 411 LGLE----------------------------------A 415 (415) Q Consensus 411 ~~~~----------------------------------~ 415 (415) .++- = T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 352 (381) T protein:vir:80 314 LGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMY 352 (381) T ss_pred eeeehhhhhhhhhcccccccccccceeEeecccchhhee Confidence 1111 0 No 141 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.12 E-value=6.8e-11 Score=76.32 Aligned_cols=295 Identities=11% Similarity=0.014 Sum_probs=157.5 Q ss_pred Hhhhhhhhhcc---cc-cccce-----eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccc Q lcl|NC_012784. 113 LETRNDIQGGS---LK-TDSGF-----VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) Q Consensus 113 ~~~~~~~~~~~---~~-~~~~~-----~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 183 (415) +.-......+. .+ ...++ .+.-+.++.++.......+.++.+.++..+.+++ ++.+++. +...+....- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gk-sv~f~~i-G~~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGK-SLQFIYT-GRMTSSFHTP 78 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCc-eEEEEee-eeeEEeeecC Confidence 00000000110 00 00111 2223678899999999999999999988887543 4555443 5566666766 Q ss_pred cccccc--cccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccc----- Q lcl|NC_012784. 184 LEENPE--LAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI----TKGS----- 251 (415) Q Consensus 184 g~~~~~--~~~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~----g~~~----- 251 (415) |+++-. ..+....+.++...++ +.-..|.+-=--++..++.+.+.++..+++++..|+.|+.-. .... T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~ 158 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSAT 158 (375) T ss_pred CcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 665421 1122234434444332 112222222222456689999999999999999999876311 1111 Q ss_pred cccccccccc-----ccccccccchhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhccCC----cccccCcc Q lcl|NC_012784. 252 TGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLG----NYLIQPDV 320 (415) Q Consensus 252 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd~~G----~~l~~~~~ 320 (415) +.....+... .......+....++.++++...+....... =.++++|..|..|.+-+|.+. .+.-.... T Consensus 159 ~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~ 238 (375) T protein:vir:10 159 NFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQ 238 (375) T ss_pred cccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccccee Confidence 1000111110 111111234456788888888887776653 378899999999887655431 11111112 Q ss_pred cCCCCceecceeeEEeccccccccCC--------------------------------ceEEEech---hh-cEEEEee- Q lcl|NC_012784. 321 KEKTQQRLLGAKIEILPDEVLGQKGN--------------------------------NTLIIGNL---KD-AIVLFDR- 363 (415) Q Consensus 321 ~~~~~~~l~G~pV~~~~~~~~~~~~~--------------------------------~~~~~gd~---~~-~~~~~~~- 363 (415) .++....++|++|+.++.+|..+... ...+-+|| .+ ...++.+ T Consensus 239 ~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~ 318 (375) T protein:vir:10 239 SGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKE 318 (375) T ss_pred ccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchh Confidence 23334589999999999998543211 11122233 11 1112222 Q ss_pred -------cceEEEEee--cccCce--EEEEEEEeccEEeccccEEEEEeecCCCCcccc Q lcl|NC_012784. 364 -------SQYQASWTD--YMHFGE--CLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) Q Consensus 364 -------~~~~i~~~~--~~~~~~--~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~ 411 (415) .+++++.+. +....+ .+++..=+|..++||++.+.++.. .++...| T Consensus 319 A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~--~~~~~~~ 375 (375) T protein:vir:10 319 AAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIG--ATAPSAF 375 (375) T ss_pred heeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecC--cCccccC Confidence 334444432 322222 245556689999999998877655 4566666 No 142 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.11 E-value=4.6e-11 Score=77.23 Aligned_cols=282 Identities=8% Similarity=-0.022 Sum_probs=156.3 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhh---------cceeEE--ccCCceeEEEEeecCCcccccccccccccc Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK---------YVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~---------~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 189 (415) +. ++.-+...+|+.+.+-+.+...+.+.+.+ +..... .++....+|+....+ ..+..+.|+..++. T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~-Gd~~~v~~~~~i~~ 77 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLD-GDSQVLNDTDDLVP 77 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCC-CcccccCCCcccch Confidence 11 23335677888777766665555554422 111111 123333344332222 35667788888764 Q ss_pred cccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccc Q lcl|NC_012784. 190 LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVK 269 (415) Q Consensus 190 ~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~ 269 (415) +..+.++-....++.+.-..++++...-+.-+....+.++++..+.+..++.+|.....-............ ...... T Consensus 78 -~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~d-vsa~~~ 155 (324) T protein:vir:59 78 -QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLD-ISGTAD 155 (324) T ss_pred -hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceee-eecccc Confidence 456666666666677766777776554455577888999999999999998877543210000000000000 111222 Q ss_pred chhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccc-cCC-- Q lcl|NC_012784. 270 KAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQ-KGN-- 346 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~-~~~-- 346 (415) ...+.+.+.+++.++.+....-.+|+||+.++..|+++.--+ ++. ..-....-++++|++|++++.+|... .+. T Consensus 156 ~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~--~~~-~s~~~~~i~~~~G~~VivdD~~p~~~~~~~~~ 232 (324) T protein:vir:59 156 GIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIE--FVK-DSQSGIRFPTYMNKRVIVDDSMPVETLEDGTK 232 (324) T ss_pred ceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhh--hcc-ccccCceeeeecccEEEEeCCCCccccCCCCc Confidence 345678899999998888888889999999999999753221 111 11122334678999999999998531 111 Q ss_pred --ceEEEechhhcEEEEe-ecceEEEEeecccC-ceEEEEEEEeccEEeccccEEEEE--eecCCCCcccccccC Q lcl|NC_012784. 347 --NTLIIGNLKDAIVLFD-RSQYQASWTDYMHF-GECLMIAVRQDCRILDYKSAIVIE--YDDSERGEGDLGLEA 415 (415) Q Consensus 347 --~~~~~gd~~~~~~~~~-~~~~~i~~~~~~~~-~~~~~~~~r~d~~v~~p~a~~~~~--~t~~~~~~~~~~~~~ 415 (415) ...+||. .++.... +..+.+++.+.... ...+....++ +++|..+..-+ .+...+.-.+|.+.+ T Consensus 233 ~y~s~l~~~--GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~---~~~p~G~s~~~~~~~~~sPt~~~L~~~~ 302 (324) T protein:vir:59 233 VFTSYLFGA--GALGYAEGQPEVPTETARNALGSQDILINRKHF---VLHPRGVKFTENAMAGTTPTDEELANGA 302 (324) T ss_pred eEEEEEEec--CeEEEeecCCCcceecccCccccceEEEEeeEE---EeEeeeEEecccccCCCCCChhhhcCCc Confidence 1345553 2333333 23455666554332 2334443343 35555544432 112333445555555 No 143 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.08 E-value=5.3e-11 Score=76.91 Aligned_cols=293 Identities=9% Similarity=-0.017 Sum_probs=154.1 Q ss_pred HHhhhhhhhhcccccccce-----eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccc Q lcl|NC_012784. 112 YLETRNDIQGGSLKTDSGF-----VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~-----~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 186 (415) ..........++....+++ .+.-+.++.+++......+.++.+.++..+.+++ ++.+++. +...+.....|.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~-sv~i~~i-g~~t~~~~~~g~~ 78 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGK-SAQFPVI-GRTKAAYLKPGEN 78 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccc-eeEeeec-cceeeeeeccCCC Confidence 0000000000000001100 1223567888988888888889988877766443 4444443 4456667777766 Q ss_pred cccc-ccccceeeEeeeeeEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------cccccc Q lcl|NC_012784. 187 NPEL-AVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG--------STGSTS 256 (415) Q Consensus 187 ~~~~-~~~~f~~v~~~~~k~a-~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~--------~~~~~~ 256 (415) .+.. ...+..+.++...+.- .-..|.+-=--+++.++.+.+.++...++++..|+.|+.-.... .+...+ T Consensus 79 l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:15 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGL 158 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 5321 1245567666554332 12233222223466689999999999999999999887321110 000000 Q ss_pred --ccccc--cccccccc-----chhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhhccC-CcccccCcccCCC Q lcl|NC_012784. 257 --SGFEK--EGKKLEVK-----KAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDKL-GNYLIQPDVKEKT 324 (415) Q Consensus 257 --~~~~~--~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lkd~~-G~~l~~~~~~~~~ 324 (415) ..... ........ ....++.++++...+...... +=.++++|..|..|.+-.+-. ..|.-...+..|. T Consensus 159 g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~ 238 (347) T protein:vir:15 159 GKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGT 238 (347) T ss_pred CccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceE Confidence 00000 00000001 112255566666666666553 235777999999987643322 2222122345566 Q ss_pred CceecceeeEEeccccccccCC---------ceEEE--------echhhc---------EEEEeecceEEEEeec-ccCc Q lcl|NC_012784. 325 QQRLLGAKIEILPDEVLGQKGN---------NTLII--------GNLKDA---------IVLFDRSQYQASWTDY-MHFG 377 (415) Q Consensus 325 ~~~l~G~pV~~~~~~~~~~~~~---------~~~~~--------gd~~~~---------~~~~~~~~~~i~~~~~-~~~~ 377 (415) .++++|++|+.++++|..+..+ ...+- ++|... +-.+.-++++++..+. .++. T Consensus 239 Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~ 318 (347) T protein:vir:15 239 IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQA 318 (347) T ss_pred EEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhhh Confidence 6789999999999998543211 11111 111111 1112223334444433 3334 Q ss_pred eEEEEEEEeccEEeccccEEEEEeecCCC Q lcl|NC_012784. 378 ECLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) Q Consensus 378 ~~~~~~~r~d~~v~~p~a~~~~~~t~~~~ 406 (415) ..+++-..+|.+++||++.+.+++..-.. T Consensus 319 d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 319 DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhehhhhcCCceeccccEEEEecCCCCC Confidence 45666677899999999999988776555 No 144 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.08 E-value=1.4e-11 Score=80.06 Aligned_cols=294 Identities=11% Similarity=0.001 Sum_probs=155.7 Q ss_pred HHHHHHHhhhhhhhhccccccc-ce-eecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccc Q lcl|NC_012784. 107 RDFTEYLETRNDIQGGSLKTDS-GF-VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL 184 (415) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~-~~-~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 184 (415) --+...+........+....++ .. .+.-+.++.++++.....+.++.+.+..++.++. ++.+++. +...+.....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~-tv~i~~i-g~~~~~~~~~g 78 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFT-GKLSAGYHTPG 78 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccc-eEEEEec-cceeEeeecCC Confidence 0000000000000001111111 11 1333788999999999999999999887776443 4555543 55566666666 Q ss_pred ccccccccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccccc-cccccc Q lcl|NC_012784. 185 EENPELAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI----TKGST-GSTSSG 258 (415) Q Consensus 185 ~~~~~~~~~~f~~v~~~~~k~-a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~----g~~~~-~~~~~~ 258 (415) ..+.....+.-.++++...+. +.-..|.+-=--++..++.+.+.++..+++++..|+.|+.-. ....+ ...+.+ T Consensus 79 ~~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~ 158 (332) T protein:vir:78 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) T ss_pred CCCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccc Confidence 654322223445555555542 222233221112455689999999999999999998776321 11111 111111 Q ss_pred cccc-cccccccchhhHHHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhccC--Cccccc-C-cccCC-CCceecc Q lcl|NC_012784. 259 FEKE-GKKLEVKKAKSLDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDKL--GNYLIQ-P-DVKEK-TQQRLLG 330 (415) Q Consensus 259 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~~--G~~l~~-~-~~~~~-~~~~l~G 330 (415) .... ......+....++.++++...+-....... .+|++|..|..|.+.+|.. .++... . ....+ ...+++| T Consensus 159 ~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G 238 (332) T protein:vir:78 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) T ss_pred cccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEee Confidence 1111 111122233456888888888887777543 4667999999987744321 111101 1 12222 2468999 Q ss_pred eeeEEeccccccccCCc---------eEEEechhhcEE-EEee--------cceEEEEee----cccCceEEEEEEEecc Q lcl|NC_012784. 331 AKIEILPDEVLGQKGNN---------TLIIGNLKDAIV-LFDR--------SQYQASWTD----YMHFGECLMIAVRQDC 388 (415) Q Consensus 331 ~pV~~~~~~~~~~~~~~---------~~~~gd~~~~~~-~~~~--------~~~~i~~~~----~~~~~~~~~~~~r~d~ 388 (415) ++|+.++++|..+..+. ..+-|+|+...- ++-+ .++.++.+. ..++...+++-..+|. T Consensus 239 ~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~ 318 (332) T protein:vir:78 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGC 318 (332) T ss_pred eEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcC Confidence 99999999986542111 123344443211 1212 222332221 2333445666678999 Q ss_pred EEeccccEEEEEee Q lcl|NC_012784. 389 RILDYKSAIVIEYD 402 (415) Q Consensus 389 ~v~~p~a~~~~~~t 402 (415) +++||++++.++-. T Consensus 319 ~v~rPe~~v~l~~a 332 (332) T protein:vir:78 319 GSLRTSVAGSFQAA 332 (332) T ss_pred ceecccceEEEeeC Confidence 99999998887533 No 145 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.04 E-value=4e-11 Score=77.59 Aligned_cols=257 Identities=11% Similarity=0.035 Sum_probs=136.3 Q ss_pred cceeEEccCCceeEEEEeecCCccccccccccccccc-cccccee--eEeeeeeEEEeehhhHHHHhcchHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL-AVKPFFQ--LAYDINTHRGYFRISREAIEDAKVNVLQELKLW 230 (415) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~f~~--v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~ 230 (415) ++ +++++++ ++.+++. +...+....-|.++... ....-.+ +++.-.+++.+ .|.+-=--++.+|+.+...++ T Consensus 1 ~v--r~i~~g~-s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~-~VdDiD~~qa~~Dlr~e~s~~ 75 (324) T protein:vir:99 1 MT--RTITSGK-SAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDV-LIYDIEDAMNHYDVRSEYSTQ 75 (324) T ss_pred Ce--eeeecCc-eEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhh-hhhhHHHHhcCccchhHHHHH Confidence 33 3344432 4444433 55666777767665211 1123344 33333333332 222211223556899999999 Q ss_pred HHHHHHHHHHHHHhhc----c--c---cccccccccc---ccccccccc--ccchhhHHHHHHHHHHhhhhccCC--CEE Q lcl|NC_012784. 231 MARTIAATRNKAIIDV----I--T---KGSTGSTSSG---FEKEGKKLE--VKKAKSLDDIKDAINLNVKPNYEH--NVA 294 (415) Q Consensus 231 la~~~~~~~d~~il~g----~--g---~~~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 294 (415) +.+++++..|+.++.- . . ...+.....+ ....+.... ......++.++++...+...+... =.+ T Consensus 76 ~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~ 155 (324) T protein:vir:99 76 MGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTF 155 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEE Confidence 9999999999877521 0 0 0111111111 111111111 112234677777877877666643 368 Q ss_pred EEcHHHHHHHHHhh-ccCCcccccCcccCCCCceecceeeEEeccccccccCCce--------------------EEEec Q lcl|NC_012784. 295 IVSQTMFAKLDKMK-DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--------------------LIIGN 353 (415) Q Consensus 295 v~~~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--------------------~~~gd 353 (415) +++|..+..|..-+ -..+.+.-...+..+..++++|++|+.++++|.....+.. -+-+| T Consensus 156 vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d 235 (324) T protein:vir:99 156 YTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVG 235 (324) T ss_pred EeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccc Confidence 99999998775432 2223443334466677788999999999999864221100 01233 Q ss_pred hhhc---------EEEEeecceEEEEeec-ccCceEEEEEEEeccEEeccccEEEEEeecCCC---------Cccccccc Q lcl|NC_012784. 354 LKDA---------IVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDSER---------GEGDLGLE 414 (415) Q Consensus 354 ~~~~---------~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~---------~~~~~~~~ 414 (415) |... +..+.-.+++.+..++ .++...+++..-+|.+++||++.+.+++.+.++ |-+.+-.. T Consensus 236 ~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~~~~~~~~~~ 315 (324) T protein:vir:99 236 ADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVITGVASFAAP 315 (324) T ss_pred cCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccchhhhhhccccCc Confidence 3221 1111222233433332 334445677777899999999999999876553 23333333 Q ss_pred C Q lcl|NC_012784. 415 A 415 (415) Q Consensus 415 ~ 415 (415) | T Consensus 316 ~ 316 (324) T protein:vir:99 316 A 316 (324) T ss_pred c Confidence 3 No 146 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.02 E-value=2e-10 Score=73.77 Aligned_cols=284 Identities=8% Similarity=-0.047 Sum_probs=156.1 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhh---------cceeEEccCCceeEEEEeecCCcccccccccc-ccccc Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK---------YVTVKRVTNGSGKYPVVRQSEVAALEKVEELE-ENPEL 190 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~-~~~~~ 190 (415) +....+.-....+|+.+.+-+.+.+...+.+++ +......++....+|+....+ ..+..+.||. .++ . T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~-G~~~~~~dg~~~i~-~ 78 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLT-GDSEVLGNGDKALE-T 78 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCC-CcccccCCCccccc-h Confidence 222234455678888877766666555444422 111111233444444433222 3455566775 454 3 Q ss_pred ccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc------ccccccccccccccc Q lcl|NC_012784. 191 AVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK------GSTGSTSSGFEKEGK 264 (415) Q Consensus 191 ~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~------~~~~~~~~~~~~~~~ 264 (415) +..+-+.-....++.+....++++...-+.-|....+.++++..+.+..+..+|.-... ........ ...... T Consensus 79 ~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~-~~~~~~ 157 (330) T protein:vir:10 79 GKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALE-ETHVSD 157 (330) T ss_pred hhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhh-hhheec Confidence 44555666666666666677766655445557788899999999999888876643221 11111000 001111 Q ss_pred cccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccccc Q lcl|NC_012784. 265 KLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK 344 (415) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~ 344 (415) ........+.+.+.++..++.+....-.+|+|||.++..|++..--+ ++ .+.-.+..-++++|++|++++.+|.... T Consensus 158 ~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~~~~G~~VivdD~~p~~~~ 234 (330) T protein:vir:10 158 QSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQ--YI-QPTTATINIPTYLGYRVIIDDGIAPTGD 234 (330) T ss_pred ccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhh--hh-cccccCcccccccceEEEEeCCCCCCCC Confidence 12233445678899999998888888889999999999998743111 11 1122223446899999999999985543 Q ss_pred CCceEEEechhhcEEEEee---cceEEEEeecccC-ceEEEEEEEeccEEeccccEEEEEe----ecCCCCcccccccC Q lcl|NC_012784. 345 GNNTLIIGNLKDAIVLFDR---SQYQASWTDYMHF-GECLMIAVRQDCRILDYKSAIVIEY----DDSERGEGDLGLEA 415 (415) Q Consensus 345 ~~~~~~~gd~~~~~~~~~~---~~~~i~~~~~~~~-~~~~~~~~r~d~~v~~p~a~~~~~~----t~~~~~~~~~~~~~ 415 (415) .-...+||. .++...+. ..+.+++.++... .+.+....+ .+++|..+..-.- ....+...+|.+.+ T Consensus 235 ~yt~yl~~~--GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt~~~L~~~~ 308 (330) T protein:vir:10 235 IYTSYLFRT--GSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRA---LVMHPYGVKWTGAEVDAGNITPSNADLAKFK 308 (330) T ss_pred ceeEEEEec--CceeeecccCCccccccccCCccccceEEEEeeE---EEeeeeeeeecccccccCcCCcChHHhcCCc Confidence 333456653 22333321 1244555544332 233333333 3456766555431 12235555666666 No 147 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.98 E-value=8.9e-11 Score=75.68 Aligned_cols=230 Identities=9% Similarity=-0.026 Sum_probs=152.6 Q ss_pred hhhhhhccccccc-ceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) Q Consensus 116 ~~~~~~~~~~~~~-~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 194 (415) +........+... ...+-|......|++.+.+.++|+..+.......+++ +.+...++-|.+.|..-++..++ +..+ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~-~~~~v~~~LP~~~fR~lN~g~~~-s~~t 78 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTG-HRTTIRSGLPSATWRLLNYGVQP-SKST 78 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCc-ceeeEeeccCCceeeecCCccCc-ccce Confidence 1111111112222 2234466677889999999999999999888765542 44445567788888888888875 4679 Q ss_pred ceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc---------------- Q lcl|NC_012784. 195 FFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS---------------- 256 (415) Q Consensus 195 f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~---------------- 256 (415) +.+++...+-+.+.+.|.+.+.+... .++...-.....+++.+++...|++|+.+..+.... T Consensus 79 t~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~q 158 (328) T protein:vir:95 79 TVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQN 158 (328) T ss_pred eEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccc Confidence 99999999999999999999887653 233444455578999999999999985543221100 Q ss_pred ------cccc-c------------------------------------------------------------------cc Q lcl|NC_012784. 257 ------SGFE-K------------------------------------------------------------------EG 263 (415) Q Consensus 257 ------~~~~-~------------------------------------------------------------------~~ 263 (415) ++.. . .+ T Consensus 159 iidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~N 238 (328) T protein:vir:95 159 IIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIAN 238 (328) T ss_pred eeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0000 0 00 Q ss_pred c-----cccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCCcccccCcccCCCCceecceeeEEec Q lcl|NC_012784. 264 K-----KLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQRLLGAKIEILP 337 (415) Q Consensus 264 ~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~ 337 (415) . ...+......+.+++++..++.....+.+|+||.+....|++. .+.....+-...+.+..+-.++|+||..++ T Consensus 239 Id~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~d 318 (328) T protein:vir:95 239 IDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETD 318 (328) T ss_pred CcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEe Confidence 0 0112233344566777888777788889999999999999874 566555555556666667789999999999 Q ss_pred cccccccCCceEEE Q lcl|NC_012784. 338 DEVLGQKGNNTLII 351 (415) Q Consensus 338 ~~~~~~~~~~~~~~ 351 (415) .+...-+ .++ T Consensus 319 ai~~tE~----~vv 328 (328) T protein:vir:95 319 ALLETEA----RVV 328 (328) T ss_pred eeecCcc----ccC Confidence 7653221 122 No 148 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.97 E-value=3.4e-10 Score=72.48 Aligned_cols=283 Identities=10% Similarity=-0.005 Sum_probs=148.1 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhhh---------cceeEEccCCceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK---------YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) +. .+.-+...+|+.+..-+.+...+.+.+++ +......++....+|+....+ ..+..+.|+..++. + T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~-Gd~~~~~~~~~i~~-~ 76 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLT-GDPDNWTDSDDIDV-N 76 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCC-CcccccCCCcccch-h Confidence 11 23335677888776666555544444422 111111233344445433222 35567788888764 4 Q ss_pred cccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cccccccccccccccccccc Q lcl|NC_012784. 192 VKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK--GSTGSTSSGFEKEGKKLEVK 269 (415) Q Consensus 192 ~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~--~~~~~~~~~~~~~~~~~~~~ 269 (415) ..+-+.-....+..+.-+.++++...-+.-|....+.++++..+.+..+..+|.-... +................... T Consensus 77 kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~ 156 (351) T protein:vir:15 77 NLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSE 156 (351) T ss_pred eecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccccccc Confidence 4555555556666666677777654444457788899999999999999987753311 00000000011111122334 Q ss_pred chhhHHHHHHHHHHhhhhccC-CCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccC--- Q lcl|NC_012784. 270 KAKSLDDIKDAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKG--- 345 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~--- 345 (415) ...+++.+.+++.++.+.... -.+|+||+.++..|++..--+ ++ +..-.+..-++++|++|++++.+|....+ T Consensus 157 ~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~t~~G~~VivdD~~p~~~~~~~~ 233 (351) T protein:vir:15 157 PMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIE--TI-QPQNGATPFEAYNGLRIVLDDDIEIDLTDKTK 233 (351) T ss_pred cccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhh--hc-cccccCcccceecceEEEEcCCCccccCCCCC Confidence 456778999999998876544 589999999999998653111 10 11111223468999999999999853221 Q ss_pred C--ceEEEechhhcEEEEeecceEEEEeec--ccCceEEEEEEEeccEEeccccEEEEEee----cCCCCcccccccC Q lcl|NC_012784. 346 N--NTLIIGNLKDAIVLFDRSQYQASWTDY--MHFGECLMIAVRQDCRILDYKSAIVIEYD----DSERGEGDLGLEA 415 (415) Q Consensus 346 ~--~~~~~gd~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~r~d~~v~~p~a~~~~~~t----~~~~~~~~~~~~~ 415 (415) . ..++||. .++...+ ....+++.++ ....+......| -.+++|..+..-.-+ ...+.-.+|.+.+ T Consensus 234 ~~ytsyl~~~--GAi~~~~-~~~~ve~~rd~~~~~g~d~l~~r~--~~~~hp~G~s~~~~~~~~~~~sPt~~~L~~~~ 306 (351) T protein:vir:15 234 PVSTSYIFAP--GAVRYST-NMRSTETKYDPLINGGQDVIVQKR--VGTIHVAGTSIKASFSPSKASFPTIDELAKSS 306 (351) T ss_pred ceeEEEEEec--ceeeeec-CCcCcceeecccCCCCceEEEEee--eeeeeeeeeeecccccccCcCCcChHHhcCCc Confidence 1 1345553 2222222 2333333332 222222222211 134666665543211 1123333444444 No 149 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.94 E-value=4e-10 Score=72.10 Aligned_cols=285 Identities=14% Similarity=0.105 Sum_probs=155.0 Q ss_pred hhhhhhhcc---cccccceeecchhHHhHHHHHHhhhh-hhhhcceeEEccCCceeEEEEeecCCcccccccccc----- Q lcl|NC_012784. 115 TRNDIQGGS---LKTDSGFVVIPEEIVTDILKLKEVEF-NLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE----- 185 (415) Q Consensus 115 ~~~~~~~~~---~~~~~~~~~vP~~~~~~Ii~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~----- 185 (415) -......+. .+..-....+ .++...+.-...+.. .++..++...-..++..+..+.. ..+..++.+. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 76 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLAS---MDPDAVKRKRSRQQS 76 (322) T ss_pred CcccceeeeeeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeeccc---ccccccccccccccc Confidence 000000011 0111111122 455556654444433 44444443332223222222211 1111121111 Q ss_pred -----cccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---cccc Q lcl|NC_012784. 186 -----ENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---STSS 257 (415) Q Consensus 186 -----~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~---~~~~ 257 (415) ..|. ....++..............|.+.-.-....|..+...+..+.+++++.|..|+++.-..... +... T Consensus 77 ~d~~~dtp~-~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v 155 (322) T protein:vir:10 77 ADGTYPTPV-NNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPV 155 (322) T ss_pred cCcccCCCc-cccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccccccccccc Confidence 1121 122344444444444555677777666677789999999999999999999888743221111 1111 Q ss_pred ccccccccccccchhhHHHHHHHHHHhhhhccCCC---EEEEcHHHHHHHHHhhccC-CcccccCcc-cCCCCceeccee Q lcl|NC_012784. 258 GFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN---VAIVSQTMFAKLDKMKDKL-GNYLIQPDV-KEKTQQRLLGAK 332 (415) Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~~~~~~~~l~~lkd~~-G~~l~~~~~-~~~~~~~l~G~p 332 (415) ...............+++.++.+...+..+..... .++++|..|..|.....-. -.|.-...+ ..|..++++|+. T Consensus 156 ~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~ 235 (322) T protein:vir:10 156 EFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYT 235 (322) T ss_pred ccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEE Confidence 11112222333456678889988888877776643 5888999999987653322 233322333 346678999999 Q ss_pred eEEeccccccc-------------cCCceEEEechhhcEEEEeecceEEEEeec--ccCceEEEEEEEeccEEeccccEE Q lcl|NC_012784. 333 IEILPDEVLGQ-------------KGNNTLIIGNLKDAIVLFDRSQYQASWTDY--MHFGECLMIAVRQDCRILDYKSAI 397 (415) Q Consensus 333 V~~~~~~~~~~-------------~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~r~d~~v~~p~a~~ 397 (415) ++.++.+|..+ .....++++. ++++......++..+.... ..+...+...+-+|..+++|+.++ T Consensus 236 ~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~-k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv 314 (322) T protein:vir:10 236 WIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMT-DMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIF 314 (322) T ss_pred EEEeccCCccccccccccccCCCCccceeEEEEe-cCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEE Confidence 99999887321 1122344444 3456666555666655432 223334555678899999999999 Q ss_pred EEEeecCC Q lcl|NC_012784. 398 VIEYDDSE 405 (415) Q Consensus 398 ~~~~t~~~ 405 (415) .+++..+. T Consensus 315 ~i~~~e~~ 322 (322) T protein:vir:10 315 KLRLKNSL 322 (322) T ss_pred EEEEeccC Confidence 99998887 No 150 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.93 E-value=1.6e-10 Score=74.24 Aligned_cols=281 Identities=9% Similarity=0.048 Sum_probs=159.9 Q ss_pred hhccccccccee-ecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceee Q lcl|NC_012784. 120 QGGSLKTDSGFV-VIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 120 ~~~~~~~~~~~~-~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) +..+..++.+-. .+|+.++..|+..+.+......+.+......+ -++.|+ ..+.+......+++.+. ....+-.++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~G-DtV~In-sIg~~tV~dY~~~~~i~-~d~ltt~~~ 77 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDG-DKLTIP-SVGTPVVRSRPEQGDFT-FDNLDTGEI 77 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCC-CeEEec-cccccccccccCCCCcc-cccCCCceE Confidence 222233333334 45999999999887776665555553332221 233343 33556666665555543 122333333 Q ss_pred --EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------ccc-ccccccccccccccc Q lcl|NC_012784. 199 --AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS------TGS-TSSGFEKEGKKLEVK 269 (415) Q Consensus 199 --~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~------~~~-~~~~~~~~~~~~~~~ 269 (415) .+.-.|+.++ .|++... +...+|.+...++.+++++...|..+..-..++. +.+ ...+.......+... T Consensus 78 ~l~IDq~KYfaf-~VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 78 SIILRDEVYAGN-AISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEehhhhhcc-ccchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 4444455554 4888554 4557899999999999999999987643122111 000 001111112223334 Q ss_pred chhhHHHHHHHHHHhhhhccCC-C-EEEEcHHHHHHHHHh-----hccCCcccc--cCcccCCC--CceecceeeEEecc Q lcl|NC_012784. 270 KAKSLDDIKDAINLNVKPNYEH-N-VAIVSQTMFAKLDKM-----KDKLGNYLI--QPDVKEKT--QQRLLGAKIEILPD 338 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~-~-~~v~~~~~~~~l~~l-----kd~~G~~l~--~~~~~~~~--~~~l~G~pV~~~~~ 338 (415) ....|+.++++..++-.+.... . .+|++|..+..|..+ --.++|+.. ..+...+. .+++.|+.|++|++ T Consensus 156 ~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~ 235 (322) T protein:vir:31 156 QTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNL 235 (322) T ss_pred chhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeecc Confidence 5667899999998888777764 3 466678887777432 122345432 22222211 47899999999998 Q ss_pred ccccccCCceEEE---------echhhcEEEEeecce-------EE---E-EeecccCceEEEEEEEeccEEeccccEEE Q lcl|NC_012784. 339 EVLGQKGNNTLII---------GNLKDAIVLFDRSQY-------QA---S-WTDYMHFGECLMIAVRQDCRILDYKSAIV 398 (415) Q Consensus 339 ~~~~~~~~~~~~~---------gd~~~~~~~~~~~~~-------~i---~-~~~~~~~~~~~~~~~r~d~~v~~p~a~~~ 398 (415) ++.. +.+++. |-++.+..+.+.+.. ++ + +-++..|.-.+|+..|+|.++.+|+.++. T Consensus 236 l~~~---~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~ 312 (322) T protein:vir:31 236 LADA---NETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVC 312 (322) T ss_pred cccc---ccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceEE Confidence 8632 112222 222222222211110 11 1 12235566678999999999999999999 Q ss_pred EEeecCCCCc Q lcl|NC_012784. 399 IEYDDSERGE 408 (415) Q Consensus 399 ~~~t~~~~~~ 408 (415) +.-++.+... T Consensus 313 ~~a~~~~~~~ 322 (322) T protein:vir:31 313 VLANADKVTF 322 (322) T ss_pred EEeccccccC Confidence 9888877777 No 151 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.88 E-value=1.5e-10 Score=74.46 Aligned_cols=268 Identities=14% Similarity=0.029 Sum_probs=145.7 Q ss_pred hhhcccccccceeec-c--hhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccc Q lcl|NC_012784. 119 IQGGSLKTDSGFVVI-P--EEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) Q Consensus 119 ~~~~~~~~~~~~~~v-P--~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f 195 (415) +.. ...+...-+. | .++.+.+-..+.....++...+..|++.++ .+.+|+..-...+..|+||+.+| .+..+. T Consensus 1 mAe--~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~-tIt~pK~~~tgda~dVaEGe~Ip-lskvt~ 76 (295) T protein:vir:99 1 MAE--KNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDL-KIQTYKWEVTLDQTDPGEGETIP-LSKVTR 76 (295) T ss_pred CCC--cccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCC-eEEeeeeeeecccccccCCcccc-hhhhee Confidence 111 1111111122 2 233344433334444444555677887765 77888888778889999999998 456665 Q ss_pred e---eeEeeeeeEEEeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccch Q lcl|NC_012784. 196 F---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA 271 (415) Q Consensus 196 ~---~v~~~~~k~a~~~~iS~e~l~ds~~-~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~ 271 (415) . ..+++.+|++.-+ |.|.++.+.+ +-...-.++|..+++.++++.++.-..++.... ....-. T Consensus 77 ~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~-----------tg~~lq 143 (295) T protein:vir:99 77 TKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKV-----------KGVGLQ 143 (295) T ss_pred eeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceee-----------ehhhHH Confidence 4 4777788888754 9999754333 457788999999999999999998765542211 011112 Q ss_pred hhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC--CcccccCcccCCCCceeccee-eEEeccccccccC--- Q lcl|NC_012784. 272 KSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKL--GNYLIQPDVKEKTQQRLLGAK-IEILPDEVLGQKG--- 345 (415) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~--G~~l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~--- 345 (415) ..+..+...+......+..+.+.++||.+...|++-..-+ ..-.|..++. -.++|.. |+.+..+|.|..- T Consensus 144 ~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L----~nfLG~q~II~S~kv~~G~~~aT~ 219 (295) T protein:vir:99 144 KALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLL----KNFLGMQNVIVMPSVPEGKIYSTA 219 (295) T ss_pred HHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhh----hhhhccceEEEcccCCCceEEEee Confidence 2444555555555566667789999999999887643222 1111221111 1389997 8899888876421 Q ss_pred CceEEE-------echhhcEEEEee-cceEEEEeecccCceEEEEE-EEeccE---EeccccEEEEEeecCC--CCcc Q lcl|NC_012784. 346 NNTLII-------GNLKDAIVLFDR-SQYQASWTDYMHFGECLMIA-VRQDCR---ILDYKSAIVIEYDDSE--RGEG 409 (415) Q Consensus 346 ~~~~~~-------gd~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~-~r~d~~---v~~p~a~~~~~~t~~~--~~~~ 409 (415) ...+.+ ||+.+.+.+... .++.--. |......+-.+ +-+.+. +-++++++..+++++. .-|| T Consensus 220 ~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~--h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 220 VENLVFASLNVKGGDLGGLFADFTDETGLIAAA--RNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPGIGG 295 (295) T ss_pred ccceEEEEecCCchhhhhhhhhccCcccceEEE--eccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCCCCC Confidence 111222 222222111111 1111000 00001111111 122232 3446799999997533 2233 No 152 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.82 E-value=4.4e-10 Score=71.87 Aligned_cols=295 Identities=9% Similarity=0.031 Sum_probs=153.7 Q ss_pred hhhhhhcc---cccccceeecc-hhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGS---LKTDSGFVVIP-EEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 116 ~~~~~~~~---~~~~~~~~~vP-~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) +....... ...++....+. +.+..++.+.....+.++++..+..+.+++ ++.++.. +...+.....|+.. ... T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~Gk-S~qf~~i-G~~~a~y~~~G~~l-dg~ 77 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSP-NAT 77 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccc-eEEEEEE-eeeEEeeecccccc-CCC Confidence 11111111 11111222233 678899999998899999999998887654 5555543 55556666666553 344 Q ss_pred cccceeeEeeeeeEE---EeehhhHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhcc---c---ccc---cccc-cc Q lcl|NC_012784. 192 VKPFFQLAYDINTHR---GYFRISREAIEDAKVN-VLQELKLWMARTIAATRNKAIIDVI---T---KGS---TGST-SS 257 (415) Q Consensus 192 ~~~f~~v~~~~~k~a---~~~~iS~e~l~ds~~~-l~~~l~~~la~~~~~~~d~~il~g~---g---~~~---~~~~-~~ 257 (415) .+..++..+....+- .++.==+|.. +.++ +.+.+.+++.+++++..|+.++.-. + +.. +.+. .. T Consensus 78 ~~~~~k~~ItID~lL~a~~~V~diDeaq--~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~ 155 (402) T protein:vir:97 78 PTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) T ss_pred CcccccEEEEeCceeechhhhhhHHHHH--hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccc Confidence 555666666665432 1111112333 3355 6889999999999999999775210 0 000 0000 00 Q ss_pred ccccccccc----cccchhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhcc-CCccccc--CcccCCCCcee Q lcl|NC_012784. 258 GFEKEGKKL----EVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDK-LGNYLIQ--PDVKEKTQQRL 328 (415) Q Consensus 258 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd~-~G~~l~~--~~~~~~~~~~l 328 (415) +.......+ .......++.+.++...+-..+... =+++++|..|..|.+-.+= +-.|... ..+..+....+ T Consensus 156 g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v 235 (402) T protein:vir:97 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) T ss_pred ccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEE Confidence 000000111 1111222355556666665544433 3899999999998863211 1111111 12445556789 Q ss_pred cceeeEEeccccccc--c----------CCceEEEechhhc-EEEEeecc-eEEEEee--------cccCceEEEEEEEe Q lcl|NC_012784. 329 LGAKIEILPDEVLGQ--K----------GNNTLIIGNLKDA-IVLFDRSQ-YQASWTD--------YMHFGECLMIAVRQ 386 (415) Q Consensus 329 ~G~pV~~~~~~~~~~--~----------~~~~~~~gd~~~~-~~~~~~~~-~~i~~~~--------~~~~~~~~~~~~r~ 386 (415) +|+||+.++++|..+ . |...-+-|||... ..+|.+.. .+++..+ ...+...+..+.-+ T Consensus 236 ~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~ 315 (402) T protein:vir:97 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) T ss_pred eceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHh Confidence 999999999998632 1 1222244666643 22333332 2222211 11111223444567 Q ss_pred ccEEeccccEEEEEeec--CCCCcccccccC Q lcl|NC_012784. 387 DCRILDYKSAIVIEYDD--SERGEGDLGLEA 415 (415) Q Consensus 387 d~~v~~p~a~~~~~~t~--~~~~~~~~~~~~ 415 (415) |..+.||++...+.+.- ++.--|++.+.- T Consensus 316 G~g~~RPeaa~vv~~~~~~t~~~~~~~~~~~ 346 (402) T protein:vir:97 316 GAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) T ss_pred CCcccCccceEEEEEecccccccCCccccch Confidence 88999999988887653 333333333322 No 153 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.82 E-value=6.3e-10 Score=71.02 Aligned_cols=295 Identities=11% Similarity=0.034 Sum_probs=158.3 Q ss_pred hhhhhhcccccc----cceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTD----SGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 116 ~~~~~~~~~~~~----~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) +........... .--.+.-+.+..++.......+.++++..++++.+++ ++.+++. +...+.....|++. ... T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gk-S~qf~~l-G~s~a~y~~pG~~l-dg~ 77 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSP-AAT 77 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccc-eEEEEEe-eeeEEeeecCCCCc-CCC Confidence 111111111111 1112234678888999998999999999999888764 5555543 66677778777764 444 Q ss_pred cccceeeEeeeeeEE-Eeehhh--HHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhh----c-cc-ccc----cccccc Q lcl|NC_012784. 192 VKPFFQLAYDINTHR-GYFRIS--REAIEDAKVN-VLQELKLWMARTIAATRNKAIID----V-IT-KGS----TGSTSS 257 (415) Q Consensus 192 ~~~f~~v~~~~~k~a-~~~~iS--~e~l~ds~~~-l~~~l~~~la~~~~~~~d~~il~----g-~g-~~~----~~~~~~ 257 (415) .+..++..+....+- ....|. +|.+.+ +| +.+.+.+++.+++++..|+.++. + .. +.. +.+... T Consensus 78 ~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~--yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~ 155 (400) T protein:vir:10 78 STQADKNQLVIDATVIARNTVAHLHDVQGD--IDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGH 155 (400) T ss_pred CcccCcEEEEeCceeeecchhhhHHHHhhc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccc Confidence 556667666665542 222222 344443 55 78999999999999999987652 1 00 111 111111 Q ss_pred ccccccccccc----cchhhHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhh-ccCCccccc--CcccCCCCcee Q lcl|NC_012784. 258 GFEKEGKKLEV----KKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMK-DKLGNYLIQ--PDVKEKTQQRL 328 (415) Q Consensus 258 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lk-d~~G~~l~~--~~~~~~~~~~l 328 (415) +.......... ........+.++...+...+.. .-++++.|..|..|.... -=|-.|... .++..+....+ T Consensus 156 g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v 235 (400) T protein:vir:10 156 GFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSS 235 (400) T ss_pred ccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEE Confidence 10110101111 1111224445555555443332 236667777777775321 001111111 12333444579 Q ss_pred cceeeEEeccccccc--c----------CCceEEEechhhcEE-EEeecc-eEEEEee--------cccCceEEEEEEEe Q lcl|NC_012784. 329 LGAKIEILPDEVLGQ--K----------GNNTLIIGNLKDAIV-LFDRSQ-YQASWTD--------YMHFGECLMIAVRQ 386 (415) Q Consensus 329 ~G~pV~~~~~~~~~~--~----------~~~~~~~gd~~~~~~-~~~~~~-~~i~~~~--------~~~~~~~~~~~~r~ 386 (415) .|+||+.++++|.++ . |..+-+-|||...+- +|.++. +.++..+ ...+...+..+.-+ T Consensus 236 ~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~ 315 (400) T protein:vir:10 236 YNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSE 315 (400) T ss_pred eceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHh Confidence 999999999998643 1 222224477775432 333332 2233211 12222334555678 Q ss_pred ccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 387 DCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 387 d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +..+.||+|...++.....++.-|.+--+ T Consensus 316 G~g~~RPeaa~vv~~~~~~~~~~~~~~~~ 344 (400) T protein:vir:10 316 GAIPDRWEAVSVVTTKRQSTGAVDSGNAA 344 (400) T ss_pred CCcccchhheEEEEecCCcccccccCcch Confidence 99999999999999887666665544333 No 154 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.75 E-value=6.1e-10 Score=71.10 Aligned_cols=293 Identities=11% Similarity=0.039 Sum_probs=152.7 Q ss_pred hhhhhhccccccc----ceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDS----GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELA 191 (415) Q Consensus 116 ~~~~~~~~~~~~~----~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 191 (415) +...........+ --.+.-+.+..++.......+.++++..++++.+++ ++.+++. +...+.....|+.. ... T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gk-S~qf~~~-G~s~~~~~~pG~~l-d~~ 77 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSP-AAT 77 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccc-eEEEEEe-eeeEeeeecCCCCc-CCC Confidence 1111111111111 112234678888999988999999999999888764 5555543 56677777777664 344 Q ss_pred cccceeeEeeeeeEE-Eeehhh--HHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhh-----cccc-----cccccccc Q lcl|NC_012784. 192 VKPFFQLAYDINTHR-GYFRIS--REAIEDAKVN-VLQELKLWMARTIAATRNKAIID-----VITK-----GSTGSTSS 257 (415) Q Consensus 192 ~~~f~~v~~~~~k~a-~~~~iS--~e~l~ds~~~-l~~~l~~~la~~~~~~~d~~il~-----g~g~-----~~~~~~~~ 257 (415) .+..++..+....+- ....|. ++.+ +.++ +.+.+.+++.+++++..|+.++. |... ..+.+... T Consensus 78 ~~~~dK~~ItID~lL~a~~~V~dlDe~q--~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~ 155 (401) T protein:vir:70 78 STQADKNQLVIDATVIARNTVAHLHDVQ--GDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGH 155 (401) T ss_pred CcccccEEEEeCceeehhhhhhhHHHHH--hcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCC Confidence 556666666665442 112221 2333 3345 68899999999999999986632 2110 11111111 Q ss_pred ccccccccc----cccchhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHH---hhccCCccccc--CcccCCCCc Q lcl|NC_012784. 258 GFEKEGKKL----EVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDK---MKDKLGNYLIQ--PDVKEKTQQ 326 (415) Q Consensus 258 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~---lkd~~G~~l~~--~~~~~~~~~ 326 (415) +........ ........+.+.++...+...+... -++++.|..|..|.. |-|. .|-.. ..+..+... T Consensus 156 G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nr--d~~~s~~g~~~~G~v~ 233 (401) T protein:vir:70 156 GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDK--TYTISQSGATIQGFTL 233 (401) T ss_pred ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccch--hhccccCCccccceEE Confidence 111111111 1122223456667766665544433 255556666666643 2111 11111 123444456 Q ss_pred eecceeeEEecccccccc------------CCceEEEechhhcEE-EEeecc-eEEEEee--------cccCceEEEEEE Q lcl|NC_012784. 327 RLLGAKIEILPDEVLGQK------------GNNTLIIGNLKDAIV-LFDRSQ-YQASWTD--------YMHFGECLMIAV 384 (415) Q Consensus 327 ~l~G~pV~~~~~~~~~~~------------~~~~~~~gd~~~~~~-~~~~~~-~~i~~~~--------~~~~~~~~~~~~ 384 (415) .+.|+||+.++++|.++. |...-+-|||....- +|.++. +.++..+ ...+...+..+. T Consensus 234 ~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~ 313 (401) T protein:vir:70 234 SSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFM 313 (401) T ss_pred EEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHH Confidence 799999999999986431 222234467765422 333332 2233221 111222244455 Q ss_pred EeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 385 RQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 385 r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) -+|..+.||+|...++..-....+.-++++- T Consensus 314 a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~ 344 (401) T protein:vir:70 314 AEGAIPDRWEAVSVVTTKRNTTTGAVEGTDG 344 (401) T ss_pred HhCCcccchhheEEEeecCcccccccccCCc Confidence 6899999999998886544333222222221 No 155 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.68 E-value=1.2e-09 Score=69.54 Aligned_cols=269 Identities=10% Similarity=-0.023 Sum_probs=142.9 Q ss_pred HHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeE-EEEeecCCccccccccccccccc Q lcl|NC_012784. 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY-PVVRQSEVAALEKVEELEENPEL 190 (415) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~v~Eg~~~~~~ 190 (415) .+........+.....+-+...-.++.+.+-..+.....++...+..|+..++ .+ .++...-...+..|+||+.+| . T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~Gs-tIkt~k~~~y~gda~dVaEGe~Ip-l 78 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCC-EEeeccceeeeeccccccCCcccc-h Confidence 00000000111112222233334566666665555555556666778888876 44 334456667788999999998 4 Q ss_pred ccccce---eeEeeeeeEEEeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccc Q lcl|NC_012784. 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) Q Consensus 191 ~~~~f~---~v~~~~~k~a~~~~iS~e~l~ds~~-~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~ 266 (415) +..+.. ..+++.+|++.-+ |.|.++.+.. +-...-.++|..+++.++++.++....++..... . .. T Consensus 79 skvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~-----~---t~ 148 (296) T protein:vir:98 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD-----A---LG 148 (296) T ss_pred hhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceee-----e---ch Confidence 566654 3777788888775 9999754332 4577889999999999999999987655421000 0 00 Q ss_pred cccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCc-eecceeeEEecccccccc- Q lcl|NC_012784. 267 EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQ-RLLGAKIEILPDEVLGQK- 344 (415) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~- 344 (415) ..-.......+.++.....+....+.+.++||.+...+++-..=. .....+..-- .++|..|+.+..+|.|.. T Consensus 149 ~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it-----~qt~fG~tyl~nfLG~~II~S~kV~~G~~~ 223 (296) T protein:vir:98 149 AGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT-----TQTAFGLTYLVDFTGTVIISTNDVTKGEIW 223 (296) T ss_pred hhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccc-----hhheechhhhhhccccEEEEcCcCCCceEE Confidence 000000111222333344444445789999999988866422111 1111111111 278999999998887642 Q ss_pred ----CCceEEEechhhcEEEEeecceEEEEeecccCceEEEEE-------------EEeccE---EeccccEEEEEeecC Q lcl|NC_012784. 345 ----GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIA-------------VRQDCR---ILDYKSAIVIEYDDS 404 (415) Q Consensus 345 ----~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-------------~r~d~~---v~~p~a~~~~~~t~~ 404 (415) .+-.+.+-|++ .+++.-.+.. ..+.+++.+. +-+.+. +-++++++..+++++ T Consensus 224 ~T~~~Ni~~ay~~~~-------~~~l~~~f~~-~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 224 ATVPENIIFAYINPN-------NSELAKEFNL-YGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred EeeecceEEEeeccc-------ccchhhhhcc-ccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCC Confidence 11111222211 1112211111 1122222221 122332 344679999999887 Q ss_pred C Q lcl|NC_012784. 405 E 405 (415) Q Consensus 405 ~ 405 (415) . T Consensus 296 ~ 296 (296) T protein:vir:98 296 V 296 (296) T ss_pred C Confidence 7 No 156 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.66 E-value=2.2e-09 Score=68.06 Aligned_cols=268 Identities=12% Similarity=0.029 Sum_probs=144.3 Q ss_pred hhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEe---ecCCcccccccccccccccccc Q lcl|NC_012784. 117 NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR---QSEVAALEKVEELEENPELAVK 193 (415) Q Consensus 117 ~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~---~~~~~~a~~v~Eg~~~~~~~~~ 193 (415) .....+......-+..+-.++.+.+-..+.....++...+..|+..++ .+..++ ..-...+..|+||+.+| .+.. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt-~iktyK~~~~~y~gda~dVaEGe~Ip-lskv 78 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGS-ALKQYRFKVEDSEKPNGDVAEGDVIP-LTKV 78 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCc-eeeeeeeeceeeccccccccCCcccc-hhhh Confidence 111111222222233444566666655555555555556777887665 343333 34446678999999998 5556 Q ss_pred cce---eeEeeeeeEEEeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccc Q lcl|NC_012784. 194 PFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVK 269 (415) Q Consensus 194 ~f~---~v~~~~~k~a~~~~iS~e~l~ds~~-~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~ 269 (415) +.. ..+++.+|++.-+ |.|.++.+.. +-...-.++|...+..++++.++.-..++...... +.. T Consensus 79 t~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~----------t~~ 146 (303) T protein:vir:10 79 TREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKR----------TNK 146 (303) T ss_pred eeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccccc----------ccc Confidence 643 5788889998865 9999854332 45777889999999999999998765543211000 011 Q ss_pred chhhHHHHHHHHHHhh------hhccCCCEEEEcHHHHHHHHHhhccCCc-ccccCcccCCCCceecceeeEEecccccc Q lcl|NC_012784. 270 KAKSLDDIKDAINLNV------KPNYEHNVAIVSQTMFAKLDKMKDKLGN-YLIQPDVKEKTQQRLLGAKIEILPDEVLG 342 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~------~~~~~~~~~v~~~~~~~~l~~lkd~~G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~ 342 (415) ...+.+.+.+++.... .....+.++++||.+...++.-..-+-+ --|..++. -.++|..|+.+..+|.| T Consensus 147 t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L----~nfLG~~II~S~kv~~G 222 (303) T protein:vir:10 147 TKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLL----TPYVGVKIVEFADVPQG 222 (303) T ss_pred eeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhh----hhhhcceEEEeccCCCc Confidence 1223444555544321 1223456999999999998752211111 11211111 13889999999988876 Q ss_pred cc-----CCceEEEechhhcEEEEeecceEEEEeecccCceEEEEE-------------EEeccE---EeccccEEEEEe Q lcl|NC_012784. 343 QK-----GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIA-------------VRQDCR---ILDYKSAIVIEY 401 (415) Q Consensus 343 ~~-----~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-------------~r~d~~---v~~p~a~~~~~~ 401 (415) .. .+-.+.|.+.+ +++.-. ..+..+.+++.+. +-+.+. +-++++++..++ T Consensus 223 ~~~~T~~~Ni~~ay~~~~--------g~l~~~-f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti 293 (303) T protein:vir:10 223 EVWMTVAENLNVAYANPR--------GELSRA-FAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTI 293 (303) T ss_pred eEEEeeccceEEEEecCc--------hhhhhh-hhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEE Confidence 42 12122233221 111111 1111112222221 122332 344678999998 Q ss_pred ecCCCCcccc Q lcl|NC_012784. 402 DDSERGEGDL 411 (415) Q Consensus 402 t~~~~~~~~~ 411 (415) ++...++=-. T Consensus 294 ~~~e~~~~~~ 303 (303) T protein:vir:10 294 KKDEAGELPS 303 (303) T ss_pred eccccCCCCC Confidence 7655443333 No 157 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.65 E-value=6.8e-09 Score=65.37 Aligned_cols=278 Identities=8% Similarity=-0.047 Sum_probs=151.2 Q ss_pred hcccccccceeecc---hhHHhHHHHHHhhhhhhhhcceeEE-ccCCceeEEEEeecCCccccccccccc-ccccccccc Q lcl|NC_012784. 121 GGSLKTDSGFVVIP---EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKPF 195 (415) Q Consensus 121 ~~~~~~~~~~~~vP---~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~f 195 (415) ++......+|...- +.+.+.|++........+.++.+.. .+-...++.+........+.|++.++. +|. .+..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~-v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPL-VDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccce-eeccc Confidence 11111122222322 2456666666655555555554332 122222444444455566778777654 553 34667 Q ss_pred eeeEeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc---cc Q lcl|NC_012784. 196 FQLAYDINTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE---VK 269 (415) Q Consensus 196 ~~v~~~~~k~a~~~~iS~e~l~ds---~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~---~~ 269 (415) .......+.++..+.++.+=++.+ ..++..--....+.++.+.+|+.++.|+..-...+............. .. T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccC Confidence 888888889898888886655433 446788788888899999999999999765432222222111111111 11 Q ss_pred chhhHHHHHHHHHHhhhh---ccCCCEEEEcHHHHHHHHHhhccCCcccccCcccC-CCCceecceeeEEeccccccccC Q lcl|NC_012784. 270 KAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKE-KTQQRLLGAKIEILPDEVLGQKG 345 (415) Q Consensus 270 ~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~-~~~~~l~G~pV~~~~~~~~~~~~ 345 (415) .+..++|+.+++..+... ...+..++++|..+..|.......|.-++. -+.. ..+.+|.+.|.... .+..+ T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~-~ik~~~~~l~i~~~~~l~~----a~~~g 234 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGE-FFRQNNSGVTVEFVQYLND----YNGTG 234 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHH-HHHHhcCCceEEEeeeecc----CCCCc Confidence 234578888888776643 345779999999999987665555533321 1111 11123333332221 12223 Q ss_pred CceEEEech-hhcEEEEeecceEEEEeecccCceEEEEEEEec-cEEeccccEEEEE-eecC Q lcl|NC_012784. 346 NNTLIIGNL-KDAIVLFDRSQYQASWTDYMHFGECLMIAVRQD-CRILDYKSAIVIE-YDDS 404 (415) Q Consensus 346 ~~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d-~~v~~p~a~~~~~-~t~~ 404 (415) ...+++-+. .+++.+...+.++........-......+.|++ +.+.+|.|+++++ +|=+ T Consensus 235 ~~~~v~~~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 235 TSAAIAYEKDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred ceEEEEEEcCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 333333222 223333333444443333333344567778885 6889999999995 2222 No 158 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.65 E-value=3.7e-09 Score=66.80 Aligned_cols=230 Identities=8% Similarity=-0.010 Sum_probs=144.8 Q ss_pred hhhhhhccccccc-ceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) Q Consensus 116 ~~~~~~~~~~~~~-~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 194 (415) +........+... ...+-|......|++.+.+.++|+..+.......+++... ...++-|.+.|..-++..++ +..+ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t-~vrt~LP~~~fR~lN~g~~~-s~~t 78 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRT-SVRTGLPTPTWRKLYGGVLP-NKSS 78 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccce-eEEeecCCchhhhcCCcccc-ccce Confidence 1111111122222 2234466667789999999999999888765544443322 22345577788887777765 4689 Q ss_pred ceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc---------------- Q lcl|NC_012784. 195 FFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTS---------------- 256 (415) Q Consensus 195 f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~---------------- 256 (415) +.+++...+-+.+.+.|-+.+.+... .++...-.....+++.+++...+++|+.+..+.... T Consensus 79 t~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~q 158 (330) T protein:vir:10 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) T ss_pred EEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhh Confidence 99999999999999999999887543 234444566688999999999999986543221100 Q ss_pred ------cccc--------------------------------------ccc----------------------------- Q lcl|NC_012784. 257 ------SGFE--------------------------------------KEG----------------------------- 263 (415) Q Consensus 257 ------~~~~--------------------------------------~~~----------------------------- 263 (415) ++.. ..+ T Consensus 159 vIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI 238 (330) T protein:vir:10 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) T ss_pred eeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEE Confidence 0000 000 Q ss_pred -------ccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCCcccccCcccCCCCceecceeeEE Q lcl|NC_012784. 264 -------KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) Q Consensus 264 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) ......+...++-++.+...++.......+|+||.+....|++. .+++...+-...+.+...-.++|+||.. T Consensus 239 ~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~ 318 (330) T protein:vir:10 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) T ss_pred eecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEE Confidence 00011111223444556677777788889999999999999974 5665544444455555556799999999 Q ss_pred eccccccccCCceEEE Q lcl|NC_012784. 336 LPDEVLGQKGNNTLII 351 (415) Q Consensus 336 ~~~~~~~~~~~~~~~~ 351 (415) ++.+...-+ .++ T Consensus 319 ~Dail~tE~----~vv 330 (330) T protein:vir:10 319 TDALLNTES----RVV 330 (330) T ss_pred EeeeecCcc----ccC Confidence 997654322 122 No 159 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.54 E-value=1.8e-08 Score=63.10 Aligned_cols=230 Identities=8% Similarity=0.001 Sum_probs=142.1 Q ss_pred hhhhhhcccccccce-eecchh-HHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDSGF-VVIPEE-IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) Q Consensus 116 ~~~~~~~~~~~~~~~-~~vP~~-~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 193 (415) +.....+..+..... .+-|.. +...|++.+.+.++|+..+.......+++. .....++-|.+.|..-++..++ +.. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~-~~~vrt~LP~~~fR~lN~g~~~-s~~ 78 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEH-KTTVRSGLPTGTWRKLNYGVQP-EKS 78 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccc-eeeEEeccCCchhhccCCccCc-ccc Confidence 000000111111111 111332 455799999999999999988776555532 2344466778888888888775 568 Q ss_pred cceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------- Q lcl|NC_012784. 194 PFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGST---------------- 255 (415) Q Consensus 194 ~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~---------------- 255 (415) ++.+++...+-+.+.+.|.+.+.+... .++...-.....+++.+.+...|++|+.+..+... T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999887643 23344455668899999999999998644211100 Q ss_pred -------ccccccc----------------------------------------------------------------cc Q lcl|NC_012784. 256 -------SSGFEKE----------------------------------------------------------------GK 264 (415) Q Consensus 256 -------~~~~~~~----------------------------------------------------------------~~ 264 (415) ..+.... .. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 0000000 00 Q ss_pred cc--------cccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCC-cccccCcccCCCCceecceeeE Q lcl|NC_012784. 265 KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLG-NYLIQPDVKEKTQQRLLGAKIE 334 (415) Q Consensus 265 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G-~~l~~~~~~~~~~~~l~G~pV~ 334 (415) .. ..+..+..+-++.+...++.....+.+|+||.+....|++. .+... +.+-...+.+...-.++|+||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00 11112233445566666666777789999999999999875 44433 3343345556666789999999 Q ss_pred EeccccccccCCceEEE Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLII 351 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~ 351 (415) .++.+...-+ .++ T Consensus 319 ~~dai~~tE~----~Vv 331 (331) T protein:vir:10 319 RTDALLLTEA----RVV 331 (331) T ss_pred EeeeeecCcc----ccC Confidence 9997754322 122 No 160 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.54 E-value=1.8e-08 Score=63.10 Aligned_cols=230 Identities=8% Similarity=0.001 Sum_probs=142.1 Q ss_pred hhhhhhcccccccce-eecchh-HHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDSGF-VVIPEE-IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) Q Consensus 116 ~~~~~~~~~~~~~~~-~~vP~~-~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 193 (415) +.....+..+..... .+-|.. +...|++.+.+.++|+..+.......+++. .....++-|.+.|..-++..++ +.. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~-~~~vrt~LP~~~fR~lN~g~~~-s~~ 78 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEH-KTTVRSGLPTGTWRKLNYGVQP-EKS 78 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccc-eeeEEeccCCchhhccCCccCc-ccc Confidence 000000111111111 111332 455799999999999999988776555532 2344466778888888888775 568 Q ss_pred cceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------- Q lcl|NC_012784. 194 PFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGST---------------- 255 (415) Q Consensus 194 ~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~---------------- 255 (415) ++.+++...+-+.+.+.|.+.+.+... .++...-.....+++.+.+...|++|+.+..+... T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999887643 23344455668899999999999998644211100 Q ss_pred -------ccccccc----------------------------------------------------------------cc Q lcl|NC_012784. 256 -------SSGFEKE----------------------------------------------------------------GK 264 (415) Q Consensus 256 -------~~~~~~~----------------------------------------------------------------~~ 264 (415) ..+.... .. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 0000000 00 Q ss_pred cc--------cccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCC-cccccCcccCCCCceecceeeE Q lcl|NC_012784. 265 KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLG-NYLIQPDVKEKTQQRLLGAKIE 334 (415) Q Consensus 265 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G-~~l~~~~~~~~~~~~l~G~pV~ 334 (415) .. ..+..+..+-++.+...++.....+.+|+||.+....|++. .+... +.+-...+.+...-.++|+||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00 11112233445566666666777789999999999999875 44433 3343345556666789999999 Q ss_pred EeccccccccCCceEEE Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLII 351 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~ 351 (415) .++.+...-+ .++ T Consensus 319 ~~dai~~tE~----~Vv 331 (331) T protein:vir:10 319 RTDALLLTEA----RVV 331 (331) T ss_pred EeeeeecCcc----ccC Confidence 9997754322 122 No 161 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.54 E-value=1.8e-08 Score=63.10 Aligned_cols=230 Identities=8% Similarity=0.001 Sum_probs=142.1 Q ss_pred hhhhhhcccccccce-eecchh-HHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDSGF-VVIPEE-IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) Q Consensus 116 ~~~~~~~~~~~~~~~-~~vP~~-~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 193 (415) +.....+..+..... .+-|.. +...|++.+.+.++|+..+.......+++. .....++-|.+.|..-++..++ +.. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~-~~~vrt~LP~~~fR~lN~g~~~-s~~ 78 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEH-KTTVRSGLPTGTWRKLNYGVQP-EKS 78 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccc-eeeEEeccCCchhhccCCccCc-ccc Confidence 000000111111111 111332 455799999999999999988776555532 2344466778888888888775 568 Q ss_pred cceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------- Q lcl|NC_012784. 194 PFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGST---------------- 255 (415) Q Consensus 194 ~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~---------------- 255 (415) ++.+++...+-+.+.+.|.+.+.+... .++...-.....+++.+.+...|++|+.+..+... T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:98 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999887643 23344455668899999999999998644211100 Q ss_pred -------ccccccc----------------------------------------------------------------cc Q lcl|NC_012784. 256 -------SSGFEKE----------------------------------------------------------------GK 264 (415) Q Consensus 256 -------~~~~~~~----------------------------------------------------------------~~ 264 (415) ..+.... .. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 0000000 00 Q ss_pred cc--------cccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCC-cccccCcccCCCCceecceeeE Q lcl|NC_012784. 265 KL--------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKM-KDKLG-NYLIQPDVKEKTQQRLLGAKIE 334 (415) Q Consensus 265 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G-~~l~~~~~~~~~~~~l~G~pV~ 334 (415) .. ..+..+..+-++.+...++.....+.+|+||.+....|++. .+... +.+-...+.+...-.++|+||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00 11112233445566666666777789999999999999875 44433 3343345556666789999999 Q ss_pred EeccccccccCCceEEE Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLII 351 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~ 351 (415) .++.+...-+ .++ T Consensus 319 ~~dai~~tE~----~Vv 331 (331) T protein:vir:98 319 RTDALLLTEA----RVV 331 (331) T ss_pred EeeeeecCcc----ccC Confidence 9997754322 122 No 162 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.46 E-value=1.7e-08 Score=63.12 Aligned_cols=231 Identities=9% Similarity=-0.054 Sum_probs=138.2 Q ss_pred hhhhhhccccccc-ceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccccccccccccccccc Q lcl|NC_012784. 116 RNDIQGGSLKTDS-GFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKP 194 (415) Q Consensus 116 ~~~~~~~~~~~~~-~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 194 (415) +........+... ...+-|......|++.+.+.++|+..+.......+++... ...++-|.+.|..-++..++ +..+ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~-~vrt~LP~~~fR~lN~g~~~-s~~t 78 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKT-TIRAGIPEPVWRRYNQGVQP-TKTQ 78 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccce-eEEEecCCchhhhcCCcccc-ccce Confidence 1111111112221 2223355566679999999999999888765544443322 22345577788887777765 4689 Q ss_pred ceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc--------------- Q lcl|NC_012784. 195 FFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--------------- 257 (415) Q Consensus 195 f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~--------------- 257 (415) +.+++...+-+.+.+.|-+.+.+... .++...-.....+++.+++...+++|+.+..+..... T Consensus 79 t~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~ 158 (335) T protein:vir:73 79 TVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAAS 158 (335) T ss_pred EEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCc Confidence 99999999999999999998776543 2345555556889999999999999855433211100 Q ss_pred -------cccc----c-------------------------------------c-------------------------- Q lcl|NC_012784. 258 -------GFEK----E-------------------------------------G-------------------------- 263 (415) Q Consensus 258 -------~~~~----~-------------------------------------~-------------------------- 263 (415) +... . + T Consensus 159 a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 159 AENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred ccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence 0000 0 0 Q ss_pred -ccc--------cccchhhHHHHHHHHH--HhhhhccCCCEEEEcHHHHHHHHHh-hccCCcccccCcccCCCCceecce Q lcl|NC_012784. 264 -KKL--------EVKKAKSLDDIKDAIN--LNVKPNYEHNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQRLLGA 331 (415) Q Consensus 264 -~~~--------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~l~G~ 331 (415) ... ...+....+.++.++. .++.......+|+||.+....|++. .++....+-...+.+...-.++|+ T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gi 318 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGI 318 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCe Confidence 000 0111222233344442 2333344558999999999999974 555555444445555555678999 Q ss_pred eeEEeccccccccCCceEEEe Q lcl|NC_012784. 332 KIEILPDEVLGQKGNNTLIIG 352 (415) Q Consensus 332 pV~~~~~~~~~~~~~~~~~~g 352 (415) ||..++.+...-+ .++. T Consensus 319 pir~~Dail~tE~----~v~~ 335 (335) T protein:vir:73 319 PIRRVDAILNTES----AVTA 335 (335) T ss_pred EEEEEeeeecCcc----cccC Confidence 9999997653322 1222 No 163 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.44 E-value=1.4e-07 Score=58.14 Aligned_cols=298 Identities=8% Similarity=-0.027 Sum_probs=149.4 Q ss_pred HHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecc---hhHHhHHHHHHhhhhhhhhcceeEE-ccCCceeEEEEe Q lcl|NC_012784. 96 IQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIP---EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVR 171 (415) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~ 171 (415) .........+........ ...........+.|+..- +.+.+.|++........+.++.+.. .+-...++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~ 77 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYL---IQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMT 77 (319) T ss_pred CCCcchhHHhhHHHHHHH---hhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeee Confidence 000000001111000000 000011111122232322 3455567777666666666655432 222223444444 Q ss_pred ecCCccccccccccc-ccccccccceeeEeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_012784. 172 QSEVAALEKVEELEE-NPELAVKPFFQLAYDINTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVI 247 (415) Q Consensus 172 ~~~~~~a~~v~Eg~~-~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds---~~~l~~~l~~~la~~~~~~~d~~il~g~ 247 (415) ......+.|++.++. +|.. +..+.......+.++..+.++..=++.+ ..++..--....+.++.+.+|+.++.|+ T Consensus 78 ~~~~G~a~~~~d~~~dip~v-~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~ 156 (319) T protein:vir:10 78 FDKVGTAQIIADYTDDLPLV-DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGS 156 (319) T ss_pred eccccceeeecCccccccce-eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec Confidence 555567778877654 5543 4667788888888888888876544433 4567777888889999999999999997 Q ss_pred ccccccccccccccccccc-------cccchhhHHHHHHHHHHhhhh---ccCCCEEEEcHHHHHHHHHhhccCCccccc Q lcl|NC_012784. 248 TKGSTGSTSSGFEKEGKKL-------EVKKAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) Q Consensus 248 g~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~ 317 (415) ..-...+............ +.+....++|+..++.++... ...+..++|+|+.+..|.......|.-++. T Consensus 157 ~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~ 236 (319) T protein:vir:10 157 APHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLD 236 (319) T ss_pred ccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHH Confidence 6543323222211111100 011123456777777766532 345789999999999997655555543331 Q ss_pred CcccCCCCceecceeeEEecccc-ccccCCceEEEech-hhcEEEEeecceEEEEeecccCceEEEEEEEec-cEEeccc Q lcl|NC_012784. 318 PDVKEKTQQRLLGAKIEILPDEV-LGQKGNNTLIIGNL-KDAIVLFDRSQYQASWTDYMHFGECLMIAVRQD-CRILDYK 394 (415) Q Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~-~~~~~~~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d-~~v~~p~ 394 (415) -+... ..+..|+.+..+. .+..+...+++-.. .+++.+.....++...............+.|++ +.+.+|. T Consensus 237 -~lk~~----~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P~ 311 (319) T protein:vir:10 237 -YFKSQ----NSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQPKDLHFKVPCTSKCTGLTIYRPM 311 (319) T ss_pred -HHHHh----cCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeeeeecCceEEEeeeeeeEEEEEEccc Confidence 11111 1123333333332 12223333333222 222333222344433222222222334456665 4678899 Q ss_pred cEEEEEee Q lcl|NC_012784. 395 SAIVIEYD 402 (415) Q Consensus 395 a~~~~~~t 402 (415) ||++++=- T Consensus 312 ai~~~dGI 319 (319) T protein:vir:10 312 TIVLITGV 319 (319) T ss_pred eeEeeecC Confidence 99998833 No 164 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.41 E-value=2.4e-07 Score=56.88 Aligned_cols=286 Identities=9% Similarity=-0.008 Sum_probs=119.8 Q ss_pred cccccceeecchhHHhHHHHHHhhhhhhhhcce-------eEEccCCceeEEEEeec-CCc-cccccccccccccccccc Q lcl|NC_012784. 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVT-------VKRVTNGSGKYPVVRQS-EVA-ALEKVEELEENPELAVKP 194 (415) Q Consensus 124 ~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~-------~~~~~~~~~~~~~~~~~-~~~-~a~~v~Eg~~~~~~~~~~ 194 (415) .+.++--+ ..+.+....++.+.+.......++ ..++.+.-..+|+.... ++. ....+.+.+..+....-+ T Consensus 1 m~lsD~~v-fN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~kitt 79 (325) T protein:vir:95 1 MALSDLAV-YSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVLKH 79 (325) T ss_pred Cchhhhhh-hhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccceecc Confidence 00000001 112222222232222222222111 01111221223332211 111 111222333332222234 Q ss_pred ceeeEeeeeeEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccc-cccccccc Q lcl|NC_012784. 195 FFQLAYDINTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE-GKKLEVKK 270 (415) Q Consensus 195 f~~v~~~~~k~a~~~~iS~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~-~~~~~~~~ 270 (415) +..+......-.+++....+.+. +....+...|.+.+++...+.+-+.++.+...... ......... ........ T Consensus 80 ~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~-~~~~~v~dis~~~~~~~~ 158 (325) T protein:vir:95 80 LVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALS-QVSDVVYDATANTDAADK 158 (325) T ss_pred ccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccccceeeeecccCcccc Confidence 45555555554444444433322 23333444455555555444444444433321100 000000110 11111222 Q ss_pred hhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCce-- Q lcl|NC_012784. 271 AKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT-- 348 (415) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~-- 348 (415) ..+...+.++..++-+....=+.|+||..++..|.+..-.+...++..+-.. ...+++|++|+++|.+|....+... T Consensus 159 ~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~-~i~t~~G~~VIVdD~~p~~~~g~~~~y 237 (325) T protein:vir:95 159 LPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVN-VVRDPFGKLLVMTDSPNLFAAGTPNVY 237 (325) T ss_pred cccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcc-cccccCCcEEEEeCCCCCCCccCceeE Confidence 3456888999999888877889999999999999986655544443322111 1247899999999999976544321 Q ss_pred --EEEechhhcEEEEeecceEEEEeecccCceEEEEEEEecc-EEeccccEEEEEee-cCCCCcccccccC Q lcl|NC_012784. 349 --LIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDC-RILDYKSAIVIEYD-DSERGEGDLGLEA 415 (415) Q Consensus 349 --~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~-~v~~p~a~~~~~~t-~~~~~~~~~~~~~ 415 (415) .+||. .++...+..+......+ ......+....|... -+++|..+..-+-. ...+.-++|.+.+ T Consensus 238 tty~lg~--GAi~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~tf~lhp~G~sw~~s~~g~sPt~aeL~~~~ 305 (325) T protein:vir:95 238 HILGLVP--GGVLIGQNNDFDANEET-KNGDENIIRTYQAEWSYNIGVKGFAWDKANGGKSPTDAALFTST 305 (325) T ss_pred EEEEEec--CeEEecCCCCccccccc-cCcccceeeeeeeeeeEEeecceeeeecccccCCcChHhhcCCc Confidence 23332 12222222222221111 111111111112222 46788887773211 1334445555555 No 165 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.33 E-value=3.3e-07 Score=56.12 Aligned_cols=274 Identities=12% Similarity=0.048 Sum_probs=145.8 Q ss_pred ccccccceeecc--hhHHhHHHHHHhhhhhhhhcceeEE-ccCCceeEEEEeecCCccccccccccc-ccccccccceee Q lcl|NC_012784. 123 SLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKPFFQL 198 (415) Q Consensus 123 ~~~~~~~~~~vP--~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~f~~v 198 (415) ..+.+.|..+.- +.+.+.|++.+......+.++.+.. ++-....+.+........+.|++.++. +|. ....+... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~-~~~~~~~~ 79 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPL-VDVDMVRK 79 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccc-ccccceeE Confidence 223333322221 2456677777777777777665432 222223344444455567778877664 453 34667888 Q ss_pred EeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccc---c------ Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK---L------ 266 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds---~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~---~------ 266 (415) ....+.++.-+.++..=++.+ ..++..--....+.++.+.+|+.+++|+..-...+........... . T Consensus 80 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~ 159 (301) T protein:vir:80 80 SVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVS 159 (301) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccccccc Confidence 888888888888886544433 4567888888899999999999999997654322222221110000 0 Q ss_pred ---cccchhhHHHHHHHHHHhhhh---ccCCCEEEEcHHHHHHHHHh--hccCCcccccCcccCCCCceecceeeEEecc Q lcl|NC_012784. 267 ---EVKKAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKM--KDKLGNYLIQPDVKEKTQQRLLGAKIEILPD 338 (415) Q Consensus 267 ---~~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~~~~~~l~~l--kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~ 338 (415) +.+....++|+.+++.++... ...+..++|+|+.+..|..- .+..|.-++. -+... ..+..|+.++. T Consensus 160 ~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~-~l~~~----~~~~~I~~~p~ 234 (301) T protein:vir:80 160 KWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLK-VLQDN----AWFSAIVRVPD 234 (301) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHH-HHHHH----cCcceEEEcce Confidence 011222367777777776543 23568999999999999753 3454543331 11111 11122333333 Q ss_pred ccc-cccCCceE-EEechhhcEEEEeecceEEEEeecccCceEEEEEEEe-ccEEeccccEEEEEee Q lcl|NC_012784. 339 EVL-GQKGNNTL-IIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) Q Consensus 339 ~~~-~~~~~~~~-~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~-d~~v~~p~a~~~~~~t 402 (415) +.. +..+...+ ++-+=.+.+.+...+.++........-....-.+.|+ ++.+.+|.||++++=- T Consensus 235 L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 235 LAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eccCCCCcccEEEEEecCCcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 221 22233222 2222122233322233332211111111112234666 4578899999999833 No 166 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.29 E-value=3e-07 Score=56.37 Aligned_cols=296 Identities=8% Similarity=-0.038 Sum_probs=143.9 Q ss_pred HHhhhhhHHHHHHHHHHhhhhhhhhccccccc-ceeecc--hhHHhHHHHHHhhhhhhhhcceeEEc-cCCceeEEEEee Q lcl|NC_012784. 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDS-GFVVIP--EEIVTDILKLKEVEFNLDKYVTVKRV-TNGSGKYPVVRQ 172 (415) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~vP--~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~ 172 (415) ...+.. .+... ............... +.+++. +.+.+.|++.......-+.++.+... +-..-++.+... T Consensus 1 ~~~~~~-~~~~~-----~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~ 74 (314) T protein:vir:10 1 MAIKFD-AEQAK-----ITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEF 74 (314) T ss_pred CccchH-HHHHH-----HHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeee Confidence 000111 00000 000111111122222 233332 34555677665555544444443211 111123444445 Q ss_pred cCCccccccccccc-ccccccccceeeEeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012784. 173 SEVAALEKVEELEE-NPELAVKPFFQLAYDINTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVIT 248 (415) Q Consensus 173 ~~~~~a~~v~Eg~~-~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds---~~~l~~~l~~~la~~~~~~~d~~il~g~g 248 (415) .....+.|++.++. +|.. +..+.......+.++..+.++..=++-+ ..++..--....+.++.+.+|+.++.|+. T Consensus 75 e~~G~a~~~~d~~~dip~v-d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~ 153 (314) T protein:vir:10 75 DGVGIAQIIADYSDDLPLV-DAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSA 153 (314) T ss_pred ccccceeeeCCccccccee-ecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc Confidence 55567778887654 5543 4677888888888998888875544333 44677888888889999999999999865 Q ss_pred ccccccccccccccccccc---ccchhhHHHHHHHHHHhhhh---ccCCCEEEEcHHHHHHHHHhhccCCcccccCcccC Q lcl|NC_012784. 249 KGSTGSTSSGFEKEGKKLE---VKKAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKE 322 (415) Q Consensus 249 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~ 322 (415) .-...|............. .+....++|+..++.++... ...+..++|+|..+..|...-+.+|.-++. -+.. T Consensus 154 ~~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~-~l~~ 232 (314) T protein:vir:10 154 PHGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGE-LFTR 232 (314) T ss_pred cccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHH-HHHH Confidence 5432222222111111110 12223477888888777653 245678999999998886544444433321 1111 Q ss_pred -CCCceecceeeEEeccccccccCCceEEEe-chhhcEEEEeecceEEEEeecccCceEEEEEEEec-cEEeccccEEEE Q lcl|NC_012784. 323 -KTQQRLLGAKIEILPDEVLGQKGNNTLIIG-NLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQD-CRILDYKSAIVI 399 (415) Q Consensus 323 -~~~~~l~G~pV~~~~~~~~~~~~~~~~~~g-d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d-~~v~~p~a~~~~ 399 (415) ..+-+|.+.|-... .+..+...+++- +=.+.+.+.....++...............+.|++ +.+.+|.||+++ T Consensus 233 n~~~l~I~~~~el~~----ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~ 308 (314) T protein:vir:10 233 NNPGLTIRFLQFLDN----YDGAGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKDLHFRYPVTSKATGLIVYRPLTMAVI 308 (314) T ss_pred hCCCcEEEEcccccc----cCCCcceEEEEEecCCcEEEEecCccceeecceecCceEEEcceeeeEEEEEECcceeEee Confidence 11123333332221 222233333222 21222222212233322111111122233455764 568889999987 Q ss_pred E-eecC Q lcl|NC_012784. 400 E-YDDS 404 (415) Q Consensus 400 ~-~t~~ 404 (415) + +|=+ T Consensus 309 dGI~~~ 314 (314) T protein:vir:10 309 KGITFA 314 (314) T ss_pred eeeecC Confidence 7 3322 No 167 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.16 E-value=1.1e-06 Score=53.28 Aligned_cols=280 Identities=11% Similarity=-0.032 Sum_probs=145.9 Q ss_pred hhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCC-cccccccccccccccccccc Q lcl|NC_012784. 117 NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPF 195 (415) Q Consensus 117 ~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f 195 (415) .......-+ +...+..-.++.+.|...-....|+..++.....++. .+.|....-. +......||...+... ..- T Consensus 1 ma~~~~~~~-t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~--~~~W~~d~l~~~~~~~~~EG~da~~~~-~~~ 76 (317) T protein:vir:88 1 MATPTNAVS-TVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAI--THEWQTDELRQPGKNTRVEGEDATIKA-GSF 76 (317) T ss_pred CCccccceE-eeeeeeeeechhhhheecCCccCcceeeecCceeccc--EEEEEeeecCCccccccccCccccccc-ccC Confidence 111111111 1222334567888888888888999988876555433 3444433322 3334556887655322 111 Q ss_pred eeeEeee-eeEEEeehhhHHHHhcchH---HHHHHHHHHHHHHHHHHHHHHHhhcccc-----ccccccccccccc---- Q lcl|NC_012784. 196 FQLAYDI-NTHRGYFRISREAIEDAKV---NVLQELKLWMARTIAATRNKAIIDVITK-----GSTGSTSSGFEKE---- 262 (415) Q Consensus 196 ~~v~~~~-~k~a~~~~iS~e~l~ds~~---~l~~~l~~~la~~~~~~~d~~il~g~g~-----~~~~~~~~~~~~~---- 262 (415) ....-+. +-+...+.||.-+..-... +...|-...-...+.+-+|..+|+|.-. ........+.... T Consensus 77 r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~ 156 (317) T protein:vir:88 77 TTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTN 156 (317) T ss_pred CEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccC Confidence 2222121 1223344444433322222 3333433444566788899999998632 1111111111000 Q ss_pred -------------c---ccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcc---cCC Q lcl|NC_012784. 263 -------------G---KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV---KEK 323 (415) Q Consensus 263 -------------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~---~~~ 323 (415) . .+.......+-+++.+++.++-..+..+..++++|.....|.++-..++.++..+.- .+. T Consensus 157 ~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~ 236 (317) T protein:vir:88 157 GSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQ 236 (317) T ss_pred ceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEE Confidence 0 011111234567888899898889988899999999999999885434444422110 000 Q ss_pred CCcee---cc-eeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEE Q lcl|NC_012784. 324 TQQRL---LG-AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVI 399 (415) Q Consensus 324 ~~~~l---~G-~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~ 399 (415) ....+ +| ++++.+.+||.+ .+++.|+.. +.+.--.++..+.---..+.....++..+++.+.+|.|...+ T Consensus 237 ~v~~~~tdfG~v~ii~~r~lp~~-----~~~~~D~~~-~~l~~Lr~~~~e~laKtGd~~k~~i~~E~tLe~~N~~a~a~i 310 (317) T protein:vir:88 237 TVDVYESDFGKYTIRANRWFHEN-----TLFVFDPKM-HSLCYLRPFFQHELAKTGDSEKRQLLVEYTFRVNNEKSGALI 310 (317) T ss_pred EEEEEEeCCeEEEEEeCCCCCCC-----eEEEEcccc-cceeecccceeeccCCCcccceeEEEEEEEEEEcCccceeEE Confidence 00000 12 355666666643 467788764 333221223222211233445567888999999999999888 Q ss_pred EeecCCC Q lcl|NC_012784. 400 EYDDSER 406 (415) Q Consensus 400 ~~t~~~~ 406 (415) .-.+++- T Consensus 311 ~~l~~~~ 317 (317) T protein:vir:88 311 RDVVAQL 317 (317) T ss_pred EEecccC Confidence 8555444 No 168 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=98.15 E-value=9.1e-08 Score=59.18 Aligned_cols=371 Identities=14% Similarity=0.134 Sum_probs=172.3 Q ss_pred CCh--HHHHHHHHHHHHHHHHHHHHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|NC_012784. 1 MKT--KEELQSEISDIKRQIDLKVKYATRA---LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) Q Consensus 1 Mk~--~~el~~~l~~l~~~~~~~~~~~~~~---~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) ||+ ++|....+.++++.+......+... -.=+++.++++++.-+.++.-+|..++..++.+++......... T Consensus 8 ~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK~kMt--- 84 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMT--- 84 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhhHHHH--- Confidence 776 5666666777766665555444322 12244667788888888888888888877765544332221110 Q ss_pred cccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcc Q lcl|NC_012784. 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~ 155 (415) .--+.......++.-...+...+.....|... ....+.+.++....+|..+.-.|-..+.++.+++... T Consensus 85 -----~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~------L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vf 153 (400) T protein:vir:93 85 -----NFIESQNAVTEFFDVLKKNSGKSEIKNAWSAK------LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) T ss_pred -----HHHhhHHHHHHHHHHHhccCCchhhhhhhhhh------HhhcCcceeccchhccHHHHHHHHHhhhccCcceeee Confidence 11122223333444333333333222222211 1222344556667889999999999999999998754 Q ss_pred eeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhH-HHHhc---chHHHHHHHHHH Q lcl|NC_012784. 156 TVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR-EAIED---AKVNVLQELKLW 230 (415) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~-e~l~d---s~~~l~~~l~~~ 230 (415) .+.+.+ .|.+.+.... ..+.-+..|.++.+. .|..+..+..-+++|+..|- ++..+ +-..+..|+..+ T Consensus 154 HVT~~~----~~~V~~s~~s~~~Aq~HkdGqTK~eq---a~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~E 226 (400) T protein:vir:93 154 HVTNVG----ALLVSRSFDSANEAQVHKDGQTKTEQ---AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAE 226 (400) T ss_pred eeccch----hhhHHhhhhhhhhhhhhccCCccccc---eeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Confidence 443332 3333222222 245555666666543 34444444444555555553 23333 333468999999 Q ss_pred HHHHHH-HHHHHHHhhcccccccccccc--cc---ccccccccccchhhH-HHHHHHHHHhhhhccCCCEEEEcHHH-HH Q lcl|NC_012784. 231 MARTIA-ATRNKAIIDVITKGSTGSTSS--GF---EKEGKKLEVKKAKSL-DDIKDAINLNVKPNYEHNVAIVSQTM-FA 302 (415) Q Consensus 231 la~~~~-~~~d~~il~g~g~~~~~~~~~--~~---~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~-~~ 302 (415) |+.++. +..|.+++-|+|+++-...-. .. ........+.+...+ |.+..+..-..+...+ -.+++...+ .+ T Consensus 227 LtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagr-rylivktedrka 305 (400) T protein:vir:93 227 LTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR-RYLIVKTEDRKA 305 (400) T ss_pred HHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc-eEEEEeccchHH Confidence 999999 888999999999876322211 00 000111112222333 3333333332222222 234444444 44 Q ss_pred HHHHhhccCCccc--ccCcccCCCCceeccee-eEEeccccccccCCceEEEechhhcEEEEeecceE-EEEeecccCce Q lcl|NC_012784. 303 KLDKMKDKLGNYL--IQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQ-ASWTDYMHFGE 378 (415) Q Consensus 303 ~l~~lkd~~G~~l--~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~~~~ 378 (415) .|..|+-++.+-- ...+-.. ...--|.. +++.. |...-++-++-|-+ |.+ +-++++ +......++.. T Consensus 306 lldelrqatanahvriknddae--iasevgvdeiivyt----gskalkptvlvdqk--yhi-dmqdltkvdafewktnsn 376 (400) T protein:vir:93 306 LLDELRQATANAHVRIKNDDAE--IASEVGVDEIIVYT----GSKALKPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSN 376 (400) T ss_pred HHHHHHhhccccceEeecchhh--hhhhcCcceeeeee----ccccccceeeeccc--ccc-chhhhhhhhhheeccCCc Confidence 4555654443221 1111000 00001111 11111 11111111222211 111 112222 11112222322 Q ss_pred EEEEEEEeccEEeccccEEEEEee Q lcl|NC_012784. 379 CLMIAVRQDCRILDYKSAIVIEYD 402 (415) Q Consensus 379 ~~~~~~r~d~~v~~p~a~~~~~~t 402 (415) -+.++..-.|-|-.-+|-+.++++ T Consensus 377 milvetltsghvetynagavitvs 400 (400) T protein:vir:93 377 MILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eEEEeecccCcceeeccceeEeeC Confidence 233333444444444455555555 No 169 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.13 E-value=3.3e-06 Score=50.63 Aligned_cols=271 Identities=10% Similarity=0.008 Sum_probs=128.5 Q ss_pred cccccceeecchhHHhHHHHHHhhhhhhhhcceeEEcc--CCc-eeEEEEeecCCcccccccccccccccccccceeeEe Q lcl|NC_012784. 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVT--NGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) Q Consensus 124 ~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~--~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~ 200 (415) ........+-|+.++.++++.+++..++..++....-. ... ..+.++.. .... +.++..+.. .+.+=+.+.+ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp-~~~~---v~dg~~~~~-~~~te~~v~l 75 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLP-YRVK---SASGRTLVK-QPMVDQTIPF 75 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeC-Ccee---ecccCCccc-cccccceEEE Confidence 22223344559999999999999999887776541111 111 13444332 1212 223333321 2233344445 Q ss_pred ee--eeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHH Q lcl|NC_012784. 201 DI--NTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIK 278 (415) Q Consensus 201 ~~--~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (415) +. +|+ .-+.|+.+=...+..++.+.+.+....+++..+|..++.-. .+.+. ... +.......|++++ T Consensus 76 ~id~~k~-~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~-~~a~~--------~~g-t~gt~~~~~~~i~ 144 (418) T protein:vir:10 76 KIAYQEH-VGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTL-KKAFH--------SSG-TPGVRPGAFIDFA 144 (418) T ss_pred EEecccc-cceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-hhccc--------ccc-cCCcCcchHHHHH Confidence 44 333 34455555444555688888889999999999999877432 22110 000 0112234589999 Q ss_pred HHHHHhhhhccCC-C-E-EEEcHHHHHHHHHhhccCCccccc-----CcccCCCCceecceeeEEeccccccccCC---c Q lcl|NC_012784. 279 DAINLNVKPNYEH-N-V-AIVSQTMFAKLDKMKDKLGNYLIQ-----PDVKEKTQQRLLGAKIEILPDEVLGQKGN---N 347 (415) Q Consensus 279 ~~~~~~~~~~~~~-~-~-~v~~~~~~~~l~~lkd~~G~~l~~-----~~~~~~~~~~l~G~pV~~~~~~~~~~~~~---~ 347 (415) ++...+....... . . .+++|..+..|.+ +. +..+. ..+..+..++|.|+.|+.++++|..+.+. . T Consensus 145 ~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~--~~--~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t 220 (418) T protein:vir:10 145 NAGAKQTTYAVPQDGMRHAVLDPFTCASLSD--EV--TKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGT 220 (418) T ss_pred HHHHHHHhcCCCCCCceEEEeCHHHHHHHhh--hc--cccccccccchhhheeeeeeeeceEEEEecCCCcccccccccc Confidence 9888887777653 2 4 5799998877653 22 22221 12445667889999999999999654443 1 Q ss_pred eEEEechhhcEEEEeecce-----EEEEeecccCceEEEEEEEeccE-EeccccEEE-------------EEeecCC--- Q lcl|NC_012784. 348 TLIIGNLKDAIVLFDRSQY-----QASWTDYMHFGECLMIAVRQDCR-ILDYKSAIV-------------IEYDDSE--- 405 (415) Q Consensus 348 ~~~~gd~~~~~~~~~~~~~-----~i~~~~~~~~~~~~~~~~r~d~~-v~~p~a~~~-------------~~~t~~~--- 405 (415) ..+.|-...+..+....+. .+..-|...+.-.+ ..-++... ..++.-|+. +++.++- T Consensus 221 ~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~-~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~ 299 (418) T protein:vir:10 221 PLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVF-GVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDG 299 (418) T ss_pred eeeecccccceeEEEeecceeeccceeeccEEEECcee-ecccccccccccceEEEEEeeccccccCcceeEeccccccc Confidence 2333432222222110000 00000100000000 00000000 001222221 1111110 Q ss_pred -----C---------CcccccccC Q lcl|NC_012784. 406 -----R---------GEGDLGLEA 415 (415) Q Consensus 406 -----~---------~~~~~~~~~ 415 (415) . .+.++.+.. T Consensus 300 ~~~~~~~~~~~~~~~~~~~v~a~~ 323 (418) T protein:vir:10 300 TATINNENGDPVSLTAYQNVTALP 323 (418) T ss_pred cccccccccccccccCCCcccccc Confidence 0 111111111 No 170 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=98.07 E-value=3.6e-06 Score=50.43 Aligned_cols=275 Identities=11% Similarity=0.023 Sum_probs=111.7 Q ss_pred hhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEcc---C-CceeEEEEee--cCCc-cccccccccccccccc Q lcl|NC_012784. 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVT---N-GSGKYPVVRQ--SEVA-ALEKVEELEENPELAV 192 (415) Q Consensus 120 ~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~---~-~~~~~~~~~~--~~~~-~a~~v~Eg~~~~~~~~ 192 (415) +.. +..+-=.+.-+.+....++.+.+.....+.+.--++. . -.+.+..... ..+. ..-.+.-.+......- T Consensus 1 ~~~--t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~ki 78 (315) T protein:vir:96 1 MAT--TVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKI 78 (315) T ss_pred Cce--eeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceec Confidence 111 1111111233344455555555544333322211110 0 0122211110 0110 0001111111111111 Q ss_pred ccceeeEeeeeeEEEeehh--hHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc Q lcl|NC_012784. 193 KPFFQLAYDINTHRGYFRI--SREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE 267 (415) Q Consensus 193 ~~f~~v~~~~~k~a~~~~i--S~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~ 267 (415) -....+.++. ..+.-++ +.+.+. +.+.....-|...+..++.+..=...+.+..... .. ........ T Consensus 79 t~~~dvaVk~--~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai----~~--~t~~~~~~ 150 (315) T protein:vir:96 79 AADEMVSVKV--PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAI----GS--NAGMNVSG 150 (315) T ss_pred ccccceeEEE--eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh----cc--cccccccc Confidence 1222333322 2222333 333333 3333333434444444444433333333322110 00 00111123 Q ss_pred ccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCc---ccCCCCceecceeeEEecccccccc Q lcl|NC_012784. 268 VKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPD---VKEKTQQRLLGAKIEILPDEVLGQK 344 (415) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~---~~~~~~~~l~G~pV~~~~~~~~~~~ 344 (415) .....+...+.++..++-+....=+.|+||..++..|.+ +.=. ..++... ..+..+. .+|+||+++|.||... T Consensus 151 ~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~L~-~~~~~~~~~~~~~~~~~-~lGkrViVdD~~P~~~- 226 (315) T protein:vir:96 151 ELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EAID-NKLYEEAGVVVYGGTPG-TLGKPVLVTDQCPATK- 226 (315) T ss_pred cccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hhhh-hhcccccceeEecCcCc-ccccEEEEECCCCcce- Confidence 345567788899999988888888999999999999986 2111 1222111 1122233 4599999999999632 Q ss_pred CCceEEEechhhcEEEEeecceEEEEeecc-cCceEEEEEEEecc-EEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 345 GNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDC-RILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 345 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~-~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ...+|. .++....... ..+..+. .....+....|..+ -+++|..|..-+.+...+.-.+|.+.+ T Consensus 227 ---~~gl~~--GAi~~~~~~~--~~~~~~~~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~sPt~aeLat~~ 292 (315) T protein:vir:96 227 ---IFGLVA--GAVMITESQA--PGMRSYQIDDQENLAIGFRAEGTANVEVLGYKWKTKTNVNPASATLATTT 292 (315) T ss_pred ---eeeeec--ceeeecCCCc--cccccccCCCcceeEEEEeeeeEeeeeeeeEEeecCCCcCCChHHhcCCc Confidence 112222 1122221111 1111111 12233444444444 356777766644333444555555555 No 171 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.06 E-value=1.5e-06 Score=52.46 Aligned_cols=303 Identities=8% Similarity=-0.023 Sum_probs=145.8 Q ss_pred HHHhhhhhHHHHHHHHHHhh-hhhhh-hcccc-cccceeecc---hhHHhHHHHHHhhhhhhhhcceeEE-ccCCceeEE Q lcl|NC_012784. 96 IQNTKVTSQEVRDFTEYLET-RNDIQ-GGSLK-TDSGFVVIP---EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYP 168 (415) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~-~~~~~~~vP---~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~~~~~ 168 (415) .++.......... ...+.. ..... ..... ..+.+...- +.+.+.|++........+.++.+.. .+-...++. T Consensus 1 ~~~~~~~~~~~~d-~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t 79 (329) T protein:vir:79 1 MRGNIMSKEMKYD-EFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFE 79 (329) T ss_pred Cccchhhhhhccc-hhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEE Confidence 1111111111110 000000 01111 11111 111222222 3356778877777776666665432 222223445 Q ss_pred EEeecCCcccccccccc-cccccccccceeeEeeeeeEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012784. 169 VVRQSEVAALEKVEELE-ENPELAVKPFFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAII 244 (415) Q Consensus 169 ~~~~~~~~~a~~v~Eg~-~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~d---s~~~l~~~l~~~la~~~~~~~d~~il 244 (415) +........+.|++.++ .+|. .+..+..-....+.++..+.++..=++- ...++..--....+.++.+.+|+-++ T Consensus 80 ~~~~~~~G~a~~~~d~~~dip~-vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f 158 (329) T protein:vir:79 80 YQTFDKVGHAKIIADYTDDLST-VDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVF 158 (329) T ss_pred eeeeecceeeeeecCcccccce-eecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEE Confidence 55555556777887654 4553 3456677777888888888887554433 34567888888888999999999999 Q ss_pred hcccccccccccccccccccccc---------ccchhhHHHHHHHHHHhhhh--c-cCCCEEEEcHHHHHHHHHhhccCC Q lcl|NC_012784. 245 DVITKGSTGSTSSGFEKEGKKLE---------VKKAKSLDDIKDAINLNVKP--N-YEHNVAIVSQTMFAKLDKMKDKLG 312 (415) Q Consensus 245 ~g~g~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~--~-~~~~~~v~~~~~~~~l~~lkd~~G 312 (415) +|+..-...+............. .+....++|+.+++.++... + ..+..++|+|+.+..|..-....| T Consensus 159 ~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~ 238 (329) T protein:vir:79 159 KGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETT 238 (329) T ss_pred eecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCC Confidence 99765433232222111111110 11122456777777777643 2 346789999999998875555555 Q ss_pred cccccCcccCCCC-ceecceeeEEeccccccccCCceEEEechh-hcEEEEeecceEEEEeecccCceEEEEEEEec-cE Q lcl|NC_012784. 313 NYLIQPDVKEKTQ-QRLLGAKIEILPDEVLGQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQD-CR 389 (415) Q Consensus 313 ~~l~~~~~~~~~~-~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d-~~ 389 (415) .-++. -+....+ -+|-+.|-.. ..+..+...+++-+.+ +.+.+.....++........-......+.|++ +. T Consensus 239 ~tvl~-~lk~~~~~l~I~~~~el~----~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~~~~~v~~~~r~~Gv~ 313 (329) T protein:vir:79 239 MSYLD-YFKQQNGGITIESISELE----DIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQPKDLHFKVPCTSKCTGLT 313 (329) T ss_pred ccHHH-HHHHhCCCcEEEEccccc----ccCCCCceEEEEEecCCceEEEecCcceeeeeceecCceEEEceeeeEEEEE Confidence 43321 1111111 1233333211 1222333333333222 22222222333332221111112223345665 46 Q ss_pred EeccccEEEEEeecCCCC Q lcl|NC_012784. 390 ILDYKSAIVIEYDDSERG 407 (415) Q Consensus 390 v~~p~a~~~~~~t~~~~~ 407 (415) +.+|.||++++=--. | T Consensus 314 i~~P~ai~~~dGI~~--~ 329 (329) T protein:vir:79 314 IYRPLTLVLIKGLVV--G 329 (329) T ss_pred EECcceeeeeeeeee--C Confidence 788999999872111 1 No 172 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=98.00 E-value=1.6e-07 Score=57.85 Aligned_cols=371 Identities=14% Similarity=0.140 Sum_probs=167.9 Q ss_pred CChHH--HHHHHHHHHHHHHHHHHHHH---HHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|NC_012784. 1 MKTKE--ELQSEISDIKRQIDLKVKYA---TRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) Q Consensus 1 Mk~~~--el~~~l~~l~~~~~~~~~~~---~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) |++.+ |.+.+++++++.-.....++ +-.-.=+++.++++++.-+.++.-+|..++..++..++........ T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN~~eE~~KGK~kM---- 76 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKM---- 76 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhcchhhHHH---- Confidence 88744 55556777755422222211 1111224456778888888888888888777776653332211110 Q ss_pred cccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcc Q lcl|NC_012784. 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~ 155 (415) ..--+.......++.-...+...+.....|... ....+.+.++....+|..+.-.|-..+.++.+++... T Consensus 77 ----t~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~------L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vf 146 (393) T protein:vir:16 77 ----TNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAK------LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) T ss_pred ----HHHHhhHHHHHHHHHHHhccCCchhhhhhhhhh------HhhcCcceeccchhccHHHHHHHHHhhhccCcceeee Confidence 011122223333444444433333222222221 1222344556667889999999999999999998754 Q ss_pred eeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhH-HHHhc---chHHHHHHHHHH Q lcl|NC_012784. 156 TVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR-EAIED---AKVNVLQELKLW 230 (415) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~-e~l~d---s~~~l~~~l~~~ 230 (415) .+.+.+ .|.+.+.... ..+.-+..|.++.+. .|..+..+..-.++|+..|- ++..+ +-..+..|++.+ T Consensus 147 HVT~~~----~~~V~~s~~s~~eAq~HkdGqTK~eq---a~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~E 219 (393) T protein:vir:16 147 HVTNVG----ALLVSRSFDSANEAQVHKDGQTKTEQ---AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAE 219 (393) T ss_pred eeccch----hhhHHhhhhhhhhhhhhccCCccccc---eeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Confidence 433332 3333222222 245555666666543 34455555555555665553 23333 333468999999 Q ss_pred HHHHHH-HHHHHHHhhcccccccccccc--cc---ccccccccccchhhH-HHHHHHHHHhhhhccCCCEEEEcHHH-HH Q lcl|NC_012784. 231 MARTIA-ATRNKAIIDVITKGSTGSTSS--GF---EKEGKKLEVKKAKSL-DDIKDAINLNVKPNYEHNVAIVSQTM-FA 302 (415) Q Consensus 231 la~~~~-~~~d~~il~g~g~~~~~~~~~--~~---~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~-~~ 302 (415) |+.++. +..|.+++-|+|+++-...-. .. ........+.+...+ |.+..+..-..+...+ -.+++...+ .+ T Consensus 220 LtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagr-rylivktedrka 298 (393) T protein:vir:16 220 LTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR-RYLIVKTEDRKA 298 (393) T ss_pred HHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc-eEEEEeccchHH Confidence 999999 888999999999876322111 00 000111112222333 3333333332222222 234444444 34 Q ss_pred HHHHhhccCCccc--ccCcccCCCCceeccee-eEEeccccccccCCceEEEechhhcEEEEeecceE-EEEeecccCce Q lcl|NC_012784. 303 KLDKMKDKLGNYL--IQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQ-ASWTDYMHFGE 378 (415) Q Consensus 303 ~l~~lkd~~G~~l--~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~~~~ 378 (415) .|..|+-++.+-- +..+-+.- ..--|.. +++.. |+..-++-++-|-+ |.+ +-++++ +......++.. T Consensus 299 lldelrqatananvriknddtei--asevgvdeiivyt----gskalkptvlvdqk--yhi-dmqdltkvdafewktnsn 369 (393) T protein:vir:16 299 LLDELRQATANANVRIKNDDTEI--ASEVGVDEIIVYT----GSKALKPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSN 369 (393) T ss_pred HHHHHHhhhccCceeeeccchhh--hhhcCcceeeeee----ccccccceeeeccc--ccc-chhhhhhhhhheeccCCc Confidence 4555554332211 11111100 0001111 11111 11111111222211 111 112222 11112222322 Q ss_pred EEEEEEEeccEEeccccEEEEEee Q lcl|NC_012784. 379 CLMIAVRQDCRILDYKSAIVIEYD 402 (415) Q Consensus 379 ~~~~~~r~d~~v~~p~a~~~~~~t 402 (415) -+.++..-.|-|-.-.|-+.++++ T Consensus 370 milvetltsghvetynagavitvs 393 (393) T protein:vir:16 370 MILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred eEEEeecccCcceeeccceeEeeC Confidence 233333444444444455555555 No 173 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.98 E-value=2.8e-06 Score=51.06 Aligned_cols=272 Identities=11% Similarity=0.067 Sum_probs=123.9 Q ss_pred ccceeecchhHHhHHHHHHhhhhhhhhcceeE---Ec---cCCceeEEEEeecCCcccccc-----cccccccccccccc Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK---RV---TNGSGKYPVVRQSEVAALEKV-----EELEENPELAVKPF 195 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~~v-----~Eg~~~~~~~~~~f 195 (415) .....++|+.++..+++.+++..++..++..- .. .+.+.+++.+ .. ....+. +++..+. ..+.+- T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~--~~-~~~~~~~~~~~~~~~~~~-~~~~~~ 76 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVP--AP-SRGHTRKLRGAGAERNLT-VSDFTE 76 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeec--cc-ccceeeeccccccCCccc-cccccc Confidence 22344789999999999999998887776421 11 1223334333 22 222222 2233222 223333 Q ss_pred eeeEeee-eeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhH Q lcl|NC_012784. 196 FQLAYDI-NTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) Q Consensus 196 ~~v~~~~-~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) +.+.+.. +..+.-+.|+++-......++...+.++..++++.++|..++.-... .+... ............+ T Consensus 77 ~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~-a~~~~------~~~~~~~~~~~~~ 149 (392) T protein:vir:99 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG-APYEA------AGAVHEVAPDEFF 149 (392) T ss_pred ceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccc------cccccccChhhhH Confidence 4555555 22334455666655555567888888999999999999987753221 11111 0111122234567 Q ss_pred HHHHHHHHHhhhhccCC-CEEEEcHHHHHHHHHhhc-cCCccccc---CcccCCCCceecceeeEEeccccccccCCceE Q lcl|NC_012784. 275 DDIKDAINLNVKPNYEH-NVAIVSQTMFAKLDKMKD-KLGNYLIQ---PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTL 349 (415) Q Consensus 275 ~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~l~~lkd-~~G~~l~~---~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~ 349 (415) +.++++...|....... -.++++|..+..|.+... .+-.+.-. .....+..+++.|++|+.++++|.... T Consensus 150 ~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~----- 224 (392) T protein:vir:99 150 KGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA----- 224 (392) T ss_pred HHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccc----- Confidence 88888887776655443 368889998888774310 00011100 113355667899999999998876532 Q ss_pred EEechhhcEEEEee-----------------cceEEEEe-ec-ccCce------EEEEEE----EeccEEeccccEEEE- Q lcl|NC_012784. 350 IIGNLKDAIVLFDR-----------------SQYQASWT-DY-MHFGE------CLMIAV----RQDCRILDYKSAIVI- 399 (415) Q Consensus 350 ~~gd~~~~~~~~~~-----------------~~~~i~~~-~~-~~~~~------~~~~~~----r~d~~v~~p~a~~~~- 399 (415) +.+..+. .....+ ..+...+. ++ ..+.. .+.+.. ..+........+... T Consensus 225 ~a~~~~a-~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~ 303 (392) T protein:vir:99 225 YLYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) T ss_pred eeeeccc-cccccccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeec Confidence 1111000 000000 00111110 00 00000 000000 000001110000000 Q ss_pred -EeecCCCCcccccccC Q lcl|NC_012784. 400 -EYDDSERGEGDLGLEA 415 (415) Q Consensus 400 -~~t~~~~~~~~~~~~~ 415 (415) .++..+........+- T Consensus 304 ~~v~v~~v~~~~~~~~~ 320 (392) T protein:vir:99 304 GSIEVAPEAGANATITA 320 (392) T ss_pred ceeeeeeeecccceeEe Confidence 0000010111110000 No 174 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.77 E-value=1.8e-05 Score=46.59 Aligned_cols=396 Identities=12% Similarity=0.071 Sum_probs=108.7 Q ss_pred CChHHHHH-HHHHHHHHHHHHHHH---HHHHhhch--HHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_012784. 1 MKTKEELQ-SEISDIKRQIDLKVK---YATRALNN--DELEKAE------KLEQEITDLRSQIQEKQEELDKLKEKDGTS 68 (415) Q Consensus 1 Mk~~~el~-~~l~~l~~~~~~~~~---~~~~~~~e--~~~~~~~------~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~ 68 (415) |-..+.+. ++++++..++....+ ++++...+ ...++.. .+.++++++++++++++++.+.+.+..... T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~l 80 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKEL 80 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54443322 233333333332222 22221111 1111111 111233444444444444433333322222 Q ss_pred hhccccccccchhhh-hhHHHHHHHHHHHHHhh-hhhHHHHHHHHHHhhh-----hhhh----hcccccccceee-cchh Q lcl|NC_012784. 69 ENNQQSVEVNEARTY-RNQANINDLGISIQNTK-VTSQEVRDFTEYLETR-----NDIQ----GGSLKTDSGFVV-IPEE 136 (415) Q Consensus 69 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~----~~~~~~~~~~~~-vP~~ 136 (415) +.............. ................. ......+......+.. .... .........+.. -... T Consensus 81 e~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 160 (466) T protein:vir:80 81 ENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELT 160 (466) T ss_pred HHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccc Confidence 221111111111000 00000100000000000 0000000000000000 0000 000000000000 0000 Q ss_pred HHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccc-----cccccccccccceeeEeeeeeEEEeehh Q lcl|NC_012784. 137 IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE-----LEENPELAVKPFFQLAYDINTHRGYFRI 211 (415) Q Consensus 137 ~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E-----g~~~~~~~~~~f~~v~~~~~k~a~~~~i 211 (415) +=..+...+.+. ++. ..++... ..+....+ ...+... ..-..+.....-...++..-.+..+-.- T Consensus 161 vP~~~~~~i~~~--l~~---~~~l~~~---~~v~~~~g--~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~ 230 (466) T protein:vir:80 161 IPDVMLELLRDN--MHR---YSKLISK---VRLRPLKG--TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVG 230 (466) T ss_pred ccHHHHHHHHHh--hhh---hhhhhhh---eeeeecCc--eeEeeeecCCcceeecccccccccccccccceeecceeee Confidence 111122222211 111 1112111 11111111 1122211 1111122111111222222222221111 Q ss_pred hHHHHhcchHHHH-HHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhh-----HHHHHHHHHHhh Q lcl|NC_012784. 212 SREAIEDAKVNVL-QELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKS-----LDDIKDAINLNV 285 (415) Q Consensus 212 S~e~l~ds~~~l~-~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 285 (415) ..=-+.+.-.+.. ..|...|...++.++...+=...=.|.+.+.+.|..+............ .+++........ T Consensus 231 ~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (466) T protein:vir:80 231 GFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKI 310 (466) T ss_pred eehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhh Confidence 1111222222222 2366667777777777665555445555555555444333222222211 112221111122 Q ss_pred hhccCCCEEEEcHHHHH-HHHHhhccCCcccccCcccCCCCceecceeeEEecc-ccccccCCceEEEechhhcEEEEee Q lcl|NC_012784. 286 KPNYEHNVAIVSQTMFA-KLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD-EVLGQKGNNTLIIGNLKDAIVLFDR 363 (415) Q Consensus 286 ~~~~~~~~~v~~~~~~~-~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~-~~~~~~~~~~~~~gd~~~~~~~~~~ 363 (415) ..+..++.+.+++..+. .+.+.++.+|.++|..+... ...+.|..+..... .+....+....++|-. +...+. T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~--~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~p---vv~s~~ 385 (466) T protein:vir:80 311 DPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNT--HAVLMSKAITFNSAGALVASLNNTMPIVGGD---IVILDF 385 (466) T ss_pred hhhccchhhHHHHHHHHHHhhhccccCCceeEEecchh--HHHhhcccccccCCccccccCCCcccccccc---eeecCc Confidence 34455566666665532 34566788888888654322 22344444332211 1111111111122310 110000 Q ss_pred c-ceEEEEeecccCceEEEEEEEeccEEecc---------ccEEEEE-eecCCCCcccccccC Q lcl|NC_012784. 364 S-QYQASWTDYMHFGECLMIAVRQDCRILDY---------KSAIVIE-YDDSERGEGDLGLEA 415 (415) Q Consensus 364 ~-~~~i~~~~~~~~~~~~~~~~r~d~~v~~p---------~a~~~~~-~t~~~~~~~~~~~~~ 415 (415) . .-.+-+.++..+ ....|-+..+... -.|.... +...+.-...|+..- T Consensus 386 ~~~~~~~~g~~~~y----~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~ 444 (466) T protein:vir:80 386 IPDNDIIGGYGSLY----LLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVN 444 (466) T ss_pred cCccceeeeccccE----EEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEE Confidence 0 001112222221 1222322222211 1111111 111111112222211 No 175 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.75 E-value=2.2e-05 Score=46.08 Aligned_cols=279 Identities=10% Similarity=-0.006 Sum_probs=125.4 Q ss_pred hhhcccccccceeecchhHHhHHHHHHhhhhhhhhccee---------EEccCCceeEEEEeecCCccccccccccc--- Q lcl|NC_012784. 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV---------KRVTNGSGKYPVVRQSEVAALEKVEELEE--- 186 (415) Q Consensus 119 ~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~---------~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~--- 186 (415) +......+.-..+.+|+.+..-+.+...+.+.+++=.-+ ....+....+|+....++.. ..+.+... T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~-~n~~~d~~~~~ 79 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE-PNYGSDNPNVE 79 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCc-cccCCCCCccc Confidence 000000111122345554444444433333332221000 01233334455543333322 22222211 Q ss_pred ccccccc-cceeeEeeeee--EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh---ccccc---ccc---- Q lcl|NC_012784. 187 NPELAVK-PFFQLAYDINT--HRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIID---VITKG---STG---- 253 (415) Q Consensus 187 ~~~~~~~-~f~~v~~~~~k--~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~---g~g~~---~~~---- 253 (415) .+ .++. +..++-...+. --....++..+-- + |....|.++++.-..+...+.+|. |.-.. ... T Consensus 80 ~t-~~kittg~~~a~v~~r~kaw~~~Dla~~lsG-~--dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~ 155 (367) T protein:vir:80 80 AP-IDGLGSGEMKTTKTWLNKAYGAMDLTAELAG-S--NPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIK 155 (367) T ss_pred cc-ccccccchheeeeehhcccchhhhHHHHhhC-c--hHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhh Confidence 11 1112 22222222222 2223355555543 2 568888899887777777665554 21110 000 Q ss_pred ----------c-cccccccc-cccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh------ccCCccc Q lcl|NC_012784. 254 ----------S-TSSGFEKE-GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK------DKLGNYL 315 (415) Q Consensus 254 ----------~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk------d~~G~~l 315 (415) . ........ ..+.......+.+.+.++...+.+....=++++||+..+..|++++ +++| T Consensus 156 ~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~--- 232 (367) T protein:vir:80 156 TRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG--- 232 (367) T ss_pred hhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCC--- Confidence 0 00000011 1111123445678888998888887777899999999999998763 3333 Q ss_pred ccCcccCCCCceecceeeEEecccccccc--CCc--eEEEechhhcEEEEeecc-eEEEEeecc--c--CceEEEEEEEe Q lcl|NC_012784. 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQK--GNN--TLIIGNLKDAIVLFDRSQ-YQASWTDYM--H--FGECLMIAVRQ 386 (415) Q Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~--~~~--~~~~gd~~~~~~~~~~~~-~~i~~~~~~--~--~~~~~~~~~r~ 386 (415) ...-.+++|++|++++.||.... ..+ +.+||. .++-..+... .-+++.++. . ..+.....-|. T Consensus 233 ------~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~--GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~ 304 (367) T protein:vir:80 233 ------QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE 304 (367) T ss_pred ------ccccceecceeEEEeCCCcccccCCCceEEEEEEec--ceeeecccCCccceecccchhhhcCCceEEEEeeee Confidence 22346899999999999996432 222 346663 2222222111 223443332 1 12222222222 Q ss_pred ccEEeccccEEEEEee--cC--------------CCCcccccccC Q lcl|NC_012784. 387 DCRILDYKSAIVIEYD--DS--------------ERGEGDLGLEA 415 (415) Q Consensus 387 d~~v~~p~a~~~~~~t--~~--------------~~~~~~~~~~~ 415 (415) .+++|..|...+-+ ++ .+.-.+|-.-+ T Consensus 305 --~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~ 347 (367) T protein:vir:80 305 --WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPD 347 (367) T ss_pred --EEeecceeeecccccccccccccccccccccCCCChHHhcCCc Confidence 67788877765432 11 11222222222 No 176 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=97.58 E-value=1.8e-05 Score=46.58 Aligned_cols=293 Identities=10% Similarity=0.015 Sum_probs=153.2 Q ss_pred Hhhhhhhhhcc--cccccceeecch-hHHhHHHHHHhhhhhhhhcceeEEccCCcee-EEEEeecCCcccc-cccccc-- Q lcl|NC_012784. 113 LETRNDIQGGS--LKTDSGFVVIPE-EIVTDILKLKEVEFNLDKYVTVKRVTNGSGK-YPVVRQSEVAALE-KVEELE-- 185 (415) Q Consensus 113 ~~~~~~~~~~~--~~~~~~~~~vP~-~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~a~-~v~Eg~-- 185 (415) +-.-.....+. .+..+.+.-+-+ .+....+...++...+.+++...+++..++. +.+.+...-+.+- ...||- T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 00000011111 111122222223 2334444455556788888988888876653 3333333322211 122332 Q ss_pred ---cc----------------------------cccccccceeeEeeeeeEEEeehhhHHHHh-cchHHHHHHHHHHHHH Q lcl|NC_012784. 186 ---EN----------------------------PELAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELKLWMAR 233 (415) Q Consensus 186 ---~~----------------------------~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~-ds~~~l~~~l~~~la~ 233 (415) +. +.--..+-.++....++++.|+.+|+++.. +++.++...|..+|.. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 11 100012234577789999999999998775 4556677665444433 Q ss_pred HH----HHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhc------------------cCC Q lcl|NC_012784. 234 TI----AATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPN------------------YEH 291 (415) Q Consensus 234 ~~----~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~ 291 (415) .- ...+-+.+|++-++--..+..+...+......+.+..+++++..+...|..+- ..+ T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~ 240 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGA 240 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcccccc Confidence 32 44445567755433222233333333334445556667788777666655311 111 Q ss_pred C-EEEEcHHHHHHHHHhhccCCcccccC--------cccCCCCceecceeeEEecccc--------cccc---------- Q lcl|NC_012784. 292 N-VAIVSQTMFAKLDKMKDKLGNYLIQP--------DVKEKTQQRLLGAKIEILPDEV--------LGQK---------- 344 (415) Q Consensus 292 ~-~~v~~~~~~~~l~~lkd~~G~~l~~~--------~~~~~~~~~l~G~pV~~~~~~~--------~~~~---------- 344 (415) + +-++|+..-..|+.++|-.|.|-|.+ .+..+..+.|.++.+++++.|. .+.. T Consensus 241 s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~ 320 (401) T protein:vir:95 241 TRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVS 320 (401) T ss_pred ceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccccccc Confidence 2 36779999999999999888877753 2344556778888888887742 1111 Q ss_pred -C---Cc--eEEEechhhcEEEEeecce----EEEEe---------ecccCceEEEEE-EEeccEEeccccEEEEEeecC Q lcl|NC_012784. 345 -G---NN--TLIIGNLKDAIVLFDRSQY----QASWT---------DYMHFGECLMIA-VRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 345 -~---~~--~~~~gd~~~~~~~~~~~~~----~i~~~---------~~~~~~~~~~~~-~r~d~~v~~p~a~~~~~~t~~ 404 (415) + +. .+++|+-.-+...+...+. .+-+. ++..+++++..+ +.+++.+++++-+++++-.++ T Consensus 321 ~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 321 GQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred CCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeecC Confidence 0 01 1355642211222222221 22111 122345555555 678889999999999986665 Q ss_pred C Q lcl|NC_012784. 405 E 405 (415) Q Consensus 405 ~ 405 (415) . T Consensus 401 ~ 401 (401) T protein:vir:95 401 L 401 (401) T ss_pred C Confidence 5 No 177 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.46 E-value=1.7e-05 Score=46.68 Aligned_cols=194 Identities=14% Similarity=0.068 Sum_probs=97.6 Q ss_pred EEEeehhhHHHHh-----cchHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccc-cccccc-cccccccccchhh Q lcl|NC_012784. 205 HRGYFRISREAIE-----DAKVNVLQELKLWMARTIAATRNKAIIDVITKG----STGS-TSSGFE-KEGKKLEVKKAKS 273 (415) Q Consensus 205 ~a~~~~iS~e~l~-----ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~----~~~~-~~~~~~-~~~~~~~~~~~~~ 273 (415) +- -.-+|.-++. ++..++.+...+++.+++++..|+.++.-.-.+ .+.. ...+.. ......+...... T Consensus 1 iD-~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l 79 (221) T protein:vir:17 1 MD-DLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAI 79 (221) T ss_pred CC-cchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHH Confidence 11 1233444433 366789999999999999999999876422111 1100 011111 1111111223344 Q ss_pred HHHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhcc-CCcccccC---cccCC-CCceecceeeEEeccccccccCC Q lcl|NC_012784. 274 LDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDK-LGNYLIQP---DVKEK-TQQRLLGAKIEILPDEVLGQKGN 346 (415) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~-~G~~l~~~---~~~~~-~~~~l~G~pV~~~~~~~~~~~~~ 346 (415) ++.++++..++-..+.... .++++|..+..|.+-.|. -.+..+.. ....+ ....+.|++|+.++++|.....+ T Consensus 80 ~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~ 159 (221) T protein:vir:17 80 VDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTN 159 (221) T ss_pred HHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCcccccc Confidence 6778888888777766633 466699877776542211 11111211 12222 34579999999999999765544 Q ss_pred ceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 347 NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 347 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ....-|+|.. .......+...++ -.=+.+++|+|+--+++=.+|.-- -++..- T Consensus 160 ~~~~ag~~~~--~~~~~~~yr~~fs-------------~~~glv~~~~Avgtvkl~~~~~~~-~~~~~~ 212 (221) T protein:vir:17 160 LVTDPGDATT--SGENNGSYRPAIT-------------DRAGLVFHKEAADTVEVLLPPSRP-PLVISM 212 (221) T ss_pred cccCCccccc--ccccccccccccc-------------ceEEEEEcchheeeeeeecCCCCC-ceeeee Confidence 3333343321 1111111111111 111556677777777765443221 122111 No 178 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.44 E-value=6.6e-05 Score=43.51 Aligned_cols=292 Identities=13% Similarity=0.048 Sum_probs=127.7 Q ss_pred hhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh--cce--eEEccCCceeEEEEeecCCcc Q lcl|NC_012784. 102 TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK--YVT--VKRVTNGSGKYPVVRQSEVAA 177 (415) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~--~~~--~~~~~~~~~~~~~~~~~~~~~ 177 (415) ...+.......+ ..........+....-...-+.++. +++.......+-. .++ ..-. +..++.|++... .. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~nt~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~--gg~tVkIp~i~~-~g 75 (319) T protein:vir:94 1 MNKTIKNATGML-KLNLQHFANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFM--EGRSFTVMKGDT-TE 75 (319) T ss_pred CCccccccccee-EeehhhhhccCCCcchHHHHHHHHH-HHHHHHHHhhhhhhcccCcceEec--cCcEEEEeeecc-cc Confidence 000000000000 0000000111111111112222333 2222222222111 111 2222 233566666554 33 Q ss_pred ccccccccccccccccc--ceeeEeeeeeEEEeehhhHHHH-hcchHHH--HHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_012784. 178 LEKVEELEENPELAVKP--FFQLAYDINTHRGYFRISREAI-EDAKVNV--LQELKLWMARTIAATRNKAIIDVITKGST 252 (415) Q Consensus 178 a~~v~Eg~~~~~~~~~~--f~~v~~~~~k~a~~~~iS~e~l-~ds~~~l--~~~l~~~la~~~~~~~d~~il~g~g~~~~ 252 (415) .....-++... .+.++ ....++.-.+...+. | +.+- ..+...+ ...+.+.....+.-.+|...+.....+.. T Consensus 76 l~DY~R~~g~~-~g~vt~~~~t~tidqdR~~~F~-V-D~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~ 152 (319) T protein:vir:94 76 LKDYKRNATNE-FDHPKIEETTYFLDQEKYWGRF-V-DALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA 152 (319) T ss_pred cccccCCCCcc-cCCcccceeEEEeecccccccc-c-chhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc Confidence 33333222111 11233 334444444443332 1 1111 1222222 22344445555555667654443322211 Q ss_pred cccccccccccccccccchhhHHHHHHHHHHhhhhccC-CCEEEEcHHHHHHHHHhhccCCcc-cccCcccCCCCceecc Q lcl|NC_012784. 253 GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGNY-LIQPDVKEKTQQRLLG 330 (415) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~lkd~~G~~-l~~~~~~~~~~~~l~G 330 (415) . ......+....|+.+.+++..+-.+... +=.++++|..+..|.+-..-.... +.......+..+.|.| T Consensus 153 ~---------~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG 223 (319) T protein:vir:94 153 K---------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDG 223 (319) T ss_pred c---------ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecC Confidence 0 1111233455688889988888776654 346889999988886533211111 1122334566678999 Q ss_pred eeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEee--cccCceEEEEEEEeccEEeccc--cEEEEEeecCCC Q lcl|NC_012784. 331 AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD--YMHFGECLMIAVRQDCRILDYK--SAIVIEYDDSER 406 (415) Q Consensus 331 ~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~r~d~~v~~p~--a~~~~~~t~~~~ 406 (415) .||+.+++. ...+..+++|... ++ ....+--.++... ...+...++.-.++|..|++|+ ++....=++++. T Consensus 224 ~~Vi~vps~---~~k~in~i~~h~~-A~-~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~ 298 (319) T protein:vir:94 224 FVIVKVPTK---LLQGLQAIAVVGE-VL-ASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) T ss_pred eEEEEeccc---ccccceEEEEcCC-ee-eeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCccc Confidence 999987542 2233456777654 23 2222222233222 3344555677778999999997 444433445666 Q ss_pred CcccccccC Q lcl|NC_012784. 407 GEGDLGLEA 415 (415) Q Consensus 407 ~~~~~~~~~ 415 (415) +..++...| T Consensus 299 ~~~~~~~~~ 307 (319) T protein:vir:94 299 KRDGVDAHA 307 (319) T ss_pred CCCcccccc Confidence 666666666 No 179 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.44 E-value=6.6e-05 Score=43.51 Aligned_cols=292 Identities=13% Similarity=0.048 Sum_probs=127.7 Q ss_pred hhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhh--cce--eEEccCCceeEEEEeecCCcc Q lcl|NC_012784. 102 TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK--YVT--VKRVTNGSGKYPVVRQSEVAA 177 (415) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~--~~~--~~~~~~~~~~~~~~~~~~~~~ 177 (415) ...+.......+ ..........+....-...-+.++. +++.......+-. .++ ..-. +..++.|++... .. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~nt~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~--gg~tVkIp~i~~-~g 75 (319) T protein:vir:97 1 MNKTIKNATGML-KLNLQHFANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFM--EGRSFTVMKGDT-TE 75 (319) T ss_pred CCccccccccee-EeehhhhhccCCCcchHHHHHHHHH-HHHHHHHHhhhhhhcccCcceEec--cCcEEEEeeecc-cc Confidence 000000000000 0000000111111111112222333 2222222222111 111 2222 233566666554 33 Q ss_pred ccccccccccccccccc--ceeeEeeeeeEEEeehhhHHHH-hcchHHH--HHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_012784. 178 LEKVEELEENPELAVKP--FFQLAYDINTHRGYFRISREAI-EDAKVNV--LQELKLWMARTIAATRNKAIIDVITKGST 252 (415) Q Consensus 178 a~~v~Eg~~~~~~~~~~--f~~v~~~~~k~a~~~~iS~e~l-~ds~~~l--~~~l~~~la~~~~~~~d~~il~g~g~~~~ 252 (415) .....-++... .+.++ ....++.-.+...+. | +.+- ..+...+ ...+.+.....+.-.+|...+.....+.. T Consensus 76 l~DY~R~~g~~-~g~vt~~~~t~tidqdR~~~F~-V-D~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~ 152 (319) T protein:vir:97 76 LKDYKRNATNE-FDHPKIEETTYFLDQEKYWGRF-V-DALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA 152 (319) T ss_pred cccccCCCCcc-cCCcccceeEEEeecccccccc-c-chhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc Confidence 33333222111 11233 334444444443332 1 1111 1222222 22344445555555667654443322211 Q ss_pred cccccccccccccccccchhhHHHHHHHHHHhhhhccC-CCEEEEcHHHHHHHHHhhccCCcc-cccCcccCCCCceecc Q lcl|NC_012784. 253 GSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGNY-LIQPDVKEKTQQRLLG 330 (415) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~lkd~~G~~-l~~~~~~~~~~~~l~G 330 (415) . ......+....|+.+.+++..+-.+... +=.++++|..+..|.+-..-.... +.......+..+.|.| T Consensus 153 ~---------~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG 223 (319) T protein:vir:97 153 K---------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDG 223 (319) T ss_pred c---------ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecC Confidence 0 1111233455688889988888776654 346889999988886533211111 1122334566678999 Q ss_pred eeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEee--cccCceEEEEEEEeccEEeccc--cEEEEEeecCCC Q lcl|NC_012784. 331 AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD--YMHFGECLMIAVRQDCRILDYK--SAIVIEYDDSER 406 (415) Q Consensus 331 ~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~r~d~~v~~p~--a~~~~~~t~~~~ 406 (415) .||+.+++. ...+..+++|... ++ ....+--.++... ...+...++.-.++|..|++|+ ++....=++++. T Consensus 224 ~~Vi~vps~---~~k~in~i~~h~~-A~-~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~ 298 (319) T protein:vir:97 224 FVIVKVPTK---LLQGLQAIAVVGE-VL-ASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) T ss_pred eEEEEeccc---ccccceEEEEcCC-ee-eeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCccc Confidence 999987542 2233456777654 23 2222222233222 3344555677778999999997 444433445666 Q ss_pred CcccccccC Q lcl|NC_012784. 407 GEGDLGLEA 415 (415) Q Consensus 407 ~~~~~~~~~ 415 (415) +..++...| T Consensus 299 ~~~~~~~~~ 307 (319) T protein:vir:97 299 KRDGVDAHA 307 (319) T ss_pred CCCcccccc Confidence 666666666 No 180 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.39 E-value=7.8e-05 Score=43.11 Aligned_cols=382 Identities=12% Similarity=0.066 Sum_probs=152.1 Q ss_pred CCh-HHHHHHHHH-----------------------------------HHHHHHHHHHHHHHHhhchHHHHHHHHHHHHH Q lcl|NC_012784. 1 MKT-KEELQSEIS-----------------------------------DIKRQIDLKVKYATRALNNDELEKAEKLEQEI 44 (415) Q Consensus 1 Mk~-~~el~~~l~-----------------------------------~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~ 44 (415) +|. .+.++..+. +...+..++...++..+..- ..++ T Consensus 185 ~~~~p~~~~~~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq~~aeeraRi~~I~~l~a~F--------ggr~ 256 (652) T protein:vir:79 185 FKKMPDSIRNMITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMF--------GGRY 256 (652) T ss_pred hhhhHHHHHHHhcccccccccccccccccccccccccCCcCchhHHHHHHHHHHHHHHHHHHHHHHhh--------cccc Confidence 111 111221111 11111112222222211100 0001 Q ss_pred HHHHHHH--H---HHHHHHHHHHHHHhhhhhcc--cccc--ccchhhhhhHHHHHHHHHHHHH------hhhh---hHHH Q lcl|NC_012784. 45 TDLRSQI--Q---EKQEELDKLKEKDGTSENNQ--QSVE--VNEARTYRNQANINDLGISIQN------TKVT---SQEV 106 (415) Q Consensus 45 ~~l~~~i--~---~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~------~~~~---~~~~ 106 (415) ..+..+. + ..++-.+.+-+......... ..+. ...................... ..+. ..+. T Consensus 257 ~~l~~~~l~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~el 336 (652) T protein:vir:79 257 QTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREY 336 (652) T ss_pred chHHHHHhhccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccccCccccCccHHHH Confidence 1111110 0 01111111111111111000 0000 0000011101111111111000 0000 0000 Q ss_pred HHHHHHHhhh-----hh----hhhcccccccceeecchhHHhHHHHHHhhh-----hhhhhcceeEEccCCceeEEEEee Q lcl|NC_012784. 107 RDFTEYLETR-----ND----IQGGSLKTDSGFVVIPEEIVTDILKLKEVE-----FNLDKYVTVKRVTNGSGKYPVVRQ 172 (415) Q Consensus 107 ~~~~~~~~~~-----~~----~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~-----~~l~~~~~~~~~~~~~~~~~~~~~ 172 (415) .+.+...+.. .. ......++++ .|.-+.+-+-+.+++. ...+.+++...++--. ....... T Consensus 337 Ar~~L~~~G~~~~~~~~~~~v~~A~~hsTsD----Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk-~~~~~~l 411 (652) T protein:vir:79 337 ARMSLTERGIGVSSYNPMQMVGAAFTHSTSD----FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFK-IAHRVGM 411 (652) T ss_pred HHHHHHhhccCCCCCCHHHHHHHHhhcCcch----HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCcccc-ccceeec Confidence 0001000000 00 0000111222 2333333222222211 1334445443332111 1123334 Q ss_pred cCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc- Q lcl|NC_012784. 173 SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS- 251 (415) Q Consensus 173 ~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~- 251 (415) .+.+....|.|+++.+.. ...=+..++...+++.++.||++++-.-+.++.+-|-..+.++.++.+++.+..-...+. T Consensus 412 g~~~~L~~V~E~gEyk~~-t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~ 490 (652) T protein:vir:79 412 GGFSALRQVREGAEYKYV-TTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPK 490 (652) T ss_pred CCCCCccccCCCCcccee-eecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 566788899999998753 333356789999999999999999877778899999999999999999986543322211 Q ss_pred ---cccccc-cccccccc-ccccchhhHHHHHHHHHHhhh----hccCCCEEEEcHHHHHHHHHhhccCCcccccCcccC Q lcl|NC_012784. 252 ---TGSTSS-GFEKEGKK-LEVKKAKSLDDIKDAINLNVK----PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKE 322 (415) Q Consensus 252 ---~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~ 322 (415) .+..+. +....+.. ..+.+..+++..+.++..-.. -+..|..|++.|.......++-.+.- +-..+... T Consensus 491 ~~~DGk~LF~hA~H~Nl~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~--v~~a~~~~ 568 (652) T protein:vir:79 491 ISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSS--VKGADINA 568 (652) T ss_pred cccCCceeecccccccccccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCC--Cccccccc Confidence 111111 11111111 112223334444433333222 12346678888877666555432221 11111222 Q ss_pred CCCceecce-eeEEeccccccccCCceEEEechhh------cEEEEeecceEEEE-eecccCceEEEEEEEeccEEeccc Q lcl|NC_012784. 323 KTQQRLLGA-KIEILPDEVLGQKGNNTLIIGNLKD------AIVLFDRSQYQASW-TDYMHFGECLMIAVRQDCRILDYK 394 (415) Q Consensus 323 ~~~~~l~G~-pV~~~~~~~~~~~~~~~~~~gd~~~------~~~~~~~~~~~i~~-~~~~~~~~~~~~~~r~d~~v~~p~ 394 (415) +..+.+.|+ .|++.+.+...+ ...+++++-.. +|+.+. ++..++. ..|..+...+++..-|+.++++-. T Consensus 569 ~~~Np~~~~~~~i~eprL~~~s--~~~wylaa~~~~dtiev~yL~G~-~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~R 645 (652) T protein:vir:79 569 GIINPVKDFATVIAEPRLDDNS--QTTFYLAASKGSDTIEVAYLNGV-DTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHR 645 (652) T ss_pred ccccccccccccccccccCCCC--cccEEEecCCCCCeEEEEEecCC-CCCeeeecCCCCcceEEEEEEEeccCceeecc Confidence 333445554 445444443211 12233433221 122222 2333332 335566667888888999999999 Q ss_pred cEEEEEe Q lcl|NC_012784. 395 SAIVIEY 401 (415) Q Consensus 395 a~~~~~~ 401 (415) ++++.+- T Consensus 646 G~~k~t~ 652 (652) T protein:vir:79 646 GLVKCTA 652 (652) T ss_pred ceeeecC Confidence 9887653 No 181 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.11 E-value=6.5e-05 Score=43.54 Aligned_cols=270 Identities=13% Similarity=0.060 Sum_probs=125.7 Q ss_pred cccceeecc--hhHHhHHHHHHhhhhhhhhcceeE---EccCCceeEEEEeecCCcccc--ccccc-cccccccccccee Q lcl|NC_012784. 126 TDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVK---RVTNGSGKYPVVRQSEVAALE--KVEEL-EENPELAVKPFFQ 197 (415) Q Consensus 126 ~~~~~~~vP--~~~~~~Ii~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~~a~--~v~Eg-~~~~~~~~~~f~~ 197 (415) -++.++++. +.+.+.|.+........+.++.+. +....+ +.+...+....+. |++-+ ..+|- -+..+++ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~--~~~~~~d~~G~a~~~~i~~~a~dip~-vd~~~~~ 77 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITE--KLHYGADEHGSLDDGLITVGTSTLDQ-VEVGFTP 77 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccce--EEEeeeeccCcccccccCCcCCccce-eecccce Confidence 111122222 123334444333333333333321 111122 3333333334444 77666 45564 3467888 Q ss_pred eEeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cccccccccc------ccccc Q lcl|NC_012784. 198 LAYDINTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGST-GSTSSGFEKE------GKKLE 267 (415) Q Consensus 198 v~~~~~k~a~~~~iS~e~l~ds---~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~-~~~~~~~~~~------~~~~~ 267 (415) -....+.++.-+.+|.+=++.+ ..++.+--.....+++.+.+++..+.|+....+ .|...+.... ..... T Consensus 78 ~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~ 157 (304) T protein:vir:52 78 TRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNT 157 (304) T ss_pred eEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCC Confidence 8888888888887775544332 235666666666778888888888888643211 1211111111 00111 Q ss_pred ccchhhH----HHHHHHHHHhhhhc---cCCCEEEEcHHHHHHHHHh-hccCCcccccCcccCCCCceecceee--EEec Q lcl|NC_012784. 268 VKKAKSL----DDIKDAINLNVKPN---YEHNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQRLLGAKI--EILP 337 (415) Q Consensus 268 ~~~~~~~----~~~~~~~~~~~~~~---~~~~~~v~~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~l~G~pV--~~~~ 337 (415) ...+.+. +++..++.++.... ..+..++|.|+.+..|... ....|.-++. -+....+. ..|.|+ +... T Consensus 158 ~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~-~l~~n~~~-~~g~~l~I~~v~ 235 (304) T protein:vir:52 158 KVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALE-FLTKHLSA-AAGRQVAIKALP 235 (304) T ss_pred ccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHH-HHHHhccc-ccCCcceEEEec Confidence 1122233 44445555544322 3467899999999998653 2323322221 01111111 234442 2221 Q ss_pred --cccccccCCceEEEechhhcEEEEeecceEEEEee-cccCceEE--EEEEEecc-EEeccccEEEEEe Q lcl|NC_012784. 338 --DEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-YMHFGECL--MIAVRQDC-RILDYKSAIVIEY 401 (415) Q Consensus 338 --~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~-~~~~~~~~--~~~~r~d~-~v~~p~a~~~~~~ 401 (415) ....+.+|...+++-+.+.-+..+. -.+.+.+.. +..+...+ -.+.|++| .+.+|.+++++++ T Consensus 236 ~~~~~~g~~g~~r~vvY~~d~~~~~~~-vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 236 SNYGTRVTDGKTRAMVYVNSKEHVIFD-VPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ccccccCCCCceEEEEEecChhheEEe-cCccccccchhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 1223444555555544443233221 122222211 11122222 24567766 5678999999999 No 182 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.02 E-value=0.0002 Score=40.81 Aligned_cols=386 Identities=13% Similarity=0.055 Sum_probs=145.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc---- Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE---- 76 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~---- 76 (415) ..+....+++..+ +...+...++..+..-.....+-+.+.+.+..-.+++..+ ++-+............. T Consensus 258 ap~~adirA~~~a---ae~~r~aaI~a~fa~f~~~~a~l~a~~l~d~~~s~d~ar~---~lL~~l~~~~~p~~~~~~~~~ 331 (693) T protein:vir:95 258 APTEADIRARILA---EESGRRSAITAAFGAFSTGHAELLATCLNDMNITVDQARE---KLLAAIGADTQPAAALSAGAH 331 (693) T ss_pred CCCcchhhHHHHH---HHHHHHHHHHHHHHhccCChHHHHHHHHhhcCCCHHHHHH---HHHHHHhhccCCCCCcCcCcc Confidence 2222222222111 1111111111111110000000000111111111111111 11111111000000000 Q ss_pred -ccchhhhhhHHHHHHHHHHHHHhhh---------hhHHHHHHHHHHhhh-----h----hhhhcccccccceeecchhH Q lcl|NC_012784. 77 -VNEARTYRNQANINDLGISIQNTKV---------TSQEVRDFTEYLETR-----N----DIQGGSLKTDSGFVVIPEEI 137 (415) Q Consensus 77 -~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~-----~----~~~~~~~~~~~~~~~vP~~~ 137 (415) ......................... ...+..+.+...+.. . .......++++ .|.-+ T Consensus 332 ~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htTSD----Fp~IL 407 (693) T protein:vir:95 332 IHAGNGNLVGDSVRASVLARIGRGERQADNAYNGMTLRELARASLVDRGIGVASLNAPQMVGLAFTHTSSD----FGLIL 407 (693) T ss_pred ccCCchhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCcch----hHHHH Confidence 0000000000000111100000000 000000000000000 0 00000112222 23322 Q ss_pred HhHHHHHHhh-----hhhhhhcceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhh Q lcl|NC_012784. 138 VTDILKLKEV-----EFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRIS 212 (415) Q Consensus 138 ~~~Ii~~~~~-----~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS 212 (415) .+-+-+.+++ ......++....++--. ..........+....|.|+++.+.. ...=..-++...+++.++.|| T Consensus 408 ~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk-~~~~~~lg~~~~L~~V~E~gEyk~~-t~~e~~e~~~l~tyG~~~~iT 485 (693) T protein:vir:95 408 LDVANKSVLAGWEEAEETFPLWTKSGILTDFK-PARRVGLGEFSSLRQVREGAEYKYV-TLGERGEQIILATYGELFSIT 485 (693) T ss_pred HHHHHHHHHHHHHhhhhHHHHHhccCCCCccc-ccceeecCCCCChhhcCCCCceeee-ecCCccceeehhhcCCeeeec Confidence 2222222222 11233333322222111 1122333455677889999988532 222233578889999999999 Q ss_pred HHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc---cccccccccccccccccccchhhHHHHHHH---HHHhh- Q lcl|NC_012784. 213 REAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG---STGSTSSGFEKEGKKLEVKKAKSLDDIKDA---INLNV- 285 (415) Q Consensus 213 ~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~- 285 (415) ++++-+-+.++.+-|...+.++.++.+++.+..-...+ ..+..+.+....+..+.+.+..+.+.+-.+ +.... T Consensus 486 RqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~ 565 (693) T protein:vir:95 486 RQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKA 565 (693) T ss_pred HHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhc Confidence 99998777889999999999999999998654332221 111222222222222222233334443333 22222 Q ss_pred --------hhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecce-eeEEeccccccccCCceEEEechhh Q lcl|NC_012784. 286 --------KPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGA-KIEILPDEVLGQKGNNTLIIGNLKD 356 (415) Q Consensus 286 --------~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~~~~gd~~~ 356 (415) .-+..+..|+..|.......++-.+.-.| ..+...+..+-+.|+ +|+..+.+...+. ..-.++.|... T Consensus 566 ~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~--~a~~~~~~~NP~~~~~~vi~~prL~~~s~-~~Wyl~a~~~~ 642 (693) T protein:vir:95 566 QVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVP--GADVNSGIVNPIRAFAQVIGEPRLDDASA-TAWYMAAKKGS 642 (693) T ss_pred chhccCCceeecccceEEecchHHHHHHHHhcccccc--ccccccccccchhccccccccceecCCCC-CceEEecCCCC Confidence 12335677888877766666654432211 111222223335554 4555555432221 11122333211 Q ss_pred -----cEEEEeecceEEEE-eecccCceEEEEEEEeccEEeccccEEEEEeecCCCCcc Q lcl|NC_012784. 357 -----AIVLFDRSQYQASW-TDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEG 409 (415) Q Consensus 357 -----~~~~~~~~~~~i~~-~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~ 409 (415) +|+.+. ++..++. ..|..+...+++..-|++++++-.++++- +|+ T Consensus 643 dtie~~yL~G~-~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn-------~GA 693 (693) T protein:vir:95 643 DTIEVAYLDGV-DTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQKS-------NGA 693 (693) T ss_pred CeEEEEEecCC-CCCeEeecCCCCcceEEEEEEEeccCceeeccccccC-------CCC Confidence 122221 2333332 23555666678888888888888876652 122 No 183 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.00 E-value=0.00022 Score=40.69 Aligned_cols=273 Identities=9% Similarity=-0.017 Sum_probs=119.4 Q ss_pred hhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-------cCCceeEEEEeecCCccccccc--cccccccc Q lcl|NC_012784. 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-------TNGSGKYPVVRQSEVAALEKVE--ELEENPEL 190 (415) Q Consensus 120 ~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~a~~v~--Eg~~~~~~ 190 (415) +.. +. --.+|+.++.+.++.+++..++..++...-- .+.+.+++++ ......... .+..+ .. T Consensus 1 MAN--~l---lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p---~~~~v~d~~~~~~~~~-~~ 71 (423) T protein:vir:35 1 MAN--NL---ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRP---HQFKSERTETGDITGK-DK 71 (423) T ss_pred Ccc--ch---hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeC---CcceeecccCcCCCCc-cc Confidence 111 11 1236999999999999999988887654211 1233334433 222222221 11111 11 Q ss_pred cccccee--eEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccc Q lcl|NC_012784. 191 AVKPFFQ--LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV 268 (415) Q Consensus 191 ~~~~f~~--v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~ 268 (415) ++..-+. +++.-+|+..+ .++.+=+..+..+|+.++...+ .+++..+|..++...-.+.+... +.. . T Consensus 72 ~~~~e~~v~l~id~~k~~a~-~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~~v-------gt~--~ 140 (423) T protein:vir:35 72 NGLFSAKATGKVGKYITVAV-EWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGALSL-------GSP--N 140 (423) T ss_pred cccccceeeEEeccceeccc-eeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcccccc-------ccc--c Confidence 2222233 44454554443 4444433334457887777664 77888899988763322221110 011 1 Q ss_pred cchhhHHHHHHHHHHhhhhccCC-C-EEEEcHHHHHHHHH----hhccCCcccccCcccCCC-CceecceeeEEeccccc Q lcl|NC_012784. 269 KKAKSLDDIKDAINLNVKPNYEH-N-VAIVSQTMFAKLDK----MKDKLGNYLIQPDVKEKT-QQRLLGAKIEILPDEVL 341 (415) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~-~-~~v~~~~~~~~l~~----lkd~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~ 341 (415) +....|++++++...+...+... . ..+++|..+..|.+ +...++ .-...+..+. .+++.|+.|+.++++|. T Consensus 141 t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~--~~~~alr~g~i~G~i~GFdv~~Snnvp~ 218 (423) T protein:vir:35 141 TAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQ--LVRTAWENAQISGNFGGIRALMSNGLAS 218 (423) T ss_pred CCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceecccc--chhHHHhhccceeeecceEEEEcCCCcc Confidence 11245788888888887666653 2 55899999888753 111110 1111233333 37899999999999996 Q ss_pred cccCCce--EEEe-----------chhhcEEE-----------Eeecc-eEEEE----eeccc---------CceEEEEE Q lcl|NC_012784. 342 GQKGNNT--LIIG-----------NLKDAIVL-----------FDRSQ-YQASW----TDYMH---------FGECLMIA 383 (415) Q Consensus 342 ~~~~~~~--~~~g-----------d~~~~~~~-----------~~~~~-~~i~~----~~~~~---------~~~~~~~~ 383 (415) .+++... ++++ +.+..... ...++ +++.- .+..+ ..+.+++. T Consensus 219 ~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~ 298 (423) T protein:vir:35 219 RKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVL 298 (423) T ss_pred ccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEe Confidence 5443321 1110 00000000 00000 00000 00000 00111111 Q ss_pred ------------EEeccE-------------------------------------EeccccEEEEEeecCCCCccccccc Q lcl|NC_012784. 384 ------------VRQDCR-------------------------------------ILDYKSAIVIEYDDSERGEGDLGLE 414 (415) Q Consensus 384 ------------~r~d~~-------------------------------------v~~p~a~~~~~~t~~~~~~~~~~~~ 414 (415) +.++.+ +++++||+..+...+..++.|. .+ T Consensus 299 ~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~~~~~~-~~ 377 (423) T protein:vir:35 299 EETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKLHSLDS-AV 377 (423) T ss_pred ccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEeecCceeEEEEEccccCCccce-ee Confidence 000111 1222223222222222211111 11 Q ss_pred C Q lcl|NC_012784. 415 A 415 (415) Q Consensus 415 ~ 415 (415) + T Consensus 378 ~ 378 (423) T protein:vir:35 378 A 378 (423) T ss_pred c Confidence 1 No 184 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=96.89 E-value=0.00028 Score=40.08 Aligned_cols=280 Identities=10% Similarity=0.012 Sum_probs=124.4 Q ss_pred ccccccceeecch--hHHhHHHHHHhhhhhhhhcceeEE----------ccCCceeEEEEeecC-Cccccccccc--ccc Q lcl|NC_012784. 123 SLKTDSGFVVIPE--EIVTDILKLKEVEFNLDKYVTVKR----------VTNGSGKYPVVRQSE-VAALEKVEEL--EEN 187 (415) Q Consensus 123 ~~~~~~~~~~vP~--~~~~~Ii~~~~~~~~l~~~~~~~~----------~~~~~~~~~~~~~~~-~~~a~~v~Eg--~~~ 187 (415) ...+.-....+|. .+..-+.+...+.+.+.+ +.++. ..+....+|+..... +.....-+.. ... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~q-SGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~ 79 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFN-SGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIA 79 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhh-ccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccccc Confidence 1122223456665 355555555544444443 22211 122333444433222 2121111111 111 Q ss_pred cccccccceeeEeeeeeEEEe--ehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh---cccccccccc--ccccc Q lcl|NC_012784. 188 PELAVKPFFQLAYDINTHRGY--FRISREAIEDAKVNVLQELKLWMARTIAATRNKAIID---VITKGSTGST--SSGFE 260 (415) Q Consensus 188 ~~~~~~~f~~v~~~~~k~a~~--~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~---g~g~~~~~~~--~~~~~ 260 (415) +....-++.++-.....-.+| -.++.++-- + |..+.|.++++.-..+...+.+|. |.-....... ..... T Consensus 80 t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG-~--dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~ 156 (349) T protein:vir:94 80 TPRAIQTGEMMARVAYLNEGFGQADLTVELTS-Q--NPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQN 156 (349) T ss_pred ccccccccceeeeeeeeccccchhHHHHHhhC-c--hHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccC Confidence 111122233444333333333 255666543 2 567888999988888887776554 2211100000 00000 Q ss_pred cccccccccchhhHHHHHHHHHHhhhhcc-----CCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEE Q lcl|NC_012784. 261 KEGKKLEVKKAKSLDDIKDAINLNVKPNY-----EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) .........+..+...++++..++.+... .=+.++||+.++..|++++-=+ | +++.-....-.+++|++|++ T Consensus 157 ~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty~G~~Viv 233 (349) T protein:vir:94 157 DMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATYQGYRVIV 233 (349) T ss_pred ceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh--h-ccCcccCcccceecCcEEEE Confidence 00111122233456677777777766532 2368999999999998753211 0 11111122236899999999 Q ss_pred eccccccccCC----ceEEEechhhcEEEEeec-ceEEEEeecccC-----ceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 336 LPDEVLGQKGN----NTLIIGNLKDAIVLFDRS-QYQASWTDYMHF-----GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 336 ~~~~~~~~~~~----~~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~-----~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) +|.||....+. .+.+||. .++...+.. ...+++.++... ...+...-| .+++|..+..-.- ..+ T Consensus 234 DD~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~---~~~hp~G~s~~~a-~v~ 307 (349) T protein:vir:94 234 DDSMTVVGQDTSRKFISIIFGQ--GAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKT---WLLHPFGYSFTSA-VIT 307 (349) T ss_pred eCCCccccCCCCceEEEEEeec--ceEEeecCCCCcceeeecccccCCcceeEEEEEeeE---EEeeeeeeeeccc-ccC Confidence 99999643221 2346663 223322221 123444443321 122222222 3567877666541 111 Q ss_pred CCcc----cccc------cC Q lcl|NC_012784. 406 RGEG----DLGL------EA 415 (415) Q Consensus 406 ~~~~----~~~~------~~ 415 (415) .++. .+-+ -+ T Consensus 308 ~~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:94 308 GNGTETIARSASWQDLANAA 327 (349) T ss_pred CCccccccCCCChHHhcCCc Confidence 1111 1222 22 No 185 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.85 E-value=0.0003 Score=39.92 Aligned_cols=281 Identities=10% Similarity=-0.001 Sum_probs=124.6 Q ss_pred ccccccceeecch--hHHhHHHHHHhhhhhhhhcceeEE----------ccCCceeEEEEeecCC-ccccccccc--ccc Q lcl|NC_012784. 123 SLKTDSGFVVIPE--EIVTDILKLKEVEFNLDKYVTVKR----------VTNGSGKYPVVRQSEV-AALEKVEEL--EEN 187 (415) Q Consensus 123 ~~~~~~~~~~vP~--~~~~~Ii~~~~~~~~l~~~~~~~~----------~~~~~~~~~~~~~~~~-~~a~~v~Eg--~~~ 187 (415) ...+.-....+|. .+.+-+.+...+.+.+.+ +.++. .++....+|+....++ .....-..+ +.. T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~q-SGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~ 79 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFD-SGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIA 79 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhh-ccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCccccc Confidence 1122223456666 355555555544444333 22211 2233344554433222 222111111 111 Q ss_pred cccccccceeeEeeeeeEEEee--hhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh---cccccccccc-cc-ccc Q lcl|NC_012784. 188 PELAVKPFFQLAYDINTHRGYF--RISREAIEDAKVNVLQELKLWMARTIAATRNKAIID---VITKGSTGST-SS-GFE 260 (415) Q Consensus 188 ~~~~~~~f~~v~~~~~k~a~~~--~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~---g~g~~~~~~~-~~-~~~ 260 (415) +....-++.++-...+.-.+|. .++.++-- + |..+.|.++++.-..+...+.+|. |.-....... .. ... T Consensus 80 t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG-~--dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~ 156 (349) T protein:vir:78 80 TPRAIQTGEMMARVAYLNEGFGQADLTVELTS-Q--NPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQN 156 (349) T ss_pred ccccccccceeeeeeeeccccchhHHHHHhhC-c--hHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcc Confidence 1112223344444444333333 45555543 2 567888999988887777765554 3211100000 00 000 Q ss_pred cccccccccchhhHHHHHHHHHHhhhhc-----cCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEE Q lcl|NC_012784. 261 KEGKKLEVKKAKSLDDIKDAINLNVKPN-----YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) .-.....+.+..+...++++...+.++. ..=+.++||+.++..|++++-=+ | +++.-....-.+++|++|++ T Consensus 157 ~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty~G~~Viv 233 (349) T protein:vir:78 157 DMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATYQGYRVIV 233 (349) T ss_pred cceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh--h-ccCcccCcccceecCeEEEE Confidence 0001111222345667777777766653 23368999999999998653210 1 11111122236899999999 Q ss_pred eccccccccCC----ceEEEechhhcEEEEeecc-eEEEEeecccC-----ceEEEEEEEeccEEeccccEEEEEeecC- Q lcl|NC_012784. 336 LPDEVLGQKGN----NTLIIGNLKDAIVLFDRSQ-YQASWTDYMHF-----GECLMIAVRQDCRILDYKSAIVIEYDDS- 404 (415) Q Consensus 336 ~~~~~~~~~~~----~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~-----~~~~~~~~r~d~~v~~p~a~~~~~~t~~- 404 (415) ++.||....+. .+.+||. .++...+... ..+++.++... ...+...-| .+++|..+..-.-..+ T Consensus 234 DD~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~---~~~hp~G~s~~~a~v~~ 308 (349) T protein:vir:78 234 DDSMTVVGQGAQRKFISIIFGQ--GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKT---WLLHPFGYRFTSAVITG 308 (349) T ss_pred eCCCccccCCCCceEEEEEeec--ceEEEccCCCccceeeecccccCCcceeEEEEEeeE---EEeeeeeeeeccccccC Confidence 99999654332 2346663 2232222121 23444443321 122322222 3567777666542211 Q ss_pred --CCCccccccc------C Q lcl|NC_012784. 405 --ERGEGDLGLE------A 415 (415) Q Consensus 405 --~~~~~~~~~~------~ 415 (415) ..++..+-+- + T Consensus 309 ~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:78 309 NGTETIARSASWQDLANAT 327 (349) T ss_pred CccccccCCCChHHhcCCc Confidence 0111122222 2 No 186 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=96.78 E-value=0.00035 Score=39.56 Aligned_cols=306 Identities=10% Similarity=0.031 Sum_probs=122.2 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhh-hhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN-LDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~-l~~ 153 (415) ...-... +.+....+.......++. ........+..-+....-..+...+-+.....+- ... T Consensus 1 ~~~~~~~----------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~ 63 (329) T protein:vir:10 1 MDGIFIT----------------GVKTMNKEIKNATGKLKL-NLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPA 63 (329) T ss_pred CCceEEe----------------chhhhhhhhhcccceeEE-ehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeee Confidence 0000000 000000000000000000 0000001111111111112223323222221110 000 Q ss_pred cce--eEEccCCceeEEEEeecCCccccccccccccc-ccccccceeeEeeeeeEEEeehhhHHHH-hcchHH--HHHHH Q lcl|NC_012784. 154 YVT--VKRVTNGSGKYPVVRQSEVAALEKVEELEENP-ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVN--VLQEL 227 (415) Q Consensus 154 ~~~--~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~f~~v~~~~~k~a~~~~iS~e~l-~ds~~~--l~~~l 227 (415) +++ ..... ..++.|++... .......-++... ..-+.++...+++-.+...+.. +.+- ..+... +...+ T Consensus 64 ~~N~~~e~~~--g~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~V--D~~D~dEtn~~l~a~~i~ 138 (329) T protein:vir:10 64 VISNDAIFMQ--GRSFTVIKGDV-TELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRFV--DALDRRDTEGNIDINYVV 138 (329) T ss_pred ecccceeecc--CcEEEEeeecc-cccccccCCCCccccccccceeEEEeecccceeeec--chhhHhhhhhhhhHHHHH Confidence 111 22222 33555655544 3333333222111 1112234445555544444321 1111 112222 22334 Q ss_pred HHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccC-CCEEEEcHHHHHHHHH Q lcl|NC_012784. 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE-HNVAIVSQTMFAKLDK 306 (415) Q Consensus 228 ~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~ 306 (415) .+.....+.-.+|...+.-.-.+... ......+....|+.+.++...+..+... +=.++++|..+..|.+ T Consensus 139 ~~~~~~~v~pEiDay~~skla~~a~~---------~~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~ 209 (329) T protein:vir:10 139 AKQASEVVAPYLDNLRFATLARNKAK---------HLTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKK 209 (329) T ss_pred HHHHHHHhhhHHHHHHHHHHHhhccc---------ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHh Confidence 44455666666676554432221100 1111233455688888888888766543 3368899999888875 Q ss_pred hhccCCcc-cccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEe-ecceEEEEeecccCceEEEEEE Q lcl|NC_012784. 307 MKDKLGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFD-RSQYQASWTDYMHFGECLMIAV 384 (415) Q Consensus 307 lkd~~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~ 384 (415) ..--.... ........+..++|.|.||+.+++.. ..+..+++|.... +.... ...+++-......+...++.-. T Consensus 210 ~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~---~k~in~ii~~~~A-~~~~~K~~~~~~~~p~~~~~a~~v~gr~ 285 (329) T protein:vir:10 210 FVIELPQGDNRQQVLGKGVQGELDGFTIVKVPSKM---LQGVEAMAVIGEV-MASPIQANEAKLNSNVPGMFGTLAEQML 285 (329) T ss_pred hhhhhccccccccceeeeeeeeecCeEEEEecCCc---ccceeEEEEcCCc-eeeeeeeeeeeeeCCCCccchheeeeee Confidence 22111111 11123345566789999999875432 2233466776542 22221 2223332222344555667777 Q ss_pred EeccEEeccccEEEE--EeecCCCCcccccccC Q lcl|NC_012784. 385 RQDCRILDYKSAIVI--EYDDSERGEGDLGLEA 415 (415) Q Consensus 385 r~d~~v~~p~a~~~~--~~t~~~~~~~~~~~~~ 415 (415) ++|+.|++|++.... .-++++.+++.....+ T Consensus 286 yyd~~V~~~k~~~I~~~~~~a~~~~~~~~~~~~ 318 (329) T protein:vir:10 286 YTGAFVPEHLQKYIFTIGGKEVETNRDGVDAHA 318 (329) T ss_pred eeeeEEEccccCEEEEecccCcccCCCCCCccc Confidence 899999999843332 2333444444332222 No 187 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=96.53 E-value=0.00054 Score=38.50 Aligned_cols=274 Identities=9% Similarity=-0.023 Sum_probs=119.8 Q ss_pred ccce--eecchhHHhHHHHHHhhhhhhhhcceeEE---c----cCCceeEEEEeecCCcccccc--ccccc--ccccccc Q lcl|NC_012784. 127 DSGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKR---V----TNGSGKYPVVRQSEVAALEKV--EELEE--NPELAVK 193 (415) Q Consensus 127 ~~~~--~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~---~----~~~~~~~~~~~~~~~~~a~~v--~Eg~~--~~~~~~~ 193 (415) .... -.+|+.++...++.+++..++..++...- . .+.+.+++++. ....... ..+.. .++.. . T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~---~~~~~d~~~~~~~~~~~~dl~-e 76 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH---QFSSLRTPTGDISGQNKNNLI-S 76 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCC---ceeeeccCCccccccccCccc-c Confidence 1111 13689999999999999998877765411 1 23333444332 1222111 12211 11111 1 Q ss_pred cceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhh Q lcl|NC_012784. 194 PFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKS 273 (415) Q Consensus 194 ~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (415) .-..+++.-+|+..+-.=..|+.. ..-+++++|... .++++..+|..+++-.....+.. .+. ....... T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~-~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~-------~gt--~~t~~~a 145 (423) T protein:vir:10 77 GKATGRVGNYITVAVEYQQLEEAI-KLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALS-------LGS--PNTPITK 145 (423) T ss_pred ceeEEEeeceeeeeeeechHHHhc-ChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccccc-------ccc--CCcccch Confidence 112466666666655544555553 444687766555 68899999998875321111110 001 1111235 Q ss_pred HHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhh--ccCCcccccCcccCCC-CceecceeeEEeccccccccCCce Q lcl|NC_012784. 274 LDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMK--DKLGNYLIQPDVKEKT-QQRLLGAKIEILPDEVLGQKGNNT 348 (415) Q Consensus 274 ~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lk--d~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) |+++.++...|...+... =..+++|..+..|.+-. -....-.-...+..+. ++++.|+.|+.++++|..+.+... T Consensus 146 ~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~ 225 (423) T protein:vir:10 146 WSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFG 225 (423) T ss_pred HHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccccc Confidence 888888877776666553 36789999888876421 0111111122233333 368999999999999965443221 Q ss_pred ----EEEechhhcEEEEeecceEE--E--E-eeccc------CceE-EEEEEEeccEEe------ccccEE--------- Q lcl|NC_012784. 349 ----LIIGNLKDAIVLFDRSQYQA--S--W-TDYMH------FGEC-LMIAVRQDCRIL------DYKSAI--------- 397 (415) Q Consensus 349 ----~~~gd~~~~~~~~~~~~~~i--~--~-~~~~~------~~~~-~~~~~r~d~~v~------~p~a~~--------- 397 (415) ...|-.-.+....+.....+ . + +.+.. +... ....-+....++ ++.-|+ T Consensus 226 ~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~ 305 (423) T protein:vir:10 226 GTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDS 305 (423) T ss_pred cceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeecc Confidence 11111000000000011111 1 0 00000 0000 000001111111 111111 Q ss_pred ----EEEeec----------------CCCCcccccccC Q lcl|NC_012784. 398 ----VIEYDD----------------SERGEGDLGLEA 415 (415) Q Consensus 398 ----~~~~t~----------------~~~~~~~~~~~~ 415 (415) -+++.+ ++.-++.++... T Consensus 306 ~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~ 343 (423) T protein:vir:10 306 GGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVG 343 (423) T ss_pred CCceeeeccCccccccCCcccccccccccCCceeeccc Confidence 122221 111111111111 No 188 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=96.22 E-value=0.00085 Score=37.41 Aligned_cols=303 Identities=12% Similarity=0.109 Sum_probs=139.9 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc--cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK--TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 178 (415) ....-...+....... +...+.. .......|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++-+ T Consensus 1 M~~~tr~~~~~y~~~~--A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~-lg~~g~ia 77 (357) T protein:vir:20 1 MRQETRFKFNAYLSRV--AELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIG-IGVTGSIA 77 (357) T ss_pred CChHHHHHHHHHHHHH--HHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEe-cccCcccc Confidence 0011111111111111 1111111 12334556667888899999999999999999999887765432 23333333 Q ss_pred ccccc-c-ccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Q lcl|NC_012784. 179 EKVEE-L-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) Q Consensus 179 ~~v~E-g-~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~- 253 (415) +-+.- + ......+...++.-.+..++.=--+.|+.+.|...+ .+|+..+++.+.+.++.=.-.--++|+....+. T Consensus 78 grtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td 157 (357) T protein:vir:20 78 STTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSD 157 (357) T ss_pred ccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCC Confidence 32211 1 111111223455556666666556677878777532 356776666666665433333333443322111 Q ss_pred ----cccccc--------------------c-cc-------cccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEc Q lcl|NC_012784. 254 ----STSSGF--------------------E-KE-------GKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVS 297 (415) Q Consensus 254 ----~~~~~~--------------------~-~~-------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~ 297 (415) +.+-.. . .. ...........+|.++ ++++.+.++.++. -++++. T Consensus 158 ~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG 237 (357) T protein:vir:20 158 RSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVG 237 (357) T ss_pred hhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 000000 0 00 0011122344566666 4666666665554 367777 Q ss_pred HHHHHH-HHHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec Q lcl|NC_012784. 298 QTMFAK-LDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) Q Consensus 298 ~~~~~~-l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 373 (415) ...+.. -..|-++.+.|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+. T Consensus 238 ~dLla~k~~~l~n~~~~pt--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~-----ilVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:20 238 RQLLADKYFPIVNKEQDNS--EMLAADVIISQKRIGNLPAVRVPYFPADA-----MLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred hhhhhhhhhhHhhccCChH--HHHHHHHHHHhhhhCCceeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEec Confidence 666443 333433333331 11111 113578999999999999764 5665555432223333333322222 Q ss_pred ccCceEEEEEEE-eccEEeccccEEEEE------eecCCCCcccccc Q lcl|NC_012784. 374 MHFGECLMIAVR-QDCRILDYKSAIVIE------YDDSERGEGDLGL 413 (415) Q Consensus 374 ~~~~~~~~~~~r-~d~~v~~p~a~~~~~------~t~~~~~~~~~~~ 413 (415) ....+.--.+.| -+..|-++.+++.++ ..+++.+.++..+ T Consensus 311 p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~~~~~~a 357 (357) T protein:vir:20 311 PKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred cccccccchhhhcceeeeeccccEEEeeeeeeccccCCccCCCCCCC Confidence 111111000111 122333444444443 3345555555555 No 189 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=96.21 E-value=0.00087 Score=37.37 Aligned_cols=271 Identities=8% Similarity=-0.071 Sum_probs=120.8 Q ss_pred cccee--ecchhHHhHHHHHHhhhhhhhhcceeEE-----c--cCCceeEEEEeecCCcccccccccccccc-cccccc- Q lcl|NC_012784. 127 DSGFV--VIPEEIVTDILKLKEVEFNLDKYVTVKR-----V--TNGSGKYPVVRQSEVAALEKVEELEENPE-LAVKPF- 195 (415) Q Consensus 127 ~~~~~--~vP~~~~~~Ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~f- 195 (415) ..... ++|+-++.++++.+++..++..++...- . .+.+.++++|. ...+.........+. .....= T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~---~~~~~d~~~~~~t~~~~~~l~e~ 77 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPH---QFKSERTMDGDITGKSKNSLISA 77 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCC---ceeeecccCcccCcccccccccc Confidence 22223 7899999999999999999888776421 1 22344444432 111111111111010 011111 Q ss_pred -eeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhH Q lcl|NC_012784. 196 -FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) Q Consensus 196 -~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) ..+++.-+|+..+-.=+.|+. ....++++++... .++++..+|..|............ +.. ......| T Consensus 78 ~v~l~id~~k~~a~~v~d~E~~-l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~v-------gt~--~t~~~a~ 146 (423) T protein:vir:10 78 KATGEVGNYITVAVEYRQIEEA-LKLNQLDQILVPI-NERMVTDLETELALFMMKHGALSL-------GSP--NTPIKKW 146 (423) T ss_pred eEEEEecceeeeeeeeChHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccccc-------ccc--ccccccH Confidence 245555555555443345554 4555787766554 789999999988643322211111 000 1112347 Q ss_pred HHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHH----hhccCCcccccCcccCC-CCceecceeeEEeccccccccCCc Q lcl|NC_012784. 275 DDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDK----MKDKLGNYLIQPDVKEK-TQQRLLGAKIEILPDEVLGQKGNN 347 (415) Q Consensus 275 ~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~----lkd~~G~~l~~~~~~~~-~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) +++.++...|...+... =..+++|..+..|.+ +...++-. ...+..+ ..+++.|+.|+.++++|..+.++. T Consensus 147 ~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~--~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~ 224 (423) T protein:vir:10 147 SDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLV--RTAWENAQISGNFGGIRALMSNGLASRTQGAF 224 (423) T ss_pred HHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccc--hHHHHhcccceeecceEEEEecCCcccccccc Confidence 88888877776665543 367999998888753 22222110 1122233 347899999999999986544432 Q ss_pred eE---------EEechhh-------cEEEEee-cceEEEEeecccCceEEEEEEEeccEE-------------------- Q lcl|NC_012784. 348 TL---------IIGNLKD-------AIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRI-------------------- 390 (415) Q Consensus 348 ~~---------~~gd~~~-------~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v-------------------- 390 (415) .. +-|+-.. ....... ....+..-|..++. +....-++...+ T Consensus 225 ~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~a-Gv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~ 303 (423) T protein:vir:10 225 GGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFD-DTHWLNQQSKQTLYNGASALSFTATVMEDANA 303 (423) T ss_pred cceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeec-ceeeecccccceeecccCCcceEEEEEecccc Confidence 21 1111100 0000000 00001111110000 000001111111 Q ss_pred eccccEEEEEeecCC------CCcccccccC Q lcl|NC_012784. 391 LDYKSAIVIEYDDSE------RGEGDLGLEA 415 (415) Q Consensus 391 ~~p~a~~~~~~t~~~------~~~~~~~~~~ 415 (415) .-+.++. +++++++ .++.++++.- T Consensus 304 ~a~~~~t-v~i~p~~~~~~~~~~~~~V~a~~ 333 (423) T protein:vir:10 304 HSSGDVT-VKISGVPIFDAGYPQYNAVDRLL 333 (423) T ss_pred cccCceE-EEeccccccccCcccccceeccc Confidence 1111211 2222211 1122222211 No 190 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=96.21 E-value=0.00087 Score=37.36 Aligned_cols=274 Identities=9% Similarity=-0.051 Sum_probs=119.5 Q ss_pred hhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEc-------cCCceeEEEEeecCCcccccc--ccccccccc Q lcl|NC_012784. 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-------TNGSGKYPVVRQSEVAALEKV--EELEENPEL 190 (415) Q Consensus 120 ~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~a~~v--~Eg~~~~~~ 190 (415) +. .+. --.+|+.++.+.++.+++..++..++...-- .+.+.++++|. ...+... ..+..+. . T Consensus 1 Ma--N~l---lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~---~~~~~~~~~~~~~~~~-~ 71 (423) T protein:vir:17 1 MP--NNL---DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH---QFSSLRTPTGDISGQN-K 71 (423) T ss_pred Cc--cch---hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCC---cceeecccCcccCCcc-c Confidence 11 110 1136999999999999999888777654211 12333334322 1121111 1111111 1 Q ss_pred cccc--ceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccc Q lcl|NC_012784. 191 AVKP--FFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV 268 (415) Q Consensus 191 ~~~~--f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~ 268 (415) ++.. -..+++.-+|+..+-.=..|+. ....++++++... .++++..+|..+++-.....+... +. .. T Consensus 72 ~~l~e~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~~~-------gt--~~ 140 (423) T protein:vir:17 72 NNLISGKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSL-------GS--PN 140 (423) T ss_pred CccccceeEEEeeceeeeeeeecHHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccccc-------cc--CC Confidence 1111 1246666666666554445554 3444687766555 688999999987754211111110 00 11 Q ss_pred cchhhHHHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHh----hccCCcccccCcccCCC-CceecceeeEEeccccc Q lcl|NC_012784. 269 KKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKM----KDKLGNYLIQPDVKEKT-QQRLLGAKIEILPDEVL 341 (415) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~l----kd~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~ 341 (415) +....|++++++...|...+... =..+++|..+..|.+- ...+ ..-...+..+. .+++.|+.|+.++++|. T Consensus 141 t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~--~~~~~alr~g~i~G~i~GFdvy~Snnip~ 218 (423) T protein:vir:17 141 TPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASD--QLVRTAWENAQIPTNFGGIRALMSNGLAS 218 (423) T ss_pred cccccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceeccc--ccchHHHhhccceeeecceEEEEeCCCcc Confidence 11235888888888877666553 3678999988887642 1111 11112233333 36899999999999996 Q ss_pred cccCCce--EEE--echhhcEEEEe----ecceEEEEe-eccc------CceE-EEEEEEeccE------EeccccEEE- Q lcl|NC_012784. 342 GQKGNNT--LII--GNLKDAIVLFD----RSQYQASWT-DYMH------FGEC-LMIAVRQDCR------ILDYKSAIV- 398 (415) Q Consensus 342 ~~~~~~~--~~~--gd~~~~~~~~~----~~~~~i~~~-~~~~------~~~~-~~~~~r~d~~------v~~p~a~~~- 398 (415) .+.+... +.. +..-.+....+ ..++...+. .+.. +... .....+.... ..++.-|+. T Consensus 219 ~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~ 298 (423) T protein:vir:17 219 RTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVT 298 (423) T ss_pred ccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEE Confidence 5544321 111 10000000000 000111110 0000 0000 0000111111 111111211 Q ss_pred ------------EEeecCCC------CcccccccC Q lcl|NC_012784. 399 ------------IEYDDSER------GEGDLGLEA 415 (415) Q Consensus 399 ------------~~~t~~~~------~~~~~~~~~ 415 (415) |++.+++- .+.+++... T Consensus 299 ~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~ 333 (423) T protein:vir:17 299 ADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQV 333 (423) T ss_pred ecccccccCceEEEecCccccccCCcccccceecc Confidence 22222111 111122110 No 191 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=303 Identities=11% Similarity=0.103 Sum_probs=139.9 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc--cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK--TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 178 (415) ....-...+....... +...+.. .....+.|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++-+ T Consensus 1 M~~~tr~~~~~y~~~~--A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~-lg~~g~ia 77 (357) T protein:vir:60 1 MRQETRFKFNAYLSRV--AELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIG-IGVTGSIA 77 (357) T ss_pred CChHHHHHHHHHHHHH--HHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEe-cccCcccc Confidence 0011111111111111 1111111 12334556667888899999999999999999999887765432 23333333 Q ss_pred ccccc-c-ccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Q lcl|NC_012784. 179 EKVEE-L-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) Q Consensus 179 ~~v~E-g-~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~- 253 (415) +-+.- + ......+...++.-.+..++.=--+.|+.+.|...+ .+|+..+++.+.+.++.=.-.--++|+....+. T Consensus 78 grtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td 157 (357) T protein:vir:60 78 STTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSD 157 (357) T ss_pred cccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCC Confidence 32211 1 111111223455556666666556677888777532 356776666666665433333333443322111 Q ss_pred ----cccccc-------------------------ccc---cccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEc Q lcl|NC_012784. 254 ----STSSGF-------------------------EKE---GKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVS 297 (415) Q Consensus 254 ----~~~~~~-------------------------~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~ 297 (415) +.+-.. ... ...........+|.++ ++++.+.++.++. -++++. T Consensus 158 ~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG 237 (357) T protein:vir:60 158 RSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVG 237 (357) T ss_pred hhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 000000 000 0011122344566666 4666666665554 367777 Q ss_pred HHHHHH-HHHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec Q lcl|NC_012784. 298 QTMFAK-LDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) Q Consensus 298 ~~~~~~-l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 373 (415) ...+.. -..|-+..+.|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+. T Consensus 238 ~dLla~k~~~l~n~~~~pT--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~-----llVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:60 238 RQLLADKYFPIVNREQDNS--EMLAADVIISQKRIGNLPAVRVPYFPADA-----MLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred hhhhhHHhhhHhhcCCChH--HHHHHHHHHHhhhhcCcceEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEec Confidence 666443 223333333331 11111 113578999999999999764 5665555432223333333322222 Q ss_pred ccCceEEEEEEE-eccEEeccccEEEEE------eecCCCCcccccc Q lcl|NC_012784. 374 MHFGECLMIAVR-QDCRILDYKSAIVIE------YDDSERGEGDLGL 413 (415) Q Consensus 374 ~~~~~~~~~~~r-~d~~v~~p~a~~~~~------~t~~~~~~~~~~~ 413 (415) ......--.+.| -+..|-++.+++.++ ...++.++++..+ T Consensus 311 p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~~~~~~a 357 (357) T protein:vir:60 311 PKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred cccccccchhhhcceeeeeccccEEEeeeeeeccCcccccCCCCCCC Confidence 111110000111 122333344444433 3445555556555 No 192 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=95.88 E-value=0.0013 Score=36.39 Aligned_cols=298 Identities=11% Similarity=0.081 Sum_probs=136.0 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc--cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK--TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 178 (415) ....-...+...... .+...+.. .......|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++-+ T Consensus 1 M~~~tr~~~~~y~~~--~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~-lgv~g~ia 77 (355) T protein:vir:98 1 MRPETRFKFNAYLTR--VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIG-VGVTGTIA 77 (355) T ss_pred CChHHHHHHHHHHHH--HHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEee-eccCcccc Confidence 000001111111111 11111111 12334556666888899999999999999999999887766433 23333333 Q ss_pred cccccc--ccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_012784. 179 EKVEEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) Q Consensus 179 ~~v~Eg--~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~ 254 (415) .-+.-+ ......+...++.-.+..++.=--..|+.+.|+..+ .+|+..+++.+.++++.=.-.--++|+....+.. T Consensus 78 grtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td 157 (355) T protein:vir:98 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) T ss_pred ccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCC Confidence 332111 111112223455556666666556677777777532 3677777777777665433333344433211100 Q ss_pred ---cccc--c--------------------c-c-------ccccccccchhhHHHHH-HHHHHhhhhccCCC---EEEEc Q lcl|NC_012784. 255 ---TSSG--F--------------------E-K-------EGKKLEVKKAKSLDDIK-DAINLNVKPNYEHN---VAIVS 297 (415) Q Consensus 255 ---~~~~--~--------------------~-~-------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~v~~ 297 (415) .+.+ . . . ............+|.++ +++..+.++.++.. ++++. T Consensus 158 ~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG 237 (355) T protein:vir:98 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) T ss_pred hhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 0000 0 0 0 00011122344566666 45666766665543 77777 Q ss_pred HHHHHH-HHHhhccCCccc---ccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec Q lcl|NC_012784. 298 QTMFAK-LDKMKDKLGNYL---IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) Q Consensus 298 ~~~~~~-l~~lkd~~G~~l---~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 373 (415) ...+.. -..|-.+...|- -.... ....+|-|+|.+..+++|... +++=-|++--.-+.....+-...+. T Consensus 238 ~dLla~k~~~l~n~~~~ptE~~Aa~~i--~s~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~ 310 (355) T protein:vir:98 238 RKLLADKYFPLVNKQQENSESLAADII--ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDEN 310 (355) T ss_pred hhhhHHHhhhHhhccCCcHHHHHHHHH--HHhhhhCCceeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEec Confidence 765442 223333333331 00001 123589999999999999764 5665555532223333333322221 Q ss_pred c------cCce-E-EEEEEEeccEEeccccEEEEEee--cCCCCcc Q lcl|NC_012784. 374 M------HFGE-C-LMIAVRQDCRILDYKSAIVIEYD--DSERGEG 409 (415) Q Consensus 374 ~------~~~~-~-~~~~~r~d~~v~~p~a~~~~~~t--~~~~~~~ 409 (415) . .+.. + ..+...++...... .+...+.. +++.+|+ T Consensus 311 p~r~rie~y~s~Ne~YvVEd~~~~a~ie-nI~~~~~~~~~~~~~~a 355 (355) T protein:vir:98 311 PKKDRVENYESMNIDYVVEVYAAGCLLE-NITLGDFTAPAAPESGA 355 (355) T ss_pred cccccccchhhhcceeeeeccccEEEee-ceeeeCCCCCcccccCC Confidence 1 1111 1 12222222222211 23332222 2233333 No 193 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=95.85 E-value=0.0014 Score=36.31 Aligned_cols=358 Identities=14% Similarity=0.106 Sum_probs=140.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQE-KQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) ||..++|+++..-+.+. +...++... -.+++-++|=+ +++.++. + T Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~~~~i~~~--~~~~~~a~llenq~~~~~~--------~---------- 46 (528) T protein:vir:80 1 MKTTKELMEKWSPLLEN--------------EKLPEIATA--SKQKLVAKILESQEADFAV--------D---------- 46 (528) T ss_pred CcchHHHHHhhhHhhcC--------------Cccchhcch--hhhhhhhhhhhhhhHHhhc--------c---------- Confidence 99999999999887542 111111000 00011111110 1111000 0 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHH---hhhhhhhhcce Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVT 156 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~---~~~~~l~~~~~ 156 (415) ...+.....+.+.............. -....... +.+.+.+ ..+.+.++.++ .+...-.+++. T Consensus 47 -~~~~~~~~~~~~~~~l~ea~~~~~~~---------~~~~~i~e-s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwG 112 (528) T protein:vir:80 47 -PIYKDEKVVEAFGGFIAEAEVAGDHG---------YDASQIAA-GQTTGAI---TNVGPAVIGMVRRAIPNLIAFDICG 112 (528) T ss_pred -ccccchHHHHhhhhhccccccccccC---------Cccccccc-ccccccc---ccCCchhhhHHHHHHhhhhhhhhhe Confidence 00011111111111111111000000 00000001 1111111 12222333333 33444556888 Q ss_pred eEEccCCceeEEEEeec--CCc---------------------------------------------------------- Q lcl|NC_012784. 157 VKRVTNGSGKYPVVRQS--EVA---------------------------------------------------------- 176 (415) Q Consensus 157 ~~~~~~~~~~~~~~~~~--~~~---------------------------------------------------------- 176 (415) ++||+++++-+=-++.. ..+ T Consensus 113 VQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~ 192 (528) T protein:vir:80 113 VQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAE 192 (528) T ss_pred eccCCchhhhheeeeeeecCCccccccccccccccccccccccccccccccccccccccccccccccccccceecccccc Confidence 88888875532111100 000 Q ss_pred -----------------------------------cccccccccc---------ccccccccceeeEeeeeeEEE----- Q lcl|NC_012784. 177 -----------------------------------ALEKVEELEE---------NPELAVKPFFQLAYDINTHRG----- 207 (415) Q Consensus 177 -----------------------------------~a~~v~Eg~~---------~~~~~~~~f~~v~~~~~k~a~----- 207 (415) ....++.|-. ....+...|.+..+...|..+ T Consensus 193 tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSR 272 (528) T protein:vir:80 193 TGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSR 272 (528) T ss_pred ccccccccccccccCccccCCcccccccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeecc Confidence 0000111100 000122335566666665554 Q ss_pred --eehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc---cccccccccccc-------h Q lcl|NC_012784. 208 --YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG---FEKEGKKLEVKK-------A 271 (415) Q Consensus 208 --~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~---~~~~~~~~~~~~-------~ 271 (415) ...+|-||.+|- .+|.++.|.+-|+..|...+++.||.-......-+.... .....+...... - T Consensus 273 aLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r 352 (528) T protein:vir:80 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR 352 (528) T ss_pred ceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccc Confidence 447999999983 468899999999999999999999754322221111000 000001111100 1 Q ss_pred hhHHHHHHH-------HHHhhh--hccCCCEEEEcHHHHHHHHHh-----hccCC-cccccCcccCC-CCceec-ceeeE Q lcl|NC_012784. 272 KSLDDIKDA-------INLNVK--PNYEHNVAIVSQTMFAKLDKM-----KDKLG-NYLIQPDVKEK-TQQRLL-GAKIE 334 (415) Q Consensus 272 ~~~~~~~~~-------~~~~~~--~~~~~~~~v~~~~~~~~l~~l-----kd~~G-~~l~~~~~~~~-~~~~l~-G~pV~ 334 (415) ...+-++.+ .+.+.. .+...+.+++++.....|... ....| ...+..+.+.. ..++|. |++|. T Consensus 353 ~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) T protein:vir:80 353 WAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVF 432 (528) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEE Confidence 112222222 222222 223457899999998888652 11111 22222222221 123444 57787 Q ss_pred EeccccccccCCceEEEech---h--hcEEEEeec-ceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecC---- Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLIIGNL---K--DAIVLFDRS-QYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS---- 404 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~gd~---~--~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~---- 404 (415) +..+++. ..+++|.= . ........- ..-....|...|+-.+-...|+++.+ +| |.. ..+.. T Consensus 433 ~D~y~~~-----dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP--~~~-~~~~~~~~r 503 (528) T protein:vir:80 433 IDQYARQ-----DYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI-NP--FAD-SKSQAPSAR 503 (528) T ss_pred ecCCCCc-----ceEEEEEeCCcccccceeecccccceeeEeeCCccccceeeeeeeeceee-cC--ccc-ccCCccccc Confidence 7776542 23444320 0 000000000 11112344555655566666776543 34 111 11110 Q ss_pred CCCcccccccC Q lcl|NC_012784. 405 ERGEGDLGLEA 415 (415) Q Consensus 405 ~~~~~~~~~~~ 415 (415) ...++|..-.| T Consensus 504 ~~~g~~~~~~a 514 (528) T protein:vir:80 504 ITSGMLSKDSV 514 (528) T ss_pred ccccchhhhhc Confidence 11233333333 No 194 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=95.81 E-value=0.0014 Score=36.20 Aligned_cols=300 Identities=11% Similarity=0.085 Sum_probs=136.4 Q ss_pred hhhHHHHHHHHHHhhhhhhhhccc--ccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSL--KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 178 (415) ....-...+....... +...+. ......+.|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++-+ T Consensus 1 M~~~tr~~~~~y~~~~--A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~-lgv~g~ia 77 (355) T protein:vir:18 1 MRQETRFKFNAYLTQL--AKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIG-VGVTGTIA 77 (355) T ss_pred CChHHHHHHHHHHHHH--HHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEe-eccCccee Confidence 0000011111111111 111111 122334556667888899999999999999999999887765433 23333333 Q ss_pred cccccc--ccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_012784. 179 EKVEEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) Q Consensus 179 ~~v~Eg--~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~ 254 (415) .-+.-. ......+...++.-.+..++.=--..|+.+.|+..+ .+|+..+++.+.++++.=.-.--++|+....+.. T Consensus 78 grtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td 157 (355) T protein:vir:18 78 STTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSD 157 (355) T ss_pred eccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCC Confidence 332111 111112223455566666666666677878777532 3677777777777665433333344433211100 Q ss_pred ---ccccc-cc-----------------------------ccccccccchhhHHHHH-HHHHHhhhhccCCC---EEEEc Q lcl|NC_012784. 255 ---TSSGF-EK-----------------------------EGKKLEVKKAKSLDDIK-DAINLNVKPNYEHN---VAIVS 297 (415) Q Consensus 255 ---~~~~~-~~-----------------------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~v~~ 297 (415) .+.+. .+ ............+|.++ ++++.+.++.++.. ++++. T Consensus 158 ~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG 237 (355) T protein:vir:18 158 RVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVG 237 (355) T ss_pred hhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 00000 00 00001122334566666 45666666665543 77777 Q ss_pred HHHHHH-HHHhhccCCcccccCcccCC---CCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec Q lcl|NC_012784. 298 QTMFAK-LDKMKDKLGNYLIQPDVKEK---TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) Q Consensus 298 ~~~~~~-l~~lkd~~G~~l~~~~~~~~---~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 373 (415) ...+.. -..|-...+.|- +-.... ...+|-|+|.+..+++|... +++=-|++--.-+.....+-...+. T Consensus 238 ~dLla~k~~~l~n~~~~pt--E~~Aa~~i~s~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~ 310 (355) T protein:vir:18 238 RKLLADKYFPLVNKQQENT--ESLAADIIISQKRIGNLPAVRVPYFPANA-----VFVTTLENLSIYFMDESHRRSIDEN 310 (355) T ss_pred hhhhHHHHhHHhhccCChH--HHHHHHHHHHHHhhCCceeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEec Confidence 765442 223333333332 111111 13589999999999999764 5665555532223333333322222 Q ss_pred c------cCce-E-EEEEEEeccEEeccccEEEEEeecCCCCcccc Q lcl|NC_012784. 374 M------HFGE-C-LMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) Q Consensus 374 ~------~~~~-~-~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~ 411 (415) . .+.. + ..+...++...... .+...+..++++++|-= T Consensus 311 p~r~rie~y~s~Ne~YvVEd~~~~a~ie-ni~~~~~~~~~~~~~g~ 355 (355) T protein:vir:18 311 PKKDRVENYESMNIDYVVEAYAAGCLLE-NITLGDFTAPAAPEGGE 355 (355) T ss_pred cccccccchhhhcceeeeeccccEEEEe-eeeecCCCCcccccCCC Confidence 1 1111 1 12222222222211 23333322222221111 No 195 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=95.68 E-value=0.0016 Score=35.86 Aligned_cols=303 Identities=12% Similarity=0.102 Sum_probs=138.1 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc--cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK--TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 178 (415) ....-...+....... +...+.. .......|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++-+ T Consensus 1 M~~~tr~~~~~y~~~~--A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~-lg~~g~ia 77 (357) T protein:vir:56 1 MRQETRFKFNAYLSRV--AELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIG-IGVTGSIA 77 (357) T ss_pred CChHHHHHHHHHHHHH--HHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEe-cccCcccc Confidence 0011111111111111 1111111 12334556667888899999999999999999999887765432 23333333 Q ss_pred ccccc-c-ccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Q lcl|NC_012784. 179 EKVEE-L-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) Q Consensus 179 ~~v~E-g-~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~- 253 (415) +-+.- + ......+...++.-.+..++.=--+.|+.+.|...+ .+|+..+++.+.+.++.=.-.--++|+....+. T Consensus 78 grtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td 157 (357) T protein:vir:56 78 STTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSD 157 (357) T ss_pred ccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCC Confidence 32211 1 111111223455556666666556677878777532 356666666666655433333333443322111 Q ss_pred ----cccccc-------------------------ccc---cccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEc Q lcl|NC_012784. 254 ----STSSGF-------------------------EKE---GKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVS 297 (415) Q Consensus 254 ----~~~~~~-------------------------~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~ 297 (415) +.+-.. ... ...........+|.++ ++++.+.++.++. -++++. T Consensus 158 ~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG 237 (357) T protein:vir:56 158 RSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVG 237 (357) T ss_pred hhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 000000 000 0011122344566666 4666666665554 367777 Q ss_pred HHHHHH-HHHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec Q lcl|NC_012784. 298 QTMFAK-LDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) Q Consensus 298 ~~~~~~-l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 373 (415) ...+.. -..|-++.+.|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+. T Consensus 238 ~dLla~k~~~l~n~~~~pT--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~-----llVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:56 238 RQLLADKYFPIVNKEQDNS--EMLAADVIISQKRIGNLPAVRVPYFPADA-----MLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred hhhhhhhhhhHhhccCChH--HHHHHHHHHHhhhhCCceeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEec Confidence 666443 333433333331 11111 113578999999999999764 5665555432223333333322222 Q ss_pred ccCceEEEEEEE-eccEEeccccEEEEE------eecCCCCcccccc Q lcl|NC_012784. 374 MHFGECLMIAVR-QDCRILDYKSAIVIE------YDDSERGEGDLGL 413 (415) Q Consensus 374 ~~~~~~~~~~~r-~d~~v~~p~a~~~~~------~t~~~~~~~~~~~ 413 (415) ......--.+.| -+..|-++.+++.++ .++++.+..+..+ T Consensus 311 p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~~~~~~a 357 (357) T protein:vir:56 311 PKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATEEPGA 357 (357) T ss_pred cccccccchhhhcceeeeeccccEEEeeeeeeccCCCCcccCCCCCC Confidence 111110000111 122233333333333 3344444444443 No 196 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=95.63 E-value=0.0017 Score=35.75 Aligned_cols=294 Identities=10% Similarity=0.043 Sum_probs=137.8 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 180 (415) ........+..... ..+...+.........|-+.+...+...+.+.+.+++.++++++.-..|...-. ..+++-++- T Consensus 1 M~~~tr~~~~~y~~--~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~l-g~~g~iagr 77 (337) T protein:vir:79 1 MRKETRQAYEKYAA--QIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL-SVSGPIASR 77 (337) T ss_pred CChHHHHHHHHHHH--HHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEee-ccCcceeee Confidence 00000001111100 001111112233344555678888889999999999999999998777654332 233333322 Q ss_pred ccccc-cccccccccceeeEeeeeeEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---- Q lcl|NC_012784. 181 VEELE-ENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---- 253 (415) Q Consensus 181 v~Eg~-~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~---- 253 (415) ..-+. .....+...++.-.+..++.=--..|+.+.|+.. ..+|+..+++.+.++++.=.-.--++|+....+. T Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~ 157 (337) T protein:vir:79 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred ecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 22111 1111222345555666666655667788877753 2367777777777766543333334453321110 Q ss_pred -ccc--------------------cccc-c--ccccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHHHH-H Q lcl|NC_012784. 254 -STS--------------------SGFE-K--EGKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-L 304 (415) Q Consensus 254 -~~~--------------------~~~~-~--~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~~~-l 304 (415) +.+ .... . ............+|.++ ++++.+.++.++. -+.++....+.. - T Consensus 158 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~ 237 (337) T protein:vir:79 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKY 237 (337) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHh Confidence 000 0000 0 01111222344566655 5566666666554 367777666542 2 Q ss_pred HHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEE Q lcl|NC_012784. 305 DKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLM 381 (415) Q Consensus 305 ~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 381 (415) ..|-...+.|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.....+-...+.......-- T Consensus 238 ~~l~n~~~~pt--E~~Aa~~i~s~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:79 238 FPIVNATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hHHhccCCCcH--HHHHHHHHHHhhhhCCceeEEccccCCCc-----eEEeechhcEEEEecCcEEEEEEEccccccccc Confidence 22322322331 00111 112579999999999999764 566555553333333333333332221111111 Q ss_pred EEEE-eccEEeccccEEEEE---eecC Q lcl|NC_012784. 382 IAVR-QDCRILDYKSAIVIE---YDDS 404 (415) Q Consensus 382 ~~~r-~d~~v~~p~a~~~~~---~t~~ 404 (415) .+.| -+..|-++.+++.++ +..+ T Consensus 311 y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 311 YESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred hhhccceeeeeccccEEEEeceeecCC Confidence 1112 233445555555554 2222 No 197 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=95.52 E-value=0.0019 Score=35.49 Aligned_cols=294 Identities=9% Similarity=0.037 Sum_probs=138.2 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 180 (415) ....-...+..... ..+...+.........|-+.+...+...+.+.+.+++.++++++.-..|...-. ..+++-++- T Consensus 1 M~~~tr~~~~~y~~--~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~l-g~~g~iagr 77 (337) T protein:vir:10 1 MRKETRQAYEKYAA--QIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL-SVSGPIASR 77 (337) T ss_pred CChHHHHHHHHHHH--HHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEee-ccCcceeee Confidence 00000011111100 011111122333345565678888889999999999999999998777654332 233333322 Q ss_pred ccccc-cccccccccceeeEeeeeeEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---- Q lcl|NC_012784. 181 VEELE-ENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---- 253 (415) Q Consensus 181 v~Eg~-~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~---- 253 (415) ..-+. .....+...++.-.+..++.=--..|+.+.|+.. ..+|+..+++.+.++++.=.-.--++|+....+. T Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~ 157 (337) T protein:vir:10 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred ecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 22111 1111222345555666666655667788877753 2367777777777766543333334453322110 Q ss_pred -ccc--------------------cccc-c--ccccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHHHH-H Q lcl|NC_012784. 254 -STS--------------------SGFE-K--EGKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-L 304 (415) Q Consensus 254 -~~~--------------------~~~~-~--~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~~~-l 304 (415) +.+ .... . ............+|.++ ++++.+.++.++. -+.++....+.. - T Consensus 158 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~ 237 (337) T protein:vir:10 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHh Confidence 000 0000 0 01111222344566655 5566666666554 367777666542 2 Q ss_pred HHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEE Q lcl|NC_012784. 305 DKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLM 381 (415) Q Consensus 305 ~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 381 (415) ..|-...+.|- +-... ....+|-|+|.+..+++|... +++=-|++--.-+.....+-...+.......-- T Consensus 238 ~~l~n~~~~pt--E~~Aa~~i~s~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:10 238 FPIVNATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hHHhccCCCcH--HHHHHHHHHHhhhhCCceeEEccccCCCc-----eEEeechhcEEEEecCcEEEEEEEccccccccc Confidence 22322322331 00111 112579999999999999764 566555553333333333333332221111111 Q ss_pred EEEE-eccEEeccccEEEEE---eecC Q lcl|NC_012784. 382 IAVR-QDCRILDYKSAIVIE---YDDS 404 (415) Q Consensus 382 ~~~r-~d~~v~~p~a~~~~~---~t~~ 404 (415) .+.| -+..|-++.+++.++ +..+ T Consensus 311 y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 311 YESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred hhhccceeeeeccccEEEEeceeecCC Confidence 1112 233445555666554 2222 No 198 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=95.34 E-value=0.0023 Score=35.10 Aligned_cols=308 Identities=11% Similarity=0.049 Sum_probs=143.5 Q ss_pred HHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) +...........+..................+..+.|.+.+...+...+.+.+.+++.++++++.-..|.. +....+++ T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~-v~lg~~g~ 79 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQV-VQVGVGQL 79 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeE-EeecCCcc Confidence 11111111111111111111111111112223456777788888999999999999999999998877764 33334444 Q ss_pred ccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcch-----HHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_012784. 177 ALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK-----VNVLQELKLWMARTIAATRNKAIIDVITKGS 251 (415) Q Consensus 177 ~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~-----~~l~~~l~~~la~~~~~~~d~~il~g~g~~~ 251 (415) -++-...+. | .+...++...+..++.=--..|+.+.|...+ .+|+..+++.+.+.++.=.-.--++|+.... T Consensus 80 iagrt~tr~--~-~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 156 (358) T protein:vir:78 80 YTGRKKGGR--F-KGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAAD 156 (358) T ss_pred cceecCCCc--c-ccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeecc Confidence 444333222 1 2334455666666666666778888887654 2577777777777665433333344433222 Q ss_pred cc-----cc--------------------ccccc-cccccc---cccchhhHHHHHH-HHHHhhhhccCCC---EEEEcH Q lcl|NC_012784. 252 TG-----ST--------------------SSGFE-KEGKKL---EVKKAKSLDDIKD-AINLNVKPNYEHN---VAIVSQ 298 (415) Q Consensus 252 ~~-----~~--------------------~~~~~-~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~v~~~ 298 (415) +. +. ..... ...... .......+|.++- ++..+.++.++.. ++++.. T Consensus 157 ~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~ 236 (358) T protein:vir:78 157 DTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGT 236 (358) T ss_pred CCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 11 00 00000 000001 1123445677764 5667777666543 677777 Q ss_pred HHHHH-HHHhhccCCcccccCcc-cCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccC Q lcl|NC_012784. 299 TMFAK-LDKMKDKLGNYLIQPDV-KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHF 376 (415) Q Consensus 299 ~~~~~-l~~lkd~~G~~l~~~~~-~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 376 (415) ..+.. -..|-...+.|- +.. ......+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+.... T Consensus 237 dLla~k~~~l~n~~~~pT--E~~Aa~~i~k~iGGlpa~~~PfFP~~~-----ilVT~L~NLsIY~Q~gs~RR~~~d~p~r 309 (358) T protein:vir:78 237 DLVAAAQAKLYSEATKPS--EQIAAQQLAKSIAGRKAYIPPFFPGKR-----MVVTTLDNLHCYTQRGTRKRKADDNQDS 309 (358) T ss_pred hhhhHHhhhHhhcCCCcH--HHHHHHHHHHHhCCCeEEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEecccc Confidence 66543 223333333331 000 01112578999999999999754 5665555532223333333333222111 Q ss_pred ceEEEEEEE-eccEEeccccEEEEEee-----cCCC-CcccccccC Q lcl|NC_012784. 377 GECLMIAVR-QDCRILDYKSAIVIEYD-----DSER-GEGDLGLEA 415 (415) Q Consensus 377 ~~~~~~~~r-~d~~v~~p~a~~~~~~t-----~~~~-~~~~~~~~~ 415 (415) ...--.+.| -+..|-++.+++.++-. +.++ ..+.=...+ T Consensus 310 ~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~~~~ 355 (358) T protein:vir:78 310 KSFDNQYWRMEGYALGEHKAYGGFEEADIEIGADPAVLAVEAAAQA 355 (358) T ss_pred ccccchhhhcceeeeeccccEEEEeeeeeeeCCCCCccccCCcccc Confidence 111001111 12234444555444422 1111 111111111 No 199 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=95.08 E-value=0.0028 Score=34.57 Aligned_cols=295 Identities=11% Similarity=0.078 Sum_probs=138.2 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 180 (415) ....-...+...... .+...+....+..+.|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++-++- T Consensus 1 M~~~tr~~~~~y~~~--~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~-lg~~g~iagr 77 (339) T protein:vir:79 1 MRNDTRRLFAAYKAA--IAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIG-LGVSGPVAST 77 (339) T ss_pred CChHHHHHHHHHHHH--HHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEe-eccCcceeec Confidence 000001111111111 1111122334455667777888899999999999999999999877765432 2233333322 Q ss_pred ccc-cccccccccccceeeEeeeeeEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--- Q lcl|NC_012784. 181 VEE-LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--- 254 (415) Q Consensus 181 v~E-g~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~--- 254 (415) ..- +......+...++.-.+..++.=--..|+.+.|... ..+|+..+++.+.+.++.=.-.--++|+....+.. T Consensus 78 tdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 157 (339) T protein:vir:79 78 TDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVA 157 (339) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhh Confidence 111 111111122345555666666655667777777753 23677777777766654333333334433221100 Q ss_pred --cc--------------------ccc-ccccc--c-ccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHHHH- Q lcl|NC_012784. 255 --TS--------------------SGF-EKEGK--K-LEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK- 303 (415) Q Consensus 255 --~~--------------------~~~-~~~~~--~-~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~~~- 303 (415) .+ ... ...+. . ........+|.++ ++++.+.++.++. -++++....+.. T Consensus 158 nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k 237 (339) T protein:vir:79 158 NPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDK 237 (339) T ss_pred CcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhH Confidence 00 000 00000 0 1112244556665 5565666665554 366777666442 Q ss_pred HHHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEE Q lcl|NC_012784. 304 LDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECL 380 (415) Q Consensus 304 l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~ 380 (415) -..|-.....|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+.......- T Consensus 238 ~~~l~n~~~~pt--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~-----llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 310 (339) T protein:vir:79 238 YFPLVNRDRDPV--QQIAADLIISQKRIGNLPAIRVPYFPANG-----LLVTRLDNLSIYYQEGGRRRTILDNAKRDRIE 310 (339) T ss_pred hhhHhhcCCChH--HHHHHHHHHHhhhhCCceeEEccccCCCc-----eEEeechhcEEEEecCcEEEEEEecccccccc Confidence 223323333331 01111 113578999999999999764 56655555322233333333332222111111 Q ss_pred EEEEE-eccEEeccccEEEEE---eecCC Q lcl|NC_012784. 381 MIAVR-QDCRILDYKSAIVIE---YDDSE 405 (415) Q Consensus 381 ~~~~r-~d~~v~~p~a~~~~~---~t~~~ 405 (415) -.+.| -+..|-++.+++.++ +..++ T Consensus 311 ~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 311 NYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred chhhccceeeeeccccEEEeeeeecccCC Confidence 11112 233445555555554 22222 No 200 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=94.84 E-value=0.0034 Score=34.14 Aligned_cols=281 Identities=14% Similarity=0.116 Sum_probs=142.5 Q ss_pred cccccccceeecchhHHhHHHHHHhhhhhhhhcce-eEEccCCceeEEEEeecCCcccccccccccccccccccceeeEe Q lcl|NC_012784. 122 GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVT-VKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAY 200 (415) Q Consensus 122 ~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~ 200 (415) ...+...-...+...++..|...+.+...-....+ +.-.+.+. ++.++. .+.+......|.+... -....-++|++ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~-~L~I~t-iGs~~~~~~~E~~~~~-~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGE-TLHIKT-IGSVTLQEAEEDTPLI-YNPIETGEITF 77 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCC-EEEecc-cCceeeeccccCCCee-ecccccceEEE Confidence 23344444455556667766655555433333333 22233232 444442 3444444444444443 23455678999 Q ss_pred eeeeEEEee-hhhHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHHhhccccc----cccc-cccccccccccccccchh Q lcl|NC_012784. 201 DINTHRGYF-RISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKG----STGS-TSSGFEKEGKKLEVKKAK 272 (415) Q Consensus 201 ~~~k~a~~~-~iS~e~l~ds~~--~l~~~l~~~la~~~~~~~d~~il~g~g~~----~~~~-~~~~~~~~~~~~~~~~~~ 272 (415) -...+++-. +||+.+-+|+-. ++...+..+-+++|....+..+|. +|.. .+.+ ...+..-....+...+.- T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~-~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~ 156 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLK-TGAEYFAANPGPHNVNGFPHVIVSAETNGVF 156 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHh-hchhhhccCCCCcccccccceEEeccCCcee Confidence 888887644 899999999752 233333334445555555666663 2321 1111 122333333444445555 Q ss_pred hHHHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhh------ccCCcccccCcccCCC--CceecceeeEEecccccc Q lcl|NC_012784. 273 SLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMK------DKLGNYLIQPDVKEKT--QQRLLGAKIEILPDEVLG 342 (415) Q Consensus 273 ~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lk------d~~G~~l~~~~~~~~~--~~~l~G~pV~~~~~~~~~ 342 (415) ...++..+-...-.+... +-++++.|.....|..+. ..+|+.|+..+..-+. ...+.|.-+++++-+... T Consensus 157 ~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 157 ALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 667777666555444433 457999999988888763 2446666544443322 245788888888755432 Q ss_pred ccC----CceEEEechhhcEEEEeecceEEEEee--------ccc-CceEEEEEEEeccEEeccccEEEEEeecCCC Q lcl|NC_012784. 343 QKG----NNTLIIGNLKDAIVLFDRSQYQASWTD--------YMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) Q Consensus 343 ~~~----~~~~~~gd~~~~~~~~~~~~~~i~~~~--------~~~-~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~ 406 (415) .-. ......|++-.++.-..-..+-..|-+ .++ ....-.+..|+|..+.+-+..+.+--.+++- T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSATAY 313 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecceeEEEeccccC Confidence 111 112344443222211111112222211 111 1122355578999888887766653333322 No 201 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=94.82 E-value=0.0034 Score=34.11 Aligned_cols=269 Identities=10% Similarity=0.021 Sum_probs=119.6 Q ss_pred ccceeecchhHHhHHHHHHhhhhhhhhccee-----EEccCCceeEEEEeecCCccccccccc--ccccccccccceeeE Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTV-----KRVTNGSGKYPVVRQSEVAALEKVEEL--EENPELAVKPFFQLA 199 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~~~~~~~a~~v~Eg--~~~~~~~~~~f~~v~ 199 (415) .. -...++.++..+.+.....+....++.. +... +..++.|++.... ......-+ +..+..-+.++...+ T Consensus 1 MA-~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~-gg~tVkI~~i~~~-gl~DY~R~~~g~~~g~~~~~~~t~~ 77 (299) T protein:vir:79 1 MA-ALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWT-GSKTIEIPTISTT-GRVDSNRDTIAVAQRNYDNAWEPKV 77 (299) T ss_pred Cc-cchhHHHHHHHHHHHHHhhceeeeeccCcccceeeec-CCCEEEEeccccc-cccccccCCCcccccccCcceeEEE Confidence 11 1123467777777777666554443321 1111 2224556655443 33333211 222211123556666 Q ss_pred eeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHH Q lcl|NC_012784. 200 YDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) Q Consensus 200 ~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (415) +.-.+.-.+..=.-.. +.+. ..+...+.+...+.++-.+|...++..-++.... .........+....++.+ T Consensus 78 ldqdr~~~f~vD~~Dv-det~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~-----g~~~~~~~~T~~n~y~~i 151 (299) T protein:vir:79 78 LTNQRKWSTLVHPADI-NQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL-----GNTADTTVLTTTNVLEVF 151 (299) T ss_pred eeccccceeccchhhH-HHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc-----CCcccccccCHHHHHHHH Confidence 6666665543110000 1111 1122333333444455555665554332221110 011122233455678999 Q ss_pred HHHHHHhhhhccC--CCEEEEcHHHHHHHHHhhc--cCCcccccCcccCCCCceecceeeEEecc--ccc------c--- Q lcl|NC_012784. 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD--KLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVL------G--- 342 (415) Q Consensus 278 ~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lkd--~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~------~--- 342 (415) .+++..+..+... +-.++++|..+..|.+... ............++..+.|.|.||+.+++ |+. | T Consensus 152 ~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~ 231 (299) T protein:vir:79 152 DKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKV 231 (299) T ss_pred HHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCccc Confidence 9999998887765 3468899999888875421 11121222234455667899999987533 331 1 Q ss_pred --ccCCceEEEechhhcEEEEeecceEEEEeecccCce--EEEEE-EEeccEEecc-ccEEEEEeecCCC Q lcl|NC_012784. 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGE--CLMIA-VRQDCRILDY-KSAIVIEYDDSER 406 (415) Q Consensus 343 --~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~-~r~d~~v~~p-~a~~~~~~t~~~~ 406 (415) ++.+-.+++.....-+.+.....+.+ ..-..++. .+.-+ .+.|.-|.+. ..-+++.+.++.. T Consensus 232 ~~~ak~in~ii~~~~a~~~~~K~~~~~~--~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 232 GAGAKQIFMSLVHPSAIITPVSYQFSKL--DEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred cCcccccceEEEcCCeeeeeEeeeeEEe--ecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 11122344444332111111112222 22222332 22222 2445555543 2333444444433 No 202 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=94.67 E-value=0.0038 Score=33.86 Aligned_cols=294 Identities=10% Similarity=0.039 Sum_probs=136.7 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 180 (415) ....-...+..... ..+...+....+....|-+.+...+...+.+.+.+++.++++++.-..|...-. ..+++-++- T Consensus 1 M~~~tr~~~~~y~~--~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~l-g~~g~iagr 77 (337) T protein:vir:78 1 MRKETRQAYEKYAA--QIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL-SVSGPIASR 77 (337) T ss_pred CChHHHHHHHHHHH--HHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEec-ccCcceeee Confidence 00000011111100 011111222334455676778888999999999999999999998777654322 233333322 Q ss_pred ccccc-cccccccccceeeEeeeeeEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---- Q lcl|NC_012784. 181 VEELE-ENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---- 253 (415) Q Consensus 181 v~Eg~-~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~---- 253 (415) ..-+. .....+...++.-.+..++.=--+.|+.+.|... ..+|+..+++.+.+.++.=.-.--++|+....+. T Consensus 78 tdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 157 (337) T protein:vir:78 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred ecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhh Confidence 21111 1111222334555555555555567777777753 2367777777776665433333333443322111 Q ss_pred -cc--------------------ccccccc---cccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHHHH-H Q lcl|NC_012784. 254 -ST--------------------SSGFEKE---GKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-L 304 (415) Q Consensus 254 -~~--------------------~~~~~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~~~-l 304 (415) +. ....... ...........+|.++ ++++.+.++.++. -++++....+.. - T Consensus 158 nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~ 237 (337) T protein:vir:78 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) T ss_pred CcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHH Confidence 00 0000000 1111222344566666 5566666665554 367777666543 2 Q ss_pred HHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEE Q lcl|NC_012784. 305 DKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLM 381 (415) Q Consensus 305 ~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 381 (415) ..|-...+.|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+.......-- T Consensus 238 ~~l~n~~~~pt--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~-----ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:78 238 FPIVNATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred HHHHhcCCCcH--HHHHHHHHHHhhhhcCcceEEccccCCCc-----eEEeechhcEEEEecCcEEEEEEeccccccccc Confidence 22322333331 00111 113578999999999999764 566555553222333333333322221111111 Q ss_pred EEEE-eccEEeccccEEEEE---eecC Q lcl|NC_012784. 382 IAVR-QDCRILDYKSAIVIE---YDDS 404 (415) Q Consensus 382 ~~~r-~d~~v~~p~a~~~~~---~t~~ 404 (415) .+.| -+..|-++.+++.++ +..+ T Consensus 311 y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 311 YESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred hhhccceeeeeccccEEEEeceeecCC Confidence 1112 233445555565554 2222 No 203 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=94.49 E-value=0.0043 Score=33.58 Aligned_cols=296 Identities=12% Similarity=0.056 Sum_probs=140.8 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCccccc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 180 (415) ....-...+..... ..+...+....+..+.|.+.+...+...+.+.+.+++.++++++.-.+|...- ...+++-+.- T Consensus 1 M~~~tr~~~~~y~~--~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~-lg~~g~iagr 77 (338) T protein:vir:11 1 MRNETRKQFDAYLA--QLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIG-IGVSGTIASR 77 (338) T ss_pred CCHHHHHHHHHHHH--HHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEee-eccCcccccc Confidence 00011111111111 11112223344556677778899999999999999999999999877765432 2333333332 Q ss_pred cc--ccccccccccccceeeEeeeeeEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--- Q lcl|NC_012784. 181 VE--ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--- 253 (415) Q Consensus 181 v~--Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~--- 253 (415) +. .+......+...++.-.+..++.=--..|+.+.|+.. ..+|+..+++.+.++++.=.-.--++|+....+. T Consensus 78 tdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~ 157 (338) T protein:vir:11 78 TDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRA 157 (338) T ss_pred ccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChh Confidence 21 1111111111234555566666555667777777753 2367777777777766543333334453321110 Q ss_pred --ccc--------------------cccccccc--c--ccccchhhHHHHH-HHHHHhhhhccCCC---EEEEcHHHHHH Q lcl|NC_012784. 254 --STS--------------------SGFEKEGK--K--LEVKKAKSLDDIK-DAINLNVKPNYEHN---VAIVSQTMFAK 303 (415) Q Consensus 254 --~~~--------------------~~~~~~~~--~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~v~~~~~~~~ 303 (415) +.+ ......+. . ........+|.++ +++..+.++.++.. ++++....+.. T Consensus 158 ~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLlad 237 (338) T protein:vir:11 158 ANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHD 237 (338) T ss_pred hCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHH Confidence 000 00000000 0 1112244566666 45666666665543 77777765442 Q ss_pred -HHHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceE Q lcl|NC_012784. 304 -LDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGEC 379 (415) Q Consensus 304 -l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~ 379 (415) -..+-.....|- +-... ....+|-|+|.+..+++|... +++=-|++--.-+.....+-...+....... T Consensus 238 k~~~l~n~~~~pt--E~~Aa~~~~s~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 310 (338) T protein:vir:11 238 KYFPMVNKDQPAT--EKIATDLILSQKRMGGLPPVEVPYVPEKG-----LMVTTLKNLSLYWQIGGRRRYLKEVPEKNRI 310 (338) T ss_pred HHhHHHhcCCChH--HHHHHHHHHHhhhhCCceeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEeccccccc Confidence 222333322221 00111 113579999999999999764 5665555532223333333333222211111 Q ss_pred EEEEEE-eccEEeccccEEEEEeecCCC Q lcl|NC_012784. 380 LMIAVR-QDCRILDYKSAIVIEYDDSER 406 (415) Q Consensus 380 ~~~~~r-~d~~v~~p~a~~~~~~t~~~~ 406 (415) --.+.| -+..|-++.+++.++=..-.. T Consensus 311 e~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 311 ENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred cchhhhccceeeeccccEEEeecceecC Confidence 111112 233445556666655221111 No 204 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=94.46 E-value=0.0044 Score=33.53 Aligned_cols=303 Identities=9% Similarity=-0.001 Sum_probs=130.7 Q ss_pred hhhHHHHHHHHHHHHHhhhhhHHHHHHH--HHHhhhhhhhhcccccccceeecchhHHh----HHHHHHhhhhhhhhcce Q lcl|NC_012784. 83 YRNQANINDLGISIQNTKVTSQEVRDFT--EYLETRNDIQGGSLKTDSGFVVIPEEIVT----DILKLKEVEFNLDKYVT 156 (415) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~vP~~~~~----~Ii~~~~~~~~l~~~~~ 156 (415) ...........+ .+ ........... ...-..........-.+.+...+|..+.+ .+++.+........++. T Consensus 1 ~~~~~~~~~l~~--~g-i~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~p 77 (336) T protein:vir:36 1 MRDAQRIQNLAR--AG-VILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVG 77 (336) T ss_pred CchHHHHHHHhh--cC-eeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhcc Confidence 000000000000 00 00000000000 00000111111111111223345654433 44444444444444544 Q ss_pred eEEccCCc-eeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhh-HHHHhc--chHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRIS-REAIED--AKVNVLQELKLWMA 232 (415) Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS-~e~l~d--s~~~l~~~l~~~la 232 (415) +...+.-. ..+.+........+.+.+-+...|-. +......+...+.++..+.++ .|+..- ..+++.+--....+ T Consensus 78 v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~-d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~ 156 (336) T protein:vir:36 78 ESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred ccccCCccceeEEEeeeeceeeEEEeeccCCCcee-ecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 43321111 12233333445566778888888744 455666777788888888887 555443 33456677777777 Q ss_pred HHHHHHHHHHHhhccccccccccccccccc---c----ccccccchhhHHHHHHHHHHhhhhcc------CCCEEEEcHH Q lcl|NC_012784. 233 RTIAATRNKAIIDVITKGSTGSTSSGFEKE---G----KKLEVKKAKSLDDIKDAINLNVKPNY------EHNVAIVSQT 299 (415) Q Consensus 233 ~~~~~~~d~~il~g~g~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~v~~~~ 299 (415) +++.+.+++-.+.|+.....-+...+.... . .....+....++|+..++.++..... .+..++|.|+ T Consensus 157 ~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~ 236 (336) T protein:vir:36 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPT 236 (336) T ss_pred HHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechH Confidence 778888887666676544332322211111 1 01111223346788888777765432 3678999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEee-cceEEEEe----ecc Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR-SQYQASWT----DYM 374 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~----~~~ 374 (415) .+..|.. ++..|.-++. -+... +-++.++..+.. .++.+....++-+- .+. ....+.+. .+. T Consensus 237 ~~~~Ls~-~n~~g~Tvl~-~lk~n----~Pnl~i~t~pEl-~~a~g~~~~l~~~~------~~~~~t~~~~~p~~~~~l~ 303 (336) T protein:vir:36 237 AMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEY-DTASGRLVQLWAPR------VEGKDTATCGFTEKMRAHS 303 (336) T ss_pred HHHhccC-CCccCccHHH-HHHHh----cCccEEEEcccc-ccCCCceEEEEEEe------cCCCcceeeecchhhhccc Confidence 8888853 3333432221 11111 112233333332 22223322222110 000 01111111 100 Q ss_pred ---cC-ceEEEEEEEecc-EEeccccEEEEEee Q lcl|NC_012784. 375 ---HF-GECLMIAVRQDC-RILDYKSAIVIEYD 402 (415) Q Consensus 375 ---~~-~~~~~~~~r~d~-~v~~p~a~~~~~~t 402 (415) .. ....-...|.+| .+.+|.||++++=- T Consensus 304 vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 304 IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eeecCceeEeccccceeeeeeeccchheeeecC Confidence 00 001122345544 56789999998733 No 205 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=94.16 E-value=0.0052 Score=33.11 Aligned_cols=299 Identities=10% Similarity=0.057 Sum_probs=138.8 Q ss_pred HHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) +...........+....... +...+.........|-+.+...+...+.+.+.+++.++++++.-.+|...- ...+++ T Consensus 1 m~~~m~~~tr~~~~~y~~~~--A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~-lg~~g~ 77 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQL--AKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVD-VGVSGL 77 (341) T ss_pred CcccccHHHHHHHHHHHHHH--HHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEee-cccccc Confidence 11111111111111111111 111122233344556667888999999999999999999998877765432 233333 Q ss_pred ccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcch-----HHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_012784. 177 ALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK-----VNVLQELKLWMARTIAATRNKAIIDVITKGS 251 (415) Q Consensus 177 ~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~-----~~l~~~l~~~la~~~~~~~d~~il~g~g~~~ 251 (415) -+.-..- +-.| ..+.++...+..++.=--+.|+.+.|+... .+|+..+++.+.++++.=.-.--++|+.... T Consensus 78 iagrtdt-~R~~--r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~ 154 (341) T protein:vir:27 78 YTGRKAG-GRFT--KQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEA 154 (341) T ss_pred eeeccCC-Ccee--cccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeecc Confidence 3333322 2222 223556666666666556677777776543 6788888888887776444444445543211 Q ss_pred cc---ccccc------------------ccccccc--ccccchhhHHHHH-HHHHHhhhhccCCC---EEEEcHHHHHH- Q lcl|NC_012784. 252 TG---STSSG------------------FEKEGKK--LEVKKAKSLDDIK-DAINLNVKPNYEHN---VAIVSQTMFAK- 303 (415) Q Consensus 252 ~~---~~~~~------------------~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~v~~~~~~~~- 303 (415) +. ..+.+ ....... ........+|.++ +++..+.++.++.. ++++....+.. T Consensus 155 ~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k 234 (341) T protein:vir:27 155 DTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAA 234 (341) T ss_pred CCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhh Confidence 10 00000 0001111 1122233456655 55666666665543 77777666542 Q ss_pred HHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec------ccCc Q lcl|NC_012784. 304 LDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY------MHFG 377 (415) Q Consensus 304 l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~------~~~~ 377 (415) -..|-+....|-=. -....-..+|-|+|.+..+++|... +++=-|++-..-+.....+-.+.+. ..|. T Consensus 235 ~~~l~n~~~~ptE~-~Aa~~i~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ye 308 (341) T protein:vir:27 235 QAKLYDKADKPSEQ-IAAQKLDKTIAGRPAYVPPFLPDNA-----MVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHT 308 (341) T ss_pred hhhhhccCCCCHHH-HHHHHHHHhhCCCeEEEccccCCCc-----eEEeeccceEEEEecCcEEEEEEeccccccccchh Confidence 22332222222100 0001113589999999999999764 5555555532223322232222221 1222 Q ss_pred eEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 378 ECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 378 ~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) .. ..|-+-.+|..+.++.-.-+-+.+---. T Consensus 309 s~--------YvVEdyg~~~~~~~~~vkl~~~~~~~~~ 338 (341) T protein:vir:27 309 GA--------WKVTQWVCWKRSPLTTQKKSTSALNHRS 338 (341) T ss_pred hh--------heeehhhhhhhccccccccCcccccccc Confidence 22 2222333333333332221111111111 No 206 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=94.04 E-value=0.0056 Score=32.94 Aligned_cols=357 Identities=14% Similarity=0.109 Sum_probs=140.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQI-QEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) ||..++|.++..-+.+. +...++...+.. +-++| +.+++.+.. . + T Consensus 3 ~~~~~~l~~kw~p~l~~--------------~~~~~i~~~~~~---~~a~~~enq~~~~~~--------~-----~---- 48 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEG--------------EGLPEIANSKQA---IIAKIFENQEKDFQT--------A-----P---- 48 (521) T ss_pred cchhHHHHHhhhhhhcc--------------CCCCccccchhh---hhhhhhhhhhhhhhh--------c-----c---- Confidence 99999999998887542 111111110111 11111 011101000 0 0 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcce Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVT 156 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~ 156 (415) ..+.......+............. ......... +.+.+++ ..+.+.++.++| +.....+++. T Consensus 49 --~~~~~~~~~~~~~~l~e~~~~~~~---------~~~~~~i~e-s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwG 113 (521) T protein:vir:10 49 --EYKDEKIAQAFGSFLTEAEIGGDH---------GYNATNIAA-GQTSGAV---TQIGPAVMGMVRRAIPNLIAFDICG 113 (521) T ss_pred --ccchhHHHHHHhhhhhhhcccCcc---------ccccccccc-ccccccc---ccCCchhhhHHHHHHhhhhhhhcee Confidence 001111111111111111000000 000000000 1111111 123333333333 3444556788 Q ss_pred eEEccCCceeEEEEeec--CC---------------cccccc-------------------------------------- Q lcl|NC_012784. 157 VKRVTNGSGKYPVVRQS--EV---------------AALEKV-------------------------------------- 181 (415) Q Consensus 157 ~~~~~~~~~~~~~~~~~--~~---------------~~a~~v-------------------------------------- 181 (415) ++||+++++-+=-.+.. .. +.+.+- T Consensus 114 VQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~ 193 (521) T protein:vir:10 114 VQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQAS 193 (521) T ss_pred eccCCchhhhheeeeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceeccc Confidence 88888877643111110 00 000000 Q ss_pred -------------------------------ccccc---------ccccccccceeeEeeeeeEEEe-------ehhhHH Q lcl|NC_012784. 182 -------------------------------EELEE---------NPELAVKPFFQLAYDINTHRGY-------FRISRE 214 (415) Q Consensus 182 -------------------------------~Eg~~---------~~~~~~~~f~~v~~~~~k~a~~-------~~iS~e 214 (415) ++|-. ....+...|.+..+...|..+- ..+|-| T Consensus 194 ~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiE 273 (521) T protein:vir:10 194 AQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIE 273 (521) T ss_pred ccccCCCcccccccccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHH Confidence 00000 0001122355666666665544 479999 Q ss_pred HHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc---ccccccccc-------chhhHHHHHHH Q lcl|NC_012784. 215 AIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE---KEGKKLEVK-------KAKSLDDIKDA 280 (415) Q Consensus 215 ~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~---~~~~~~~~~-------~~~~~~~~~~~ 280 (415) |.+|- .+|.+++|.+-|+..|...++++||.-......-+...... ...+..... .-...+-++.+ T Consensus 274 LAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L 353 (521) T protein:vir:10 274 LAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKAL 353 (521) T ss_pred HHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHH Confidence 99983 45889999999999999999999996432221111100000 001111111 01111222222 Q ss_pred HHHh-------hh--hccCCCEEEEcHHHHHHHHHhh-----ccCC-cccccCcccCCC-Cceec-ceeeEEeccccccc Q lcl|NC_012784. 281 INLN-------VK--PNYEHNVAIVSQTMFAKLDKMK-----DKLG-NYLIQPDVKEKT-QQRLL-GAKIEILPDEVLGQ 343 (415) Q Consensus 281 ~~~~-------~~--~~~~~~~~v~~~~~~~~l~~lk-----d~~G-~~l~~~~~~~~~-~~~l~-G~pV~~~~~~~~~~ 343 (415) +.++ .. .-...+.+++++.....|...- .+.| ..=|..+.+... .+.|. |++|.+..+++. T Consensus 354 ~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~-- 431 (521) T protein:vir:10 354 LFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQ-- 431 (521) T ss_pred HHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCc-- Confidence 2222 22 1244678999999988888531 1111 000111221111 13443 577877776542 Q ss_pred cCCceEEEechh-----hcEEEEee-cceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecCC---CCccccccc Q lcl|NC_012784. 344 KGNNTLIIGNLK-----DAIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE---RGEGDLGLE 414 (415) Q Consensus 344 ~~~~~~~~gd~~-----~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~---~~~~~~~~~ 414 (415) ..+++|.=- ........ ...-+...|...|+-.+-...|+++.+ +|= +. ..+..+ -.++|..-. T Consensus 432 ---dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~--~~-~~~~~~~~~i~~~~~~~~ 504 (521) T protein:vir:10 432 ---DYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPF--AE-SAAQAPASRIQSGMPSIL 504 (521) T ss_pred ---ceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCc--cc-ccCCccceeecccchhhh Confidence 234444200 00000000 001112234455666666667777654 452 22 111111 134444444 Q ss_pred C Q lcl|NC_012784. 415 A 415 (415) Q Consensus 415 ~ 415 (415) | T Consensus 505 a 505 (521) T protein:vir:10 505 N 505 (521) T ss_pred c Confidence 4 No 207 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=93.87 E-value=0.0031 Score=34.31 Aligned_cols=301 Identities=13% Similarity=0.111 Sum_probs=127.0 Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeE Q lcl|NC_012784. 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 158 (415) -..--+.......++.-...+...+.....|... ....+.+.++....+|..+...|...+.++.++.....+. T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~------L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT 74 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAK------LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVT 74 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhh------hhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeec Confidence 0001112222223333333333333222222211 1122344456667789999999999999999998754443 Q ss_pred EccCCceeEEEEeec-CCcccccccccccccccccccceeeEeeeeeEEEeehhhH-HHHhc---chHHHHHHHHHHHHH Q lcl|NC_012784. 159 RVTNGSGKYPVVRQS-EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR-EAIED---AKVNVLQELKLWMAR 233 (415) Q Consensus 159 ~~~~~~~~~~~~~~~-~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~-e~l~d---s~~~l~~~l~~~la~ 233 (415) +.+ .|.+.+.. +.+.+.-+..|.++.+. .|..+..+..-.++++..|- ++..+ +-..+..|+.++|+. T Consensus 75 ~~~----~~~V~~s~~s~AeAq~HkdGqTK~eq---a~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ 147 (318) T protein:vir:86 75 NVG----ALLVSRSFDSSAEAQVHKDGQTKTEQ---AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 147 (318) T ss_pred cch----hhhhhhhhhhhhhhhhhccCCccccc---eeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHH Confidence 332 33332222 22455556666666543 34445555555555555553 33333 333468999999999 Q ss_pred HHH-HHHHHHHhhcccccccccccc--cc---ccccccccccchhhHHHH-HHHHHHhhhhccCCCEEEEcHHH-HHHHH Q lcl|NC_012784. 234 TIA-ATRNKAIIDVITKGSTGSTSS--GF---EKEGKKLEVKKAKSLDDI-KDAINLNVKPNYEHNVAIVSQTM-FAKLD 305 (415) Q Consensus 234 ~~~-~~~d~~il~g~g~~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~~~~-~~~l~ 305 (415) ++. +..|.+++-|+|.++-...-. .. ........+.++..+... ..+..-..+.. ..-..++...+ .+.|. T Consensus 148 ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrpta-grrylivkaedrkalld 226 (318) T protein:vir:86 148 AIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTA-GRRYLIVKAEDRKALLD 226 (318) T ss_pred HHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCC-CceEEEEeecchHHHHH Confidence 999 888999999998875322110 00 001111112223333222 22222221111 11234444444 34455 Q ss_pred HhhccCCccc--ccCcccCCCCceeccee-eEEeccccccccCCceEEEechhhcEEEEeecceE-EEEeecccCceEEE Q lcl|NC_012784. 306 KMKDKLGNYL--IQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQ-ASWTDYMHFGECLM 381 (415) Q Consensus 306 ~lkd~~G~~l--~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~~~~~~~ 381 (415) .|+-++.+-- ..++-+.- ..--|.. +++.. ++..-++-++-|-+ |.+ +-++++ +......++..-+. T Consensus 227 elrqatanahvriknddtei--asevgvdeiivyt----gskalkptvlvdqk--yhi-dmqdltkvdafewktnsnmil 297 (318) T protein:vir:86 227 ELRQATANAHVRIKNDDTEI--ASEVGVDEIIVYT----GSKALKPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSNMIL 297 (318) T ss_pred HHHhhcccceeEEeccchhh--hhhcCcceeeeee----ccccccceeeeccc--eec-chhhhhhhhcceeccCCceEE Confidence 5554433211 11111000 0001111 11111 11111111222211 111 212222 11122223322233 Q ss_pred EEEEeccEEeccccEEEEEee Q lcl|NC_012784. 382 IAVRQDCRILDYKSAIVIEYD 402 (415) Q Consensus 382 ~~~r~d~~v~~p~a~~~~~~t 402 (415) ++..-.|-|..-+|-+.++++ T Consensus 298 vetltsghvetynagavitvs 318 (318) T protein:vir:86 298 VETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EeecccCcceeecCceeEEeC Confidence 443444444444455555555 No 208 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=93.44 E-value=0.0075 Score=32.22 Aligned_cols=308 Identities=6% Similarity=-0.036 Sum_probs=134.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHH--HHHHhhhhhhhhc Q lcl|NC_012784. 45 TDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDF--TEYLETRNDIQGG 122 (415) Q Consensus 45 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 122 (415) -+|+-+.+. ++ ..++ .+........... ....-.......+ T Consensus 1 ~~~~~~~~~----~~-------~l~~--------------------------~g~~~~~~~~~~~~~~~~~~a~d~~~~~ 43 (339) T protein:vir:94 1 MSINNDRTD----IK-------QLEK--------------------------VGIIFDGYSPKSISSEVSAYAMDAVNLT 43 (339) T ss_pred CceechHHH----HH-------HHHh--------------------------hceeeccchhhhcchhhHhhhccccccc Confidence 000000000 00 0000 0000000000000 0000001111111 Q ss_pred ccccccceeecch----hHHhHHHHHHhhhhhhhhcceeEEccCC-ceeEEEEeecCCcccccccccccccccc-cccce Q lcl|NC_012784. 123 SLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENPELA-VKPFF 196 (415) Q Consensus 123 ~~~~~~~~~~vP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~f~ 196 (415) ....+.....||. .+.+.|++........+.++.+.+...- ...+.+........+.+++.++..|-.+ +..|. T Consensus 44 ~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~ 123 (339) T protein:vir:94 44 PTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFE 123 (339) T ss_pred cccccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceee Confidence 1112222233443 3456667777777777777776554432 2345666666777888888888886433 24455 Q ss_pred eeEeeeeeEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccc--ccccc----c Q lcl|NC_012784. 197 QLAYDINTHRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE--GKKLE----V 268 (415) Q Consensus 197 ~v~~~~~k~a~~~~iS~e~l~d--s~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~--~~~~~----~ 268 (415) ..++....++-.+. ..|+..- ...++.+--....++++.+.+++-.+.|+......+...+.... ..... . T Consensus 124 ~~~v~~~~~g~~y~-~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~k 202 (339) T protein:vir:94 124 SRQNYRYQTWTEYG-DLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATA 202 (339) T ss_pred EEeEEEEEEEEeec-HHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccC Confidence 55555544443333 2333322 34567777788888888888888777776543322222211110 11111 1 Q ss_pred cchhhHHHHHHHHHHhhhhcc------CCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccccc Q lcl|NC_012784. 269 KKAKSLDDIKDAINLNVKPNY------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG 342 (415) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~------~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~ 342 (415) +...-++|+..++.++..... .+..++|.|+.+..|.. ++..|.-++. -+... +.++.++....+- + T Consensus 203 T~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~-~n~~~~Tvl~-~lk~n----~pnl~i~~~~el~-~ 275 (339) T protein:vir:94 203 APEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNR-TNNFGLSAGA-KIAQT----YPNIQFVAVPEFD-T 275 (339) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhccc-CCcCCccHHH-HHHHh----cCCcEEEEccccc-c Confidence 122234666666666654322 24579999999998864 3444433321 11111 2233455444432 2 Q ss_pred ccCCceEEEechhhcEEEEeecceEEEEe-ec------ccC-ceEEEEEEEe-ccEEeccccEEEEEee Q lcl|NC_012784. 343 QKGNNTLIIGNLKDAIVLFDRSQYQASWT-DY------MHF-GECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) Q Consensus 343 ~~~~~~~~~gd~~~~~~~~~~~~~~i~~~-~~------~~~-~~~~~~~~r~-d~~v~~p~a~~~~~~t 402 (415) +.++...++-+- ..+.....+.+. ++ ... ....-...|. |+.+.+|.||++++=- T Consensus 276 a~g~~~~~~~~~-----~~~~~~~~~~~p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 276 ASGRLVQLWVPE-----VNGQPTGEVAFAEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CCCceEEEEEEe-----ccCCcceEEEcchhhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 222322222110 000111222111 00 001 1112233564 5567889999998733 No 209 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=93.36 E-value=0.0078 Score=32.13 Aligned_cols=298 Identities=12% Similarity=0.082 Sum_probs=139.1 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc----cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK----TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) ....-...+..... ..+...+.. ..+-...|-+.+...+...+.+.+.+++.++++++.-..|...- ...+++ T Consensus 1 M~~~tr~~~~~y~~--~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~-lg~~g~ 77 (342) T protein:vir:10 1 MKDLTLEKYNAYLA--RQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLG-LDSAHT 77 (342) T ss_pred CChHHHHHHHHHHH--HHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEe-cccCcc Confidence 00000111111111 011111111 22224556667888899999999999999999999877765432 233333 Q ss_pred ccccccccc--cccccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_012784. 177 ALEKVEELE--ENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGST 252 (415) Q Consensus 177 ~a~~v~Eg~--~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~ 252 (415) -++-+.-.+ .....+...++.-.+..++.=--+.|+.+.|...+ .+|+..+++.+.+.++.=.-.--++|+....+ T Consensus 78 iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 157 (342) T protein:vir:10 78 VASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAAT 157 (342) T ss_pred cccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccC Confidence 333321111 11112223455556666666556677888777532 36777777777666543333333344332211 Q ss_pred c-----cc--------------------ccccccc--cccccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHH Q lcl|NC_012784. 253 G-----ST--------------------SSGFEKE--GKKLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMF 301 (415) Q Consensus 253 ~-----~~--------------------~~~~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~ 301 (415) . +. ....... ...........+|.++ ++++.+.++.++. -++++....+ T Consensus 158 Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLl 237 (342) T protein:vir:10 158 SDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLL 237 (342) T ss_pred CChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhh Confidence 1 00 0000000 1111222344566666 4666666665554 3677777665 Q ss_pred HH-HHHhhccCCcccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCc Q lcl|NC_012784. 302 AK-LDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFG 377 (415) Q Consensus 302 ~~-l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~ 377 (415) .. -..|-...+.|- +-... ....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+..... T Consensus 238 adk~~~l~n~~~~pt--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~-----ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 310 (342) T protein:vir:10 238 ADKYFPIVNQQNAPT--EELAADIVISQKRIGGLKAVRVPFFPANA-----ILITKLENLAIYVQEGTTRKHIENVPKKD 310 (342) T ss_pred HHHHHHHHhcCCChH--HHHHHHHHHhhhhhcCceeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEeccccc Confidence 42 222322222321 00111 113578999999999999764 56655555322233333333332222111 Q ss_pred eEEEEEEE-eccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 378 ECLMIAVR-QDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 378 ~~~~~~~r-~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) ..--.+.| -+..|-++.+++.++-..-..++ T Consensus 311 rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 311 RIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred cccchhhhccceeeeccccEEEeecceecCCC Confidence 11111112 23345556666666533222222 No 210 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=93.10 E-value=0.0088 Score=31.86 Aligned_cols=395 Identities=11% Similarity=0.024 Sum_probs=128.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh---ccccccccchhhhh Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN---NQQSVEVNEARTYR 84 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 84 (415) +.+-..+.++..+..+++... .+++.+...++++.++++..+++.+.++++...+....... .............. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 79 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDI-NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) T ss_pred CCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555556666665555554443 33334445566666777766777766655543332211110 00000000000000 Q ss_pred hHHHHHHHHHHHHHhhhh-hHHHHH---HHHHHhhhhhhhhccc---------------------------ccccceeec Q lcl|NC_012784. 85 NQANINDLGISIQNTKVT-SQEVRD---FTEYLETRNDIQGGSL---------------------------KTDSGFVVI 133 (415) Q Consensus 85 ~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~---------------------------~~~~~~~~v 133 (415) .................. ...... .....+.......... ...++...- T Consensus 80 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 159 (497) T protein:vir:78 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) T ss_pred HHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCccc Confidence 000000000000000000 000000 0000000000000000 000000000 Q ss_pred chhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccc------cccccccccccceeeEeeeeeEEE Q lcl|NC_012784. 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE------LEENPELAVKPFFQLAYDINTHRG 207 (415) Q Consensus 134 P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E------g~~~~~~~~~~f~~v~~~~~k~a~ 207 (415) +..+-.++...+-+ .++.... +-..-.. ++. +.+...+..+ ..-+.|.........++..-++.. T Consensus 160 g~~vp~~~~~~ii~--~~~~~~~---i~~l~~~--~~~--~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:78 160 APGILPTFLPGIVE--QLFYELS---LADLISS--RPV--TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred ccccchhhhHHHHH--HHHhhhh---HHhhccc--ccc--CCCceEEEEEcCCCCcceeeccCcccccccccceeeEeee Confidence 00111111111100 0011000 0000001 111 1111122111 111122222222333444333332 Q ss_pred eehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc-ccchhhHHHHHHHHHHhhh Q lcl|NC_012784. 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAINLNVK 286 (415) Q Consensus 208 ~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 286 (415) .-.-..--+.+.-..-..+|...+...+++++...+=...=.|...+.+.+......... ........+.......+.. T Consensus 231 ~k~a~~~~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) T protein:vir:78 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) T ss_pred eeeEeecHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhh Confidence 211111112222222234566667777777777666555445555455555544433333 3344455666777777777 Q ss_pred hccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCcee-------ccee-----------eEEecccc-----ccc Q lcl|NC_012784. 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRL-------LGAK-----------IEILPDEV-----LGQ 343 (415) Q Consensus 287 ~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l-------~G~p-----------V~~~~~~~-----~~~ 343 (415) .+.....|+++...+..+....+.+|.+.+........+... ..++ +++.+... ..- T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk 390 (497) T protein:vir:78 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) T ss_pred hcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhh Confidence 788888999999999999988888888765433222111100 0000 11100000 000 Q ss_pred cCCceEEEechhh-----c----EEEEeecceEEEEeecccCceE---------EEEEEEeccEEecc---------c-- Q lcl|NC_012784. 344 KGNNTLIIGNLKD-----A----IVLFDRSQYQASWTDYMHFGEC---------LMIAVRQDCRILDY---------K-- 394 (415) Q Consensus 344 ~~~~~~~~gd~~~-----~----~~~~~~~~~~i~~~~~~~~~~~---------~~~~~r~d~~v~~p---------~-- 394 (415) .++...++++... . ..++ |+.+..++....... +.+..|-+..+... + T Consensus 391 d~~G~~i~~~~~~~~~~~~~~~~~~l~---G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v 467 (497) T protein:vir:78 391 DANGQYMGGNFFGNAYGNPVNGGKNIW---GVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKV 467 (497) T ss_pred cCCCceeccCcccccccccccCCceee---ceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcE Confidence 0011111111000 0 0111 222222222211111 11112222222110 0 Q ss_pred cEEE-EEee-----cCCCCcccccccC Q lcl|NC_012784. 395 SAIV-IEYD-----DSERGEGDLGLEA 415 (415) Q Consensus 395 a~~~-~~~t-----~~~~~~~~~~~~~ 415 (415) +|+. ..+. +.+=-...+++.| T Consensus 468 ~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 468 TVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred EEEEEEeecceeeccccEEEEEecCCc Confidence 1110 0111 1111112222222 No 211 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=93.10 E-value=0.0088 Score=31.86 Aligned_cols=395 Identities=11% Similarity=0.024 Sum_probs=128.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh---ccccccccchhhhh Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN---NQQSVEVNEARTYR 84 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 84 (415) +.+-..+.++..+..+++... .+++.+...++++.++++..+++.+.++++...+....... .............. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 79 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDI-NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) T ss_pred CCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555556666665555554443 33334445566666777766777766655543332211110 00000000000000 Q ss_pred hHHHHHHHHHHHHHhhhh-hHHHHH---HHHHHhhhhhhhhccc---------------------------ccccceeec Q lcl|NC_012784. 85 NQANINDLGISIQNTKVT-SQEVRD---FTEYLETRNDIQGGSL---------------------------KTDSGFVVI 133 (415) Q Consensus 85 ~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~---------------------------~~~~~~~~v 133 (415) .................. ...... .....+.......... ...++...- T Consensus 80 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 159 (497) T protein:vir:10 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) T ss_pred HHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCccc Confidence 000000000000000000 000000 0000000000000000 000000000 Q ss_pred chhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccc------cccccccccccceeeEeeeeeEEE Q lcl|NC_012784. 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE------LEENPELAVKPFFQLAYDINTHRG 207 (415) Q Consensus 134 P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E------g~~~~~~~~~~f~~v~~~~~k~a~ 207 (415) +..+-.++...+-+ .++.... +-..-.. ++. +.+...+..+ ..-+.|.........++..-++.. T Consensus 160 g~~vp~~~~~~ii~--~~~~~~~---i~~l~~~--~~~--~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:10 160 APGILPTFLPGIVE--QLFYELS---LADLISS--RPV--TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred ccccchhhhHHHHH--HHHhhhh---HHhhccc--ccc--CCCceEEEEEcCCCCcceeeccCcccccccccceeeEeee Confidence 00111111111100 0011000 0000001 111 1111122111 111122222222333444333332 Q ss_pred eehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccc-ccchhhHHHHHHHHHHhhh Q lcl|NC_012784. 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAINLNVK 286 (415) Q Consensus 208 ~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 286 (415) .-.-..--+.+.-..-..+|...+...+++++...+=...=.|...+.+.+......... ........+.......+.. T Consensus 231 ~k~a~~~~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) T protein:vir:10 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) T ss_pred eeeEeecHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhh Confidence 211111112222222234566667777777777666555445555455555544433333 3344455666777777777 Q ss_pred hccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCcee-------ccee-----------eEEecccc-----ccc Q lcl|NC_012784. 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRL-------LGAK-----------IEILPDEV-----LGQ 343 (415) Q Consensus 287 ~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l-------~G~p-----------V~~~~~~~-----~~~ 343 (415) .+.....|+++...+..+....+.+|.+.+........+... ..++ +++.+... ..- T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk 390 (497) T protein:vir:10 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) T ss_pred hcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhh Confidence 788888999999999999988888888765433222111100 0000 11100000 000 Q ss_pred cCCceEEEechhh-----c----EEEEeecceEEEEeecccCceE---------EEEEEEeccEEecc---------c-- Q lcl|NC_012784. 344 KGNNTLIIGNLKD-----A----IVLFDRSQYQASWTDYMHFGEC---------LMIAVRQDCRILDY---------K-- 394 (415) Q Consensus 344 ~~~~~~~~gd~~~-----~----~~~~~~~~~~i~~~~~~~~~~~---------~~~~~r~d~~v~~p---------~-- 394 (415) .++...++++... . ..++ |+.+..++....... +.+..|-+..+... + T Consensus 391 d~~G~~i~~~~~~~~~~~~~~~~~~l~---G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v 467 (497) T protein:vir:10 391 DANGQYMGGNFFGNAYGNPVNGGKNIW---GVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKV 467 (497) T ss_pred cCCCceeccCcccccccccccCCceee---ceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcE Confidence 0011111111000 0 0111 222222222211111 11112222222110 0 Q ss_pred cEEE-EEee-----cCCCCcccccccC Q lcl|NC_012784. 395 SAIV-IEYD-----DSERGEGDLGLEA 415 (415) Q Consensus 395 a~~~-~~~t-----~~~~~~~~~~~~~ 415 (415) +|+. ..+. +.+=-...+++.| T Consensus 468 ~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 468 TVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred EEEEEEeecceeeccccEEEEEecCCc Confidence 1110 0111 1111112222222 No 212 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=92.94 E-value=0.0094 Score=31.70 Aligned_cols=279 Identities=11% Similarity=0.029 Sum_probs=116.2 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhh--hcceeEEccCCceeEEEEeecCC-c-ccccccccccccccccccce Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD--KYVTVKRVTNGSGKYPVVRQSEV-A-ALEKVEELEENPELAVKPFF 196 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~-~-~a~~v~Eg~~~~~~~~~~f~ 196 (415) +.. --.+.-+..+..-|-+......+++ .++...++. ...+.+.+...+ . .+..+..+...+-.....+. T Consensus 1 M~~----i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~--~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~ 74 (348) T protein:vir:96 1 MGL----IYDKVTASNIAGYFNTLQENVDSTLGESIFPARKQL--GTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAE 74 (348) T ss_pred Ccc----hhhccCHHHHHHHHHhcccchhhhhhhhcCCCcccc--ceeEEEEeecCCceeEeeeecCCCCcceeccccee Confidence 000 0111223333332322222222222 233322222 223333332222 2 24566666544434445677 Q ss_pred eeEeeeeeEEEeehhhHHHHh------cch-----HHHHHHHH---HHHHHHHHHHHHHHHhhcccccc----cccc--- Q lcl|NC_012784. 197 QLAYDINTHRGYFRISREAIE------DAK-----VNVLQELK---LWMARTIAATRNKAIIDVITKGS----TGST--- 255 (415) Q Consensus 197 ~v~~~~~k~a~~~~iS~e~l~------ds~-----~~l~~~l~---~~la~~~~~~~d~~il~g~g~~~----~~~~--- 255 (415) ..++.+-.++....++..-+. ++. ..+...|. ..+.+++...+|........+|. ..+. T Consensus 75 ~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~ 154 (348) T protein:vir:96 75 IHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEE Confidence 778887777776666543221 110 01222222 23345566566643332222111 0000 Q ss_pred -----c-cccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhcc----CCcc-cccCccc Q lcl|NC_012784. 256 -----S-SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK---MKDK----LGNY-LIQPDVK 321 (415) Q Consensus 256 -----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---lkd~----~G~~-l~~~~~~ 321 (415) . ............++.+.+.|+.+....+...+..+..++|++..|.+|+. +++. ++.. ...+... T Consensus 155 vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~ 234 (348) T protein:vir:96 155 IDYGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAEL 234 (348) T ss_pred EeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHH Confidence 0 00001111223345566788887777776667788899999999999863 3332 1111 0111111 Q ss_pred CCCCceecceeeEEeccccccccCCc-------eEEE-echhhcEEEEee--c-------------------ceEE-EEe Q lcl|NC_012784. 322 EKTQQRLLGAKIEILPDEVLGQKGNN-------TLII-GNLKDAIVLFDR--S-------------------QYQA-SWT 371 (415) Q Consensus 322 ~~~~~~l~G~pV~~~~~~~~~~~~~~-------~~~~-gd~~~~~~~~~~--~-------------------~~~i-~~~ 371 (415) ...-.++.|+++++.+..-....|.. .+++ .+-..+...+-. + ++-+ .+. T Consensus 235 ~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (348) T protein:vir:96 235 QNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTTK 314 (348) T ss_pred HHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEeee Confidence 11112456788877654321111211 1222 110001111100 0 0000 011 Q ss_pred ecccCceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 372 DYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 372 ~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) +.+-...-+.+..+.=-.+.+|+++..+++.++. T Consensus 315 ~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 315 TTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred cCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 1111122233333333345679999999988877 No 213 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=92.85 E-value=0.0097 Score=31.61 Aligned_cols=293 Identities=11% Similarity=-0.042 Sum_probs=129.4 Q ss_pred ccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhh--hh Q lcl|NC_012784. 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE--FN 150 (415) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~--~~ 150 (415) ...+.+... ......... ...+.+.+. .....+..+..+++.+--+.+.++|..+.... .. T Consensus 1 ~~~~~~~~~-----------~~~~~~~~~----~e~~~KS~~--tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:95 1 MTIEKNLSD-----------VQQKYADQF----QEDVVKSFQ--TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) T ss_pred CCcccccch-----------HHHHHHhhh----hHHHHHHhh--cCCccCCccccCcchhhhhhhhhhhheeeecccchh Confidence 000000000 000000000 000000000 00112223444455554444555543322221 12 Q ss_pred hhhcceeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHHHHH Q lcl|NC_012784. 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) Q Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~-l~ds~~~l~~~l~ 228 (415) +..-....++.+--..|......+. .-..+++|++.. +.+++.+......++-++....+|.-+ +.++..+....+. T Consensus 64 ~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~-~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~ 142 (463) T protein:vir:95 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVA-PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILT 142 (463) T ss_pred hhhhcCCchhhhhhhhheeeeccCcccccccccccccc-ccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHH Confidence 3333334444444444544433343 566789999975 578899999999999999888777543 3556668889999 Q ss_pred HHHHHHHHHHHHHHHhhcccccccccccccc-----c---ccccccccc-chhhHHHHHHHHHHhhhhccCCCEEEEcHH Q lcl|NC_012784. 229 LWMARTIAATRNKAIIDVITKGSTGSTSSGF-----E---KEGKKLEVK-KAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) Q Consensus 229 ~~la~~~~~~~d~~il~g~g~~~~~~~~~~~-----~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 299 (415) +.-.-.++..+|.+.+.|+..=.+.+.+-+. . .......+- ...+.+.+..+-..+...+..++-++|+.- T Consensus 143 ~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~ 222 (463) T protein:vir:95 143 EDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIG 222 (463) T ss_pred HHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchH Confidence 9999999999999999997664443221111 0 111111111 122334444454455566677788999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceE Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGEC 379 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~ 379 (415) +.+.|..---..-|.+..++.. ....|+||--.- ..+..+.+.-+.+..+... T Consensus 223 vka~f~~~~l~~qrv~~~~N~~----~~~~G~~v~~f~-----------------------s~~G~I~L~~s~~m~~~~i 275 (463) T protein:vir:95 223 VHADFVNSILGRQMQLMQDNSG----NVNTGYSVNGFY-----------------------SSRGFIKLHGSTVMENELI 275 (463) T ss_pred HHHHHHHHhcCceEEEEcCCCC----ceeeeeecccee-----------------------eeeeeeeeCCceecCCccc Confidence 9888875322222333332221 124455543110 0011111111111110000 Q ss_pred EEEEEEeccEEeccccEEEEEeec--CCCCcccccccC Q lcl|NC_012784. 380 LMIAVRQDCRILDYKSAIVIEYDD--SERGEGDLGLEA 415 (415) Q Consensus 380 ~~~~~r~d~~v~~p~a~~~~~~t~--~~~~~~~~~~~~ 415 (415) + ...+ ...|.|++..++++ .+...|-+...+ T Consensus 276 l-~~~~----~~~p~ap~~~~~tatv~~~~~~~~~~~~ 308 (463) T protein:vir:95 276 L-DESL----QPLPNAPQPAKVTATVETKQKGAFENEE 308 (463) T ss_pred c-cchh----hcCCCCccCceeEEEEeeccCCCCCCcc Confidence 0 0000 01222222222221 111111111111 No 214 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=92.85 E-value=0.0097 Score=31.61 Aligned_cols=293 Identities=11% Similarity=-0.042 Sum_probs=129.4 Q ss_pred ccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhh--hh Q lcl|NC_012784. 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE--FN 150 (415) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~--~~ 150 (415) ...+.+... ......... ...+.+.+. .....+..+..+++.+--+.+.++|..+.... .. T Consensus 1 ~~~~~~~~~-----------~~~~~~~~~----~e~~~KS~~--tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:99 1 MTIEKNLSD-----------VQQKYADQF----QEDVVKSFQ--TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) T ss_pred CCcccccch-----------HHHHHHhhh----hHHHHHHhh--cCCccCCccccCcchhhhhhhhhhhheeeecccchh Confidence 000000000 000000000 000000000 00112223444455554444555543322221 12 Q ss_pred hhhcceeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHHHHH Q lcl|NC_012784. 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) Q Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~-l~ds~~~l~~~l~ 228 (415) +..-....++.+--..|......+. .-..+++|++.. +.+++.+......++-++....+|.-+ +.++..+....+. T Consensus 64 ~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~-~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~ 142 (463) T protein:vir:99 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVA-PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILT 142 (463) T ss_pred hhhhcCCchhhhhhhhheeeeccCcccccccccccccc-ccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHH Confidence 3333334444444444544433343 566789999975 578899999999999999888777543 3556668889999 Q ss_pred HHHHHHHHHHHHHHHhhcccccccccccccc-----c---ccccccccc-chhhHHHHHHHHHHhhhhccCCCEEEEcHH Q lcl|NC_012784. 229 LWMARTIAATRNKAIIDVITKGSTGSTSSGF-----E---KEGKKLEVK-KAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) Q Consensus 229 ~~la~~~~~~~d~~il~g~g~~~~~~~~~~~-----~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 299 (415) +.-.-.++..+|.+.+.|+..=.+.+.+-+. . .......+- ...+.+.+..+-..+...+..++-++|+.- T Consensus 143 ~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~ 222 (463) T protein:vir:99 143 EDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIG 222 (463) T ss_pred HHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchH Confidence 9999999999999999997664443221111 0 111111111 122334444454455566677788999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceE Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGEC 379 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~ 379 (415) +.+.|..---..-|.+..++.. ....|+||--.- ..+..+.+.-+.+..+... T Consensus 223 vka~f~~~~l~~qrv~~~~N~~----~~~~G~~v~~f~-----------------------s~~G~I~L~~s~~m~~~~i 275 (463) T protein:vir:99 223 VHADFVNSILGRQMQLMQDNSG----NVNTGYSVNGFY-----------------------SSRGFIKLHGSTVMENELI 275 (463) T ss_pred HHHHHHHHhcCceEEEEcCCCC----ceeeeeecccee-----------------------eeeeeeeeCCceecCCccc Confidence 9888875322222333332221 124455543110 0011111111111110000 Q ss_pred EEEEEEeccEEeccccEEEEEeec--CCCCcccccccC Q lcl|NC_012784. 380 LMIAVRQDCRILDYKSAIVIEYDD--SERGEGDLGLEA 415 (415) Q Consensus 380 ~~~~~r~d~~v~~p~a~~~~~~t~--~~~~~~~~~~~~ 415 (415) + ...+ ...|.|++..++++ .+...|-+...+ T Consensus 276 l-~~~~----~~~p~ap~~~~~tatv~~~~~~~~~~~~ 308 (463) T protein:vir:99 276 L-DESL----QPLPNAPQPAKVTATVETKQKGAFENEE 308 (463) T ss_pred c-cchh----hcCCCCccCceeEEEEeeccCCCCCCcc Confidence 0 0000 01222222222221 111111111111 No 215 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=92.69 E-value=0.01 Score=31.46 Aligned_cols=279 Identities=11% Similarity=0.025 Sum_probs=116.0 Q ss_pred ccccceeecchhHHhHHHHHHhhhhhhh--hcceeEEccCCceeEEEEeecCC-c-ccccccccccccccccccceeeEe Q lcl|NC_012784. 125 KTDSGFVVIPEEIVTDILKLKEVEFNLD--KYVTVKRVTNGSGKYPVVRQSEV-A-ALEKVEELEENPELAVKPFFQLAY 200 (415) Q Consensus 125 ~~~~~~~~vP~~~~~~Ii~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~-~-~a~~v~Eg~~~~~~~~~~f~~v~~ 200 (415) -..-....-|.++..-|.++.....+++ .++...++ ...++.+.....+ . .+..+..+...+-.....+...++ T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~--~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~ 78 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQ--LGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDE 78 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccc--cceeEEEEeeccCceeEeeeecCCCCcceecccceeeeee Confidence 0000112223334433333333333222 12222222 2223333332222 1 234555554433333445677777 Q ss_pred eeeeEEEeehhhHHHHh------cch-HHHH----HHH---HHHHHHHHHHHHHHHHhhccccccc----ccc------- Q lcl|NC_012784. 201 DINTHRGYFRISREAIE------DAK-VNVL----QEL---KLWMARTIAATRNKAIIDVITKGST----GST------- 255 (415) Q Consensus 201 ~~~k~a~~~~iS~e~l~------ds~-~~l~----~~l---~~~la~~~~~~~d~~il~g~g~~~~----~~~------- 255 (415) .+-.+.....++..-++ ++. .+.. ..| ..++..++.+.+|........+|.- .+. T Consensus 79 ~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg 158 (348) T protein:vir:27 79 QMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYG 158 (348) T ss_pred ecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeec Confidence 77777766666643321 110 1111 111 2334455666666544333222210 000 Q ss_pred --ccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhccCCcc-----cccCcccCCCC Q lcl|NC_012784. 256 --SSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK---MKDKLGNY-----LIQPDVKEKTQ 325 (415) Q Consensus 256 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---lkd~~G~~-----l~~~~~~~~~~ 325 (415) .............++.+.+.|+.+....+...+..+..++|++..|..|+. +++.-... ...+......- T Consensus 159 ~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~ 238 (348) T protein:vir:27 159 VKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYI 238 (348) T ss_pred CCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHHHH Confidence 000011111223335566788888877776677888999999999999864 33322110 01111111111 Q ss_pred ceecceeeEEeccccccccCC-------ceEEE-echhhcEEEEe--ec-------------------ceEEE-Eeeccc Q lcl|NC_012784. 326 QRLLGAKIEILPDEVLGQKGN-------NTLII-GNLKDAIVLFD--RS-------------------QYQAS-WTDYMH 375 (415) Q Consensus 326 ~~l~G~pV~~~~~~~~~~~~~-------~~~~~-gd~~~~~~~~~--~~-------------------~~~i~-~~~~~~ 375 (415) .++.|.+|++.+..-....|. ..+++ .+-..+...+- -+ ++-+. +.+.+- T Consensus 239 ~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP 318 (348) T protein:vir:27 239 ADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDP 318 (348) T ss_pred HhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecCCC Confidence 245677777765432111111 12222 21111111110 00 00000 011111 Q ss_pred CceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 376 FGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 376 ~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ...-..+..+.=-.+.+|+++..+++.++. T Consensus 319 ~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 319 VNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred ceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 112222333333345678999999988887 No 216 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=92.43 E-value=0.011 Score=31.23 Aligned_cols=301 Identities=10% Similarity=0.030 Sum_probs=135.4 Q ss_pred hhhHHHHHHHHHHhhhhhhhhccc----ccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSL----KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) ....-...+....... +...+. ...+.-+.|.+.+...+...+.+.+.+++.++++++....+..-. ...++. T Consensus 1 M~~~tr~~~~~y~~~~--A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~-~~~sg~ 77 (343) T protein:vir:98 1 MNKTAQELFYSLIGDA--AEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDL-RSNRKR 77 (343) T ss_pred CChHHHHHHHHHHHHH--HHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEE-eecCcc Confidence 0000011111111110 111111 122334667777888899999999999999999998765544332 222332 Q ss_pred ccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcch--HH-HHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_012784. 177 ALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VN-VLQELKLWMARTIAATRNKAIIDVITKGSTG 253 (415) Q Consensus 177 ~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~--~~-l~~~l~~~la~~~~~~~d~~il~g~g~~~~~ 253 (415) .+.-....+...+. ...+.-.+..++.=--..|+.+.|...+ .| |+..+++.+.+.++.=.-.--++|+....+. T Consensus 78 ~t~r~~t~~~~~~~--~~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 155 (343) T protein:vir:98 78 HYGAHDRRTPIQQR--WTRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDT 155 (343) T ss_pred ccCccccCCCcccc--ccCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCC Confidence 22211111100000 0111123444444444567777777643 34 6666666666555433322333443322211 Q ss_pred cccc----------------------ccccccc---cccccchhhHHHHHHHHHHhhhhccCC---CEEEEcHHHHHH-H Q lcl|NC_012784. 254 STSS----------------------GFEKEGK---KLEVKKAKSLDDIKDAINLNVKPNYEH---NVAIVSQTMFAK-L 304 (415) Q Consensus 254 ~~~~----------------------~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~v~~~~~~~~-l 304 (415) ..+. ....... .........+|.++-.+..+.++.++. -++++....+.. - T Consensus 156 ~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~ 235 (343) T protein:vir:98 156 SDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEA 235 (343) T ss_pred CCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhh Confidence 1110 0000000 011222445666664444455665544 266677665443 2 Q ss_pred HHhhccCCcccccCccc---CCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEE Q lcl|NC_012784. 305 DKMKDKLGNYLIQPDVK---EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLM 381 (415) Q Consensus 305 ~~lkd~~G~~l~~~~~~---~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 381 (415) ..|-.+.+++-- .... -....+|-|+|.+..+++|..+ +++=-|++--.-+.+...+-...+.......-- T Consensus 236 ~~l~n~~~~~pt-Ek~Aa~~~~~~k~iGGl~a~~~PfFP~~~-----llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 309 (343) T protein:vir:98 236 SLVYKGNGLIAT-EKAALNTHDLMKSFGGMPAMIVPNMPPRA-----AIVTSLSNLSIYTQEGSMRRGMKDDDDKKAVRD 309 (343) T ss_pred hhhhhhcCCChH-HHHHHHHHHHHHhhCCCeeEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEeccccccccc Confidence 233333343211 0111 0123578999999999999764 566555553222333333333333222111111 Q ss_pred EEEE-eccEEeccccEEEEEeec--CCCCccccc Q lcl|NC_012784. 382 IAVR-QDCRILDYKSAIVIEYDD--SERGEGDLG 412 (415) Q Consensus 382 ~~~r-~d~~v~~p~a~~~~~~t~--~~~~~~~~~ 412 (415) .+.| -+..|-++.+++.++-.. -+.+.|... T Consensus 310 y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~~g~w~ 343 (343) T protein:vir:98 310 SYYRNEAYAVEDCGKFMAVDFTKVKLSSGKGTWK 343 (343) T ss_pred hhhhcceeeeeccccEEEeeeeeeeecCCCCCCC Confidence 1122 233566677777776443 223444455 No 217 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=91.74 E-value=0.014 Score=30.68 Aligned_cols=304 Identities=9% Similarity=0.026 Sum_probs=131.3 Q ss_pred hhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHh--hhhhhhhcccccccceeecchhHH----hHHHHHHhhhhhhhhcce Q lcl|NC_012784. 83 YRNQANINDLGISIQNTKVTSQEVRDFTEYLE--TRNDIQGGSLKTDSGFVVIPEEIV----TDILKLKEVEFNLDKYVT 156 (415) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~vP~~~~----~~Ii~~~~~~~~l~~~~~ 156 (415) ...........+ ...............+. ..........-.+.+...+|..+. +.+++.+........++. T Consensus 1 ~~~~~~~~~l~~---~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~ 77 (336) T protein:vir:78 1 MRDAQRIQNLAR---AGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVG 77 (336) T ss_pred CchHHHHHHHhc---cCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcc Confidence 000000000000 00000000000001000 111111111112222233554332 344455444444445554 Q ss_pred eEEccCCc-eeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhc---chHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMA 232 (415) Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~d---s~~~l~~~l~~~la 232 (415) +...+.-. ..+.+........+.+.+-+...|- .+......+...+.++..+.++.+=+.- ...++.+--....+ T Consensus 78 v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~-vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:78 78 ESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred cccCCCccccEEEEeeeecceeeEEeecccCCCe-eecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHH Confidence 43321111 1334444455566778888888875 4466778888888999889898443332 33467777777777 Q ss_pred HHHHHHHHHHHhhccccccccccccccccc---ccccc----ccchhhHHHHHHHHHHhhhhcc------CCCEEEEcHH Q lcl|NC_012784. 233 RTIAATRNKAIIDVITKGSTGSTSSGFEKE---GKKLE----VKKAKSLDDIKDAINLNVKPNY------EHNVAIVSQT 299 (415) Q Consensus 233 ~~~~~~~d~~il~g~g~~~~~~~~~~~~~~---~~~~~----~~~~~~~~~~~~~~~~~~~~~~------~~~~~v~~~~ 299 (415) +++.+.++.-.+.|+......+...+.... ...+. .+...-++|+..++.++..... .+..++|.|. T Consensus 157 ~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~ 236 (336) T protein:vir:78 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPT 236 (336) T ss_pred HHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechH Confidence 777788887667776554333322221111 11111 1122345677777766654432 2458999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEe-ec----- Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DY----- 373 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~-~~----- 373 (415) .+..|.. ++..|.-++. -+... +-++.++..+.+- ++.++...+|-.- +. ......+.+. ++ T Consensus 237 ~~~~L~~-~n~~g~tv~~-~lk~n----~Pnl~i~t~pel~-~Agg~~~~~~~~~---~~--~~~t~~~~~p~~f~~lpv 304 (336) T protein:vir:78 237 AMSDLSK-TNQYGLSAAA-KLKEI----FPKLEFVTIPEYD-TASGRLVQLWAPR---VE--GKDTATCGFTEKMRAHSI 304 (336) T ss_pred HHHhccC-CCccCccHHH-HHHHh----cCccEEEEccccc-ccCcceEEEEEee---cc--CCcceeeecchhhhccce Confidence 9998864 3333332211 11111 1122344333332 1222222222110 00 0001222111 01 Q ss_pred -ccC-ceEEEEEEEecc-EEeccccEEEEEee Q lcl|NC_012784. 374 -MHF-GECLMIAVRQDC-RILDYKSAIVIEYD 402 (415) Q Consensus 374 -~~~-~~~~~~~~r~d~-~v~~p~a~~~~~~t 402 (415) ... ....-...|.+| .+.+|.||++++=- T Consensus 305 q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 305 ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eecCceeEeccccceeeeeeeccchheeeccC Confidence 001 011122345544 56789999998733 No 218 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=90.43 E-value=0.021 Score=29.80 Aligned_cols=303 Identities=9% Similarity=-0.002 Sum_probs=131.1 Q ss_pred hhhHHHHHHHHHHHHHhhhhhHHHHHHHH--HHhhhhhhhhcccccccceeecchhHH----hHHHHHHhhhhhhhhcce Q lcl|NC_012784. 83 YRNQANINDLGISIQNTKVTSQEVRDFTE--YLETRNDIQGGSLKTDSGFVVIPEEIV----TDILKLKEVEFNLDKYVT 156 (415) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~vP~~~~----~~Ii~~~~~~~~l~~~~~ 156 (415) ...........+ .+ ............ ..-..........-.+.+...+|..+. +.+++.+........++. T Consensus 1 ~~~~~~~~~l~~--~g-i~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~p 77 (336) T protein:vir:10 1 MRDAQRIQNLAR--AG-VILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVG 77 (336) T ss_pred CchHHHHHHHhh--cC-eeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhcc Confidence 000000000000 00 000000000000 000011111111111222334554332 444554444444444544 Q ss_pred eEEccCCc-eeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhh-HHHHhc--chHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRIS-REAIED--AKVNVLQELKLWMA 232 (415) Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS-~e~l~d--s~~~l~~~l~~~la 232 (415) +...+.-. ..+.+........+.+.+-+...|-. +..-...+...+.++..+.++ .|+..- ..+++.+--....+ T Consensus 78 v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~-d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:10 78 ESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred ccccCCccceeEEEeeeeceeeEEEeeccCCCcee-ecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 43321111 12233333445566778888888744 455666777788888888888 444432 33467777777777 Q ss_pred HHHHHHHHHHHhhccccccccccccccccc---c----ccccccchhhHHHHHHHHHHhhhhcc------CCCEEEEcHH Q lcl|NC_012784. 233 RTIAATRNKAIIDVITKGSTGSTSSGFEKE---G----KKLEVKKAKSLDDIKDAINLNVKPNY------EHNVAIVSQT 299 (415) Q Consensus 233 ~~~~~~~d~~il~g~g~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~v~~~~ 299 (415) +++.+.+++-.+.|+.....-+...+.... . .....+....++|+..++..+..... .+..++|.|+ T Consensus 157 ~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~ 236 (336) T protein:vir:10 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPT 236 (336) T ss_pred HHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHH Confidence 888888887666776554332222221111 1 01111223356788888887776432 3678999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEee-cceEEEEe----ecc Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR-SQYQASWT----DYM 374 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~----~~~ 374 (415) .+..|.. ++..|.-++. -+... +-++.++..+.. .++.+....++-+- .+. ....+.+. .+. T Consensus 237 ~~~~Ls~-~n~~g~Tvl~-~lk~n----~Pnl~i~t~pEl-~~a~G~~~~l~~~~------~~~~~t~~~~~p~~~~~l~ 303 (336) T protein:vir:10 237 AMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEY-DTASGRLVQLWAPR------VEGKDTATCGFTEKMRAHS 303 (336) T ss_pred HHHhccC-CCccCccHHH-HHHHh----cCccEEEEcccc-ccCCCceEEEEEEe------cCCCcceeeecchhhhccc Confidence 8888853 3333432221 11111 112233333333 22223322222110 000 01111111 100 Q ss_pred ---cC-ceEEEEEEEecc-EEeccccEEEEEee Q lcl|NC_012784. 375 ---HF-GECLMIAVRQDC-RILDYKSAIVIEYD 402 (415) Q Consensus 375 ---~~-~~~~~~~~r~d~-~v~~p~a~~~~~~t 402 (415) .. ....-...|.+| .+.+|.||++++=- T Consensus 304 vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 304 IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eeecCceeEeccccceeeeeeeccchheeeecC Confidence 00 001122345544 56789999998733 No 219 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=90.40 E-value=0.021 Score=29.77 Aligned_cols=359 Identities=14% Similarity=0.113 Sum_probs=137.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQE-KQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) |.+.++|+++..-+.+.-+ -..++... -.+++-++|=+ +++..+. .+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~-------------~~~~i~~~--~~~~~~a~llenq~~~~~~--------~~--------- 48 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQE-------------GLPDIATK--SKKQLVAAILEAQEKDAET--------DP--------- 48 (524) T ss_pred CcchHHHHHHhHHHhcCCc-------------Ccchhcch--hhHHHHHHHHhhHHHHHhc--------Cc--------- Confidence 9999999999988754211 00111000 00111111111 1111100 00 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcce Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVT 156 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~ 156 (415) ..+.....+.+.............. ........ +.+.+++ ..+.+.++.++| +.....+++. T Consensus 49 --~~~~~~~~~~~~~~l~ea~~~~~~~---------~~~~~i~~-s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwG 113 (524) T protein:vir:98 49 --VYRDEKIVESFGGFLAEAEIAGDHN---------YDQTNIAS-GKSSGAI---TNIGPAVIGMVRRAIPNLIAFDICG 113 (524) T ss_pred --cccchHHHHhhhccccccccccccc---------cccccccc-ccccccc---ccccchhhhHHHHHHHhhhhhhhhe Confidence 0000011111111111100000000 00000000 1111111 122223333333 3344456777 Q ss_pred eEEccCCceeE-----EEEeecCCc-------c---------ccc----------------------------------- Q lcl|NC_012784. 157 VKRVTNGSGKY-----PVVRQSEVA-------A---------LEK----------------------------------- 180 (415) Q Consensus 157 ~~~~~~~~~~~-----~~~~~~~~~-------~---------a~~----------------------------------- 180 (415) ++||+++++-+ .+....... . +.+ T Consensus 114 VQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~ 193 (524) T protein:vir:98 114 VQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAY 193 (524) T ss_pred eccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCCcccccccccccccccccccccccccccccccee Confidence 88887776542 221111000 0 000 Q ss_pred ----------------------------------ccccccc---------cccccccceeeEeeeeeEEEe-------eh Q lcl|NC_012784. 181 ----------------------------------VEELEEN---------PELAVKPFFQLAYDINTHRGY-------FR 210 (415) Q Consensus 181 ----------------------------------v~Eg~~~---------~~~~~~~f~~v~~~~~k~a~~-------~~ 210 (415) ++.|-.. ...+...|.+..+...|..+- .. T Consensus 194 ~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAE 273 (524) T protein:vir:98 194 FQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQ 273 (524) T ss_pred ccccccCcccccccccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeeccccccc Confidence 0000000 001123355666666665544 47 Q ss_pred hhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc---cccccccccccc-------chhhHHH Q lcl|NC_012784. 211 ISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS---GFEKEGKKLEVK-------KAKSLDD 276 (415) Q Consensus 211 iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~---~~~~~~~~~~~~-------~~~~~~~ 276 (415) +|-||.+|- .+|.+++|.+-|+..|...+++.||.-.......+... ......+..... +-...+- T Consensus 274 YTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~ 353 (524) T protein:vir:98 274 YSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGES 353 (524) T ss_pred ccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHH Confidence 999999983 45789999999999999999999996432221111000 000000100000 1011122 Q ss_pred HH-------HHHHHhhh--hccCCCEEEEcHHHHHHHHHh----hccCC---cccccCcccCCCCcee-cceeeEEeccc Q lcl|NC_012784. 277 IK-------DAINLNVK--PNYEHNVAIVSQTMFAKLDKM----KDKLG---NYLIQPDVKEKTQQRL-LGAKIEILPDE 339 (415) Q Consensus 277 ~~-------~~~~~~~~--~~~~~~~~v~~~~~~~~l~~l----kd~~G---~~l~~~~~~~~~~~~l-~G~pV~~~~~~ 339 (415) ++ ++.+.+.. .+...+.+++++.....|..+ -+..+ ..+-.+....-..+.| .|++|.+..++ T Consensus 354 ~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~ 433 (524) T protein:vir:98 354 YKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYA 433 (524) T ss_pred HHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCCC Confidence 22 22222222 223477899999999888863 11111 0000000000011233 35778877765 Q ss_pred cccccCCceEEEechh-----hcEEEEee-cceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecCC----CCcc Q lcl|NC_012784. 340 VLGQKGNNTLIIGNLK-----DAIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE----RGEG 409 (415) Q Consensus 340 ~~~~~~~~~~~~gd~~-----~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~----~~~~ 409 (415) +. ..+++|.=- ........ ....+...|...|+-.+-...|+++.+ +|= .. ..+.++ ..+. T Consensus 434 ~~-----dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~--~~-~~~~~~~~ri~~g~ 504 (524) T protein:vir:98 434 RQ-----DYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPF--AN-SRSQAPADRITSGM 504 (524) T ss_pred Cc-----ceEEEEeeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCc--cc-ccCCccccccccCc Confidence 42 234444200 00000000 001112234455555566666776653 442 11 122211 1344 Q ss_pred cccccC Q lcl|NC_012784. 410 DLGLEA 415 (415) Q Consensus 410 ~~~~~~ 415 (415) |..-.| T Consensus 505 ~~~~~a 510 (524) T protein:vir:98 505 ISKEMC 510 (524) T ss_pred chHhhc Confidence 443333 No 220 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=89.54 E-value=0.026 Score=29.30 Aligned_cols=359 Identities=14% Similarity=0.108 Sum_probs=137.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQ-EKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) ||..++|.++..-+.+. +...++...+.. +-++|= .+++.+.. . + T Consensus 3 ~~~~~~l~~kw~p~l~~--------------~~~~~i~~~~~~---~~a~~~enq~~~~~~--------~-----~---- 48 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEG--------------EGLPEIANSKQA---IIAKIFENQEKDFQT--------A-----P---- 48 (521) T ss_pred cchhHHHHHhhhhhhcc--------------CCCCccccchhh---hhhhhhhhhhhhhhh--------c-----c---- Confidence 99999999998887542 111111110011 111110 11111000 0 0 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcce Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVT 156 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~ 156 (415) ..+.....+.+............. ......... +.+.+++ ..+.+.++.++| +.....+++. T Consensus 49 --~~~~~~~~~~~~~~l~e~~~~~~~---------~~~~~~iae-s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwG 113 (521) T protein:vir:72 49 --EYKDEKIAQAFGSFLTEAEIGGDH---------GYNATNIAA-GQTSGAV---TQIGPAVMGMVRRAIPNLIAFDICG 113 (521) T ss_pred --cccchHHHHHHhhhhhhhcccCcc---------ccCcccccc-ccccccc---ccCCchhhhHHHHHHhhhhhhhcee Confidence 000000011111111110000000 000000000 1111111 123333333333 3344456778 Q ss_pred eEEccCCceeE-----EEEeecCC------------ccccc--------------------------------------- Q lcl|NC_012784. 157 VKRVTNGSGKY-----PVVRQSEV------------AALEK--------------------------------------- 180 (415) Q Consensus 157 ~~~~~~~~~~~-----~~~~~~~~------------~~a~~--------------------------------------- 180 (415) ++||+++++-+ .+...... +.+.+ T Consensus 114 VQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~ 193 (521) T protein:vir:72 114 VQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQAS 193 (521) T ss_pred eccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccccccccccccccccccccccccccccccccccccccc Confidence 88888776542 11111000 00000 Q ss_pred ------------------------------ccccc--ccc-------ccccccceeeEeeeeeEEE-------eehhhHH Q lcl|NC_012784. 181 ------------------------------VEELE--ENP-------ELAVKPFFQLAYDINTHRG-------YFRISRE 214 (415) Q Consensus 181 ------------------------------v~Eg~--~~~-------~~~~~~f~~v~~~~~k~a~-------~~~iS~e 214 (415) +++|- ... ..+...|.+..+...|..+ ...+|-| T Consensus 194 ~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiE 273 (521) T protein:vir:72 194 VQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIE 273 (521) T ss_pred cccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHH Confidence 00000 000 0112235566665555554 4479999 Q ss_pred HHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc---ccccccccc-------chhhHHHHHHH Q lcl|NC_012784. 215 AIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE---KEGKKLEVK-------KAKSLDDIKDA 280 (415) Q Consensus 215 ~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~---~~~~~~~~~-------~~~~~~~~~~~ 280 (415) |.+|- .+|.+++|.+-|+..|...++++||.-......-+...... ...+..... .-...+-++.+ T Consensus 274 LAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L 353 (521) T protein:vir:72 274 LAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKAL 353 (521) T ss_pred HHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHH Confidence 99983 45789999999999999999999995432221111100000 001111111 01111222222 Q ss_pred HHHh-------hh--hccCCCEEEEcHHHHHHHHHhh-----ccCC--cccccCcccC-CCCcee-cceeeEEecccccc Q lcl|NC_012784. 281 INLN-------VK--PNYEHNVAIVSQTMFAKLDKMK-----DKLG--NYLIQPDVKE-KTQQRL-LGAKIEILPDEVLG 342 (415) Q Consensus 281 ~~~~-------~~--~~~~~~~~v~~~~~~~~l~~lk-----d~~G--~~l~~~~~~~-~~~~~l-~G~pV~~~~~~~~~ 342 (415) +.++ .. .-...+.+++++.....|...- .++| .. |..+.+. -..+.| .|++|.+..+++. T Consensus 354 ~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g-~~~d~~~~~~~G~l~~~~~vy~D~y~~~- 431 (521) T protein:vir:72 354 LFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATG-FSTDTTKSVFAGVLGGKYRVYIDQYAKQ- 431 (521) T ss_pred HHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccccc-ccccCCCceEEEEccCceEEEecCCCCc- Confidence 2222 22 1244678999999988888531 0000 11 1111111 011233 3578887776542 Q ss_pred ccCCceEEEechh-----hcEEEEee-cceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 343 QKGNNTLIIGNLK-----DAIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 343 ~~~~~~~~~gd~~-----~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ..+++|.=- ........ ...-+...|...|+-.+-...|+++.+ +|=+-..-+-.+..-.++|..-.| T Consensus 432 ----dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~a~~i~~~~~~~~a 505 (521) T protein:vir:72 432 ----DYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILN 505 (521) T ss_pred ----ceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCcccceeecCcChhhhc Confidence 234444200 00000000 001112234455655566666776654 332110000001112233443333 No 221 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=84.83 E-value=0.057 Score=27.39 Aligned_cols=366 Identities=12% Similarity=0.036 Sum_probs=99.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhhhhhcccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK----LKEKDGTSENNQQSVE 76 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~----~~~~~~~~~~~~~~~~ 76 (415) +|+++||+++++++.++++....+.+..--+...++++++.++++.+.+..+...+.... ................ T Consensus 6 ~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (397) T protein:vir:12 6 SKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQ 85 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccccc Confidence 446889999999998888776554433323344556677777776665444433322221 1111111111111111 Q ss_pred ccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHH-HHHHhhhhhhhhc-ccccccceeecchhHHhHHHHHHhhhhhhhhc Q lcl|NC_012784. 77 VNEARTYRNQANINDLGISIQNTKVTSQEVRDF-TEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~ 154 (415) ................................. ...+........+ .....-...++.......++..+-...++... T Consensus 86 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 165 (397) T protein:vir:12 86 GNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTR 165 (397) T ss_pred hhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCC Confidence 111111111111111111111100001100000 0001000000000 11111111122221222222222222222211 Q ss_pred cee--EEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee-EEE-eehhhHHHHhcchHHHHHHHHHH Q lcl|NC_012784. 155 VTV--KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT-HRG-YFRISREAIEDAKVNVLQELKLW 230 (415) Q Consensus 155 ~~~--~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k-~a~-~~~iS~e~l~ds~~~l~~~l~~~ 230 (415) ... ++.........+.. .+ ....+.. .+..+..+|..-++...- +.. .+.-|.--+. .+ +...|.+. T Consensus 166 ~~~~~~~~~~~~~~a~~v~--Eg---~~~~~~~-~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~--~~-i~~~l~~~ 236 (397) T protein:vir:12 166 SGTRLLEKNADMVPFSPVE--EL---GNLPEID-QPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIM--TY-VAKWFAKK 236 (397) T ss_pred ceeEEEEEecCCcceeeec--cc---ccccccc-cccceeEEeeheeeEeeehhhHHHHhhchHHHH--HH-HHHHHHHH Confidence 111 12111222222221 11 1122221 122333444443333211 111 1121221111 12 55566666 Q ss_pred HHHHHHHHHHHHHhhccccccccc--cc--cccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH Q lcl|NC_012784. 231 MARTIAATRNKAIIDVITKGSTGS--TS--SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) Q Consensus 231 la~~~~~~~d~~il~g~g~~~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~ 306 (415) ++..+...+=...=++.-.+.... .. ..............-.........+..+.+.. ..+++.|.....-- T Consensus 237 ~~~~~d~~il~G~g~~~~~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~---G~~l~~~~~~~g~~- 312 (397) T protein:vir:12 237 SVVTRNNLILAAIASLKKVDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGT---GRYLLQPDPTNPTK- 312 (397) T ss_pred HHHHHHHHHHhccccccccccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccC---CceeecccccCCCC- Confidence 666666665332222222222111 00 00000000000000000111122333333332 23444443210000 Q ss_pred hhccCCccccc-CcccCCCCceecceeeEEeccccccccCCceEEEechhhcEE--------EEeecceEEEEeecccCc Q lcl|NC_012784. 307 MKDKLGNYLIQ-PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV--------LFDRSQYQASWTDYMHFG 377 (415) Q Consensus 307 lkd~~G~~l~~-~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~--------~~~~~~~~i~~~~~~~~~ 377 (415) .-=.|+|++. ++...+. ...-.++++-+. ...+++++....-. .+....+.+...-+..+. T Consensus 313 -~~l~G~pv~~~~~~~~~~--~~~~~~~~~gd~-------~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 382 (397) T protein:vir:12 313 -KLLDGRPVVPFTNRVLKT--QKGKAPLIIGNL-------KEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVR 382 (397) T ss_pred -ccccceeeEEeccccccc--CCCccEEEEEeh-------hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 0003566532 1100000 000111222111 00112222111000 011111111111111000 Q ss_pred e---EEEEEEEeccE Q lcl|NC_012784. 378 E---CLMIAVRQDCR 389 (415) Q Consensus 378 ~---~~~~~~r~d~~ 389 (415) . ...+..-+-.+ T Consensus 383 ~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 383 KWDEDAVVFGQITVE 397 (397) T ss_pred EecccceEEEEEeeC Confidence 0 00111111111 No 222 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=83.40 E-value=0.069 Score=26.96 Aligned_cols=292 Identities=11% Similarity=0.033 Sum_probs=136.0 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc----cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK----TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) ..+.....+... .+...+.. ..+..+.|.+.+...+...+.+.+.+++.++++++.-..|...- ...+++ T Consensus 1 mtr~~~~~y~~~-----~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~-lg~~g~ 74 (336) T protein:vir:37 1 MNKQAYYALAAA-----LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLF-GATEKG 74 (336) T ss_pred CcHHHHHHHHHH-----HHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEee-eccCcc Confidence 111111111111 01111111 12234667778889999999999999999999999887775433 233333 Q ss_pred ccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH--HHHHHHhhccccc--cc Q lcl|NC_012784. 177 ALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA--TRNKAIIDVITKG--ST 252 (415) Q Consensus 177 ~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~--~~d~~il~g~g~~--~~ 252 (415) -++-..-+ - ......++.-.+..++.=--..|+.+.|+..+ .+..+..+.+...+.+ ++|.-.+.-.|+. .+ T Consensus 75 iagrtdt~-R--~~~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~ 150 (336) T protein:vir:37 75 VTGRKQTG-R--NLANLDHTQNGFELAETDSGIIVPWALFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQSVADN 150 (336) T ss_pred cccccCCC-c--cccccCcCCcccEEEEeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHHhhchhhhcccceeeccC Confidence 33322221 1 12223455566666666666788888888764 3455555555554444 3555444333322 11 Q ss_pred cccccc----------------------c-ccccc---cccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHHH Q lcl|NC_012784. 253 GSTSSG----------------------F-EKEGK---KLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFA 302 (415) Q Consensus 253 ~~~~~~----------------------~-~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~~ 302 (415) ...+.+ . ...+. .........+|.++ ++++. .++.++. -+.++....++ T Consensus 151 TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~-I~~~~~~d~dLVvivG~dLla 229 (336) T protein:vir:37 151 TTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQG-LDFRHQNRNDLVFLVGADLVS 229 (336) T ss_pred CCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhc-CchHHhcCCCeEEEEchhhhh Confidence 111110 0 00000 01122244566655 55554 4554443 26666665543 Q ss_pred H-HHHhhccCC-cccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCc Q lcl|NC_012784. 303 K-LDKMKDKLG-NYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFG 377 (415) Q Consensus 303 ~-l~~lkd~~G-~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~ 377 (415) . ...|-..+| +|- +.... ....+|-|+|.+..+++|..+ +++=-|++--.-+.....+-...+..... T Consensus 230 ~~~~~l~~~~~~~Pt--E~~Aa~~~~~~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 302 (336) T protein:vir:37 230 KETKLIQQKHGLTPT--EKAALGSHNLMGSFGGMNAITPPNFPARA-----AAVTTLKNLSVYTEAESVRRSLRNDEDKK 302 (336) T ss_pred hhhhhhhhhcCCCHH--HHHHHHHHHHHHhhCCceeEEccccCCCc-----eEEeechhcEEEEecCcEEEEEEEccccc Confidence 3 223433333 331 11111 123589999999999999764 56655555332233333333332222111 Q ss_pred eEEEEEEE-eccEEeccccEEEEEeecCCCCcccc Q lcl|NC_012784. 378 ECLMIAVR-QDCRILDYKSAIVIEYDDSERGEGDL 411 (415) Q Consensus 378 ~~~~~~~r-~d~~v~~p~a~~~~~~t~~~~~~~~~ 411 (415) ..--.+.| -+..|-++.+++.++-..-.- .+++ T Consensus 303 rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~-~~e~ 336 (336) T protein:vir:37 303 GLVTSYYRQEGYVVEDLGLMTAIDHTKVKL-NGEV 336 (336) T ss_pred cccchhhhcceeeeeccccEEEeeeeeeee-cCcC Confidence 11111112 233445566666655222111 1111 No 223 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=82.76 E-value=0.074 Score=26.78 Aligned_cols=348 Identities=15% Similarity=0.089 Sum_probs=132.5 Q ss_pred hhchHH-HHHHHHHHH-----HHHHH-HHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhh Q lcl|NC_012784. 28 ALNNDE-LEKAEKLEQ-----EITDL-RSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTK 100 (415) Q Consensus 28 ~~~e~~-~~~~~~~~~-----e~~~l-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (415) ..+.++ .+++.-+.+ ++.+. +..| -.++-+-.++.-+..+ ........+.+.. T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i---~a~llENQe~~~~~~~-----------~~~~~~~~~~~~~------ 60 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAV---TSVLLENQERFLREER-----------GMLNEVAVNSLGA------ 60 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhh---hhhhhhhHHHHHhccc-----------cccchhhHhhcCC------ Confidence 111111 112211111 11111 0000 0000000000000000 0000000000000 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcceeEEccCCceeEEEEe-----e Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVTVKRVTNGSGKYPVVR-----Q 172 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~-----~ 172 (415) ...........+..+... ..+.+.++.++| ......+++.++||+++++-+==.+ . T Consensus 61 ----------~~~~~~n~~~~~~~t~~v------~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~ 124 (468) T protein:vir:10 61 ----------GTIAPAGSALGSANTGGL------AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ 124 (468) T ss_pred ----------cccchhhhhhhhcccccc------cccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCC Confidence 000000011111111111 112233333333 3445567889999999887542222 1 Q ss_pred cCCcc------cccc----------------------------------------------cccccccccccccceeeEe Q lcl|NC_012784. 173 SEVAA------LEKV----------------------------------------------EELEENPELAVKPFFQLAY 200 (415) Q Consensus 173 ~~~~~------a~~v----------------------------------------------~Eg~~~~~~~~~~f~~v~~ 200 (415) .+... ..|- +.++...+. ...|.+..+ T Consensus 125 ~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~-~~~f~EMaF 203 (468) T protein:vir:10 125 AGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSF 203 (468) T ss_pred CCccceeccccccccccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCC-Ccccceeee Confidence 11000 0000 000000011 123555666 Q ss_pred eeeeEEE-------eehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccc--- Q lcl|NC_012784. 201 DINTHRG-------YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL--- 266 (415) Q Consensus 201 ~~~k~a~-------~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~--- 266 (415) ...|..+ ...+|-||.+|- .+|.+++|.+-|+..|...+++.||.-.-+-...+...+....+... T Consensus 204 sIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~ 283 (468) T protein:vir:10 204 SIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDV 283 (468) T ss_pred EEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccc Confidence 5555554 447999999983 45789999999999999999999986543322211111111111110 Q ss_pred cccchhhHHHHHHHHHHh---------hhhccCCCEEEEcHHHHHHHHH---hhccC---Ccccc---cCccc-CCCCce Q lcl|NC_012784. 267 EVKKAKSLDDIKDAINLN---------VKPNYEHNVAIVSQTMFAKLDK---MKDKL---GNYLI---QPDVK-EKTQQR 327 (415) Q Consensus 267 ~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~~~~~~l~~---lkd~~---G~~l~---~~~~~-~~~~~~ 327 (415) ...+-...+..+.++.++ .+-.+..+.+++++.....|.. +...- ++.-+ ..+.+ .-..+. T Consensus 284 ~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~ 363 (468) T protein:vir:10 284 DSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcceEEEE Confidence 111122222233232222 1234556789999999999986 33110 11100 00111 111233 Q ss_pred ec-ceeeEEeccccccccCCceEEEechh-----hcEEEEeecceEE-EEeecccCceEEEEEEEeccEEeccccEEEEE Q lcl|NC_012784. 328 LL-GAKIEILPDEVLGQKGNNTLIIGNLK-----DAIVLFDRSQYQA-SWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) Q Consensus 328 l~-G~pV~~~~~~~~~~~~~~~~~~gd~~-----~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~ 400 (415) |. |++|.+..++... .....+++|.=- ........-.+.. ...|...|+-.+-...|+++.+ +|=+... . T Consensus 364 l~~r~~vy~D~Ya~~~-s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~-~ 440 (468) T protein:vir:10 364 INGRIKVFVDPYAANL-SDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NPFVTTN-G 440 (468) T ss_pred ecCceEEEEccccccC-CccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeeeeeeceee-cccceec-c Confidence 33 5677776554321 112234444200 0000000001111 1234455666666667877654 4533211 2 Q ss_pred eecCCCCcccccccC Q lcl|NC_012784. 401 YDDSERGEGDLGLEA 415 (415) Q Consensus 401 ~t~~~~~~~~~~~~~ 415 (415) ++...+.+.|...-+ T Consensus 441 ~~~g~~~~~~~~~~~ 455 (468) T protein:vir:10 441 LYNGTPDGEALTPNA 455 (468) T ss_pred ccCCCcccccccccc Confidence 444444444443333 No 224 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=82.50 E-value=0.076 Score=26.71 Aligned_cols=275 Identities=11% Similarity=0.011 Sum_probs=110.3 Q ss_pred cccccccceeecchhHHhHHHHHHhhhh---hhh-hcceeEEccCCceeEEEEeecCC-c-ccccccccccccccccccc Q lcl|NC_012784. 122 GSLKTDSGFVVIPEEIVTDILKLKEVEF---NLD-KYVTVKRVTNGSGKYPVVRQSEV-A-ALEKVEELEENPELAVKPF 195 (415) Q Consensus 122 ~~~~~~~~~~~vP~~~~~~Ii~~~~~~~---~l~-~~~~~~~~~~~~~~~~~~~~~~~-~-~a~~v~Eg~~~~~~~~~~f 195 (415) ...+. .....-|.++..-|-..+.... -++ .+....+ ..+..+.+.+.... + .+..++.+...+-.....+ T Consensus 1 M~~~~-~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~ 77 (348) T protein:vir:98 1 MSWTL-DTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVD--VDDITFEFLRGGGGLAETASYRSWDTESKIGRREGL 77 (348) T ss_pred Ccchh-hhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCcc--ccceEEEEEeccCCceeeeeeecCCCccceeecccc Confidence 00011 1112234444443333221111 111 2222222 22334444333222 1 3445665655443333457 Q ss_pred eeeEeeeeeEEEeehhhHHHHhc---ch-HHHHHHHH---HHHHHHHHHHHHHHHhhccccccc------c----ccc-c Q lcl|NC_012784. 196 FQLAYDINTHRGYFRISREAIED---AK-VNVLQELK---LWMARTIAATRNKAIIDVITKGST------G----STS-S 257 (415) Q Consensus 196 ~~v~~~~~k~a~~~~iS~e~l~d---s~-~~l~~~l~---~~la~~~~~~~d~~il~g~g~~~~------~----~~~-~ 257 (415) ...++.+-.++....++.+=+.. +. ..+..+|. .++..++.+.+|........+|.- . +.. . T Consensus 78 ~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~ 157 (348) T protein:vir:98 78 AKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGS 157 (348) T ss_pred eeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcc Confidence 77888887777766666542211 11 12233333 334556666665432222111110 0 000 0 Q ss_pred ccccccccc-cccchhhHHHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHH---hhcc-CC------cccccCcccCCCC Q lcl|NC_012784. 258 GFEKEGKKL-EVKKAKSLDDIKDAINLNVKP-NYEHNVAIVSQTMFAKLDK---MKDK-LG------NYLIQPDVKEKTQ 325 (415) Q Consensus 258 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~---lkd~-~G------~~l~~~~~~~~~~ 325 (415) +........ ...+.+.+.|+.+....+... +..+..++|++..|..|+. +++. .+ .++..+...... T Consensus 158 ~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~- 236 (348) T protein:vir:98 158 HSVVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTV- 236 (348) T ss_pred cccccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHH- Confidence 000001111 123345678888887777654 6778999999999999863 3332 11 112211110000 Q ss_pred ceeccee-eEEeccccccccCCceEEEechhhcEEEEeec-----------------------------------ceEE- Q lcl|NC_012784. 326 QRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-----------------------------------QYQA- 368 (415) Q Consensus 326 ~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-----------------------------------~~~i- 368 (415) -.-+|.| |.+.+..-. ..|...-++.+ +.+.+.... ++-+ T Consensus 237 ~~~~g~~~i~~~d~~~~-~~g~~~~~~p~--~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~ 313 (348) T protein:vir:98 237 LSSMGLPPIEVYDAKVA-VDGVSTRITPA--NAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVAA 313 (348) T ss_pred HHhhCCeEEEEeeeEEE-cCCceeceecC--CeEEEEecCCcccccccccccceecccchhhhccccccceeccCceeee Confidence 1123554 444332211 11111112211 111111000 0000 Q ss_pred EEeecccCceEEEEEEEeccEEeccccEEEEEeec Q lcl|NC_012784. 369 SWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) Q Consensus 369 ~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~ 403 (415) .+.+.+--..-+.+..+.=-.+.+|++++.+++-+ T Consensus 314 ~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 314 TWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 01111111222333334334456789999998877 No 225 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=81.99 E-value=0.081 Score=26.57 Aligned_cols=358 Identities=14% Similarity=0.097 Sum_probs=129.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQ-EKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) ||..++|+++..-+.+. +...++... -.+++-++|= .+++.++... T Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~~~~i~~~--~~~~~~a~l~enq~~~~~~~~----------------- 47 (528) T protein:vir:66 1 MKTTKELMEKWSPLLEN--------------EKLPEIATA--SKQKLVAKILESQEADFAVDP----------------- 47 (528) T ss_pred CcchHHHHHHhHHhhcC--------------CCcchhcch--hhhhhhhhhhhhhHHHhhccc----------------- Confidence 99999999999887542 111111000 0000011110 0000000000 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHH---hhhhhhhhcce Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVT 156 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~---~~~~~l~~~~~ 156 (415) ..+.....+.+............... ....... +.+.+.+ ..+.+.++.++ .+...-.+++. T Consensus 48 --~~~~~~~~~~~~~~l~ea~~~~~~~~---------~~~~i~e-s~~t~~v---~~~~P~Li~lvRRa~p~LIa~DIwG 112 (528) T protein:vir:66 48 --IYKDEKVVEAFGGFIAEAEVAGDHGY---------DASQIAA-GQTTGAI---TNVGPAVIGMVRRAIPNLIAFDICG 112 (528) T ss_pred --chhhHHHHHhhhhhhhhhcccccccc---------cchhccc-ccccccc---ccCchhHHHHHHHHHHhhhhhhhhe Confidence 00000000000000000000000000 0000000 0000000 11222222222 22233344556 Q ss_pred eEEccCCceeE-----EEEee----------------------------------------------------------- Q lcl|NC_012784. 157 VKRVTNGSGKY-----PVVRQ----------------------------------------------------------- 172 (415) Q Consensus 157 ~~~~~~~~~~~-----~~~~~----------------------------------------------------------- 172 (415) ++||+++++-+ .+... T Consensus 113 VQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~ 192 (528) T protein:vir:66 113 VQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAE 192 (528) T ss_pred eecCCchhhhheeeeeeecCCcccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccc Confidence 66665532100 00000 Q ss_pred -------------------------------cCCcccc--------cccccc-cccccccccceeeEeeeeeEE------ Q lcl|NC_012784. 173 -------------------------------SEVAALE--------KVEELE-ENPELAVKPFFQLAYDINTHR------ 206 (415) Q Consensus 173 -------------------------------~~~~~a~--------~v~Eg~-~~~~~~~~~f~~v~~~~~k~a------ 206 (415) ....... -.+|.. .....+...|.+..+...|.. T Consensus 193 t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSR 272 (528) T protein:vir:66 193 TGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSR 272 (528) T ss_pred cceeeeccccccccccCcccccccccccccccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeecc Confidence 0000000 011100 000011122445555554444 Q ss_pred -EeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc---cccccccccc------c-h Q lcl|NC_012784. 207 -GYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF---EKEGKKLEVK------K-A 271 (415) Q Consensus 207 -~~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~---~~~~~~~~~~------~-~ 271 (415) -...+|-||.+|- .+|.++.|.+-|+..|...++++||.-......-+..... ....+..... + - T Consensus 273 aLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~r 352 (528) T protein:vir:66 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR 352 (528) T ss_pred ceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccch Confidence 4457999999983 4588999999999999999999997543222211110000 0000111110 0 1 Q ss_pred hhHHHHHHH-------HHHhhh--hccCCCEEEEcHHHHHHHHHh-----hcc-CCcccccCcccCCC-Cceec-ceeeE Q lcl|NC_012784. 272 KSLDDIKDA-------INLNVK--PNYEHNVAIVSQTMFAKLDKM-----KDK-LGNYLIQPDVKEKT-QQRLL-GAKIE 334 (415) Q Consensus 272 ~~~~~~~~~-------~~~~~~--~~~~~~~~v~~~~~~~~l~~l-----kd~-~G~~l~~~~~~~~~-~~~l~-G~pV~ 334 (415) ...+-++.+ .+.+.. .+...+.+++++.....|... .+. .....+..+.+... .++|. |++|. T Consensus 353 w~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) T protein:vir:66 353 WAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVF 432 (528) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEE Confidence 112222222 222222 223457899999998888652 111 11222222222111 23444 57888 Q ss_pred EeccccccccCCceEEEech---h--hcEEEEee-cceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecC---- Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLIIGNL---K--DAIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS---- 404 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~gd~---~--~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~---- 404 (415) +..+++. ..+++|.= . ........ ...-....|...|+-.+-...|+++.+ +|=+ . ..+.. T Consensus 433 ~D~y~~~-----dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v-NP~~--~-~~~~~~~~r 503 (528) T protein:vir:66 433 IDQYARQ-----DYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI-NPFA--D-SKSQEPSAR 503 (528) T ss_pred ecCCCCc-----ceEEEEEeCCcccccceeecccccceeeEeeCCccccceeeeeeeeceee-cCcc--c-ccCcccccc Confidence 7776542 23444320 0 00000000 011112344455655555566766543 3411 1 11111 Q ss_pred CCCcccccccC Q lcl|NC_012784. 405 ERGEGDLGLEA 415 (415) Q Consensus 405 ~~~~~~~~~~~ 415 (415) ...++|..-.| T Consensus 504 i~~g~~~~~~a 514 (528) T protein:vir:66 504 ITSGMLSKDSV 514 (528) T ss_pred ccccchhhhhc Confidence 11233333333 No 226 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=81.57 E-value=0.085 Score=26.47 Aligned_cols=304 Identities=9% Similarity=0.022 Sum_probs=124.6 Q ss_pred hhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHh--hhhhhhhcccccccceeecchhHH----hHHHHHHhhhhhhhhcce Q lcl|NC_012784. 83 YRNQANINDLGISIQNTKVTSQEVRDFTEYLE--TRNDIQGGSLKTDSGFVVIPEEIV----TDILKLKEVEFNLDKYVT 156 (415) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~vP~~~~----~~Ii~~~~~~~~l~~~~~ 156 (415) ...........+ ...............+. ..........-.+.+...+|..+. +.+++.+........++. T Consensus 1 ~~~~~~~~~l~~---~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~ 77 (336) T protein:vir:10 1 MRDAQRIQNLAR---AGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVG 77 (336) T ss_pred CchHHHHHHHhc---cCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhcc Confidence 000000000000 00000000000001000 111111111112222233554333 233333333333333333 Q ss_pred eEEccCCc-eeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhc---chHHHHHHHHHHHH Q lcl|NC_012784. 157 VKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMA 232 (415) Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~d---s~~~l~~~l~~~la 232 (415) +.....-. ....+........+.+.+.+...|-. +.....-..+.+.++..+.++.+=+.- ...++.+--....+ T Consensus 78 v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~-d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:10 78 ESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred cccCCCcceeeEEEEeeeeeeeEEEccccCCCcce-eeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 32211110 01122233333455566777777754 455666777788888888888443332 33467777777777 Q ss_pred HHHHHHHHHHHhhccccccccccccccccc---cccc----cccchhhHHHHHHHHHHhhhhcc------CCCEEEEcHH Q lcl|NC_012784. 233 RTIAATRNKAIIDVITKGSTGSTSSGFEKE---GKKL----EVKKAKSLDDIKDAINLNVKPNY------EHNVAIVSQT 299 (415) Q Consensus 233 ~~~~~~~d~~il~g~g~~~~~~~~~~~~~~---~~~~----~~~~~~~~~~~~~~~~~~~~~~~------~~~~~v~~~~ 299 (415) +++.+.++.-.+.|+.....-+...+.... ...+ ..+...-++|+..++.++..... .+..++|.|. T Consensus 157 ~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~ 236 (336) T protein:vir:10 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPT 236 (336) T ss_pred HHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechH Confidence 777777777666676554333322221111 0011 11223345777777777654432 2458999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEe-ec----- Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DY----- 373 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~-~~----- 373 (415) .+..|.. ++..|.-++. -+... +-++.++..+.+- ++.++...+|-.-.. ....+.+.+. ++ T Consensus 237 ~~~~L~~-~n~~g~tv~~-~lk~n----~Pnl~i~t~pel~-~Agg~~~~~~~~~~~-----~~~t~~~~~P~~f~~lpv 304 (336) T protein:vir:10 237 AMSDLSK-TNQYGLSAAA-KLKEI----FPKLEFVTIPEYD-TASGRLVQLWAPRVE-----GKDTATCGFTEKMRAHSI 304 (336) T ss_pred HHHhccC-CCccCccHHH-HHHHh----CCccEEEEccccc-ccCCceEEEEEeccc-----CCcceeeecChhhhccce Confidence 9999864 3333432221 11111 1122344433332 222222222211100 0001121111 01 Q ss_pred -ccC-ceEEEEEEEecc-EEeccccEEEEEee Q lcl|NC_012784. 374 -MHF-GECLMIAVRQDC-RILDYKSAIVIEYD 402 (415) Q Consensus 374 -~~~-~~~~~~~~r~d~-~v~~p~a~~~~~~t 402 (415) ... ....-...|.+| .+.+|-||++++=- T Consensus 305 q~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 305 ERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred eecCceeEeccccceeeeeeeccchheeeccC Confidence 001 001122345544 56789999998733 No 227 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=81.47 E-value=0.085 Score=26.44 Aligned_cols=279 Identities=10% Similarity=0.034 Sum_probs=111.1 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhh-h-cceeEEccCCceeEEEEeecCC-c-ccccccccccccccccccce Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-K-YVTVKRVTNGSGKYPVVRQSEV-A-ALEKVEELEENPELAVKPFF 196 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~-~-~~~~~~~~~~~~~~~~~~~~~~-~-~a~~v~Eg~~~~~~~~~~f~ 196 (415) +... -...-|..+..-|-.......+++ + ++...++. ..++.......+ . .+..+..+...+-.....+. T Consensus 1 M~~l----~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~--~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~ 74 (348) T protein:vir:49 1 MGLI----YDKVTASNIAGYFNALQENVDSTLGESIFPARKQL--GTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAE 74 (348) T ss_pred Ccch----hhhcCHHHHHHHHHhccccchhhhHhhcCCCcccc--CceeEEEEeecCceeeeeeecCCCCcceeccccee Confidence 0000 011112233232222221111211 1 22222222 122222222221 2 34455555443333344567 Q ss_pred eeEeeeeeEEEeehhhHHHH------hcc-hHHHHHHHH-------HHHHHHHHHHHHHHHhhccccccc----ccc--- Q lcl|NC_012784. 197 QLAYDINTHRGYFRISREAI------EDA-KVNVLQELK-------LWMARTIAATRNKAIIDVITKGST----GST--- 255 (415) Q Consensus 197 ~v~~~~~k~a~~~~iS~e~l------~ds-~~~l~~~l~-------~~la~~~~~~~d~~il~g~g~~~~----~~~--- 255 (415) ..++.+-.++....++..-+ .++ ..+....+. ..+..++.+.+|........+|.- .+. T Consensus 75 ~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~ 154 (348) T protein:vir:49 75 MHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEE Confidence 77777777776666653321 111 111222222 223355666666544333222210 000 Q ss_pred -----c-cccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhcc----CCcc-cccCccc Q lcl|NC_012784. 256 -----S-SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK---MKDK----LGNY-LIQPDVK 321 (415) Q Consensus 256 -----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---lkd~----~G~~-l~~~~~~ 321 (415) . .+..........++.+.+.|+.+....+...+..+..++|++..|..|.. +++. ++.. ...+... T Consensus 155 vdyg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~ 234 (348) T protein:vir:49 155 IDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAEL 234 (348) T ss_pred EeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHH Confidence 0 00011111223345566788887777776677888999999999999854 2221 1111 0111111 Q ss_pred CCCCceecceeeEEeccccccccCCc-------eEEEechh-hcEEEEe--e-------------------cceEEE-Ee Q lcl|NC_012784. 322 EKTQQRLLGAKIEILPDEVLGQKGNN-------TLIIGNLK-DAIVLFD--R-------------------SQYQAS-WT 371 (415) Q Consensus 322 ~~~~~~l~G~pV~~~~~~~~~~~~~~-------~~~~gd~~-~~~~~~~--~-------------------~~~~i~-~~ 371 (415) ...-.++.|.+|++.+..-....|.. .++++... .+...+- - .++-+. +. T Consensus 235 ~~~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (348) T protein:vir:49 235 DNYIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTTK 314 (348) T ss_pred HHHHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeeee Confidence 11112456777776654321111211 12221110 0111110 0 000000 00 Q ss_pred ecccCceEEEEEEEeccEEeccccEEEEEeecCC Q lcl|NC_012784. 372 DYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) Q Consensus 372 ~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~ 405 (415) ..+-...-..+..+.=-.+.+|+++..+++.++. T Consensus 315 ~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 315 TTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred cCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 0111111122222322345678999999998887 No 228 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=79.78 E-value=0.1 Score=26.04 Aligned_cols=292 Identities=11% Similarity=0.030 Sum_probs=135.6 Q ss_pred hhhHHHHHHHHHHhhhhhhhhcccc----cccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCc Q lcl|NC_012784. 101 VTSQEVRDFTEYLETRNDIQGGSLK----TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA 176 (415) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (415) ..+.....+... .+...+.. ..+..+.|.+.+...+...+.+.+.+++.++++++.-..|...- ...+++ T Consensus 1 mtr~~~~~y~~~-----~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~-lg~~g~ 74 (336) T protein:vir:37 1 MNKQAYYALAAA-----LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLF-GATEKG 74 (336) T ss_pred CcHHHHHHHHHH-----HHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEe-eccCcc Confidence 111111111111 01111111 12235677778888999999999999999999999887775433 233333 Q ss_pred ccccccccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH--HHHHHHhhccccc--cc Q lcl|NC_012784. 177 ALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA--TRNKAIIDVITKG--ST 252 (415) Q Consensus 177 ~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~--~~d~~il~g~g~~--~~ 252 (415) -++-..-+... ....++.-.+..++.=--..|+.+.|+..+ .+..+..+.+...+.+ ++|.-.+.-.|+. .+ T Consensus 75 iagrtdt~r~r---~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~ 150 (336) T protein:vir:37 75 VTGRKQTGRNL---ATLDHSQNGYELSETDSGILVNWSLFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQSVATN 150 (336) T ss_pred cccccCCCCCc---cccCCCCCccEEEEeeeeeeccHHHHHHHh-cChhHHHHHHHHHHHHHHhcchhhhcccceeeccC Confidence 33332222211 122344455555555556688888888764 3555555555555544 4565544333321 11 Q ss_pred cccccc----------------------c-ccccc---cccccchhhHHHHH-HHHHHhhhhccCC---CEEEEcHHHHH Q lcl|NC_012784. 253 GSTSSG----------------------F-EKEGK---KLEVKKAKSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFA 302 (415) Q Consensus 253 ~~~~~~----------------------~-~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~v~~~~~~~ 302 (415) ...+.+ . ...+. .........+|.++ ++++. .++.++. -+.++....++ T Consensus 151 TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~-I~~~~~~d~dLVvivG~dLla 229 (336) T protein:vir:37 151 TTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQG-LDFRHQNRNDLVFLVGADLVS 229 (336) T ss_pred CCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhc-cchHHhcCCCeEEEEchhhhh Confidence 111110 0 00000 01122244566655 55554 4554443 36666665543 Q ss_pred H-HHHhhccCC-cccccCcccC---CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCc Q lcl|NC_012784. 303 K-LDKMKDKLG-NYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFG 377 (415) Q Consensus 303 ~-l~~lkd~~G-~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~ 377 (415) . ...|-..+| +|- +.... -...+|-|+|.+..+++|..+ +++=-|++--.-+.....+-...+..... T Consensus 230 ~~~~~l~~~~~~~Pt--E~~Aa~~~~~~k~iGGlpa~~~PffP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 302 (336) T protein:vir:37 230 KETKLIQQKHGLTPT--EKAALGSHNLMGSFGGMNAITPPNFPARA-----AAVTTLKNLSVYTEAESVRRSLRNDEDKK 302 (336) T ss_pred hhhhhhhhhcCCCHH--HHHHHHHHHHHHhhCCceEEEccccCCCc-----eEEeeccccEEEEecCcEEEEEEEccccc Confidence 3 223333333 231 11110 123578999999999999764 56655555322233333333332222111 Q ss_pred eEEEEEEE-eccEEeccccEEEEEeecCCCCcccc Q lcl|NC_012784. 378 ECLMIAVR-QDCRILDYKSAIVIEYDDSERGEGDL 411 (415) Q Consensus 378 ~~~~~~~r-~d~~v~~p~a~~~~~~t~~~~~~~~~ 411 (415) ..--.+.| -+..|-++.+++.++-..-.- .+++ T Consensus 303 rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~-~~e~ 336 (336) T protein:vir:37 303 GLVTSYYRQEGYVVEDLGLMTAIDHTKVKL-NGEV 336 (336) T ss_pred cccchhhhcceeeeeccccEEEeeeeeeec-cccC Confidence 11111112 233455566666665332111 1111 No 229 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=79.19 E-value=0.11 Score=25.91 Aligned_cols=281 Identities=10% Similarity=0.032 Sum_probs=111.2 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh--hhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE--VEFNLD 152 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~--~~~~l~ 152 (415) .. ..+.+.-.. . ....... ....|+.+--+.+.+++..+.. +...++ T Consensus 1 ~~---~~~~~~~~~-------------------a--------~~~al~~-a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~ 49 (470) T protein:vir:10 1 MP---YEHLKHLDE-------------------A--------TLKALNA-AGQVAESLEREDLEPEVTQLNVLDTPLTDL 49 (470) T ss_pred CC---hhHhhhhhH-------------------H--------HHHHHHH-hhhcchhhhhhhhccceeEeeecCccchhh Confidence 00 000000000 0 0000000 0111111111111111110000 001111 Q ss_pred hcceeEEccCCceeEEE-EeecCCcccccccccccccccccccceeeEeeeeeEEEeehhhHHH---HhcchHHHHHHHH Q lcl|NC_012784. 153 KYVTVKRVTNGSGKYPV-VRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA---IEDAKVNVLQELK 228 (415) Q Consensus 153 ~~~~~~~~~~~~~~~~~-~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~---l~ds~~~l~~~l~ 228 (415) .-....+..+--..|.+ ....+........|++- ++.+++.+......++-++.-..||.-. ++....++...+. T Consensus 50 ~~i~k~~a~STV~ey~~~~~rhG~~g~s~~~E~~l-~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~ 128 (470) T protein:vir:10 50 LSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGGL-PRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVK 128 (470) T ss_pred hhcCCchhhhHhhhhhhhccccccccceeeccccc-CccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHH Confidence 11122233333333432 11123333334578875 5677899999999999999999999764 3445558888898 Q ss_pred HHHHHHHHHHHHHHHhhcccccc---cccc-c---ccccc------ccccccccc-hhhHHHHHHHHHHhh--hhccCCC Q lcl|NC_012784. 229 LWMARTIAATRNKAIIDVITKGS---TGST-S---SGFEK------EGKKLEVKK-AKSLDDIKDAINLNV--KPNYEHN 292 (415) Q Consensus 229 ~~la~~~~~~~d~~il~g~g~~~---~~~~-~---~~~~~------~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~ 292 (415) +...-.+++++|.+.+.|+..=. ++.+ + -|..+ .......-+ ..+.+.+..+...+. ..+..++ T Consensus 129 ~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~T 208 (470) T protein:vir:10 129 REKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPT 208 (470) T ss_pred HHHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChh Confidence 89999999999999999965221 2111 0 11111 011111111 122333334443332 3566677 Q ss_pred EEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEee Q lcl|NC_012784. 293 VAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) Q Consensus 293 ~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 372 (415) -++|+.-+.+.|..--...-|.+..++.. ....|+||--.-. ..| .+.+--+. T Consensus 209 D~~lp~~vka~f~~~~~~~qRv~~~~N~~----~~~~G~~v~~f~s-----------a~G------------~I~L~~s~ 261 (470) T protein:vir:10 209 AVFISYVDKLNLQASFYQISRVMTTADRR----AGLLGADAQSYIG-----------VRG------------EHSLYPSQ 261 (470) T ss_pred hhccchhHHHHHHHhhcCceEEEEecCCC----ceeeeeeccceee-----------eee------------eeeecccc Confidence 88999999888877555544544433322 2235666432110 011 11110000 Q ss_pred cccCceEEEEEEEeccEE---eccccEEEEEee------cCCCCccc--------------------------ccccC Q lcl|NC_012784. 373 YMHFGECLMIAVRQDCRI---LDYKSAIVIEYD------DSERGEGD--------------------------LGLEA 415 (415) Q Consensus 373 ~~~~~~~~~~~~r~d~~v---~~p~a~~~~~~t------~~~~~~~~--------------------------~~~~~ 415 (415) +...... .--.|++..+ .-|..++-++-+ +...+.|+ ++.++ T Consensus 262 ~m~~~~k-~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds~s~~v~vt~t~ 338 (470) T protein:vir:10 262 FLGDFHK-FNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANFYGESAAKYIDVYIDS 338 (470) T ss_pred cccchhh-cCcccCCcccCCcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEecCCCCcceEEEEEee Confidence 0000000 0001111111 112211111100 00011111 11111 No 230 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=77.81 E-value=0.12 Score=25.61 Aligned_cols=266 Identities=10% Similarity=0.050 Sum_probs=118.6 Q ss_pred cccccccceeecchhHHhHHHHHHhhhh-hhhhcceeEEccCCceeEEEEeecCCccc-ccccccccccccccccceeeE Q lcl|NC_012784. 122 GSLKTDSGFVVIPEEIVTDILKLKEVEF-NLDKYVTVKRVTNGSGKYPVVRQSEVAAL-EKVEELEENPELAVKPFFQLA 199 (415) Q Consensus 122 ~~~~~~~~~~~vP~~~~~~Ii~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~Eg~~~~~~~~~~f~~v~ 199 (415) ...+...-. .+-..+...+........ ....+++..+.+..+.+|.++ ...+.. .|++|.. .....=...+ T Consensus 1 m~it~~~l~-~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~l--g~~p~l~e~~Ge~~----~~~l~~~~~~ 73 (302) T protein:vir:10 1 MLINKQSLN-AAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWL--STFPKMRRWIGAKV----VKNLKAYKYV 73 (302) T ss_pred CcccHHHHH-HHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceec--CCCCCcccccccee----ecccccccee Confidence 000000000 000011111111111111 223344444433333444443 333443 4555543 2234445678 Q ss_pred eeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----cc-cccccc------------- Q lcl|NC_012784. 200 YDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG----ST-SSGFEK------------- 261 (415) Q Consensus 200 ~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~----~~-~~~~~~------------- 261 (415) +..+++...+.||++.+.|-.+++..-+.+.+.++.++.+++.+..-...+.+. +. ..+... T Consensus 74 i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~ 153 (302) T protein:vir:10 74 VENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTA 153 (302) T ss_pred EEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccch Confidence 999999999999999999888899999999999999999998766533322110 00 000000 Q ss_pred -ccccccccchhhHHHHHHHHHHhhhh-----ccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecc-eeeE Q lcl|NC_012784. 262 -EGKKLEVKKAKSLDDIKDAINLNVKP-----NYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLG-AKIE 334 (415) Q Consensus 262 -~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G-~pV~ 334 (415) ............+...+.++.+.... ...+..+++.|.....-+++-.+ +++- .+..+.+.| ..++ T Consensus 154 ~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~~------~g~~Np~~g~~~~v 226 (302) T protein:vir:10 154 PLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKLA------DNTPNPYVGTAELV 226 (302) T ss_pred hhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cccC------CCCcceeccceEEE Confidence 00001112233455555555555433 33457788888776665554221 2211 111222333 3456 Q ss_pred EeccccccccCCceEEEechhhc--EEEEeecceEEEEe-ecccCceEEEEEEEecc------EEeccccEEEEEeecCC Q lcl|NC_012784. 335 ILPDEVLGQKGNNTLIIGNLKDA--IVLFDRSQYQASWT-DYMHFGECLMIAVRQDC------RILDYKSAIVIEYDDSE 405 (415) Q Consensus 335 ~~~~~~~~~~~~~~~~~gd~~~~--~~~~~~~~~~i~~~-~~~~~~~~~~~~~r~d~------~v~~p~a~~~~~~t~~~ 405 (415) +++.+..+. .-.++.|.+.. +.+-.+++..+... ++.....-++....|++ .-..|.... .+ T Consensus 227 v~p~L~s~~---aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~------~s 297 (302) T protein:vir:10 227 VDGRIESDT---AWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAY------GS 297 (302) T ss_pred EeeccCCCC---ceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhh------cc Confidence 655554322 22344454431 11112344555432 23333333444444442 222232211 12 Q ss_pred CCccc Q lcl|NC_012784. 406 RGEGD 410 (415) Q Consensus 406 ~~~~~ 410 (415) .|.+. T Consensus 298 ~g~~~ 302 (302) T protein:vir:10 298 TGTGA 302 (302) T ss_pred CccCC Confidence 22222 No 231 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=76.50 E-value=0.13 Score=25.36 Aligned_cols=358 Identities=11% Similarity=0.003 Sum_probs=114.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhh Q lcl|NC_012784. 4 KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTY 83 (415) Q Consensus 4 ~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (415) +.+|++++.+.++++.++.+........+. +..++..+++++++++++.++++++++++................... T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~-~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~- 78 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDG-ELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV- 78 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Confidence 777777776666666665555433322221 122455577888888898888888776665544333322222111110 Q ss_pred hhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhh-hhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC Q lcl|NC_012784. 84 RNQANINDLGISIQNTKVTSQEVRDFTEYLETRND-IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN 162 (415) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 162 (415) .. .... ..................... ........+.+...-...+-.+++..+-. .++.... +.. T Consensus 79 ---~~---~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~--~~~~~~~---i~~ 145 (390) T protein:vir:97 79 ---GD---MFVA--SEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFIT--PPDARLT---VRD 145 (390) T ss_pred ---hh---hhhh--hHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHH--HHhhhhh---hHh Confidence 00 0000 000000000000111111111 11111111111111111222333332211 1122111 111 Q ss_pred CceeEEEEeecCCccccccccc------ccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 163 GSGKYPVVRQSEVAALEKVEEL------EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 163 ~~~~~~~~~~~~~~~a~~v~Eg------~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) .-..+++ ......+..+. .-..+.....-...++....+...-.-..--+.+....-...+.+.+...++ T Consensus 146 ~~~~~~~----~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~l~~~i~~~la 221 (390) T protein:vir:97 146 LIGSGRT----DSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLI 221 (390) T ss_pred hcceeec----cCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 1112222 11122222111 1122333333344555555554322111112333333334567777888888 Q ss_pred HHHHHHHhhcccccccccc-ccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc-- Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTGST-SSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN-- 313 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~-- 313 (415) +++...+-...=.|.+... +.+..+...........+.+...+ .....+..+...... T Consensus 222 ~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d-------------------~~~~~~~~~~~~~~~~~ 282 (390) T protein:vir:97 222 RGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVD-------------------QLRLAMLQASLAEYPAS 282 (390) T ss_pred HHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHH-------------------HHHHHHHhhccccCCCC Confidence 8888777654433322111 222222211111111111111111 112233344333221 Q ss_pred -ccccCc-------ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeec-ceEEEEeecccCceEEEEEE Q lcl|NC_012784. 314 -YLIQPD-------VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-QYQASWTDYMHFGECLMIAV 384 (415) Q Consensus 314 -~l~~~~-------~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~ 384 (415) +++.+. +.+. .|.|++....-+ ....++|-. +...+.. .-.+-+-++. ..+.... T Consensus 283 ~~v~n~~~~~~L~~lkd~-----~G~~l~~~~~~~-----~~~~l~G~p---V~~~~~~~~~~~~~gd~~---~~~~~~~ 346 (390) T protein:vir:97 283 GIVINPIDWAAIELAKDA-----NNQYLIGNARGT-----LTPTLWGLP---VVATQAMAPGEFLVGAFD---LAAQIFD 346 (390) T ss_pred EEEEcHHHHHHHHHhhcC-----CCceeecCccCC-----CCceeccee---eEEcCCCCCCcEEEEecc---ceEEEEE Confidence 121111 1111 355554321111 001122211 0000000 0011111111 1111111 Q ss_pred EeccEEec--------cc-----cEEEEEeecCCCCcccccccC Q lcl|NC_012784. 385 RQDCRILD--------YK-----SAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 385 r~d~~v~~--------p~-----a~~~~~~t~~~~~~~~~~~~~ 415 (415) |-+..+.. .. +..++.+...-+..=-..+.| T Consensus 347 ~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 347 QWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred ecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 22221110 11 111111111111111122222 No 232 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=76.21 E-value=0.14 Score=25.30 Aligned_cols=359 Identities=11% Similarity=0.010 Sum_probs=109.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhh Q lcl|NC_012784. 4 KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTY 83 (415) Q Consensus 4 ~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (415) +.+|++++.+.++++.++.++..+....+. +..++..+.++++.++++.++++++++++................... T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~-~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~- 78 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDG-ELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV- 78 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Confidence 666666555555555444443322221111 122445678888899999988888877765554443322222111110 Q ss_pred hhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCC Q lcl|NC_012784. 84 RNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNG 163 (415) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~ 163 (415) .. .. ............... ...................+...-...+-++++..+-. .++... ++... T Consensus 79 ---~~--~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~--~~~~~~---~l~~~ 146 (390) T protein:vir:81 79 ---GD--MF-VASEQFQASAGRWND-RSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFIT--PPDARL---TVRDL 146 (390) T ss_pred ---hh--hh-hhhHHHHHHHHHHhh-hhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHH--HHhhhh---hhhhh Confidence 00 00 000000000000000 01111111111111111111100111222233322211 111111 11111 Q ss_pred ceeEEEEeecCCccccccccc------ccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|NC_012784. 164 SGKYPVVRQSEVAALEKVEEL------EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) Q Consensus 164 ~~~~~~~~~~~~~~a~~v~Eg------~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~ 237 (415) -..+++ +.....+.... .-..+.........++...++...-.-..--+.+........+.+.+...++. T Consensus 147 ~~~~~~----~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~~l~~ 222 (390) T protein:vir:81 147 IGSGRT----DSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLIR 222 (390) T ss_pred cceeec----cCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHHHHHHHHHHHHH Confidence 111121 12222222111 11223333333444555544443222112223443444445666666777777 Q ss_pred HHHHHHhhcccccccccc-ccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC---c Q lcl|NC_012784. 238 TRNKAIIDVITKGSTGST-SSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---N 313 (415) Q Consensus 238 ~~d~~il~g~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~ 313 (415) ++...+-...=.|...+. +.+....................+ .....+..+..... . T Consensus 223 ~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~ 283 (390) T protein:vir:81 223 GLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVD-------------------QLRLAMLQASLAEYNPSG 283 (390) T ss_pred HHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHH-------------------HHHHHHHhhccccCCCCE Confidence 777665443333322111 222221111111111111111111 11222333333221 1 Q ss_pred ccccCc-------ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEee-cceEEEEeecccCceEEEEEEE Q lcl|NC_012784. 314 YLIQPD-------VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR-SQYQASWTDYMHFGECLMIAVR 385 (415) Q Consensus 314 ~l~~~~-------~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r 385 (415) +++.+. ..+. .|.|++...... ....++|-. +...+. ..-.+-+-++.. .+....| T Consensus 284 ~v~~~~~~~~l~~lkd~-----~G~~l~~~~~~~-----~~~~l~G~p---v~~~~~~p~~~~~~gd~~~---~~~~~~~ 347 (390) T protein:vir:81 284 IVINPIDWAAIELAKDA-----NNQYLIGNARGT-----LTPTLWGLP---VVATQAMAPGEFLVGAFDL---AAQIFDQ 347 (390) T ss_pred EEEcHHHHHHHHHhhcC-----CCceeecCcccc-----cCceeccee---eEEcCCCCCCcEEEEehhc---eEEEEEe Confidence 222111 1111 244544321111 111233321 111000 001111222211 1111111 Q ss_pred eccEEec-------------cccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 386 QDCRILD-------------YKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 386 ~d~~v~~-------------p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) -+..+.. -.++.++.+...-+..--+.+.| T Consensus 348 ~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 348 WDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 1111110 01112222221111111222233 No 233 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=75.56 E-value=0.15 Score=25.18 Aligned_cols=358 Identities=14% Similarity=0.130 Sum_probs=138.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQ-EKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) ||..++|+++...+.+. +...++...+.. +-++|= .+++.++. T Consensus 4 ~~~~e~l~~kw~p~l~~--------------~~~~~~~~~~~~---~~a~l~enq~~~~~~------------------- 47 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEG--------------EGLPEIANSKQA---IIAKIFENQEKDFEV------------------- 47 (522) T ss_pred cchHHHHHHhhHHHhcC--------------CCCCccccchhh---hhhhhhhhhhHHhhc------------------- Confidence 77788888888776432 001111000000 111110 01111000 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHH---HHhhhhhhhhcce Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILK---LKEVEFNLDKYVT 156 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~---~~~~~~~l~~~~~ 156 (415) ....+.....+.+.............. -........+.+ +.+ ..+.+.++. ...+.....+++. T Consensus 48 ~~~~~~~~~~~~~~~~l~ea~~~~~~~---------~~~~~i~es~~t-~~v---~~~~P~li~lvrRa~p~LIa~DIwG 114 (522) T protein:vir:69 48 SPEYKDEKIAQAFGSFLTEAEIGGDHG---------YNAQNIAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICG 114 (522) T ss_pred ccccchhHHHHhhhhhhhhhccccccC---------CCcccccccccc-ccc---ccccchHHHHHHHHHhhhhhhhcee Confidence 000111111111111111111110000 000011111111 111 122222332 3333444556788 Q ss_pred eEEccCCceeE-----EEEeecCC------------ccccc--------------------------------------- Q lcl|NC_012784. 157 VKRVTNGSGKY-----PVVRQSEV------------AALEK--------------------------------------- 180 (415) Q Consensus 157 ~~~~~~~~~~~-----~~~~~~~~------------~~a~~--------------------------------------- 180 (415) ++||+++++-+ .+...... +.+.+ T Consensus 115 VQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~ 194 (522) T protein:vir:69 115 VQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQAS 194 (522) T ss_pred eccCCchhhhheeeeeeccCCcccCccccccccccccccccccccccccccccccccccccccccccccccccceeeecc Confidence 88888877632 11111000 00000 Q ss_pred --------------------------------------ccccc-cccccccccceeeEeeeeeEEE-------eehhhHH Q lcl|NC_012784. 181 --------------------------------------VEELE-ENPELAVKPFFQLAYDINTHRG-------YFRISRE 214 (415) Q Consensus 181 --------------------------------------v~Eg~-~~~~~~~~~f~~v~~~~~k~a~-------~~~iS~e 214 (415) .+|.. .....+...|.+..+...|..+ ...+|-| T Consensus 195 a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiE 274 (522) T protein:vir:69 195 AQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIE 274 (522) T ss_pred cCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHH Confidence 00100 0000111234455555555544 4579999 Q ss_pred HHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc----ccccccccc-------chhhHHHHH- Q lcl|NC_012784. 215 AIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE----KEGKKLEVK-------KAKSLDDIK- 278 (415) Q Consensus 215 ~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~----~~~~~~~~~-------~~~~~~~~~- 278 (415) |.+|- .+|.+++|.+-|+..|...++++||.-......-+.. +.. ...+..... +-...+-++ T Consensus 275 LAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~-g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~ 353 (522) T protein:vir:69 275 LAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKS-GMTNIVGSKAGVFDFQDPIDIRGARWAGESFKA 353 (522) T ss_pred HHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeecc-ccccccccccceeecccccccccchhHHHHHHH Confidence 99983 4578999999999999999999999643322111110 100 001111110 011112222 Q ss_pred ------HHHHHhhh--hccCCCEEEEcHHHHHHHHHhh-----ccCC-cccccCcccCC-CCceec-ceeeEEecccccc Q lcl|NC_012784. 279 ------DAINLNVK--PNYEHNVAIVSQTMFAKLDKMK-----DKLG-NYLIQPDVKEK-TQQRLL-GAKIEILPDEVLG 342 (415) Q Consensus 279 ------~~~~~~~~--~~~~~~~~v~~~~~~~~l~~lk-----d~~G-~~l~~~~~~~~-~~~~l~-G~pV~~~~~~~~~ 342 (415) ...+.+.. .....+.+++++.....|...- .+.| ..=|..+.+.. ..+.|. |++|.+..+++. T Consensus 354 L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~- 432 (522) T protein:vir:69 354 LLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQ- 432 (522) T ss_pred HHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCc- Confidence 22222222 2234678999999988887531 1111 01011122111 113443 577777776542 Q ss_pred ccCCceEEEechh-----hcEEEEeecc-eEEEEeecccCceEEEEEEEeccEEecccc-------EEEEEeecCCCCcc Q lcl|NC_012784. 343 QKGNNTLIIGNLK-----DAIVLFDRSQ-YQASWTDYMHFGECLMIAVRQDCRILDYKS-------AIVIEYDDSERGEG 409 (415) Q Consensus 343 ~~~~~~~~~gd~~-----~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a-------~~~~~~t~~~~~~~ 409 (415) ..+++|.=- ........-. .-+...|...|+-.+-...|+++.+ +|=+ .+++. ...|+..+ T Consensus 433 ----dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~-~g~p~~~~ 506 (522) T protein:vir:69 433 ----DYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGV-NPFAESSLQAPGARIQ-SGMPSILN 506 (522) T ss_pred ----ceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCcccceee-cccchhhc Confidence 234444200 0000000001 1112234455666666667777654 3311 11221 23333333 Q ss_pred cccccC Q lcl|NC_012784. 410 DLGLEA 415 (415) Q Consensus 410 ~~~~~~ 415 (415) -.+.-+ T Consensus 507 ~~~~n~ 512 (522) T protein:vir:69 507 SLGKNA 512 (522) T ss_pred ccCCcc Confidence 333333 No 234 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=74.19 E-value=0.16 Score=24.93 Aligned_cols=309 Identities=11% Similarity=-0.021 Sum_probs=127.2 Q ss_pred ccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhh--h Q lcl|NC_012784. 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEF--N 150 (415) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~--~ 150 (415) ...+.+... ..... .... ...+.+.+. ........+..++|.+--+.+.++|..+..... . T Consensus 1 ~~~~~~~~~---~~~~~--------~~~~----~e~~~KS~~--tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~ 63 (462) T protein:vir:96 1 MHKDTNLTA---EQNKY--------ADKF----QEEVMKSYQ--TGYGITPDTQVDAGALRREILDDQITMLTWTQDDLI 63 (462) T ss_pred Cccccccch---hhhhh--------hchh----hHHHHHHHh--cCCCcCCccccccchhhhhhhhhhhheeeecccchh Confidence 000000000 00000 0000 000000000 001122233444454443444444432222211 2 Q ss_pred hhhcceeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHHHHH Q lcl|NC_012784. 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) Q Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~-l~ds~~~l~~~l~ 228 (415) +..-....+..+--..|......+. .-..++.|++.. +.+++.+......++-++.-..+|... +..+..+....+. T Consensus 64 ~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~-~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~ 142 (462) T protein:vir:96 64 FYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVA-PVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILT 142 (462) T ss_pred hhhhcCCchhhhhhhhheeeeccCcccccccccccccc-ccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHH Confidence 2233333444444444444433343 556789999975 578899999999999999877777553 2445667788999 Q ss_pred HHHHHHHHHHHHHHHhhcccccccccccccccc--------ccccccccch-hhHHHHHHHHHHhhhhccCCCEEEEcHH Q lcl|NC_012784. 229 LWMARTIAATRNKAIIDVITKGSTGSTSSGFEK--------EGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) Q Consensus 229 ~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~--------~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 299 (415) +.-...++..+|.+.+.|+..=.+.+..-+.-- .......-+. .+.+.+..+...+...+..++-++|+.- T Consensus 143 ~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~ 222 (462) T protein:vir:96 143 EDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIG 222 (462) T ss_pred HHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchH Confidence 999999999999999999776544332111111 1111112222 2223333334444455666778999999 Q ss_pred HHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhh-cEEEEeecceEEEEee----cc Q lcl|NC_012784. 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD----YM 374 (415) Q Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~----~~ 374 (415) +.+.|..---..-|.+..++.. ....|+||--.-. ..|+.+= ...+.+. ...+..+. .. T Consensus 223 v~a~f~~~~l~~qrv~~~~n~g----~~~~G~~v~~f~s-----------~~G~I~L~~s~~m~~-~~i~~~~~~~~p~a 286 (462) T protein:vir:96 223 VHADFVNSVLGRQMQLMQDNSG----NVNAGYNVQGFYS-----------SRGFIKLHGSTVMEN-ELILDESLQPLPNA 286 (462) T ss_pred HHHHHHHhhcCceEEEEcCCCC----ceeeeeeccceee-----------eeeeeeeCCceecCc-ccccccccccCCCC Confidence 9888875332222333333221 2245555432100 0000000 0000000 00000000 00 Q ss_pred cCceEEEEEEEec--cEEeccc--cEEEEEeecCCCC-----cccccccC Q lcl|NC_012784. 375 HFGECLMIAVRQD--CRILDYK--SAIVIEYDDSERG-----EGDLGLEA 415 (415) Q Consensus 375 ~~~~~~~~~~r~d--~~v~~p~--a~~~~~~t~~~~~-----~~~~~~~~ 415 (415) -....+.+....+ +...++. +-...++++.... --.+..|+ T Consensus 287 p~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTv 336 (462) T protein:vir:96 287 PQPATVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEAVTATV 336 (462) T ss_pred CCCCceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCccccceeeEeee Confidence 0011112221111 0111110 1111111111111 01112222 No 235 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=73.49 E-value=0.17 Score=24.81 Aligned_cols=261 Identities=13% Similarity=0.067 Sum_probs=111.6 Q ss_pred cccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccc--cccccccccccccceeeE Q lcl|NC_012784. 122 GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKV--EELEENPELAVKPFFQLA 199 (415) Q Consensus 122 ~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v--~Eg~~~~~~~~~~f~~v~ 199 (415) ... . .-+.++..+.+.+...+....+.....-..+..++.+++.+.. ..... ..|-..++. ..++...+ T Consensus 1 Mai-----n--~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~-gl~DY~R~~g~~~g~v-~~~~et~t 71 (290) T protein:vir:78 1 MAI-----N--YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTT-GLKAHTRNKGYNEGSA-SNTNKSYT 71 (290) T ss_pred Cch-----h--HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccC-cccccccCCCcccCcc-ccceeeEE Confidence 000 0 0123444444444433332222221111122234556555432 22222 222222222 12344555 Q ss_pred eeeeeEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHH Q lcl|NC_012784. 200 YDINTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) Q Consensus 200 ~~~~k~a~~~~iS~e~l~d---s~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) ++-.+.-.+.. +.|-.| ...++...+.+...+.+.-.+|...+.-.-++.... ........+....++. T Consensus 72 l~qdR~~~F~v--D~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~------~~~~~~t~t~~n~~~~ 143 (290) T protein:vir:78 72 IDFDRDVEFFV--DVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTN------SNSVAEEITKDNVFTK 143 (290) T ss_pred eeccccceeec--cccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhcc------CcccccccCHHHHHHH Confidence 55554443321 001111 123455566666667777777776554222211100 0111122344566788 Q ss_pred HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCccc---ccCcccCCCCceecceeeEEeccc-------------- Q lcl|NC_012784. 277 IKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYL---IQPDVKEKTQQRLLGAKIEILPDE-------------- 339 (415) Q Consensus 277 ~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l---~~~~~~~~~~~~l~G~pV~~~~~~-------------- 339 (415) +.+++.++......+-.++++|..+..|.+.+.=....- +..+..++..+.|.|.+|+.+++- T Consensus 144 i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~ 223 (290) T protein:vir:78 144 LKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYK 223 (290) T ss_pred HHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhccccc Confidence 888887776655556688999999988865322111110 112223455678999999876631 Q ss_pred cccccCCceEEEechhhcEEEEeecceEEEEeecccCce--EEEEE--EEeccEEeccc---cEEEEEe Q lcl|NC_012784. 340 VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGE--CLMIA--VRQDCRILDYK---SAIVIEY 401 (415) Q Consensus 340 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~--~r~d~~v~~p~---a~~~~~~ 401 (415) +...+.+-.+++..... ..-.. .--.+....-..++. +.... .+.|.-|.+.+ .++.+.+ T Consensus 224 ~~~~ak~in~ii~~~~a-~i~~~-K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 224 PAAGAKKLNFLLVNKGS-VVGGA-KHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred ccCCccceeEEEEcCCc-eeeee-eeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 22222333344444332 22211 111233332233333 22222 34455555432 2222222 No 236 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=72.06 E-value=0.19 Score=24.57 Aligned_cols=270 Identities=10% Similarity=0.083 Sum_probs=111.3 Q ss_pred ccceeecchhHHhHHHHHHhhhhhhhhcc----eeEEccCCceeEEEEeecCCcccccccc--ccccc--ccccccceee Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYV----TVKRVTNGSGKYPVVRQSEVAALEKVEE--LEENP--ELAVKPFFQL 198 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~----~~~~~~~~~~~~~~~~~~~~~~a~~v~E--g~~~~--~~~~~~f~~v 198 (415) ...-....+.+...+-+.....+ +.... ..+... +..++.+++.... ......- +.... +. ..++... T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~-~s~~l~~~~~~v~~~-ggktVkIp~i~~~-gl~DY~R~~g~~~~~g~v-~~~~et~ 76 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQEL-LTGWMDSNAKQIKYE-GGKEVKIGKLSTD-GLGDYSRGSANAYVGGDV-KFEYETK 76 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhh-ccccccCCCceEEEe-cCcEEEEEeeecc-cccccccccCCccccccc-cccceeE Confidence 00001223444444443333222 11111 112222 2235556554433 2222221 11111 11 1233444 Q ss_pred EeeeeeEEEeehhhHHHHhc-ch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIED-AK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~d-s~--~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) +++-.+.-.+.. +.|-.| +. .++...+.+...+.+.=.+|...++-.-........ ..........+....++ T Consensus 77 tl~qDR~~~F~v--D~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~--~~~~~~~~~~T~~ni~~ 152 (312) T protein:vir:10 77 TMTQDRGRKFTL--DAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKG--DTNVEYSYSVNSSTIIN 152 (312) T ss_pred Eeeecccceeec--cccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccc--ccccccccccCHHHHHH Confidence 444444333221 111111 11 123333444445555556666655432222111100 00111122345566788 Q ss_pred HHHHHHHHhhhhccC-CCEEEEcHHHHHHHHHhhccCCcc---cccCcccCCCCceecceeeEEecc--cc------cc- Q lcl|NC_012784. 276 DIKDAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGNY---LIQPDVKEKTQQRLLGAKIEILPD--EV------LG- 342 (415) Q Consensus 276 ~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~lkd~~G~~---l~~~~~~~~~~~~l~G~pV~~~~~--~~------~~- 342 (415) .+.+++..+-.+... +-.++|.|..+..|.+. -..+. .+..+...+....|.|.||+.+++ |. +| T Consensus 153 ~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~--~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~ 230 (312) T protein:vir:10 153 KIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEK--VLEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGT 230 (312) T ss_pred HHHHHHHHHHHccCCCceEEEeChHHHHHHhhh--hhceecccccccceeeeeeeeecccEEEEchhhhccceeeeccCc Confidence 888888888876654 55788999888666642 11111 112223344456899999997753 21 01 Q ss_pred -------------ccCCceEEEechhhcEEEEeecceEEEEeecccCceE--EEEE--EEeccEEecc-ccEEEEEeecC Q lcl|NC_012784. 343 -------------QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGEC--LMIA--VRQDCRILDY-KSAIVIEYDDS 404 (415) Q Consensus 343 -------------~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~--~r~d~~v~~p-~a~~~~~~t~~ 404 (415) .+.+-.+++-... +..-.. .--.+....-..++.. .... .+.|.-|.+. ..-+++.+..+ T Consensus 231 t~~~~~gg~~~~~~ak~INfiiv~~~-a~i~~~-K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 231 TSNQTAGGYLKGTKALDTNFIIAPVD-VPLAIT-KQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred ccccccCceeecCcccccceEEeCCc-eeecee-eeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecc Confidence 1111122222221 111111 1112222222233332 2222 3456666654 45557788888 Q ss_pred CCCc Q lcl|NC_012784. 405 ERGE 408 (415) Q Consensus 405 ~~~~ 408 (415) .++| T Consensus 309 ~~~~ 312 (312) T protein:vir:10 309 KPVG 312 (312) T ss_pred cCCC Confidence 8777 No 237 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=71.48 E-value=0.2 Score=24.48 Aligned_cols=267 Identities=15% Similarity=0.043 Sum_probs=100.9 Q ss_pred ccceeecchhHHhHHHHHHhhhhhh-hhcceeEEccCCceeEEEEeecCCcc--cccccccccccccccccceeeEeeee Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFNL-DKYVTVKRVTNGSGKYPVVRQSEVAA--LEKVEELEENPELAVKPFFQLAYDIN 203 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~--a~~v~Eg~~~~~~~~~~f~~v~~~~~ 203 (415) ...+..++..+.+.+-.--++..-+ ..++..+++...+++|+......... -.-++.++..... .+.....++..+ T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v-~~~~~~~~~~~~ 79 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETGSTE 79 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceE-eecccCceeeec Confidence 1112222233333332111111111 12345566666666666543211111 1223344332211 233334444444 Q ss_pred eEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cccccccccccccccccchhhHHH Q lcl|NC_012784. 204 THRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGST-----GSTSSGFEKEGKKLEVKKAKSLDD 276 (415) Q Consensus 204 k~a~~~~iS~e~l~ds--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) ..+-..+|..+-..++ .+|.++.-.+.+.+.+....|..+-.-.-...+ ...+++. ........+...+ T Consensus 80 ~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt----~~wsd~~SDPi~~ 155 (309) T protein:vir:99 80 DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGA----DQWSDPTSNPLPV 155 (309) T ss_pred ccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCc----cccCCCCCCcHHH Confidence 4445556666555543 467788888888887776666543322111111 1111111 1112234445556 Q ss_pred HHHHHHHhhhhccCCCEEEEcHHHHHHHHH---h----hccCCcc-cccCcccCCCCceecce-eeEEeccccc----cc Q lcl|NC_012784. 277 IKDAINLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGA-KIEILPDEVL----GQ 343 (415) Q Consensus 277 ~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~-pV~~~~~~~~----~~ 343 (415) +...+..+ ++.++.++|....|.+|+. + +-..+.. ++.+. .-..++|. .|++-...-. +. T Consensus 156 i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~----~la~l~~ve~V~vg~a~~n~a~~g~ 228 (309) T protein:vir:99 156 ITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQ 228 (309) T ss_pred HHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHH----HHHHHhCcceEEeecceeecccccc Confidence 66665543 7899999999999998864 2 2222111 11111 11224444 2333221110 00 Q ss_pred cCCceEEEech----------------hhcEEE--EeecceEEEEeecc-cCceEEEEEEEeccEEeccccEEEEEeecC Q lcl|NC_012784. 344 KGNNTLIIGNL----------------KDAIVL--FDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 344 ~~~~~~~~gd~----------------~~~~~~--~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~ 404 (415) .++..-+-|+. +-+++. ..|..-.+....+. .....+|+..++.-.+.-+++-.+++=..+ T Consensus 229 ~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va 308 (309) T protein:vir:99 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) T ss_pred ccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhccc Confidence 00000011110 000000 00111111000000 111223333333333444444333331111 Q ss_pred C Q lcl|NC_012784. 405 E 405 (415) Q Consensus 405 ~ 405 (415) . T Consensus 309 ~ 309 (309) T protein:vir:99 309 A 309 (309) T ss_pred C Confidence 1 No 238 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=71.27 E-value=0.2 Score=24.44 Aligned_cols=359 Identities=14% Similarity=0.113 Sum_probs=123.7 Q ss_pred HHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHH Q lcl|NC_012784. 18 IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQ 97 (415) Q Consensus 18 ~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (415) +.-. +|+-.+++.-+.+- +. .-+|+..-++ ......-+++.+ +..+....+...-...+..... T Consensus 1 ~~~~--------~~~l~~kw~p~l~~-~~-~~~i~~~~~~-----~~~a~l~enq~~-~~~~~~~~~~~~~~e~~~~~l~ 64 (529) T protein:vir:10 1 MSLK--------TKEILNKWTPLLEG-EG-LPEIAGKNKQ-----ALVAQILEAQEK-DSKTDPVYRDDKLIEAFGQSLM 64 (529) T ss_pred Cccc--------hHHHHHHhhHhhcC-Cc-cchhcchhhh-----hhhhhhhhhHHH-Hhhcccccchhhhhhhhhhccc Confidence 0000 01001111111000 00 0001100000 000000000000 0000000000000000000000 Q ss_pred HhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcceeEEccCCceeE-----EE Q lcl|NC_012784. 98 NTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVTVKRVTNGSGKY-----PV 169 (415) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~~~~-----~~ 169 (415) ....... +......... +.+.+++ ..+.+.++.++| +.....+++.++||+++++-+ .+ T Consensus 65 e~~~~~~---------~~~~~~~ia~-s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY 131 (529) T protein:vir:10 65 EAEVAGD---------HGYDPTNIAA-GQSSGAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVY 131 (529) T ss_pred hhhcccc---------cccccccccc-ccccccc---ccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeee Confidence 0000000 0000000011 1111111 123333333333 344555688889988877643 11 Q ss_pred EeecCC--------------------------------------------------------------------ccc--- Q lcl|NC_012784. 170 VRQSEV--------------------------------------------------------------------AAL--- 178 (415) Q Consensus 170 ~~~~~~--------------------------------------------------------------------~~a--- 178 (415) ...... ..+ T Consensus 132 ~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~ 211 (529) T protein:vir:10 132 GKDPLAAGAKEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTN 211 (529) T ss_pred cCCcCCCcccccccccccccccccccccccccccccccccccccccceeeccccceeeeccccccccccccccccccccc Confidence 110000 000 Q ss_pred -------------------cccccccc------c---cccccccceeeEeeeeeEEE-------eehhhHHHHhcc---- Q lcl|NC_012784. 179 -------------------EKVEELEE------N---PELAVKPFFQLAYDINTHRG-------YFRISREAIEDA---- 219 (415) Q Consensus 179 -------------------~~v~Eg~~------~---~~~~~~~f~~v~~~~~k~a~-------~~~iS~e~l~ds---- 219 (415) ..+++|-. . ...+...|.+..+...|..+ ...+|-||.+|- T Consensus 212 ~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvH 291 (529) T protein:vir:10 212 ETGAALDALVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVH 291 (529) T ss_pred ccCCccccccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 00000000 0 00112235555565555554 447999999983 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc----ccccccccc-------chhhHHHHH-------HHH Q lcl|NC_012784. 220 KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE----KEGKKLEVK-------KAKSLDDIK-------DAI 281 (415) Q Consensus 220 ~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~----~~~~~~~~~-------~~~~~~~~~-------~~~ 281 (415) .+|.++.|.+-|+..|...++++||.-......-+. .+.. ......... .-...+-++ ++. T Consensus 292 GLDAEtELsNILStEImlEINReii~~i~~~a~~~~-~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~a 370 (529) T protein:vir:10 292 GMDADSELNGILANEVMLEINREVIDWINYTAQVGK-SGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEA 370 (529) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHHHhhhhceeee-eeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHH Confidence 457899999999999999999999973222211100 0100 000000000 011111122 222 Q ss_pred HHhhh--hccCCCEEEEcHHHHHHHHHh--hccCCccc----ccCcccCC-CCceec-ceeeEEeccccccccCCceEEE Q lcl|NC_012784. 282 NLNVK--PNYEHNVAIVSQTMFAKLDKM--KDKLGNYL----IQPDVKEK-TQQRLL-GAKIEILPDEVLGQKGNNTLII 351 (415) Q Consensus 282 ~~~~~--~~~~~~~~v~~~~~~~~l~~l--kd~~G~~l----~~~~~~~~-~~~~l~-G~pV~~~~~~~~~~~~~~~~~~ 351 (415) +.+.. .+...+.+++++.....|..+ .+.-+-.- |..+.+.+ ..+.|. |++|.+..+++. ..+++ T Consensus 371 n~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~v 445 (529) T protein:vir:10 371 NEIARQTGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQ-----DYFTM 445 (529) T ss_pred HHHHHhhccccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCc-----ceEEE Confidence 22222 223567899999999998742 21111100 11111111 123443 577777776542 23444 Q ss_pred echh-----hcEEEEeecceE-EEEeecccCceEEEEEEEeccEEeccccEEEEEeecC----CCCcccccccC Q lcl|NC_012784. 352 GNLK-----DAIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS----ERGEGDLGLEA 415 (415) Q Consensus 352 gd~~-----~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~----~~~~~~~~~~~ 415 (415) |.=- ........-.+. ....|...|+-.+-...|+++.+ +| |... .+.. -..+.|....| T Consensus 446 G~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP--~~~~-~~~~~~~r~~~g~~~~~~a 515 (529) T protein:vir:10 446 GYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIGV-NP--FAES-RTQAPTSRISNGMPGAHSV 515 (529) T ss_pred EEeCCcccccceeeccccccccccccCCCcccceeeeeeeeceee-cC--cccc-ccccccccccCCcchhhhc Confidence 4200 000000001111 11234455555555666776643 34 1111 1111 12222333333 No 239 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=66.68 E-value=0.26 Score=23.76 Aligned_cols=313 Identities=12% Similarity=0.031 Sum_probs=118.9 Q ss_pred ccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhh--hh Q lcl|NC_012784. 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE--FN 150 (415) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~--~~ 150 (415) ...+.+.+...... .....+.+. ........+..+++.+=-+.+.++|..+.-.. .. T Consensus 1 ~~~~~n~~~~~~~~---------------~e~~~Ks~t------tgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~ 59 (464) T protein:vir:80 1 MTEKKNTERQLTSV---------------QEEVIKGFT------TGYGITPESQTDAAALRREFLDDQITMLTWADGDLS 59 (464) T ss_pred CCcchhhHhhcCcc---------------cHHHHHHHH------hCCccCcccccCcchhhhhhhhhhhheeeecccchh Confidence 00000000000000 000001111 00112223344455554444555443322221 12 Q ss_pred hhhcceeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHHHHH Q lcl|NC_012784. 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) Q Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~-l~ds~~~l~~~l~ 228 (415) +..-....+..+--..|......+. .-..++.|++.. +.+++.+.......+-+...--+|.-+ |.++..+-...+. T Consensus 60 f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~-~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~ 138 (464) T protein:vir:80 60 FYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVA-PISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILT 138 (464) T ss_pred hhhhcCCchhhhhhhhhheeeccCcccccccccccccc-ccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHH Confidence 3333344445444445544433343 556788999975 478899999998887665433333322 2334556677888 Q ss_pred HHHHHHHHHHHHHHHhhccccccccccc---c---ccc---cccccccccc-hhhHHHHHHHHHHhhhhccCCCEEEEcH Q lcl|NC_012784. 229 LWMARTIAATRNKAIIDVITKGSTGSTS---S---GFE---KEGKKLEVKK-AKSLDDIKDAINLNVKPNYEHNVAIVSQ 298 (415) Q Consensus 229 ~~la~~~~~~~d~~il~g~g~~~~~~~~---~---~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 298 (415) +.-.-.++..+|.+.+.|+..=.+.++. + |.. ........-+ ..+.+.+..+-..+...+..++-++|+. T Consensus 139 ~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~ 218 (464) T protein:vir:80 139 DDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPI 218 (464) T ss_pred HHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccch Confidence 8888899999999999997654432111 0 000 0111111111 2223333344444445566677788888 Q ss_pred HHHHHH-HHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCc-eEEEechhhcEEEEee-------cceEEE Q lcl|NC_012784. 299 TMFAKL-DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN-TLIIGNLKDAIVLFDR-------SQYQAS 369 (415) Q Consensus 299 ~~~~~l-~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~-~~~~gd~~~~~~~~~~-------~~~~i~ 369 (415) -+.+.+ ...=+.+ +.+. .+..++...|+||--.-.. .+...-. .-++.++. +....+ ...++. T Consensus 219 ~v~a~f~n~~l~~q--~~~~---~~n~~~~~~G~~v~~f~sa-~G~i~L~~s~~m~~~~--~ld~~~~~~~~apaapsvt 290 (464) T protein:vir:80 219 GVQADFVNQQLDRQ--VQVI---SDNGQNATMGFNVKGFNSA-RGFIRLHGSTVMELEQ--ILDENRMQLPNAPQKATVK 290 (464) T ss_pred hHHHHHHhhhcCce--eEEE---cCCCCcceeeeeccccccc-ccceeccCccccCccc--ccccccccCCCCcCCceeE Confidence 887665 3322221 1111 1111222445444321100 0000000 00111100 000000 001111 Q ss_pred E--ee--cccCceE-EEEEEEeccEEecc--ccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 370 W--TD--YMHFGEC-LMIAVRQDCRILDY--KSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 370 ~--~~--~~~~~~~-~~~~~r~d~~v~~p--~a~~~~~~t~~~~~~~~~~~~~ 415 (415) + +. ...|... ..+...+-+++.+. ++...--.+++...+++-+..- T Consensus 291 ~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~ 343 (464) T protein:vir:80 291 ATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLE 343 (464) T ss_pred EEecCCcccCCccccccceeEEEEEEECCCCccccceeeeeeecCcccEEEEE Confidence 1 10 0111111 11111111222221 1111111222222222222222 No 240 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=66.61 E-value=0.26 Score=23.75 Aligned_cols=328 Identities=8% Similarity=0.000 Sum_probs=120.3 Q ss_pred HhhhhhccccccccchhhhhhHHHHHHHHHHHHH--hhhh----hHHHHHHHH-----HHhhhhhhhhcccccccceeec Q lcl|NC_012784. 65 DGTSENNQQSVEVNEARTYRNQANINDLGISIQN--TKVT----SQEVRDFTE-----YLETRNDIQGGSLKTDSGFVVI 133 (415) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~----~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v 133 (415) +................................. -... ..+...+.+ ..........+.. +.++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~--t~~~~g~ 78 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPV--TTPSIPT 78 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCcc--ccCCccH Confidence 0000000000000000000000000000000000 0000 000011100 0001111111122 2233345 Q ss_pred chhHHh----HHHHHHhhhhhhhhcceeEEccCCc-eeEEEEeecCCcccccccccccccccccccceeeEeeeeeEEEe Q lcl|NC_012784. 134 PEEIVT----DILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGY 208 (415) Q Consensus 134 P~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~ 208 (415) |..+.+ .+++.+........++.+...+.-. ..+.+........+.+.+-+...|-.+ ......+...+.+... T Consensus 79 p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d-~~~~~~~r~v~~~~~g 157 (382) T protein:vir:96 79 PIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNANFERRTIVRGELG 157 (382) T ss_pred HHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccc-cccceeEEEEEEEEEe Confidence 766554 4444444444444555443321111 123344444556677788888777433 3333444444555555 Q ss_pred ehh-hHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---ccccccccc---ccccc----ccchhhHH Q lcl|NC_012784. 209 FRI-SREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---STSSGFEKE---GKKLE----VKKAKSLD 275 (415) Q Consensus 209 ~~i-S~e~l~d--s~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~---~~~~~~~~~---~~~~~----~~~~~~~~ 275 (415) +.+ ..|+.+- ...++.+--....++++.+.+|+-.+.|+..+... +...+.... ..... .+...-++ T Consensus 158 ~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~ 237 (382) T protein:vir:96 158 LLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIG 237 (382) T ss_pred eeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHH Confidence 555 4555553 23466666677777888888888788886433221 111111100 01111 12222356 Q ss_pred HHHHHHHHhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEecccc-ccccC-- Q lcl|NC_012784. 276 DIKDAINLNVKPNY-------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV-LGQKG-- 345 (415) Q Consensus 276 ~~~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~-~~~~~-- 345 (415) |+..++.++..... .+..++|.|..+..|.. .+..|.-++. -+.. .+.++.++..+.+. .+..+ T Consensus 238 Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~-~n~~g~Tvl~-~lk~----n~Pnl~i~t~peL~~a~~~g~g 311 (382) T protein:vir:96 238 DIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSV-TTPYGISVSD-WIEQ----TYPKMRIVSAPELSGVQMQGKT 311 (382) T ss_pred HHHHHHHHHHhccCCeeeecccceEEeechHHHhhccc-cCccCccHHH-HHHH----hcCCcEEEEccccccccCCCcc Confidence 66666666654332 12368889988877753 2333322211 0111 11123333333321 11111 Q ss_pred Cce--EEEechhhcEEEEeecceEEEEee--cccC------ce--EEE--EEE-EeccEEeccccEEEEEee Q lcl|NC_012784. 346 NNT--LIIGNLKDAIVLFDRSQYQASWTD--YMHF------GE--CLM--IAV-RQDCRILDYKSAIVIEYD 402 (415) Q Consensus 346 ~~~--~~~gd~~~~~~~~~~~~~~i~~~~--~~~~------~~--~~~--~~~-r~d~~v~~p~a~~~~~~t 402 (415) ..- +++.+--.....+.. .....+.. -..+ .. .++ ... ..|+.+.+|.||++++=- T Consensus 312 ~~~~~~~~~~e~~~~~~~s~-~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 312 PEDALVLFVEEVDASVDGST-DGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred ceeEEEEecchhhhhccccc-ccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 111 111110000000000 00000000 0000 00 000 011 256677889999998732 No 241 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=65.66 E-value=0.28 Score=23.62 Aligned_cols=380 Identities=10% Similarity=-0.033 Sum_probs=105.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHH Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQA 87 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (415) |+.+++|++++.+..+.+.....+.+..-.++-.++.+.+.++++.+++++++..+........................ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 77777887777777666665555444333333345667777888887777766555544333222221111111111111 Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeE Q lcl|NC_012784. 88 NINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) ..........................+..............+ .+...-...++........+..+-...++......+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:79 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhc--cccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 110011110001000111101111111111110011000001 111111111111111111111111111222111122 Q ss_pred EEEeecCC-ccccc--ccccccccccccc-cceeeEeeeeeEEEeehhhHHHHhcchHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_012784. 168 PVVRQSEV-AALEK--VEELEENPELAVK-PFFQLAYDINTHRGYFRISREAIEDAKVNVLQE-LKLWMARTIAATRNKA 242 (415) Q Consensus 168 ~~~~~~~~-~~a~~--v~Eg~~~~~~~~~-~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~-l~~~la~~~~~~~d~~ 242 (415) ++...... +-..+ .....-..+.... ..+..++...++...-.-..--+.+...+...+ +...+...++.++... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~ 238 (415) T protein:vir:79 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 22111111 00111 1111222222221 122334444444332211111222222222222 5666666777777665 Q ss_pred HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC---CcccccCc Q lcl|NC_012784. 243 IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKL---GNYLIQPD 319 (415) Q Consensus 243 il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~---G~~l~~~~ 319 (415) +-...-.+...+................ .......-......+..+.+.. ..+++.+. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:79 239 RNKAIIDVITKGSTGSTSSGFEKEGKKL-------------------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHhhccccCcccccccccccccccc-------------------ccccccchhHHHHHHHhhhhhccCCCEEEEcHH Confidence 5443333322222211111100000000 0000011111122222333322 12222111 Q ss_pred ccCC--CCceecceeeEEeccccccccCCceEEEechh---hcEEEEeecceEEEEeecccCceEEEEEEEeccEEe--- Q lcl|NC_012784. 320 VKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK---DAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRIL--- 391 (415) Q Consensus 320 ~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~--- 391 (415) .... .-..=.|.|++..+.. .+....++|-.= ............+-+-++..+.. ...|-+..+. T Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~----~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~---~~~~~~~~v~~~~ 372 (415) T protein:vir:79 300 MFAKLDKMKDKLGNYLIQPDVK----EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV---LFDRSQYQASWTD 372 (415) T ss_pred HHHHHHHhhccCCceeeccCcC----CCCCceecceeeEEecccccCCCCccEEEEEehhccEE---EEeecceEEEEec Confidence 0000 0001135565533211 111112222110 00000011111222222221110 0111111110 Q ss_pred ---cc---ccEEEEEeecC---CCCcccccccC Q lcl|NC_012784. 392 ---DY---KSAIVIEYDDS---ERGEGDLGLEA 415 (415) Q Consensus 392 ---~p---~a~~~~~~t~~---~~~~~~~~~~~ 415 (415) +. -++.++..... +-=-..+++.+ T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:79 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 00 11122211111 11111222222 No 242 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=65.66 E-value=0.28 Score=23.62 Aligned_cols=380 Identities=10% Similarity=-0.033 Sum_probs=105.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHH Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQA 87 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (415) |+.+++|++++.+..+.+.....+.+..-.++-.++.+.+.++++.+++++++..+........................ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 77777887777777666665555444333333345667777888887777766555544333222221111111111111 Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeE Q lcl|NC_012784. 88 NINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) ..........................+..............+ .+...-...++........+..+-...++......+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:81 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhc--cccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 110011110001000111101111111111110011000001 111111111111111111111111111222111122 Q ss_pred EEEeecCC-ccccc--ccccccccccccc-cceeeEeeeeeEEEeehhhHHHHhcchHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_012784. 168 PVVRQSEV-AALEK--VEELEENPELAVK-PFFQLAYDINTHRGYFRISREAIEDAKVNVLQE-LKLWMARTIAATRNKA 242 (415) Q Consensus 168 ~~~~~~~~-~~a~~--v~Eg~~~~~~~~~-~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~-l~~~la~~~~~~~d~~ 242 (415) ++...... +-..+ .....-..+.... ..+..++...++...-.-..--+.+...+...+ +...+...++.++... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~ 238 (415) T protein:vir:81 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 22111111 00111 1111222222221 122334444444332211111222222222222 5666666777777665 Q ss_pred HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC---CcccccCc Q lcl|NC_012784. 243 IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKL---GNYLIQPD 319 (415) Q Consensus 243 il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~---G~~l~~~~ 319 (415) +-...-.+...+................ .......-......+..+.+.. ..+++.+. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:81 239 RNKAIIDVITKGSTGSTSSGFEKEGKKL-------------------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHhhccccCcccccccccccccccc-------------------ccccccchhHHHHHHHhhhhhccCCCEEEEcHH Confidence 5443333322222211111100000000 0000011111122222333322 12222111 Q ss_pred ccCC--CCceecceeeEEeccccccccCCceEEEechh---hcEEEEeecceEEEEeecccCceEEEEEEEeccEEe--- Q lcl|NC_012784. 320 VKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK---DAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRIL--- 391 (415) Q Consensus 320 ~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~--- 391 (415) .... .-..=.|.|++..+.. .+....++|-.= ............+-+-++..+.. ...|-+..+. T Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~----~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~---~~~~~~~~v~~~~ 372 (415) T protein:vir:81 300 MFAKLDKMKDKLGNYLIQPDVK----EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV---LFDRSQYQASWTD 372 (415) T ss_pred HHHHHHHhhccCCceeeccCcC----CCCCceecceeeEEecccccCCCCccEEEEEehhccEE---EEeecceEEEEec Confidence 0000 0001135565533211 111112222110 00000011111222222221110 0111111110 Q ss_pred ---cc---ccEEEEEeecC---CCCcccccccC Q lcl|NC_012784. 392 ---DY---KSAIVIEYDDS---ERGEGDLGLEA 415 (415) Q Consensus 392 ---~p---~a~~~~~~t~~---~~~~~~~~~~~ 415 (415) +. -++.++..... +-=-..+++.+ T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:81 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 00 11122211111 11111222222 No 243 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=65.66 E-value=0.28 Score=23.62 Aligned_cols=380 Identities=10% Similarity=-0.033 Sum_probs=105.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHH Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQA 87 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (415) |+.+++|++++.+..+.+.....+.+..-.++-.++.+.+.++++.+++++++..+........................ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 77777887777777666665555444333333345667777888887777766555544333222221111111111111 Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeE Q lcl|NC_012784. 88 NINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) ..........................+..............+ .+...-...++........+..+-...++......+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:98 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhc--cccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 110011110001000111101111111111110011000001 111111111111111111111111111222111122 Q ss_pred EEEeecCC-ccccc--ccccccccccccc-cceeeEeeeeeEEEeehhhHHHHhcchHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_012784. 168 PVVRQSEV-AALEK--VEELEENPELAVK-PFFQLAYDINTHRGYFRISREAIEDAKVNVLQE-LKLWMARTIAATRNKA 242 (415) Q Consensus 168 ~~~~~~~~-~~a~~--v~Eg~~~~~~~~~-~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~-l~~~la~~~~~~~d~~ 242 (415) ++...... +-..+ .....-..+.... ..+..++...++...-.-..--+.+...+...+ +...+...++.++... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~ 238 (415) T protein:vir:98 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 22111111 00111 1111222222221 122334444444332211111222222222222 5666666777777665 Q ss_pred HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC---CcccccCc Q lcl|NC_012784. 243 IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKL---GNYLIQPD 319 (415) Q Consensus 243 il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~---G~~l~~~~ 319 (415) +-...-.+...+................ .......-......+..+.+.. ..+++.+. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:98 239 RNKAIIDVITKGSTGSTSSGFEKEGKKL-------------------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHhhccccCcccccccccccccccc-------------------ccccccchhHHHHHHHhhhhhccCCCEEEEcHH Confidence 5443333322222211111100000000 0000011111122222333322 12222111 Q ss_pred ccCC--CCceecceeeEEeccccccccCCceEEEechh---hcEEEEeecceEEEEeecccCceEEEEEEEeccEEe--- Q lcl|NC_012784. 320 VKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK---DAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRIL--- 391 (415) Q Consensus 320 ~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~--- 391 (415) .... .-..=.|.|++..+.. .+....++|-.= ............+-+-++..+.. ...|-+..+. T Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~----~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~---~~~~~~~~v~~~~ 372 (415) T protein:vir:98 300 MFAKLDKMKDKLGNYLIQPDVK----EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV---LFDRSQYQASWTD 372 (415) T ss_pred HHHHHHHhhccCCceeeccCcC----CCCCceecceeeEEecccccCCCCccEEEEEehhccEE---EEeecceEEEEec Confidence 0000 0001135565533211 111112222110 00000011111222222221110 0111111110 Q ss_pred ---cc---ccEEEEEeecC---CCCcccccccC Q lcl|NC_012784. 392 ---DY---KSAIVIEYDDS---ERGEGDLGLEA 415 (415) Q Consensus 392 ---~p---~a~~~~~~t~~---~~~~~~~~~~~ 415 (415) +. -++.++..... +-=-..+++.+ T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:98 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 00 11122211111 11111222222 No 244 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=65.19 E-value=0.29 Score=23.55 Aligned_cols=358 Identities=11% Similarity=-0.004 Sum_probs=104.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhh Q lcl|NC_012784. 4 KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTY 83 (415) Q Consensus 4 ~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (415) +.||++++.+.++++.+..++..+....+. +..++..+.++++.++++.++++++++.++.................. T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~-~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~- 78 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDG-ELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV- 78 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch- Confidence 555555555544544444443322222111 122455677888888998888888777666554433322222111111 Q ss_pred hhHHHHHHHHHHHHHhhhhhHHHHHHHHHHh-hhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccC Q lcl|NC_012784. 84 RNQANINDLGISIQNTKVTSQEVRDFTEYLE-TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTN 162 (415) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 162 (415) .. .... ................. ............+.+...-...+-++++..+-. .++.. -++.. T Consensus 79 ---~~---~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~--~~~~~---~~l~~ 145 (390) T protein:vir:10 79 ---GD---LFVA--SEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFIT--QPDAR---LTVRD 145 (390) T ss_pred ---hh---hhhh--hHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHH--HHHhh---chhhh Confidence 00 0000 00000000000011111 111111111111111111112233333322211 11111 11211 Q ss_pred CceeEEEEeecCCcccccccc------cccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 163 GSGKYPVVRQSEVAALEKVEE------LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 163 ~~~~~~~~~~~~~~~a~~v~E------g~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~ 236 (415) .-..+++ +.....+..+ ..-..+.....-...++....+...-.-..--+.+...+-...|.+.+...++ T Consensus 146 ~~~~~~~----~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~i~~~l~ 221 (390) T protein:vir:10 146 LIGSGRT----DSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLI 221 (390) T ss_pred hcceeec----cCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 1111121 1112222211 11122322222233444444443322211222333333333456666666666 Q ss_pred HHHHHHHhhccccccccc-cccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCc-- Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTGS-TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN-- 313 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~-- 313 (415) +++...+-...=.|.+.+ .+.+.................+.. ......+..+...... T Consensus 222 ~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~l~~~~~~~~ 282 (390) T protein:vir:10 222 RGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRV-------------------DQLRLAMLQASLAEYPAS 282 (390) T ss_pred HHHHHHHHHHHhhcCCCCccccccccccccccccccccccchH-------------------HHHHHHHHhhccccCCCC Confidence 666665543322222111 122222111111111111111111 1122223334333221 Q ss_pred -ccccCc-------ccCCCCceecceeeEEecccccc-ccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEE Q lcl|NC_012784. 314 -YLIQPD-------VKEKTQQRLLGAKIEILPDEVLG-QKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAV 384 (415) Q Consensus 314 -~l~~~~-------~~~~~~~~l~G~pV~~~~~~~~~-~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 384 (415) .++.+. ..+. .|.|++..+....+ .....++++-++- . .-.+-+-++.. .+..+. T Consensus 283 ~~v~n~~~~~~L~~lkd~-----~g~~l~~~~~~~~~~~l~G~pv~~~~~~------p--~~~~~~gdf~~---~~~~~~ 346 (390) T protein:vir:10 283 GIVINPIDWAAIELAKDA-----NNQYLIGNARGTLTPTLWGLPVVATQAM------A--PGEFLVGAFDL---AAQIFD 346 (390) T ss_pred EEEEcHHHHHHHHHhhcC-----CCceeecCCcCcCCceecceeeEEcCCC------C--CCcEEEEeccc---eEEEEE Confidence 111111 1111 24444432111100 0001111211110 0 00111111111 111111 Q ss_pred EeccEEe--------ccc-----cEEEEEeecCCCCcccccccC Q lcl|NC_012784. 385 RQDCRIL--------DYK-----SAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 385 r~d~~v~--------~p~-----a~~~~~~t~~~~~~~~~~~~~ 415 (415) |-+..+. ... +..++.+...-+..=-..+.| T Consensus 347 ~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 347 QWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 1111110 001 111111111111111112222 No 245 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=57.73 E-value=0.43 Score=22.60 Aligned_cols=373 Identities=9% Similarity=-0.026 Sum_probs=109.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccch Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) |+..++|++..+++++. .+..... .++..+..+++++..+++++++++++.+.+..+......+........... T Consensus 1 m~~~~~lee~~a~l~~~----~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (419) T protein:vir:94 1 MPPTPTLEEQRAALLAR----LDDTSLT-TEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) T ss_pred CCHHHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 99998888776666443 2222222 122222334444455566666666666665555444332222111111111 Q ss_pred hhhhhHHHHHHHHHH-HHHhhhh-hHHHHHHHHHHhhhhhhhhcccccccceeecch-hHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 81 RTYRNQANINDLGIS-IQNTKVT-SQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE-EIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 81 ~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~-~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) ............... ....... ..........................++...|. .+.+.++.......+.... .+ T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~-~i 154 (419) T protein:vir:94 76 AEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPL-LV 154 (419) T ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhh-hh Confidence 111111111011111 0000111 111111111122222222222333334444443 2334444332222211111 11 Q ss_pred EEccCCceeEEEEeecCCcccccc-----------ccc--ccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHH Q lcl|NC_012784. 158 KRVTNGSGKYPVVRQSEVAALEKV-----------EEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVL 224 (415) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~a~~v-----------~Eg--~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~ 224 (415) .. .-.. ++. ......+. ..+ .-.++.....-...++....+...-.-..--+...-.+-. T Consensus 155 ~~---~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~ 227 (419) T protein:vir:94 155 AD---LLDQ--QNA--DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN 227 (419) T ss_pred hh---ccee--eec--cCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH Confidence 11 0011 111 11111111 111 1122322222233444444443322111112333223333 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHH Q lcl|NC_012784. 225 QELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKL 304 (415) Q Consensus 225 ~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l 304 (415) ..+...|...++.++-..+-...=.|.+.+.+.+............. ...........+..| T Consensus 228 ~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~------------------~~~~~~t~~~~~~~l 289 (419) T protein:vir:94 228 SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP------------------KPTAPATDEPPLVDI 289 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc------------------ccccccccchhHHHH Confidence 45666677777777776665544444444444333322221111100 011111122223333 Q ss_pred H----HhhccCCcc-cccCcccCCCCcee------cceeeEEeccccccccCCceEEEechhhcEEEEee-cceEEEEee Q lcl|NC_012784. 305 D----KMKDKLGNY-LIQPDVKEKTQQRL------LGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR-SQYQASWTD 372 (415) Q Consensus 305 ~----~lkd~~G~~-l~~~~~~~~~~~~l------~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~~ 372 (415) . .+..+..++ .|.-++.. -..| .|.|..+.++...+. ...++|-. +...+. ..-.+-+-| T Consensus 290 ~~~~~~~~~~~~~~~~~v~n~~~--~~~l~~~k~~~~~~~~~~~~~~~~~---~~~l~G~p---V~~~~~~~~~~~~~gd 361 (419) T protein:vir:94 290 RRAKTVAEIAGFPPDGVVVHPQD--WESIELDQAPGSGVFRVIANVQGEA---TPRIWGLN---VVSTVAIAQGTALVGG 361 (419) T ss_pred HHHHHhhhhccCCCCEEEEcHHH--HHHHHHHhhcCCCceeecCCcccCC---Ccccccee---eEEcCCCCCccEEEee Confidence 3 233222211 11111100 0000 122222221111110 01122210 100000 001111111 Q ss_pred cccCceEEEEEEEeccEEe---------ccc-----cEEEEEeec-CCCCcccccccC Q lcl|NC_012784. 373 YMHFGECLMIAVRQDCRIL---------DYK-----SAIVIEYDD-SERGEGDLGLEA 415 (415) Q Consensus 373 ~~~~~~~~~~~~r~d~~v~---------~p~-----a~~~~~~t~-~~~~~~~~~~~~ 415 (415) +.. .+....|-+..+. ... +..++.+.. -+....=++.+| T Consensus 362 ~~~---~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~a 416 (419) T protein:vir:94 362 FRQ---GATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) T ss_pred ccc---eEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEecc Confidence 111 1111111111110 000 111111111 011111222222 No 246 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=54.16 E-value=0.51 Score=22.19 Aligned_cols=374 Identities=9% Similarity=-0.026 Sum_probs=88.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhh-- Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRN-- 85 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 85 (415) ++.++++++++.++.+++.....+.+..--++-.++++.+.++++.++++++...+...................... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 666667777666666555555444333333333456667777777777776655444333332222111111111111 Q ss_pred HHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecch---hHHh-HHHHHHhhhhhhhhcceeEEcc Q lcl|NC_012784. 86 QANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE---EIVT-DILKLKEVEFNLDKYVTVKRVT 161 (415) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~---~~~~-~Ii~~~~~~~~l~~~~~~~~~~ 161 (415) ........................ ..+.............. +..-+. ...+ .+...+ +..+-...++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~--e~~~~~~~~~~~~~~~~-~~~~~~~g~~~iP~~~~~~i-----i~~~~~~~~l~ 152 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQ--EVRDFTEYLETRNDIQG-GSLKTDSGFVVIPEEIVTDI-----LKLKEVEFNLD 152 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHH--HHHHHHHHhhhhhhhhh-hccccccccccCcHHHHHHH-----HHHHHhhhhhh Confidence 111111111100000000000000 00000000000000000 000000 0111 111111 11111111121 Q ss_pred CCceeEEEEeecCC-cccc--ccccccccccccccc-ceeeEeeeeeEEEeehhhHHHHhcchHHHHHH-HHHHHHHHHH Q lcl|NC_012784. 162 NGSGKYPVVRQSEV-AALE--KVEELEENPELAVKP-FFQLAYDINTHRGYFRISREAIEDAKVNVLQE-LKLWMARTIA 236 (415) Q Consensus 162 ~~~~~~~~~~~~~~-~~a~--~v~Eg~~~~~~~~~~-f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~-l~~~la~~~~ 236 (415) .....+++...... +... ...+..-..+....+ .+..++..-.+...-.-..--+.+...+...+ +.+.+...++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~ 232 (415) T protein:vir:94 153 KYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMA 232 (415) T ss_pred hhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHH Confidence 11112222111100 0000 111111122221111 11222333222221111111112221111112 4445555555 Q ss_pred HHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC---c Q lcl|NC_012784. 237 ATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---N 313 (415) Q Consensus 237 ~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~ 313 (415) ..+...+-...-.+...+.............. ... .+....-......+..+.+... . T Consensus 233 ~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~--~~~-----------------~~~~~~~~~i~~~~~~~~~~~~~~~~ 293 (415) T protein:vir:94 233 RTIAATRNKAIIDVITKGSTGSTSSGFEKEGK--KLE-----------------VKKAKSLDDIKDAINLNVKPNYEHNV 293 (415) T ss_pred HHHHHHHHHHHhhccccCcccccccccccccc--ccc-----------------cccccchHHHHHHHHhhhhhccCCCE Confidence 55555433222222111111111000000000 000 0000001111222223332221 1 Q ss_pred ccccCcccCC--CCceecceeeEEeccccccccCCceEEEech---hhcEEEEeecceEEEEeecccCceEEEEEEEecc Q lcl|NC_012784. 314 YLIQPDVKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNL---KDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDC 388 (415) Q Consensus 314 ~l~~~~~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~ 388 (415) +++.+..... .-..-.|.|++..+.. . +....+.|-. .............+-+-++..+ +....|-+. T Consensus 294 ~vmn~~~~~~l~~lkd~~G~~l~~~~~~-~---~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~---~~~~~~~~~ 366 (415) T protein:vir:94 294 AIVSQTMFAKLDKMKDKLGNYLIQPDVK-E---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDA---IVLFDRSQY 366 (415) T ss_pred EEEcHHHHHHHHHhhccCCCeeeccCcC-C---CCCceecceeeEEecccccCCCCccEEEEEehhcc---EEEEeecce Confidence 1211110000 0001135555432110 0 1101122210 0000000000111111111111 000011111 Q ss_pred EEe------cc---ccEEEEEeec---CCCCcccccccC Q lcl|NC_012784. 389 RIL------DY---KSAIVIEYDD---SERGEGDLGLEA 415 (415) Q Consensus 389 ~v~------~p---~a~~~~~~t~---~~~~~~~~~~~~ 415 (415) .+. +. .++.++.+.. .+--...+++++ T Consensus 367 ~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 367 QASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred EEEEeccccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 100 00 0111211111 111111122222 No 247 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=53.01 E-value=0.54 Score=22.05 Aligned_cols=361 Identities=15% Similarity=0.090 Sum_probs=123.9 Q ss_pred HHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHH Q lcl|NC_012784. 18 IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQ 97 (415) Q Consensus 18 ~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (415) +.-. +|.-.+++.-+.+- +. .-+|+..-++ ......-+++.+... +....+...-...+..... T Consensus 1 ~~~~--------~~~l~~kw~p~l~~-~~-~~~i~~~~~~-----~~~a~l~enq~~~~~-~~~~~~~~~~~e~~~~~l~ 64 (529) T protein:vir:10 1 MSLK--------NKEILNKWTPLLEG-EG-LPEIAGKNKQ-----ALVAQILEAQEKDSK-SDPVYRDDKLIEAFGQSLM 64 (529) T ss_pred Cccc--------hHHHHHHhhHhhcC-Cc-cchhccchhh-----hhhhhhhhhhHHHHh-cccccchhhhhhhhhccch Confidence 0000 00001111111000 00 0001000000 000000000000000 0000000000000000000 Q ss_pred HhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHH---hhhhhhhhcceeEEccCCceeEEEEee-- Q lcl|NC_012784. 98 NTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVTVKRVTNGSGKYPVVRQ-- 172 (415) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~-- 172 (415) ...... .+.-....... +...+.+ ..+.+.++.++ .+.....+++.++||+++++-+=-.+. T Consensus 65 e~~~~~---------~~~~~~~~i~~-st~t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY 131 (529) T protein:vir:10 65 EAEVAG---------DHGYDPTNIAA-GQSSGAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVY 131 (529) T ss_pred hhcccc---------ccccccccccc-ccccccc---cccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeee Confidence 000000 00000000001 1111111 12233333333 344455678889999887654311111 Q ss_pred ---cCCc--------------------------------------------------------------------c---- Q lcl|NC_012784. 173 ---SEVA--------------------------------------------------------------------A---- 177 (415) Q Consensus 173 ---~~~~--------------------------------------------------------------------~---- 177 (415) .... . T Consensus 132 ~~~~~~~~~~eaf~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~ 211 (529) T protein:vir:10 132 GKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTN 211 (529) T ss_pred cCCcccccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCceeeccccccccccCcc Confidence 0000 0 Q ss_pred ------------------cccccccccc---------cccccccceeeEeeeeeEEE-------eehhhHHHHhcc---- Q lcl|NC_012784. 178 ------------------LEKVEELEEN---------PELAVKPFFQLAYDINTHRG-------YFRISREAIEDA---- 219 (415) Q Consensus 178 ------------------a~~v~Eg~~~---------~~~~~~~f~~v~~~~~k~a~-------~~~iS~e~l~ds---- 219 (415) ...+++|-.. ...+...|.+..+...|..+ ...+|-||.+|- T Consensus 212 ~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVH 291 (529) T protein:vir:10 212 ETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVH 291 (529) T ss_pred ccCcccccccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 0000000000 00112235555666555554 447999999983 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccc---ccccccc-------chhhHHHHH-------HHHH Q lcl|NC_012784. 220 KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE---GKKLEVK-------KAKSLDDIK-------DAIN 282 (415) Q Consensus 220 ~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~---~~~~~~~-------~~~~~~~~~-------~~~~ 282 (415) .+|.++.|.+-|+..|...++++||.-.-+....+...+.... ....... .-...+-++ ++.+ T Consensus 292 GLDAEtELsNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an 371 (529) T protein:vir:10 292 GMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEAN 371 (529) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHH Confidence 4578999999999999999999998654332211111111000 0001100 111111122 2222 Q ss_pred Hhhh--hccCCCEEEEcHHHHHHHHHh---------hccCCcccccCcccCCCCceec-ceeeEEeccccccccCCceEE Q lcl|NC_012784. 283 LNVK--PNYEHNVAIVSQTMFAKLDKM---------KDKLGNYLIQPDVKEKTQQRLL-GAKIEILPDEVLGQKGNNTLI 350 (415) Q Consensus 283 ~~~~--~~~~~~~~v~~~~~~~~l~~l---------kd~~G~~l~~~~~~~~~~~~l~-G~pV~~~~~~~~~~~~~~~~~ 350 (415) .+.. .+...+.+++++.....|... ....|. ..++......+.|. |++|.+..+++. ..++ T Consensus 372 ~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~--~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~ 444 (529) T protein:vir:10 372 EIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGL--NADTTKGVFAGILGGRYKVYIDQYARQ-----DYFT 444 (529) T ss_pred HHHHhhccccceEEEEchHHHHHHHhhccccccccccccccc--ccccCCceEEEEecCceEEEecCCCCc-----ceEE Confidence 2222 223567899999998888742 111111 01111111123443 578887776542 2344 Q ss_pred Eechh-----hcEEEEeecce-EEEEeecccCceEEEEEEEeccEEeccccEEEEEe-ecCCCCcccccccC Q lcl|NC_012784. 351 IGNLK-----DAIVLFDRSQY-QASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY-DDSERGEGDLGLEA 415 (415) Q Consensus 351 ~gd~~-----~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~-t~~~~~~~~~~~~~ 415 (415) +|.=- ........-.+ -+...|...|+-.+-...|+++.+ +|=+.-.-.. .+--..+.|....| T Consensus 445 vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~a 515 (529) T protein:vir:10 445 MGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSV 515 (529) T ss_pred EEEeCCcccccceeeccccccccccccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhc Confidence 44200 00000000000 111233445555555556766543 3322111100 00112344444455 No 248 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=52.79 E-value=0.55 Score=22.03 Aligned_cols=361 Identities=15% Similarity=0.084 Sum_probs=124.0 Q ss_pred HHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHH Q lcl|NC_012784. 18 IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQ 97 (415) Q Consensus 18 ~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (415) +.-. ++.-.+++.-+.+- +. .-+|+..-++ ......-+++.+... +....+...-...+..... T Consensus 1 ~~~~--------~~~l~~kw~p~l~~-~~-~~~i~~~~~~-----~~~a~l~enq~~~~~-~~~~~~~~~~~e~~~~~l~ 64 (529) T protein:vir:10 1 MSLK--------NKEILNKWTPLLEG-EG-LPEIAGKNKQ-----ALVAQILEAQEKDSK-SDPVYRDDKLIEAFGQSLM 64 (529) T ss_pred Cccc--------HHHHHHHhHHHhcC-Cc-cchhccchhh-----hhhhhhhhhhHHHHh-hccccchhhhhhhhhcccc Confidence 0000 00000111111000 00 0000000000 000000000000000 0000000000000000000 Q ss_pred HhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHH---hhhhhhhhcceeEEccCCceeE-----EE Q lcl|NC_012784. 98 NTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVTVKRVTNGSGKY-----PV 169 (415) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~---~~~~~l~~~~~~~~~~~~~~~~-----~~ 169 (415) ...... .+.-....... +...+.+ ..+.+.++.++ .+.....+++.++||+++++-+ .+ T Consensus 65 ~~~~~~---------~~~~~~~~i~e-st~t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY 131 (529) T protein:vir:10 65 EAEVAG---------DHGYDPTNIAA-GQSSGAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVY 131 (529) T ss_pred hhhccc---------ccccccccccc-ccccccc---cccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheee Confidence 000000 00000000000 1111111 12333333333 3344555688889988775432 11 Q ss_pred EeecCC--------------------------------------------------------------------c----- Q lcl|NC_012784. 170 VRQSEV--------------------------------------------------------------------A----- 176 (415) Q Consensus 170 ~~~~~~--------------------------------------------------------------------~----- 176 (415) ...... . T Consensus 132 ~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~ 211 (529) T protein:vir:10 132 GKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTN 211 (529) T ss_pred cCCccccccccccccccccccccccccccccccccCccccccccccccccccCcceeeeecccceecccccccccccCcc Confidence 000000 0 Q ss_pred -----------------ccccccccccc---------cccccccceeeEeeeeeEEE-------eehhhHHHHhcc---- Q lcl|NC_012784. 177 -----------------ALEKVEELEEN---------PELAVKPFFQLAYDINTHRG-------YFRISREAIEDA---- 219 (415) Q Consensus 177 -----------------~a~~v~Eg~~~---------~~~~~~~f~~v~~~~~k~a~-------~~~iS~e~l~ds---- 219 (415) ....+++|-.. ...+...|.+..+...|..+ ...+|-||.+|- T Consensus 212 ~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVH 291 (529) T protein:vir:10 212 ETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVH 291 (529) T ss_pred ccCcccccccccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 00000111000 00112235555555555554 447999999983 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc----cccc-cccccc-----chhhHHHHH-------HHHH Q lcl|NC_012784. 220 KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEG-KKLEVK-----KAKSLDDIK-------DAIN 282 (415) Q Consensus 220 ~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~----~~~~-~~~~~~-----~~~~~~~~~-------~~~~ 282 (415) .+|.++.|.+-|+..|...++++||.-.-+-...+...+. ...+ .....+ .-...+-++ ++.+ T Consensus 292 GLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an 371 (529) T protein:vir:10 292 GMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEAN 371 (529) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHH Confidence 4578999999999999999999998653322211111100 0000 000000 111111122 2222 Q ss_pred Hhhh--hccCCCEEEEcHHHHHHHHHh--h-------ccCCcccccCcccCCCCceec-ceeeEEeccccccccCCceEE Q lcl|NC_012784. 283 LNVK--PNYEHNVAIVSQTMFAKLDKM--K-------DKLGNYLIQPDVKEKTQQRLL-GAKIEILPDEVLGQKGNNTLI 350 (415) Q Consensus 283 ~~~~--~~~~~~~~v~~~~~~~~l~~l--k-------d~~G~~l~~~~~~~~~~~~l~-G~pV~~~~~~~~~~~~~~~~~ 350 (415) .+.. .+...+.+++++.....|... . ...|. ..++...-..+.|. |++|.+..+++. ..++ T Consensus 372 ~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~--~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~ 444 (529) T protein:vir:10 372 EIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGL--NADTTKGVFAGILGGRYKVYIDQYARQ-----DYFT 444 (529) T ss_pred HHHHhhccccceEEEEchHHHHHHHhhhhhcccccccccccc--ccccCCceEEEEecCceEEEecCCCCc-----ceEE Confidence 2222 223567899999999888742 1 11111 01111111123443 578887776542 2344 Q ss_pred Eechh-----hcEEEEeecceE-EEEeecccCceEEEEEEEeccEEeccccEEEEEe-ecCCCCcccccccC Q lcl|NC_012784. 351 IGNLK-----DAIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY-DDSERGEGDLGLEA 415 (415) Q Consensus 351 ~gd~~-----~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~-t~~~~~~~~~~~~~ 415 (415) +|.=- ........-.+. +...|...|+-.+-...|+++.+ +|=+.-.-.. .+--..+.|....| T Consensus 445 vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~a 515 (529) T protein:vir:10 445 MGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSV 515 (529) T ss_pred EEEeCCcccccceeeccccccccccccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhc Confidence 44200 000000001111 12234455555566666776653 3422111000 00112344444555 No 249 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=52.68 E-value=0.55 Score=22.02 Aligned_cols=365 Identities=13% Similarity=0.098 Sum_probs=130.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQI-QEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) |+ .++|+++..-+.+. +...++....++ ++-++| +.+++.+ ++..+ ... T Consensus 1 ~~-~~~l~~kw~p~l~~--------------~~~~~i~~~~~~--~~~a~l~enq~~~~---~~~~~--------~~~-- 50 (534) T protein:vir:10 1 MS-KKSLLKKWQPLVES--------------EGMPAIASMKRK--DIVARIFENQDEDI---AHNEG--------GVY-- 50 (534) T ss_pred Cc-hhHHHHHhHHhhcC--------------Cccccccchhhh--hhhhhhhhhHHHHH---hhhcc--------ccc-- Confidence 43 34455555444221 111111111110 011111 0111111 00000 000 Q ss_pred hhhhhhHHHHHHHHHHHHH--hhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhc Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQN--TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKY 154 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~ 154 (415) +.......+...... .....+... ..-+.-....... +.+.+.+ ..+.+.++.++| +.....++ T Consensus 51 ----~~~~~~~~~~~~~~~~~~~~l~ea~~---~~~~g~~~~~ia~-s~~s~~v---~~~~P~Li~lvRra~p~LIa~DI 119 (534) T protein:vir:10 51 ----TDQVVVNSMVDVKGRIEEARLAEANI---GGDHGYDATKIAS-GETSGSI---TNVGPAVMGLVRRAIPQLIAFDI 119 (534) T ss_pred ----chhhhhhhhhccccchhhcccccccc---ccccccccccccc-ccccccc---ccccchhhhHHHHHHHhhhhhhh Confidence 000000000000000 000000000 0000000000000 1111111 123333333333 34455578 Q ss_pred ceeEEccCCceeEEEEe-----ecCC------------ccccccccc--------------------------------- Q lcl|NC_012784. 155 VTVKRVTNGSGKYPVVR-----QSEV------------AALEKVEEL--------------------------------- 184 (415) Q Consensus 155 ~~~~~~~~~~~~~~~~~-----~~~~------------~~a~~v~Eg--------------------------------- 184 (415) +.++||+++++-+=-.+ .... +.+.+-+.+ T Consensus 120 wGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~G 199 (534) T protein:vir:10 120 CGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAG 199 (534) T ss_pred heeccCCchhhhheeeeeeecCCCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 88999988876532122 1100 000000000 Q ss_pred -----------------------------------------------ccc---cccccccceeeEeeeeeEEE------- Q lcl|NC_012784. 185 -----------------------------------------------EEN---PELAVKPFFQLAYDINTHRG------- 207 (415) Q Consensus 185 -----------------------------------------------~~~---~~~~~~~f~~v~~~~~k~a~------- 207 (415) +.. ...++..|.+..+...|..+ T Consensus 200 t~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaL 279 (534) T protein:vir:10 200 TKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQL 279 (534) T ss_pred ccccccccccccccccCCccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccce Confidence 000 00111234555555555544 Q ss_pred eehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc---ccccccccc-------chhh Q lcl|NC_012784. 208 YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE---KEGKKLEVK-------KAKS 273 (415) Q Consensus 208 ~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~---~~~~~~~~~-------~~~~ 273 (415) ...+|-||.+|- .+|.+++|.+-|+..|...++++||.-.-+....+...+.. ......... +-.. T Consensus 280 KAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~ 359 (534) T protein:vir:10 280 KAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWA 359 (534) T ss_pred eccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHH Confidence 447999999983 45789999999999999999999986533322111111100 000111100 1111 Q ss_pred HHHHHHHHHHhh---------hhccCCCEEEEcHHHHHHHHHh---h--ccCCccc-ccCcccCC-CCceec-ceeeEEe Q lcl|NC_012784. 274 LDDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDKM---K--DKLGNYL-IQPDVKEK-TQQRLL-GAKIEIL 336 (415) Q Consensus 274 ~~~~~~~~~~~~---------~~~~~~~~~v~~~~~~~~l~~l---k--d~~G~~l-~~~~~~~~-~~~~l~-G~pV~~~ 336 (415) .+-+..++.++. +.....+.+++++.....|+.. . -..|-.. ...+.+.. ..++|. |++|.+. T Consensus 360 ~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D 439 (534) T protein:vir:10 360 GESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYID 439 (534) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEec Confidence 222222322222 1223577899999999988651 1 0011111 01111111 123444 5788877 Q ss_pred ccccccccCCceEEEechh-----hcEEEEee-cceEEEEeecccCceEEEEEEEeccEEeccccEEEEEeecCC----- Q lcl|NC_012784. 337 PDEVLGQKGNNTLIIGNLK-----DAIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE----- 405 (415) Q Consensus 337 ~~~~~~~~~~~~~~~gd~~-----~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~----- 405 (415) .+++. ..+++|.=- ........ ....+...|...|+-.+-...|+++.+ +|=+ ... +... T Consensus 440 ~y~~~-----dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~--~~~-~~~~~~~i~ 510 (534) T protein:vir:10 440 QYAVE-----DYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYGVKL-HPMA--DAT-QNKGFAKIS 510 (534) T ss_pred CCCCc-----ceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcc--ccc-CCccccccc Confidence 77653 234444200 00000000 001111234455555566667777654 3411 100 0000 Q ss_pred CCcccccccC Q lcl|NC_012784. 406 RGEGDLGLEA 415 (415) Q Consensus 406 ~~~~~~~~~~ 415 (415) .|-++-.-.| T Consensus 511 ~g~~~~~~~a 520 (534) T protein:vir:10 511 NGMPQHTNMF 520 (534) T ss_pred cCCcchhhhc Confidence 0001111222 No 250 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=48.31 E-value=0.67 Score=21.52 Aligned_cols=351 Identities=15% Similarity=0.082 Sum_probs=126.1 Q ss_pred HHHHH-HHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHH Q lcl|NC_012784. 14 IKRQI-DLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDL 92 (415) Q Consensus 14 l~~~~-~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 92 (415) ++-.. +...++=.-.++.+...++... -.+++.++|-+-++ T Consensus 1 ~~~~~~e~l~~kw~p~l~~~~~~~i~~~--~~~~v~a~l~enq~------------------------------------ 42 (470) T protein:vir:10 1 MQMFNSEYLQEKWAPILDYDGLDPIKDS--HRRSVTAVLLENQE------------------------------------ 42 (470) T ss_pred CCcchhHHHHHhhhhhhcCCccchhcch--hhhhhhhhhhhhhH------------------------------------ Confidence 00000 0000000001110000000000 00000111100000 Q ss_pred HHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHH---HHHHhhhhhhhhcceeEEccCCceeEEE Q lcl|NC_012784. 93 GISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDI---LKLKEVEFNLDKYVTVKRVTNGSGKYPV 169 (415) Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~I---i~~~~~~~~l~~~~~~~~~~~~~~~~~~ 169 (415) +.......-..+.......... .......+.+.+.+ ..+.+.+ ++.........+++.++||+++++-+=- T Consensus 43 -~~~~~~~~~l~e~~~~~~~~~~--~~~~i~~st~t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 116 (470) T protein:vir:10 43 -KELREERNFLSEAPNVNTNSGA--TAGFSADATAAGPV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFA 116 (470) T ss_pred -HHHhhccchhhhhhhccccccc--cccccccccccccc---cccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeE Confidence 0000000000000000000000 00000001111111 1222233 3333344455678999999999876532 Q ss_pred Ee-----ecCCc------cccccccc-------------------------cc--------------------------- Q lcl|NC_012784. 170 VR-----QSEVA------ALEKVEEL-------------------------EE--------------------------- 186 (415) Q Consensus 170 ~~-----~~~~~------~a~~v~Eg-------------------------~~--------------------------- 186 (415) .+ ..+.. ...+-+.+ +. T Consensus 117 mRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~ 196 (470) T protein:vir:10 117 MRSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDS 196 (470) T ss_pred EEEEecCCCccceeeecCCcccCcccccccccccccccccccccccccccccccccccccccccccccccccccccchHH Confidence 22 11000 00010000 00 Q ss_pred ---ccccccccceeeEeeeeeEEEe-------ehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_012784. 187 ---NPELAVKPFFQLAYDINTHRGY-------FRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGST 252 (415) Q Consensus 187 ---~~~~~~~~f~~v~~~~~k~a~~-------~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~ 252 (415) ....+...|.+..+...|..+- ..+|-||.+|- .+|.+++|.+-|+..|...+++.||.-.-+... T Consensus 197 aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~ 276 (470) T protein:vir:10 197 AEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAE 276 (470) T ss_pred hhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhh Confidence 0001123366666666666554 47899999983 457899999999999999999999976443332 Q ss_pred ccccccccccccc---ccccchhhHHHHHHHHHHhh---------hhccCCCEEEEcHHHHHHHHHh--hccCC---ccc Q lcl|NC_012784. 253 GSTSSGFEKEGKK---LEVKKAKSLDDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDKM--KDKLG---NYL 315 (415) Q Consensus 253 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~v~~~~~~~~l~~l--kd~~G---~~l 315 (415) .+...+....+.- ....+-.....+..++.++. +.-...+.+++++.....|... -+... ..+ T Consensus 277 ~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~ 356 (470) T protein:vir:10 277 PGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL 356 (470) T ss_pred hceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhcccccccccccccc Confidence 2222222111110 11111122333333333332 2234467899999998888531 01000 000 Q ss_pred ccCcccCCCCceec-ceeeEEecccccc-ccCCceEEEechhh-----cEEEEeecceE-EEEeecccCceEEEEEEEec Q lcl|NC_012784. 316 IQPDVKEKTQQRLL-GAKIEILPDEVLG-QKGNNTLIIGNLKD-----AIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQD 387 (415) Q Consensus 316 ~~~~~~~~~~~~l~-G~pV~~~~~~~~~-~~~~~~~~~gd~~~-----~~~~~~~~~~~-i~~~~~~~~~~~~~~~~r~d 387 (415) -.++...-..++|. |++|.+..++..+ .+....+++|.=-. .......-.+. ....|...|+-.+-...|++ T Consensus 357 ~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~ 436 (470) T protein:vir:10 357 NVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRYG 436 (470) T ss_pred ccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCccccceeeeeeeec Confidence 00110000123443 5777777654322 22223444442000 00000000000 01123345555555556766 Q ss_pred cEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 388 CRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 388 ~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) +.+ +|-+ ...+.+.+-+.--. T Consensus 437 l~~-NP~~------~~~~~~~~~i~~~~ 457 (470) T protein:vir:10 437 LVE-NPFS------QGTTQGLGTLTRNS 457 (470) T ss_pred eee-cCcc------cCCCcccccccCCC Confidence 543 3321 01112222111111 No 251 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=46.56 E-value=0.73 Score=21.33 Aligned_cols=355 Identities=13% Similarity=0.099 Sum_probs=122.0 Q ss_pred HHHHHHHHHhhchH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHH Q lcl|NC_012784. 19 DLKVKYATRALNND--ELEKAEKLEQEITDLRSQIQE-KQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGIS 95 (415) Q Consensus 19 ~~~~~~~~~~~~e~--~~~~~~~~~~e~~~l~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (415) -..+++=.-.++.+ +..++.... .+++-++|=+ +++.+ ... ...+.......+... T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~--~~~~~~~l~enq~~~~--------~~~-----------~~~~~~~~~~~~~~~ 59 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATAT--KQKIMSKIFENQDRDI--------NND-----------PMYRDPQLVEAFNAG 59 (514) T ss_pred CchhhhhhHHhcccccccccccchh--hhhhhhhhhhhHHHHH--------hcC-----------Ccccchhhhhhhhcc Confidence 00000000111100 000110000 0000111100 00000 000 000000000000000 Q ss_pred HHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcceeEEccCCceeE----- Q lcl|NC_012784. 96 IQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVTVKRVTNGSGKY----- 167 (415) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~~~~----- 167 (415) ......... +..........+. .+++ ..+.+.++.++| ......+++.++||+++++-+ T Consensus 60 l~e~~~~~~---------~~~~~~~ia~s~~-t~~v---~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRs 126 (514) T protein:vir:56 60 LNEAVVNGD---------HGYDPANIAQGVT-TGAV---TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRS 126 (514) T ss_pred ccccccccc---------ccccccccccccc-cccc---cccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeee Confidence 000000000 0000000001111 1111 123333333433 444556788899998887643 Q ss_pred EEEeec-CCcc---------ccccc------------------------------------------------------- Q lcl|NC_012784. 168 PVVRQS-EVAA---------LEKVE------------------------------------------------------- 182 (415) Q Consensus 168 ~~~~~~-~~~~---------a~~v~------------------------------------------------------- 182 (415) .+.... ++.. +.+-+ T Consensus 127 rY~~~~~tg~EAf~~~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t 206 (514) T protein:vir:56 127 VYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTAT 206 (514) T ss_pred eecCCCcccccccccccccCcCcccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 111110 0000 00000 Q ss_pred ---------------cc------cc---ccccccccceeeEeeeeeEEEe-------ehhhHHHHhcc----hHHHHHHH Q lcl|NC_012784. 183 ---------------EL------EE---NPELAVKPFFQLAYDINTHRGY-------FRISREAIEDA----KVNVLQEL 227 (415) Q Consensus 183 ---------------Eg------~~---~~~~~~~~f~~v~~~~~k~a~~-------~~iS~e~l~ds----~~~l~~~l 227 (415) +| +. ....+...|.+..+...|..+- ..+|-||.+|- .+|.+++| T Consensus 207 ~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtEL 286 (514) T protein:vir:56 207 EYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAEL 286 (514) T ss_pred ccccccccchhhhhhhhhhhhhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHH Confidence 00 00 0001122355666666666544 47999999983 45789999 Q ss_pred HHHHHHHHHHHHHHHHhhcc---ccccccccccccccccc-c----cccc-chhhHHHHHHHHHHhh---------hhcc Q lcl|NC_012784. 228 KLWMARTIAATRNKAIIDVI---TKGSTGSTSSGFEKEGK-K----LEVK-KAKSLDDIKDAINLNV---------KPNY 289 (415) Q Consensus 228 ~~~la~~~~~~~d~~il~g~---g~~~~~~~~~~~~~~~~-~----~~~~-~~~~~~~~~~~~~~~~---------~~~~ 289 (415) .+-|+..|...++++||.-. -+-.......+....+. . ..+. +-....-+..++.++. +... T Consensus 287 sNILSTEImlEINReii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg 366 (514) T protein:vir:56 287 SGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRG 366 (514) T ss_pred HHHHHHHHHHHhhHHHHHHHHhheeehhcccccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 99999999999999996322 11110000000000000 0 0000 1112222222222222 1234 Q ss_pred CCCEEEEcHHHHHHHHH---hhc--cCC--cccccCccc-CCCCcee-cceeeEEeccccccccCCceEEEechhh---- Q lcl|NC_012784. 290 EHNVAIVSQTMFAKLDK---MKD--KLG--NYLIQPDVK-EKTQQRL-LGAKIEILPDEVLGQKGNNTLIIGNLKD---- 356 (415) Q Consensus 290 ~~~~~v~~~~~~~~l~~---lkd--~~G--~~l~~~~~~-~~~~~~l-~G~pV~~~~~~~~~~~~~~~~~~gd~~~---- 356 (415) ..+.+++++.....|.. |.- +.| +--+..+.. .-..+.| .|++|.+..+++. ..+++|.=-. T Consensus 367 ~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~~~ 441 (514) T protein:vir:56 367 NGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVN-----DYFTVGFKGSTEMD 441 (514) T ss_pred cccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceEEEEecCceEEEecCCCCc-----ceEEEEEecCccee Confidence 57889999999998874 200 111 000000111 0001333 4678887777653 2344442000 Q ss_pred -cEEEEeecce-EEEEeecccCceEEEEEEEeccEEeccccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 357 -AIVLFDRSQY-QASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 357 -~~~~~~~~~~-~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~~~~~~~~ 415 (415) .......-.+ .+...|...|+-.+-...|+++.+ +| |.-.+-.....+.+.-+... T Consensus 442 ~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP--y~~~~~~~~~~~~~~~~~a~ 499 (514) T protein:vir:56 442 AGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV-NP--FADPTASATKVGNGAPVAAS 499 (514) T ss_pred cceeeccccccccccccCCccccceeeeeeeeceee-CC--CCCccccccccCCcchhhhc Confidence 0000000000 011133445555556666776654 34 11000000011111111111 No 252 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=46.32 E-value=0.74 Score=21.30 Aligned_cols=278 Identities=9% Similarity=-0.011 Sum_probs=117.3 Q ss_pred hhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhc----ceeEEccCCceeEEEEee-c Q lcl|NC_012784. 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY----VTVKRVTNGSGKYPVVRQ-S 173 (415) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~----~~~~~~~~~~~~~~~~~~-~ 173 (415) ... ..+.. +.... . ...+..+...+-..++++.. ..+.+.. +..++..+.. . T Consensus 1 mp~-----~~lse-l~t~t------l----------~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~-gg~~I~~~l~y~ 57 (321) T protein:vir:34 1 MPF-----PNISD-IITTT------I----------ESRSGVIADNVTKNNAILARLAKRGKPRLVS-GGYTILEELSFS 57 (321) T ss_pred CCC-----chHHH-HHHHH------H----------HhhcchhhhhhhcccHHHHHHHhcCcccccC-CCeeEEEEEeec Confidence 000 00000 00000 0 01111122222222222221 1222332 3234433333 3 Q ss_pred CCcccccccccccccccccccceeeEeeeeeEEEeehhhH-HHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012784. 174 EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR-EAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVIT 248 (415) Q Consensus 174 ~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~-e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g 248 (415) .+..+.|..-.+..+-.....|...++..+.+++-+.||- |+++.+ .++|...=.+...+.+...++..+. .+| T Consensus 58 ~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~-sdG 136 (321) T protein:vir:34 58 GNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALY-GDG 136 (321) T ss_pred cCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhh-ccc Confidence 3566777644444443344678999999999999888883 444432 2344444445555677777777754 444 Q ss_pred cc--cccccccccc-----cccc---------------cccccchhhHHHHHHHHHH----hhhhccCCCEEEEcHHHHH Q lcl|NC_012784. 249 KG--STGSTSSGFE-----KEGK---------------KLEVKKAKSLDDIKDAINL----NVKPNYEHNVAIVSQTMFA 302 (415) Q Consensus 249 ~~--~~~~~~~~~~-----~~~~---------------~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~v~~~~~~~ 302 (415) ++ .......... +++. .+...++.+...+...+.. +......|+.|++....|. T Consensus 137 Ta~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~ 216 (321) T protein:vir:34 137 TAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWT 216 (321) T ss_pred cccccchhhhhhhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHH Confidence 43 3222111111 1111 1112222233333333332 2334456789999999998 Q ss_pred HHHHhhccCCcccccCcccCCC-CceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEe---ec-ccCc Q lcl|NC_012784. 303 KLDKMKDKLGNYLIQPDVKEKT-QQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT---DY-MHFG 377 (415) Q Consensus 303 ~l~~lkd~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~---~~-~~~~ 377 (415) ..+.---..-|+--..+...|. .-.+.|.-|+..+..-.. +....++|-|-+ |..+....+-.+... +. ..++ T Consensus 217 ~y~~s~q~~qR~~~~~~a~~Gf~~Lky~~~div~D~~~g~~-~pan~~yfiNT~-yl~~r~h~~~~~~pi~p~r~~~~Nq 294 (321) T protein:vir:34 217 TYSNSLQVLQRFTSAEEANLGFRSLKFLSTDVVLDGGIGGF-AGANTMYFLNTK-YLHFRPHKDRNMVPLSPSRRAAFNQ 294 (321) T ss_pred HHHHhhheeeeecccccccccceeeeeeeEEEEEeCCCCCC-ccccceeeeecc-eEEEEEcCCCceeecCcccccccch Confidence 8776443433443222211111 124677777777643211 122246777654 455554333222211 11 1233 Q ss_pred eEEEEEE--EeccEEeccccEEEEEeecC Q lcl|NC_012784. 378 ECLMIAV--RQDCRILDYKSAIVIEYDDS 404 (415) Q Consensus 378 ~~~~~~~--r~d~~v~~p~a~~~~~~t~~ 404 (415) ..+.-.. +-.+.+-+|.+=.+ +.+. T Consensus 295 dA~~q~I~~~GnL~~sn~~~~~v--L~~~ 321 (321) T protein:vir:34 295 DAEAQILAWAGNLTCSGAQFQGR--LIAE 321 (321) T ss_pred hHHhhhhhhhheeeeecccceeE--EeeC Confidence 2222222 22222223332222 2222 No 253 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=40.76 E-value=0.96 Score=20.69 Aligned_cols=339 Identities=14% Similarity=0.084 Sum_probs=124.2 Q ss_pred hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhh-HHHHHHHHHHHHHhhhhhHHHH Q lcl|NC_012784. 29 LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRN-QANINDLGISIQNTKVTSQEVR 107 (415) Q Consensus 29 ~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 107 (415) .+- +.|.+...- ..+....++........- ..-........ .+.. T Consensus 1 ms~------~~l~~~w~~---------------------~l~~~~~~~i~~~~~~~~~~~~~enq~~~~-------~~~~ 46 (462) T protein:vir:10 1 MSI------QQLQEKWAP---------------------VLNHESVPEIKDSYKKGVVAQLLENQENAI-------REEG 46 (462) T ss_pred Cch------HHHHHHhhh---------------------hhcccccchhhhhhHHHHHHHHhhhHHHHH-------Hhcc Confidence 000 011111100 000000000000000000 00000000000 0000 Q ss_pred HHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHh---hhhhhhhcceeEEccCCceeE-----EEEeec-----C Q lcl|NC_012784. 108 DFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVTVKRVTNGSGKY-----PVVRQS-----E 174 (415) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~~~~-----~~~~~~-----~ 174 (415) .+.... .........+...+.+ ..+.+.++.++| +.....+++.++||+++++-+ .+.... + T Consensus 47 ~~l~ea--~~~~g~~~~~~~t~~~---~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~ 121 (462) T protein:vir:10 47 QVLNET--LQTTGYTTGDTATGPV---AGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSD 121 (462) T ss_pred cchhcc--ccccCCCcCccccccc---ccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccc Confidence 000000 0000000000001110 113333343433 444556788899998887643 211100 0 Q ss_pred Cccc-------cc-------------------------------------------c------ccccccc-cccccccee Q lcl|NC_012784. 175 VAAL-------EK-------------------------------------------V------EELEENP-ELAVKPFFQ 197 (415) Q Consensus 175 ~~~a-------~~-------------------------------------------v------~Eg~~~~-~~~~~~f~~ 197 (415) +..+ .+ . +.++... ..++..|.+ T Consensus 122 gtEAlfnEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~E 201 (462) T protein:vir:10 122 FREALFNEPNAGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFRE 201 (462) T ss_pred cchhhhccCCcCccccccccccccccccccccccccccccceeecCCCccceecccccccccchhccccCCccCCcchhh Confidence 0000 00 0 0000000 011234556 Q ss_pred eEeeeeeEEE-------eehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccc-- Q lcl|NC_012784. 198 LAYDINTHRG-------YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-- 264 (415) Q Consensus 198 v~~~~~k~a~-------~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~-- 264 (415) ..+...|..+ ...+|-||.+|- .+|.++.|.+-|+..|...+++.||.-.-+....+...++...+. T Consensus 202 MaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~d 281 (462) T protein:vir:10 202 MGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFD 281 (462) T ss_pred ceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceee Confidence 6666665554 447999999983 457899999999999999999999965433322211111111110 Q ss_pred -cccccchhhHHHHHHHHHHhh---------hhccCCCEEEEcHHHHHHHHHh---hcc---CCcc-cc-cCcccCCCCc Q lcl|NC_012784. 265 -KLEVKKAKSLDDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDKM---KDK---LGNY-LI-QPDVKEKTQQ 326 (415) Q Consensus 265 -~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~v~~~~~~~~l~~l---kd~---~G~~-l~-~~~~~~~~~~ 326 (415) .....+-..+.....++.++. .--...+.+++++.....|... +-. +++. +. .++......+ T Consensus 282 l~~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G 361 (462) T protein:vir:10 282 LDVDSNGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVG 361 (462) T ss_pred eccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEE Confidence 111222334444444544442 2234567899999998888431 100 1111 10 1111112233 Q ss_pred eec-ceeeEEeccccccccCCceEEEechhh-----cEEEEee-cceEEEEeecccCceEEEEEEEeccEEeccccEEEE Q lcl|NC_012784. 327 RLL-GAKIEILPDEVLGQKGNNTLIIGNLKD-----AIVLFDR-SQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVI 399 (415) Q Consensus 327 ~l~-G~pV~~~~~~~~~~~~~~~~~~gd~~~-----~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~ 399 (415) .|+ |++|.+..++.. .+....+++|.=-. ....... ........|...|+-.+-...|+++.+ +|= T Consensus 362 ~l~~r~~vy~D~Y~~~-ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~----- 434 (462) T protein:vir:10 362 TLNGRIKVYVDPYSSN-VADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVS-NPF----- 434 (462) T ss_pred EecCceEEEEecccCC-CcccceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeeeee-cCC----- Confidence 454 567777655421 12223444442000 0000000 000111223444555555555665543 221 Q ss_pred EeecCCC-CcccccccC Q lcl|NC_012784. 400 EYDDSER-GEGDLGLEA 415 (415) Q Consensus 400 ~~t~~~~-~~~~~~~~~ 415 (415) +.... .++-+..-. T Consensus 435 --t~~~~~~~~~~~~~~ 449 (462) T protein:vir:10 435 --SGGLTQGSGALTANA 449 (462) T ss_pred --CCCcCCccccccccC Confidence 11111 111111111 No 254 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=40.01 E-value=0.99 Score=20.61 Aligned_cols=349 Identities=12% Similarity=0.021 Sum_probs=109.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHH------hhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATR------ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQS 74 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~------~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 74 (415) +|.++||++++.++.+++++..++... ...++..++++++..+++.+++++++.+................. . T Consensus 5 ~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 83 (395) T protein:vir:43 5 EKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAPK-T 83 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchhh-h Confidence 999999999999988887766554332 234455667788888998998888776665544332222111111 1 Q ss_pred ccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhc-ccccccceeecchhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~ 153 (415) .. ...... .....+................. .......+ .....-..-++.......++..+-...++.. T Consensus 84 ~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 154 (395) T protein:vir:43 84 AG--QMVAES--LKEQGVTSSLRGSHRVSMPRSAI-----TSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES 154 (395) T ss_pred HH--HHHHHH--HHHHHHHHHhhhhhhhhhhhhhh-----cccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCC Confidence 11 100000 01111111111111111111100 00000000 0111111111111111112211111112211 Q ss_pred cceeEEc-cCCceeEEEEeecCCcccccccccccccccccccceeeEeeeeeE-EE-eehhhHHHHhcchHHHHHHHHHH Q lcl|NC_012784. 154 YVTVKRV-TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTH-RG-YFRISREAIEDAKVNVLQELKLW 230 (415) Q Consensus 154 ~~~~~~~-~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~-a~-~~~iS~e~l~ds~~~l~~~l~~~ 230 (415) ....++. +.......+.. . .....|.. +..+..+|..-++...-. .. ++.-+.. +. . -+...|... T Consensus 155 ~~~~~~~~~~~~~~a~~v~--E---~~~~~~~~--~~~~~i~~~~~k~~~~~~is~ell~d~~~-l~--~-~v~~~la~a 223 (395) T protein:vir:43 155 NSVEYVRETGFVNNAAPVS--E---GTQKPYSD--LTFELENAPVRTIAHLFKASRQILDDASA-LQ--S-YIDARARYG 223 (395) T ss_pred CceEEEEEecCCCceeeec--C---Cccccccc--cceeEEEEeeeeEEEeehhhHHHHHhHHH-HH--H-HHHHHHHHH Confidence 1111222 22222222221 1 11223332 234445555444443211 11 1222222 22 2 267788888 Q ss_pred HHHHHHHHHHH---------HHhhcccccccccccccccc-----------ccccc--ccc-chhhHHHHHHHHHHhhhh Q lcl|NC_012784. 231 MARTIAATRNK---------AIIDVITKGSTGSTSSGFEK-----------EGKKL--EVK-KAKSLDDIKDAINLNVKP 287 (415) Q Consensus 231 la~~~~~~~d~---------~il~g~g~~~~~~~~~~~~~-----------~~~~~--~~~-~~~~~~~~~~~~~~~~~~ 287 (415) ++..+..++=. -|+++.+............. ..... ... -.....+ ...+..+.+. T Consensus 224 ~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~-~~~l~~lkd~ 302 (395) T protein:vir:43 224 LMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPID-WALIELNKDA 302 (395) T ss_pred HHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHH-HHHHHHhhcc Confidence 88888777642 34544332221111000000 00000 000 0001111 1222233322 Q ss_pred ccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhcEE-------- Q lcl|NC_012784. 288 NYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV-------- 359 (415) Q Consensus 288 ~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~-------- 359 (415) . ..++..+.....- ..=.|.|++..+.. +...-|.|-. . ..+++++....-. T Consensus 303 ~---G~~i~~~~~~~~~---~~l~G~pVv~~~~~-~~~~~~~gd~------------~-~~~~~~~~~~~~i~~~~~~~~ 362 (395) T protein:vir:43 303 E---NRYIIGSPQNGTT---PTLWRLPVVETQAI-TQDEFLTGAF------------S-LGAQIFDRMDIEVLVSTENDK 362 (395) T ss_pred C---CceeccccccCCC---ceecceeeEEcCCC-CCCcEEEEec------------c-ceEEEEEecceEEEEeccccc Confidence 2 1222211100000 00125555432211 1111121210 0 0011111111000 Q ss_pred EEeecceEEEEeecccCc-eEEEEEEEeccEEecccc Q lcl|NC_012784. 360 LFDRSQYQASWTDYMHFG-ECLMIAVRQDCRILDYKS 395 (415) Q Consensus 360 ~~~~~~~~i~~~~~~~~~-~~~~~~~r~d~~v~~p~a 395 (415) .+.+..+.+....+..+. ..--++.++.+ +.| T Consensus 363 ~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~----taa 395 (395) T protein:vir:43 363 DFENNMVTIRAEERLAFAVYRPEAFVTGSL----TAS 395 (395) T ss_pred hhhcCcEEEEEEEeeccEEecccceEEEEe----ccC Confidence 011111111111110000 00001111111 111 No 255 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=38.54 E-value=1.1 Score=20.44 Aligned_cols=341 Identities=13% Similarity=0.078 Sum_probs=128.1 Q ss_pred hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHH Q lcl|NC_012784. 29 LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRD 108 (415) Q Consensus 29 ~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (415) .+- +.+.+. =....+....++...... +..... ...+......+... T Consensus 1 m~~------~~l~~~---------------------w~~~l~~~~~~~i~~~~~---~~~~~~---~lenq~~~~~~~~~ 47 (457) T protein:vir:10 1 MSF------QNLQEK---------------------WAPVLEHDSLPEIGDSYK---KGVVAQ---LLENQEKAIAEEGK 47 (457) T ss_pred Cch------HHHHHH---------------------hhHhhccCccchhhhhHH---HHHHHH---HhhhHHHHHHhccc Confidence 000 001111 111111111001000000 000000 00000000000000 Q ss_pred HHHHHhhhhhhhhccc--ccccceeecchhHHhHHHHHHh---hhhhhhhcceeEEccCCceeEEEEe--e---cCC--- Q lcl|NC_012784. 109 FTEYLETRNDIQGGSL--KTDSGFVVIPEEIVTDILKLKE---VEFNLDKYVTVKRVTNGSGKYPVVR--Q---SEV--- 175 (415) Q Consensus 109 ~~~~~~~~~~~~~~~~--~~~~~~~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~--~---~~~--- 175 (415) +... . ....+.. +...+.+ ..+.+.++.+.| ......+++.++||+++++-+==.+ . .+. T Consensus 48 ~l~e---a-~~~~g~~~~s~~t~~v---~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a 120 (457) T protein:vir:10 48 ILTE---T-LQTTGYTGGDTVTGPV---AGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAA 120 (457) T ss_pred cccc---c-ccccCCCccccccccc---ccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccc Confidence 0000 0 0000000 1111111 223334444444 4445567888999998877542111 0 000 Q ss_pred --ccc-------cc--------------------------------------c------cccccccc-cccccceeeEee Q lcl|NC_012784. 176 --AAL-------EK--------------------------------------V------EELEENPE-LAVKPFFQLAYD 201 (415) Q Consensus 176 --~~a-------~~--------------------------------------v------~Eg~~~~~-~~~~~f~~v~~~ 201 (415) ..+ .+ . ++++...+ .++..|.+..+. T Consensus 121 ~~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFs 200 (457) T protein:vir:10 121 GYDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFS 200 (457) T ss_pred cccceeeeccCcccCcccccccccccccccccccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeE Confidence 000 00 0 00000000 111235555555 Q ss_pred eeeEEE-------eehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccc---ccc Q lcl|NC_012784. 202 INTHRG-------YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK---KLE 267 (415) Q Consensus 202 ~~k~a~-------~~~iS~e~l~ds----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~---~~~ 267 (415) ..|..+ ...+|-||.+|- .+|.++.|.+-|+..|...+++.||.-.-+....+...++...+. ... T Consensus 201 IeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~~~ 280 (457) T protein:vir:10 201 IEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLDVD 280 (457) T ss_pred EEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeeecc Confidence 555544 447999999983 457899999999999999999999965333222111111111111 011 Q ss_pred ccchhhHHHHHHHHHHh---------hhhccCCCEEEEcHHHHHHHHH--h-hcc---CCc--ccccCcccCCCCceec- Q lcl|NC_012784. 268 VKKAKSLDDIKDAINLN---------VKPNYEHNVAIVSQTMFAKLDK--M-KDK---LGN--YLIQPDVKEKTQQRLL- 329 (415) Q Consensus 268 ~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~~~~~~l~~--l-kd~---~G~--~l~~~~~~~~~~~~l~- 329 (415) ..+-...+..+.++.++ .+-.+..+.+++++.....|.. . .-+ +|+ ..-.++......++|+ T Consensus 281 ~~g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~ 360 (457) T protein:vir:10 281 SNGRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNG 360 (457) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecC Confidence 11122223333332222 1234456789999999888876 1 111 111 0000112222234554 Q ss_pred ceeeEEeccccccccCCceEEEechhh------cEEEEeec-ceEEEEeecccCceEEEEEEEeccEEeccccEEEEEee Q lcl|NC_012784. 330 GAKIEILPDEVLGQKGNNTLIIGNLKD------AIVLFDRS-QYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) Q Consensus 330 G~pV~~~~~~~~~~~~~~~~~~gd~~~------~~~~~~~~-~~~i~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t 402 (415) |++|.+..++... .....+++|. +. .......- .......|...|+-.+-...|+++ .++|-..- ++ T Consensus 361 r~~vy~D~Ya~~n-s~~dy~~vG~-KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP~~~~---~~ 434 (457) T protein:vir:10 361 RIKVYVDPYSANV-ADKHFYVAGY-KGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGM-VSNPFAGG---LT 434 (457) T ss_pred CeEEEEecccccC-CccceEEEEE-eCCcceecceeecccccccccCccCCccccceeeeeeeeee-eecccccc---cc Confidence 5677776554311 1123444442 10 00000000 011112344566666667778888 66775321 11 Q ss_pred cCCC-CcccccccC Q lcl|NC_012784. 403 DSER-GEGDLGLEA 415 (415) Q Consensus 403 ~~~~-~~~~~~~~~ 415 (415) .... -.-+--.-| T Consensus 435 ~~~~~~~~~~n~~~ 448 (457) T protein:vir:10 435 QGSGALTVNANKYY 448 (457) T ss_pred cccccccccchhhc Confidence 1100 000000111 No 256 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=37.86 E-value=1.1 Score=20.36 Aligned_cols=357 Identities=11% Similarity=0.065 Sum_probs=92.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHH-hhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATR-ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~-~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) |+.+.++++++.+..+++....++.+. .......+++++++.++++|++++++++++.+.+.+................ T Consensus 19 l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 98 (397) T protein:vir:96 19 IDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAKAADPTDQKPKDG 98 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHH Confidence 666666666666666666555554332 1122334567888889999999988888776665544332222211111111 Q ss_pred hhh-hhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhc-ccccccceeecchhHHhHHHHHHhhhhhhhhccee Q lcl|NC_012784. 80 ART-YRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTV 157 (415) Q Consensus 80 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~ 157 (415) ... ................. .....................+ .....-....+...-...+...... .++...... T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~~l~~~~~~-~~~~~~~~~ 176 (397) T protein:vir:96 99 EKRKMKKFKVTEEELAEKRSA-INAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLEPKDIVDLSKYVRS-VPVNSASGK 176 (397) T ss_pred HHHHHHHHhhhhHHHHHHHHH-HHHHHHhhhhhhhhcccccccccchhHHHHHHHHHhhhhhhHHHhhhh-cccccccee Confidence 100 00000000000000000 0000000000000000000000 0000000001100001111111111 111111111 Q ss_pred EEccC-CceeEEEEeecCCcccccccccccccccccccceeeEeeee-eEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Q lcl|NC_012784. 158 KRVTN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN-THRGYFRISREAIEDAKVNVLQELKLWMARTI 235 (415) Q Consensus 158 ~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~-k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~ 235 (415) .++.. .+....+.. .+ ....+.. .+.....++..-++... ++.. --+.+...+-..+ +...|...++... T Consensus 177 ~~~~~~~~~~~~~~~--E~---~~~~~~~-~~~~~~i~~~~~~~~~~~~~s~-ell~ds~~~l~~~-i~~~l~~~~~~~~ 248 (397) T protein:vir:96 177 FPVISKSGSKMATVQ--QL---EKNPQLA-NPKMVEIDYSVATRRGYIPISQ-EMIDDASYDVTGL-IADEIQDQSLNTK 248 (397) T ss_pred EEEEeccCCcccccc--cc---ccccccc-cccccceeecHhHhhcchhhHH-HHHhhhHHHHHHH-HHHHHHHHHHHHH Confidence 12111 111111111 11 1111111 11122222221111100 0000 0011111111111 4555555565555 Q ss_pred HHHHHHHHhhccccccccccc--c--cccccc--ccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhc Q lcl|NC_012784. 236 AATRNKAIIDVITKGSTGSTS--S--GFEKEG--KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKD 309 (415) Q Consensus 236 ~~~~d~~il~g~g~~~~~~~~--~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd 309 (415) ...+-...=.+..++...... . ...... ...-.....++ ..+..+.+.. ..+++.|.. .+ T Consensus 249 ~~~i~~g~g~~~~~~~~~~d~~~~~~~~~~~~~~~a~~v~n~~~~----~~l~~lkd~~---G~~~~~~~~-------~~ 314 (397) T protein:vir:96 249 NADIAAVLKTATAKSVVGVDGLKDLINKEIKKVYDVKLFISASMY----SELDKLKDKN---GRYLLQDSI-------TA 314 (397) T ss_pred HHHHhhcccccccccccchHHHHHHHHHhhhhhcCcEEEEcHHHH----HHHHHhhccC---CCeEeccCc-------cC Confidence 555544332333332221100 0 000000 00001111222 2333333221 223333321 22 Q ss_pred cC-----Cccccc-CcccCCCCceecce-eeEEeccccccccCCceEEEechhhcEEEEeecc---eEE-EEeecccCce Q lcl|NC_012784. 310 KL-----GNYLIQ-PDVKEKTQQRLLGA-KIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ---YQA-SWTDYMHFGE 378 (415) Q Consensus 310 ~~-----G~~l~~-~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~---~~i-~~~~~~~~~~ 378 (415) .. |+|+.. ++.. ++.-.|- ++++-+. ...+++++......-+.... ..+ .+.+++.... T Consensus 315 ~~~~~l~G~pv~~~~~~~---~~~~~~~~~~~~gd~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~ 384 (397) T protein:vir:96 315 ASGKQLLGKEVVVLDDDV---IGKSVGNVVGFIGDA-------KAFASFFDRKQVSVSWVDNNIYGQLLAGIIRYDVKAT 384 (397) T ss_pred CCcccccccceEEecccc---cCCCCCceEEEEeeh-------hcceEeEeecceEEEEecccccceeEEEEEEEccEEe Confidence 22 444431 1100 0000111 1111110 00122233222110010000 000 0011111000 Q ss_pred EEEEEEEeccEEecccc Q lcl|NC_012784. 379 CLMIAVRQDCRILDYKS 395 (415) Q Consensus 379 ~~~~~~r~d~~v~~p~a 395 (415) .-.++...-+ +-| T Consensus 385 ~~~a~~~~~~----~~a 397 (397) T protein:vir:96 385 DKKAGFYVTF----TIG 397 (397) T ss_pred cccceEEEEe----ecC Confidence 0000111111 111 No 257 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=36.64 E-value=1.2 Score=20.23 Aligned_cols=380 Identities=9% Similarity=-0.001 Sum_probs=99.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc---hhh Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELE-K-AEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE---ART 82 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~-~-~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 82 (415) |.++.+|++++.++.++++...+..... . -++-.+++++|+.+++.+++++++++................. ... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 80 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVI 80 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccc Confidence 7778888877777766665544321100 0 1122467888999999998888766544332221111111000 000 Q ss_pred hhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhh-hhccccc-ccceeecc-hhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 83 YRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKT-DSGFVVIP-EEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~vP-~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) ..........................+.......... ....... ...+..-. ..+-.++... +..++.... T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~------ii~~l~~~~ 154 (428) T protein:vir:10 81 VKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSE------VIELLRDRT 154 (428) T ss_pred cccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHH------HHHHHhhhc Confidence 0000000000000000000000000000000000000 0000000 00000000 0111122111 222221111 Q ss_pred ccCCceeEEEEeecCCccccccccc---ccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHH-HHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQSEVAALEKVEEL---EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVL-QELKLWMARTI 235 (415) Q Consensus 160 ~~~~~~~~~~~~~~~~~~a~~v~Eg---~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~-~~l~~~la~~~ 235 (415) .-..-+...++..++....-....+ .-..+....+-...++..-.+...-.-..--+.+.-.+.. --+.+.|...+ T Consensus 155 ~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l 234 (428) T protein:vir:10 155 IVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDI 234 (428) T ss_pred hhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHH Confidence 1100011122222222111111111 1122222333334444444443322222222233322221 22566667777 Q ss_pred HHHHHHHHhhcccccccc-ccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHH-HHHHHH-hhccC- Q lcl|NC_012784. 236 AATRNKAIIDVITKGSTG-STSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTM-FAKLDK-MKDKL- 311 (415) Q Consensus 236 ~~~~d~~il~g~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~-~~~l~~-lkd~~- 311 (415) +.++...+-...=.|... ..+.+..+........... ....+.-+--... ...+.. ..+.+ T Consensus 235 ~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (428) T protein:vir:10 235 LTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPW---------------AADAAVNLDTIDTYLDSIILMSMDGNS 299 (428) T ss_pred HHHHHHHHHHHHhccCCCCccccccccccccccccccc---------------cccccccHHHHHHHHHHHHHhhhcccc Confidence 777776654433222211 1222221111110000000 0000000000011 111111 11111 Q ss_pred ----CcccccCcccCC--CCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeecccCceEEEEEEE Q lcl|NC_012784. 312 ----GNYLIQPDVKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR 385 (415) Q Consensus 312 ----G~~l~~~~~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r 385 (415) ..+++.+..... .-.-=.|.|++... ..+.....++++-+.=............+-+-++..+... .| T Consensus 300 ~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~--~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~----~~ 373 (428) T protein:vir:10 300 NMISSGWGMSNRTYMKLFGLRDGNGNKVYPEM--AQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIG----ED 373 (428) T ss_pred ccccCEEEEcHHHHHHHHHhhccCCceeccCC--CCCeeeceeeEEeccccccccCCCccceEEEEecceEEEE----Ee Confidence 112221110000 00001355554211 1111001111111100000000000011112222211111 11 Q ss_pred eccEEe-ccc----------------------cEEEEEeecCCCCcccccccC Q lcl|NC_012784. 386 QDCRIL-DYK----------------------SAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 386 ~d~~v~-~p~----------------------a~~~~~~t~~~~~~~~~~~~~ 415 (415) -+..+. .++ ++.++.+. ...+++=..+++ T Consensus 374 ~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~-v~~p~a~~~~t~ 425 (428) T protein:vir:10 374 GNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIG-FRHPEGLVLGTG 425 (428) T ss_pred cceEEEeecccccccccccccchhhcchhheeeeeeeCce-eeccceEEEEec Confidence 111110 011 11111111 111222222222 No 258 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=36.60 E-value=1.2 Score=20.22 Aligned_cols=380 Identities=9% Similarity=-0.046 Sum_probs=76.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHH Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQA 87 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (415) ++.++++++++.+..+++.....+.+...-++-.++.+.+.++++.++.+++++.+.....+.................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 55555555555555444444433322222122233445555566655555554443333222211111111111100000 Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeE Q lcl|NC_012784. 88 NINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) ..........................+.............. + .+.+.-...++........+..+-...++......+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQG-G-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhh-c-cccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000000000000000000000110000000000000 0 011110110111100011111111111111111111 Q ss_pred EEEeecCC-ccc--ccccccccccccccc-cceeeEeeeeeEEEeehhhHHHHhcchHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_012784. 168 PVVRQSEV-AAL--EKVEELEENPELAVK-PFFQLAYDINTHRGYFRISREAIEDAKVNVLQE-LKLWMARTIAATRNKA 242 (415) Q Consensus 168 ~~~~~~~~-~~a--~~v~Eg~~~~~~~~~-~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~-l~~~la~~~~~~~d~~ 242 (415) ++...... +.. ....+..-..+.... .....++..-.+...-.-..--+.+.-.+...+ |.+.|...++.++... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 238 (415) T protein:vir:46 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 11111000 000 000111111111111 111223333333222111111122222222122 4444555555555544 Q ss_pred HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC---cccccCc Q lcl|NC_012784. 243 IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPD 319 (415) Q Consensus 243 il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~~l~~~~ 319 (415) +-...=.|...+................ .......-......+..+.+... .+++.+. T Consensus 239 ~d~~il~g~g~g~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:46 239 RNKAIIDVITKGSTGSTSSGFEKEGKKL-------------------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHhhccccCCcccccccccccccee-------------------ccccccchHHHHHHHHhhhhhccCCCEEEEcHH Confidence 3333222221111111100000000000 00000000011111222222211 1111110 Q ss_pred ccCC--CCceecceeeEEeccccccccCCceEEEechh---hcEEEEeecceEEEEeecccCceEEEEEEEeccEEe--- Q lcl|NC_012784. 320 VKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK---DAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRIL--- 391 (415) Q Consensus 320 ~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~--- 391 (415) .... .-.-=.|.|++..+. ..+ ....+.|-.= ............+-+-++..+. ....|-+..+. T Consensus 300 ~~~~L~~lkd~~G~~i~~~~~-~~~---~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~---~~~~~~~~~v~~~~ 372 (415) T protein:vir:46 300 MFAKLDKMKDKLGNYLIQPDV-KEK---TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI---VLFDRSQYQASWTD 372 (415) T ss_pred HHHHHHHhhccCCCeeeccCc-CCC---CCccccceeeEEeccccccCCCccEEEEEehhccE---EEEeecceEEEeec Confidence 0000 000012444432210 000 0001111100 0000000000111111111100 00000000000 Q ss_pred ---cc---ccEEEEEeec---CCCCcccccccC Q lcl|NC_012784. 392 ---DY---KSAIVIEYDD---SERGEGDLGLEA 415 (415) Q Consensus 392 ---~p---~a~~~~~~t~---~~~~~~~~~~~~ 415 (415) +. .++.++.+.. .+--...+++.| T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:46 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEeeccC Confidence 00 0111111111 000011222222 No 259 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=36.60 E-value=1.2 Score=20.22 Aligned_cols=380 Identities=9% Similarity=-0.046 Sum_probs=76.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHH Q lcl|NC_012784. 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQA 87 (415) Q Consensus 8 ~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (415) ++.++++++++.+..+++.....+.+...-++-.++.+.+.++++.++.+++++.+.....+.................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 55555555555555444444433322222122233445555566655555554443333222211111111111100000 Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeE Q lcl|NC_012784. 88 NINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) ..........................+.............. + .+.+.-...++........+..+-...++......+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQG-G-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhh-c-cccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000000000000000000000110000000000000 0 011110110111100011111111111111111111 Q ss_pred EEEeecCC-ccc--ccccccccccccccc-cceeeEeeeeeEEEeehhhHHHHhcchHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_012784. 168 PVVRQSEV-AAL--EKVEELEENPELAVK-PFFQLAYDINTHRGYFRISREAIEDAKVNVLQE-LKLWMARTIAATRNKA 242 (415) Q Consensus 168 ~~~~~~~~-~~a--~~v~Eg~~~~~~~~~-~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~~-l~~~la~~~~~~~d~~ 242 (415) ++...... +.. ....+..-..+.... .....++..-.+...-.-..--+.+.-.+...+ |.+.|...++.++... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 238 (415) T protein:vir:47 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 11111000 000 000111111111111 111223333333222111111122222222122 4444555555555544 Q ss_pred HhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC---cccccCc Q lcl|NC_012784. 243 IIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPD 319 (415) Q Consensus 243 il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~~l~~~~ 319 (415) +-...=.|...+................ .......-......+..+.+... .+++.+. T Consensus 239 ~d~~il~g~g~g~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:47 239 RNKAIIDVITKGSTGSTSSGFEKEGKKL-------------------EVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHhhccccCCcccccccccccccee-------------------ccccccchHHHHHHHHhhhhhccCCCEEEEcHH Confidence 3333222221111111100000000000 00000000011111222222211 1111110 Q ss_pred ccCC--CCceecceeeEEeccccccccCCceEEEechh---hcEEEEeecceEEEEeecccCceEEEEEEEeccEEe--- Q lcl|NC_012784. 320 VKEK--TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLK---DAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRIL--- 391 (415) Q Consensus 320 ~~~~--~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~v~--- 391 (415) .... .-.-=.|.|++..+. ..+ ....+.|-.= ............+-+-++..+. ....|-+..+. T Consensus 300 ~~~~L~~lkd~~G~~i~~~~~-~~~---~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~---~~~~~~~~~v~~~~ 372 (415) T protein:vir:47 300 MFAKLDKMKDKLGNYLIQPDV-KEK---TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI---VLFDRSQYQASWTD 372 (415) T ss_pred HHHHHHHhhccCCCeeeccCc-CCC---CCccccceeeEEeccccccCCCccEEEEEehhccE---EEEeecceEEEeec Confidence 0000 000012444432210 000 0001111100 0000000000111111111100 00000000000 Q ss_pred ---cc---ccEEEEEeec---CCCCcccccccC Q lcl|NC_012784. 392 ---DY---KSAIVIEYDD---SERGEGDLGLEA 415 (415) Q Consensus 392 ---~p---~a~~~~~~t~---~~~~~~~~~~~~ 415 (415) +. .++.++.+.. .+--...+++.| T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:47 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEeeccC Confidence 00 0111111111 000011222222 No 260 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=36.28 E-value=1.2 Score=20.19 Aligned_cols=275 Identities=13% Similarity=0.086 Sum_probs=96.9 Q ss_pred hcccccccceeecchhHHhHHHHHHhhhhhhh-hcceeEEccCCceeEEEEeecCCccc-ccccccccccccccccceee Q lcl|NC_012784. 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-KYVTVKRVTNGSGKYPVVRQSEVAAL-EKVEELEENPELAVKPFFQL 198 (415) Q Consensus 121 ~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~Eg~~~~~~~~~~f~~v 198 (415) ++.. . ...+...+.+.+..--++..-+. .++..+++...+++|+.......... ...+-++......-..++.. T Consensus 1 m~~~---~-~~~~~dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k~~~f~~e~f~~~~t~ra~~~~~~~v~~~~~~~~ 76 (307) T protein:vir:79 1 MGRL---S-KLRIVDPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGKIPKFGKESFRLYQTERALRAKSNRMNPEDIDSV 76 (307) T ss_pred CCCC---C-CCcccCHHHHHHHhhccchhhhhhhcCCcccccccccceeeeccccccccccccccCCCcceeeeeccccc Confidence 1110 0 00111112222221111111111 12334455555555544321111000 01122222111110112222 Q ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cccccccccccccccccchhhHHHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGST-GSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (415) ++.....+-..++...--..+.+++++.-.+.+.+.|....|..+-.-.-+... .................+.+.+.++ T Consensus 77 ~~~~~~~~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~di 156 (307) T protein:vir:79 77 DVNLDEHDLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVGVI 156 (307) T ss_pred cccccccchhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHHHH Confidence 333333332334433333344556677766777776666666543321111111 0111111111112233455667777 Q ss_pred HHHHHHhh-hhccCCCEEEEcHHHHHHHHH----hhccCC--cccccCcccCCCCceeccee-eEEeccccccccCCceE Q lcl|NC_012784. 278 KDAINLNV-KPNYEHNVAIVSQTMFAKLDK----MKDKLG--NYLIQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTL 349 (415) Q Consensus 278 ~~~~~~~~-~~~~~~~~~v~~~~~~~~l~~----lkd~~G--~~l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~ 349 (415) .+.+..+. .-++.++.++|.+..|.+|+. ++.-.+ ..++.+. .-..++|.. |.+....-....++..- T Consensus 157 ~~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~~----~la~l~~v~~V~vg~a~y~~~~~~~~~ 232 (307) T protein:vir:79 157 EDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVD----LLKEIFEVENIAVGEAIYADDKDRFTD 232 (307) T ss_pred HHHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCccccccCHH----HHHHHhCceeEEEeeeeeecccccchh Confidence 77777665 457889999999999999864 222222 2222111 112345544 33322221111111111 Q ss_pred EEec--------------------hhhcEEEEeecceEE-EEeecccCceEEEEEEEeccEEeccccEEEEEeecCCCCc Q lcl|NC_012784. 350 IIGN--------------------LKDAIVLFDRSQYQA-SWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) Q Consensus 350 ~~gd--------------------~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~r~d~~v~~p~a~~~~~~t~~~~~~ 408 (415) +.|+ ++-+|+. .+++..+ +...........|+...+.-.+.-|++-.+++ T Consensus 233 iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~-~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~-------- 303 (307) T protein:vir:79 233 IWGANIVLAYVPLQRGGQQRTPYEPSYGYTL-RKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLIS-------- 303 (307) T ss_pred cCCCceEEEecccccCCCCCcccccccceeE-EecCceEEecccCCCceeEEeecccccceeeccccchhhc-------- Confidence 1111 1111211 1222211 11111111122333334444444444433332 Q ss_pred cccc Q lcl|NC_012784. 409 GDLG 412 (415) Q Consensus 409 ~~~~ 412 (415) +.++ T Consensus 304 ~~v~ 307 (307) T protein:vir:79 304 GING 307 (307) T ss_pred cCCC Confidence 0011 No 261 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=34.81 E-value=1.3 Score=20.02 Aligned_cols=361 Identities=11% Similarity=-0.003 Sum_probs=104.4 Q ss_pred CCh----------HHHHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_012784. 1 MKT----------KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) Q Consensus 1 Mk~----------~~el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) |.. .+|+++++++++++++....+.+.. .++..++.+++.+...+++++++.+..+.+.+.+.....+. T Consensus 4 ~~~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~-~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~ 82 (418) T protein:vir:10 4 MNEPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSA-GEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQ 82 (418) T ss_pred chhHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 322 2334444455555444444443332 33333444555555666677777777666665555444333 Q ss_pred ccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhh-----hhhcc--cccccceeecchhHHhHHHH Q lcl|NC_012784. 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRND-----IQGGS--LKTDSGFVVIPEEIVTDILK 143 (415) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~--~~~~~~~~~vP~~~~~~Ii~ 143 (415) .............. ....... ........+......... ..... .....+...-...+-..+.. T Consensus 83 ~~~~~~~~~~~~~~-~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~ 153 (418) T protein:vir:10 83 KLARGGGSAELETP-KTLGQLV--------TESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQA 153 (418) T ss_pred HHhhcccccccchh-hhhhHHh--------hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHH Confidence 32222211111111 1100000 011111111111111111 01111 11111110111112222222 Q ss_pred HHhhhhhhhhcceeEEccCCceeEEEEeecCCcccccccccc------cccccccccceeeEeeeeeEEEeehhhHHHHh Q lcl|NC_012784. 144 LKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE------ENPELAVKPFFQLAYDINTHRGYFRISREAIE 217 (415) Q Consensus 144 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~------~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ 217 (415) . .+..+....++......+++ ++....+.-+.. -..+.....-...++....+...-....--+. T Consensus 154 ~-----ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is 224 (418) T protein:vir:10 154 G-----IIAPPQRKMTIRDLLMPGQT----SSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKAS 224 (418) T ss_pred H-----HHHHHhhhhhHHhhcceeec----cCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhh Confidence 1 11111111122211112222 112222222111 11222222233445555444433222111233 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-cccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEE Q lcl|NC_012784. 218 DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS-TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIV 296 (415) Q Consensus 218 ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 296 (415) +.-.+-...+...+...++.++...+-...=.|.+.+ .+.+............ ..+.+.. T Consensus 225 ~ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~-------------------~~~~~~~ 285 (418) T protein:vir:10 225 RQILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSI-------------------TLANATP 285 (418) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-------------------ccccccc Confidence 3333333456666666666666666554333332211 1222222111111100 0011111 Q ss_pred cHHHHHHHHHhhccCCc---ccccCc-------ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEEEeec-c Q lcl|NC_012784. 297 SQTMFAKLDKMKDKLGN---YLIQPD-------VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-Q 365 (415) Q Consensus 297 ~~~~~~~l~~lkd~~G~---~l~~~~-------~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~ 365 (415) -......+..+...... +++.+. ..+. .|.|++.... .+. ...++|-. +...+.. . T Consensus 286 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~-----~G~~i~~~~~--~~~---~~~l~G~p---V~~~~~~p~ 352 (418) T protein:vir:10 286 IDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDS-----QGRYIVGNPV--NGT---TPRLWNLP---VVETQAMTA 352 (418) T ss_pred HHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcC-----CCceeccccc--cCC---Cceeccee---eEEcCCCCC Confidence 11122233344433322 222211 1111 2455442110 111 11233311 1110000 0 Q ss_pred eEEEEeecccCceEEEEEEEeccEEe-cc--------c-----cEEEEEee---cCCCCcccccccC Q lcl|NC_012784. 366 YQASWTDYMHFGECLMIAVRQDCRIL-DY--------K-----SAIVIEYD---DSERGEGDLGLEA 415 (415) Q Consensus 366 ~~i~~~~~~~~~~~~~~~~r~d~~v~-~p--------~-----a~~~~~~t---~~~~~~~~~~~~~ 415 (415) -.+.+-++.. .+....|-+..+. .+ . +..++.+. +.+--.+.+++.| T Consensus 353 ~~~~~gd~s~---~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~ 416 (418) T protein:vir:10 353 NEFLVGAFSM---AAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQA 416 (418) T ss_pred CcEEEeeccc---eEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccCC Confidence 1111122211 1111111111110 00 0 11111111 1111112223333 No 262 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=33.77 E-value=1.3 Score=19.90 Aligned_cols=339 Identities=8% Similarity=-0.042 Sum_probs=123.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHH--HHHHHH---HHhhhhh Q lcl|NC_012784. 44 ITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQE--VRDFTE---YLETRND 118 (415) Q Consensus 44 ~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~ 118 (415) +.++.+....+..+.-+..+ +............. .....+........... ...+.. ....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~-------~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da 69 (388) T protein:vir:99 1 MKQLSKVHQSLAGRSVRAFD----MANGKADYRLTDMA-------VRELKKFGLVFDHATVKRQIELLHEGGVATQAFDS 69 (388) T ss_pred CCCccceeeecCCcccchhh----hhcCCcceeeechh-------hHhhhhcceeccCccchhhhhhhhhhhhhhcccCc Confidence 00000000000000000000 00000000000000 00000000000000000 000000 0001111 Q ss_pred hhhcccccccceeecchhHHh----HHHHHHhhhhhhhhcceeEEccCC-ceeEEEEeecCCcccccccccccccccccc Q lcl|NC_012784. 119 IQGGSLKTDSGFVVIPEEIVT----DILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) Q Consensus 119 ~~~~~~~~~~~~~~vP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 193 (415) ...+.. +.++.-+|..+.+ .|++.+........++.+...+.- ...+.+........+.+.+-+...|-.+ . T Consensus 70 ~~~~~~--t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d-~ 146 (388) T protein:vir:99 70 AYVAPT--TQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-W 146 (388) T ss_pred cccccc--ccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCcee-c Confidence 111222 3334446766665 333333333333334443332111 0123333334445666778887777433 4 Q ss_pred cceeeEeeeeeEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cccccccc------c Q lcl|NC_012784. 194 PFFQLAYDINTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGST---GSTSSGFE------K 261 (415) Q Consensus 194 ~f~~v~~~~~k~a~~~~iS~e~l~---ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~---~~~~~~~~------~ 261 (415) .....+-..+.+...+.++.+=+. ...+++.+.-.....+++.+.+++-.+.|...... -+...+.. . T Consensus 147 ~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~a 226 (388) T protein:vir:99 147 NVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAS 226 (388) T ss_pred cceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCccccccc Confidence 444555555666666667644332 23456777778888888888888888878543221 11111111 0 Q ss_pred cc-cccc----ccchhhHHHHHHHHHHhhhhccC-------CCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceec Q lcl|NC_012784. 262 EG-KKLE----VKKAKSLDDIKDAINLNVKPNYE-------HNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLL 329 (415) Q Consensus 262 ~~-~~~~----~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~ 329 (415) .+ .... .+...-++|+..++.++...... +..++|.|..+..|.. .+..|.-++. -+.. .+. T Consensus 227 t~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~-~lk~----n~P 300 (388) T protein:vir:99 227 TTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRD-WLKQ----TYP 300 (388) T ss_pred ccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhccc-cCcCCccHHH-HHHH----hcC Confidence 00 0000 11222356667777666543321 2268889988888853 2333322211 0111 122 Q ss_pred ceeeEEecccccc---ccCCceEEEechhhc-EEEE-eecc-----eEEEEeeccc--CceEEE--EEEEe-ccEEeccc Q lcl|NC_012784. 330 GAKIEILPDEVLG---QKGNNTLIIGNLKDA-IVLF-DRSQ-----YQASWTDYMH--FGECLM--IAVRQ-DCRILDYK 394 (415) Q Consensus 330 G~pV~~~~~~~~~---~~~~~~~~~gd~~~~-~~~~-~~~~-----~~i~~~~~~~--~~~~~~--~~~r~-d~~v~~p~ 394 (415) ++.++........ ..++...++.+--.. +... ++.. +...+.-..- ....++ ...|. |+.+.+|. T Consensus 301 nl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~ 380 (388) T protein:vir:99 301 RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPW 380 (388) T ss_pred CcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceeeeEEeccc Confidence 3334443332211 112222333221100 0000 0000 0001100000 001111 12344 55677899 Q ss_pred cEEEEEee Q lcl|NC_012784. 395 SAIVIEYD 402 (415) Q Consensus 395 a~~~~~~t 402 (415) ||++++=- T Consensus 381 Ai~~~~GI 388 (388) T protein:vir:99 381 AVVRLIGL 388 (388) T ss_pred hhheeccC Confidence 99998732 No 263 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=33.37 E-value=1.4 Score=19.85 Aligned_cols=379 Identities=8% Similarity=0.002 Sum_probs=124.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|NC_012784. 1 MKTKEELQSEISDIKRQIDLKVKYATRALNN----DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE 76 (415) Q Consensus 1 Mk~~~el~~~l~~l~~~~~~~~~~~~~~~~e----~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (415) |+.+.+.++++.+..+.+.++.++..+.... ...+++++++++++.+++++++.++.................... T Consensus 7 lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (421) T protein:vir:13 7 LKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRVIING 86 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 8888888887777776666666555443332 335678889999999998888877766554333222211111111 Q ss_pred ccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhc--ccccccceeecchhHHhHHHHHHhhhhhhhhc Q lcl|NC_012784. 77 VNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~ 154 (415) . ............+....+........... .....+ .....-..-+++......++..+-...++... T Consensus 87 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~ra~--------~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~ 156 (421) T protein:vir:13 87 D--SKEEKRSLQLSAMSKTIRGIQLSEEERDI--------MSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN 156 (421) T ss_pred c--hhHHHHHHHHHHHHHhhhccchhHHHhhc--------cccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCC Confidence 1 11111122222222222221111111110 000111 11111111122222222222222222223222 Q ss_pred ceeEEccCCceeEEEEeecCCcccccccccccccccccccceeeEeeeee-EEE-eehhhHHHHhcchHHHHHHHHHHHH Q lcl|NC_012784. 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT-HRG-YFRISREAIEDAKVNVLQELKLWMA 232 (415) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k-~a~-~~~iS~e~l~ds~~~l~~~l~~~la 232 (415) ...+++......-.+.... ....+.+. .+..+..+|..-++...- +.. .+.-|..-+. .+ +...|...+. T Consensus 157 ~~~~~~~~~~~~~~~~~~~---E~~~~~~s--~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~--~~-i~~~la~~~~ 228 (421) T protein:vir:13 157 AGKMPVRAGASVDKLANLA---KDTELVKA--MLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFL--EF-VNEEFAEFAV 228 (421) T ss_pred ceEEEEeecCCccceeecc---cccccccc--ccceeEEEeeeeeeEeehhhhHHHHhhhHHHHH--HH-HHHHHHHHHH Confidence 2223322221111111111 11223332 233333444433333211 111 1111211111 11 4455555555 Q ss_pred HHHHHHHHHH---Hhhccccccccccccccccccc-cccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh Q lcl|NC_012784. 233 RTIAATRNKA---IIDVITKGSTGSTSSGFEKEGK-KLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK 308 (415) Q Consensus 233 ~~~~~~~d~~---il~g~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk 308 (415) ..+-..+-.. +++..+................ ......-.........+..+.+. +..+++.+.....- . T Consensus 229 ~~~~~~i~~~~~g~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~---~G~~i~~~~~~~~~---~ 302 (421) T protein:vir:13 229 NTENAEIVKQAKAVLAEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDK---QGRPLLKELSDGGD---L 302 (421) T ss_pred HHhhhhHhhhhhhccccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcC---CCceeecCcCCCCC---c Confidence 5554444332 2222222111110000000000 00000000011112223333332 22233332110000 0 Q ss_pred ccCCcccccCc-ccCCCCceecceeeEEeccccccccCCceEEEechhhcEEE-E------eecceEEEEeecccCceEE Q lcl|NC_012784. 309 DKLGNYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVL-F------DRSQYQASWTDYMHFGECL 380 (415) Q Consensus 309 d~~G~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~-~------~~~~~~i~~~~~~~~~~~~ 380 (415) -=.|.|+...+ ...+..+ -.++++-+. ...+++++... +.+ + .+..+.+...... +.... T Consensus 303 tl~G~pV~~~~~~~~~~~~---~~~~~~gd~-------~~~~~~~~~~~-~~v~~~~~~~f~~~~~~~r~~~r~-d~~~~ 370 (421) T protein:vir:13 303 VFKGRPVIELEESIFDVGD---ETKFIVSDF-------KTLIKFMDRKQ-YLIDQSKEAGYTKNETIARIIERF-DVNSP 370 (421) T ss_pred eecceeeEEeccccccCCC---ceEEEEEec-------cccEEEEEecc-eEEEeecccccccCeeEEEEEeee-cceee Confidence 01234443211 0001000 012222211 11234455433 221 1 1111222221110 00111 Q ss_pred EEEEEeccEEeccccEEEEEeecCCC-CcccccccC Q lcl|NC_012784. 381 MIAVRQDCRILDYKSAIVIEYDDSER-GEGDLGLEA 415 (415) Q Consensus 381 ~~~~r~d~~v~~p~a~~~~~~t~~~~-~~~~~~~~~ 415 (415) .-..-..+.+..+.+|+..+-+++++ ..|...-+- T Consensus 371 ~~~a~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~ 406 (421) T protein:vir:13 371 LDKSSDAEKIRKFGVIVKLQEVLKSSPRSGKNKNES 406 (421) T ss_pred cchhhheeeecccceeeccccccCCCCcCCCCcccc Confidence 11112245667788888875444333 333332222 No 264 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=32.17 E-value=1.4 Score=19.71 Aligned_cols=329 Identities=12% Similarity=0.007 Sum_probs=115.6 Q ss_pred HHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccce Q lcl|NC_012784. 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF 130 (415) Q Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (415) +-.. +..+..+ +..........+.....+.... ..... .+.... ..........+.++++ T Consensus 1 ~~~~----~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~---------~~~~~---~~~k~a-~t~gy~~~~~~~t~ga 60 (514) T protein:vir:10 1 MYTQ----DKTKDIM---KKSFFGGDRAVAFDTNKEDILN---------ENLPE---NVKKSA-FTAGHSITPDTQTDGA 60 (514) T ss_pred CCcc----chhhHHH---hhhhcccceeeeecCcHHHHHH---------Hhcch---hhhhhh-hccccccCCccccCcc Confidence 0000 0000000 0000000000000000000000 00000 000000 0000111122333344 Q ss_pred eecchhHHhHHHHHHhh--hhhhhhcceeEEccCCceeEEEEeecCC-cccccccccccccccccccceeeEeeeeeEEE Q lcl|NC_012784. 131 VVIPEEIVTDILKLKEV--EFNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRG 207 (415) Q Consensus 131 ~~vP~~~~~~Ii~~~~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~ 207 (415) .+--+.+.+++..+... ...++.-....++.+--..|......+. .-..++.|++- ++.+++.+....+.++-+.. T Consensus 61 AlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi-~~~~d~~~~rk~~~~k~l~~ 139 (514) T protein:vir:10 61 ANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGI-GDVNNPNERQRTINIKYIVD 139 (514) T ss_pred chhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCccccccccccccc-CcCCCcceEEEEEeeeeeee Confidence 33323333333211111 1122222223344433334433333333 35668899984 56788999999999998887 Q ss_pred eehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-----cccc---ccccccccchhhHHHHH Q lcl|NC_012784. 208 YFRISREA-IEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS-----GFEK---EGKKLEVKKAKSLDDIK 278 (415) Q Consensus 208 ~~~iS~e~-l~ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~-----~~~~---~~~~~~~~~~~~~~~~~ 278 (415) -..+|..+ +.++..+......+.-...++..+|.+.+.|+..=.+..... +..+ .......-+.....+++ T Consensus 140 ~~~vS~~~~l~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~l 219 (514) T protein:vir:10 140 THVTSIALQRANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAAL 219 (514) T ss_pred eeeeeehhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHH Confidence 66666443 245777888889999999999999999998875433211100 0000 01111111222223333 Q ss_pred -HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCceecceeeEEeccccccccCCceEEEechhhc Q lcl|NC_012784. 279 -DAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDA 357 (415) Q Consensus 279 -~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~ 357 (415) .+...+...+..++-++|+.-+.+.|..--...-|-++..++ .+...|.||--.-. .-|... +-|+ T Consensus 220 n~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~~n~----~~~~~G~~v~~f~s----~~G~I~-L~gs---- 286 (514) T protein:vir:10 220 NMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLPGQT----GGMTTGLDIDKFLS----AHGSIR-IQGS---- 286 (514) T ss_pred hhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEeecCc----cceeeeeeccceeE----ecccee-ecCC---- Confidence 333333345556677888888877665433322222211111 11233443321100 000000 0000 Q ss_pred EEEEeecceEE----EEeecccCceEE--------------------EE--EEEeccEEeccccEEEEEeecCCCCcccc Q lcl|NC_012784. 358 IVLFDRSQYQA----SWTDYMHFGECL--------------------MI--AVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) Q Consensus 358 ~~~~~~~~~~i----~~~~~~~~~~~~--------------------~~--~~r~d~~v~~p~a~~~~~~t~~~~~~~~~ 411 (415) .+.+ ....+ ...+.+...-.+ .+ +.--+..+...-+++......-+.+--.. T Consensus 287 -~im~-~~n~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~v 364 (514) T protein:vir:10 287 -TIMD-SDNKLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQ 364 (514) T ss_pred -eeec-ccccCccCCccCCcCCCCCcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECCCCccccccee Confidence 0000 00000 000110000000 00 00011111112222222222111111122 Q ss_pred cccC Q lcl|NC_012784. 412 GLEA 415 (415) Q Consensus 412 ~~~~ 415 (415) +.|+ T Consensus 365 taT~ 368 (514) T protein:vir:10 365 TATP 368 (514) T ss_pred eeee Confidence 2222 No 265 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=30.60 E-value=1.6 Score=19.52 Aligned_cols=277 Identities=9% Similarity=0.023 Sum_probs=105.3 Q ss_pred ccceeecchhHHhHHHHHHhhhhh----hhhc--ceeEEccCCceeEEEEeecC--Ccccccccccccccccccccceee Q lcl|NC_012784. 127 DSGFVVIPEEIVTDILKLKEVEFN----LDKY--VTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFFQL 198 (415) Q Consensus 127 ~~~~~~vP~~~~~~Ii~~~~~~~~----l~~~--~~~~~~~~~~~~~~~~~~~~--~~~a~~v~Eg~~~~~~~~~~f~~v 198 (415) .. ...-+.+...+.+....... +..- ...+... +..++.|++.+. +-..+.-.-|-..+..-+.++... T Consensus 1 Ma--inya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~-ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~ 77 (346) T protein:vir:10 1 MT--INYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFD-GAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSY 77 (346) T ss_pred Cc--chhHHHHHHHHHHHHHhhhccchhhcccccccceEec-CCCEEEEEEeeeecccccccccCCcccccccccceeEE Confidence 00 00113344445444433211 1100 1111111 223555666542 222221111111111112234455 Q ss_pred EeeeeeEEEeehhhHHHHhc-c--hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHH Q lcl|NC_012784. 199 AYDINTHRGYFRISREAIED-A--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) Q Consensus 199 ~~~~~k~a~~~~iS~e~l~d-s--~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) +++-.+.-.+.- +.|-.| + ...+...+.+...+.+.=.+|...++-.-++...... ........+....++ T Consensus 78 tl~qDR~~~F~v--D~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~----~~~~~~a~T~~ni~~ 151 (346) T protein:vir:10 78 ELKNERYWSTLV--DPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHD----GGITTNTLDEKNILP 151 (346) T ss_pred Eeeccccceecc--cccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhcc----ccccccccCHHHHHH Confidence 555544433320 101011 1 1122222222233333334555433322111000000 001112234556678 Q ss_pred HHHHHHHHhhhhccC--CCEEEEcHHHHHHHHHhhccCCcc-cccCcccCCCCceecceeeEEec--ccc---------- Q lcl|NC_012784. 276 DIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDKLGNY-LIQPDVKEKTQQRLLGAKIEILP--DEV---------- 340 (415) Q Consensus 276 ~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lkd~~G~~-l~~~~~~~~~~~~l~G~pV~~~~--~~~---------- 340 (415) .+.+++..+...... +-.++|+|..+..|.+...=+... +...+...+..+.|.|+||+.++ .|. T Consensus 152 ~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~ 231 (346) T protein:vir:10 152 AFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSK 231 (346) T ss_pred HHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhcccchhhccCcc Confidence 889999888777654 457889999988776543211111 11222335556789999998754 232 Q ss_pred -ccccCCceEEEechhhcEEEEeecceEEEEeecccCce--EEEEE--EEeccEEecc-ccEEEEEeec-CCCCc----c Q lcl|NC_012784. 341 -LGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGE--CLMIA--VRQDCRILDY-KSAIVIEYDD-SERGE----G 409 (415) Q Consensus 341 -~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~--~r~d~~v~~p-~a~~~~~~t~-~~~~~----~ 409 (415) ..++.+-.+++.... +..-.. .--.+....-. ++. .+... .+.|.-|.+. ..-+++.++. ++.++ + T Consensus 232 ~~t~ak~INfiiv~~~-A~ia~~-K~~~~~if~P~-~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~~~~~~ 308 (346) T protein:vir:10 232 IIDTAKQIEMFLIYNG-VQIAPE-KYSFVGFDQPS-AATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQEQSGQ 308 (346) T ss_pred ccCCccceeEEEECCc-eeeeee-eeeeeEeeCCC-CCcccceeeeeeeeeeeeeeccccceEEEeeecccccCccCccc Confidence 111112233333322 121111 11111111111 111 12222 3456666553 3334444443 33333 3 Q ss_pred cccccC Q lcl|NC_012784. 410 DLGLEA 415 (415) Q Consensus 410 ~~~~~~ 415 (415) |.+=|+ T Consensus 309 ~~kpt~ 314 (346) T protein:vir:10 309 DAKPTA 314 (346) T ss_pred ccCccc Confidence 443333 No 266 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=28.00 E-value=1.8 Score=19.20 Aligned_cols=321 Identities=13% Similarity=0.087 Sum_probs=125.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhh Q lcl|NC_012784. 40 LEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI 119 (415) Q Consensus 40 ~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (415) +.+ +. ..+++ .++=....+..... . .+... .....+. .. +.+ ..+ .+.. T Consensus 1 ~~~-----~~----~~e~l---~~kw~p~l~~~~~~------~--~~~~~----a~llenq-~~-~~~---~~l--~e~~ 49 (523) T protein:vir:59 1 MSQ-----PK----INEQL---IEKWQPLLEGCRND------W--ERHTL----ATLLENQ-YR-EAK---KHL--METT 49 (523) T ss_pred CCc-----ch----hhHHH---HHhhhhhhcccCCh------h--HHHHH----HHHhhhh-hH-HHH---Hhh--hhhh Confidence 100 00 01111 11111111100000 0 00000 0000000 00 000 000 0000 Q ss_pred hhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEEccCCceeEEEEee-----cCC------------------- Q lcl|NC_012784. 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQ-----SEV------------------- 175 (415) Q Consensus 120 ~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-----~~~------------------- 175 (415) ..+.. .+-++++| +++..-+...-.+...|+||++++|-+=-.+. .+. T Consensus 50 ~~~~~--~~~~~~~~------~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ 121 (523) T protein:vir:59 50 QTTEV--DGWNLALP------IVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLY 121 (523) T ss_pred hcccc--ccccchhh------hhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCccccccccc Confidence 01111 11112222 33444444445566677777766654211110 000 Q ss_pred -cccc---------------------------------------------------cccc-------------------- Q lcl|NC_012784. 176 -AALE---------------------------------------------------KVEE-------------------- 183 (415) Q Consensus 176 -~~a~---------------------------------------------------~v~E-------------------- 183 (415) +.+. -+++ T Consensus 122 ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~f 201 (523) T protein:vir:59 122 DENARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTV 201 (523) T ss_pred ccccccccccccCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccc Confidence 0000 0000 Q ss_pred --------------------------ccc------------------------------------ccccccccceeeEee Q lcl|NC_012784. 184 --------------------------LEE------------------------------------NPELAVKPFFQLAYD 201 (415) Q Consensus 184 --------------------------g~~------------------------------------~~~~~~~~f~~v~~~ 201 (415) +.. ....+...|.+..+. T Consensus 202 a~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~Fs 281 (523) T protein:vir:59 202 AYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLE 281 (523) T ss_pred cchhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeE Confidence 000 000111234455555 Q ss_pred eeeEEEe-------ehhhHHHHhcc-----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccc-cc- Q lcl|NC_012784. 202 INTHRGY-------FRISREAIEDA-----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-LE- 267 (415) Q Consensus 202 ~~k~a~~-------~~iS~e~l~ds-----~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~-~~- 267 (415) ..|..+- ..+|-||.+|- .+|.++.|.+-|+..|...++++||.-.-+....+...++...+.. .. T Consensus 282 IeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~ 361 (523) T protein:vir:59 282 LRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYD 361 (523) T ss_pred EEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecc Confidence 5555543 47899999982 4679999999999999999999999654332221111111111110 00 Q ss_pred ccc-----h-------hhHHHH----HHHHHHhhh--hccCCCEEEEcHHHHHHHHHhhccCCcccccCcccCC-CCcee Q lcl|NC_012784. 268 VKK-----A-------KSLDDI----KDAINLNVK--PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-TQQRL 328 (415) Q Consensus 268 ~~~-----~-------~~~~~~----~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~-~~~~l 328 (415) ... . -.+..+ -+..+.+.. .....+.+++++.....|...---+++.......++. ..++| T Consensus 362 ~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l 441 (523) T protein:vir:59 362 ETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMV 441 (523) T ss_pred cccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEe Confidence 000 0 001111 122222222 2235788999999999986421111111111111111 12344 Q ss_pred c-ceeeEEeccccccccCCceEEEech------hhcEEEEeecceEE--EEeecccCceEEEEEEEeccEEecccc--EE Q lcl|NC_012784. 329 L-GAKIEILPDEVLGQKGNNTLIIGNL------KDAIVLFDRSQYQA--SWTDYMHFGECLMIAVRQDCRILDYKS--AI 397 (415) Q Consensus 329 ~-G~pV~~~~~~~~~~~~~~~~~~gd~------~~~~~~~~~~~~~i--~~~~~~~~~~~~~~~~r~d~~v~~p~a--~~ 397 (415) . ||+|.+..+.+. ..+++|.= .........-.+.. ...|-..|+-.+-...|+++.|.+|-+ .. T Consensus 442 ~~~~~vy~d~~~~~-----dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~ 516 (523) T protein:vir:59 442 QGRYRLYKNIYQNQ-----PVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLL 516 (523) T ss_pred cCceEEEecCCCCc-----ceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhh Confidence 4 567777776543 24444421 10011011111100 112335566667777899999999964 44 Q ss_pred EEEeecC Q lcl|NC_012784. 398 VIEYDDS 404 (415) Q Consensus 398 ~~~~t~~ 404 (415) ++++-.+ T Consensus 517 ~~~~~~~ 523 (523) T protein:vir:59 517 YVKLLQP 523 (523) T ss_pred hhhhcCC Confidence 4454444 No 267 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=27.25 E-value=1.9 Score=19.10 Aligned_cols=335 Identities=7% Similarity=0.001 Sum_probs=125.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhh Q lcl|NC_012784. 41 EQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQ 120 (415) Q Consensus 41 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 120 (415) ++........| .. +.......... ... .......... +-................+.... T Consensus 1 ~~~~~~~~~~~---~~-------------~~~~~~~~~~~-~~~-~~~~~~l~~~--gi~~~~~~~~~~~~~~~amd~~~ 60 (379) T protein:vir:10 1 MPQISKIHSSL---NA-------------RQMTQMVMDSA-DVT-LDNLKHLESY--GIHLNGRKNKLFELMQFAMDSND 60 (379) T ss_pred CCCcceeeeec---Cc-------------cccchhhhccc-ccc-HHHHHHHHhc--Cccccchhhhhhhhhhhhhcccc Confidence 00000000000 00 00000000000 000 0000000000 00000000000000000111110 Q ss_pred hcc------cccccceeecch---hHHhHHHHHHhhhhhhhhcceeEEccCCc-eeEEEEeecCCccccccccccccccc Q lcl|NC_012784. 121 GGS------LKTDSGFVVIPE---EIVTDILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPEL 190 (415) Q Consensus 121 ~~~------~~~~~~~~~vP~---~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~ 190 (415) .+. .....+...+|. .+.+.+++.+..-.....++.+.....-. ..+.+........+.+++-+...|-. T Consensus 61 ~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~ 140 (379) T protein:vir:10 61 IGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALM 140 (379) T ss_pred ccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCee Confidence 010 000111112333 23356666665555555555554321111 12233333444566677777777643 Q ss_pred ccccceeeEeeeeeEEEeehhhHH-HHh--cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccccccc---- Q lcl|NC_012784. 191 AVKPFFQLAYDINTHRGYFRISRE-AIE--DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEK---- 261 (415) Q Consensus 191 ~~~~f~~v~~~~~k~a~~~~iS~e-~l~--ds~~~l~~~l~~~la~~~~~~~d~~il~g~g~~~~~--~~~~~~~~---- 261 (415) ........-..+.+...+.++.. +.. -...++.+--....++++.+.+|+-.+.|.+..... +...+... T Consensus 141 -d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~ 219 (379) T protein:vir:10 141 -SWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYV 219 (379) T ss_pred -eeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccc Confidence 33444445555666666777644 322 234467888888888888888888888885332211 11111110 Q ss_pred ---cc-cccc----ccchhhHHHHHHHHHHhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCcccccCcccCCCCc Q lcl|NC_012784. 262 ---EG-KKLE----VKKAKSLDDIKDAINLNVKPNY-------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQ 326 (415) Q Consensus 262 ---~~-~~~~----~~~~~~~~~~~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 326 (415) .+ .... .+...-++|+..++..+..... .+..++|.|..+..|..- +..|.-++. -+.. T Consensus 220 t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~-~lk~---- 293 (379) T protein:vir:10 220 AVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQ-YMRE---- 293 (379) T ss_pred cccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHH-HHHH---- Confidence 00 0001 1122235666666666553322 233789999998888642 332322221 1111 Q ss_pred eecceeeEEeccccc-cccCCceEEEechhhcEEEEeecceEEEE-eeccc-----CceE--EEEEEEe-ccEEeccccE Q lcl|NC_012784. 327 RLLGAKIEILPDEVL-GQKGNNTLIIGNLKDAIVLFDRSQYQASW-TDYMH-----FGEC--LMIAVRQ-DCRILDYKSA 396 (415) Q Consensus 327 ~l~G~pV~~~~~~~~-~~~~~~~~~~gd~~~~~~~~~~~~~~i~~-~~~~~-----~~~~--~~~~~r~-d~~v~~p~a~ 396 (415) .+.++.++..+.+.. ++.++...+|.+-...-...+.+.+...+ .++.. .... .-...|. |+.+.+|.|| T Consensus 294 n~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai 373 (379) T protein:vir:10 294 SYPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFAT 373 (379) T ss_pred hcCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhh Confidence 122334444444322 22222333443211100000000011111 01100 0001 1222454 4467789999 Q ss_pred EEEEee Q lcl|NC_012784. 397 IVIEYD 402 (415) Q Consensus 397 ~~~~~t 402 (415) ++++=. T Consensus 374 ~~~~G~ 379 (379) T protein:vir:10 374 YRQTGA 379 (379) T ss_pred heecCC Confidence 997633 No 268 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=26.07 E-value=2 Score=18.95 Aligned_cols=349 Identities=9% Similarity=0.044 Sum_probs=97.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc----ccchh Q lcl|NC_012784. 6 ELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE----VNEAR 81 (415) Q Consensus 6 el~~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 81 (415) -|+++|.++++++.+..+++....++.+..--++-.+++++++++++.++++++++.+.....+....... ..... T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~ 80 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 55678999999998888887776655544333445677888999999999888887765544433222111 11111 Q ss_pred hhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhh-hhhhc----ccccccceeecc---hhHHhHHHHHHhhhhhhhh Q lcl|NC_012784. 82 TYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRN-DIQGG----SLKTDSGFVVIP---EEIVTDILKLKEVEFNLDK 153 (415) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~~vP---~~~~~~Ii~~~~~~~~l~~ 153 (415) ............ .................... ..... .......+.... ..+-.++... +.. T Consensus 81 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~------ii~ 150 (394) T protein:vir:97 81 TQEEKTYRESVN----DFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYT------PAR 150 (394) T ss_pred chhhHHHHHHHH----HHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHH------HHH Confidence 111111111111 11111111111000000000 00000 000000011100 0111122211 122 Q ss_pred cce-eEEccCCceeEEEEeecCCcccccccc--cc----cccccccc-cceeeEeeeeeEEEeehhhH-HHHhcchHHHH Q lcl|NC_012784. 154 YVT-VKRVTNGSGKYPVVRQSEVAALEKVEE--LE----ENPELAVK-PFFQLAYDINTHRGYFRISR-EAIEDAKVNVL 224 (415) Q Consensus 154 ~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~E--g~----~~~~~~~~-~f~~v~~~~~k~a~~~~iS~-e~l~ds~~~l~ 224 (415) ++. ..++...-..+++ .++ ...+... +. -..+.... .....++..-.+... .+.- --+.+.-.+.. T Consensus 151 ~~~~~~~l~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~-k~~~~i~is~ell~ds 225 (394) T protein:vir:97 151 EVKTVVDLKPFTTVYQA---KKA-SGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNID-TYRGAIPLSQESIDDA 225 (394) T ss_pred Hhhhhhhhhhhceeeec---cCc-ceEEEEEecCCCccceecccccccccccccceeEEeehh-heeeehhhHHHHHhhh Confidence 111 1111111111221 111 1112111 11 11111111 011122222222111 1110 01111111111 Q ss_pred HH-HHHHHHHHHHHHHHHHHhhccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccCCCEEEEcHHHHHH Q lcl|NC_012784. 225 QE-LKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAK 303 (415) Q Consensus 225 ~~-l~~~la~~~~~~~d~~il~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 303 (415) .+ +...+...++..+...+-.... .+ .+.....+....-+. ... T Consensus 226 ~~~~~~~i~~~la~~~~~~~~~~i~--------~g-------~~~~~~~~~~~~~~~--------------------~~~ 270 (394) T protein:vir:97 226 DVDLVGIVSESISQIKVNTTNDAIA--------KV-------LKSFTTKTVKNLDEI--------------------KAL 270 (394) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHh--------hc-------cccccccccccHHHH--------------------HHH Confidence 11 3344444444444433211110 00 111122221111111 111 Q ss_pred HHHhhcc--CCcccccCcccCCCCce---ecceeeEEeccccccccCCceEEEechhhcEEEEee---cceEEEEeeccc Q lcl|NC_012784. 304 LDKMKDK--LGNYLIQPDVKEKTQQR---LLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR---SQYQASWTDYMH 375 (415) Q Consensus 304 l~~lkd~--~G~~l~~~~~~~~~~~~---l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~---~~~~i~~~~~~~ 375 (415) +....+. +..+++.+.... .-.. -.|.|++..+ . ..+....++|-. +...+. ....+-+-|+.. T Consensus 271 ~~~~~~~~~~a~~v~n~~~~~-~l~~lkd~~G~~i~~~~-~---~~~~~~~l~G~p---v~~~~~~~~~~~~~~~gd~~~ 342 (394) T protein:vir:97 271 LNGGFDPAYNVSLIVSQSFYQ-TLDTLKDGNGRYLLQDD-I---TAVSGKVLLGKP---VFVLSDEVLGANKAFIGDFKR 342 (394) T ss_pred HHhhhhhhhCCEEEEcHHHHH-HHHHhhccCCCeeeecC-c---CCCCCceeccce---eEEecccccCCccEEEeeccc Confidence 1111111 112332211100 0000 1377766432 1 111111222211 111000 001111122111 Q ss_pred CceEEEEEEEeccEEe------cc---ccEEEEEeec-CCCCc--ccccccC Q lcl|NC_012784. 376 FGECLMIAVRQDCRIL------DY---KSAIVIEYDD-SERGE--GDLGLEA 415 (415) Q Consensus 376 ~~~~~~~~~r~d~~v~------~p---~a~~~~~~t~-~~~~~--~~~~~~~ 415 (415) + +....|-+..+. +. .++.++.... -+... ..++.++ T Consensus 343 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 391 (394) T protein:vir:97 343 G---VLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) T ss_pred c---EEEEEecceEEEEecccccceeEEEEEEEccEEecccceEEEEecccc Confidence 1 011111111110 01 1122222111 11111 1222222 No 269 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=20.79 E-value=2.7 Score=18.21 Aligned_cols=303 Identities=12% Similarity=0.111 Sum_probs=121.6 Q ss_pred hhhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHhhhhhhhhcccccccceeecchhHHhHHHHHHhhhhhhhhcceeEE Q lcl|NC_012784. 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~ 159 (415) ..+.-+....-..+...........+....+.. .....+.+.++.-..+|..+...|-..+...+|+.....+.+ T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawna-----klaengvtitdttfqlprklvesintallntnpvfkvfhvtn 75 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNA-----KLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 75 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhh-----hhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehh Confidence 001111111111111111111122222221110 011223344445566788888888888888888877665544 Q ss_pred ccCCceeEEEEee-cCCcccccccccccccccccccceeeEeeeeeEEEeehhhHH--HHhcchHHHHHHHHHHHHHHHH Q lcl|NC_012784. 160 VTNGSGKYPVVRQ-SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISRE--AIEDAKVNVLQELKLWMARTIA 236 (415) Q Consensus 160 ~~~~~~~~~~~~~-~~~~~a~~v~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e--~l~ds~~~l~~~l~~~la~~~~ 236 (415) +. -+-+.+. .+...+.....|+.+.+. ..++..-++.|--++.+..+... -|++|-..+...|..+|..++. T Consensus 76 vg----allvsrsfdssneaqvhkdgqtkteq-aatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaiv 150 (318) T protein:vir:94 76 VG----ALLVSRSFDSSNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIV 150 (318) T ss_pred hh----heeeeccccccchhhhhccccccccc-ceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 43 2223222 333455556677776653 35566666666544444433333 2445544577777777877777 Q ss_pred HHH-HHHHhhccccccccccccccccc-----cccccccchhhH-HHHHHHHHHhhhhccCCCEEEEcHHH-HHHHHHhh Q lcl|NC_012784. 237 ATR-NKAIIDVITKGSTGSTSSGFEKE-----GKKLEVKKAKSL-DDIKDAINLNVKPNYEHNVAIVSQTM-FAKLDKMK 308 (415) Q Consensus 237 ~~~-d~~il~g~g~~~~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~-~~~l~~lk 308 (415) .++ |-+++-|+|+.+-..+....... .....+.+...+ |.+..+..-..+...+ -..++...+ .+.|..|+ T Consensus 151 nkivdlalvegdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagr-rylivktedrkalldelr 229 (318) T protein:vir:94 151 NKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR-RYLIVKTEDRKALLDELR 229 (318) T ss_pred hhhhheeeeecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCc-eEEEEeccchHHHHHHHH Confidence 776 55677788876543332111100 011112222233 3333333332222222 234444444 34455555 Q ss_pred ccCCcc--cccCcccCCCCceeccee-eEEeccccccccCCceEEEechhhcEEEEeecceE-EEEeecccCceEEEEEE Q lcl|NC_012784. 309 DKLGNY--LIQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQ-ASWTDYMHFGECLMIAV 384 (415) Q Consensus 309 d~~G~~--l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~ 384 (415) -+..+- -+.++-+.- ..--|.. +++.. ++..-++-++-|-+ |.+ +-++++ ++.....++..-+.++. T Consensus 230 qatananvriknddtei--asevgvdeiivyt----gskavkptvlvdqk--yhi-dmqdltkvdafewktnsnmilvet 300 (318) T protein:vir:94 230 QATANANVRIKNDDTEI--ASEVGVDEIIVYT----GSKAVKPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSNMILVET 300 (318) T ss_pred hhhcccceEEeccchhh--hhhcCcceeEEee----ccccccceeEeccc--eec-chhhhhhhhceeeccCCceEEEEe Confidence 433221 111111100 0001111 11111 11111112222211 221 222222 12222223322233443 Q ss_pred EeccEEeccccEEEEEee Q lcl|NC_012784. 385 RQDCRILDYKSAIVIEYD 402 (415) Q Consensus 385 r~d~~v~~p~a~~~~~~t 402 (415) .-.|-|..-+|-+.++++ T Consensus 301 ltsghvetynagavitvs 318 (318) T protein:vir:94 301 LTSGHVETYNAGAVITVS 318 (318) T ss_pred cccCcceeecCceeEEeC Confidence 444444444555555555 No 270 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=20.26 E-value=2.8 Score=18.13 Aligned_cols=374 Identities=11% Similarity=0.044 Sum_probs=102.2 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhh Q lcl|NC_012784. 7 LQ-SEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRN 85 (415) Q Consensus 7 l~-~~l~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (415) |. ++|.+.++++.++.+++.+...+.+.- -++-.++++.|+++++.++.+++++++............+......... T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~l-t~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTAL-SVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAP 79 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhc Confidence 32 444444444544444444433222211 1223578899999999999888776654433322222222111111110 Q ss_pred HH-HHHH--HHHHHHHhhhhhHHHHHH-------HHHHhhhhhh-----hhcccccccceeecchhHHhHHHHHHhhhhh Q lcl|NC_012784. 86 QA-NIND--LGISIQNTKVTSQEVRDF-------TEYLETRNDI-----QGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) Q Consensus 86 ~~-~~~~--~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~-----~~~~~~~~~~~~~vP~~~~~~Ii~~~~~~~~ 150 (415) .. .... ........... ...... ....+..... .....+....+. =...+-.++.. . T Consensus 80 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~-gg~~vP~~~~~------~ 151 (435) T protein:vir:14 80 AAAPVHAQPKALEVKGAKMA-RMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGA-GGVLVPENLSS------E 151 (435) T ss_pred cccccccccchhhhhHHHHH-HHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCC-CccccchhHHH------H Confidence 00 0000 00000000000 000000 0000000000 000001111000 00011111111 1 Q ss_pred hhhcceeEEccCCceeEEEEeecCCcccccc---cccccccccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHH-- Q lcl|NC_012784. 151 LDKYVTVKRVTNGSGKYPVVRQSEVAALEKV---EELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQ-- 225 (415) Q Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v---~Eg~~~~~~~~~~f~~v~~~~~k~a~~~~iS~e~l~ds~~~l~~-- 225 (415) +..++.....-..-+...++...+....... ++..-..+....+-...++....+...-.-..--+.+.-.+... T Consensus 152 ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~ 231 (435) T protein:vir:14 152 VIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVN 231 (435) T ss_pred HHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccC Confidence 2222211111100011122222221111111 12122334433444455555555443322222122222221111 Q ss_pred -HHHHHHHHHHHHHHHHHHhhcccccccc-ccccccccccccccccch---hhHHHHHHHHHHhhhhccCCCEEEEcHHH Q lcl|NC_012784. 226 -ELKLWMARTIAATRNKAIIDVITKGSTG-STSSGFEKEGKKLEVKKA---KSLDDIKDAINLNVKPNYEHNVAIVSQTM 300 (415) Q Consensus 226 -~l~~~la~~~~~~~d~~il~g~g~~~~~-~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 300 (415) -|.+.|...++.++...+-...-.|.+. ..+.+............. .+.+.+...+..+ T Consensus 232 ~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l---------------- 295 (435) T protein:vir:14 232 PNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKV---------------- 295 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHH---------------- Confidence 2556666677777766654443333221 123332222111111111 1111212221111 Q ss_pred HHHHHHhhccC-----CcccccCcccC--CCCceecceeeEEeccccccccCCceEEEechhhcEEEEeecceEEEEeec Q lcl|NC_012784. 301 FAKLDKMKDKL-----GNYLIQPDVKE--KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) Q Consensus 301 ~~~l~~lkd~~-----G~~l~~~~~~~--~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 373 (415) +..++... ..+++.+.... .....=.|.|++.. ...+.....++++-++-..-.........+-+-++ T Consensus 296 ---~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~--~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~ 370 (435) T protein:vir:14 296 ---ILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPE--LANGMLKGYPVGKTTQVPINLGETGKESEIYFTDF 370 (435) T ss_pred ---HHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccC--CCCCeeecceeEeeccccccccCCCccceEEEeec Confidence 22233222 12222211100 00001135555421 11111101111111100000000000011222222 Q ss_pred ccCceEEEEEEEeccEEec-c----------------------ccEEEEEeecCCCCcccccccC Q lcl|NC_012784. 374 MHFGECLMIAVRQDCRILD-Y----------------------KSAIVIEYDDSERGEGDLGLEA 415 (415) Q Consensus 374 ~~~~~~~~~~~r~d~~v~~-p----------------------~a~~~~~~t~~~~~~~~~~~~~ 415 (415) ..+. +..|-+..+.. + .++.++.+ ....+.+=..++. T Consensus 371 s~~~----i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a~~~l~~ 430 (435) T protein:vir:14 371 GDVF----IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDF-GPRHVESIAVLAG 430 (435) T ss_pred ccEE----EEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCc-eeecccceEEEec Confidence 2211 11122221110 1 11112111 1111222111111 Done!