Query lcl|NC_019933.2_cdsid_YP_007238078.1 [gene=G176_gp08] [protein=putative head protein] [protein_id=YP_007238078.1] [location=5760..6944] Match_columns 394 No_of_seqs 134 out of 1152 Neff 10.0 Searched_HMMs 1612 Date Thu Nov 7 17:34:56 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:81070 Length: 390 100.0 3.7E-75 2.3E-78 428.6 42.3 388 1-389 1-390 (390) 2 protein:vir:97053 Length: 390 100.0 8.2E-75 5.1E-78 426.8 41.9 388 1-389 1-390 (390) 3 protein:vir:10364 Length: 390 100.0 1.4E-74 8.7E-78 425.5 42.3 388 1-389 1-390 (390) 4 protein:vir:100135 Length: 418 100.0 3.6E-73 2.3E-76 417.7 42.3 392 1-394 21-418 (418) 5 protein:vir:191 Length: 385 # 100.0 8.6E-73 5.4E-76 415.7 41.9 384 1-392 1-385 (385) 6 protein:vir:1886 Length: 385 # 100.0 8.6E-73 5.4E-76 415.7 41.9 384 1-392 1-385 (385) 7 protein:vir:4339 Length: 395 # 100.0 3.9E-71 2.4E-74 406.6 41.8 385 1-391 1-395 (395) 8 protein:vir:485 Length: 407 # 100.0 6.5E-67 4.1E-70 383.4 38.4 381 1-394 1-403 (407) 9 protein:vir:4456 Length: 401 # 100.0 3.5E-66 2.2E-69 379.4 38.3 376 1-391 5-401 (401) 10 protein:vir:4953 Length: 397 # 100.0 1.4E-65 8.4E-69 376.2 39.4 372 1-394 1-388 (397) 11 protein:vir:101650 Length: 497 100.0 2.6E-65 1.6E-68 374.6 40.6 393 1-394 1-496 (497) 12 protein:vir:7855 Length: 497 # 100.0 2.6E-65 1.6E-68 374.6 40.6 393 1-394 1-496 (497) 13 protein:vir:105038 Length: 428 100.0 2.1E-65 1.3E-68 375.1 38.8 384 1-391 1-428 (428) 14 protein:vir:4997 Length: 397 # 100.0 1.7E-64 1E-67 370.2 39.1 372 1-394 1-388 (397) 15 protein:vir:1328 Length: 392 # 100.0 1.9E-64 1.2E-67 369.9 39.2 383 1-392 1-392 (392) 16 protein:vir:8102 Length: 543 # 100.0 2.4E-64 1.5E-67 369.3 39.6 385 1-392 142-543 (543) 17 protein:vir:1433 Length: 435 # 100.0 2.5E-64 1.5E-67 369.3 39.3 386 1-393 1-435 (435) 18 protein:vir:102119 Length: 404 100.0 3.7E-64 2.3E-67 368.3 39.7 382 1-394 1-403 (404) 19 protein:vir:4830 Length: 397 # 100.0 3.1E-64 1.9E-67 368.8 39.1 372 1-394 1-388 (397) 20 protein:vir:6242 Length: 390 # 100.0 2.4E-64 1.5E-67 369.4 38.0 382 1-392 1-390 (390) 21 protein:vir:4511 Length: 409 # 100.0 3.6E-64 2.2E-67 368.4 38.7 387 1-394 1-409 (409) 22 protein:vir:80376 Length: 435 100.0 5.3E-64 3.3E-67 367.5 39.6 386 1-393 1-435 (435) 23 protein:vir:81227 Length: 413 100.0 2E-63 1.2E-66 364.3 42.1 391 1-394 2-413 (413) 24 protein:vir:1025 Length: 408 # 100.0 5.7E-64 3.5E-67 367.3 37.8 373 1-394 4-396 (408) 25 protein:vir:7409 Length: 408 # 100.0 9.9E-64 6.1E-67 366.0 38.2 376 1-394 4-396 (408) 26 protein:vir:94673 Length: 419 100.0 5.6E-63 3.5E-66 361.9 41.4 388 1-393 4-419 (419) 27 protein:vir:79987 Length: 415 100.0 6.4E-63 4E-66 361.6 40.1 386 1-394 1-407 (415) 28 protein:vir:98339 Length: 415 100.0 6.4E-63 4E-66 361.6 40.1 386 1-394 1-407 (415) 29 protein:vir:81100 Length: 415 100.0 6.4E-63 4E-66 361.6 40.1 386 1-394 1-407 (415) 30 protein:vir:4600 Length: 415 # 100.0 1.7E-62 1.1E-65 359.2 41.2 386 1-394 1-407 (415) 31 protein:vir:4700 Length: 415 # 100.0 1.7E-62 1.1E-65 359.2 41.2 386 1-394 1-407 (415) 32 protein:vir:100247 Length: 425 100.0 9.9E-64 6.1E-67 366.0 34.2 372 1-392 21-425 (425) 33 protein:vir:9410 Length: 415 # 100.0 2.7E-62 1.7E-65 358.1 40.3 386 1-394 1-407 (415) 34 protein:vir:3991 Length: 404 # 100.0 3.2E-62 2E-65 357.7 38.4 373 1-394 4-396 (404) 35 protein:vir:101607 Length: 379 100.0 5.4E-62 3.4E-65 356.5 39.6 366 1-391 1-379 (379) 36 protein:vir:81160 Length: 371 100.0 5.2E-62 3.2E-65 356.6 39.3 353 1-391 1-371 (371) 37 protein:vir:95376 Length: 425 100.0 7.7E-62 4.8E-65 355.6 38.7 385 1-394 3-424 (425) 38 protein:vir:105004 Length: 392 100.0 7.4E-61 4.6E-64 350.2 39.1 367 1-394 1-387 (392) 39 protein:vir:107593 Length: 392 100.0 7.4E-61 4.6E-64 350.2 39.1 367 1-394 1-387 (392) 40 protein:vir:102873 Length: 392 100.0 7.4E-61 4.6E-64 350.2 39.1 367 1-394 1-387 (392) 41 protein:vir:102082 Length: 392 100.0 7.4E-61 4.6E-64 350.2 39.1 367 1-394 1-387 (392) 42 protein:vir:6212 Length: 434 # 100.0 6.2E-61 3.9E-64 350.7 38.3 386 1-394 1-434 (434) 43 protein:vir:3845 Length: 395 # 100.0 1.7E-60 1E-63 348.3 38.1 369 1-394 1-386 (395) 44 protein:vir:1268 Length: 397 # 100.0 7E-60 4.3E-63 344.9 38.5 368 1-391 5-397 (397) 45 protein:vir:2685 Length: 387 # 100.0 1.7E-60 1.1E-63 348.3 33.8 372 1-394 1-384 (387) 46 protein:vir:96978 Length: 387 100.0 1.7E-60 1.1E-63 348.3 33.8 372 1-394 1-384 (387) 47 protein:vir:94424 Length: 387 100.0 1.7E-60 1.1E-63 348.3 33.8 372 1-394 1-384 (387) 48 protein:vir:104256 Length: 458 100.0 1.6E-59 1E-62 342.9 39.0 379 1-391 14-458 (458) 49 protein:vir:1383 Length: 421 # 100.0 1.1E-59 6.5E-63 343.9 37.6 369 1-394 3-386 (421) 50 protein:vir:9361 Length: 402 # 100.0 4.3E-60 2.6E-63 346.1 34.4 372 1-394 16-399 (402) 51 protein:vir:93881 Length: 387 100.0 1.9E-59 1.2E-62 342.5 35.5 371 1-394 1-384 (387) 52 protein:vir:100172 Length: 394 100.0 7.8E-59 4.8E-62 339.1 38.7 365 1-394 1-387 (394) 53 protein:vir:3870 Length: 400 # 100.0 1.1E-58 6.8E-62 338.3 37.9 367 1-392 11-400 (400) 54 protein:vir:100884 Length: 389 100.0 1.4E-58 9E-62 337.7 38.2 367 1-394 1-385 (389) 55 protein:vir:9704 Length: 394 # 100.0 6.2E-58 3.9E-61 334.2 37.8 369 1-394 2-393 (394) 56 protein:vir:8420 Length: 477 # 100.0 7.3E-58 4.5E-61 333.8 35.7 390 1-394 8-475 (477) 57 protein:vir:4092 Length: 390 # 100.0 2.2E-57 1.4E-60 331.2 36.5 354 1-394 1-371 (390) 58 protein:vir:93616 Length: 645 100.0 6.9E-57 4.3E-60 328.5 38.3 385 1-394 193-645 (645) 59 protein:vir:1084 Length: 437 # 100.0 4.5E-57 2.8E-60 329.5 36.1 373 1-394 1-430 (437) 60 protein:vir:962 Length: 397 # 100.0 1E-55 6.5E-59 322.0 35.5 367 1-391 1-397 (397) 61 protein:vir:41 Length: 299 # N 100.0 5E-57 3.1E-60 329.2 27.5 282 106-392 1-299 (299) 62 protein:vir:7771 Length: 330 # 100.0 5.4E-57 3.4E-60 329.1 27.6 290 100-394 1-326 (330) 63 protein:vir:97148 Length: 324 100.0 1.3E-56 8.1E-60 327.0 29.2 303 77-394 1-318 (324) 64 protein:vir:9574 Length: 300 # 100.0 1.4E-56 8.5E-60 326.8 28.1 279 111-391 1-300 (300) 65 protein:vir:80128 Length: 466 100.0 4.7E-55 2.9E-58 318.4 35.8 385 1-394 16-451 (466) 66 protein:vir:9309 Length: 324 # 100.0 3.6E-56 2.2E-59 324.5 29.6 303 77-394 1-318 (324) 67 protein:vir:96392 Length: 324 100.0 4.1E-56 2.5E-59 324.2 28.8 303 77-394 1-318 (324) 68 protein:vir:78830 Length: 324 100.0 4.1E-56 2.5E-59 324.2 28.8 303 77-394 1-318 (324) 69 protein:vir:98635 Length: 377 100.0 4.3E-55 2.7E-58 318.6 30.0 349 1-391 5-377 (377) 70 protein:vir:99749 Length: 324 100.0 2.9E-55 1.8E-58 319.6 28.6 303 77-394 1-318 (324) 71 protein:vir:96223 Length: 324 100.0 2.7E-55 1.7E-58 319.7 28.5 303 77-394 1-318 (324) 72 protein:vir:78640 Length: 352 100.0 1.2E-54 7.3E-58 316.2 31.7 344 1-394 1-349 (352) 73 protein:vir:103955 Length: 324 100.0 3.5E-55 2.2E-58 319.1 28.2 303 77-394 1-318 (324) 74 protein:vir:4856 Length: 293 # 100.0 3.7E-55 2.3E-58 319.0 27.5 272 107-394 1-284 (293) 75 protein:vir:9759 Length: 303 # 100.0 6.1E-55 3.8E-58 317.8 28.1 278 113-391 1-303 (303) 76 protein:vir:1638 Length: 298 # 100.0 1.3E-54 7.9E-58 316.0 28.2 275 115-390 1-298 (298) 77 protein:vir:96762 Length: 632 100.0 1.1E-53 7.1E-57 310.8 32.8 378 1-390 199-632 (632) 78 protein:vir:2344 Length: 397 # 100.0 1E-54 6.5E-58 316.5 26.7 288 98-394 1-309 (397) 79 protein:vir:4226 Length: 326 # 100.0 1.1E-54 7.1E-58 316.3 26.7 299 89-394 1-326 (326) 80 protein:vir:94142 Length: 304 100.0 1.2E-54 7.7E-58 316.1 26.8 281 102-390 1-304 (304) 81 protein:vir:105905 Length: 304 100.0 1.2E-54 7.7E-58 316.1 26.8 281 102-390 1-304 (304) 82 protein:vir:2430 Length: 318 # 100.0 2.2E-54 1.4E-57 314.7 27.8 294 94-394 1-316 (318) 83 protein:vir:5739 Length: 366 # 100.0 2.5E-54 1.6E-57 314.4 27.9 331 53-391 1-366 (366) 84 protein:vir:95963 Length: 395 100.0 6.6E-53 4.1E-56 306.7 35.1 357 1-394 1-379 (395) 85 protein:vir:80684 Length: 315 100.0 2.2E-54 1.4E-57 314.7 26.9 281 111-394 1-309 (315) 86 protein:vir:94771 Length: 298 100.0 3.3E-54 2.1E-57 313.8 27.8 275 115-390 1-298 (298) 87 protein:vir:8187 Length: 311 # 100.0 6E-54 3.7E-57 312.4 27.8 278 113-392 1-311 (311) 88 protein:vir:9509 Length: 381 # 100.0 8.3E-53 5.1E-56 306.1 29.7 342 1-394 1-371 (381) 89 protein:vir:101291 Length: 381 100.0 8.3E-53 5.1E-56 306.1 29.7 342 1-394 1-371 (381) 90 protein:vir:104085 Length: 320 100.0 4.2E-53 2.6E-56 307.7 27.8 295 94-394 1-320 (320) 91 protein:vir:100632 Length: 381 100.0 1.5E-52 9.4E-56 304.7 29.8 342 1-394 3-371 (381) 92 protein:vir:95763 Length: 297 100.0 8.3E-53 5.2E-56 306.1 26.9 281 100-392 1-297 (297) 93 protein:vir:9643 Length: 377 # 100.0 1.4E-51 8.5E-55 299.4 33.0 342 1-391 5-377 (377) 94 protein:vir:78523 Length: 338 100.0 9.8E-52 6.1E-55 300.2 27.5 296 98-394 1-338 (338) 95 protein:vir:78223 Length: 333 100.0 1.2E-51 7.7E-55 299.7 27.9 293 98-392 1-333 (333) 96 protein:vir:99920 Length: 311 100.0 1.8E-51 1.1E-54 298.8 27.1 278 111-391 1-311 (311) 97 protein:vir:2504 Length: 305 # 100.0 2.3E-51 1.5E-54 298.2 25.7 276 111-394 1-301 (305) 98 protein:vir:78350 Length: 383 100.0 3E-50 1.9E-53 292.1 30.6 358 1-394 1-380 (383) 99 protein:vir:4197 Length: 314 # 100.0 2.6E-40 1.6E-43 237.6 25.8 288 100-394 1-314 (314) 100 protein:vir:97397 Length: 517 100.0 2.4E-39 1.5E-42 232.3 29.6 378 1-394 127-517 (517) 101 protein:vir:4159 Length: 315 # 100.0 8.2E-40 5.1E-43 234.9 23.9 291 89-390 1-315 (315) 102 protein:vir:3158 Length: 321 # 100.0 2E-36 1.2E-39 216.3 26.0 297 95-394 1-315 (321) 103 protein:vir:4074 Length: 480 # 100.0 1.8E-34 1.1E-37 205.6 21.1 358 1-394 114-480 (480) 104 protein:vir:9820 Length: 272 # 100.0 4.1E-33 2.5E-36 198.1 24.9 263 111-394 1-272 (272) 105 protein:vir:3033 Length: 272 # 100.0 4.1E-33 2.5E-36 198.1 24.9 263 111-394 1-272 (272) 106 protein:vir:93742 Length: 274 99.9 4E-25 2.5E-28 154.4 22.8 263 111-394 1-272 (274) 107 protein:vir:3613 Length: 272 # 99.9 3.7E-25 2.3E-28 154.5 20.6 261 111-391 1-272 (272) 108 protein:vir:94933 Length: 330 99.9 3E-24 1.9E-27 149.6 18.5 299 90-392 1-330 (330) 109 protein:vir:96123 Length: 274 99.9 4.9E-23 3.1E-26 142.9 22.8 262 111-394 1-273 (274) 110 protein:vir:105334 Length: 276 99.9 3.1E-23 1.9E-26 144.0 21.4 262 111-394 1-273 (276) 111 protein:vir:94494 Length: 274 99.9 7.3E-23 4.5E-26 141.9 22.8 260 111-394 1-272 (274) 112 protein:vir:97433 Length: 274 99.9 7.3E-23 4.5E-26 141.9 22.8 260 111-394 1-272 (274) 113 protein:vir:96833 Length: 275 99.9 7E-23 4.4E-26 142.0 21.3 262 110-394 1-274 (275) 114 protein:vir:80930 Length: 278 99.8 2E-22 1.2E-25 139.5 21.3 268 111-392 1-278 (278) 115 protein:vir:1239 Length: 274 # 99.8 7.3E-22 4.5E-25 136.5 21.7 260 111-394 1-272 (274) 116 protein:vir:95107 Length: 270 99.8 6.7E-22 4.2E-25 136.7 20.9 261 111-394 1-268 (270) 117 protein:vir:95898 Length: 274 99.8 2.3E-21 1.5E-24 133.7 21.7 259 111-394 1-271 (274) 118 protein:vir:96262 Length: 274 99.8 2.3E-21 1.5E-24 133.7 21.7 259 111-394 1-271 (274) 119 protein:vir:79928 Length: 393 99.8 2.9E-21 1.8E-24 133.2 21.0 347 1-394 1-381 (393) 120 protein:vir:93858 Length: 400 99.8 1E-20 6.3E-24 130.2 20.9 376 1-389 8-400 (400) 121 protein:vir:97255 Length: 310 99.8 3E-20 1.9E-23 127.6 22.8 278 111-391 1-310 (310) 122 protein:vir:8324 Length: 410 # 99.8 4.4E-20 2.7E-23 126.7 20.9 362 1-389 1-410 (410) 123 protein:vir:739 Length: 231 # 99.7 1.8E-19 1.1E-22 123.3 17.7 221 145-391 1-231 (231) 124 protein:vir:99424 Length: 360 99.7 9E-18 5.6E-21 114.0 21.6 301 77-393 1-360 (360) 125 protein:vir:102605 Length: 273 99.6 4.1E-16 2.5E-19 105.0 19.8 258 111-391 1-273 (273) 126 protein:vir:105822 Length: 273 99.6 4.1E-16 2.5E-19 105.0 19.8 258 111-391 1-273 (273) 127 protein:vir:7990 Length: 273 # 99.6 4.3E-16 2.7E-19 104.8 19.6 259 111-391 1-273 (273) 128 protein:vir:108211 Length: 318 99.5 1.7E-15 1.1E-18 101.5 17.7 283 107-391 1-318 (318) 129 protein:vir:94622 Length: 341 99.4 1.9E-14 1.2E-17 95.8 16.7 288 102-394 1-340 (341) 130 protein:vir:5974 Length: 324 # 99.4 2.6E-13 1.6E-16 89.6 20.3 265 111-394 1-294 (324) 131 protein:vir:8885 Length: 347 # 99.4 8.6E-14 5.3E-17 92.2 17.2 288 98-392 1-347 (347) 132 protein:vir:2201 Length: 345 # 99.3 6.6E-14 4.1E-17 92.8 15.5 281 100-391 1-345 (345) 133 protein:vir:80213 Length: 334 99.3 6E-14 3.7E-17 93.1 15.2 283 105-393 1-334 (334) 134 protein:vir:102944 Length: 330 99.3 7.7E-13 4.8E-16 87.0 19.4 267 111-394 1-300 (330) 135 protein:vir:94576 Length: 347 99.3 1.9E-13 1.2E-16 90.3 15.9 281 105-391 1-347 (347) 136 protein:vir:6324 Length: 335 # 99.3 6.8E-13 4.2E-16 87.3 18.0 283 100-394 1-331 (335) 137 protein:vir:1583 Length: 351 # 99.3 1.3E-12 8.2E-16 85.7 19.1 266 111-394 1-298 (351) 138 protein:vir:78935 Length: 335 99.3 9.5E-13 5.9E-16 86.5 17.7 283 100-394 1-331 (335) 139 protein:vir:103323 Length: 364 99.2 2.9E-12 1.8E-15 83.9 19.4 283 102-394 1-342 (364) 140 protein:vir:3364 Length: 347 # 99.2 9.7E-13 6E-16 86.4 16.8 283 105-393 1-347 (347) 141 protein:vir:95318 Length: 328 99.2 3.9E-13 2.4E-16 88.6 13.7 226 108-334 1-328 (328) 142 protein:vir:94711 Length: 347 99.2 3.9E-13 2.4E-16 88.6 13.1 278 105-392 1-347 (347) 143 protein:vir:10450 Length: 344 99.2 1.2E-12 7.2E-16 86.0 15.7 277 105-391 1-344 (344) 144 protein:vir:78739 Length: 332 99.2 1E-12 6.5E-16 86.3 15.4 284 102-389 1-332 (332) 145 protein:vir:1541 Length: 347 # 99.1 1.7E-11 1.1E-14 79.6 19.2 283 105-393 1-347 (347) 146 protein:vir:100057 Length: 375 99.1 3E-11 1.9E-14 78.3 20.4 288 100-394 1-373 (375) 147 protein:vir:9927 Length: 295 # 99.1 1E-11 6.4E-15 80.8 17.2 261 111-394 1-291 (295) 148 protein:vir:80180 Length: 381 99.1 3E-11 1.8E-14 78.3 17.8 285 105-394 1-313 (381) 149 protein:vir:103759 Length: 330 99.1 5.5E-12 3.4E-15 82.3 12.7 226 108-334 1-330 (330) 150 protein:vir:106647 Length: 303 99.0 7.9E-11 4.9E-14 76.0 17.1 265 107-394 1-299 (303) 151 protein:vir:99675 Length: 324 99.0 2.8E-11 1.8E-14 78.4 14.2 240 144-394 1-299 (324) 152 protein:vir:3136 Length: 322 # 99.0 3.8E-11 2.3E-14 77.7 14.6 280 111-394 1-321 (322) 153 protein:vir:97031 Length: 402 99.0 3.5E-11 2.1E-14 77.9 13.3 287 102-394 1-336 (402) 154 protein:vir:98525 Length: 331 98.9 6.7E-11 4.2E-14 76.4 13.7 226 108-334 1-331 (331) 155 protein:vir:107826 Length: 331 98.9 6.7E-11 4.2E-14 76.4 13.7 226 108-334 1-331 (331) 156 protein:vir:107388 Length: 331 98.9 6.7E-11 4.2E-14 76.4 13.7 226 108-334 1-331 (331) 157 protein:vir:9875 Length: 296 # 98.9 2.6E-10 1.6E-13 73.2 15.6 265 102-392 1-296 (296) 158 protein:vir:7324 Length: 335 # 98.9 1.3E-10 7.9E-14 74.8 13.1 227 108-335 1-335 (335) 159 protein:vir:7019 Length: 401 # 98.8 1.7E-10 1E-13 74.2 13.5 279 108-394 1-336 (401) 160 protein:vir:93966 Length: 400 98.8 2.1E-10 1.3E-13 73.7 14.0 373 1-389 8-400 (400) 161 protein:vir:103285 Length: 296 98.8 2.4E-09 1.5E-12 67.8 18.3 272 111-389 1-296 (296) 162 protein:vir:102655 Length: 322 98.8 1.7E-09 1E-12 68.7 16.6 287 102-392 1-322 (322) 163 protein:vir:105645 Length: 400 98.8 8.8E-10 5.5E-13 70.2 14.6 284 108-394 1-336 (400) 164 protein:vir:99075 Length: 392 98.7 5.1E-09 3.2E-12 66.0 17.2 266 111-394 1-305 (392) 165 protein:vir:104342 Length: 314 98.7 7.5E-09 4.7E-12 65.1 17.8 292 92-391 1-314 (314) 166 protein:vir:80068 Length: 301 98.7 1.7E-08 1E-11 63.2 19.6 268 113-388 1-301 (301) 167 protein:vir:107687 Length: 319 98.7 1.3E-08 7.8E-12 63.9 18.6 291 74-388 1-319 (319) 168 protein:vir:1663 Length: 393 # 98.7 1.1E-09 7E-13 69.6 12.5 373 1-389 1-393 (393) 169 protein:vir:8843 Length: 317 # 98.6 3.1E-08 1.9E-11 61.8 19.0 280 108-393 1-317 (317) 170 protein:vir:80446 Length: 367 98.6 2.9E-08 1.8E-11 61.9 18.5 276 111-394 1-339 (367) 171 protein:vir:79642 Length: 329 98.4 1.3E-07 7.8E-11 58.4 17.9 300 86-392 1-329 (329) 172 protein:vir:95131 Length: 325 98.3 2.1E-07 1.3E-10 57.2 16.6 272 111-394 1-297 (325) 173 protein:vir:108303 Length: 418 98.3 1.2E-06 7.4E-10 53.1 20.6 263 111-394 1-288 (418) 174 protein:vir:78387 Length: 349 98.2 1.9E-06 1.2E-09 51.9 19.9 269 111-394 1-319 (349) 175 protein:vir:94989 Length: 349 98.2 2.6E-06 1.6E-09 51.2 20.1 268 111-394 1-319 (349) 176 protein:vir:79548 Length: 652 98.1 4.9E-06 3E-09 49.7 23.3 378 1-388 224-652 (652) 177 protein:vir:97331 Length: 319 98.0 6.6E-06 4.1E-09 49.0 19.6 276 100-394 1-297 (319) 178 protein:vir:94800 Length: 319 98.0 6.6E-06 4.1E-09 49.0 19.6 276 100-394 1-297 (319) 179 protein:vir:3525 Length: 423 # 98.0 2.2E-06 1.3E-09 51.6 16.7 261 111-394 1-309 (423) 180 protein:vir:96792 Length: 315 98.0 5.7E-06 3.5E-09 49.3 18.9 265 111-394 1-284 (315) 181 protein:vir:107120 Length: 329 97.9 7.8E-06 4.8E-09 48.6 18.3 297 77-394 1-308 (329) 182 protein:vir:100331 Length: 342 97.8 9.2E-06 5.7E-09 48.2 17.0 299 90-392 1-342 (342) 183 protein:vir:1829 Length: 355 # 97.8 1.3E-05 7.9E-09 47.4 17.6 300 74-394 1-354 (355) 184 protein:vir:98566 Length: 355 97.8 1.4E-05 8.8E-09 47.2 17.6 302 82-394 1-353 (355) 185 protein:vir:104011 Length: 337 97.8 1.9E-05 1.2E-08 46.5 18.6 294 94-391 1-337 (337) 186 protein:vir:105374 Length: 423 97.8 1.9E-05 1.2E-08 46.4 18.1 263 111-394 1-337 (423) 187 protein:vir:174 Length: 423 # 97.8 2E-05 1.3E-08 46.3 18.7 260 111-394 1-337 (423) 188 protein:vir:79171 Length: 337 97.8 2.1E-05 1.3E-08 46.2 18.5 294 94-391 1-337 (337) 189 protein:vir:1153 Length: 338 # 97.8 2E-05 1.2E-08 46.4 18.0 296 94-393 1-338 (338) 190 protein:vir:5255 Length: 304 # 97.7 6.4E-06 4E-09 49.1 14.6 266 116-388 1-304 (304) 191 protein:vir:79157 Length: 339 97.7 2.4E-05 1.5E-08 45.9 17.7 295 94-392 1-339 (339) 192 protein:vir:78186 Length: 337 97.6 3.6E-05 2.2E-08 45.0 17.5 294 94-391 1-337 (337) 193 protein:vir:78777 Length: 358 97.6 2.9E-05 1.8E-08 45.5 16.8 298 90-394 1-347 (358) 194 protein:vir:98856 Length: 343 97.6 4.4E-05 2.7E-08 44.5 17.6 299 90-394 1-341 (343) 195 protein:vir:105522 Length: 423 97.5 4.8E-05 3E-08 44.3 19.4 258 111-394 1-337 (423) 196 protein:vir:6061 Length: 357 # 97.5 4.1E-05 2.6E-08 44.6 16.7 302 74-394 1-351 (357) 197 protein:vir:270 Length: 341 # 97.5 2.7E-05 1.7E-08 45.6 15.4 295 77-394 1-333 (341) 198 protein:vir:5694 Length: 357 # 97.4 6.4E-05 4E-08 43.6 16.4 300 74-394 1-351 (357) 199 protein:vir:95875 Length: 401 97.4 4.6E-05 2.8E-08 44.4 15.5 291 101-392 1-401 (401) 200 protein:vir:1781 Length: 221 # 97.4 2.4E-05 1.5E-08 45.9 13.9 183 194-394 1-205 (221) 201 protein:vir:2016 Length: 357 # 97.3 8.9E-05 5.5E-08 42.8 17.0 300 74-394 1-351 (357) 202 protein:vir:94070 Length: 339 97.3 3.5E-05 2.2E-08 45.0 14.2 306 51-388 1-339 (339) 203 protein:vir:95512 Length: 693 97.3 9.8E-05 6.1E-08 42.5 23.1 375 1-389 220-693 (693) 204 protein:vir:861 Length: 318 # 97.3 7.7E-06 4.8E-09 48.6 10.3 300 77-389 1-318 (318) 205 protein:vir:3643 Length: 336 # 97.2 6.5E-05 4E-08 43.5 14.0 302 46-388 1-336 (336) 206 protein:vir:3746 Length: 336 # 97.1 0.00017 1.1E-07 41.3 17.3 295 77-394 1-333 (336) 207 protein:vir:348 Length: 321 # 97.0 8.4E-05 5.2E-08 42.9 13.3 277 102-389 1-321 (321) 208 protein:vir:3783 Length: 336 # 97.0 0.00023 1.4E-07 40.5 17.2 294 90-394 1-333 (336) 209 protein:vir:78558 Length: 336 96.8 0.00025 1.6E-07 40.3 14.2 302 46-388 1-336 (336) 210 protein:vir:101557 Length: 336 96.6 0.00033 2E-07 39.7 13.9 302 46-388 1-336 (336) 211 protein:vir:95603 Length: 463 96.3 0.00033 2.1E-07 39.6 12.2 291 77-394 1-329 (463) 212 protein:vir:99311 Length: 463 96.3 0.00033 2.1E-07 39.6 12.2 291 77-394 1-329 (463) 213 protein:vir:79008 Length: 299 96.1 0.00094 5.9E-07 37.2 19.4 265 111-393 1-299 (299) 214 protein:vir:95451 Length: 313 96.1 0.0011 6.5E-07 36.9 14.9 271 113-393 1-313 (313) 215 protein:vir:106734 Length: 336 96.1 0.00054 3.3E-07 38.5 12.1 302 46-388 1-336 (336) 216 protein:vir:103886 Length: 302 95.8 0.0014 8.8E-07 36.2 16.8 268 111-392 1-302 (302) 217 protein:vir:96079 Length: 382 95.7 0.001 6.5E-07 36.9 12.3 322 60-388 1-382 (382) 218 protein:vir:107732 Length: 379 95.5 0.0012 7.6E-07 36.6 11.8 324 18-389 1-379 (379) 219 protein:vir:93696 Length: 364 95.4 0.0022 1.4E-06 35.1 17.0 276 111-394 1-364 (364) 220 protein:vir:96666 Length: 462 94.8 0.0034 2.1E-06 34.1 16.7 309 67-394 1-371 (462) 221 protein:vir:80835 Length: 464 94.4 0.0046 2.9E-06 33.4 13.1 308 77-394 1-368 (464) 222 protein:vir:94870 Length: 318 94.0 0.0044 2.7E-06 33.5 11.3 301 67-389 1-318 (318) 223 protein:vir:99576 Length: 388 93.9 0.00076 4.7E-07 37.7 6.9 326 18-388 1-388 (388) 224 protein:vir:63741 Length: 468 92.6 0.011 6.6E-06 31.4 13.0 294 68-394 1-321 (468) 225 protein:vir:80491 Length: 467 92.5 0.011 7E-06 31.3 13.4 293 70-394 1-320 (467) 226 protein:vir:78920 Length: 290 90.7 0.019 1.2E-05 30.0 19.7 261 111-390 1-290 (290) 227 protein:vir:7214 Length: 521 # 90.3 0.022 1.3E-05 29.7 19.0 356 1-394 3-507 (521) 228 protein:vir:1025 Length: 408 # 89.0 0.029 1.8E-05 29.0 14.4 365 1-394 11-407 (408) 229 protein:vir:103463 Length: 521 88.4 0.032 2E-05 28.8 19.3 351 1-394 3-509 (521) 230 protein:vir:105464 Length: 346 85.0 0.056 3.5E-05 27.5 18.7 266 111-394 1-303 (346) 231 protein:vir:98143 Length: 524 84.8 0.057 3.6E-05 27.4 19.7 355 1-394 1-504 (524) 232 protein:vir:80986 Length: 528 84.2 0.062 3.8E-05 27.2 20.6 346 1-391 1-528 (528) 233 protein:vir:6901 Length: 522 # 83.9 0.064 4E-05 27.1 19.4 350 1-394 4-510 (522) 234 protein:vir:79712 Length: 285 81.0 0.089 5.5E-05 26.3 18.8 259 111-392 1-285 (285) 235 protein:vir:99888 Length: 309 78.6 0.11 7E-05 25.8 15.2 264 116-392 1-309 (309) 236 protein:vir:7409 Length: 408 # 76.7 0.13 8.2E-05 25.4 14.3 370 1-394 11-407 (408) 237 protein:vir:2736 Length: 348 # 76.6 0.13 8.3E-05 25.4 20.3 276 111-392 1-348 (348) 238 protein:vir:102335 Length: 312 75.6 0.15 9E-05 25.2 20.5 265 111-393 1-312 (312) 239 protein:vir:100851 Length: 514 73.8 0.17 0.0001 24.9 14.1 324 54-394 1-386 (514) 240 protein:vir:99523 Length: 311 68.5 0.24 0.00015 24.0 20.6 272 115-391 1-311 (311) 241 protein:vir:96490 Length: 348 66.6 0.27 0.00016 23.7 20.1 276 111-392 1-348 (348) 242 protein:vir:3991 Length: 404 # 62.7 0.33 0.0002 23.2 14.2 353 1-394 11-399 (404) 243 protein:vir:4997 Length: 397 # 60.3 0.37 0.00023 22.9 15.0 360 1-394 4-394 (397) 244 protein:vir:3424 Length: 341 # 57.4 0.43 0.00027 22.6 21.8 271 119-389 1-341 (341) 245 protein:vir:100603 Length: 529 55.2 0.48 0.0003 22.3 18.4 345 29-394 1-517 (529) 246 protein:vir:6601 Length: 528 # 47.9 0.69 0.00043 21.5 21.7 347 1-391 1-528 (528) 247 protein:vir:393 Length: 341 # 47.5 0.7 0.00043 21.4 22.0 271 119-389 1-341 (341) 248 protein:vir:104915 Length: 470 46.3 0.74 0.00046 21.3 18.8 342 1-392 3-470 (470) 249 protein:vir:106998 Length: 468 45.9 0.75 0.00047 21.3 20.3 345 1-394 1-449 (468) 250 protein:vir:10123 Length: 404 44.1 0.82 0.00051 21.1 14.6 299 74-394 1-404 (404) 251 protein:vir:819 Length: 404 # 44.1 0.82 0.00051 21.1 14.6 299 74-394 1-404 (404) 252 protein:vir:3298 Length: 404 # 44.1 0.82 0.00051 21.1 14.6 299 74-394 1-404 (404) 253 protein:vir:104439 Length: 404 44.1 0.82 0.00051 21.1 14.6 299 74-394 1-404 (404) 254 protein:vir:4902 Length: 348 # 38.1 1.1 0.00067 20.4 19.8 278 111-392 1-348 (348) 255 protein:vir:104549 Length: 462 37.4 1.1 0.00069 20.3 19.0 323 30-394 1-449 (462) 256 protein:vir:106286 Length: 534 36.2 1.2 0.00074 20.2 19.1 355 1-394 1-522 (534) 257 protein:vir:5670 Length: 514 # 35.8 1.2 0.00075 20.1 19.8 341 33-391 1-514 (514) 258 protein:vir:102823 Length: 470 35.2 1.2 0.00077 20.1 16.0 294 77-394 1-369 (470) 259 protein:vir:4830 Length: 397 # 28.5 1.7 0.0011 19.3 13.9 349 1-388 4-397 (397) 260 protein:vir:4953 Length: 397 # 24.6 2.2 0.0013 18.8 15.8 347 1-394 4-391 (397) 261 protein:vir:101039 Length: 529 22.2 2.5 0.0015 18.4 17.8 343 29-391 1-529 (529) 262 protein:vir:101811 Length: 529 21.1 2.7 0.0017 18.3 19.5 338 29-391 1-529 (529) No 1 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=3.7e-75 Score=428.62 Aligned_cols=388 Identities=59% Similarity=0.935 Sum_probs=346.1 Q ss_pred CchHHH-HHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINA-INSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~e-l~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |++|.+ |+++++++.+++++..++...+.+++++.+++++++.+++++++++|+++++++...+..........+..++ T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchh Confidence 999755 8999999999999999998888889999999999999999999999999998888888777776666666655 Q ss_pred hhhhHHHHHHHHHHhh-hhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGG-QRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYV 158 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 158 (394) ..........+..... .......+.++......+++++.+|.++|+++...|++.+++.++|++++++++++++.+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:81 81 MFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEE Confidence 5544444443333222 233344556666666777777888999999999999999999999999999999999999999 Q ss_pred EEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCc Q lcl|NC_019933. 159 RETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQ 238 (394) Q Consensus 159 ~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~ 238 (394) +.++..+.++|++||+.+|+++++|+++++.+++++++++||+|+++++++++++|.++|++++++++|.+||+|+|+++ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~ 240 (390) T protein:vir:81 161 QETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGAND 240 (390) T ss_pred EEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC Confidence 99877678899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecc Q lcl|NC_019933. 239 NLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGL 318 (394) Q Consensus 239 ~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~ 318 (394) +|.||++.++........++...++++.+++..+...++.+++|+|||.+|..|++++|++|+|+|+++..+++++|+|+ T Consensus 241 ~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~ 320 (390) T protein:vir:81 241 GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGL 320 (390) T ss_pred cccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecce Confidence 99999998887777777777788999999999999999999999999999999999999999999998888888899999 Q ss_pred eEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|NC_019933. 319 PVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 319 pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~ 389 (394) ||++++++|+++++||||+.+|.++++.+++++++++. .+|.+|++.||++.|+||++.+|+||++++++ T Consensus 321 pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 321 PVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVG-EDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred eeEEcCCCCCCcEEEEehhceEEEEEecceEEEEeccc-chhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 99999999999999999999899999999999887653 46899999999999999999999999999999 No 2 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=8.2e-75 Score=426.75 Aligned_cols=388 Identities=60% Similarity=0.946 Sum_probs=345.9 Q ss_pred CchHH-HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDIN-AINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~-el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |++++ +|+++++++.++++++.++..++.+++++.+++++++.++++++++++++++++.+.............++..+ T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchh Confidence 99875 59999999999999999999888889999999999999999999999999998888887777766666666555 Q ss_pred hhhhHHHHHHHHHHh-hhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESG-GQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYV 158 (394) Q Consensus 80 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 158 (394) ..........+.... ........+.+.......++++.++|.++|+++...|++.+++.++|+++++++|++++.+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:97 81 MFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEE Confidence 544444444433332 2223334455666677777788889999999999999999999999999999999999999999 Q ss_pred EEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCc Q lcl|NC_019933. 159 RETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQ 238 (394) Q Consensus 159 ~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~ 238 (394) +.++..+.+.|++||+.+|+++++|+++++.+++++++++||+|+++++++++++|.++|++++++++|.+||+|+|+++ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~ 240 (390) T protein:vir:97 161 QETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGAND 240 (390) T ss_pred EEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCc Confidence 99887778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecc Q lcl|NC_019933. 239 NLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGL 318 (394) Q Consensus 239 ~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~ 318 (394) +|.||++.++..+.....++...++++.+++..+...+..+++|+|||.+|.+|+++||++|+|+|+++..+++++|+|+ T Consensus 241 ~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~ 320 (390) T protein:vir:97 241 GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGL 320 (390) T ss_pred cccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecce Confidence 99999998887777777778888999999999999999999999999999999999999999999998888888899999 Q ss_pred eEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|NC_019933. 319 PVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 319 pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~ 389 (394) ||++++.+|+++++||||+.+|.++++.++++.+.++. .+|.+|++.||+..|+||++++|+||++++++ T Consensus 321 pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 321 PVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred eeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 99999999999999999999899999999999987643 46899999999999999999999999999999 No 3 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.4e-74 Score=425.47 Aligned_cols=388 Identities=60% Similarity=0.949 Sum_probs=343.2 Q ss_pred CchHHH-HHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINA-INSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~e-l~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |+++.+ |+++++++.++++++.++...+.++++|.+++++++.+++++++++++++++++...+..........+..++ T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhh Confidence 998555 8999999999999999998888889999999999999999999999999998888877776666666555555 Q ss_pred hhhhHHHHHHHHHHh-hhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESG-GQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYV 158 (394) Q Consensus 80 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 158 (394) .......+..+.... ........+.+.......+.+++.+|.++|+++...|++.+++.++|+++|+++|++++.+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:10 81 LFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 444444333332222 2223334456666666666777777888999999999999999999999999999999999999 Q ss_pred EEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCc Q lcl|NC_019933. 159 RETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQ 238 (394) Q Consensus 159 ~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~ 238 (394) +.++.++.+.|++||+.+|+++++|+++++.+++++++++||+++++++++++++|.++|++++++++|.++|+|+|+++ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~ 240 (390) T protein:vir:10 161 QETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGAND 240 (390) T ss_pred EEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCc Confidence 99877778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecc Q lcl|NC_019933. 239 NLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGL 318 (394) Q Consensus 239 ~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~ 318 (394) .|.||++.++....+...++...++++.+++..+...++.+++|+|||.+|..|++++|++|+|+|+++...++++|+|+ T Consensus 241 ~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~ 320 (390) T protein:vir:10 241 GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGL 320 (390) T ss_pred cccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecce Confidence 99999999888777777777888999999999999999999999999999999999999999999998888888899999 Q ss_pred eEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|NC_019933. 319 PVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 319 pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~ 389 (394) ||++++.||+++++||||+.+|.++++.+++++++++. .+|.+|++.||++.|+||++++|+||++++++ T Consensus 321 pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 321 PVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred eeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 99999999999999999999899999999999987653 46899999999999999999999999999999 No 4 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=3.6e-73 Score=417.72 Aligned_cols=392 Identities=50% Similarity=0.772 Sum_probs=327.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ----ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~----~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 76 (394) |+.+++++++++++.++++++.++...+. +..+|.+++++++.++.+++++++++++++........ .....+. T Consensus 21 ~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~--~~~~~~~ 98 (418) T protein:vir:10 21 EQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSA--ELETPKT 98 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--ccchhhh Confidence 77788888888888888888877655443 34666777788888888888888777666554332221 1222222 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHH--HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKA--AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT 154 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 154 (394) ..+..........+............+.+. ........+++++|.+||+++...|++.+++.++|+++++++|++++. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 178 (418) T protein:vir:10 99 LGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSS 178 (418) T ss_pred hhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCc Confidence 333333333333333322222222222222 123334455667899999999999999999999999999999999988 Q ss_pred eeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019933. 155 LEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGN 234 (394) Q Consensus 155 ~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~ 234 (394) +++|+..+.++.+.|++||+.+|+++++|++|++.+++++++++||+++++++++++++|++.|++++++++|.+||+|+ T Consensus 179 ~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~ 258 (418) T protein:vir:10 179 IEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAPALQSYIDGRARYGLQLTEEGQILKGD 258 (418) T ss_pred eeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999887778889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCce Q lcl|NC_019933. 235 GTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPT 314 (394) Q Consensus 235 g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~ 314 (394) |++.+|.||++.++......+.++..+++++.+++..+...+..+++|+|||.+|..|++++|++|+|+|+.+..+++++ T Consensus 259 g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~ 338 (418) T protein:vir:10 259 GTGANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPR 338 (418) T ss_pred CCCccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCce Confidence 99999999999998888888788888999999999999999999999999999999999999999999998877778889 Q ss_pred eecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 315 LWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 315 l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) |+|+||++++.||+++++||||+.+++++++.++++.++++...+|.+|++.||++.|+||++++|+||++++++++++= T Consensus 339 l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 339 LWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred ecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 99999999999999999999999989999999999999999888999999999999999999999999999998877777 No 5 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=8.6e-73 Score=415.66 Aligned_cols=384 Identities=41% Similarity=0.625 Sum_probs=314.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |++|++|+++++++.++++++.++...+.+ +..++.+++.+++.++.++++++++++...+................ T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIE---STGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 999999999999999999998876654432 23344455555666666666655555555444332222111111000 Q ss_pred -hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 81 -FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 81 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) ......+....... ............ ...+++.+|.++|+++...|++.+++.++|+++|+++|++++.+++|+ T Consensus 78 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 152 (385) T protein:vir:19 78 SERAAEELIKSWDGK----QGTFGAKTFNKS-LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVR 152 (385) T ss_pred HHHHHHHHHHHHHHh----hccchhhHHHhh-hccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEE Confidence 00011111111111 111111111122 233345567788999999999999999999999999999998999999 Q ss_pred EcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcc Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQN 239 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~ 239 (394) ..+..+.++|++||+.+|+++++|+++++.+++++++++||+|+++++++++++|.++|++++++++|.+||+|+|++++ T Consensus 153 ~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~ 232 (385) T protein:vir:19 153 EEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDN 232 (385) T ss_pred EecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc Confidence 98777889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecce Q lcl|NC_019933. 240 LLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLP 319 (394) Q Consensus 240 ~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~p 319 (394) |.||++.++..+.+...++..++++|.+++..+...+..+++|+|||.+|.+|++++|++|+|+|+.+..+++++|+|+| T Consensus 233 ~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~p 312 (385) T protein:vir:19 233 LEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLP 312 (385) T ss_pred ccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceeccee Confidence 99999998888777777778899999999999999999999999999999999999999999999988888889999999 Q ss_pred EEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 320 VVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 320 v~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) |++++.+|+++++||||+.+|.++++.++++++.++.+.+|.+|++.||+++|+||++++|+||+++++++++ T Consensus 313 V~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 313 VVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred eEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 9999999999999999999999999999999999988889999999999999999999999999999999999 No 6 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=8.6e-73 Score=415.66 Aligned_cols=384 Identities=41% Similarity=0.625 Sum_probs=314.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |++|++|+++++++.++++++.++...+.+ +..++.+++.+++.++.++++++++++...+................ T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIE---STGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 999999999999999999998876654432 23344455555666666666655555555444332222111111000 Q ss_pred -hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 81 -FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 81 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) ......+....... ............ ...+++.+|.++|+++...|++.+++.++|+++|+++|++++.+++|+ T Consensus 78 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 152 (385) T protein:vir:18 78 SERAAEELIKSWDGK----QGTFGAKTFNKS-LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVR 152 (385) T ss_pred HHHHHHHHHHHHHHh----hccchhhHHHhh-hccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEE Confidence 00011111111111 111111111122 233345567788999999999999999999999999999998999999 Q ss_pred EcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcc Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQN 239 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~ 239 (394) ..+..+.++|++||+.+|+++++|+++++.+++++++++||+|+++++++++++|.++|++++++++|.+||+|+|++++ T Consensus 153 ~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~ 232 (385) T protein:vir:18 153 EEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDN 232 (385) T ss_pred EecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc Confidence 98777889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecce Q lcl|NC_019933. 240 LLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLP 319 (394) Q Consensus 240 ~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~p 319 (394) |.||++.++..+.+...++..++++|.+++..+...+..+++|+|||.+|.+|++++|++|+|+|+.+..+++++|+|+| T Consensus 233 ~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~p 312 (385) T protein:vir:18 233 LEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLP 312 (385) T ss_pred ccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceeccee Confidence 99999998888777777778899999999999999999999999999999999999999999999988888889999999 Q ss_pred EEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 320 VVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 320 v~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) |++++.+|+++++||||+.+|.++++.++++++.++.+.+|.+|++.||+++|+||++++|+||+++++++++ T Consensus 313 V~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 313 VVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred eEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 9999999999999999999999999999999999988889999999999999999999999999999999999 No 7 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=3.9e-71 Score=406.60 Aligned_cols=385 Identities=54% Similarity=0.793 Sum_probs=314.7 Q ss_pred Cc----hHHHHHHHHHHHHHHHHHHHHHHHhhh----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_019933. 1 MS----DINAINSTLANISDSLKAHADRAVKDQ----ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDV 72 (394) Q Consensus 1 Mk----~i~el~~~~~~~~~~~k~~~e~~~~~~----~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~ 72 (394) |+ +|+||+++++++.+++++..++...+. +..++.+++++++..++.+++.++++.+............. . T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 79 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGE-E 79 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-c Confidence 55 477888888888888877776654432 34566777777777777777777776655544443332222 1 Q ss_pred cchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccccc Q lcl|NC_019933. 73 QHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEG 152 (394) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 152 (394) ..+..............+........... ......++++..+|.++|+++..+|++.+++.++|+++|+++++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 154 (395) T protein:vir:43 80 APKTAGQMVAESLKEQGVTSSLRGSHRVS-----MPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES 154 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhh-----hhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCC Confidence 12222222222222222222111111111 1223344556678899999999999999999999999999999999 Q ss_pred CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 153 NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +.+++|+.++..+.+.|++||+.+|+++++|+++++.+++++++++||+++++++++++++|.+.|++++++++|.+||+ T Consensus 155 ~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~v~~~la~a~~~~~d~~~l~ 234 (395) T protein:vir:43 155 NSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDASALQSYIDARARYGLMLVEECQLLY 234 (395) T ss_pred CceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 88999998877778999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccccccccccccccccc--cccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPI--TVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGT 310 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~ 310 (394) |+|++++|.||++.++...... ..++...++++.+++..+...+..+++|+|||.+|..|++++|++|+|+|+++..+ T Consensus 235 G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~ 314 (395) T protein:vir:43 235 GNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNG 314 (395) T ss_pred ccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccC Confidence 9999999999999877655443 34455679999999999999999999999999999999999999999999888888 Q ss_pred CCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 311 LAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 311 ~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) ++++|+|+||+++++||+++++||||+.++.++++.+++++++++.+.+|++|++.||++.|+||++++|+||+++++++ T Consensus 315 ~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 315 TTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTA 394 (395) T ss_pred CCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 88899999999999999999999999998999999999999999988899999999999999999999999999999999 Q ss_pred C Q lcl|NC_019933. 391 A 391 (394) Q Consensus 391 a 391 (394) | T Consensus 395 a 395 (395) T protein:vir:43 395 S 395 (395) T ss_pred C Confidence 9 No 8 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=6.5e-67 Score=383.43 Aligned_cols=381 Identities=17% Similarity=0.223 Sum_probs=288.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cccchhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGG--DVQHISIG 78 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~--~~~~~~~~ 78 (394) |+++++|++.++++.++++++.++.. ...++.+++.+++..+++.++++++++++.....+...... ........ T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~---~~~~~~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKND---KRIDAIEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNK 77 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 88888888877766666554433221 12333445555566666666666655554444332221110 00000000 Q ss_pred hhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEE Q lcl|NC_019933. 79 QQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYV 158 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 158 (394) ........+..+.+.+........+ ......+++++||++||+++.++|++.+++.++|+++|+++|++++.+++| T Consensus 78 ~~~e~~~a~~~~l~~g~~~~~~~~e----~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~ 153 (407) T protein:vir:48 78 VASEHKEAFIGFMRKGREDGLRELE----RKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKL 153 (407) T ss_pred hhhHHHHHHHHHHhccchhhhhHHH----HHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEE Confidence 1111122222232222211111222 334556667789999999999999999999999999999999999899999 Q ss_pred EEcCcccccceecCCccccccc-cceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019933. 159 RETGFTNAAAPVAEGAQKPESS-LRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGT 236 (394) Q Consensus 159 ~~~~~~~~~~~~~eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~ 236 (394) +..+. ..+.|++|++.+|+++ ++|+++++.+++++++++||+|+++|+. +++++|.++|+++++.++|.+|++|+|+ T Consensus 154 ~~~~~-~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~ 232 (407) T protein:vir:48 154 VNLGG-TTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGS 232 (407) T ss_pred EecCC-cceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCC Confidence 98765 5788999999999865 7999999999999999999999999985 8999999999999999999999999998 Q ss_pred Cccccccccccccccc------------cccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccc Q lcl|NC_019933. 237 GQNLLGLLPQATAFAA------------PITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYIL 304 (394) Q Consensus 237 ~~~~~Gi~~~~~~~~~------------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~ 304 (394) + .|.||++.+..... +...++..++++|++++..++..+..+++|+||+.+|..|++++|++|+|+| T Consensus 233 ~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~ 311 (407) T protein:vir:48 233 K-KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLW 311 (407) T ss_pred C-ccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceee Confidence 5 68899876553321 2233455679999999999999999999999999999999999999999999 Q ss_pred cCc-ccCCCceeecceEEEcCCCCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEe Q lcl|NC_019933. 305 GNP-QGTLAPTLWGLPVVATQAMAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVY 378 (394) Q Consensus 305 ~~~-~~~~~~~l~G~pv~~~~~~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~ 378 (394) ++. ..+.+++|+|+||++++.||. ..++||||+.+|.++++.++++..++ ++.+|++.||++.|+|++++ T Consensus 312 ~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~----~~~~~~~~~~~~~r~d~~v~ 387 (407) T protein:vir:48 312 RPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP----YTNKPFVGFYTTKRTGGMLV 387 (407) T ss_pred ccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeec----cccCCcEEEEEEEEeccEEe Confidence 654 445556899999999999985 23788999998999999888876543 46789999999999999999 Q ss_pred cccceEEEEecCCCCC Q lcl|NC_019933. 379 RPESFIKGSLAAAAGT 394 (394) Q Consensus 379 ~~~a~~~l~~~~a~~~ 394 (394) +|+||+++++++++++ T Consensus 388 ~~~a~~~l~~~aa~~~ 403 (407) T protein:vir:48 388 DSQAIKLMKIGAATRQ 403 (407) T ss_pred cccceEEEEeeccCCC Confidence 9999999999999999 No 9 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=3.5e-66 Score=379.42 Aligned_cols=376 Identities=20% Similarity=0.253 Sum_probs=282.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ||+++++.+++.+..+++++..++...+. ....++.++.+.+.+++.+++..+++++.+............... . T Consensus 5 lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--- 80 (401) T protein:vir:44 5 IKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKV-A--- 80 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch-h--- Confidence 55666666666666666665554433322 122333444444444444444444443333332221111111111 1 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) ......+..+.+..........+. ..+.++++++||++||+++.++|++.+++.++|+++++++|++++.+++|+ T Consensus 81 -~e~~~a~~~~lr~~~~~~~~~~e~----~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 155 (401) T protein:vir:44 81 -AEHKDAFVGFLRKGREDGLRDLER----KALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLV 155 (401) T ss_pred -HHHHHHHHHHHhhhhhhhhHHHHH----HHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEE Confidence 111223333333333222222333 345566677899999999999999999999999999999999998899999 Q ss_pred EcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTG 237 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~ 237 (394) ..+. ..+.|++|++.+|.+ .++|++|++.++|++++++||+|+++|++ +++++|.++|++++++++|.++|+|+|+ T Consensus 156 ~~~~-~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~- 233 (401) T protein:vir:44 156 NLGG-TASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGT- 233 (401) T ss_pred ecCC-ccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCC- Confidence 8764 577899999999875 48999999999999999999999999985 8999999999999999999999999997 Q ss_pred ccccccccccccccc------------cccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 238 QNLLGLLPQATAFAA------------PITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 238 ~~~~Gi~~~~~~~~~------------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +.|.||++.....+. .....+..+++++.+++..++..+..+++|+||+.+|..|++++|++|+|+|. T Consensus 234 ~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~ 313 (401) T protein:vir:44 234 KKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWR 313 (401) T ss_pred CccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeec Confidence 568999876554332 12234456799999999999999999999999999999999999999999996 Q ss_pred Cc-ccCCCceeecceEEEcCCCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 NP-QGTLAPTLWGLPVVATQAMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~~-~~~~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) +. ..+.+++|+|+||++++.+|.. .++||||+.+|.++++.++++..++ ++.+|++.||++.|+|+++.+ T Consensus 314 ~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~----~~~~~~v~~~a~~r~d~~~~~ 389 (401) T protein:vir:44 314 PGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP----YTNKPFVGFYTTKRTGGMLVD 389 (401) T ss_pred CCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeec----cccCCcEEEEEEEEeccEEec Confidence 54 4455668999999999999852 3688999998999999998876543 467999999999999999999 Q ss_pred ccceEEEEecCC Q lcl|NC_019933. 380 PESFIKGSLAAA 391 (394) Q Consensus 380 ~~a~~~l~~~~a 391 (394) |+||+++++++| T Consensus 390 ~~a~~~l~~~aa 401 (401) T protein:vir:44 390 SQAIKLLKIAAA 401 (401) T ss_pred ccceEEEEeecC Confidence 999999999999 No 10 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=1.4e-65 Score=376.21 Aligned_cols=372 Identities=13% Similarity=0.155 Sum_probs=287.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cchhh- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDV--QHISI- 77 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~--~~~~~- 77 (394) ||+++||+++++++.++++++.++............++++++.++++.++.+++.++++............. ..+.. T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 999999999999999999887766554322222223456666666666666666555444443322221111 11111 Q ss_pred -hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCc-- Q lcl|NC_019933. 78 -GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT-- 154 (394) Q Consensus 78 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-- 154 (394) ............+.. . ..............+++++||++||+++.+.|++.+++.++|+++|+++|+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~-~-----l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 154 (397) T protein:vir:49 81 KSEEEVKAGFVKDFKN-L-----VRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGS 154 (397) T ss_pred cchhHHHHHHHHHHHH-H-----HhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccc Confidence 111111111111111 1 1111222334455666778999999999999999999999999999999987654 Q ss_pred eeEEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 155 LEYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 155 ~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +.+|+.....+.+.|++||+.+|+ ++++|+++++++++++++++||+|+++|+ .++++||.++|++++++++|.++++ T Consensus 155 ~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~ 234 (397) T protein:vir:49 155 RVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE 234 (397) T ss_pred eEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 556666655567899999999996 67999999999999999999999999998 5899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccC-cccCC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGN-PQGTL 311 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~-~~~~~ 311 (394) |+|++.+..| ..+++++.+++.+++..+..+++|+|||.+|..|+++||++|+|+|++ ..+++ T Consensus 235 G~g~~~~~~~----------------~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~ 298 (397) T protein:vir:49 235 AIAALPTKPT----------------LTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPT 298 (397) T ss_pred hccccccccc----------------cccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCC Confidence 9988766433 246889999999999999999999999999999999999999999975 45566 Q ss_pred CceeecceEEEcC--CCCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_019933. 312 APTLWGLPVVATQ--AMAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFI 384 (394) Q Consensus 312 ~~~l~G~pv~~~~--~~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~ 384 (394) +++|+|+||++++ .+|. ..++||||+.+|.++++.+++++++++...+|.+|++.||++.|+|+++++|+||+ T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~ 378 (397) T protein:vir:49 299 GYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFV 378 (397) T ss_pred CceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceE Confidence 6799999998754 3443 34899999999999999999999999888899999999999999999999999999 Q ss_pred EEEecCCCCC Q lcl|NC_019933. 385 KGSLAAAAGT 394 (394) Q Consensus 385 ~l~~~~a~~~ 394 (394) ++++++++++ T Consensus 379 ~~~~~~~~~~ 388 (397) T protein:vir:49 379 PASFKAIADQ 388 (397) T ss_pred EEEeecccCC Confidence 9999998887 No 11 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=2.6e-65 Score=374.64 Aligned_cols=393 Identities=28% Similarity=0.399 Sum_probs=271.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHH--------------HHHhhhh-hhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHAD--------------RAVKDQE-LNASVRA--KVDELLMAQGALQADLKAAQQRIAEV 63 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e--------------~~~~~~~-~~~e~~~--~~~~~~~~~~~l~~~i~~~e~~~~~~ 63 (394) ||.-..|+++..++.++++++.. +...+.+ +.++... +.++..++..++++++++++..+.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66544444444444444333322 2111111 0011110 11122233333444444444333333 Q ss_pred Hhhcccccccchh----hhhhhhh-------------------HHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCc Q lcl|NC_019933. 64 EGNGAGGDVQHIS----IGQQFVN-------------------SDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAG 120 (394) Q Consensus 64 ~~~~~~~~~~~~~----~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 120 (394) +............ ..+.... .....................+........++++.|| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 2221111110000 0000000 0000000001111111112223344555667778899 Q ss_pred cccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhh Q lcl|NC_019933. 121 ATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKAS 200 (394) Q Consensus 121 ~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is 200 (394) ++||+++..+|++.+++.++|++++++++++++.++||+..+..+.+.|++||+.+|+++++|++|++.+++++++++|| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:10 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 99999999999999999999999999999999999999988777788999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccccc----------------------- Q lcl|NC_019933. 201 RQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVA----------------------- 257 (394) Q Consensus 201 ~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~----------------------- 257 (394) +|+++|++++++||.++|++++++++|.+||+|+|++ +|.||++.+++.+.+.... T Consensus 241 ~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) T protein:vir:10 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-cccccccccccccccccccchhhhhhhhhhhhhhcccccchh Confidence 9999999999999999999999999999999999976 5899988765443221110 Q ss_pred -------------------------------ccchHHHHHHHHHHhhhhc-CCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 258 -------------------------------NATAVDRLRLALLQAQLAE-FPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 258 -------------------------------~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) .......+..++..+...+ ..+++|+|||.+|..|+++||++|+|+|+ T Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~ 399 (497) T protein:vir:10 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) T ss_pred hhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceecc Confidence 1112233344444444333 45668999999999999999999999997 Q ss_pred CcccC-------CCceeecceEEEcCCCCcCceEEeeccce-EEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 306 NPQGT-------LAPTLWGLPVVATQAMAVGQFLTGAFDAG-AQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAV 377 (394) Q Consensus 306 ~~~~~-------~~~~l~G~pv~~~~~~p~~~~~~gd~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v 377 (394) +.... .+++|+|+||++++.||+++++||||+.+ +.++++.+++|.++++...+|++|++.+|++.|+||.| T Consensus 400 ~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v 479 (497) T protein:vir:10 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) T ss_pred CcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeeccee Confidence 65422 34589999999999999999999999985 55889999999999988889999999999999999999 Q ss_pred ecccceEEEEecCCCCC Q lcl|NC_019933. 378 YRPESFIKGSLAAAAGT 394 (394) Q Consensus 378 ~~~~a~~~l~~~~a~~~ 394 (394) ++|+||+++++++++.. T Consensus 480 ~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:10 480 YRPSAFQLIQLKKGATG 496 (497) T ss_pred eccccEEEEEecCCccC Confidence 99999999998776655 No 12 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=2.6e-65 Score=374.64 Aligned_cols=393 Identities=28% Similarity=0.399 Sum_probs=271.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHH--------------HHHhhhh-hhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHAD--------------RAVKDQE-LNASVRA--KVDELLMAQGALQADLKAAQQRIAEV 63 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e--------------~~~~~~~-~~~e~~~--~~~~~~~~~~~l~~~i~~~e~~~~~~ 63 (394) ||.-..|+++..++.++++++.. +...+.+ +.++... +.++..++..++++++++++..+.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66544444444444444333322 2111111 0011110 11122233333444444444333333 Q ss_pred Hhhcccccccchh----hhhhhhh-------------------HHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCc Q lcl|NC_019933. 64 EGNGAGGDVQHIS----IGQQFVN-------------------SDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAG 120 (394) Q Consensus 64 ~~~~~~~~~~~~~----~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 120 (394) +............ ..+.... .....................+........++++.|| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 2221111110000 0000000 0000000001111111112223344555667778899 Q ss_pred cccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhh Q lcl|NC_019933. 121 ATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKAS 200 (394) Q Consensus 121 ~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is 200 (394) ++||+++..+|++.+++.++|++++++++++++.++||+..+..+.+.|++||+.+|+++++|++|++.+++++++++|| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:78 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 99999999999999999999999999999999999999988777788999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccccc----------------------- Q lcl|NC_019933. 201 RQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVA----------------------- 257 (394) Q Consensus 201 ~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~----------------------- 257 (394) +|+++|++++++||.++|++++++++|.+||+|+|++ +|.||++.+++.+.+.... T Consensus 241 ~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) T protein:vir:78 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-cccccccccccccccccccchhhhhhhhhhhhhhcccccchh Confidence 9999999999999999999999999999999999976 5899988765443221110 Q ss_pred -------------------------------ccchHHHHHHHHHHhhhhc-CCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 258 -------------------------------NATAVDRLRLALLQAQLAE-FPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 258 -------------------------------~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) .......+..++..+...+ ..+++|+|||.+|..|+++||++|+|+|+ T Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~ 399 (497) T protein:vir:78 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) T ss_pred hhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceecc Confidence 1112233344444444333 45668999999999999999999999997 Q ss_pred CcccC-------CCceeecceEEEcCCCCcCceEEeeccce-EEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 306 NPQGT-------LAPTLWGLPVVATQAMAVGQFLTGAFDAG-AQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAV 377 (394) Q Consensus 306 ~~~~~-------~~~~l~G~pv~~~~~~p~~~~~~gd~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v 377 (394) +.... .+++|+|+||++++.||+++++||||+.+ +.++++.+++|.++++...+|++|++.+|++.|+||.| T Consensus 400 ~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v 479 (497) T protein:vir:78 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) T ss_pred CcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeeccee Confidence 65422 34589999999999999999999999985 55889999999999988889999999999999999999 Q ss_pred ecccceEEEEecCCCCC Q lcl|NC_019933. 378 YRPESFIKGSLAAAAGT 394 (394) Q Consensus 378 ~~~~a~~~l~~~~a~~~ 394 (394) ++|+||+++++++++.. T Consensus 480 ~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:78 480 YRPSAFQLIQLKKGATG 496 (497) T ss_pred eccccEEEEEecCCccC Confidence 99999999998776655 No 13 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2.1e-65 Score=375.14 Aligned_cols=384 Identities=17% Similarity=0.235 Sum_probs=285.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---cc----cccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNG---AG----GDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~---~~----~~~~ 73 (394) |++|++|+++++++.++++++.+.......++++..++++++.+++++|+.+|++++.......... .. .... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 80 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVI 80 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccc Confidence 9999999999999999999988765555567888889999999999999999986554332221110 00 0000 Q ss_pred chhhhhhhhhHHHHHHHHHH----h-hhh----h-hhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHH Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAES----G-GQR----G-RAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRS 143 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~----~-~~~----~-~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~ 143 (394) ........ ....+...... . ... . ....... .........++.||++||+++.++||+.+++.++|++ T Consensus 81 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~ 158 (428) T protein:vir:10 81 VKAEPKQY-TGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQ-SVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRK 158 (428) T ss_pred cccccchh-hhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhh-hHhhhhcccccCCccccchhHHHHHHHHHhhhchhhh Confidence 00000000 00000000000 0 000 0 0000011 1112223344578999999999999999999999999 Q ss_pred h-ccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_019933. 144 L-LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRG 221 (394) Q Consensus 144 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a 221 (394) + ++++|++++.+++|+.++. +.+.|++||+.+|+++++|++|++.+++++++++||+|+++|+ +++++||.++|+++ T Consensus 159 ~~~~~~~~~~g~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~a 237 (428) T protein:vir:10 159 LGARSIPLPNGNMSLPRLAGG-ATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTA 237 (428) T ss_pred hcceeeecCCcceEEEEEeCC-cceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHH Confidence 9 6788888888999998764 6789999999999999999999999999999999999999987 69999999999999 Q ss_pred HHHHHHHHHhhccCCCcccccccccccccccc--ccccccchHH---HH---HHHHHHhhhhcCCCCeeEeCHHHHHHHH Q lcl|NC_019933. 222 LEVVEENQLLNGNGTGQNLLGLLPQATAFAAP--ITVANATAVD---RL---RLALLQAQLAEFPATGIVLNPADWAGIE 293 (394) Q Consensus 222 ~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~--~~~~~~~~~~---~i---~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 293 (394) +++++|.+||+|+|+++.|.||++.++..... .......+++ .. +.+.......+..+++|+||+.++..|+ T Consensus 238 i~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~ 317 (428) T protein:vir:10 238 ISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLF 317 (428) T ss_pred HHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHH Confidence 99999999999999999999999876543221 1111222222 22 2223344555667889999999999999 Q ss_pred HhhccCCcccccCcccCCCceeecceEEEcCCCCcC--------ceEEeeccceEEEEeecceEEEEecccc-------- Q lcl|NC_019933. 294 LLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG--------QFLTGAFDAGAQVFDRWAARVEVATENQ-------- 357 (394) Q Consensus 294 ~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~--------~~~~gd~~~~~~~~~~~~~~i~~~~~~~-------- 357 (394) +++|++|+|+|+.. .+++|+|+||++++.+|++ .++||||+. ++++.+.++.+.++++.. T Consensus 318 ~lkd~~G~~i~~~~---~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~i~i~~~~~~~~~~~~~~~ 393 (428) T protein:vir:10 318 GLRDGNGNKVYPEM---AQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFND-VVIGEDGNMKVDFSKEASYIDTDGKL 393 (428) T ss_pred HhhccCCceeccCC---CCCeeeceeeEEeccccccccCCCccceEEEEecce-EEEEEecceEEEeecccccccccccc Confidence 99999999999643 3458999999999999864 479999986 668889999999888753 Q ss_pred -hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 358 -DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 358 -~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) .+|++|++.+|++.|+||++.+|+||++++--.= T Consensus 394 ~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 394 VSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 4699999999999999999999999999984433 No 14 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=1.7e-64 Score=370.25 Aligned_cols=372 Identities=14% Similarity=0.161 Sum_probs=286.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cchhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDV--QHISIG 78 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~--~~~~~~ 78 (394) ||+++||+++++++.++++++.++............++++++..+++.++++++.+++.............. ...... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 999999999999999888876655444332223334556666667766666666555444433322211111 111110 Q ss_pred --hhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCce- Q lcl|NC_019933. 79 --QQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL- 155 (394) Q Consensus 79 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~- 155 (394) ...........+.. .. .............+++++||.+||+++...|++.+++.++|++++++++++++.. T Consensus 81 ~~~~~~~~~~~~~~~~-~l-----~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 154 (397) T protein:vir:49 81 KNEEEVKANFVKDFKN-LV-----RGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGS 154 (397) T ss_pred chhhHHHHHHHHHHHH-Hh-----hcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcce Confidence 00111111111111 11 1111223344556677789999999999999999999999999999999987654 Q ss_pred -eEEEEcCcccccceecCCccccccc-cceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 156 -EYVRETGFTNAAAPVAEGAQKPESS-LRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 156 -~~~~~~~~~~~~~~~~eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) .+|+..+..+.++|++||+.+|+++ ++|++|++.+++++++++||+++++++. +++++|.+.|++++++++|.+||+ T Consensus 155 ~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~ 234 (397) T protein:vir:49 155 RVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE 234 (397) T ss_pred EEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4555555556789999999999875 7999999999999999999999999985 899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccC-cccCC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGN-PQGTL 311 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~-~~~~~ 311 (394) |+|++.+.. +..+++++.+++.+++..+..+++|+|||.+|.+|++|||++|+|+|.+ ...++ T Consensus 235 G~g~~~~~~----------------~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~ 298 (397) T protein:vir:49 235 AIGTLPNKP----------------TLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPT 298 (397) T ss_pred ccccccccc----------------cccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCC Confidence 998866532 3346889999999999999999999999999999999999999999965 44566 Q ss_pred CceeecceEEEcC--CCCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_019933. 312 APTLWGLPVVATQ--AMAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFI 384 (394) Q Consensus 312 ~~~l~G~pv~~~~--~~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~ 384 (394) +++|+|+||++++ .+|. ..++||||+.+|.++++.+++++++++...+|.+|++.||++.|+|+++++|+||+ T Consensus 299 ~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~ 378 (397) T protein:vir:49 299 GYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFV 378 (397) T ss_pred CceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceE Confidence 6799999998754 3443 35799999999999999999999999888889999999999999999999999999 Q ss_pred EEEecCCCCC Q lcl|NC_019933. 385 KGSLAAAAGT 394 (394) Q Consensus 385 ~l~~~~a~~~ 394 (394) ++++++++.+ T Consensus 379 ~~~~~~~~~~ 388 (397) T protein:vir:49 379 PASFKAIADQ 388 (397) T ss_pred EEEecccccc Confidence 9999988887 No 15 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.9e-64 Score=369.92 Aligned_cols=383 Identities=18% Similarity=0.162 Sum_probs=291.3 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-ccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAG-GDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~-~~~~~~~~ 77 (394) |. +|++|+++++++.++++++.++.. ..++++|.+++++++..+++.++++|++..+........... ........ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~-~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFA-GKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGS 79 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccc Confidence 77 589999999999999999888664 356778888899999999999998887544433322111100 00000000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhh-hhhHHHhcccccccc-Cce Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQR-RMTIRSLLAQGTMEG-NTL 155 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~-~~~l~~~~~~~~~~~-~~~ 155 (394) ............+.+.+........+.. ......+++.+|.++|+++...+|..+.. .++++.+++++++.+ +.+ T Consensus 80 ~~~~~~~~~~~~~~r~g~~~~~~~~~~~---~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~ 156 (392) T protein:vir:13 80 GAQRSADHDDDAVLRAGNLGEARSFEFA---PEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPM 156 (392) T ss_pred chhhhhhHHHHHHHhccchhhhHHHHhh---hhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCcee Confidence 0001111111111111111111111111 11222334445667777777777765555 556777888888754 458 Q ss_pred eEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019933. 156 EYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGN 234 (394) Q Consensus 156 ~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~ 234 (394) .+|+..+. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++|+. +++++|.++|++++++++|.+||+|+ T Consensus 157 ~~~~~~~~-~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~ 235 (392) T protein:vir:13 157 DFTVITGR-ATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGT 235 (392) T ss_pred EEEEEcCC-cceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 89988764 67889999999999999999999999999999999999999985 89999999999999999999999999 Q ss_pred CCCcccccccccccccccc--ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCC Q lcl|NC_019933. 235 GTGQNLLGLLPQATAFAAP--ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTL 311 (394) Q Consensus 235 g~~~~~~Gi~~~~~~~~~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~ 311 (394) |++ .|.||++.++..... ...++..+++++++++..++..+..+++|+||+.++..|++++|++|+|+|.+. ..+. T Consensus 236 Gt~-~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~ 314 (392) T protein:vir:13 236 GTG-QPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGA 314 (392) T ss_pred CCc-cccccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC Confidence 965 588999876544332 334566789999999999999999999999999999999999999999999755 4455 Q ss_pred CceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 312 APTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 312 ~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +++|+|+||++++.+|+++++||||+. |.++.+.++++..+.+ .+|.+|++.||++.|+|+++.+|+||++++++++ T Consensus 315 ~~~l~G~Pv~~~~~~~~~~i~~Gdf~~-~~i~~~~~~~i~~~~~--~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 315 PDTFNGKVVETDDGMPADKVLFADLSK-YRVRFAGSLRVDRSVD--AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred CceecceeeEEcCCCCCCcEEEeeccc-eeEEeecceEEEeecc--ccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 568999999999999999999999986 7788889988887665 4589999999999999999999999999999999 Q ss_pred C Q lcl|NC_019933. 392 A 392 (394) Q Consensus 392 ~ 392 (394) | T Consensus 392 a 392 (392) T protein:vir:13 392 A 392 (392) T ss_pred C Confidence 9 No 16 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=2.4e-64 Score=369.32 Aligned_cols=385 Identities=16% Similarity=0.110 Sum_probs=290.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ...+.+++.++....++++...+....+. ++.++..+.++++..+...++..+.+.++++...+............. T Consensus 142 ~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~-- 219 (543) T protein:vir:81 142 PDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSP-- 219 (543) T ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh-- Confidence 33466777777777777777666554433 334444555666666666666655554444443333221111111100 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHH-hhcccccCCcCccccchhhhhHHH-hhhhhhhhHHHhccccccccCceeE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAI-TSLSTNADGSAGATVQTTRLPGIL-ELPQRRMTIRSLLAQGTMEGNTLEY 157 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~ip~~~~~~ii-~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (394) . ....+..+.+..........+.++.. ......++++||++||+++...+| ..++..++|.+++++.+++| .+.+ T Consensus 220 -~-~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g-~~~~ 296 (543) T protein:vir:81 220 -A-YLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATG-DVWH 296 (543) T ss_pred -h-hhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCc-ceEE Confidence 0 00111111111111122222222222 122334566789999999998876 56778899999998877654 5889 Q ss_pred EEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_019933. 158 VRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTG 237 (394) Q Consensus 158 ~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~ 237 (394) |+..+ .+.+.|++||+.+|+++++|++|++.+++++++++||+++++|++++.++|.+.|++++++++|.+||+|+|++ T Consensus 297 ~~~~~-~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~ 375 (543) T protein:vir:81 297 GVSSA-AVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDEANVTETVALLFAEGKDELEAVTLTTGTGQG 375 (543) T ss_pred EEecC-CcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhccCCCC Confidence 98776 47889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccc--ccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 238 QNLLGLLPQATAFA--APITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 238 ~~~~Gi~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) +.|.||++...... .+....+..+++++.+++..++..+..+++|+|||.+|..|++++|++|+|+|++...+++++| T Consensus 376 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l 455 (543) T protein:vir:81 376 NQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQL 455 (543) T ss_pred cccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccc Confidence 99999988655433 2334556678999999999999999999999999999999999999999999998777778899 Q ss_pred ecceEEEcCCCCcCc----------eEEeeccceEEEEeecceEEEEecccc--hhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 316 WGLPVVATQAMAVGQ----------FLTGAFDAGAQVFDRWAARVEVATENQ--DDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 316 ~G~pv~~~~~~p~~~----------~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) +|+||+++++||.+. ++||||+ .|.++.+.++++.++++.. .+|.+|++.|+++.|+||++.+|+|| T Consensus 456 ~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~ 534 (543) T protein:vir:81 456 LGRPVGEAEAMDANWNTSASADNFVLLYGNFQ-NYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAF 534 (543) T ss_pred cceeeEEeccccccccccccCCcceEEEeecc-ceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccce Confidence 999999999998653 7899998 4778899999999988754 35778999999999999999999999 Q ss_pred EEEEecCCC Q lcl|NC_019933. 384 IKGSLAAAA 392 (394) Q Consensus 384 ~~l~~~~a~ 392 (394) ++++++++| T Consensus 535 ~~l~~~~~a 543 (543) T protein:vir:81 535 RLLNVETAS 543 (543) T ss_pred EEEEecccC Confidence 999999999 No 17 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.5e-64 Score=369.30 Aligned_cols=386 Identities=16% Similarity=0.183 Sum_probs=294.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGG---------- 70 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~---------- 70 (394) |+ |+||+++++++.++++++.+.......++++..++++++++++++|+.+|+++++............ T Consensus 1 M~-i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:14 1 MN-VNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAP 79 (435) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhc Confidence 86 9999999999999999988765555567888899999999999999999987765433222111000 Q ss_pred ---cccchhhhhhhhhHHHHHHHHHHhhhh----------hhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhh Q lcl|NC_019933. 71 ---DVQHISIGQQFVNSDSFKAMAESGGQR----------GRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQR 137 (394) Q Consensus 71 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~ 137 (394) ....+..... .....+..+.+..... ..............+.+++..||++||+++.++|++.+++ T Consensus 80 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~ 158 (435) T protein:vir:14 80 AAAPVHAQPKALE-VKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP 158 (435) T ss_pred cccccccccchhh-hhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhh Confidence 0000000000 0001111111110000 0000000112234556666778999999999999999999 Q ss_pred hhhHHHh-ccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHH Q lcl|NC_019933. 138 RMTIRSL-LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSF 213 (394) Q Consensus 138 ~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~ 213 (394) .++++++ ++.+|+.++.+++|+.++. +.+.|++|++.+|+++++|++|++.+++++++++||+|+++|+ ++++++ T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~ 237 (435) T protein:vir:14 159 KSVVRKLGARTLPLSNGNITIPRLKGG-AIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQI 237 (435) T ss_pred hchhhhhcceeeecCCCceEEEEEeCC-cceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHH Confidence 9999997 7788888888999999764 6788999999999999999999999999999999999999998 369999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccc---cccccchHHHHHHHHHHhhhh--cCCCCeeEeCHHH Q lcl|NC_019933. 214 INARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPI---TVANATAVDRLRLALLQAQLA--EFPATGIVLNPAD 288 (394) Q Consensus 214 i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~---~~~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~~~~ 288 (394) |.++|++++++++|.+|++|+|+++.|.||++.+....+.. ..+......++.+++..+... ++.+++|+|||.+ T Consensus 238 i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~ 317 (435) T protein:vir:14 238 VVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRT 317 (435) T ss_pred HHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHH Confidence 99999999999999999999999999999987655433222 222233455666666666544 5568899999999 Q ss_pred HHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC--------ceEEeeccceEEEEeecceEEEEecccc--- Q lcl|NC_019933. 289 WAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG--------QFLTGAFDAGAQVFDRWAARVEVATENQ--- 357 (394) Q Consensus 289 ~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~--------~~~~gd~~~~~~~~~~~~~~i~~~~~~~--- 357 (394) |..|++++|++|+|+|+.. ..++|+|+||++++.||.+ .++||||+. ++++++.+++++++++.. T Consensus 318 ~~~L~~lkd~~G~~l~~~~---~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~~~~~~~~~~~~ 393 (435) T protein:vir:14 318 FRFLEGLRDGNGNKVYPEL---ANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD-VFIGEEETLEIDYSKEATYKD 393 (435) T ss_pred HHHHHHhhccCCceeccCC---CCCeeecceeEeeccccccccCCCccceEEEeeccc-EEEEEecccEEEEeccccccc Confidence 9999999999999999643 3458999999999999863 589999997 568899999999988754 Q ss_pred ------hhhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|NC_019933. 358 ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 358 ------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~ 393 (394) .+|++|++.||++.|+||++.+|+||++++=.+-+. T Consensus 394 ~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 394 ADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred cccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 569999999999999999999999999999666555 No 18 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=3.7e-64 Score=368.35 Aligned_cols=382 Identities=15% Similarity=0.194 Sum_probs=287.8 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHhhccccccc Q lcl|NC_019933. 1 MSD-INAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQR------IAEVEGNGAGGDVQ 73 (394) Q Consensus 1 Mk~-i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~------~~~~~~~~~~~~~~ 73 (394) |++ |++|+++++++.++++++.++.....+ +++.+.++++.|++++++.++. .............. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~e-------e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAE-------ELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKE 73 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 996 999999999999998888765443332 3444555555555555433322 22222111111111 Q ss_pred chhhhhhhhhH-HHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccccc Q lcl|NC_019933. 74 HISIGQQFVNS-DSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEG 152 (394) Q Consensus 74 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 152 (394) ........... ............... .........+..+++++||++||+++.++|++.+++.++|++++++.|+++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 151 (404) T protein:vir:10 74 ENVIYNGALFVRAIADNLLKQKNQRGL--NLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFT 151 (404) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhh--cchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccC Confidence 11110000000 000001111100000 011122334556677889999999999999999999999999999999875 Q ss_pred C--ceeEEEEcCcccccceecCCcccccc--ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 153 N--TLEYVRETGFTNAAAPVAEGAQKPES--SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEE 227 (394) Q Consensus 153 ~--~~~~~~~~~~~~~~~~~~eg~~~~~~--~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d 227 (394) . .+.+|+..+ ...+.|++||+.+|.+ +++|+++++++++++++++||+|++++++ +++++|++.|++++++++| T Consensus 152 ~~g~~~~~~~~~-~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~ 230 (404) T protein:vir:10 152 RSGSRTYEKRSK-QKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRN 230 (404) T ss_pred CccceEEEEecC-CcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHH Confidence 4 566777655 4678899999999875 58999999999999999999999999985 8999999999999999999 Q ss_pred HHHhhccCCCccccccccccccccccccccccchHHHHHHHHH-HhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccC Q lcl|NC_019933. 228 NQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALL-QAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGN 306 (394) Q Consensus 228 ~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~ 306 (394) .+||+|+|++++|.|+++..+..+.+. ++..+++++..++. .++..+..+++|+|||.+|.+|+++||++|+|+|.+ T Consensus 231 ~~il~G~g~~~~~~gi~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~ 308 (404) T protein:vir:10 231 AEILYGAGGDEHATGIMTANKFKKITL--PKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQP 308 (404) T ss_pred HHHhhcCCCCCcccceeeccccceeec--cccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeecc Confidence 999999999999999998776655443 44557888887765 677778888899999999999999999999999975 Q ss_pred -cccCCCceeecceEEEc-CCCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 307 -PQGTLAPTLWGLPVVAT-QAMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 307 -~~~~~~~~l~G~pv~~~-~~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) ..++.+++|+|+||++. +.+|.+ .++||||++++.++.+.+++++++++.+.+|.+|++.||++.|+|+++.+ T Consensus 309 ~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~ 388 (404) T protein:vir:10 309 DPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKD 388 (404) T ss_pred CcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEec Confidence 44556679999999854 445433 37899999999999999999999999888999999999999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||+++++++++.. T Consensus 389 ~~a~~~~~~~~aa~~ 403 (404) T protein:vir:10 389 SEALLIAEIPVESVQ 403 (404) T ss_pred ccceEEEEeecccCC Confidence 999999999999988 No 19 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=3.1e-64 Score=368.77 Aligned_cols=372 Identities=14% Similarity=0.154 Sum_probs=286.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cccchhh- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGG--DVQHISI- 77 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~--~~~~~~~- 77 (394) ||+++||++++.++.++++++.++........+...++++++.+++..++++++.+++............ ....... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 9999999999999999888876665444333333445566666666666666665554444332221111 1111111 Q ss_pred -hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 78 -GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 78 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) ..........+.+..... ............+++++||++||+++.++|++.+++.++|+++++++|++++..+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 154 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVR------GRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGS 154 (397) T ss_pred chhhHHHHHHHHHHHHHHh------hhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcce Confidence 111111111111111111 1111223344555667899999999999999999999999999999999887666 Q ss_pred EEEE--cCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 157 YVRE--TGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 157 ~~~~--~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +|+. .+..+.++|++||+.+|++ +++|++|++++++++++++||+|+++++. ++++||.+.|++++++++|.+||+ T Consensus 155 ~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~ 234 (397) T protein:vir:48 155 RVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE 234 (397) T ss_pred EEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 6544 4444568999999999987 58999999999999999999999999985 899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTL 311 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~ 311 (394) |+|++.+.. +..++++|.+++..++..+..+++|+|||.+|..|+++||++|+|+|++. ..+. T Consensus 235 G~g~~~~~~----------------~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~ 298 (397) T protein:vir:48 235 AIATLPTKP----------------TLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPT 298 (397) T ss_pred ccccccccc----------------ccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCC Confidence 998876543 33468899999999999999999999999999999999999999999754 4556 Q ss_pred CceeecceEEEcCC--CC-----cCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_019933. 312 APTLWGLPVVATQA--MA-----VGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFI 384 (394) Q Consensus 312 ~~~l~G~pv~~~~~--~p-----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~ 384 (394) +++|+|+||++++. +| ...++||||+.++.++++.+++++++++...+|.+|++.||+++|+|+++++|+||+ T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~ 378 (397) T protein:vir:48 299 GYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFV 378 (397) T ss_pred CceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceE Confidence 67999999987543 33 345789999998999999999999999888889999999999999999999999999 Q ss_pred EEEecCCCCC Q lcl|NC_019933. 385 KGSLAAAAGT 394 (394) Q Consensus 385 ~l~~~~a~~~ 394 (394) +++++++++. T Consensus 379 ~~~~~~~~~~ 388 (397) T protein:vir:48 379 PASFKAIADQ 388 (397) T ss_pred EEEecccccC Confidence 9999999877 No 20 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.4e-64 Score=369.37 Aligned_cols=382 Identities=17% Similarity=0.140 Sum_probs=288.6 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-cccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGA-GGDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~-~~~~~~~~~ 77 (394) |. +|++|+++++++.++++.+.++.. +.++++|.+++++++..+++.++++|++..+.......... ......... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~-~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFA-GKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGS 79 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhh-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 55 699999999999999999887653 45688899999999999999999999865544433221111 000000000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccccc-Ccee Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEG-NTLE 156 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~ 156 (394) ............+.+.+........... ......+...+|++++|+.+...|++.++..++++++++++++++ +.+. T Consensus 80 ~~~~~~~~~~~~~~r~~~~~~~r~~~~~--~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (390) T protein:vir:62 80 GAQRSADVDDDATLRAGNLGEARSFEFA--PEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLD 157 (390) T ss_pred cchhhcchHHHHHHhhhhhhhhHHHHhh--hhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeE Confidence 0001111111111111111111111111 111122222334444444444455566777778888999999865 4588 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++. +.+.|++|++.+|+++++|+++++++++++++++||+|+++|+. +++++|.++|+++++.++|.+||+|+| T Consensus 158 ~p~~~~~-~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G 236 (390) T protein:vir:62 158 FTVITGR-SSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG 236 (390) T ss_pred EEEEcCC-cceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC Confidence 9998764 67889999999999999999999999999999999999999985 899999999999999999999999987 Q ss_pred CCccccccccccccccc--cccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc-cCCC Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAA--PITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ-GTLA 312 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~-~~~~ 312 (394) .|.||++....... ....++..+++++++++..+...+..+++|+||+.++..|++|||++|+|||++.. .+.+ T Consensus 237 ---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~ 313 (390) T protein:vir:62 237 ---QPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAP 313 (390) T ss_pred ---ccccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCcc Confidence 37899887654332 33344566899999999999999999999999999999999999999999997654 4555 Q ss_pred ceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 313 PTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 313 ~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) .+|+|+||++++.+|++.++||||+. |.++.+.++.+..+.+. +|.+|++.||++.|+|+++.+|+||++|+++++| T Consensus 314 ~~l~G~Pv~~~~~~p~~~i~~gd~s~-~~i~~~~~~~v~~~~~~--~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 314 SLFNGKVVETDDGMPADKILFADLSK-YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred ceecccceEEecCCCCccEEEeeccc-eeEEeecceEEEeeccc--cccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 68999999999999999999999986 67888899888887654 5899999999999999999999999999999999 No 21 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=3.6e-64 Score=368.42 Aligned_cols=387 Identities=16% Similarity=0.166 Sum_probs=301.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------ccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGA-------GGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~-------~~~~~ 73 (394) |+ |+||+++++++.++++++.++.. +..+++|.+++++++.+++++++++|++.++.......... ..... T Consensus 1 M~-l~eL~e~r~~l~~e~~~l~~k~~-~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 78 (409) T protein:vir:45 1 MK-LHELKQKRNTIATDMRALNEKIG-DNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDP 78 (409) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHhh-cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCC Confidence 99 89999999999999999988653 34578889999999999999999998866544332221110 00000 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHH--HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKA--AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME 151 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~ 151 (394) .............+..+.+.... .....+++. .......+++++||++||+++.++|++.+++.++|+++|++++++ T Consensus 79 ~~~~~~~~~~~~a~~~~l~~~~~-~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 157 (409) T protein:vir:45 79 ENNSQQDEKRAQVFDKWMRHGAS-ELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTS 157 (409) T ss_pred CCcchhhHHHHHHHHHHHHhhhh-hccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecC Confidence 00000111112223333322211 122223332 234555667778999999999999999999999999999999997 Q ss_pred cCc-eeEEEEcCcccccceecCCccccccccceeeEEeeeeeEE-EeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 152 GNT-LEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIA-HWMKASRQILSDS-AQLQSFINARLLRGLEVVEEN 228 (394) Q Consensus 152 ~~~-~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~-~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~ 228 (394) ++. ..+|+..+....++|++||+.+|+++++|+++++.++|++ ++++||+|+++|+ +++++||.++|+++++.++|. T Consensus 158 ~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~ 237 (409) T protein:vir:45 158 DGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEAR 237 (409) T ss_pred CCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH Confidence 654 4556655555667899999999999999999999999985 5789999999998 599999999999999999999 Q ss_pred HHhhccCCC--ccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCee--EeCHHHHHHHHHhhccCCcccc Q lcl|NC_019933. 229 QLLNGNGTG--QNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGI--VLNPADWAGIELLKDTQGRYIL 304 (394) Q Consensus 229 a~l~g~g~~--~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~lkd~~G~~~~ 304 (394) +||+|+|++ ..|.||++..+..... ..++..+++++.+++..++..+..++.| +||+.++.+|++|||++|+|+| T Consensus 238 a~l~G~G~~~~~~p~Gil~~~~~~~~~-~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~ 316 (409) T protein:vir:45 238 YLIQGTGAGTPKQPKGLAASVTGTTQT-AAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLW 316 (409) T ss_pred HhhccCCCCCccccceeeecccccccc-ccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceee Confidence 999999876 4689999876654333 3456678899999999999999888765 7799999999999999999999 Q ss_pred cCc-ccCCCceeecceEEEcCCCCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEe Q lcl|NC_019933. 305 GNP-QGTLAPTLWGLPVVATQAMAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVY 378 (394) Q Consensus 305 ~~~-~~~~~~~l~G~pv~~~~~~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~ 378 (394) ++. ..+.+.+|+|+||++++.||. ..++||||+. |.++.+.++.+++..+. ++.+|++.||++.|+|+++. T Consensus 317 ~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~i~~~~~~~~~~~~d~--~~~~~~~~~~~~~r~d~~~~ 393 (409) T protein:vir:45 317 LPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDR-FIIRRVRYMILKRLVER--YAEYDQTGFLAFHRFDCILE 393 (409) T ss_pred ccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhh-hheeeccceEEEEeecc--cccCCcEEEEEEEEeccEee Confidence 754 445567899999999999985 3478899997 55778888888887655 47889999999999999999 Q ss_pred cccceEEEEecCCCCC Q lcl|NC_019933. 379 RPESFIKGSLAAAAGT 394 (394) Q Consensus 379 ~~~a~~~l~~~~a~~~ 394 (394) +|+||+++++++++|. T Consensus 394 ~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 394 DTSAIKALVGKGSVGG 409 (409) T ss_pred chhheEEEEeccCCCC Confidence 9999999999999888 No 22 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=5.3e-64 Score=367.46 Aligned_cols=386 Identities=17% Similarity=0.186 Sum_probs=293.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------c- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGD-------V- 72 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~-------~- 72 (394) |+ |+||+++++++.++++++.+.......++++..++++++++++++++.+|+++++............. . T Consensus 1 M~-l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:80 1 MN-VNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTAS 79 (435) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccc Confidence 86 79999999999999999877655445678889999999999999999999987754332211111000 0 Q ss_pred c-----chhhhhhhhhHHHHHHHHHHhhhh-h---------hhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhh Q lcl|NC_019933. 73 Q-----HISIGQQFVNSDSFKAMAESGGQR-G---------RAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQR 137 (394) Q Consensus 73 ~-----~~~~~~~~~~~~~~~~~~~~~~~~-~---------~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~ 137 (394) . .+.... ......+..+.+..... + .............++++...||++||+++.++|++.+++ T Consensus 80 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~ 158 (435) T protein:vir:80 80 AAAPVYAQPKAP-EVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP 158 (435) T ss_pred cccccccccchh-hhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhh Confidence 0 000000 00111111111110000 0 000001111233445566778999999999999999999 Q ss_pred hhhHHHh-ccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHH Q lcl|NC_019933. 138 RMTIRSL-LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSF 213 (394) Q Consensus 138 ~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~ 213 (394) .++|+++ ++++|+..+.+++|+.++. +.+.|++|++.+|+++++|++|++.+++++++++||+|+++|+ ++++++ T Consensus 159 ~~~i~~~~~~~v~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~ 237 (435) T protein:vir:80 159 KSVVRKLGARTLPLSNGNITIPRLKGG-AIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQI 237 (435) T ss_pred hchhhhccceeeecCCCceEEEEEeCC-cceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHH Confidence 9999998 7888988888999999764 6788999999999999999999999999999999999999987 369999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc---cccchHHHHHHHHHHhhhh--cCCCCeeEeCHHH Q lcl|NC_019933. 214 INARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV---ANATAVDRLRLALLQAQLA--EFPATGIVLNPAD 288 (394) Q Consensus 214 i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~~~~ 288 (394) |.++|++++++++|.+||+|+|+++.|.||++.+......... +......++.+++..+... +..+++|+|||.+ T Consensus 238 i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~ 317 (435) T protein:vir:80 238 VVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRT 317 (435) T ss_pred HHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHH Confidence 9999999999999999999999999999999876554433222 2223345667766666544 4567899999999 Q ss_pred HHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC--------ceEEeeccceEEEEeecceEEEEecccc--- Q lcl|NC_019933. 289 WAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG--------QFLTGAFDAGAQVFDRWAARVEVATENQ--- 357 (394) Q Consensus 289 ~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~--------~~~~gd~~~~~~~~~~~~~~i~~~~~~~--- 357 (394) +.+|++++|++|+|+|+.. .+++|+|+||++++.||.+ .++||||+. ++++++.+++++++++.. T Consensus 318 ~~~L~~lkd~~G~~l~~~~---~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~i~~~~~~~~~~ 393 (435) T protein:vir:80 318 FRFLEGLRDGNGNKVYPEL---ANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGD-VFIGEEETLEIDYSKEATYKD 393 (435) T ss_pred HHHHHhhhccCCceeccCC---CCCeEeeeeeEEeccccccccCCCCcceEEEEEccc-EEEEeecceEEEEeccccccc Confidence 9999999999999999643 3458999999999999863 589999997 568899999999988764 Q ss_pred ------hhhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|NC_019933. 358 ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 358 ------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~ 393 (394) .+|++|++.||++.|+||++.+|+||++++=.+-+. T Consensus 394 ~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 394 ADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred cccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 459999999999999999999999999998444433 No 23 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2e-63 Score=364.34 Aligned_cols=391 Identities=27% Similarity=0.353 Sum_probs=277.8 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSD-INAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~-i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ||+ .+...++..+..++++...++.....+...+..+++..+.+...++++................ .....+...+ T Consensus 2 ~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 79 (413) T protein:vir:81 2 VKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTR--KGEGYKSIGE 79 (413) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhh--hhhhhhhhhh Confidence 554 2333333344445555555554444333333333333222222222221111000000000000 0000111111 Q ss_pred hhhh-----HHHHHHHH-HHhhhhhhhhHHHHHHH-hhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccccc Q lcl|NC_019933. 80 QFVN-----SDSFKAMA-ESGGQRGRAEINIKAAI-TSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEG 152 (394) Q Consensus 80 ~~~~-----~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 152 (394) ...+ ........ ...........+.++.. .....++++.+|.++|+++.++|++.+++.++|+++++++|+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 159 (413) T protein:vir:81 80 FFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTN 159 (413) T ss_pred hhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccC Confidence 1000 00000000 00111111122222222 22334555678999999999999999999999999999999999 Q ss_pred CceeEEEEcCcc---cccceecCCccccccc-cceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 153 NTLEYVRETGFT---NAAAPVAEGAQKPESS-LRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEEN 228 (394) Q Consensus 153 ~~~~~~~~~~~~---~~~~~~~eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~ 228 (394) +.+++|+..... ..+.|++||+.+|+++ ++|+++++.+++++++++||+|+++|++++++||++.|++++++++|+ T Consensus 160 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~ 239 (413) T protein:vir:81 160 TTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYDFLVSYINARLLEELAIEEER 239 (413) T ss_pred CceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 989999987542 3578999999999987 689999999999999999999999999999999999999999999999 Q ss_pred HHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhh-cCCCCeeEeCHHHHHHHHHhhccCCcccccCc Q lcl|NC_019933. 229 QLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLA-EFPATGIVLNPADWAGIELLKDTQGRYILGNP 307 (394) Q Consensus 229 a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~ 307 (394) +||+|+|++++|.||++.++..+.... ++...++++..++..+... ++.+++|+|||.+|.+|++|||++|+|||.+. T Consensus 240 ~~l~G~G~~~~~~Gi~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~ 318 (413) T protein:vir:81 240 QLLLGDGTGNNLTGLLKRDGIQTLAVS-NKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGV 318 (413) T ss_pred HHhccCCCCCccccccccccccccccc-ccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceecccc Confidence 999999999999999998877665543 3445677888887776554 34566799999999999999999999999765 Q ss_pred ccC--------CCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 308 QGT--------LAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 308 ~~~--------~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) ... ..++|+|+||++++.+|+++++||||+.+|+++++.+++++++++...+|.+|++.||+++|+|+.+.+ T Consensus 319 ~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~ 398 (413) T protein:vir:81 319 FQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTF 398 (413) T ss_pred ccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEec Confidence 432 235899999999999999999999999999999999999999999888999999999999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++..- T Consensus 399 ~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 399 PEAIVQLDVAEVVTP 413 (413) T ss_pred ccceEEEEecCCCCC Confidence 999999998776666 No 24 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=5.7e-64 Score=367.33 Aligned_cols=373 Identities=14% Similarity=0.181 Sum_probs=276.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------ccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGA-------GGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~-------~~~~~ 73 (394) |++|+||++++.++.++++++.++.....+......++++++.+++..+.+++++++.++...+.... ..... T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 66899999999999999888877654432211112223333444444444444433333333222111 11111 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .............+..+.+... .... .........+++++||++||+++.++||+.+++.++|+++++++|+++. T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~ 158 (408) T protein:vir:10 84 KSENELKDKFVKDFVNMVRNPM----AFMN-TVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTS 158 (408) T ss_pred cchhhhHHHHHHHHHHHhhcch----hhhh-hhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 1010000001111111111111 1111 1123445566777899999999999999999999999999999999876 Q ss_pred ceeEE--EEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYV--RETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 154 ~~~~~--~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a 229 (394) ...+| +..+..+.+.|++||+.+|++ .++|++|++.+++++++++||+|+++|+. ++.++|.+.|+++++.++|.+ T Consensus 159 ~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~ 238 (408) T protein:vir:10 159 NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQA 238 (408) T ss_pred cceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHH Confidence 55554 444445678899999999975 58999999999999999999999999985 899999999999999999999 Q ss_pred HhhccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc- Q lcl|NC_019933. 230 LLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP- 307 (394) Q Consensus 230 ~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~- 307 (394) |++|+|++.+.. +..+++++.+++ ..++..+..++.|+|||.+|..|+++||++|+|+|++. T Consensus 239 il~g~g~~~~~~----------------~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~ 302 (408) T protein:vir:10 239 IIEVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (408) T ss_pred Hhhccccccccc----------------ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCc Confidence 999998765432 234578888865 57888888899999999999999999999999999754 Q ss_pred ccCCCceeecceEEEcC--CCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 308 QGTLAPTLWGLPVVATQ--AMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 308 ~~~~~~~l~G~pv~~~~--~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) ..+.+++|+|+||++++ .+|.. .++||||+.+|.++++.+++++++++.+..|.+|++.||++.|+||++.+| T Consensus 303 ~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 382 (408) T protein:vir:10 303 TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS 382 (408) T ss_pred CCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEecc Confidence 45566799999999865 45643 279999999999999999999999999899999999999999999999999 Q ss_pred cceEEEEecCCCCC Q lcl|NC_019933. 381 ESFIKGSLAAAAGT 394 (394) Q Consensus 381 ~a~~~l~~~~a~~~ 394 (394) +||++++++++++. T Consensus 383 ~a~~~~~~~~~~~~ 396 (408) T protein:vir:10 383 EALVAGSFSAIADQ 396 (408) T ss_pred ccEEEEEeeccccC Confidence 99999999997766 No 25 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=9.9e-64 Score=366.00 Aligned_cols=376 Identities=14% Similarity=0.178 Sum_probs=276.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----hh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQH----IS 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~----~~ 76 (394) |.+|+||++++.++.++++++.++.....+......+++.++.++++.++++++++++++...+.......... .. T Consensus 4 ~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 66999999999999999988877654433211112223344444444444444444433333222111110000 00 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCce- Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL- 155 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~- 155 (394) .............+.. ........ .......+...+++++||++||+++.+.|++.+++.++|+++|+++|++++.. T Consensus 84 ~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (408) T protein:vir:74 84 KSENELKDKFVKDFVN-MVRNPMAF-LNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGS 161 (408) T ss_pred chhhhhHHHHHHHHHH-HHhcchhh-hhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcce Confidence 0000011111111111 11111000 01112334456667789999999999999999999999999999999987654 Q ss_pred -eEEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 156 -EYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 156 -~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) .+++..+....++|++|++.+|+ ++++|++|++++++++++++||+|+++|+. +++++|.+.|++++++++|.+||+ T Consensus 162 ~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~ 241 (408) T protein:vir:74 162 RVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIA 241 (408) T ss_pred EEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 45555565667899999999997 569999999999999999999999999985 899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGT 310 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~ 310 (394) |+|++.+..+ ..+++++.+++ ..++..+..++.|+|||.+|.+|+++||++|+|+|++. ..+ T Consensus 242 G~G~~~~~~~----------------~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~ 305 (408) T protein:vir:74 242 AMGTVPKKPT----------------IANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKP 305 (408) T ss_pred cccccccccc----------------cccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCC Confidence 9988765432 23467787765 58888898999999999999999999999999999754 455 Q ss_pred CCceeecceEEEcC--CCCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 311 LAPTLWGLPVVATQ--AMAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 311 ~~~~l~G~pv~~~~--~~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) .+++|+|+||++++ .+|. ..++||||+.+|.++++.+++++++++.+..|.+|++.||++.|+||++++|+|| T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~ 385 (408) T protein:vir:74 306 NSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEAL 385 (408) T ss_pred CCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccce Confidence 56799999999865 3553 3479999999999999999999999998889999999999999999999999999 Q ss_pred EEEEecCCCCC Q lcl|NC_019933. 384 IKGSLAAAAGT 394 (394) Q Consensus 384 ~~l~~~~a~~~ 394 (394) +++++++.++. T Consensus 386 ~~~~~~~~~~~ 396 (408) T protein:vir:74 386 VAGSFTAIADQ 396 (408) T ss_pred EEEEeecccCC Confidence 99999777766 No 26 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=5.6e-63 Score=361.88 Aligned_cols=388 Identities=29% Similarity=0.451 Sum_probs=287.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------hccccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEG-------NGAGGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~-------~~~~~~~~ 73 (394) |+.|+|+++++.+..++.++..++..+ ..++.+...+++.++.+++..+++.++......+. ........ T Consensus 4 ~~~lee~~a~l~~~~~~~~~~~~~~~~---~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:94 4 TPTLEEQRAALLARLDDTSLTTEQVQE---IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 666777777777766666655554432 33444444455555555555554443333222221 11222222 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH------Hhhccccc-CCcCccccchhhhhHHHhhhhhhhhHHHhcc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAA------ITSLSTNA-DGSAGATVQTTRLPGILELPQRRMTIRSLLA 146 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~-~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~ 146 (394) .+..++.......+........ ......+.+.. ......+. ...++.++|..+.+.|+..+...+.|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~ 159 (419) T protein:vir:94 81 FRSLAQRFADSDGLREYRARDK-RGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) T ss_pred ccchhhhhhhHHHHHHHHHhhh-hhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcce Confidence 2333333333333332222211 11111111111 11112222 2334566677777777778888889999999 Q ss_pred ccccccCceeEEEEcCcc-------cccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 147 QGTMEGNTLEYVRETGFT-------NAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLL 219 (394) Q Consensus 147 ~~~~~~~~~~~~~~~~~~-------~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la 219 (394) +++++++.+.+|+.++.+ +.+.|++||+.+|+++++|+++++.+++++++++||+|+++|+++++++|.++|+ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la 239 (419) T protein:vir:94 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLT 239 (419) T ss_pred eeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 999999999999876532 3467999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccCCCcccccccccccccccc-----ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHH Q lcl|NC_019933. 220 RGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAP-----ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIEL 294 (394) Q Consensus 220 ~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 294 (394) +++++++|.+||+|+|++ .|.|+++.++..+.. ...+....++++.+++..+...+..+++|+|||.+|..|++ T Consensus 240 ~a~~~~~d~aii~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 318 (419) T protein:vir:94 240 YGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) T ss_pred HHHHHHHHHHHHhccCcc-cccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH Confidence 999999999999999975 688998876654332 22345557899999999999999999999999999999999 Q ss_pred hhccCCccc-cc-CcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEE Q lcl|NC_019933. 295 LKDTQGRYI-LG-NPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEER 372 (394) Q Consensus 295 lkd~~G~~~-~~-~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 372 (394) ++|++|+++ ++ ...++.+++|+|+||++++++|+++++||||+.+|+++++.+++++++++...+|.+|++.||++.| T Consensus 319 ~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r 398 (419) T protein:vir:94 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) T ss_pred HhhcCCCceeecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEe Confidence 999877754 33 4455667899999999999999999999999999999999999999999888899999999999999 Q ss_pred eccEEecccceEEEEecCCCC Q lcl|NC_019933. 373 LALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 373 ~d~~v~~~~a~~~l~~~~a~~ 393 (394) +|+++++|+||++++++++.. T Consensus 399 ~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 399 ANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred eccEEeccccEEEEEeccCCC Confidence 999999999999999998887 No 27 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=6.4e-63 Score=361.56 Aligned_cols=386 Identities=13% Similarity=0.166 Sum_probs=291.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh---- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS---- 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~---- 76 (394) ||.++||++++.++.+++.+..++.... +.++..++++++.+++++++.+|+++++................+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~--l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH--hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 9999999999999988887766554332 3445556777888888888888887776655543322111111000 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH--------HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAA--------ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) ..........................+.++. .......++..||.+||+++.+.|++.+++.++|+++++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:79 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 0000000001111111111111111111111 11122334556899999999999999999999999999999 Q ss_pred ccccCceeEEEEc-CcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019933. 149 TMEGNTLEYVRET-GFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVV 225 (394) Q Consensus 149 ~~~~~~~~~~~~~-~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~ 225 (394) +|+++.+++|+.. .....+.|++|++.+|++ .++|+++++.+++++++++||+|+++++. ++++||.+.|+++++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~ 238 (415) T protein:vir:79 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 9987766666544 223578899999999975 58999999999999999999999999985 89999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +|.++++|+|++.++.++..... .......++..++++|.+++..+...++.+++|+|||.+|..|+++||++|+|+|. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~ 317 (415) T protein:vir:79 239 RNKAIIDVITKGSTGSTSSGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) T ss_pred HHHHHhhccccCccccccccccc-cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 99999999998887665554333 33344556677899999999999999999999999999999999999999999996 Q ss_pred Cc-ccCCCceeecceEEEcCCCCcCc-----eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 NP-QGTLAPTLWGLPVVATQAMAVGQ-----FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~~-~~~~~~~l~G~pv~~~~~~p~~~-----~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) +. .++.+++|+|+||++++.+|.+. ++||||+.+|.++++.++++++++ |..+.+.+|+++|+|+++.+ T Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:79 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred cCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEec Confidence 54 45566799999999999988643 799999998889999999998765 45667899999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++++. T Consensus 393 ~~a~~~~~~~~~~~~ 407 (415) T protein:vir:79 393 YKSAIVIEYDDSERG 407 (415) T ss_pred cccEEEEEEeccCCC Confidence 999999999988877 No 28 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=6.4e-63 Score=361.56 Aligned_cols=386 Identities=13% Similarity=0.166 Sum_probs=291.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh---- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS---- 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~---- 76 (394) ||.++||++++.++.+++.+..++.... +.++..++++++.+++++++.+|+++++................+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~--l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH--hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 9999999999999988887766554332 3445556777888888888888887776655543322111111000 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH--------HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAA--------ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) ..........................+.++. .......++..||.+||+++.+.|++.+++.++|+++++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:98 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 0000000001111111111111111111111 11122334556899999999999999999999999999999 Q ss_pred ccccCceeEEEEc-CcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019933. 149 TMEGNTLEYVRET-GFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVV 225 (394) Q Consensus 149 ~~~~~~~~~~~~~-~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~ 225 (394) +|+++.+++|+.. .....+.|++|++.+|++ .++|+++++.+++++++++||+|+++++. ++++||.+.|+++++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~ 238 (415) T protein:vir:98 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 9987766666544 223578899999999975 58999999999999999999999999985 89999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +|.++++|+|++.++.++..... .......++..++++|.+++..+...++.+++|+|||.+|..|+++||++|+|+|. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~ 317 (415) T protein:vir:98 239 RNKAIIDVITKGSTGSTSSGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) T ss_pred HHHHHhhccccCccccccccccc-cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 99999999998887665554333 33344556677899999999999999999999999999999999999999999996 Q ss_pred Cc-ccCCCceeecceEEEcCCCCcCc-----eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 NP-QGTLAPTLWGLPVVATQAMAVGQ-----FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~~-~~~~~~~l~G~pv~~~~~~p~~~-----~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) +. .++.+++|+|+||++++.+|.+. ++||||+.+|.++++.++++++++ |..+.+.+|+++|+|+++.+ T Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:98 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred cCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEec Confidence 54 45566799999999999988643 799999998889999999998765 45667899999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++++. T Consensus 393 ~~a~~~~~~~~~~~~ 407 (415) T protein:vir:98 393 YKSAIVIEYDDSERG 407 (415) T ss_pred cccEEEEEEeccCCC Confidence 999999999988877 No 29 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=6.4e-63 Score=361.56 Aligned_cols=386 Identities=13% Similarity=0.166 Sum_probs=291.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh---- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS---- 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~---- 76 (394) ||.++||++++.++.+++.+..++.... +.++..++++++.+++++++.+|+++++................+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~--l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH--hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 9999999999999988887766554332 3445556777888888888888887776655543322111111000 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH--------HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAA--------ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) ..........................+.++. .......++..||.+||+++.+.|++.+++.++|+++++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:81 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 0000000001111111111111111111111 11122334556899999999999999999999999999999 Q ss_pred ccccCceeEEEEc-CcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019933. 149 TMEGNTLEYVRET-GFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVV 225 (394) Q Consensus 149 ~~~~~~~~~~~~~-~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~ 225 (394) +|+++.+++|+.. .....+.|++|++.+|++ .++|+++++.+++++++++||+|+++++. ++++||.+.|+++++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~ 238 (415) T protein:vir:81 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 9987766666544 223578899999999975 58999999999999999999999999985 89999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +|.++++|+|++.++.++..... .......++..++++|.+++..+...++.+++|+|||.+|..|+++||++|+|+|. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~ 317 (415) T protein:vir:81 239 RNKAIIDVITKGSTGSTSSGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) T ss_pred HHHHHhhccccCccccccccccc-cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 99999999998887665554333 33344556677899999999999999999999999999999999999999999996 Q ss_pred Cc-ccCCCceeecceEEEcCCCCcCc-----eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 NP-QGTLAPTLWGLPVVATQAMAVGQ-----FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~~-~~~~~~~l~G~pv~~~~~~p~~~-----~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) +. .++.+++|+|+||++++.+|.+. ++||||+.+|.++++.++++++++ |..+.+.+|+++|+|+++.+ T Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:81 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred cCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEec Confidence 54 45566799999999999988643 799999998889999999998765 45667899999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++++. T Consensus 393 ~~a~~~~~~~~~~~~ 407 (415) T protein:vir:81 393 YKSAIVIEYDDSERG 407 (415) T ss_pred cccEEEEEEeccCCC Confidence 999999999988877 No 30 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=1.7e-62 Score=359.19 Aligned_cols=386 Identities=12% Similarity=0.159 Sum_probs=286.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc---ccccchh- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAG---GDVQHIS- 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~---~~~~~~~- 76 (394) ||.++|+++++.++.+++.+..++.... +.++..++++++.+++++++.+|+++++.+...+..... ....... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~--~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9999999999988888776655544332 233344456666667777777766655554433322111 0000000 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH--------HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAA--------ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) ..........................+.++. .......+++.||.+||+++.+.|++.+++.++|+++++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:46 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 0000011111111111111111111111111 11122334557889999999999999999999999999999 Q ss_pred ccccCceeEEEEcC-cccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019933. 149 TMEGNTLEYVRETG-FTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVV 225 (394) Q Consensus 149 ~~~~~~~~~~~~~~-~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~ 225 (394) |++++..++|+... ....+.|++||+.+|+ +.++|++|++.+++++++++||+|+++|++ ++++||.+.|+++++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 238 (415) T protein:vir:46 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 99988888876643 2356789999999997 568999999999999999999999999985 89999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +|.+||+|+|++.++.++...... ......++..+++++.+++..+...++.+++|+|||.+|..|++++|++|+|+|. T Consensus 239 ~d~~il~g~g~g~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~ 317 (415) T protein:vir:46 239 RNKAIIDVITKGSTGSTSSGFEKE-GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) T ss_pred HHHHHhhccccCCccccccccccc-cceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeec Confidence 999999999988877665544332 3334456667899999999999999999999999999999999999999999996 Q ss_pred C-cccCCCceeecceEEEcCCCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 N-PQGTLAPTLWGLPVVATQAMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~-~~~~~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) + ..++.+++|+|+||++++.+|.+ .++||||+.+|.++++.+++++.++ |.++.+.+|+++|+|+++.+ T Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:46 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred cCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----cccCceEEEEEEEeccEEec Confidence 4 45566679999999999998854 3799999998999999999887764 56678899999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++++- T Consensus 393 ~~a~~~~~~~~~~~~ 407 (415) T protein:vir:46 393 YKSAIVIEYDDSERG 407 (415) T ss_pred cccEEEEEeeccCCC Confidence 999999998887766 No 31 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=1.7e-62 Score=359.19 Aligned_cols=386 Identities=12% Similarity=0.159 Sum_probs=286.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc---ccccchh- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAG---GDVQHIS- 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~---~~~~~~~- 76 (394) ||.++|+++++.++.+++.+..++.... +.++..++++++.+++++++.+|+++++.+...+..... ....... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~--~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9999999999988888776655544332 233344456666667777777766655554433322111 0000000 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH--------HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAA--------ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) ..........................+.++. .......+++.||.+||+++.+.|++.+++.++|+++++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:47 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee Confidence 0000011111111111111111111111111 11122334557889999999999999999999999999999 Q ss_pred ccccCceeEEEEcC-cccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019933. 149 TMEGNTLEYVRETG-FTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVV 225 (394) Q Consensus 149 ~~~~~~~~~~~~~~-~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~ 225 (394) |++++..++|+... ....+.|++||+.+|+ +.++|++|++.+++++++++||+|+++|++ ++++||.+.|+++++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 238 (415) T protein:vir:47 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 99988888876643 2356789999999997 568999999999999999999999999985 89999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +|.+||+|+|++.++.++...... ......++..+++++.+++..+...++.+++|+|||.+|..|++++|++|+|+|. T Consensus 239 ~d~~il~g~g~g~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~ 317 (415) T protein:vir:47 239 RNKAIIDVITKGSTGSTSSGFEKE-GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) T ss_pred HHHHHhhccccCCccccccccccc-cceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeec Confidence 999999999988877665544332 3334456667899999999999999999999999999999999999999999996 Q ss_pred C-cccCCCceeecceEEEcCCCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 N-PQGTLAPTLWGLPVVATQAMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~-~~~~~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) + ..++.+++|+|+||++++.+|.+ .++||||+.+|.++++.+++++.++ |.++.+.+|+++|+|+++.+ T Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:47 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred cCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----cccCceEEEEEEEeccEEec Confidence 4 45566679999999999998854 3799999998999999999887764 56678899999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++++- T Consensus 393 ~~a~~~~~~~~~~~~ 407 (415) T protein:vir:47 393 YKSAIVIEYDDSERG 407 (415) T ss_pred cccEEEEEeeccCCC Confidence 999999998887766 No 32 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=9.9e-64 Score=366.00 Aligned_cols=372 Identities=21% Similarity=0.211 Sum_probs=276.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLM---------AQGALQADLKAAQQRIAEVEGNGAGG 70 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~---------~~~~l~~~i~~~e~~~~~~~~~~~~~ 70 (394) |+ +.|.+.+++..+++++++++..+.. ++.++..++++++++ +++.+..+++.++..+.+........ T Consensus 21 ~~--~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~ 98 (425) T protein:vir:10 21 VP--RGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAA 98 (425) T ss_pred hh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 55 4555555555566666666654443 233333333433322 22333333333333322221111100 Q ss_pred ---cccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccc Q lcl|NC_019933. 71 ---DVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQ 147 (394) Q Consensus 71 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~ 147 (394) ....+..... .....+..+. ...+.+ ..++.+++++||++||+++.+.|++.+++.++|+++|++ T Consensus 99 ~~~~~~~~~~~~~-~~~~af~~~l--------~~~e~~---~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~ 166 (425) T protein:vir:10 99 QMGANGVKPLRDP-EYTEAFKAHV--------KRGDVQ---AALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRV 166 (425) T ss_pred hcccccccccccH-HHHHHHHHHh--------hhhhhH---HHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhcee Confidence 0000111000 0011111111 111222 334556778899999999999999999999999999999 Q ss_pred cccccCceeEEEEcCcccccceecCCccccccc-cceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 148 GTMEGNTLEYVRETGFTNAAAPVAEGAQKPESS-LRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVV 225 (394) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~ 225 (394) +|++++.+++|+..+. +.+.|++|++.+|+++ ++|+++++.+++++++++||+|+++|+ ++++++|.++|+++++++ T Consensus 167 ~~~~~~~~~~~~~~~~-~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~ 245 (425) T protein:vir:10 167 QPVSKAGFSKLFNMGG-TTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQ 245 (425) T ss_pred eeccCCceEEEEEcCC-cceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHH Confidence 9999888999998775 6789999999999876 799999999999999999999999998 699999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccc------------cccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHH Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAA------------PITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIE 293 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~------------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 293 (394) +|.+||+|+|++ .|.||++..+..+. ....++..++++|++++..+++.+..+++|+|||.+|.+|+ T Consensus 246 ~d~~~l~G~G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~ 324 (425) T protein:vir:10 246 EGKAFLAGDGTN-KPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVR 324 (425) T ss_pred HHhhhhcccCCC-CcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHH Confidence 999999999964 68999887554322 12234566899999999999999999999999999999999 Q ss_pred HhhccCCcccccCc-ccCCCceeecceEEEcCCCCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEE Q lcl|NC_019933. 294 LLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAMAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTI 367 (394) Q Consensus 294 ~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 367 (394) +++|++|+|+|++. ..+.+++|+|+||++++.||. ..++||||+.+|.++++.++++..++ ++.+|++.| T Consensus 325 ~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~----~~~~~~~~~ 400 (425) T protein:vir:10 325 KLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDP----YTAKPYVLF 400 (425) T ss_pred HhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecc----cccCCcEEE Confidence 99999999999755 445567899999999999984 34789999998999999887765433 366899999 Q ss_pred EEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 368 LAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 368 ~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) +++.|+|+++.+|+||++++++++- T Consensus 401 ~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 401 YTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred EEEEEeccEeecccceEEEEeeccC Confidence 9999999999999999999999999 No 33 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=2.7e-62 Score=358.09 Aligned_cols=386 Identities=13% Similarity=0.164 Sum_probs=289.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh---h Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS---I 77 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~---~ 77 (394) ||.+++|++++.++.+++.+..++.... +.++..++++++.+++++++.+|+++++................+. . T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~--~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHH--hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9999999999998888876665543332 3445556777788888888888876665554443321111000000 0 Q ss_pred -hhhhhhHHHHHHHHHHhhhhhhhhHHHHH--------HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 78 -GQQFVNSDSFKAMAESGGQRGRAEINIKA--------AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 78 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) .........................+.++ ........+++.||.+||+++.+.|++.+++.++|+++++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:94 79 EASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred chhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhccee Confidence 00000000011111111111111111111 111223334567899999999999999999999999999999 Q ss_pred ccccCceeEEEEcC-cccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019933. 149 TMEGNTLEYVRETG-FTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVV 225 (394) Q Consensus 149 ~~~~~~~~~~~~~~-~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~ 225 (394) +++++..++|+... ..+.+.|++||+.+|++ .++|++|++.+++++++++||+|+++|++ +++++|.++|+++++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~ 238 (415) T protein:vir:94 159 RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAAT 238 (415) T ss_pred eccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 99877666665432 24578899999999975 68999999999999999999999999985 89999999999999999 Q ss_pred HHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc Q lcl|NC_019933. 226 EENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG 305 (394) Q Consensus 226 ~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~ 305 (394) +|.+|++|+|++.++.++..... .......++..++++|.+++..+...++.+++|+|||.+|.+|+++||++|+|+|. T Consensus 239 ~~~~il~g~g~g~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~ 317 (415) T protein:vir:94 239 RNKAIIDVITKGSTGSTSSGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQ 317 (415) T ss_pred HHHHHhhccccCccccccccccc-cccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeec Confidence 99999999998887766554333 23334455667899999999999999999999999999999999999999999996 Q ss_pred C-cccCCCceeecceEEEcCCCCcCc-----eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEec Q lcl|NC_019933. 306 N-PQGTLAPTLWGLPVVATQAMAVGQ-----FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYR 379 (394) Q Consensus 306 ~-~~~~~~~~l~G~pv~~~~~~p~~~-----~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~ 379 (394) + ..++.+++|+|+||++++.+|.+. ++||||+.+|.++++.+++++.++ |.++.+.+|++.|+|+++.+ T Consensus 318 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~r~~~r~d~~~~~ 392 (415) T protein:vir:94 318 PDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred cCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEec Confidence 4 455566799999999999998654 799999998999999999987754 45678899999999999999 Q ss_pred ccceEEEEecCCCCC Q lcl|NC_019933. 380 PESFIKGSLAAAAGT 394 (394) Q Consensus 380 ~~a~~~l~~~~a~~~ 394 (394) |+||++++++++++. T Consensus 393 ~~a~~~~~~~~~~~~ 407 (415) T protein:vir:94 393 YKSAIVIEYDDSERG 407 (415) T ss_pred cccEEEEEEeccCCC Confidence 999999999888777 No 34 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=3.2e-62 Score=357.72 Aligned_cols=373 Identities=15% Similarity=0.191 Sum_probs=273.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------ccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGA-------GGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~-------~~~~~ 73 (394) +.+|+||++++.++.++++++.++............+++.++.+++++++.+++.+++++...+.... ..... T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (404) T protein:vir:39 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 22799999999999999888877654432221112223333333444444444433333333222111 11111 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .............+..+.+.. .... ..........+++++||++||+++.+.|++.+++.++|+++|+++|++++ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 158 (404) T protein:vir:39 84 KSEYELKDKFVKEFVNMVRNP----MAFL-NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTS 158 (404) T ss_pred cchhhhHHHHHHHHHHHHhcc----hhhh-hhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCC Confidence 111111111111122111111 1111 01123344556677899999999999999999999999999999999876 Q ss_pred ceeEE--EEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYV--RETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 154 ~~~~~--~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ...+| +..+..+.+.|++||+.+|+ ++++|+++++.+++++++++||+|+++++ .+++++|.++|++++++++|.+ T Consensus 159 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~ 238 (404) T protein:vir:39 159 NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQA 238 (404) T ss_pred cceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHH Confidence 55544 44455567899999999997 57999999999999999999999999998 5899999999999999999999 Q ss_pred HhhccCCCccccccccccccccccccccccchHHHHHHHHH-HhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc- Q lcl|NC_019933. 230 LLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALL-QAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP- 307 (394) Q Consensus 230 ~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~- 307 (394) +|+|+|++.+.. +..+++++.+++. .+...+..+++|+|||.+|..|+++||++|+|+|++. T Consensus 239 il~g~g~~~~~~----------------~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~ 302 (404) T protein:vir:39 239 IIAAMGTVPKKP----------------TIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (404) T ss_pred HHhccccccccc----------------ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCc Confidence 999998765432 2234677777654 6777788889999999999999999999999999754 Q ss_pred ccCCCceeecceEEEcCC--CCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 308 QGTLAPTLWGLPVVATQA--MAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 308 ~~~~~~~l~G~pv~~~~~--~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .++.+++|+|+||+++++ +|. ..+++|||++++.++++.+++++++++...+|++|++.||++.|+|+.+++| T Consensus 303 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 382 (404) T protein:vir:39 303 TKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDS 382 (404) T ss_pred CCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecc Confidence 455667999999998654 443 2489999999999999999999999998889999999999999999999999 Q ss_pred cceEEEEecCCCCC Q lcl|NC_019933. 381 ESFIKGSLAAAAGT 394 (394) Q Consensus 381 ~a~~~l~~~~a~~~ 394 (394) +||++++++++++. T Consensus 383 ~a~~~~~~~~~a~~ 396 (404) T protein:vir:39 383 EALVAGSFTAIADQ 396 (404) T ss_pred cceEEEEeeccccC Confidence 99999999888876 No 35 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=5.4e-62 Score=356.46 Aligned_cols=366 Identities=19% Similarity=0.250 Sum_probs=273.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh---------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ---------ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGD 71 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~---------~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~ 71 (394) |. +.|++++++++.+++++..++...+. ++.++..++.+++..+++++++++++++++.+.... .. T Consensus 1 m~-~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~----~~ 75 (379) T protein:vir:10 1 ME-ALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAK----SE 75 (379) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc----cc Confidence 76 66677777777776665544332221 122333444555555555555555554444332211 11 Q ss_pred ccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc Q lcl|NC_019933. 72 VQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME 151 (394) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~ 151 (394) ...+......... .......+..... ........+++++++.++|+.+...|++.+++.++|+++|++++++ T Consensus 76 ~~~~~~~~~~~~~-----~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~ 147 (379) T protein:vir:10 76 DKSDSLVKSITEN-----FNDIKEVRNGKSI---QVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSIS 147 (379) T ss_pred ccchhHHHHHHHH-----HHhHHHHHhhhhh---hhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeecc Confidence 1111111111100 0000000000000 0111223344455666899999999999999999999999999999 Q ss_pred cCceeEEEEcCcc-cccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 152 GNTLEYVRETGFT-NAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQL 230 (394) Q Consensus 152 ~~~~~~~~~~~~~-~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~ 230 (394) ++.++||+.++.+ ....|++||+.+|+++++|++|++.+++++++++||+|+++|++++.+||.++|+++++.++|.+| T Consensus 148 ~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~l~~~i~~~la~~~~~~~~~~~ 227 (379) T protein:vir:10 148 GGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPFLTSFIPNALRRDYAKAENAAF 227 (379) T ss_pred CCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999987543 456799999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc-- Q lcl|NC_019933. 231 LNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ-- 308 (394) Q Consensus 231 l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~-- 308 (394) +.|.+++.. .+ ..+.++...++++.+++..+...++.+++|+|||.+|..|+++||++|+|++++.. T Consensus 228 ~~g~~~~~~-~~----------~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~ 296 (379) T protein:vir:10 228 NAVLAANAT-AS----------TEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVT 296 (379) T ss_pred hcccccccc-cc----------cccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccC Confidence 988775432 11 11234455678999999999999999999999999999999999999999997543 Q ss_pred -cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_019933. 309 -GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGS 387 (394) Q Consensus 309 -~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~ 387 (394) .+++++|+|+||++++.||+++++||||+.+ .++.+.++++++.++...+|.+|++.||++.|+|+.+++|+||++++ T Consensus 297 ~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~-~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~ 375 (379) T protein:vir:10 297 QDNGVLRINGIPLFRATWLAANKYYVGDWTRV-TKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGD 375 (379) T ss_pred CCCCcceecceeeEecCCCCCCceEEeecccE-EEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEE Confidence 3445689999999999999999999999985 45577899999999888899999999999999999999999999999 Q ss_pred ecCC Q lcl|NC_019933. 388 LAAA 391 (394) Q Consensus 388 ~~~a 391 (394) +++. T Consensus 376 ~~~~ 379 (379) T protein:vir:10 376 FTAV 379 (379) T ss_pred ecCC Confidence 9999 No 36 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=5.2e-62 Score=356.58 Aligned_cols=353 Identities=18% Similarity=0.238 Sum_probs=275.4 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |+ +|+++.++++++.++++.+.++. ..++++++.++++.++++++++++..................... T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~---------~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAEN---------KIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQV 71 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhh Confidence 98 68888888888888877765432 223566777777777777776665544433322222111111111 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) .......+..+.+ .....+...+++++||++||+++.++|++.+++.++|+++++++|++++..++++ T Consensus 72 ~~~~~~~~~~~l~------------~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~ 139 (371) T protein:vir:81 72 KENEVEAFVNHIR------------TRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVF 139 (371) T ss_pred HHHHHHHHHHHHH------------HHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEE Confidence 1111111111111 1122345566778899999999999999999999999999999999876666554 Q ss_pred EcC-cccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_019933. 160 ETG-FTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGT 236 (394) Q Consensus 160 ~~~-~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~ 236 (394) ... ..+.+.|++||+.+|+ ++++|+++++++++++++++||+|+++|+ +++++||.+.|++++++++|.+|++|+|+ T Consensus 140 ~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~ 219 (371) T protein:vir:81 140 KKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNT 219 (371) T ss_pred EeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 432 2357889999999986 67999999999999999999999999998 58999999999999999999999999886 Q ss_pred CccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccC-cccCCCce Q lcl|NC_019933. 237 GQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGN-PQGTLAPT 314 (394) Q Consensus 237 ~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~-~~~~~~~~ 314 (394) +.+. +..+++++..++ ..+...+..+++|+|||.+|.+|+++||++|+|+|++ ..++.+++ T Consensus 220 ~~~~-----------------~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~ 282 (371) T protein:vir:81 220 KAKT-----------------AIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQ 282 (371) T ss_pred cccc-----------------ccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCce Confidence 5431 223456666655 5677888889999999999999999999999999975 45566689 Q ss_pred eecceEEEcCCCCcC------------ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccc Q lcl|NC_019933. 315 LWGLPVVATQAMAVG------------QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPES 382 (394) Q Consensus 315 l~G~pv~~~~~~p~~------------~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a 382 (394) |+|+||++++.+|.+ .++||||+.++.++++.+++++++++...+|++|++.||++.|+||++++|+| T Consensus 283 l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a 362 (371) T protein:vir:81 283 LLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEA 362 (371) T ss_pred ecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccc Confidence 999999999988733 47899999999999999999999999888999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_019933. 383 FIKGSLAAA 391 (394) Q Consensus 383 ~~~l~~~~a 391 (394) |+++++++| T Consensus 363 ~~~~~~~~A 371 (371) T protein:vir:81 363 FVFGEVQLA 371 (371) T ss_pred eEEEEEecC Confidence 999999999 No 37 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=7.7e-62 Score=355.64 Aligned_cols=385 Identities=15% Similarity=0.220 Sum_probs=263.7 Q ss_pred Cc------hHHHHHHHHHHHHHHHHHHH----------HHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 1 MS------DINAINSTLANISDSLKAHA----------DRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVE 64 (394) Q Consensus 1 Mk------~i~el~~~~~~~~~~~k~~~----------e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~ 64 (394) || ++++++.++.++.++.+++. +++..+++ ...+.++++.+.++.+++.+.+.+++.+..... T Consensus 3 ~~~~~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee-~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~ 81 (425) T protein:vir:95 3 LRQLMLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEE-VSAVEEEVAKLEDERNELNEKKSKLEGEIAQLE 81 (425) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12333333333333222221 11111111 111222233333333344333333333222221 Q ss_pred h-------hcccccccchhhh--hhhhhH--HHHHHHHHHhhhhhhhhHHHHH-HHhhcccccCCcCccccchhhhhHHH Q lcl|NC_019933. 65 G-------NGAGGDVQHISIG--QQFVNS--DSFKAMAESGGQRGRAEINIKA-AITSLSTNADGSAGATVQTTRLPGIL 132 (394) Q Consensus 65 ~-------~~~~~~~~~~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~ip~~~~~~ii 132 (394) . ............+ ...... ............. ...+.+. .......++++.||++||+++.+.|+ T Consensus 82 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii 159 (425) T protein:vir:95 82 DELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYY--KRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIM 159 (425) T ss_pred HHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhh--hhhHHHHHHHHHHhhcccccCceeccHHHHHHHH Confidence 1 1111000000000 000000 0011111111110 1111111 11122234456789999999999999 Q ss_pred hhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccc-cceeeEEeeeeeEEEeehhhHHHHHHHH-HH Q lcl|NC_019933. 133 ELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESS-LRFDLVQTSAKVIAHWMKASRQILSDSA-QL 210 (394) Q Consensus 133 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~ 210 (394) +.+++.++|+++|+++|++|+ +++|+..+ .+.++|++|++.+|+++ ++|++|++++++++++++||+|+++|++ ++ T Consensus 160 ~~l~~~~~i~~~~~~~~~~g~-~~ip~~~~-~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l 237 (425) T protein:vir:95 160 DIMGDYTTLYPLVDKIRVKGT-TRILVDTD-TSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINL 237 (425) T ss_pred HHHHhhhhHHHhhceeecCce-eEEEEecC-CccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHH Confidence 999999999999999998764 78999766 47889999999999887 6899999999999999999999999985 89 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCC-ccccccccccccccccccccccchHHHHHHHHHHhhhhcC--CCCeeEeCHH Q lcl|NC_019933. 211 QSFINARLLRGLEVVEENQLLNGNGTG-QNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEF--PATGIVLNPA 287 (394) Q Consensus 211 ~~~i~~~la~a~~~~~d~a~l~g~g~~-~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~ 287 (394) ++||.++|++++++++|.++|+|+|++ +.|.||++...........++..+++++.+++..+...+. .+++|+||+. T Consensus 238 ~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 317 (425) T protein:vir:95 238 DDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRS 317 (425) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeCh Confidence 999999999999999999999999986 4689999875555444455677789999999888877654 4668999999 Q ss_pred HH-H---HHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcC Q lcl|NC_019933. 288 DW-A---GIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKN 363 (394) Q Consensus 288 ~~-~---~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 363 (394) ++ . .|+.++|++|+|+|+.+. ...++|+|+||++++.||++.++||||+. |+++++.++++.++++. +|.+| T Consensus 318 ~~~~~l~~l~~~kd~~g~~i~~~~~-~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~-~~~~~~~~~~i~~~~~~--~f~~~ 393 (425) T protein:vir:95 318 TYYNRLVEFSIQVDSNGNVVGKLPN-LRTPDLLGLRVVFNNFLDDDTVLFGEFEQ-YTLVERENITIDSSTHV--KFTED 393 (425) T ss_pred HHHHHHHHHHhhcCCCCceeeccCC-CCCccccceeeEEcCcCCCccEEEEeccc-EEEEeecceEEEeeccc--ccccC Confidence 84 3 456788999999997654 44678999999999999999999999997 77888999999988764 68999 Q ss_pred cEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 364 MVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 364 ~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.||++.|+|+++.+|+||++++++++..- T Consensus 394 ~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g 424 (425) T protein:vir:95 394 QTAFRGKGRFDGKPVKPEAFVLVTITDPVQG 424 (425) T ss_pred ceEEEEEEeeCcEeecccceEEEEecCcCCC Confidence 9999999999999999999999999985444 No 38 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=7.4e-61 Score=350.24 Aligned_cols=367 Identities=15% Similarity=0.201 Sum_probs=274.9 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccccchhhh Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGN-GAGGDVQHISIG 78 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~-~~~~~~~~~~~~ 78 (394) |+ +|+||+++++++.++++.+.++. ..++++++.++++.|+++|+..++........ ........+... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~---------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED---------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCcc Confidence 99 59999999999999988876542 22456667777777777776544332222111 111111111111 Q ss_pred hhhhhHHHHHHHHHHhhhhhhhh-H-HHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 79 QQFVNSDSFKAMAESGGQRGRAE-I-NIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) ........+..+.+......... . ........+..+++++||++||+++.+.|++.+++.++|+++++++++++++.+ T Consensus 72 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 151 (392) T protein:vir:10 72 GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) T ss_pred chHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCcee Confidence 11111112222221111111100 0 011122344555667899999999999999999999999999999999876655 Q ss_pred --EEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 157 --YVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 157 --~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +|+..+ .+.+.|++||+.+|++ .++|++|++.+++++++++||+|+++++ ++++++|.+.|++++++++|.+|++ T Consensus 152 ~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~ 230 (392) T protein:vir:10 152 RVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG 230 (392) T ss_pred EEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455443 3578899999999976 5899999999999999999999999998 5899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGT 310 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~ 310 (394) |+|++.++ +..+++++.+++ ..+...+..+++|+|||.+|..|+++||++|+|+|++. ..+ T Consensus 231 g~g~~~~~-----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 231 VIEKLTKQ-----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred cccccccc-----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 98865432 234578888876 57888888999999999999999999999999999654 455 Q ss_pred CCceeecceEEE-cCCC-C--------cCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 311 LAPTLWGLPVVA-TQAM-A--------VGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 311 ~~~~l~G~pv~~-~~~~-p--------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .+++|+|+|+++ ++.+ | ...++||||+.+|.++.+.+++++++++...+|++|++.||++.|+||++++| T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 373 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDN 373 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc Confidence 667999986554 3322 1 12378999999999999999999999988888999999999999999999999 Q ss_pred cceEEEEecCCCCC Q lcl|NC_019933. 381 ESFIKGSLAAAAGT 394 (394) Q Consensus 381 ~a~~~l~~~~a~~~ 394 (394) +||+++++++++++ T Consensus 374 ~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 374 EAAVYGEIDLSAPV 387 (392) T ss_pred cceEEEEecccccc Confidence 99999999999999 No 39 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=7.4e-61 Score=350.24 Aligned_cols=367 Identities=15% Similarity=0.201 Sum_probs=274.9 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccccchhhh Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGN-GAGGDVQHISIG 78 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~-~~~~~~~~~~~~ 78 (394) |+ +|+||+++++++.++++.+.++. ..++++++.++++.|+++|+..++........ ........+... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~---------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED---------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCcc Confidence 99 59999999999999988876542 22456667777777777776544332222111 111111111111 Q ss_pred hhhhhHHHHHHHHHHhhhhhhhh-H-HHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 79 QQFVNSDSFKAMAESGGQRGRAE-I-NIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) ........+..+.+......... . ........+..+++++||++||+++.+.|++.+++.++|+++++++++++++.+ T Consensus 72 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 151 (392) T protein:vir:10 72 GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) T ss_pred chHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCcee Confidence 11111112222221111111100 0 011122344555667899999999999999999999999999999999876655 Q ss_pred --EEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 157 --YVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 157 --~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +|+..+ .+.+.|++||+.+|++ .++|++|++.+++++++++||+|+++++ ++++++|.+.|++++++++|.+|++ T Consensus 152 ~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~ 230 (392) T protein:vir:10 152 RVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG 230 (392) T ss_pred EEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455443 3578899999999976 5899999999999999999999999998 5899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGT 310 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~ 310 (394) |+|++.++ +..+++++.+++ ..+...+..+++|+|||.+|..|+++||++|+|+|++. ..+ T Consensus 231 g~g~~~~~-----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 231 VIEKLTKQ-----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred cccccccc-----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 98865432 234578888876 57888888999999999999999999999999999654 455 Q ss_pred CCceeecceEEE-cCCC-C--------cCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 311 LAPTLWGLPVVA-TQAM-A--------VGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 311 ~~~~l~G~pv~~-~~~~-p--------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .+++|+|+|+++ ++.+ | ...++||||+.+|.++.+.+++++++++...+|++|++.||++.|+||++++| T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 373 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDN 373 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc Confidence 667999986554 3322 1 12378999999999999999999999988888999999999999999999999 Q ss_pred cceEEEEecCCCCC Q lcl|NC_019933. 381 ESFIKGSLAAAAGT 394 (394) Q Consensus 381 ~a~~~l~~~~a~~~ 394 (394) +||+++++++++++ T Consensus 374 ~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 374 EAAVYGEIDLSAPV 387 (392) T ss_pred cceEEEEecccccc Confidence 99999999999999 No 40 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=7.4e-61 Score=350.24 Aligned_cols=367 Identities=15% Similarity=0.201 Sum_probs=274.9 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccccchhhh Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGN-GAGGDVQHISIG 78 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~-~~~~~~~~~~~~ 78 (394) |+ +|+||+++++++.++++.+.++. ..++++++.++++.|+++|+..++........ ........+... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~---------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED---------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCcc Confidence 99 59999999999999988876542 22456667777777777776544332222111 111111111111 Q ss_pred hhhhhHHHHHHHHHHhhhhhhhh-H-HHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 79 QQFVNSDSFKAMAESGGQRGRAE-I-NIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) ........+..+.+......... . ........+..+++++||++||+++.+.|++.+++.++|+++++++++++++.+ T Consensus 72 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 151 (392) T protein:vir:10 72 GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) T ss_pred chHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCcee Confidence 11111112222221111111100 0 011122344555667899999999999999999999999999999999876655 Q ss_pred --EEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 157 --YVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 157 --~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +|+..+ .+.+.|++||+.+|++ .++|++|++.+++++++++||+|+++++ ++++++|.+.|++++++++|.+|++ T Consensus 152 ~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~ 230 (392) T protein:vir:10 152 RVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG 230 (392) T ss_pred EEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455443 3578899999999976 5899999999999999999999999998 5899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGT 310 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~ 310 (394) |+|++.++ +..+++++.+++ ..+...+..+++|+|||.+|..|+++||++|+|+|++. ..+ T Consensus 231 g~g~~~~~-----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 231 VIEKLTKQ-----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred cccccccc-----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 98865432 234578888876 57888888999999999999999999999999999654 455 Q ss_pred CCceeecceEEE-cCCC-C--------cCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 311 LAPTLWGLPVVA-TQAM-A--------VGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 311 ~~~~l~G~pv~~-~~~~-p--------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .+++|+|+|+++ ++.+ | ...++||||+.+|.++.+.+++++++++...+|++|++.||++.|+||++++| T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 373 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDN 373 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc Confidence 667999986554 3322 1 12378999999999999999999999988888999999999999999999999 Q ss_pred cceEEEEecCCCCC Q lcl|NC_019933. 381 ESFIKGSLAAAAGT 394 (394) Q Consensus 381 ~a~~~l~~~~a~~~ 394 (394) +||+++++++++++ T Consensus 374 ~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 374 EAAVYGEIDLSAPV 387 (392) T ss_pred cceEEEEecccccc Confidence 99999999999999 No 41 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=7.4e-61 Score=350.24 Aligned_cols=367 Identities=15% Similarity=0.201 Sum_probs=274.9 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccccchhhh Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGN-GAGGDVQHISIG 78 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~-~~~~~~~~~~~~ 78 (394) |+ +|+||+++++++.++++.+.++. ..++++++.++++.|+++|+..++........ ........+... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~---------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED---------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCcc Confidence 99 59999999999999988876542 22456667777777777776544332222111 111111111111 Q ss_pred hhhhhHHHHHHHHHHhhhhhhhh-H-HHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 79 QQFVNSDSFKAMAESGGQRGRAE-I-NIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) ........+..+.+......... . ........+..+++++||++||+++.+.|++.+++.++|+++++++++++++.+ T Consensus 72 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 151 (392) T protein:vir:10 72 GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) T ss_pred chHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCcee Confidence 11111112222221111111100 0 011122344555667899999999999999999999999999999999876655 Q ss_pred --EEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 157 --YVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 157 --~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) +|+..+ .+.+.|++||+.+|++ .++|++|++.+++++++++||+|+++++ ++++++|.+.|++++++++|.+|++ T Consensus 152 ~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~ 230 (392) T protein:vir:10 152 RVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG 230 (392) T ss_pred EEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455443 3578899999999976 5899999999999999999999999998 5899999999999999999999999 Q ss_pred ccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGT 310 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~ 310 (394) |+|++.++ +..+++++.+++ ..+...+..+++|+|||.+|..|+++||++|+|+|++. ..+ T Consensus 231 g~g~~~~~-----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 231 VIEKLTKQ-----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred cccccccc-----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 98865432 234578888876 57888888999999999999999999999999999654 455 Q ss_pred CCceeecceEEE-cCCC-C--------cCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 311 LAPTLWGLPVVA-TQAM-A--------VGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 311 ~~~~l~G~pv~~-~~~~-p--------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .+++|+|+|+++ ++.+ | ...++||||+.+|.++.+.+++++++++...+|++|++.||++.|+||++++| T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 373 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDN 373 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc Confidence 667999986554 3322 1 12378999999999999999999999988888999999999999999999999 Q ss_pred cceEEEEecCCCCC Q lcl|NC_019933. 381 ESFIKGSLAAAAGT 394 (394) Q Consensus 381 ~a~~~l~~~~a~~~ 394 (394) +||+++++++++++ T Consensus 374 ~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 374 EAAVYGEIDLSAPV 387 (392) T ss_pred cceEEEEecccccc Confidence 99999999999999 No 42 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=6.2e-61 Score=350.66 Aligned_cols=386 Identities=18% Similarity=0.182 Sum_probs=273.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQEL----NASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQH-- 74 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~----~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~-- 74 (394) |+ |+||++++.+..++.++.+.......++ .+..+++++++.+++..+.++++++++................ T Consensus 1 M~-l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~ 79 (434) T protein:vir:62 1 MN-LKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKK 79 (434) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhh Confidence 87 7777777777666665544433332222 3334567778888888888877766554433222211110000 Q ss_pred ------hhhhhhhhhHHHHHHH-----HHHh---hhhhhhhHHHHHH------------HhhcccccCCcCccccchhhh Q lcl|NC_019933. 75 ------ISIGQQFVNSDSFKAM-----AESG---GQRGRAEINIKAA------------ITSLSTNADGSAGATVQTTRL 128 (394) Q Consensus 75 ------~~~~~~~~~~~~~~~~-----~~~~---~~~~~~~~~~~~~------------~~~~~~~~~~~~g~~ip~~~~ 128 (394) +............... .... ........+.+.. .....+.++++||++||+++. T Consensus 80 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~ 159 (434) T protein:vir:62 80 EDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLS 159 (434) T ss_pred cchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhH Confidence 0000000000000000 0000 0000000011100 111122344678999999999 Q ss_pred hHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccc--eecCCccccccccceeeEEeeeeeEEEeehhhHHHHHH Q lcl|NC_019933. 129 PGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAA--PVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSD 206 (394) Q Consensus 129 ~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~ 206 (394) +.|++.++++++|+++++++++++ .+++|+....+...| +.++++.+|+++++|++|++.+++++++++||+|+++| T Consensus 160 ~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~d 238 (434) T protein:vir:62 160 KEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLAR 238 (434) T ss_pred HHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhc Confidence 999999999999999999988765 488998765543333 35678899999999999999999999999999999999 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeC Q lcl|NC_019933. 207 SA-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLN 285 (394) Q Consensus 207 s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 285 (394) +. ++++||.++|+++++.++|.+||+|+|+++++.|+++..+.. ...++..++++|++++..+...+..+++|+|| T Consensus 239 s~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~---~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n 315 (434) T protein:vir:62 239 TGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVE---FKTDEKNLYDALVKMKNTPVKEVRKKARWVLN 315 (434) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccc---ccccccchhhHHHHHHhhcchhhhcCCEEEEc Confidence 85 899999999999999999999999999999999998765443 33445668999999999999999999999999 Q ss_pred HHHHHHHHHhhccCCcccccCcc---cCCCceeecceEEEcCCCCcCc------eEEeeccceEEEEeec-ceEEEEecc Q lcl|NC_019933. 286 PADWAGIELLKDTQGRYILGNPQ---GTLAPTLWGLPVVATQAMAVGQ------FLTGAFDAGAQVFDRW-AARVEVATE 355 (394) Q Consensus 286 ~~~~~~l~~lkd~~G~~~~~~~~---~~~~~~l~G~pv~~~~~~p~~~------~~~gd~~~~~~~~~~~-~~~i~~~~~ 355 (394) |.+|.+|++|||++|+|+|++.. ++.+++|+|+||++++.+|.+. ++||||+.+ +++++. .++++.+. T Consensus 316 ~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~-~i~~~~g~~~i~~~~- 393 (434) T protein:vir:62 316 TAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKF-YIQDVIGSLEVQKLV- 393 (434) T ss_pred HHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccce-EEEEeeceeEEEeeh- Confidence 99999999999999999997533 3445689999999999998654 789999964 566665 45566654 Q ss_pred cchhhhcCcEEEEEEEEeccEEec-ccceEEEE--ecCCCCC Q lcl|NC_019933. 356 NQDDFIKNMVTILAEERLALAVYR-PESFIKGS--LAAAAGT 394 (394) Q Consensus 356 ~~~~~~~~~~~~~~~~~~d~~v~~-~~a~~~l~--~~~a~~~ 394 (394) ..+|.+|++.||++.|+|+++.+ |.++++++ .++|++. T Consensus 394 -~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 394 -ELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred -hhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 45678999999999999999776 88877665 4455444 No 43 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.7e-60 Score=348.31 Aligned_cols=369 Identities=14% Similarity=0.160 Sum_probs=269.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQE-----LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHI 75 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~-----~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 75 (394) |. |+||++++.++.++++++.++..+... ...+..++++++.++++.+++..+..+.................. T Consensus 1 M~-~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (395) T protein:vir:38 1 MN-INQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKK 79 (395) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 76 688888888888888887766543221 112222334444444444444433333222222211111111111 Q ss_pred hhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCce Q lcl|NC_019933. 76 SIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL 155 (394) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) ... ..........+..... ...+.... ....++++||++||+++.++|++.+++.++|+++|+++|++++.. T Consensus 80 ~~~-~~~~~~~~~~~~~~~~------~~~~~~~~-~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 151 (395) T protein:vir:38 80 PLP-VKDGKPDAQAMKNQFV------KDFKNLVT-SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHG 151 (395) T ss_pred ccc-hhhhhHHHHHHHHHHH------HHHHHHHh-hccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcc Confidence 110 0011111111111110 11111111 233445678999999999999999999999999999999886555 Q ss_pred eE--EEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 156 EY--VRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLL 231 (394) Q Consensus 156 ~~--~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l 231 (394) .+ ++..+..+.++|++||+.+|++ .++|++|++++++++++++||+|+++|++ +++++|.+.|++++++++|.+|+ T Consensus 152 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il 231 (395) T protein:vir:38 152 SRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKIL 231 (395) T ss_pred eEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 54 4555555678999999999976 58999999999999999999999999985 89999999999999999999999 Q ss_pred hccCCCccccccccccccccccccccccchHHHHHHHHH-HhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-cc Q lcl|NC_019933. 232 NGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALL-QAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QG 309 (394) Q Consensus 232 ~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~ 309 (394) +|+|++.++.++ .+++++.+++. .+...+..+++|+|||.+|..|++++|++|+|+|++. .+ T Consensus 232 ~g~g~~~~~~~~----------------~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~ 295 (395) T protein:vir:38 232 EVMGKAPKKPTI----------------SQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTS 295 (395) T ss_pred hccccccccccc----------------ccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCC Confidence 999987765432 24567776654 6788888999999999999999999999999999654 45 Q ss_pred CCCceeecceEEEcCCCCc------CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 310 TLAPTLWGLPVVATQAMAV------GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 310 ~~~~~l~G~pv~~~~~~p~------~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) +.+++|+|+||+++++++. ..++||||+.+|.++++.+++++++++.+.+|.+|++.||++.|+|+++.+|+|| T Consensus 296 ~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 375 (395) T protein:vir:38 296 PDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAF 375 (395) T ss_pred CCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccce Confidence 5567999999999876432 3479999999899999999999999988888999999999999999999999999 Q ss_pred EEEEecCCCCC Q lcl|NC_019933. 384 IKGSLAAAAGT 394 (394) Q Consensus 384 ~~l~~~~a~~~ 394 (394) +++++++++.. T Consensus 376 ~~~~~~~~~~~ 386 (395) T protein:vir:38 376 AAASFKTVANQ 386 (395) T ss_pred EEEEeecccCC Confidence 99999877655 No 44 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=7e-60 Score=344.90 Aligned_cols=368 Identities=17% Similarity=0.211 Sum_probs=266.5 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHhhcccccc Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRI-------AEVEGNGAGGDV 72 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~-------~~~~~~~~~~~~ 72 (394) |+ +|+||+++++++.++++.+.++...++ .+...++++++.++++.+++..+...... ............ T Consensus 5 m~k~l~el~~~~~~~~~~~~~~~~~~~~ee--~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (397) T protein:vir:12 5 MSKKEIALRQQFTEKKQQADKALQEGNTDE--ARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRS 82 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccc Confidence 44 588999999999888888765443221 12222333333333333332222111111 111111111111 Q ss_pred cchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHH-----HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccc Q lcl|NC_019933. 73 QHISIGQQFVNSDSFKAMAESGGQRGRAEINIKA-----AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQ 147 (394) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~ 147 (394) ..... ..........+... ........+.+. ....+..+++++||++||+++.+.|++.+++.++|++++++ T Consensus 83 ~~~~~--~~~~~~~~~a~~~~-~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~ 159 (397) T protein:vir:12 83 QGQGN--EERQQQYSKAFLKG-LRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTV 159 (397) T ss_pred ccchh--hHHHHHHHHHHHHH-HhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcce Confidence 11111 00111111111111 111111112221 23345566778899999999999999999999999999999 Q ss_pred cccccCc--eeEEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_019933. 148 GTMEGNT--LEYVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLE 223 (394) Q Consensus 148 ~~~~~~~--~~~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~ 223 (394) +|+++++ +.+|+..+. +.+.|++||+.+|++ .++|+.|++.+++++++++||+|+++|++ +++++|.+.|+++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~ 238 (397) T protein:vir:12 160 EPVTTRSGTRLLEKNADM-VPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSV 238 (397) T ss_pred eeccCCceeEEEEEecCC-cceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHH Confidence 9988644 455665554 578899999999975 68999999999999999999999999985 899999999999999 Q ss_pred HHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHH-HHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcc Q lcl|NC_019933. 224 VVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQAQLAEFPATGIVLNPADWAGIELLKDTQGRY 302 (394) Q Consensus 224 ~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~ 302 (394) +++|.+|++|+|++.+ .|+ .+++++.+++ ..++..+..+++|+|||.+|.+|+++||++|+| T Consensus 239 ~~~d~~il~G~g~~~~-~g~----------------~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~ 301 (397) T protein:vir:12 239 VTRNNLILAAIASLKK-VDI----------------DGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRY 301 (397) T ss_pred HHHHHHHHhccccccc-ccc----------------ccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCce Confidence 9999999999987653 232 3467788766 588889999999999999999999999999999 Q ss_pred cccC-cccCCCceeecceEEEcCC-CCc-----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc Q lcl|NC_019933. 303 ILGN-PQGTLAPTLWGLPVVATQA-MAV-----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL 375 (394) Q Consensus 303 ~~~~-~~~~~~~~l~G~pv~~~~~-~p~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~ 375 (394) +|++ ..++.+++|+|+||++++. +|. ..++||||+.++.++++.+++++++++.+.+|.+|++.||++.|+|| T Consensus 302 l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~ 381 (397) T protein:vir:12 302 LLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDV 381 (397) T ss_pred eecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc Confidence 9975 4455667999999987665 332 23899999998999999999999999989899999999999999999 Q ss_pred EEecccceEEEEecCC Q lcl|NC_019933. 376 AVYRPESFIKGSLAAA 391 (394) Q Consensus 376 ~v~~~~a~~~l~~~~a 391 (394) ++++|+||+++++++- T Consensus 382 ~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 382 RKWDEDAVVFGQITVE 397 (397) T ss_pred EEecccceEEEEEeeC Confidence 9999999999999988 No 45 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.7e-60 Score=348.26 Aligned_cols=372 Identities=15% Similarity=0.136 Sum_probs=265.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-------ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-------~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~ 73 (394) ||+|.||++++.++.++++++.++..... +...+.+.+++.+.+++++++.++++++................ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99999999999999999888776543221 11223344455555555555555554443332221111111111 Q ss_pred chhhhhhhhhHHHHHHHHHHhh---hhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGG---QRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) ....... ...+..+.+... ..................+++++||++||+++.++|++.++++++|+++++++++ T Consensus 81 ~~~~~~~---~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 157 (387) T protein:vir:26 81 LSDNEKM---VKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CchhHHH---HHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeec Confidence 0000010 011111111111 1111111122233455667778899999999999999999999999999999888 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ++ ..+|+.......++|++||+.+++++++|+++++.+++++++++||+|+++|| +++++||.++|+++++++++.. T Consensus 158 ~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:26 158 KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred CC--ceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 75 46777665556789999999999999999999999999999999999999998 5999999999999999997765 Q ss_pred Hh-hccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc Q lcl|NC_019933. 230 LL-NGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ 308 (394) Q Consensus 230 ~l-~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~ 308 (394) +| .|+|++ .|.|++...+... .++..++++|++++..++..|..+++|+||+.++..+.++++..|+|++. T Consensus 236 ~~~~g~g~g-~~~g~~~~~~~~~----~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~--- 307 (387) T protein:vir:26 236 ALAVSPKSG-LEHMSFYNGSVKE----VEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD--- 307 (387) T ss_pred HhhcCCCcc-ccceeeecccccc----ccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc--- Confidence 44 555544 4667765544332 23455789999999999999999999999999998877776667777774 Q ss_pred cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 309 GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 309 ~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) +.+.+|+|+||++++.++ +++||||+.+|..++ ++.+...++ ..+|++.|++..|+|+++++|+||+++++ T Consensus 308 -~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~~--~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ 378 (387) T protein:vir:26 308 -TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYD--GTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKA 378 (387) T ss_pred -cCCccccccceEEecCCC--ceeeechhhhhhhhh--hhhheeccc----ccCCceEEEEEEEeCcEeechhheEEEEe Confidence 234589999999999865 579999998765543 344443332 34789999999999999999999999999 Q ss_pred cCCCCC Q lcl|NC_019933. 389 AAAAGT 394 (394) Q Consensus 389 ~~a~~~ 394 (394) ++++++ T Consensus 379 ka~~~~ 384 (387) T protein:vir:26 379 KENTGP 384 (387) T ss_pred ecCCCC Confidence 888888 No 46 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.7e-60 Score=348.26 Aligned_cols=372 Identities=15% Similarity=0.136 Sum_probs=265.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-------ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-------~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~ 73 (394) ||+|.||++++.++.++++++.++..... +...+.+.+++.+.+++++++.++++++................ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99999999999999999888776543221 11223344455555555555555554443332221111111111 Q ss_pred chhhhhhhhhHHHHHHHHHHhh---hhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGG---QRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) ....... ...+..+.+... ..................+++++||++||+++.++|++.++++++|+++++++++ T Consensus 81 ~~~~~~~---~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 157 (387) T protein:vir:96 81 LSDNEKM---VKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CchhHHH---HHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeec Confidence 0000010 011111111111 1111111122233455667778899999999999999999999999999999888 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ++ ..+|+.......++|++||+.+++++++|+++++.+++++++++||+|+++|| +++++||.++|+++++++++.. T Consensus 158 ~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:96 158 KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred CC--ceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 75 46777665556789999999999999999999999999999999999999998 5999999999999999997765 Q ss_pred Hh-hccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc Q lcl|NC_019933. 230 LL-NGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ 308 (394) Q Consensus 230 ~l-~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~ 308 (394) +| .|+|++ .|.|++...+... .++..++++|++++..++..|..+++|+||+.++..+.++++..|+|++. T Consensus 236 ~~~~g~g~g-~~~g~~~~~~~~~----~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~--- 307 (387) T protein:vir:96 236 ALAVSPKSG-LEHMSFYNGSVKE----VEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD--- 307 (387) T ss_pred HhhcCCCcc-ccceeeecccccc----ccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc--- Confidence 44 555544 4667765544332 23455789999999999999999999999999998877776667777774 Q ss_pred cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 309 GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 309 ~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) +.+.+|+|+||++++.++ +++||||+.+|..++ ++.+...++ ..+|++.|++..|+|+++++|+||+++++ T Consensus 308 -~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~~--~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ 378 (387) T protein:vir:96 308 -TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYD--GTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKA 378 (387) T ss_pred -cCCccccccceEEecCCC--ceeeechhhhhhhhh--hhhheeccc----ccCCceEEEEEEEeCcEeechhheEEEEe Confidence 234589999999999865 579999998765543 344443332 34789999999999999999999999999 Q ss_pred cCCCCC Q lcl|NC_019933. 389 AAAAGT 394 (394) Q Consensus 389 ~~a~~~ 394 (394) ++++++ T Consensus 379 ka~~~~ 384 (387) T protein:vir:96 379 KENTGP 384 (387) T ss_pred ecCCCC Confidence 888888 No 47 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.7e-60 Score=348.26 Aligned_cols=372 Identities=15% Similarity=0.136 Sum_probs=265.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-------ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-------~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~ 73 (394) ||+|.||++++.++.++++++.++..... +...+.+.+++.+.+++++++.++++++................ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99999999999999999888776543221 11223344455555555555555554443332221111111111 Q ss_pred chhhhhhhhhHHHHHHHHHHhh---hhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGG---QRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) ....... ...+..+.+... ..................+++++||++||+++.++|++.++++++|+++++++++ T Consensus 81 ~~~~~~~---~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 157 (387) T protein:vir:94 81 LSDNEKM---VKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CchhHHH---HHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeec Confidence 0000010 011111111111 1111111122233455667778899999999999999999999999999999888 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ++ ..+|+.......++|++||+.+++++++|+++++.+++++++++||+|+++|| +++++||.++|+++++++++.. T Consensus 158 ~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:94 158 KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred CC--ceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 75 46777665556789999999999999999999999999999999999999998 5999999999999999997765 Q ss_pred Hh-hccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc Q lcl|NC_019933. 230 LL-NGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ 308 (394) Q Consensus 230 ~l-~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~ 308 (394) +| .|+|++ .|.|++...+... .++..++++|++++..++..|..+++|+||+.++..+.++++..|+|++. T Consensus 236 ~~~~g~g~g-~~~g~~~~~~~~~----~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~--- 307 (387) T protein:vir:94 236 ALAVSPKSG-LEHMSFYNGSVKE----VEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD--- 307 (387) T ss_pred HhhcCCCcc-ccceeeecccccc----ccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc--- Confidence 44 555544 4667765544332 23455789999999999999999999999999998877776667777774 Q ss_pred cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 309 GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 309 ~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) +.+.+|+|+||++++.++ +++||||+.+|..++ ++.+...++ ..+|++.|++..|+|+++++|+||+++++ T Consensus 308 -~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~~--~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ 378 (387) T protein:vir:94 308 -TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYD--GTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKA 378 (387) T ss_pred -cCCccccccceEEecCCC--ceeeechhhhhhhhh--hhhheeccc----ccCCceEEEEEEEeCcEeechhheEEEEe Confidence 234589999999999865 579999998765543 344443332 34789999999999999999999999999 Q ss_pred cCCCCC Q lcl|NC_019933. 389 AAAAGT 394 (394) Q Consensus 389 ~~a~~~ 394 (394) ++++++ T Consensus 379 ka~~~~ 384 (387) T protein:vir:94 379 KENTGP 384 (387) T ss_pred ecCCCC Confidence 888888 No 48 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=1.6e-59 Score=342.87 Aligned_cols=379 Identities=16% Similarity=0.159 Sum_probs=254.0 Q ss_pred CchHHH-------------HHHHH----HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_019933. 1 MSDINA-------------INSTL----ANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQR---- 59 (394) Q Consensus 1 Mk~i~e-------------l~~~~----~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~---- 59 (394) |.++.+ +.+.+ .+...++.+..++ ...|.++++++..++++.+.+++++..+. T Consensus 14 ~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k------~~~E~~~~le~~~ee~k~l~ee~~~~~~~~a~~ 87 (458) T protein:vir:10 14 LGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSK------AVGEDRKRLEEALELVKSLDEKSKKSNELFAQT 87 (458) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 11100 0001111111111 11112222222222222222221111110 Q ss_pred ------------------HHHHHhhcccccccch---hhhhhhhhHHHHHHHHHHhhhhhhhhHH--HHHHHhhcccccC Q lcl|NC_019933. 60 ------------------IAEVEGNGAGGDVQHI---SIGQQFVNSDSFKAMAESGGQRGRAEIN--IKAAITSLSTNAD 116 (394) Q Consensus 60 ------------------~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 116 (394) ....+..........+ ..............+............+ .+........++. T Consensus 88 ~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 167 (458) T protein:vir:10 88 VEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSV 167 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccC Confidence 0000000000000000 0000000111111111111111111111 1111122233445 Q ss_pred CcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcccccc------ccceeeEEeee Q lcl|NC_019933. 117 GSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPES------SLRFDLVQTSA 190 (394) Q Consensus 117 ~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~------~~~~~~i~~~~ 190 (394) +.||.++|+++.+.|++.+++.++|+++++++|++++.+.+|+..+. +.+.|++|++.++++ +++|+++++.+ T Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~ 246 (458) T protein:vir:10 168 EVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDA-GKATWVAASTYGTDTTTGEEVKGALKEIHFST 246 (458) T ss_pred ccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCC-cceeecccccccccccccccccccceeeEeee Confidence 67899999999999999999999999999999999998999997764 678999999888864 57899999999 Q ss_pred eeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccc------cccccccchHH Q lcl|NC_019933. 191 KVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAA------PITVANATAVD 263 (394) Q Consensus 191 ~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~------~~~~~~~~~~~ 263 (394) ++++++++||+++++|+ +++++||.++|++++++++|.+||+|+|++ .|.||++.++.... +....+..+++ T Consensus 247 ~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (458) T protein:vir:10 247 YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSG-KPKGLLTLASEDSAKVVTEAKADGSVLVTAK 325 (458) T ss_pred eeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC-ccceeeecccccccceeecccccccccccHH Confidence 99999999999999998 589999999999999999999999999974 68899887654332 22233455799 Q ss_pred HHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc-----cCCCceeecceEEEcCCCCcC----ceEEe Q lcl|NC_019933. 264 RLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ-----GTLAPTLWGLPVVATQAMAVG----QFLTG 334 (394) Q Consensus 264 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~-----~~~~~~l~G~pv~~~~~~p~~----~~~~g 334 (394) +|++++..+...+..+++|+|||.+|..|++++|++|+|+++... .+.+++|+|+||++++.||++ .++|| T Consensus 326 ~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~ 405 (458) T protein:vir:10 326 TISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVI 405 (458) T ss_pred HHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEE Confidence 999999999999999999999999999999999999999986432 233468999999999999874 57899 Q ss_pred eccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 335 AFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 335 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) ||+++|.++++.++++..+++ +.+|++.||++.|+|+.+++|+||++.+++++ T Consensus 406 ~f~~~~~~~~~~~~~v~~d~~----~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 406 VYKDNFVMPRQRAVTVERERQ----AGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred EecccEEEEEeeceEEEeecc----cCCCceEEEEEEEecceEecccceEEEeeccC Confidence 999889999999998876543 45889999999999999999999999999988 No 49 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.1e-59 Score=343.93 Aligned_cols=369 Identities=13% Similarity=0.087 Sum_probs=270.8 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQE-----LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQH 74 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~-----~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~ 74 (394) |+ +|++|+++++++.++.+++.++.....+ ...+..++++++.++++.++++++..+................. T Consensus 3 ~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (421) T protein:vir:13 3 LFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRV 82 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Confidence 44 4888999998888888887776654322 12222333444444444444444433333222221111111111 Q ss_pred hh--hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccccc Q lcl|NC_019933. 75 IS--IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEG 152 (394) Q Consensus 75 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 152 (394) .. .............+.. .........+.++ ..++++||++||+++...|++.+++.++|+++|+++|+++ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ra------~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~ 155 (421) T protein:vir:13 83 IINGDSKEEKRSLQLSAMSK-TIRGIQLSEEERD------IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNR 155 (421) T ss_pred ccccchhHHHHHHHHHHHHH-hhhccchhHHHhh------ccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccC Confidence 11 1111111111111111 1111111122222 2334568999999999999999999999999999999999 Q ss_pred CceeEEEEcCccc-ccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 153 NTLEYVRETGFTN-AAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQL 230 (394) Q Consensus 153 ~~~~~~~~~~~~~-~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~ 230 (394) +++++|+...... ...|++|++.+|+++++|++|++.+++++++++||+|+++|++ ++++||.++|++++..++|..+ T Consensus 156 ~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i 235 (421) T protein:vir:13 156 NAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEI 235 (421) T ss_pred CceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhH Confidence 9999998776533 3567999999999999999999999999999999999999985 7999999999999999999888 Q ss_pred hhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccC Q lcl|NC_019933. 231 LNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGT 310 (394) Q Consensus 231 l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~ 310 (394) ++ .|.|+++. ++..++++|.+++..+..+++.+++|+|||.+|.+|++++|++|+|+|+++..+ T Consensus 236 ~~------~~~g~~~~----------~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~ 299 (421) T protein:vir:13 236 VK------QAKAVLAE----------ETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDG 299 (421) T ss_pred hh------hhhhcccc----------ccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCC Confidence 74 25666543 233468999999999999999999999999999999999999999999888777 Q ss_pred CCceeecceEEEcCCCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEE Q lcl|NC_019933. 311 LAPTLWGLPVVATQAMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIK 385 (394) Q Consensus 311 ~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~ 385 (394) ++++|+|+||++++++|.+ .++||||+.+|.++++.+++++++++. +|.+|++.+|+..|+|+++.+++||+. T Consensus 300 ~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~d~~~~~~~a~~~ 377 (421) T protein:vir:13 300 GDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA--GYTKNETIARIIERFDVNSPLDKSSDA 377 (421) T ss_pred CCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeeccc--ccccCeeEEEEEeeecceeecchhhhe Confidence 8889999999999998854 368999999999999999999987764 699999999999999999999999866 Q ss_pred EEecCCCCC Q lcl|NC_019933. 386 GSLAAAAGT 394 (394) Q Consensus 386 l~~~~a~~~ 394 (394) ++....+.. T Consensus 378 ~~~~~~~a~ 386 (421) T protein:vir:13 378 EKIRKFGVI 386 (421) T ss_pred eeeccccee Confidence 654432222 No 50 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=4.3e-60 Score=346.07 Aligned_cols=372 Identities=15% Similarity=0.136 Sum_probs=265.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh---h----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ---E----LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~---~----~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~ 73 (394) ||+|.||++++.++.++++++.++..... + ...+.+.+++.+.++++++++++++++................ T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 95 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 95 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 99999999999999998888766543321 1 1223344455555555566665555544333222211111111 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhh---hhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQR---GRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) ....+. ....+..+.+..... .............+..+++++||++||+++...|++.++++++|+++|+++++ T Consensus 96 -~~~~~~--~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~ 172 (402) T protein:vir:93 96 -LSDNEK--MVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 172 (402) T ss_pred -CchhHH--HHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeec Confidence 010000 001111111111111 11111122234556677778899999999999999999999999999999888 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ++ ..+|+.......++|++||+.+|+++++|++|++.+++++++++||+|+++|| .++++||.++|+++++++++.. T Consensus 173 ~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~ 250 (402) T protein:vir:93 173 KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 250 (402) T ss_pred CC--ceeeeeeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 65 45777665556789999999999999999999999999999999999999998 5899999999999999998765 Q ss_pred H-hhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc Q lcl|NC_019933. 230 L-LNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ 308 (394) Q Consensus 230 ~-l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~ 308 (394) + ..|+|++ .|.|++..++... .++...+++|++++.+++..|..+++|+||+.++..+..+++..|++++. T Consensus 251 ~~~~g~g~g-~p~g~~~~~~~~~----~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~--- 322 (402) T protein:vir:93 251 ALAVSPKSG-LEHMSFYNGSVKE----VEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD--- 322 (402) T ss_pred HhhcCCCcc-ccceeeecccccc----ccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc--- Confidence 4 4555544 5677776544333 23455689999999999999999999999999988877666666777763 Q ss_pred cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 309 GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 309 ~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) +.+.+|+|+||++++.++ +++||||+.+|.++++ +.+.... +..+|++.|++..|+|+++.+|+||+++++ T Consensus 323 -~~~~~llG~PV~~t~~~~--~i~~GDf~~~~~~~~~--~~~~~~~----~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~i 393 (402) T protein:vir:93 323 -TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYDG--TTYDTDK----DVKKGEYLFVLTAWYDQQRTLDSAFRIAKA 393 (402) T ss_pred -cCCccccccceEEecCCC--ceeeechhhhhhhhhh--hhhhhhh----cccCCceEEEEEEEeCcEEechhheEEEEe Confidence 234589999999999865 5799999987765543 3333332 234689999999999999999999999999 Q ss_pred cCCCCC Q lcl|NC_019933. 389 AAAAGT 394 (394) Q Consensus 389 ~~a~~~ 394 (394) ++++++ T Consensus 394 k~~~~~ 399 (402) T protein:vir:93 394 KENTGP 399 (402) T ss_pred ecCCCC Confidence 888887 No 51 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.9e-59 Score=342.54 Aligned_cols=371 Identities=15% Similarity=0.153 Sum_probs=260.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh---h----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ---E----LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~---~----~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~ 73 (394) ||+|.||++++.++.++++.+.++..... + ...+.+.+++.+.++++.++.++++++................ T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 99999999999999998888766543322 1 1222333444445555555555444332222111111111110 Q ss_pred chhhhhhhhhHHHHHHHHHHhhh---hhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQ---RGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) ....+. ....+..+.+.... ........+.....+..+++++||++||+++.+.|++.++++++|+++++++++ T Consensus 81 -~~~~~~--~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~ 157 (387) T protein:vir:93 81 -LNDHEK--MVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred -cchhhH--HHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeec Confidence 000000 01111111111111 111112222334556677788899999999999999999999999999999988 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ++ ..+|+.......+.|++||+..++++++|+++++.+++++++++||+|+++|| .++++||.++|+++++++++.. T Consensus 158 ~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:93 158 KG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred CC--ceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 75 46777655456788999999999999999999999999999999999999998 5899999999999999998765 Q ss_pred Hh-hccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHH-HhhccCCcccccCc Q lcl|NC_019933. 230 LL-NGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIE-LLKDTQGRYILGNP 307 (394) Q Consensus 230 ~l-~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~lkd~~G~~~~~~~ 307 (394) +| .|+|++ .|.|++..++... .++..++++|++++..++..+..+++|+||+.++..+. +++|.+| +++. T Consensus 236 ~~~~g~g~g-~p~g~l~~~~~~~----v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~-~~~~-- 307 (387) T protein:vir:93 236 ALAVSPKSG-LDHMSFYNGSVKE----VEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFFD-- 307 (387) T ss_pred HhhcCCCcc-ccceeeecccccc----ccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC-cccc-- Confidence 44 555544 4667766544332 23455689999999999999999999999999987764 5555555 4442 Q ss_pred ccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_019933. 308 QGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGS 387 (394) Q Consensus 308 ~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~ 387 (394) +.+.+|+|+||++++.++ .++||||+.++..+. ++.+... .++.++.+.|++..|+|+++.+|+||++++ T Consensus 308 --~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~~--~~~~~~~----~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~ 377 (387) T protein:vir:93 308 --TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYD--GTTYDTD----KDVKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred --cCCccccccceEEecCCC--ceeeeehhhhheehh--hheeeec----ccccCCceeEEEEeeeCceeechhheEEEE Confidence 234589999999999865 479999998765543 4444433 235689999999999999999999999999 Q ss_pred ecCCCCC Q lcl|NC_019933. 388 LAAAAGT 394 (394) Q Consensus 388 ~~~a~~~ 394 (394) +++++++ T Consensus 378 ~k~~~~~ 384 (387) T protein:vir:93 378 AKENTGS 384 (387) T ss_pred eecCCCC Confidence 9888888 No 52 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=7.8e-59 Score=339.15 Aligned_cols=365 Identities=15% Similarity=0.135 Sum_probs=268.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhh----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQE----LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~----~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 76 (394) |++|+++++++++..+++++.+++...+.+ ...++.++++.+..++++++.+++.++.................+. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 999999999999999999998876554432 2334455666677777777776665544333222111111111111 Q ss_pred hhhhh-----hhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc Q lcl|NC_019933. 77 IGQQF-----VNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME 151 (394) Q Consensus 77 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~ 151 (394) ..+.. .....+..+.+.. ...........++++||++||+++..+|++.++++++|+++|+++|++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~~---------~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 151 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHSH---------GKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVT 151 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhcc---------chhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeecc Confidence 10000 0111111111111 111123345566788999999999999999999999999999999999 Q ss_pred cCceeEEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 152 GNTLEYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a 229 (394) ++++++|+.....+.+.|++|++.+|+ ++++|++|++.+++++++++||+|+++|+ ++++++|.+.|+++++.++|.+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~ 231 (394) T protein:vir:10 152 TPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAM 231 (394) T ss_pred CCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 988999988766677889999999996 67999999999999999999999999998 5899999999999999999999 Q ss_pred HhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCccc Q lcl|NC_019933. 230 LLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQG 309 (394) Q Consensus 230 ~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~ 309 (394) |++|.|++.+. ...+..+++++.+++.......+ +++|+|||++|.+|++|+|++|+|+|++... T Consensus 232 il~g~g~~~~~--------------~~~~~~~~d~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~ 296 (394) T protein:vir:10 232 IAPVLQSFTAK--------------ATTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASD 296 (394) T ss_pred Hhhcccccccc--------------cccccccHHHHHHHHHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeecccc Confidence 99998765431 12334567778776643333333 5799999999999999999999999975432 Q ss_pred -----CCCceeecceEEEcCCC--Cc----CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEe Q lcl|NC_019933. 310 -----TLAPTLWGLPVVATQAM--AV----GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVY 378 (394) Q Consensus 310 -----~~~~~l~G~pv~~~~~~--p~----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~ 378 (394) +.+++|+|+||++++.. |. ..++||||+.+|.++++.++++.++++.+ |. +.++++.|+|++++ T Consensus 297 ~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~--~~---~~~~~~~r~d~~~~ 371 (394) T protein:vir:10 297 SITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI--YG---RYLGAAFRFGVKQA 371 (394) T ss_pred ccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccc--cc---eeEEEEEEeccEEe Confidence 23358999999986643 32 23799999998999999999998866433 33 46899999999999 Q ss_pred cccceEEEEecCCCCC Q lcl|NC_019933. 379 RPESFIKGSLAAAAGT 394 (394) Q Consensus 379 ~~~a~~~l~~~~a~~~ 394 (394) +|+||+.++++++++- T Consensus 372 ~~~ai~~~~~~~~~~~ 387 (394) T protein:vir:10 372 DSNAGYFVTNTDAASG 387 (394) T ss_pred ccccEEEEEeecccCC Confidence 9999999997766554 No 53 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.1e-58 Score=338.34 Aligned_cols=367 Identities=14% Similarity=0.093 Sum_probs=262.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch--h Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ--ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHI--S 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~--~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~--~ 76 (394) .++|+||++++.++.++++++.++...+. ....+++++++++.+++++++++++..+.................. . T Consensus 11 ~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (400) T protein:vir:38 11 KKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHS 90 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Confidence 22355555566666666665555443322 2345556677777777777777776554444333322222111111 0 Q ss_pred hhhhh---hhH-----HHHHHH---HHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhc Q lcl|NC_019933. 77 IGQQF---VNS-----DSFKAM---AESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLL 145 (394) Q Consensus 77 ~~~~~---~~~-----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~ 145 (394) ..... ... ...... ............+.++.... ..++++||++||+++.+.|++.+++.++|++++ T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~ 168 (400) T protein:vir:38 91 YRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNA--GVKAADAASTIPETISNTPQRELQTVVDLKPFT 168 (400) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhh--cccccCCcccccHHHHHHHHHHHHhhhhhhhcc Confidence 00000 000 000000 00000001111112222222 234566899999999999999999999999999 Q ss_pred cccccccCceeEEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_019933. 146 AQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLE 223 (394) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~ 223 (394) +++|++++++++|+.....+.+.|++|++.+|+ ++++|++|++.+++++++++||+|+++|+ +++++||.+.++++++ T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~ 248 (400) T protein:vir:38 169 NVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKV 248 (400) T ss_pred eeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 999999988999998866677888999988886 68999999999999999999999999998 5899999999999999 Q ss_pred HHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccc Q lcl|NC_019933. 224 VVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYI 303 (394) Q Consensus 224 ~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~ 303 (394) .++|.++++|.|++.. .+..+++++.+++..... ...+++|+|||.+|..|+++||++|+|+ T Consensus 249 ~~~~~~i~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~-~~~~a~~v~~~~~~~~l~~lkd~~G~~i 310 (400) T protein:vir:38 249 NTTNGAVATLLKGFTA-----------------KTISSVDDLKHINNVDLD-PAYSRVIIASQSFYNFLDTVKDGNGRYL 310 (400) T ss_pred HHHHHhhhhccccccc-----------------cccccHHHHHHHHHhhhh-hhhCcEEEEcHHHHHHHHHhhccCCCee Confidence 9999999998775432 223356777777654333 3347899999999999999999999999 Q ss_pred ccC-cccCCCceeecceEEEcCCCCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 304 LGN-PQGTLAPTLWGLPVVATQAMAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAV 377 (394) Q Consensus 304 ~~~-~~~~~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v 377 (394) |++ ..++.+++|+|+||++++.+|.+ .++||||+.+|.++++.++++.++++.+ +...+|+++|+|+++ T Consensus 311 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~r~d~~~ 385 (400) T protein:vir:38 311 LQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQI-----YGQFLQAGMRFGVSV 385 (400) T ss_pred eecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccc-----cceeEEEEEEeccEE Confidence 964 45566679999999999988754 3799999999999999999998876543 346899999999999 Q ss_pred ecccceEEEEecCCC Q lcl|NC_019933. 378 YRPESFIKGSLAAAA 392 (394) Q Consensus 378 ~~~~a~~~l~~~~a~ 392 (394) .+|+||+.++++++| T Consensus 386 ~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 386 ADEKAGYFLTYTPKA 400 (400) T ss_pred ecccceEEEEeecCC Confidence 999999999999999 No 54 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1.4e-58 Score=337.68 Aligned_cols=367 Identities=15% Similarity=0.147 Sum_probs=266.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhh----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQE----LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~----~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 76 (394) |++|+++.+++++..+++++.+++...+.+ ...++.++++++.+++++++++++.++................... T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKG 80 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 999999999999999998888776554432 2334445566666666666666665544332221111111111100 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) .................... ... .....+..+++++||++||+++...|++.++++++|+++|+++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~lr---~~~---~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~ 154 (389) T protein:vir:10 81 TDLSKKPIDAKKKAINDFIH---SHG---KVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGT 154 (389) T ss_pred cccchhHHHHHHHHHHHHhh---cch---hhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeE Confidence 00000000000111111111 011 112334456667899999999999999999999999999999999998899 Q ss_pred EEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGN 234 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~ 234 (394) +|+....+..+.|++|++.+|+ ++++|++|++.+++++++++||+|+++|+. ++++||.+.|++++++++|.+|++|. T Consensus 155 ~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~ 234 (389) T protein:vir:10 155 YPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVL 234 (389) T ss_pred EEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 9998876667778888888885 789999999999999999999999999984 89999999999999999999999987 Q ss_pred CCCccccccccccccccccccccccchHHHHHHHHH-HhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCccc---- Q lcl|NC_019933. 235 GTGQNLLGLLPQATAFAAPITVANATAVDRLRLALL-QAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQG---- 309 (394) Q Consensus 235 g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~---- 309 (394) +++.+ .+..+..+++++.+++. .++..+ +++|+|||.+|..|+++||++|+|+|++... T Consensus 235 ~~~~~--------------~~~~~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~ 298 (389) T protein:vir:10 235 QSFTA--------------KKTTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITD 298 (389) T ss_pred ccccc--------------ccccccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccc Confidence 65432 12234556788877764 444443 6799999999999999999999999975432 Q ss_pred -CCCceeecceEEEcCC-CCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccc Q lcl|NC_019933. 310 -TLAPTLWGLPVVATQA-MAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPES 382 (394) Q Consensus 310 -~~~~~l~G~pv~~~~~-~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a 382 (394) +.+++|+|+||++++. ++.. .++||||+++|.++++.++++.++++.+ | ...+|++.|+|+++.+|+| T Consensus 299 ~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~---~~~~~~~~r~d~~~~~~~a 373 (389) T protein:vir:10 299 GTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI--Y---GKYLGAAFRFGVQKADSKA 373 (389) T ss_pred cccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecccc--c---cceEEEEEEeccEEecccc Confidence 3345899999987654 3322 3799999999999999999998876432 3 3578999999999999999 Q ss_pred eEEEEecCCCCC Q lcl|NC_019933. 383 FIKGSLAAAAGT 394 (394) Q Consensus 383 ~~~l~~~~a~~~ 394 (394) |+.+++++++++ T Consensus 374 ~~~~~~~~~~~~ 385 (389) T protein:vir:10 374 GYFVTNTDVPGS 385 (389) T ss_pred eEEEEeeccCCC Confidence 999998866666 No 55 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=6.2e-58 Score=334.20 Aligned_cols=369 Identities=13% Similarity=0.128 Sum_probs=265.3 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccchh Q lcl|NC_019933. 1 MS-DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGD---VQHIS 76 (394) Q Consensus 1 Mk-~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~---~~~~~ 76 (394) |+ +|+||++++.++.+++.+..++.... ..++..++++++..+++.++++++++++++...+....... ..... T Consensus 2 ~~~~l~el~~~l~e~~~~i~~~~~e~~~~--~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~ 79 (394) T protein:vir:97 2 FEEKIKEIKATIADLNNTIVTKTAQVKNA--LESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 44 59999988888888877665544332 23344455666666666666666665555444332211100 00000 Q ss_pred h-hhhhhhHHHHHHHHHHhh------hhhhhhHHH-------HHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHH Q lcl|NC_019933. 77 I-GQQFVNSDSFKAMAESGG------QRGRAEINI-------KAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIR 142 (394) Q Consensus 77 ~-~~~~~~~~~~~~~~~~~~------~~~~~~~~~-------~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~ 142 (394) . .........+..+.+... .......+. ..........+..+||+++|+++.+.|++.+++.++|+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~ 159 (394) T protein:vir:97 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) T ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhh Confidence 0 000000000011100000 000000000 11112222345567899999999999999999999999 Q ss_pred HhccccccccCceeEEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_019933. 143 SLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLR 220 (394) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~ 220 (394) ++++++|++++++++|+....++.++|++||+.+|+ ++++|+.|++.+++++++++||+|+++|+. ++++||.+.|++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~ 239 (394) T protein:vir:97 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) T ss_pred hhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHH Confidence 999999999988999998776678899999999997 569999999999999999999999999985 899999999999 Q ss_pred HHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCC Q lcl|NC_019933. 221 GLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQG 300 (394) Q Consensus 221 a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G 300 (394) ++++++|.+|++|.+++.+ .+..+++++.+++..... ...++.|+|||.+|..|++++|++| T Consensus 240 ~~~~~~~~~i~~g~~~~~~-----------------~~~~~~~~~~~~~~~~~~-~~~~a~~v~n~~~~~~l~~lkd~~G 301 (394) T protein:vir:97 240 IKVNTTNDAIAKVLKSFTT-----------------KTVKNLDEIKALLNGGFD-PAYNVSLIVSQSFYQTLDTLKDGNG 301 (394) T ss_pred HHHHHHHHHHhhccccccc-----------------cccccHHHHHHHHHhhhh-hhhCCEEEEcHHHHHHHHHhhccCC Confidence 9999999999988654322 233457788777755433 3446789999999999999999999 Q ss_pred cccccC-cccCCCceeecceEEEcC--CCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 301 RYILGN-PQGTLAPTLWGLPVVATQ--AMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAV 377 (394) Q Consensus 301 ~~~~~~-~~~~~~~~l~G~pv~~~~--~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v 377 (394) +|+|++ +.++.+++|+|+||++++ .++.+.++||||+.+|.++++.+++++++++. .+...+|++.|+|+++ T Consensus 302 ~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~r~d~~v 376 (394) T protein:vir:97 302 RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNE-----IYGQYLQAVLRFGVSK 376 (394) T ss_pred CeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEeccc-----ccceeEEEEEEEccEE Confidence 999964 455566799999999854 57778899999999899999999999876643 3346899999999999 Q ss_pred ecccceEEEEecCCCCC Q lcl|NC_019933. 378 YRPESFIKGSLAAAAGT 394 (394) Q Consensus 378 ~~~~a~~~l~~~~a~~~ 394 (394) .+|+||+.+++++++-- T Consensus 377 ~~~~a~~~~~~~~~~~p 393 (394) T protein:vir:97 377 VDDKAGYYVTFTPEPLP 393 (394) T ss_pred ecccceEEEEecccccC Confidence 99999999999755544 No 56 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=7.3e-58 Score=333.82 Aligned_cols=390 Identities=18% Similarity=0.180 Sum_probs=261.7 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHhhhh--hhHH----HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhh---- Q lcl|NC_019933. 1 MSD-INAINSTLANISDSLKAHADRAVKDQE--LNAS----VRAKVDELLMAQGALQADLKAAQ---QRIAEVEGN---- 66 (394) Q Consensus 1 Mk~-i~el~~~~~~~~~~~k~~~e~~~~~~~--~~~e----~~~~~~~~~~~~~~l~~~i~~~e---~~~~~~~~~---- 66 (394) |+. |++|++++.++.+++++++++...+.+ ..++ .+++++++.++++.+++.+++++ ......... T Consensus 8 m~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~~~~~~~~ 87 (477) T protein:vir:84 8 LRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERSGKLEAET 87 (477) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 553 677777777777777777776654422 2333 33344455555554443333222 222111000 Q ss_pred --ccc--ccccchhhhhhhhhHHHHHHH----------------HHHhhhhhhhhHHH---HH-HHhhcccccCCcCccc Q lcl|NC_019933. 67 --GAG--GDVQHISIGQQFVNSDSFKAM----------------AESGGQRGRAEINI---KA-AITSLSTNADGSAGAT 122 (394) Q Consensus 67 --~~~--~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~---~~-~~~~~~~~~~~~~g~~ 122 (394) ... .....+............... .+............ +. ......+++++.||.+ T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~l 167 (477) T protein:vir:84 88 KTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYA 167 (477) T ss_pred hhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCccee Confidence 000 000000000000000000000 00000000000000 00 1122223445667888 Q ss_pred cchhh-hhHHHhhhhhhhhHHHhcccccccc--CceeEEEEcCcccccceecCCcc-----ccccccceeeEEeeeeeEE Q lcl|NC_019933. 123 VQTTR-LPGILELPQRRMTIRSLLAQGTMEG--NTLEYVRETGFTNAAAPVAEGAQ-----KPESSLRFDLVQTSAKVIA 194 (394) Q Consensus 123 ip~~~-~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~eg~~-----~~~~~~~~~~i~~~~~k~~ 194 (394) +|+++ .+.|++.+++.++|+++++.+++++ +.+++|+..+....++|++||+. +|+++++|+++++++++++ T Consensus 168 v~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~ 247 (477) T protein:vir:84 168 VPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIA 247 (477) T ss_pred eccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEE Confidence 88775 6789999999999999999998865 46789998777777899999864 5788999999999999999 Q ss_pred EeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccccc------ccchHHHHHH Q lcl|NC_019933. 195 HWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVA------NATAVDRLRL 267 (394) Q Consensus 195 ~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~------~~~~~~~i~~ 267 (394) ++++||+|+++|+ +++++||.++|+++++.++|.+||+|+|+++.|.||++.++....+.+.. ....++++.+ T Consensus 248 ~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~ 327 (477) T protein:vir:84 248 GQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIAD 327 (477) T ss_pred eeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHHHHHHHHHH Confidence 9999999999997 59999999999999999999999999999989999998876654433322 1234666777 Q ss_pred HHHHhhhhcCCC-CeeEeCHHHHHHHHHhhccCCcccccCcc--------------cCCCceeecceEEEcCCCCcC--- Q lcl|NC_019933. 268 ALLQAQLAEFPA-TGIVLNPADWAGIELLKDTQGRYILGNPQ--------------GTLAPTLWGLPVVATQAMAVG--- 329 (394) Q Consensus 268 ~~~~~~~~~~~~-~~~~~~~~~~~~l~~lkd~~G~~~~~~~~--------------~~~~~~l~G~pv~~~~~~p~~--- 329 (394) ++..+...+..+ +.|+|||.+|..|++++|++|+|+|++.. ....++|+|+||++++.||++ T Consensus 328 ~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~ 407 (477) T protein:vir:84 328 AIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGT 407 (477) T ss_pred HHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccc Confidence 777787777654 47999999999999999999999996532 223468999999999999964 Q ss_pred -----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc-EEecccceEEEEecC-CCCC Q lcl|NC_019933. 330 -----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL-AVYRPESFIKGSLAA-AAGT 394 (394) Q Consensus 330 -----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~-~v~~~~a~~~l~~~~-a~~~ 394 (394) .++||||+. ++++. .++.++++++.+ ..++.+.|+.+.++++ .+++|+||++++.++ ++|| T Consensus 408 ~~d~~~i~~gd~~~-~~i~~-~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~ 475 (477) T protein:vir:84 408 GTDQDVIHVLRASD-LALFE-SSVRMRALQETR--AENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPT 475 (477) T ss_pred cCCcceEEEEEece-EEEEe-eceeEEeccccc--cccceeeeeehhhhhhhhhccccceEEeecccccccc Confidence 478999986 44554 467787777655 4577888888888886 445699999999654 5666 No 57 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=2.2e-57 Score=331.20 Aligned_cols=354 Identities=15% Similarity=0.145 Sum_probs=250.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLK-AAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~-~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ||+|+|+.++..++.+++.+.++..... .+..+.++++. ..++.++. +.+....... T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~~~~~----~e~~~~~~~~~---~~~~~~~~~~~~~~~~~~~--------------- 58 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKEGATE----AEQVTAFTNMA---EQIQNNIIAQARKEVNREM--------------- 58 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhhhhhH----HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH--------------- Confidence 8888888877766555443333221111 11122222221 11111111 0000000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) .......... .......+++........+++++||++||+++.++|++.+++.++|+++|+++|++++...+|+ T Consensus 59 -----~~~~~~~~~~-~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~ 132 (390) T protein:vir:40 59 -----NDNNVLASRG-ANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIIS 132 (390) T ss_pred -----HHHHHHHhcC-chhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEE Confidence 0000000000 0001111222222333445667899999999999999999999999999999999998899999 Q ss_pred EcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTG 237 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~ 237 (394) ..+ .+.++|++|++.+++ ++++|+++++.+++++++++||+|+++|++ ++++||+++|+++++.++|.+||+|+|++ T Consensus 133 ~~~-~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~ 211 (390) T protein:vir:40 133 VGD-VATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKD 211 (390) T ss_pred EcC-CcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCC Confidence 876 468999999988874 689999999999999999999999999996 79999999999999999999999999965 Q ss_pred ccccccccccccccccc---cccccchHHHHHHHHHHhhh-------hcCCCCeeEeCHHHH-H---HHHHhhccCCccc Q lcl|NC_019933. 238 QNLLGLLPQATAFAAPI---TVANATAVDRLRLALLQAQL-------AEFPATGIVLNPADW-A---GIELLKDTQGRYI 303 (394) Q Consensus 238 ~~~~Gi~~~~~~~~~~~---~~~~~~~~~~i~~~~~~~~~-------~~~~~~~~~~~~~~~-~---~l~~lkd~~G~~~ 303 (394) .|.||++..+..+... ......+..+..++...+.. ....+++|+||+.++ . .+++++|.+|+|+ T Consensus 212 -~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v 290 (390) T protein:vir:40 212 -QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWV 290 (390) T ss_pred -ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccc Confidence 5789988654333211 12222334444444433333 234678899999884 3 4457899999998 Q ss_pred ccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 304 LGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 304 ~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) +.. .++|+||++++.||+++++||||+. |.++++.++++.+++ +.+|.+|++.||++.|+|+++++++|| T Consensus 291 ~~~-------~~~g~pvv~~~~~p~~~i~~Gd~s~-~~i~~~~~~~v~~~~--~~~f~~~~~~~r~~~r~dg~v~~~~A~ 360 (390) T protein:vir:40 291 TGI-------LPVPLEIVQSVAVPVGKAVAGRAKD-YFMGIGSEQVIRTST--EYRLLDDETLYYAKQYANGRPKDNSSF 360 (390) T ss_pred ccc-------CCCceeEEEcCCCCCCcEEEEeece-EEEEeecceEEEecc--hhhhhcCcEEEEEEEEeCCEEecccce Confidence 742 3479999999999999999999997 678899999998876 446899999999999999999999999 Q ss_pred EEEEecCCCCC Q lcl|NC_019933. 384 IKGSLAAAAGT 394 (394) Q Consensus 384 ~~l~~~~a~~~ 394 (394) +++++++++|+ T Consensus 361 ~~l~~~~~~~~ 371 (390) T protein:vir:40 361 LVFDITGLEGS 371 (390) T ss_pred EEEEeeccCCC Confidence 99999999997 No 58 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=6.9e-57 Score=328.49 Aligned_cols=385 Identities=14% Similarity=0.150 Sum_probs=275.0 Q ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Q lcl|NC_019933. 1 MS---DINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGD----- 71 (394) Q Consensus 1 Mk---~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~----- 71 (394) |+ .|+++++++.++.++++++.++...+. .+.++..++++++..++++++.+|+++++............. T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~ 272 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNG 272 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 33 378888899888888888777665554 467888899999999999999999877654333221111000 Q ss_pred -ccc-h-----hhhhhhhhHHHHHH------------------HHHHhhhhhhhhHHHHHHHh-hcccccCCcCccccch Q lcl|NC_019933. 72 -VQH-I-----SIGQQFVNSDSFKA------------------MAESGGQRGRAEINIKAAIT-SLSTNADGSAGATVQT 125 (394) Q Consensus 72 -~~~-~-----~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~ip~ 125 (394) ... . .......+...+.. +...+..........+..+. ...+++...||+++|+ T Consensus 273 ~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~ 352 (645) T protein:vir:93 273 NVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQ 352 (645) T ss_pred ccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCch Confidence 000 0 00000001111111 11111111112222232322 2333445568899999 Q ss_pred hhhhHHHhhhhhhhhHHHhccccccc----cCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhH Q lcl|NC_019933. 126 TRLPGILELPQRRMTIRSLLAQGTME----GNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASR 201 (394) Q Consensus 126 ~~~~~ii~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ 201 (394) .+..+||+.+++.++++++......+ .+.+++|++++. +.++|++||+.+|+++++|++++++++|+++++++|+ T Consensus 353 ~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~-~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ 431 (645) T protein:vir:93 353 EYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSG-GAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTE 431 (645) T ss_pred hhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecC-cceEEeccCccccccccceeEEEEeeEEEEEeehhHH Confidence 99999999999999999986543221 135789998875 6889999999999999999999999999999999999 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCC---ccccccccccccccccccccccchHHHHHHHHHHhhhhcC Q lcl|NC_019933. 202 QILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTG---QNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEF 277 (394) Q Consensus 202 e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~---~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 277 (394) |+++++ ++++++|.+.|+++++.++|.+||+|+|++ ..|.|++..... ..++.....++..++..+..++. T Consensus 432 ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~-----~~~~~~~~~d~~~~~~~~~~a~~ 506 (645) T protein:vir:93 432 ELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKG-----TASSGNPDADAEAAFGQFVAANL 506 (645) T ss_pred HHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccc-----cccccchHHHHHHHHHHHHhcCC Confidence 999987 699999999999999999999999987764 347777653221 12233355677777777665543 Q ss_pred --CCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecc Q lcl|NC_019933. 278 --PATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATE 355 (394) Q Consensus 278 --~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 355 (394) .+++|+|||.++..|+++||++|+|+|+.. ...+++|+|+||++++.+|++ +++|||+. ++++...++.+.++.+ T Consensus 507 ~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~-~~~~~tL~G~PV~~s~~vp~~-~~~gd~s~-~~ig~~~~v~i~~s~~ 583 (645) T protein:vir:93 507 QPTGAVWLMSSTNALALSMRKNALGQKEYPDM-TLLGGSFQGLPVIVSQYVGDQ-LVLVNAPD-IYLADDGGVAVDMSRE 583 (645) T ss_pred CccccEEEEcHHHHHHHHhccccCCceeecCC-CCCCceeeceeeEEeccCCcc-eeEecccc-EEEEEecceEEEeecc Confidence 456899999999999999999999999543 444579999999999999975 67899986 4556667777666543 Q ss_pred cc--------------------hhhhcCcEEEEEEEEeccEEecccceEEEE---ecCCCCC Q lcl|NC_019933. 356 NQ--------------------DDFIKNMVTILAEERLALAVYRPESFIKGS---LAAAAGT 394 (394) Q Consensus 356 ~~--------------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~---~~~a~~~ 394 (394) .. .+|++|++++|+.+|+||+++||+||++|+ +.++.|- T Consensus 584 a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 584 ASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred eeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 22 249999999999999999999999999997 3333333 No 59 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=4.5e-57 Score=329.52 Aligned_cols=373 Identities=12% Similarity=0.071 Sum_probs=256.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQ-------ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGG--- 70 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~-------~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~--- 70 (394) || |+||+++++++.++++...++..... +.....+.+++++.++++++++++++.+............. T Consensus 1 Mk-i~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~ 79 (437) T protein:vir:10 1 MK-IEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDL 79 (437) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99 88999888887777766554433221 12222333445555555555555544332221111000000 Q ss_pred ---------c--c---cchhhhhhhh---------hHHHHHHHH-----HHhhhhhhhhHH--------HHHHHhhcccc Q lcl|NC_019933. 71 ---------D--V---QHISIGQQFV---------NSDSFKAMA-----ESGGQRGRAEIN--------IKAAITSLSTN 114 (394) Q Consensus 71 ---------~--~---~~~~~~~~~~---------~~~~~~~~~-----~~~~~~~~~~~~--------~~~~~~~~~~~ 114 (394) . . ..+....... ......... ...........+ ........... T Consensus 80 ~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 159 (437) T protein:vir:10 80 VAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGI 159 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhc Confidence 0 0 0000000000 000000000 000000000000 01112234445 Q ss_pred cCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccc-cccceeeEEeeeeeE Q lcl|NC_019933. 115 ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVI 193 (394) Q Consensus 115 ~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~ 193 (394) ++.++|++||+++...|.. +++.++|+.++++++++++.+++|+.....+.+.|++|++..|+ ++++|++|++.++++ T Consensus 160 ~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~ 238 (437) T protein:vir:10 160 ALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTY 238 (437) T ss_pred ccccccccchHHHHHHHHH-hhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhhe Confidence 6678999999999876654 57888999999999999888999998776678899999999996 568999999999999 Q ss_pred EEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHH-HH Q lcl|NC_019933. 194 AHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLAL-LQ 271 (394) Q Consensus 194 ~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~-~~ 271 (394) +++++||+|+++|++ ++.+||.+.|+++++.++|.+|++|+|++.+. .++..+++++.+++ .. T Consensus 239 ~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~---------------~~~~~~~~~~~~~~~~~ 303 (437) T protein:vir:10 239 TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKK---------------TTSTYLLGDLKKVLNVT 303 (437) T ss_pred eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc---------------cccccchhhHHHHHHhh Confidence 999999999999985 89999999999999999999999998765421 12233456666655 37 Q ss_pred hhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCCC--CcC-----ceEEeeccceEEEE Q lcl|NC_019933. 272 AQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAM--AVG-----QFLTGAFDAGAQVF 343 (394) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~--p~~-----~~~~gd~~~~~~~~ 343 (394) ++..+..+++|+|||.+|..|++++|++|+|+|.+. ..+.+++|+|+||++++++ |.. .++||||+.+|.++ T Consensus 304 l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 383 (437) T protein:vir:10 304 LKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINF 383 (437) T ss_pred hhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEE Confidence 888888899999999999999999999999999654 4455679999999997754 432 37999999999999 Q ss_pred eecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 344 DRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 344 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.++++.++.. +..+.+.+++.+|+||++++|+||++|+.+.++-| T Consensus 384 ~r~~~~~~~~~~----~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~ 430 (437) T protein:vir:10 384 KLTEITGQFQDT----YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVT 430 (437) T ss_pred eeeceEEEEecc----cccccceeeEEEEEccEEecccceEEEEeeccccc Confidence 999999987643 44566789999999999999999999986654444 No 60 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=1e-55 Score=322.02 Aligned_cols=367 Identities=14% Similarity=0.140 Sum_probs=251.3 Q ss_pred Cc--------hHHHHHHHHHHHHHHHHHHHHHHHhh---------hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 1 MS--------DINAINSTLANISDSLKAHADRAVKD---------QELNASVRAKVDELLMAQGALQADLKAAQQRIAEV 63 (394) Q Consensus 1 Mk--------~i~el~~~~~~~~~~~k~~~e~~~~~---------~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~ 63 (394) |+ ++++++++++++.++.+++.++.... .+..++..++++++..++..+++++++++...... T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 23333333333333222222111110 12234455566667777777777776666555444 Q ss_pred Hhhcccccccch--hhhhhhhhHHHHHH-HHHHhhhhhhhhHHHHH-HHhhcccccCCcCccccchhhhhHHHhhhhhhh Q lcl|NC_019933. 64 EGNGAGGDVQHI--SIGQQFVNSDSFKA-MAESGGQRGRAEINIKA-AITSLSTNADGSAGATVQTTRLPGILELPQRRM 139 (394) Q Consensus 64 ~~~~~~~~~~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~ 139 (394) ............ .............. ...............+. ........+..++|+++|+++...|++ +.+.. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~ 159 (397) T protein:vir:96 81 EDELAKAADPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIV 159 (397) T ss_pred HHHHHhhhhhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhh Confidence 332211111110 00000000000000 00000001111111111 112233445677899999999999997 57888 Q ss_pred hHHHhccccccccCceeEEEEcCcccccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHH Q lcl|NC_019933. 140 TIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPE-SSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINAR 217 (394) Q Consensus 140 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~ 217 (394) +++++|++++++++++.+|+....+..+.|+.|++..|+ ++++|++|++.++++++++++|+++++|++ +++++|.+. T Consensus 160 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~ 239 (397) T protein:vir:96 160 DLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADE 239 (397) T ss_pred hHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHH Confidence 999999999999988999987766667778889988886 689999999999999999999999999985 899999999 Q ss_pred HHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhc Q lcl|NC_019933. 218 LLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKD 297 (394) Q Consensus 218 la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd 297 (394) ++++++.++|.+|++|+|.+.+ .+..+++++.+++......+ .+++|+|||.+|..|++++| T Consensus 240 l~~~~~~~~~~~i~~g~g~~~~-----------------~~~~~~d~~~~~~~~~~~~~-~~a~~v~n~~~~~~l~~lkd 301 (397) T protein:vir:96 240 IQDQSLNTKNADIAAVLKTATA-----------------KSVVGVDGLKDLINKEIKKV-YDVKLFISASMYSELDKLKD 301 (397) T ss_pred HHHHHHHHHHHHHhhccccccc-----------------ccccchHHHHHHHHHhhhhh-cCcEEEEcHHHHHHHHHhhc Confidence 9999999999999999876542 23346788888876655444 47899999999999999999 Q ss_pred cCCcccccC-cccCCCceeecceEEEcCCCCc------CceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEE Q lcl|NC_019933. 298 TQGRYILGN-PQGTLAPTLWGLPVVATQAMAV------GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAE 370 (394) Q Consensus 298 ~~G~~~~~~-~~~~~~~~l~G~pv~~~~~~p~------~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 370 (394) ++|+|+|++ ..++.+++|+|+||++++..+. ..++||||+.+|.++++.++++..+++. .+.+.+|++ T Consensus 302 ~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~ 376 (397) T protein:vir:96 302 KNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN-----IYGQLLAGI 376 (397) T ss_pred cCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEeccc-----ccceeEEEE Confidence 999999964 4555667999999998664322 2479999999899999999999876643 235689999 Q ss_pred EEeccEEecccceEEEEecCC Q lcl|NC_019933. 371 ERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 371 ~~~d~~v~~~~a~~~l~~~~a 391 (394) +|+|+++++|+||+++++++| T Consensus 377 ~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 377 IRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEccEEecccceEEEEeecC Confidence 999999999999999999999 No 61 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=5e-57 Score=329.23 Aligned_cols=282 Identities=17% Similarity=0.159 Sum_probs=249.2 Q ss_pred HHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceee Q lcl|NC_019933. 106 AAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDL 185 (394) Q Consensus 106 ~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~ 185 (394) --.++....+++++|.+||+++.++|++.+++.++|+++++++|++++..++|+.++ +.+.|++|++.+|+++++|++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--~~a~~v~E~~~~~~~~~~f~~ 78 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG--VGAFWVDEAERIQTSKPTFTK 78 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC--CceeeeecCccccccccceeE Confidence 113344455667788999999999999999999999999999999999999998754 568899999999999999999 Q ss_pred EEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHH Q lcl|NC_019933. 186 VQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDR 264 (394) Q Consensus 186 i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 264 (394) +++.+++++++++||+|+++++ .+++++|.+.|++++++++|.++|+|+|+++ |.|+++.+..... ....+..++++ T Consensus 79 v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~-~~gil~~~~~~~~-~~~~~~~~~~~ 156 (299) T protein:vir:41 79 AKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPY-NWNILKSATDASN-LVEETANKYDD 156 (299) T ss_pred EEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc-cccccccccccce-eeccccccHHH Confidence 9999999999999999999988 5899999999999999999999999998754 5688775443332 23445668999 Q ss_pred HHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCc----eEEeeccceE Q lcl|NC_019933. 265 LRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQ----FLTGAFDAGA 340 (394) Q Consensus 265 i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~----~~~gd~~~~~ 340 (394) +.+++.++...+..+++|+|||+++.+|++++|++|+|+|.+....+.++|+|+||++++.+|.+. ++||||+. + T Consensus 157 l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~-~ 235 (299) T protein:vir:41 157 LNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQ-A 235 (299) T ss_pred HHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEEeccc-E Confidence 999999999999999999999999999999999999999987777777899999999999999876 89999987 5 Q ss_pred EEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 341 QVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 341 ~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) .++.+.+++++++++.+ ..|++|++.+|++.|+||++.+|+||++++.+++. T Consensus 236 ~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 236 YYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 68899999999887653 24899999999999999999999999999999999 No 62 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=5.4e-57 Score=329.06 Aligned_cols=290 Identities=14% Similarity=0.135 Sum_probs=246.8 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcccccc Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPES 179 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~ 179 (394) +..+. ..+.....+.++|.++|+++.++|++.+++.++|++++++++++++.+++|+..+. +.+.|++||+.+|++ T Consensus 1 m~~~~---~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~ 76 (330) T protein:vir:77 1 MAGST---VPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGA-VSASWTGEAERKPIT 76 (330) T ss_pred Ccccc---cchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCC-cceeEecCCCccccc Confidence 11111 11222333455677888899999999999999999999999999988999998764 678899999999999 Q ss_pred ccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccc------- Q lcl|NC_019933. 180 SLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFA------- 251 (394) Q Consensus 180 ~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~------- 251 (394) +++|++++++++|++++++||+|+++++ .+++++|.++|++++++++|.++|+|+|+++++.|+++...... T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL 156 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc Confidence 9999999999999999999999999988 58999999999999999999999999999999999987653322 Q ss_pred ccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccC------CCceeecceEEEcCC Q lcl|NC_019933. 252 APITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGT------LAPTLWGLPVVATQA 325 (394) Q Consensus 252 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~------~~~~l~G~pv~~~~~ 325 (394) .+........++++.+++..+...+..+++|+|||+++..|+++||++|+|+|++.... .+++|+|+||++++. T Consensus 157 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~ 236 (330) T protein:vir:77 157 TTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADN 236 (330) T ss_pred cccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecc Confidence 22223345568899999999999999999999999999999999999999999754322 345899999999999 Q ss_pred CCcCc------eEEeeccceEEEEeecceEEEEecccc----------------hhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 326 MAVGQ------FLTGAFDAGAQVFDRWAARVEVATENQ----------------DDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 326 ~p~~~------~~~gd~~~~~~~~~~~~~~i~~~~~~~----------------~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) ||++. +++|||+. +.++++.+++++++++.+ ..|++|++.||++.|+|+++.+|+|| T Consensus 237 ~p~~~~~~~~~~~~gd~s~-~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~ 315 (330) T protein:vir:77 237 VVNGTVGNRVVGVMGDFSQ-VIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAF 315 (330) T ss_pred ccCCCCCCccEEEEEecce-EEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccce Confidence 99764 78999997 468888999999877643 45999999999999999999999999 Q ss_pred EEEEecCCCCC Q lcl|NC_019933. 384 IKGSLAAAAGT 394 (394) Q Consensus 384 ~~l~~~~a~~~ 394 (394) ++++.++++.+ T Consensus 316 ~~i~~~~~~~~ 326 (330) T protein:vir:77 316 VKLTDQVAGTD 326 (330) T ss_pred EEEEeccCCcC Confidence 99999998888 No 63 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.3e-56 Score=326.96 Aligned_cols=303 Identities=15% Similarity=0.081 Sum_probs=252.3 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) |.+........ ....................++++|.+||+++.++|++.+++.++|+++++++|++++.++ T Consensus 1 ~~~~~~~~~~~--------~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 72 (324) T protein:vir:97 1 MEQTQKLKLNL--------QHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CccchhHHHHH--------HHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceE Confidence 11111111111 1111122222334445555667789999999999999999999999999999999999999 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.+.|++++++++|+++|+|+| T Consensus 73 ip~~~~~-~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g 151 (324) T protein:vir:97 73 FTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEecC-cceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 9998764 6789999999999999999999999999999999999999998 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|+++...... ....+..++++|.+++..+...++.+++|+|||.++..|++++|++|+|+|.. ..+++| T Consensus 152 ~~~~~~gi~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~---~~~~tl 226 (324) T protein:vir:97 152 NNPFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDTL 226 (324) T ss_pred CCccCccccccccccc--eeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC---CCCccc Confidence 9988999887544322 33456678999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCCC--CcCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQAM--AVGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~~--p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||+.++.+ +.+.++||||++ +.++++.+++++++++.. .+|++|++.||++.|+|+++.+|+ T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:97 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeeEeecCCCCCcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc Confidence 99999998865 456789999997 557888999999987642 469999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++++.+.+..+ T Consensus 306 a~~~l~~~~~~~~ 318 (324) T protein:vir:97 306 AFAKLVPADKKTD 318 (324) T ss_pred ceEEEEeccCCCC Confidence 9999999888777 No 64 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.4e-56 Score=326.84 Aligned_cols=279 Identities=15% Similarity=0.118 Sum_probs=240.2 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeee Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSA 190 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~ 190 (394) |.. +++++|.++|+++..+|++.+++.++++++++++|++++..++|+..+. +.+.|++||+.+|+++++|+++++++ T Consensus 1 ma~-~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSE-AQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFD-SDIDIVAENGKKTHGGVSLDPVTIVP 78 (300) T ss_pred Ccc-cccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecC-cceEEeeCCcccccccccceeeEeee Confidence 333 3445678999999999999999999999999999999888999998775 67899999999999999999999999 Q ss_pred eeEEEeehhhHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHHHhhcc----CCCccccccccccccccccccccccchH Q lcl|NC_019933. 191 KVIAHWMKASRQILS---DS-AQLQSFINARLLRGLEVVEENQLLNGN----GTGQNLLGLLPQATAFAAPITVANATAV 262 (394) Q Consensus 191 ~k~~~~~~is~e~l~---~s-~~~~~~i~~~la~a~~~~~d~a~l~g~----g~~~~~~Gi~~~~~~~~~~~~~~~~~~~ 262 (394) +|++++++||+|+++ ++ .+++++|.++|++++++++|.++|+|+ |.+..+.|.....+.........+...+ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPD 158 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchH Confidence 999999999999995 33 589999999999999999999999984 4444555655554544444445566778 Q ss_pred HHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCCCCcCc------eEEee Q lcl|NC_019933. 263 DRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAMAVGQ------FLTGA 335 (394) Q Consensus 263 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~p~~~------~~~gd 335 (394) +++.+++..+...++.+++|+|||.++.+|+++||++|+|+|+.. .+..+++|+|+||++++.+|.+. +++|| T Consensus 159 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GD 238 (300) T protein:vir:95 159 ESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGD 238 (300) T ss_pred HHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEEee Confidence 999999999999999999999999999999999999999999643 44567899999999999998653 67899 Q ss_pred ccceEEEEeecceEEEEecccc------hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 336 FDAGAQVFDRWAARVEVATENQ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 336 ~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) |+.++.++.+.+++++++++.. .+|++|++.+|++.|+||++.+|+||++++.++- T Consensus 239 f~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 239 FETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 9988888889999999987644 2599999999999999999999999999876655 No 65 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=4.7e-55 Score=318.43 Aligned_cols=385 Identities=14% Similarity=0.113 Sum_probs=251.8 Q ss_pred CchHHHHHHHHHHHHHHHHH---HHHHHHhhhh------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKA---HADRAVKDQE------LNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGD 71 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~---~~e~~~~~~~------~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~ 71 (394) ..+|.+|++++.++.++.++ .+++...+++ .......+.+++.+++..++.+|+++++++.+.+....... T Consensus 16 ~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e~~~~~~~~~ 95 (466) T protein:vir:80 16 KAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQLNNKEPKNN 95 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Confidence 22334444444333322222 2222222211 11222233334444555555566666655555544332222 Q ss_pred ccchhhhhhhhhH-----HHHHHHHHHhh----hhhhhhHHHHHH----Hhhcc-cccCCcCccccchhhhhHHHhhhhh Q lcl|NC_019933. 72 VQHISIGQQFVNS-----DSFKAMAESGG----QRGRAEINIKAA----ITSLS-TNADGSAGATVQTTRLPGILELPQR 137 (394) Q Consensus 72 ~~~~~~~~~~~~~-----~~~~~~~~~~~----~~~~~~~~~~~~----~~~~~-~~~~~~~g~~ip~~~~~~ii~~~~~ 137 (394) ............. .......+... .......+.+.. ..... ..+.++++.++|+++.+.|++.+++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~ 175 (466) T protein:vir:80 96 SEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHR 175 (466) T ss_pred chhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhh Confidence 2111111100000 00000000000 000011111111 11111 2233456789999999999999999 Q ss_pred hhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHH Q lcl|NC_019933. 138 RMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINA 216 (394) Q Consensus 138 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~ 216 (394) .++|++++++.++++. .++|+... .+.+.|++||+.+|+++++|++|++.+++++++++||+|+++|++ ++++||.. T Consensus 176 ~~~l~~~~~v~~~~g~-~~~~~~~~-~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 253 (466) T protein:vir:80 176 YSKLISKVRLRPLKGT-ARQNIAGA-IPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILD 253 (466) T ss_pred hhhhhhheeeeecCce-eEeeeecC-CcceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHH Confidence 9999999999998764 67787654 467889999999999999999999999999999999999999995 89999999 Q ss_pred HHHHHHHHHHHHHHhhccCCCccccccccccccccccccc------cccc-----------------hHHHHHHHHHHhh Q lcl|NC_019933. 217 RLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV------ANAT-----------------AVDRLRLALLQAQ 273 (394) Q Consensus 217 ~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~------~~~~-----------------~~~~i~~~~~~~~ 273 (394) +|+++++.++|.+||+|+|+++ |.||++.....+..... .... .+.++...+.... T Consensus 254 ~la~~~~~~~~~ail~G~G~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (466) T protein:vir:80 254 AIGQAIGFALDKAILYGTGTKM-PVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKAR 332 (466) T ss_pred HHHHHHHHHHhhheeeccCCCC-cceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhh Confidence 9999999999999999999876 67998764432221110 0001 1222222223333 Q ss_pred hhcCCC-CeeEeCHHHHHHHHHhh---ccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceE Q lcl|NC_019933. 274 LAEFPA-TGIVLNPADWAGIELLK---DTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAAR 349 (394) Q Consensus 274 ~~~~~~-~~~~~~~~~~~~l~~lk---d~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~ 349 (394) ..+..+ ..|+||+.++..|..++ +.+|.+++... .++.++|+||+.++.||++++++|||+. |.++++.+++ T Consensus 333 ~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~---~~~~i~G~pvv~s~~~~~~~~~~g~~~~-y~i~~r~~~~ 408 (466) T protein:vir:80 333 ANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN---NTMPIVGGDIVILDFIPDNDIIGGYGSL-YLLAERADIK 408 (466) T ss_pred ccccCCceeEEecchhHHHhhcccccccCCccccccCC---CcccccccceeecCccCccceeeecccc-EEEEeecceE Confidence 444444 46999999999998887 56777776432 2335999999999999999999999986 6799999999 Q ss_pred EEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 350 VEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 350 i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +..+.+ ..|.+|++.||+..|+|+++++++||++++++..++. T Consensus 409 i~~~~~--~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~ 451 (466) T protein:vir:80 409 LAQSEH--VRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPT 451 (466) T ss_pred EEechh--hhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcc Confidence 988764 5689999999999999999999999999999887776 No 66 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=3.6e-56 Score=324.53 Aligned_cols=303 Identities=14% Similarity=0.082 Sum_probs=251.0 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) +.+........+ ............++.....++.++.+||+++.++|++.+++.++|+++++++|++++.++ T Consensus 1 ~~~~~~~~~~~~--------~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:93 1 MEQTQKLKLNLQ--------HFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CchhHHHHHHHH--------HHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 211111111111 111122222233444455556677899999999999999999999999999999998899 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.++|+|+| T Consensus 73 ip~~~~~-~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g 151 (324) T protein:vir:93 73 FTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEecC-cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 9998764 6789999999999999999999999999999999999999998 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|++....... ....+..+++++.+++..++..+..+++|+|||.+|..|++++|++|+|++.. ..+++| T Consensus 152 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~---~~~~~l 226 (324) T protein:vir:93 152 NNPFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSL 226 (324) T ss_pred CCCcCccccccccccc--eeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC---CCCCcc Confidence 9888888877544322 22345668999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCC--CCcCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||+.++. .+.+.+++|||+. +.++.+.+++++++++.. .+|++|++.+|+++|+||++.+|+ T Consensus 227 ~G~PVv~~~~~~~~~~~i~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:93 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeeEeecCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc Confidence 9999998776 4566789999997 567888999999987743 469999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++|+.+.+.+| T Consensus 306 a~~~l~~a~~~~~ 318 (324) T protein:vir:93 306 AFAKLVPADKRTD 318 (324) T ss_pred ceEEEecccccCC Confidence 9999998777776 No 67 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=4.1e-56 Score=324.23 Aligned_cols=303 Identities=14% Similarity=0.082 Sum_probs=250.7 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) |.+......... ............++.....++.+|.+||+++.++|++.+++.++|+++++++|++++.++ T Consensus 1 ~~~~~~~~~~~~--------~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:96 1 MEQTQKLKLNLQ--------HFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchhhhHHHH--------HHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 111111111111 111111122223344455567788999999999999999999999999999999998899 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.++|+|+| T Consensus 73 ~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g 151 (324) T protein:vir:96 73 FTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEecC-cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 9998764 6789999999999999999999999999999999999999988 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|+........ ....+..++++|.+++..+...+..+++|+|||.++..|++++|++|+|++.. ..+++| T Consensus 152 ~~~~~~gi~~~~~~~~--~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~---~~~~~l 226 (324) T protein:vir:96 152 NNPFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSL 226 (324) T ss_pred CCCcCccccccccccc--eeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC---CCCCcc Confidence 9988988877544332 22345668999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCC--CCcCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||+.++. ++++.+++|||++ +.++.+.+++++++++.. .+|++|++.||+++|+||++.+|+ T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:96 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeeEeeCCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEeccc Confidence 9999999776 4566799999997 567889999999987643 469999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++|+.+.+.++ T Consensus 306 A~~~l~~a~~~~~ 318 (324) T protein:vir:96 306 AFAKLVPADKRTD 318 (324) T ss_pred ceEEEecccccCC Confidence 9999998777776 No 68 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=4.1e-56 Score=324.23 Aligned_cols=303 Identities=14% Similarity=0.082 Sum_probs=250.7 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) |.+......... ............++.....++.+|.+||+++.++|++.+++.++|+++++++|++++.++ T Consensus 1 ~~~~~~~~~~~~--------~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:78 1 MEQTQKLKLNLQ--------HFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchhhhHHHH--------HHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 111111111111 111111122223344455567788999999999999999999999999999999998899 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.++|+|+| T Consensus 73 ~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g 151 (324) T protein:vir:78 73 FTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEecC-cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 9998764 6789999999999999999999999999999999999999988 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|+........ ....+..++++|.+++..+...+..+++|+|||.++..|++++|++|+|++.. ..+++| T Consensus 152 ~~~~~~gi~~~~~~~~--~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~---~~~~~l 226 (324) T protein:vir:78 152 NNPFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSL 226 (324) T ss_pred CCCcCccccccccccc--eeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC---CCCCcc Confidence 9988988877544332 22345668999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCC--CCcCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||+.++. ++++.+++|||++ +.++.+.+++++++++.. .+|++|++.||+++|+||++.+|+ T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:78 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeeEeeCCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEeccc Confidence 9999999776 4566799999997 567889999999987643 469999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++|+.+.+.++ T Consensus 306 A~~~l~~a~~~~~ 318 (324) T protein:vir:78 306 AFAKLVPADKRTD 318 (324) T ss_pred ceEEEecccccCC Confidence 9999998777776 No 69 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=4.3e-55 Score=318.63 Aligned_cols=349 Identities=11% Similarity=0.024 Sum_probs=251.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) ||+++++++.+.++.+.+++. .. .++....+++ ....++.++.+......+......... +. T Consensus 5 ~k~~~~~~~~~~~l~~~~~~~----~~----~ee~~~~~~~---~~~~~~~~~~~~~~~e~~~~~~~~~~~---~~---- 66 (377) T protein:vir:98 5 LKELPKYREAVAELSAKISAG----AT----SEEQEKLFEA---AFTTMGDEILAKNEEEMERMFDLRDKN---RE---- 66 (377) T ss_pred HHHHHHHHHHHHHHHHHHHhh----hh----hHHHHHHHHH---HHHhHHHHHHHHHHHHHHHHHHhccCC---cc---- Confidence 445555555544443332221 11 1111112222 222333333221100000000000000 00 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRE 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) ....+.+........+++++||++||+++.+.|++.+...++++++|++.++++. .++|+. T Consensus 67 ------------------lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~~~~~ 127 (377) T protein:vir:98 67 ------------------LTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTA 127 (377) T ss_pred ------------------cCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcc-eEEEEe Confidence 0111222222234456778899999999999999999999999999999998765 789987 Q ss_pred cCcccccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCc Q lcl|NC_019933. 161 TGFTNAAAPVAEGAQKP-ESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTGQ 238 (394) Q Consensus 161 ~~~~~~~~~~~eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~ 238 (394) .+ .+.+.|++|++..+ +++++|+++++.+++++++++||+++|+|++ ++++||++++++++++++|.+|++|+|++ T Consensus 128 ~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~- 205 (377) T protein:vir:98 128 ET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLL- 205 (377) T ss_pred cC-CcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCC- Confidence 65 46788999987665 6789999999999999999999999999986 89999999999999999999999999975 Q ss_pred ccccccccccccccccc-----ccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccC-c----- Q lcl|NC_019933. 239 NLLGLLPQATAFAAPIT-----VANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGN-P----- 307 (394) Q Consensus 239 ~~~Gi~~~~~~~~~~~~-----~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~-~----- 307 (394) .|.||++......+... .+.....+.+.++...+...+..+++|+||+.++..++++||.+|+|+|.. + T Consensus 206 qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~ 285 (377) T protein:vir:98 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWA 285 (377) T ss_pred cceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhh Confidence 68999986543322111 122223467778888888888899999999999999999999999999831 1 Q ss_pred ---------ccCCCceeecce--EEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccE Q lcl|NC_019933. 308 ---------QGTLAPTLWGLP--VVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALA 376 (394) Q Consensus 308 ---------~~~~~~~l~G~p--v~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~ 376 (394) ..+...+++|+| |+.++.||+++++||||++ |.++++.+++++.+++ ..|.+|++.|++..|+|++ T Consensus 286 ~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~-Y~i~~r~~~~i~~~~~--~~~~~d~~~f~~~~r~dg~ 362 (377) T protein:vir:98 286 LEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQ--TFAMEDLQLYLTKNYFYGK 362 (377) T ss_pred ccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecc-eeEEeecceEEEeech--hhhhcCceEEEEEEEEcCE Confidence 012223688887 6678899999999999998 8899999999887764 4688999999999999999 Q ss_pred EecccceEEEEecCC Q lcl|NC_019933. 377 VYRPESFIKGSLAAA 391 (394) Q Consensus 377 v~~~~a~~~l~~~~a 391 (394) +++++||++++++.- T Consensus 363 ~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 363 AKDNHTAALLTLAGG 377 (377) T ss_pred EeccCcEEEEEEecC Confidence 999999999999887 No 70 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=2.9e-55 Score=319.58 Aligned_cols=303 Identities=15% Similarity=0.081 Sum_probs=250.4 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) |.+........+. ....................++|.+||+++.++|++.+++.++|+++++++|+++++++ T Consensus 1 ~~k~~~~~~~~~~--------~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 72 (324) T protein:vir:99 1 MEQTQKLKLNLQH--------FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKK 72 (324) T ss_pred CCCchHhhHHHHH--------HHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 1111111101111 11112222233344445556677799999999999999999999999999999999999 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++ .+.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.+.|++++++++|.++|+|+| T Consensus 73 ~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) T protein:vir:99 73 FTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEec-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 999876 46789999999999999999999999999999999999999998 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|+.+...... ....+..+++++.+++..+...+..+++|+|||.+|..|++++|++|+|+|.. ..+++| T Consensus 152 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~---~~~~~l 226 (324) T protein:vir:99 152 NNPFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDTL 226 (324) T ss_pred CCccCccccccccccc--eeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC---CCCccc Confidence 9988888876544322 33445678999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCCCC--cCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQAMA--VGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~~p--~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||+.++.++ .+.+++|||+. +.++.+.+++|+++++.. .+|++|++.+|++.|+||++.+|+ T Consensus 227 ~G~PVv~~~~~~~~~~~~i~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:99 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeEEeecCCCCCcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEeccc Confidence 999999988765 45689999997 557888999999887643 459999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++++.+.+..+ T Consensus 306 a~~~lt~a~~~~~ 318 (324) T protein:vir:99 306 AFAKLVPADKKTD 318 (324) T ss_pred ceEEEEeccCCCC Confidence 9999999888888 No 71 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=2.7e-55 Score=319.73 Aligned_cols=303 Identities=14% Similarity=0.083 Sum_probs=247.8 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) +.+........+.+ .......+..+........++|.+||+++.++|++.++++++|+++++++|++++.++ T Consensus 1 ~~~~~~~~~~~~~f--------~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:96 1 MEQTQKLKLNLQHF--------ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchhhhHHHHHH--------HHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 11110011111111 1111111222333334445677799999999999999999999999999999998899 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.++|+|+| T Consensus 73 ~p~~~~~-~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g 151 (324) T protein:vir:96 73 FTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEecC-cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 9998764 6789999999999999999999999999999999999999987 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|+....... .....+..++++|.+++.+++..+..+++|+|||.++..|++++|++|+|+++. ..+++| T Consensus 152 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~---~~~~~l 226 (324) T protein:vir:96 152 NNPFGKSIAQSIKKT--NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSL 226 (324) T ss_pred CCCcCcccccccccc--ceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC---CCCCcc Confidence 998888887654332 222345567999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCCC--CcCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQAM--AVGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~~--p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||++++.. +.+.+++|||+. +.++.+.+++++++++.. .+|++|++.+|+++|+||++.+|+ T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:96 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeeEeecCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc Confidence 99999987764 556799999997 567888999999887643 469999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++|+.+.+.++ T Consensus 306 a~~~l~~a~~~~~ 318 (324) T protein:vir:96 306 AFAKLVPADKRTD 318 (324) T ss_pred ceEEEecccccCC Confidence 9999998888887 No 72 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=1.2e-54 Score=316.24 Aligned_cols=344 Identities=15% Similarity=0.155 Sum_probs=242.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |++|++++++.+.+.++...+. +++++++ .+.+.. ...............+.. . T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~--------------~~~d~~e-------~e~~~~---~~~~~~~~~~~~~~~~~~-~- 54 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVE--------------RQVQDIE-------EKEKAK---VKDKGEAYQSLNDNEKLV-K- 54 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHH--------------HHHHHHH-------HHHHHH---hhhccccccccchhhhHH-H- Confidence 8888888777766644433221 1111111 111110 000000000000000000 0 Q ss_pred hhhHHHHHHHHHHhhhh---hhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQR---GRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEY 157 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (394) .+..+.+..... .............++.+++++||++||+++.++|++.++++++|+++++++++++ ..+ T Consensus 55 -----~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~ 127 (352) T protein:vir:78 55 -----AKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEI 127 (352) T ss_pred -----HHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceE Confidence 000011110000 0011112223344566777889999999999999999999999999999988765 467 Q ss_pred EEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHhhccC Q lcl|NC_019933. 158 VRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEEN-QLLNGNG 235 (394) Q Consensus 158 ~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~-a~l~g~g 235 (394) |+.....+.+.|++||+.+|+++++|++|++.+++++++++||+|+++|+ .++++||.++|+++++++.+. ++..|+| T Consensus 128 p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 207 (352) T protein:vir:78 128 PRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 207 (352) T ss_pred EEEecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCC Confidence 77665556789999999999999999999999999999999999999998 599999999999999998655 5556666 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++.+ .|++...+... .++...+++|.+++..++..+..+++|+||+.++..|.++++.+|+|++. +.+.+| T Consensus 208 ~~~~-~g~l~~~~~~~----~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~----~~~~~l 278 (352) T protein:vir:78 208 SGLE-HMSFYNGSVKE----VEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD----TPAEKV 278 (352) T ss_pred Cccc-ccceecccccc----ccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccc----cCCccc Confidence 6554 45555444332 23445689999999999999999999999999999999998888999874 234579 Q ss_pred ecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 316 WGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 316 ~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +|+||++++.++ .++||||+.++... .++.++...+ ..+|++.|++..|+|+++++|+||++++++++++. T Consensus 279 lG~PV~~~~~~~--~~~~Gdf~~~~~~~--~~~~~~~~~~----~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~ 349 (352) T protein:vir:78 279 FGKPVVFTDAAV--KPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGS 349 (352) T ss_pred cccceEEecCCC--ceeEeehhhhhhhh--hhheeeeecc----ccCCeeEEEEEeeeCceeechhheEEEEeecccCC Confidence 999999999865 47999999866544 3444444332 34789999999999999999999999999999998 No 73 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=3.5e-55 Score=319.15 Aligned_cols=303 Identities=15% Similarity=0.088 Sum_probs=249.2 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) +.+........+.+ .................+..+|.+||+++.++|++.+++.++|+++++++|++++.++ T Consensus 1 ~~~~~~~~~~~~~f--------~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 72 (324) T protein:vir:10 1 MEQTQKLKLNLQHF--------ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCCchHHHHHHHHH--------HHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 11111111111111 1111122223334444555677899999999999999999999999999999999999 Q ss_pred EEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 157 YVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 157 ~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) +|+.++ .+.+.|++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.+.|++++++++|.++|+|+| T Consensus 73 ~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g 151 (324) T protein:vir:10 73 FTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEeC-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 999876 46789999999999999999999999999999999999999988 5899999999999999999999999999 Q ss_pred CCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCcee Q lcl|NC_019933. 236 TGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTL 315 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l 315 (394) ++..+.|+++...... ....+..+++++.+++..+...+..+++|+|||.+|..|++++|++|+|+|.. ..+++| T Consensus 152 ~~~~~~~i~~~~~~~~--~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~---~~~~~l 226 (324) T protein:vir:10 152 NNPFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDTL 226 (324) T ss_pred CCccCccccccccccc--eeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecC---CCCccc Confidence 9988888877544322 33445678999999999999999999999999999999999999999999853 345689 Q ss_pred ecceEEEcCCCC--cCceEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQAMA--VGQFLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~~p--~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||+.++.++ .+.+++|||+. +.++.+.+++++++++.. .+|++|++.+|+++|+||++.+|+ T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:10 227 DGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred cceeEEeecCCCCCcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEeccc Confidence 999999987654 56789999987 457888899999887643 469999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++++.+++..+ T Consensus 306 A~~~l~~a~~~~~ 318 (324) T protein:vir:10 306 AFAKLVPADKKTD 318 (324) T ss_pred ceEEEEeccCCCC Confidence 9999998888776 No 74 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=3.7e-55 Score=319.01 Aligned_cols=272 Identities=15% Similarity=0.197 Sum_probs=241.3 Q ss_pred HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCc--eeEEEEcCcccccceecCCccccc-cccce Q lcl|NC_019933. 107 AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT--LEYVRETGFTNAAAPVAEGAQKPE-SSLRF 183 (394) Q Consensus 107 ~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~eg~~~~~-~~~~~ 183 (394) .+.++..+++++||++||+++.++|++.++++++|+++++++|+++.. +.+|+.....+.+.|++||+.+|+ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 677888888899999999999999999999999999999999987654 556666555567899999999997 56999 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchH Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAV 262 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~ 262 (394) ++++++++|++++++||+|+++|+ .+++++|.+++++++++++|.+|++|.+++.. ..+..++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~----------------~~~~~~~ 144 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT----------------KPTLTKW 144 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc----------------cccccCH Confidence 999999999999999999999998 58999999999999999999999998765332 2345578 Q ss_pred HHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCC--CCc-----CceEEe Q lcl|NC_019933. 263 DRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQA--MAV-----GQFLTG 334 (394) Q Consensus 263 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~--~p~-----~~~~~g 334 (394) ++|.+++.++...+..+++|+||+.++..|+++||++|+|+|++. ..+.+++|+|+||++++. +|. ..++|| T Consensus 145 d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~g 224 (293) T protein:vir:48 145 DDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFG 224 (293) T ss_pred HHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEE Confidence 999999999999999999999999999999999999999999754 455667999999987543 443 247999 Q ss_pred eccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 335 AFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 335 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ||++++.++++.+++++++++...+|++|++.+|++.|+|+++++|+||+++++++++++ T Consensus 225 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~ 284 (293) T protein:vir:48 225 DLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQ 284 (293) T ss_pred eccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccC Confidence 999999999999999999998888899999999999999999999999999999998888 No 75 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=6.1e-55 Score=317.81 Aligned_cols=278 Identities=13% Similarity=0.090 Sum_probs=236.3 Q ss_pred cccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeee Q lcl|NC_019933. 113 TNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKV 192 (394) Q Consensus 113 ~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k 192 (394) -.+.++||++||+++.++|++.+++.++|+++++++|++++..++|+.++. +.+.|++||+.+|+++++|+++++.++| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLD-SDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecC-cceEEeecCccccccccceeeEEeeeEE Confidence 334456899999999999999999999999999999999988999998775 6789999999999999999999999999 Q ss_pred EEEeehhhHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcc----cccccccccccc-ccccccccchHH Q lcl|NC_019933. 193 IAHWMKASRQILS---DS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQN----LLGLLPQATAFA-APITVANATAVD 263 (394) Q Consensus 193 ~~~~~~is~e~l~---~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~----~~Gi~~~~~~~~-~~~~~~~~~~~~ 263 (394) +++++++|+|+++ ++ .+++++|.+++++++++++|.++|+|+++... +.|+....+..+ .....++...++ T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADA 159 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHH Confidence 9999999999984 33 57999999999999999999999999654332 223222111111 122234556789 Q ss_pred HHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc--cCCCceeecceEEEcCCCCcC--------ceEE Q lcl|NC_019933. 264 RLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVG--------QFLT 333 (394) Q Consensus 264 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~--------~~~~ 333 (394) ++.+++..+...++.++.|+|||.++.+|+++||++|+|+|.+.. +..+++|+|+||++++.||.. .++| T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~ 239 (303) T protein:vir:97 160 NIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVII 239 (303) T ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEEEE Confidence 999999999988999999999999999999999999999997543 344568999999999999853 3789 Q ss_pred eeccceEEEEeecceEEEEecccc------hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 334 GAFDAGAQVFDRWAARVEVATENQ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 334 gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) |||+..+.++.+.++++++.++.. .+|++|++.+|++.|+||++++|+||++|+.+.. T Consensus 240 Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 240 GDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred eeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 999988999999999999887643 3599999999999999999999999999998888 No 76 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=1.3e-54 Score=316.05 Aligned_cols=275 Identities=17% Similarity=0.112 Sum_probs=234.4 Q ss_pred cCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEE Q lcl|NC_019933. 115 ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIA 194 (394) Q Consensus 115 ~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~ 194 (394) -..+||.++|+++..+|++.++++++|+++++++|++++..++|+.++. +.+.|++|++.+|+++++|+++++.++|++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMD-SEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecC-cceEEecCCccccccccceeEEEEeeeeEE Confidence 3355789999999999999999999999999999999888999998775 678999999999999999999999999999 Q ss_pred EeehhhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhhccC----CCccccccccccccc--cccccccccchHHH Q lcl|NC_019933. 195 HWMKASRQILSDS----AQLQSFINARLLRGLEVVEENQLLNGNG----TGQNLLGLLPQATAF--AAPITVANATAVDR 264 (394) Q Consensus 195 ~~~~is~e~l~~s----~~~~~~i~~~la~a~~~~~d~a~l~g~g----~~~~~~Gi~~~~~~~--~~~~~~~~~~~~~~ 264 (394) ++++||+|+++++ .+++++|.++|++++++++|.++++|.+ ....+.|+....+.. ...........+++ T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHH Confidence 9999999999643 3799999999999999999999999843 333333433322221 12222334445788 Q ss_pred HHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCCCCcC------ceEEeecc Q lcl|NC_019933. 265 LRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAMAVG------QFLTGAFD 337 (394) Q Consensus 265 i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~p~~------~~~~gd~~ 337 (394) +.+++..+...+..+++|+|||.++..|+++||++|+|+|++. ..+.+++|+|+||++++.+|.+ .+++|||+ T Consensus 160 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs 239 (298) T protein:vir:16 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) T ss_pred HHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeecc Confidence 9999999999999999999999999999999999999999764 4555679999999999999863 57889999 Q ss_pred ceEEEEeecceEEEEecccc------hhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 338 AGAQVFDRWAARVEVATENQ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 338 ~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) .++.++.+.++++++.++.. .+|++|++.||++.|+||++++|+||++|+-++ T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 98888889999999877543 359999999999999999999999999999888 No 77 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=1.1e-53 Score=310.83 Aligned_cols=378 Identities=13% Similarity=0.123 Sum_probs=244.4 Q ss_pred CchHH----------------HHHHHHHHHHHHHHHHHHHHHhhh--hhhHHHH---HHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_019933. 1 MSDIN----------------AINSTLANISDSLKAHADRAVKDQ--ELNASVR---AKVDELLMAQ-GALQADLKAAQQ 58 (394) Q Consensus 1 Mk~i~----------------el~~~~~~~~~~~k~~~e~~~~~~--~~~~e~~---~~~~~~~~~~-~~l~~~i~~~e~ 58 (394) ..... +..+.+.+..++.+++.+...... +...+.. ..+++.+++. +.++........ T Consensus 199 ~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~ 278 (632) T protein:vir:96 199 QTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFE 278 (632) T ss_pred cccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhh Confidence 00000 000000000111111111100000 0000000 0011111110 000000000000 Q ss_pred HHHHHHhhccccccc-------chhh-----hhh------------hhhHHHHHHHHHHhhhhh---hhhHHHHHHHhhc Q lcl|NC_019933. 59 RIAEVEGNGAGGDVQ-------HISI-----GQQ------------FVNSDSFKAMAESGGQRG---RAEINIKAAITSL 111 (394) Q Consensus 59 ~~~~~~~~~~~~~~~-------~~~~-----~~~------------~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 111 (394) +.............. .+.. .+. .................. ....+ ....... T Consensus 279 ~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~-~l~~ra~ 357 (632) T protein:vir:96 279 KPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHE-VLVQRQL 357 (632) T ss_pred hhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHH-HHHHhhh Confidence 000000000000000 0000 000 000000000000000000 00000 0112344 Q ss_pred ccccCCcCccccchhh-hhHHHhhhhhhhhHHHh-ccccccccCceeEEEEcCcccccceecCCccccccccceeeEEee Q lcl|NC_019933. 112 STNADGSAGATVQTTR-LPGILELPQRRMTIRSL-LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTS 189 (394) Q Consensus 112 ~~~~~~~~g~~ip~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~ 189 (394) .++++++||++||+++ ...||+.+++.+++.++ ++++|+..+.+++|+.++. +.+.|++|++.+|+++++|+++++. T Consensus 358 ~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~-~~a~wv~E~~~~~~s~~~f~~i~l~ 436 (632) T protein:vir:96 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG-ANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) T ss_pred hcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCC-ceeEeecCCccccccccceeeEEee Confidence 5566678999999886 57899999999999998 6778888888999999875 6789999999999999999999999 Q ss_pred eeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHH Q lcl|NC_019933. 190 AKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLA 268 (394) Q Consensus 190 ~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~ 268 (394) +++++++++||+|+++++ ++++++|.+.|+.+++.++|.++|+|+|+++.|.||++.++..+.+. .++..+++++.++ T Consensus 437 ~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~-~~~~~~~~~i~~~ 515 (632) T protein:vir:96 437 PKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY-PAGGVDWASVVDM 515 (632) T ss_pred eeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceec-ccccCCHHHHHHH Confidence 999999999999999987 69999999999999999999999999999899999998877655443 3345678899999 Q ss_pred HHHhhhhcC--CCCeeEeCHHHHHHHHH--hhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEe Q lcl|NC_019933. 269 LLQAQLAEF--PATGIVLNPADWAGIEL--LKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFD 344 (394) Q Consensus 269 ~~~~~~~~~--~~~~~~~~~~~~~~l~~--lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~ 344 (394) ...+...+. .+++|+||+.++..|.+ ++|++|+|+|+ +++|+|+||++++.+|+++++||||+. +++++ T Consensus 516 ~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~------~~~l~G~pv~~s~~ip~~~~~~gd~s~-~~i~~ 588 (632) T protein:vir:96 516 ETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ------NNEVNGYRAEASNQIPADTWIFGDWSQ-IVIAM 588 (632) T ss_pred HHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeec------CCeecccceEeccccccCcEEEeecce-EEEEE Confidence 888887664 46689999998777764 77999999985 347999999999999999999999997 55778 Q ss_pred ecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 345 RWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 345 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) +.++.+.++++. ++.+|++.|+++.|+|+++++|++|++++.+| T Consensus 589 ~~~~~i~~~~~~--~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 589 WGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ecceEEEEcccc--ccccCceEEEEEeecCceeechhhhhheeecC Confidence 888888887754 57899999999999999999999999999999 No 78 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1e-54 Score=316.52 Aligned_cols=288 Identities=14% Similarity=0.131 Sum_probs=241.2 Q ss_pred hhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcccc Q lcl|NC_019933. 98 GRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKP 177 (394) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~ 177 (394) .....+.+. +....++.+|.++|+++..+|++.+++.++|++++++++++++.+++|++++. +.+.|++|++.+| T Consensus 1 ~g~~~e~~~----~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~ 75 (397) T protein:vir:23 1 MGFSADHSQ----IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGD-VSAQWIGEGDMKP 75 (397) T ss_pred CCcCHHHHH----HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCC-cceEEecCCcccc Confidence 112222222 22223334455677788999999999999999999999999988999999874 6789999999999 Q ss_pred ccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc Q lcl|NC_019933. 178 ESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV 256 (394) Q Consensus 178 ~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~ 256 (394) +++++|+++++.+||++++++||+|+++++ .+++++|+++|++++++++|+++|+|+|+..++.++...... .... T Consensus 76 ~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~---~~~~ 152 (397) T protein:vir:23 76 ITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNK---TQSI 152 (397) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccc---eeee Confidence 999999999999999999999999999988 589999999999999999999999999998887777654443 2223 Q ss_pred cccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCC------CceeecceEEEcCCCCcCc Q lcl|NC_019933. 257 ANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTL------APTLWGLPVVATQAMAVGQ 330 (394) Q Consensus 257 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~------~~~l~G~pv~~~~~~p~~~ 330 (394) .+...++++.++...+...+..+++|+||+.++..|+++||++|+|+|++....+ +++|+|+||++++++|+++ T Consensus 153 ~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~ 232 (397) T protein:vir:23 153 SPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGD 232 (397) T ss_pred cccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCc Confidence 4455678888888999999999999999999999999999999999997654322 3489999999999999886 Q ss_pred e--EEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 331 F--LTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 331 ~--~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) . ++|||+. ++++.+.++.++++++.+ .+|++|++.||++.|+||++++|+||++++..+...+ T Consensus 233 ~~~~~gDfs~-~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~ 309 (397) T protein:vir:23 233 VVGYAGDFSQ-IIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTT 309 (397) T ss_pred eEEEEeecce-EEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccce Confidence 4 7899997 457788899998876542 4599999999999999999999999999998777666 No 79 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.1e-54 Score=316.31 Aligned_cols=299 Identities=15% Similarity=0.090 Sum_probs=236.3 Q ss_pred HHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccc Q lcl|NC_019933. 89 AMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAA 168 (394) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (394) ......+...+...+ .....+++ ++.+|.++|+++.++|++.+++.++|+++++++|++++.+++|+.++. +.+. T Consensus 1 ~~~~~~r~~~~~~~~---e~~a~~~~-~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~-~~a~ 75 (326) T protein:vir:42 1 MAVNPDRTTPFLGVN---DPKVAQTG-DSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGD-VSAS 75 (326) T ss_pred CCCCccchhhhcCcc---hhhheecc-ccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCC-cceE Confidence 000000000001111 11222333 344566799999999999999999999999999999989999998864 6788 Q ss_pred eecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccc Q lcl|NC_019933. 169 PVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQA 247 (394) Q Consensus 169 ~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~ 247 (394) |++||+.+|+++++|+++++.++|++++++||+|++++| .+++++|.++|++++++++|+++|+|+|++ .|.|+++.. T Consensus 76 ~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~-~p~gi~~~~ 154 (326) T protein:vir:42 76 WIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSP-FPTFLAQTT 154 (326) T ss_pred EecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-ccccccccc Confidence 999999999999999999999999999999999999987 589999999999999999999999999965 567877654 Q ss_pred cccccc----ccccccchHHH--HHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCC------Ccee Q lcl|NC_019933. 248 TAFAAP----ITVANATAVDR--LRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTL------APTL 315 (394) Q Consensus 248 ~~~~~~----~~~~~~~~~~~--i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~------~~~l 315 (394) ...... .......++.+ +..+...+...+..+++|+|||.++.+|++|||++|+|+|++....+ .+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l 234 (326) T protein:vir:42 155 KEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRI 234 (326) T ss_pred cccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCcee Confidence 432221 11122223333 34555666777888899999999999999999999999997644322 3479 Q ss_pred ecceEEEcCCCCcCce--EEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 316 WGLPVVATQAMAVGQF--LTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 316 ~G~pv~~~~~~p~~~~--~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) +|+||++++.+|+++. ++|||+.+ +++.+.++.++++++.+ ..|++|++.||+++|+||++.+|+ T Consensus 235 ~G~pv~~~~~~~~~~~~~~~Gd~s~~-~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~ 313 (326) T protein:vir:42 235 VARPTILSDHVASGTVVGYQGDFRQL-VWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKD 313 (326) T ss_pred eeeeEEEcCCCCCCceEEEEeecceE-EEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccc Confidence 9999999999999874 67999975 47788999998876643 349999999999999999999999 Q ss_pred ceEEEEecCCCCC Q lcl|NC_019933. 382 SFIKGSLAAAAGT 394 (394) Q Consensus 382 a~~~l~~~~a~~~ 394 (394) ||++|+.++++++ T Consensus 314 a~~~l~~~~~~~~ 326 (326) T protein:vir:42 314 AFVKLTNVDATEA 326 (326) T ss_pred ceEEEeeccccCC Confidence 9999999999999 No 80 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=1.2e-54 Score=316.11 Aligned_cols=281 Identities=14% Similarity=0.109 Sum_probs=239.7 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcccccccc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSL 181 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~ 181 (394) +- .....+.+..++++||.+||+++.++|++.+++.++|+++++++|++++.+++|+.++. +.+.|++|++.+|++++ T Consensus 1 ma-~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MA-TPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKG-VGAYWVSETERIQTSKP 78 (304) T ss_pred Cc-ccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCC-cceEEeecCcccccccc Confidence 11 11124455566778899999999999999999999999999999999988999998764 67899999999999999 Q ss_pred ceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccc----ccccccccccccccc Q lcl|NC_019933. 182 RFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLL----GLLPQATAFAAPITV 256 (394) Q Consensus 182 ~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~----Gi~~~~~~~~~~~~~ 256 (394) +|++++++++|++++++||+|+++++. +++++|.++|++++++++|.++|+|+|++.+.. +++...... ..... T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~-~~~~~ 157 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEK-GNVVT 157 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccc-ccccc Confidence 999999999999999999999999885 899999999999999999999999999766433 223222222 22223 Q ss_pred cccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC----ceE Q lcl|NC_019933. 257 ANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG----QFL 332 (394) Q Consensus 257 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~----~~~ 332 (394) .+..++++|.+++.++...+..+++|+|||.++.+|++++|++|+|+|.. .+++|+|+||++++.+|.. .++ T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~----~~~~l~G~PV~~~~~~~~~~~~~~~~ 233 (304) T protein:vir:94 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDA----NGNEIMGLPLSYTGADVYDKKKSLAL 233 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecC----CCccccceeeEEecccccCCCCcEEE Confidence 45567999999999999999999999999999999999999999999853 3468999999999999853 589 Q ss_pred EeeccceEEEEeecceEEEEecccc--------------hhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 333 TGAFDAGAQVFDRWAARVEVATENQ--------------DDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 333 ~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) ||||++ +.++.+.+++++++++.. .+|++|++.||+++|+|+++.+|+||++|+.+- T Consensus 234 ~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 234 MGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 999997 568888999998877632 459999999999999999999999999999988 No 81 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=1.2e-54 Score=316.11 Aligned_cols=281 Identities=14% Similarity=0.109 Sum_probs=239.7 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcccccccc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSL 181 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~ 181 (394) +- .....+.+..++++||.+||+++.++|++.+++.++|+++++++|++++.+++|+.++. +.+.|++|++.+|++++ T Consensus 1 ma-~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MA-TPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKG-VGAYWVSETERIQTSKP 78 (304) T ss_pred Cc-ccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCC-cceEEeecCcccccccc Confidence 11 11124455566778899999999999999999999999999999999988999998764 67899999999999999 Q ss_pred ceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccc----ccccccccccccccc Q lcl|NC_019933. 182 RFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLL----GLLPQATAFAAPITV 256 (394) Q Consensus 182 ~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~----Gi~~~~~~~~~~~~~ 256 (394) +|++++++++|++++++||+|+++++. +++++|.++|++++++++|.++|+|+|++.+.. +++...... ..... T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~-~~~~~ 157 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEK-GNVVT 157 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccc-ccccc Confidence 999999999999999999999999885 899999999999999999999999999766433 223222222 22223 Q ss_pred cccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC----ceE Q lcl|NC_019933. 257 ANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG----QFL 332 (394) Q Consensus 257 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~----~~~ 332 (394) .+..++++|.+++.++...+..+++|+|||.++.+|++++|++|+|+|.. .+++|+|+||++++.+|.. .++ T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~----~~~~l~G~PV~~~~~~~~~~~~~~~~ 233 (304) T protein:vir:10 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDA----NGNEIMGLPLSYTGADVYDKKKSLAL 233 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecC----CCccccceeeEEecccccCCCCcEEE Confidence 45567999999999999999999999999999999999999999999853 3468999999999999853 589 Q ss_pred EeeccceEEEEeecceEEEEecccc--------------hhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 333 TGAFDAGAQVFDRWAARVEVATENQ--------------DDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 333 ~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) ||||++ +.++.+.+++++++++.. .+|++|++.||+++|+|+++.+|+||++|+.+- T Consensus 234 ~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 234 MGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 999997 568888999998877632 459999999999999999999999999999988 No 82 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=2.2e-54 Score=314.73 Aligned_cols=294 Identities=12% Similarity=0.077 Sum_probs=243.7 Q ss_pred hhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCC Q lcl|NC_019933. 94 GGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEG 173 (394) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg 173 (394) .........+.+ .+...+++++|.+||+++.++|++.+++.++|+++++++|++++.+++|+.++. +.+.|++|| T Consensus 1 ~~~~~~~~~e~~----~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg 75 (318) T protein:vir:24 1 MAAGTAFAVDHA----QIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGD-VSAQWIGEG 75 (318) T ss_pred CCCCCCCCHHHH----HhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCC-cceEEecCC Confidence 111222222222 223344456777899999999999999999999999999999999999998864 678999999 Q ss_pred ccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccc Q lcl|NC_019933. 174 AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAA 252 (394) Q Consensus 174 ~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~ 252 (394) +.+|+++++|+++++.++|+++++++|+|+++++ .+++++|.++|++++++++|.++|+|+|++ .+.|++........ T Consensus 76 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~-~~~~~~~~~~~~~~ 154 (318) T protein:vir:24 76 DMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSP-FPTYIGQTTKAISI 154 (318) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCC-CCcccccccccccc Confidence 9999999999999999999999999999999987 589999999999999999999999999865 45677765544433 Q ss_pred cccccc-cchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCC------CceeecceEEEcCC Q lcl|NC_019933. 253 PITVAN-ATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTL------APTLWGLPVVATQA 325 (394) Q Consensus 253 ~~~~~~-~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~------~~~l~G~pv~~~~~ 325 (394) ...... ....+.+.++...+...+..+++|+|||.++..|+++||++|+|+|++....+ +..++|+||+.++. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~ 234 (318) T protein:vir:24 155 ADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDH 234 (318) T ss_pred cccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCC Confidence 333333 33445667788888999999999999999999999999999999997543222 24799999999999 Q ss_pred CCcCc--eEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 326 MAVGQ--FLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 326 ~p~~~--~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +|.++ +++|||+. +.++++.+++++++++.+ .+|++|++.+|+.+|+||++.+|+||++|+.+++ T Consensus 235 ~~~~~~~~~~gdfs~-~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 235 VVEGTTVGFMGDFSQ-LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred CCCCccEEEEeecce-EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeecc Confidence 99775 57899987 567888999998877643 4599999999999999999999999999999999 Q ss_pred CCC Q lcl|NC_019933. 392 AGT 394 (394) Q Consensus 392 ~~~ 394 (394) +|. T Consensus 314 ~~~ 316 (318) T protein:vir:24 314 GGG 316 (318) T ss_pred CCC Confidence 999 No 83 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.5e-54 Score=314.41 Aligned_cols=331 Identities=20% Similarity=0.228 Sum_probs=233.7 Q ss_pred HHHHHHHHHHHHhhc-----ccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhH---H-HHHHHhhcccccCCcCcccc Q lcl|NC_019933. 53 LKAAQQRIAEVEGNG-----AGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEI---N-IKAAITSLSTNADGSAGATV 123 (394) Q Consensus 53 i~~~e~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~g~~i 123 (394) +.+....-.+..... .......+..+ +...... .....+........ . ...........++++||++| T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~--~~~~~~a-~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lv 77 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAG--MTRMVMS-IAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALI 77 (366) T ss_pred Ccccccccccccccccccccccccccccchh--HHHHHHH-HHhcccchhHHHHHHHHhhcchhhhhhccccccCCcccc Confidence 000000000000000 00000000000 0000000 00000000000000 0 00001112223445789999 Q ss_pred chhhhhHHHhhhhhhhhHHHh-ccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHH Q lcl|NC_019933. 124 QTTRLPGILELPQRRMTIRSL-LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQ 202 (394) Q Consensus 124 p~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e 202 (394) |+++.++|++.+++.++++++ ++++|+.++.+++|+.++. +.+.|++|++.+|+++++|++|++.++|++++++||+| T Consensus 78 P~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~-~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~e 156 (366) T protein:vir:57 78 PQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGG-ATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQ 156 (366) T ss_pred chhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCC-cceeeeccCccccccccceeEEEEeeEEEEEeehhhHH Confidence 999999999999999999998 8888988888999999864 68899999999999999999999999999999999999 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc----cccchHHHH---HHHHHHhhh Q lcl|NC_019933. 203 ILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV----ANATAVDRL---RLALLQAQL 274 (394) Q Consensus 203 ~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~----~~~~~~~~i---~~~~~~~~~ 274 (394) +++++ ++++++|+++|++++++++|.+||+|+|+++.|.||++.++........ .+....+.+ +.+...... T Consensus 157 ll~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~ 236 (366) T protein:vir:57 157 LIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSN 236 (366) T ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccc Confidence 99988 5999999999999999999999999999999999999876554332221 112222222 233334445 Q ss_pred hcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC--------ceEEeeccceEEEEeec Q lcl|NC_019933. 275 AEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG--------QFLTGAFDAGAQVFDRW 346 (394) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~--------~~~~gd~~~~~~~~~~~ 346 (394) .+..++.|+||+.++.+|++++|++|+|+|+.. .+++|+|+||++++.||++ .++||||+. ++++++. T Consensus 237 ~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~---~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~-~~i~~~~ 312 (366) T protein:vir:57 237 SNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEM---SQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFND-VVIGEDG 312 (366) T ss_pred cccccCEEEecHHHHHHHHhhhccCCceeccCC---CCCeecceeeEEccccccccccCCCccEEEEEecce-EEEEEec Confidence 566788999999999999999999999999643 3458999999999999863 488999986 5688999 Q ss_pred ceEEEEecccc---------hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 347 AARVEVATENQ---------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 347 ~~~i~~~~~~~---------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +++++++++.+ ..|++|++.+|+++|+||+++||+||++++=..= T Consensus 313 ~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 313 MMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 99999877642 4689999999999999999999999999984333 No 84 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.6e-53 Score=306.65 Aligned_cols=357 Identities=12% Similarity=0.097 Sum_probs=246.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.++.++.+++.++.+..+++ +++..+....++. .+.+.+.+++++.++.+......+ .. T Consensus 1 mt~~~~~~e~~~~~~e~~~~~-~~~~~~~~~~e~~---~~~~~~~~~~~~~~~~~~~~~e~~--~~-------------- 60 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQF-ANLVQNGASDEEQ---SKAFGAMFDALSNDLQEEITAEIN--NR-------------- 60 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH-HHHHhhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHH--HH-------------- Confidence 888888777776664443333 3322222222222 222222233333322211110000 00 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRE 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) .........+.......+.+...+....+++++||++||+++.++|++.+++.++|+++|+++++++. .++|+. T Consensus 61 -----~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~-~~i~~~ 134 (395) T protein:vir:95 61 -----VVDNGILAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIK-TRVIKA 134 (395) T ss_pred -----HHHHHHHhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEe Confidence 00000000000011223334444556667788899999999999999999999999999999998764 689987 Q ss_pred cCcccccceecCCccc-cccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCc Q lcl|NC_019933. 161 TGFTNAAAPVAEGAQK-PESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTGQ 238 (394) Q Consensus 161 ~~~~~~~~~~~eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~ 238 (394) .+. +.++|++|++.. ++++++|+++++.+++++++++||+|+++|++ ++++||.++|+++++.++|.+|++|+|+++ T Consensus 135 ~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~ 213 (395) T protein:vir:95 135 DPA-GQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAK 213 (395) T ss_pred cCC-cceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCC Confidence 664 678888886655 57899999999999999999999999999985 899999999999999999999999999985 Q ss_pred -ccccccccccccccc---ccccccchHHHHHHHHHHhh--------------hhcCCCCeeEeCHHHHHHHHHhhccCC Q lcl|NC_019933. 239 -NLLGLLPQATAFAAP---ITVANATAVDRLRLALLQAQ--------------LAEFPATGIVLNPADWAGIELLKDTQG 300 (394) Q Consensus 239 -~~~Gi~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~--------------~~~~~~~~~~~~~~~~~~l~~lkd~~G 300 (394) .|.||++.....+.. ...++..+++++...+..+. ..+..+..|+||+.++. |..| T Consensus 214 ~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g 287 (395) T protein:vir:95 214 TQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQA 287 (395) T ss_pred cCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCC Confidence 699999865443222 11222233343333332222 12345668999999864 5679 Q ss_pred cccccCcccCCCceee--cceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEe Q lcl|NC_019933. 301 RYILGNPQGTLAPTLW--GLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVY 378 (394) Q Consensus 301 ~~~~~~~~~~~~~~l~--G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~ 378 (394) +|+|++. .+.+.+++ |+||+.++.||+++++||||++ |.++++.+++++++++ .+|.+|++.||+..|+|++++ T Consensus 288 ~~~~~~~-~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~-y~i~~r~~~~i~~~~~--~~~~~d~~~f~~~~r~dg~~~ 363 (395) T protein:vir:95 288 RYTYLTA-NGGFVTVLPYNVTIITSEFVPEGKLVAFVTDR-YNAVRGGGLTVKKFDQ--TLALEDAVLFTAKTFAYGQPD 363 (395) T ss_pred cceeccC-CCcceeccCCcceEEEcCCCCCCcEEEEeccc-EEEEEecceEEEeccc--hhhhCCcEEEEEEEEECCEEe Confidence 9999864 33444565 6678999999999999999998 8889999999988765 468899999999999999999 Q ss_pred cccceEEEEecCCCCC Q lcl|NC_019933. 379 RPESFIKGSLAAAAGT 394 (394) Q Consensus 379 ~~~a~~~l~~~~a~~~ 394 (394) +++||++|+++.+..- T Consensus 364 ~~~A~~~l~i~~~~~~ 379 (395) T protein:vir:95 364 DNKASAVYDLKVASAP 379 (395) T ss_pred ccccEEEEEeeccCCC Confidence 9999999987632211 No 85 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=2.2e-54 Score=314.71 Aligned_cols=281 Identities=16% Similarity=0.093 Sum_probs=233.2 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeee Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSA 190 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~ 190 (394) |..++++.||+++|+++..+|++.+++.++|+++++++|++++.+++|+..+. +.+.|++||+.+|+++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~-~~a~wv~Eg~~~~~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGV-PRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC-cceEEeeCCccccccccceeeeEeee Confidence 77888889999999999999999999999999999999999888999998764 68899999999999999999999999 Q ss_pred eeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCC--CccccccccccccccccccccccchHH Q lcl|NC_019933. 191 KVIAHWMKASRQILSDSA-----QLQSFINARLLRGLEVVEENQLLNGNGT--GQNLLGLLPQATAFAAPITVANATAVD 263 (394) Q Consensus 191 ~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~--~~~~~Gi~~~~~~~~~~~~~~~~~~~~ 263 (394) +|++++++||+|+++++. .++++|.++|++++++++|.++|+|++. +..+.|+.......+. ....+...++ T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 158 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKN-IVDATDSATA 158 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccc-eeeccccchH Confidence 999999999999998663 3889999999999999999999998753 4456665554332222 2233444678 Q ss_pred HHHHHHHHhhhh-cCCCCeeEeCHHHHHHHHHhhccCCc-----ccccCcccCCCceeecceEEEcCCCCcC-------- Q lcl|NC_019933. 264 RLRLALLQAQLA-EFPATGIVLNPADWAGIELLKDTQGR-----YILGNPQGTLAPTLWGLPVVATQAMAVG-------- 329 (394) Q Consensus 264 ~i~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~lkd~~G~-----~~~~~~~~~~~~~l~G~pv~~~~~~p~~-------- 329 (394) ++.+++..+... +..+++|+|||.++..|++++|.+|+ |+|+....+++++|+|+||++++.||.+ T Consensus 159 d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~ 238 (315) T protein:vir:80 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) T ss_pred HHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccccccccc Confidence 888888777655 44567899999999999999877665 5554444555679999999999999864 Q ss_pred -ceEEeeccceEEEEeecceEEEEecccc------hhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 330 -QFLTGAFDAGAQVFDRWAARVEVATENQ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 330 -~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) .++||||++ +.+..+.+++++++++.. ..|++|++.||++.|+||++.+|+||++|+.+++... T Consensus 239 ~~~~~GDfs~-~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~ 309 (315) T protein:vir:80 239 VKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) T ss_pred cEEEEeeccc-EEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCC Confidence 368899997 556778899998877643 4599999999999999999999999999997775322 No 86 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=3.3e-54 Score=313.77 Aligned_cols=275 Identities=17% Similarity=0.118 Sum_probs=234.0 Q ss_pred cCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEE Q lcl|NC_019933. 115 ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIA 194 (394) Q Consensus 115 ~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~ 194 (394) -..+||.++|+++.++|++.++++++|++++++++++++.+++|+.++. +.+.|++||+.+|+++++|+++++.++|++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMD-SEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecC-cceEEeeCCccccccccceeEEEEeeeEEE Confidence 3346799999999999999999999999999999999888999998764 678999999999999999999999999999 Q ss_pred EeehhhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhhccC----CCccccccccccccc--cccccccccchHHH Q lcl|NC_019933. 195 HWMKASRQILSDS----AQLQSFINARLLRGLEVVEENQLLNGNG----TGQNLLGLLPQATAF--AAPITVANATAVDR 264 (394) Q Consensus 195 ~~~~is~e~l~~s----~~~~~~i~~~la~a~~~~~d~a~l~g~g----~~~~~~Gi~~~~~~~--~~~~~~~~~~~~~~ 264 (394) ++++||+|+++++ .+++++|.++|++++++++|.++|+|.+ ....+.|+....... ...........+++ T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHH Confidence 9999999999643 4799999999999999999999999843 333333332222221 12223334556889 Q ss_pred HHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCCCCcC------ceEEeecc Q lcl|NC_019933. 265 LRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAMAVG------QFLTGAFD 337 (394) Q Consensus 265 i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~p~~------~~~~gd~~ 337 (394) +.+++..+...+..+++|+|||+++.+|+++||++|+|+|++. .++.+++|+|+||++++.+|.+ .+++|||+ T Consensus 160 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs 239 (298) T protein:vir:94 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) T ss_pred HHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeecc Confidence 9999999999999999999999999999999999999999764 4455679999999999999853 57889999 Q ss_pred ceEEEEeecceEEEEecccc------hhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 338 AGAQVFDRWAARVEVATENQ------DDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 338 ~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) .++.++.+.++++++.++.. .+|++|++.||++.|+||++.+|+||++++-++ T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 98888889999999877542 369999999999999999999999999999888 No 87 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=6e-54 Score=312.38 Aligned_cols=278 Identities=18% Similarity=0.148 Sum_probs=234.0 Q ss_pred cccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeee Q lcl|NC_019933. 113 TNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKV 192 (394) Q Consensus 113 ~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k 192 (394) --+.++||+++|+++.+.|++.++++++|+++++++|++++..++|+.++. +.+.|++||+.+|+++++|+++++.++| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCC-ceeEEeecCcccccccceeeEEEEeeEE Confidence 333455899999999999999999999999999999999888999998765 6789999999999999999999999999 Q ss_pred EEEeehhhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhhccC--CCcccccccccccccc--ccccccc-cchHH Q lcl|NC_019933. 193 IAHWMKASRQILSDS----AQLQSFINARLLRGLEVVEENQLLNGNG--TGQNLLGLLPQATAFA--APITVAN-ATAVD 263 (394) Q Consensus 193 ~~~~~~is~e~l~~s----~~~~~~i~~~la~a~~~~~d~a~l~g~g--~~~~~~Gi~~~~~~~~--~~~~~~~-~~~~~ 263 (394) ++++++||+|+++++ .+++++|.+++++++++++|.++++|++ ++..+.|+.+...... ...+..+ ...+. T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHH Confidence 999999999999633 3699999999999999999999999964 5566778776543222 2222222 23445 Q ss_pred HHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCCCCcC------------- Q lcl|NC_019933. 264 RLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAMAVG------------- 329 (394) Q Consensus 264 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~p~~------------- 329 (394) ++..++..+...+..+++|+|||.++.+|++|||++|+|+|++. ....+++|+|+||++++.+|.+ T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~ 239 (311) T protein:vir:81 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) T ss_pred HHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchhcc Confidence 66667777777777888899999999999999999999999754 4455679999999999988753 Q ss_pred -----ceEEeeccceEEEEeecceEEEEecccc-----hhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 330 -----QFLTGAFDAGAQVFDRWAARVEVATENQ-----DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 330 -----~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) .++||||+. +.+..+.+++++++++.. .+|++|++.||++.|+||++.+|+||++|+.+.-| T Consensus 240 ~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 240 TNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 368999987 667778899999987643 35999999999999999999999999999988888 No 88 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=8.3e-53 Score=306.11 Aligned_cols=342 Identities=12% Similarity=0.107 Sum_probs=236.8 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhcccccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAA-QQRIAEVEGNGAGGDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~-e~~~~~~~~~~~~~~~~~~~~ 77 (394) |+ ..+++.++++++.+.++. ..... +..+..++. +.++..++... +...... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~----~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~e~~~~-------------- 55 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNN----GEPQE----RQNELYGDM---INQLFEETKLQAKAEAERV-------------- 55 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhh----hhhhH----HHHHHHHHH---HHhhhhhHHHHHHHHHHHH-------------- Confidence 44 444444444333332221 11110 000111111 11111111100 0000000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeE Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEY 157 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (394) ....+.......+.+...+....+++++||++||+++.+.|++.+++.++|+++|++.++++. .++ T Consensus 56 -------------~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i 121 (381) T protein:vir:95 56 -------------SSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) T ss_pred -------------HHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEE Confidence 000000111112223334555667778899999999999999999999999999999998765 689 Q ss_pred EEEcCcccccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 158 VRETGFTNAAAPVAEGAQKP-ESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 158 ~~~~~~~~~~~~~~eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) |+..+ .+.+.|++|++.++ +++++|+++++.+++++++++||+++++|++ ++++||.++|+++++.++|.+|++|+| T Consensus 122 ~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G 200 (381) T protein:vir:95 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) T ss_pred EEecC-CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccC Confidence 98765 47888999988776 5689999999999999999999999999985 899999999999999999999999999 Q ss_pred CCccccccccccccc-cccc------------c-ccccchHHHHHHHHHHhhh-------hcCCCCeeEeCHHHHHHHHH Q lcl|NC_019933. 236 TGQNLLGLLPQATAF-AAPI------------T-VANATAVDRLRLALLQAQL-------AEFPATGIVLNPADWAGIEL 294 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~-~~~~------------~-~~~~~~~~~i~~~~~~~~~-------~~~~~~~~~~~~~~~~~l~~ 294 (394) ++ .|.||++..... ..+. + .+....++.+.++...+.. .+..+..|+|||.++..|+. T Consensus 201 ~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~ 279 (381) T protein:vir:95 201 KD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) T ss_pred CC-CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc Confidence 76 578998754321 1110 0 1112234444444433332 34556789999999988876 Q ss_pred hh---ccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEE Q lcl|NC_019933. 295 LK---DTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEE 371 (394) Q Consensus 295 lk---d~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 371 (394) ++ +.+|+|++..+ .|++|+.++.||+++++||||+. |.++++.+++++.+++ ..|.+|++.||+.. T Consensus 280 ~~~~~~~~G~~v~~l~--------~g~~vv~s~~~p~~~iifgDfs~-Y~i~~r~~~~i~~~~~--~~~~~d~~~f~a~~ 348 (381) T protein:vir:95 280 QYTHLNANGVYVTALP--------FNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAKQ 348 (381) T ss_pred ccccCCCCCceeecCC--------CCceEEecCCCCcCcEEEEeccc-EEEEEecccEEEeech--hHhhcCCeEEEEEE Confidence 55 56777765321 46789999999999999999997 8899999999988765 46899999999999 Q ss_pred EeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 372 RLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 372 ~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) |+|+++++++||++++++..+++ T Consensus 349 r~dg~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:95 349 FAYGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred EEcCEEecCceEEEEEEEecCCC Confidence 99999999999999998887777 No 89 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=8.3e-53 Score=306.11 Aligned_cols=342 Identities=12% Similarity=0.107 Sum_probs=236.8 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhcccccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAA-QQRIAEVEGNGAGGDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~-e~~~~~~~~~~~~~~~~~~~~ 77 (394) |+ ..+++.++++++.+.++. ..... +..+..++. +.++..++... +...... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~----~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~e~~~~-------------- 55 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNN----GEPQE----RQNELYGDM---INQLFEETKLQAKAEAERV-------------- 55 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhh----hhhhH----HHHHHHHHH---HHhhhhhHHHHHHHHHHHH-------------- Confidence 44 444444444333332221 11110 000111111 11111111100 0000000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeE Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEY 157 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (394) ....+.......+.+...+....+++++||++||+++.+.|++.+++.++|+++|++.++++. .++ T Consensus 56 -------------~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i 121 (381) T protein:vir:10 56 -------------SSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) T ss_pred -------------HHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEE Confidence 000000111112223334555667778899999999999999999999999999999998765 689 Q ss_pred EEEcCcccccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 158 VRETGFTNAAAPVAEGAQKP-ESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 158 ~~~~~~~~~~~~~~eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) |+..+ .+.+.|++|++.++ +++++|+++++.+++++++++||+++++|++ ++++||.++|+++++.++|.+|++|+| T Consensus 122 ~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G 200 (381) T protein:vir:10 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) T ss_pred EEecC-CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccC Confidence 98765 47888999988776 5689999999999999999999999999985 899999999999999999999999999 Q ss_pred CCccccccccccccc-cccc------------c-ccccchHHHHHHHHHHhhh-------hcCCCCeeEeCHHHHHHHHH Q lcl|NC_019933. 236 TGQNLLGLLPQATAF-AAPI------------T-VANATAVDRLRLALLQAQL-------AEFPATGIVLNPADWAGIEL 294 (394) Q Consensus 236 ~~~~~~Gi~~~~~~~-~~~~------------~-~~~~~~~~~i~~~~~~~~~-------~~~~~~~~~~~~~~~~~l~~ 294 (394) ++ .|.||++..... ..+. + .+....++.+.++...+.. .+..+..|+|||.++..|+. T Consensus 201 ~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~ 279 (381) T protein:vir:10 201 KD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) T ss_pred CC-CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc Confidence 76 578998754321 1110 0 1112234444444433332 34556789999999988876 Q ss_pred hh---ccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEE Q lcl|NC_019933. 295 LK---DTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEE 371 (394) Q Consensus 295 lk---d~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 371 (394) ++ +.+|+|++..+ .|++|+.++.||+++++||||+. |.++++.+++++.+++ ..|.+|++.||+.. T Consensus 280 ~~~~~~~~G~~v~~l~--------~g~~vv~s~~~p~~~iifgDfs~-Y~i~~r~~~~i~~~~~--~~~~~d~~~f~a~~ 348 (381) T protein:vir:10 280 QYTHLNANGVYVTALP--------FNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAKQ 348 (381) T ss_pred ccccCCCCCceeecCC--------CCceEEecCCCCcCcEEEEeccc-EEEEEecccEEEeech--hHhhcCCeEEEEEE Confidence 55 56777765321 46789999999999999999997 8899999999988765 46899999999999 Q ss_pred EeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 372 RLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 372 ~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) |+|+++++++||++++++..+++ T Consensus 349 r~dg~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:10 349 FAYGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred EEcCEEecCceEEEEEEEecCCC Confidence 99999999999999998887777 No 90 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=4.2e-53 Score=307.73 Aligned_cols=295 Identities=14% Similarity=0.084 Sum_probs=235.7 Q ss_pred hhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCC Q lcl|NC_019933. 94 GGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEG 173 (394) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg 173 (394) +........+.++ +....++.+|.+||+++.++|++.+++.++|++++++++++++.+++|+..+. +.+.|++|+ T Consensus 1 ~~~~~~~~~~~~~----~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQ----IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGD-VSAQWIGEG 75 (320) T ss_pred CCCCccCCHHHHH----hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCC-cceEEecCC Confidence 2222222333332 22333455666899999999999999999999999999999988999998764 678899999 Q ss_pred ccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccc--cccccccccc Q lcl|NC_019933. 174 AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNL--LGLLPQATAF 250 (394) Q Consensus 174 ~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~--~Gi~~~~~~~ 250 (394) +.+|+++++|+++++.++|++++++||+|+++++ .+++++|.+.|++++++++|+++|+|+|++.+. .++.+..... T Consensus 76 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 155 (320) T protein:vir:10 76 DMKPITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLA 155 (320) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccce Confidence 9999999999999999999999999999999987 589999999999999999999999999876532 2333322222 Q ss_pred ccccc-ccccch-HHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccC------CCceeecceEEE Q lcl|NC_019933. 251 AAPIT-VANATA-VDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGT------LAPTLWGLPVVA 322 (394) Q Consensus 251 ~~~~~-~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~------~~~~l~G~pv~~ 322 (394) ..... .+.... .+.+.++...+...+..+++|+|||.++.+|+++||++|+|+|++.... ..++++|+||+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~ 235 (320) T protein:vir:10 156 DPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTIL 235 (320) T ss_pred ecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEe Confidence 22111 122222 3356777888888999999999999999999999999999999754322 135799999999 Q ss_pred cCCCCcCc--eEEeeccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 323 TQAMAVGQ--FLTGAFDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 323 ~~~~p~~~--~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) ++.+|+++ ++||||+. ++++.+.+++++++++.+ .+|++|++.||+++|+|+++.+|+||++|+. T Consensus 236 ~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~ 314 (320) T protein:vir:10 236 SDHVADGTTVGYMGDFRN-VIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTN 314 (320) T ss_pred cCCCCCCceEEEEeecce-EEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEe Confidence 99999986 56899987 458888999999887644 4599999999999999999999999999996 Q ss_pred cCCCCC Q lcl|NC_019933. 389 AAAAGT 394 (394) Q Consensus 389 ~~a~~~ 394 (394) .++... T Consensus 315 ~~ap~~ 320 (320) T protein:vir:10 315 VVTPDA 320 (320) T ss_pred ccCCCC Confidence 665444 No 91 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.5e-52 Score=304.66 Aligned_cols=342 Identities=11% Similarity=0.094 Sum_probs=232.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQ-QRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e-~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ||..+++++.+.++...++.. ...+ +..+.+++... ++..+..... ....... T Consensus 3 ~kl~~~~~~~~~~~~~~~~~~----~~~~----~~~~~~~~~~~---~~~~~~~~~~~~e~~~~~--------------- 56 (381) T protein:vir:10 3 INLSETFANAKNEFINAVNNG----EPQE----RQNELYGDMIN---QLFEETKLQAKAEAERVS--------------- 56 (381) T ss_pred hhHHHHHHHHHHHHHHHHHhh----hHHH----HHHHHHHHHHH---hhhhhHHHHHHHHHHHHH--------------- Confidence 444555554444433332211 1100 00011111111 1111111000 0000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) . ..+.......+.+...+....+++++||++||+++.+.|++.+.+.++|+++|+++++++ ..++|+ T Consensus 57 ---------~---~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~~~i~~ 123 (381) T protein:vir:10 57 ---------S---LPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLK 123 (381) T ss_pred ---------H---hcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-ceEEEe Confidence 0 000001112223333445666777889999999999999999999999999999999866 468888 Q ss_pred EcCcccccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKP-ESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTG 237 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~ 237 (394) .++ .+.++|++|++..+ +++++|+++++.++|++++++||+++|+|++ ++++||+..+++++++++|.+|++|+|++ T Consensus 124 ~~~-~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~ 202 (381) T protein:vir:10 124 SET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD 202 (381) T ss_pred ecC-CcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCC Confidence 765 46788999977654 6789999999999999999999999999985 89999999999999999999999999976 Q ss_pred cccccccccccccc-ccc------cccccch-------HHHHHHHHHHhh-------hhcCCCCeeEeCHHHHHHHHHhh Q lcl|NC_019933. 238 QNLLGLLPQATAFA-API------TVANATA-------VDRLRLALLQAQ-------LAEFPATGIVLNPADWAGIELLK 296 (394) Q Consensus 238 ~~~~Gi~~~~~~~~-~~~------~~~~~~~-------~~~i~~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~lk 296 (394) .|.||++...... ... ...+..+ +..+...+..+. ..+..+..|+|||.++..|+.++ T Consensus 203 -qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~ 281 (381) T protein:vir:10 203 -QPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) T ss_pred -CceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcccc Confidence 5789987432211 110 0111112 222222221111 12445678999999988887544 Q ss_pred ---ccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe Q lcl|NC_019933. 297 ---DTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL 373 (394) Q Consensus 297 ---d~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 373 (394) +.+|+|++..+ .|+||+.++.||+++++||||++ |.++++.+++++.+++ .+|.+|++.|++..|+ T Consensus 282 ~~~~~~G~~v~~lp--------~g~~vv~~~~~p~~~i~fGDfs~-Y~i~~r~~~~i~~~~~--~~~~~d~~~f~a~~r~ 350 (381) T protein:vir:10 282 THLNANGVYVTALP--------FNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAKQFA 350 (381) T ss_pred ccCCCCCceeecCC--------CCceeEEcCCCCcCcEEEEEccc-EEEEEecccEEEeech--hhhhcCceEEEEEEEE Confidence 77888876422 57899999999999999999997 8999999999988765 4688999999999999 Q ss_pred ccEEecccceEEEEecCCCCC Q lcl|NC_019933. 374 ALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 374 d~~v~~~~a~~~l~~~~a~~~ 394 (394) |+++++++||++++++..... T Consensus 351 dG~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:10 351 YGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred cCEEecCCcEEEEEEeecCCc Confidence 999999999999998765533 No 92 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=8.3e-53 Score=306.10 Aligned_cols=281 Identities=15% Similarity=0.123 Sum_probs=238.1 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCc-eeEEEEcCcccccceecCCccccc Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT-LEYVRETGFTNAAAPVAEGAQKPE 178 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~eg~~~~~ 178 (394) +..+. ..+.+..+++++|.+||+++.++|++.+++.++|++++++++++++. ..+|+..+ .+.+.|++||+.+|+ T Consensus 1 m~~~~---~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~ 76 (297) T protein:vir:95 1 MTVQT---FNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTD-GISAYWVNETEKIKT 76 (297) T ss_pred CCccc---cccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcC-CceeEEeecCccccc Confidence 12111 12334445567788999999999999999999999999999997654 56777655 467899999999999 Q ss_pred cccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccccc Q lcl|NC_019933. 179 SSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVA 257 (394) Q Consensus 179 ~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~ 257 (394) ++++|++++++++|++++++||+|+++++ .+++++|.+++++++++++|.++|+|+|++ .+.|+++..... ..... T Consensus 77 ~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~-~~~gi~~~~~~~--~~~~~ 153 (297) T protein:vir:95 77 DKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTP-FANSVAKAAKDA--NKVIG 153 (297) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCc-cccccccccccc--ceecc Confidence 99999999999999999999999999988 589999999999999999999999999865 467887654432 23334 Q ss_pred ccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCC--CCcCceEEee Q lcl|NC_019933. 258 NATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQA--MAVGQFLTGA 335 (394) Q Consensus 258 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~--~p~~~~~~gd 335 (394) +..+++++.+++.++...+..+++|+|||.++.+|++++|++|+|+|+. .+++|+|+||+.+.. ++++.+++|| T Consensus 154 ~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~----~~~~l~G~Pv~~~~~~~~~~~~~~~gd 229 (297) T protein:vir:95 154 GPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDK----AANTIDGITTVDLKSARFEKGDLLAGD 229 (297) T ss_pred cccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecC----CCCcccceeeEeecCCCCCCceEEEEe Confidence 5668999999999999999999999999999999999999999999853 345799999997654 5778999999 Q ss_pred ccceEEEEeecceEEEEecccc------------hhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 336 FDAGAQVFDRWAARVEVATENQ------------DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 336 ~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) |+. +.++.+.+++++++++.+ .+|++|++.+|+++|+||++.+|+||++|+.++.= T Consensus 230 ~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 230 FDN-LIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ccc-EEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 997 557888999999887653 45999999999999999999999999999877666 No 93 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.4e-51 Score=299.44 Aligned_cols=342 Identities=12% Similarity=0.063 Sum_probs=235.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |+++++++++++++.+++++. . ..++..+.+++ ....++.++.+......+......... ... T Consensus 5 ~~~~~~~~e~~~~l~~~~~~~-------~-~~e~~~~~~~~---~~~~~~~~~~~~~~~e~~~~~~~~~~~---~~l--- 67 (377) T protein:vir:96 5 LKELPKYREAVAELSAKISAG-------A-TPEEQEKLFEA---AFTTMGDEILAKNEEEMERMFDLRDKN---REL--- 67 (377) T ss_pred HHHHHHHHHHHHHHHHHHhhc-------c-cHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhccCC---ccc--- Confidence 445566665555555444321 1 11122222222 223333333321111000000000000 000 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRE 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) ...+.+........+++++||++||+++.+.|++.+.+.++++++|++.++++ ..++|+. T Consensus 68 -------------------t~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~ 127 (377) T protein:vir:96 68 -------------------TAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTA 127 (377) T ss_pred -------------------CHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEe Confidence 01111111122334567788999999999999999999999999999999865 4789987 Q ss_pred cCcccccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCc Q lcl|NC_019933. 161 TGFTNAAAPVAEGAQKP-ESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTGQ 238 (394) Q Consensus 161 ~~~~~~~~~~~eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~ 238 (394) .+ .+.+.|++|++.++ +++++|+++++.+++++++++||+++|+|++ ++++||++++++++++++|.+|++|+|++ T Consensus 128 ~~-~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~- 205 (377) T protein:vir:96 128 ET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL- 205 (377) T ss_pred cC-CcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCC- Confidence 65 47889999988765 5789999999999999999999999999986 89999999999999999999999999976 Q ss_pred ccccccccccccccccc----------------ccccchHHHHHHHHHHhhhhcC-----------CCCeeEeCHHHHHH Q lcl|NC_019933. 239 NLLGLLPQATAFAAPIT----------------VANATAVDRLRLALLQAQLAEF-----------PATGIVLNPADWAG 291 (394) Q Consensus 239 ~~~Gi~~~~~~~~~~~~----------------~~~~~~~~~i~~~~~~~~~~~~-----------~~~~~~~~~~~~~~ 291 (394) .|.||++.....++... .....+.+.++++...+...+. .+.+|+|||.++.. T Consensus 206 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~ 285 (377) T protein:vir:96 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT 285 (377) T ss_pred cceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHh Confidence 58899986443222111 1111234455555444443332 34579999999765 Q ss_pred HHHhhccCCcccccCcccCCCceeecce--EEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEE Q lcl|NC_019933. 292 IELLKDTQGRYILGNPQGTLAPTLWGLP--VVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILA 369 (394) Q Consensus 292 l~~lkd~~G~~~~~~~~~~~~~~l~G~p--v~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 369 (394) + .|++.|++.. +...+++|+| ++.++.||+++++||||++ |.++++.+++|+.+++ ..|.+|++.||+ T Consensus 286 ~------~~~~~~~~~~-G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~-Y~i~~r~~~~i~~~~~--~~~~~d~~~f~~ 355 (377) T protein:vir:96 286 L------EAKFTSRNQF-GEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQ--TFAMEDLQLYLT 355 (377) T ss_pred c------cccccccCCC-CCceeccCCCceEEecCCCCcccEEEEEcCc-EEEEEecccEEEeehh--hhhhcCCeEEEE Confidence 4 3556655432 2233677766 6778899999999999998 8999999999988765 468899999999 Q ss_pred EEEeccEEecccceEEEEecCC Q lcl|NC_019933. 370 EERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 370 ~~~~d~~v~~~~a~~~l~~~~a 391 (394) ..|+|+++.+++||++++++.. T Consensus 356 ~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 356 KNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEcCEEecCCcEEEEEEecC Confidence 9999999999999999999888 No 94 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=9.8e-52 Score=300.22 Aligned_cols=296 Identities=14% Similarity=0.098 Sum_probs=235.4 Q ss_pred hhhhHHHHHHH--hhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcc-------cccc Q lcl|NC_019933. 98 GRAEINIKAAI--TSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFT-------NAAA 168 (394) Q Consensus 98 ~~~~~~~~~~~--~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-------~~~~ 168 (394) ...-.+.+... .......++.++.+||+++.++|++.+++.++|+++|+++|++++.+++|+..... ..+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 11111222211 11112233445679999999999999999999999999999999999999986532 3467 Q ss_pred eecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCC--cccccccc Q lcl|NC_019933. 169 PVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTG--QNLLGLLP 245 (394) Q Consensus 169 ~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~--~~~~Gi~~ 245 (394) |++||+.+|+++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.++|+|+|++ +.|.|+.+ T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~ 160 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (338) T ss_pred cccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc Confidence 889999999999999999999999999999999999987 589999999999999999999999999865 45778776 Q ss_pred ccccccccc----cccccchHHHHHHHHHHhhh-hcCCCCeeEeCHHHHHHHH---HhhccCCcccccCc-ccCCCceee Q lcl|NC_019933. 246 QATAFAAPI----TVANATAVDRLRLALLQAQL-AEFPATGIVLNPADWAGIE---LLKDTQGRYILGNP-QGTLAPTLW 316 (394) Q Consensus 246 ~~~~~~~~~----~~~~~~~~~~i~~~~~~~~~-~~~~~~~~~~~~~~~~~l~---~lkd~~G~~~~~~~-~~~~~~~l~ 316 (394) .......+. .......++++.++...+.. .+...++|+|||.++.+|+ +++|++|+|+|+.. ..+.+++|+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~ 240 (338) T protein:vir:78 161 NNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLL 240 (338) T ss_pred ccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceee Confidence 544332211 22234567888887776654 3456678999999988775 57899999999754 455667999 Q ss_pred cceEEEcCCCCcC---------ceEEeeccceEEEEeecceEEEEeccc------------chhhhcCcEEEEEEEEecc Q lcl|NC_019933. 317 GLPVVATQAMAVG---------QFLTGAFDAGAQVFDRWAARVEVATEN------------QDDFIKNMVTILAEERLAL 375 (394) Q Consensus 317 G~pv~~~~~~p~~---------~~~~gd~~~~~~~~~~~~~~i~~~~~~------------~~~~~~~~~~~~~~~~~d~ 375 (394) |+||+++++||.+ .+++|||+. +.++++.+++++++++. ..+|++|++.+|++.|+|| T Consensus 241 G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 319 (338) T protein:vir:78 241 GLPVQFGKAVGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGW 319 (338) T ss_pred eeeEEEccccCccccccCCcccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 9999999999852 478999987 66889999999988763 2469999999999999999 Q ss_pred EEecccceEEEEecCCCCC Q lcl|NC_019933. 376 AVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 376 ~v~~~~a~~~l~~~~a~~~ 394 (394) ++.||+||++|+.++++.- T Consensus 320 ~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 320 LLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EeecccceEEEecccCCCC Confidence 9999999999998666666 No 95 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=1.2e-51 Score=299.66 Aligned_cols=293 Identities=13% Similarity=0.084 Sum_probs=232.5 Q ss_pred hhhhHHHHHHHhh--cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCC-- Q lcl|NC_019933. 98 GRAEINIKAAITS--LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEG-- 173 (394) Q Consensus 98 ~~~~~~~~~~~~~--~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg-- 173 (394) ...-.+.++.... ......+.++.++|+++.++|++.+++.++|+++++++|++++.+++|+.++. +.+.|++|| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~eg~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKR-PEVGQVGVGTS 79 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCC-ceeEeecCccc Confidence 1111122211111 11112334556999999999999999999999999999999999999998764 556666554 Q ss_pred ------ccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCc--cccccc Q lcl|NC_019933. 174 ------AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLL 244 (394) Q Consensus 174 ------~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~ 244 (394) +.+|+++++|+++++.++|++++++||+|+++++ .+++++|+++|++++++++|.++|+|+|++. .+.|+. T Consensus 80 ~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~ 159 (333) T protein:vir:78 80 NEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (333) T ss_pred ccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccc Confidence 6678999999999999999999999999999987 5899999999999999999999999998754 467776 Q ss_pred cccccccc----cccccccchHHHHHHHHHHhhhhc-CCCCeeEeCHHHHHHHHH---hhccCCcccccCc-ccCCCcee Q lcl|NC_019933. 245 PQATAFAA----PITVANATAVDRLRLALLQAQLAE-FPATGIVLNPADWAGIEL---LKDTQGRYILGNP-QGTLAPTL 315 (394) Q Consensus 245 ~~~~~~~~----~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~---lkd~~G~~~~~~~-~~~~~~~l 315 (394) +....... .....+..+++++.+++..+...+ ..+++|+|||.++..|++ ++|++|+|+|+.. ....+++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l 239 (333) T protein:vir:78 160 TDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDV 239 (333) T ss_pred ccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCcee Confidence 65443322 223345567889999988876654 456689999999987764 7799999999754 45566799 Q ss_pred ecceEEEcCCCCcC---------ceEEeeccceEEEEeecceEEEEecccc---------hhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 316 WGLPVVATQAMAVG---------QFLTGAFDAGAQVFDRWAARVEVATENQ---------DDFIKNMVTILAEERLALAV 377 (394) Q Consensus 316 ~G~pv~~~~~~p~~---------~~~~gd~~~~~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~~~~~~~d~~v 377 (394) +|+||++++.+|.+ .+++|||++ +.++.+.+++++++++.. ..|++|++.||++.|+||++ T Consensus 240 ~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v 318 (333) T protein:vir:78 240 LGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ-LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLL 318 (333) T ss_pred eceeeEEccccCCCccccCCCccEEEEEeccc-EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEE Confidence 99999999999865 489999988 667888999999987632 46999999999999999999 Q ss_pred ecccceEEEEecCCC Q lcl|NC_019933. 378 YRPESFIKGSLAAAA 392 (394) Q Consensus 378 ~~~~a~~~l~~~~a~ 392 (394) ++|+||++++.+.+- T Consensus 319 ~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 319 GDKQAFVKFVDDEQP 333 (333) T ss_pred ecccceEEEeccCCC Confidence 999999999866665 No 96 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.8e-51 Score=298.76 Aligned_cols=278 Identities=17% Similarity=0.125 Sum_probs=225.7 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeee Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSA 190 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~ 190 (394) |.+. ++++|.++|+++.++|++.+++.++|+++++++|++++..++|+.++. +.+.|++||+.+|+++++|+++++.+ T Consensus 1 Mat~-tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATF-GTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGR-PKAEFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred Ccee-cCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCC-ceeEEeecCcccccccceeeEEEEee Confidence 4433 456788999999999999999999999999999999888999999775 57899999999999999999999999 Q ss_pred eeEEEeehhhHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCC--cccccccccccc--ccccccccc-cch Q lcl|NC_019933. 191 KVIAHWMKASRQILS---DS-AQLQSFINARLLRGLEVVEENQLLNGNGTG--QNLLGLLPQATA--FAAPITVAN-ATA 261 (394) Q Consensus 191 ~k~~~~~~is~e~l~---~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~--~~~~Gi~~~~~~--~~~~~~~~~-~~~ 261 (394) +|++++++||+|+++ ++ .+++++|.++|++++++++|+++|+|+|++ ..+.|+.+.... ..++.+..+ ... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANP 158 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchh Confidence 999999999999995 33 579999999999999999999999998754 445555443222 222222222 233 Q ss_pred HHHHHHHHHHhhhh--cCCCCeeEeCHHHHHHHHHhhccCCcccccCc-ccCCCceeecceEEEcCCCCcC--------- Q lcl|NC_019933. 262 VDRLRLALLQAQLA--EFPATGIVLNPADWAGIELLKDTQGRYILGNP-QGTLAPTLWGLPVVATQAMAVG--------- 329 (394) Q Consensus 262 ~~~i~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~-~~~~~~~l~G~pv~~~~~~p~~--------- 329 (394) +.++.+++..+... ....+.|+|||.++..|+++||++|+|+|++. .+..+++|+|+||++++.+|.+ T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~ 238 (311) T protein:vir:99 159 DLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDED 238 (311) T ss_pred HHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccch Confidence 45555565555444 34456799999999999999999999999754 4455679999999999887632 Q ss_pred -------ceEEeeccceEEEEeecceEEEEecccc-----hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 330 -------QFLTGAFDAGAQVFDRWAARVEVATENQ-----DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 330 -------~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) .+++|||+.++.+..+.++++++.++.. .+|++|++.+|++.|+||.+.|| +|++++.++| T Consensus 239 ~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 239 LDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred hhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 3578999998888899999999877643 45999999999999999999996 6777777777 No 97 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=2.3e-51 Score=298.16 Aligned_cols=276 Identities=13% Similarity=0.106 Sum_probs=226.1 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcc-----ccccccceee Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQ-----KPESSLRFDL 185 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~-----~~~~~~~~~~ 185 (394) +...++++||.+||+++.++|++.+++.++|++++++++++++.+++|+.++. +.+.|++||+. +|.++++|++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~-~~a~wv~E~~~~~~~~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL-PEADWVGESATDPKGVKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCC-cceEEeecccccccccccccccceee Confidence 77888888999999999999999999999999999999999989999998864 68899999986 5567899999 Q ss_pred EEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccc--ccccccccccccc-ccccccch Q lcl|NC_019933. 186 VQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNL--LGLLPQATAFAAP-ITVANATA 261 (394) Q Consensus 186 i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~--~Gi~~~~~~~~~~-~~~~~~~~ 261 (394) +++.++|++++++||+|+++++ .+++++|.++|++++++++|.+||+|+|++..+ .++.+........ ........ T Consensus 80 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchh Confidence 9999999999999999999998 589999999999999999999999999865432 2222222211111 11122222 Q ss_pred ----HHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCc----CceEE Q lcl|NC_019933. 262 ----VDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAV----GQFLT 333 (394) Q Consensus 262 ----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~----~~~~~ 333 (394) ++.+..+...+...+...+.|+|||.++..|+++||++|+|+|++ ++|+|+||++++.+|. +.++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~------~~l~G~Pv~~~~~~~~~~~~~~~~~ 233 (305) T protein:vir:25 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) T ss_pred hhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC------CcccccceEEcCccCCCCCccEEEE Confidence 333444445555556667789999999999999999999999953 4799999999999874 36899 Q ss_pred eeccceEEEEeecceEEEEeccc--------chhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 334 GAFDAGAQVFDRWAARVEVATEN--------QDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 334 gd~~~~~~~~~~~~~~i~~~~~~--------~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) |||++ ++++.+.+++++++++. +..|++|++.+|++.|+||.+.||+||++++....+.. T Consensus 234 gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~ 301 (305) T protein:vir:25 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) T ss_pred Eecce-EEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccccccc Confidence 99987 67888889999887653 34699999999999999999999999999997655543 No 98 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=3e-50 Score=292.05 Aligned_cols=358 Identities=8% Similarity=0.033 Sum_probs=232.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQ-QRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e-~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |. .+|+++++++.++.+++.+....+.. .++..+.+++ ....++.++.+.. ......... T Consensus 1 M~--~kl~~~~~~~~e~~~~l~~~~~~~~~-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~------------- 61 (383) T protein:vir:78 1 MT--IKLKNNLANYEEKRTAFVNAVKNEDT-QEIQNKAYVE---MVDAMAADIMEQAKKEARQEADA------------- 61 (383) T ss_pred Cc--hhHHHHHHHHHHHHHHHHHHHhccCh-HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH------------- Confidence 87 44666666666666655544333221 1111111222 2222222222100 000000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) +....+.......+.+...+....+++++||++||+++.+.|++.+.+.++|+++|++.++++. .++|+ T Consensus 62 ----------~~~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~-~~i~~ 130 (383) T protein:vir:78 62 ----------YISASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLR-TKFLK 130 (383) T ss_pred ----------HHHhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCc-eEEEE Confidence 0000000001111222334455667788999999999999999999999999999999998775 68998 Q ss_pred EcCcccccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKP-ESSLRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGNGTG 237 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~ 237 (394) ..+. +.+.|++|++.++ .++++|+++++.+++++++++||+++|+|++ ++++||.+++++++++++|.+|++|+|.+ T Consensus 131 ~~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~ 209 (383) T protein:vir:78 131 SETS-GVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGND 209 (383) T ss_pred EcCC-cceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCC Confidence 7764 6788999977664 6789999999999999999999999999985 89999999999999999999999999965 Q ss_pred cccccccccccccccc-------ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhc---cCCcccccCc Q lcl|NC_019933. 238 QNLLGLLPQATAFAAP-------ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKD---TQGRYILGNP 307 (394) Q Consensus 238 ~~~~Gi~~~~~~~~~~-------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd---~~G~~~~~~~ 307 (394) .|.||++........ ....+..+.+++..+...+. .++.+..|+||..++..+++++. ..+.+.+... T Consensus 210 -qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 287 (383) T protein:vir:78 210 -KPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELT-DVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQ 287 (383) T ss_pred -CceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHH-HHHhccchhcccchhhhcCceEEEEcCcchhhhccc Confidence 588998754322111 11223334555555444443 33444445555555444444431 1111111110 Q ss_pred -----ccCCCceeecce--EEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 308 -----QGTLAPTLWGLP--VVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 308 -----~~~~~~~l~G~p--v~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .++...+++|+| |+.++.||+++++||||++ |.++++.+++++.++ +..|.+|++.||+..|+|++++++ T Consensus 288 ~~~~~~~G~~~t~l~~~~~iv~s~~~p~~~iifgdfs~-Y~i~~r~~~~i~~~~--~~~f~~d~~~f~~~~r~dG~~~~~ 364 (383) T protein:vir:78 288 YTSLNANGVYVTALPFNLNIIESLFVPEKKAISYVAER-YDALIGGPLDIGTYD--QTLAIEDLNLYAAKQFAYGKAKDD 364 (383) T ss_pred hhccCCCCceeeecCCCceEEecCCCCcccEEEeeccc-eEEEecccceEEecc--hhhhhcCceEEEEEEEEcCEEecC Confidence 111122556555 7789999999999999998 889999999988765 456899999999999999999999 Q ss_pred cceEEEEecC--CCCC Q lcl|NC_019933. 381 ESFIKGSLAA--AAGT 394 (394) Q Consensus 381 ~a~~~l~~~~--a~~~ 394 (394) +||++++++. ..+| T Consensus 365 ~A~~vl~~~~~~~~~~ 380 (383) T protein:vir:78 365 KAAAVWTLNINPAEQT 380 (383) T ss_pred CeEEEEEEEecCCCCC Confidence 9999976553 3333 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=2.6e-40 Score=237.59 Aligned_cols=288 Identities=9% Similarity=-0.017 Sum_probs=222.8 Q ss_pred hhHHHHH--HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc-cccCceeEEEEcCcc---cccceecCC Q lcl|NC_019933. 100 AEINIKA--AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT-MEGNTLEYVRETGFT---NAAAPVAEG 173 (394) Q Consensus 100 ~~~~~~~--~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~---~~~~~~~eg 173 (394) ++...+. .....+ .++..||+++|+++ .++++.+++.+++++++++++ +......+|+..... ....|.++. T Consensus 1 ~~~~~~~~~~~k~it-~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~ 78 (314) T protein:vir:41 1 MDFLNKPFQITPKID-VPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTK 78 (314) T ss_pred CchhhhHHHhhcccc-cccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCC Confidence 1211221 223333 34556899999887 579999999999999999985 466678888865321 234566777 Q ss_pred ccccccccceeeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCC-------cccccc Q lcl|NC_019933. 174 AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTG-------QNLLGL 243 (394) Q Consensus 174 ~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~-------~~~~Gi 243 (394) ...++++++|+++.+.++++...+.||+|+|+|+. +++++|...+++++++.++.++++|+|+. +.|.|+ T Consensus 79 ~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 79 VAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred ccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 88899999999999999999999999999999984 89999999999999999999999999853 357899 Q ss_pred cccccccccccc-ccccchHHHHHHHHHHhhhhcCC---CCeeEeCHHHHHHHHHhhccCCcccccCccc-CCCceeecc Q lcl|NC_019933. 244 LPQATAFAAPIT-VANATAVDRLRLALLQAQLAEFP---ATGIVLNPADWAGIELLKDTQGRYILGNPQG-TLAPTLWGL 318 (394) Q Consensus 244 ~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~l~G~ 318 (394) ++.+.......+ .+...+.+.+.+++..++..|++ +.+|+||+.++.+++++++.+|+++++.... +.+.+|+|+ T Consensus 159 l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~ 238 (314) T protein:vir:41 159 MKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGI 238 (314) T ss_pred hhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecce Confidence 987655544332 33445667778899999998875 4579999999999999999999999876543 445579999 Q ss_pred eEEEcCCC-----CcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|NC_019933. 319 PVVATQAM-----AVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 319 pv~~~~~~-----p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~ 393 (394) ||+.++.| |++.++||||++. .++.+..++++..+ +..++++.|.+..|+|+.+.+++|.++..+..+.+ T Consensus 239 PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~----~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~ 313 (314) T protein:vir:41 239 PIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKR----DAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSG 313 (314) T ss_pred eeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecc----cCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCC Confidence 99999987 4578999999974 44555555554333 45688999999999999999987776666544444 Q ss_pred C Q lcl|NC_019933. 394 T 394 (394) Q Consensus 394 ~ 394 (394) - T Consensus 314 ~ 314 (314) T protein:vir:41 314 G 314 (314) T ss_pred C Confidence 4 No 100 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=2.4e-39 Score=232.33 Aligned_cols=378 Identities=14% Similarity=0.168 Sum_probs=217.7 Q ss_pred CchHHHHHHHHHHHHHHHHHH---HHHHHhhhhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcc--c-cccc Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAH---ADRAVKDQELNASVRAKVDELLMAQGALQ-ADLKAAQQRIAEVEGNGA--G-GDVQ 73 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~---~e~~~~~~~~~~e~~~~~~~~~~~~~~l~-~~i~~~e~~~~~~~~~~~--~-~~~~ 73 (394) ...|..+++.......++++. .++.....+...++..+++++.++..+.. +.+..++.++........ . .... T Consensus 127 ~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~ 206 (517) T protein:vir:97 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhccccccc Confidence 233333333322222222211 11111111111222222222222221111 111111111111111110 0 0001 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .......+.............. ... ...............+|++.|..+...+...+...+++...+++.+++. T Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~~- 280 (517) T protein:vir:97 207 VTPEATEFLKTREAEVAYMSAS---LTK--DPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT- 280 (517) T ss_pred ccchhhHHHHHHHHHHHHHHhc---ccc--cccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeeccccc- Confidence 1111111111100000000000 000 0000111112233457889999999999999999999888877765543 Q ss_pred ceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-H----HHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-Q----LQSFINARLLRGLEVVEEN 228 (394) Q Consensus 154 ~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~----~~~~i~~~la~a~~~~~d~ 228 (394) ..+|.... ...++|+.||+.+|+++++|+++++.++++++++++|+++++|+. + +++||.++|+.++++++|. T Consensus 281 -~~~~~~~~-~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~ 358 (517) T protein:vir:97 281 -LVVGGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNR 358 (517) T ss_pred -eeeecccc-cceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHH Confidence 44555444 346789999999999999999999999999999999999998773 3 9999999999999999999 Q ss_pred HHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcc Q lcl|NC_019933. 229 QLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQ 308 (394) Q Consensus 229 a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~ 308 (394) +|++|+|++.++.|+++.+....... ..+..+..+++..+..... ...++.|+|||.+|.+|+++||++|+|||++.. T Consensus 359 a~l~GdGtg~~~~gi~~~a~~~~~~~-~~~~~~~~d~i~~l~~a~~-~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~ 436 (517) T protein:vir:97 359 AIIMGGVTGVSETQIYPVVGDAWATN-VTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGV 436 (517) T ss_pred HHhcccCCCccccccccccccccccc-ccccchHHHHHHHHHHHhh-hccCCEEEECHHHHHHHHHhhcCCCCeeccCcC Confidence 99999999988888887543322211 1222233344333222111 224778999999999999999999999998765 Q ss_pred cC-CCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_019933. 309 GT-LAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGS 387 (394) Q Consensus 309 ~~-~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~ 387 (394) .. ...+++|..-+. +.++.+...++..+ .|.++.+.++.+..+ .+..+|+..|+..+|+++.|+.|++|+.+. T Consensus 437 ~~~~~~~l~G~~~~~-~~~~~~~~~~~~~~-~y~i~~~~g~~~~~~----fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~ 510 (517) T protein:vir:97 437 SNQTIATHFGFNRLV-QSVAVDEKTAVSLS-GYVTNGSRGMEFEQG----TILVENNKEYLFEMPISGSLEYKGTTAYGT 510 (517) T ss_pred CcccccccCCccccc-cccccCceeEeecc-ccEEEeecceeeeee----eecccCceeEeeeeeeccccccccceEEEE Confidence 43 345778842222 23444555555544 566777666554322 234578999999999999999999999988 Q ss_pred ecCCCCC Q lcl|NC_019933. 388 LAAAAGT 394 (394) Q Consensus 388 ~~~a~~~ 394 (394) +.++..- T Consensus 511 ~~p~~~~ 517 (517) T protein:vir:97 511 YTPPVAG 517 (517) T ss_pred EcCCCCC Confidence 7655444 No 101 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=8.2e-40 Score=234.86 Aligned_cols=291 Identities=10% Similarity=-0.001 Sum_probs=216.3 Q ss_pred HHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc-cccCceeEEEEcCc---c Q lcl|NC_019933. 89 AMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT-MEGNTLEYVRETGF---T 164 (394) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~---~ 164 (394) .+.......+ .........+ .++.+||+++|+.. ..+++.+.+.+++++++++++ +.+....+++.... . T Consensus 1 ~~~~~~~~~~----~~~~~~k~~t-~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~ 74 (315) T protein:vir:41 1 MLTIEDIRGG----KPFEIVPKID-VPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVG 74 (315) T ss_pred CcccchhhcC----ChhhhhhhcC-CcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccc Confidence 1111111111 1111223333 34456888888775 569999999999999999865 44545555543211 1 Q ss_pred cccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--- Q lcl|NC_019933. 165 NAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--- 238 (394) Q Consensus 165 ~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--- 238 (394) ....|.+++...++++++|+++.+.++++.+.+.||+++|+|+. +++++|..++++++++.++.++++|+|.+. T Consensus 75 ~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~ 154 (315) T protein:vir:41 75 PGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPL 154 (315) T ss_pred cccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcc Confidence 23467888888999999999999999999999999999999984 899999999999999999999999998753 Q ss_pred --ccccccccccccccccc---ccccchHHHHHHHHHHhhhhcCC---CCeeEeCHHHHHHHHHhhccCCcccccCcc-c Q lcl|NC_019933. 239 --NLLGLLPQATAFAAPIT---VANATAVDRLRLALLQAQLAEFP---ATGIVLNPADWAGIELLKDTQGRYILGNPQ-G 309 (394) Q Consensus 239 --~~~Gi~~~~~~~~~~~~---~~~~~~~~~i~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~-~ 309 (394) .|+|+++.+........ .+...+.+.+.+++..++..|+. +.+|+||+.++.+|++++|++|+|+|++.. . T Consensus 155 ~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~ 234 (315) T protein:vir:41 155 LRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTG 234 (315) T ss_pred ccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhc Confidence 45799887655433222 22334577888999999998874 568999999999999999999999998664 4 Q ss_pred CCCceeecceEEEcCCCC-----cCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_019933. 310 TLAPTLWGLPVVATQAMA-----VGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFI 384 (394) Q Consensus 310 ~~~~~l~G~pv~~~~~~p-----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~ 384 (394) +.+.+|+|+||..++.|| ++.++||||++ +.++.+.+++++.+++ ..++.+.|.+..|+|+.+.++++.+ T Consensus 235 g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~n-l~~~~~~~i~i~~~~~----a~~~~~~~~~~~r~d~~~~~~~~~a 309 (315) T protein:vir:41 235 ANSILYDGRPVQYVPALEALNDGKSRALFVVPTQ-LVYGFWRNIKVVPDYD----AEMRLTKYVASLRTDNHYEDEEGAV 309 (315) T ss_pred CCCceecccceEecccccccCCCCccEEEecccc-eEEEeccccEEEeeec----CCCCceEEEEEEEeceeEEecccee Confidence 456789999999999885 56799999987 4556677777765543 3466788999999999888777644 Q ss_pred EEEecC Q lcl|NC_019933. 385 KGSLAA 390 (394) Q Consensus 385 ~l~~~~ 390 (394) +..++. T Consensus 310 ~~~~~v 315 (315) T protein:vir:41 310 SATITV 315 (315) T ss_pred EeeeeC Confidence 433333 No 102 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=2e-36 Score=216.32 Aligned_cols=297 Identities=11% Similarity=0.043 Sum_probs=218.5 Q ss_pred hhhhhhhHHHHHHH-hhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceec-C Q lcl|NC_019933. 95 GQRGRAEINIKAAI-TSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVA-E 172 (394) Q Consensus 95 ~~~~~~~~~~~~~~-~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-e 172 (394) ..+........... ....+.++..+|++||+++...|++.+.+.++++++++++++......+|...... ..+|++ + T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~-~~~~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGE-RHRRPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCC-cccccccc Confidence 11111111111111 12222344567889999999999999999999999999999998888899876543 334443 3 Q ss_pred C-ccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccc-----ccc Q lcl|NC_019933. 173 G-AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQNL-----LGL 243 (394) Q Consensus 173 g-~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~-----~Gi 243 (394) + ...+.++++|+++.+..+++.+.+.||+++|+++ ++++++|++.++++++..++.++|+|++.+.++ .|+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~ 159 (321) T protein:vir:31 80 GEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGF 159 (321) T ss_pred cccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhh Confidence 3 4556788999999999999999999999999987 389999999999999999999999999988765 688 Q ss_pred cccccccccccc-ccccchHHHHHHHHHHhhhhcCC--CCeeEeCHHHHHHHHH-hhccCCcccccCc-ccCCCceeecc Q lcl|NC_019933. 244 LPQATAFAAPIT-VANATAVDRLRLALLQAQLAEFP--ATGIVLNPADWAGIEL-LKDTQGRYILGNP-QGTLAPTLWGL 318 (394) Q Consensus 244 ~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~-lkd~~G~~~~~~~-~~~~~~~l~G~ 318 (394) ++.+........ ..+..+++.+.+++..++..++. +.+|+||+.++..++. +++. +.+++... .++.+.+|+|+ T Consensus 160 l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~~~~~~tl~G~ 238 (321) T protein:vir:31 160 ITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIMGEADVNPFSF 238 (321) T ss_pred hhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhhccccccccce Confidence 875544333322 23445678899999999998864 4589999999887765 5554 45677543 34455689999 Q ss_pred eEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchh-hhcCcEEEEEEEEeccEEecccceEEEE-ecCCCCC Q lcl|NC_019933. 319 PVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDD-FIKNMVTILAEERLALAVYRPESFIKGS-LAAAAGT 394 (394) Q Consensus 319 pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~~~~~~~d~~v~~~~a~~~l~-~~~a~~~ 394 (394) ||+.++.||++.++|++|++.++ +...+++++...+.... ...+.+......++|+.|.+++|++.++ ++-+--+ T Consensus 239 pvv~~~~mP~~~il~t~~~nl~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~ 315 (321) T protein:vir:31 239 PIIGSGLWPDDKAMFTDPQNLIY-ALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEH 315 (321) T ss_pred eEEEcCCCCCCcEEEeccccEEE-EEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcchhc Confidence 99999999999999999998644 44456677665543321 2234445556668999999999999998 4443333 No 103 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=1.8e-34 Score=205.59 Aligned_cols=358 Identities=12% Similarity=0.054 Sum_probs=194.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) -.++..+++.-....+. ...++..+......+.+.+.+++.++..+++.++++..... ...... ....+..... T Consensus 114 ~a~v~~vks~~~~~e~~--~~~~e~~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~---~~~~~~-~~~~~~~~~e 187 (480) T protein:vir:40 114 GAKVTKVREENKGEQEQ--MGANETQEIMKQAIEAGVKVRELEAKVEELNKEREELKKER---EASIPS-EKPEDAERKF 187 (480) T ss_pred hhhhhhhhhhhhhhhhh--hhhHHHHHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhh---hhhccc-cchhhhhhHH Confidence 11122211111100000 00000000000001112223333333333333333221110 000000 0001111000 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRE 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) . ............. ...+........+....++. +|+.+...+.......+++...+..... + T Consensus 188 ~---r~~~~~~~~~~e~----~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g-------- 250 (480) T protein:vir:40 188 M---RELGSKMAEMPEQ----GFLREFANGADLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAED-G-------- 250 (480) T ss_pred H---HHHHHHhccchhh----hhhhhhhhhccccccccccc-cccchhhheeechhhhhhhhhcceeeec-c-------- Confidence 0 0111111111110 11111112222233333343 4555554444443344444333222111 1 Q ss_pred cCcccccceecCCccccccc--cceeeEEee---eeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 161 TGFTNAAAPVAEGAQKPESS--LRFDLVQTS---AKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 161 ~~~~~~~~~~~eg~~~~~~~--~~~~~i~~~---~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) .....|++++...+... ..+.+..+. .++++.....|+++++|++++++||.++|+..++.+++.+|++|+| T Consensus 251 ---~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g 327 (480) T protein:vir:40 251 ---VDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSV 327 (480) T ss_pred ---ccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 11233455443333221 233344444 4677888889999999998999999999999999999999999977 Q ss_pred CCc-cccccccccccccccccccccchHHHHHHHHHHhhhhcCCCC-eeEeCHHHHHHHHHhhccCCcccccCcc-cCCC Q lcl|NC_019933. 236 TGQ-NLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPAT-GIVLNPADWAGIELLKDTQGRYILGNPQ-GTLA 312 (394) Q Consensus 236 ~~~-~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~lkd~~G~~~~~~~~-~~~~ 312 (394) ++. .+.|+.......+ .+....+.|.+++..+...+..++ .|+||+.+|+.|++|||++|+|||++.. .+.+ T Consensus 328 ~g~~~~~g~~~~~~~~~-----~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~ 402 (480) T protein:vir:40 328 DGSNGFYGLKTATDGWT-----KQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQI 402 (480) T ss_pred CCccccccceeeccccc-----ccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCc Confidence 664 3566644322111 112223444457778888888777 6999999999999999999999998755 4556 Q ss_pred ceeecceEEEc-CCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 313 PTLWGLPVVAT-QAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 313 ~~l~G~pv~~~-~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) .+|+|+||+++ ..+|.+...++.++.++.++++. ++. .....+..++..|+++.|+++.+..|+|+.+++++.. T Consensus 403 ~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~~---~~~--~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 403 AQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDLN---VEN--YNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred ceecccceeeeeccccCCcceeeeCCccEEEEecc---cce--ecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 79999998765 46788888888888878888763 222 2223566889999999999999999999999999999 Q ss_pred CCC Q lcl|NC_019933. 392 AGT 394 (394) Q Consensus 392 ~~~ 394 (394) -|- T Consensus 478 ~~~ 480 (480) T protein:vir:40 478 LGV 480 (480) T ss_pred cCC Confidence 999 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=4.1e-33 Score=198.15 Aligned_cols=263 Identities=17% Similarity=0.157 Sum_probs=211.8 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc----ccccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG----TMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |..+++..+..++|+.+...+++.+.+.+.+.+++.+. ..+|+++++|++.. .+.+.|++||+.+|.++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i~~~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAIPMTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcccccccccceE Confidence 66555666789999999999999999988887876542 24567899999865 46788999999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+.+++++..+.+|+++..++ +++.+.+.+++++++++++|..++....+. ....++..+++.+ T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a---------------~~~~~~~~t~d~i 144 (272) T protein:vir:98 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKS---------------TQTVEATATVDGV 144 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------ccccccccCHHHH Confidence 999999999999999998876 799999999999999999999998642211 1122345578999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc--CC--cccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDT--QG--RYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~--~G--~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) .++...+...+.....|+|||.++..|++.... .+ .+.......+..++++|+||++++.+|++++++.+.. ++. T Consensus 145 ~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~-a~~ 223 (272) T protein:vir:98 145 SKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG-ALR 223 (272) T ss_pred HHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCC-eEE Confidence 999999988888889999999999999876422 11 1111222233446899999999999999999888754 566 Q ss_pred EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 342 VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 342 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.+.++.++.+++. .++...++...|+++++.+|++++++++++|+-- T Consensus 224 ~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 224 IMLKRNTMVETDRDI----TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEecCCceeeecccc----ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 777788787765543 4677899999999999999999999999988877 No 105 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=4.1e-33 Score=198.15 Aligned_cols=263 Identities=17% Similarity=0.157 Sum_probs=211.8 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc----ccccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG----TMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |..+++..+..++|+.+...+++.+.+.+.+.+++.+. ..+|+++++|++.. .+.+.|++||+.+|.++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i~~~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAIPMTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcccccccccceE Confidence 66555666789999999999999999988887876542 24567899999865 46788999999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+.+++++..+.+|+++..++ +++.+.+.+++++++++++|..++....+. ....++..+++.+ T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a---------------~~~~~~~~t~d~i 144 (272) T protein:vir:30 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKS---------------TQTVEATATVDGV 144 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------ccccccccCHHHH Confidence 999999999999999998876 799999999999999999999998642211 1122345578999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc--CC--cccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDT--QG--RYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~--~G--~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) .++...+...+.....|+|||.++..|++.... .+ .+.......+..++++|+||++++.+|++++++.+.. ++. T Consensus 145 ~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~-a~~ 223 (272) T protein:vir:30 145 SKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG-ALR 223 (272) T ss_pred HHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCC-eEE Confidence 999999988888889999999999999876422 11 1111222233446899999999999999999888754 566 Q ss_pred EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 342 VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 342 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.+.++.++.+++. .++...++...|+++++.+|++++++++++|+-- T Consensus 224 ~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 224 IMLKRNTMVETDRDI----TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEecCCceeeecccc----ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 777788787765543 4677899999999999999999999999988877 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.90 E-value=4e-25 Score=154.36 Aligned_cols=263 Identities=14% Similarity=0.112 Sum_probs=200.6 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |....+.-+..++|+.|...+.+.+.+...+.+++...+ -+|.++++|++... +.+.++.||+.++.++.++++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~-g~~~~~~eg~~i~~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccC-CCcccccCCCccccccccccee Confidence 555666667789999999999999988888878776532 13568999998753 5677889999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++....++.. ...+...+++.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~--------------~~~~~~~~~d~i 145 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------TVNADITKLNGL 145 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHH Confidence 999999999999999987766 68899999999999999999998865432211 012234568899 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh--ccCCccccc--CcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLK--DTQGRYILG--NPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk--d~~G~~~~~--~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) .++...+...+.....++|||.++..|++.. +.....-.. ....+.-++++|+||++++.+|.++.++.... ++. T Consensus 146 ~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g-ai~ 224 (274) T protein:vir:93 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG-AVK 224 (274) T ss_pred HHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCC-eEE Confidence 9999888888778889999999999997532 110000000 11223346899999999999999998887754 555 Q ss_pred EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 342 VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 342 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++...++.++..++. .+....+++..++++++.+|+++++++ ++++|+ T Consensus 225 ~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~t-~~~~s~ 272 (274) T protein:vir:93 225 LILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKIT-KGSGSL 272 (274) T ss_pred EEecCCcccccccch----hhcccEEEEEEEEEEEEEcCCceEEEe-eCcccc Confidence 666677777655543 345679999999999999999999999 444455 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.89 E-value=3.7e-25 Score=154.51 Aligned_cols=261 Identities=19% Similarity=0.158 Sum_probs=199.9 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc----ccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM----EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |..+.+.-...++|+.|.+.+.+.+.+...+.+++...+. +|.++++|++... +.+.++.||..++..+.+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~-gda~~~~eg~~i~~~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYI-GDAADVAEGGEISLDKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccC-ccccccCCCCccChhhcCCcce Confidence 5555556667889999999999999888888887765432 4678999998754 5667899999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|+.++....+ .....+...+++.+ T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~---------------~~~~~~~~~~~d~i 144 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT---------------TSQTVSTKANVDGV 144 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------ccccccccccHHHH Confidence 999999999999999987666 68999999999999999999988854211 11123445678899 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCc--ccccC-cccCCCceeecceEEEcCCCCcCceEEe---eccce Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGR--YILGN-PQGTLAPTLWGLPVVATQAMAVGQFLTG---AFDAG 339 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~--~~~~~-~~~~~~~~l~G~pv~~~~~~p~~~~~~g---d~~~~ 339 (394) .++...+...+....+++|||.++..|++.....-. ..... ...+.-++++|++|++++.+|.++.+.. ..+.+ T Consensus 145 ~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA 224 (272) T protein:vir:36 145 QAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPA 224 (272) T ss_pred HHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccc Confidence 999999988888888999999999999864322111 11111 1122346899999999999999876432 22445 Q ss_pred EEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 340 AQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 340 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +..+...++.+|..++ ..+....+++..+++.++.+|+++++++++.. T Consensus 225 ~~~~~~~~~~vE~~R~----~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 225 LKLVLKRGVQVETDRD----IVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeeeecCCcccccccc----hhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 5556666777765443 33556789999999999999999999999998 No 108 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.87 E-value=3e-24 Score=149.56 Aligned_cols=299 Identities=15% Similarity=0.120 Sum_probs=214.3 Q ss_pred HHH--HhhhhhhhhHHHHH-HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccc Q lcl|NC_019933. 90 MAE--SGGQRGRAEINIKA-AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNA 166 (394) Q Consensus 90 ~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (394) +.+ .....++++.-... =..++.+-+...++.+.|......||+.+.+.+.|++++++.++.++.+.+++.... +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~l-p~ 79 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVL-GD 79 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecC-Cc Confidence 000 00011111110000 011233334555788999999999999999999999999999998988999998774 78 Q ss_pred cceecCCccccccc-cceeeEEeeeeeEEEeehhhHHHHHH--HH-HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccc Q lcl|NC_019933. 167 AAPVAEGAQKPESS-LRFDLVQTSAKVIAHWMKASRQILSD--SA-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLG 242 (394) Q Consensus 167 ~~~~~eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~e~l~~--s~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~G 242 (394) +.|...++..+.+. .+|.+++...+.+++.+.|.+++.+- ++ +...+-.+...+++.++.+..+|+|+...+.+.| T Consensus 80 a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~G 159 (330) T protein:vir:94 80 VQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQG 159 (330) T ss_pred ceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccc Confidence 88998888887765 47999999999999999999999653 33 7788888899999999999999999988888999 Q ss_pred ccccccccccc--ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccc-cCcc---cCCCceee Q lcl|NC_019933. 243 LLPQATAFAAP--ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYIL-GNPQ---GTLAPTLW 316 (394) Q Consensus 243 i~~~~~~~~~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~-~~~~---~~~~~~l~ 316 (394) +++........ .+..+..+.+++-.++..+......+++|+||+.+..+|+.+....|++-. +... +....++. T Consensus 160 L~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~ 239 (330) T protein:vir:94 160 MMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYR 239 (330) T ss_pred hhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeC Confidence 98765443322 223466778888777777776677899999999999999998876665443 2222 22234688 Q ss_pred cceEEEcCCCCcC----------ceEEeecc-----ceEEEEe---ecceEEEEecccchhhhcCcEEEEEEEEeccEEe Q lcl|NC_019933. 317 GLPVVATQAMAVG----------QFLTGAFD-----AGAQVFD---RWAARVEVATENQDDFIKNMVTILAEERLALAVY 378 (394) Q Consensus 317 G~pv~~~~~~p~~----------~~~~gd~~-----~~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~ 378 (394) |+|++.++.+|.+ .+++..|- +++.... ..+++++. .+..-.++...+++++||+.++. T Consensus 240 GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~---~G~~~~k~v~~~~v~~y~~~av~ 316 (330) T protein:vir:94 240 GVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQN---VGAKENADETITRVKMYCGFANF 316 (330) T ss_pred CeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeee---CCCccccceeeEEEEEeeeeEEe Confidence 9999999988864 34443332 2333322 22455533 22122456788999999999999 Q ss_pred cccceEEEEecCCC Q lcl|NC_019933. 379 RPESFIKGSLAAAA 392 (394) Q Consensus 379 ~~~a~~~l~~~~a~ 392 (394) +++|+++|+--..+ T Consensus 317 ~~~a~~~L~~V~~g 330 (330) T protein:vir:94 317 SQLGLAAIKGLIPG 330 (330) T ss_pred chhheeeeccccCC Confidence 99999999866666 No 109 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.86 E-value=4.9e-23 Score=142.88 Aligned_cols=262 Identities=12% Similarity=0.131 Sum_probs=200.0 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-+..++|+.|...+.+.+.....+.+++...+ -+|+++++|++.. .+.+....+|+.++..+.+++.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~~~~~~~g~~i~~~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeecc-CCCccccCCCCcCchhhccccee Confidence 554455556889999999999999988877777765532 2467899999874 34566788899999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++....++. ....++..+++.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~--------------~~~~~~~~~~d~i 145 (274) T protein:vir:96 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) T ss_pred EEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--------------CCcCcccccHHHH Confidence 999999999999999987666 5888999999999999999998886432211 1112344568999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc------CcccCCCceeecceEEEcCCCCcCceEEeeccce Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG------NPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAG 339 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~------~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~ 339 (394) .++...+.........++|||..+..|++.... +++-. ....+.-++++|++|++++.+|.++.++... .+ T Consensus 146 ~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~-gA 222 (274) T protein:vir:96 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASD--NFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKK-GA 222 (274) T ss_pred HHHHHHhcccCCCceEEEeCHHHHHHHHhcccc--cccccccccccceeecccceecCeeEEEcCCCCcceEEEEeC-cc Confidence 999988888777888999999999999876311 11111 1112234689999999999999999877664 45 Q ss_pred EEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 340 AQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 340 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +.++...++.++..++ ..+....+++..+++.++.+|+++++++.+++--- T Consensus 223 ~~~~~~~~~~vE~~Rd----~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~ 273 (274) T protein:vir:96 223 VKLITKRDFFLEKDRD----ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) T ss_pred eeeeecCCcccccccc----hhhcccEEEEeeEEEEEEEcCccEEEEEcCccccc Confidence 6566667767765443 34567789999999999999999999998877766 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.86 E-value=3.1e-23 Score=144.00 Aligned_cols=262 Identities=11% Similarity=0.117 Sum_probs=200.8 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-...++|+.|.+.+.+.+.+...+.+++...+ .+|.++++|++... +.+..+.||+.++..+.++++. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~i-gda~~~~eg~~i~~~~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYS-GDATVVPEGQKIPVDKIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCC-CccccccCCCccCcccccccee Confidence 554445556788999999999999999888888876543 35778999998754 5667799999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) ....++.+..+.++++....+ .|....+.++++.++++.+|..++.-...+. . ....+..+++.+ T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~-------------~-~~~~~~~t~d~i 145 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTK-------------L-TVSADIGTLAGL 145 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------c-cccccccCHHHH Confidence 999999999999999987766 5888999999999999999998885321110 0 012334578999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccccc-----C-cccCCCceeecceEEEcCCCCcCceEEeeccce Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILG-----N-PQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAG 339 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~-----~-~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~ 339 (394) .++...+........+++|||..+..|+++.... ++.. . ...+.-++++|++|++++.+|.++.++... .+ T Consensus 146 ~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~-gA 222 (276) T protein:vir:10 146 EAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGEAILAKR-GA 222 (276) T ss_pred HHHHHHhccccCcccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceecceeEEEcCCCCcceEEEEec-cc Confidence 9999888887778889999999999998754221 1110 0 112234689999999999999999876653 45 Q ss_pred EEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 340 AQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 340 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +.++...++.++.+++. .+....+++..+++.++.+|..++++++++-+.. T Consensus 223 i~~~~~~~~~vE~dRd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 273 (276) T protein:vir:10 223 VKLITKRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKVTKGAGTTD 273 (276) T ss_pred eeeeecCCceeecccch----hhcccEEEEeeEEEEEEEcCcceEEEecCCcCCc Confidence 66677777777665543 3456789999999999999999999996652222 No 111 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.86 E-value=7.3e-23 Score=141.94 Aligned_cols=260 Identities=14% Similarity=0.129 Sum_probs=198.1 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-+..++|+.|...+.+.+.....+.+++...+ .+|+++++|++... +.+..+.+|+.++..+.+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~-g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCC-CccccccCCCccccccccccee Confidence 555555667889999999999999888777777765533 24778999998743 4566788999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++....++.. + ..+...+++.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~-----------~---~~~~~~~~d~i 145 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------T---VNADITKLNGL 145 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-----------c---ccccccCHHHH Confidence 999999999999999976655 58889999999999999999988864322110 0 11234568999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh--c----cC-CcccccCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLK--D----TQ-GRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk--d----~~-G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) .++...+.........++|||..+..|++.. + +. |..+ ...+.-++++|++|++++.+|.++.++... . T Consensus 146 ~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~---~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~-g 221 (274) T protein:vir:94 146 QSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDI---IVKGAFGEALGAIIVRTNKLEAGTAILAKK-G 221 (274) T ss_pred HHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccc---eeccccceecCeeEEEcCCCCcceEEEEeC-c Confidence 9999998888778889999999999987531 1 11 1111 122334689999999999999999887764 4 Q ss_pred eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 339 GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 339 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.++.+.++.++.+++. .+....+++..++++++.+|.++++++ ++.+|+ T Consensus 222 A~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t-~~~~~~ 272 (274) T protein:vir:94 222 AVKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKIT-KGSGSL 272 (274) T ss_pred ceEeeecCCceeccccch----hhcccEEEEEEEEEEEEEcCCceEEEe-cCcccc Confidence 566666777777665543 345678999999999999999999999 455555 No 112 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.86 E-value=7.3e-23 Score=141.94 Aligned_cols=260 Identities=14% Similarity=0.129 Sum_probs=198.1 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-+..++|+.|...+.+.+.....+.+++...+ .+|+++++|++... +.+..+.+|+.++..+.+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~-g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCC-CccccccCCCccccccccccee Confidence 555555667889999999999999888777777765533 24778999998743 4566788999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++....++.. + ..+...+++.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~-----------~---~~~~~~~~d~i 145 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------T---VNADITKLNGL 145 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-----------c---ccccccCHHHH Confidence 999999999999999976655 58889999999999999999988864322110 0 11234568999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh--c----cC-CcccccCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLK--D----TQ-GRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk--d----~~-G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) .++...+.........++|||..+..|++.. + +. |..+ ...+.-++++|++|++++.+|.++.++... . T Consensus 146 ~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~---~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~-g 221 (274) T protein:vir:97 146 QSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDI---IVKGAFGEALGAIIVRTNKLEAGTAILAKK-G 221 (274) T ss_pred HHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccc---eeccccceecCeeEEEcCCCCcceEEEEeC-c Confidence 9999998888778889999999999987531 1 11 1111 122334689999999999999999887764 4 Q ss_pred eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 339 GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 339 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.++.+.++.++.+++. .+....+++..++++++.+|.++++++ ++.+|+ T Consensus 222 A~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t-~~~~~~ 272 (274) T protein:vir:97 222 AVKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKIT-KGSGSL 272 (274) T ss_pred ceEeeecCCceeccccch----hhcccEEEEEEEEEEEEEcCCceEEEe-cCcccc Confidence 566666777777665543 345678999999999999999999999 455555 No 113 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.85 E-value=7e-23 Score=142.03 Aligned_cols=262 Identities=12% Similarity=0.096 Sum_probs=198.8 Q ss_pred hcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc----ccCceeEEEEcCcccccceecCCccccccccceee Q lcl|NC_019933. 110 SLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM----EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDL 185 (394) Q Consensus 110 ~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~ 185 (394) -...+.+.-+..++|+.|...+.+.+.+...+.+++.+-+. +|+++++|++... +.+..+.+|+.++..+.++++ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~i-g~a~~~~~g~~i~~~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYS-GDAKVVPEGEEIPIDLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccC-CccccccCCCCcchhhcccce Confidence 11122344456788999999999999998888888765442 4678999998754 566778999999999999999 Q ss_pred EEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHH Q lcl|NC_019933. 186 VQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDR 264 (394) Q Consensus 186 i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 264 (394) .....++.+..+.++++....+ .++...+.++++.++++.+|..++...+++.. + ..++..+++. T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~-----------~---~~~~~~~~d~ 145 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL-----------K---VEADITKLAG 145 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------c---ccccccCHHH Confidence 9999999999999999987666 57888899999999999999988864322110 0 1234557899 Q ss_pred HHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc-------CCcccccCcccCCCceeecceEEEcCCCCcCceEEeecc Q lcl|NC_019933. 265 LRLALLQAQLAEFPATGIVLNPADWAGIELLKDT-------QGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFD 337 (394) Q Consensus 265 i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~-------~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~ 337 (394) +.++...+.........++|||..+..|+++... .|..+ ...+.-++++|++|++++.+|.++.++.. . T Consensus 146 i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~---~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~-~ 221 (275) T protein:vir:96 146 LQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNV---IVKGAFGEALGAIIVRSNKIKEGEAILAK-R 221 (275) T ss_pred HHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccc---eeccccceecCeeEEEeCCCCcceEEEEe-c Confidence 9999988877777788999999999999876311 11111 11233468999999999999999887665 3 Q ss_pred ceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 338 AGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 338 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) .++.++...++.++.+++. .+....+++..+++.++.+|++++++++++++=- T Consensus 222 gA~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 274 (275) T protein:vir:96 222 GAVKLITKRDFFLETERHA----SHKSTALFSDKHYVAYLYDESKVVKITKSASGLG 274 (275) T ss_pred cceeeeecCCcccccccch----hhcCcEEEEeEEEEEEEEcCccEEEEEecccccC Confidence 4556666667677655543 3556789999999999999999999988665544 No 114 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.84 E-value=2e-22 Score=139.55 Aligned_cols=268 Identities=15% Similarity=0.049 Sum_probs=192.5 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |...++.-+..++|+.|.+.+.+.+.+...+.+++.... -+|.++++|++... +.+.++.+|+.++..++++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~-g~a~~~~~g~~i~~~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYI-GDAQDVAEGAAIDYSALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccC-CcceeecCCCcCccccccccee Confidence 444455557789999999999999988887777765432 24668999998753 4567889999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++..-.+... + .. ...........++.+ T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-----~-~~--~~~t~~~~~~~~~~~ 151 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-----E-VK--GAINIGLIDKIENTF 151 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----c-cc--cccccchhhhHHHHH Confidence 999999999999999987666 58999999999999999999988864321110 0 00 001111122345666 Q ss_pred HHHHHHhhhhcCC-CCeeEeCHHHHHHHHHhhccCC--cccc--cCcccCCCceeecceEEEcCCCCcCceEEeeccceE Q lcl|NC_019933. 266 RLALLQAQLAEFP-ATGIVLNPADWAGIELLKDTQG--RYIL--GNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGA 340 (394) Q Consensus 266 ~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~lkd~~G--~~~~--~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~ 340 (394) .++...+..++.. ...++|||..+..|++....+. ..-. .....+.-++++|++|++++.+|.++.++... .++ T Consensus 152 ~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~-gAi 230 (278) T protein:vir:80 152 TDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGNALAVKA-GAL 230 (278) T ss_pred HHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcceEEEEec-cce Confidence 6776666655544 3368899999999986532111 0000 11122334689999999999999998877664 355 Q ss_pred EEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 341 QVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 341 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) ..+...++.++.+++ ..+....+++..+++.++.+|+++++++..+.. T Consensus 231 ~~~~~~~~~vE~~Rd----~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 231 KTFLKRNLLAESGRD----MDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eeeecCCcccccccc----hhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 566667777765543 335567899999999999999999999887777 No 115 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.83 E-value=7.3e-22 Score=136.48 Aligned_cols=260 Identities=13% Similarity=0.122 Sum_probs=196.0 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-+..++|+.|...+.+.+.....+.+++..-. .+|+++++|++... +.+..+.+|+.++..+.+.++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~i-g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS-GDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCC-CccccccCCCccchhhccccee Confidence 444455566789999999999998887777777765532 35778999998753 4566788999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++....++.. .......+++.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~--------------~~~~~a~~~d~i 145 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------------TVNADITKLNGL 145 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHH Confidence 999999999999999876555 57888899999999999999988864322111 012344578999 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh--c----cC-CcccccCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLK--D----TQ-GRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk--d----~~-G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) .++...+.........++|||..+..|++.. + +. |.. ....+.-++++|++|++++.+|.++.++... . T Consensus 146 ~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~---~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~-g 221 (274) T protein:vir:12 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDD---IIVKGAFGEALGAIIVRSNKLEAGTAILAKK-G 221 (274) T ss_pred HHHHHHhccccccccEEEeCHHHHHHHHhhhhhhcccccccccc---ceecccceeecCeeEEEeCCCCcceEEEEec-c Confidence 9999888877777889999999999987631 1 11 111 1122334679999999999999998765543 4 Q ss_pred eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 339 GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 339 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++..+...++.+|.+++.. +....+++..++++++.+|+.+++++ ++++|+ T Consensus 222 A~~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t-~~~~~~ 272 (274) T protein:vir:12 222 AVKLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKIT-KGSGSL 272 (274) T ss_pred ceeeeecCCceeccccchh----hcccEEEeeeEEEEEEEcCCceEEEE-cCCccc Confidence 5556667777776655443 45678999999999999999999999 555566 No 116 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.83 E-value=6.7e-22 Score=136.66 Aligned_cols=261 Identities=8% Similarity=0.023 Sum_probs=196.0 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc----ccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM----EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |..+. -...++|+.+.+.+.+.+.+...+.+++...+. +|.++++|.+.. .+.+..+.||+.++..+.++++- T Consensus 1 Ma~T~--~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~-igdae~~~eg~~i~~~~lt~~~~ 77 (270) T protein:vir:95 1 MTQTK--KANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAY-IGAAEDLQEGVAMDTTQMSMTTT 77 (270) T ss_pred CCcee--hhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecC-CCccccccCCCccchhhcccchh Confidence 33332 245689999999999999888888888765432 577899999874 45667788999999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) ....++.+..+.++++....+ .|....+.++++..+++++|+.++.-.. | .....+...+++.+ T Consensus 78 ~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~------~---------a~~~~~~~~t~~~~ 142 (270) T protein:vir:95 78 KVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELN------K---------SKQTATVSADATGI 142 (270) T ss_pred eeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhc------c---------cccccccccCHHHH Confidence 999999999999999987666 5788889999999999999998874311 1 11112344578899 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCC-cccccCcccCCCceeecceEEEcCCCC-cCceEEeeccceEEEE Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDTQG-RYILGNPQGTLAPTLWGLPVVATQAMA-VGQFLTGAFDAGAQVF 343 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G-~~~~~~~~~~~~~~l~G~pv~~~~~~p-~~~~~~gd~~~~~~~~ 343 (394) .++...+........+++|||.++..|++...-.+ ++.......+.-++++|++|++++.+| .++.++.. ..++.++ T Consensus 143 ~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~-~gAi~~~ 221 (270) T protein:vir:95 143 LDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQR-YGAMEIV 221 (270) T ss_pred HHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCCCceeEEEEe-ccceeee Confidence 99998888888888899999999999986432111 111111222345689999998877554 55555333 4566677 Q ss_pred eecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 344 DRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 344 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ...++.+|.+++. .+....+++..++++++.++..+++++++.+.+| T Consensus 222 ~~~~~~vEtdRd~----~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~ 268 (270) T protein:vir:95 222 NKKKPEAYTDFDI----LKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSL 268 (270) T ss_pred ecCCceeeeccch----hhcccEEEeeeEEEEEEEccceEEEEEecCCCCc Confidence 7777777665543 3556789999999999999999999999888888 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.82 E-value=2.3e-21 Score=133.68 Aligned_cols=259 Identities=13% Similarity=0.119 Sum_probs=194.8 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-+..++|+.|...+.+.+.....+.+++..-+ -+|+++++|++... +.+..+.+|+.++..+.+.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~i-g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYS-GDAKVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCC-CccccccCCCccchhhccccee Confidence 444444556788899999999998888777777754432 24778999998753 4566788899999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.-..++.. + ..++..+++.+ T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-----------~---~~~~~~~~d~i 145 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL-----------T---VEADITKLTGL 145 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------c---ccccccCHHHH Confidence 999999999999999876655 58889999999999999999988854332111 0 11234568899 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh--c----cC-CcccccCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLK--D----TQ-GRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk--d----~~-G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) .++...+.........++|||..+..|++.. + +. |.. ....+.-++++|++|++++.+|.++.++... . T Consensus 146 ~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~-g 221 (274) T protein:vir:95 146 QTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDD---VIVKGAFGEALGAVIVRSNKLEAGTAILAKK-G 221 (274) T ss_pred HHHHHHhccccccccEEEeCHHHHHHHHhhcccccccccccccc---ceeccccceecCeEEEEeCCCCCceEEEEec-c Confidence 9998888877777789999999999987641 1 11 111 1122334689999999999999998765543 3 Q ss_pred eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 339 GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 339 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++..+...++.++.+++ ..+....+++..++++++.+|+++++++ +.+|| T Consensus 222 A~~~~~~~~~~vE~~Rd----~~~~~d~i~~~~~y~~~~~~~~~~v~~t--k~~~~ 271 (274) T protein:vir:95 222 AVKLITKRDFFLETDRD----PSTKTTALYSDKHYVAYLYDESKAVKIT--KGSGS 271 (274) T ss_pred ceeeeecCCcccccccc----cccccCEEEEeEEEEEEEEcCCcEEEEE--cCCcc Confidence 45556667777765554 3456788999999999999999999998 56667 No 118 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.82 E-value=2.3e-21 Score=133.68 Aligned_cols=259 Identities=13% Similarity=0.119 Sum_probs=194.8 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc----cccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) +....+.-+..++|+.|...+.+.+.....+.+++..-+ -+|+++++|++... +.+..+.+|+.++..+.+.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~i-g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYS-GDAKVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCC-CccccccCCCccchhhccccee Confidence 444444556788899999999998888777777754432 24778999998753 4566788899999999999999 Q ss_pred EeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) .+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.-..++.. + ..++..+++.+ T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-----------~---~~~~~~~~d~i 145 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL-----------T---VEADITKLTGL 145 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------c---ccccccCHHHH Confidence 999999999999999876655 58889999999999999999988854332111 0 11234568899 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh--c----cC-CcccccCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLK--D----TQ-GRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk--d----~~-G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) .++...+.........++|||..+..|++.. + +. |.. ....+.-++++|++|++++.+|.++.++... . T Consensus 146 ~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~-g 221 (274) T protein:vir:96 146 QTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDD---VIVKGAFGEALGAVIVRSNKLEAGTAILAKK-G 221 (274) T ss_pred HHHHHHhccccccccEEEeCHHHHHHHHhhcccccccccccccc---ceeccccceecCeEEEEeCCCCCceEEEEec-c Confidence 9998888877777789999999999987641 1 11 111 1122334689999999999999998765543 3 Q ss_pred eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 339 GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 339 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++..+...++.++.+++ ..+....+++..++++++.+|+++++++ +.+|| T Consensus 222 A~~~~~~~~~~vE~~Rd----~~~~~d~i~~~~~y~~~~~~~~~~v~~t--k~~~~ 271 (274) T protein:vir:96 222 AVKLITKRDFFLETDRD----PSTKTTALYSDKHYVAYLYDESKAVKIT--KGSGS 271 (274) T ss_pred ceeeeecCCcccccccc----cccccCEEEEeEEEEEEEEcCCcEEEEE--cCCcc Confidence 45556667777765554 3456788999999999999999999998 56667 No 119 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.81 E-value=2.9e-21 Score=133.16 Aligned_cols=347 Identities=13% Similarity=0.140 Sum_probs=208.4 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Q lcl|NC_019933. 1 MSD-INAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQ 79 (394) Q Consensus 1 Mk~-i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |.+ |++|++.- .++....+..++..+. ++ .+-+++.+.+...-+.....+-+ T Consensus 1 ~~~~~~~~~~~~-------------------~~~~~~~e~k~lr~~m-------e~-~et~~e~~~~~~~~~~~e~el~E 53 (393) T protein:vir:79 1 MENWLKQLKESG-------------------FTETQVQEQKSLRTRM-------ER-GETLAEADANKLALNEEETQILE 53 (393) T ss_pred CchHHHHHHhcc-------------------CchhHHHHHHHHHHHh-------hh-hhhhhhhhhhhhhcchhHHHHHH Confidence 443 22222210 0000000111111111 10 00000000000000000000111 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc-ccCceeEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM-EGNTLEYV 158 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~~ 158 (394) . +..+.. +...... ++....-++.++..+||..+++-+.+...+-...-+++..+.+ .|.++.+| T Consensus 54 ~------f~Kmm~-----G~~p~~e---V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~ 119 (393) T protein:vir:79 54 S------FAKMME-----GETPTNE---VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFP 119 (393) T ss_pred H------HHHHhc-----CCCchhh---eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceecc Confidence 1 111100 1111111 1111112345678999999999999998888888888888887 56777777 Q ss_pred EEcCcccccceecCCccccccc---cceeeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019933. 159 RETGFTNAAAPVAEGAQKPESS---LRFDLVQTSAKVIAHWMKASRQILSDSA-QLQSFINARLLRGLEVVEENQLLNGN 234 (394) Q Consensus 159 ~~~~~~~~~~~~~eg~~~~~~~---~~~~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~la~a~~~~~d~a~l~g~ 234 (394) ... ..-++-++||+..|+-+ .++++|+++..|++..+.+|+|+++||. ++.+++.....++++++.|..++++. T Consensus 120 ~~g--~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~f 197 (393) T protein:vir:79 120 SIG--IMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQF 197 (393) T ss_pred chh--eeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhh Confidence 654 24567799999998865 5799999999999999999999999996 89999999999999999999999998 Q ss_pred CCCcc--cccccccccccccc----ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHh---hccCCccccc Q lcl|NC_019933. 235 GTGQN--LLGLLPQATAFAAP----ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELL---KDTQGRYILG 305 (394) Q Consensus 235 g~~~~--~~Gi~~~~~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l---kd~~G~~~~~ 305 (394) .+..+ +.++.+....+.+. ....+..+.++++++..++.+..+.++.|+|||-.|+.+++- ....-+++-. T Consensus 198 k~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN 277 (393) T protein:vir:79 198 RSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGN 277 (393) T ss_pred hcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccc Confidence 88776 66766554444433 345678899999999999999999999999999999998753 2222222210 Q ss_pred -Cccc------CCCcee-----ecceEEEcCCCCcCc------eEEeeccceEEEEeecceEEEEecccchhhhcCcEEE Q lcl|NC_019933. 306 -NPQG------TLAPTL-----WGLPVVATQAMAVGQ------FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTI 367 (394) Q Consensus 306 -~~~~------~~~~~l-----~G~pv~~~~~~p~~~------~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 367 (394) +.-. -++..| +.+.|++++.+|-.+ .+..|-+..-.+..+.+++.+.. ++-..|.+.+ T Consensus 278 ~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~----ddk~rdiq~i 353 (393) T protein:vir:79 278 YPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQW----DEKARGLQNI 353 (393) T ss_pred cCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceecc----ccccccceee Confidence 0000 111112 237899999998543 23333332222223334443332 3345678889 Q ss_pred EEEEEeccEEecc-cceEEEEecCCCCC Q lcl|NC_019933. 368 LAEERLALAVYRP-ESFIKGSLAAAAGT 394 (394) Q Consensus 368 ~~~~~~d~~v~~~-~a~~~l~~~~a~~~ 394 (394) +...|+|+.|.+. .|+++.+-=.-+-| T Consensus 354 Kl~ERYG~gvLn~gkaiavakNI~~~k~ 381 (393) T protein:vir:79 354 KMIERYGIGILNEGKAIAVAKNISMDKS 381 (393) T ss_pred eeeeeeceeeeeCCceEEEEecceeecc Confidence 9999999999996 67766652222222 No 120 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.80 E-value=1e-20 Score=130.18 Aligned_cols=376 Identities=12% Similarity=0.053 Sum_probs=237.7 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~ 77 (394) |+ +|.|-++.+....++.-.+..+....+ ...-|-..+.+++++.+.++..+|.+.|..+....+...+.+.-..-+ T Consensus 8 ~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~~Kgk~~mtefL 87 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhhcccchhHHHhh Confidence 66 366666655555554444433333322 122233467889999999999999998888887766666554432222 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeE Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEY 157 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (394) .........++............. .+++.+.....+ ..+....+|..+...|-+.++.+.++++++.+.++++.-... T Consensus 88 kT~~A~~~fa~~l~~nsg~sd~kn-aW~A~l~E~gvt-~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~ 165 (400) T protein:vir:93 88 ESQNAVTEFFDVLKKNSGKSEIKN-AWSAKLAENGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 165 (400) T ss_pred hhHHHHHHHHHHHHhhcCCcchhh-hhhhhhhhcccc-cCCchhhcchHHHHHHHHhhhccCCcccceeeecCCceeeec Confidence 221111111111111111111111 222322222221 123455889999999999999999999999988886543322 Q ss_pred EEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHH---HHHHHHHHHHHHHHHHHH-HHHHHHhhc Q lcl|NC_019933. 158 VRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSD---SAQLQSFINARLLRGLEV-VEENQLLNG 233 (394) Q Consensus 158 ~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~---s~~~~~~i~~~la~a~~~-~~d~a~l~g 233 (394) +-. . ...++.+--|+.+.++..+|..-++.|.-++.+..+.+-..++ ...+.+|++++|...+.. .++++++-| T Consensus 166 ~~d-t-~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~G 243 (400) T protein:vir:93 166 SFD-S-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) T ss_pred chh-h-hcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeec Confidence 322 2 2355667789999999999999999999988888885555543 247899999999999996 579999999 Q ss_pred cCCCcc-----ccccccccccccccccccccchHHHHHHH-HHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCc Q lcl|NC_019933. 234 NGTGQN-----LLGLLPQATAFAAPITVANATAVDRLRLA-LLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNP 307 (394) Q Consensus 234 ~g~~~~-----~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~ 307 (394) +|+++- .+-|.+-+.. +.....++...+.+++.- ..-+.+.......++++|++|+.|+.++|++|++.|+.. T Consensus 244 dG~Ngf~~~dk~t~Ik~I~~d-t~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~ 322 (400) T protein:vir:93 244 DGTNGFKSIDKEADVKKIKKI-TTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIK 322 (400) T ss_pred ccccccCCCcchhhhhhhhhh-hhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeec Confidence 887641 1111111111 222223455556666554 344444455566899999999999999999999999766 Q ss_pred ccCCC-ceeecce-EEEcCC--CCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 308 QGTLA-PTLWGLP-VVATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 308 ~~~~~-~~l~G~p-v~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) ....+ .+-+|+- +++... +|...+++ |.+ +++ +..++ .......+..|+..+..+.++++-+.-|++- T Consensus 323 n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-Dek--~~i-~~~~~----~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ 394 (400) T protein:vir:93 323 NDDTEIASEVGVDEIIVYTGSKALKPTVLV-DQK--YHI-DMQDL----TKVDAFEWKTNSNMILVETLTSGHVETYNAG 394 (400) T ss_pred cccchhhhhcccceeeeeccCCCCCceeee-ehh--hhc-cccCc----eeccceeeeeccceEEeeeeeccceecccce Confidence 65433 3445653 333444 44444443 644 333 22332 2234456778888999999999999999999 Q ss_pred EEEEec Q lcl|NC_019933. 384 IKGSLA 389 (394) Q Consensus 384 ~~l~~~ 389 (394) ++++++ T Consensus 395 ay~~v~ 400 (400) T protein:vir:93 395 AVITVS 400 (400) T ss_pred eeEeeC Confidence 999998 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.79 E-value=3e-20 Score=127.60 Aligned_cols=278 Identities=15% Similarity=0.137 Sum_probs=196.6 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccc----cceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNA----AAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~eg~~~~~~~~~~~~i 186 (394) +..-+...++.+.+..+...||+.+.+.+.|++++++.++.++.+.+.|...-+.. ..|.--....+++..+|+++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 33333444567889999999999999999999999999999999999988653222 12222335556788999999 Q ss_pred EeeeeeEEEeehhhHHHHHH--H-H-HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccc-cccc-cccccc Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSD--S-A-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAF-AAPI-TVANAT 260 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~--s-~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~-~~~~-~~~~~~ 260 (394) +...+-+++.+.|.+.+.+- + + +...+=.+...+++.++.+..+|+|+.+++++.|+++..... .+.. +..+.. T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~~ 160 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSAI 160 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCCC Confidence 99999999999999877542 2 3 444455677889999999999999999888888998875442 2222 233566 Q ss_pred hHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHh-hccCCcccccCccc---CCCceeecceEEEcCCCCcCc------ Q lcl|NC_019933. 261 AVDRLRLALLQAQLAEFPATGIVLNPADWAGIELL-KDTQGRYILGNPQG---TLAPTLWGLPVVATQAMAVGQ------ 330 (394) Q Consensus 261 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l-kd~~G~~~~~~~~~---~~~~~l~G~pv~~~~~~p~~~------ 330 (394) +.+++-.++..+......+++|+|||++..+|+.+ +..+++.+++.... ....++.|+|++.++.+|.+. T Consensus 161 t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~~~~ 240 (310) T protein:vir:97 161 SFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKGGTT 240 (310) T ss_pred CHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCccccccC Confidence 77888777777766677889999999998888754 55555556554332 233578999999999988642 Q ss_pred ----eE---Eeecc--ceEEEEe---ecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 331 ----FL---TGAFD--AGAQVFD---RWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 331 ----~~---~gd~~--~~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) ++ ||+.. .++.... ..+++++.-- ..-.++...++.++||+.++.+|+|+++|.--.= T Consensus 241 gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G---~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 241 GCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVG---ESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred CceeEEEEeeCccccccceeccccCCccceeEEeCC---cccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 33 33321 2332211 2234444321 1123567789999999999999999999973222 No 122 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.78 E-value=4.4e-20 Score=126.69 Aligned_cols=362 Identities=12% Similarity=0.121 Sum_probs=207.7 Q ss_pred CchHH----HHHHHHHHHH------------------HHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 1 MSDIN----AINSTLANIS------------------DSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQ 58 (394) Q Consensus 1 Mk~i~----el~~~~~~~~------------------~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~ 58 (394) |-+-. +...++..+. ...++...+-..+.+...+..+...+...++..++.++...+. T Consensus 1 ~~n~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~~~~~~~~~~E~Rs~~~ 80 (410) T protein:vir:83 1 MGNATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQAQEVNRIAFETRSKGQ 80 (410) T ss_pred CCCcccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhhhHHHHHHHHHHHHHHHHHH Confidence 44321 2222222221 0111110011111111122222333334444444444433333 Q ss_pred HHHHHHh----hcccccccchhhhhhhhhHHHHHHHH--HHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHH Q lcl|NC_019933. 59 RIAEVEG----NGAGGDVQHISIGQQFVNSDSFKAMA--ESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGIL 132 (394) Q Consensus 59 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii 132 (394) .+...-. .+.....+-++.++ ..+.+. ..+.......++..+ .......+.+...+||++|..+.+ T Consensus 81 ~i~~~~~~~r~~p~~~~veyRSaGE------~lkal~~~~~Gd~~A~~~~e~~r--~a~~~~~Tgd~~~~i~~~~v~d~i 152 (410) T protein:vir:83 81 AVDAAISAMRGSPVGTEVEYRSAGE------YMLDMWNSAQGNASAADRLEVYA--RAADHQKTGDLQGVIPDPIVGPVI 152 (410) T ss_pred HHHhhhccCcCCCCCCCcccccHHH------HHHHHhccCCchHHHHHHHHHHH--HhhccCcccccccccchhHhhhHH Confidence 3322222 22222223333322 222221 111222222223211 122233333445578888999999 Q ss_pred hhhhhhhhHHHhccccccccCceeEEEEcCccccc------ceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHH Q lcl|NC_019933. 133 ELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAA------APVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSD 206 (394) Q Consensus 133 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~ 206 (394) +++.+..++..++...|.+|.++.||+.+.....+ ..-.||...+..+++|+.-+...+.++++..+||+.++. T Consensus 153 ~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~IER 232 (410) T protein:vir:83 153 DFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQAIDF 232 (410) T ss_pred HHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcCcccccceeeec Confidence 99999999999999999999999998876543221 335689999999999999999999999999999999997 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHH---HHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhh--cCCCC Q lcl|NC_019933. 207 S-AQLQSFINARLLRGLEVVEEN---QLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLA--EFPAT 280 (394) Q Consensus 207 s-~~~~~~i~~~la~a~~~~~d~---a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~ 280 (394) | ..+.+..++.|..+++.+-+. ++|+..-++ ..... ..+.......++++...+..+ +..-. T Consensus 233 s~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~---------~~a~~---~~Tad~~~~~i~da~~~v~da~~~~~~~ 300 (410) T protein:vir:83 233 SSPSALDLVVNGLGQQYAIETEALVGAALASTSTG---------AVGYG---NATADNVASAIWQAAGAVYTAVKGMGRL 300 (410) T ss_pred CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---------hhhhh---hccHHHHHHHHHHHHHHHhhhhccceee Confidence 7 578899999998888887764 445432111 11111 112223334455555566655 44555 Q ss_pred eeEeCHHHHHHHHHhhccCCcccccCcc--------cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEE Q lcl|NC_019933. 281 GIVLNPADWAGIELLKDTQGRYILGNPQ--------GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEV 352 (394) Q Consensus 281 ~~~~~~~~~~~l~~lkd~~G~~~~~~~~--------~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~ 352 (394) .+.++|+++..+.++. ..+++.+.+.. .+-.+.++|.||++.+..+++++.|.|.. ++..+......+++ T Consensus 301 ~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~-Ai~~~eS~~gp~qL 378 (410) T protein:vir:83 301 VIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTA-AIECFEQRVGTLQV 378 (410) T ss_pred eEEechhhhhhcccee-eccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccc-eeeeeecCCceeEe Confidence 7899999987776653 22333322211 12236789999999999999999999854 68888877666766 Q ss_pred ecccchhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|NC_019933. 353 ATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~ 389 (394) ..+.-....+ .|- .|+.+.+.++.++.-+.=. T Consensus 379 ~d~~i~nLt~---~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 379 VEPSVFGLQV---AYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred eCCchhhhhh---hhe--eeeeeccccccceeeeccC Confidence 5544322222 222 7778889999888776543 No 123 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.73 E-value=1.8e-19 Score=123.31 Aligned_cols=221 Identities=19% Similarity=0.167 Sum_probs=173.1 Q ss_pred ccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_019933. 145 LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLE 223 (394) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~ 223 (394) -+-++ .|+++++|.. .+.+.-+.||..++..+.++++.+...++.+..++|+++....+ .|......++++.+++ T Consensus 1 ~~~~~-~Gdtit~P~~---iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGIN-LANLCEYPND---IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) T ss_pred Ccccc-CCceEEeccc---ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHH Confidence 22222 2568999965 34567899999999999999999999999999999999987655 5788999999999999 Q ss_pred HHHHHHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccC---- Q lcl|NC_019933. 224 VVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQ---- 299 (394) Q Consensus 224 ~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~---- 299 (394) +++|..++.-..+ .+...+...+++.|.++...+...+..+.+++|||..+..|++..+.. T Consensus 77 ~kvD~di~~~~~~---------------a~l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~ 141 (231) T protein:vir:73 77 NKVDDDLLKAAKT---------------TSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGS 141 (231) T ss_pred HhhhHHHHHhhcc---------------ccccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhh Confidence 9999988853211 111233456899999999999888878889999999999998754321 Q ss_pred --CcccccCcccCCCceeecceEEEcCCCCcCceEEee---ccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEec Q lcl|NC_019933. 300 --GRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGA---FDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLA 374 (394) Q Consensus 300 --G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d 374 (394) |+.+ ...+.-+.+.|+||++++.+|.+..+..- -+.++.++...++.++.+++ .......+++..+++ T Consensus 142 ~~g~~i---~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd----~~~k~~~i~~~~~y~ 214 (231) T protein:vir:73 142 EVGANA---LINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD----IVTKTTVITADEHYA 214 (231) T ss_pred hhccce---eeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecccc----ccccccEEEEeEEEE Confidence 2222 22334468999999999999998876433 25677788888888876654 345677899999999 Q ss_pred cEEecccceEEEEecCC Q lcl|NC_019933. 375 LAVYRPESFIKGSLAAA 391 (394) Q Consensus 375 ~~v~~~~a~~~l~~~~a 391 (394) .++.+|..+++++++-. T Consensus 215 v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 215 AYLYDLTKVVNITFTGV 231 (231) T ss_pred EEEEcCccEEEEEeecC Confidence 99999999999999988 No 124 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.69 E-value=9e-18 Score=114.04 Aligned_cols=301 Identities=9% Similarity=0.040 Sum_probs=183.1 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) ++.. .-.......+. ..+.....+...-+|.++++.....|++.+.+.+++++.++++++...+.. T Consensus 1 ~~~~-------------~~~~~~~n~~~-~~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~e 66 (360) T protein:vir:99 1 MSSN-------------STIDSVRNQNM-NSLSQKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEME 66 (360) T ss_pred Ccch-------------hHHHHHhhhHH-HHHHhhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeeccccccc Confidence 1100 00000111111 112122222222357788999999999999999999999999999988888 Q ss_pred EEEEcCcccccceecC-CccccccccceeeEEe-eeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 157 YVRETGFTNAAAPVAE-GAQKPESSLRFDLVQT-SAKVIAHWMKASRQILSDSA-----QLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 157 ~~~~~~~~~~~~~~~e-g~~~~~~~~~~~~i~~-~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a~~~~~d~a 229 (394) +++...+.-..--..| +.......++...+.+ ..+++...+.++.+.+.+.. .+++.|++.+++++++-++.. T Consensus 67 i~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l 146 (360) T protein:vir:99 67 VPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLM 146 (360) T ss_pred ccccccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHH Confidence 8876543212212222 3322224455666665 34567777788888776543 477999999999999999999 Q ss_pred HhhccCCCc-------------ccccccccccccccccc----------------------------cc---ccchHHHH Q lcl|NC_019933. 230 LLNGNGTGQ-------------NLLGLLPQATAFAAPIT----------------------------VA---NATAVDRL 265 (394) Q Consensus 230 ~l~g~g~~~-------------~~~Gi~~~~~~~~~~~~----------------------------~~---~~~~~~~i 265 (394) .++|+.... ...|+++.+........ .. .......+ T Consensus 147 ~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf 226 (360) T protein:vir:99 147 GIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLF 226 (360) T ss_pred HhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHH Confidence 999875532 13566665532211000 00 01123345 Q ss_pred HHHHHHhhhhcCCC----CeeEeCHHHHHHHHH-hhccCCccccc-CcccCCCceeecceEEEcCCCCcCceEEeeccce Q lcl|NC_019933. 266 RLALLQAQLAEFPA----TGIVLNPADWAGIEL-LKDTQGRYILG-NPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAG 339 (394) Q Consensus 266 ~~~~~~~~~~~~~~----~~~~~~~~~~~~l~~-lkd~~G~~~~~-~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~ 339 (394) .+++..++..|..+ -+|+||+..+..++. |.+-. .++.. ...+.+.-.++|+|++..+.+|++.++|.++++. T Consensus 227 ~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~-t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NL 305 (360) T protein:vir:99 227 NETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTERE-DPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNL 305 (360) T ss_pred HHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccC-cccchhheecccccccceeeeEEcCCCCCCceEEeccCce Confidence 57777888887643 289999998766654 43322 23322 1233333467899999999999999999999986 Q ss_pred EEEEeecceEEEEecccchhhhcCc-EEEEEEEEeccEEecccceEEEE-ecCCCC Q lcl|NC_019933. 340 AQVFDRWAARVEVATENQDDFIKNM-VTILAEERLALAVYRPESFIKGS-LAAAAG 393 (394) Q Consensus 340 ~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~d~~v~~~~a~~~l~-~~~a~~ 393 (394) ++.+ ...++++...++...-.+.. +.+....++|+.+.+++|.++++ +..+.- T Consensus 306 i~g~-~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 306 AFGL-YEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred eEEe-eeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 5444 44667765544433222221 33445678999999999999987 111111 No 125 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.58 E-value=4.1e-16 Score=104.96 Aligned_cols=258 Identities=12% Similarity=0.033 Sum_probs=163.4 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc----ccccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG----TMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |.. ..++|+.|...+++.+.+.+.+..++..- ...|+++++|+.... ....+..++..++..+.+...+ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc-cccccccCCCccCccccccceE Confidence 111 23579999999999999998888876431 234678999987543 3455667777777667777777 Q ss_pred EeeeeeE-EEeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHH Q lcl|NC_019933. 187 QTSAKVI-AHWMKASRQ-ILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDR 264 (394) Q Consensus 187 ~~~~~k~-~~~~~is~e-~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 264 (394) ++...+. +..+.|++. ..+...++.+ +.++++.+++.++|..++.--..... ........+....++. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~---------~~~~~~~~~~~~~~~~ 143 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT---------ALTGSAPTDADDAFDL 143 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc---------ccccccccchhHHHHH Confidence 7777553 445567764 3444456777 56678899999999877642211110 0111112233456788 Q ss_pred HHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhccCCc--cccc--CcccCCCceeecceEEEcCCCCcCc---eEEee Q lcl|NC_019933. 265 LRLALLQAQLAEFP--ATGIVLNPADWAGIELLKDTQGR--YILG--NPQGTLAPTLWGLPVVATQAMAVGQ---FLTGA 335 (394) Q Consensus 265 i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd~~G~--~~~~--~~~~~~~~~l~G~pv~~~~~~p~~~---~~~gd 335 (394) |.++...+...+.+ +-.++++|..+..|.+..+.-.+ .... ....+..++|.|++|+.++.+|.+. .+.+- T Consensus 144 i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~ 223 (273) T protein:vir:10 144 IAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) T ss_pred HHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEe Confidence 88888888877764 34789999999998764321111 1101 1123344689999999999999754 23222 Q ss_pred ccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 336 FDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 336 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +.++... .+...++..+... +-...+++..++|+++++|++++.++.+.+ T Consensus 224 -~~A~~~a-~q~~~~e~~r~~~----~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 224 -PSAAAYV-SQIDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -ccceeee-eeeehhhcccCCC----cceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 2333222 2222333333222 224578899999999999999999887766 No 126 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.58 E-value=4.1e-16 Score=104.96 Aligned_cols=258 Identities=12% Similarity=0.033 Sum_probs=163.4 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc----ccccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG----TMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |.. ..++|+.|...+++.+.+.+.+..++..- ...|+++++|+.... ....+..++..++..+.+...+ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc-cccccccCCCccCccccccceE Confidence 111 23579999999999999998888876431 234678999987543 3455667777777667777777 Q ss_pred EeeeeeE-EEeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHH Q lcl|NC_019933. 187 QTSAKVI-AHWMKASRQ-ILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDR 264 (394) Q Consensus 187 ~~~~~k~-~~~~~is~e-~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 264 (394) ++...+. +..+.|++. ..+...++.+ +.++++.+++.++|..++.--..... ........+....++. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~---------~~~~~~~~~~~~~~~~ 143 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT---------ALTGSAPTDADDAFDL 143 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc---------ccccccccchhHHHHH Confidence 7777553 445567764 3444456777 56678899999999877642211110 0111112233456788 Q ss_pred HHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhccCCc--cccc--CcccCCCceeecceEEEcCCCCcCc---eEEee Q lcl|NC_019933. 265 LRLALLQAQLAEFP--ATGIVLNPADWAGIELLKDTQGR--YILG--NPQGTLAPTLWGLPVVATQAMAVGQ---FLTGA 335 (394) Q Consensus 265 i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd~~G~--~~~~--~~~~~~~~~l~G~pv~~~~~~p~~~---~~~gd 335 (394) |.++...+...+.+ +-.++++|..+..|.+..+.-.+ .... ....+..++|.|++|+.++.+|.+. .+.+- T Consensus 144 i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~ 223 (273) T protein:vir:10 144 IAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) T ss_pred HHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEe Confidence 88888888877764 34789999999998764321111 1101 1123344689999999999999754 23222 Q ss_pred ccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 336 FDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 336 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +.++... .+...++..+... +-...+++..++|+++++|++++.++.+.+ T Consensus 224 -~~A~~~a-~q~~~~e~~r~~~----~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 224 -PSAAAYV-SQIDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -ccceeee-eeeehhhcccCCC----cceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 2333222 2222333333222 224578899999999999999999887766 No 127 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.58 E-value=4.3e-16 Score=104.82 Aligned_cols=259 Identities=12% Similarity=0.021 Sum_probs=166.1 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccc----cccccCceeEEEEcCcccccceecCCccccccccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQ----GTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i 186 (394) |.. ..++|+.|...+++.+...+.+.+++.. ....|+++++|+... .....+..+|..++..+++...+ T Consensus 1 MA~------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:79 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCc-ccccccccCCCccCccccccceE Confidence 211 2367999999999999999888887643 223467899999754 34555677888888888888888 Q ss_pred EeeeeeE-EEeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHH Q lcl|NC_019933. 187 QTSAKVI-AHWMKASRQ-ILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDR 264 (394) Q Consensus 187 ~~~~~k~-~~~~~is~e-~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 264 (394) ++...+. +.-+.|++. ..+...++.+ +.++++.++++++|..++.--..... ..+.....+....++. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~---------~~~~~~~~~~~~~~~~ 143 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT---------ALTGSAPSDADDAFDL 143 (273) T ss_pred EEEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc---------ccccccccchhhHHHH Confidence 8888663 555677774 3444457776 55678899999999876642211110 0111112233446788 Q ss_pred HHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhcc--CCccccc--CcccCCCceeecceEEEcCCCCcCce--EEeec Q lcl|NC_019933. 265 LRLALLQAQLAEFP--ATGIVLNPADWAGIELLKDT--QGRYILG--NPQGTLAPTLWGLPVVATQAMAVGQF--LTGAF 336 (394) Q Consensus 265 i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd~--~G~~~~~--~~~~~~~~~l~G~pv~~~~~~p~~~~--~~gd~ 336 (394) +.++...+...+.+ +-.++++|..+..|.+..+. +...... ....+..++|.|++|+.++.+|.+.. .+... T Consensus 144 i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~ 223 (273) T protein:vir:79 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) T ss_pred HHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEe Confidence 88888888877763 34789999999988764321 1111111 12234456899999999999997643 22222 Q ss_pred cceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 337 DAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 337 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +.++... .....++..+.. .+-...+++..++|+++++|+++++++.+.+ T Consensus 224 ~~A~~~a-~~~~~~e~~r~~----~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 224 PSAAAYV-SQIDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccceeee-eehhhhhcccCc----ccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 3333222 222233333322 2235678899999999999999999887766 No 128 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.51 E-value=1.7e-15 Score=101.53 Aligned_cols=283 Identities=14% Similarity=0.082 Sum_probs=166.0 Q ss_pred HHhhcccccCCcCcccc-------chhhhhHHHhhhhhhhhHHHhcccccc-ccCceeEEEEcC--cccccceecCCccc Q lcl|NC_019933. 107 AITSLSTNADGSAGATV-------QTTRLPGILELPQRRMTIRSLLAQGTM-EGNTLEYVRETG--FTNAAAPVAEGAQK 176 (394) Q Consensus 107 ~~~~~~~~~~~~~g~~i-------p~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~--~~~~~~~~~eg~~~ 176 (394) ..+........+++.+. |+.+-..|.+++...-..-.+++.+.. .++.+.+-+... ....+.-+.||+.+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 11111111112233221 555555666666555544445555433 345555544321 13456778999999 Q ss_pred cccccceeeEEe-eeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccc Q lcl|NC_019933. 177 PESSLRFDLVQT-SAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPI 254 (394) Q Consensus 177 ~~~~~~~~~i~~-~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~ 254 (394) |.+.+.++.-.+ ..+|++..+.||+|++..+ -+..+.....++.++++..|+.++..--+...+. ++.++.+.... T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~--~~~s~~w~~~~ 158 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPT--LAVPTAWDNGG 158 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--ccCCcCCCCcc Confidence 999999988776 5579999999999999877 4788888899999999999987775321111010 11111111100 Q ss_pred -cccc-cchHHHHHHHHHHh---------hhhcCCCCeeEeCHHHHHHHHH------hhccCCcccccC--cccCCCcee Q lcl|NC_019933. 255 -TVAN-ATAVDRLRLALLQA---------QLAEFPATGIVLNPADWAGIEL------LKDTQGRYILGN--PQGTLAPTL 315 (394) Q Consensus 255 -~~~~-~~~~~~i~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~------lkd~~G~~~~~~--~~~~~~~~l 315 (394) ...+ ....+.+..+...+ ..-++.++.++|||.+|..|++ +-..++++++.. ..+..++.+ T Consensus 159 ~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~ 238 (318) T protein:vir:10 159 KVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSV 238 (318) T ss_pred cccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccccccccee Confidence 0000 00111111111111 1224567899999999999954 333456666533 234445688 Q ss_pred ecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEeccc--chhh-hcCcEEEEEEEEeccEEecccceEEEE-ecCC Q lcl|NC_019933. 316 WGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATEN--QDDF-IKNMVTILAEERLALAVYRPESFIKGS-LAAA 391 (394) Q Consensus 316 ~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~~~-~~~~~~~~~~~~~d~~v~~~~a~~~l~-~~~a 391 (394) +|+.|+.++.+|.+++++++-...-.+.+...+...-.+.. ..+. .+..+..++...-...|.+|.|+++|+ +-++ T Consensus 239 lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 239 MGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 99999999999999999988543334444444444332211 1111 233456678888899999999999998 3333 No 129 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.41 E-value=1.9e-14 Score=95.84 Aligned_cols=288 Identities=11% Similarity=0.035 Sum_probs=164.5 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc---cccCceeEEEEcCcccccceecCCccccc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT---MEGNTLEYVRETGFTNAAAPVAEGAQKPE 178 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~eg~~~~~ 178 (394) +-.-+-+..... +++.....||+.|..++++.+.+...+.++++-.+ ..|+++++|+... +.+....++..++. T Consensus 1 ~~~~~~~~~~~~-~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~--~~~~d~~~~~~i~~ 77 (341) T protein:vir:94 1 MALGNTITGPSI-NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISE--LGVEDKATDVPVGV 77 (341) T ss_pred Ccchhhhccccc-cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCc--ceeeeecCCCcccc Confidence 111111111111 22233456899999999999999988888876443 2367899998642 44556677888877 Q ss_pred cccceeeEEeeeee-EEEeehhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc Q lcl|NC_019933. 179 SSLRFDLVQTSAKV-IAHWMKASRQI-LSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV 256 (394) Q Consensus 179 ~~~~~~~i~~~~~k-~~~~~~is~e~-l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~ 256 (394) .+++-..+++...+ .+.-+.|++.- .+.+.++.+.+.++.+.++++++|+.++.--.......+............+. T Consensus 78 ~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~ 157 (341) T protein:vir:94 78 QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGN 157 (341) T ss_pred ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCc Confidence 77777777777744 35567788754 44556899999999999999999988775322111100000011111111112 Q ss_pred cccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhccCCc-cccc-CcccCCCceeecceEEEcCCCCcCceE Q lcl|NC_019933. 257 ANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKDTQGR-YILG-NPQGTLAPTLWGLPVVATQAMAVGQFL 332 (394) Q Consensus 257 ~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd~~G~-~~~~-~~~~~~~~~l~G~pv~~~~~~p~~~~~ 332 (394) ....+++.++++...+...+.+. -.++++|..+..|.+...-... +.-. ....+..+++.|++|+.++.+|.+... T Consensus 158 ~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~ 237 (341) T protein:vir:94 158 GQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSAT 237 (341) T ss_pred hhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEeccccccccc Confidence 23446788888888888776643 3678999999998753211111 1111 122233468999999999999865421 Q ss_pred E---------------------------eeccce-EEEEeecceE-EE--------------EecccchhhhcCcEEEEE Q lcl|NC_019933. 333 T---------------------------GAFDAG-AQVFDRWAAR-VE--------------VATENQDDFIKNMVTILA 369 (394) Q Consensus 333 ~---------------------------gd~~~~-~~~~~~~~~~-i~--------------~~~~~~~~~~~~~~~~~~ 369 (394) . +++... .+.+.+..+. +. ...+......+=...+++ T Consensus 238 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 317 (341) T protein:vir:94 238 GWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVG 317 (341) T ss_pred cccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhh Confidence 0 011100 0111111110 10 001111111122334566 Q ss_pred EEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 370 EERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 370 ~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ..-+|.++++|++.+.++ +++.| T Consensus 318 ~~~~G~~~lrp~~~v~~~--~~~~~ 340 (341) T protein:vir:94 318 RQAYGARLYRPLHAVNIH--TTGDT 340 (341) T ss_pred hhhhcccccCcceeEEEe--cCcCC Confidence 778899999999975554 44555 No 130 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.37 E-value=2.6e-13 Score=89.56 Aligned_cols=265 Identities=12% Similarity=0.067 Sum_probs=169.1 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHH---------hcccc--ccccCceeEEEEcCcccccceecCCcccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRS---------LLAQG--TMEGNTLEYVRETGFTNAAAPVAEGAQKPES 179 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~---------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~ 179 (394) |.++ .-.-.++|+.+..-+.+...+.+.|++ +.... ..+|..+++|.+...++.+..+.++..++.. T Consensus 1 MA~T--~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MAYT--KISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CCce--eeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 4422 234567888888877777777766633 11222 2357789999988765667778899999999 Q ss_pred ccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccc----cccccc Q lcl|NC_019933. 180 SLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQAT----AFAAPI 254 (394) Q Consensus 180 ~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~----~~~~~~ 254 (394) +.+.++-....++.+..+.++++....+ .+....+.++++....+..+..+|..- .|++.... ...+.. T Consensus 79 ~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l------~g~~~~~~~~~~~~dvsa 152 (324) T protein:vir:59 79 KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAEL------AGVFSNDDMKDNKLDISG 152 (324) T ss_pred hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHH------HHhhhccccccceeeeec Confidence 9988888888888888899999876545 477888999999999999998776431 11111110 011222 Q ss_pred cccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCc---- Q lcl|NC_019933. 255 TVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQ---- 330 (394) Q Consensus 255 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~---- 330 (394) ..+...+.+.+.++...+-.....-.+|+||+.++..|++..-- .++.....+..-++++|++|++++.||... T Consensus 153 ~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~~s~~~~~i~~~~G~~VivdD~~p~~~~~~~ 230 (324) T protein:vir:59 153 TADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLI--EFVKDSQSGIRFPTYMNKRVIVDDSMPVETLEDG 230 (324) T ss_pred cccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhh--hhccccccCceeeeecccEEEEeCCCCccccCCC Confidence 22334567888998888877777778999999999999875321 122222222334689999999999998531 Q ss_pred ------eEEeeccceEEEEe-ecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEE--ecCCCCC Q lcl|NC_019933. 331 ------FLTGAFDAGAQVFD-RWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGS--LAAAAGT 394 (394) Q Consensus 331 ------~~~gd~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~--~~~a~~~ 394 (394) .+|+. .++.... +..+.++.+++. ..+...++...++.+.+ .++..-. .+...|| T Consensus 231 ~~~y~s~l~~~--GAi~~~~~~~~v~vE~dRd~----~~g~~~l~~r~~~~~~p---~G~s~~~~~~~~~sPt 294 (324) T protein:vir:59 231 TKVFTSYLFGA--GALGYAEGQPEVPTETARNA----LGSQDILINRKHFVLHP---RGVKFTENAMAGTTPT 294 (324) T ss_pred CceEEEEEEec--CeEEEeecCCCcceecccCc----cccceEEEEeeEEEeEe---eeEEecccccCCCCCC Confidence 22332 2333333 234555555443 35667777777765444 4443332 2234455 No 131 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.36 E-value=8.6e-14 Score=92.21 Aligned_cols=288 Identities=14% Similarity=0.085 Sum_probs=168.4 Q ss_pred hhhhHHHHHHHhhcccccCCcCc--cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCc Q lcl|NC_019933. 98 GRAEINIKAAITSLSTNADGSAG--ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGA 174 (394) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~g--~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~ 174 (394) .. ....-..+.........++- .+.=+.|..+++......+.++++++..++. |+++.+|+.... .+.....|. T Consensus 1 ~a-~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~--~~~~~~~g~ 77 (347) T protein:vir:88 1 MA-NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT--KGYYLAPGE 77 (347) T ss_pred CC-CcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecce--eeeeecccc Confidence 00 00000001011111111111 2333889999999999999999999988765 678999987543 455666677 Q ss_pred cccc--cccceeeEEeeeeeEE-EeehhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC--------CCccccc Q lcl|NC_019933. 175 QKPE--SSLRFDLVQTSAKVIA-HWMKASRQI-LSDSAQLQSFINARLLRGLEVVEENQLLNGNG--------TGQNLLG 242 (394) Q Consensus 175 ~~~~--~~~~~~~i~~~~~k~~-~~~~is~e~-l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g--------~~~~~~G 242 (394) .++. .++..+++++...++- .-..|.+.- .+.+.|+.+.+.++.+.++++..|+.++..-. .+..+.| T Consensus 78 ~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:88 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) T ss_pred CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC Confidence 6554 3567787777777643 222333322 22223788899999999999999998874211 1122333 Q ss_pred cccccccc-c-----ccccccccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhc-cCCccccc-CcccCCC Q lcl|NC_019933. 243 LLPQATAF-A-----APITVANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKD-TQGRYILG-NPQGTLA 312 (394) Q Consensus 243 i~~~~~~~-~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd-~~G~~~~~-~~~~~~~ 312 (394) +....... . .+....+...++.|+++...+...+.+. -.++++|..|..|.+... ..+.+.-. ....+.. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) T protein:vir:88 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhccee Confidence 22211111 0 1111123345788888888888777653 378999999988865432 22333221 2233445 Q ss_pred ceeecceEEEcCCCCcCc---e----------------------EEeeccceEEEE-e--------ecceEEEEecccch Q lcl|NC_019933. 313 PTLWGLPVVATQAMAVGQ---F----------------------LTGAFDAGAQVF-D--------RWAARVEVATENQD 358 (394) Q Consensus 313 ~~l~G~pv~~~~~~p~~~---~----------------------~~gd~~~~~~~~-~--------~~~~~i~~~~~~~~ 358 (394) +.+.|++|+.++++|.+. . +.+|++....++ . -.++.++..++.. T Consensus 238 g~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~- 316 (347) T protein:vir:88 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE- 316 (347) T ss_pred eeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechh- Confidence 689999999999998421 1 223444332222 1 1223344433322 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 359 DFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 359 ~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) .| ...+++...+|.++++|++.+.++.+++| T Consensus 317 ~~---~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 317 FQ---ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hH---HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 12 34678899999999999999999999999 No 132 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.34 E-value=6.6e-14 Score=92.84 Aligned_cols=281 Identities=14% Similarity=0.079 Sum_probs=164.7 Q ss_pred hhHHHHHHHhhcccccCCcCc-----cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCC Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAG-----ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEG 173 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g-----~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg 173 (394) +.. ..........+....+| .+.=+.+..++++.....+.++++++..++. |+++.+|+... ..+.....| T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~--~~~~~~~~G 77 (345) T protein:vir:22 1 MAS-MTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGR--TQAAYLAPG 77 (345) T ss_pred Ccc-cccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecc--eEEEeeecC Confidence 000 00000001111111111 2334788999999999999999999998887 56888998743 456677778 Q ss_pred cccccc--ccceeeEEeeeeeEEEeehhhHHHHHH-----H-HHHHHHHHHHHHHHHHHHHHHHHhhcc--------CCC Q lcl|NC_019933. 174 AQKPES--SLRFDLVQTSAKVIAHWMKASRQILSD-----S-AQLQSFINARLLRGLEVVEENQLLNGN--------GTG 237 (394) Q Consensus 174 ~~~~~~--~~~~~~i~~~~~k~~~~~~is~e~l~~-----s-~~~~~~i~~~la~a~~~~~d~a~l~g~--------g~~ 237 (394) +.+..+ ++..++.+|...++- +++.++.| + .++.+.+.++++.++++..|+.++..- +.+ T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~----y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~ 153 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLL----TADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYN 153 (345) T ss_pred CCCCCCCCCcccceEEEEecchh----hhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 877554 467777555555432 33333322 2 378999999999999999999887311 111 Q ss_pred cccccccccc----ccccc---cccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhccC-CcccccCc Q lcl|NC_019933. 238 QNLLGLLPQA----TAFAA---PITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKDTQ-GRYILGNP 307 (394) Q Consensus 238 ~~~~Gi~~~~----~~~~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd~~-G~~~~~~~ 307 (394) ..|.|.-+.. +.... .........++.|.++...+...+.+.+ .++++|..|..|..-+..+ ..+.-... T Consensus 154 ~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~ 233 (345) T protein:vir:22 154 ENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALID 233 (345) T ss_pred ccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccc Confidence 2233322211 11111 1112234568888888888888877644 6799999999886543322 22222111 Q ss_pred c-cCCCceeecceEEEcCCCCcCce-----------------------EEeeccceEEEEe--------ecceEEEEecc Q lcl|NC_019933. 308 Q-GTLAPTLWGLPVVATQAMAVGQF-----------------------LTGAFDAGAQVFD--------RWAARVEVATE 355 (394) Q Consensus 308 ~-~~~~~~l~G~pv~~~~~~p~~~~-----------------------~~gd~~~~~~~~~--------~~~~~i~~~~~ 355 (394) . .+..+.+.|++|+.++.+|.+.+ ..++.+-...++. -.+++++..++ T Consensus 234 ~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 313 (345) T protein:vir:22 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARR 313 (345) T ss_pred cccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeec Confidence 1 22245789999999998874210 1111111112222 22334444443 Q ss_pred cchhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 356 NQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 356 ~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) .. .| ...+++..-+|.++++|+|.+.++++.- T Consensus 314 ~~-~~---~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 314 AN-FQ---ADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hh-HH---HHHHHHHHhcCCcccccceeEEEEEeeC Confidence 32 22 2367788899999999999999988777 No 133 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.34 E-value=6e-14 Score=93.06 Aligned_cols=283 Identities=11% Similarity=0.019 Sum_probs=165.7 Q ss_pred HHHHhhcccccC---CcCc-cccc-hhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccccc Q lcl|NC_019933. 105 KAAITSLSTNAD---GSAG-ATVQ-TTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPE 178 (394) Q Consensus 105 ~~~~~~~~~~~~---~~~g-~~ip-~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~ 178 (394) .+.......+.+ ..++ .-+. +.+..+|.+.....+.+++++++.++. |+++.+|+... ..+.....|+.+.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~--~~~~~~~~g~~l~~ 78 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGA--STIAGRKAGEELVV 78 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecc--eeeeeecCCCCCCC Confidence 110100111111 1122 2344 889999999999999999999998887 67899998743 45667888888888 Q ss_pred cccceeeEEeeeeeEE-EeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCcc----c---ccccc Q lcl|NC_019933. 179 SSLRFDLVQTSAKVIA-HWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNG----NGTGQN----L---LGLLP 245 (394) Q Consensus 179 ~~~~~~~i~~~~~k~~-~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g----~g~~~~----~---~Gi~~ 245 (394) ..+.-++.+|....+- .-..|.+----++ -|+.+.+.++++.++++..|++++.. .....+ + .|+.. T Consensus 79 ~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~ 158 (334) T protein:vir:80 79 QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILL 158 (334) T ss_pred CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcce Confidence 8777888888777633 2222222111112 37999999999999999999977632 111111 0 12222 Q ss_pred cccc--ccccccccccchHHHHHHHHHHhhhhcCC-----CCeeEeCHHHHHHHHHhhccCCc-cccc----CcccCCCc Q lcl|NC_019933. 246 QATA--FAAPITVANATAVDRLRLALLQAQLAEFP-----ATGIVLNPADWAGIELLKDTQGR-YILG----NPQGTLAP 313 (394) Q Consensus 246 ~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~lkd~~G~-~~~~----~~~~~~~~ 313 (394) .... .+.....+.......++.+...+...+.. .-+++++|..|..|..-..-..+ |... ...++.-+ T Consensus 159 ~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i~ 238 (334) T protein:vir:80 159 PSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRIA 238 (334) T ss_pred eecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeEE Confidence 1111 11111122223345555666666666655 23789999999998764321111 1100 01122245 Q ss_pred eeecceEEEcCCCCcCc-----------eEEeeccceEE-EEeecc--------eEEEEecccchhhhcCcEEEEEEEEe Q lcl|NC_019933. 314 TLWGLPVVATQAMAVGQ-----------FLTGAFDAGAQ-VFDRWA--------ARVEVATENQDDFIKNMVTILAEERL 373 (394) Q Consensus 314 ~l~G~pv~~~~~~p~~~-----------~~~gd~~~~~~-~~~~~~--------~~i~~~~~~~~~~~~~~~~~~~~~~~ 373 (394) .++|+||+.++++|... .+-|||+.... ++.+.. ++.+..++.. .| ...+.+..-+ T Consensus 239 ~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~---~d~i~~~~a~ 314 (334) T protein:vir:80 239 MLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK-DF---GHYLDTFQSY 314 (334) T ss_pred EEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh-hH---HHHHHHHHHc Confidence 78999999999999652 33455554332 222222 2222222211 11 1234456778 Q ss_pred ccEEecccceEEEEecCCCC Q lcl|NC_019933. 374 ALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 374 d~~v~~~~a~~~l~~~~a~~ 393 (394) |.++++|+|+++++++...| T Consensus 315 G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 315 NIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred CCceeccceEEEEEEeeecC Confidence 99999999999999999888 No 134 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.31 E-value=7.7e-13 Score=86.98 Aligned_cols=267 Identities=10% Similarity=-0.000 Sum_probs=164.2 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHh---------ccccccccCceeEEEEcCcccccceecCCc-cccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSL---------LAQGTMEGNTLEYVRETGFTNAAAPVAEGA-QKPESS 180 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~-~~~~~~ 180 (394) |....+.-.-.++|+.+..-+.+...+.+.|++- ......+|..+++|.+...++.+..+.+|+ .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 4444444556788988887777777666665432 112223678899999886656676777875 688888 Q ss_pred cceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccc----------c Q lcl|NC_019933. 181 LRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQAT----------A 249 (394) Q Consensus 181 ~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~----------~ 249 (394) .+-++-....++.+..+.++++....+ .+....+.+++++...+..+..++... .|++.... . T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l------~gvf~~~~~~~~~~~~~~~ 154 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATL------NGIFATGTAGEKGALEETH 154 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHH------Hhhhhhhhcccchhhhhhh Confidence 888888888888899999999875544 577888999999988888887666421 11211100 0 Q ss_pred ccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC Q lcl|NC_019933. 250 FAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG 329 (394) Q Consensus 250 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~ 329 (394) ...........+.+.+.++...+-.....-.+|+||+.++..|++..-- .++.....+..-++++|++|++++.+|.. T Consensus 155 ~~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~~s~~~~~i~~~~G~~VivdD~~p~~ 232 (330) T protein:vir:10 155 VSDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLI--QYIQPTTATINIPTYLGYRVIIDDGIAPT 232 (330) T ss_pred eecccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhh--hhhcccccCcccccccceEEEEeCCCCCC Confidence 0001112334466788888888777766777999999999999874311 11111111223468999999999999843 Q ss_pred c----e-EEeeccceEEEEee---cceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe----cCCCCC Q lcl|NC_019933. 330 Q----F-LTGAFDAGAQVFDR---WAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL----AAAAGT 394 (394) Q Consensus 330 ~----~-~~gd~~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~----~~a~~~ 394 (394) . . +|+. .++.+.+. ..+.++.++ +...+...+....++. .+|..+..-.- +...|| T Consensus 233 ~~~yt~yl~~~--GAi~~~~~~~~~~v~~EtdR----d~~~g~~~l~~r~~~~---~hp~G~s~~~~~~~~~~~sPt 300 (330) T protein:vir:10 233 GDIYTSYLFRT--GSIGLNTGNPSGLTTFETSR----EAAKGNDMIYTRRALV---MHPYGVKWTGAEVDAGNITPS 300 (330) T ss_pred CCceeEEEEec--CceeeecccCCccccccccC----CccccceEEEEeeEEE---eeeeeeeecccccccCcCCcC Confidence 2 1 2222 22223221 113343333 3345666677766644 44555554432 223344 No 135 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.31 E-value=1.9e-13 Score=90.34 Aligned_cols=281 Identities=13% Similarity=0.080 Sum_probs=168.0 Q ss_pred HHHHhhcc---c--cc-CCcCc--cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCcc Q lcl|NC_019933. 105 KAAITSLS---T--NA-DGSAG--ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQ 175 (394) Q Consensus 105 ~~~~~~~~---~--~~-~~~~g--~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~ 175 (394) .+...... + +. ..+|. .+.=+.+..+|.+.+...+.+++++++..+. |+++.+|+... ..+..+..|.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~--~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGR--TKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccc--eeEeeeecCcC Confidence 11010000 0 00 01111 1233889999999999999999999988866 67899998754 35667778887 Q ss_pred ccc--cccceeeEEeeeeeEE-EeehhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----c----CCCcccccc Q lcl|NC_019933. 176 KPE--SSLRFDLVQTSAKVIA-HWMKASRQI-LSDSAQLQSFINARLLRGLEVVEENQLLNG----N----GTGQNLLGL 243 (394) Q Consensus 176 ~~~--~~~~~~~i~~~~~k~~-~~~~is~e~-l~~s~~~~~~i~~~la~a~~~~~d~a~l~g----~----g~~~~~~Gi 243 (394) +.. .++..++.++....+- .-..|.+.- .+..-++.+.+.++.+.++++..|+.++.. . ....++.|. T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGL 158 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccC Confidence 754 3577887777766642 122232211 111237889999999999999999888631 1 111222222 Q ss_pred cccccc-------ccccccccccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhc-cCCccccc-CcccCCC Q lcl|NC_019933. 244 LPQATA-------FAAPITVANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKD-TQGRYILG-NPQGTLA 312 (394) Q Consensus 244 ~~~~~~-------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd-~~G~~~~~-~~~~~~~ 312 (394) ...... .+.+....+...++.+.++...+...+.+. -.++++|..|..|.+..+ ..+.+-.. ....+.. T Consensus 159 ~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V 238 (347) T protein:vir:94 159 GKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSI 238 (347) T ss_pred CcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccccccccee Confidence 111110 011111233445788888888888887753 256778999988876433 22332221 2233445 Q ss_pred ceeecceEEEcCCCCcCc-------------------------eEEeeccceEEE-E--------eecceEEEEecccch Q lcl|NC_019933. 313 PTLWGLPVVATQAMAVGQ-------------------------FLTGAFDAGAQV-F--------DRWAARVEVATENQD 358 (394) Q Consensus 313 ~~l~G~pv~~~~~~p~~~-------------------------~~~gd~~~~~~~-~--------~~~~~~i~~~~~~~~ 358 (394) +++.|++|+.++++|... -+=+||+....+ + .-.++.+++.++.. T Consensus 239 ~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~- 317 (347) T protein:vir:94 239 RNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN- 317 (347) T ss_pred EEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechh- Confidence 689999999999998532 122333332222 2 22334444443322 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 359 DFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 359 ~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) +-...+.+..-+|..+++|++.+.+..++| T Consensus 318 ---~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 318 ---FQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ---hhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 223467788999999999999999999999 No 136 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.29 E-value=6.8e-13 Score=87.30 Aligned_cols=283 Identities=10% Similarity=0.017 Sum_probs=166.4 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccccc Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPE 178 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~ 178 (394) +. ...+..+....+..++....| +.+..+|.+.+...+.++++.++.++. |+++.+|+... ..+.+...|+.+.. T Consensus 1 ms-~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~--~~~~~~~pG~~l~~ 76 (335) T protein:vir:63 1 MS-FLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGN--VEAKGRRAGEELER 76 (335) T ss_pred CC-Ccccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeee--eeeecccCCcCcCC Confidence 00 000001111112222222233 889999999999999999999999886 56789999743 46777888888877 Q ss_pred cccceeeEEeeeeeEEEeehhhHHHHH---HH---HHHHHHHHHHHHHHHHHHHHHHHh----hccCCCcc---cc---- Q lcl|NC_019933. 179 SSLRFDLVQTSAKVIAHWMKASRQILS---DS---AQLQSFINARLLRGLEVVEENQLL----NGNGTGQN---LL---- 241 (394) Q Consensus 179 ~~~~~~~i~~~~~k~~~~~~is~e~l~---~s---~~~~~~i~~~la~a~~~~~d~a~l----~g~g~~~~---~~---- 241 (394) +.+..++..+....+- +++.++. +. -|+.+.+..+++.++++..|+.++ .+.....+ +. T Consensus 77 ~~~~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred CCccccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCC Confidence 7777788777777644 4444433 22 279999999999999999999775 22222111 11 Q ss_pred ccccccccccccccccccchHHHHHHHHHHhhhhcCCC-----CeeEeCHHHHHHHHHhhccCCc-cccc----CcccCC Q lcl|NC_019933. 242 GLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPA-----TGIVLNPADWAGIELLKDTQGR-YILG----NPQGTL 311 (394) Q Consensus 242 Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~lkd~~G~-~~~~----~~~~~~ 311 (394) |+.........+.........+.+.++...+...+.+. -+++++|..|..|..-..--.+ |... ....+. T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~ 232 (335) T protein:vir:63 153 GVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSR 232 (335) T ss_pred CcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCce Confidence 22211111111111223334455667777777666652 4689999999998864322222 1111 112233 Q ss_pred CceeecceEEEcCCCCcCc-----------eEEeeccceE-EEEeecc--------eEEEEecccchhhhcCcEEEEEEE Q lcl|NC_019933. 312 APTLWGLPVVATQAMAVGQ-----------FLTGAFDAGA-QVFDRWA--------ARVEVATENQDDFIKNMVTILAEE 371 (394) Q Consensus 312 ~~~l~G~pv~~~~~~p~~~-----------~~~gd~~~~~-~~~~~~~--------~~i~~~~~~~~~~~~~~~~~~~~~ 371 (394) ...+.|+||+.++++|.+. .+-+|+.... +++.+.. ++.++.++... | ...+.+.. T Consensus 233 v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~-~---~~~i~~~~ 308 (335) T protein:vir:63 233 VAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEK-F---SWVLDTFQ 308 (335) T ss_pred eEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccch-h---hHHhHHHH Confidence 4578999999999998543 2334443322 2222222 22222222221 1 23455667 Q ss_pred EeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 372 RLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 372 ~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) -+|..+++|++.+.++.+..+.- T Consensus 309 a~G~g~lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:63 309 MYNIGARRPDTAGAIELKGIGAF 331 (335) T ss_pred HcCCcccccceEEEEEEcCCCce Confidence 79999999999999998554443 No 137 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.28 E-value=1.3e-12 Score=85.70 Aligned_cols=266 Identities=11% Similarity=-0.021 Sum_probs=161.9 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHh---------ccccccccCceeEEEEcCcccccceecCCcccccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSL---------LAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSL 181 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~ 181 (394) |.++ .-+-.++|+.+..-+.+...+.+.|++- .....-+|..+++|.+...++....+.++..++..+. T Consensus 1 MA~T--~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MAET--HLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CCce--eeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 4432 2245677888877777766666665441 1122235778999998765556777888999999998 Q ss_pred ceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccc---cccc----cccc Q lcl|NC_019933. 182 RFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLP---QATA----FAAP 253 (394) Q Consensus 182 ~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~---~~~~----~~~~ 253 (394) +-++-....+..+..+.++++....+ .+....+.++++....+..+..+|..- .|++. .... .+.. T Consensus 79 tt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l------~gv~~~~~~~~~~~~d~t~~ 152 (351) T protein:vir:15 79 TSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVL------KGVMGVTKIANSKVYDQTKV 152 (351) T ss_pred cccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHH------HHHhhchhhcccceeccccc Confidence 88888888888888899999876555 477888999999999999988776521 11110 0011 0111 Q ss_pred ccccccchHHHHHHHHHHhhhhcC-CCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcC--- Q lcl|NC_019933. 254 ITVANATAVDRLRLALLQAQLAEF-PATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVG--- 329 (394) Q Consensus 254 ~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~--- 329 (394) ...+...+.+.+.++...+-.... .-.+|+||+.++..|++..--+ ++-....+..-++++|++|++++.+|.. T Consensus 153 ~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~--~~~~s~~~~~i~t~~G~~VivdD~~p~~~~~ 230 (351) T protein:vir:15 153 SPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIE--TIQPQNGATPFEAYNGLRIVLDDDIEIDLTD 230 (351) T ss_pred cccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhh--hccccccCcccceecceEEEEcCCCccccCC Confidence 223344567889998888766533 3579999999999998653111 1111111223468999999999999842 Q ss_pred -------ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEec----CCCCC Q lcl|NC_019933. 330 -------QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA----AAAGT 394 (394) Q Consensus 330 -------~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~----~a~~~ 394 (394) ..+|+. +...+......+++.++.... .++..++...++ +.||..+..-+.. ...|| T Consensus 231 ~~~~~ytsyl~~~---GAi~~~~~~~~ve~~rd~~~~--~g~d~l~~r~~~---~~hp~G~s~~~~~~~~~~~sPt 298 (351) T protein:vir:15 231 KTKPVSTSYIFAP---GAVRYSTNMRSTETKYDPLIN--GGQDVIVQKRVG---TIHVAGTSIKASFSPSKASFPT 298 (351) T ss_pred CCCceeEEEEEec---ceeeeecCCcCcceeecccCC--CCceEEEEeeee---eeeeeeeeecccccccCcCCcC Confidence 112222 222222334445555544332 344455554443 4666666553221 12234 No 138 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.27 E-value=9.5e-13 Score=86.48 Aligned_cols=283 Identities=10% Similarity=0.028 Sum_probs=166.6 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccccc Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPE 178 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~ 178 (394) +. ......+....+..++....| +.+..+|.+.....+.++++.++.++. |+++.+|+... ..+.+...|+.+.. T Consensus 1 ms-~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~--~~~~~~~pG~~l~~ 76 (335) T protein:vir:78 1 MS-FLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGN--VEAKGRRAGEELER 76 (335) T ss_pred CC-ccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeee--eeecccccCcccCC Confidence 00 000001111112222222333 889999999999999999999998886 56899998743 45677788888777 Q ss_pred cccceeeEEeeeeeEEEeehhhHHHHHH-----HH-HHHHHHHHHHHHHHHHHHHHHHhh----ccCCCccc-------c Q lcl|NC_019933. 179 SSLRFDLVQTSAKVIAHWMKASRQILSD-----SA-QLQSFINARLLRGLEVVEENQLLN----GNGTGQNL-------L 241 (394) Q Consensus 179 ~~~~~~~i~~~~~k~~~~~~is~e~l~~-----s~-~~~~~i~~~la~a~~~~~d~a~l~----g~g~~~~~-------~ 241 (394) +.+..++..+....+- +++.++.+ +. |+.+.+.++++.++++..|+.++. +.....+. . T Consensus 77 ~~~~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred CCcccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 7777788777777644 45544432 22 789999999999999999997762 22221111 1 Q ss_pred ccccccccccccccccccchHHHHHHHHHHhhhhcCCC-----CeeEeCHHHHHHHHHhhccCCc-cccc----CcccCC Q lcl|NC_019933. 242 GLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPA-----TGIVLNPADWAGIELLKDTQGR-YILG----NPQGTL 311 (394) Q Consensus 242 Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~lkd~~G~-~~~~----~~~~~~ 311 (394) |+.........+.........+.+.++...+...+.+. -+++++|..|..|..-..--.+ |... ....+. T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~ 232 (335) T protein:vir:78 153 GVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSR 232 (335) T ss_pred CcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccce Confidence 32221111122222234445556666666666665542 3689999999998864322111 1111 112233 Q ss_pred CceeecceEEEcCCCCcCce-----------EEeeccce-EEEEeecc--------eEEEEecccchhhhcCcEEEEEEE Q lcl|NC_019933. 312 APTLWGLPVVATQAMAVGQF-----------LTGAFDAG-AQVFDRWA--------ARVEVATENQDDFIKNMVTILAEE 371 (394) Q Consensus 312 ~~~l~G~pv~~~~~~p~~~~-----------~~gd~~~~-~~~~~~~~--------~~i~~~~~~~~~~~~~~~~~~~~~ 371 (394) ...+.|+||+.++++|.+.+ +=+|+... ..++.+.. +..++.++.. .| ...+.+.. T Consensus 233 v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~---~~~i~~~~ 308 (335) T protein:vir:78 233 VAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-QF---SWVLDTFQ 308 (335) T ss_pred eEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc-hh---hHhhhHHH Confidence 45789999999999995421 22344332 22333222 2222222221 12 23455677 Q ss_pred EeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 372 RLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 372 ~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) -+|.++++|++.+.++.+....- T Consensus 309 a~G~g~lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:78 309 MYNIGARRPDTAGAIELKGIEAF 331 (335) T ss_pred HcCCcccCcceEEEEEecCCCcc Confidence 79999999999999997765544 No 139 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.25 E-value=2.9e-12 Score=83.85 Aligned_cols=283 Identities=12% Similarity=0.099 Sum_probs=161.0 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccccccc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPESS 180 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~~~ 180 (394) +-..+..+....+...+--.+.=+.+..++.+.....+.++++..+.++. |+++.+|+... ..+.+...|+.+.-+. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~--~~~~~~~~G~~ld~~~ 78 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGE--TELQVLSPGKSPDASP 78 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeee--eEEeeeccCcccCCCC Confidence 00000000000111001112233788899999999999999999998886 56899999743 3456666677766666 Q ss_pred cceeeEEeeeeeEEEeehhhHHHHH------HHHH-HHHHHHHHHHHHHHHHHHHHHhh----ccCCCcc---ccccccc Q lcl|NC_019933. 181 LRFDLVQTSAKVIAHWMKASRQILS------DSAQ-LQSFINARLLRGLEVVEENQLLN----GNGTGQN---LLGLLPQ 246 (394) Q Consensus 181 ~~~~~i~~~~~k~~~~~~is~e~l~------~s~~-~~~~i~~~la~a~~~~~d~a~l~----g~g~~~~---~~Gi~~~ 246 (394) +.-++.+|....+- +++.++. +.-+ +.+.+..+++.++++..|+.++. +...... -.++... T Consensus 79 ~~~~k~~itID~ll----~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~ 154 (364) T protein:vir:10 79 TEFDKNRLVVDTTV----IARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAG 154 (364) T ss_pred cccCcEEEEeccee----eechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccC Confidence 77777777777644 3333332 1224 67888889999999999987752 1101100 0111111 Q ss_pred cc------cccccccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhccCC-cccc---cCcccCCCce Q lcl|NC_019933. 247 AT------AFAAPITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKDTQG-RYIL---GNPQGTLAPT 314 (394) Q Consensus 247 ~~------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd~~G-~~~~---~~~~~~~~~~ 314 (394) .+ ..+.+.........+.+.++...+...+.+.. +++++|..|..|.+-..--. .|.. .....+.... T Consensus 155 ~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~ 234 (364) T protein:vir:10 155 HGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLK 234 (364) T ss_pred CcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEE Confidence 11 11111222333445666677777777776543 78999999988876322111 1111 1112233357 Q ss_pred eecceEEEcCCCCcCc---------------------e--EEeeccc-eEEEEee--------cceEEEEecccchhhhc Q lcl|NC_019933. 315 LWGLPVVATQAMAVGQ---------------------F--LTGAFDA-GAQVFDR--------WAARVEVATENQDDFIK 362 (394) Q Consensus 315 l~G~pv~~~~~~p~~~---------------------~--~~gd~~~-~~~~~~~--------~~~~i~~~~~~~~~~~~ 362 (394) +.|+||+.++.+|... - ..+|+.. ...+|.+ .++..++.++.. + T Consensus 235 v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~----~ 310 (364) T protein:vir:10 235 SWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK----E 310 (364) T ss_pred EeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc----e Confidence 8999999999998420 0 1133332 1233333 334444333222 2 Q ss_pred CcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 363 NMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 363 ~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) =...+.+..-+|..+++|+|.+.++..++++- T Consensus 311 ~~~~ida~~a~G~g~lRPeaa~~i~~~~~~~~ 342 (364) T protein:vir:10 311 KTWYIDTFLAEGAIPDRWEAVAVVTAADTAEL 342 (364) T ss_pred eeeeeeeehcccCcccCccceEEEEecCCCCC Confidence 23445567779999999999999998888877 No 140 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.25 E-value=9.7e-13 Score=86.44 Aligned_cols=283 Identities=14% Similarity=0.075 Sum_probs=161.5 Q ss_pred HHHHhhcc---cc---cCCcCc--cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCcc Q lcl|NC_019933. 105 KAAITSLS---TN---ADGSAG--ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQ 175 (394) Q Consensus 105 ~~~~~~~~---~~---~~~~~g--~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~ 175 (394) .+...... +. ....+- .+.=+.|..+|...+...+.++++++..++. |+++.+|+... ..+.....|+. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~--~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR--TKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccc--eeeeeecCCCC Confidence 11000000 00 001111 1223889999999999999999999987765 67889998754 34566666777 Q ss_pred cccc--ccceeeEEeeeeeEEEe-ehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhc-----cCCCc------cc Q lcl|NC_019933. 176 KPES--SLRFDLVQTSAKVIAHW-MKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNG-----NGTGQ------NL 240 (394) Q Consensus 176 ~~~~--~~~~~~i~~~~~k~~~~-~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g-----~g~~~------~~ 240 (394) ++.+ ++...+.++...++-.+ ..|.+.---++ .++.+.+.++.+.++++..|+.++.- ..... .+ T Consensus 79 l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:33 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGL 158 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Confidence 6543 35666666655543211 12222111122 37888899999999999999988721 11110 01 Q ss_pred ccc--cccc---ccccccccccccchHHHHHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhc-cCCccccc-CcccCC Q lcl|NC_019933. 241 LGL--LPQA---TAFAAPITVANATAVDRLRLALLQAQLAEFP--ATGIVLNPADWAGIELLKD-TQGRYILG-NPQGTL 311 (394) Q Consensus 241 ~Gi--~~~~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd-~~G~~~~~-~~~~~~ 311 (394) .+. .... .....+........++.|+++...+...+.+ .-.++++|..|..|.+-.. .++.+.-. ....+. T Consensus 159 ~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~ 238 (347) T protein:vir:33 159 GKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERGT 238 (347) T ss_pred cccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccccce Confidence 110 0000 0001111112334577888888888888775 3368999999998875432 22333211 122333 Q ss_pred CceeecceEEEcCCCCcCce----------------------EEeeccce-EEEEe--------ecceEEEEecccchhh Q lcl|NC_019933. 312 APTLWGLPVVATQAMAVGQF----------------------LTGAFDAG-AQVFD--------RWAARVEVATENQDDF 360 (394) Q Consensus 312 ~~~l~G~pv~~~~~~p~~~~----------------------~~gd~~~~-~~~~~--------~~~~~i~~~~~~~~~~ 360 (394) .+.++|++|+.++.+|.+.+ +-++|+.. .+++. ..++.++..++.. T Consensus 239 V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~--- 315 (347) T protein:vir:33 239 IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN--- 315 (347) T ss_pred eEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchh--- Confidence 46899999999999986432 11122111 11222 2233444444332 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|NC_019933. 361 IKNMVTILAEERLALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 361 ~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~ 393 (394) +-...+++...+|.++++|++.+.++++..+- T Consensus 316 -~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 316 -YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred -hhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 22345677777899999999999999998888 No 141 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=99.23 E-value=3.9e-13 Score=88.60 Aligned_cols=226 Identities=11% Similarity=0.061 Sum_probs=155.3 Q ss_pred Hhhcccc--cCC-cCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccccccccce Q lcl|NC_019933. 108 ITSLSTN--ADG-SAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 108 ~~~~~~~--~~~-~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~ 183 (394) ....... +.. .+..+-|......|+|.+.+.++|++.+++.... +..+.+.+..+- +.+.|..=++.++.++.++ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~L-P~~~fR~lN~g~~~s~~tt 79 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGL-PSATWRLLNYGVQPSKSTT 79 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeecc-CCceeeecCCccCccccee Confidence 1011011 111 1334557778889999999999999999999885 445777787765 7888999999999999999 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--cccccccc------------ Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLLPQ------------ 246 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~~~------------ 246 (394) .+++-..+-+++.+.|.+.+.+... ++...-.....+++++.+...||+|+...+ .+.|+.+. T Consensus 80 ~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qi 159 (328) T protein:vir:95 80 VQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNI 159 (328) T ss_pred EEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccce Confidence 9999999999999999999887553 455556667899999999999999865433 23333110 Q ss_pred --c---------------------------------------------------------------------------cc Q lcl|NC_019933. 247 --A---------------------------------------------------------------------------TA 249 (394) Q Consensus 247 --~---------------------------------------------------------------------------~~ 249 (394) + +. T Consensus 160 idaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NI 239 (328) T protein:vir:95 160 IDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANI 239 (328) T ss_pred eecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecC Confidence 0 00 Q ss_pred c--cccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh-ccCCcccccCcc-cCCCceeecceEEEcCC Q lcl|NC_019933. 250 F--AAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLK-DTQGRYILGNPQ-GTLAPTLWGLPVVATQA 325 (394) Q Consensus 250 ~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk-d~~G~~~~~~~~-~~~~~~l~G~pv~~~~~ 325 (394) . ..+....+....+.+++++..++.....+.+|+||.+....|++.. +.....+-.... +.....+.|+||..++. T Consensus 240 d~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~da 319 (328) T protein:vir:95 240 DVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETDA 319 (328) T ss_pred cccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEee Confidence 0 0001122334566667777888877778889999999999998764 333323322222 23334689999999998 Q ss_pred CCcCceEEe Q lcl|NC_019933. 326 MAVGQFLTG 334 (394) Q Consensus 326 ~p~~~~~~g 334 (394) +-.++..+. T Consensus 320 i~~tE~~vv 328 (328) T protein:vir:95 320 LLETEARVV 328 (328) T ss_pred eecCccccC Confidence 765543333 No 142 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.21 E-value=3.9e-13 Score=88.64 Aligned_cols=278 Identities=13% Similarity=0.135 Sum_probs=158.9 Q ss_pred HHHHhhccccc-CCcC----c--cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccc Q lcl|NC_019933. 105 KAAITSLSTNA-DGSA----G--ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQK 176 (394) Q Consensus 105 ~~~~~~~~~~~-~~~~----g--~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~ 176 (394) .+..+....++ ...| . .+.=+.+.++++......+.++++++..++. |+++.+|+... ..+..+..|+.+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~--~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGR--TSGVYLAPGERL 78 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccc--eeeeeecCCCCc Confidence 11000000000 0111 1 2223688899999888888899999988876 66888998743 456667777776 Q ss_pred ccc--ccceeeEEeeeeeEEEeehhhHHHHHH-----H-HHHHHHHHHHHHHHHHHHHHHHHhhcc----C----CCccc Q lcl|NC_019933. 177 PES--SLRFDLVQTSAKVIAHWMKASRQILSD-----S-AQLQSFINARLLRGLEVVEENQLLNGN----G----TGQNL 240 (394) Q Consensus 177 ~~~--~~~~~~i~~~~~k~~~~~~is~e~l~~-----s-~~~~~~i~~~la~a~~~~~d~a~l~g~----g----~~~~~ 240 (394) +.+ +..-.+++|...++- +++.++.+ + .++.+.+.++.+.++++..|+.++.-. . +...+ T Consensus 79 ~~~~~~~~~~e~~itID~~~----~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~ 154 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLL----TADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENI 154 (347) T ss_pred CCCCCCCCcceEEEEecchh----hhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 543 334445445554432 33434332 2 278888999999999999999876311 0 11222 Q ss_pred cccccccccccc------cccccccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhccCC-ccccc-CcccC Q lcl|NC_019933. 241 LGLLPQATAFAA------PITVANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKDTQG-RYILG-NPQGT 310 (394) Q Consensus 241 ~Gi~~~~~~~~~------~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd~~G-~~~~~-~~~~~ 310 (394) .|+......... +........++.|.++...+...+.+. -.++++|..+..|..-+..+. .+.-. ....+ T Consensus 155 ~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 234 (347) T protein:vir:94 155 AGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETG 234 (347) T ss_pred CCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccccccccc Confidence 332211111111 111122345677777777787777653 378999999988754433222 11111 12223 Q ss_pred CCceeecceEEEcCCCCcCce--------------------------EEeeccceEE-EEeec--------ceEEEEecc Q lcl|NC_019933. 311 LAPTLWGLPVVATQAMAVGQF--------------------------LTGAFDAGAQ-VFDRW--------AARVEVATE 355 (394) Q Consensus 311 ~~~~l~G~pv~~~~~~p~~~~--------------------------~~gd~~~~~~-~~~~~--------~~~i~~~~~ 355 (394) ..++++|++|+.++.+|.+.. +-+||+.... ++.+. +++++..++ T Consensus 235 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 314 (347) T protein:vir:94 235 NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD 314 (347) T ss_pred ceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc Confidence 446899999999999984210 1122222221 22222 223333222 Q ss_pred cchhhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 356 NQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 356 ~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) .. +=...+++...+|.++++|++.+.++.++|- T Consensus 315 ~~----~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 315 VD----AQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hh----hHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 21 1134688899999999999999999999888 No 143 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.21 E-value=1.2e-12 Score=86.02 Aligned_cols=277 Identities=14% Similarity=0.094 Sum_probs=159.9 Q ss_pred HHHH-hhcccccCCc---Cc-----cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCc Q lcl|NC_019933. 105 KAAI-TSLSTNADGS---AG-----ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGA 174 (394) Q Consensus 105 ~~~~-~~~~~~~~~~---~g-----~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~ 174 (394) .+.. .....++... +| .+.=+.|..++++.+...+.+++++++.++. |+++.+|+... ..+..+..|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~--~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGR--TQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeece--eEEEeeecCC Confidence 1111 0000011111 11 1222889999999999999999999998887 56888998744 3456777788 Q ss_pred ccccc--ccceeeEEeeeeeEEEeehhhHHHHHH-----H-HHHHHHHHHHHHHHHHHHHHHHHhhcc----C----CCc Q lcl|NC_019933. 175 QKPES--SLRFDLVQTSAKVIAHWMKASRQILSD-----S-AQLQSFINARLLRGLEVVEENQLLNGN----G----TGQ 238 (394) Q Consensus 175 ~~~~~--~~~~~~i~~~~~k~~~~~~is~e~l~~-----s-~~~~~~i~~~la~a~~~~~d~a~l~g~----g----~~~ 238 (394) .++.+ ++.-.+++|...++- +++.++.| + -++.+.+.++++.++++..|+.++.-- . .+. T Consensus 79 ~l~~t~~~~~~~e~~l~ID~~~----y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 79 NLDDIRKDIKHTEKVITIDGLL----TADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred CCCCCCCCcccceEEEEEcchh----hhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 87654 466677666666532 33333322 2 278899999999999999998876321 1 112 Q ss_pred cccccccc----cccccc---cccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhccC-CcccccC-c Q lcl|NC_019933. 239 NLLGLLPQ----ATAFAA---PITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKDTQ-GRYILGN-P 307 (394) Q Consensus 239 ~~~Gi~~~----~~~~~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd~~-G~~~~~~-~ 307 (394) .|.|.-.. ....+. .....+...++.|.++...+...+.+.. .++++|..|..|..-+..+ ..+.-.. . T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~ 234 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDP 234 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccce Confidence 22222111 111111 1111223456778888888888777533 5688999999886533221 2221111 1 Q ss_pred ccCCCceeecceEEEcCCCCcCce------E---------------EeeccceE-EEEe--------ecceEEEEecccc Q lcl|NC_019933. 308 QGTLAPTLWGLPVVATQAMAVGQF------L---------------TGAFDAGA-QVFD--------RWAARVEVATENQ 357 (394) Q Consensus 308 ~~~~~~~l~G~pv~~~~~~p~~~~------~---------------~gd~~~~~-~~~~--------~~~~~i~~~~~~~ 357 (394) ..+..+.+.|++|+.++.+|.+.+ . ..+++... .++. -.++.++..++. T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~- 313 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA- 313 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch- Confidence 122235789999999999985311 1 11222211 1221 122344443332 Q ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|NC_019933. 358 DDFIKNMVTILAEERLALAVYRPESFIKGSLAAA 391 (394) Q Consensus 358 ~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a 391 (394) .+| ...+++..-+|.++++|++.+.++++.- T Consensus 314 ~~~---~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 314 NFQ---ADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hHH---HHHHHHHhhcccceecccceEEEEeecC Confidence 122 2367788899999999999977777666 No 144 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.21 E-value=1e-12 Score=86.28 Aligned_cols=284 Identities=12% Similarity=0.028 Sum_probs=158.3 Q ss_pred HHHHHHHhhcccc------cCCcCc-cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCC Q lcl|NC_019933. 102 INIKAAITSLSTN------ADGSAG-ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEG 173 (394) Q Consensus 102 ~~~~~~~~~~~~~------~~~~~g-~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg 173 (394) +-.-+.++..+.. ...+.- .+.=+.|..++++.....+.++.+++..++. |+++.+|+... ..+.....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~--~~~~~~~~g 78 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK--LSAGYHTPG 78 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccc--eeEeeecCC Confidence 1100111111111 111111 1333789999999999999999999988776 67899999754 345555556 Q ss_pred ccccc-cccceeeEEeeeeeEE-EeehhhHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHhh----ccCCCcccccccc Q lcl|NC_019933. 174 AQKPE-SSLRFDLVQTSAKVIA-HWMKASRQILS-DS-AQLQSFINARLLRGLEVVEENQLLN----GNGTGQNLLGLLP 245 (394) Q Consensus 174 ~~~~~-~~~~~~~i~~~~~k~~-~~~~is~e~l~-~s-~~~~~~i~~~la~a~~~~~d~a~l~----g~g~~~~~~Gi~~ 245 (394) ..+.. .++.-.++++...+.- .-..|.+ +-+ ++ .++.+.+.++.+.++++..|+.++. +.....+..+... T Consensus 79 ~~l~~~~~~~~~~~~l~ID~~ky~~~~Vdd-iD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g 157 (332) T protein:vir:78 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYS-LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) T ss_pred CCCCCCCCCCCceEEEEEehhhhhHHHHHh-HHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccc Confidence 65533 3455566666655422 1112221 111 12 2789999999999999999987763 2222222222111 Q ss_pred cccc-ccccccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhccC--Cc-ccccC--cc-cCCCceee Q lcl|NC_019933. 246 QATA-FAAPITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKDTQ--GR-YILGN--PQ-GTLAPTLW 316 (394) Q Consensus 246 ~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd~~--G~-~~~~~--~~-~~~~~~l~ 316 (394) .... .+.....+....++.|+++...+...+.+.. .++++|..|..|.+.+|.. .+ +.... .. +...+.+. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~ 237 (332) T protein:vir:78 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) T ss_pred ccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEe Confidence 1111 0111122344567888899899988888644 4677999999887643321 00 00011 11 11235789 Q ss_pred cceEEEcCCCCcCc--------------eEEeeccce-EEEEeecce--------EEEEecccchhhhcCcEEEEEEEEe Q lcl|NC_019933. 317 GLPVVATQAMAVGQ--------------FLTGAFDAG-AQVFDRWAA--------RVEVATENQDDFIKNMVTILAEERL 373 (394) Q Consensus 317 G~pv~~~~~~p~~~--------------~~~gd~~~~-~~~~~~~~~--------~i~~~~~~~~~~~~~~~~~~~~~~~ 373 (394) |++|+.++.+|... .+-|+|+.. ..++.+..+ .++.... ..+-.+-...+++...+ T Consensus 238 G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~-~~~~~~~~d~i~~~~~~ 316 (332) T protein:vir:78 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSG-DFNVQYQGDLIVGKLAM 316 (332) T ss_pred eeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhc-ccchhhhHhhhhhhhhh Confidence 99999999998532 133344332 223333322 2322110 00111112356677789 Q ss_pred ccEEecccceEEEEec Q lcl|NC_019933. 374 ALAVYRPESFIKGSLA 389 (394) Q Consensus 374 d~~v~~~~a~~~l~~~ 389 (394) |.++++|++++.++-+ T Consensus 317 G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 317 GCGSLRTSVAGSFQAA 332 (332) T ss_pred cCceecccceEEEeeC Confidence 9999999999999887 No 145 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.15 E-value=1.7e-11 Score=79.59 Aligned_cols=283 Identities=13% Similarity=0.075 Sum_probs=157.3 Q ss_pred HHHH-hhccc-ccCCcCc------cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCcc Q lcl|NC_019933. 105 KAAI-TSLST-NADGSAG------ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQ 175 (394) Q Consensus 105 ~~~~-~~~~~-~~~~~~g------~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~ 175 (394) .+.. ..... +....+| .+.=+.+..+++......+.++++++..++. |+++.+|+... ..+.....|.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~--~t~~~~~~g~~ 78 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR--TKAAYLKPGEN 78 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc--eeeeeeccCCC Confidence 1100 00000 0011111 1223677888889889999999999887765 67899998754 45666677777 Q ss_pred cccc--ccceeeEEeeeeeEEE-eehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccC--------CCcccc-- Q lcl|NC_019933. 176 KPES--SLRFDLVQTSAKVIAH-WMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNG--------TGQNLL-- 241 (394) Q Consensus 176 ~~~~--~~~~~~i~~~~~k~~~-~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g--------~~~~~~-- 241 (394) ++.+ +++..+.+|...+.-. -..|.+----++ .++.+.+.++.+.++++..|+.++.--. +...+. T Consensus 79 l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:15 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGL 158 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 6543 4566776666554322 112222111122 3788999999999999999998873210 001110 Q ss_pred ---cccccccccccc---ccccccchHHHHHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhccC-Cccccc-CcccCC Q lcl|NC_019933. 242 ---GLLPQATAFAAP---ITVANATAVDRLRLALLQAQLAEFP--ATGIVLNPADWAGIELLKDTQ-GRYILG-NPQGTL 311 (394) Q Consensus 242 ---Gi~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd~~-G~~~~~-~~~~~~ 311 (394) ++.......+.. ........++.+.++...+...+.+ .-.++++|..|..|.+-.+.. ..+.-. ....+. T Consensus 159 g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~ 238 (347) T protein:vir:15 159 GKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGT 238 (347) T ss_pred CccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceE Confidence 111111111100 0011223356666666677777764 225788999999987543322 222111 122233 Q ss_pred CceeecceEEEcCCCCcCce----------------------EEeeccc-eEEE--------EeecceEEEEecccchhh Q lcl|NC_019933. 312 APTLWGLPVVATQAMAVGQF----------------------LTGAFDA-GAQV--------FDRWAARVEVATENQDDF 360 (394) Q Consensus 312 ~~~l~G~pv~~~~~~p~~~~----------------------~~gd~~~-~~~~--------~~~~~~~i~~~~~~~~~~ 360 (394) .+.++|++|+.++.+|.+.+ +-++|.. ..++ +...++.++..++.. T Consensus 239 Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~--- 315 (347) T protein:vir:15 239 IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN--- 315 (347) T ss_pred EEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch--- Confidence 46799999999999985321 0111111 1112 222333444444332 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|NC_019933. 361 IKNMVTILAEERLALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 361 ~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~ 393 (394) +-...+++...+|.++++|++.+.++++..+- T Consensus 316 -~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 316 -YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred -hhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 22345677777899999999999999988888 No 146 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.15 E-value=3e-11 Score=78.27 Aligned_cols=288 Identities=13% Similarity=0.105 Sum_probs=164.5 Q ss_pred hhHHHHHHHhhcccccC-CcCc-----cccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecC Q lcl|NC_019933. 100 AEINIKAAITSLSTNAD-GSAG-----ATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAE 172 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~-~~~g-----~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~e 172 (394) +..--.+.+...+.++. ..+| .+.=+.+..++...+...+.++++++..++. |+++.+|+... ..+..... T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~--~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGR--MTSSFHTP 78 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeee--eEEeeecC Confidence 00000001111111111 1111 2334788899999999999999999988887 56888999744 34555666 Q ss_pred Ccccc---ccccceeeEEeeeeeEEEeehhhHHHHHH-----H-HHHHHHHHHHHHHHHHHHHHHHHhh----ccCCCcc Q lcl|NC_019933. 173 GAQKP---ESSLRFDLVQTSAKVIAHWMKASRQILSD-----S-AQLQSFINARLLRGLEVVEENQLLN----GNGTGQN 239 (394) Q Consensus 173 g~~~~---~~~~~~~~i~~~~~k~~~~~~is~e~l~~-----s-~~~~~~i~~~la~a~~~~~d~a~l~----g~g~~~~ 239 (394) |+.+. ..++...+.+|...++- +++.++.| + .++.+.+.++.+.++++..|+.++. +.....+ T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~----y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p 154 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLL----ISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASP 154 (375) T ss_pred CcCcCCccccCCCCCceEEEecchh----hhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 66653 22444444444444321 33333332 2 3799999999999999999988773 2111111 Q ss_pred --------ccccccccc-cccccccccccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhccCC----cccc Q lcl|NC_019933. 240 --------LLGLLPQAT-AFAAPITVANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKDTQG----RYIL 304 (394) Q Consensus 240 --------~~Gi~~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd~~G----~~~~ 304 (394) +.|.....+ ........+....++.|.++...+...+.+. -.++++|..|..|..-+|.+. .+.- T Consensus 155 ~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~ 234 (375) T protein:vir:10 155 VSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQG 234 (375) T ss_pred cccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccc Confidence 111111111 1111222345567888999888888887763 367999999998876554321 1111 Q ss_pred cC-cccCCCceeecceEEEcCCCCcCce-------------------------------------EEeec---cc-eEEE Q lcl|NC_019933. 305 GN-PQGTLAPTLWGLPVVATQAMAVGQF-------------------------------------LTGAF---DA-GAQV 342 (394) Q Consensus 305 ~~-~~~~~~~~l~G~pv~~~~~~p~~~~-------------------------------------~~gd~---~~-~~~~ 342 (394) .. ..++....+.|++|+.++.+|...+ |-+|| +. ...+ T Consensus 235 ~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~ 314 (375) T protein:vir:10 235 SALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLI 314 (375) T ss_pred cceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEE Confidence 10 1122235789999999999984321 12233 11 1122 Q ss_pred Eee--------cceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 343 FDR--------WAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 343 ~~~--------~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +.+ .++++++....+ ...+-...+.+..-+|..+.+|+|.+.|+..++++. T Consensus 315 ~~~~A~g~v~~~~~~~~~~~~~~-~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~ 373 (375) T protein:vir:10 315 FQKEAAGVVEAIGPQVQVTNGDV-SVIYQGDVILGRMAMGADYLNPAAAVELYIGATAPS 373 (375) T ss_pred Echhheeeeeeeccccccccchh-hheeeeeeeeeeeeeccCccCceeEEEEecCcCccc Confidence 222 333333322111 122234567788889999999999999999988887 No 147 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.13 E-value=1e-11 Score=80.83 Aligned_cols=261 Identities=11% Similarity=0.057 Sum_probs=150.1 Q ss_pred cccccCCcCccccchh---hhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCcccccccccee-- Q lcl|NC_019933. 111 LSTNADGSAGATVQTT---RLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPESSLRFD-- 184 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~---~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~-- 184 (394) |.....+..--+.++. +...|-..+..-..++...+..|+. |.++++|++... ..+.-|+||+.+|.++.+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~t-gda~dVaEGe~Iplskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVT-LDQTDPGEGETIPLSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeee-cccccccCCcccchhhheeeee Confidence 2222222222233332 2222322223333444445778886 678999998754 56777999999999999876 Q ss_pred -eEEeeeeeEEEeehhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccch Q lcl|NC_019933. 185 -LVQTSAKVIAHWMKASRQILSDSA--QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATA 261 (394) Q Consensus 185 -~i~~~~~k~~~~~~is~e~l~~s~--~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~ 261 (394) ..+++.+|++..+ |.|.++.+. +-...-.++|..+++.++|..||.-..++.. +. .+..-... T Consensus 80 ~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~-----------t~-tg~~lq~a 145 (295) T protein:vir:99 80 KDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPT-----------KV-KGVGLQKA 145 (295) T ss_pred eeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCce-----------ee-ehhhHHHH Confidence 4778888888754 999997664 6777888999999999999999974322110 00 01111224 Q ss_pred HHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccC--CcccccCcccCCCceeecce-EEEcCCCCcCceEEeeccc Q lcl|NC_019933. 262 VDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQ--GRYILGNPQGTLAPTLWGLP-VVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 262 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~--G~~~~~~~~~~~~~~l~G~p-v~~~~~~p~~~~~~gd~~~ 338 (394) ++.+..........+..+.++++||...+.|++-..-+ ..-.|.. .---.++|.. |+++..+|+|+++.--..+ T Consensus 146 ~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~---~~L~nfLG~q~II~S~kv~~G~~~aT~~~N 222 (295) T protein:vir:99 146 LSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGM---TLLKNFLGMQNVIVMPSVPEGKIYSTAVEN 222 (295) T ss_pred HHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhh---hhhhhhhccceEEEcccCCCceEEEeeccc Confidence 44444444555555556678999999988877543221 1111100 0001389997 9999999999877543222 Q ss_pred eEEE--EeecceEEEEecccchhhhcCcEEEEEEEE-------------ecc---EEecccceEEEEecCCCCC Q lcl|NC_019933. 339 GAQV--FDRWAARVEVATENQDDFIKNMVTILAEER-------------LAL---AVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 339 ~~~~--~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~-------------~d~---~v~~~~a~~~l~~~~a~~~ 394 (394) +.+ ....+..+. ....+-.|.+.+.+..+ +.+ -+...+++++.++.+++++ T Consensus 223 -i~~ay~~~~~g~l~----~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~ 291 (295) T protein:vir:99 223 -LVFASLNVKGGDLG----GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVP 291 (295) T ss_pred -eEEEEecCCchhhh----hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCC Confidence 111 111110010 11112223333332221 122 2345688999999888888 No 148 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.08 E-value=3e-11 Score=78.31 Aligned_cols=285 Identities=12% Similarity=0.014 Sum_probs=156.9 Q ss_pred HHHHhhc-----ccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccccc---ccCceeEEEEcCcccccceecCCccc Q lcl|NC_019933. 105 KAAITSL-----STNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTM---EGNTLEYVRETGFTNAAAPVAEGAQK 176 (394) Q Consensus 105 ~~~~~~~-----~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~eg~~~ 176 (394) .+.+... ..-+++..-..+|+.|..++++.+.+.+.+..++..... .|.++++|+... +.+..+.++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~--~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISR--AAVYDKQPQTPV 78 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCc--ceeeeecCCCcc Confidence 1111100 001122234577999999999999998888887765433 356888998643 456667888888 Q ss_pred cccccceeeEEeeeeeE-EEeehhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCc---cc---ccc--ccc Q lcl|NC_019933. 177 PESSLRFDLVQTSAKVI-AHWMKASRQIL-SDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQ---NL---LGL--LPQ 246 (394) Q Consensus 177 ~~~~~~~~~i~~~~~k~-~~~~~is~e~l-~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~---~~---~Gi--~~~ 246 (394) +..+++...+++...+. ..-+.|++.-. +.+.++.+.+.+.++.++++.+|+.++....... .+ .+. ... T Consensus 79 ~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~ 158 (381) T protein:vir:80 79 NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGD 158 (381) T ss_pred cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccc Confidence 87777777777777553 34467777544 3445899999999999999999998875321110 00 000 000 Q ss_pred cccccccccccccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhccC-Cccccc-CcccCCCceeecceEEE Q lcl|NC_019933. 247 ATAFAAPITVANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKDTQ-GRYILG-NPQGTLAPTLWGLPVVA 322 (394) Q Consensus 247 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd~~-G~~~~~-~~~~~~~~~l~G~pv~~ 322 (394) ..............+++.|+++...+...+.+. -.++++|..+..|.+...-. -.+... ....+..++|.|++|+. T Consensus 159 ~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~~Vv~ 238 (381) T protein:vir:80 159 GTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEVIV 238 (381) T ss_pred cccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcceEEEe Confidence 011111122234557889999988888877643 37899999999987543211 112111 12233456899999999 Q ss_pred cCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc-cceEE-----EEecCCCCC Q lcl|NC_019933. 323 TQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP-ESFIK-----GSLAAAAGT 394 (394) Q Consensus 323 ~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~-~a~~~-----l~~~~a~~~ 394 (394) ++.+|.+.+..-....++-...... +.-..+ ...|..+..+++....+|.++... ..+-. .+...-.+| T Consensus 239 Sn~lp~~~~t~~~~~agap~~~~~~--~~~~~~-~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~ 313 (381) T protein:vir:80 239 TTQIGINSLTGYVNGQGAPTQPTPG--VLGSPY-LPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQT 313 (381) T ss_pred ecccccccccceeeecccccccccc--cccccc-ccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCce Confidence 9999975432000000000000000 011111 112333445555555566655332 11111 111111111 No 149 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=99.06 E-value=5.5e-12 Score=82.29 Aligned_cols=226 Identities=13% Similarity=0.068 Sum_probs=149.0 Q ss_pred HhhcccccC--C-cCccccchhhhhHHHhhhhhhhhHHHhccccccccCc-eeEEEEcCcccccceecCCccccccccce Q lcl|NC_019933. 108 ITSLSTNAD--G-SAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT-LEYVRETGFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 108 ~~~~~~~~~--~-~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~eg~~~~~~~~~~ 183 (394) ...+..... . .+..+-|......|+|.+.+.++|++.+++......+ ....+..+. |.+.|..=++.++.++.++ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~L-P~~~fR~lN~g~~~s~~tt 79 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGL-PTPTWRKLYGGVLPNKSST 79 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeec-CCchhhhcCCccccccceE Confidence 111111111 1 1233456667789999999999999999987643332 223333443 6788999999999999999 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--ccccccccc----------- Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLLPQA----------- 247 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~~~~----------- 247 (394) .+++-..+-+++.+.|.+.+.+... ++.........+++.+.+...+|+|+...+ .+.|+.+.- T Consensus 80 ~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qv 159 (330) T protein:vir:10 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) T ss_pred EEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhhe Confidence 9999999999999999999987543 466667778999999999999999975432 233331100 Q ss_pred -----------------------------c-------------------cc----------------------------- Q lcl|NC_019933. 248 -----------------------------T-------------------AF----------------------------- 250 (394) Q Consensus 248 -----------------------------~-------------------~~----------------------------- 250 (394) + .. T Consensus 160 IdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~ 239 (330) T protein:vir:10 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) T ss_pred eeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEe Confidence 0 00 Q ss_pred ccc-----ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHh-hccCCccc-ccCcccCCCceeecceEEEc Q lcl|NC_019933. 251 AAP-----ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELL-KDTQGRYI-LGNPQGTLAPTLWGLPVVAT 323 (394) Q Consensus 251 ~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l-kd~~G~~~-~~~~~~~~~~~l~G~pv~~~ 323 (394) .++ ........++.++.+...++.......+|+||++....|++. .+.+...+ .....+.....+.|+||..+ T Consensus 240 NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~~ 319 (330) T protein:vir:10 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) T ss_pred ecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEEE Confidence 000 001112244566666677777777888999999999999875 34433222 22222222346899999999 Q ss_pred CCCCcCceEEe Q lcl|NC_019933. 324 QAMAVGQFLTG 334 (394) Q Consensus 324 ~~~p~~~~~~g 334 (394) +.+-.++..+. T Consensus 320 Dail~tE~~vv 330 (330) T protein:vir:10 320 DALLNTESRVV 330 (330) T ss_pred eeeecCccccC Confidence 98765543333 No 150 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.01 E-value=7.9e-11 Score=75.96 Aligned_cols=265 Identities=11% Similarity=0.033 Sum_probs=146.4 Q ss_pred HHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEc--CcccccceecCCccccccccce Q lcl|NC_019933. 107 AITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRET--GFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 107 ~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~ 183 (394) +....+.....+-+...--.+.+.|-..+..-.-++...+..|+. |..++++++. .....+..|+||+.||.++++. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 111111111122222333344555544444444555556788886 4456555543 2235567899999999999886 Q ss_pred e---eEEeeeeeEEEeehhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccc Q lcl|NC_019933. 184 D---LVQTSAKVIAHWMKASRQILSDSA--QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVAN 258 (394) Q Consensus 184 ~---~i~~~~~k~~~~~~is~e~l~~s~--~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~ 258 (394) . ..++..+|++..+ |.|.++.+. +....--++|..+++.++|+.||.-..++. .+...+... T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT-----------~t~~~t~~t 147 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAI-----------ENGKRTNKT 147 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcc-----------cccccccce Confidence 4 5788899988865 999997663 677778899999999999999996432211 011111223 Q ss_pred cchHHHHHHHHHHhh------hhcCCCCeeEeCHHHHHHHHHhhcc-CCcccccCcccCCCceeecceEEEcCCCCcCce Q lcl|NC_019933. 259 ATAVDRLRLALLQAQ------LAEFPATGIVLNPADWAGIELLKDT-QGRYILGNPQGTLAPTLWGLPVVATQAMAVGQF 331 (394) Q Consensus 259 ~~~~~~i~~~~~~~~------~~~~~~~~~~~~~~~~~~l~~lkd~-~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~ 331 (394) ..+.+.|..++.... ..+..+.+++|||.+.+.++.-..- ...--|... ---.++|..|+++..+|.|++ T Consensus 148 ~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n---~L~nfLG~~II~S~kv~~G~~ 224 (303) T protein:vir:10 148 KLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVN---LLTPYVGVKIVEFADVPQGEV 224 (303) T ss_pred eecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhh---hhhhhhcceEEEeccCCCceE Confidence 344555555544331 2223345899999998887642211 111111100 001389999999999999987 Q ss_pred EEeeccceEEEE---eecceEEEEecccchhhhcCcEEEEEEE-------------Eecc---EEecccceEEEEecCCC Q lcl|NC_019933. 332 LTGAFDAGAQVF---DRWAARVEVATENQDDFIKNMVTILAEE-------------RLAL---AVYRPESFIKGSLAAAA 392 (394) Q Consensus 332 ~~gd~~~~~~~~---~~~~~~i~~~~~~~~~~~~~~~~~~~~~-------------~~d~---~v~~~~a~~~l~~~~a~ 392 (394) +.--..+ +.+. .+.++. ....+..|.+.+.+.. .+.+ -+.+.+++++.++++.- T Consensus 225 ~~T~~~N-i~~ay~~~~g~l~------~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e 297 (303) T protein:vir:10 225 WMTVAEN-LNVAYANPRGELS------RAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDE 297 (303) T ss_pred EEeeccc-eEEEEecCchhhh------hhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccc Confidence 7543322 1111 111111 0111222233333221 1122 23446889999997777 Q ss_pred CC Q lcl|NC_019933. 393 GT 394 (394) Q Consensus 393 ~~ 394 (394) .+ T Consensus 298 ~~ 299 (303) T protein:vir:10 298 AG 299 (303) T ss_pred cC Confidence 65 No 151 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.99 E-value=2.8e-11 Score=78.42 Aligned_cols=240 Identities=15% Similarity=0.119 Sum_probs=134.8 Q ss_pred hccccccccCceeEEEEcCcccccceecCCccccc--cccceeeEEeeeeeEEEeehhhHHHHHH-----H-HHHHHHHH Q lcl|NC_019933. 144 LLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPE--SSLRFDLVQTSAKVIAHWMKASRQILSD-----S-AQLQSFIN 215 (394) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~--~~~~~~~i~~~~~k~~~~~~is~e~l~~-----s-~~~~~~i~ 215 (394) +++.+. +|+++.+|+... ..+.+...|+.+.. .++.-.+.+|...++- +++.++.| + -++.+... T Consensus 1 ~vr~i~-~g~s~~~~~iG~--~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l----~~~~~VdDiD~~qa~~Dlr~e~s 73 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVMGR--TKARYLKQGQSLDDGREDIKHTEKVITIDGLL----TTDVLIYDIEDAMNHYDVRSEYS 73 (324) T ss_pred Ceeeee-cCceEEEeeeee--eEeccccCCCCcCCCcCCcCcccEEEEecchh----hhhhhhhhHHHHhcCccchhHHH Confidence 333322 367899999743 45666777777643 3344445444433322 33333322 2 37999999 Q ss_pred HHHHHHHHHHHHHHHhhcc------C---CCccccc---cccc-cccccccccccccchHHHHHHHHHHhhhhcCCCC-- Q lcl|NC_019933. 216 ARLLRGLEVVEENQLLNGN------G---TGQNLLG---LLPQ-ATAFAAPITVANATAVDRLRLALLQAQLAEFPAT-- 280 (394) Q Consensus 216 ~~la~a~~~~~d~a~l~g~------g---~~~~~~G---i~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-- 280 (394) ++++.++++..|+.++.-- . ...+..| .... ...............++.|.++...+...+.+.. T Consensus 74 ~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR 153 (324) T protein:vir:99 74 TQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDR 153 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCC Confidence 9999999999998876320 0 0111111 0000 0111111112233457888888888888877533 Q ss_pred eeEeCHHHHHHHHHhhc-cCCccccc-CcccCCCceeecceEEEcCCCCcCce-------------------------EE Q lcl|NC_019933. 281 GIVLNPADWAGIELLKD-TQGRYILG-NPQGTLAPTLWGLPVVATQAMAVGQF-------------------------LT 333 (394) Q Consensus 281 ~~~~~~~~~~~l~~lkd-~~G~~~~~-~~~~~~~~~l~G~pv~~~~~~p~~~~-------------------------~~ 333 (394) .++++|..+..|..-+. ..+.+.-. ....+..+.+.|++|+.++++|...+ +- T Consensus 154 ~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~ 233 (324) T protein:vir:99 154 TFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMT 233 (324) T ss_pred EEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccc Confidence 67999999987764332 22333221 12233446789999999999986321 22 Q ss_pred eeccceE-EEEeecc--------eEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 334 GAFDAGA-QVFDRWA--------ARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 334 gd~~~~~-~~~~~~~--------~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +|++... .++.+.. +..+..++.. +-...+++..-+|.++++|++.+.+++++.+-+ T Consensus 234 ~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~----~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~ 299 (324) T protein:vir:99 234 VGADNVVGLFVHRSAVATLKLKDMALERARRPE----YQADQIIAKYAMGHGGLRPEAVGAIIFEDGETP 299 (324) T ss_pred cccCceeEEEEehhheEEEeeecceecceechh----hHHHhhhhhhhhcCcccccceEEEEEEccCccc Confidence 3333221 2222222 2344433322 223456777788999999999998887666532 No 152 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.99 E-value=3.8e-11 Score=77.72 Aligned_cols=280 Identities=12% Similarity=0.063 Sum_probs=161.5 Q ss_pred cccc-cCCcCcccc-chhhhhHHHhhhhhhhhHHHhccccc-cccCceeEEEEcCcccccceecCCccccccccceeeEE Q lcl|NC_019933. 111 LSTN-ADGSAGATV-QTTRLPGILELPQRRMTIRSLLAQGT-MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQ 187 (394) Q Consensus 111 ~~~~-~~~~~g~~i-p~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~ 187 (394) +.++ .++++-..| |+.|+..|+.-+.+......+.+... -.|++++||+... +...-..+++++.--.++-.+++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~--~tV~dY~~~~~i~~d~ltt~~~~ 78 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGT--PVVRSRPEQGDFTFDNLDTGEIS 78 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccc--cccccccCCCCcccccCCCceEE Confidence 3333 344454555 99999999988887776666555444 2478899998754 34444444555544444444444 Q ss_pred --eeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---h-ccCCC---ccccccccccccccccccccc Q lcl|NC_019933. 188 --TSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQLL---N-GNGTG---QNLLGLLPQATAFAAPITVAN 258 (394) Q Consensus 188 --~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~l---~-g~g~~---~~~~Gi~~~~~~~~~~~~~~~ 258 (394) +...|+-++ .|++...+.+.++.+...++.+.+++..+|..+. . |.... +.|.- .+.....-+..+... T Consensus 79 l~IDq~KYfaf-~VdDD~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~v-in~~~~~iv~~gt~~ 156 (322) T protein:vir:31 79 IILRDEVYAGN-AISKKLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNV-INGVPHRFVGTGTDQ 156 (322) T ss_pred EEEehhhhhcc-ccchhHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcce-ecCCccceeccCCCc Confidence 444445544 4888777777899999999999999999997653 2 11000 00100 000001112233344 Q ss_pred cchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhh-----ccCCccc--ccCcccCC---CceeecceEEEcCCC Q lcl|NC_019933. 259 ATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLK-----DTQGRYI--LGNPQGTL---APTLWGLPVVATQAM 326 (394) Q Consensus 259 ~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lk-----d~~G~~~--~~~~~~~~---~~~l~G~pv~~~~~~ 326 (394) ...|+.++++..++...+.+.. .+|++|..+..|..+. -.++++. ...+...+ .+.++|+.|+.|+.+ T Consensus 157 ~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~l 236 (322) T protein:vir:31 157 TMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNLL 236 (322) T ss_pred hhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeeccc Confidence 5688999999999998888643 4678899877774421 1233332 12222222 368999999999998 Q ss_pred CcCc--eEEee---------ccceEEEEeecceEE-----EE-ecccchhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|NC_019933. 327 AVGQ--FLTGA---------FDAGAQVFDRWAARV-----EV-ATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 327 p~~~--~~~gd---------~~~~~~~~~~~~~~i-----~~-~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~ 389 (394) +.+. ++.|. ++.+..+.+..-..+ ++ ..+...+-.+..-.+|+..++|.++.+|+.++.|.-. T Consensus 237 ~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~ 316 (322) T protein:vir:31 237 ADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVCVLAN 316 (322) T ss_pred cccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceEEEEec Confidence 7542 22221 111111111000000 00 0111112223345688999999999999999998766 Q ss_pred CCCCC Q lcl|NC_019933. 390 AAAGT 394 (394) Q Consensus 390 ~a~~~ 394 (394) ++--| T Consensus 317 ~~~~~ 321 (322) T protein:vir:31 317 ADKVT 321 (322) T ss_pred ccccc Confidence 66666 No 153 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.95 E-value=3.5e-11 Score=77.93 Aligned_cols=287 Identities=12% Similarity=0.038 Sum_probs=158.0 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCceeEEEEcCcccccceecCCccccccc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLEYVRETGFTNAAAPVAEGAQKPESS 180 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~eg~~~~~~~ 180 (394) +-..+..+....+...+--.+.=+.+..++.+.....+.++++.++.++. |+++.+|+... ..+.+...|+...-+. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~--~~a~y~~~G~~ldg~~ 78 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE--TELQVLAPGQSPNATP 78 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEee--eEEeeeccccccCCCC Confidence 00000000000111001112233788899999999999999999998886 46899999743 3456666676666666 Q ss_pred cceeeEEeeeeeEEEeehhhHHHHHH------HHH-HHHHHHHHHHHHHHHHHHHHHhh-----c----cCCCccccccc Q lcl|NC_019933. 181 LRFDLVQTSAKVIAHWMKASRQILSD------SAQ-LQSFINARLLRGLEVVEENQLLN-----G----NGTGQNLLGLL 244 (394) Q Consensus 181 ~~~~~i~~~~~k~~~~~~is~e~l~~------s~~-~~~~i~~~la~a~~~~~d~a~l~-----g----~g~~~~~~Gi~ 244 (394) +..++..|....+- +++.++.+ +-+ +.+.+.++++.++++..|+.++. + .+.+..+.+.- T Consensus 79 ~~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 79 TQADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred cccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 77777777776544 34443321 224 67888899999999999997753 1 11111222221 Q ss_pred ccccccccc----ccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhccCC-ccccc---CcccCCCce Q lcl|NC_019933. 245 PQATAFAAP----ITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKDTQG-RYILG---NPQGTLAPT 314 (394) Q Consensus 245 ~~~~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd~~G-~~~~~---~~~~~~~~~ 314 (394) .......+. ....+....+.+.++...+...+.+.+ +++++|..|..|.+-.+--. .|... ....+.... T Consensus 155 ~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~ 234 (402) T protein:vir:97 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) T ss_pred cccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEE Confidence 111111111 112333445666777777777666544 78999999998876422111 11111 112333467 Q ss_pred eecceEEEcCCCCcCc---------------e--EEeeccce-EEEEeecce-EEEEec---ccchhhhcCcEEEEEEEE Q lcl|NC_019933. 315 LWGLPVVATQAMAVGQ---------------F--LTGAFDAG-AQVFDRWAA-RVEVAT---ENQDDFIKNMVTILAEER 372 (394) Q Consensus 315 l~G~pv~~~~~~p~~~---------------~--~~gd~~~~-~~~~~~~~~-~i~~~~---~~~~~~~~~~~~~~~~~~ 372 (394) +.|+||+.++++|..- . +-+|+... ..+|.+..+ +++.-+ +...+-.+-...+-+.+- T Consensus 235 v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a 314 (402) T protein:vir:97 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) T ss_pred EeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHH Confidence 9999999999998521 1 12455432 233333322 222211 111111111223446677 Q ss_pred eccEEecccceEEEEecCCCCC Q lcl|NC_019933. 373 LALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 373 ~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++..+++|+|..++..+-..-+ T Consensus 315 ~G~g~~RPeaa~vv~~~~~~t~ 336 (402) T protein:vir:97 315 EGAIPDRWEAVSVVTTKRDATT 336 (402) T ss_pred hCCcccCccceEEEEEeccccc Confidence 8999999999999976653333 No 154 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.92 E-value=6.7e-11 Score=76.36 Aligned_cols=226 Identities=13% Similarity=0.112 Sum_probs=146.4 Q ss_pred HhhcccccCC---cCccccch-hhhhHHHhhhhhhhhHHHhccccccccCc-eeEEEEcCcccccceecCCccccccccc Q lcl|NC_019933. 108 ITSLSTNADG---SAGATVQT-TRLPGILELPQRRMTIRSLLAQGTMEGNT-LEYVRETGFTNAAAPVAEGAQKPESSLR 182 (394) Q Consensus 108 ~~~~~~~~~~---~~g~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~eg~~~~~~~~~ 182 (394) ......+..+ .+..+-|. .+...|+|.+.+.++|++.+++......+ +...+.++. +.+.|..=++.++.++.+ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~L-P~~~fR~lN~g~~~s~~t 79 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGL-PTGTWRKLNYGVQPEKSR 79 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEecc-CCchhhccCCccCcccce Confidence 1111011110 11112133 34567999999999999999998765443 344555554 688999999999999999 Q ss_pred eeeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--cccccccc----------- Q lcl|NC_019933. 183 FDLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLLPQ----------- 246 (394) Q Consensus 183 ~~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~~~----------- 246 (394) +.+++-..+-+++.+.|.+.+.+... ++...-...+.+++.+.+...||+|+...+ .+.|+.+. T Consensus 80 t~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:98 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred eEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 99999999999999999999887653 456666777899999999999999874422 12222110 Q ss_pred ---c---------------------------------------------------------------------------c Q lcl|NC_019933. 247 ---A---------------------------------------------------------------------------T 248 (394) Q Consensus 247 ---~---------------------------------------------------------------------------~ 248 (394) + + T Consensus 160 ~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~N 239 (331) T protein:vir:98 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) T ss_pred eeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 0 Q ss_pred ccccc---ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh-ccCCcc-cccCcc-cCCCceeecceEEE Q lcl|NC_019933. 249 AFAAP---ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLK-DTQGRY-ILGNPQ-GTLAPTLWGLPVVA 322 (394) Q Consensus 249 ~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk-d~~G~~-~~~~~~-~~~~~~l~G~pv~~ 322 (394) ..... .+.++...++.++.+...++.....+.+|+||.+....|++.. +..... +-.... +.....+.|+||.. T Consensus 240 Idvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~ 319 (331) T protein:vir:98 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) T ss_pred cchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEE Confidence 00000 0112233455666777777777777789999999999998763 332212 222222 23334689999999 Q ss_pred cCCCCcCceEEe Q lcl|NC_019933. 323 TQAMAVGQFLTG 334 (394) Q Consensus 323 ~~~~p~~~~~~g 334 (394) ++.+-.++..+. T Consensus 320 ~dai~~tE~~Vv 331 (331) T protein:vir:98 320 TDALLLTEARVV 331 (331) T ss_pred eeeeecCccccC Confidence 998765543333 No 155 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.92 E-value=6.7e-11 Score=76.36 Aligned_cols=226 Identities=13% Similarity=0.112 Sum_probs=146.4 Q ss_pred HhhcccccCC---cCccccch-hhhhHHHhhhhhhhhHHHhccccccccCc-eeEEEEcCcccccceecCCccccccccc Q lcl|NC_019933. 108 ITSLSTNADG---SAGATVQT-TRLPGILELPQRRMTIRSLLAQGTMEGNT-LEYVRETGFTNAAAPVAEGAQKPESSLR 182 (394) Q Consensus 108 ~~~~~~~~~~---~~g~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~eg~~~~~~~~~ 182 (394) ......+..+ .+..+-|. .+...|+|.+.+.++|++.+++......+ +...+.++. +.+.|..=++.++.++.+ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~L-P~~~fR~lN~g~~~s~~t 79 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGL-PTGTWRKLNYGVQPEKSR 79 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEecc-CCchhhccCCccCcccce Confidence 1111011110 11112133 34567999999999999999998765443 344555554 688999999999999999 Q ss_pred eeeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--cccccccc----------- Q lcl|NC_019933. 183 FDLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLLPQ----------- 246 (394) Q Consensus 183 ~~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~~~----------- 246 (394) +.+++-..+-+++.+.|.+.+.+... ++...-...+.+++.+.+...||+|+...+ .+.|+.+. T Consensus 80 t~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:10 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred eEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 99999999999999999999887653 456666777899999999999999874422 12222110 Q ss_pred ---c---------------------------------------------------------------------------c Q lcl|NC_019933. 247 ---A---------------------------------------------------------------------------T 248 (394) Q Consensus 247 ---~---------------------------------------------------------------------------~ 248 (394) + + T Consensus 160 ~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~N 239 (331) T protein:vir:10 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) T ss_pred eeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 0 Q ss_pred ccccc---ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh-ccCCcc-cccCcc-cCCCceeecceEEE Q lcl|NC_019933. 249 AFAAP---ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLK-DTQGRY-ILGNPQ-GTLAPTLWGLPVVA 322 (394) Q Consensus 249 ~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk-d~~G~~-~~~~~~-~~~~~~l~G~pv~~ 322 (394) ..... .+.++...++.++.+...++.....+.+|+||.+....|++.. +..... +-.... +.....+.|+||.. T Consensus 240 Idvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~ 319 (331) T protein:vir:10 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) T ss_pred cchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEE Confidence 00000 0112233455666777777777777789999999999998763 332212 222222 23334689999999 Q ss_pred cCCCCcCceEEe Q lcl|NC_019933. 323 TQAMAVGQFLTG 334 (394) Q Consensus 323 ~~~~p~~~~~~g 334 (394) ++.+-.++..+. T Consensus 320 ~dai~~tE~~Vv 331 (331) T protein:vir:10 320 TDALLLTEARVV 331 (331) T ss_pred eeeeecCccccC Confidence 998765543333 No 156 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.92 E-value=6.7e-11 Score=76.36 Aligned_cols=226 Identities=13% Similarity=0.112 Sum_probs=146.4 Q ss_pred HhhcccccCC---cCccccch-hhhhHHHhhhhhhhhHHHhccccccccCc-eeEEEEcCcccccceecCCccccccccc Q lcl|NC_019933. 108 ITSLSTNADG---SAGATVQT-TRLPGILELPQRRMTIRSLLAQGTMEGNT-LEYVRETGFTNAAAPVAEGAQKPESSLR 182 (394) Q Consensus 108 ~~~~~~~~~~---~~g~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~eg~~~~~~~~~ 182 (394) ......+..+ .+..+-|. .+...|+|.+.+.++|++.+++......+ +...+.++. +.+.|..=++.++.++.+ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~L-P~~~fR~lN~g~~~s~~t 79 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGL-PTGTWRKLNYGVQPEKSR 79 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEecc-CCchhhccCCccCcccce Confidence 1111011110 11112133 34567999999999999999998765443 344555554 688999999999999999 Q ss_pred eeeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--cccccccc----------- Q lcl|NC_019933. 183 FDLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLLPQ----------- 246 (394) Q Consensus 183 ~~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~~~----------- 246 (394) +.+++-..+-+++.+.|.+.+.+... ++...-...+.+++.+.+...||+|+...+ .+.|+.+. T Consensus 80 t~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:10 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred eEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 99999999999999999999887653 456666777899999999999999874422 12222110 Q ss_pred ---c---------------------------------------------------------------------------c Q lcl|NC_019933. 247 ---A---------------------------------------------------------------------------T 248 (394) Q Consensus 247 ---~---------------------------------------------------------------------------~ 248 (394) + + T Consensus 160 ~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~N 239 (331) T protein:vir:10 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) T ss_pred eeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 0 Q ss_pred ccccc---ccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhh-ccCCcc-cccCcc-cCCCceeecceEEE Q lcl|NC_019933. 249 AFAAP---ITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLK-DTQGRY-ILGNPQ-GTLAPTLWGLPVVA 322 (394) Q Consensus 249 ~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lk-d~~G~~-~~~~~~-~~~~~~l~G~pv~~ 322 (394) ..... .+.++...++.++.+...++.....+.+|+||.+....|++.. +..... +-.... +.....+.|+||.. T Consensus 240 Idvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~ 319 (331) T protein:vir:10 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) T ss_pred cchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEE Confidence 00000 0112233455666777777777777789999999999998763 332212 222222 23334689999999 Q ss_pred cCCCCcCceEEe Q lcl|NC_019933. 323 TQAMAVGQFLTG 334 (394) Q Consensus 323 ~~~~p~~~~~~g 334 (394) ++.+-.++..+. T Consensus 320 ~dai~~tE~~Vv 331 (331) T protein:vir:10 320 TDALLLTEARVV 331 (331) T ss_pred eeeeecCccccC Confidence 998765543333 No 157 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.88 E-value=2.6e-10 Score=73.17 Aligned_cols=265 Identities=12% Similarity=0.044 Sum_probs=140.6 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccc-cCcee-EEEEcCcccccceecCCcccccc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTME-GNTLE-YVRETGFTNAAAPVAEGAQKPES 179 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~eg~~~~~~ 179 (394) +.-.......+.....+-+...--.+.+.|-..+..-..++...+..||. |.+++ +|.+.. ...+..|+||+.+|.+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y-~gda~dVaEGe~Ipls 79 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceee-eeccccccCCcccchh Confidence 00000000000111111222222334444444344434445555788887 45674 454554 3567789999999999 Q ss_pred ccceee---EEeeeeeEEEeehhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccc Q lcl|NC_019933. 180 SLRFDL---VQTSAKVIAHWMKASRQILSDSA--QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPI 254 (394) Q Consensus 180 ~~~~~~---i~~~~~k~~~~~~is~e~l~~s~--~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~ 254 (394) +.+... .++..+|++..+ |.|.++.+. +....-.++|...++.++|+.||.-..++.. +. T Consensus 80 kvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~-------------t~ 144 (296) T protein:vir:98 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG-------------TQ 144 (296) T ss_pred hheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc-------------ee Confidence 998764 778888888885 999997664 6777788999999999999999964322210 00 Q ss_pred cccccchHHHH----HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCC-CceeecceEEEcCCCCcC Q lcl|NC_019933. 255 TVANATAVDRL----RLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTL-APTLWGLPVVATQAMAVG 329 (394) Q Consensus 255 ~~~~~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~-~~~l~G~pv~~~~~~p~~ 329 (394) ..++...-..+ -++....+..+....++++||...+.+++-..-.- +..++.. -..++|..|+.+..+|.| T Consensus 145 ~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~----qt~fG~tyl~nfLG~~II~S~kV~~G 220 (296) T protein:vir:98 145 DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT----QTAFGLTYLVDFTGTVIISTNDVTKG 220 (296) T ss_pred eechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccch----hheechhhhhhccccEEEEcCcCCCc Confidence 01111111111 11223334444456689999999776553221111 1111111 013789999999999999 Q ss_pred ceEEeeccceEEEE-ee--cceEEEEecccchhhhcCcEEEEEEE-------------Eecc---EEecccceEEEEecC Q lcl|NC_019933. 330 QFLTGAFDAGAQVF-DR--WAARVEVATENQDDFIKNMVTILAEE-------------RLAL---AVYRPESFIKGSLAA 390 (394) Q Consensus 330 ~~~~gd~~~~~~~~-~~--~~~~i~~~~~~~~~~~~~~~~~~~~~-------------~~d~---~v~~~~a~~~l~~~~ 390 (394) +++..-..+....+ +. .++.-.. .+..|.+.+.+.. .+.+ -+...+++++.++++ T Consensus 221 ~~~~T~~~Ni~~ay~~~~~~~l~~~f------~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~ 294 (296) T protein:vir:98 221 EIWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTP 294 (296) T ss_pred eEEEeeecceEEEeecccccchhhhh------ccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecC Confidence 98865443321111 11 1111111 1122233333221 1122 234468899999977 Q ss_pred CC Q lcl|NC_019933. 391 AA 392 (394) Q Consensus 391 a~ 392 (394) +- T Consensus 295 ~~ 296 (296) T protein:vir:98 295 GV 296 (296) T ss_pred CC Confidence 76 No 158 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.86 E-value=1.3e-10 Score=74.83 Aligned_cols=227 Identities=12% Similarity=0.067 Sum_probs=141.7 Q ss_pred HhhcccccC--C-cCccccchhhhhHHHhhhhhhhhHHHhccccccccCc-eeEEEEcCcccccceecCCccccccccce Q lcl|NC_019933. 108 ITSLSTNAD--G-SAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT-LEYVRETGFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 108 ~~~~~~~~~--~-~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~eg~~~~~~~~~~ 183 (394) ......+.. . .+..+-|......|+|.+.+.++|++.+++......+ ....+..+. |.+.|..=++.++.++.++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~L-P~~~fR~lN~g~~~s~~tt 79 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGI-PEPVWRRYNQGVQPTKTQT 79 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEec-CCchhhhcCCccccccceE Confidence 111111111 0 1222345667778999999999999999987643332 223333444 6788999999999999999 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--cccccccc------------ Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQ--NLLGLLPQ------------ 246 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--~~~Gi~~~------------ 246 (394) .+++-..+-+++.+.|.+.+.+... ++...-.....+++.+.+...+|+|+...+ .+.|+.+. T Consensus 80 ~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a 159 (335) T protein:vir:73 80 VPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASA 159 (335) T ss_pred EEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCcc Confidence 9999999999999999998877543 466777778999999999999999965443 23343110 Q ss_pred -----cc---------------------------------------------------------------------c--- Q lcl|NC_019933. 247 -----AT---------------------------------------------------------------------A--- 249 (394) Q Consensus 247 -----~~---------------------------------------------------------------------~--- 249 (394) ++ + T Consensus 160 ~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI 239 (335) T protein:vir:73 160 ENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRI 239 (335) T ss_pred cceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEE Confidence 00 0 Q ss_pred ccccc------cccccchHHHHHHHHH--HhhhhcCCCCeeEeCHHHHHHHHHhh-ccCCcccccCccc-CCCceeecce Q lcl|NC_019933. 250 FAAPI------TVANATAVDRLRLALL--QAQLAEFPATGIVLNPADWAGIELLK-DTQGRYILGNPQG-TLAPTLWGLP 319 (394) Q Consensus 250 ~~~~~------~~~~~~~~~~i~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~lk-d~~G~~~~~~~~~-~~~~~l~G~p 319 (394) -.++. ..++....+.++.++. .++.......+|+||++....|++.. +.....+-..... ...-.+.|+| T Consensus 240 ~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gip 319 (335) T protein:vir:73 240 CNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGIP 319 (335) T ss_pred eecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCeE Confidence 00010 1112223344444442 33444444578999999999998754 4433222222222 2223588999 Q ss_pred EEEcCCCCcCceEEee Q lcl|NC_019933. 320 VVATQAMAVGQFLTGA 335 (394) Q Consensus 320 v~~~~~~p~~~~~~gd 335 (394) |..++.+-.++..+.. T Consensus 320 ir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 320 IRRVDAILNTESAVTA 335 (335) T ss_pred EEEEeeeecCcccccC Confidence 9999987655433222 No 159 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.85 E-value=1.7e-10 Score=74.18 Aligned_cols=279 Identities=13% Similarity=0.030 Sum_probs=157.5 Q ss_pred Hhhccccc-CCcC-----ccccchhhhhHHHhhhhhhhhHHHhcccccccc-CceeEEEEcCcccccceecCCccccccc Q lcl|NC_019933. 108 ITSLSTNA-DGSA-----GATVQTTRLPGILELPQRRMTIRSLLAQGTMEG-NTLEYVRETGFTNAAAPVAEGAQKPESS 180 (394) Q Consensus 108 ~~~~~~~~-~~~~-----g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~eg~~~~~~~ 180 (394) ++..+..+ ...+ -.+.=+.+..++.+.....+.++++..+.++.+ +++.+|+... ..+.+...|+.+.-+. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~--s~~~~~~pG~~ld~~~ 78 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGE--TELQVLAPGQSPAATS 78 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeee--eEeeeecCCCCcCCCC Confidence 11111111 1111 134457788889999999999999999999874 6899999743 4566777777776677 Q ss_pred cceeeEEeeeeeEEEeehhhHHHHH---H--H-HH-HHHHHHHHHHHHHHHHHHHHHhhcc---------CCCccccccc Q lcl|NC_019933. 181 LRFDLVQTSAKVIAHWMKASRQILS---D--S-AQ-LQSFINARLLRGLEVVEENQLLNGN---------GTGQNLLGLL 244 (394) Q Consensus 181 ~~~~~i~~~~~k~~~~~~is~e~l~---~--s-~~-~~~~i~~~la~a~~~~~d~a~l~g~---------g~~~~~~Gi~ 244 (394) +..++..|....+- +++-++. + + -+ +.+.+.++++.++++..|+.++.-- +....|.|.- T Consensus 79 ~~~dK~~ItID~lL----~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~ 154 (401) T protein:vir:70 79 TQADKNQLVIDATV----IARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKG 154 (401) T ss_pred cccccEEEEeCcee----ehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCC Confidence 77888777777643 4444332 1 2 14 6788889999999999998664211 1122232221 Q ss_pred ccc----ccccccccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHh---hccCCccccc---CcccCCC Q lcl|NC_019933. 245 PQA----TAFAAPITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELL---KDTQGRYILG---NPQGTLA 312 (394) Q Consensus 245 ~~~----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~l---kd~~G~~~~~---~~~~~~~ 312 (394) ... +.........+....+.+.++...+...+.+.. ++++.|..|..|..- -|.. |-.. ....+.. T Consensus 155 ~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd--~~~s~~g~~~~G~v 232 (401) T protein:vir:70 155 HGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKT--YTISQSGATIQGFT 232 (401) T ss_pred CceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchh--hccccCCccccceE Confidence 111 111111122333456677788778777776544 456667777666432 1111 1111 1122233 Q ss_pred ceeecceEEEcCCCCcCc---------------e--EEeeccceE-EEEeecce-EEEEeccc---chhhhcCcEEEEEE Q lcl|NC_019933. 313 PTLWGLPVVATQAMAVGQ---------------F--LTGAFDAGA-QVFDRWAA-RVEVATEN---QDDFIKNMVTILAE 370 (394) Q Consensus 313 ~~l~G~pv~~~~~~p~~~---------------~--~~gd~~~~~-~~~~~~~~-~i~~~~~~---~~~~~~~~~~~~~~ 370 (394) ..+.|+||+.++.+|.+. . +-+|++... .+|.+..+ .++.-+-. ..+-.+-...+-++ T Consensus 233 ~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~ 312 (401) T protein:vir:70 233 LSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTF 312 (401) T ss_pred EEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHH Confidence 468999999999998631 1 124555433 23333322 22221111 11111112334467 Q ss_pred EEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 371 ERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 371 ~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) .-++..+++|+|.++++.+-..-| T Consensus 313 ~a~g~g~~RPeaa~vv~~k~~~~~ 336 (401) T protein:vir:70 313 MAEGAIPDRWEAVSVVTTKRNTTT 336 (401) T ss_pred HHhCCcccchhheEEEeecCcccc Confidence 788999999999999865554333 No 160 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=98.85 E-value=2.1e-10 Score=73.67 Aligned_cols=373 Identities=13% Similarity=0.087 Sum_probs=192.4 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~ 77 (394) |+ +|.|-++.+....++.-.+..+....+ ...-|-..+.+++++.+.++.-+|.+.++.++..++.+.+...-...+ T Consensus 8 ~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK~kMt~~i 87 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhhHHHHHHH Confidence 66 366666655555554444433333322 122233457889999999999999999999887777766654332222 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCce Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL 155 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) .........+..+..... ......+........ +.++....+|..+...|...+..+.+++..+-+...+.--+ T Consensus 88 ~sq~A~~eF~~vL~~N~G----~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V 163 (400) T protein:vir:93 88 ESQNAVTEFFDVLKKNSG----KSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLV 163 (400) T ss_pred hhHHHHHHHHHHHhccCC----chhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhH Confidence 222222222222222111 112222222222222 22445678999999999999999999988665555443222 Q ss_pred eEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHH---H-HHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_019933. 156 EYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSD---S-AQLQSFINARLLRGLE-VVEENQL 230 (394) Q Consensus 156 ~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~---s-~~~~~~i~~~la~a~~-~~~d~a~ 230 (394) +... .....+...-.|..+.+...+|..-++.+-.++....+- ++..+ + ..+..+++.+|+.++. +.+|.++ T Consensus 164 ~~s~--~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~Al 240 (400) T protein:vir:93 164 SRSF--DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 240 (400) T ss_pred Hhhh--hhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHH-HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 2111 112355666777877777777777777665544433332 23333 2 3579999999999998 8899999 Q ss_pred hhccCCCcccccccccccccc-----ccccccccc-hHHHHHHHHHHhhhhcCCCCeeEeCHHHHHH-HHHhhccCCccc Q lcl|NC_019933. 231 LNGNGTGQNLLGLLPQATAFA-----APITVANAT-AVDRLRLALLQAQLAEFPATGIVLNPADWAG-IELLKDTQGRYI 303 (394) Q Consensus 231 l~g~g~~~~~~Gi~~~~~~~~-----~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~lkd~~G~~~ 303 (394) +-|+|+++ ++.+-+.+.... .....++.+ ..+.|..+..-+.+...+ -.+++......+ |..|+.+..+.- T Consensus 241 V~GDG~N~-f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagr-rylivktedrkalldelrqatanah 318 (400) T protein:vir:93 241 VEGDGTNG-FKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR-RYLIVKTEDRKALLDELRQATANAH 318 (400) T ss_pred heecCCCC-ccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc-eEEEEeccchHHHHHHHHhhccccc Confidence 99999875 222222211111 111112222 344555555544433222 245555555443 456665554333 Q ss_pred ccCcccCCC-ceeecce-EEE-cCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 304 LGNPQGTLA-PTLWGLP-VVA-TQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 304 ~~~~~~~~~-~~l~G~p-v~~-~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .....+... ..-.|.. +++ +..-.-...++.|.+. .+ +.++++- -....|..|.-.+..+....+.+... T Consensus 319 vriknddaeiasevgvdeiivytgskalkptvlvdqky--hi-dmqdltk----vdafewktnsnmilvetltsghvety 391 (400) T protein:vir:93 319 VRIKNDDAEIASEVGVDEIIVYTGSKALKPTVLVDQKY--HI-DMQDLTK----VDAFEWKTNSNMILVETLTSGHVETY 391 (400) T ss_pred eEeecchhhhhhhcCcceeeeeeccccccceeeecccc--cc-chhhhhh----hhhheeccCCceEEEeecccCcceee Confidence 222221110 0111221 111 1111111123344332 11 1122211 11223556666666676677777777 Q ss_pred cceEEEEec Q lcl|NC_019933. 381 ESFIKGSLA 389 (394) Q Consensus 381 ~a~~~l~~~ 389 (394) +|-+++++. T Consensus 392 nagavitvs 400 (400) T protein:vir:93 392 NAGAVITVS 400 (400) T ss_pred ccceeEeeC Confidence 777777776 No 161 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.80 E-value=2.4e-09 Score=67.81 Aligned_cols=272 Identities=13% Similarity=0.025 Sum_probs=158.1 Q ss_pred cccccCCcCccccc---hhhhhHHHhhhhhhhhHHHhccccc-cc--cCceeEEEEcCcccccceecC-Cccccccccce Q lcl|NC_019933. 111 LSTNADGSAGATVQ---TTRLPGILELPQRRMTIRSLLAQGT-ME--GNTLEYVRETGFTNAAAPVAE-GAQKPESSLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip---~~~~~~ii~~~~~~~~l~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~e-g~~~~~~~~~~ 183 (394) ++......+|...- +.+.+.|++...+.-..++++++.. ++ ..++.+...... +.+.|.+. +..+|..+..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~-G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGV-GIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeecc-CceeEeCCCccccceeeccc Confidence 22221122333333 2355566776666666666666543 22 234555555443 44555544 56689999999 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc--- Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV--- 256 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~--- 256 (394) .......+.++..+.++.+=++.+. ++..--....++++++.+|+.+|+|+... ...|+++.++........ T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~-g~~GLlN~p~v~~~~~~~~W~ 158 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAH-GIPSVFDYPNINNVVSGGSWS 158 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccc-cceeEeecCCCccccccCCcc Confidence 9999999999999999987665442 57888888899999999999999997543 467999877654332221 Q ss_pred cccchHHHHHHHHHHhhhh---cCCCCeeEeCHHHHHHHHHhhccCCcccccCccc-CCCceeecceEEEcCCCCc-Cce Q lcl|NC_019933. 257 ANATAVDRLRLALLQAQLA---EFPATGIVLNPADWAGIELLKDTQGRYILGNPQG-TLAPTLWGLPVVATQAMAV-GQF 331 (394) Q Consensus 257 ~~~~~~~~i~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~l~G~pv~~~~~~p~-~~~ 331 (394) +....++||..++..+... ...+..++++|..+..|.......|.-+...... ..+.++.+.|......-.. +.. T Consensus 159 ~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a~~~g~~~~ 238 (296) T protein:vir:10 159 QPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDYNGTGTSAA 238 (296) T ss_pred CHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeeccCCCCcceEE Confidence 1224477887777655432 3456789999999998875555455333322111 1222445555444332211 112 Q ss_pred EEeec-cceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe-ccEEecccceEEE---Eec Q lcl|NC_019933. 332 LTGAF-DAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL-ALAVYRPESFIKG---SLA 389 (394) Q Consensus 332 ~~gd~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-d~~v~~~~a~~~l---~~~ 389 (394) ++... .+.+.+.....++. .+- ....-.+.++...++ +..+++|.|++++ +++ T Consensus 239 v~~~~~~~~~~~~v~~~~~~--~~~---e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 239 IAYEKDPNNMAIEIPEATNA--LPA---QPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEcCCceEEEEcCcceee--ecc---cccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 22221 22222222222222 111 111223455567766 5889999999999 565 No 162 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.77 E-value=1.7e-09 Score=68.71 Aligned_cols=287 Identities=10% Similarity=0.044 Sum_probs=151.9 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhh-hhHHHhccccccccCceeEEEEcCcc-------cccceecCC Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRR-MTIRSLLAQGTMEGNTLEYVRETGFT-------NAAAPVAEG 173 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~eg 173 (394) +.+.+.++..-.=++. -....-+++...+.-.+.+. +.|.+-++..+-.++..++-...... ........+ T Consensus 1 ~~~~~~~~~~~~Ms~~-i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 79 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGD-IDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADG 79 (322) T ss_pred Ccccceeeeeeeeech-hhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCc Confidence 1111111110000000 01111255555555555444 44555555433333322211111100 011111111 Q ss_pred c-cccccccceeeEEeeeeeEEEeehhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccc--cccccc Q lcl|NC_019933. 174 A-QKPESSLRFDLVQTSAKVIAHWMKASRQI-LSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGL--LPQATA 249 (394) Q Consensus 174 ~-~~~~~~~~~~~i~~~~~k~~~~~~is~e~-l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi--~~~~~~ 249 (394) . +.|.....++..............|.+.- ++...|..+...+..+.+++++.|..++.+.-......+. ...... T Consensus 80 ~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~s 159 (322) T protein:vir:10 80 TYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLA 159 (322) T ss_pred ccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCC Confidence 1 34544444444444444444456777764 4455688888999999999999999888643211111110 000011 Q ss_pred ccccccccccchHHHHHHHHHHhhhhcCCCC---eeEeCHHHHHHHHHhhcc-CCcccccC-cc-cCCCceeecceEEEc Q lcl|NC_019933. 250 FAAPITVANATAVDRLRLALLQAQLAEFPAT---GIVLNPADWAGIELLKDT-QGRYILGN-PQ-GTLAPTLWGLPVVAT 323 (394) Q Consensus 250 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~l~~lkd~-~G~~~~~~-~~-~~~~~~l~G~pv~~~ 323 (394) ...........+++.|+.+...+...+.+.. .++++|..|..|...... +..|.-.. .. .+..++++|+.++.+ T Consensus 160 s~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~s 239 (322) T protein:vir:10 160 TQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIVS 239 (322) T ss_pred CcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEEe Confidence 1111222346678899999888888877643 578999999998765432 23333222 21 233568999999999 Q ss_pred CCCCcCce-----------------EEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|NC_019933. 324 QAMAVGQF-----------------LTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKG 386 (394) Q Consensus 324 ~~~p~~~~-----------------~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l 386 (394) +.+|.... .|.-.+.++.+....++..++.+.+.. .+...+++...+|..+.+|++++.+ T Consensus 240 ~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~---~~a~~I~~~~~~Ga~ri~~~gVv~i 316 (322) T protein:vir:10 240 TRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA---SFAWRIYSAFTADCVRVEDEHIFKL 316 (322) T ss_pred ccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCc---chhhhhhhhhhhCceEeccCcEEEE Confidence 99984211 112223345555555666666544433 2234567778999999999999999 Q ss_pred EecCCC Q lcl|NC_019933. 387 SLAAAA 392 (394) Q Consensus 387 ~~~~a~ 392 (394) ...-+- T Consensus 317 ~~~e~~ 322 (322) T protein:vir:10 317 RLKNSL 322 (322) T ss_pred EEeccC Confidence 998777 No 163 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.76 E-value=8.8e-10 Score=70.23 Aligned_cols=284 Identities=11% Similarity=0.024 Sum_probs=158.3 Q ss_pred Hhhccccc-CCcC-----ccccchhhhhHHHhhhhhhhhHHHhcccccccc-CceeEEEEcCcccccceecCCccccccc Q lcl|NC_019933. 108 ITSLSTNA-DGSA-----GATVQTTRLPGILELPQRRMTIRSLLAQGTMEG-NTLEYVRETGFTNAAAPVAEGAQKPESS 180 (394) Q Consensus 108 ~~~~~~~~-~~~~-----g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~eg~~~~~~~ 180 (394) ++..+..+ ...+ -.+.=+.+..++.+.....+.++++..+.++.+ +++.+|+... ..+.+...|+.+.-+. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~--s~a~y~~pG~~ldg~~ 78 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGE--TELQVLAPGQSPAATS 78 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeee--eEEeeecCCCCcCCCC Confidence 11111111 1111 134457788899999999999999999999874 6799999743 4677788888877667 Q ss_pred cceeeEEeeeeeEE-EeehhhH--HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh----cc-----CCCcccccccccc Q lcl|NC_019933. 181 LRFDLVQTSAKVIA-HWMKASR--QILSDSAQ-LQSFINARLLRGLEVVEENQLLN----GN-----GTGQNLLGLLPQA 247 (394) Q Consensus 181 ~~~~~i~~~~~k~~-~~~~is~--e~l~~s~~-~~~~i~~~la~a~~~~~d~a~l~----g~-----g~~~~~~Gi~~~~ 247 (394) +..++..|....+- .-..|.+ |.++ .-| +.+.+.++++.++++..|+.++. +. .....+.|..... T Consensus 79 ~~~dk~~ItIDtLL~a~~~V~dlDd~q~-~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~ 157 (400) T protein:vir:10 79 TQADKNQLVIDATVIARNTVAHLHDVQG-DIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGF 157 (400) T ss_pred cccCcEEEEeCceeeecchhhhHHHHhh-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccccc Confidence 77777777777643 2222222 1111 124 78888999999999999987763 11 0111233332222 Q ss_pred cccc----ccccccccchHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhc-cCCcccccC---cccCCCceeec Q lcl|NC_019933. 248 TAFA----APITVANATAVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKD-TQGRYILGN---PQGTLAPTLWG 317 (394) Q Consensus 248 ~~~~----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd-~~G~~~~~~---~~~~~~~~l~G 317 (394) .... ......+....+.+.++...+...+.+.. ++++.|..|..|..-.- -+-.+.... ...+....+.| T Consensus 158 s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~G 237 (400) T protein:vir:10 158 SVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSYN 237 (400) T ss_pred ceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEec Confidence 1111 11112233345566677777776665533 56777777777753210 000111111 11122246899 Q ss_pred ceEEEcCCCCcCc---------------e--EEeeccceEE-EEeecce-EEEEe---cccchhhhcCcEEEEEEEEecc Q lcl|NC_019933. 318 LPVVATQAMAVGQ---------------F--LTGAFDAGAQ-VFDRWAA-RVEVA---TENQDDFIKNMVTILAEERLAL 375 (394) Q Consensus 318 ~pv~~~~~~p~~~---------------~--~~gd~~~~~~-~~~~~~~-~i~~~---~~~~~~~~~~~~~~~~~~~~d~ 375 (394) +||+.++.+|... . +-+|++.... +|.+..+ .++.- .+...+-.+-...+-+++-++. T Consensus 238 v~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~ 317 (400) T protein:vir:10 238 CPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGA 317 (400) T ss_pred eEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhCC Confidence 9999999998521 1 2256654332 3333222 22221 1111111122234557788899 Q ss_pred EEecccceEEEEecCCCCC Q lcl|NC_019933. 376 AVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 376 ~v~~~~a~~~l~~~~a~~~ 394 (394) .+++|+|.++++.+-.+-. T Consensus 318 g~~RPeaa~vv~~~~~~~~ 336 (400) T protein:vir:10 318 IPDRWEAVSVVTTKRQSTG 336 (400) T ss_pred cccchhheEEEEecCCccc Confidence 9999999999986543333 No 164 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.70 E-value=5.1e-09 Score=66.05 Aligned_cols=266 Identities=14% Similarity=0.080 Sum_probs=134.6 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc---cc---ccCceeEEEEcCcccccce----ecCCccccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG---TM---EGNTLEYVRETGFTNAAAP----VAEGAQKPESS 180 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~----~~eg~~~~~~~ 180 (394) |. -..++|+.|..++++.+++...+..++..- .. .|+++++|+..... ...+ .+++..+...+ T Consensus 1 Ma------~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~-~~~~~~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSR-GHTRKLRGAGAERNLTVSD 73 (392) T ss_pred Cc------cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccccc-ceeeeccccccCCcccccc Confidence 21 134889999999999999999988887432 22 36678888754322 1111 23344555555 Q ss_pred cceeeEEeeee-eEEEeehhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccc Q lcl|NC_019933. 181 LRFDLVQTSAK-VIAHWMKASRQI-LSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVAN 258 (394) Q Consensus 181 ~~~~~i~~~~~-k~~~~~~is~e~-l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~ 258 (394) .+-..+.+... ..+.-+.|+++- .++..++...+.+....+++.++|..++.--...... ........+. T Consensus 74 ~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~--------~~~~~~~~~~ 145 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE--------AAGAVHEVAP 145 (392) T ss_pred cccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------ccccccccCh Confidence 55555555552 334445567664 4455678778888889999999998877432111110 0111122334 Q ss_pred cchHHHHHHHHHHhhhhcCCCC-eeEeCHHHHHHHHHhhc-----cCCcccccCcccCCCceeecceEEEcCCCCcCceE Q lcl|NC_019933. 259 ATAVDRLRLALLQAQLAEFPAT-GIVLNPADWAGIELLKD-----TQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFL 332 (394) Q Consensus 259 ~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~lkd-----~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~ 332 (394) ...++.+.++...+...+.+.. .++++|..+..|.+... ..|.-.......+..+++.|++|+.+..+|.+..+ T Consensus 146 ~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~ 225 (392) T protein:vir:99 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAY 225 (392) T ss_pred hhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccce Confidence 5578889998888887766544 67899998888764311 11111111122334468999999999999987765 Q ss_pred EeeccceEEEEeecc-----------------eEEEEecccchhhhcCcEEEEEEEEeccEEec---ccceEEE-EecCC Q lcl|NC_019933. 333 TGAFDAGAQVFDRWA-----------------ARVEVATENQDDFIKNMVTILAEERLALAVYR---PESFIKG-SLAAA 391 (394) Q Consensus 333 ~gd~~~~~~~~~~~~-----------------~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~---~~a~~~l-~~~~a 391 (394) .+..+. +.+..+.. +...+..+....+..+...+ ....+..... ..++... +++.. T Consensus 226 a~~~~a-~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v--~~~~g~~~v~~~~~~~~~~~~~~~~~ 302 (392) T protein:vir:99 226 LYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLI--DTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) T ss_pred eeeccc-cccccccccccccccceeEEecccceecceeecccceeecccccc--ceeEEEEEEeeccccceeeeeeeeee Confidence 443221 11111110 00000000000000000000 0011111111 0011000 00000 Q ss_pred CCC Q lcl|NC_019933. 392 AGT 394 (394) Q Consensus 392 ~~~ 394 (394) ..+ T Consensus 303 ~~~ 305 (392) T protein:vir:99 303 PGS 305 (392) T ss_pred cce Confidence 000 No 165 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.69 E-value=7.5e-09 Score=65.12 Aligned_cols=292 Identities=13% Similarity=0.054 Sum_probs=153.3 Q ss_pred HHhhhhhhhhHHHHHHHhhcc-cccCCcCccccc--hhhhhHHHhhhhhhhhHHHhccccccc---cCceeEEEEcCccc Q lcl|NC_019933. 92 ESGGQRGRAEINIKAAITSLS-TNADGSAGATVQ--TTRLPGILELPQRRMTIRSLLAQGTME---GNTLEYVRETGFTN 165 (394) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~ip--~~~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~ 165 (394) ..+... .......+....+. ...++.|.+++. +.+.+.|++...+....++++++..-. ..++.+...... + T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~-G 78 (314) T protein:vir:10 1 MAIKFD-AEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGV-G 78 (314) T ss_pred CccchH-HHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccc-c Confidence 000000 00000111111111 222233344444 345556777666666666666554321 234556655543 4 Q ss_pred ccceecC-CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhccCCCccc Q lcl|NC_019933. 166 AAAPVAE-GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLLNGNGTGQNL 240 (394) Q Consensus 166 ~~~~~~e-g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~ 240 (394) .+.|.+. +..+|..+..++......+.++..+.++..=+..+. ++...-....++++.+.+|+.+|.|+... .. T Consensus 79 ~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~-g~ 157 (314) T protein:vir:10 79 IAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPH-GI 157 (314) T ss_pred ceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccc-cc Confidence 5556655 455899999999999999999999999876555442 57888888899999999999999997654 57 Q ss_pred cccccccccccccccc---cccchHHHHHHHHHHhhhh---cCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCC-Cc Q lcl|NC_019933. 241 LGLLPQATAFAAPITV---ANATAVDRLRLALLQAQLA---EFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTL-AP 313 (394) Q Consensus 241 ~Gi~~~~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~-~~ 313 (394) .|+++.++........ +....++||..++..+... ...+..++++|..+..|...-+.+|.-+........ +- T Consensus 158 ~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~n~~~l 237 (314) T protein:vir:10 158 VSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTRNNPGL 237 (314) T ss_pred eeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHHhCCCc Confidence 7999877644332221 2222366666666666542 245668999999988775443444433332211111 12 Q ss_pred eeecceEEEcCCCCcCce-EEe-eccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe-ccEEecccceEEEE-ec Q lcl|NC_019933. 314 TLWGLPVVATQAMAVGQF-LTG-AFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL-ALAVYRPESFIKGS-LA 389 (394) Q Consensus 314 ~l~G~pv~~~~~~p~~~~-~~g-d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-d~~v~~~~a~~~l~-~~ 389 (394) +|.+.|-.........+. ++. +-...+.+.....+ ...+-.. ..-.+......++ |..+++|.||++++ ++ T Consensus 238 ~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~--~~l~~e~---~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~ 312 (314) T protein:vir:10 238 TIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVT--NVLPAQP---KDLHFRYPVTSKATGLIVYRPLTMAVIKGIT 312 (314) T ss_pred EEEEcccccccCCCcceEEEEEecCCcEEEEecCccc--eeeccee---cCceEEEcceeeeEEEEEECcceeEeeeeee Confidence 344444444333221111 111 11111111111111 1111100 0111222234454 67788999999775 33 Q ss_pred CC Q lcl|NC_019933. 390 AA 391 (394) Q Consensus 390 ~a 391 (394) =+ T Consensus 313 ~~ 314 (314) T protein:vir:10 313 FA 314 (314) T ss_pred cC Confidence 33 No 166 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.68 E-value=1.7e-08 Score=63.24 Aligned_cols=268 Identities=14% Similarity=0.104 Sum_probs=157.5 Q ss_pred cccCCcCccccc--hhhhhHHHhhhhhhhhHHHhccccc-cc--cCceeEEEEcCcccccceecC-CccccccccceeeE Q lcl|NC_019933. 113 TNADGSAGATVQ--TTRLPGILELPQRRMTIRSLLAQGT-ME--GNTLEYVRETGFTNAAAPVAE-GAQKPESSLRFDLV 186 (394) Q Consensus 113 ~~~~~~~g~~ip--~~~~~~ii~~~~~~~~l~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~e-g~~~~~~~~~~~~i 186 (394) ..++++|..++. +.+.+.+++.+.+....++++++.. ++ ...+.+...... +.+.+.+. +.++|..+..++.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~-G~~~~~~~~~~dip~~~~~~~~~ 79 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRS-GAAKIIANGADDLPLVDVDMVRK 79 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccc-eeEEEecCcccccccccccceeE Confidence 333344544443 3456677788888777777776643 22 234555554443 45556555 45578889889999 Q ss_pred EeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc------ Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV------ 256 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~------ 256 (394) ......++..+.++..=++.+. ++..--....++++++.+|+.+|+|+.. ....|+++.++........ T Consensus 80 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~~~~~~~~~~~~~~~ 158 (301) T protein:vir:80 80 SVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK-YAIKGAFEATGIQIDVSPTTGVGNV 158 (301) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc-ccceeeecCCCcccccccCcccccc Confidence 9999999999999987665442 5888888899999999999999999764 3467998887653322111 Q ss_pred ------cccchHHHHHHHHHHhhhh---cCCCCeeEeCHHHHHHHHHhh--ccCCcccccCccc-CCCceeecceEEEcC Q lcl|NC_019933. 257 ------ANATAVDRLRLALLQAQLA---EFPATGIVLNPADWAGIELLK--DTQGRYILGNPQG-TLAPTLWGLPVVATQ 324 (394) Q Consensus 257 ------~~~~~~~~i~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~lk--d~~G~~~~~~~~~-~~~~~l~G~pv~~~~ 324 (394) +....+++|..++.++... ...+..++++|..+..|.... +..|..+...... ....++.+.|.+... T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~~ 238 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAGM 238 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceeccC Confidence 1222467777777776432 235678999999999997543 4445444332221 112245555555443 Q ss_pred CCCcCc--eEEeeccceEEEEeecceEEEEecccchhhhcCc-EEEEEEEEe-ccEEecccceEEEE-e Q lcl|NC_019933. 325 AMAVGQ--FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNM-VTILAEERL-ALAVYRPESFIKGS-L 388 (394) Q Consensus 325 ~~p~~~--~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~-d~~v~~~~a~~~l~-~ 388 (394) .....+ +++.+-.+.+.+.....++. ..- -.++. +..-...++ +..+++|.||++++ + T Consensus 239 g~~g~~~~v~~~~~~d~~~~~v~~~~~~--~~~----e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 239 GTAGSDSFAVIHDSNETAELIIPMDITR--HPE----EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred CCCcccEEEEEecCCcEEEEEecCceee--ecc----eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 321111 12222122222222222221 111 11222 122234444 67889999999987 5 No 167 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.67 E-value=1.3e-08 Score=63.90 Aligned_cols=291 Identities=10% Similarity=0.034 Sum_probs=156.9 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhccc--ccCCcCccccc---hhhhhHHHhhhhhhhhHHHhcccc Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLST--NADGSAGATVQ---TTRLPGILELPQRRMTIRSLLAQG 148 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~ip---~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) .+.+ .........+......+.. .+....|.+.. +.+.+.+++...+....+.++++. T Consensus 1 ~~~~-----------------~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~ 63 (319) T protein:vir:10 1 MTTK-----------------KFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVT 63 (319) T ss_pred CCCc-----------------chhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccc Confidence 1110 0000000001111111111 11122343333 334556777777777777777654 Q ss_pred c-cc--cCceeEEEEcCcccccceecC-CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHH Q lcl|NC_019933. 149 T-ME--GNTLEYVRETGFTNAAAPVAE-GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA----QLQSFINARLLR 220 (394) Q Consensus 149 ~-~~--~~~~~~~~~~~~~~~~~~~~e-g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~ 220 (394) . ++ ..++.+...... +.+.|.+. +..+|..+..+.......+.++..+.++..=+..+. ++...-....++ T Consensus 64 ~~~~~~~~~~~~~~~~~~-G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~ 142 (319) T protein:vir:10 64 TELSPTDKTFEYMTFDKV-GTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQL 142 (319) T ss_pred cCCCCceEEEEeeeeccc-cceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHH Confidence 3 22 234555555443 45566655 555899998899989999999999999886555442 578888889999 Q ss_pred HHHHHHHHHHhhccCCCcccccccccccccccccc-------ccccchHHHHHHHHHHhhh---hcCCCCeeEeCHHHHH Q lcl|NC_019933. 221 GLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPIT-------VANATAVDRLRLALLQAQL---AEFPATGIVLNPADWA 290 (394) Q Consensus 221 a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~ 290 (394) ++++.+|+.+|+|+... ...|+++.++......+ .+....+++|..++..+.. ....+..++++|..+. T Consensus 143 ~~~~~~n~i~f~G~~~~-g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~ 221 (319) T protein:vir:10 143 AHDQLVNRLVFKGSAPH-KIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRK 221 (319) T ss_pred HHHHhhceEEEeecccc-cceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHH Confidence 99999999999997644 46799998775443222 1222345667666666543 2345778999999999 Q ss_pred HHHHhhccCCcccccCcccC-CCceeecceEEEcCCCCc-CceEEeec-cceEEEEeecceEEEEecccchhhhcCcEEE Q lcl|NC_019933. 291 GIELLKDTQGRYILGNPQGT-LAPTLWGLPVVATQAMAV-GQFLTGAF-DAGAQVFDRWAARVEVATENQDDFIKNMVTI 367 (394) Q Consensus 291 ~l~~lkd~~G~~~~~~~~~~-~~~~l~G~pv~~~~~~p~-~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 367 (394) .|.......|..+....... ..-++.+.|.+....... +..++... ...+.+.....++. .+-... .-.+.. T Consensus 222 ~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~--~~~e~~---~l~~~~ 296 (319) T protein:vir:10 222 VLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNM--LPAQPK---DLHFKV 296 (319) T ss_pred hhhcccCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceee--eeeeec---CceEEE Confidence 99765555554443322211 122455555544332111 11122221 22222222222222 111110 111223 Q ss_pred EEEEEe-ccEEecccceEEEE-e Q lcl|NC_019933. 368 LAEERL-ALAVYRPESFIKGS-L 388 (394) Q Consensus 368 ~~~~~~-d~~v~~~~a~~~l~-~ 388 (394) ....++ +..+++|.||++++ + T Consensus 297 ~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 297 PCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeeeeeEEEEEEccceeEeeecC Confidence 334444 46788899999987 5 No 168 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=98.66 E-value=1.1e-09 Score=69.65 Aligned_cols=373 Identities=13% Similarity=0.075 Sum_probs=187.8 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhh Q lcl|NC_019933. 1 MS--DINAINSTLANISDSLKAHADRAVKDQ-ELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISI 77 (394) Q Consensus 1 Mk--~i~el~~~~~~~~~~~k~~~e~~~~~~-~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~ 77 (394) |. +|-|-+..+.++.+..-.+..+....+ ...-|-..+.+++++.+.++.-+|.+.++.++..++.+.+...-...+ T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN~~eE~~KGK~kMt~~i 80 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhcchhhHHHHHHH Confidence 65 466656666555543322222222221 112233457788999999999999998888888766665544332222 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCce Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL 155 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) .........+..+... .......++........ +.++....+|..+...|...+..+.+++..+-+...+.--+ T Consensus 81 esq~A~~eF~~vL~~N----~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V 156 (393) T protein:vir:16 81 ESQNAVTEFFDVLKKN----SGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLV 156 (393) T ss_pred hhHHHHHHHHHHHhcc----CCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhH Confidence 2222222222222221 11122222222222222 22445678999999999999999999988665554443222 Q ss_pred eEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHH---H-HHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_019933. 156 EYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSD---S-AQLQSFINARLLRGLE-VVEENQL 230 (394) Q Consensus 156 ~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~---s-~~~~~~i~~~la~a~~-~~~d~a~ 230 (394) +... .....+...-.|..+.+...+|..-++.+-.++....+ -++..+ + ..+..+++.+|+.++. +.+|.++ T Consensus 157 ~~s~--~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~Al 233 (393) T protein:vir:16 157 SRSF--DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 233 (393) T ss_pred Hhhh--hhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 2111 11235566667777777777777666666554433333 223333 2 3579999999999998 8899999 Q ss_pred hhccCCCcccccccccccccc-----ccccccccc-hHHHHHHHHHHhhhhcCCCCeeEeCHHHHHH-HHHhhccCCccc Q lcl|NC_019933. 231 LNGNGTGQNLLGLLPQATAFA-----APITVANAT-AVDRLRLALLQAQLAEFPATGIVLNPADWAG-IELLKDTQGRYI 303 (394) Q Consensus 231 l~g~g~~~~~~Gi~~~~~~~~-----~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~lkd~~G~~~ 303 (394) +-|+|+++ ++.+-+.+.... .....++.+ ..+.|..+..-+.+...+ -.+++......+ |..|+.+..+.- T Consensus 234 V~GDG~N~-f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagr-rylivktedrkalldelrqatanan 311 (393) T protein:vir:16 234 VEGDGTNG-FKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR-RYLIVKTEDRKALLDELRQATANAN 311 (393) T ss_pred heecCCCC-ccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc-eEEEEeccchHHHHHHHHhhhccCc Confidence 99999875 122222111111 111112222 344555555544433222 245555555443 455665443221 Q ss_pred ccCcccCCC-ceeecce-EEEcC-CCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc Q lcl|NC_019933. 304 LGNPQGTLA-PTLWGLP-VVATQ-AMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP 380 (394) Q Consensus 304 ~~~~~~~~~-~~l~G~p-v~~~~-~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~ 380 (394) .....+... ..-.|.. +++-. .-.-...++.|.+. .+ +.++++- -....|..|.-.+..+....+.+... T Consensus 312 vriknddteiasevgvdeiivytgskalkptvlvdqky--hi-dmqdltk----vdafewktnsnmilvetltsghvety 384 (393) T protein:vir:16 312 VRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKY--HI-DMQDLTK----VDAFEWKTNSNMILVETLTSGHVETY 384 (393) T ss_pred eeeeccchhhhhhcCcceeeeeeccccccceeeecccc--cc-chhhhhh----hhhheeccCCceEEEeecccCcceee Confidence 111111110 0111221 11111 11111113344332 11 1122211 11223556666666777777777777 Q ss_pred cceEEEEec Q lcl|NC_019933. 381 ESFIKGSLA 389 (394) Q Consensus 381 ~a~~~l~~~ 389 (394) +|-+++++. T Consensus 385 nagavitvs 393 (393) T protein:vir:16 385 NAGAVITVS 393 (393) T ss_pred ccceeEeeC Confidence 777777776 No 169 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.61 E-value=3.1e-08 Score=61.77 Aligned_cols=280 Identities=9% Similarity=-0.041 Sum_probs=162.9 Q ss_pred HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceecCCcccccccccee-eE Q lcl|NC_019933. 108 ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFD-LV 186 (394) Q Consensus 108 ~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~-~i 186 (394) +.....+-.+.......+.+.+.|...-....|+.+++......+..+++....-..+.....-||.+.+.....-. .+ T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 11111111112234456778888888878889999998887776666666654433344445567776655432111 11 Q ss_pred EeeeeeEEEeehhhHHHHHHH--H--HHHHHHHHHHHHHHHHHHHHHHhhccCC-----C---cccccccccc------- Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSDS--A--QLQSFINARLLRGLEVVEENQLLNGNGT-----G---QNLLGLLPQA------- 247 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~s--~--~~~~~i~~~la~a~~~~~d~a~l~g~g~-----~---~~~~Gi~~~~------- 247 (394) .=...-+...+.||..+..-+ . +...+-...-...+.+.+|.++|+|... + ....||+... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~ 160 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLG 160 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceec Confidence 111112333444555443322 1 4444444555566778899999987521 1 1345654321 Q ss_pred --ccc-------cccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCC-C----- Q lcl|NC_019933. 248 --TAF-------AAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTL-A----- 312 (394) Q Consensus 248 --~~~-------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~-~----- 312 (394) +.. ..+.......+.++|.+++.++-..+..+..++|++.....|.++...++.++........ + T Consensus 161 ~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~~ 240 (317) T protein:vir:88 161 ANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVDV 240 (317) T ss_pred cCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEEE Confidence 110 0111222346888899999999999988889999999999998875444444432111110 0 Q ss_pred --ceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 313 --PTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 313 --~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) ..+--+.++.+.+||++.+++.|++..-.-+- ..+..+....+. +......+..++..++++.|.+++.--+ T Consensus 241 ~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~L-r~~~~e~laKtG-----d~~k~~i~~E~tLe~~N~~a~a~i~~l~ 314 (317) T protein:vir:88 241 YESDFGKYTIRANRWFHENTLFVFDPKMHSLCYL-RPFFQHELAKTG-----DSEKRQLLVEYTFRVNNEKSGALIRDVV 314 (317) T ss_pred EEeCCeEEEEEeCCCCCCCeEEEEcccccceeec-ccceeeccCCCc-----ccceeEEEEEEEEEEcCccceeEEEEec Confidence 01122588999999999999999875333232 333333322222 3455778889999999999999988544 Q ss_pred CCC Q lcl|NC_019933. 391 AAG 393 (394) Q Consensus 391 a~~ 393 (394) +.= T Consensus 315 ~~~ 317 (317) T protein:vir:88 315 AQL 317 (317) T ss_pred ccC Confidence 444 No 170 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.60 E-value=2.9e-08 Score=61.93 Aligned_cols=276 Identities=12% Similarity=0.008 Sum_probs=129.4 Q ss_pred cccc--cCCcCccccchhhhhHHHhhhhhhhhHHHhccc---------cccccCceeEEEEcCcccccceecCCc---cc Q lcl|NC_019933. 111 LSTN--ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQ---------GTMEGNTLEYVRETGFTNAAAPVAEGA---QK 176 (394) Q Consensus 111 ~~~~--~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~eg~---~~ 176 (394) |..- .+.-.-..+|+.+..-+.+...+.+.|++-.-+ ...+|..+++|.+...++...-+.+.. .+ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 1100 011112456666655555554444444332111 124567788998865544333333322 23 Q ss_pred cccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhh---cc---CCCccc-----cccc Q lcl|NC_019933. 177 PESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLN---GN---GTGQNL-----LGLL 244 (394) Q Consensus 177 ~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~---g~---g~~~~~-----~Gi~ 244 (394) +..+.+-++-.-.....+..+..++-...-+ .+....|.++++..-.+...+.+|. |- ...+.. .+.. T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~ 160 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhcc Confidence 3344443333333333333444444332222 3566666666666555555444443 11 000000 0000 Q ss_pred -----cccccccccc-----cccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCce Q lcl|NC_019933. 245 -----PQATAFAAPI-----TVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPT 314 (394) Q Consensus 245 -----~~~~~~~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~ 314 (394) ...+.++.+. ......+...+.++...+-.....-+.++||+.++..|++++- =.|+-.......-++ T Consensus 161 ~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~l--i~~i~~sd~~~~i~t 238 (367) T protein:vir:80 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDE--IEFIPDSKGQLTIPT 238 (367) T ss_pred ccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccc--cccccCCCCccccce Confidence 0011111111 1223456777888877776666677899999999999987641 011111111234468 Q ss_pred eecceEEEcCCCCcC-----ce----EEeeccceEEEEeecc--eEEEEecccchhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_019933. 315 LWGLPVVATQAMAVG-----QF----LTGAFDAGAQVFDRWA--ARVEVATENQDDFIKNMVTILAEERLALAVYRPESF 383 (394) Q Consensus 315 l~G~pv~~~~~~p~~-----~~----~~gd~~~~~~~~~~~~--~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~ 383 (394) ++|++|++++.||-. .. +||.- ...+.... ...+++++.-..--.++..++...+ .+.||.+| T Consensus 239 y~G~~VIvDD~~Pv~~~~a~~~yttYlfg~G---Ai~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~hP~G~ 312 (367) T protein:vir:80 239 YMGKVVIVDDGMPVFGTGADKTYLSILFGGA---AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHPGGF 312 (367) T ss_pred ecceeEEEeCCCcccccCCCceEEEEEEecc---eeeecccCCccceecccchhhhcCCceEEEEeeee---EEeeccee Confidence 999999999999942 22 33332 11121112 2234443332110123444444444 67788887 Q ss_pred EEEEecCCC----------------CC Q lcl|NC_019933. 384 IKGSLAAAA----------------GT 394 (394) Q Consensus 384 ~~l~~~~a~----------------~~ 394 (394) ...+-..++ || T Consensus 313 s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 313 NWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred eecccccccccccccccccccccCCCC Confidence 766543332 33 No 171 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.42 E-value=1.3e-07 Score=58.40 Aligned_cols=300 Identities=12% Similarity=0.029 Sum_probs=156.8 Q ss_pred HHHHHHHHhhhhhhhhHHHHHHHhhcc---cccCCcCccccc--hhhhhHHHhhhhhhhhHHHhccccc-cc--cCceeE Q lcl|NC_019933. 86 SFKAMAESGGQRGRAEINIKAAITSLS---TNADGSAGATVQ--TTRLPGILELPQRRMTIRSLLAQGT-ME--GNTLEY 157 (394) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~g~~ip--~~~~~~ii~~~~~~~~l~~~~~~~~-~~--~~~~~~ 157 (394) -+.....++......+...-+....+. ....+.+.+++. +.+.+.|++...+....+.++++.. ++ ..++.+ T Consensus 1 ~~~~~~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~ 80 (329) T protein:vir:79 1 MRGNIMSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEY 80 (329) T ss_pred CccchhhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEe Confidence 000111111111111111111111111 111112223332 2355667777777777777776543 22 235556 Q ss_pred EEEcCcccccceecC-CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019933. 158 VRETGFTNAAAPVAE-GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLLN 232 (394) Q Consensus 158 ~~~~~~~~~~~~~~e-g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l~ 232 (394) ...... +.+.|.+. +..+|..+..+..-....+.++..+.++..=+..+. ++...-....++++.+.+|+.+|+ T Consensus 81 ~~~~~~-G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 159 (329) T protein:vir:79 81 QTFDKV-GHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFK 159 (329) T ss_pred eeeecc-eeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEe Confidence 655543 45556555 567888888888888888889999999876555442 588888888999999999999999 Q ss_pred ccCCCccccccccccccccccccc---------cccchHHHHHHHHHHhhhh--c-CCCCeeEeCHHHHHHHHHhhccCC Q lcl|NC_019933. 233 GNGTGQNLLGLLPQATAFAAPITV---------ANATAVDRLRLALLQAQLA--E-FPATGIVLNPADWAGIELLKDTQG 300 (394) Q Consensus 233 g~g~~~~~~Gi~~~~~~~~~~~~~---------~~~~~~~~i~~~~~~~~~~--~-~~~~~~~~~~~~~~~l~~lkd~~G 300 (394) |++. ....|+++.++..+...+. +....+++|..++.++... + ..+..++++|..+..|.......| T Consensus 160 G~~~-~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~ 238 (329) T protein:vir:79 160 GSKP-HKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETT 238 (329) T ss_pred eccc-ccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCC Confidence 9754 3457999887765433322 1122456777776666543 2 346789999999988865555555 Q ss_pred cccccCcccCC-CceeecceEEEcCCCC-cCceEEeecc-ceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe-ccE Q lcl|NC_019933. 301 RYILGNPQGTL-APTLWGLPVVATQAMA-VGQFLTGAFD-AGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL-ALA 376 (394) Q Consensus 301 ~~~~~~~~~~~-~~~l~G~pv~~~~~~p-~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-d~~ 376 (394) .-+........ .-+|.+.|-+...... .+..++.+.+ ..+.+.....++ ..+-... .-.+......++ +.. T Consensus 239 ~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~q~~---~~~~~v~~~~r~~Gv~ 313 (329) T protein:vir:79 239 MSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFN--MLTAQPK---DLHFKVPCTSKCTGLT 313 (329) T ss_pred ccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCccee--eeeceec---CceEEEceeeeEEEEE Confidence 44432222111 1234444444332211 1122222222 222222112222 2111110 111222234444 577 Q ss_pred EecccceEEEEecCCC Q lcl|NC_019933. 377 VYRPESFIKGSLAAAA 392 (394) Q Consensus 377 v~~~~a~~~l~~~~a~ 392 (394) +++|.||++++==..+ T Consensus 314 i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 314 IYRPLTLVLIKGLVVG 329 (329) T ss_pred EECcceeeeeeeeeeC Confidence 8889999988711111 No 172 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.31 E-value=2.1e-07 Score=57.18 Aligned_cols=272 Identities=10% Similarity=-0.033 Sum_probs=119.6 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcc-------ccccccCceeEEEEcCccc---ccceecCCccccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLA-------QGTMEGNTLEYVRETGFTN---AAAPVAEGAQKPESS 180 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~-------~~~~~~~~~~~~~~~~~~~---~~~~~~eg~~~~~~~ 180 (394) +..... .+..+......++.+.+...++.... ..++.|.-+.+|.+....+ ....+.+.+.++.++ T Consensus 1 m~lsD~----~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSDL----AVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhhh----hhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccce Confidence 111111 11233334444454444433333221 2234455566776653211 222233344444444 Q ss_pred c-ceeeEEeeeeeEEEeehhhHHHHHH---H-HHHHHHHHHHHHHHHHHHHHHHHhhccCCC-ccccccccccccccccc Q lcl|NC_019933. 181 L-RFDLVQTSAKVIAHWMKASRQILSD---S-AQLQSFINARLLRGLEVVEENQLLNGNGTG-QNLLGLLPQATAFAAPI 254 (394) Q Consensus 181 ~-~~~~i~~~~~k~~~~~~is~e~l~~---s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~-~~~~Gi~~~~~~~~~~~ 254 (394) . +..++......-.+......+.+.. . ..+...|.+.++++..+.+=+.+|.+.... ..-..... ...+.+. T Consensus 77 itt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~--dis~~~~ 154 (325) T protein:vir:95 77 LKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVY--DATANTD 154 (325) T ss_pred eccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccee--eeecccC Confidence 3 3444444444433333333333221 1 134445555555544433333333221100 00001111 0111111 Q ss_pred cccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCce--- Q lcl|NC_019933. 255 TVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQF--- 331 (394) Q Consensus 255 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~--- 331 (394) ......+...+.++..++-.....-+.|+||+.++..|.+..-.+...++.......-++++|++|++++.+|.... T Consensus 155 ~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~i~t~~G~~VIVdD~~p~~~~g~~ 234 (325) T protein:vir:95 155 AADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNVVRDPFGKLLVMTDSPNLFAAGTP 234 (325) T ss_pred cccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcccccccCCcEEEEeCCCCCCCccCc Confidence 12223467888888888766666777999999999999876544444444332222335789999999999985431 Q ss_pred --E--EeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecC--CCCC Q lcl|NC_019933. 332 --L--TGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAA--AAGT 394 (394) Q Consensus 332 --~--~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~--a~~~ 394 (394) + ++--..++.+....+......+... -.+-...++.+.. -+.||.++..- .+. ..|| T Consensus 235 ~~ytty~lg~GAi~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~t---f~lhp~G~sw~-~s~~g~sPt 297 (325) T protein:vir:95 235 NVYHILGLVPGGVLIGQNNDFDANEETKNG--DENIIRTYQAEWS---YNIGVKGFAWD-KANGGKSPT 297 (325) T ss_pred eeEEEEEEecCeEEecCCCCccccccccCc--ccceeeeeeeeee---EEeecceeeee-cccccCCcC Confidence 0 1111122222222222221111111 1122223332221 35678777762 222 3344 No 173 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.30 E-value=1.2e-06 Score=53.06 Aligned_cols=263 Identities=10% Similarity=-0.054 Sum_probs=129.6 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc-----cccCceeEEEEcCcccccceecCCccccccccceee Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT-----MEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDL 185 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~ 185 (394) |. +.....+-|+.|..++++.+++..++.+++..-. -.|+++++|+..... +.++..+.-.+.+-.. T Consensus 1 m~---~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~-----v~dg~~~~~~~~te~~ 72 (418) T protein:vir:10 1 MA---VQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK-----SASGRTLVKQPMVDQT 72 (418) T ss_pred CC---ccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee-----ecccCCccccccccce Confidence 22 2223556699999999999999999988876522 125688888743221 2334444444555455 Q ss_pred EEeeeee-EEEeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHH Q lcl|NC_019933. 186 VQTSAKV-IAHWMKASRQ-ILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVD 263 (394) Q Consensus 186 i~~~~~k-~~~~~~is~e-~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~ 263 (394) +.+...+ .+.-+.|+++ ...+..++...+.+....+++..+|..++.--.. +.. ......+....++ T Consensus 73 v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~----------a~~-~~gt~gt~~~~~~ 141 (418) T protein:vir:10 73 IPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKK----------AFH-SSGTPGVRPGAFI 141 (418) T ss_pred EEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------ccc-ccccCCcCcchHH Confidence 5555422 3334556665 4445568888888889999999999887642111 100 1111112334689 Q ss_pred HHHHHHHHhhhhcCCC-C-e-eEeCHHHHHHHHHhhccCCccc-c-cCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 264 RLRLALLQAQLAEFPA-T-G-IVLNPADWAGIELLKDTQGRYI-L-GNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 264 ~i~~~~~~~~~~~~~~-~-~-~~~~~~~~~~l~~lkd~~G~~~-~-~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) ++.++...+...+.+. . . .+++|..+..|.+-.......- - .....+.-+++.|+.|+.++++|..+. |.+.. T Consensus 142 ~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta--g~~~~ 219 (418) T protein:vir:10 142 DFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV--GDHGG 219 (418) T ss_pred HHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccc--ccccc Confidence 9999988888887763 2 4 5799998877753211110000 0 011123346899999999999995432 11110 Q ss_pred --eEEEEeecceEEEEe--ccc-chhhhcCc-EEEEEEE---EeccEE-ecccceEEEEec---CCCCC Q lcl|NC_019933. 339 --GAQVFDRWAARVEVA--TEN-QDDFIKNM-VTILAEE---RLALAV-YRPESFIKGSLA---AAAGT 394 (394) Q Consensus 339 --~~~~~~~~~~~i~~~--~~~-~~~~~~~~-~~~~~~~---~~d~~v-~~~~a~~~l~~~---~a~~~ 394 (394) .+......+..+.++ .-. +.....|. +.|-... .+...+ .+..-|++..-. +++++ T Consensus 220 t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~ 288 (418) T protein:vir:10 220 TPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAG 288 (418) T ss_pred ceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcc Confidence 000000111111110 000 00011111 1111000 000000 012223222211 11111 No 174 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=98.21 E-value=1.9e-06 Score=51.89 Aligned_cols=269 Identities=10% Similarity=0.022 Sum_probs=124.9 Q ss_pred cccccCCcCccccch--hhhhHHHhhhhhhhhHHHhcc---------ccccccCceeEEEEcCcccc--ccee--cCCcc Q lcl|NC_019933. 111 LSTNADGSAGATVQT--TRLPGILELPQRRMTIRSLLA---------QGTMEGNTLEYVRETGFTNA--AAPV--AEGAQ 175 (394) Q Consensus 111 ~~~~~~~~~g~~ip~--~~~~~ii~~~~~~~~l~~~~~---------~~~~~~~~~~~~~~~~~~~~--~~~~--~eg~~ 175 (394) |.++.. .-..+|+ .+..-+.+...+.+.|++-.- ....+|..+++|.+...++. ..+- +..+. T Consensus 1 Ma~T~l--~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTI--GDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEE--eeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 333322 2346666 355555555555555544211 11234677889988643322 1121 21223 Q ss_pred ccccccceeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccc-------- Q lcl|NC_019933. 176 KPESSLRFDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQ-------- 246 (394) Q Consensus 176 ~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~-------- 246 (394) .+..+.+-++........+..+..++-.-.-+ .+....|.++++....+...+.+|.- ..|++.. T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~------L~Gvf~~~~~a~~~~ 152 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIAT------ALGLYNDNVSATDAY 152 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHH------HHHhhcccccccchh Confidence 34444433332222222233333333222212 35666677777766666655544431 1122211 Q ss_pred --cccccccccccccchHHHHHHHHHHhhhh-----cCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecce Q lcl|NC_019933. 247 --ATAFAAPITVANATAVDRLRLALLQAQLA-----EFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLP 319 (394) Q Consensus 247 --~~~~~~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~p 319 (394) .+..+.........+...++++...+-.. ...-+.++||+.++..|.+++-= .|+-.......-++++|++ T Consensus 153 ~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li--~~i~~s~~~~~i~ty~G~~ 230 (349) T protein:vir:78 153 HEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--DFIRDAENNTMFATYQGYR 230 (349) T ss_pred hhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh--hhccCcccCcccceecCeE Confidence 11112222222234556666665555443 23456899999999998865321 1111111122346899999 Q ss_pred EEEcCCCCcCc---------eEEeeccceEEEEeec-ceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|NC_019933. 320 VVATQAMAVGQ---------FLTGAFDAGAQVFDRW-AARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 320 v~~~~~~p~~~---------~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~ 389 (394) |++++.||-.. .+||. .++...+.. ...++..++....-..++..+....++ +.||.++..-.-. T Consensus 231 VivDD~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~a~ 305 (349) T protein:vir:78 231 VIVDDSMTVVGQGAQRKFISIIFGQ--GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTSAV 305 (349) T ss_pred EEEeCCCccccCCCCceEEEEEeec--ceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---Eeeeeeeeecccc Confidence 99999998421 23443 222222211 123454444322212345556555554 4455565554432 Q ss_pred CC---------CCC Q lcl|NC_019933. 390 AA---------AGT 394 (394) Q Consensus 390 ~a---------~~~ 394 (394) .+ +|| T Consensus 306 v~~~~~~~~~~sPt 319 (349) T protein:vir:78 306 ITGNGTETIARSAS 319 (349) T ss_pred ccCCccccccCCCC Confidence 22 344 No 175 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=98.19 E-value=2.6e-06 Score=51.25 Aligned_cols=268 Identities=10% Similarity=0.039 Sum_probs=124.7 Q ss_pred cccccCCcCccccch--hhhhHHHhhhhhhhhHHHhccc---------cccccCceeEEEEcCccccc--ceecCC--cc Q lcl|NC_019933. 111 LSTNADGSAGATVQT--TRLPGILELPQRRMTIRSLLAQ---------GTMEGNTLEYVRETGFTNAA--APVAEG--AQ 175 (394) Q Consensus 111 ~~~~~~~~~g~~ip~--~~~~~ii~~~~~~~~l~~~~~~---------~~~~~~~~~~~~~~~~~~~~--~~~~eg--~~ 175 (394) |.++.. .-..+|+ .+..-+.+...+.+.|++-.-+ ...+|..+++|.+...++.. .+-+.. +. T Consensus 1 Ma~T~l--~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MAITTI--GNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEE--eeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 333322 2346665 3555555555555555552111 12346678899876433221 122211 22 Q ss_pred ccccccc-eeeEEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccc------ Q lcl|NC_019933. 176 KPESSLR-FDLVQTSAKVIAHWMKASRQILSDS-AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQA------ 247 (394) Q Consensus 176 ~~~~~~~-~~~i~~~~~k~~~~~~is~e~l~~s-~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~------ 247 (394) .+.++.+ ..++.....+ +..+..++-.-.-+ .+..+.|.++++....+...+.+|.- +.|++... T Consensus 79 ~t~~kit~~~~~a~~~~r-~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~------L~Gvf~~~~~~~~~ 151 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYL-NEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIAT------ALGLYNDNVSATDA 151 (349) T ss_pred cccccccccceeeeeeee-ccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHH------HHhhhccccccccc Confidence 3334433 3333333322 22222332222212 35666677777776666665555431 12222211 Q ss_pred ----ccccccccccccchHHHHHHHHHHhhhh-----cCCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecc Q lcl|NC_019933. 248 ----TAFAAPITVANATAVDRLRLALLQAQLA-----EFPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGL 318 (394) Q Consensus 248 ----~~~~~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~ 318 (394) ...+.........+...++++...+-.. ...-+.++||+.++..|.+++-=. ++-.......-++++|+ T Consensus 152 ~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~i~~s~~~~~i~ty~G~ 229 (349) T protein:vir:94 152 YHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--FIRDAENNTMFATYQGY 229 (349) T ss_pred ccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh--hccCcccCcccceecCc Confidence 1111222222334556666665554433 234568999999999988753210 11111112234689999 Q ss_pred eEEEcCCCCcC---------ceEEeeccceEEEEeec-ceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 319 PVVATQAMAVG---------QFLTGAFDAGAQVFDRW-AARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 319 pv~~~~~~p~~---------~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) +|++++.||-. ..+||. .++...+.. ...+++.++....-..++..+....++ +.||.++..-.- T Consensus 230 ~VivDD~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a 304 (349) T protein:vir:94 230 RVIVDDSMTVVGQDTSRKFISIIFGQ--GAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSA 304 (349) T ss_pred EEEEeCCCccccCCCCceEEEEEeec--ceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---Eeeeeeeeeccc Confidence 99999999842 123443 122222221 223555544432212344555555554 456666655543 Q ss_pred cCC---------CCC Q lcl|NC_019933. 389 AAA---------AGT 394 (394) Q Consensus 389 ~~a---------~~~ 394 (394) ..+ +|| T Consensus 305 ~v~~~~~~~~~~sPt 319 (349) T protein:vir:94 305 VITGNGTETIARSAS 319 (349) T ss_pred ccCCCccccccCCCC Confidence 222 344 No 176 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.09 E-value=4.9e-06 Score=49.72 Aligned_cols=378 Identities=12% Similarity=0.079 Sum_probs=162.4 Q ss_pred CchHHHHHHHH-HHHHHHHHHHHHHHHhhhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---ch Q lcl|NC_019933. 1 MSDINAINSTL-ANISDSLKAHADRAVKDQELNASVRAK-VDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ---HI 75 (394) Q Consensus 1 Mk~i~el~~~~-~~~~~~~k~~~e~~~~~~~~~~e~~~~-~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~---~~ 75 (394) +-+-..++++. ++.++.++++.+..............+ +......+++..+ .+-+.+.+.. .+...... .. T Consensus 224 ~~De~airAq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~---~il~~l~~~~-~p~~~~~~~~~~~ 299 (652) T protein:vir:79 224 VVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQARE---KLLNEMGRES-TPSNKNTPAHIYA 299 (652) T ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHH---HHHHHHHhhc-CCCCCCcceeEee Confidence 11222222221 222222222222111111000000000 0000001111111 1111111100 00000000 00 Q ss_pred hhhhhhhhHHHHHHHHHH-----------------------hhhhhh--hhHHHHHHHhhcccccCCcCccccchhhhhH Q lcl|NC_019933. 76 SIGQQFVNSDSFKAMAES-----------------------GGQRGR--AEINIKAAITSLSTNADGSAGATVQTTRLPG 130 (394) Q Consensus 76 ~~~~~~~~~~~~~~~~~~-----------------------~~~~~~--~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ 130 (394) ..++.+........+.+. ...++. ........+....+.++++-+.++-...-.. T Consensus 300 ~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~ 379 (652) T protein:vir:79 300 GNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKA 379 (652) T ss_pred ccchhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHH Confidence 000000000000000000 000000 0001111222222334444444443333333 Q ss_pred HHhhhhh-hhhHHHhcccccccc-CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH-HHHH Q lcl|NC_019933. 131 ILELPQR-RMTIRSLLAQGTMEG-NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI-LSDS 207 (394) Q Consensus 131 ii~~~~~-~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~-l~~s 207 (394) ++..-.. ...+...++..+++. ...+..+..+. +...-|.|++.+......=+..++...++|..+.||||+ ++|- T Consensus 380 l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~-~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDD 458 (652) T protein:vir:79 380 ILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGF-SALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDD 458 (652) T ss_pred HHHHHhhhHHHHHHHhccCCCccccccceeecCCC-CCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccc Confidence 3333322 234566667666653 34556666554 566678898888776655566789999999999999996 5666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh---ccCCCc-cccccc-cccccccccccccccchHHHHHHHHHHhhh----hcCC Q lcl|NC_019933. 208 AQLQSFINARLLRGLEVVEENQLLN---GNGTGQ-NLLGLL-PQATAFAAPITVANATAVDRLRLALLQAQL----AEFP 278 (394) Q Consensus 208 ~~~~~~i~~~la~a~~~~~d~a~l~---g~g~~~-~~~Gi~-~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~----~~~~ 278 (394) -+..+-|...++++.++.++..++. +++.-. .=+.++ +..-..-.+...-+...++.-..++..-.. -+.. T Consensus 459 L~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~ 538 (652) T protein:vir:79 459 LNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIR 538 (652) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccccCCHHHHHHHHHHHHHhccCCcccccc Confidence 6788888899999999999865543 332110 011223 211111111111122223333322222221 2345 Q ss_pred CCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecc-eEEEcCCCCcCc---eEEeeccc--eEE---EEeecceE Q lcl|NC_019933. 279 ATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGL-PVVATQAMAVGQ---FLTGAFDA--GAQ---VFDRWAAR 349 (394) Q Consensus 279 ~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~-pv~~~~~~p~~~---~~~gd~~~--~~~---~~~~~~~~ 349 (394) |..|++.+........+..+...+- .+.+.+....+.|+ .+++++.+..+. -++.+... .+. +-...+.. T Consensus 539 P~~llvp~~le~~a~~ll~s~~v~~-a~~~~~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G~~~P~ 617 (652) T protein:vir:79 539 PAFVLVPTAMESVANQVIRSSSVKG-ADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPY 617 (652) T ss_pred ccEEEecchhHHHHHHHhccCCCcc-cccccccccccccccccccccccCCCCcccEEEecCCCCCeEEEEEecCCCCCe Confidence 7788888887666555542221100 01111111224443 666676664422 23333221 111 11123333 Q ss_pred EEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 350 VEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 350 i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) ++. ...|..+.+.|++...++.++.|--++++.+- T Consensus 618 ie~----~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 618 IDQ----MEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred eee----cCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 322 33588999999999999999999999998876 No 177 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.03 E-value=6.6e-06 Score=48.99 Aligned_cols=276 Identities=8% Similarity=-0.056 Sum_probs=128.1 Q ss_pred hhHHHHHHHhhc-------ccccCCcCccccchhhhhHHHhhhhhhhhHHH-h-cc--ccccccCceeEEEEcCcccccc Q lcl|NC_019933. 100 AEINIKAAITSL-------STNADGSAGATVQTTRLPGILELPQRRMTIRS-L-LA--QGTMEGNTLEYVRETGFTNAAA 168 (394) Q Consensus 100 ~~~~~~~~~~~~-------~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~-~-~~--~~~~~~~~~~~~~~~~~~~~~~ 168 (394) ...+.++..... ........-....+.+.. +++.+.....+-. + ++ .....|++++||+....+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHH-HHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 111111111000 001111122233333333 3333333333221 1 22 2334678999999876432322 Q ss_pred eecCCcccccccc--ceeeEEeeeeeEEEeehhhHHHHHHHH-HH--HHHHHHHHHHHHHHHHHHHHhhccCCCcccccc Q lcl|NC_019933. 169 PVAEGAQKPESSL--RFDLVQTSAKVIAHWMKASRQILSDSA-QL--QSFINARLLRGLEVVEENQLLNGNGTGQNLLGL 243 (394) Q Consensus 169 ~~~eg~~~~~~~~--~~~~i~~~~~k~~~~~~is~e~l~~s~-~~--~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi 243 (394) -.+. .....++ +....++...+.-.+. |..--...+. .+ ...+.+.....++-.+|.-.+..-..+ T Consensus 80 ~R~~--g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~------ 150 (319) T protein:vir:97 80 KRNA--TNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN------ 150 (319) T ss_pred cCCC--CcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhh------ Confidence 2222 2333344 4444455554433222 1111122222 22 333445555566667776555321111 Q ss_pred ccccccccccccccccchHHHHHHHHHHhhhhcCCCC-eeEeCHHHHHHHHHhhccCCcc-cc-cCcccCCCceeecceE Q lcl|NC_019933. 244 LPQATAFAAPITVANATAVDRLRLALLQAQLAEFPAT-GIVLNPADWAGIELLKDTQGRY-IL-GNPQGTLAPTLWGLPV 320 (394) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~lkd~~G~~-~~-~~~~~~~~~~l~G~pv 320 (394) ++. ..+.+.+....++.|.++...+...+.+.. .++|+|.++..|.+-....... +. .....+..+.|.|++| T Consensus 151 ---a~~-~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~V 226 (319) T protein:vir:97 151 ---KAK-HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVI 226 (319) T ss_pred ---ccc-ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEE Confidence 011 111123345578999999888888766533 5789999988885433211110 11 1223344578999999 Q ss_pred EEcCC--CCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 321 VATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 321 ~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +.++. +..-.+++|-. .+...... --.+++...... +....++...++|..|.++++..++....+.+. T Consensus 227 i~vps~~~k~in~i~~h~-~A~~~~~k-~~~~~~~~p~~~---~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:97 227 VKVPTKLLQGLQAIAVVG-EVLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) T ss_pred EEecccccccceEEEEcC-Ceeeeeee-eeeeeccCCCcc---ccceeeeeeeeeeeEEeccccceEEEeecCCcc Confidence 97643 33334555543 33332221 112232221111 224678899999999999986555543333333 No 178 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.03 E-value=6.6e-06 Score=48.99 Aligned_cols=276 Identities=8% Similarity=-0.056 Sum_probs=128.1 Q ss_pred hhHHHHHHHhhc-------ccccCCcCccccchhhhhHHHhhhhhhhhHHH-h-cc--ccccccCceeEEEEcCcccccc Q lcl|NC_019933. 100 AEINIKAAITSL-------STNADGSAGATVQTTRLPGILELPQRRMTIRS-L-LA--QGTMEGNTLEYVRETGFTNAAA 168 (394) Q Consensus 100 ~~~~~~~~~~~~-------~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~-~-~~--~~~~~~~~~~~~~~~~~~~~~~ 168 (394) ...+.++..... ........-....+.+.. +++.+.....+-. + ++ .....|++++||+....+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHH-HHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 111111111000 001111122233333333 3333333333221 1 22 2334678999999876432322 Q ss_pred eecCCcccccccc--ceeeEEeeeeeEEEeehhhHHHHHHHH-HH--HHHHHHHHHHHHHHHHHHHHhhccCCCcccccc Q lcl|NC_019933. 169 PVAEGAQKPESSL--RFDLVQTSAKVIAHWMKASRQILSDSA-QL--QSFINARLLRGLEVVEENQLLNGNGTGQNLLGL 243 (394) Q Consensus 169 ~~~eg~~~~~~~~--~~~~i~~~~~k~~~~~~is~e~l~~s~-~~--~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi 243 (394) -.+. .....++ +....++...+.-.+. |..--...+. .+ ...+.+.....++-.+|.-.+..-..+ T Consensus 80 ~R~~--g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~------ 150 (319) T protein:vir:94 80 KRNA--TNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN------ 150 (319) T ss_pred cCCC--CcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhh------ Confidence 2222 2333344 4444455554433222 1111122222 22 333445555566667776555321111 Q ss_pred ccccccccccccccccchHHHHHHHHHHhhhhcCCCC-eeEeCHHHHHHHHHhhccCCcc-cc-cCcccCCCceeecceE Q lcl|NC_019933. 244 LPQATAFAAPITVANATAVDRLRLALLQAQLAEFPAT-GIVLNPADWAGIELLKDTQGRY-IL-GNPQGTLAPTLWGLPV 320 (394) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~lkd~~G~~-~~-~~~~~~~~~~l~G~pv 320 (394) ++. ..+.+.+....++.|.++...+...+.+.. .++|+|.++..|.+-....... +. .....+..+.|.|++| T Consensus 151 ---a~~-~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~V 226 (319) T protein:vir:94 151 ---KAK-HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVI 226 (319) T ss_pred ---ccc-ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEE Confidence 011 111123345578999999888888766533 5789999988885433211110 11 1223344578999999 Q ss_pred EEcCC--CCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 321 VATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 321 ~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +.++. +..-.+++|-. .+...... --.+++...... +....++...++|..|.++++..++....+.+. T Consensus 227 i~vps~~~k~in~i~~h~-~A~~~~~k-~~~~~~~~p~~~---~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:94 227 VKVPTKLLQGLQAIAVVG-EVLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) T ss_pred EEecccccccceEEEEcC-Ceeeeeee-eeeeeccCCCcc---ccceeeeeeeeeeeEEeccccceEEEeecCCcc Confidence 97643 33334555543 33332221 112232221111 224678899999999999986555543333333 No 179 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=98.02 E-value=2.2e-06 Score=51.63 Aligned_cols=261 Identities=10% Similarity=0.024 Sum_probs=125.7 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc---c----ccCceeEEEEcCcccccceecCCccccccccce Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT---M----EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~---~----~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~ 183 (394) |. ++. -..||+.|..+.++.+++..++.+++..-. . .|+++++++.......-+-.+.+..+...+..- T Consensus 1 MA-N~l---lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MA-NNL---ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 21 111 124699999999999999999988876522 1 156788887643321111122223333333333 Q ss_pred eeE--EeeeeeEEEeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccc Q lcl|NC_019933. 184 DLV--QTSAKVIAHWMKASRQ-ILSDSAQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANAT 260 (394) Q Consensus 184 ~~i--~~~~~k~~~~~~is~e-~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~ 260 (394) .++ .+.-+|.. .++++++ ...+..++++++...+ .+++..+|..++..--.+.. + .+....+... T Consensus 77 ~~v~l~id~~k~~-a~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~-----~-----~vgt~~t~~~ 144 (423) T protein:vir:35 77 AKATGKVGKYITV-AVEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGA-----L-----SLGSPNTAIK 144 (423) T ss_pred ceeeEEeccceec-cceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccc-----c-----ccccccCCcc Confidence 344 44444443 4456664 4445567777776664 77888888877752111110 0 0111112234 Q ss_pred hHHHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHH----hhccCCcccccCcccCC-CceeecceEEEcCCCCcCceEE Q lcl|NC_019933. 261 AVDRLRLALLQAQLAEFPAT--GIVLNPADWAGIEL----LKDTQGRYILGNPQGTL-APTLWGLPVVATQAMAVGQFLT 333 (394) Q Consensus 261 ~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~----lkd~~G~~~~~~~~~~~-~~~l~G~pv~~~~~~p~~~~~~ 333 (394) .++++.++-..+...+.+.. ..+++|..+..|.+ +...++. .-.....++ .+.+.|+.|+.++++|..+.. T Consensus 145 ~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~g~i~G~i~GFdv~~Snnvp~~T~g- 222 (423) T protein:vir:35 145 KWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQL-VRTAWENAQISGNFGGIRALMSNGLASRKQG- 222 (423) T ss_pred hHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccc-hhHHHhhccceeeecceEEEEcCCCcccccc- Confidence 58889998888887777633 56999999877653 1111110 001111222 368999999999999964321 Q ss_pred eeccceEEE-----------EeecceEE--EEec-ccchhhhcCcEEEEEEEEeccEEecccceE-----------EEEe Q lcl|NC_019933. 334 GAFDAGAQV-----------FDRWAARV--EVAT-ENQDDFIKNMVTILAEERLALAVYRPESFI-----------KGSL 388 (394) Q Consensus 334 gd~~~~~~~-----------~~~~~~~i--~~~~-~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~-----------~l~~ 388 (394) .+...... .......+ .... ..+.....+. ...+.|+...++.--. ...+ T Consensus 223 -t~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD----~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V 297 (423) T protein:vir:35 223 -DFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGD----QLKFTSTHWLNQQSKQTLYNGSTAMSFTATV 297 (423) T ss_pred -ccccceeeccccccccccccccccceeeeeeeeeccCCcEEecc----eEEeeeeeeccccccceeecccCCceeEEEE Confidence 11111110 00000001 0000 0111111111 2233343333321111 1112 Q ss_pred c------CCCCC Q lcl|NC_019933. 389 A------AAAGT 394 (394) Q Consensus 389 ~------~a~~~ 394 (394) . +.+.+ T Consensus 298 ~~~~~~~a~g~~ 309 (423) T protein:vir:35 298 LEETNSTASGDV 309 (423) T ss_pred eccccccccCce Confidence 2 11222 No 180 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=98.02 E-value=5.7e-06 Score=49.35 Aligned_cols=265 Identities=11% Similarity=0.034 Sum_probs=116.4 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcccc-------ccccCceeEEEEc-CcccccceecCCccccccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQG-------TMEGNTLEYVRET-GFTNAAAPVAEGAQKPESSLR 182 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~-------~~~~~~~~~~~~~-~~~~~~~~~~eg~~~~~~~~~ 182 (394) +.++..++- .+.-+.+....++.+.+...+++..... ++.|+-...+-.. +......-+...+.+..++.+ T Consensus 1 ~~~t~~sdl-~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSDL-VIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeecce-eeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 333333321 2445566666777766665555543221 2223222222111 100001111112223333322 Q ss_pred -eeeEEeeeeeEEEeeh--hhHHHHHHH---H-HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccc Q lcl|NC_019933. 183 -FDLVQTSAKVIAHWMK--ASRQILSDS---A-QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPIT 255 (394) Q Consensus 183 -~~~i~~~~~k~~~~~~--is~e~l~~s---~-~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~ 255 (394) ...+.... ..+.-+ ++...+... | .....|...+..++.+.+=...+.+.... +.. ........ T Consensus 80 ~~~dvaVk~--~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aa------i~~-~t~~~~~~ 150 (315) T protein:vir:96 80 ADEMVSVKV--PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGA------IGS-NAGMNVSG 150 (315) T ss_pred cccceeEEE--eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh------hcc-cccccccc Confidence 22222222 222233 344444422 2 23333444444444443333323221100 000 00011112 Q ss_pred ccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccccCccc---CCCceeecceEEEcCCCCcCceE Q lcl|NC_019933. 256 VANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQG---TLAPTLWGLPVVATQAMAVGQFL 332 (394) Q Consensus 256 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~---~~~~~l~G~pv~~~~~~p~~~~~ 332 (394) .....+...+.++..++-.....-+.|+||..++..|.+ +. --..++..... +.++..+|+||++++.||...++ T Consensus 151 ~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~-L~~~~~~~~~~~~~~~~~~~lGkrViVdD~~P~~~~~ 228 (315) T protein:vir:96 151 ELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EA-IDNKLYEEAGVVVYGGTPGTLGKPVLVTDQCPATKIF 228 (315) T ss_pred cccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hh-hhhhcccccceeEecCcCcccccEEEEECCCCcceee Confidence 334456777888888876666677799999999999876 31 11233332221 12234559999999999986543 Q ss_pred EeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc-EEecccceEEEEecCCCCC Q lcl|NC_019933. 333 TGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL-AVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 333 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~-~v~~~~a~~~l~~~~a~~~ 394 (394) |--..++.+.....+.....+ ..++..+....|..| -+++|.+|..-+.+...|| T Consensus 229 -gl~~GAi~~~~~~~~~~~~~~------~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~sPt 284 (315) T protein:vir:96 229 -GLVAGAVMITESQAPGMRSYQ------IDDQENLAIGFRAEGTANVEVLGYKWKTKTNVNPA 284 (315) T ss_pred -eeecceeeecCCCcccccccc------CCCcceeEEEEeeeeEeeeeeeeEEeecCCCcCCC Confidence 322233333322221111111 122334444444444 3567777766544445566 No 181 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.93 E-value=7.8e-06 Score=48.59 Aligned_cols=297 Identities=6% Similarity=-0.046 Sum_probs=128.7 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHH-hcc--ccccccC Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRS-LLA--QGTMEGN 153 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~-~~~--~~~~~~~ 153 (394) +...+....-..+ .......+.....+.... ..+. .-+....-+.+...+-+.+...+.-.. +++ .....|+ T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~---~~~~-~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~ 75 (329) T protein:vir:10 1 MDGIFITGVKTMN-KEIKNATGKLKLNLQHFA---NKSV-EPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGR 75 (329) T ss_pred CCceEEechhhhh-hhhhcccceeEEehhhhc---CCcc-CCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCc Confidence 1111110000000 000000011111111110 0000 111122233333333333322221111 122 2345678 Q ss_pred ceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-H--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-Q--LQSFINARLLRGLEVVEENQL 230 (394) Q Consensus 154 ~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-~--~~~~i~~~la~a~~~~~d~a~ 230 (394) +++||+....+...+-.+.|-....-+.++...++...+.-.+. |.+--...+. . +.....+.....++-.+|... T Consensus 76 tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~ 154 (329) T protein:vir:10 76 SFTVIKGDVTELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRF-VDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLR 154 (329) T ss_pred EEEEeeecccccccccCCCCccccccccceeEEEeecccceeee-cchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHH Confidence 99999986533222222222222222334444555555533322 2111122222 2 234445556666677778655 Q ss_pred hhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCC-eeEeCHHHHHHHHHhhccCCc--ccccCc Q lcl|NC_019933. 231 LNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPAT-GIVLNPADWAGIELLKDTQGR--YILGNP 307 (394) Q Consensus 231 l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~lkd~~G~--~~~~~~ 307 (394) +.---.. ++. ..+.+.+....++.|.++...+.....+.. .++++|..+..|.+-...... .--... T Consensus 155 ~skla~~---------a~~-~~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~ 224 (329) T protein:vir:10 155 FATLARN---------KAK-HLTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVL 224 (329) T ss_pred HHHHHhh---------ccc-ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccce Confidence 5321110 011 111122344578889988888887655433 678999998888652211111 111112 Q ss_pred ccCCCceeecceEEEcCC--CCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEE Q lcl|NC_019933. 308 QGTLAPTLWGLPVVATQA--MAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIK 385 (394) Q Consensus 308 ~~~~~~~l~G~pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~ 385 (394) ..+..+.|.|++|+.++. ++.-..++|-.. +...... --.+++..... .++...++...++|..|.++++..+ T Consensus 225 ~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~-A~~~~~K-~~~~~~~~p~~---~~~a~~v~gr~yyd~~V~~~k~~~I 299 (329) T protein:vir:10 225 GKGVQGELDGFTIVKVPSKMLQGVEAMAVIGE-VMASPIQ-ANEAKLNSNVP---GMFGTLAEQMLYTGAFVPEHLQKYI 299 (329) T ss_pred eeeeeeeecCeEEEEecCCcccceeEEEEcCC-ceeeeee-eeeeeeeCCCC---ccchheeeeeeeeeeEEEccccCEE Confidence 234456899999997654 333344555433 3322222 11233322111 1234688899999999999987665 Q ss_pred EEecCCCCC Q lcl|NC_019933. 386 GSLAAAAGT 394 (394) Q Consensus 386 l~~~~a~~~ 394 (394) +.....+.+ T Consensus 300 ~~~~~~a~~ 308 (329) T protein:vir:10 300 FTIGGKEVE 308 (329) T ss_pred EEecccCcc Confidence 554333333 No 182 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=97.82 E-value=9.2e-06 Score=48.19 Aligned_cols=299 Identities=13% Similarity=0.019 Sum_probs=151.3 Q ss_pred HHHHhhhhhhhhHHHHHHHhhcccccCCcC-ccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccc Q lcl|NC_019933. 90 MAESGGQRGRAEINIKAAITSLSTNADGSA-GATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAA 168 (394) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (394) +...-+.....-...-+..+.........+ -+.|-+.....+.+.+.+.+.++++++++++.-......-....++.+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 000000000000001111222221111211 3677888888999999999999999999998754333333222222222 Q ss_pred eec---CCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCc---- Q lcl|NC_019933. 169 PVA---EGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQ---- 238 (394) Q Consensus 169 ~~~---eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~---- 238 (394) -+. .+...|..-..++.-.+...+.-.-..|+.+.|..+ +++...+++.+.++++.-+=..-|+|..-.. T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 221 122333333566777788888888888999998866 4789999988888876554445555532211 Q ss_pred --ccc------cccc------------cc-ccccccccccc-cchHH-HHHHHHHH-hhhhcCC--CCeeEeCHHHHH-H Q lcl|NC_019933. 239 --NLL------GLLP------------QA-TAFAAPITVAN-ATAVD-RLRLALLQ-AQLAEFP--ATGIVLNPADWA-G 291 (394) Q Consensus 239 --~~~------Gi~~------------~~-~~~~~~~~~~~-~~~~~-~i~~~~~~-~~~~~~~--~~~~~~~~~~~~-~ 291 (394) +|. |.+. .. ....+..+..+ =..+| .+.++... +++.+.. .-+++|.+.... + T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk 240 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADK 240 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHH Confidence 111 1111 10 01111111111 11233 33555554 4555543 347888888855 2 Q ss_pred HHHhhccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEE Q lcl|NC_019933. 292 IELLKDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILA 369 (394) Q Consensus 292 l~~lkd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 369 (394) ...+-.....|--.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+- T Consensus 241 ~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~ 316 (342) T protein:vir:10 241 YFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPK----KDRIETYE 316 (342) T ss_pred HHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc----cccccchh Confidence 22332222221110000 1123579999999999999999999888876555554444433332221 22222222 Q ss_pred EEEeccEEecccceEEEE---ecCCC Q lcl|NC_019933. 370 EERLALAVYRPESFIKGS---LAAAA 392 (394) Q Consensus 370 ~~~~d~~v~~~~a~~~l~---~~~a~ 392 (394) ..--++.|.++.+++.+. ++-+- T Consensus 317 s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 317 SENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred hhccceeeeccccEEEeecceecCCC Confidence 233445555555555553 22222 No 183 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=97.80 E-value=1.3e-05 Score=47.41 Aligned_cols=300 Identities=12% Similarity=0.019 Sum_probs=148.7 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .+... ...+..+ ...-+........ ...-.+.+-+.....+.+.+.+.+.++++++++++.-. T Consensus 1 M~~~t-----r~~~~~y-----------~~~~A~~ngv~~~-~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~ 63 (355) T protein:vir:18 1 MRQET-----RFKFNAY-----------LTQLAKLNGISVD-DVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEM 63 (355) T ss_pred CChHH-----HHHHHHH-----------HHHHHHHhCCChh-HccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccc Confidence 00000 0000000 0001111111111 11235677788888999999999999999999988754 Q ss_pred ceeEEEEcCcccccceec---CCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYVRETGFTNAAAPVA---EGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEE 227 (394) Q Consensus 154 ~~~~~~~~~~~~~~~~~~---eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d 227 (394) .....-....++.+.-+. .....|.....++.-.+..++.-....|+.+.|..+ +++...+++.+.++++.-+= T Consensus 64 ~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i 143 (355) T protein:vir:18 64 KGEKIGVGVTGTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFI 143 (355) T ss_pred eeeEEeeccCcceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchh Confidence 433333222222222221 112334444556777788888888888999998866 47899999998888775555 Q ss_pred HHHhhccCCC------cccc------cccc------------ccc-------cccccccccc-cchHH-HHHHHHHH-hh Q lcl|NC_019933. 228 NQLLNGNGTG------QNLL------GLLP------------QAT-------AFAAPITVAN-ATAVD-RLRLALLQ-AQ 273 (394) Q Consensus 228 ~a~l~g~g~~------~~~~------Gi~~------------~~~-------~~~~~~~~~~-~~~~~-~i~~~~~~-~~ 273 (394) ..-|+|..-. .+|. |.+. ... ...+..+..+ =..+| .+.++... ++ T Consensus 144 ~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~ 223 (355) T protein:vir:18 144 MAGFNGTTRADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLID 223 (355) T ss_pred hhcccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCC Confidence 5555664311 1221 2211 000 0001111111 11233 33555544 35 Q ss_pred hhcCC--CCeeEeCHHHHH-HHHHhhccCCcccccCccc--CCCceeecceEEEcCCCCcCceEEeeccceEEEEeecce Q lcl|NC_019933. 274 LAEFP--ATGIVLNPADWA-GIELLKDTQGRYILGNPQG--TLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAA 348 (394) Q Consensus 274 ~~~~~--~~~~~~~~~~~~-~l~~lkd~~G~~~~~~~~~--~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~ 348 (394) +.+.. .-+++|.+.... +...|-+..+.|--..... ....++.|+|.+..+.+|++.+++--+++....+..... T Consensus 224 ~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~ 303 (355) T protein:vir:18 224 EIYQDDPKLVAIVGRKLLADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESH 303 (355) T ss_pred hHHhcCCCEEEEEchhhhHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcE Confidence 54443 337888888754 3333433333222110001 113579999999999999999999888875555544444 Q ss_pred EEEEecccchhhhcCcEEEEEEEEeccEEecccceEEE---EecCCC------CC Q lcl|NC_019933. 349 RVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKG---SLAAAA------GT 394 (394) Q Consensus 349 ~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l---~~~~a~------~~ 394 (394) +-.+...+. ++.+.-+-..--++.|.++.+++.+ +++.+. |. T Consensus 304 RR~~~d~p~----r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~g 354 (355) T protein:vir:18 304 RRSIDENPK----KDRVENYESMNIDYVVEAYAAGCLLENITLGDFTAPAAPEGG 354 (355) T ss_pred EEEEEeccc----cccccchhhhcceeeeeccccEEEEeeeeecCCCCcccccCC Confidence 333322221 1111111222233334444443333 222211 11 No 184 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=97.79 E-value=1.4e-05 Score=47.17 Aligned_cols=302 Identities=13% Similarity=0.040 Sum_probs=148.0 Q ss_pred hhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEc Q lcl|NC_019933. 82 VNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRET 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 161 (394) ... .-+.....-...-+..+..... ...-.+.|.+.....+.+.+.+.+.++++++++++.-......-.. T Consensus 1 M~~--------~tr~~~~~y~~~~A~~ngv~~~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg 71 (355) T protein:vir:98 1 MRP--------ETRFKFNAYLTRVAELNNISTD-DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVG 71 (355) T ss_pred CCh--------HHHHHHHHHHHHHHHHhCCChh-HccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeec Confidence 000 0000000000011111221111 1123466777888899999999999999999998875443333322 Q ss_pred CcccccceecC---CccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 162 GFTNAAAPVAE---GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 162 ~~~~~~~~~~e---g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) ..++.+.-+.- ....|..-..++.-.+..++.-....|+.+.|..+ +++...+++.+.++++.-+=..-|+|.. T Consensus 72 v~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s 151 (355) T protein:vir:98 72 VTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTT 151 (355) T ss_pred cCccccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhccccee Confidence 11222222211 12233344556777788888888888999998866 4789999999988877555555556643 Q ss_pred CC------cccc------ccc------------cccc-------cccccccccc-cchHH-HHHHHHHH-hhhhcCC--C Q lcl|NC_019933. 236 TG------QNLL------GLL------------PQAT-------AFAAPITVAN-ATAVD-RLRLALLQ-AQLAEFP--A 279 (394) Q Consensus 236 ~~------~~~~------Gi~------------~~~~-------~~~~~~~~~~-~~~~~-~i~~~~~~-~~~~~~~--~ 279 (394) -. .+|. |.+ +... ...+..+..+ =..+| .+.++... +++.+.. . T Consensus 152 ~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~d 231 (355) T protein:vir:98 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPN 231 (355) T ss_pred eeccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCC Confidence 11 1221 221 1000 0001111111 11233 33555554 3554443 3 Q ss_pred CeeEeCHHHHH-HHHHhhccCCcccccC--cccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEeccc Q lcl|NC_019933. 280 TGIVLNPADWA-GIELLKDTQGRYILGN--PQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATEN 356 (394) Q Consensus 280 ~~~~~~~~~~~-~l~~lkd~~G~~~~~~--~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~ 356 (394) -+++|.+.... +...|-+....|--.. ..-....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+ T Consensus 232 LVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p 311 (355) T protein:vir:98 232 LVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) T ss_pred EEEEEchhhhHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 37888888754 3333433322221100 00112357999999999999999999988887555554444433332222 Q ss_pred c----hhhhcCcEEEEEEEEeccEEecccceEEEEec-CCCC-C Q lcl|NC_019933. 357 Q----DDFIKNMVTILAEERLALAVYRPESFIKGSLA-AAAG-T 394 (394) Q Consensus 357 ~----~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~-~a~~-~ 394 (394) . ..|..-..+|.++-+--+...+ .+.....+ ++++ + T Consensus 312 ~r~rie~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~~ 353 (355) T protein:vir:98 312 KKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPES 353 (355) T ss_pred ccccccchhhhcceeeeeccccEEEee--ceeeeCCCCCccccc Confidence 1 1122222344443333333332 22222111 1111 1 No 185 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=97.79 E-value=1.9e-05 Score=46.46 Aligned_cols=294 Identities=13% Similarity=-0.011 Sum_probs=152.7 Q ss_pred hhhhhhhhHH-HHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCccccccee-- Q lcl|NC_019933. 94 GGQRGRAEIN-IKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPV-- 170 (394) Q Consensus 94 ~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 170 (394) +....+.... ....+..........-.+.+.|.....+.+.+.+.+.++++++++++.-......-....++.+.-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 0000000000 0000111111111223456777888889999999999999999998875433333222111222211 Q ss_pred cCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCC------cccc Q lcl|NC_019933. 171 AEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTG------QNLL 241 (394) Q Consensus 171 ~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~------~~~~ 241 (394) +.+...|..-..++.-.+...+.-....|+.+.|..+ +++...+++.+.++++.-+=..-|+|..-. .+|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 2233333334566777788888888888999999866 489999999999887765555555664311 1121 Q ss_pred ------cc------------ccccccc--cccccccc-cchHHH-HHHHHHH-hhhhcCC--CCeeEeCHHHHH-HHHHh Q lcl|NC_019933. 242 ------GL------------LPQATAF--AAPITVAN-ATAVDR-LRLALLQ-AQLAEFP--ATGIVLNPADWA-GIELL 295 (394) Q Consensus 242 ------Gi------------~~~~~~~--~~~~~~~~-~~~~~~-i~~~~~~-~~~~~~~--~~~~~~~~~~~~-~l~~l 295 (394) |. ++..... .+..+..+ =..+|. +.++... +++.+.. .-+++|.+.... +-..| T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 11 1111000 11111111 113333 4555544 4555544 347888888755 22233 Q ss_pred hccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe Q lcl|NC_019933. 296 KDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL 373 (394) Q Consensus 296 kd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 373 (394) -+..+.|--.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-..-- T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s~Ne 316 (337) T protein:vir:10 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE----RDRIENYESSND 316 (337) T ss_pred hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccc----cccccchhhccc Confidence 3222222110000 0113579999999999999999999888876555554444433322221 222222233344 Q ss_pred ccEEecccceEEEE---ecCC Q lcl|NC_019933. 374 ALAVYRPESFIKGS---LAAA 391 (394) Q Consensus 374 d~~v~~~~a~~~l~---~~~a 391 (394) ++.|.++.+++.+. +..+ T Consensus 317 ~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 317 AYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred eeeeeccccEEEEeceeecCC Confidence 55666666666553 4444 No 186 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.77 E-value=1.9e-05 Score=46.42 Aligned_cols=263 Identities=11% Similarity=0.035 Sum_probs=121.2 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc-----c--ccCceeEEEEcCcccccceecCCccccccccce Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT-----M--EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~ 183 (394) |. ++. -..+|+.|..++++.+++..++.+++..-. . .|+++++++........+-...+..+...+..- T Consensus 1 Ma-N~l---lT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MP-NNL---DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 11 110 113699999999999999999988876521 1 366788877543221111112222333333333 Q ss_pred e--eEEeeeeeEEEeehhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCcccccccccccccccccccccc Q lcl|NC_019933. 184 D--LVQTSAKVIAHWMKASR-QILSDSAQLQSFINARLLRGLEVVEENQLLNGN-GTGQNLLGLLPQATAFAAPITVANA 259 (394) Q Consensus 184 ~--~i~~~~~k~~~~~~is~-e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~-g~~~~~~Gi~~~~~~~~~~~~~~~~ 259 (394) . .+.+.-+|...+ .+++ |+..+..++++++... .++++..+|..++.-. +...+.. ....+.. T Consensus 77 ~~v~l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~~-----------gt~~t~~ 143 (423) T protein:vir:10 77 GKATGRVGNYITVAV-EYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSL-----------GSPNTPI 143 (423) T ss_pred ceeEEEeeceeeeee-eechHHHhcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhcccccc-----------ccCCccc Confidence 3 355555555444 3544 4544445677666555 6889999998877531 1111110 0011122 Q ss_pred chHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhc--cCCccccc-CcccCC-CceeecceEEEcCCCCcCceE- Q lcl|NC_019933. 260 TAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKD--TQGRYILG-NPQGTL-APTLWGLPVVATQAMAVGQFL- 332 (394) Q Consensus 260 ~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd--~~G~~~~~-~~~~~~-~~~l~G~pv~~~~~~p~~~~~- 332 (394) ..++++.++-..+...+.+. -..+++|..+..|.+-.. ........ ..-.++ .+++.|+.++.++++|..+.. T Consensus 144 ~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt 223 (423) T protein:vir:10 144 TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGA 223 (423) T ss_pred chHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccc Confidence 35788888888887776653 367999999877753110 11111101 111122 258999999999999964321 Q ss_pred Eeec---cceEEE-----EeecceEEEEe--ccc-chhhhcCcEEEEEEEEeccEEe--------------cccceEE-- Q lcl|NC_019933. 333 TGAF---DAGAQV-----FDRWAARVEVA--TEN-QDDFIKNMVTILAEERLALAVY--------------RPESFIK-- 385 (394) Q Consensus 333 ~gd~---~~~~~~-----~~~~~~~i~~~--~~~-~~~~~~~~~~~~~~~~~d~~v~--------------~~~a~~~-- 385 (394) ++.. ..+..+ ....+..+.+. .-. +.....+. ...+.|+... ...-|.+ T Consensus 224 ~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD----~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a 299 (423) T protein:vir:10 224 FGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGD----QVKFTNTYWLQQQTKQALYNGATPISFTATVTA 299 (423) T ss_pred cccceeeeecceeccccccccceeeeeeeeccccccCceeecc----eEEecceeeecccccccccccccCcceEEEEEe Confidence 1100 000000 00111111110 000 00000000 0111111111 0111111 Q ss_pred -----------------------------EEecCCCCC Q lcl|NC_019933. 386 -----------------------------GSLAAAAGT 394 (394) Q Consensus 386 -----------------------------l~~~~a~~~ 394 (394) ++-..++++ T Consensus 300 ~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~ 337 (423) T protein:vir:10 300 DANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGD 337 (423) T ss_pred eeeeccCCceeeeccCccccccCCcccccccccccCCc Confidence 111111222 No 187 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.77 E-value=2e-05 Score=46.29 Aligned_cols=260 Identities=12% Similarity=0.064 Sum_probs=120.7 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc-----c--ccCceeEEEEcCcccccceecCCccccccccce Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT-----M--EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~ 183 (394) |. ++. -..+|+.|..+.++.+++..++.+++.... . .|+++++++........+-...+..+...+..- T Consensus 1 Ma-N~l---lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MP-NNL---DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 21 110 123699999999999999999988876522 1 366888886443211111111222233333332 Q ss_pred e--eEEeeeeeEEEeehhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCcccccccccccccccccccccc Q lcl|NC_019933. 184 D--LVQTSAKVIAHWMKASR-QILSDSAQLQSFINARLLRGLEVVEENQLLNGN-GTGQNLLGLLPQATAFAAPITVANA 259 (394) Q Consensus 184 ~--~i~~~~~k~~~~~~is~-e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~-g~~~~~~Gi~~~~~~~~~~~~~~~~ 259 (394) . .+.+.-+|...+ .+++ |...+..++++++... .++++..+|..++.-. +...+..| ...+.. T Consensus 77 ~~v~l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~~~g-----------t~~t~~ 143 (423) T protein:vir:17 77 GKATGRVGNYITVAV-EYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLG-----------SPNTPI 143 (423) T ss_pred ceeEEEeeceeeeee-eecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccc-----------cCCccc Confidence 3 355555555544 4555 4444555677665555 6889999998777431 11111100 011122 Q ss_pred chHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHh----hccC--CcccccCcccCC-CceeecceEEEcCCCCcCc Q lcl|NC_019933. 260 TAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELL----KDTQ--GRYILGNPQGTL-APTLWGLPVVATQAMAVGQ 330 (394) Q Consensus 260 ~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~l----kd~~--G~~~~~~~~~~~-~~~l~G~pv~~~~~~p~~~ 330 (394) ..++++.++-..+...+.+. -..+++|..+..|.+- ...+ +.--+ -.++ .+++.|+.|+.++++|..+ T Consensus 144 ~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al---r~g~i~G~i~GFdvy~Snnip~~T 220 (423) T protein:vir:17 144 TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAW---ENAQIPTNFGGIRALMSNGLASRT 220 (423) T ss_pred ccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHH---hhccceeeecceEEEEeCCCcccc Confidence 35888888888888777653 3679999998777531 1111 11111 1122 2589999999999999643 Q ss_pred eE-Eeec---cceEE-----EEee--cceEEEEeccc-chhhhcCcEEEEEEEEeccEEecc--------------cceE Q lcl|NC_019933. 331 FL-TGAF---DAGAQ-----VFDR--WAARVEVATEN-QDDFIKNMVTILAEERLALAVYRP--------------ESFI 384 (394) Q Consensus 331 ~~-~gd~---~~~~~-----~~~~--~~~~i~~~~~~-~~~~~~~~~~~~~~~~~d~~v~~~--------------~a~~ 384 (394) .. ++.. ..+.. .... ..+.+...... +.....+. ...+.|+...++ .-|. T Consensus 221 ~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD----~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~ 296 (423) T protein:vir:17 221 QGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGD----QVKFTNTYWLQQQTKQALYNGATPISFTAT 296 (423) T ss_pred ccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecc----eEEecceeeecccccccccccccccceEEE Confidence 21 1110 00000 0000 00001100000 00001111 111222222111 1111 Q ss_pred EE-------------Ee------------------cCCCCC Q lcl|NC_019933. 385 KG-------------SL------------------AAAAGT 394 (394) Q Consensus 385 ~l-------------~~------------------~~a~~~ 394 (394) +. ++ ..++|+ T Consensus 297 v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~ 337 (423) T protein:vir:17 297 VTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGD 337 (423) T ss_pred EEecccccccCceEEEecCccccccCCcccccceecccCCc Confidence 11 10 011111 No 188 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=97.76 E-value=2.1e-05 Score=46.20 Aligned_cols=294 Identities=13% Similarity=-0.013 Sum_probs=152.2 Q ss_pred hhhhhhhhHH-HHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCccccccee-- Q lcl|NC_019933. 94 GGQRGRAEIN-IKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPV-- 170 (394) Q Consensus 94 ~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 170 (394) +....+.... ....+..........-.+.+.|.....+.+.+.+.+.++++++++++.-......-....++.+.-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 0000000000 0000001111111122456777888889999999999999999998875433333222111222211 Q ss_pred cCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCC------cccc Q lcl|NC_019933. 171 AEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTG------QNLL 241 (394) Q Consensus 171 ~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~------~~~~ 241 (394) +.+...|..-..++.-.+...+.-....|+.+.|..+ +++...+++.+.++++.-+=..-|+|..-. .+|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 2233333334566777788888888888999999866 489999999999887765555555664311 1121 Q ss_pred ------cc------------ccccccc--cccccccc-cchHHH-HHHHHHH-hhhhcCC--CCeeEeCHHHHH-HHHHh Q lcl|NC_019933. 242 ------GL------------LPQATAF--AAPITVAN-ATAVDR-LRLALLQ-AQLAEFP--ATGIVLNPADWA-GIELL 295 (394) Q Consensus 242 ------Gi------------~~~~~~~--~~~~~~~~-~~~~~~-i~~~~~~-~~~~~~~--~~~~~~~~~~~~-~l~~l 295 (394) |. ++..... .+..+..+ =..+|. +.++... +++.+.. .-+++|.+.... +-..| T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 11 1111000 11111111 113333 4555544 4555544 337888888755 22233 Q ss_pred hccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe Q lcl|NC_019933. 296 KDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL 373 (394) Q Consensus 296 kd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 373 (394) -+..+.|--.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-..-- T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s~Ne 316 (337) T protein:vir:79 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE----RDRIENYESSND 316 (337) T ss_pred hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccc----cccccchhhccc Confidence 3222222110000 0113579999999999999999999888876555554444433322221 222222233334 Q ss_pred ccEEecccceEEEE---ecCC Q lcl|NC_019933. 374 ALAVYRPESFIKGS---LAAA 391 (394) Q Consensus 374 d~~v~~~~a~~~l~---~~~a 391 (394) ++.|.++.+++.+. +..| T Consensus 317 ~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 317 AYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred eeeeeccccEEEEeceeecCC Confidence 55566666665553 4444 No 189 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=97.76 E-value=2e-05 Score=46.37 Aligned_cols=296 Identities=11% Similarity=0.002 Sum_probs=149.6 Q ss_pred hhhhhhhhH-HHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccceec- Q lcl|NC_019933. 94 GGQRGRAEI-NIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPVA- 171 (394) Q Consensus 94 ~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 171 (394) +....+... ..-..+............+.+.|.....+.+.+.+.+.++++++++++.-......-....++.+.-+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 000000000 000011111111222346778888999999999999999999999998754433332221122222221 Q ss_pred -C-CccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCC------ccc Q lcl|NC_019933. 172 -E-GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTG------QNL 240 (394) Q Consensus 172 -e-g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~------~~~ 240 (394) . ++..|..-..++.-.+..++.-....|+.+.|..+ +++...+++.+.++++.-+=..-|+|..-. .+| T Consensus 81 ~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nP 160 (338) T protein:vir:11 81 TGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANP 160 (338) T ss_pred CCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCc Confidence 1 11222222356666778888888888999998866 489999999999888765555555664311 122 Q ss_pred c------cc------------ccc-cccccccc--cccc-cchHH-HHHHHHHH-hhhhcCCC--CeeEeCHHHHH-HHH Q lcl|NC_019933. 241 L------GL------------LPQ-ATAFAAPI--TVAN-ATAVD-RLRLALLQ-AQLAEFPA--TGIVLNPADWA-GIE 293 (394) Q Consensus 241 ~------Gi------------~~~-~~~~~~~~--~~~~-~~~~~-~i~~~~~~-~~~~~~~~--~~~~~~~~~~~-~l~ 293 (394) . |. ++. .....+.. +..+ =..+| .+.++... +++.+... -+++|.+.... +.. T Consensus 161 llqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~ 240 (338) T protein:vir:11 161 LLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYF 240 (338) T ss_pred CccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHh Confidence 1 11 111 00001111 1111 12233 33555543 35555433 37888888754 222 Q ss_pred HhhccCCcccccCccc--CCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEE Q lcl|NC_019933. 294 LLKDTQGRYILGNPQG--TLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEE 371 (394) Q Consensus 294 ~lkd~~G~~~~~~~~~--~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 371 (394) .+-+....|--..... ....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-.. T Consensus 241 ~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s~ 316 (338) T protein:vir:11 241 PMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPE----KNRIENYESS 316 (338) T ss_pred HHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc----cccccchhhh Confidence 3333221111000001 113579999999999999999999888875555544444333322221 2222222223 Q ss_pred EeccEEecccceEEEEecCCCC Q lcl|NC_019933. 372 RLALAVYRPESFIKGSLAAAAG 393 (394) Q Consensus 372 ~~d~~v~~~~a~~~l~~~~a~~ 393 (394) --++.|.++.+++.+.--..+- T Consensus 317 Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 317 NDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred ccceeeeccccEEEeecceecC Confidence 3344555555555543111111 No 190 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.71 E-value=6.4e-06 Score=49.06 Aligned_cols=266 Identities=11% Similarity=0.025 Sum_probs=140.2 Q ss_pred CCcCccccch--hhhhHHHhhhhhhhhHHHhccccc---cccCceeEEEEcCcccccc--ee-cCCccccccccceeeEE Q lcl|NC_019933. 116 DGSAGATVQT--TRLPGILELPQRRMTIRSLLAQGT---MEGNTLEYVRETGFTNAAA--PV-AEGAQKPESSLRFDLVQ 187 (394) Q Consensus 116 ~~~~g~~ip~--~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~--~~-~eg~~~~~~~~~~~~i~ 187 (394) -+...+++.+ .+.+.|.+...+.-...+++++.+ ..-.++.+...... +.+. |. +...++|..+..+++-. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~-G~a~~~~i~~~a~dip~vd~~~~~~~ 79 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEH-GSLDDGLITVGTSTLDQVEVGFTPTR 79 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeecc-CcccccccCCcCCccceeecccceeE Confidence 2223344432 233344444444445555555433 22234444444332 3343 54 44578899999999999 Q ss_pred eeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccc------- Q lcl|NC_019933. 188 TSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITV------- 256 (394) Q Consensus 188 ~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~------- 256 (394) ...+.++..+.+|.+=+..+. ++.+.-.....+++...+|+..+.|+.......|+++.++........ T Consensus 80 ~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w 159 (304) T protein:vir:52 80 SYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKV 159 (304) T ss_pred EEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCcc Confidence 999999999999876555442 467766677778889999999999975555678999987765332211 Q ss_pred ---cccchHHHHHHHHHHhhhh---cCCCCeeEeCHHHHHHHHHhh-ccCCcccccCcccCCCceeecce--EEEcCC-- Q lcl|NC_019933. 257 ---ANATAVDRLRLALLQAQLA---EFPATGIVLNPADWAGIELLK-DTQGRYILGNPQGTLAPTLWGLP--VVATQA-- 325 (394) Q Consensus 257 ---~~~~~~~~i~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~lk-d~~G~~~~~~~~~~~~~~l~G~p--v~~~~~-- 325 (394) +..-.+++|..++.++... ...+..++|.|..+..|.... ...+.-++.-...... ...|.| +...++ T Consensus 160 ~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~-~~~g~~l~I~~v~~~~ 238 (304) T protein:vir:52 160 QAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLS-AAAGRQVAIKALPSNY 238 (304) T ss_pred ccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcc-cccCCcceEEEecccc Confidence 1112344444455554322 134668999999999886543 2223222211111111 112333 221111 Q ss_pred CCcC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEE--EEE-EeccEEecccceEEEEe Q lcl|NC_019933. 326 MAVG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTIL--AEE-RLALAVYRPESFIKGSL 388 (394) Q Consensus 326 ~p~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~-~~d~~v~~~~a~~~l~~ 388 (394) ..++ ..++.+.+.-+.-+. -.+.+.... ...+|...|. ++. ..|+.++.|.+++++.+ T Consensus 239 ~~~g~~g~~r~vvY~~d~~~~~~~-vP~p~~~l~----~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 239 GTRVTDGKTRAMVYVNSKEHVIFD-VPMSPTVLD----AQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cccCCCCceEEEEEecChhheEEe-cCccccccc----hhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 1111 123333222221111 112222222 1223432332 344 45566777999999999 No 191 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=97.70 E-value=2.4e-05 Score=45.92 Aligned_cols=295 Identities=12% Similarity=0.015 Sum_probs=152.1 Q ss_pred hhhhhhhhHH-HHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCccccccee-- Q lcl|NC_019933. 94 GGQRGRAEIN-IKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAPV-- 170 (394) Q Consensus 94 ~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 170 (394) +....+.... ....+..........-.+.|.+.....+.+.+.+.+.++++++++++.-......-....++.+.-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 0000000000 0001111111122234567788888899999999999999999998875433333222111222211 Q ss_pred cCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCc------ccc Q lcl|NC_019933. 171 AEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQ------NLL 241 (394) Q Consensus 171 ~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~------~~~ 241 (394) ..++..|..-..++.-.+...+.-.-..|+.+.|..+ +++...+++.+.++++.-+=..-|+|..-.. +|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 1122223322466667777788888888999988865 4789989888888876554445555532211 111 Q ss_pred ------cc------------ccccccc--cccc-ccc-ccchHH-HHHHHHHH-hhhhcCCC--CeeEeCHHHHH-HHHH Q lcl|NC_019933. 242 ------GL------------LPQATAF--AAPI-TVA-NATAVD-RLRLALLQ-AQLAEFPA--TGIVLNPADWA-GIEL 294 (394) Q Consensus 242 ------Gi------------~~~~~~~--~~~~-~~~-~~~~~~-~i~~~~~~-~~~~~~~~--~~~~~~~~~~~-~l~~ 294 (394) |. ++..... .+.. +.. .=..+| .+.++... +++.+... -+++|.+.... +-.. T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~ 240 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFP 240 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhh Confidence 11 1110000 0001 111 111233 33555543 45555543 37888888855 3333 Q ss_pred hhccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEE Q lcl|NC_019933. 295 LKDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEER 372 (394) Q Consensus 295 lkd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 372 (394) |-+....|--.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-..- T Consensus 241 l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s~N 316 (339) T protein:vir:79 241 LVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAK----RDRIENYESSN 316 (339) T ss_pred HhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccc----cccccchhhcc Confidence 33332222110000 0113579999999999999999999888876555554444433332222 22222222333 Q ss_pred eccEEecccceEEE---EecCCC Q lcl|NC_019933. 373 LALAVYRPESFIKG---SLAAAA 392 (394) Q Consensus 373 ~d~~v~~~~a~~~l---~~~~a~ 392 (394) -++.|.++.+++.+ ++..+| T Consensus 317 e~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 317 DAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred ceeeeeccccEEEeeeeecccCC Confidence 45556666665555 455555 No 192 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=97.62 E-value=3.6e-05 Score=44.96 Aligned_cols=294 Identities=13% Similarity=-0.008 Sum_probs=152.1 Q ss_pred hhhhhhhhHH-HHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccce--e Q lcl|NC_019933. 94 GGQRGRAEIN-IKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAAP--V 170 (394) Q Consensus 94 ~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 170 (394) +....+.... ....+..........-.+.|.+.....+.+.+.+.+.++++++++++.-......-....++.+.- . T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 0000000000 000011111111222356678888889999999999999999999887543333322211122221 1 Q ss_pred cCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCc------ccc Q lcl|NC_019933. 171 AEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQ------NLL 241 (394) Q Consensus 171 ~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~------~~~ 241 (394) +.+...|..-..++.-.+...+.-.-..|+.+.|..+ +++...+++.+.++++.-+=..-|+|..-.. +|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 2223333334556777777888888888999998865 4789999988888876554445555532211 111 Q ss_pred ------c------------cccccccc--cccccccc-cchHH-HHHHHHHH-hhhhcCC--CCeeEeCHHHHHH-HHHh Q lcl|NC_019933. 242 ------G------------LLPQATAF--AAPITVAN-ATAVD-RLRLALLQ-AQLAEFP--ATGIVLNPADWAG-IELL 295 (394) Q Consensus 242 ------G------------i~~~~~~~--~~~~~~~~-~~~~~-~i~~~~~~-~~~~~~~--~~~~~~~~~~~~~-l~~l 295 (394) | +++..... .+..+..+ =..+| .+.++... +++.+.. .-+++|.+..... -..+ T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l 240 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHH Confidence 1 11111000 11111111 11233 33555554 4555544 3478888888552 2233 Q ss_pred hccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEe Q lcl|NC_019933. 296 KDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERL 373 (394) Q Consensus 296 kd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 373 (394) -+..+.|--.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-..-- T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s~Ne 316 (337) T protein:vir:78 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE----RDRIENYESSND 316 (337) T ss_pred HhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccc----cccccchhhccc Confidence 2222222110000 1123579999999999999999999888876555554444433332222 222222233334 Q ss_pred ccEEecccceEEEE---ecCC Q lcl|NC_019933. 374 ALAVYRPESFIKGS---LAAA 391 (394) Q Consensus 374 d~~v~~~~a~~~l~---~~~a 391 (394) ++.|.++.+++.+. +..+ T Consensus 317 ~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 317 AYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred eeeeeccccEEEEeceeecCC Confidence 55666666666553 4444 No 193 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=97.61 E-value=2.9e-05 Score=45.50 Aligned_cols=298 Identities=9% Similarity=-0.054 Sum_probs=149.4 Q ss_pred HHHHhhhhhhhhHH----HHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCccc Q lcl|NC_019933. 90 MAESGGQRGRAEIN----IKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTN 165 (394) Q Consensus 90 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 165 (394) +...+....+.... .-+..+.... ....-.+.|.+.....+.+.+.+.+.++++++++++.-......-....++ T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~-~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~ 79 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDI-SKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQL 79 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCCh-hHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcc Confidence 00001111111100 0111111111 111235778888888999999999999999999998754443333322222 Q ss_pred ccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHhhccCCCc- Q lcl|NC_019933. 166 AAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA------QLQSFINARLLRGLEVVEENQLLNGNGTGQ- 238 (394) Q Consensus 166 ~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~------~~~~~i~~~la~a~~~~~d~a~l~g~g~~~- 238 (394) .+.-+.. ..|.....++.-.+...+.-.-..|+.+.|..++ ++...+++.+.++++.-+=..-|+|..-.. T Consensus 80 iagrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 157 (358) T protein:vir:78 80 YTGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADD 157 (358) T ss_pred cceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccC Confidence 2222222 2233445566677777887778889999888654 688888888888776554445555532211 Q ss_pred -----ccc------ccc------------cccc-cccccccc---cccchHHHH-HHHHH-HhhhhcCC--CCeeEeCHH Q lcl|NC_019933. 239 -----NLL------GLL------------PQAT-AFAAPITV---ANATAVDRL-RLALL-QAQLAEFP--ATGIVLNPA 287 (394) Q Consensus 239 -----~~~------Gi~------------~~~~-~~~~~~~~---~~~~~~~~i-~~~~~-~~~~~~~~--~~~~~~~~~ 287 (394) +|. |.+ +... ...+..+. .+=..+|.+ .+++. .+++.+.. .-+++|.+. T Consensus 158 Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~d 237 (358) T protein:vir:78 158 TDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTD 237 (358) T ss_pred CChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 111 111 1000 01111111 111233333 44543 44554444 347888888 Q ss_pred HHH-HHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEE Q lcl|NC_019933. 288 DWA-GIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVT 366 (394) Q Consensus 288 ~~~-~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 366 (394) ... +-..|-+..+.|--......-..++.|+|.+..+.+|.+.+++--+++....+.....+-.+...+. ++.+. T Consensus 238 Lla~k~~~l~n~~~~pTE~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~----r~riE 313 (358) T protein:vir:78 238 LVAAAQAKLYSEATKPSEQIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQD----SKSFD 313 (358) T ss_pred hhhHHhhhHhhcCCCcHHHHHHHHHHHHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc----ccccc Confidence 855 3333433322221100000111478999999999999999999888876555554444433332221 22222 Q ss_pred EEEEEEeccEEecccceEEEEecC------CCCC Q lcl|NC_019933. 367 ILAEERLALAVYRPESFIKGSLAA------AAGT 394 (394) Q Consensus 367 ~~~~~~~d~~v~~~~a~~~l~~~~------a~~~ 394 (394) -+-..--++.|.++.+++.+.-.. +++. T Consensus 314 ~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~ 347 (358) T protein:vir:78 314 NQYWRMEGYALGEHKAYGGFEEADIEIGADPAVL 347 (358) T ss_pred chhhhcceeeeeccccEEEEeeeeeeeCCCCCcc Confidence 222233344555555554443111 1111 No 194 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=299 Identities=11% Similarity=-0.002 Sum_probs=144.3 Q ss_pred HHHHhhhhhhhhHHHHHHHhhccccc-CCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccccc Q lcl|NC_019933. 90 MAESGGQRGRAEINIKAAITSLSTNA-DGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNAAA 168 (394) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (394) +...-+.....-...-+......... ..+.-+.|.+.....+.+.+.+.+.++++++++++.--...+.-....+..+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 00000000000000011111111110 11123678888889999999999999999999888532222211111111111 Q ss_pred e-ecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHhhccCCC---ccc Q lcl|NC_019933. 169 P-VAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQ-LQSFINARLLRGLEVVEENQLLNGNGTG---QNL 240 (394) Q Consensus 169 ~-~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~-~~~~i~~~la~a~~~~~d~a~l~g~g~~---~~~ 240 (394) - ...+...... ..+.-.+...+.-.-..|+-+.|..+ +| +...+++.+.++++.-+=..-|+|..-. .+| T Consensus 81 r~~t~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nP 158 (343) T protein:vir:98 81 AHDRRTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDP 158 (343) T ss_pred ccccCCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCc Confidence 1 1111111111 11112466666666777888888765 46 8888888888877654444555553221 222 Q ss_pred c------cc------------cccccc-cccc-ccccc-cchHH-HHHHHHHHhhhhcCC--CCeeEeCHHHHHH-HHHh Q lcl|NC_019933. 241 L------GL------------LPQATA-FAAP-ITVAN-ATAVD-RLRLALLQAQLAEFP--ATGIVLNPADWAG-IELL 295 (394) Q Consensus 241 ~------Gi------------~~~~~~-~~~~-~~~~~-~~~~~-~i~~~~~~~~~~~~~--~~~~~~~~~~~~~-l~~l 295 (394) . |. ++.... .... .+..+ =..+| .+.++...+++.+.. .-+++|.+..... ...+ T Consensus 159 llqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l 238 (343) T protein:vir:98 159 NLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLV 238 (343) T ss_pred chhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhh Confidence 1 11 111100 0010 11111 11233 335555555555444 3378888887543 3334 Q ss_pred hccCCcccccCcc---cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEE Q lcl|NC_019933. 296 KDTQGRYILGNPQ---GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEER 372 (394) Q Consensus 296 kd~~G~~~~~~~~---~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 372 (394) -+..+++--.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-..- T Consensus 239 ~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s~N 314 (343) T protein:vir:98 239 YKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDD----KKAVRDSYYRN 314 (343) T ss_pred hhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc----cccccchhhhc Confidence 4344432211111 0123578999999999999999999888876555555444443332222 22222233334 Q ss_pred eccEEecccceEEE-----EecCCCCC Q lcl|NC_019933. 373 LALAVYRPESFIKG-----SLAAAAGT 394 (394) Q Consensus 373 ~d~~v~~~~a~~~l-----~~~~a~~~ 394 (394) -++.|.++.+++.+ ++.+.+|+ T Consensus 315 e~YvVEd~~~~a~iE~i~v~~~~~~g~ 341 (343) T protein:vir:98 315 EAYAVEDCGKFMAVDFTKVKLSSGKGT 341 (343) T ss_pred ceeeeeccccEEEeeeeeeeecCCCCC Confidence 45566666666554 33343444 No 195 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=97.54 E-value=4.8e-05 Score=44.28 Aligned_cols=258 Identities=11% Similarity=0.034 Sum_probs=117.0 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccc-----c--ccCceeEEEEcCcccccceecCCccc---cccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGT-----M--EGNTLEYVRETGFTNAAAPVAEGAQK---PESS 180 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~~~eg~~~---~~~~ 180 (394) |. ++ -..++|+.|..++++.+++..++.+++..-. . .|+++++|+..... +. ...+..+ +..+ T Consensus 1 MA-Ns---l~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~--~~-d~~~~~~t~~~~~~ 73 (423) T protein:vir:10 1 MA-NN---LDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFK--SE-RTMDGDITGKSKNS 73 (423) T ss_pred Cc-cc---cccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCcee--ee-cccCcccCcccccc Confidence 11 11 1237899999999999999999998886522 1 35678887643221 11 1111111 1112 Q ss_pred cce--eeEEeeeeeEEEeehhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC-Cccccccccccccccccccc Q lcl|NC_019933. 181 LRF--DLVQTSAKVIAHWMKASR-QILSDSAQLQSFINARLLRGLEVVEENQLLNGNGT-GQNLLGLLPQATAFAAPITV 256 (394) Q Consensus 181 ~~~--~~i~~~~~k~~~~~~is~-e~l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~-~~~~~Gi~~~~~~~~~~~~~ 256 (394) ..= ..+.+.-.|... +.+++ |+..+..++++++.. -.++++..+|..+...-.. ..+.-| ... T Consensus 74 l~e~~v~l~id~~k~~a-~~v~d~E~~l~i~~~~~~l~~-A~~aLA~~vd~~ia~~~~~~~~~~vg-----------t~~ 140 (423) T protein:vir:10 74 LISAKATGEVGNYITVA-VEYRQIEEALKLNQLDQILVP-INERMVTDLETELALFMMKHGALSLG-----------SPN 140 (423) T ss_pred cccceEEEEecceeeee-eeeChHHHhcChhHHHHHHHH-HHHHHHHHHHHHHHHHhhhccccccc-----------ccc Confidence 222 234455555443 34554 454445577765544 4788999999877532111 111100 011 Q ss_pred cccchHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHH-h---hccCCcccccCcccC-CCceeecceEEEcCCCCc- Q lcl|NC_019933. 257 ANATAVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIEL-L---KDTQGRYILGNPQGT-LAPTLWGLPVVATQAMAV- 328 (394) Q Consensus 257 ~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~-l---kd~~G~~~~~~~~~~-~~~~l~G~pv~~~~~~p~- 328 (394) +....|+++.++-..+...+.+. -..+++|..+..|.+ + ...++..- ...-.+ ..+++.|+.++.++.+|. T Consensus 141 t~~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~-~alr~~~i~G~~~GFdi~~Sn~vp~~ 219 (423) T protein:vir:10 141 TPIKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVR-TAWENAQISGNFGGIRALMSNGLASR 219 (423) T ss_pred cccccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccch-HHHHhcccceeecceEEEEecCCccc Confidence 12234788888877777776653 367999999887753 2 22211100 111112 236899999999999984 Q ss_pred --Cce-EEeeccceEEEEeecc--------eEEEEecccchh-hhcCcEEEEEEEEeccEEecc--------------cc Q lcl|NC_019933. 329 --GQF-LTGAFDAGAQVFDRWA--------ARVEVATENQDD-FIKNMVTILAEERLALAVYRP--------------ES 382 (394) Q Consensus 329 --~~~-~~gd~~~~~~~~~~~~--------~~i~~~~~~~~~-~~~~~~~~~~~~~~d~~v~~~--------------~a 382 (394) ++. ..+-.+.++.+ .+.. .+.......... ...+. ...+.|+...|+ .- T Consensus 220 T~g~~~ga~~~~~~~~v-t~a~~~~~~~~~~~~~~~T~s~~g~l~~GD----~~t~aGv~~v~~~tk~~l~~~~~~~~~~ 294 (423) T protein:vir:10 220 TQGAFGGKLTVKGTPEV-NYDSVKDSYAFTATLTGATASKKGFLKVGD----QLQFDDTHWLNQQSKQTLYNGASALSFT 294 (423) T ss_pred ccccccceeeeeeeeEE-EecccccccccccceeeccceeceeEEecc----eEeecceeeecccccceeecccCCcceE Confidence 221 00000111111 1000 000000000000 00000 111111111111 11 Q ss_pred eEEE-------------Ee------------------cCCCCC Q lcl|NC_019933. 383 FIKG-------------SL------------------AAAAGT 394 (394) Q Consensus 383 ~~~l-------------~~------------------~~a~~~ 394 (394) |.+. ++ ..++|+ T Consensus 295 ~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~ 337 (423) T protein:vir:10 295 ATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGD 337 (423) T ss_pred EEEEecccccccCceEEEeccccccccCcccccceeccccCCc Confidence 1111 11 001111 No 196 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=97.52 E-value=4.1e-05 Score=44.61 Aligned_cols=302 Identities=11% Similarity=0.032 Sum_probs=148.6 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .+... ...+..+ ...-+..+..... ...-.+.|-+.....+.+.+.+.+.++++++++++.-. T Consensus 1 M~~~t-----r~~~~~y-----------~~~~A~~ngv~~~-d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~ 63 (357) T protein:vir:60 1 MRQET-----RFKFNAY-----------LSRVAELNGIDAG-DVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEM 63 (357) T ss_pred CChHH-----HHHHHHH-----------HHHHHHHhCCChH-HhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccc Confidence 00000 0000000 0001111111111 11235667788888999999999999999999988754 Q ss_pred ceeEEEEcCcccccceec--CC-ccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYVRETGFTNAAAPVA--EG-AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEE 227 (394) Q Consensus 154 ~~~~~~~~~~~~~~~~~~--eg-~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d 227 (394) .....-....++.+.-+. .+ ...|..-..++.-.+...+.-.-..|+.+.|..+ +++...+++.+.++++.-+= T Consensus 64 ~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i 143 (357) T protein:vir:60 64 KGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLI 143 (357) T ss_pred eeeEEecccCcccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccc Confidence 433333222222222221 11 2223222456677778888888888999998866 47888888888888765544 Q ss_pred HHHhhccCCCc------cc------cccc------------ccc----cccc---ccccccc-cchHH-HHHHHHHH-hh Q lcl|NC_019933. 228 NQLLNGNGTGQ------NL------LGLL------------PQA----TAFA---APITVAN-ATAVD-RLRLALLQ-AQ 273 (394) Q Consensus 228 ~a~l~g~g~~~------~~------~Gi~------------~~~----~~~~---~~~~~~~-~~~~~-~i~~~~~~-~~ 273 (394) ..-|+|..-.. +| .|.+ +.. +... +..+..+ =..+| .+.++... ++ T Consensus 144 ~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~ 223 (357) T protein:vir:60 144 MAGFNGVRRAETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIE 223 (357) T ss_pred eecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCC Confidence 45555532211 11 1221 100 0000 1111111 11333 33555554 45 Q ss_pred hhcCC--CCeeEeCHHHHH-HHHHhhccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecce Q lcl|NC_019933. 274 LAEFP--ATGIVLNPADWA-GIELLKDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAA 348 (394) Q Consensus 274 ~~~~~--~~~~~~~~~~~~-~l~~lkd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~ 348 (394) +.+.. .-+++|.+.... +...|-+..+.|--.... -....++.|+|.+..+.+|.+.+++--+++....+..... T Consensus 224 ~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~ 303 (357) T protein:vir:60 224 PWYQEDPDLVVIVGRQLLADKYFPIVNREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSH 303 (357) T ss_pred hHHhcCCCEEEEEchhhhhHHhhhHhhcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcE Confidence 55543 347888888855 233333332222110000 0123579999999999999999999888876555554444 Q ss_pred EEEEecccc----hhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 349 RVEVATENQ----DDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 349 ~i~~~~~~~----~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) +-.+...+. ..|..-..+|.++-+--+...+. +.+...+.+++. T Consensus 304 RR~~~d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~--i~~~~~~~pa~~ 351 (357) T protein:vir:60 304 RRVIEENPKLDRVENYESMNIDYVVEDYAAGCLVEK--IKVGDFSTPAKA 351 (357) T ss_pred EEEEEeccccccccchhhhcceeeeeccccEEEeee--eeeccCcccccC Confidence 433322221 11222223344333333333321 222223333333 No 197 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=97.49 E-value=2.7e-05 Score=45.62 Aligned_cols=295 Identities=9% Similarity=-0.043 Sum_probs=144.2 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHH-HHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCce Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEIN-IKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL 155 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) +.+ .+....+.... .-..+.....-......+.|.|.....+.+.+.+.+.+++.++++++.-... T Consensus 1 m~~-------------~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~G 67 (341) T protein:vir:27 1 MSQ-------------ILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEG 67 (341) T ss_pred Ccc-------------cccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceee Confidence 100 00000000000 0000111111111223466777888999999999999999999998875433 Q ss_pred eEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHH------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 156 EYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS------AQLQSFINARLLRGLEVVEENQ 229 (394) Q Consensus 156 ~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s------~~~~~~i~~~la~a~~~~~d~a 229 (394) ...-....++.+.-+. ++..|. ++.++...+...+.-.-+.|+.+.|..+ +++...+.+.+.++++.-+=.. T Consensus 68 e~v~lg~~g~iagrtd-t~R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~I 145 (341) T protein:vir:27 68 QVVDVGVSGLYTGRKA-GGRFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRI 145 (341) T ss_pred eEeecccccceeeccC-CCceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhh Confidence 3332221122222222 223222 2466677777777777788888887643 4688888888888876555555 Q ss_pred HhhccCCC------cccc------ccccc----cc----cc-cccccccc-cchHH-HHHHHHHH-hhhhcCCC--CeeE Q lcl|NC_019933. 230 LLNGNGTG------QNLL------GLLPQ----AT----AF-AAPITVAN-ATAVD-RLRLALLQ-AQLAEFPA--TGIV 283 (394) Q Consensus 230 ~l~g~g~~------~~~~------Gi~~~----~~----~~-~~~~~~~~-~~~~~-~i~~~~~~-~~~~~~~~--~~~~ 283 (394) -|+|..-. .+|. |.+.. +. .. ....+..+ =..+| .+.++... +++.+... -+++ T Consensus 146 GfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvi 225 (341) T protein:vir:27 146 GWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVF 225 (341) T ss_pred cccceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEE Confidence 55664311 1121 11111 00 00 01111111 11234 34555544 35554432 3788 Q ss_pred eCHHHHH-HHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccc-hhhh Q lcl|NC_019933. 284 LNPADWA-GIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQ-DDFI 361 (394) Q Consensus 284 ~~~~~~~-~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-~~~~ 361 (394) |.+.... +-..|-+....|--......-..++.|+|.+..+.+|.+.+++--+++....+.....+-.+...+. +.++ T Consensus 226 vG~dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 305 (341) T protein:vir:27 226 VGSGLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSK 305 (341) T ss_pred EchhhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEecccccccc Confidence 8887755 3333332221111000000113589999999999999999999888875555544444333322222 1222 Q ss_pred cCcEEEEEEEEeccEEecccceE---EEEecCCCCC Q lcl|NC_019933. 362 KNMVTILAEERLALAVYRPESFI---KGSLAAAAGT 394 (394) Q Consensus 362 ~~~~~~~~~~~~d~~v~~~~a~~---~l~~~~a~~~ 394 (394) .-.- ++.|.+..+|+ .-+++-++|+ T Consensus 306 ~yes--------~YvVEdyg~~~~~~~~~vkl~~~~ 333 (341) T protein:vir:27 306 THTG--------AWKVTQWVCWKRSPLTTQKKSTSA 333 (341) T ss_pred chhh--------hheeehhhhhhhccccccccCccc Confidence 1111 24444444433 3334444444 No 198 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=97.41 E-value=6.4e-05 Score=43.57 Aligned_cols=300 Identities=10% Similarity=0.001 Sum_probs=148.1 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .+... ...+..+ ...-+..+..... ...-.+.|-+.....+.+.+.+.+.++++++++++.-. T Consensus 1 M~~~t-----r~~~~~y-----------~~~~A~~ngv~~~-d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~ 63 (357) T protein:vir:56 1 MRQET-----RFKFNAY-----------LSRVAELNGIDAG-DVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEM 63 (357) T ss_pred CChHH-----HHHHHHH-----------HHHHHHHhCCChH-HhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccc Confidence 00000 0000000 0001111111111 11235667888888999999999999999999988754 Q ss_pred ceeEEEEcCcccccceec--CC-ccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYVRETGFTNAAAPVA--EG-AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEE 227 (394) Q Consensus 154 ~~~~~~~~~~~~~~~~~~--eg-~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d 227 (394) .....-....++.+.-+. .+ ...|..-..++.-.+...+.-.-..|+.+.|..+ +++...+++.+.++++.-+= T Consensus 64 ~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i 143 (357) T protein:vir:56 64 KGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFI 143 (357) T ss_pred eeeEEecccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccc Confidence 433333221222222221 11 2222222456677778888888888999998866 47888888888888765544 Q ss_pred HHHhhccCCCc------cc------cccc------------ccc----cccc---ccccccc-cchHH-HHHHHHHH-hh Q lcl|NC_019933. 228 NQLLNGNGTGQ------NL------LGLL------------PQA----TAFA---APITVAN-ATAVD-RLRLALLQ-AQ 273 (394) Q Consensus 228 ~a~l~g~g~~~------~~------~Gi~------------~~~----~~~~---~~~~~~~-~~~~~-~i~~~~~~-~~ 273 (394) ..-|+|..-.. +| .|.+ +.. +... +..+..+ =..+| .+.++... ++ T Consensus 144 ~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~ 223 (357) T protein:vir:56 144 MAGFNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIE 223 (357) T ss_pred eecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCC Confidence 45555532211 11 1221 100 0000 1111111 11333 33555554 45 Q ss_pred hhcCC--CCeeEeCHHHHH-HHHHhhccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecce Q lcl|NC_019933. 274 LAEFP--ATGIVLNPADWA-GIELLKDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAA 348 (394) Q Consensus 274 ~~~~~--~~~~~~~~~~~~-~l~~lkd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~ 348 (394) +.+.. .-+++|.+.... +...|-+..+.|--.... -....++.|+|.+..+.+|.+.+++--+++....+..... T Consensus 224 ~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~ 303 (357) T protein:vir:56 224 PWYQEDPDLVVIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSH 303 (357) T ss_pred hHHhcCCCEEEEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcE Confidence 55543 347888888855 333343333322211000 0113579999999999999999999888876555554444 Q ss_pred EEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEE------ecCCCCC Q lcl|NC_019933. 349 RVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGS------LAAAAGT 394 (394) Q Consensus 349 ~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~------~~~a~~~ 394 (394) +-.+...+. ++.+.-+-..--++.|.++.+++.+. .+.++.. T Consensus 304 RR~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~ 351 (357) T protein:vir:56 304 RRVIEENPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred EEEEEeccc----cccccchhhhcceeeeeccccEEEeeeeeeccCCCCccc Confidence 433322221 11222222222333444444444332 2222222 No 199 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=97.40 E-value=4.6e-05 Score=44.38 Aligned_cols=291 Identities=12% Similarity=0.084 Sum_probs=146.3 Q ss_pred hHHHHHHHhhcccccCCcCccccch-hhhhHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCcc- Q lcl|NC_019933. 101 EINIKAAITSLSTNADGSAGATVQT-TRLPGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQ- 175 (394) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~g~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~- 175 (394) -++..+-.....++..++.|--+.+ -+....+....+...+.++....|++. .++++.+-...........||-+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 1111111111112222222333333 234555555566678888888999884 34444333322222222223321 Q ss_pred ---------------------------------ccccccceeeEEeeeeeEEEeehhhHHHHH-HH-HHHHHHH-HHHHH Q lcl|NC_019933. 176 ---------------------------------KPESSLRFDLVQTSAKVIAHWMKASRQILS-DS-AQLQSFI-NARLL 219 (394) Q Consensus 176 ---------------------------------~~~~~~~~~~i~~~~~k~~~~~~is~e~l~-~s-~~~~~~i-~~~la 219 (394) +..-..+-..+..+.++++.+.++|+++.. ++ +.+..-+ .+.|. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 111122334466778999999999999876 33 3455433 34444 Q ss_pred HHHHHHHH---HHHhhccCCCccccccccccccccccccccccchHHHHHHHHHHhhhhc------------------CC Q lcl|NC_019933. 220 RGLEVVEE---NQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAE------------------FP 278 (394) Q Consensus 220 ~a~~~~~d---~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~------------------~~ 278 (394) .+..+.+| ..+|+..+.---+ |-...-...+....+.+..+++++..+...+.... .. T Consensus 161 g~~~~t~d~i~~dll~ag~~viyA-g~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~ 239 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYA-GAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIG 239 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecC-CccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccc Confidence 44444333 5666543211111 11110111122344556678888888776666421 12 Q ss_pred CCe-eEeCHHHHHHHHHhhccCCcccccCc---------ccCCCceeecceEEEcCCCC--------cC----------- Q lcl|NC_019933. 279 ATG-IVLNPADWAGIELLKDTQGRYILGNP---------QGTLAPTLWGLPVVATQAMA--------VG----------- 329 (394) Q Consensus 279 ~~~-~~~~~~~~~~l~~lkd~~G~~~~~~~---------~~~~~~~l~G~pv~~~~~~p--------~~----------- 329 (394) ++. -+||+.+-..|+.++|-.|+|-|-+. ..+.-+.+.+++++.++.+- +. T Consensus 240 ~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~ 319 (401) T protein:vir:95 240 ATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMV 319 (401) T ss_pred cceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccc Confidence 332 48899999999999998887766322 11233567889999888642 10 Q ss_pred ----------ceEEeeccceEEEEeecce----EEEEecccch----hhhcCcEEEE-EEEEeccEEecccceEEEEecC Q lcl|NC_019933. 330 ----------QFLTGAFDAGAQVFDRWAA----RVEVATENQD----DFIKNMVTIL-AEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 330 ----------~~~~gd~~~~~~~~~~~~~----~i~~~~~~~~----~~~~~~~~~~-~~~~~d~~v~~~~a~~~l~~~~ 390 (394) ..++|.-..+..-+...+. .+-+..-.+. .=.-|+.++. ..++.++.+.+++-+++++-.+ T Consensus 320 ~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a 399 (401) T protein:vir:95 320 SGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVA 399 (401) T ss_pred cCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeec Confidence 1233443322222222221 2222221110 0012333333 3467788899999999988766 Q ss_pred CC Q lcl|NC_019933. 391 AA 392 (394) Q Consensus 391 a~ 392 (394) .- T Consensus 400 ~~ 401 (401) T protein:vir:95 400 PL 401 (401) T ss_pred CC Confidence 66 No 200 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.38 E-value=2.4e-05 Score=45.92 Aligned_cols=183 Identities=14% Similarity=0.013 Sum_probs=98.7 Q ss_pred EEeehhhHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhh----ccCCCccccccccccc-cccccccccccchH Q lcl|NC_019933. 194 AHWMKASRQILSDS------AQLQSFINARLLRGLEVVEENQLLN----GNGTGQNLLGLLPQAT-AFAAPITVANATAV 262 (394) Q Consensus 194 ~~~~~is~e~l~~s------~~~~~~i~~~la~a~~~~~d~a~l~----g~g~~~~~~Gi~~~~~-~~~~~~~~~~~~~~ 262 (394) --..-+|+.++.|- -++.+...++++.++++..|+.++. +.....+..+-..... ......+.++...+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 22334667666543 2688999999999999999988764 2222222111100000 11112223445567 Q ss_pred HHHHHHHHHhhhhcCCCC--eeEeCHHHHHHHHHhhcc-CCcccccC----cccC-CCceeecceEEEcCCCCcC--ceE Q lcl|NC_019933. 263 DRLRLALLQAQLAEFPAT--GIVLNPADWAGIELLKDT-QGRYILGN----PQGT-LAPTLWGLPVVATQAMAVG--QFL 332 (394) Q Consensus 263 ~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~lkd~-~G~~~~~~----~~~~-~~~~l~G~pv~~~~~~p~~--~~~ 332 (394) +.|.++...+...+.+.. .++++|..+..|.+-.|. -.+.-+.. ...+ ..+.+.|++|+.++++|.. +-+ T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 888888888888888644 467799888777643221 11111111 1111 2356899999999999963 222 Q ss_pred EeeccceEEEEeecceEEEEecccchhhhcCcEEEEE-EEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 333 TGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILA-EERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 333 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~-~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ..+...+..- ... ...||. ..-.-+.+.+++|+..+|+=.+-+- T Consensus 161 ~~~ag~~~~~----~~~--------------~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~ 205 (221) T protein:vir:17 161 VTDPGDATTS----GEN--------------NGSYRPAITDRAGLVFHKEAADTVEVLLPPSR 205 (221) T ss_pred ccCCcccccc----ccc--------------cccccccccceEEEEEcchheeeeeeecCCCC Confidence 2222111000 000 001111 1112267888999888886555444 No 201 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=97.34 E-value=8.9e-05 Score=42.78 Aligned_cols=300 Identities=11% Similarity=0.002 Sum_probs=148.4 Q ss_pred chhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 74 HISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) .+... ...+..+ ...-+..+..... ...-.+.|-+.....+.+.+.+.+.++++++++++.-. T Consensus 1 M~~~t-----r~~~~~y-----------~~~~A~~ngv~~~-d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~ 63 (357) T protein:vir:20 1 MRQET-----RFKFNAY-----------LSRVAELNGIDAG-DVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEM 63 (357) T ss_pred CChHH-----HHHHHHH-----------HHHHHHHhCCChH-HhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccc Confidence 00000 0000000 0001111111111 11235667788888999999999999999999988754 Q ss_pred ceeEEEEcCcccccceec--CC-ccccccccceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 154 TLEYVRETGFTNAAAPVA--EG-AQKPESSLRFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEE 227 (394) Q Consensus 154 ~~~~~~~~~~~~~~~~~~--eg-~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d 227 (394) .....-....++.+.-+. .+ ...|..-..++.-.+...+.-.-..|+.+.|..+ +++...+++.+.++++.-+= T Consensus 64 ~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i 143 (357) T protein:vir:20 64 KGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFI 143 (357) T ss_pred eeeEEecccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccc Confidence 433333222222222221 11 2222222356677778888888888999998865 47888888888888765544 Q ss_pred HHHhhccCCCc------cc------cccc------------ccc----cccc---ccccccc-cchHH-HHHHHHHH-hh Q lcl|NC_019933. 228 NQLLNGNGTGQ------NL------LGLL------------PQA----TAFA---APITVAN-ATAVD-RLRLALLQ-AQ 273 (394) Q Consensus 228 ~a~l~g~g~~~------~~------~Gi~------------~~~----~~~~---~~~~~~~-~~~~~-~i~~~~~~-~~ 273 (394) ..-|+|..-.. +| .|.+ +.. +... +..+..+ =..+| .+.++... ++ T Consensus 144 ~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~ 223 (357) T protein:vir:20 144 MAGFNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIE 223 (357) T ss_pred eecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCC Confidence 45555532211 11 1221 110 0000 1111111 11333 33555554 45 Q ss_pred hhcCC--CCeeEeCHHHHH-HHHHhhccCCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecce Q lcl|NC_019933. 274 LAEFP--ATGIVLNPADWA-GIELLKDTQGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAA 348 (394) Q Consensus 274 ~~~~~--~~~~~~~~~~~~-~l~~lkd~~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~ 348 (394) +.+.. .-+++|.+.... +...|-+..+.|--.... -....++.|+|.+..+.+|.+.+++--+++....+..... T Consensus 224 ~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~ 303 (357) T protein:vir:20 224 PWYQEDPDLVVIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSH 303 (357) T ss_pred hHHhcCCCEEEEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcE Confidence 55543 347888888855 333343333322211000 0113579999999999999999999888876555554444 Q ss_pred EEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEE------ecCCCCC Q lcl|NC_019933. 349 RVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGS------LAAAAGT 394 (394) Q Consensus 349 ~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~------~~~a~~~ 394 (394) +-.+...+. ++.+.-+-..--++.|.++.+++.+. .+.+++. T Consensus 304 RR~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~ 351 (357) T protein:vir:20 304 RRVIEENPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred EEEEEeccc----cccccchhhhcceeeeeccccEEEeeeeeeccccCCccC Confidence 433322221 11222222223334444444444432 2222222 No 202 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.33 E-value=3.5e-05 Score=44.98 Aligned_cols=306 Identities=10% Similarity=0.049 Sum_probs=141.5 Q ss_pred HHHHHHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccc--cchhhh Q lcl|NC_019933. 51 ADLKAAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGAT--VQTTRL 128 (394) Q Consensus 51 ~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--ip~~~~ 128 (394) -.|.---..+++++......+...... .......+...+.......++..+.|.. ..+-+. T Consensus 1 ~~~~~~~~~~~~l~~~g~~~~~~~~~~-----------------~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~ 63 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGIIFDGYSPKS-----------------ISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVD 63 (339) T ss_pred CceechHHHHHHHHhhceeeccchhhh-----------------cchhhHhhhccccccccccccccccchhhhhhhhhc Confidence 000000001111111111000000000 0000000000011111112222233331 233344 Q ss_pred hHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCccccccccc--eeeEEeeeeeEEEeehhhHH- Q lcl|NC_019933. 129 PGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQKPESSLR--FDLVQTSAKVIAHWMKASRQ- 202 (394) Q Consensus 129 ~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~--~~~i~~~~~k~~~~~~is~e- 202 (394) +.|++...+......++++.+.+. .+++++..... +.+.+.+.+++.|..+.. +.+.++.... ..+.++.. T Consensus 64 ~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~-G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~--~g~~y~~~E 140 (339) T protein:vir:94 64 RRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPV-GQVATYSDWSANGMSKANVNFESRQNYRYQ--TWTEYGDLE 140 (339) T ss_pred hhheeecccccchhhhcccccCCCCcccEEEEeeeecc-cceEEcccccCCCcccccceeeEEeEEEEE--EEEeecHHH Confidence 567777777778888888877653 35677766554 566677888888887754 5555544444 44455543 Q ss_pred HHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccc------ccccchHHHHHHHHHHhh Q lcl|NC_019933. 203 ILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPIT------VANATAVDRLRLALLQAQ 273 (394) Q Consensus 203 ~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~------~~~~~~~~~i~~~~~~~~ 273 (394) +..... ++.+.-....++++.+.+|+..+.|+.. ....|+++.++....... ++..-.++||..++..+. T Consensus 141 ~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~-~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~ 219 (339) T protein:vir:94 141 MATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAG-IANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLI 219 (339) T ss_pred HHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecc-cceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHH Confidence 333222 6788888889999999999988888754 345788887665332211 122223566666666654 Q ss_pred hhcC------CCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEe--- Q lcl|NC_019933. 274 LAEF------PATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFD--- 344 (394) Q Consensus 274 ~~~~------~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~--- 344 (394) ..-. .+..++|.|..+..|... +..|.-++.-.... +-++.+...+.+.... |+- .+++.. T Consensus 220 ~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~~lk~n----~pnl~i~~~~el~~a~---g~~--~~~~~~~~~ 289 (339) T protein:vir:94 220 SQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGAKIAQT----YPNIQFVAVPEFDTAS---GRL--VQLWVPEVN 289 (339) T ss_pred HhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHHHHHHh----cCCcEEEEccccccCC---Cce--EEEEEEecc Confidence 3321 244789999999888643 44343332211111 1123444444332110 110 011100 Q ss_pred -ecceEEEEe----cccchhhhcCcEEEEEEEE-eccEEecccceEEEE-e Q lcl|NC_019933. 345 -RWAARVEVA----TENQDDFIKNMVTILAEER-LALAVYRPESFIKGS-L 388 (394) Q Consensus 345 -~~~~~i~~~----~~~~~~~~~~~~~~~~~~~-~d~~v~~~~a~~~l~-~ 388 (394) .....+.+- ..+- ....-.+..-+..| .|..+++|.||++++ + T Consensus 290 ~~~~~~~~~p~~~~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 290 GQPTGEVAFAEKLRSHSI-ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CCcceEEEcchhhhcccc-EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 001111110 0000 00111122234444 666777899998887 5 No 203 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.31 E-value=9.8e-05 Score=42.55 Aligned_cols=375 Identities=14% Similarity=0.079 Sum_probs=150.4 Q ss_pred Cc--------------------------------------hHHHHHHHHHHHH-HHHHHHHHHHHhhhhhhHHHHHHHHH Q lcl|NC_019933. 1 MS--------------------------------------DINAINSTLANIS-DSLKAHADRAVKDQELNASVRAKVDE 41 (394) Q Consensus 1 Mk--------------------------------------~i~el~~~~~~~~-~~~k~~~e~~~~~~~~~~e~~~~~~~ 41 (394) |+ +-.+++++..+.. ++.+.+...........+++..+ T Consensus 220 ~p~~l~~~~~~~~~~p~~~~~~PaPTPaaaaPaaP~aaap~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~l~a~--- 296 (693) T protein:vir:95 220 MPEALKTLLAPRAQTPAAPANTPAPTPASAAPAAPVAAAPTEADIRARILAEESGRRSAITAAFGAFSTGHAELLAT--- 296 (693) T ss_pred hHHHHHHHHhhhcccccccccCcccCccCCCCCCCccCCCCcchhhHHHHHHHHHHHHHHHHHHHhccCChHHHHHH--- Confidence 22 1111111111111 11111111100000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHhhccc--c----cccchhhhhhhhhHHHHHHHHHHh-------------------- Q lcl|NC_019933. 42 LLMAQGALQADLKAAQQRI-AEVEGNGAG--G----DVQHISIGQQFVNSDSFKAMAESG-------------------- 94 (394) Q Consensus 42 ~~~~~~~l~~~i~~~e~~~-~~~~~~~~~--~----~~~~~~~~~~~~~~~~~~~~~~~~-------------------- 94 (394) .+....-.++++.+++ +.+.....+ . .......+...........+.+.+ T Consensus 297 ---~l~d~~~s~d~ar~~lL~~l~~~~~p~~~~~~~~~~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr 373 (693) T protein:vir:95 297 ---CLNDMNITVDQAREKLLAAIGADTQPAAALSAGAHIHAGNGNLVGDSVRASVLARIGRGERQADNAYNGMTLRELAR 373 (693) T ss_pred ---HHhhcCCCHHHHHHHHHHHHhhccCCCCCcCcCccccCCchhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHH Confidence 0000000111111111 111110000 0 000000000000000000000000 Q ss_pred ---hhhhh--hhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhh-hhhHHHhcccccccc-CceeEEEEcCccccc Q lcl|NC_019933. 95 ---GQRGR--AEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQR-RMTIRSLLAQGTMEG-NTLEYVRETGFTNAA 167 (394) Q Consensus 95 ---~~~~~--~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~-~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~ 167 (394) ..++. ........+....+.++++-+.++-...-..++..-.. ...+...++..+++. ...+..+..+. +.. T Consensus 374 ~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~-~~L 452 (693) T protein:vir:95 374 ASLVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEF-SSL 452 (693) T ss_pred HHHHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCC-CCh Confidence 00000 00011111222222334443333333333333322222 233455555555543 33445555443 455 Q ss_pred ceecCCccccccccceeeEEeeeeeEEEeehhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh---ccCCCcccccc Q lcl|NC_019933. 168 APVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQIL-SDSAQLQSFINARLLRGLEVVEENQLLN---GNGTGQNLLGL 243 (394) Q Consensus 168 ~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l-~~s~~~~~~i~~~la~a~~~~~d~a~l~---g~g~~~~~~Gi 243 (394) .-|.|++.+......=..-++...++|..+.|||+++ +|--++.+.+...++++.++.++..++. +++.-..=+.+ T Consensus 453 ~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~L 532 (693) T protein:vir:95 453 RQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTL 532 (693) T ss_pred hhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcce Confidence 6678888776554443445678899999999999975 6666778888889999999999875553 22211111122 Q ss_pred ccccccccccc-cccccchHHHHHHH---HHHhh---------hhcCCCCeeEeCHHHHHHHHHhhccCCcccccCcccC Q lcl|NC_019933. 244 LPQATAFAAPI-TVANATAVDRLRLA---LLQAQ---------LAEFPATGIVLNPADWAGIELLKDTQGRYILGNPQGT 310 (394) Q Consensus 244 ~~~~~~~~~~~-~~~~~~~~~~i~~~---~~~~~---------~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~ 310 (394) +.. .+..-. +.....+.+.+-.+ +..-. .-+..+..|++.+........+-.+...|-- +.+.+ T Consensus 533 Fha--dH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a-~~~~~ 609 (693) T protein:vir:95 533 FHA--DHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGA-DVNSG 609 (693) T ss_pred eec--cccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhcccccccc-ccccc Confidence 222 111111 11223344444333 22221 1234677888888777666665533321110 01111 Q ss_pred CCceeecc-eEEEcCCCCcC--ce-E-Eeeccc-eEE---EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEeccc Q lcl|NC_019933. 311 LAPTLWGL-PVVATQAMAVG--QF-L-TGAFDA-GAQ---VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPE 381 (394) Q Consensus 311 ~~~~l~G~-pv~~~~~~p~~--~~-~-~gd~~~-~~~---~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~ 381 (394) ....+.|+ .|+..+.+.+. +. + +.|... .+. +-..++..++. ...|..+.+.|++...++.++.|-- T Consensus 610 ~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~----~~gf~~dG~~~kvr~D~G~~~iD~R 685 (693) T protein:vir:95 610 IVNPIRAFAQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQ----QEGFTVDGVASKVRIDAGVAPLDFR 685 (693) T ss_pred cccchhccccccccceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEee----cCCCCcceEEEEEEEeccCceeecc Confidence 11124443 56666666431 21 2 222211 111 11122333332 3358899999999999999999988 Q ss_pred ceEEEEec Q lcl|NC_019933. 382 SFIKGSLA 389 (394) Q Consensus 382 a~~~l~~~ 389 (394) ++++-.=+ T Consensus 686 g~~kn~GA 693 (693) T protein:vir:95 686 GLQKSNGA 693 (693) T ss_pred ccccCCCC Confidence 77653222 No 204 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=97.30 E-value=7.7e-06 Score=48.62 Aligned_cols=300 Identities=10% Similarity=0.053 Sum_probs=143.0 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNT 154 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 154 (394) +........+...+.+-...........++........ +.++....+|..+...|...+..+.++++.+-+...+.-- T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~ 80 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhh Confidence 21111111122222221111111111122222122211 1244567899999999999999999998866555544322 Q ss_pred eeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHH---H-HHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_019933. 155 LEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSD---S-AQLQSFINARLLRGLE-VVEENQ 229 (394) Q Consensus 155 ~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~---s-~~~~~~i~~~la~a~~-~~~d~a 229 (394) ++....+ ...+...-.|..+.+...+|..-++.+-.++....+ -++..+ + ..+..+++.+|+.++. +.+|.+ T Consensus 81 V~~s~~s--~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~A 157 (318) T protein:vir:86 81 VSRSFDS--SAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLA 157 (318) T ss_pred hhhhhhh--hhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2222222 244556667777777777777667766554433333 233333 2 3579999999999998 889999 Q ss_pred HhhccCCCccccccccccccc------cccccccccc-hHHHHHHHHHHhhhhcCCCCeeEeCHHHHHH-HHHhhccCCc Q lcl|NC_019933. 230 LLNGNGTGQNLLGLLPQATAF------AAPITVANAT-AVDRLRLALLQAQLAEFPATGIVLNPADWAG-IELLKDTQGR 301 (394) Q Consensus 230 ~l~g~g~~~~~~Gi~~~~~~~------~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~lkd~~G~ 301 (394) ++-|+|+++ ++.+-+.+... +... .++.+ ....|..+..-+.+-..+ -.+++......+ |..|+.+..+ T Consensus 158 lV~GDG~N~-f~~~DK~advK~I~k~Ttkak-sagttpfanaieeavdfvrptagr-rylivkaedrkalldelrqatan 234 (318) T protein:vir:86 158 LVEGDGSNG-FKSIDKEADVKKIKKITTKAK-SAGTTPFANAIEEAVDFVRPTAGR-RYLIVKAEDRKALLDELRQATAN 234 (318) T ss_pred heeecCCCC-ccchhhHHHHHHHHHHhhhhh-ccCCCchhhHHHHHHhhhccCCCc-eEEEEeecchHHHHHHHHhhccc Confidence 999998765 22222221111 1111 12222 233444444444332222 145555555443 4566655544 Q ss_pred ccccCcccCCC-ceeecce-EEEcCC-CCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEe Q lcl|NC_019933. 302 YILGNPQGTLA-PTLWGLP-VVATQA-MAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVY 378 (394) Q Consensus 302 ~~~~~~~~~~~-~~l~G~p-v~~~~~-~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~ 378 (394) .-.....+... ..-.|.. +++-.. -.-...++.|.+. .+ +.++++- -....|..|.-.+..+..-.+.+. T Consensus 235 ahvriknddteiasevgvdeiivytgskalkptvlvdqky--hi-dmqdltk----vdafewktnsnmilvetltsghve 307 (318) T protein:vir:86 235 AHVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKY--HI-DMQDLTK----VDAFEWKTNSNMILVETLTSGHVE 307 (318) T ss_pred ceeEEeccchhhhhhcCcceeeeeeccccccceeeeccce--ec-chhhhhh----hhcceeccCCceEEEeecccCcce Confidence 33222222110 0111221 111111 1111123344332 11 1122211 112235566666666666777777 Q ss_pred cccceEEEEec Q lcl|NC_019933. 379 RPESFIKGSLA 389 (394) Q Consensus 379 ~~~a~~~l~~~ 389 (394) ..+|=+++++. T Consensus 308 tynagavitvs 318 (318) T protein:vir:86 308 TYNAGAVITVS 318 (318) T ss_pred eecCceeEEeC Confidence 77777777776 No 205 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.15 E-value=6.5e-05 Score=43.55 Aligned_cols=302 Identities=15% Similarity=0.130 Sum_probs=141.7 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccch Q lcl|NC_019933. 46 QGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQT 125 (394) Q Consensus 46 ~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~ 125 (394) ++..+ .+..++.-....+. +.. ........+...+......-.+++.+ -||. T Consensus 1 ~~~~~--------~~~~l~~~gi~~~~---~~~---------------~~~~~~~~~~~da~d~~~~~~~~~~~--~~~~ 52 (336) T protein:vir:36 1 MRDAQ--------RIQNLARAGVILPR---SVQ---------------NVSTPLTEYAMDAADLSPHLSSTGSS--GIPN 52 (336) T ss_pred CchHH--------HHHHHhhcCeeecc---hhh---------------hhhhHHHHhhhhhhhccCccccCCCc--chHH Confidence 00000 00000000000000 000 00000000001011111111112222 2343 Q ss_pred hhh----hHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeeh Q lcl|NC_019933. 126 TRL----PGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMK 198 (394) Q Consensus 126 ~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~ 198 (394) .+. +.+++.+........++++...+. ....++.... .+.+.+.+.+.+.|..+......+-..+.++..+. T Consensus 53 ~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~ 131 (336) T protein:vir:36 53 YLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTR 131 (336) T ss_pred HHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCceeecccceeeeeEEEEEeeee Confidence 332 355666666777777777766542 2344455443 35566778889999999877777788899999999 Q ss_pred hhH-HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccc---cccc----cccchHHHHHH Q lcl|NC_019933. 199 ASR-QILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAA---PITV----ANATAVDRLRL 267 (394) Q Consensus 199 is~-e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~---~~~~----~~~~~~~~i~~ 267 (394) ++. |+..... ++.+.-....++++.+.+|+..+.|+.. ....|+++.+..... +... +....++||.. T Consensus 132 yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~ 210 (336) T protein:vir:36 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVA 210 (336) T ss_pred eCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-cceEEEEecCCCccccccCCCcccccCHHHHHHHHHH Confidence 984 5554332 5777788888888999999888888754 345688887655321 1111 11234667776 Q ss_pred HHHHhhhhc------CCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 268 ALLQAQLAE------FPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 268 ~~~~~~~~~------~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) ++..+...- ..+..++|.+..+..|.. ++..|.-+..-.... +-++.+...+.+.... |+. .++ T Consensus 211 ~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n----~Pnl~i~t~pEl~~a~---g~~--~~l 280 (336) T protein:vir:36 211 LFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTAS---GRL--VQL 280 (336) T ss_pred HHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHHHHHh----cCccEEEEccccccCC---Cce--EEE Confidence 666665432 236689999998888753 333343232211111 1122333333221110 110 111 Q ss_pred EEee-cc---eEEEEecccc--hhhh--cCcEEEEEEEE-eccEEecccceEEEE-e Q lcl|NC_019933. 342 VFDR-WA---ARVEVATENQ--DDFI--KNMVTILAEER-LALAVYRPESFIKGS-L 388 (394) Q Consensus 342 ~~~~-~~---~~i~~~~~~~--~~~~--~~~~~~~~~~~-~d~~v~~~~a~~~l~-~ 388 (394) ++.. .+ ..+.+ ++.. ...+ .-.+..-+..+ .|..+++|.||++++ + T Consensus 281 ~~~~~~~~~t~~~~~-p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 281 WAPRVEGKDTATCGF-TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EEEecCCCcceeeec-chhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 1110 00 11111 0000 0001 11122223444 455566799998887 5 No 206 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=97.10 E-value=0.00017 Score=41.26 Aligned_cols=295 Identities=6% Similarity=-0.042 Sum_probs=141.7 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCc-CccccchhhhhHHHhhhhhhhhHHHhccccccccCce Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGS-AGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTL 155 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 155 (394) +.+ ..+..+. ..-+............ --+.+.|.....+.+.+.+.+.++++++++++.-... T Consensus 1 mtr-----~~~~~y~-----------~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~G 64 (336) T protein:vir:37 1 MNK-----QAYYALA-----------AALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKG 64 (336) T ss_pred CcH-----HHHHHHH-----------HHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccc Confidence 000 0000000 0001111111111111 2477888899999999999999999999998875433 Q ss_pred eEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHh-- Q lcl|NC_019933. 156 EYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEV--VEENQLL-- 231 (394) Q Consensus 156 ~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~--~~d~a~l-- 231 (394) ...-....++.+.-+. ++..| .++..+.-.+...+.-....|+.+.|..++.+.++..+.+...+.+ ++|...+ T Consensus 65 e~v~lg~~g~iagrtd-t~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGf 142 (336) T protein:vir:37 65 QKLFGATEKGVTGRKQ-TGRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGW 142 (336) T ss_pred eEeeeccCcccccccC-CCccc-cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcc Confidence 3322221112222111 22222 2246667777788888888899999987753333332333333332 3554444 Q ss_pred hccCC---Cccccc------c------------cccccccc---ccccccc-cchHHH-HHHHHHHhhhhcCC--CCeeE Q lcl|NC_019933. 232 NGNGT---GQNLLG------L------------LPQATAFA---APITVAN-ATAVDR-LRLALLQAQLAEFP--ATGIV 283 (394) Q Consensus 232 ~g~g~---~~~~~G------i------------~~~~~~~~---~~~~~~~-~~~~~~-i~~~~~~~~~~~~~--~~~~~ 283 (394) +|... +.+|.+ . ++.....+ ...+..+ =..+|. +.++...+++.+.. .-+++ T Consensus 143 nG~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvi 222 (336) T protein:vir:37 143 NGQSVADNTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFL 222 (336) T ss_pred cceeeccCCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEE Confidence 44321 112221 1 11110000 1111111 123333 46666666665554 33788 Q ss_pred eCHHHHH-HHHHhhccCCc-ccccCcc---cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccch Q lcl|NC_019933. 284 LNPADWA-GIELLKDTQGR-YILGNPQ---GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQD 358 (394) Q Consensus 284 ~~~~~~~-~l~~lkd~~G~-~~~~~~~---~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 358 (394) |.+.... +...|-...|. |- .... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. T Consensus 223 vG~dLla~~~~~l~~~~~~~Pt-E~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~- 300 (336) T protein:vir:37 223 VGADLVSKETKLIQQKHGLTPT-EKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDED- 300 (336) T ss_pred EchhhhhhhhhhhhhhcCCCHH-HHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccc- Confidence 8887744 23334444432 21 1000 0123579999999999999999999888876555544444333322221 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 359 DFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 359 ~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ++.+.-+-..--++.|.++.+++.+.-....=. T Consensus 301 ---r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~ 333 (336) T protein:vir:37 301 ---KKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLN 333 (336) T ss_pred ---cccccchhhhcceeeeeccccEEEeeeeeeeec Confidence 222222222333445555555554431111111 No 207 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=97.00 E-value=8.4e-05 Score=42.91 Aligned_cols=277 Identities=11% Similarity=-0.008 Sum_probs=140.6 Q ss_pred HHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhcc----cccccc-CceeEEEEcCcccccce-ecCCcc Q lcl|NC_019933. 102 INIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLA----QGTMEG-NTLEYVRETGFTNAAAP-VAEGAQ 175 (394) Q Consensus 102 ~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~----~~~~~~-~~~~~~~~~~~~~~~~~-~~eg~~ 175 (394) +.... +....+. .=...+..+.+.+-..++|+.... +.+..| .++..|........+.| .++-.- T Consensus 1 mp~~~-lsel~t~--------tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l 71 (321) T protein:vir:34 1 MPFPN-ISDIITT--------TIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVL 71 (321) T ss_pred CCCch-HHHHHHH--------HHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeee Confidence 00000 0000000 011223344555556666655543 233444 45666666654455555 444333 Q ss_pred ccccccceeeEEeeeeeEEEeehhhHH-HHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCC---cccccc--- Q lcl|NC_019933. 176 KPESSLRFDLVQTSAKVIAHWMKASRQ-ILSDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTG---QNLLGL--- 243 (394) Q Consensus 176 ~~~~~~~~~~i~~~~~k~~~~~~is~e-~l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~---~~~~Gi--- 243 (394) ...-...|.+-++.++.+++.+.||-. +++.+. ++...-.+...+.+...++..+.. +|++ ....|+ T Consensus 72 ~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~s-dGTa~g~~~i~GL~~l 150 (321) T protein:vir:34 72 PTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYG-DGTAFGGRAINGLDGA 150 (321) T ss_pred ccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhc-cccccccchhhhhhhh Confidence 333446799999999999999999975 565542 455555556677777888877664 4442 333333 Q ss_pred cccc-ccccc--------------cccccccchHHHHH----HHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccc Q lcl|NC_019933. 244 LPQA-TAFAA--------------PITVANATAVDRLR----LALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYIL 304 (394) Q Consensus 244 ~~~~-~~~~~--------------~~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~ 304 (394) .+.. ++.+. .....+..+...+. .++.+.--....|+.|+++...|...+.-....-|+-- T Consensus 151 v~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~ 230 (321) T protein:vir:34 151 VPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTS 230 (321) T ss_pred cccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeecc Confidence 2211 11000 00011112222232 33333333344677899999988876653322222222 Q ss_pred cCccc-C-CCceeecceEEEcC----CCCcCceEEeeccceEEEEeecceEEEEecccc-hhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 305 GNPQG-T-LAPTLWGLPVVATQ----AMAVGQFLTGAFDAGAQVFDRWAARVEVATENQ-DDFIKNMVTILAEERLALAV 377 (394) Q Consensus 305 ~~~~~-~-~~~~l~G~pv~~~~----~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~~~~~d~~v 377 (394) ..... + ..=.+.|..|+.++ .+|+++.+|.|.+. +.+....+-.+....... ..+.++.+.-...++....+ T Consensus 231 ~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~y-l~~r~h~~~~~~pi~p~r~~~~NqdA~~q~I~~~GnL~~ 309 (321) T protein:vir:34 231 AEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKY-LHFRPHKDRNMVPLSPSRRAAFNQDAEAQILAWAGNLTC 309 (321) T ss_pred cccccccceeeeeeeEEEEEeCCCCCCccccceeeeecce-EEEEEcCCCceeecCcccccccchhHHhhhhhhhheeee Confidence 11111 1 12246788888887 58999999999874 445443333333322211 12234444444555666666 Q ss_pred ecccceEEEEec Q lcl|NC_019933. 378 YRPESFIKGSLA 389 (394) Q Consensus 378 ~~~~a~~~l~~~ 389 (394) -++.+=.+|+-- T Consensus 310 sn~~~~~vL~~~ 321 (321) T protein:vir:34 310 SGAQFQGRLIAE 321 (321) T ss_pred ecccceeEEeeC Confidence 666665555544 No 208 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=96.97 E-value=0.00023 Score=40.51 Aligned_cols=294 Identities=6% Similarity=-0.062 Sum_probs=140.8 Q ss_pred HHHHhhhhhhhhHHH--HHHHhhccccc-CCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEcCcccc Q lcl|NC_019933. 90 MAESGGQRGRAEINI--KAAITSLSTNA-DGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRETGFTNA 166 (394) Q Consensus 90 ~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (394) +.+ ....... -+......... ...--+.+.+.....+.+.+.+.+.++++++++++.-......-....++. T Consensus 1 mtr-----~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~i 75 (336) T protein:vir:37 1 MNK-----QAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGV 75 (336) T ss_pred CcH-----HHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCccc Confidence 000 0001000 01111111111 111247788888999999999999999999999887543333222211122 Q ss_pred cceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHh--hccCC---Ccc Q lcl|NC_019933. 167 AAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEV--VEENQLL--NGNGT---GQN 239 (394) Q Consensus 167 ~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~--~~d~a~l--~g~g~---~~~ 239 (394) +.-+.-+.. -.....+.-.+..++.-....|+.+.|..++.+.++..+.+...+.+ ++|...+ +|... +++ T Consensus 76 agrtdt~r~--r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Tdn 153 (336) T protein:vir:37 76 TGRKQTGRN--LATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTK 153 (336) T ss_pred ccccCCCCC--ccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCC Confidence 222221211 11234555667777777788899999987753333333333333332 3555444 44221 123 Q ss_pred ccc------c------------cccccccc---ccccccc-cchHHH-HHHHHHHhhhhcCC--CCeeEeCHHHHH-HHH Q lcl|NC_019933. 240 LLG------L------------LPQATAFA---APITVAN-ATAVDR-LRLALLQAQLAEFP--ATGIVLNPADWA-GIE 293 (394) Q Consensus 240 ~~G------i------------~~~~~~~~---~~~~~~~-~~~~~~-i~~~~~~~~~~~~~--~~~~~~~~~~~~-~l~ 293 (394) |.+ . ++.....+ ...+..+ =..+|. +.++...+++.+.. .-+++|.+.... +.. T Consensus 154 PllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~ 233 (336) T protein:vir:37 154 TDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETK 233 (336) T ss_pred ccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhh Confidence 321 1 11110000 1111111 123333 46666666665554 337888887744 233 Q ss_pred HhhccCCcccccCcc---cCCCceeecceEEEcCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEE Q lcl|NC_019933. 294 LLKDTQGRYILGNPQ---GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAE 370 (394) Q Consensus 294 ~lkd~~G~~~~~~~~---~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 370 (394) .|-...|...-.... -....++.|+|.+..+.+|++.+++--+++....+.....+-.+...+. ++.+.-+-. T Consensus 234 ~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r~rie~y~s 309 (336) T protein:vir:37 234 LIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDED----KKGLVTSYY 309 (336) T ss_pred hhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEccc----cccccchhh Confidence 344443321111000 0123578999999999999999999888875555544444433322221 222222222 Q ss_pred EEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 371 ERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 371 ~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) .--++.|.++.+++.+.-....=. T Consensus 310 ~Ne~YvVEd~~~~a~iE~i~v~~~ 333 (336) T protein:vir:37 310 RQEGYVVEDLGLMTAIDHTKVKLN 333 (336) T ss_pred hcceeeeeccccEEEeeeeeeecc Confidence 334455555555555532211111 No 209 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.77 E-value=0.00025 Score=40.30 Aligned_cols=302 Identities=15% Similarity=0.144 Sum_probs=143.4 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccch Q lcl|NC_019933. 46 QGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQT 125 (394) Q Consensus 46 ~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~ 125 (394) ++..+ .+..++.-...-+...+.+... ...+..-+........+.++.| ||. T Consensus 1 ~~~~~--------~~~~l~~~gi~~~~~~~~~~~~------------------~~~~a~da~d~~~~~~t~~~~g--~~~ 52 (336) T protein:vir:78 1 MRDAQ--------RIQNLARAGVILPRSVKNVSTP------------------LAEYAMDAADLSPHLSSTGSSG--IPN 52 (336) T ss_pred CchHH--------HHHHHhccCeecchhhhhhhHH------------------HHHHHHhhhhhccccccCCCcc--hHH Confidence 00000 0111111000000000000000 0000011111111112222222 232 Q ss_pred ---hhh-hHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeeh Q lcl|NC_019933. 126 ---TRL-PGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMK 198 (394) Q Consensus 126 ---~~~-~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~ 198 (394) .+. +.+++.+........++++..++. .++.++.... .+.+.+.+.+.+.|..+...+..+-..+.++..+. T Consensus 53 ~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~ 131 (336) T protein:vir:78 53 YLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTR 131 (336) T ss_pred HHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeec-ceeeEEeecccCCCeeecceeeEEEEEEEEEeeee Confidence 222 455566666667777777766532 2445555444 35666778899999999999999999999999999 Q ss_pred hhHHHHHHH-H---HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccc-------ccccchHHHHHH Q lcl|NC_019933. 199 ASRQILSDS-A---QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPIT-------VANATAVDRLRL 267 (394) Q Consensus 199 is~e~l~~s-~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~~~i~~ 267 (394) ++.+=+..+ . ++.+.-....++++.+.+|...+.|++. ....|+++.+........ ++....++||.. T Consensus 132 yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~ 210 (336) T protein:vir:78 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVT 210 (336) T ss_pred ecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccc-cceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHH Confidence 996544433 2 5778888888888999999888888754 446788887655322111 112234556666 Q ss_pred HHHHhhhhc------CCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 268 ALLQAQLAE------FPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 268 ~~~~~~~~~------~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) ++..+...- ..+..++|.+..+..|.. ++..|--+..-.... +-++.+...+.+... -|+- .++ T Consensus 211 ~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n----~Pnl~i~t~pel~~A---gg~~--~~~ 280 (336) T protein:vir:78 211 LFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI----FPKLEFVTIPEYDTA---SGRL--VQL 280 (336) T ss_pred HHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHh----cCccEEEEccccccc---Ccce--EEE Confidence 665554332 124478999999888864 333343222111111 112334433333211 0110 011 Q ss_pred EEee----cceEEEEe----cccchhhhcCcEEEEEEEE-eccEEecccceEEEE-e Q lcl|NC_019933. 342 VFDR----WAARVEVA----TENQDDFIKNMVTILAEER-LALAVYRPESFIKGS-L 388 (394) Q Consensus 342 ~~~~----~~~~i~~~----~~~~~~~~~~~~~~~~~~~-~d~~v~~~~a~~~l~-~ 388 (394) +... .-..+.+- ..+. ......+..-+..| .|..+++|.||++++ + T Consensus 281 ~~~~~~~~~t~~~~~p~~f~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 281 WAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EEeeccCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeeccC Confidence 1000 00111110 0000 00011122223344 455566789988876 4 No 210 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=96.62 E-value=0.00033 Score=39.69 Aligned_cols=302 Identities=14% Similarity=0.116 Sum_probs=141.9 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccch Q lcl|NC_019933. 46 QGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQT 125 (394) Q Consensus 46 ~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~ 125 (394) ++..+ .+..++.-....+..... .......+...+........+++++ .||. T Consensus 1 ~~~~~--------~~~~l~~~gi~~~~~~~~------------------~~~~~~~~~~da~d~~~~~~~~~~~--~i~~ 52 (336) T protein:vir:10 1 MRDAQ--------RIQNLARAGVILPRSVQN------------------VSTPLTEYAMDAADLSPHLSSTGSS--GIPN 52 (336) T ss_pred CchHH--------HHHHHhhcCeeecchhhh------------------hhhhHHHhhhhhhhccCccccCCCc--hhHH Confidence 00000 000011000000000000 0000000111111111111122222 2333 Q ss_pred h---hh-hHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeeh Q lcl|NC_019933. 126 T---RL-PGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMK 198 (394) Q Consensus 126 ~---~~-~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~ 198 (394) . +. +.+++.+........++++...+. ....++.... .+.+.+.+.+.+.|..+......+-..+.++..+. T Consensus 53 ~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~ 131 (336) T protein:vir:10 53 YLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTR 131 (336) T ss_pred HHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCceeecccceeeeeEEEEEeeee Confidence 2 22 445566666666777777766542 2344455443 35566778889999999877777788999999999 Q ss_pred hhH-HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccc---ccc----cccchHHHHHH Q lcl|NC_019933. 199 ASR-QILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAP---ITV----ANATAVDRLRL 267 (394) Q Consensus 199 is~-e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~---~~~----~~~~~~~~i~~ 267 (394) ++. |+..... ++.+.-....++++.+.+|+..+.|++. ....|+++.+...... ... +....++||.. T Consensus 132 yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~ 210 (336) T protein:vir:10 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVA 210 (336) T ss_pred eCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-cceEEEEeCCCCccccccCCCcccccCHHHHHHHHHH Confidence 995 4544332 6778888888899999999888888754 3456888876553211 111 11224667776 Q ss_pred HHHHhhhhc------CCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 268 ALLQAQLAE------FPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 268 ~~~~~~~~~------~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) ++..+...- ..+..++|.+..+..|.. ++..|.-+..-.... +-++.+...+.+.... |+. .++ T Consensus 211 ~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n----~Pnl~i~t~pEl~~a~---G~~--~~l 280 (336) T protein:vir:10 211 LFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTAS---GRL--VQL 280 (336) T ss_pred HHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHHHHHh----cCccEEEEccccccCC---Cce--EEE Confidence 666665432 236689999998888753 333343232211111 1122333333221110 110 111 Q ss_pred EEee-cc---eEEEEe----cccchhhhcCcEEEEEEEE-eccEEecccceEEEE-e Q lcl|NC_019933. 342 VFDR-WA---ARVEVA----TENQDDFIKNMVTILAEER-LALAVYRPESFIKGS-L 388 (394) Q Consensus 342 ~~~~-~~---~~i~~~----~~~~~~~~~~~~~~~~~~~-~d~~v~~~~a~~~l~-~ 388 (394) ++.. .+ ..+.+- ..+- ....-.+..-+..+ .|..+++|.||++++ + T Consensus 281 ~~~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 281 WAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EEEecCCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeeecC Confidence 1110 00 111100 0000 00011122223444 455566799998887 5 No 211 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=96.31 E-value=0.00033 Score=39.64 Aligned_cols=291 Identities=14% Similarity=0.009 Sum_probs=136.1 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhh--hhhhHHHhcccccccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQ--RRMTIRSLLAQGTMEG 152 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~--~~~~l~~~~~~~~~~~ 152 (394) +.........-..... ...+.-.++....-.++ +-.+|+.+=-+.+..+|..+.. +.-.++.-+...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~-----~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:95 1 MTIEKNLSDVQQKYAD-----QFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHHHHh-----hhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 0000000000000000 00001112221111111 1123455545555555544322 3334566666666665 Q ss_pred CceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH-HHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 153 NTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI-LSDS-AQLQSFINARLLRGLEVVEEN 228 (394) Q Consensus 153 ~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~-l~~s-~~~~~~i~~~la~a~~~~~d~ 228 (394) --.++-... +..+-+.+++|++..+.+++.+.......|-++....+|.-+ +.++ .+.+....++-.-.++..++. T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~ 155 (463) T protein:vir:95 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHH Confidence 433333333 333456789999999999999999999999999888888764 3344 478888889999999999999 Q ss_pred HHhhccCCC--------ccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCC Q lcl|NC_019933. 229 QLLNGNGTG--------QNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQG 300 (394) Q Consensus 229 a~l~g~g~~--------~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G 300 (394) ++|.|+..= -++.||.+.-....+-.......+.+.|..+-..+...+..++-++|+..+.+.|..---.. T Consensus 156 a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~- 234 (463) T protein:vir:95 156 ASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR- 234 (463) T ss_pred HHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCc- Confidence 999986432 24667755433222222223444555566565666677778888999999988887322111 Q ss_pred cccccCcccCCCceeecceEEE------------cCCCCcCceEEeeccceEEEE--eecceEEEEecccchhhhcCcEE Q lcl|NC_019933. 301 RYILGNPQGTLAPTLWGLPVVA------------TQAMAVGQFLTGAFDAGAQVF--DRWAARVEVATENQDDFIKNMVT 366 (394) Q Consensus 301 ~~~~~~~~~~~~~~l~G~pv~~------------~~~~p~~~~~~gd~~~~~~~~--~~~~~~i~~~~~~~~~~~~~~~~ 366 (394) +..+...+ .+....|+||-- +..|. +..++ |.+.-.... ....++..+.... T Consensus 235 qrv~~~~N--~~~~~~G~~v~~f~s~~G~I~L~~s~~m~-~~~il-~~~~~~~p~ap~~~~~tatv~~~~---------- 300 (463) T protein:vir:95 235 QMQLMQDN--SGNVNTGYSVNGFYSSRGFIKLHGSTVME-NELIL-DESLQPLPNAPQPAKVTATVETKQ---------- 300 (463) T ss_pred eEEEEcCC--CCceeeeeeccceeeeeeeeeeCCceecC-Ccccc-cchhhcCCCCccCceeEEEEeecc---------- Confidence 11111111 111233443321 00111 11111 111000000 0001111221111 Q ss_pred EEEEEEeccEEecccceEEEEec--------CCCCC Q lcl|NC_019933. 367 ILAEERLALAVYRPESFIKGSLA--------AAAGT 394 (394) Q Consensus 367 ~~~~~~~d~~v~~~~a~~~l~~~--------~a~~~ 394 (394) .+...++...+..+++ -+++| T Consensus 301 -------~~~~~~~~~~a~~~Y~vv~~s~~geS~pS 329 (463) T protein:vir:95 301 -------KGAFENEEDRAGLSYKVVVNSDDAQSAPS 329 (463) T ss_pred -------CCCCCCcccccceEEEEEEECCCCCcccc Confidence 1111111111111111 11122 No 212 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=96.31 E-value=0.00033 Score=39.64 Aligned_cols=291 Identities=14% Similarity=0.009 Sum_probs=136.1 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhh--hhhhHHHhcccccccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQ--RRMTIRSLLAQGTMEG 152 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~--~~~~l~~~~~~~~~~~ 152 (394) +.........-..... ...+.-.++....-.++ +-.+|+.+=-+.+..+|..+.. +.-.++.-+...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~-----~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:99 1 MTIEKNLSDVQQKYAD-----QFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHHHHh-----hhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 0000000000000000 00001112221111111 1123455545555555544322 3334566666666665 Q ss_pred CceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH-HHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 153 NTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI-LSDS-AQLQSFINARLLRGLEVVEEN 228 (394) Q Consensus 153 ~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~-l~~s-~~~~~~i~~~la~a~~~~~d~ 228 (394) --.++-... +..+-+.+++|++..+.+++.+.......|-++....+|.-+ +.++ .+.+....++-.-.++..++. T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~ 155 (463) T protein:vir:99 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHH Confidence 433333333 333456789999999999999999999999999888888764 3344 478888889999999999999 Q ss_pred HHhhccCCC--------ccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCC Q lcl|NC_019933. 229 QLLNGNGTG--------QNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQG 300 (394) Q Consensus 229 a~l~g~g~~--------~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G 300 (394) ++|.|+..= -++.||.+.-....+-.......+.+.|..+-..+...+..++-++|+..+.+.|..---.. T Consensus 156 a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~- 234 (463) T protein:vir:99 156 ASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR- 234 (463) T ss_pred HHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCc- Confidence 999986432 24667755433222222223444555566565666677778888999999988887322111 Q ss_pred cccccCcccCCCceeecceEEE------------cCCCCcCceEEeeccceEEEE--eecceEEEEecccchhhhcCcEE Q lcl|NC_019933. 301 RYILGNPQGTLAPTLWGLPVVA------------TQAMAVGQFLTGAFDAGAQVF--DRWAARVEVATENQDDFIKNMVT 366 (394) Q Consensus 301 ~~~~~~~~~~~~~~l~G~pv~~------------~~~~p~~~~~~gd~~~~~~~~--~~~~~~i~~~~~~~~~~~~~~~~ 366 (394) +..+...+ .+....|+||-- +..|. +..++ |.+.-.... ....++..+.... T Consensus 235 qrv~~~~N--~~~~~~G~~v~~f~s~~G~I~L~~s~~m~-~~~il-~~~~~~~p~ap~~~~~tatv~~~~---------- 300 (463) T protein:vir:99 235 QMQLMQDN--SGNVNTGYSVNGFYSSRGFIKLHGSTVME-NELIL-DESLQPLPNAPQPAKVTATVETKQ---------- 300 (463) T ss_pred eEEEEcCC--CCceeeeeeccceeeeeeeeeeCCceecC-Ccccc-cchhhcCCCCccCceeEEEEeecc---------- Confidence 11111111 111233443321 00111 11111 111000000 0001111221111 Q ss_pred EEEEEEeccEEecccceEEEEec--------CCCCC Q lcl|NC_019933. 367 ILAEERLALAVYRPESFIKGSLA--------AAAGT 394 (394) Q Consensus 367 ~~~~~~~d~~v~~~~a~~~l~~~--------~a~~~ 394 (394) .+...++...+..+++ -+++| T Consensus 301 -------~~~~~~~~~~a~~~Y~vv~~s~~geS~pS 329 (463) T protein:vir:99 301 -------KGAFENEEDRAGLSYKVVVNSDDAQSAPS 329 (463) T ss_pred -------CCCCCCcccccceEEEEEEECCCCCcccc Confidence 1111111111111111 11122 No 213 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=96.15 E-value=0.00094 Score=37.17 Aligned_cols=265 Identities=5% Similarity=-0.058 Sum_probs=127.0 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhccc------cccccCceeEEEEcCcccccceecC-Cccccccccce Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQ------GTMEGNTLEYVRETGFTNAAAPVAE-GAQKPESSLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~e-g~~~~~~~~~~ 183 (394) |.. .-..+.|...+.+.+...+....++.. ...+|+++++|+....+...+-.+. |......+.++ T Consensus 1 MA~-------~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~ 73 (299) T protein:vir:79 1 MAA-------LNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAW 73 (299) T ss_pred Ccc-------chhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcce Confidence 110 112466777777777777665554332 1234678999998654322222222 22222334566 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHH-H--HHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccc Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDS-A--QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANAT 260 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s-~--~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~ 260 (394) ...++...+.-.+. |..--.+.+ . .+...+.+.....++-.+|.-.+..--++..- .+........+... T Consensus 74 ~t~~ldqdr~~~f~-vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~------~g~~~~~~~~T~~n 146 (299) T protein:vir:79 74 EPKVLTNQRKWSTL-VHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTA------LGNTADTTVLTTTN 146 (299) T ss_pred eEEEeeccccceec-cchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhh------cCCcccccccCHHH Confidence 67777777644332 221111112 1 23333344444555556676555321111000 01111112223455 Q ss_pred hHHHHHHHHHHhhhhcCCC--CeeEeCHHHHHHHHHhhcc--CCccccc-CcccCCCceeecceEEEcC--CCCcC---- Q lcl|NC_019933. 261 AVDRLRLALLQAQLAEFPA--TGIVLNPADWAGIELLKDT--QGRYILG-NPQGTLAPTLWGLPVVATQ--AMAVG---- 329 (394) Q Consensus 261 ~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~lkd~--~G~~~~~-~~~~~~~~~l~G~pv~~~~--~~p~~---- 329 (394) .++.|.++...+.....+. -.++|+|.++..|..-..- ....... ....+..+.|.|+||+..+ .|+.. T Consensus 147 ~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~ 226 (299) T protein:vir:79 147 VLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFT 226 (299) T ss_pred HHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceec Confidence 7899999999999887753 3679999998888653321 1111111 1233445689999998743 34321 Q ss_pred ------------ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc-cceEEEEecCCCC Q lcl|NC_019933. 330 ------------QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP-ESFIKGSLAAAAG 393 (394) Q Consensus 330 ------------~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~-~a~~~l~~~~a~~ 393 (394) ..+++- ..+..-.... -.+.+.. +... +++-..+.-..|.|.-+.+. ..-.++.+++|.+ T Consensus 227 ~G~~~~~~ak~in~ii~~-~~a~~~~~K~-~~~~~~~-P~~~-~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 227 TGWKVGAGAKQIFMSLVH-PSAIITPVSY-QFSKLDE-PTAV-TEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred cCccccCcccccceEEEc-CCeeeeeEee-eeEEeec-CCCC-CccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 112222 1222111111 1122221 2211 22222344466666666654 3344677777777 No 214 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=96.06 E-value=0.0011 Score=36.91 Aligned_cols=271 Identities=15% Similarity=0.111 Sum_probs=144.8 Q ss_pred cccCCcCcccc-chhhhhHHHhhhhhhhhHHHhcc-cccc-ccCceeEEEEcCcccccceecCCccccccccceeeEEee Q lcl|NC_019933. 113 TNADGSAGATV-QTTRLPGILELPQRRMTIRSLLA-QGTM-EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTS 189 (394) Q Consensus 113 ~~~~~~~g~~i-p~~~~~~ii~~~~~~~~l~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~ 189 (394) ..-+++.-.+| -+.|+..|...+.+...=..+.+ +... +|.++.+|...+ +...-..|-++........++|++. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs--~~~~~~~E~~~~~~~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGS--VTLQEAEEDTPLIYNPIETGEITFQ 78 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCc--eeeeccccCCCeeecccccceEEEE Confidence 22223333444 45566555554444322122222 2222 356777776533 3444455556666667788899999 Q ss_pred eeeEEEee-hhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhc-------cCCCccccccccccccccccccccc Q lcl|NC_019933. 190 AKVIAHWM-KASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNG-------NGTGQNLLGLLPQATAFAAPITVAN 258 (394) Q Consensus 190 ~~k~~~~~-~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g-------~g~~~~~~Gi~~~~~~~~~~~~~~~ 258 (394) ...+.+-. +||+.+-+|+- .+-+.+.-+-+++|....+..++.. ...-....|+-. .-+..+..+ T Consensus 79 i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH----~~V~~~T~~ 154 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPH----VIVSAETNG 154 (313) T ss_pred EEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccc----eEEeccCCc Confidence 99877654 89999999874 4666666677788887777666642 111112222211 112233445 Q ss_pred cchHHHHHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhc------cCCcccccCcccCCC---ceeecceEEEcCCCC Q lcl|NC_019933. 259 ATAVDRLRLALLQAQLAEFP--ATGIVLNPADWAGIELLKD------TQGRYILGNPQGTLA---PTLWGLPVVATQAMA 327 (394) Q Consensus 259 ~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd------~~G~~~~~~~~~~~~---~~l~G~pv~~~~~~p 327 (394) ...+.+++.+...+..+..+ .-.+++.|.....|..+.. .+|++|.......+. ..+.|..+.+++.+. T Consensus 155 ~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~ 234 (313) T protein:vir:95 155 VFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLH 234 (313) T ss_pred eehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhh Confidence 55677777776666665554 3478999999888876642 346777665554433 257888888887553 Q ss_pred c---------CceEEeeccceEEEEeecceEEEEec------cc--chhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 328 V---------GQFLTGAFDAGAQVFDRWAARVEVAT------EN--QDDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 328 ~---------~~~~~gd~~~~~~~~~~~~~~i~~~~------~~--~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) . +..+.|+.... +.+.+-..+.... +. .++-..+.+. ...|+|+.+.+.+-+..+-..+ T Consensus 235 ~AN~~D~~tT~~G~~~NlFM~--i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~--~~~R~G~Gi~R~~~L~~~~~~A 310 (313) T protein:vir:95 235 VANYNDGTTTGNGYVGNLFMC--ILDDQTKPIMGAWRRMPKSEGERNKDRARDEHV--VRCRYGFGIQRLDTLGLLATSA 310 (313) T ss_pred hccccccccccCceeeeeeee--eecccccceeeeeccccccccccccccccccce--eeeeecccceeecceeEEEecc Confidence 2 22344443211 1111111111100 00 0111122333 4556777777776665554333 Q ss_pred CCC Q lcl|NC_019933. 391 AAG 393 (394) Q Consensus 391 a~~ 393 (394) .+- T Consensus 311 ~~~ 313 (313) T protein:vir:95 311 TAY 313 (313) T ss_pred ccC Confidence 333 No 215 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.05 E-value=0.00054 Score=38.50 Aligned_cols=302 Identities=16% Similarity=0.139 Sum_probs=139.6 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccch Q lcl|NC_019933. 46 QGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQT 125 (394) Q Consensus 46 ~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~ 125 (394) ++..+ .+..++.-...-+...+.+... ...+..-+........+.++.| ||. T Consensus 1 ~~~~~--------~~~~l~~~gi~~~~~~~~~~~~------------------~~~~a~da~d~~~~~~t~~~~g--~~~ 52 (336) T protein:vir:10 1 MRDAQ--------RIQNLARAGVILPRSVKNVSTP------------------LAEYAMDAADLSPHLSSTGSSG--IPN 52 (336) T ss_pred CchHH--------HHHHHhccCeecchhhhhhhHH------------------HHHHHHhhhhhccccccCCCcc--hHH Confidence 00000 0111111000000000000000 0000011111111112222222 232 Q ss_pred h---hh-hHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeeh Q lcl|NC_019933. 126 T---RL-PGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMK 198 (394) Q Consensus 126 ~---~~-~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~ 198 (394) . +. +.+++.+........++++.+.+. ....++.... .+.+.+.+.+.++|..+.....-.-..+.++..+. T Consensus 53 ~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~-~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~ 131 (336) T protein:vir:10 53 YLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTR 131 (336) T ss_pred HHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeee-eeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEe Confidence 2 22 345555666666677776665432 2233444443 34455667788999999888888888999999999 Q ss_pred hhHHHHHHH-H---HHHHHHHHHHHHHHHHHHHHHHhhccCCCcccccccccccccccccc-------ccccchHHHHHH Q lcl|NC_019933. 199 ASRQILSDS-A---QLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPIT-------VANATAVDRLRL 267 (394) Q Consensus 199 is~e~l~~s-~---~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~~~i~~ 267 (394) ++.+=+... . ++.+.-....++++.+.++...+.|+.. ....|+++.+........ ++....++||.. T Consensus 132 yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~ 210 (336) T protein:vir:10 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVT 210 (336) T ss_pred eCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecc-cceEEEeecCCCCcccccCcCcccccCHHHHHHHHHH Confidence 996544433 2 5778888888888889999888888764 346688887655322111 122334566666 Q ss_pred HHHHhhhhc------CCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 268 ALLQAQLAE------FPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 268 ~~~~~~~~~------~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) ++..+...- ..+..++|.+..+..|.. ++..|--+..-.... +-++.+...+.+... -|+- .++ T Consensus 211 ~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n----~Pnl~i~t~pel~~A---gg~~--~~~ 280 (336) T protein:vir:10 211 LFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI----FPKLEFVTIPEYDTA---SGRL--VQL 280 (336) T ss_pred HHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHh----CCccEEEEccccccc---CCce--EEE Confidence 666554332 124478999999988864 333443222211111 112334443333211 0110 111 Q ss_pred EEee----cceEEEEecccc----hhhhcCcEEEEEEEE-eccEEecccceEEEE-e Q lcl|NC_019933. 342 VFDR----WAARVEVATENQ----DDFIKNMVTILAEER-LALAVYRPESFIKGS-L 388 (394) Q Consensus 342 ~~~~----~~~~i~~~~~~~----~~~~~~~~~~~~~~~-~d~~v~~~~a~~~l~-~ 388 (394) +... .-..+.+ ++.. .....-.+..-+..| .|..+++|.||++++ + T Consensus 281 ~~~~~~~~~t~~~~~-P~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 281 WAPRVEGKDTATCGF-TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred EEecccCCcceeeec-ChhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 1000 0011111 0000 000011122223344 455566688888876 4 No 216 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=95.81 E-value=0.0014 Score=36.19 Aligned_cols=268 Identities=12% Similarity=0.028 Sum_probs=131.3 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhh-hHHHhccccccccCceeEEEEcCcccccceecCCccccccccceeeEEee Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRM-TIRSLLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTS 189 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~ 189 (394) +..+.. .- .++-..+...+.+...... ...++++.++.+....++......+.-..|.+| .+...+.=...++. T Consensus 1 m~it~~-~l-~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~~~~~l~~~~~~i~ 75 (302) T protein:vir:10 1 MLINKQ-SL-NAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---KVVKNLKAYKYVVE 75 (302) T ss_pred CcccHH-HH-HHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---eeeccccccceeEE Confidence 111100 00 0111112222333222222 344555666655556667776665544556543 44555666667899 Q ss_pred eeeEEEeehhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcc--C-C-----Cc------ccccccccccccc--- Q lcl|NC_019933. 190 AKVIAHWMKASRQILS-DSAQLQSFINARLLRGLEVVEENQLLNGN--G-T-----GQ------NLLGLLPQATAFA--- 251 (394) Q Consensus 190 ~~k~~~~~~is~e~l~-~s~~~~~~i~~~la~a~~~~~d~a~l~g~--g-~-----~~------~~~Gi~~~~~~~~--- 251 (394) .++++..+.|||+.+. |.-.+..-+...++++.++..|+.++.-- | + +. ++.|--...+... T Consensus 76 ~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~ 155 (302) T protein:vir:10 76 NEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPL 155 (302) T ss_pred eecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhh Confidence 9999999999999876 44577888889999999999998766421 1 1 11 1211111111100 Q ss_pred -ccccccccchHHHHHHHHHHhhhhc-----CCCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCC Q lcl|NC_019933. 252 -APITVANATAVDRLRLALLQAQLAE-----FPATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQA 325 (394) Q Consensus 252 -~~~~~~~~~~~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~ 325 (394) .+........++....++....... ..+..+++.|.....-+.+-.. ++.- .+...+...-+.+++++. T Consensus 156 ~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~~----~g~~Np~~g~~~~vv~p~ 230 (302) T protein:vir:10 156 SNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKLA----DNTPNPYVGTAELVVDGR 230 (302) T ss_pred hhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cccC----CCCcceeccceEEEEeec Confidence 0111122334445555555554333 3466788887776655554321 2211 111122122257777888 Q ss_pred CCcCce--EEeeccc--eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccc-----eEEEEecCCC Q lcl|NC_019933. 326 MAVGQF--LTGAFDA--GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPES-----FIKGSLAAAA 392 (394) Q Consensus 326 ~p~~~~--~~gd~~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a-----~~~l~~~~a~ 392 (394) +.+++. ++.|.+. .+++-.++...++. ...|..+.+.++....+++..+-.-+ +++.+..+++ T Consensus 231 L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~----~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 231 IESDTAWFLLDTTKPVKPFIFQPRKQPEFVS----QVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred cCCCCceEEEecCCccceEEEcCccccEEEe----ccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 866543 3334332 11222233444433 22456666777766666643332221 1233333333 No 217 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=95.70 E-value=0.001 Score=36.93 Aligned_cols=322 Identities=12% Similarity=0.078 Sum_probs=136.9 Q ss_pred HHHH-HhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhh--hhHHHHHHH--------hhcc---cccCCcCccccch Q lcl|NC_019933. 60 IAEV-EGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGR--AEINIKAAI--------TSLS---TNADGSAGATVQT 125 (394) Q Consensus 60 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~--------~~~~---~~~~~~~g~~ip~ 125 (394) +... ..+..-..+..+.+.........+..+.+.+..-.. .....+... .++. .+..+.++.-||- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~g~p~ 80 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccCCccHHH Confidence 0000 000000011111111111111111111111100000 000011110 0111 1111222333454 Q ss_pred h----hhhHHHhhhhhhhhHHHhcccccccc---CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeeh Q lcl|NC_019933. 126 T----RLPGILELPQRRMTIRSLLAQGTMEG---NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMK 198 (394) Q Consensus 126 ~----~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~ 198 (394) . +.+.|++-+........++++...+. .++.++..... +.+.+.+.+.+.|..+......+-..+.+...+. T Consensus 81 ~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~-G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~ 159 (382) T protein:vir:96 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPA-GTAVEYGDHTNIPLTSWNANFERRTIVRGELGLL 159 (382) T ss_pred HHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecc-cceEEeecccCCCccccccceeEEEEEEEEEeee Confidence 4 44456666666667777777766432 24456555443 5566778888889887665555555566666666 Q ss_pred hh-HHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCC--Ccccccccccccccccc---cc----ccccchHHHH Q lcl|NC_019933. 199 AS-RQILSDSA---QLQSFINARLLRGLEVVEENQLLNGNGT--GQNLLGLLPQATAFAAP---IT----VANATAVDRL 265 (394) Q Consensus 199 is-~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~g~--~~~~~Gi~~~~~~~~~~---~~----~~~~~~~~~i 265 (394) ++ .|+...+. ++.+.-....++++.+.+|+..|.|+.. .+...|+++.+...+.. .. ++....++|| T Consensus 160 yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di 239 (382) T protein:vir:96 160 VGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDI 239 (382) T ss_pred ecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHH Confidence 64 45555432 5677777888888899999999988533 34567998877643211 11 1222235566 Q ss_pred HHHHHHhhhhcC-------CCCeeEeCHHHHHHHHHhhccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccc Q lcl|NC_019933. 266 RLALLQAQLAEF-------PATGIVLNPADWAGIELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDA 338 (394) Q Consensus 266 ~~~~~~~~~~~~-------~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~ 338 (394) ..++..+...-. .+..+++.+..+..|.. .+..|--++.-.... +-++.+...+.+... ..-|.-+. T Consensus 240 ~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n----~Pnl~i~t~peL~~a-~~~g~g~~ 313 (382) T protein:vir:96 240 REAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSV-TTPYGISVSDWIEQT----YPKMRIVSAPELSGV-QMQGKTPE 313 (382) T ss_pred HHHHHHHHhccCCeeeecccceEEeechHHHhhccc-cCccCccHHHHHHHh----cCCcEEEEccccccc-cCCCccce Confidence 666666543321 12257888888877753 233332222111111 112333333222100 00000000 Q ss_pred eEEEEeecceEE--EEecccchhhhc--------CcEEEE--------EEEEeccEEecccceEEEE-e Q lcl|NC_019933. 339 GAQVFDRWAARV--EVATENQDDFIK--------NMVTIL--------AEERLALAVYRPESFIKGS-L 388 (394) Q Consensus 339 ~~~~~~~~~~~i--~~~~~~~~~~~~--------~~~~~~--------~~~~~d~~v~~~~a~~~l~-~ 388 (394) .........+.. ....+....|.+ -.+..+ .....|+.+++|.||++++ + T Consensus 314 ~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 314 DALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred eEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 000000000000 000000000100 000001 1234677788899988876 4 No 218 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=95.45 E-value=0.0012 Score=36.56 Aligned_cols=324 Identities=13% Similarity=0.057 Sum_probs=143.5 Q ss_pred HHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhh Q lcl|NC_019933. 18 LKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQR 97 (394) Q Consensus 18 ~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 97 (394) |+ ..+..... ..+.....-.+.........+..+.+.+..- T Consensus 1 ~~-------------------------~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~ 41 (379) T protein:vir:10 1 MP-------------------------QISKIHSS--------------LNARQMTQMVMDSADVTLDNLKHLESYGIHL 41 (379) T ss_pred CC-------------------------Ccceeeee--------------cCccccchhhhccccccHHHHHHHHhcCccc Confidence 00 00000000 0000000000000000111111111111100 Q ss_pred h-hhhHHHHHHHhhcccc----------cC-CcCccccc---hhhhhHHHhhhhhhhhHHHhcccccccc---CceeEEE Q lcl|NC_019933. 98 G-RAEINIKAAITSLSTN----------AD-GSAGATVQ---TTRLPGILELPQRRMTIRSLLAQGTMEG---NTLEYVR 159 (394) Q Consensus 98 ~-~~~~~~~~~~~~~~~~----------~~-~~~g~~ip---~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~ 159 (394) . .......+..-++... .. +.+..-+| +.|.+.+++.+-....+..++++.+.+. ....++. T Consensus 42 ~~~~~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v 121 (379) T protein:vir:10 42 NGRKNKLFELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRV 121 (379) T ss_pred cchhhhhhhhhhhhhccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEee Confidence 0 0000000000001100 00 00111112 2345677787777777888877766532 2344455 Q ss_pred EcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019933. 160 ETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLLNGNG 235 (394) Q Consensus 160 ~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l~g~g 235 (394) .... +.+.+.+.+.+.|..+.......-..+.++..+.++..=+..+. ++.+.-....++++...+|+..|.|.+ T Consensus 122 ~e~~-G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~ 200 (379) T protein:vir:10 122 LEGL-GTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYN 200 (379) T ss_pred eeee-eeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeec Confidence 4433 45666788888898887777767777778888888765333332 688888889999999999999999964 Q ss_pred C-Cccccccccccccccc---ccc---------ccccchHHHHHHHHHHhhhhc-------CCCCeeEeCHHHHHHHHHh Q lcl|NC_019933. 236 T-GQNLLGLLPQATAFAA---PIT---------VANATAVDRLRLALLQAQLAE-------FPATGIVLNPADWAGIELL 295 (394) Q Consensus 236 ~-~~~~~Gi~~~~~~~~~---~~~---------~~~~~~~~~i~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~l 295 (394) . +....|+++.++.... +.+ ++..-.++||..++..+...- ..+..+++.|..+..|..- T Consensus 201 d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~ 280 (379) T protein:vir:10 201 DGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP 280 (379) T ss_pred CCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc Confidence 3 4456788887655321 111 112223556666555543221 1233788999998888643 Q ss_pred hccCCcccccCcccCCCceeecceEEEcCCCCcCceEEeeccceEEEEee-cce------EE-EEecccch--hhhcC-- Q lcl|NC_019933. 296 KDTQGRYILGNPQGTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVFDR-WAA------RV-EVATENQD--DFIKN-- 363 (394) Q Consensus 296 kd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~-~~~------~i-~~~~~~~~--~~~~~-- 363 (394) +..|--++.-.... +-++.+...+.+.... |.-+..+++.+. .+. .+ ...++... ..+.. T Consensus 281 -n~~g~Tvl~~lk~n----~Pnl~i~t~pEL~~ag---gg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~ 352 (379) T protein:vir:10 281 -TELGYSVAQYMRES----YPNVTFVSAPELNDAN---GGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKIK 352 (379) T ss_pred -cccCccHHHHHHHh----cCCcEEEEcccccccC---CCccEEEEEeeccCCCccCCcceEEEecchhhhhccceecCc Confidence 33333222211111 1123344333332100 111111122111 000 00 11111100 00001 Q ss_pred cEEEEE-EEEeccEEecccceEEEEec Q lcl|NC_019933. 364 MVTILA-EERLALAVYRPESFIKGSLA 389 (394) Q Consensus 364 ~~~~~~-~~~~d~~v~~~~a~~~l~~~ 389 (394) .+..-. ....|..+++|.||+++.=+ T Consensus 353 ~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 353 GYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred eeEeccccceeeeeeecchhhheecCC Confidence 111122 33356667779999888655 No 219 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=95.36 E-value=0.0022 Score=35.13 Aligned_cols=276 Identities=12% Similarity=0.062 Sum_probs=139.7 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHH-hccccc---------c---ccCceeEEEEcCcccccceecCCccc- Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRS-LLAQGT---------M---EGNTLEYVRETGFTNAAAPVAEGAQK- 176 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~-~~~~~~---------~---~~~~~~~~~~~~~~~~~~~~~eg~~~- 176 (394) +..+....+.-.....|+..++....+.+++.. +.+.-. + .|..+++.-.... ....|.+++.. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L--~g~gv~Gd~~le 78 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHL--RGKPTYGDARVE 78 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeec--ccCCcccCceee Confidence 444444444344456778788877777777665 433110 0 1222332221111 12222222222 Q ss_pred -cccccceeeEEeeeeeEEEeehhhHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHhh-ccCCC---------cccccc Q lcl|NC_019933. 177 -PESSLRFDLVQTSAKVIAHWMKASRQILS-DSA-QLQSFINARLLRGLEVVEENQLLN-GNGTG---------QNLLGL 243 (394) Q Consensus 177 -~~~~~~~~~i~~~~~k~~~~~~is~e~l~-~s~-~~~~~i~~~la~a~~~~~d~a~l~-g~g~~---------~~~~Gi 243 (394) .+..++|.+-.+.+..+...+.....+-+ -++ +|...-++.|+..+.+..|..+|. -.|.. ..+.+. T Consensus 79 Gnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~ 158 (364) T protein:vir:93 79 GKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGY 158 (364) T ss_pred ccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcccc Confidence 23456777777777777766665555443 344 899999999999999999986653 22211 001111 Q ss_pred c-cc---c----------ccccccccccccchHHHHHHHHHHhhhhcCC----------------CCeeEeCHHHHHHHH Q lcl|NC_019933. 244 L-PQ---A----------TAFAAPITVANATAVDRLRLALLQAQLAEFP----------------ATGIVLNPADWAGIE 293 (394) Q Consensus 244 ~-~~---~----------~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----------------~~~~~~~~~~~~~l~ 293 (394) . +. + .........++..+++.|..+...+...... .=.++|||..+..|+ T Consensus 159 ~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) T protein:vir:93 159 AGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMR 238 (364) T ss_pred cccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhh Confidence 0 00 0 0112223344556677776666655443211 116799999988887 Q ss_pred Hhhcc--------------CCcccccCcccCCCceeecceEEEcCCCCcC-------------ceEEeeccceEEEEeec Q lcl|NC_019933. 294 LLKDT--------------QGRYILGNPQGTLAPTLWGLPVVATQAMAVG-------------QFLTGAFDAGAQVFDRW 346 (394) Q Consensus 294 ~lkd~--------------~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~~-------------~~~~gd~~~~~~~~~~~ 346 (394) +-.+. ..+|||. +.-+.+.|.+|+....++.. ..++|-..-++.++... T Consensus 239 ~~t~~~w~d~qk~A~~~~g~~nPlF~----G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~ 314 (364) T protein:vir:93 239 TAAGGTWIDFQKAAAAAEGRNNPIFK----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTAN 314 (364) T ss_pred hcCCHHHHHHHHHhhhcccccCCcee----cCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCC Confidence 43321 1133432 33457788888776655321 12334433222222233 Q ss_pred ceEEEEecccchhhhcCcEEEEEEEEeccEEec----ccceEEEEecCCCCC Q lcl|NC_019933. 347 AARVEVATENQDDFIKNMVTILAEERLALAVYR----PESFIKGSLAAAAGT 394 (394) Q Consensus 347 ~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~----~~a~~~l~~~~a~~~ 394 (394) +....+..+..++ .|...+-+...+|++-.+ .-++..|...+.++| T Consensus 315 g~~~~w~Ee~~D~--gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 315 GLRFDWEETVKDY--GNEPAIAAGFIAGMKKARFNNKDFGVISIDTAAKKHS 364 (364) T ss_pred CCCceeeecccCC--CCchhhhhhhHhhhhhcccCCccceEEEecccccccC Confidence 4444444333322 234445555555555444 344566666777777 No 220 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=94.84 E-value=0.0034 Score=34.14 Aligned_cols=309 Identities=12% Similarity=-0.012 Sum_probs=141.0 Q ss_pred cccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc-cC-CcCccccchhhhhHHHhhhh--hhhhHH Q lcl|NC_019933. 67 GAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN-AD-GSAGATVQTTRLPGILELPQ--RRMTIR 142 (394) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~g~~ip~~~~~~ii~~~~--~~~~l~ 142 (394) .... .+. +.... .......+.-.++....-.++ .+ .++|.+=.+.+.++|..+.. +.-.++ T Consensus 1 ~~~~-~~~-~~~~~-------------~~~~~~~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~ 65 (462) T protein:vir:96 1 MHKD-TNL-TAEQN-------------KYADKFQEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFY 65 (462) T ss_pred Cccc-ccc-chhhh-------------hhhchhhHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhh Confidence 0000 000 00000 000000011112221111111 11 23444444555555544332 233456 Q ss_pred HhccccccccCceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHHHHHHHHHHHHHH Q lcl|NC_019933. 143 SLLAQGTMEGNTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI--LSDSAQLQSFINARL 218 (394) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~--l~~s~~~~~~i~~~l 218 (394) .-+...+..+--.++-... +..+-+.+++|++..+.+++.+.......|-++..-.+|..+ .+...+......++- T Consensus 66 ~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~da 145 (462) T protein:vir:96 66 REISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDA 145 (462) T ss_pred hhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHH Confidence 6666666665433333332 333456789999999999999999999999999888888764 333357778888999 Q ss_pred HHHHHHHHHHHHhhccCCC--------ccccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHH Q lcl|NC_019933. 219 LRGLEVVEENQLLNGNGTG--------QNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWA 290 (394) Q Consensus 219 a~a~~~~~d~a~l~g~g~~--------~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (394) ...++..++.++|.|+..= -++.||.+.-....+-.......+.+.|..+-..+...+..++-++|+..+.+ T Consensus 146 i~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a 225 (462) T protein:vir:96 146 IAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHA 225 (462) T ss_pred HHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHH Confidence 9999999999999987432 34677655433322222223444555555555566677778888999999988 Q ss_pred HHHH-hh-------ccC-CcccccCcc-----cC-----CCceeecceEEEcC------CCCcC-----------ceEEe Q lcl|NC_019933. 291 GIEL-LK-------DTQ-GRYILGNPQ-----GT-----LAPTLWGLPVVATQ------AMAVG-----------QFLTG 334 (394) Q Consensus 291 ~l~~-lk-------d~~-G~~~~~~~~-----~~-----~~~~l~G~pv~~~~------~~p~~-----------~~~~g 334 (394) .|.. .- .++ |+....-.. .. .+.++++.|.+... .+|.. ...|+ T Consensus 226 ~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f~ 305 (462) T protein:vir:96 226 DFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPATVKATVETGKKGLFT 305 (462) T ss_pred HHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCCceeEEEEeCCCCCCC Confidence 8873 11 122 211110000 00 11122233333221 22221 11223 Q ss_pred eccceEEEEeecceEEEEecccchhh----------hcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 335 AFDAGAQVFDRWAARVEVATENQDDF----------IKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 335 d~~~~~~~~~~~~~~i~~~~~~~~~~----------~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) |........+. + ..++.+....- ..+.+.+-..+- .+.-..|+-+.+.+....+|+ T Consensus 306 ~~~d~~~y~Y~--V-~avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~-a~~~~~~~~~~IYRk~~~sg~ 371 (462) T protein:vir:96 306 DEHDRAELTYK--V-VVNSDDAQSAPSEAVTATVNNATDGVKLEISVN-AMYQQQPQFVSIYRQGRKTGD 371 (462) T ss_pred CccCceeEEEE--E-EEECCCCccccceeeEeeeecccccceEEEEEc-CCccccceEEEEEeecCCccc Confidence 32110000000 0 00011100000 001111111111 111112233444444333333 No 221 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=94.37 E-value=0.0046 Score=33.39 Aligned_cols=308 Identities=13% Similarity=-0.020 Sum_probs=132.6 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhccccc--CCcCccccchhhhhHHHhhhh--hhhhHHHhcccccccc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNA--DGSAGATVQTTRLPGILELPQ--RRMTIRSLLAQGTMEG 152 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~ip~~~~~~ii~~~~--~~~~l~~~~~~~~~~~ 152 (394) +....... .......+...++....-.++. -.+|+.+=.+.+.++|-.+.. +.-.++.-+...+..+ T Consensus 1 ~~~~~n~~---------~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~S 71 (464) T protein:vir:80 1 MTEKKNTE---------RQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATS 71 (464) T ss_pred CCcchhhH---------hhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 10000000 0000111111233332222221 123455555556666544332 3335566666667665 Q ss_pred CceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 153 NTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASR--QILSDSAQLQSFINARLLRGLEVVEEN 228 (394) Q Consensus 153 ~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~--e~l~~s~~~~~~i~~~la~a~~~~~d~ 228 (394) --.+|-... +..+-+.+++|++..+.+++.+.......|-+...=-+|- .+.+.-.+-.....++-.-.++..++. T Consensus 72 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~ 151 (464) T protein:vir:80 72 TVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAKTIEW 151 (464) T ss_pred hhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 433333332 3334567899999999999999999888775554333333 334433466667777888889999999 Q ss_pred HHhhccCCCc---------cccccccccccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHH-HHhhcc Q lcl|NC_019933. 229 QLLNGNGTGQ---------NLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGI-ELLKDT 298 (394) Q Consensus 229 a~l~g~g~~~---------~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~lkd~ 298 (394) ++|.|+..=. ++.||.+.-....+-.......+.+.|..+-..+...+..++-++|+..+.+.+ ...-+. T Consensus 152 a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~ 231 (464) T protein:vir:80 152 ASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDR 231 (464) T ss_pred HHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCc Confidence 9999874322 466766544333222223334455556556566667777888899999998765 433222 Q ss_pred CCcccccCccc-------------CCCceeecceEEEcCC-----------CCcC-----------ceEEe------ecc Q lcl|NC_019933. 299 QGRYILGNPQG-------------TLAPTLWGLPVVATQA-----------MAVG-----------QFLTG------AFD 337 (394) Q Consensus 299 ~G~~~~~~~~~-------------~~~~~l~G~pv~~~~~-----------~p~~-----------~~~~g------d~~ 337 (394) +-+.+..+... .+.-.|.|--+...+. .|+. ...|+ +.+ T Consensus 232 q~~~~~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~ 311 (464) T protein:vir:80 232 QVQVISDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDTE 311 (464) T ss_pred eeEEEcCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccceeE Confidence 21111100000 0001122221111111 1110 01111 111 Q ss_pred ceEEEEeecceEEEEe-cccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 338 AGAQVFDRWAARVEVA-TENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 338 ~~~~~~~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ..+...+..+-++-.. -.....-....+.+.... -.+.-..|+-+.+......+|. T Consensus 312 Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~-~~~~~~~p~yv~IYR~~~~~g~ 368 (464) T protein:vir:80 312 YKVVVVSDDAESAPSDVASVVIDDKKKQVKLEITI-NNMYQARPQYVAIYRKGLETGL 368 (464) T ss_pred EEEEEECCCCccccceeeeeeecCcccEEEEEEEe-CCccccccceEEEEeecCCCCc Confidence 1111111111000000 000000011112222111 1111111233333333333222 No 222 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=94.02 E-value=0.0044 Score=33.50 Aligned_cols=301 Identities=13% Similarity=0.065 Sum_probs=147.5 Q ss_pred cccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhh---cccccCCcCccccchhhhhHHHhhhhhhhhHHH Q lcl|NC_019933. 67 GAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITS---LSTNADGSAGATVQTTRLPGILELPQRRMTIRS 143 (394) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~ 143 (394) ..+..... ..-...+..+... ....+.+++-++ -+.-+.++.-+-+|-.+...|...+...+|++. T Consensus 1 mtnfiesq------navteffdvlkkn-----sgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfk 69 (318) T protein:vir:94 1 MTNFIESQ------NAVTEFFDVLKKN-----SGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFK 69 (318) T ss_pred Cccchhhh------hhHHHHHHHHhcc-----cChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCccee Confidence 11111100 1111111111111 111122221111 111122334566788888889888999999988 Q ss_pred hccccccccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHHH-HHHHHHHHHHHHH Q lcl|NC_019933. 144 LLAQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI--LSDS-AQLQSFINARLLR 220 (394) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~--l~~s-~~~~~~i~~~la~ 220 (394) .+-+.+++..- ..|.......+...-.|+.+.+...++.--++.|-.++....+...+ +++| ..+...|..++.. T Consensus 70 vfhvtnvgall--vsrsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltq 147 (318) T protein:vir:94 70 VFHVTNVGALL--VSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 147 (318) T ss_pred eeeehhhhhee--eeccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHH Confidence 77776665432 22222334566777788888887777776677777776666666554 4455 5788889999999 Q ss_pred HHHHHH-HHHHhhccCCCccccccccccccccc-----cccccccc-hHHHHHHHHHHhhhhcCCCCeeEeCHHHHHH-H Q lcl|NC_019933. 221 GLEVVE-ENQLLNGNGTGQNLLGLLPQATAFAA-----PITVANAT-AVDRLRLALLQAQLAEFPATGIVLNPADWAG-I 292 (394) Q Consensus 221 a~~~~~-d~a~l~g~g~~~~~~Gi~~~~~~~~~-----~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-l 292 (394) +|..++ |-+++-|+|++. ++.|-..+..... ....++.+ ..+.|..+..-+.+-..+ -.+++......+ | T Consensus 148 aivnkivdlalvegdgtng-fksidkeadvkkikkittkaksagktpfadaieeavdfvrptagr-rylivktedrkall 225 (318) T protein:vir:94 148 AIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR-RYLIVKTEDRKALL 225 (318) T ss_pred HHHhhhhheeeeecCCcch-hhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCc-eEEEEeccchHHHH Confidence 988775 778888988754 4445444433211 11112333 344555555444433222 245555555443 4 Q ss_pred HHhhccCCcccccCcccCCC-ceeecce-EEE-cCCCCcCceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEE Q lcl|NC_019933. 293 ELLKDTQGRYILGNPQGTLA-PTLWGLP-VVA-TQAMAVGQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILA 369 (394) Q Consensus 293 ~~lkd~~G~~~~~~~~~~~~-~~l~G~p-v~~-~~~~p~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 369 (394) ..++.+..+.-.....+... ..-.|.. +++ +..-.-...++.|.+. .+ +.++++- -....|..|.-.+.. T Consensus 226 delrqatananvriknddteiasevgvdeiivytgskavkptvlvdqky--hi-dmqdltk----vdafewktnsnmilv 298 (318) T protein:vir:94 226 DELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQKY--HI-DMQDLTK----VDAFEWKTNSNMILV 298 (318) T ss_pred HHHHhhhcccceEEeccchhhhhhcCcceeEEeeccccccceeEeccce--ec-chhhhhh----hhceeeccCCceEEE Confidence 55665443221111111110 0111221 111 1111111123344432 11 1122211 112235566666666 Q ss_pred EEEeccEEecccceEEEEec Q lcl|NC_019933. 370 EERLALAVYRPESFIKGSLA 389 (394) Q Consensus 370 ~~~~d~~v~~~~a~~~l~~~ 389 (394) +..-.+.+...+|=+++++. T Consensus 299 etltsghvetynagavitvs 318 (318) T protein:vir:94 299 ETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EecccCcceeecCceeEEeC Confidence 66677777777777777776 No 223 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=93.90 E-value=0.00076 Score=37.70 Aligned_cols=326 Identities=13% Similarity=0.066 Sum_probs=137.3 Q ss_pred HHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhh----hhHHHHHHHHHH Q lcl|NC_019933. 18 LKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQF----VNSDSFKAMAES 93 (394) Q Consensus 18 ~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 93 (394) |+. +...-..| .+.......+.... ........+.+. T Consensus 1 ~~~-------------------------~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 41 (388) T protein:vir:99 1 MKQ-------------------------LSKVHQSL--------------AGRSVRAFDMANGKADYRLTDMAVRELKKF 41 (388) T ss_pred CCC-------------------------ccceeeec--------------CCcccchhhhhcCCcceeeechhhHhhhhc Confidence 000 00000000 00000000000000 000000001110 Q ss_pred hhhhhhhh----------HHHHH-HHhhcccccCCcCccccchhhhh----HHHhhhhhhhhHHHhcccccccc---Cce Q lcl|NC_019933. 94 GGQRGRAE----------INIKA-AITSLSTNADGSAGATVQTTRLP----GILELPQRRMTIRSLLAQGTMEG---NTL 155 (394) Q Consensus 94 ~~~~~~~~----------~~~~~-~~~~~~~~~~~~~g~~ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~~---~~~ 155 (394) +..-.... ..... +..+...+..+.++.-||-.+.+ .|++-+........++++.+.+. ..+ T Consensus 42 g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~ 121 (388) T protein:vir:99 42 GLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEI 121 (388) T ss_pred ceeccCccchhhhhhhhhhhhhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeE Confidence 00000000 00000 00011111122233335554444 55666666666677777766432 244 Q ss_pred eEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019933. 156 EYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA----QLQSFINARLLRGLEVVEENQLL 231 (394) Q Consensus 156 ~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~----~~~~~i~~~la~a~~~~~d~a~l 231 (394) .++.... .+.+.+.+.+.++|..+......+-..+.++..+.++.+=+..+. ++.+.-....++++.+.+|+..| T Consensus 122 ~f~v~e~-~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f 200 (388) T protein:vir:99 122 VQGIVEP-AGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGF 200 (388) T ss_pred EEeeeec-ceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEE Confidence 5555443 355667788889999887766666677777777778765444332 67888888889999999999999 Q ss_pred hccCCC--cccccccccccccccccc-----------ccccchHHHHHHHHHHhhhhcC-------CCCeeEeCHHHHHH Q lcl|NC_019933. 232 NGNGTG--QNLLGLLPQATAFAAPIT-----------VANATAVDRLRLALLQAQLAEF-------PATGIVLNPADWAG 291 (394) Q Consensus 232 ~g~g~~--~~~~Gi~~~~~~~~~~~~-----------~~~~~~~~~i~~~~~~~~~~~~-------~~~~~~~~~~~~~~ 291 (394) .|.... ....|+++.++....... ++..-.++||..++..+...-. .+..+++.+..+.. T Consensus 201 ~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~ 280 (388) T protein:vir:99 201 YGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDM 280 (388) T ss_pred EeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHh Confidence 885432 346788887554321110 1222235666666666543321 12267888888888 Q ss_pred HHHhhccCCcccccCcccCCCceeecceEEEcCCCC-----cC-ce-E-EeeccceEEEE-eecceEEEE-ecccc--hh Q lcl|NC_019933. 292 IELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMA-----VG-QF-L-TGAFDAGAQVF-DRWAARVEV-ATENQ--DD 359 (394) Q Consensus 292 l~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p-----~~-~~-~-~gd~~~~~~~~-~~~~~~i~~-~~~~~--~~ 359 (394) |... +..|--++.-.... +-++.+...+.+. .+ .. + +.+--...... .....+... .+... .. T Consensus 281 Ls~~-n~~g~Tvl~~lk~n----~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~ 355 (388) T protein:vir:99 281 LSVV-TDLGISVRDWLKQT----YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLG 355 (388) T ss_pred cccc-CcCCccHHHHHHHh----cCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEeccccccccc Confidence 8533 33332222111111 1123333222221 01 11 0 00000000000 000000000 00000 00 Q ss_pred hhcCcEEE--E-EEEEeccEEecccceEEEE-e Q lcl|NC_019933. 360 FIKNMVTI--L-AEERLALAVYRPESFIKGS-L 388 (394) Q Consensus 360 ~~~~~~~~--~-~~~~~d~~v~~~~a~~~l~-~ 388 (394) .+.....| - .....|..+++|.||++++ + T Consensus 356 vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 356 VEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred ceecCceeEeccccceeeeEEeccchhheeccC Confidence 01111111 1 2233566777899988876 4 No 224 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=92.62 E-value=0.011 Score=31.40 Aligned_cols=294 Identities=11% Similarity=-0.019 Sum_probs=134.4 Q ss_pred ccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhhh--hhhHHH Q lcl|NC_019933. 68 AGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQR--RMTIRS 143 (394) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~~--~~~l~~ 143 (394) .+...+...+.... ....++.-.++....-.++ +-.+|+.+=-+.+..+|..+... .-.++. T Consensus 1 ~~~~~~~~~~~~~~--------------~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~ 66 (468) T protein:vir:63 1 MPKNNKEEEVKEVN--------------LNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYK 66 (468) T ss_pred CCCCcchhhccccC--------------hhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhh Confidence 11100100010000 0011111122222222111 11234444455566666544333 234455 Q ss_pred hccccccccCceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019933. 144 LLAQGTMEGNTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI--LSDSAQLQSFINARLL 219 (394) Q Consensus 144 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~--l~~s~~~~~~i~~~la 219 (394) -+...+..+--.+|-... +..+-+.+++|++..+.+++.+.......|-++....+|.-+ .+.-.+......++-. T Consensus 67 di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai 146 (468) T protein:vir:63 67 DIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAI 146 (468) T ss_pred hcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHH Confidence 555566554333333332 333456789999999999999999999999999988888764 3333477788888999 Q ss_pred HHHHHHHHHHHhhccCCCc---------ccccccccccccccccccc-ccchHHHHHHHHHHhhhhcCCCCeeEeCHHHH Q lcl|NC_019933. 220 RGLEVVEENQLLNGNGTGQ---------NLLGLLPQATAFAAPITVA-NATAVDRLRLALLQAQLAEFPATGIVLNPADW 289 (394) Q Consensus 220 ~a~~~~~d~a~l~g~g~~~---------~~~Gi~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 289 (394) ..++..++.++|.|+..-. .+.||...-... .....- ...+...|..+...+-..+..++-++|+..+. T Consensus 147 ~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~e-nviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~ 225 (468) T protein:vir:63 147 VNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQ 225 (468) T ss_pred HHHHHHHHHHhhhcccccccCCCccccccccceeEEecCC-ceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHH Confidence 9999999999999875431 345665433221 111122 22344444444444445566777788998887 Q ss_pred HHH-HHhhccCCcccccCcccCCCceeecceEEEcCCCCc-------CceEEeeccceEEEEeecceEEEEecccchhhh Q lcl|NC_019933. 290 AGI-ELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAV-------GQFLTGAFDAGAQVFDRWAARVEVATENQDDFI 361 (394) Q Consensus 290 ~~l-~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~-------~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~ 361 (394) +.| ...-.. +..+. .+.+.....|.||- ..+.+ +..++++... .-.+..+... .+.+ T Consensus 226 a~~~~~~L~~--q~~v~--~~n~~~~~~G~~v~--g~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~~--Apsp----- 290 (468) T protein:vir:63 226 ADFVNQQLSK--QTQLV--RDNGNNVSVGFNIQ--GFHSARGFIKLHGSTVMENEQI--LDERILALPT--APQP----- 290 (468) T ss_pred hhhhhhhcCc--eEEEE--cCCCCceeeeeccc--ceecceeeeeecCceeeccccC--CCcccccccc--cccC----- Confidence 666 222211 12221 12223345666662 11211 1122222211 0000000000 0000 Q ss_pred cCcEEEEEEEEeccEEec-ccceEEEEecCCCCC Q lcl|NC_019933. 362 KNMVTILAEERLALAVYR-PESFIKGSLAAAAGT 394 (394) Q Consensus 362 ~~~~~~~~~~~~d~~v~~-~~a~~~l~~~~a~~~ 394 (394) . .+-+....+++-.+ ....+..+++.+.-+ T Consensus 291 -~--~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs 321 (468) T protein:vir:63 291 -A--KVTATQEAGKKGQFRAEDLAAHEYKVVVSS 321 (468) T ss_pred -C--ccceeeecccCCcccCCCcceEEEEEEEEC Confidence 0 11112222222111 111111222222222 No 225 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=92.46 E-value=0.011 Score=31.26 Aligned_cols=293 Identities=11% Similarity=-0.029 Sum_probs=135.0 Q ss_pred ccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccc--cCCcCccccchhhhhHHHhhhhh--hhhHHHhc Q lcl|NC_019933. 70 GDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTN--ADGSAGATVQTTRLPGILELPQR--RMTIRSLL 145 (394) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~ip~~~~~~ii~~~~~--~~~l~~~~ 145 (394) -+...+..- ........++.-.|+....-.++ +-.+|+.+=-+.+..+|..+... .-.++.-+ T Consensus 1 ~~~~~~~~~-------------~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di 67 (467) T protein:vir:80 1 MPKNNKEEV-------------KEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDI 67 (467) T ss_pred CCCcchhhh-------------hhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhc Confidence 000000000 00001111222233322222211 11234555455566666544333 23345555 Q ss_pred cccccccCceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 146 AQGTMEGNTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI--LSDSAQLQSFINARLLRG 221 (394) Q Consensus 146 ~~~~~~~~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~--l~~s~~~~~~i~~~la~a 221 (394) ...+..+--.+|-... +..+-+.+++|++..+.+++.+.......|-++....+|.-+ .+.-.+......++-... T Consensus 68 ~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~ 147 (467) T protein:vir:80 68 AKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVN 147 (467) T ss_pred ccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHH Confidence 5556554333333332 333456789999999999999999999999999988888764 333347778888899999 Q ss_pred HHHHHHHHHhhccCCCc---------ccccccccccccccccccc-ccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHH Q lcl|NC_019933. 222 LEVVEENQLLNGNGTGQ---------NLLGLLPQATAFAAPITVA-NATAVDRLRLALLQAQLAEFPATGIVLNPADWAG 291 (394) Q Consensus 222 ~~~~~d~a~l~g~g~~~---------~~~Gi~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (394) ++..++.++|.|+..-. .+.||...-... .....- ...+...|..+...+-..+..++-++|+..+.+. T Consensus 148 ~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~e-nviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~ 226 (467) T protein:vir:80 148 IAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQAD 226 (467) T ss_pred HHHHHHHHhhhcccccccCCCccccccccceeEEecCC-ceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhh Confidence 99999999999875431 345665433221 111122 2234444444444444556677778899888766 Q ss_pred H-HHhhccCCcccccCcccCCCceeecceEEEcCCCCc-------CceEEeeccceEEEEeecceEEEEecccchhhhcC Q lcl|NC_019933. 292 I-ELLKDTQGRYILGNPQGTLAPTLWGLPVVATQAMAV-------GQFLTGAFDAGAQVFDRWAARVEVATENQDDFIKN 363 (394) Q Consensus 292 l-~~lkd~~G~~~~~~~~~~~~~~l~G~pv~~~~~~p~-------~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 363 (394) | ...-.. +..+. .+.+.....|.||- ..+.+ +..++++... .-.+..+... .+.+ . T Consensus 227 ~~~~~L~~--q~~v~--~~n~~~~~~G~~v~--g~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~~--Apsp------~ 290 (467) T protein:vir:80 227 FVNQQLSK--QTQLV--RDNGNNVSVGFNIQ--GFHSARGFIKLHGSTVMENEQI--LDERILALPT--APQP------A 290 (467) T ss_pred hhhhhcCc--eEEEE--cCCCCceeeeeccc--ceecceeeeeecCceeeccccC--CCcccccccc--cccC------C Confidence 6 222211 12221 12223345666662 11211 1122222211 0000000000 0000 0 Q ss_pred cEEEEEEEEeccEEec-ccceEEEEecCCCCC Q lcl|NC_019933. 364 MVTILAEERLALAVYR-PESFIKGSLAAAAGT 394 (394) Q Consensus 364 ~~~~~~~~~~d~~v~~-~~a~~~l~~~~a~~~ 394 (394) .+-+....+++-.+ ....+..+++.+.-+ T Consensus 291 --~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs 320 (467) T protein:vir:80 291 --KVTATQEAGKKGQFRAEDLAAHEYKVVVSS 320 (467) T ss_pred --ccceeeecccCCcccCCCcceEEEEEEEEC Confidence 11112222222111 111111222222222 No 226 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=90.73 E-value=0.019 Score=29.98 Aligned_cols=261 Identities=10% Similarity=-0.040 Sum_probs=122.7 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhc--cccccccCceeEEEEcCcccccceecCCccccccccceeeEEe Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLL--AQGTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQT 188 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~ 188 (394) |.... -+.+...+.+.+...+....+. ...-.+|+++++|+....+...+-.+.|-..+.-+.++...++ T Consensus 1 Main~--------a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl 72 (290) T protein:vir:78 1 MAINY--------VDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTI 72 (290) T ss_pred CchhH--------HHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEe Confidence 11100 1234444444444443322222 1222456789999987543333333333333334455666666 Q ss_pred eeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccchHHHH Q lcl|NC_019933. 189 SAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 189 ~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i 265 (394) ...+.-.+. |-.-=++.+ ..+.....+.....++-.+|.-.+.---+.....+ ...+.+.+....++.+ T Consensus 73 ~qdR~~~F~-vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~-------~~~~~t~t~~n~~~~i 144 (290) T protein:vir:78 73 DFDRDVEFF-VDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS-------NSVAEEITKDNVFTKL 144 (290) T ss_pred eccccceee-ccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC-------cccccccCHHHHHHHH Confidence 666643332 110001211 24556666677777777788665531100000000 0111123445678888 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcc----cccCcccCCCceeecceEEEcCC---C-----------C Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRY----ILGNPQGTLAPTLWGLPVVATQA---M-----------A 327 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~----~~~~~~~~~~~~l~G~pv~~~~~---~-----------p 327 (394) .+++..+......+-.++|+|.++..|..-..-.... .-+....+..+.|.|.+|+..+. + + T Consensus 145 ~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~ 224 (290) T protein:vir:78 145 KAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKP 224 (290) T ss_pred HHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccc Confidence 8888888766555667899999988876433222211 11122334457899999987552 1 1 Q ss_pred cC-----ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEeccc-ceEEEEecC Q lcl|NC_019933. 328 VG-----QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPE-SFIKGSLAA 390 (394) Q Consensus 328 ~~-----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~-a~~~l~~~~ 390 (394) .. ..+++.. .+..-..... .+.+.. ++.+-+-+...+.-..|.|.-+.+.. .-.++.... T Consensus 225 ~~~ak~in~ii~~~-~a~i~~~K~~-~~~~~~-P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 225 AAGAKKLNFLLVNK-GSVVGGAKHA-SIYLHA-PGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred cCCccceeEEEEcC-Cceeeeeeee-EEEeeC-CCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 10 1122221 1111111111 222222 22222234456666777777777643 233333333 No 227 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=90.29 E-value=0.022 Score=29.71 Aligned_cols=356 Identities=10% Similarity=0.038 Sum_probs=129.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) ||+-+.|.+++.-+.+. +.+-+....... ..+++ +|++.......+...+.+.... T Consensus 3 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~----~~a~~----------------~enq~~~~~~~~~~~~~~~~~~--- 58 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEG-EGLPEIANSKQA----IIAKI----------------FENQEKDFQTAPEYKDEKIAQA--- 58 (521) T ss_pred cchhHHHHHhhhhhhcc-CCCCccccchhh----hhhhh----------------hhhhhhhhhhcccccchHHHHH--- Confidence 88888899999877654 111111100000 00010 0000000000000000000000 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRE 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) +........... ..... -.....++++.+-.-..|.++ .++++........+++.+.||++.+.-+.-. T Consensus 59 ------~~~~l~e~~~~~--~~~~~--~~~iaes~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAM 127 (521) T protein:vir:72 59 ------FGSFLTEAEIGG--DHGYN--ATNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMNSPTGQVFAL 127 (521) T ss_pred ------HhhhhhhhcccC--ccccC--cccccccccccccccCCchhh-hHHHHHHhhhhhhhceeeccCCchhhhheee Confidence 000000000000 00000 000001111110001111111 1222223344455666666665432211100 Q ss_pred c----Ccc--------------cccc------------------------------------------------------ Q lcl|NC_019933. 161 T----GFT--------------NAAA------------------------------------------------------ 168 (394) Q Consensus 161 ~----~~~--------------~~~~------------------------------------------------------ 168 (394) . ... +.+. T Consensus 128 RsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~ 207 (521) T protein:vir:72 128 RAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAK 207 (521) T ss_pred eeeecCCCCCcccccccchhcccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCccc Confidence 0 000 0000 Q ss_pred -----------------------eecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHH Q lcl|NC_019933. 169 -----------------------PVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQ 211 (394) Q Consensus 169 -----------------------~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~ 211 (394) -..| +...++-..++++++...+.-+=...+|-||.+|-- |.+ T Consensus 208 t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAE 287 (521) T protein:vir:72 208 LDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDAD 287 (521) T ss_pred cccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChH Confidence 0000 111233344556666666666667789999999862 688 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCcc--cccccc----cccccccccccc---ccchHH-------HHHHHHHHhh-- Q lcl|NC_019933. 212 SFINARLLRGLEVVEENQLLNGNGTGQN--LLGLLP----QATAFAAPITVA---NATAVD-------RLRLALLQAQ-- 273 (394) Q Consensus 212 ~~i~~~la~a~~~~~d~a~l~g~g~~~~--~~Gi~~----~~~~~~~~~~~~---~~~~~~-------~i~~~~~~~~-- 273 (394) +.|.+-|+..|...|++.+|.--..... ..|+.. .+++.......+ ..-..+ .|......+. T Consensus 288 tELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~ 367 (521) T protein:vir:72 288 AELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQ 367 (521) T ss_pred HHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHh Confidence 9999999999999999988853211111 112211 111111111111 111112 2222222222 Q ss_pred hhcCCCCeeEeCHHHHHHHHHhh--cc-CCc---ccc-cCccc-CCCcee-ecceEEEcCCCCcCceEEeeccceEE--- Q lcl|NC_019933. 274 LAEFPATGIVLNPADWAGIELLK--DT-QGR---YIL-GNPQG-TLAPTL-WGLPVVATQAMAVGQFLTGAFDAGAQ--- 341 (394) Q Consensus 274 ~~~~~~~~~~~~~~~~~~l~~lk--d~-~G~---~~~-~~~~~-~~~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~~--- 341 (394) ......+.+++|++....|...- +. .+. --| .+... ...+.| .|++|+.++..|.+-+++|---...+ T Consensus 368 T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~g 447 (521) T protein:vir:72 368 TGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAG 447 (521) T ss_pred cccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccc Confidence 22345668999999998888531 11 110 001 11110 112344 46899999999888777664210000 Q ss_pred EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccc-------eEEEE---ecCCCCC Q lcl|NC_019933. 342 VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPES-------FIKGS---LAAAAGT 394 (394) Q Consensus 342 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a-------~~~l~---~~~a~~~ 394 (394) +|...=+.+.. ....+-.+-+-.+-...|++..+ +|=+ ..+++ ....++- T Consensus 448 lfyaPYv~l~~--~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~a~~i~~~~~~~~a~~ 507 (521) T protein:vir:72 448 IYYAPYVALTP--LRGSDPKNFQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSL 507 (521) T ss_pred eeecccccccc--ccccCCccccceeeeeeeeceee-cCcccccCcccceeecCcChhhhcCc Confidence 01000000100 00001111122333444444432 2311 12221 1111111 No 228 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=89.02 E-value=0.029 Score=29.03 Aligned_cols=365 Identities=11% Similarity=0.017 Sum_probs=107.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---Hhhcccccccc--- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEV---EGNGAGGDVQH--- 74 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~---~~~~~~~~~~~--- 74 (394) ++.+.+++++++++.+++++..+......+...+++.+++++.+++++++.++.+.+...... ........... T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (408) T protein:vir:10 11 NEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELK 90 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhH Confidence 667888888888888888776666555555667788888888888888888887766443211 11211111111 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhccccc----CC-cCccccchhhhhHHHhhhhhhhhHHHhccccc Q lcl|NC_019933. 75 ISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNA----DG-SAGATVQTTRLPGILELPQRRMTIRSLLAQGT 149 (394) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~-~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~ 149 (394) ......+............. .+.++......... +. -...++........+..+...-++......++ T Consensus 91 ~~~~~~~~~~~~~~~~~~~~-------~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 163 (408) T protein:vir:10 91 DKFVKDFVNMVRNPMAFMNT-------VSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) T ss_pred HHHHHHHHHHhhcchhhhhh-------hhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEE Confidence 11111111110000000000 01112111111110 11 11112211111111221111112211111222 Q ss_pred cc---cCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 150 ME---GNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVE 226 (394) Q Consensus 150 ~~---~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~ 226 (394) +. +.... ..+.+ ...-..+.........+|..-.+...-.-..--+.+...+-...+...+...++.++..++ T Consensus 164 ~~~~~~~~~~-a~~v~---E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i 239 (408) T protein:vir:10 164 YEKWTDVTPL-TVMDA---EDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) T ss_pred Eeeccccccc-eeeec---CccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 11111 11111 1111222222233444444433322111111112222222222344444555555555444 Q ss_pred HHHHhhccCC--Cccccccccccc--c-cc---ccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc Q lcl|NC_019933. 227 ENQLLNGNGT--GQNLLGLLPQAT--A-FA---APITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDT 298 (394) Q Consensus 227 d~a~l~g~g~--~~~~~Gi~~~~~--~-~~---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~ 298 (394) =...=.+... ......+..... . .. ...-.....++..| ..+.....++ .|.-++..-. -. -- T Consensus 240 l~g~g~~~~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l----~~lkd~~G~~-i~~~~~~~~~-~~---~l 310 (408) T protein:vir:10 240 IEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKL----ALVKTAEGKY-LLEPDPTKPN-SY---LI 310 (408) T ss_pred hhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHH----HHhhccCCce-EeccCcCCCC-Cc---ee Confidence 3221111111 111222221100 0 00 00001122233333 3333322211 1211111100 00 01 Q ss_pred CCcccccCcc--cCCCceeecceEEEcCCCCcCceEEeeccceEE--------EEeecceEEEEecccchhhhcCcEEEE Q lcl|NC_019933. 299 QGRYILGNPQ--GTLAPTLWGLPVVATQAMAVGQFLTGAFDAGAQ--------VFDRWAARVEVATENQDDFIKNMVTIL 368 (394) Q Consensus 299 ~G~~~~~~~~--~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~--------~~~~~~~~i~~~~~~~~~~~~~~~~~~ 368 (394) .|.|+..... .+..+ -.-.++++-+. - .-+.+++....-. .|......+..... -|-..+. T Consensus 311 ~G~PV~~~~~~~~~~~~-~~~~~i~~gd~-~-~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r------~d~~v~~ 381 (408) T protein:vir:10 311 KGKQVIVVADRWLPNTG-STVYPLYYGDM-S-QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR------FDVKATD 381 (408) T ss_pred cceeeEEecccccCccC-CCceEEEEEeh-h-ccEEEEEecceEEEEcccccchhhcCceEEEEEEe------eccEEec Confidence 3554432110 00000 00111221111 0 0011222111000 01111111111000 0000111 Q ss_pred EEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 369 AEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 369 ~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) -...+-+.+.-...-.-.+.+.++++ T Consensus 382 ~~a~~~~~~~~~~~~~~~~~~~~~~~ 407 (408) T protein:vir:10 382 SEALVAGSFSAIADQVGNFKTTTSTA 407 (408) T ss_pred cccEEEEEeeccccCCCCCCCCCccc Confidence 11111111111112222233334444 No 229 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=88.45 E-value=0.032 Score=28.76 Aligned_cols=351 Identities=12% Similarity=0.059 Sum_probs=132.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-chhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ-HISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~-~~~~~~ 79 (394) ||+-+.|.+++.-+.+- +.+-+..... +.+-++| +|++.......+...+.+ ..+.++ T Consensus 3 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~------------------~~~~a~~--~enq~~~~~~~~~~~~~~~~~~~~~ 61 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEG-EGLPEIANSK------------------QAIIAKI--FENQEKDFQTAPEYKDEKIAQAFGS 61 (521) T ss_pred cchhHHHHHhhhhhhcc-CCCCccccch------------------hhhhhhh--hhhhhhhhhhccccchhHHHHHHhh Confidence 88888899999877654 1111110000 0000000 000100011100000000 000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) . .....-.+...... .....++++.+-.-..|.++ .++++........+++.+.||++.+.-+.- T Consensus 62 ~----------l~e~~~~~~~~~~~----~~i~es~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (521) T protein:vir:10 62 F----------LTEAEIGGDHGYNA----TNIAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMNSPTGQVFA 126 (521) T ss_pred h----------hhhhcccCcccccc----ccccccccccccccCCchhh-hHHHHHHhhhhhhhceeeccCCchhhhhee Confidence 0 00000000000000 00011111111011111111 122223344555666777776654322110 Q ss_pred EcC---cc---------------cccce---------------------------------------------------- Q lcl|NC_019933. 160 ETG---FT---------------NAAAP---------------------------------------------------- 169 (394) Q Consensus 160 ~~~---~~---------------~~~~~---------------------------------------------------- 169 (394) ..+ .. +.+.| T Consensus 127 MRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~ 206 (521) T protein:vir:10 127 LRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAA 206 (521) T ss_pred eeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCcccccc Confidence 000 00 00000 Q ss_pred -----------------ecC-----------------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HH Q lcl|NC_019933. 170 -----------------VAE-----------------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QL 210 (394) Q Consensus 170 -----------------~~e-----------------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~ 210 (394) +++ +...++-..++++++...+.-+=...+|-||.+|-- |. T Consensus 207 ~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDA 286 (521) T protein:vir:10 207 KLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDA 286 (521) T ss_pred cccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCCh Confidence 000 112334445566666666666667789999999862 68 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCcc--cccccc----cccccccccccc---ccchHH-------HHHHHHHHhh- Q lcl|NC_019933. 211 QSFINARLLRGLEVVEENQLLNGNGTGQN--LLGLLP----QATAFAAPITVA---NATAVD-------RLRLALLQAQ- 273 (394) Q Consensus 211 ~~~i~~~la~a~~~~~d~a~l~g~g~~~~--~~Gi~~----~~~~~~~~~~~~---~~~~~~-------~i~~~~~~~~- 273 (394) ++.|.+-|+..|...|++.+|.--..... ..|+.. .+++.......+ ..-..+ .|......+. T Consensus 287 EtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~ 366 (521) T protein:vir:10 287 DAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIAR 366 (521) T ss_pred HHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 89999999999999999988853211111 112221 111111111111 111112 2222222222 Q ss_pred -hhcCCCCeeEeCHHHHHHHHHhh--c---cCC-cccccCcccCCC----cee-ecceEEEcCCCCcCceEEeeccceEE Q lcl|NC_019933. 274 -LAEFPATGIVLNPADWAGIELLK--D---TQG-RYILGNPQGTLA----PTL-WGLPVVATQAMAVGQFLTGAFDAGAQ 341 (394) Q Consensus 274 -~~~~~~~~~~~~~~~~~~l~~lk--d---~~G-~~~~~~~~~~~~----~~l-~G~pv~~~~~~p~~~~~~gd~~~~~~ 341 (394) ......+.+++|++....|...- + +.| ..-|. .+.+. +.| .|++|+.++..|.+-+++|---...+ T Consensus 367 ~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~--~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 444 (521) T protein:vir:10 367 QTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFN--TDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEM 444 (521) T ss_pred hcccccceEEEEchHHHHHHhhccccccccccccccccc--ccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccc Confidence 22345668999999999888631 1 111 01111 11122 344 36899999999888777663210000 Q ss_pred ---EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEE---------Eec---C-CC-CC Q lcl|NC_019933. 342 ---VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKG---------SLA---A-AA-GT 394 (394) Q Consensus 342 ---~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l---------~~~---~-a~-~~ 394 (394) +|...=+.+.. ....+-.+-+-.+-...|++..+ +| |+.. +-. . ++ +. T Consensus 445 ~~glfyaPYv~l~~--~~~~dp~sfqP~~g~~tRY~l~~-NP--~~~~~~~~~~~~i~~~~~~~~a~~~~ 509 (521) T protein:vir:10 445 DAGIYYAPYVALTP--LRGSDPKNFQPVMGFKTRYGIGI-NP--FAESAAQAPASRIQSGMPSILNSLGK 509 (521) T ss_pred ccceeecccccccc--ccccCCccccceeeeeeeeceee-cC--cccccCCccceeecccchhhhccccc Confidence 01000000100 00011111222333444554433 33 2211 100 0 01 11 No 230 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=85.05 E-value=0.056 Score=27.46 Aligned_cols=266 Identities=7% Similarity=-0.023 Sum_probs=116.6 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhh----HHHhcc---ccccccCceeEEEEcCcccccce-ecCCccc-ccccc Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMT----IRSLLA---QGTMEGNTLEYVRETGFTNAAAP-VAEGAQK-PESSL 181 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~----l~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-~~eg~~~-~~~~~ 181 (394) |... .-+.+...+.+.+...+. +..-.. +.-.++.++++|+.+...+...+ ...|-.. ..-+. T Consensus 1 Main--------ya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~ 72 (346) T protein:vir:10 1 MTIN--------YAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSN 72 (346) T ss_pred Ccch--------hHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccc Confidence 1111 012344455444444321 111111 12245678999998532223333 2223222 22345 Q ss_pred ceeeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccc Q lcl|NC_019933. 182 RFDLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVAN 258 (394) Q Consensus 182 ~~~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~ 258 (394) ++...++...+.-.+. |..-=++.+ ..+...+.+.....+.=.+|.-.|.---+.. .+ .......+.+.+. T Consensus 73 ~~et~tl~qDR~~~F~-vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a--~~---~~~~~~~~~a~T~ 146 (346) T protein:vir:10 73 DWDSYELKNERYWSTL-VDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGK--EA---AHDGGITTNTLDE 146 (346) T ss_pred ceeEEEeeccccceec-ccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhh--hh---hccccccccccCH Confidence 5666666666533322 111001112 1233333344444444466755442110000 00 0001111122244 Q ss_pred cchHHHHHHHHHHhhhhcCC--CCeeEeCHHHHHHHHHhhccCCcc-cc-cCcccCCCceeecceEEEcC--CCCc---- Q lcl|NC_019933. 259 ATAVDRLRLALLQAQLAEFP--ATGIVLNPADWAGIELLKDTQGRY-IL-GNPQGTLAPTLWGLPVVATQ--AMAV---- 328 (394) Q Consensus 259 ~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~lkd~~G~~-~~-~~~~~~~~~~l~G~pv~~~~--~~p~---- 328 (394) ...++.+.+++..+.....+ +-.++|+|.++..|..-..-+... +. .....+..+.|.|+||+..+ .|+. T Consensus 147 ~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f 226 (346) T protein:vir:10 147 KNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDF 226 (346) T ss_pred HHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhcccchhh Confidence 55788888998899877664 446899999988776433221111 11 11223445689999998743 2321 Q ss_pred --C----------ceEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEeccc--c-eEEEEecCCCC Q lcl|NC_019933. 329 --G----------QFLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPE--S-FIKGSLAAAAG 393 (394) Q Consensus 329 --~----------~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~--a-~~~l~~~~a~~ 393 (394) | ..+++. ..+..-..... .+.+.. +.. -..|...+.-..|.|.-|.+.. + ++-++-+++++ T Consensus 227 ~~G~~~~t~ak~INfiiv~-~~A~ia~~K~~-~~~if~-P~~-~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~ 302 (346) T protein:vir:10 227 SDGSKIIDTAKQIEMFLIY-NGVQIAPEKYS-FVGFDQ-PSA-ATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKD 302 (346) T ss_pred ccCccccCCccceeEEEEC-Cceeeeeeeee-eeEeeC-CCC-CcccceeeeeeeeeeeeeeccccceEEEeeecccccC Confidence 1 011121 11111111111 122221 122 2445566667777887777743 2 33344444444 Q ss_pred C Q lcl|NC_019933. 394 T 394 (394) Q Consensus 394 ~ 394 (394) . T Consensus 303 ~ 303 (346) T protein:vir:10 303 Q 303 (346) T ss_pred c Confidence 4 No 231 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=84.83 E-value=0.057 Score=27.39 Aligned_cols=355 Identities=13% Similarity=0.087 Sum_probs=130.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-ccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAG-GDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~-~~~~~~~~~~ 79 (394) |++-++|+++|.-+.+.-+.+-+-..+..+ ...+++ . |++.......... .+.-..+++. T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~---~~~a~l---l-------------enq~~~~~~~~~~~~~~~~~~~~~ 61 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKK---QLVAAI---L-------------EAQEKDAETDPVYRDEKIVESFGG 61 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhH---HHHHHH---H-------------hhHHHHHhcCccccchHHHHhhhc Confidence 999999999998887542221111000000 000010 0 0000000000000 0000000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeE-- Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEY-- 157 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~-- 157 (394) ...+ ....+. ... +-. ....++++.+-.-..|.++ .++++........+++.+.||++.+.-+ T Consensus 62 ~l~e----------a~~~~~--~~~-~~~-~i~~s~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (524) T protein:vir:98 62 FLAE----------AEIAGD--HNY-DQT-NIASGKSSGAITNIGPAVI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFA 126 (524) T ss_pred cccc----------cccccc--ccc-ccc-cccccccccccccccchhh-hHHHHHHHhhhhhhhheeccCCchhhhhhh Confidence 0000 000000 000 000 0000000000000111111 1122223334445555555554432111 Q ss_pred -----EEEcCcccc---------------c-------------------------------------------------- Q lcl|NC_019933. 158 -----VRETGFTNA---------------A-------------------------------------------------- 167 (394) Q Consensus 158 -----~~~~~~~~~---------------~-------------------------------------------------- 167 (394) ......... . T Consensus 127 mRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tg 206 (524) T protein:vir:98 127 LRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTG 206 (524) T ss_pred hheeecCCCCCcccccccccccccccccccccCCccccccccccccccccccccccccccccccceeccccccCcccccc Confidence 000000000 0 Q ss_pred ---------------------------ceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH--- Q lcl|NC_019933. 168 ---------------------------APVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA--- 208 (394) Q Consensus 168 ---------------------------~~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~--- 208 (394) .-..| +...++-..++++++...+.-+=...+|-||.+|-- T Consensus 207 t~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVH 286 (524) T protein:vir:98 207 ADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVH 286 (524) T ss_pred cccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhc Confidence 00001 122344445566666666666667789999999862 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc--ccccccc----cccccccccc---cc-------cchHHHHHHHHH Q lcl|NC_019933. 209 --QLQSFINARLLRGLEVVEENQLLNGNGTGQN--LLGLLPQ----ATAFAAPITV---AN-------ATAVDRLRLALL 270 (394) Q Consensus 209 --~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~--~~Gi~~~----~~~~~~~~~~---~~-------~~~~~~i~~~~~ 270 (394) |.++.|.+-|+..|...|++.||.--..... ..|+.+. ++...-.... .+ ...+-.+..... T Consensus 287 GLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an 366 (524) T protein:vir:98 287 GMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEAN 366 (524) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHH Confidence 6889999999999999999888853211111 1222211 1111111110 01 111222222222 Q ss_pred Hhh--hhcCCCCeeEeCHHHHHHHHHh----hccCCcccccCcccCCC----cee-ecceEEEcCCCCcCceEEeeccce Q lcl|NC_019933. 271 QAQ--LAEFPATGIVLNPADWAGIELL----KDTQGRYILGNPQGTLA----PTL-WGLPVVATQAMAVGQFLTGAFDAG 339 (394) Q Consensus 271 ~~~--~~~~~~~~~~~~~~~~~~l~~l----kd~~G~~~~~~~~~~~~----~~l-~G~pv~~~~~~p~~~~~~gd~~~~ 339 (394) .+. ..+...+.+++|++....|..+ .+..|.-.-....+.++ +.| .|++|+.++..|.+-+++|---.. T Consensus 367 ~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~ 446 (524) T protein:vir:98 367 EIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDN 446 (524) T ss_pred HHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCc Confidence 222 2334577899999999888863 12222111000111111 344 368999999998887776532100 Q ss_pred EE---EEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCC-----CCC Q lcl|NC_019933. 340 AQ---VFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAA-----AGT 394 (394) Q Consensus 340 ~~---~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a-----~~~ 394 (394) .+ +|...=+.+.. ....+-.+-+-.+-...|++..+ +| |+...-.+. .|+ T Consensus 447 ~~~~glfyaPYv~l~~--~~~~dp~sfqP~~g~~tRY~l~~-NP--~~~~~~~~~~~ri~~g~ 504 (524) T protein:vir:98 447 EMDAGIYYAPYVALTP--LRGSDPKNFQPVMGFKTRYGIGI-NP--FANSRSQAPADRITSGM 504 (524) T ss_pred ccccceeecccccccc--ccccCCccccceeeeeeeeceee-cC--cccccCCccccccccCc Confidence 00 01000000100 00001111122333344444332 33 221111110 111 No 232 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=84.23 E-value=0.062 Score=27.20 Aligned_cols=346 Identities=14% Similarity=0.078 Sum_probs=133.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-chhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ-HISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~-~~~~~~ 79 (394) |++-++|+++|.-+.+... + .+++.... +.+-++| +|++.......+.-.+.+ ..+++. T Consensus 1 ~~~~~~l~~kw~p~l~~~~-~-----------~~i~~~~~------~~~~a~l--lenq~~~~~~~~~~~~~~~~~~~~~ 60 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEK-L-----------PEIATASK------QKLVAKI--LESQEADFAVDPIYKDEKVVEAFGG 60 (528) T ss_pred CcchHHHHHhhhHhhcCCc-c-----------chhcchhh------hhhhhhh--hhhhhHHhhccccccchHHHHhhhh Confidence 9999999999988765311 0 00000000 0000000 011111111110000000 000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHH---HhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGI---LELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) +.......+........ ...++++ +.+. .+.+.+ +++........+++.+.||++.+.- T Consensus 61 ----------~l~ea~~~~~~~~~~~~----i~es~~t-~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGL 122 (528) T protein:vir:80 61 ----------FIAEAEVAGDHGYDASQ----IAAGQTT-GAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQ 122 (528) T ss_pred ----------hccccccccccCCcccc----ccccccc-cccc---cCCchhhhHHHHHHhhhhhhhhheeccCCchhhh Confidence 00000000000000000 0001111 1100 122222 2223345555667777777654211 Q ss_pred EEEEc--C-cc--------------------------------------------------------------------- Q lcl|NC_019933. 157 YVRET--G-FT--------------------------------------------------------------------- 164 (394) Q Consensus 157 ~~~~~--~-~~--------------------------------------------------------------------- 164 (394) +.-.. . .. T Consensus 123 IFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~ 202 (528) T protein:vir:80 123 IFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVT 202 (528) T ss_pred heeeeeeecCCccccccccccccccccccccccccccccccccccccccccccccccccccceecccccccccccccccc Confidence 11000 0 00 Q ss_pred --------------------------------cccceecC---------CccccccccceeeEEeeeeeEEEeehhhHHH Q lcl|NC_019933. 165 --------------------------------NAAAPVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQI 203 (394) Q Consensus 165 --------------------------------~~~~~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~ 203 (394) +-..-.+| +...++-..++++++...+.-+=...+|-|| T Consensus 203 ~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiEL 282 (528) T protein:vir:80 203 AEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEV 282 (528) T ss_pred ccccCccccCCcccccccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHH Confidence 00000001 1123444455666666666666677899999 Q ss_pred HHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc--ccccc----ccccccccccccc--c-cchHHH----- Q lcl|NC_019933. 204 LSDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTGQN--LLGLL----PQATAFAAPITVA--N-ATAVDR----- 264 (394) Q Consensus 204 l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~--~~Gi~----~~~~~~~~~~~~~--~-~~~~~~----- 264 (394) .+|-- |.++.|.+-|+..|...|++.||.--..... .+|+. ..++........+ + --..+. T Consensus 283 AQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~ 362 (528) T protein:vir:80 283 AQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLI 362 (528) T ss_pred HHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHH Confidence 99862 6899999999999999999998743211111 11111 1111111111000 0 011222 Q ss_pred --HHHHHHHhh--hhcCCCCeeEeCHHHHHHHHHh-----hccCC-cccccCcccC--CCceee-cceEEEcCCCCcCce Q lcl|NC_019933. 265 --LRLALLQAQ--LAEFPATGIVLNPADWAGIELL-----KDTQG-RYILGNPQGT--LAPTLW-GLPVVATQAMAVGQF 331 (394) Q Consensus 265 --i~~~~~~~~--~~~~~~~~~~~~~~~~~~l~~l-----kd~~G-~~~~~~~~~~--~~~~l~-G~pv~~~~~~p~~~~ 331 (394) +......+. ..+...+.+++|++....|... ....| +..+...... ..+.|. |++|+.++..|.+-+ T Consensus 363 ~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 442 (528) T protein:vir:80 363 YQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYF 442 (528) T ss_pred HHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceE Confidence 222223332 2234457899999999888653 11111 1212111111 123453 689999999988877 Q ss_pred EEeeccc-----eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEE--------------------- Q lcl|NC_019933. 332 LTGAFDA-----GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIK--------------------- 385 (394) Q Consensus 332 ~~gd~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~--------------------- 385 (394) ++|---. +++...--.+......++. .| +-.+-...|++..+ +| |+. T Consensus 443 ~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~-sf---qP~~g~~tRY~l~~-NP--~~~~~~~~~~~r~~~g~~~~~~ag 515 (528) T protein:vir:80 443 TVGYKGDNEMDAGIYYAPYVALTPLRATDPQ-SF---HPVLGFKTRYGIGI-NP--FADSKSQAPSARITSGMLSKDSVG 515 (528) T ss_pred EEEEeCCcccccceeecccccceeeEeeCCc-cc---cceeeeeeeeceee-cC--cccccCCcccccccccchhhhhcC Confidence 6653210 0000000011111111111 11 12233334444332 22 211 Q ss_pred -------EEecCC Q lcl|NC_019933. 386 -------GSLAAA 391 (394) Q Consensus 386 -------l~~~~a 391 (394) +-+|-- T Consensus 516 ~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 516 KNAYFRRVWVKGC 528 (528) T ss_pred ccceeEEeeeccC Confidence 111111 No 233 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=83.95 E-value=0.064 Score=27.12 Aligned_cols=350 Identities=12% Similarity=0.050 Sum_probs=129.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-chhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQ-HISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~-~~~~~~ 79 (394) ||+-++|.++|.-+.+..- +-+........+ +++ +|++.......+.-.+.+ ..+++. T Consensus 4 ~~~~e~l~~kw~p~l~~~~-~~~~~~~~~~~~----a~l----------------~enq~~~~~~~~~~~~~~~~~~~~~ 62 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEGEG-LPEIANSKQAII----AKI----------------FENQEKDFEVSPEYKDEKIAQAFGS 62 (522) T ss_pred cchHHHHHHhhHHHhcCCC-CCccccchhhhh----hhh----------------hhhhhHHhhcccccchhHHHHhhhh Confidence 5555667777766654310 000000000000 000 001100010000000000 000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhH---HHhhhhhhhhHHHhccccccccCcee Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPG---ILELPQRRMTIRSLLAQGTMEGNTLE 156 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~---ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) +.......+.......+ ...++++ +.+ ..+.+. ++++........+++.+.||++.+.- T Consensus 63 ----------~l~ea~~~~~~~~~~~~----i~es~~t-~~v---~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGL 124 (522) T protein:vir:69 63 ----------FLTEAEIGGDHGYNAQN----IAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQ 124 (522) T ss_pred ----------hhhhhccccccCCCccc----ccccccc-ccc---ccccchHHHHHHHHHhhhhhhhceeeccCCchhhh Confidence 00000000000000000 0011110 000 011122 22233334445566666666553321 Q ss_pred EEEEc----Ccc--------------cc---------------------------------------------------- Q lcl|NC_019933. 157 YVRET----GFT--------------NA---------------------------------------------------- 166 (394) Q Consensus 157 ~~~~~----~~~--------------~~---------------------------------------------------- 166 (394) +.-.. ... +. T Consensus 125 IFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~ 204 (522) T protein:vir:69 125 VFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSAD 204 (522) T ss_pred heeeeeeccCCcccCccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCCcCCCCCc Confidence 10000 000 00 Q ss_pred -------------------------cceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH---- Q lcl|NC_019933. 167 -------------------------AAPVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA---- 208 (394) Q Consensus 167 -------------------------~~~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~---- 208 (394) ..-.+| +...++-..++++++...+.-+=...+|-||.+|-- T Consensus 205 ~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHG 284 (522) T protein:vir:69 205 DAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHG 284 (522) T ss_pred ccccccchhccccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcC Confidence 000011 112444556667777777777777889999999862 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc--cccccc----cccccccccccc---ccch-------HHHHHHHHHH Q lcl|NC_019933. 209 -QLQSFINARLLRGLEVVEENQLLNGNGTGQN--LLGLLP----QATAFAAPITVA---NATA-------VDRLRLALLQ 271 (394) Q Consensus 209 -~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~--~~Gi~~----~~~~~~~~~~~~---~~~~-------~~~i~~~~~~ 271 (394) |.++.|.+-|+..|...|++.+|.--..... ..|+.. .++......... +--. +-.|...... T Consensus 285 LDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~ 364 (522) T protein:vir:69 285 MDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVE 364 (522) T ss_pred CChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHH Confidence 6789999999999999999888853211111 122221 111111111111 1111 2222222223 Q ss_pred hh--hhcCCCCeeEeCHHHHHHHHHhh--c---cCC-cccccCcccC--CCcee-ecceEEEcCCCCcCceEEeeccceE Q lcl|NC_019933. 272 AQ--LAEFPATGIVLNPADWAGIELLK--D---TQG-RYILGNPQGT--LAPTL-WGLPVVATQAMAVGQFLTGAFDAGA 340 (394) Q Consensus 272 ~~--~~~~~~~~~~~~~~~~~~l~~lk--d---~~G-~~~~~~~~~~--~~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~ 340 (394) +. ......+.+++|++....|...- | +.| ..-|...... .-+.| .|++|+.++..|.+-+++|---... T Consensus 365 i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~ 444 (522) T protein:vir:69 365 IARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANE 444 (522) T ss_pred HHHhcccccccEEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcc Confidence 32 22345678999999999887531 1 111 1111111111 11344 3689999999988877766321000 Q ss_pred ---EEEee--cceEEEEecccchhhhcCcEEEEEEEEeccEEecccc-------eEEEEecC-----CCCC Q lcl|NC_019933. 341 ---QVFDR--WAARVEVATENQDDFIKNMVTILAEERLALAVYRPES-------FIKGSLAA-----AAGT 394 (394) Q Consensus 341 ---~~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a-------~~~l~~~~-----a~~~ 394 (394) -+|.. ..+......++ .+-+-.+-...|++..+ +|=+ .+++.-.. .++. T Consensus 445 ~~~glfyaPYv~l~~~~~~dp----~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~p~~~~~~~~ 510 (522) T protein:vir:69 445 MDAGIYYAPYVALTPLRGSDP----KNFQPVMGFKTRYGIGV-NPFAESSLQAPGARIQSGMPSILNSLGK 510 (522) T ss_pred cccceeeccccccccccccCC----ccccceeeeeeeeceee-cCcccccCCcccceeecccchhhcccCC Confidence 00100 00111111111 11122333344444332 2200 11222111 1111 No 234 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=81.04 E-value=0.089 Score=26.34 Aligned_cols=259 Identities=12% Similarity=0.045 Sum_probs=115.4 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhcc------ccccccCceeEEEEcC-cccccceecCCccccccccce Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLA------QGTMEGNTLEYVRETG-FTNAAAPVAEGAQKPESSLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~------~~~~~~~~~~~~~~~~-~~~~~~~~~eg~~~~~~~~~~ 183 (394) |. ...-+.+...+.+.+...+....++. +...+++++++|+... .+-..+-.+.|-...+-..++ T Consensus 1 Ma--------in~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~ 72 (285) T protein:vir:79 1 MT--------VVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGK 72 (285) T ss_pred Cc--------chhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceee Confidence 11 01123345555555555444444432 2234567899999853 223333334343333344555 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHH-HHHHHHHHHHHHHhhccCCCccccccccccccccccccccccch Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDSA-QLQSFINAR-LLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANATA 261 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s~-~~~~~i~~~-la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~~ 261 (394) ...++...+.-.+. |..-=.+.+. .-...++.+ ....+.=.+|.-.|..--+ .++.. .+.+.+.... T Consensus 73 et~tl~~DR~~~f~-iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~---------~a~~~-~~~~~T~~nv 141 (285) T protein:vir:79 73 ETVKLTHEDWFGYD-LDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFD---------SAAKK-ATDSITKDNA 141 (285) T ss_pred eEEEeeccccceec-ccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHh---------hcccc-cccccCHHHH Confidence 55666655533222 1100011111 112233333 3333444566544431100 01111 1112244557 Q ss_pred HHHHHHHHHHhhhhcCC-CCeeEeCHHHHHHHHHhhccCCc-----ccccCcccCCCceeec-ceEEEcC--CCCcCc-- Q lcl|NC_019933. 262 VDRLRLALLQAQLAEFP-ATGIVLNPADWAGIELLKDTQGR-----YILGNPQGTLAPTLWG-LPVVATQ--AMAVGQ-- 330 (394) Q Consensus 262 ~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~lkd~~G~-----~~~~~~~~~~~~~l~G-~pv~~~~--~~p~~~-- 330 (394) ++.+..++..+...+.+ +-.++|+|.++..|..-+.-... ...........+.|.| .|++..+ .|+... T Consensus 142 ~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~ 221 (285) T protein:vir:79 142 LDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGIT 221 (285) T ss_pred HHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcc Confidence 88888888888887664 44679999998877754332211 1111122233467888 8998753 343211 Q ss_pred ----eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc--cceEEEEecCCC Q lcl|NC_019933. 331 ----FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP--ESFIKGSLAAAA 392 (394) Q Consensus 331 ----~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~--~a~~~l~~~~a~ 392 (394) .++...+ +..-........-++++. .-.-|...+.-..|.|.-+.+. +++.+- .+++- T Consensus 222 k~Infiiv~~~-a~i~~~K~~~~~~f~P~~--~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~-~~a~~ 285 (285) T protein:vir:79 222 NHVNFILTPLS-AIAPIVKYDSVSVIDPST--DRSGNRWTIKGLSYYDAIVLDNAKKGIYVA-ATAGV 285 (285) T ss_pred hhccEEEecCc-eeccceeeeeeEeECCCC--CCCcceeeeeeeeeeeeeehhhccceeeee-ecccC Confidence 1222221 222122212111122221 2123345566667777777664 334333 33333 No 235 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=78.63 E-value=0.11 Score=25.79 Aligned_cols=264 Identities=11% Similarity=-0.026 Sum_probs=112.4 Q ss_pred CCcCccccchhhhhHHHhhhhhh-hhHHHhccccccccCceeEEEEcCcc---cccceecCCccccccccceeeEEeeee Q lcl|NC_019933. 116 DGSAGATVQTTRLPGILELPQRR-MTIRSLLAQGTMEGNTLEYVRETGFT---NAAAPVAEGAQKPESSLRFDLVQTSAK 191 (394) Q Consensus 116 ~~~~g~~ip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~eg~~~~~~~~~~~~i~~~~~ 191 (394) -+++-.++ ......+---.++. ..--.+++.+|++....+|+...... ....-++.++....-.++...-+...+ T Consensus 1 ~~~~~~~~-dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~ 79 (309) T protein:vir:99 1 MSNAPFPI-DPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTE 79 (309) T ss_pred CCCCCcCc-CHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccCceeeec Confidence 22222222 22233332211222 22234578888887778888764321 111223444433333344444455555 Q ss_pred eEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhh--ccCCCcccccc-ccccccccccccccccchHHHH Q lcl|NC_019933. 192 VIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLN--GNGTGQNLLGL-LPQATAFAAPITVANATAVDRL 265 (394) Q Consensus 192 k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~--g~g~~~~~~Gi-~~~~~~~~~~~~~~~~~~~~~i 265 (394) ..+-..+|..+-..+.+ +.++...+.+.+.|....|..+-. -+. .+.+.+= .+.++ +......+...+.+| T Consensus 80 ~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~-a~y~~~~k~~Lsg--t~~wsd~~SDPi~~i 156 (309) T protein:vir:99 80 DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSP-NSYAAGNKTTLSG--ADQWSDPTSNPLPVI 156 (309) T ss_pred ccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCh-hhcCCCceEEecC--ccccCCCCCCcHHHH Confidence 55555677776655443 567777777777776666532221 111 1112220 00111 111223344456666 Q ss_pred HHHHHHhhhhcCCCCeeEeCHHHHHHHHH---h----hccCCcccccCcccCCCceeecc-eEEEcCCC-----Cc---- Q lcl|NC_019933. 266 RLALLQAQLAEFPATGIVLNPADWAGIEL---L----KDTQGRYILGNPQGTLAPTLWGL-PVVATQAM-----AV---- 328 (394) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---l----kd~~G~~~~~~~~~~~~~~l~G~-pv~~~~~~-----p~---- 328 (394) .+....+ ...++.++|...+|.+|+. + +...++.-.-.+ ..-..|+|+ .|++.... +. T Consensus 157 ~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~--~~la~l~~ve~V~vg~a~~n~a~~g~~~~ 231 (309) T protein:vir:99 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM--AFLQELLELDAIYIGEARLNIARPGQNPN 231 (309) T ss_pred HHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCH--HHHHHHhCcceEEeecceeeccccccccc Confidence 6655443 6788899999999998864 2 222221100000 000123343 23322111 00 Q ss_pred ------CceEE----------eeccceEEE--EeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|NC_019933. 329 ------GQFLT----------GAFDAGAQV--FDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAA 390 (394) Q Consensus 329 ------~~~~~----------gd~~~~~~~--~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~ 390 (394) +.+.+ ...+.+|.. ..+....+. +++ .-..+...+|+...+.-.+.-+++=..++=.. T Consensus 232 ~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~-d~~---~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~v 307 (309) T protein:vir:99 232 LIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA-DPN---IGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) T ss_pred cccccCCcEEEEEcCCCCCCcccccccceeecccccCCcee-eee---eccCCceEEEEeccccchhcchhcchhhhhcc Confidence 00000 111222211 112111111 110 01133344555555555555555555555444 Q ss_pred CC Q lcl|NC_019933. 391 AA 392 (394) Q Consensus 391 a~ 392 (394) |+ T Consensus 308 a~ 309 (309) T protein:vir:99 308 AA 309 (309) T ss_pred cC Confidence 44 No 236 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=76.75 E-value=0.13 Score=25.40 Aligned_cols=370 Identities=11% Similarity=0.032 Sum_probs=111.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) ++.+.++.++++++.+++++..+......+...+++++++++.++.++++.++++.+....................... T Consensus 11 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (408) T protein:vir:74 11 NEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELK 90 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhhhH Confidence 44688888888888888887777766666667888899999999999999888776655443332222222211111111 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHH-----HhhhhhhhhHHHhccccccc---c Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGI-----LELPQRRMTIRSLLAQGTME---G 152 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~i-----i~~~~~~~~l~~~~~~~~~~---~ 152 (394) ......+..+.+..... ....+.++............-...+...+...+ +..+...-++-.....+++. + T Consensus 91 ~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 169 (408) T protein:vir:74 91 DKFVKDFVNMVRNPMAF-LNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTD 169 (408) T ss_pred HHHHHHHHHHHhcchhh-hhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecC Confidence 11112222222222211 112233333222222111111112222232222 21111111111111111211 1 Q ss_pred CceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHh Q lcl|NC_019933. 153 NTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEEN-QLL 231 (394) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~-a~l 231 (394) ....-... ....-..+.........++..-.+...-.-..--+.+...+--..+...|...++.++..++=. .-- T Consensus 170 ~~~~~~~v----~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~ 245 (408) T protein:vir:74 170 VTPLKAMD----EEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGT 245 (408) T ss_pred Cccccccc----ccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 11100111 1111122233333333333333322221111111222122212233444444444444444311 000 Q ss_pred h-ccCCCccccccccccc--cc----cccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCcccc Q lcl|NC_019933. 232 N-GNGTGQNLLGLLPQAT--AF----AAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYIL 304 (394) Q Consensus 232 ~-g~g~~~~~~Gi~~~~~--~~----~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~~ 304 (394) + ..+......++..... .. ....-.....++..|. .+..... ..+..+.....-. ..-.|.|+. T Consensus 246 ~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~----~lkd~~G---~~l~~~~~~~~~~--~~l~G~pV~ 316 (408) T protein:vir:74 246 VPKKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLA----LVKTAEG---KYLLEPDPTKPNS--YLIKGKQVI 316 (408) T ss_pred cccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHH----HhhcCCC---ceEeccCcCCCCC--ceecceeeE Confidence 0 1111122223322110 00 0011112233333333 3333222 2222222100000 001344443 Q ss_pred cCcccCCCcee-ec-ceEEEcCCCCcCceEEeeccceEE--------EEeecceEEEEecc-cchhhhcCcEEEEEEEEe Q lcl|NC_019933. 305 GNPQGTLAPTL-WG-LPVVATQAMAVGQFLTGAFDAGAQ--------VFDRWAARVEVATE-NQDDFIKNMVTILAEERL 373 (394) Q Consensus 305 ~~~~~~~~~~l-~G-~pv~~~~~~p~~~~~~gd~~~~~~--------~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~ 373 (394) ....... +.. .+ .++++-+. . ..+.+++....-. .|......+..... +......+ +...+ T Consensus 317 ~~~~~~~-~~~~~~~~~i~~gd~-~-~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~-----a~~~~ 388 (408) T protein:vir:74 317 VVADRWL-PNSGSTVYPLYYGDM-S-QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSE-----ALVAG 388 (408) T ss_pred EecCccc-ccccCCcceEEEEeh-h-ccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEeccc-----ceEEE Confidence 1110000 000 00 11111110 0 0011111111000 01111111111000 00000011 11111 Q ss_pred ccEEecccceEEEEecCCCCC Q lcl|NC_019933. 374 ALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 374 d~~v~~~~a~~~l~~~~a~~~ 394 (394) .+...-+. .--+.+.++.+ T Consensus 389 ~~~~~~~~--~~~~~~~~~~~ 407 (408) T protein:vir:74 389 SFTAIADQ--VGNFKTTTSTA 407 (408) T ss_pred EeecccCC--CCCCCCCcccc Confidence 11111110 00011111111 No 237 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=76.58 E-value=0.13 Score=25.37 Aligned_cols=276 Identities=15% Similarity=0.119 Sum_probs=122.0 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHH--HhccccccccCceeEEEEc-CcccccceecCCcccccc-ccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIR--SLLAQGTMEGNTLEYVRET-GFTNAAAPVAEGAQKPES-SLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~eg~~~~~~-~~~~~~i 186 (394) +..- -...-|..+..-|.++.....+++ .+++..++.+..+.+.... .....+.++..+.+.+.. ...+... T Consensus 1 M~~i----~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~ 76 (348) T protein:vir:27 1 MGLI----YDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMH 76 (348) T ss_pred Ccch----hhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeee Confidence 1100 011223333333333333333332 3455555544444443322 222335677766554443 3456777 Q ss_pred EeeeeeEEEeehhhHHHHHH--------HHH----HHHHH---HHHHHHHHHHHHHHHHh----hcc----CCCccc--- Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSD--------SAQ----LQSFI---NARLLRGLEVVEENQLL----NGN----GTGQNL--- 240 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~--------s~~----~~~~i---~~~la~a~~~~~d~a~l----~g~----g~~~~~--- 240 (394) ++.+-.++-...++..-++. ++. +...+ ...+.+.+.+.+|..+. +|. +.+... T Consensus 77 ~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vd 156 (348) T protein:vir:27 77 DEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDID 156 (348) T ss_pred eeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEe Confidence 77777776666666443221 111 11222 22344555555554333 221 111100 Q ss_pred cccccc-cccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHH---hhccC----Cc--ccccCcccC Q lcl|NC_019933. 241 LGLLPQ-ATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIEL---LKDTQ----GR--YILGNPQGT 310 (394) Q Consensus 241 ~Gi~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---lkd~~----G~--~~~~~~~~~ 310 (394) .|.-.. ..+.+...+.++...+.+|.+....+...+..+..++|++.+|.+|++ +++.- +. .+-+..... T Consensus 157 fg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~ 236 (348) T protein:vir:27 157 YGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELEN 236 (348) T ss_pred ecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHH Confidence 011000 001112344456667888888877777677788899999999999874 33221 11 110000100 Q ss_pred CCceeecceEEEcC------------CCCcCceEEeecc-ceEEEEee------------cceE-------EEEecccch Q lcl|NC_019933. 311 LAPTLWGLPVVATQ------------AMAVGQFLTGAFD-AGAQVFDR------------WAAR-------VEVATENQD 358 (394) Q Consensus 311 ~~~~l~G~pv~~~~------------~~p~~~~~~gd~~-~~~~~~~~------------~~~~-------i~~~~~~~~ 358 (394) .-+++.|++|++-+ .+|++.++++-.. .+...+.. .... +-+..+... T Consensus 237 ~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (348) T protein:vir:27 237 YIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTT 316 (348) T ss_pred HHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecC Confidence 11234455555422 3455665544321 12222110 0000 000000000 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 359 DFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 359 ~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) .--...+.+....=-.+.+++++.++|+-++- T Consensus 317 --dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 317 --DPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred --CCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 01123444555555667778999999877777 No 238 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=75.57 E-value=0.15 Score=25.18 Aligned_cols=265 Identities=8% Similarity=0.020 Sum_probs=116.9 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHHHhc----c-ccccccCceeEEEEcCcccccceecCCcccccc--ccce Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIRSLL----A-QGTMEGNTLEYVRETGFTNAAAPVAEGAQKPES--SLRF 183 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~--~~~~ 183 (394) |. ++ .-..+.+...+-+.+...+ +.... . +.-.++.++++|+....+-..+-...+...... +.++ T Consensus 1 Ma-nt-----l~ya~~~~~~LD~~~~~~~-~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~ 73 (312) T protein:vir:10 1 MA-NT-----LAYGQVLQQGLDKQATQEL-LTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEY 73 (312) T ss_pred CC-cc-----hhHHHHHHHHHHHHHHhhh-ccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccc Confidence 11 00 1112334444433333322 21111 1 212456789999986543233323223222322 3455 Q ss_pred eeEEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccccccccccccccccccccccc Q lcl|NC_019933. 184 DLVQTSAKVIAHWMKASRQILSDS---AQLQSFINARLLRGLEVVEENQLLNGNGTGQNLLGLLPQATAFAAPITVANAT 260 (394) Q Consensus 184 ~~i~~~~~k~~~~~~is~e~l~~s---~~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~~~Gi~~~~~~~~~~~~~~~~~ 260 (394) ...++...+.-.+. |..-=++.+ ..+.....+.....+.=.+|.-.|.---......+ ..+..+...+.+... T Consensus 74 et~tl~qDR~~~F~-vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~---~~~~~~~~~~~T~~n 149 (312) T protein:vir:10 74 ETKTMTQDRGRKFT-LDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIK---GDTNVEYSYSVNSST 149 (312) T ss_pred eeEEeeecccceee-ccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccc---cccccccccccCHHH Confidence 56666666533222 111012222 13444555556666666677766642110000000 000111122234555 Q ss_pred hHHHHHHHHHHhhhhcCC-CCeeEeCHHHHHHHHHhhccCCcccc----cCcccCCCceeecceEEEcCCCCcCceE-Ee Q lcl|NC_019933. 261 AVDRLRLALLQAQLAEFP-ATGIVLNPADWAGIELLKDTQGRYIL----GNPQGTLAPTLWGLPVVATQAMAVGQFL-TG 334 (394) Q Consensus 261 ~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~lkd~~G~~~~----~~~~~~~~~~l~G~pv~~~~~~p~~~~~-~g 334 (394) .++.|..++..+.....+ +-.++|+|.++..|.+- ...+... +.......+.|.|+||+..+ .+... -. T Consensus 150 i~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~--~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VP---s~r~~t~~ 224 (312) T protein:vir:10 150 IINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEK--VLEKLTAVTFAQGGIQTQVPSIDGCALIKTP---QNRMYSSI 224 (312) T ss_pred HHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhh--hhceecccccccceeeeeeeeecccEEEEch---hhhcccee Confidence 788888888888887665 44689999887666642 1111111 11223344679999999744 22110 00 Q ss_pred eccce--------------------EEEEeecceE--------EEEecccchhhhcCcEEEEEEEEeccEEecc--cce- Q lcl|NC_019933. 335 AFDAG--------------------AQVFDRWAAR--------VEVATENQDDFIKNMVTILAEERLALAVYRP--ESF- 383 (394) Q Consensus 335 d~~~~--------------------~~~~~~~~~~--------i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~--~a~- 383 (394) +|..+ ++++. .... +.+. .+...-..|...+.-..|.|.-|.+. +++ T Consensus 225 ~f~dG~t~~~~~gg~~~~~~ak~INfiiv~-~~a~i~~~K~~~~~if-~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iy 302 (312) T protein:vir:10 225 LLNDGTTSNQTAGGYLKGTKALDTNFIIAP-VDVPLAITKQDKMRIF-DPETNQTANAWSMDYRRYHDLWVTDNKANSVY 302 (312) T ss_pred eeccCcccccccCceeecCcccccceEEeC-Cceeeceeeeeeeeee-CCCCCCCcceeeeeeeeeeeeeeeccccCeEE Confidence 11100 11111 1111 1121 12222223345666677777777764 333 Q ss_pred EEEEecCCCC Q lcl|NC_019933. 384 IKGSLAAAAG 393 (394) Q Consensus 384 ~~l~~~~a~~ 393 (394) +-++-+.+.| T Consensus 303 v~~k~a~~~~ 312 (312) T protein:vir:10 303 ANFKDAKPVG 312 (312) T ss_pred EEeecccCCC Confidence 4455555556 No 239 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=73.82 E-value=0.17 Score=24.86 Aligned_cols=324 Identities=12% Similarity=0.027 Sum_probs=132.0 Q ss_pred HHHHHHHHHHHhh-cccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcc-cc-cCCcCccccc-hhhhh Q lcl|NC_019933. 54 KAAQQRIAEVEGN-GAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLS-TN-ADGSAGATVQ-TTRLP 129 (394) Q Consensus 54 ~~~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~g~~ip-~~~~~ 129 (394) .-.+++......+ --++++...-... +.+... ...-....+.+..+.- ++ ++..+|..+. +.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~--------~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~ 69 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTN---KEDILN--------ENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNR 69 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCc---HHHHHH--------HhcchhhhhhhhccccccCCccccCccchhhhhhcc Confidence 0000000000000 0000000000000 000000 0000000111111111 11 1122333333 33333 Q ss_pred HHHhh--hhhhhhHHHhccccccccCceeEEEEc--CcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHH-- Q lcl|NC_019933. 130 GILEL--PQRRMTIRSLLAQGTMEGNTLEYVRET--GFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQI-- 203 (394) Q Consensus 130 ~ii~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~-- 203 (394) ++..+ ..+.-.++.-+...+..+--.+|-... +..+-+.+++|++-.+.+++.+....+..+-++....+|.-+ T Consensus 70 ~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l 149 (514) T protein:vir:10 70 DLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQR 149 (514) T ss_pred ceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhh Confidence 33322 222334566566666655433333322 333456789999999999999999999999888877777654 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC--------CccccccccccccccccccccccchHHHHHHHHHHhhhh Q lcl|NC_019933. 204 LSDSAQLQSFINARLLRGLEVVEENQLLNGNGT--------GQNLLGLLPQATAFAAPITVANATAVDRLRLALLQAQLA 275 (394) Q Consensus 204 l~~s~~~~~~i~~~la~a~~~~~d~a~l~g~g~--------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 275 (394) .+...+......++-...++..++.++|.|+.. +-.+.||.+.-....+-.......+.+.|..+-..+... T Consensus 150 ~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~g 229 (514) T protein:vir:10 150 ANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEG 229 (514) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcc Confidence 343347778888888889999999999987642 234677776654333322223344555555555555566 Q ss_pred cCCCCeeEeCHHHHHHHHHhhccCCcccccCc--------------ccCCCceeecceEEEcCC-CCcCceEEeeccceE Q lcl|NC_019933. 276 EFPATGIVLNPADWAGIELLKDTQGRYILGNP--------------QGTLAPTLWGLPVVATQA-MAVGQFLTGAFDAGA 340 (394) Q Consensus 276 ~~~~~~~~~~~~~~~~l~~lkd~~G~~~~~~~--------------~~~~~~~l~G~pv~~~~~-~p~~~~~~gd~~~~~ 340 (394) +..++-++|+..+.+.|..-....-+-+.+.. ...+.-.|.|--|...++ ++.+ .-.+.+.- T Consensus 230 fGt~TD~ylp~~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~-~~~~~~Ap-- 306 (514) T protein:vir:10 230 FGTPTDAYMPIGIKADFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFD-RPVSPTAP-- 306 (514) T ss_pred cCChhheeCchHHHHHHhhcccCcceEEeecCccceeeeeeccceeEeccceeecCCeeecccccCccC-CccCCcCC-- Confidence 77788899999998877643222222111100 001111222222221111 0000 00000000 Q ss_pred EEEeecceEEEEecccch-----h--------h---hcCcEE-EEEEEE-----------eccEEecccceEEEEecC-C Q lcl|NC_019933. 341 QVFDRWAARVEVATENQD-----D--------F---IKNMVT-ILAEER-----------LALAVYRPESFIKGSLAA-A 391 (394) Q Consensus 341 ~~~~~~~~~i~~~~~~~~-----~--------~---~~~~~~-~~~~~~-----------~d~~v~~~~a~~~l~~~~-a 391 (394) ....+.+.+..+... + | ..+... |++... ++..+....--..|++++ + T Consensus 307 ---~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~ 383 (514) T protein:vir:10 307 ---TAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNA 383 (514) T ss_pred ---CCCcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEecc Confidence 000000111000000 0 0 000000 111111 111111111112333332 2 Q ss_pred CCC Q lcl|NC_019933. 392 AGT 394 (394) Q Consensus 392 ~~~ 394 (394) .|+ T Consensus 384 ~~~ 386 (514) T protein:vir:10 384 MQN 386 (514) T ss_pred Ccc Confidence 222 No 240 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=68.53 E-value=0.24 Score=24.02 Aligned_cols=272 Identities=9% Similarity=-0.036 Sum_probs=110.9 Q ss_pred cCCcCcc-cc--chhhhhHHHhhhhhhhhHHHhc---cc-cccccCceeEEEEcCcccccceecCCccccccccceeeEE Q lcl|NC_019933. 115 ADGSAGA-TV--QTTRLPGILELPQRRMTIRSLL---AQ-GTMEGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQ 187 (394) Q Consensus 115 ~~~~~g~-~i--p~~~~~~ii~~~~~~~~l~~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~ 187 (394) -++++.. .+ -+.+...+-+.+...+ +-... .. +-.++.++++|+....+-..+-...|-....-+.++...+ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~-~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~t 79 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGL-FTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYT 79 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhh-cccceecCchheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEE Confidence 1111111 11 2223333333333221 11111 11 1124678999998754333333333322222234555555 Q ss_pred eeeeeEEEeehhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhcc-----CCCcccccccccccccccccccccc Q lcl|NC_019933. 188 TSAKVIAHWMKASRQILSDSA---QLQSFINARLLRGLEVVEENQLLNGN-----GTGQNLLGLLPQATAFAAPITVANA 259 (394) Q Consensus 188 ~~~~k~~~~~~is~e~l~~s~---~~~~~i~~~la~a~~~~~d~a~l~g~-----g~~~~~~Gi~~~~~~~~~~~~~~~~ 259 (394) +...+--.+. |..-=++.+. .+.....+.....+.=.+|.-.|.-- +.+....+..............+.. T Consensus 80 l~~DR~~~f~-vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~ 158 (311) T protein:vir:99 80 MGQDRDVEFY-LDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDET 158 (311) T ss_pred eeeccceeee-cchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHH Confidence 5555533222 1110122221 23333344444444455665444210 0011110000001111112222334 Q ss_pred chHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhccCCccc----ccCcccCCCceeecceEEEc-CC--CC----- Q lcl|NC_019933. 260 TAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDTQGRYI----LGNPQGTLAPTLWGLPVVAT-QA--MA----- 327 (394) Q Consensus 260 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~~----~~~~~~~~~~~l~G~pv~~~-~~--~p----- 327 (394) ..++.|..++..+......+-.++|+|.++..|...+.-+...- -+.......+.|.|.||+.. +. |. T Consensus 159 nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~~~f 238 (311) T protein:vir:99 159 NAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTKYDF 238 (311) T ss_pred HHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcchhhh Confidence 45677777777776555556688999999887664332221110 01123344578999998855 32 22 Q ss_pred -cCc----------eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc-cceEEEEecCC Q lcl|NC_019933. 328 -VGQ----------FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP-ESFIKGSLAAA 391 (394) Q Consensus 328 -~~~----------~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~-~a~~~l~~~~a 391 (394) .|. .++... .+..-... --.+.+.. +...-.-+...+.-..|.|.-|.+. ..-.++.++.| T Consensus 239 t~G~~~~~~ak~INfiiv~~-~a~i~~~K-~~~v~~f~-P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 239 TDGAKPTEDAKAINFLVVAK-PAVISIVK-ENAVFLFA-PGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred cCCccccCcccccceEEeCC-Ceeeeeee-eeeeeeeC-CCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 110 111111 11111111 11122221 2222223345666677777777764 33445666666 No 241 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=66.57 E-value=0.27 Score=23.74 Aligned_cols=276 Identities=13% Similarity=0.112 Sum_probs=121.1 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhHH--HhccccccccCceeEEEEc-CcccccceecCCcccccc-ccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTIR--SLLAQGTMEGNTLEYVRET-GFTNAAAPVAEGAQKPES-SLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~eg~~~~~~-~~~~~~i 186 (394) +..-. ...-+..+..-|-+......+++ .+++..++.+..+.+.... +....+.++..+.+.+.. ...+... T Consensus 1 M~~i~----d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~ 76 (348) T protein:vir:96 1 MGLIY----DKVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIH 76 (348) T ss_pred Ccchh----hccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeee Confidence 11100 11222233332223333333332 3455555555444443322 223346677776555543 3457777 Q ss_pred EeeeeeEEEeehhhHHHHHH------H---H---HHHHHHHH---HHHHHHHHHHHHH----Hhhcc----CCCcc--c- Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILSD------S---A---QLQSFINA---RLLRGLEVVEENQ----LLNGN----GTGQN--L- 240 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~~------s---~---~~~~~i~~---~la~a~~~~~d~a----~l~g~----g~~~~--~- 240 (394) ++.+-.++-...++..-++. + + .+...+.+ .+.+.+.+.+|.. +.+|. +.+.. . T Consensus 77 ~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~vd 156 (348) T protein:vir:96 77 DEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDID 156 (348) T ss_pred eeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEEe Confidence 77777776666665432211 1 1 12222222 2444555555532 22321 11110 0 Q ss_pred cccccc-cccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHH---hhc----cCCcc--cccCcccC Q lcl|NC_019933. 241 LGLLPQ-ATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIEL---LKD----TQGRY--ILGNPQGT 310 (394) Q Consensus 241 ~Gi~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---lkd----~~G~~--~~~~~~~~ 310 (394) .|.-.. ..+.+...+.++...+.+|.+....+...+..+..++|++.+|.+|+. +++ .++.. +-+..... T Consensus 157 fg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~ 236 (348) T protein:vir:96 157 YGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQN 236 (348) T ss_pred ccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHH Confidence 011000 001112344455667888887777777667778899999999999863 332 11111 11111111 Q ss_pred CCceeecceEEEcC------------CCCcCceEEeecc-ceEEEEee--cce----------E-------EEEecccch Q lcl|NC_019933. 311 LAPTLWGLPVVATQ------------AMAVGQFLTGAFD-AGAQVFDR--WAA----------R-------VEVATENQD 358 (394) Q Consensus 311 ~~~~l~G~pv~~~~------------~~p~~~~~~gd~~-~~~~~~~~--~~~----------~-------i~~~~~~~~ 358 (394) .-..+.|+++++-+ .+|++.++++-.. .+...+.. ... . +-+..+... T Consensus 237 ~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (348) T protein:vir:96 237 YVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTTKTT 316 (348) T ss_pred HHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEeeecC Confidence 11234455555422 3455665553221 12211100 000 0 000000000 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 359 DFIKNMVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 359 ~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) + --...+.+....=-.+.+|+++.++++-++- T Consensus 317 d--P~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 317 D--PVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred C--CceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 0 0123444555555566778999999977777 No 242 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=62.71 E-value=0.33 Score=23.22 Aligned_cols=353 Identities=12% Similarity=0.042 Sum_probs=104.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---Hhhcccccccchh- Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEV---EGNGAGGDVQHIS- 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~---~~~~~~~~~~~~~- 76 (394) ++++.+++++++++.+++++..+......+..+++.++++++..+..+++.++++.+...... ............. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (404) T protein:vir:39 11 NEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSEYELK 90 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhH Confidence 667888888888888888777766666566677788888888888888888887655432221 1122211111111 Q ss_pred --hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhH-----HHhhhhhhhhHHHhccccc Q lcl|NC_019933. 77 --IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPG-----ILELPQRRMTIRSLLAQGT 149 (394) Q Consensus 77 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~-----ii~~~~~~~~l~~~~~~~~ 149 (394) ..+.+.... ...... .. ..+.++............-...+-..+... .+..+...-++......++ T Consensus 91 ~~~~~~~~~~~--~~~~~~-~~----~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 163 (404) T protein:vir:39 91 DKFVKEFVNMV--RNPMAF-LN----TVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (404) T ss_pred HHHHHHHHHHH--hcchhh-hh----hhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEE Confidence 111111110 000000 00 001111111111110000111111222221 1211111222211111112 Q ss_pred c---ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 150 M---EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVE 226 (394) Q Consensus 150 ~---~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~ 226 (394) + .+.......... ..-..+.........++..-.+...-.-..--+.+....--..+...+.+.++.++..++ T Consensus 164 ~~~~~~~~~~a~~v~E----g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~i 239 (404) T protein:vir:39 164 YEKWTDVTPLTVMDAE----DGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (404) T ss_pred EEeecCCccceeeecC----ccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 121111111111 111222333333444444433332211000011111111112334444444444444443 Q ss_pred HHHHhhc--cCCCccccccccccc---c---ccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc Q lcl|NC_019933. 227 ENQLLNG--NGTGQNLLGLLPQAT---A---FAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDT 298 (394) Q Consensus 227 d~a~l~g--~g~~~~~~Gi~~~~~---~---~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~ 298 (394) =...=.+ .+......++..... . .....-.....++..|. .+.....+ .+..+.. .+. T Consensus 240 l~g~g~~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~----~lkd~~G~---~l~~~~~-------~~~ 305 (404) T protein:vir:39 240 IAAMGTVPKKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLA----LVKTAEGK---YLLEPDP-------TKP 305 (404) T ss_pred HhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHH----HhhccCCc---eeeccCc-------CCC Confidence 2111111 111222223322110 0 00111112233333333 33322221 1221111 111 Q ss_pred -----CCcccccCcccC-CCceeecceEEEcCCCCcCceEEeeccceEEEE--------eecceEEEEecccchhhhcCc Q lcl|NC_019933. 299 -----QGRYILGNPQGT-LAPTLWGLPVVATQAMAVGQFLTGAFDAGAQVF--------DRWAARVEVATENQDDFIKNM 364 (394) Q Consensus 299 -----~G~~~~~~~~~~-~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~--------~~~~~~i~~~~~~~~~~~~~~ 364 (394) .|.|+....... +.....-.+++.-+. ..-+++++....-..+ ......+.... .-|. T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~------r~d~ 377 (404) T protein:vir:39 306 NSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDM--SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVID------RFDV 377 (404) T ss_pred CcceecceeEEEecccccCccCCCccEEEEEec--cccEEEEeecceEEEEeccchhhhhhceeeEEEEe------eecc Confidence 244443211000 000000011221111 0011112211100000 00011111100 0000 Q ss_pred EEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 365 VTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 365 ~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ..++-...+ + +..-..+.++|+ T Consensus 378 ~~~~~~a~~---~-----~~~~~~a~~~~~ 399 (404) T protein:vir:39 378 KTTDSEALV---A-----GSFTAIADQVGN 399 (404) T ss_pred EEecccceE---E-----EEeeccccCCCC Confidence 011111111 1 111122333333 No 243 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=60.35 E-value=0.37 Score=22.92 Aligned_cols=360 Identities=10% Similarity=0.012 Sum_probs=98.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHh----hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVK----DQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~----~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 76 (394) +++|+++.+++.+..+++++.+++... ..+...+++++++.+.++++.+++++++.+................... T Consensus 4 ~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (397) T protein:vir:49 4 SNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLTKNE 83 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchh Confidence 666555555555555555444443332 2334566777788888888777777765544333222221221111111 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhccccc--CCc-CccccchhhhhHHHhhhhhhhhHHHhccccccc-- Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNA--DGS-AGATVQTTRLPGILELPQRRMTIRSLLAQGTME-- 151 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-- 151 (394) ..........+..+.+........ ..........+. +.. ...++........+..+...-++-.....+++. T Consensus 84 ~~~~~~~~~~~~~~l~~~~~~~~~---~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:49 84 EEVKANFVKDFKNLVRGRYQNLLD---SKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred hHHHHHHHHHHHHHhhcchhhHHH---hhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEee Confidence 111112222233333333221111 111111111100 100 111221111111121111111111111111211 Q ss_pred -cCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 152 -GNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQL 230 (394) Q Consensus 152 -~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~ 230 (394) +.......... ..-..+.........++..-.+...---..--+.+...+-...+...+...++.++..++=... T Consensus 161 ~~~~~~a~~v~E----~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~ 236 (397) T protein:vir:49 161 ADITGLAKLDDE----GGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI 236 (397) T ss_pred ccCCcceeeecc----ccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 11111111111 1111222222233444443333322110011122222222234555555555555555542211 Q ss_pred hhccCC--Cccccccccccccc-----cccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc----- Q lcl|NC_019933. 231 LNGNGT--GQNLLGLLPQATAF-----AAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDT----- 298 (394) Q Consensus 231 l~g~g~--~~~~~Gi~~~~~~~-----~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~----- 298 (394) =.+... .....++....... ..+.-.....++..| ..+..... ..+..|.. .++ T Consensus 237 g~~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l----~~lkd~~g---~~l~~~~~-------~~g~~~~l 302 (397) T protein:vir:49 237 GTLPNKPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTAL----KKVKNAMG---DYLMERDV-------KSPTGYSI 302 (397) T ss_pred ccccccccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHH----HHhhccCC---ceeecccc-------cCCCCcee Confidence 112111 12222332211100 011111222233333 33333322 22332221 111 Q ss_pred CCcccccCcccCC-CceeecceEEEcCCCCcCceEEeeccceEEE--------EeecceEEEEecccchhhhcCcEEEEE Q lcl|NC_019933. 299 QGRYILGNPQGTL-APTLWGLPVVATQAMAVGQFLTGAFDAGAQV--------FDRWAARVEVATENQDDFIKNMVTILA 369 (394) Q Consensus 299 ~G~~~~~~~~~~~-~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~--------~~~~~~~i~~~~~~~~~~~~~~~~~~~ 369 (394) .|.|+........ ...-...++++-+. ...+++++....-.. +......+.... +.+ ...+.. T Consensus 303 ~G~pV~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--r~d----~~~~~~ 374 (397) T protein:vir:49 303 DGFVVKEISDRFLPNGTGGAMPLYFGDL--KQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVID--RFD----VVSTDT 374 (397) T ss_pred cceeeEEecccccccccCCceeEEEeec--cceEEEEeecccEEEEeccccchhhcCeeeEEEEE--eec----cEEecc Confidence 2334321100000 00001112221110 011112221110000 011111111100 000 000000 Q ss_pred EEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 370 EERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 370 ~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) .. +.+..-.+.+ +.+++++| T Consensus 375 ~a---~~~~~~~~~~--~~~~~~~~ 394 (397) T protein:vir:49 375 EA---FVPASFKAIA--DQKAKLST 394 (397) T ss_pred cc---eEEEEecccc--cccCcccc Confidence 01 1111111111 11111111 No 244 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=57.43 E-value=0.43 Score=22.57 Aligned_cols=271 Identities=12% Similarity=0.055 Sum_probs=117.0 Q ss_pred CccccchhhhhHHHhhhhhhhhHHHhc-c-ccccccCceeEEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEE Q lcl|NC_019933. 119 AGATVQTTRLPGILELPQRRMTIRSLL-A-QGTMEGNTLEYVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAH 195 (394) Q Consensus 119 ~g~~ip~~~~~~ii~~~~~~~~l~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~ 195 (394) -...-+.++..-+.......+-|++++ + ..+.+...+.+-...+.-..+.++..+.+.+.. .-.+....+.+-.+.- T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~d~~fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~i~~ 80 (341) T protein:vir:34 1 MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYVKP 80 (341) T ss_pred CCCcCHHHHHHHHHhccCccchhHHhcCCcccccccceEEEEEeeCCeeEEEeecCCCCcceeccCceeeeEEecCccCc Confidence 223334455544444444555566653 2 222333334443344444455666665444332 2345555666666665 Q ss_pred eehhhHH-HHHH----------HH--HHHHHHHH---HHHHHHHHHHHHHHh----hcc----CCCcccccc-ccccccc Q lcl|NC_019933. 196 WMKASRQ-ILSD----------SA--QLQSFINA---RLLRGLEVVEENQLL----NGN----GTGQNLLGL-LPQATAF 250 (394) Q Consensus 196 ~~~is~e-~l~~----------s~--~~~~~i~~---~la~a~~~~~d~a~l----~g~----g~~~~~~Gi-~~~~~~~ 250 (394) ...|+-+ +++. ++ .+...+.+ .+.+.+...+|..+. +|. +.+....-+ +.....+ T Consensus 81 ~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vDfg~~~~~ 160 (341) T protein:vir:34 81 KHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGRSEEN 160 (341) T ss_pred cceeCHHHHHHHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEEeCCCCcc Confidence 5555543 2211 00 12222222 334456666664333 221 111000000 0000111 Q ss_pred c------ccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHH---hhc-------cCCcccc--cCccc--C Q lcl|NC_019933. 251 A------APITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIEL---LKD-------TQGRYIL--GNPQG--T 310 (394) Q Consensus 251 ~------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---lkd-------~~G~~~~--~~~~~--~ 310 (394) . ...+..+....+.+.+....+...+..+..++|++.+|..|.. +++ .+|.... ..... . T Consensus 161 ~~~~t~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (341) T protein:vir:34 161 NITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLGKAVS 240 (341) T ss_pred ceEecCCccCCcCCCchHHHHHHHHHHHHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhccccccccccccccccccee Confidence 1 1122223334455555555566667778889999999998852 221 1111111 00100 1 Q ss_pred CCceeecceEEEcC-----------CCCcCceEEeecc-ceEEEEee-cc-------eE-EEEeccc-chhhhcCcEEEE Q lcl|NC_019933. 311 LAPTLWGLPVVATQ-----------AMAVGQFLTGAFD-AGAQVFDR-WA-------AR-VEVATEN-QDDFIKNMVTIL 368 (394) Q Consensus 311 ~~~~l~G~pv~~~~-----------~~p~~~~~~gd~~-~~~~~~~~-~~-------~~-i~~~~~~-~~~~~~~~~~~~ 368 (394) ..+++.|+++.+-+ .+|++.++++-.. .+...+.. .+ +. ....... ...-......+. T Consensus 241 ~~~~~~g~~i~~y~~~y~ddG~~~~~ip~~~v~l~p~g~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~ 320 (341) T protein:vir:34 241 YKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINASARYPKNWVTTGDPAREFTM 320 (341) T ss_pred eeeecCCceEEEEcCEEEECCcEEeeecCCeEEEeeCCCcceEEEeecccccccccceeeeeEeeeeeeecCCCcEEEEE Confidence 11245566665422 2677776654321 11111100 00 00 0000000 000011234455 Q ss_pred EEEEeccEEecccceEEEEec Q lcl|NC_019933. 369 AEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 369 ~~~~~d~~v~~~~a~~~l~~~ 389 (394) +..+.=-.+.+|+++..++++ T Consensus 321 ~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 321 IQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred EcccceeeeeCCCcEEEEEeC Confidence 666666677889999999999 No 245 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=55.22 E-value=0.48 Score=22.31 Aligned_cols=345 Identities=14% Similarity=0.092 Sum_probs=127.4 Q ss_pred hhh-hHHHHHHHHHHHHHHH--HH----HHHHH--HHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhh Q lcl|NC_019933. 29 QEL-NASVRAKVDELLMAQG--AL----QADLK--AAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGR 99 (394) Q Consensus 29 ~~~-~~e~~~~~~~~~~~~~--~l----~~~i~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (394) +.+ ++++.+++..+++... ++ ++.+- -+|++.....+.+.-.+ ..+.+.+.. .+ ......+. T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~---~~~~e~~~~-----~l-~e~~~~~~ 71 (529) T protein:vir:10 1 MSLKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRD---DKLIEAFGQ-----SL-MEAEVAGD 71 (529) T ss_pred CccchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccch---hhhhhhhhh-----cc-chhhcccc Confidence 211 2335555554443311 01 11110 01111111111110000 000000000 00 00000000 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEE-------EEcC---------- Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYV-------RETG---------- 162 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-------~~~~---------- 162 (394) .... ... ...++++.+-.-..|.++ .++++........+++.+.||++.+.-+. .... T Consensus 72 ~~~~-~~~---ia~s~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~ 146 (529) T protein:vir:10 72 HGYD-PTN---IAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHP 146 (529) T ss_pred cccc-ccc---ccccccccccccccchhh-hhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCcccccccc Confidence 0000 000 011111111111112211 22333444556677788888766432210 0000 Q ss_pred ---------------------------------------------------------ccc-------------------- Q lcl|NC_019933. 163 ---------------------------------------------------------FTN-------------------- 165 (394) Q Consensus 163 ---------------------------------------------------------~~~-------------------- 165 (394) ... T Consensus 147 ~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a 226 (529) T protein:vir:10 147 MYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIA 226 (529) T ss_pred cccccccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccc Confidence 000 Q ss_pred ----------ccceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHH Q lcl|NC_019933. 166 ----------AAAPVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQSFINARLLRG 221 (394) Q Consensus 166 ----------~~~~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a 221 (394) ...-.+| +...++-..++++++...+.-+=...+|-||.+|-- |.++.|.+-|+.. T Consensus 227 ~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStE 306 (529) T protein:vir:10 227 AGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANE 306 (529) T ss_pred cccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHH Confidence 0000000 112333345566666666666667789999999862 6789999999999 Q ss_pred HHHHHHHHHhhccCCCcc------------cccccccccccccccc-ccc---cchHHHHHHHHHHhh--hhcCCCCeeE Q lcl|NC_019933. 222 LEVVEENQLLNGNGTGQN------------LLGLLPQATAFAAPIT-VAN---ATAVDRLRLALLQAQ--LAEFPATGIV 283 (394) Q Consensus 222 ~~~~~d~a~l~g~g~~~~------------~~Gi~~~~~~~~~~~~-~~~---~~~~~~i~~~~~~~~--~~~~~~~~~~ 283 (394) |...|++.||.--..... ..|++.......+... ... ...+-.+......+. ..+.....++ T Consensus 307 ImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi 386 (529) T protein:vir:10 307 VMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFII 386 (529) T ss_pred HHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEE Confidence 999999888862211111 1222221111000000 000 111222222222232 2334566899 Q ss_pred eCHHHHHHHHH--hhccCCcccccC--cccCC----Ccee-ecceEEEcCCCCcCceEEeeccceE---EEEee--cceE Q lcl|NC_019933. 284 LNPADWAGIEL--LKDTQGRYILGN--PQGTL----APTL-WGLPVVATQAMAVGQFLTGAFDAGA---QVFDR--WAAR 349 (394) Q Consensus 284 ~~~~~~~~l~~--lkd~~G~~~~~~--~~~~~----~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~---~~~~~--~~~~ 349 (394) +|++....|.. +.+..+..-... ..+.+ -+.| .|++|+.++..|.+-+++|---.-. -+|.. -.+. T Consensus 387 ~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~ 466 (529) T protein:vir:10 387 ASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALT 466 (529) T ss_pred EchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccc Confidence 99999999974 233222111110 01112 2344 4689999999988877766321000 00100 0011 Q ss_pred EEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecC-----C--------CCC Q lcl|NC_019933. 350 VEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAA-----A--------AGT 394 (394) Q Consensus 350 i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~-----a--------~~~ 394 (394) .....++ .+-+-.+-...|++..+ +| |+..+-.+ + +|. T Consensus 467 ~~~~~dp----~sfqP~~g~~tRY~l~~-NP--~~~~~~~~~~~r~~~g~~~~~~ag~ 517 (529) T protein:vir:10 467 PLRGSDP----KNFQPVMGFKTRYAIGV-NP--FAESRTQAPTSRISNGMPGAHSVGK 517 (529) T ss_pred cccccCC----Ccccceeeeeeeeceee-cC--ccccccccccccccCCcchhhhcCc Confidence 1111111 11122333344444432 33 22111111 0 111 No 246 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=47.87 E-value=0.69 Score=21.48 Aligned_cols=347 Identities=16% Similarity=0.101 Sum_probs=131.8 Q ss_pred CchHHHHHHHHHHHHHHHH--HHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-cchhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLK--AHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDV-QHISI 77 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k--~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~-~~~~~ 77 (394) ||+-++|+++|.-+.+... ++.+...+ ..++ ++ . |++.......+.-.+. -..++ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~--~~~a----~l---~-------------enq~~~~~~~~~~~~~~~~~~~ 58 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQ--KLVA----KI---L-------------ESQEADFAVDPIYKDEKVVEAF 58 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhh--hhhh----hh---h-------------hhhHHHhhcccchhhHHHHHhh Confidence 9999999999988765311 00000000 0000 00 0 0000000000000000 00000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCcee- Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLE- 156 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~- 156 (394) + .+.......+...... ..+....++ ..-.-+.|.++ .++++........+++.+.||++.+.- T Consensus 59 ~----------~~l~ea~~~~~~~~~~-~~i~es~~t---~~v~~~~P~Li-~lvRRa~p~LIa~DIwGVQPMTgPTGlI 123 (528) T protein:vir:66 59 G----------GFIAEAEVAGDHGYDA-SQIAAGQTT---GAITNVGPAVI-GMVRRAIPNLIAFDICGVQPMSTPTSQI 123 (528) T ss_pred h----------hhhhhhcccccccccc-hhccccccc---cccccCchhHH-HHHHHHHHhhhhhhhheeecCCchhhhh Confidence 0 0000000000000000 000000000 00000111111 122223334445666666666552100 Q ss_pred ------E-------------------------------------------------------EEEcC------------- Q lcl|NC_019933. 157 ------Y-------------------------------------------------------VRETG------------- 162 (394) Q Consensus 157 ------~-------------------------------------------------------~~~~~------------- 162 (394) + ..... T Consensus 124 FAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~ 203 (528) T protein:vir:66 124 FAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTG 203 (528) T ss_pred eeeeeeecCCcccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccccceeeeccccc Confidence 0 00000 Q ss_pred --cc-------------------ccc--------ceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHH Q lcl|NC_019933. 163 --FT-------------------NAA--------APVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQIL 204 (394) Q Consensus 163 --~~-------------------~~~--------~~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l 204 (394) .. ... .-.+| +...++-..++++++...+.-+=...+|-|+. T Consensus 204 ~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELA 283 (528) T protein:vir:66 204 DSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVA 283 (528) T ss_pred cccccCcccccccccccccccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHH Confidence 00 000 00001 11234445566677777777777788999999 Q ss_pred HHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc--ccccc----ccccccccccccc--c-cchHHH------ Q lcl|NC_019933. 205 SDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTGQN--LLGLL----PQATAFAAPITVA--N-ATAVDR------ 264 (394) Q Consensus 205 ~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~--~~Gi~----~~~~~~~~~~~~~--~-~~~~~~------ 264 (394) +|-- |.++.|.+-|+..|...|++.||.--..... .+|+. ..++........+ + --..+. T Consensus 284 QDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~ 363 (528) T protein:vir:66 284 QDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIY 363 (528) T ss_pred HHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHH Confidence 9862 6789999999999999999998743211111 11211 1111111111110 0 001222 Q ss_pred -HHHHHHHhh--hhcCCCCeeEeCHHHHHHHHHh-----hccC-CcccccCcccCC----Cceee-cceEEEcCCCCcCc Q lcl|NC_019933. 265 -LRLALLQAQ--LAEFPATGIVLNPADWAGIELL-----KDTQ-GRYILGNPQGTL----APTLW-GLPVVATQAMAVGQ 330 (394) Q Consensus 265 -i~~~~~~~~--~~~~~~~~~~~~~~~~~~l~~l-----kd~~-G~~~~~~~~~~~----~~~l~-G~pv~~~~~~p~~~ 330 (394) +......+. ..+...+.+++|++....|... .+.. .++.+. .+.. .+.|. |++|+.++..|.+- T Consensus 364 ~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~--~d~~~~~~~G~l~~~~~vy~D~y~~~dy 441 (528) T protein:vir:66 364 QIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLN--TDTTKAVFAGVLAGKYKVFIDQYARQDY 441 (528) T ss_pred HHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccc--cCCCCceeEEEecCceEEEecCCCCcce Confidence 222223332 2234457899999999888753 1111 112221 1122 24454 68999999998887 Q ss_pred eEEeeccc-----eEEEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecccc----------------------- Q lcl|NC_019933. 331 FLTGAFDA-----GAQVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPES----------------------- 382 (394) Q Consensus 331 ~~~gd~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a----------------------- 382 (394) +++|---. +++...--.+......++. .| +-.+-...|++..+ +|=+ T Consensus 442 ~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~-sf---qP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~~~~~~ag~ 516 (528) T protein:vir:66 442 FTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ-SF---HPVLGFKTRYGIGI-NPFADSKSQEPSARITSGMLSKDSVGK 516 (528) T ss_pred EEEEEeCCcccccceeecccccceeeEeeCCc-cc---cceeeeeeeeceee-cCcccccCccccccccccchhhhhcCc Confidence 76653210 0000000011111111111 11 12233334444332 2200 Q ss_pred ---eEEEEecCC Q lcl|NC_019933. 383 ---FIKGSLAAA 391 (394) Q Consensus 383 ---~~~l~~~~a 391 (394) ++++-+|-- T Consensus 517 n~~~r~~~Vk~~ 528 (528) T protein:vir:66 517 NAYFRRVWVKGC 528 (528) T ss_pred cceeEEeeeccC Confidence 111111111 No 247 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=47.46 E-value=0.7 Score=21.43 Aligned_cols=271 Identities=13% Similarity=0.053 Sum_probs=115.6 Q ss_pred CccccchhhhhHHHhhhhhhhhHHHhc-c-ccccccCceeEEEEcCcccccceecCCcccccc-ccceeeEEeeeeeEEE Q lcl|NC_019933. 119 AGATVQTTRLPGILELPQRRMTIRSLL-A-QGTMEGNTLEYVRETGFTNAAAPVAEGAQKPES-SLRFDLVQTSAKVIAH 195 (394) Q Consensus 119 ~g~~ip~~~~~~ii~~~~~~~~l~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~eg~~~~~~-~~~~~~i~~~~~k~~~ 195 (394) -...-+.++..-|.......+-|++++ + ..+.+...+.+-...+.-..+.++..+.+-+.. .-.+....+.+-.+.- T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~~~~Fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~i~~ 80 (341) T protein:vir:39 1 MSVYTTAQLLAVNEKKFKFDPLFLRIFFRETYPFSTEKVYLSQIPGLVNMALYVSPIVSGKVIRSRGGSTSEFTPGYVKP 80 (341) T ss_pred CCccCHHHHHHHHHhhcCccchhHhhcCCcccccCcceEEEEEecCCceeeEEecCCCCcceecccceeeeeEeccccCc Confidence 223334455544444444555566653 2 222233334443344444455566665444332 2345555666666655 Q ss_pred eehhhHHHHH-H----------HH--HHHHHHH---HHHHHHHHHHHHHHH----hhcc----CCCcccccc-ccccccc Q lcl|NC_019933. 196 WMKASRQILS-D----------SA--QLQSFIN---ARLLRGLEVVEENQL----LNGN----GTGQNLLGL-LPQATAF 250 (394) Q Consensus 196 ~~~is~e~l~-~----------s~--~~~~~i~---~~la~a~~~~~d~a~----l~g~----g~~~~~~Gi-~~~~~~~ 250 (394) ...++-+-+. . ++ .....+. ..+.+.+...+|..+ ++|. +.+.....+ +.....+ T Consensus 81 ~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDfg~~~~~ 160 (341) T protein:vir:39 81 KHEVNPLMTLRRLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDMGRSAGN 160 (341) T ss_pred ccccCHHHHHHHhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEeccCCccc Confidence 5555543221 0 00 1111222 234445555555433 3331 111110000 0000011 Q ss_pred c------ccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHH---hhcc----CCcc-cccC-ccc---C-- Q lcl|NC_019933. 251 A------APITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIEL---LKDT----QGRY-ILGN-PQG---T-- 310 (394) Q Consensus 251 ~------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---lkd~----~G~~-~~~~-~~~---~-- 310 (394) . ..++..+....+.+.+...-+...+..+..++|++.+|..|.. +++. .+.. .+.. ... + T Consensus 161 ~~~lt~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (341) T protein:vir:39 161 NIVQAGAAAWSSRDKETYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETALKDLGKAVS 240 (341) T ss_pred eeEecCCccCCCCCCchHHHHHHHHHHHHhcCCceEEEEeChHHHHHHhcCHHHHHHHhhcccccccccchhhhhhhHhh Confidence 1 1112222233444444444445556677889999999999863 3321 1110 0010 000 0 Q ss_pred CCceeecceEEEcC-----------CCCcCceEEeecc-ceEEEEee-cce------EEEEecccc---hhhhcCcEEEE Q lcl|NC_019933. 311 LAPTLWGLPVVATQ-----------AMAVGQFLTGAFD-AGAQVFDR-WAA------RVEVATENQ---DDFIKNMVTIL 368 (394) Q Consensus 311 ~~~~l~G~pv~~~~-----------~~p~~~~~~gd~~-~~~~~~~~-~~~------~i~~~~~~~---~~~~~~~~~~~ 368 (394) .-+++.|+++.+-+ .+|++.++++-.. .+...+-. .++ ......... ....-....+. T Consensus 241 ~~~~~~g~~i~~y~~~y~d~g~~~~~ip~~~~~l~p~~~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~ 320 (341) T protein:vir:39 241 YKGMYGDVAIVVYSGQYIENDVKKNYLPDLTMVLGNTQARGLRTYGCILDADAQREGINASTRYPKNWVQTGDPAREFTM 320 (341) T ss_pred hhhhhcCceEEEEccEEEecCcEEeeecCCeEEEeeCCCcceEEEecccchhhcccceeeeeeeeeeeeecCCCcEEEEE Confidence 11235566655522 2567766654321 12211100 000 000000000 00011245566 Q ss_pred EEEEeccEEecccceEEEEec Q lcl|NC_019933. 369 AEERLALAVYRPESFIKGSLA 389 (394) Q Consensus 369 ~~~~~d~~v~~~~a~~~l~~~ 389 (394) +....=-.+.+|+++++++++ T Consensus 321 ~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 321 IQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred EeccccceeeCCCcEEEEEeC Confidence 666666777889999999999 No 248 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=46.34 E-value=0.74 Score=21.31 Aligned_cols=342 Identities=13% Similarity=0.058 Sum_probs=127.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHISIGQQ 80 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) ||+-++|+++|.-+.+..- + .+++.... +.+.+.|- |++.....+.... +.+. T Consensus 3 ~~~~e~l~~kw~p~l~~~~-~-----------~~i~~~~~------~~v~a~l~--enq~~~~~~~~~~-------l~e~ 55 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDYDG-L-----------DPIKDSHR------RSVTAVLL--ENQEKELREERNF-------LSEA 55 (470) T ss_pred cchhHHHHHhhhhhhcCCc-c-----------chhcchhh------hhhhhhhh--hhhHHHHhhccch-------hhhh Confidence 8888888888887765411 0 00000000 00000000 0010000000000 0000 Q ss_pred hhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEE Q lcl|NC_019933. 81 FVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRE 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) .. ....-+.. ... ...++.+.+-.-..|.++ .++++........+++.+.||++.+.-+.-. T Consensus 56 ~~----------~~~~~~~~----~~~---i~~st~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAm 117 (470) T protein:vir:10 56 PN----------VNTNSGAT----AGF---SADATAAGPVAGFDPVLI-SLIRRSMPNLVAYDLAGVQPMNGPTGLIFAM 117 (470) T ss_pred hh----------cccccccc----ccc---cccccccccccccCchhh-hhHHHHHhhhhhhhhheeecCCccceeeeEE Confidence 00 00000000 000 001111111011112221 1333344455667778888887654433211 Q ss_pred c----Cccc-cc-------ceec--------------------------------------------------------- Q lcl|NC_019933. 161 T----GFTN-AA-------APVA--------------------------------------------------------- 171 (394) Q Consensus 161 ~----~~~~-~~-------~~~~--------------------------------------------------------- 171 (394) . ...+ .+ .|-+ T Consensus 118 RsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~a 197 (470) T protein:vir:10 118 RSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSA 197 (470) T ss_pred EEEecCCCccceeeecCCcccCcccccccccccccccccccccccccccccccccccccccccccccccccccccchHHh Confidence 1 0000 00 0000 Q ss_pred ------CCccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc- Q lcl|NC_019933. 172 ------EGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTGQN- 239 (394) Q Consensus 172 ------eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~- 239 (394) .+...++-..++++++...+.-+-...+|-|+.+|-- |.++.|.+-|+..|...|++.+|.---+... T Consensus 198 E~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~ 277 (470) T protein:vir:10 198 EDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEP 277 (470) T ss_pred hhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhh Confidence 0111233344555566666666666789999999862 5788888888888888888888753221111 Q ss_pred --ccccccccccccccccccccchHHHHHHHHHHh---------hhhcCCCCeeEeCHHHHHHHHH--hhccCC---ccc Q lcl|NC_019933. 240 --LLGLLPQATAFAAPITVANATAVDRLRLALLQA---------QLAEFPATGIVLNPADWAGIEL--LKDTQG---RYI 303 (394) Q Consensus 240 --~~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~--lkd~~G---~~~ 303 (394) ..|+ ...+.........+....+.+..++..+ .......+.+++|++....|.. +-+..+ ..+ T Consensus 278 ~k~~~~-~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~ 356 (470) T protein:vir:10 278 GAQANV-AAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL 356 (470) T ss_pred ceeccc-cccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhcccccccccccccc Confidence 1111 1111111111111111222222222222 2334456689999999888853 111110 011 Q ss_pred ccCcccC-CCcee-ecceEEEcCCCC------cCceEEeeccceEE----EEeecceEEEEecccchhhhcCcEEEEEEE Q lcl|NC_019933. 304 LGNPQGT-LAPTL-WGLPVVATQAMA------VGQFLTGAFDAGAQ----VFDRWAARVEVATENQDDFIKNMVTILAEE 371 (394) Q Consensus 304 ~~~~~~~-~~~~l-~G~pv~~~~~~p------~~~~~~gd~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 371 (394) -.+..+. ..+.| .|++|+.++.+. .+-+++|- +.... +|...= +..+.....+-.+-+-.+-... T Consensus 357 ~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~-KG~~~~~~glfy~PY--v~l~~~~~~dp~sfqP~~g~~t 433 (470) T protein:vir:10 357 NVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGY-KGSSPYDAGLFYCPY--VPLQMVRAVGQDTFQPKIGFKT 433 (470) T ss_pred ccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEE-ecCcceecceeeccc--cccccCCCCCCccccceeeeee Confidence 1111110 11344 358999987544 33333332 21000 010000 1111111111111122333334 Q ss_pred EeccEEecccc-----------------eEEEEecCCC Q lcl|NC_019933. 372 RLALAVYRPES-----------------FIKGSLAAAA 392 (394) Q Consensus 372 ~~d~~v~~~~a-----------------~~~l~~~~a~ 392 (394) |++..+ +|=. |.++.++--- T Consensus 434 RY~l~~-NP~~~~~~~~~~~i~~~~n~y~r~~~v~~l~ 470 (470) T protein:vir:10 434 RYGLVE-NPFSQGTTQGLGTLTRNSNRYYRRVKVANLM 470 (470) T ss_pred eeceee-cCcccCCCcccccccCCCCceeeEEEeeccC Confidence 444332 2211 1111111111 No 249 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=45.93 E-value=0.75 Score=21.26 Aligned_cols=345 Identities=12% Similarity=0.040 Sum_probs=128.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-cccccchhhhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGA-GGDVQHISIGQ 79 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~-~~~~~~~~~~~ 79 (394) |-+-++|+++|.-+.+..-. .++.... .+.+-++|= |++......... -.+....+++. T Consensus 1 ~~~~e~l~~kW~plLe~~~~------------~~i~~~~------k~~i~a~ll--ENQe~~~~~~~~~~~~~~~~~~~~ 60 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEA------------PAIGDRY------KRAVTSVLL--ENQERFLREERGMLNEVAVNSLGA 60 (468) T ss_pred CcchHHHHHhhhHhhcCCcc------------chhccch------hhhhhhhhh--hhHHHHHhccccccchhhHhhcCC Confidence 88888888888877654110 0000000 000000000 001000100000 00000001100 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEE Q lcl|NC_019933. 80 QFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVR 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 159 (394) ..+.-...+.. .+++....+ +.|.++ .+.++........+++.+.||++.+.-+.- T Consensus 61 --------------------~~~~~~n~~~~-~~~t~~v~~--~~P~Li-~l~RRa~p~LIa~DIwGVQPMTgPTGLIFA 116 (468) T protein:vir:10 61 --------------------GTIAPAGSALG-SANTGGLAG--FDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFA 116 (468) T ss_pred --------------------cccchhhhhhh-hcccccccc--cCchhh-hhHHHHHhhhhhhhceeeecCCccceeeeE Confidence 00000001111 011111111 112221 122233345556777777777765433321 Q ss_pred Ec----Cccc-cc-------cee------------------------------------------------cC-----Cc Q lcl|NC_019933. 160 ET----GFTN-AA-------APV------------------------------------------------AE-----GA 174 (394) Q Consensus 160 ~~----~~~~-~~-------~~~------------------------------------------------~e-----g~ 174 (394) .. ...+ .+ .|- .| +. T Consensus 117 mRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~ 196 (468) T protein:vir:10 117 MRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANR 196 (468) T ss_pred EEEEecCCCCccceeccccccccccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCc Confidence 11 0000 00 000 00 11 Q ss_pred cccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc---ccccccc Q lcl|NC_019933. 175 QKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTGQN---LLGLLPQ 246 (394) Q Consensus 175 ~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~---~~Gi~~~ 246 (394) ..++-..++++++...+.-+-...+|-|+.+|-- |.++.|.+-|+..|...+++.+|.---+... ..|+. . T Consensus 197 ~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~-~ 275 (468) T protein:vir:10 197 LFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVA-N 275 (468) T ss_pred ccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccc-c Confidence 2333445566666666666667789999999862 5788888888888888888877753211111 11111 1 Q ss_pred cccccccccccccchHH-------HHHHHHHHh--hhhcCCCCeeEeCHHHHHHHHH---hhccCC---cccccCc-ccC Q lcl|NC_019933. 247 ATAFAAPITVANATAVD-------RLRLALLQA--QLAEFPATGIVLNPADWAGIEL---LKDTQG---RYILGNP-QGT 310 (394) Q Consensus 247 ~~~~~~~~~~~~~~~~~-------~i~~~~~~~--~~~~~~~~~~~~~~~~~~~l~~---lkd~~G---~~~~~~~-~~~ 310 (394) ++........++.-..+ .+......+ .......+.+++|++....|.. +....+ +.-.... .+. T Consensus 276 ~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~ 355 (468) T protein:vir:10 276 AGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDD 355 (468) T ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceeccccccccccccccccc Confidence 11111111111111112 122222222 2334566789999999999985 332111 1111000 111 Q ss_pred C----Ccee-ecceEEEcCCCC----cCceEEeeccceE----EEEeecceEEEEecccchhhhcCcEEEEEEEEeccEE Q lcl|NC_019933. 311 L----APTL-WGLPVVATQAMA----VGQFLTGAFDAGA----QVFDRWAARVEVATENQDDFIKNMVTILAEERLALAV 377 (394) Q Consensus 311 ~----~~~l-~G~pv~~~~~~p----~~~~~~gd~~~~~----~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v 377 (394) . .+.| .|++|+.++.+. .+-+++|- +... -+|...=+.+.. ....+-.+-+-.+-...|++..+ T Consensus 356 tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~-KG~~~~d~glfyaPYv~l~~--~~~~dp~sfqP~~g~~tRY~l~~ 432 (468) T protein:vir:10 356 TGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGY-KGTSPYDAGLFYCPYVPLQM--VRSIDPNTFQPKIGFKTRYGMVS 432 (468) T ss_pred CcceEEEEecCceEEEEccccccCCccceEEEEE-ecCcceeceeeecccccccc--ccccCCCcccceeeeeeeeceee Confidence 1 1334 368999887653 34444432 1100 001000000110 00001111122333344444332 Q ss_pred ecccce-EEEEecCCCCC Q lcl|NC_019933. 378 YRPESF-IKGSLAAAAGT 394 (394) Q Consensus 378 ~~~~a~-~~l~~~~a~~~ 394 (394) +|=+. ..++-..+.|. T Consensus 433 -NP~~~~~~~~~g~~~~~ 449 (468) T protein:vir:10 433 -NPFVTTNGLYNGTPDGE 449 (468) T ss_pred -cccceeccccCCCcccc Confidence 22111 11111111111 No 250 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=44.10 E-value=0.82 Score=21.06 Aligned_cols=299 Identities=11% Similarity=0.073 Sum_probs=117.3 Q ss_pred chhhhhhhhhH-HHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhh--hhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNS-DSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELP--QRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~--~~~~~l~~~~~~~~~ 150 (394) .-..+...... .+...+............-. +....|......+.... ...++|...-.-..- T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~ 66 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILT--------------EQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhh--------------hhhhhhhhhccchhhccCCCCCccEEEeecCCCC Confidence 11111100000 00000000000000000000 00000000000000000 011111111111112 Q ss_pred ccCceeEEEEcCcccccceecCCccc--cccccceeeEEeeeeeEEEeehhhHHHHH-HHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQK--PESSLRFDLVQTSAKVIAHWMKASRQILS-DSA-QLQSFINARLLRGLEVVE 226 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~--~~~~~~~~~i~~~~~k~~~~~~is~e~l~-~s~-~~~~~i~~~la~a~~~~~ 226 (394) .|..+++...... ...+|.+++.. .+..++|.+-.+.+..+...+.....+-+ -++ +|...-++.|...+...+ T Consensus 67 aGd~vtf~L~~~L--~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~ 144 (404) T protein:vir:10 67 AGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (404) T ss_pred CCcEEEEeEeeec--ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 2444444433211 12223322222 23456777777777777777766665544 344 899999999999999999 Q ss_pred HHHHh---hccCCC-------------ccccc-----cccc---------cccccccccccccchHHHHHHHHHHhhh-- Q lcl|NC_019933. 227 ENQLL---NGNGTG-------------QNLLG-----LLPQ---------ATAFAAPITVANATAVDRLRLALLQAQL-- 274 (394) Q Consensus 227 d~a~l---~g~g~~-------------~~~~G-----i~~~---------~~~~~~~~~~~~~~~~~~i~~~~~~~~~-- 274 (394) |+.+| .|.... ....+ +... ..+.......++..+++.|-.+...+.. T Consensus 145 d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~ 224 (404) T protein:vir:10 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA 224 (404) T ss_pred HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC Confidence 98887 232210 00110 0000 0001111222333344444333333311 Q ss_pred -----hcC---C------CCeeEeCHHHHHHHHHh----------hc------cCCcccccCcccCCCceeecceEEEcC Q lcl|NC_019933. 275 -----AEF---P------ATGIVLNPADWAGIELL----------KD------TQGRYILGNPQGTLAPTLWGLPVVATQ 324 (394) Q Consensus 275 -----~~~---~------~~~~~~~~~~~~~l~~l----------kd------~~G~~~~~~~~~~~~~~l~G~pv~~~~ 324 (394) ... . .=+++|||.-+..|++= +. ...+|||. +.-+.+.|.+|+.-+ T Consensus 225 ~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~----G~~gm~ngvii~~~~ 300 (404) T protein:vir:10 225 HPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYA 300 (404) T ss_pred CCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCcee----cCeeEEcCEEEEecC Confidence 111 1 12579999998888752 21 11245543 333466776665443 Q ss_pred CCC------------------------cC----c-eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc Q lcl|NC_019933. 325 AMA------------------------VG----Q-FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL 375 (394) Q Consensus 325 ~~p------------------------~~----~-~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~ 375 (394) .+| ++ . .++|-..-++.++...+....+..+..++ .|.+.+-+...+|+ T Consensus 301 ~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~--g~~~~i~~~~i~G~ 378 (404) T protein:vir:10 301 GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWINGL 378 (404) T ss_pred CceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc--CchhhhhhHHHhhh Confidence 332 00 0 23343322222222233344444443332 23444555555555 Q ss_pred EEec-c------cceEEEEecCCCCC Q lcl|NC_019933. 376 AVYR-P------ESFIKGSLAAAAGT 394 (394) Q Consensus 376 ~v~~-~------~a~~~l~~~~a~~~ 394 (394) +-.+ + +-|-++.+.+++.= T Consensus 379 kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 379 KKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred hhccccCCCCceeeEEEEEecccccC Confidence 4444 2 35666666666666 No 251 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=44.10 E-value=0.82 Score=21.06 Aligned_cols=299 Identities=11% Similarity=0.073 Sum_probs=117.3 Q ss_pred chhhhhhhhhH-HHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhh--hhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNS-DSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELP--QRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~--~~~~~l~~~~~~~~~ 150 (394) .-..+...... .+...+............-. +....|......+.... ...++|...-.-..- T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~ 66 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILT--------------EQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhh--------------hhhhhhhhhccchhhccCCCCCccEEEeecCCCC Confidence 11111100000 00000000000000000000 00000000000000000 011111111111112 Q ss_pred ccCceeEEEEcCcccccceecCCccc--cccccceeeEEeeeeeEEEeehhhHHHHH-HHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQK--PESSLRFDLVQTSAKVIAHWMKASRQILS-DSA-QLQSFINARLLRGLEVVE 226 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~--~~~~~~~~~i~~~~~k~~~~~~is~e~l~-~s~-~~~~~i~~~la~a~~~~~ 226 (394) .|..+++...... ...+|.+++.. .+..++|.+-.+.+..+...+.....+-+ -++ +|...-++.|...+...+ T Consensus 67 aGd~vtf~L~~~L--~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~ 144 (404) T protein:vir:81 67 AGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (404) T ss_pred CCcEEEEeEeeec--ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 2444444433211 12223322222 23456777777777777777766665544 344 899999999999999999 Q ss_pred HHHHh---hccCCC-------------ccccc-----cccc---------cccccccccccccchHHHHHHHHHHhhh-- Q lcl|NC_019933. 227 ENQLL---NGNGTG-------------QNLLG-----LLPQ---------ATAFAAPITVANATAVDRLRLALLQAQL-- 274 (394) Q Consensus 227 d~a~l---~g~g~~-------------~~~~G-----i~~~---------~~~~~~~~~~~~~~~~~~i~~~~~~~~~-- 274 (394) |+.+| .|.... ....+ +... ..+.......++..+++.|-.+...+.. T Consensus 145 d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~ 224 (404) T protein:vir:81 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA 224 (404) T ss_pred HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC Confidence 98887 232210 00110 0000 0001111222333344444333333311 Q ss_pred -----hcC---C------CCeeEeCHHHHHHHHHh----------hc------cCCcccccCcccCCCceeecceEEEcC Q lcl|NC_019933. 275 -----AEF---P------ATGIVLNPADWAGIELL----------KD------TQGRYILGNPQGTLAPTLWGLPVVATQ 324 (394) Q Consensus 275 -----~~~---~------~~~~~~~~~~~~~l~~l----------kd------~~G~~~~~~~~~~~~~~l~G~pv~~~~ 324 (394) ... . .=+++|||.-+..|++= +. ...+|||. +.-+.+.|.+|+.-+ T Consensus 225 ~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~----G~~gm~ngvii~~~~ 300 (404) T protein:vir:81 225 HPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYA 300 (404) T ss_pred CCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCcee----cCeeEEcCEEEEecC Confidence 111 1 12579999998888752 21 11245543 333466776665443 Q ss_pred CCC------------------------cC----c-eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc Q lcl|NC_019933. 325 AMA------------------------VG----Q-FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL 375 (394) Q Consensus 325 ~~p------------------------~~----~-~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~ 375 (394) .+| ++ . .++|-..-++.++...+....+..+..++ .|.+.+-+...+|+ T Consensus 301 ~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~--g~~~~i~~~~i~G~ 378 (404) T protein:vir:81 301 GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWINGL 378 (404) T ss_pred CceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc--CchhhhhhHHHhhh Confidence 332 00 0 23343322222222233344444443332 23444555555555 Q ss_pred EEec-c------cceEEEEecCCCCC Q lcl|NC_019933. 376 AVYR-P------ESFIKGSLAAAAGT 394 (394) Q Consensus 376 ~v~~-~------~a~~~l~~~~a~~~ 394 (394) +-.+ + +-|-++.+.+++.= T Consensus 379 kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:81 379 KKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred hhccccCCCCceeeEEEEEecccccC Confidence 4444 2 35666666666666 No 252 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=44.10 E-value=0.82 Score=21.06 Aligned_cols=299 Identities=11% Similarity=0.073 Sum_probs=117.3 Q ss_pred chhhhhhhhhH-HHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhh--hhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNS-DSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELP--QRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~--~~~~~l~~~~~~~~~ 150 (394) .-..+...... .+...+............-. +....|......+.... ...++|...-.-..- T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~ 66 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILT--------------EQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhh--------------hhhhhhhhhccchhhccCCCCCccEEEeecCCCC Confidence 11111100000 00000000000000000000 00000000000000000 011111111111112 Q ss_pred ccCceeEEEEcCcccccceecCCccc--cccccceeeEEeeeeeEEEeehhhHHHHH-HHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQK--PESSLRFDLVQTSAKVIAHWMKASRQILS-DSA-QLQSFINARLLRGLEVVE 226 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~--~~~~~~~~~i~~~~~k~~~~~~is~e~l~-~s~-~~~~~i~~~la~a~~~~~ 226 (394) .|..+++...... ...+|.+++.. .+..++|.+-.+.+..+...+.....+-+ -++ +|...-++.|...+...+ T Consensus 67 aGd~vtf~L~~~L--~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~ 144 (404) T protein:vir:32 67 AGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (404) T ss_pred CCcEEEEeEeeec--ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 2444444433211 12223322222 23456777777777777777766665544 344 899999999999999999 Q ss_pred HHHHh---hccCCC-------------ccccc-----cccc---------cccccccccccccchHHHHHHHHHHhhh-- Q lcl|NC_019933. 227 ENQLL---NGNGTG-------------QNLLG-----LLPQ---------ATAFAAPITVANATAVDRLRLALLQAQL-- 274 (394) Q Consensus 227 d~a~l---~g~g~~-------------~~~~G-----i~~~---------~~~~~~~~~~~~~~~~~~i~~~~~~~~~-- 274 (394) |+.+| .|.... ....+ +... ..+.......++..+++.|-.+...+.. T Consensus 145 d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~ 224 (404) T protein:vir:32 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA 224 (404) T ss_pred HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC Confidence 98887 232210 00110 0000 0001111222333344444333333311 Q ss_pred -----hcC---C------CCeeEeCHHHHHHHHHh----------hc------cCCcccccCcccCCCceeecceEEEcC Q lcl|NC_019933. 275 -----AEF---P------ATGIVLNPADWAGIELL----------KD------TQGRYILGNPQGTLAPTLWGLPVVATQ 324 (394) Q Consensus 275 -----~~~---~------~~~~~~~~~~~~~l~~l----------kd------~~G~~~~~~~~~~~~~~l~G~pv~~~~ 324 (394) ... . .=+++|||.-+..|++= +. ...+|||. +.-+.+.|.+|+.-+ T Consensus 225 ~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~----G~~gm~ngvii~~~~ 300 (404) T protein:vir:32 225 HPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYA 300 (404) T ss_pred CCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCcee----cCeeEEcCEEEEecC Confidence 111 1 12579999998888752 21 11245543 333466776665443 Q ss_pred CCC------------------------cC----c-eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc Q lcl|NC_019933. 325 AMA------------------------VG----Q-FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL 375 (394) Q Consensus 325 ~~p------------------------~~----~-~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~ 375 (394) .+| ++ . .++|-..-++.++...+....+..+..++ .|.+.+-+...+|+ T Consensus 301 ~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~--g~~~~i~~~~i~G~ 378 (404) T protein:vir:32 301 GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWINGL 378 (404) T ss_pred CceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc--CchhhhhhHHHhhh Confidence 332 00 0 23343322222222233344444443332 23444555555555 Q ss_pred EEec-c------cceEEEEecCCCCC Q lcl|NC_019933. 376 AVYR-P------ESFIKGSLAAAAGT 394 (394) Q Consensus 376 ~v~~-~------~a~~~l~~~~a~~~ 394 (394) +-.+ + +-|-++.+.+++.= T Consensus 379 kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:32 379 KKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred hhccccCCCCceeeEEEEEecccccC Confidence 4444 2 35666666666666 No 253 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=44.10 E-value=0.82 Score=21.06 Aligned_cols=299 Identities=11% Similarity=0.073 Sum_probs=117.3 Q ss_pred chhhhhhhhhH-HHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhhh--hhhhhHHHhcccccc Q lcl|NC_019933. 74 HISIGQQFVNS-DSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILELP--QRRMTIRSLLAQGTM 150 (394) Q Consensus 74 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~--~~~~~l~~~~~~~~~ 150 (394) .-..+...... .+...+............-. +....|......+.... ...++|...-.-..- T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~ 66 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILT--------------EQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhh--------------hhhhhhhhhccchhhccCCCCCccEEEeecCCCC Confidence 11111100000 00000000000000000000 00000000000000000 011111111111112 Q ss_pred ccCceeEEEEcCcccccceecCCccc--cccccceeeEEeeeeeEEEeehhhHHHHH-HHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQK--PESSLRFDLVQTSAKVIAHWMKASRQILS-DSA-QLQSFINARLLRGLEVVE 226 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~--~~~~~~~~~i~~~~~k~~~~~~is~e~l~-~s~-~~~~~i~~~la~a~~~~~ 226 (394) .|..+++...... ...+|.+++.. .+..++|.+-.+.+..+...+.....+-+ -++ +|...-++.|...+...+ T Consensus 67 aGd~vtf~L~~~L--~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~ 144 (404) T protein:vir:10 67 AGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (404) T ss_pred CCcEEEEeEeeec--ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 2444444433211 12223322222 23456777777777777777766665544 344 899999999999999999 Q ss_pred HHHHh---hccCCC-------------ccccc-----cccc---------cccccccccccccchHHHHHHHHHHhhh-- Q lcl|NC_019933. 227 ENQLL---NGNGTG-------------QNLLG-----LLPQ---------ATAFAAPITVANATAVDRLRLALLQAQL-- 274 (394) Q Consensus 227 d~a~l---~g~g~~-------------~~~~G-----i~~~---------~~~~~~~~~~~~~~~~~~i~~~~~~~~~-- 274 (394) |+.+| .|.... ....+ +... ..+.......++..+++.|-.+...+.. T Consensus 145 d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~ 224 (404) T protein:vir:10 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA 224 (404) T ss_pred HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhC Confidence 98887 232210 00110 0000 0001111222333344444333333311 Q ss_pred -----hcC---C------CCeeEeCHHHHHHHHHh----------hc------cCCcccccCcccCCCceeecceEEEcC Q lcl|NC_019933. 275 -----AEF---P------ATGIVLNPADWAGIELL----------KD------TQGRYILGNPQGTLAPTLWGLPVVATQ 324 (394) Q Consensus 275 -----~~~---~------~~~~~~~~~~~~~l~~l----------kd------~~G~~~~~~~~~~~~~~l~G~pv~~~~ 324 (394) ... . .=+++|||.-+..|++= +. ...+|||. +.-+.+.|.+|+.-+ T Consensus 225 ~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~----G~~gm~ngvii~~~~ 300 (404) T protein:vir:10 225 HPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYA 300 (404) T ss_pred CCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCcee----cCeeEEcCEEEEecC Confidence 111 1 12579999998888752 21 11245543 333466776665443 Q ss_pred CCC------------------------cC----c-eEEeeccceEEEEeecceEEEEecccchhhhcCcEEEEEEEEecc Q lcl|NC_019933. 325 AMA------------------------VG----Q-FLTGAFDAGAQVFDRWAARVEVATENQDDFIKNMVTILAEERLAL 375 (394) Q Consensus 325 ~~p------------------------~~----~-~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~ 375 (394) .+| ++ . .++|-..-++.++...+....+..+..++ .|.+.+-+...+|+ T Consensus 301 ~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~--g~~~~i~~~~i~G~ 378 (404) T protein:vir:10 301 GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWINGL 378 (404) T ss_pred CceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc--CchhhhhhHHHhhh Confidence 332 00 0 23343322222222233344444443332 23444555555555 Q ss_pred EEec-c------cceEEEEecCCCCC Q lcl|NC_019933. 376 AVYR-P------ESFIKGSLAAAAGT 394 (394) Q Consensus 376 ~v~~-~------~a~~~l~~~~a~~~ 394 (394) +-.+ + +-|-++.+.+++.= T Consensus 379 kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 379 KKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred hhccccCCCCceeeEEEEEecccccC Confidence 4444 2 35666666666666 No 254 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=38.06 E-value=1.1 Score=20.39 Aligned_cols=278 Identities=15% Similarity=0.122 Sum_probs=118.3 Q ss_pred cccccCCcCccccchhhhhHHHhhhhhhhhH--HHhccccccccCceeEEEE-cCcccccceecCCccccc-cccceeeE Q lcl|NC_019933. 111 LSTNADGSAGATVQTTRLPGILELPQRRMTI--RSLLAQGTMEGNTLEYVRE-TGFTNAAAPVAEGAQKPE-SSLRFDLV 186 (394) Q Consensus 111 ~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~eg~~~~~-~~~~~~~i 186 (394) +..- -...-|..+..-|-+......++ -.+++..++.+........ .+....+.++..+.+.+. ....+... T Consensus 1 M~~l----~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~ 76 (348) T protein:vir:49 1 MGLI----YDKVTASNIAGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEMH 76 (348) T ss_pred Ccch----hhhcCHHHHHHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeeecCCCCcceecccceeee Confidence 1100 01112233333222222222222 2234544444444443332 233335667776655443 33456666 Q ss_pred EeeeeeEEEeehhhHHHHH------H--HHH----HHHHHH---HHHHHHHHHHHHHHHh----hcc----CCCccc--- Q lcl|NC_019933. 187 QTSAKVIAHWMKASRQILS------D--SAQ----LQSFIN---ARLLRGLEVVEENQLL----NGN----GTGQNL--- 240 (394) Q Consensus 187 ~~~~~k~~~~~~is~e~l~------~--s~~----~~~~i~---~~la~a~~~~~d~a~l----~g~----g~~~~~--- 240 (394) ++.+-.++-...++..-++ + ++. +...+. ..+.+.+.+.+|..+. +|. +.+... T Consensus 77 ~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~vd 156 (348) T protein:vir:49 77 DEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDID 156 (348) T ss_pred eeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEEEe Confidence 7777776666666543211 1 111 222222 2234455555554333 221 111100 Q ss_pred cccccc-cccccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHH---hhc---c-CCcc--cccCcccC Q lcl|NC_019933. 241 LGLLPQ-ATAFAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIEL---LKD---T-QGRY--ILGNPQGT 310 (394) Q Consensus 241 ~Gi~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---lkd---~-~G~~--~~~~~~~~ 310 (394) .|+-.. ..+.+...+.++...+.+|.+....+...+..+..++|++.+|..|.. +++ . ++.. +-+..... T Consensus 157 yg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~ 236 (348) T protein:vir:49 157 YGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAELDN 236 (348) T ss_pred ecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHHHH Confidence 011000 001112344455667788887777777667788899999999999853 222 1 1111 10000111 Q ss_pred CCceeecceEEEcC------------CCCcCceEEeecc-ceEEEEee--c----------ceEEEEecccc--hhhh-c Q lcl|NC_019933. 311 LAPTLWGLPVVATQ------------AMAVGQFLTGAFD-AGAQVFDR--W----------AARVEVATENQ--DDFI-K 362 (394) Q Consensus 311 ~~~~l~G~pv~~~~------------~~p~~~~~~gd~~-~~~~~~~~--~----------~~~i~~~~~~~--~~~~-~ 362 (394) .-..+.|++|++-+ .+|++.++++-.. .+...+.. . ...+....... ..+. . T Consensus 237 ~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (348) T protein:vir:49 237 YIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTT 316 (348) T ss_pred HHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeeeecC Confidence 11234455555422 3455555554321 11111100 0 00000000000 0011 1 Q ss_pred C--cEEEEEEEEeccEEecccceEEEEecCCC Q lcl|NC_019933. 363 N--MVTILAEERLALAVYRPESFIKGSLAAAA 392 (394) Q Consensus 363 ~--~~~~~~~~~~d~~v~~~~a~~~l~~~~a~ 392 (394) + ...+.+....=-.+.+|+++.++++-++- T Consensus 317 dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 317 DPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred CCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 1 23444445555566778999999987777 No 255 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=37.41 E-value=1.1 Score=20.31 Aligned_cols=323 Identities=14% Similarity=0.067 Sum_probs=117.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhHHHHHH Q lcl|NC_019933. 30 ELNASVRAKVDELLMAQGALQADLKAA--QQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEINIKAA 107 (394) Q Consensus 30 ~~~~e~~~~~~~~~~~~~~l~~~i~~~--e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (394) =..+++++++..+++.... .+|+.. +.-.+++- .+..+........-.|.-. T Consensus 1 ms~~~l~~~w~~~l~~~~~--~~i~~~~~~~~~~~~~-----------------------enq~~~~~~~~~~l~ea~~- 54 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESV--PEIKDSYKKGVVAQLL-----------------------ENQENAIREEGQVLNETLQ- 54 (462) T ss_pred CchHHHHHHhhhhhccccc--chhhhhhHHHHHHHHh-----------------------hhHHHHHHhcccchhcccc- Confidence 0112344444433322110 011100 00000000 0000000000000000000 Q ss_pred HhhcccccCCcCccccchhhhhHHHhh---hhhhhhHHHhccccccccCceeEE----EEcC-------ccccc------ Q lcl|NC_019933. 108 ITSLSTNADGSAGATVQTTRLPGILEL---PQRRMTIRSLLAQGTMEGNTLEYV----RETG-------FTNAA------ 167 (394) Q Consensus 108 ~~~~~~~~~~~~g~~ip~~~~~~ii~~---~~~~~~l~~~~~~~~~~~~~~~~~----~~~~-------~~~~~------ 167 (394) ........+..+. -..+.+.++.+ ..+.....+++.+.||++.+.-+. +-.. ....+ T Consensus 55 ~~g~~~~~~~t~~---~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEad 131 (462) T protein:vir:10 55 TTGYTTGDTATGP---VAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPN 131 (462) T ss_pred ccCCCcCcccccc---cccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCC Confidence 0000000000110 01123333333 334555677788888776432211 0000 00000 Q ss_pred -ce---------------------------------------------------ecC-------CccccccccceeeEEe Q lcl|NC_019933. 168 -AP---------------------------------------------------VAE-------GAQKPESSLRFDLVQT 188 (394) Q Consensus 168 -~~---------------------------------------------------~~e-------g~~~~~~~~~~~~i~~ 188 (394) .| ..| +...++-..++++++. T Consensus 132 t~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tV 211 (462) T protein:vir:10 132 AGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTV 211 (462) T ss_pred cCccccccccccccccccccccccccccccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEE Confidence 00 000 0123334455566666 Q ss_pred eeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCCc--------ccccccccccccccccc Q lcl|NC_019933. 189 SAKVIAHWMKASRQILSDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTGQ--------NLLGLLPQATAFAAPIT 255 (394) Q Consensus 189 ~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~--------~~~Gi~~~~~~~~~~~~ 255 (394) ..+.-+=...+|-|+.+|-- |.++.|.+-|+..|...|++.+|.---+.. ...|++.. ... T Consensus 212 tAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl------~~~ 285 (462) T protein:vir:10 212 TAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDL------DVD 285 (462) T ss_pred eeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeee------ccc Confidence 66666667789999999862 578888888888888888888875321111 12233321 111 Q ss_pred ccccchHHHHHHHHHHh---------hhhcCCCCeeEeCHHHHHHHHH---hhcc---CCcccc-c-Cccc-CCCcee-e Q lcl|NC_019933. 256 VANATAVDRLRLALLQA---------QLAEFPATGIVLNPADWAGIEL---LKDT---QGRYIL-G-NPQG-TLAPTL-W 316 (394) Q Consensus 256 ~~~~~~~~~i~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~---lkd~---~G~~~~-~-~~~~-~~~~~l-~ 316 (394) ..+--.++....++..+ .......+.+++|++....|+. |.-. +++.-. . +.++ ...+.| . T Consensus 286 ~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~ 365 (462) T protein:vir:10 286 SNGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNG 365 (462) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecC Confidence 12222333333333333 2334556789999999888843 1111 111111 0 1111 112344 4 Q ss_pred cceEEEcCCCC----cCceEEeeccceEEEEeecceEEEEecccchhh------hcCcEEEEEEEEeccEEecccceEEE Q lcl|NC_019933. 317 GLPVVATQAMA----VGQFLTGAFDAGAQVFDRWAARVEVATENQDDF------IKNMVTILAEERLALAVYRPESFIKG 386 (394) Q Consensus 317 G~pv~~~~~~p----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~------~~~~~~~~~~~~~d~~v~~~~a~~~l 386 (394) |++|+.++... .+-+++|- +. -.-. ...+-..++...++ .+-+-.+-...|++..+ +|=+-. + T Consensus 366 r~~vy~D~Y~~~ns~~dy~~vG~-KG-~~~~---~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~t~~-~ 438 (462) T protein:vir:10 366 RIKVYVDPYSSNVADKHFYVAGY-KG-TSPY---DAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVS-NPFSGG-L 438 (462) T ss_pred ceEEEEecccCCCcccceEEEEE-eC-Cccc---ccceeeccccccccccccCCccccceeeeeeeeeeee-cCCCCC-c Confidence 68999887643 33344432 21 0000 01111222221110 11111122222222221 110000 0 Q ss_pred Eec---CCCCC Q lcl|NC_019933. 387 SLA---AAAGT 394 (394) Q Consensus 387 ~~~---~a~~~ 394 (394) +.+ -..++ T Consensus 439 ~~~~~~~~~~~ 449 (462) T protein:vir:10 439 TQGSGALTANA 449 (462) T ss_pred CCccccccccC Confidence 000 00011 No 256 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=36.18 E-value=1.2 Score=20.17 Aligned_cols=355 Identities=15% Similarity=0.049 Sum_probs=131.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---ccccccchhh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNG---AGGDVQHISI 77 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~---~~~~~~~~~~ 77 (394) |+ .++|.++|.-+.+..- ..|++....+ .+-++| +|++........ ........++ T Consensus 1 ~~-~~~l~~kw~p~l~~~~------------~~~i~~~~~~------~~~a~l--~enq~~~~~~~~~~~~~~~~~~~~~ 59 (534) T protein:vir:10 1 MS-KKSLLKKWQPLVESEG------------MPAIASMKRK------DIVARI--FENQDEDIAHNEGGVYTDQVVVNSM 59 (534) T ss_pred Cc-hhHHHHHhHHhhcCCc------------cccccchhhh------hhhhhh--hhhHHHHHhhhcccccchhhhhhhh Confidence 54 3555555554443200 0000000000 000000 011111110000 0000011111 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHH----HhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccC Q lcl|NC_019933. 78 GQQFVNSDSFKAMAESGGQRGRAEINIKAA----ITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGN 153 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 153 (394) +.. + .........+.+.-.. -.....++++..-.-..|.++ .++++........+++.+.||++. T Consensus 60 ~~~-------~---~~~~~~~l~ea~~~~~~g~~~~~ia~s~~s~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgP 128 (534) T protein:vir:10 60 VDV-------K---GRIEEARLAEANIGGDHGYDATKIASGETSGSITNVGPAVM-GLVRRAIPQLIAFDICGVQPMTSS 128 (534) T ss_pred hcc-------c---cchhhccccccccccccccccccccccccccccccccchhh-hHHHHHHHhhhhhhhheeccCCch Confidence 100 0 0000000000000000 000011111111111112221 223334445566777888887765 Q ss_pred ceeEEEEc--C--cc--------------ccccee--------------------------------------------- Q lcl|NC_019933. 154 TLEYVRET--G--FT--------------NAAAPV--------------------------------------------- 170 (394) Q Consensus 154 ~~~~~~~~--~--~~--------------~~~~~~--------------------------------------------- 170 (394) +.-+.-.. . .. +.+.|- T Consensus 129 TGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~ 208 (534) T protein:vir:10 129 TGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKD 208 (534) T ss_pred hhhheeeeeeecCCCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 32221110 0 00 000000 Q ss_pred -------------------------------------cC---------CccccccccceeeEEeeeeeEEEeehhhHHHH Q lcl|NC_019933. 171 -------------------------------------AE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQIL 204 (394) Q Consensus 171 -------------------------------------~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l 204 (394) +| +...++-..++++++...+.-+=...+|-||. T Consensus 209 ~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELA 288 (534) T protein:vir:10 209 YAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMA 288 (534) T ss_pred cccccccCCccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHH Confidence 00 01133444566666666666666778999999 Q ss_pred HHHH-----HHHHHHHHHHHHHHHHHHHHHHhhccCCCcc------------ccccccccccccccccccccchHHHHHH Q lcl|NC_019933. 205 SDSA-----QLQSFINARLLRGLEVVEENQLLNGNGTGQN------------LLGLLPQATAFAAPITVANATAVDRLRL 267 (394) Q Consensus 205 ~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g~g~~~~------------~~Gi~~~~~~~~~~~~~~~~~~~~~i~~ 267 (394) +|-- |.++.|.+-|+..|...|++.+|.---.... -.|++........ ..+-...+.+.. T Consensus 289 QDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~---~~~~~~~e~~~~ 365 (534) T protein:vir:10 289 QDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDI---RGARWAGESYKA 365 (534) T ss_pred HHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccc---cchhHHHHHHHH Confidence 9862 5788888888888888888888753211111 1122221111110 011112233333 Q ss_pred HHHHhh---------hhcCCCCeeEeCHHHHHHHHHh--hcc---CCcccccCcccCC----Cceee-cceEEEcCCCCc Q lcl|NC_019933. 268 ALLQAQ---------LAEFPATGIVLNPADWAGIELL--KDT---QGRYILGNPQGTL----APTLW-GLPVVATQAMAV 328 (394) Q Consensus 268 ~~~~~~---------~~~~~~~~~~~~~~~~~~l~~l--kd~---~G~~~~~~~~~~~----~~~l~-G~pv~~~~~~p~ 328 (394) ++-.+. ......+.+++|++....|+.. -+. .|-..-. ..+.. .++|. |++|+.++..|. T Consensus 366 L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y~~~ 444 (534) T protein:vir:10 366 LVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTM-NTDTTSSLFAGVLAGKYRVYIDQYAVE 444 (534) T ss_pred HHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccc-cccCCCceEEEEecCceEEEecCCCCc Confidence 332222 2234577899999999888642 111 1111111 11111 23453 689999999998 Q ss_pred CceEEeeccceE---EEEeecceEEEEecccchhhhcCcEEEEEEEEeccEEecc-------cceEEEEec-----CCCC Q lcl|NC_019933. 329 GQFLTGAFDAGA---QVFDRWAARVEVATENQDDFIKNMVTILAEERLALAVYRP-------ESFIKGSLA-----AAAG 393 (394) Q Consensus 329 ~~~~~gd~~~~~---~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~-------~a~~~l~~~-----~a~~ 393 (394) +-+++|---... -+|...= +........+-.+-+-.+-...|++..+ +| .-+.++.-. .-+| T Consensus 445 dy~~vG~KG~~~~~~glfyaPY--v~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~g~~~~~~~ag 521 (534) T protein:vir:10 445 DYFTVGYKGASEMDAGLYYCPY--VALTPLRGTDPKNFQPVLGFKTRYGVKL-HPMADATQNKGFAKISNGMPQHTNMFG 521 (534) T ss_pred ceEEEEEeCCcccccceeeccc--cccccccccCCccccceeeeeeeeceee-cCcccccCCccccccccCCcchhhhcc Confidence 777665321000 0111000 1111011111112222333445555443 33 111122111 1111 Q ss_pred C Q lcl|NC_019933. 394 T 394 (394) Q Consensus 394 ~ 394 (394) . T Consensus 522 ~ 522 (534) T protein:vir:10 522 K 522 (534) T ss_pred c Confidence 1 No 257 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=35.82 E-value=1.2 Score=20.13 Aligned_cols=341 Identities=16% Similarity=0.093 Sum_probs=123.9 Q ss_pred HHHHHHHHHHHHHHH----HH----HHHHH--HHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhhhhH Q lcl|NC_019933. 33 ASVRAKVDELLMAQG----AL----QADLK--AAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGRAEI 102 (394) Q Consensus 33 ~e~~~~~~~~~~~~~----~l----~~~i~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (394) -.+.+++.-+++.-. ++ ++.+- -+|++.....+...-.+. .+.+. +..+.......+... T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~---~~~~~------~~~~l~e~~~~~~~~- 70 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDP---QLVEA------FNAGLNEAVVNGDHG- 70 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccch---hhhhh------hhccccccccccccc- Confidence 113334433332211 11 11110 011111111111000000 00000 000000000000000 Q ss_pred HHHHHHhhcccccCCcCccccchhhhhHHHhhhhhhhhHHHhccccccccCceeEEEEc----C---cccc--------- Q lcl|NC_019933. 103 NIKAAITSLSTNADGSAGATVQTTRLPGILELPQRRMTIRSLLAQGTMEGNTLEYVRET----G---FTNA--------- 166 (394) Q Consensus 103 ~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~---~~~~--------- 166 (394) . .-. ....++++.+-.-+.|.++ .++++........+++.+.||++.+.-+.-.. . .... T Consensus 71 -~-~~~-~ia~s~~t~~v~~~~P~ll-~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEad 146 (514) T protein:vir:56 71 -Y-DPA-NIAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQAD 146 (514) T ss_pred -c-ccc-ccccccccccccccchhHH-HHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccC Confidence 0 000 0001111111111122222 22333445566678888888876532221100 0 0000 Q ss_pred cceec--------------------------------------------------------------------------- Q lcl|NC_019933. 167 AAPVA--------------------------------------------------------------------------- 171 (394) Q Consensus 167 ~~~~~--------------------------------------------------------------------------- 171 (394) ..|-+ T Consensus 147 t~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~T 226 (514) T protein:vir:56 147 ASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMAT 226 (514) T ss_pred cCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhh Confidence 00000 Q ss_pred ---C---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhhc- Q lcl|NC_019933. 172 ---E---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQSFINARLLRGLEVVEENQLLNG- 233 (394) Q Consensus 172 ---e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~la~a~~~~~d~a~l~g- 233 (394) | +...++-..++++++...+.-+=...+|-||.+|-- |.++.|.+-|+..|...|++.+|.- T Consensus 227 a~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l 306 (514) T protein:vir:56 227 SQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLV 306 (514) T ss_pred hhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHH Confidence 0 111233344555556666665666789999999862 6789999999999999999888521 Q ss_pred --c---CCCcccccccccccccccccccc---ccchHHHHHHHHHHhh---------hhcCCCCeeEeCHHHHHHHHH-- Q lcl|NC_019933. 234 --N---GTGQNLLGLLPQATAFAAPITVA---NATAVDRLRLALLQAQ---------LAEFPATGIVLNPADWAGIEL-- 294 (394) Q Consensus 234 --~---g~~~~~~Gi~~~~~~~~~~~~~~---~~~~~~~i~~~~~~~~---------~~~~~~~~~~~~~~~~~~l~~-- 294 (394) . +......|+-. ++....+...+ +.-..+.+..++-.+. ......+.+++|++....|.. T Consensus 307 ~~~atv~~~~~~~~~~~-~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg 385 (514) T protein:vir:56 307 NSQAQIGKSGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTD 385 (514) T ss_pred Hhheeehhccccccccc-ccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhh Confidence 1 11111122211 12211111111 1112333333332222 233467789999999988874 Q ss_pred hhccC-Ccc-cccC-cccCC----Ccee-ecceEEEcCCCCcCceEEeeccceEE---EEeecceEEEEecccchhhhcC Q lcl|NC_019933. 295 LKDTQ-GRY-ILGN-PQGTL----APTL-WGLPVVATQAMAVGQFLTGAFDAGAQ---VFDRWAARVEVATENQDDFIKN 363 (394) Q Consensus 295 lkd~~-G~~-~~~~-~~~~~----~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~~---~~~~~~~~i~~~~~~~~~~~~~ 363 (394) +.+.. +.. .... ..+.. .+.| .|++|+.++..|.+-+++|---...+ +|...=+.+ ......+-.+- T Consensus 386 ~l~~~~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l--~~~~~~dp~sf 463 (514) T protein:vir:56 386 TLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPL--TPLRGSDSKNF 463 (514) T ss_pred hhccccccCccccccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeecccccc--ccccccCCccc Confidence 11110 100 0000 11111 1344 46899999999987766653210000 010000000 00000011111 Q ss_pred cEEEEEEEEeccEEeccc----c--------------------eEEEEecCC Q lcl|NC_019933. 364 MVTILAEERLALAVYRPE----S--------------------FIKGSLAAA 391 (394) Q Consensus 364 ~~~~~~~~~~d~~v~~~~----a--------------------~~~l~~~~a 391 (394) +-.+-...|++..+ +|= + |.+++++-- T Consensus 464 qP~~g~~tRY~l~~-NPy~~~~~~~~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 464 QPVIGFKTRYGVQV-NPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred cceeeeeeeeceee-CCCCCccccccccCCcchhhhcccccceeeeEEEecC Confidence 22233344444332 220 0 111111111 No 258 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=35.17 E-value=1.2 Score=20.06 Aligned_cols=294 Identities=11% Similarity=0.002 Sum_probs=126.4 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhcccccCCcCccccchhhhhHHHhh--hhhhhhHHHhccccccccCc Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNADGSAGATVQTTRLPGILEL--PQRRMTIRSLLAQGTMEGNT 154 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii~~--~~~~~~l~~~~~~~~~~~~~ 154 (394) +.-... +-..+.-.++ ++. ....|+.+=-+.+.+++..+ ..+.-.++.-+...+..+-- T Consensus 1 ~~~~~~--------------~~~~~a~~~a-l~~----a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV 61 (470) T protein:vir:10 1 MPYEHL--------------KHLDEATLKA-LNA----AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYE 61 (470) T ss_pred CChhHh--------------hhhhHHHHHH-HHH----hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHh Confidence 000000 0000011111 111 11223332222233333221 12233445555555655433 Q ss_pred eeEEEEcC--cccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHH---HHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 155 LEYVRETG--FTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQIL---SDS-AQLQSFINARLLRGLEVVEEN 228 (394) Q Consensus 155 ~~~~~~~~--~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l---~~s-~~~~~~i~~~la~a~~~~~d~ 228 (394) .+|-...+ +-....+..|++-.+.+++.+.......|-++....+|.-++ +.. .+++....++-.-.++..++. T Consensus 62 ~ey~~~~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~ 141 (470) T protein:vir:10 62 HEYNVVTARHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEY 141 (470) T ss_pred hhhhhhccccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHh Confidence 33322221 112333568999999999999999999999999999998753 222 378888888888899999999 Q ss_pred HHhhccCC----------Ccccccccccccc--cccccc-ccccchHHHHHHHHHHhh--hhcCCCCeeEeCHHHHHHHH Q lcl|NC_019933. 229 QLLNGNGT----------GQNLLGLLPQATA--FAAPIT-VANATAVDRLRLALLQAQ--LAEFPATGIVLNPADWAGIE 293 (394) Q Consensus 229 a~l~g~g~----------~~~~~Gi~~~~~~--~~~~~~-~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~~~~~~l~ 293 (394) ++|.|+.. +-++.||.+.-.- ...... .....+.+.|..+...+. ..+..++-++|+..+.+.|. T Consensus 142 a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~ 221 (470) T protein:vir:10 142 LAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQ 221 (470) T ss_pred hhhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHH Confidence 99999641 1246777543211 111111 223335555665555553 46777888999999988887 Q ss_pred HhhccCCcccccC-cc------------c-CCCceeecceEE----------EcCC---CCcCceEE------------- Q lcl|NC_019933. 294 LLKDTQGRYILGN-PQ------------G-TLAPTLWGLPVV----------ATQA---MAVGQFLT------------- 333 (394) Q Consensus 294 ~lkd~~G~~~~~~-~~------------~-~~~~~l~G~pv~----------~~~~---~p~~~~~~------------- 333 (394) .-....-+.+.+. +. + .+.-.|.|--+. .... +++..+.. T Consensus 222 ~~~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~ 301 (470) T protein:vir:10 222 ASFYQISRVMTTADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPY 301 (470) T ss_pred HhhcCceEEEEecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCcccCCcccCceeEEeecCCCceeecc Confidence 5332222222210 00 0 000012222221 1111 11111000 Q ss_pred ----eecc------ceEEEE--eecceEEEEecccchhhhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 334 ----GAFD------AGAQVF--DRWAARVEVATENQDDFIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 334 ----gd~~------~~~~~~--~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) |+|. ..|.+. +...-+.-++-....--..+.+.+.+... .+++-+.+....+.++. T Consensus 302 ~sk~g~~~~~~v~sy~y~v~~~~gds~s~~v~vt~t~~~v~kgv~ltI~~~-----~~v~yv~IYRk~~~s~~ 369 (470) T protein:vir:10 302 NSGLGDPANTTVYSYAFKAANFYGESAAKYIDVYIDSTEAGKGVRFQFHGL-----VNVKWLDVYRKDPGSQE 369 (470) T ss_pred cCCCCcccCcceeEEEEEEEEecCCCCcceEEEEEeeehhcceeEEEEecC-----CCCcEEEEEeecCCCCc Confidence 0000 001110 00000000000000000011111211100 12333333333333333 No 259 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=28.52 E-value=1.7 Score=19.26 Aligned_cols=349 Identities=10% Similarity=0.063 Sum_probs=92.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHh----hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Q lcl|NC_019933. 1 MSDINAINSTLANISDSLKAHADRAVK----DQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS 76 (394) Q Consensus 1 Mk~i~el~~~~~~~~~~~k~~~e~~~~----~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 76 (394) +++|++.++++.+..+++++.++.... ..+..++++.+++.+.++++.++..++..+................... T Consensus 4 ~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (397) T protein:vir:48 4 SNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLTKSE 83 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccccccchh Confidence 666555555555544444444433322 2234456677777788888877777665443332222221111111111 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhccccc--CC-cCccccchhhhhHHHhhhhhhhhHHHhcccccc--- Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNA--DG-SAGATVQTTRLPGILELPQRRMTIRSLLAQGTM--- 150 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~--- 150 (394) ..........+..+.+....... +.+........+. +. -...++........+..+...-++-......++ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:48 84 EEVKAGFVKDFKNLVRGRYQNLL---DSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred hHHHHHHHHHHHHHHhhhhhHHH---HHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEee Confidence 11111122222223233322111 1111111111111 11 111111111111122111111112111111111 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQL 230 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~ 230 (394) .+... ...+.. ...-..+.........++..-.+...----.--+.+....-...+...+...++.++..++=... T Consensus 161 ~~~~~-~a~~v~---E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~ 236 (397) T protein:vir:48 161 ADITG-LAKLDD---EAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI 236 (397) T ss_pred cCCCc-ceeeec---cccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 11111 111111 11112223222334444433332222110000122212221223444445555555554432111 Q ss_pred hhcc--CCCcccccccccccc-----ccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc----- Q lcl|NC_019933. 231 LNGN--GTGQNLLGLLPQATA-----FAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDT----- 298 (394) Q Consensus 231 l~g~--g~~~~~~Gi~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~----- 298 (394) =.+. +......+|...-.. .....-.....++..| ..+..... ..+..+.. ... T Consensus 237 g~~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L----~~lkd~~G---~~i~~~~~-------~~~~~~~l 302 (397) T protein:vir:48 237 ATLPTKPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTAL----KKVKNAFG---DYLMERDV-------KSPTGYSI 302 (397) T ss_pred cccccccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHH----HHhhcCCC---ceeeccCc-------CCCCCcee Confidence 1111 111122222211000 0011111222233333 23332222 12222211 111 Q ss_pred CCcccccCc----cc---CCCceeec-------------ceEEEcCCCCcCceEEeeccc---eEEEEeecceEEEEecc Q lcl|NC_019933. 299 QGRYILGNP----QG---TLAPTLWG-------------LPVVATQAMAVGQFLTGAFDA---GAQVFDRWAARVEVATE 355 (394) Q Consensus 299 ~G~~~~~~~----~~---~~~~~l~G-------------~pv~~~~~~p~~~~~~gd~~~---~~~~~~~~~~~i~~~~~ 355 (394) .|.|+.... .. +....++| +.+..++... .+|.. ++....+-+..+. .+ T Consensus 303 ~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~-~~- 374 (397) T protein:vir:48 303 DGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG------GAFETDTTKIRVIDRFDVVAT-DT- 374 (397) T ss_pred ccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccch------hhhhcCceeEEEEeeeccEEe-cc- Confidence 244332100 00 00011222 2222211100 00100 0111111111110 00 Q ss_pred cchhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_019933. 356 NQDDFIKNMVTILAEERLALAVYRPESFIKGSL 388 (394) Q Consensus 356 ~~~~~~~~~~~~~~~~~~d~~v~~~~a~~~l~~ 388 (394) -.+... -+.....-+..+..+-+ T Consensus 375 ---------~a~~~~-~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 375 ---------ESFVPA-SFKAIADQKGNLGSTAV 397 (397) T ss_pred ---------cceEEE-EecccccCCCCccccCC Confidence 000000 00000000000000000 No 260 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=24.63 E-value=2.2 Score=18.76 Aligned_cols=347 Identities=10% Similarity=-0.007 Sum_probs=100.8 Q ss_pred Cc----hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Q lcl|NC_019933. 1 MS----DINAINSTLANISDSLKAHADRAVKDQELNASVRAKVDELLMAQGALQADLKAAQQRIAEVEGNGAGGDVQHIS 76 (394) Q Consensus 1 Mk----~i~el~~~~~~~~~~~k~~~e~~~~~~~~~~e~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 76 (394) ++ .+++++++++++.++.+..........+...++.++++.+.++++.++.++.+.+................... T Consensus 4 ~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (397) T protein:vir:49 4 SNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPLTKSE 83 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccch Confidence 33 35566666666666555544333333344566777788888887777777665443333222221111111111 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHHhhccccc--CC-cCccccchhhhhHHHhhhhhhhhHHHhcccccc--- Q lcl|NC_019933. 77 IGQQFVNSDSFKAMAESGGQRGRAEINIKAAITSLSTNA--DG-SAGATVQTTRLPGILELPQRRMTIRSLLAQGTM--- 150 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~g~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~--- 150 (394) ..........+..+.+....... +.+........+. +. -...++........+..+...-++-......++ T Consensus 84 ~~~~~~~~~~~~~~l~~~~~~~~---~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:49 84 EEVKAGFVKDFKNLVRGRYQNLL---DSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred hHHHHHHHHHHHHHHhcchhHHH---HHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEee Confidence 11111222233333333322111 1111111111111 11 112222222222222222222222211111111 Q ss_pred ccCceeEEEEcCcccccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019933. 151 EGNTLEYVRETGFTNAAAPVAEGAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSAQLQSFINARLLRGLEVVEENQL 230 (394) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~~~~~~i~~~la~a~~~~~d~a~ 230 (394) ........... ...-..+.........++..-.+...-.-..--+.+....-...+...+...++.++..++=... T Consensus 161 ~~~~~~a~~v~----E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~ 236 (397) T protein:vir:49 161 TDITGLANIDD----EAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI 236 (397) T ss_pred ccCCcceeeec----CccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 11111111111 11112233333344444444433332111111122222222233455555555555555543221 Q ss_pred hhccC--CCcccccccccccc-----ccccccccccchHHHHHHHHHHhhhhcCCCCeeEeCHHHHHHHHHhhcc----- Q lcl|NC_019933. 231 LNGNG--TGQNLLGLLPQATA-----FAAPITVANATAVDRLRLALLQAQLAEFPATGIVLNPADWAGIELLKDT----- 298 (394) Q Consensus 231 l~g~g--~~~~~~Gi~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~lkd~----- 298 (394) =.+.. ......++...-.. ...+.-.....++..|. .+..... ..+..|.. ... T Consensus 237 g~~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~----~lkd~~G---~~l~~~~~-------~~~~~~~l 302 (397) T protein:vir:49 237 AALPTKPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALK----KVKNALG---DYLMERDV-------KSPTGYSI 302 (397) T ss_pred cccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHH----HhhcCCC---ceeeccCc-------CCCCCcee Confidence 11111 11122222211100 01111122333343333 3332222 12222211 111 Q ss_pred CCcccccCcccC-CCceeecceEEEcCCCCcCceEEeeccc------------------eEEEEeecceEEEEecccchh Q lcl|NC_019933. 299 QGRYILGNPQGT-LAPTLWGLPVVATQAMAVGQFLTGAFDA------------------GAQVFDRWAARVEVATENQDD 359 (394) Q Consensus 299 ~G~~~~~~~~~~-~~~~l~G~pv~~~~~~p~~~~~~gd~~~------------------~~~~~~~~~~~i~~~~~~~~~ 359 (394) .|.|+....... ..+...-.++++-+ +.. .+.+++... .+....+-+..+. T Consensus 303 ~G~PV~~~~~~~~~~~~~~~~~i~~gd-~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~-------- 372 (397) T protein:vir:49 303 DGFAVKEVADRWLANGTGGAMPLYFGD-LKQ-AVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVAT-------- 372 (397) T ss_pred cceeeEEecccccccccCCceeEEEee-ccc-eEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEe-------- Confidence 233332110000 00000001111111 000 011111110 0111111111110 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCCCCC Q lcl|NC_019933. 360 FIKNMVTILAEERLALAVYRPESFIKGSLAAAAGT 394 (394) Q Consensus 360 ~~~~~~~~~~~~~~d~~v~~~~a~~~l~~~~a~~~ 394 (394) ..+ +... +..-..+.+.|+ T Consensus 373 -~~~-----a~~~----------~~~~~~~~~~~~ 391 (397) T protein:vir:49 373 -DTE-----AFVP----------ASFKAIADQKGN 391 (397) T ss_pred -ccc-----ceEE----------EEeecccCCCCC Confidence 000 0111 111111122222 No 261 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=22.21 E-value=2.5 Score=18.42 Aligned_cols=343 Identities=14% Similarity=0.055 Sum_probs=122.1 Q ss_pred hh-hhHHHHHHHHHHHHHHH------HHHHHHH--HHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhh Q lcl|NC_019933. 29 QE-LNASVRAKVDELLMAQG------ALQADLK--AAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGR 99 (394) Q Consensus 29 ~~-~~~e~~~~~~~~~~~~~------~l~~~i~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (394) +. ..+++.+++..+++... .-++.+- -+|++.........-.+ ..+.+.+.. ........+. T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~---~~~~e~~~~------~l~~~~~~~~ 71 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRD---DKLIEAFGQ------SLMEAEVAGD 71 (529) T ss_pred CcccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccch---hhhhhhhhc------ccchhhcccc Confidence 11 12234444444333210 0000010 01111111111110000 000000000 0000000000 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHHH---hhhhhhhhHHHhccccccccCceeEEEEcC-------------- Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGIL---ELPQRRMTIRSLLAQGTMEGNTLEYVRETG-------------- 162 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~ii---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-------------- 162 (394) .. .-...+. .++. ++.+ ..+.+.++ ++........+++.+.||++.+.-+.-..+ T Consensus 72 ~~-~~~~~i~---est~-t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea 143 (529) T protein:vir:10 72 HG-YDPTNIA---AGQS-SGAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEA 143 (529) T ss_pred cc-ccccccc---cccc-cccc---cccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccc Confidence 00 0000000 0111 1111 11222222 333445556777788787653211100000 Q ss_pred ------------------------------------------------------------cccc---------------- Q lcl|NC_019933. 163 ------------------------------------------------------------FTNA---------------- 166 (394) Q Consensus 163 ------------------------------------------------------------~~~~---------------- 166 (394) .... T Consensus 144 f~~~y~Pda~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~ 223 (529) T protein:vir:10 144 FHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINA 223 (529) T ss_pred cccccccccccccccccccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCccccccccc Confidence 0000 Q ss_pred --------------cceecC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHH Q lcl|NC_019933. 167 --------------AAPVAE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQSFINARL 218 (394) Q Consensus 167 --------------~~~~~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~l 218 (394) ..-..| +...++-..++++++...+.-+=...+|-|+.+|-- |.++.|.+-| T Consensus 224 ~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNIL 303 (529) T protein:vir:10 224 AIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGIL 303 (529) T ss_pred ccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHH Confidence 000000 011333445566666666666667789999999862 5788888888 Q ss_pred HHHHHHHHHHHHhhccCCCc------------ccccccccccccccccc-ccc---cchHHHHHHHHHHhh--hhcCCCC Q lcl|NC_019933. 219 LRGLEVVEENQLLNGNGTGQ------------NLLGLLPQATAFAAPIT-VAN---ATAVDRLRLALLQAQ--LAEFPAT 280 (394) Q Consensus 219 a~a~~~~~d~a~l~g~g~~~------------~~~Gi~~~~~~~~~~~~-~~~---~~~~~~i~~~~~~~~--~~~~~~~ 280 (394) +..|...|++.||.---+.. ...|++.......+... ... ...+-.+......+. ..+.... T Consensus 304 StEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n 383 (529) T protein:vir:10 304 ANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 383 (529) T ss_pred HHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccce Confidence 88888888888775321111 11222221111000000 000 112222222222332 2334566 Q ss_pred eeEeCHHHHHHHHHh--hccCCcccccC--cccCC----Ccee-ecceEEEcCCCCcCceEEeeccceE---EEEeecce Q lcl|NC_019933. 281 GIVLNPADWAGIELL--KDTQGRYILGN--PQGTL----APTL-WGLPVVATQAMAVGQFLTGAFDAGA---QVFDRWAA 348 (394) Q Consensus 281 ~~~~~~~~~~~l~~l--kd~~G~~~~~~--~~~~~----~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~---~~~~~~~~ 348 (394) .+++|++....|... .+..+.+-+.. ..+.. -+.| .|++|+.++..|.+-+++|---.-. -+|...=+ T Consensus 384 ~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv 463 (529) T protein:vir:10 384 FIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYV 463 (529) T ss_pred EEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccc Confidence 899999999988742 22111000000 01111 2344 3589999999988877766321000 00000000 Q ss_pred EEEEecccchhhhcCcEEEEEEEEeccEEecccc--------------------------eEEEEecCC Q lcl|NC_019933. 349 RVEVATENQDDFIKNMVTILAEERLALAVYRPES--------------------------FIKGSLAAA 391 (394) Q Consensus 349 ~i~~~~~~~~~~~~~~~~~~~~~~~d~~v~~~~a--------------------------~~~l~~~~a 391 (394) .+... ...+-.+-+-.+-...|++..+ +|=+ ++++.+|-- T Consensus 464 ~l~~~--~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 464 ALTPL--RGSDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccccc--cccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 01000 0001111122233334444332 2210 111111111 No 262 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=21.07 E-value=2.7 Score=18.25 Aligned_cols=338 Identities=15% Similarity=0.076 Sum_probs=122.5 Q ss_pred hhh-hHHHHHHHHHHHHHHH--HH----HHHHH--HHHHHHHHHHhhcccccccchhhhhhhhhHHHHHHHHHHhhhhhh Q lcl|NC_019933. 29 QEL-NASVRAKVDELLMAQG--AL----QADLK--AAQQRIAEVEGNGAGGDVQHISIGQQFVNSDSFKAMAESGGQRGR 99 (394) Q Consensus 29 ~~~-~~e~~~~~~~~~~~~~--~l----~~~i~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (394) +.+ .+++.+++..+++... ++ ++.+- -+|++.........-.+ ..+.+.+. .........+. T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~---~~~~e~~~------~~l~e~~~~~~ 71 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRD---DKLIEAFG------QSLMEAEVAGD 71 (529) T ss_pred CccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccch---hhhhhhhh------ccchhhccccc Confidence 211 2234555554443311 00 11110 01111111111110000 00000000 00000000000 Q ss_pred hhHHHHHHHhhcccccCCcCccccchhhhhHH---HhhhhhhhhHHHhccccccccCceeEEEEcC----c--c------ Q lcl|NC_019933. 100 AEINIKAAITSLSTNADGSAGATVQTTRLPGI---LELPQRRMTIRSLLAQGTMEGNTLEYVRETG----F--T------ 164 (394) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~g~~ip~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~--~------ 164 (394) .. .-...+ ..++. ++.+ ..+.+.+ +++........+++.+.||++.+.-+.-..+ . . T Consensus 72 ~~-~~~~~i---~~st~-t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea 143 (529) T protein:vir:10 72 HG-YDPTNI---AAGQS-SGAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEA 143 (529) T ss_pred cc-cccccc---ccccc-cccc---cccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccc Confidence 00 000000 01111 1111 1122223 3334455666788888888764322110000 0 0 Q ss_pred -----------------------------------------------------------cccce---------------- Q lcl|NC_019933. 165 -----------------------------------------------------------NAAAP---------------- 169 (394) Q Consensus 165 -----------------------------------------------------------~~~~~---------------- 169 (394) ..... T Consensus 144 f~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~ 223 (529) T protein:vir:10 144 FHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINA 223 (529) T ss_pred ccccccccccccccccccccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCccccccccc Confidence 00000 Q ss_pred ---------e--------cC---------CccccccccceeeEEeeeeeEEEeehhhHHHHHHHH-----HHHHHHHHHH Q lcl|NC_019933. 170 ---------V--------AE---------GAQKPESSLRFDLVQTSAKVIAHWMKASRQILSDSA-----QLQSFINARL 218 (394) Q Consensus 170 ---------~--------~e---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~e~l~~s~-----~~~~~i~~~l 218 (394) . +| +...++-..++++++...+.-+=...+|-||.+|-- |.++.|.+-| T Consensus 224 ~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNIL 303 (529) T protein:vir:10 224 AIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGIL 303 (529) T ss_pred ccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHH Confidence 0 00 011233344556666666666667789999999862 5788888888 Q ss_pred HHHHHHHHHHHHhhccCCCc--------c----cccccccccccccccc-ccc---cchHHHHHHHHHHhh--hhcCCCC Q lcl|NC_019933. 219 LRGLEVVEENQLLNGNGTGQ--------N----LLGLLPQATAFAAPIT-VAN---ATAVDRLRLALLQAQ--LAEFPAT 280 (394) Q Consensus 219 a~a~~~~~d~a~l~g~g~~~--------~----~~Gi~~~~~~~~~~~~-~~~---~~~~~~i~~~~~~~~--~~~~~~~ 280 (394) +..|...|++.||.---+.. . -.|++.......+... ... ...+-.+......+. ..+.... T Consensus 304 StEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n 383 (529) T protein:vir:10 304 ANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGN 383 (529) T ss_pred HHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccce Confidence 88888888887775321111 1 1122221111000000 000 111222222222332 2334566 Q ss_pred eeEeCHHHHHHHHHh--hc-------cCCcccccCccc-CCCcee-ecceEEEcCCCCcCceEEeeccceEEEEeecceE Q lcl|NC_019933. 281 GIVLNPADWAGIELL--KD-------TQGRYILGNPQG-TLAPTL-WGLPVVATQAMAVGQFLTGAFDAGAQVFDRWAAR 349 (394) Q Consensus 281 ~~~~~~~~~~~l~~l--kd-------~~G~~~~~~~~~-~~~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~ 349 (394) .+++|++....|... .. .+|. ..+.+. ..-+.| .|++|+.++..|.+-+++|---. .-.+ .. T Consensus 384 ~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~--~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~--~~~~---~g 456 (529) T protein:vir:10 384 FIIASRNVVSALALIDTNISPAAQGMASGL--NADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGA--NNLD---AG 456 (529) T ss_pred EEEEchHHHHHHHhhccccccccccccccc--ccccCCceEEEEecCceEEEecCCCCcceEEEEEeCC--cccc---cc Confidence 899999999988742 11 1111 011110 112344 35899999999888777663210 0000 00 Q ss_pred EEEecccch------hhhcCcEEEEEEEEeccEEecccc--------------------------eEEEEecCC Q lcl|NC_019933. 350 VEVATENQD------DFIKNMVTILAEERLALAVYRPES--------------------------FIKGSLAAA 391 (394) Q Consensus 350 i~~~~~~~~------~~~~~~~~~~~~~~~d~~v~~~~a--------------------------~~~l~~~~a 391 (394) +-..++... +-.+-+-.+-...|++..+ +|=+ ++++.+|-- T Consensus 457 lfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 457 IYYCPYVALTPLRGFDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeccccccccccccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 111111110 0011112222333333322 1200 111111111 Done!