Query lcl|NC_020078.1_cdsid_YP_007348355.1 [gene=GAP227_35] [protein=major capsid protein] [protein_id=YP_007348355.1] [location=22834..23853] Match_columns 339 No_of_seqs 119 out of 136 Neff 7.2 Searched_HMMs 1612 Date Thu Nov 7 19:19:31 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_36 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_36_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80213 Length: 334 100.0 4E-102 2E-105 576.6 30.7 330 1-339 1-332 (334) 2 protein:vir:6324 Length: 335 # 100.0 3E-102 2E-105 577.2 28.9 326 1-339 1-328 (335) 3 protein:vir:78935 Length: 335 100.0 7E-102 5E-105 575.1 29.7 326 1-339 1-328 (335) 4 protein:vir:103323 Length: 364 100.0 5E-101 3E-104 570.4 26.8 327 1-339 1-339 (364) 5 protein:vir:10450 Length: 344 100.0 3E-100 2E-103 566.4 29.9 330 1-339 1-344 (344) 6 protein:vir:94576 Length: 347 100.0 8E-100 5E-103 563.9 28.5 327 5-339 1-347 (347) 7 protein:vir:2201 Length: 345 # 100.0 7.1E-99 4E-102 558.7 31.5 330 1-339 1-345 (345) 8 protein:vir:97031 Length: 402 100.0 4.4E-99 3E-102 559.8 29.7 329 1-339 1-337 (402) 9 protein:vir:100057 Length: 375 100.0 1.4E-99 9E-103 562.5 26.8 333 1-339 1-370 (375) 10 protein:vir:78739 Length: 332 100.0 1.6E-98 1E-101 556.8 29.8 325 1-337 1-332 (332) 11 protein:vir:3364 Length: 347 # 100.0 2.9E-98 2E-101 555.3 30.9 328 5-339 1-345 (347) 12 protein:vir:94711 Length: 347 100.0 1.7E-97 1E-100 551.2 29.1 328 1-339 1-346 (347) 13 protein:vir:8885 Length: 347 # 100.0 3.9E-97 2E-100 549.1 29.4 327 1-339 1-346 (347) 14 protein:vir:1541 Length: 347 # 100.0 2.8E-96 1.8E-99 544.4 30.4 330 5-339 1-345 (347) 15 protein:vir:7019 Length: 401 # 100.0 1.1E-95 7.1E-99 541.1 29.1 327 1-339 1-333 (401) 16 protein:vir:105645 Length: 400 100.0 2.1E-95 1.3E-98 539.7 29.2 327 1-339 1-333 (400) 17 protein:vir:99675 Length: 324 100.0 8.7E-86 5.4E-89 486.9 26.5 284 52-339 1-296 (324) 18 protein:vir:94622 Length: 341 100.0 6E-70 3.7E-73 400.1 26.2 312 1-339 1-339 (341) 19 protein:vir:80180 Length: 381 100.0 1.1E-63 6.7E-67 365.8 22.3 324 1-339 1-381 (381) 20 protein:vir:102605 Length: 273 100.0 1.3E-61 8E-65 354.4 20.3 267 24-339 1-273 (273) 21 protein:vir:105822 Length: 273 100.0 1.3E-61 8E-65 354.4 20.3 267 24-339 1-273 (273) 22 protein:vir:3136 Length: 322 # 100.0 2E-62 1.3E-65 358.8 12.4 301 1-339 1-318 (322) 23 protein:vir:7990 Length: 273 # 100.0 2.8E-59 1.7E-62 341.6 20.1 267 24-339 1-273 (273) 24 protein:vir:102655 Length: 322 100.0 5.4E-55 3.4E-58 318.1 24.8 309 1-339 1-321 (322) 25 protein:vir:1781 Length: 221 # 100.0 2.8E-53 1.7E-56 308.7 17.3 213 96-331 1-221 (221) 26 protein:vir:80930 Length: 278 100.0 3.9E-46 2.4E-49 269.5 19.8 270 18-339 1-277 (278) 27 protein:vir:107120 Length: 329 100.0 3.4E-45 2.1E-48 264.4 21.0 284 4-339 1-306 (329) 28 protein:vir:97331 Length: 319 100.0 8.6E-45 5.3E-48 262.2 21.3 286 1-339 1-294 (319) 29 protein:vir:94800 Length: 319 100.0 8.6E-45 5.3E-48 262.2 21.3 286 1-339 1-294 (319) 30 protein:vir:96123 Length: 274 100.0 2.6E-44 1.6E-47 259.5 19.0 263 18-339 1-270 (274) 31 protein:vir:94494 Length: 274 100.0 4.1E-43 2.6E-46 252.9 18.5 263 18-339 1-270 (274) 32 protein:vir:97433 Length: 274 100.0 4.1E-43 2.6E-46 252.9 18.5 263 18-339 1-270 (274) 33 protein:vir:93742 Length: 274 100.0 3.7E-43 2.3E-46 253.2 17.9 263 18-339 1-270 (274) 34 protein:vir:1239 Length: 274 # 100.0 7.9E-43 4.9E-46 251.4 18.3 263 18-339 1-270 (274) 35 protein:vir:96833 Length: 275 100.0 6.8E-42 4.2E-45 246.3 19.1 263 18-339 1-271 (275) 36 protein:vir:3613 Length: 272 # 100.0 7.1E-42 4.4E-45 246.2 18.1 267 18-339 1-272 (272) 37 protein:vir:95898 Length: 274 100.0 6.2E-42 3.8E-45 246.5 17.0 263 18-339 1-270 (274) 38 protein:vir:96262 Length: 274 100.0 6.2E-42 3.8E-45 246.5 17.0 263 18-339 1-270 (274) 39 protein:vir:99075 Length: 392 100.0 2.4E-41 1.5E-44 243.3 17.8 282 24-339 1-314 (392) 40 protein:vir:108303 Length: 418 100.0 5.9E-41 3.7E-44 241.1 18.4 286 1-339 1-417 (418) 41 protein:vir:79008 Length: 299 100.0 2.3E-38 1.4E-41 227.0 21.1 285 20-339 1-297 (299) 42 protein:vir:3525 Length: 423 # 100.0 1.4E-38 8.6E-42 228.1 19.4 288 21-338 1-423 (423) 43 protein:vir:174 Length: 423 # 100.0 1.1E-38 6.6E-42 228.7 18.7 286 21-338 1-423 (423) 44 protein:vir:105374 Length: 423 100.0 2.3E-38 1.4E-41 227.0 19.2 288 21-338 1-423 (423) 45 protein:vir:9820 Length: 272 # 100.0 8.6E-38 5.3E-41 223.8 16.6 262 18-339 1-269 (272) 46 protein:vir:3033 Length: 272 # 100.0 8.6E-38 5.3E-41 223.8 16.6 262 18-339 1-269 (272) 47 protein:vir:105334 Length: 276 100.0 2.4E-37 1.5E-40 221.3 18.4 263 18-339 1-270 (276) 48 protein:vir:105522 Length: 423 100.0 1.9E-35 1.2E-38 210.9 17.4 290 21-338 1-423 (423) 49 protein:vir:78920 Length: 290 100.0 1E-33 6.4E-37 201.4 20.0 281 24-339 1-289 (290) 50 protein:vir:105464 Length: 346 99.9 1.7E-29 1E-32 178.3 19.6 284 24-339 1-298 (346) 51 protein:vir:95107 Length: 270 99.9 6.2E-30 3.9E-33 180.7 16.3 260 20-339 1-265 (270) 52 protein:vir:739 Length: 231 # 99.9 3.4E-30 2.1E-33 182.1 14.1 231 53-339 1-231 (231) 53 protein:vir:102335 Length: 312 99.9 1.4E-28 8.9E-32 173.2 19.8 292 20-339 1-308 (312) 54 protein:vir:99523 Length: 311 99.9 2.5E-24 1.6E-27 150.0 18.9 297 8-339 1-311 (311) 55 protein:vir:79712 Length: 285 99.9 4.8E-24 3E-27 148.4 16.9 268 26-339 1-283 (285) 56 protein:vir:78090 Length: 302 99.8 8.9E-21 5.5E-24 130.5 18.3 283 20-339 1-300 (302) 57 protein:vir:9265 Length: 430 # 99.5 3.3E-16 2.1E-19 105.4 16.1 286 18-339 1-430 (430) 58 protein:vir:100939 Length: 430 99.5 3.3E-16 2.1E-19 105.4 16.1 286 18-339 1-430 (430) 59 protein:vir:78523 Length: 338 99.5 6E-16 3.7E-19 104.0 17.4 306 1-339 1-335 (338) 60 protein:vir:2106 Length: 430 # 99.5 6.3E-16 3.9E-19 103.9 16.1 291 18-339 1-430 (430) 61 protein:vir:7771 Length: 330 # 99.5 7.1E-15 4.4E-18 98.2 17.7 297 3-339 1-323 (330) 62 protein:vir:41 Length: 299 # N 99.5 7.7E-15 4.8E-18 98.0 17.3 281 5-339 1-298 (299) 63 protein:vir:78223 Length: 333 99.4 9.8E-15 6.1E-18 97.4 17.3 307 1-339 1-332 (333) 64 protein:vir:4339 Length: 395 # 99.4 9.6E-15 5.9E-18 97.4 16.8 292 1-339 98-395 (395) 65 protein:vir:191 Length: 385 # 99.4 9.3E-15 5.8E-18 97.5 14.8 289 1-339 86-384 (385) 66 protein:vir:1886 Length: 385 # 99.4 9.3E-15 5.8E-18 97.5 14.8 289 1-339 86-384 (385) 67 protein:vir:9410 Length: 415 # 99.4 1.5E-14 9.2E-18 96.4 14.7 296 1-339 106-404 (415) 68 protein:vir:4600 Length: 415 # 99.4 2.7E-14 1.7E-17 95.0 14.8 296 1-339 99-404 (415) 69 protein:vir:4700 Length: 415 # 99.4 2.7E-14 1.7E-17 95.0 14.8 296 1-339 99-404 (415) 70 protein:vir:80684 Length: 315 99.4 3.8E-14 2.3E-17 94.2 15.6 285 1-339 1-306 (315) 71 protein:vir:94142 Length: 304 99.4 8.4E-14 5.2E-17 92.3 17.3 283 1-338 1-304 (304) 72 protein:vir:105905 Length: 304 99.4 8.4E-14 5.2E-17 92.3 17.3 283 1-338 1-304 (304) 73 protein:vir:1328 Length: 392 # 99.4 4.3E-14 2.7E-17 93.9 15.6 293 1-339 91-391 (392) 74 protein:vir:81100 Length: 415 99.4 4.2E-14 2.6E-17 93.9 15.4 296 1-339 89-404 (415) 75 protein:vir:79987 Length: 415 99.4 4.2E-14 2.6E-17 93.9 15.4 296 1-339 89-404 (415) 76 protein:vir:98339 Length: 415 99.4 4.2E-14 2.6E-17 93.9 15.4 296 1-339 89-404 (415) 77 protein:vir:100247 Length: 425 99.3 3.6E-14 2.2E-17 94.3 14.3 301 1-339 114-424 (425) 78 protein:vir:95451 Length: 313 99.3 8.3E-15 5.2E-18 97.8 10.7 299 19-339 1-311 (313) 79 protein:vir:97053 Length: 390 99.3 6.1E-14 3.8E-17 93.0 15.4 289 1-337 92-390 (390) 80 protein:vir:104085 Length: 320 99.3 1.4E-13 8.9E-17 91.0 17.2 291 3-339 1-317 (320) 81 protein:vir:8102 Length: 543 # 99.3 1.4E-13 8.6E-17 91.1 17.0 298 1-339 235-542 (543) 82 protein:vir:6242 Length: 390 # 99.3 4.3E-14 2.6E-17 93.9 14.0 287 1-339 76-389 (390) 83 protein:vir:8187 Length: 311 # 99.3 1.9E-13 1.2E-16 90.3 17.4 288 20-339 1-310 (311) 84 protein:vir:9309 Length: 324 # 99.3 1.6E-13 9.7E-17 90.8 16.8 282 1-339 17-315 (324) 85 protein:vir:78830 Length: 324 99.3 9.6E-14 5.9E-17 92.0 14.9 288 1-339 9-315 (324) 86 protein:vir:96392 Length: 324 99.3 9.6E-14 5.9E-17 92.0 14.9 288 1-339 9-315 (324) 87 protein:vir:96223 Length: 324 99.3 1.5E-13 9.2E-17 90.9 15.0 283 1-339 16-315 (324) 88 protein:vir:103955 Length: 324 99.3 2.7E-13 1.7E-16 89.5 15.1 280 1-339 16-315 (324) 89 protein:vir:3870 Length: 400 # 99.3 1.9E-13 1.2E-16 90.3 14.0 276 1-339 120-399 (400) 90 protein:vir:95763 Length: 297 99.3 4.9E-13 3E-16 88.1 16.2 277 4-339 1-296 (297) 91 protein:vir:94771 Length: 298 99.3 6.7E-13 4.1E-16 87.3 16.8 281 18-338 1-298 (298) 92 protein:vir:1638 Length: 298 # 99.3 5.8E-13 3.6E-16 87.7 16.4 282 18-338 1-298 (298) 93 protein:vir:4856 Length: 293 # 99.3 5.7E-13 3.6E-16 87.7 16.3 273 13-339 1-281 (293) 94 protein:vir:97148 Length: 324 99.3 3.2E-13 2E-16 89.1 14.9 286 1-339 9-315 (324) 95 protein:vir:10364 Length: 390 99.3 4.2E-13 2.6E-16 88.5 15.0 289 1-337 92-390 (390) 96 protein:vir:104256 Length: 458 99.2 1.1E-12 6.8E-16 86.2 16.7 300 1-339 142-458 (458) 97 protein:vir:81070 Length: 390 99.2 5.6E-13 3.5E-16 87.7 15.0 289 1-337 95-390 (390) 98 protein:vir:94673 Length: 419 99.2 5E-13 3.1E-16 88.0 14.6 296 1-339 103-417 (419) 99 protein:vir:2344 Length: 397 # 99.2 1.6E-12 9.8E-16 85.3 16.9 285 1-339 1-306 (397) 100 protein:vir:4830 Length: 397 # 99.2 9.4E-13 5.8E-16 86.5 15.6 285 1-339 91-385 (397) 101 protein:vir:100135 Length: 418 99.2 7.7E-13 4.8E-16 87.0 15.1 289 1-339 120-415 (418) 102 protein:vir:485 Length: 407 # 99.2 1.2E-12 7.2E-16 86.0 16.0 300 1-339 83-400 (407) 103 protein:vir:81160 Length: 371 99.2 3.2E-12 2E-15 83.6 18.4 283 1-339 76-371 (371) 104 protein:vir:4511 Length: 409 # 99.2 6.8E-13 4.2E-16 87.3 14.6 294 1-339 93-406 (409) 105 protein:vir:99749 Length: 324 99.2 9E-13 5.6E-16 86.6 15.2 281 1-339 16-315 (324) 106 protein:vir:9759 Length: 303 # 99.2 1.5E-12 9.6E-16 85.3 16.3 286 20-339 1-303 (303) 107 protein:vir:9574 Length: 300 # 99.2 1.7E-12 1E-15 85.1 16.5 284 18-339 1-300 (300) 108 protein:vir:2430 Length: 318 # 99.2 2.3E-12 1.4E-15 84.4 16.7 288 1-339 1-313 (318) 109 protein:vir:4456 Length: 401 # 99.2 2.1E-12 1.3E-15 84.6 15.5 300 1-339 84-401 (401) 110 protein:vir:81227 Length: 413 99.2 3.7E-12 2.3E-15 83.2 16.1 290 1-339 103-410 (413) 111 protein:vir:101607 Length: 379 99.1 3E-12 1.8E-15 83.8 14.8 282 1-339 88-379 (379) 112 protein:vir:5739 Length: 366 # 99.1 1.2E-11 7.4E-15 80.5 17.3 300 1-339 37-364 (366) 113 protein:vir:4953 Length: 397 # 99.1 3.3E-12 2.1E-15 83.5 14.2 284 1-339 91-385 (397) 114 protein:vir:1583 Length: 351 # 99.1 2.3E-12 1.4E-15 84.4 13.2 280 20-339 1-299 (351) 115 protein:vir:5974 Length: 324 # 99.1 5.9E-12 3.6E-15 82.2 15.5 277 20-339 1-303 (324) 116 protein:vir:3991 Length: 404 # 99.1 1.6E-11 9.7E-15 79.8 17.7 286 1-339 99-393 (404) 117 protein:vir:4997 Length: 397 # 99.1 3.1E-12 1.9E-15 83.7 13.7 284 1-339 91-385 (397) 118 protein:vir:99920 Length: 311 99.1 1.9E-11 1.2E-14 79.3 17.1 293 1-339 1-311 (311) 119 protein:vir:105610 Length: 430 99.1 9.6E-11 6E-14 75.5 20.9 323 9-339 1-422 (430) 120 protein:vir:1433 Length: 435 # 99.1 3.7E-11 2.3E-14 77.8 18.4 299 1-339 103-433 (435) 121 protein:vir:102119 Length: 404 99.1 1.6E-11 1E-14 79.7 16.4 296 1-339 85-400 (404) 122 protein:vir:1268 Length: 397 # 99.1 1.7E-11 1.1E-14 79.6 16.3 283 1-339 99-397 (397) 123 protein:vir:100172 Length: 394 99.1 2.6E-11 1.6E-14 78.6 16.9 282 1-339 97-384 (394) 124 protein:vir:8420 Length: 477 # 99.1 5E-11 3.1E-14 77.0 18.2 305 1-339 138-471 (477) 125 protein:vir:95376 Length: 425 99.1 6E-12 3.7E-15 82.1 13.1 289 1-339 119-421 (425) 126 protein:vir:4226 Length: 326 # 99.1 3.7E-11 2.3E-14 77.8 17.4 299 1-339 1-323 (326) 127 protein:vir:80376 Length: 435 99.1 4.2E-11 2.6E-14 77.5 17.6 295 1-339 103-431 (435) 128 protein:vir:105038 Length: 428 99.0 8.5E-11 5.3E-14 75.8 18.4 297 1-339 98-426 (428) 129 protein:vir:93696 Length: 364 99.0 2.7E-10 1.7E-13 73.1 21.0 304 11-339 1-359 (364) 130 protein:vir:4092 Length: 390 # 99.0 2.9E-11 1.8E-14 78.4 15.7 293 1-339 63-368 (390) 131 protein:vir:1025 Length: 408 # 99.0 3E-11 1.9E-14 78.3 15.4 284 1-339 101-393 (408) 132 protein:vir:100884 Length: 389 99.0 3.2E-11 2E-14 78.2 14.6 283 1-339 88-382 (389) 133 protein:vir:9704 Length: 394 # 99.0 3.1E-11 1.9E-14 78.2 14.5 273 1-339 113-390 (394) 134 protein:vir:7409 Length: 408 # 99.0 7.2E-11 4.5E-14 76.2 16.0 286 1-339 94-393 (408) 135 protein:vir:101650 Length: 497 99.0 1.2E-10 7.5E-14 75.0 16.3 304 1-339 131-493 (497) 136 protein:vir:7855 Length: 497 # 99.0 1.2E-10 7.5E-14 75.0 16.3 304 1-339 131-493 (497) 137 protein:vir:108211 Length: 318 98.9 1E-10 6.5E-14 75.3 15.8 286 1-339 8-317 (318) 138 protein:vir:3845 Length: 395 # 98.9 2.8E-10 1.8E-13 72.9 18.0 281 1-339 94-383 (395) 139 protein:vir:962 Length: 397 # 98.9 1.6E-10 9.6E-14 74.4 16.6 275 1-339 121-397 (397) 140 protein:vir:102873 Length: 392 98.9 1E-10 6.3E-14 75.4 15.5 287 1-339 81-384 (392) 141 protein:vir:107593 Length: 392 98.9 1E-10 6.3E-14 75.4 15.5 287 1-339 81-384 (392) 142 protein:vir:105004 Length: 392 98.9 1E-10 6.3E-14 75.4 15.5 287 1-339 81-384 (392) 143 protein:vir:102082 Length: 392 98.9 1E-10 6.3E-14 75.4 15.5 287 1-339 81-384 (392) 144 protein:vir:96762 Length: 632 98.9 3.5E-11 2.2E-14 77.9 12.7 285 1-338 332-632 (632) 145 protein:vir:6212 Length: 434 # 98.9 9.3E-11 5.8E-14 75.6 14.8 292 1-339 124-431 (434) 146 protein:vir:2504 Length: 305 # 98.9 2.9E-10 1.8E-13 72.9 16.8 278 13-339 1-298 (305) 147 protein:vir:1084 Length: 437 # 98.9 2E-10 1.2E-13 73.8 15.4 284 1-339 141-427 (437) 148 protein:vir:10123 Length: 404 98.9 1.2E-09 7.5E-13 69.5 19.5 320 1-339 1-401 (404) 149 protein:vir:104439 Length: 404 98.9 1.2E-09 7.5E-13 69.5 19.5 320 1-339 1-401 (404) 150 protein:vir:819 Length: 404 # 98.9 1.2E-09 7.5E-13 69.5 19.5 320 1-339 1-401 (404) 151 protein:vir:3298 Length: 404 # 98.9 1.2E-09 7.5E-13 69.5 19.5 320 1-339 1-401 (404) 152 protein:vir:102944 Length: 330 98.9 1.2E-10 7.3E-14 75.0 13.0 281 18-339 1-309 (330) 153 protein:vir:9361 Length: 402 # 98.8 3.4E-11 2.1E-14 77.9 8.4 277 1-339 117-396 (402) 154 protein:vir:2770 Length: 318 # 98.8 1.6E-09 9.7E-13 68.9 17.5 251 1-283 1-318 (318) 155 protein:vir:93881 Length: 387 98.8 7.6E-11 4.7E-14 76.1 10.3 277 1-339 102-381 (387) 156 protein:vir:1383 Length: 421 # 98.8 3.7E-10 2.3E-13 72.3 13.0 278 1-339 101-383 (421) 157 protein:vir:9875 Length: 296 # 98.7 8.5E-09 5.3E-12 64.8 19.5 272 2-339 1-295 (296) 158 protein:vir:78640 Length: 352 98.7 1.4E-10 8.5E-14 74.7 9.1 277 1-339 61-346 (352) 159 protein:vir:94424 Length: 387 98.7 6.8E-11 4.2E-14 76.3 7.3 277 1-339 102-381 (387) 160 protein:vir:96978 Length: 387 98.7 6.8E-11 4.2E-14 76.3 7.3 277 1-339 102-381 (387) 161 protein:vir:2685 Length: 387 # 98.7 6.8E-11 4.2E-14 76.3 7.3 277 1-339 102-381 (387) 162 protein:vir:9927 Length: 295 # 98.7 1.9E-09 1.2E-12 68.4 15.2 267 1-339 1-288 (295) 163 protein:vir:93616 Length: 645 98.7 6.4E-09 4E-12 65.5 17.5 294 1-339 300-637 (645) 164 protein:vir:101291 Length: 381 98.5 4.8E-09 3E-12 66.2 12.3 288 1-339 56-368 (381) 165 protein:vir:9509 Length: 381 # 98.5 4.8E-09 3E-12 66.2 12.3 288 1-339 56-368 (381) 166 protein:vir:9643 Length: 377 # 98.5 2.1E-08 1.3E-11 62.6 15.2 295 1-339 58-377 (377) 167 protein:vir:4159 Length: 315 # 98.5 1.8E-08 1.1E-11 63.0 14.2 304 1-338 1-315 (315) 168 protein:vir:100632 Length: 381 98.4 1.7E-08 1.1E-11 63.2 12.4 286 1-339 56-368 (381) 169 protein:vir:78350 Length: 383 98.4 1.8E-08 1.1E-11 63.1 12.2 288 1-339 59-375 (383) 170 protein:vir:4197 Length: 314 # 98.3 1.2E-07 7.7E-11 58.4 15.6 299 1-339 1-311 (314) 171 protein:vir:80128 Length: 466 98.3 4.2E-08 2.6E-11 61.0 12.6 291 1-339 123-448 (466) 172 protein:vir:3158 Length: 321 # 98.3 1.6E-07 1E-10 57.8 15.1 296 1-339 1-312 (321) 173 protein:vir:95963 Length: 395 98.3 1.2E-07 7.7E-11 58.4 14.4 287 1-339 64-376 (395) 174 protein:vir:95875 Length: 401 98.2 9.8E-07 6.1E-10 53.5 19.1 316 1-339 1-400 (401) 175 protein:vir:98635 Length: 377 98.2 1.1E-07 6.9E-11 58.7 13.6 287 1-339 58-377 (377) 176 protein:vir:106647 Length: 303 98.2 4.8E-07 3E-10 55.2 16.5 271 1-339 1-296 (303) 177 protein:vir:80446 Length: 367 97.7 7.8E-06 4.8E-09 48.6 15.2 289 8-339 1-348 (367) 178 protein:vir:79928 Length: 393 97.7 2.5E-06 1.5E-09 51.3 12.5 304 1-339 56-377 (393) 179 protein:vir:80068 Length: 301 97.2 0.00014 8.6E-08 41.7 18.0 284 17-339 1-301 (301) 180 protein:vir:78387 Length: 349 96.4 0.00064 4E-07 38.1 16.3 285 1-339 1-319 (349) 181 protein:vir:103285 Length: 296 96.3 0.00059 3.6E-07 38.3 13.4 278 1-339 1-296 (296) 182 protein:vir:107687 Length: 319 95.2 0.0015 9.1E-07 36.1 11.4 295 1-339 1-319 (319) 183 protein:vir:98871 Length: 314 95.2 0.0026 1.6E-06 34.7 15.9 285 4-339 1-311 (314) 184 protein:vir:94989 Length: 349 95.0 0.003 1.8E-06 34.4 16.3 285 1-339 1-328 (349) 185 protein:vir:3969 Length: 287 # 94.3 0.0047 2.9E-06 33.4 15.8 263 23-339 1-286 (287) 186 protein:vir:94528 Length: 286 94.3 0.0047 2.9E-06 33.3 16.3 266 8-339 1-286 (286) 187 protein:vir:95512 Length: 693 94.3 0.0048 3E-06 33.3 12.3 293 1-339 366-693 (693) 188 protein:vir:97255 Length: 310 93.9 0.0061 3.8E-06 32.7 17.2 289 8-339 1-310 (310) 189 protein:vir:94933 Length: 330 92.6 0.011 6.7E-06 31.3 15.7 300 1-339 13-329 (330) 190 protein:vir:8324 Length: 410 # 91.9 0.014 8.6E-06 30.8 12.4 272 1-337 108-410 (410) 191 protein:vir:95131 Length: 325 91.3 0.017 1E-05 30.3 15.4 282 1-339 1-298 (325) 192 protein:vir:79548 Length: 652 88.3 0.033 2E-05 28.7 11.5 292 1-338 331-652 (652) 193 protein:vir:4074 Length: 480 # 88.1 0.035 2.1E-05 28.6 17.3 276 1-339 168-477 (480) 194 protein:vir:97397 Length: 517 88.1 0.035 2.2E-05 28.6 16.5 277 1-339 222-514 (517) 195 protein:vir:103886 Length: 302 85.9 0.049 3.1E-05 27.8 14.6 271 1-339 1-302 (302) 196 protein:vir:10324 Length: 320 83.0 0.046 2.8E-05 27.9 7.9 292 11-339 1-317 (320) 197 protein:vir:104342 Length: 314 79.4 0.11 6.5E-05 25.9 13.8 293 1-339 1-314 (314) 198 protein:vir:78148 Length: 123 74.7 0.03 1.8E-05 29.0 4.2 110 208-339 1-123 (123) 199 protein:vir:4786 Length: 295 # 72.7 0.18 0.00011 24.7 16.0 265 8-327 1-295 (295) 200 protein:vir:99888 Length: 309 65.7 0.22 0.00014 24.2 6.9 286 3-339 1-297 (309) 201 protein:vir:99424 Length: 360 61.5 0.35 0.00022 23.1 13.3 299 1-339 15-360 (360) 202 protein:vir:1991 Length: 305 # 52.1 0.56 0.00035 22.0 12.9 212 1-255 1-305 (305) 203 protein:vir:107732 Length: 379 51.2 0.59 0.00037 21.8 13.3 296 1-339 56-379 (379) 204 protein:vir:79078 Length: 307 49.8 0.63 0.00039 21.7 9.9 285 1-339 1-305 (307) 205 protein:vir:79642 Length: 329 45.4 0.77 0.00048 21.2 18.8 297 1-337 8-329 (329) 206 protein:vir:103181 Length: 457 32.8 1.4 0.00087 19.8 11.5 309 1-339 76-439 (457) 207 protein:vir:96079 Length: 382 26.0 2 0.0012 18.9 15.6 301 1-338 36-382 (382) 208 protein:vir:5942 Length: 523 # 20.4 2.8 0.0017 18.1 12.4 310 1-339 162-521 (523) No 1 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=3.7e-102 Score=576.64 Aligned_cols=330 Identities=21% Similarity=0.277 Sum_probs=299.9 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) ||--.+ .+++||+|++.+.+ ++||||+|+|||+++|++++||++++++|+|++|||+||+++|++++++|+||++ T Consensus 1 m~~~~~----~~~t~~~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~~~g~~ 75 (334) T protein:vir:80 1 MTYPAA----NTHTRPGWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAGEE 75 (334) T ss_pred CCCCcC----CCccccccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeeeecCCCC Confidence 663222 57899999966554 7799999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMP 160 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~ 160 (339) |++++ +.+++++|+||+++|++++|||||+||+++|+|+|+++|+||+||+++||+|+++++|||+...+.+..++..+ T Consensus 76 l~~~~-~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~ 154 (334) T protein:vir:80 76 LVVQK-NVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHD 154 (334) T ss_pred CCCCC-cccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccC Confidence 99875 67899999999999999999999999999999999999999999999999999999999999999888888878 Q ss_pred CccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCC-cCCeEEEECHHHHHHHhcccchhhhccccccc-ceeec Q lcl|NC_020078. 161 GHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPN-EEDMILVLPPAAFTALMQAEHITNGEYVTSAG-ETLNT 238 (339) Q Consensus 161 g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p-~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~-~~l~~ 238 (339) |+.....++......++++++|+++|+++++.|+|+|||.+ ..+||+||+|++|++||+++||+|+||+++++ ..+.+ T Consensus 155 G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~ 234 (334) T protein:vir:80 155 GILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVG 234 (334) T ss_pred CcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccc Confidence 76666656555666778999999999999999999999422 47899999999999999999999999987653 44789 Q ss_pred ceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHH Q lcl|NC_020078. 239 KYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSW 318 (339) Q Consensus 239 G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~ 318 (339) |.|++++||+|++|||+|.+ +++.|+++. +.+.|+++|+.++++|+|++||++++++++++|+||++++|+|+|+|+ T Consensus 235 g~i~~v~G~~V~~Sn~~P~~-~~t~~~~g~--~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~ 311 (334) T protein:vir:80 235 GRIAMLNGVRVVETPRFPQS-AITANALGA--DFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTF 311 (334) T ss_pred eeEEEEeceEEEeecCCCCc-ccccccccc--ccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHH Confidence 99999999999999999954 677777654 456789999999999999999999999999999999999999999999 Q ss_pred HHhCCccccccceEEEEecCC Q lcl|NC_020078. 319 LAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 319 ~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++|||+++||||+++++++.- T Consensus 312 ~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 312 QSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred HHcCCceeccceEEEEEEeee Confidence 999999999999999999999 No 2 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=3e-102 Score=577.20 Aligned_cols=326 Identities=24% Similarity=0.337 Sum_probs=297.3 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) || +| -|+|||||++.++|. +||||+|+|||+++|+|+++|++++++|+|++|||+|||++|+.++++|+||++ T Consensus 1 ms-----~~-~~~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~ 73 (335) T protein:vir:63 1 MS-----FL-NDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEE 73 (335) T ss_pred CC-----Cc-ccchhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcC Confidence 65 33 589999998777775 799999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMP 160 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~ 160 (339) |+++. +.++|++|+||+++|++++|||||+||++||+|+|+++|+||+||+++||+|++++++||+...+....++..+ T Consensus 74 l~~~~-~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:63 74 LERSR-VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred cCCCC-ccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCC Confidence 99875 56789999999999999999999999999999999999999999999999999999999999999888888777 Q ss_pred CccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCC-cCCeEEEECHHHHHHHhcccchhhhcccccccc-eeec Q lcl|NC_020078. 161 GHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPN-EEDMILVLPPAAFTALMQAEHITNGEYVTSAGE-TLNT 238 (339) Q Consensus 161 g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p-~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~-~l~~ 238 (339) |+.... .+++.+...++++|+++|+++.++|+|+|||.+ .++||++|+|++|++||++++|+|++|+.+++. .+.+ T Consensus 153 G~~~~~--~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:63 153 GVLEKL--DLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVK 230 (335) T ss_pred Ccceee--eeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccC Confidence 754433 344556667899999999999999999999432 367999999999999999999999999877653 4789 Q ss_pred ceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHH Q lcl|NC_020078. 239 KYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSW 318 (339) Q Consensus 239 G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~ 318 (339) |.|++++||+|++|||+|. .++++|+++.++|+++ +++++++|++||++|+++++++++++|.||++++|+|+|+++ T Consensus 231 g~v~~v~Gv~V~~sn~lP~-~~~t~~~lg~a~n~~~--~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~ 307 (335) T protein:vir:63 231 SRVAILNGVKVLETPRFAT-KAIAAHPLGRHFNVSA--EESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTF 307 (335) T ss_pred ceeEEeeceEEEeeccCCC-CCcccccccccCCccc--cccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHH Confidence 9999999999999999994 5789999998887654 577899999999999999999999999999999999999999 Q ss_pred HHhCCccccccceEEEEecCC Q lcl|NC_020078. 319 LAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 319 ~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++|||+++||||+++|++++. T Consensus 308 ~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 308 QMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred HHcCCcccccceEEEEEEcCC Confidence 999999999999999999998 No 3 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=7.3e-102 Score=575.05 Aligned_cols=326 Identities=24% Similarity=0.326 Sum_probs=298.7 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) || +| -|+|||||++.++|. +||||+|+|||+++|++++||++++++|+|++|||+|||++|+.+++||+||++ T Consensus 1 ms-----~~-~~~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~ 73 (335) T protein:vir:78 1 MS-----FL-NDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEE 73 (335) T ss_pred CC-----cc-ccccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCcc Confidence 65 34 479999998766665 899999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMP 160 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~ 160 (339) |+++. +.++|++|+||+++|++++|||||+||+|||+|+|+++|+|++||+++||++++++++||+...++...++..+ T Consensus 74 l~~~~-~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 74 LERSR-VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred cCCCC-cccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 99875 67799999999999999999999999999999999999999999999999999999999999999988888877 Q ss_pred CccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCC-cCCeEEEECHHHHHHHhcccchhhhcccccccc-eeec Q lcl|NC_020078. 161 GHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPN-EEDMILVLPPAAFTALMQAEHITNGEYVTSAGE-TLNT 238 (339) Q Consensus 161 g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p-~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~-~l~~ 238 (339) |+.....+ .+.+...++++|+++++++.++|+|+|||.+ .++||++|+|++|++||++++|+|++|+.+++. .+.+ T Consensus 153 G~~~~~~~--tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:78 153 GVLEKLDL--TGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVK 230 (335) T ss_pred Ccceeeee--ccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccccc Confidence 76554443 3455567899999999999999999999532 358999999999999999999999999877653 4789 Q ss_pred ceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHH Q lcl|NC_020078. 239 KYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSW 318 (339) Q Consensus 239 G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~ 318 (339) |.|++++||+|++|||+|. .++++|+++.+||+++| +++.++|++||++|+++++++++++|+||++++|+|+|+++ T Consensus 231 g~v~~v~Gv~V~~Sn~lP~-~~~t~~~lg~a~n~~~~--d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~ 307 (335) T protein:vir:78 231 SRVAILNGVKVLETPRFAT-KAISAHPLGRHFNVSAE--EAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTF 307 (335) T ss_pred ceeEEeeceEEEeeccCCC-CCCccccccccCCcccc--cccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHH Confidence 9999999999999999995 57899999999988776 67788999999999999999999999999999999999999 Q ss_pred HHhCCccccccceEEEEecCC Q lcl|NC_020078. 319 LAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 319 ~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++|||+++||||+|+|++++. T Consensus 308 ~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 308 QMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred HHcCCcccCcceEEEEEecCC Confidence 999999999999999999999 No 4 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=5.1e-101 Score=570.42 Aligned_cols=327 Identities=26% Similarity=0.409 Sum_probs=289.5 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) ||. | -+++||+|+ +++++++||||+|+|||+++|+++++|+++|++|+|++|||+|||++|+.+++||+||++ T Consensus 1 ms~-----~-n~~t~~~~~-~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~~~G~~ 73 (364) T protein:vir:10 1 MSN-----P-NVLTQPAVS-ASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQVLSPGKS 73 (364) T ss_pred CCC-----c-ccccccccc-cccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEeeeccCcc Confidence 653 2 368889887 667889999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFD-YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d-~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) |+++ .+.++|++|+||+++|++++|||||++|+||| +|+||++|+|||||+++||+|++++.++|.....+....+. T Consensus 74 ld~~-~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~- 151 (364) T protein:vir:10 74 PDAS-PTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPR- 151 (364) T ss_pred cCCC-CcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCc- Confidence 9875 57789999999999999999999999999999 89999999999999999999998887766432222221112 Q ss_pred cCccccc--cccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 160 PGHSGGN--VVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 160 ~g~~~~~--~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) +.+.|. .+.....+.++++++|+++|++++++|+|||| |.+|||+||+|++|++||++++|+|++|+.++++.+. T Consensus 152 -~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdV--P~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~ 228 (364) T protein:vir:10 152 -VAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEV--DTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTV 228 (364) T ss_pred -ccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCC--CccccEEEeChHHHHHHhcCCccccccccccCCCccc Confidence 222222 22333455678899999999999999999999 7789999999999999999999999999877666788 Q ss_pred cceeEEEeceEEEEeccccc-------cccccccccCCCccccccc--cccceEEEEEeccceeEEEEEeeeeEEeeech Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVF-------GKTITDHLLSNANNEKAYD--GDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD 308 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~-------~~~~~~~~l~~~~~~~~y~--~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~ 308 (339) +|+|++++||+|++|||+|. +..+++|++++++++++|. +++++.++++|||+|+++++++++++|.||++ T Consensus 229 ~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~ 308 (364) T protein:vir:10 229 DGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEK 308 (364) T ss_pred cceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeecc Confidence 99999999999999999995 3467899999999999998 67779999999999999999999999999999 Q ss_pred hhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 309 LSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 309 ~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++|+|+|+++++|||+++||||+++|.+.++ T Consensus 309 ~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 309 KEKTWYIDTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred ceeeeeeeeehcccCcccCccceEEEEecCC Confidence 9999999999999999999999999999888 No 5 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=2.8e-100 Score=566.36 Aligned_cols=330 Identities=15% Similarity=0.137 Sum_probs=285.6 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) |.=. =+.|+-|+.|.++.++++++++||||+|+|||+++|+|+|+|+++|++|+|++|||+|||++|++++++|+||++ T Consensus 1 ma~~-~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~~~~~G~~ 79 (344) T protein:vir:10 1 MANM-TGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (344) T ss_pred Cccc-cccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEEeeecCCC Confidence 3211 134666777776788999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) |++. ++++++|++|+||++|||+|+|||+|++|+++|+|+++++++||+||+++|++|++++++++....+.... . T Consensus 80 l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~---~ 156 (344) T protein:vir:10 80 LDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNEN---I 156 (344) T ss_pred CCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc---c Confidence 9875 56789999999999999999999999999999999999999999999999999999999998877665543 3 Q ss_pred cCccccccccccCc-----cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 160 PGHSGGNVVTLAGA-----NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 160 ~g~~~~~~~~~~~~-----~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) +|.+.++++..... ..+..++++|++|+++.++|+|++| |.+|||+||+|++|++||++++|++.+|.++. T Consensus 157 ~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~V--P~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~-- 232 (344) T protein:vir:10 157 TGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYV--PSSDRVFYCDPDSYSAILAALMPNAANYAALI-- 232 (344) T ss_pred ccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCC--CccCCEEEeChHHHHHHhhccccccccccccc-- Confidence 34444444332222 2334457899999999999999999 78899999999999999999999999987654 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccc--------cccccceEEEEEeccceeEEEEEeeeeEEeee Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKA--------YDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF 306 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~--------y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~ 306 (339) .+.+|.|++++||+||+|||+|.+ .++.+..+.+++.+. |.+++++++|++|||+|+++++++++++|.+| T Consensus 233 ~~~~G~V~~v~G~~V~~Sn~lp~~-~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r 311 (344) T protein:vir:10 233 DPEKGSIRNVMGFEVVEVPHLTAG-GAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 311 (344) T ss_pred ceeeeEEEEEeceEEEeccccccc-cCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeeccc Confidence 478999999999999999999965 344455555544444 44589999999999999999999999999999 Q ss_pred chhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 307 DDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 307 ~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++++|+|+|+|+|+||||++||||+++|+++.- T Consensus 312 ~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 312 RANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred chhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 999999999999999999999999999998888 No 6 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=7.8e-100 Score=563.92 Aligned_cols=327 Identities=16% Similarity=0.143 Sum_probs=287.7 Q ss_pred cCcccCCC--cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCCCC Q lcl|NC_020078. 5 DGQTPSYD--VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTTPP 82 (339) Q Consensus 5 ~~~~~~~~--~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 82 (339) -++.++-+ -|||||+++++|+++||||+|+|||+++|+|+|+|+++|++|+|++|||++||++|++++.+|+||++++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~~l~ 80 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLD 80 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCcCCC Confidence 45666654 5999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccC Q lcl|NC_020078. 83 PS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPG 161 (339) Q Consensus 83 ~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g 161 (339) .+ ++++++|++|+||+++||+|+|||+|++|+++|+|+++++++||+||+++||+|++++++++....+.. ...+| T Consensus 81 ~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~---~~~~g 157 (347) T protein:vir:94 81 DKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANN---ENIAG 157 (347) T ss_pred CCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---ccccc Confidence 75 467899999999999999999999999999999999999999999999999999999999987654433 33555 Q ss_pred ccccccccccC-----ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccccee Q lcl|NC_020078. 162 HSGGNVVTLAG-----ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETL 236 (339) Q Consensus 162 ~~~~~~~~~~~-----~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l 236 (339) .++++.+.... .+.+.++.++|++|+++.++|+|+|| |+++||+||+|++|+.||+..++...+|.... .+ T Consensus 158 ~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dV--P~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~--~~ 233 (347) T protein:vir:94 158 LGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYV--PSSDRVFYTTPDNYSAILAALMPNAANYQALI--DP 233 (347) T ss_pred CCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCC--CCCCCEEEeChHHHHHHHHhhccccccccccc--cc Confidence 55555544322 23345688999999999999999999 78899999999999999998888877775543 47 Q ss_pred ecceeEEEeceEEEEeccccccccccccccCC------------CccccccccccceEEEEEeccceeEEEEEeeeeEEe Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLSN------------ANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKI 304 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~------------~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~ 304 (339) ++|.|++++||+||+|||+|.... +.+.++. ....++|+++|+++++++|||+|+++++++++++|. T Consensus 234 ~~G~V~~v~G~~V~~Sn~~p~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~ 312 (347) T protein:vir:94 234 STGSIRNVMGFEVIEVPHLTAGGA-GDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALER 312 (347) T ss_pred ccceeEEeeceEEEEcCccccccC-cccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceee Confidence 899999999999999999996432 2222221 234578999999999999999999999999999999 Q ss_pred eechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 305 FFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 305 ~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +|++++|+|+|+|+++|||+++||||+|+|.+++| T Consensus 313 ~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 313 ARRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred eechhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999999999999999 No 7 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=7.1e-99 Score=558.68 Aligned_cols=330 Identities=15% Similarity=0.149 Sum_probs=283.3 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) |.-..|.-..-+.+||||+ +++|+++||||+|+|||+++|+|+|+|+++|++|+|++|||++||++|++++++|+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV-AAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (345) T ss_pred Ccccccchhcccccccccc-cCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEEeeecCCC Confidence 6666666666688999987 688999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) |+++ +.++++|++|+||+++||+|+|||||+||+++|+|+++++|+||+||+++||+|+++++|+|+..++....+ T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~--- 156 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENI--- 156 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc--- Confidence 9885 357889999999999999999999999999999999999999999999999999999999998877664333 Q ss_pred cCccccccccccCc-----cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 160 PGHSGGNVVTLAGA-----NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 160 ~g~~~~~~~~~~~~-----~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) .|.+.+..+..... ...+.+.++|++|++|.++|+|+|| |.+|||+||+|++|++|+++++|++.+|.++.. T Consensus 157 ~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~V--P~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~- 233 (345) T protein:vir:22 157 EGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYV--PAADRVFYCDPDSYSAILAALMPNAANYAALID- 233 (345) T ss_pred cccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCC--CccCCEEEeChHHHHHHhccccccccccccccc- Confidence 33444443332222 1234567899999999999999999 667999999999999999999999999986553 Q ss_pred eeecceeEEEeceEEEEecccccccccc--------ccccC-CCccccccccccceEEEEEeccceeEEEEEeeeeEEee Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTIT--------DHLLS-NANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIF 305 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~--------~~~l~-~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~ 305 (339) ..+|.|++++||+||+|||+|.+.... +|.++ ..++.+ +....++++|++|||+|+++++++++++|.+ T Consensus 234 -~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~ 311 (345) T protein:vir:22 234 -PEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGN-VKVAKDNVIGLFMHRSAVGTVKLRDLALERA 311 (345) T ss_pred -cccceEEEEeceEEEecccccccccCccccCccccccccccccccee-eeeccCceEEEEEehhheeeeeeecceeeee Confidence 578999999999999999999643221 12222 122223 3445688999999999999999999999999 Q ss_pred echhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 306 FDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 306 ~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |++++|+|+|+|+++|||+++||||+++|++.-- T Consensus 312 r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 312 RRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred echhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 9999999999999999999999999999988877 No 8 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=4.4e-99 Score=559.84 Aligned_cols=329 Identities=28% Similarity=0.439 Sum_probs=287.6 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) ||. | -+++||+|+ +++++++||||+|+|||+++|+++++|+++|++|+|++|||+||+++|+++++||+||++ T Consensus 1 Ms~-----~-n~~t~~~~~-~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~~~G~~ 73 (402) T protein:vir:97 1 MST-----P-NTLTNVAVS-ASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (402) T ss_pred CCC-----c-ccccccccc-cccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEeeeccccc Confidence 653 2 368888887 667889999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFD-YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d-~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) ++++ .+.++|++|+||+++|++++|||||++|+||| +|+++++|+|++||+++||+|++++..+++....+....+.. T Consensus 74 ldg~-~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~ 152 (402) T protein:vir:97 74 PNAT-PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) T ss_pred cCCC-CcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcc Confidence 9875 57789999999999999999999999999999 899999999999999999999888887776432222111111 Q ss_pred cCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecc Q lcl|NC_020078. 160 PGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTK 239 (339) Q Consensus 160 ~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G 239 (339) .+.+..-.+..+....++++++|+++|++++++|+|||| |.+|||++|+|++|++||++++|+|++|+.++++.+.+| T Consensus 153 ~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdV--P~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G 230 (402) T protein:vir:97 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEV--DISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) T ss_pred cccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCC--CccccEEEeChHHHHHHhhcccccchhhccccCCccccc Confidence 111111112223344468999999999999999999999 778999999999999999999999999986666678899 Q ss_pred eeEEEeceEEEEecccccc-ccccccccCCCccccccc--cccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHH Q lcl|NC_020078. 240 YMFAAFGVPVITSNNAVFG-KTITDHLLSNANNEKAYD--GDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFID 316 (339) Q Consensus 240 ~v~~i~G~~V~~Snnlp~~-~~~~~~~l~~~~~~~~y~--~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~ 316 (339) .|++++||+||+|||+|.. .++++|.+++++++++|+ ++++..+|++|||+|++++|++++++|.||++++|+|+|+ T Consensus 231 ~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id 310 (402) T protein:vir:97 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYID 310 (402) T ss_pred eeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHH Confidence 9999999999999999964 578999999999999888 8999999999999999999999999999999999999999 Q ss_pred HHHHhCCccccccceEEEEecC----C Q lcl|NC_020078. 317 SWLAFGVTINRTEYAGVIKLPA----A 339 (339) Q Consensus 317 g~~~~Ga~v~rPe~~v~i~~~~----a 339 (339) ++++|||+++||||+++|++-- + T Consensus 311 ~~~a~G~g~~RPeaa~vv~~~~~~t~~ 337 (402) T protein:vir:97 311 TFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) T ss_pred HHHHhCCcccCccceEEEEEecccccc Confidence 9999999999999999996532 2 No 9 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=1.4e-99 Score=562.46 Aligned_cols=333 Identities=18% Similarity=0.169 Sum_probs=294.0 Q ss_pred Ccccc-CcccCC-CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCC Q lcl|NC_020078. 1 MSIFD-GQTPSY-DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~-~~~~~~-~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g 78 (339) ||.-. ..++.- ..+||||+ +++++++||||+|+|||+++|+++|+++++|++|+|++|||++|+++|++++++|+|| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~-~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYG-GATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccc-cccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecCC Confidence 65432 334443 47788777 6779999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCC--CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 79 TTPPPST--EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 79 ~~i~~~~--~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) ++|++++ .+++++++|+||++|||+|+|||||++|+++|+|+++++|+||+||+++|++|+++++|||+...+....+ T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9998875 46678999999999999999999999999999999999999999999999999999999999998888777 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcc---cchhhhccccccc Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQA---EHITNGEYVTSAG 233 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~---~~~~n~d~~~~~~ 233 (339) ...+|..............+++++++|++|+++.++|+|++| |.++||+||+|++|++||++ ++|+|++|.+++ T Consensus 160 ~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~V--P~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~- 236 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGV--SSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSA- 236 (375) T ss_pred ccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCC--CCCCCEEEeChHHHHHHHhcCCccceeeecccccc- Confidence 777765554444445556668899999999999999999999 77899999999999999986 689999996554 Q ss_pred ceeecceeEEEeceEEEEecccccccccccc------------------------ccCCCcccccccccc---ceEEEEE Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDH------------------------LLSNANNEKAYDGDF---KDIVAQM 286 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~------------------------~l~~~~~~~~y~~~~---~~~~~~~ 286 (339) ...+|.|++++||+||+|||+|..+. ++| .+..+|+.++|++++ ++++|++ T Consensus 237 -~~~~g~v~~i~Gv~V~~Sn~lP~~~~-~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~ 314 (375) T protein:vir:10 237 -LQSGNGVIEIAGIHIYKSMNIPFLGK-YGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLI 314 (375) T ss_pred -eeccceEEEEeceEEEEecccccccc-ccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEE Confidence 46689999999999999999996532 222 224566778999998 9999999 Q ss_pred eccceeEEEEEeeeeEEee---echhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 287 FSPKALLAGSTIPVTSKIF---FDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 287 ~h~~A~~~~~~~~~~~e~~---~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |||+|+|+++++++++|++ |+.++|+|+|+++++|||+++||||+|+|++++. T Consensus 315 ~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 315 FQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred EchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 9999999999999999998 5899999999999999999999999999999966 No 10 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=1.6e-98 Score=556.79 Aligned_cols=325 Identities=21% Similarity=0.253 Sum_probs=289.7 Q ss_pred CccccCcccCCCcccCCccCcccchh-HHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPL-ADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~-a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~ 79 (339) |+... -+-.-|..|+||+++++|++ |+|||+|+|||+++|+|+|+|+++++.|++++||||||+++|++++++|++|+ T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLS-NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) T ss_pred Ccccc-cccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEeeecCCC Confidence 55432 23344788888889999976 99999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) +|++++.+++++++|+|||.|||+|+|||+|++|+++|+|+++++++||+||+++|++|++++++||+...+.. T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~------ 153 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT------ 153 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccc------ Confidence 99987778899999999999999999999999999999999999999999999999999999999987554333 Q ss_pred cCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhc--ccchhhhcccccccceee Q lcl|NC_020078. 160 PGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQ--AEHITNGEYVTSAGETLN 237 (339) Q Consensus 160 ~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~--~~~~~n~d~~~~~~~~l~ 237 (339) +..+++.+.. +++.++++.++|++|++|+++|+|+|| |.+|||+||+|++|++||+ |++|+|++++++++ .++ T Consensus 154 -~~~g~~~~~~-~~~~~~~~~~~~~~i~~a~~~Lde~~V--P~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~-~~~ 228 (332) T protein:vir:78 154 -GEPGGFHVNI-GAGNTNDAQAIVDGFFEAAAVLDERSA--PQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG-DMN 228 (332) T ss_pred -cccccccccc-CCccccCHHHHHHHHHHHHHHHhhcCC--CccCCEEEeCHHHHHHHHhhcCceeeeeecccccc-cee Confidence 3333444433 344568899999999999999999999 7789999999999999998 89999999988776 467 Q ss_pred cce-eEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEe---eechhhhHH Q lcl|NC_020078. 238 TKY-MFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKI---FFDDLSKLW 313 (339) Q Consensus 238 ~G~-v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~---~~~~~~~~d 313 (339) +|. |++++||+||+|||+|.+....+...+.+++.++|+++|++++|++|||+|+++++++++++|. .|++++|+| T Consensus 229 ~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d 308 (332) T protein:vir:78 229 SGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD 308 (332) T ss_pred cceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHh Confidence 775 8999999999999999876666666777888999999999999999999999999999997775 678999999 Q ss_pred HHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 314 FIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 314 ~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) +|+|+++||++++||||+++|+.. T Consensus 309 ~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 309 LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhhhhhhcCceecccceEEEeeC Confidence 999999999999999999999877 No 11 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=2.9e-98 Score=555.34 Aligned_cols=328 Identities=16% Similarity=0.125 Sum_probs=284.4 Q ss_pred cCcccCCC--cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCCCC Q lcl|NC_020078. 5 DGQTPSYD--VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTTPP 82 (339) Q Consensus 5 ~~~~~~~~--~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 82 (339) -++.|+-+ .|||||+++++|+++||||+|+|||+++|+++|+|++++++|++++|||+||+++|++++++|++|++++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~~~g~~l~ 80 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeeecCCCCCC Confidence 56777776 6999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cccccccc Q lcl|NC_020078. 83 PS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSP-YGTAAQMP 160 (339) Q Consensus 83 ~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~-~~~~~~~~ 160 (339) .+ +.+++++++|+||+.+||+|+|||+|++|+++|+|+++++++|++||+++|++|++++.+++.....+ ...++ . T Consensus 81 ~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~--~ 158 (347) T protein:vir:33 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEG--L 158 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc--c Confidence 75 45788999999999999999999999999999999999999999999999999999998766544322 22222 2 Q ss_pred CccccccccccCccccc----cHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccccee Q lcl|NC_020078. 161 GHSGGNVVTLAGANDYK----DPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETL 236 (339) Q Consensus 161 g~~~~~~~~~~~~~~~~----~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l 236 (339) |...++.....+++..+ ++.++|++|++++++|+|+|| |.++||+||+|++|++||++++|++++|.++. .+ T Consensus 159 ~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~V--P~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~--~~ 234 (347) T protein:vir:33 159 GKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYV--PAADRTFYTTPDNYSAILAALMPNAANYQALL--DP 234 (347) T ss_pred cccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCC--CccCcEEEeCHHHHHHHhcccccccccccccc--cc Confidence 33333443333333333 467899999999999999999 77899999999999999999999999997643 57 Q ss_pred ecceeEEEeceEEEEeccccccccccccccC-CCccccc--------cccccceEEEEEeccceeEEEEEeeeeEEeeec Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLS-NANNEKA--------YDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD 307 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~-~~~~~~~--------y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~ 307 (339) .+|.|++++||+||+|||+|.+. +++|.++ .+|+.+. +.++|++.+||+|||+|+|+++++++++|.+|+ T Consensus 235 ~~G~V~~i~G~~V~~Sn~lp~~~-~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~ 313 (347) T protein:vir:33 235 ERGTIRNVMGFEVVEVPHLTAGG-AGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred ccceeEEEeceeEEEecccccCc-cccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccc Confidence 89999999999999999999764 4444443 2344444 456677889999999999999999999999999 Q ss_pred hhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 308 DLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 308 ~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++|+|+|+|+++||||++||||+|+|+++.- T Consensus 314 ~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 314 ANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 99999999999999999999999999999988 No 12 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=1.7e-97 Score=551.16 Aligned_cols=328 Identities=16% Similarity=0.149 Sum_probs=286.9 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) |+-- -|+-..|||||++.++|+.+||||+|++||+++|+++|+|+++|++|+|++|||+|||++|++++++|+||++ T Consensus 1 m~~~---~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G~~ 77 (347) T protein:vir:94 1 MANV---PGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGER 77 (347) T ss_pred CCCC---CccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCCCC Confidence 6543 3566779999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) |+++ +.+++++++|+||+.+|++|+|||+|++|+++|+|+++++++|++||+++|++|++++++.+....++ .... T Consensus 78 l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~---~~~~ 154 (347) T protein:vir:94 78 LSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAAS---NENI 154 (347) T ss_pred cCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---cccc Confidence 9774 45788999999999999999999999999999999999999999999999999999998766444333 3444 Q ss_pred cCccccccccccCcccc----ccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 160 PGHSGGNVVTLAGANDY----KDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 160 ~g~~~~~~~~~~~~~~~----~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) +|.+.++++.....+.. .++++++++|+++.++|+|+|| |.+|||+||+||+|++||++++|++.+|.+++ . T Consensus 155 ~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~V--P~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~--~ 230 (347) T protein:vir:94 155 AGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYV--PAGDRYFYTTPDNYSAILAALMPNAANYAALI--D 230 (347) T ss_pred CCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCC--CCCCcEEEeCHHHHHHHhccchhhhhhccccc--c Confidence 55556666655444444 4568899999999999999999 77899999999999999999999999987754 4 Q ss_pred eecceeEEEeceEEEEeccccccccccccccC-----CCcc--------ccccccccceEEEEEeccceeEEEEEeeeeE Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLS-----NANN--------EKAYDGDFKDIVAQMFSPKALLAGSTIPVTS 302 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~-----~~~~--------~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~ 302 (339) +++|.|++++||+||+|||+|.... +.++.+ .+|+ ...|.++|+++++++|||+|+++|+++++++ T Consensus 231 ~~~G~Vg~i~G~~V~~Sn~lp~~~~-t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~ 309 (347) T protein:vir:94 231 PETGNIRNVMGFVVVEVPHLVQGGA-GETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLAL 309 (347) T ss_pred ccccceEEEeceEEEecCccccccc-ccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccc Confidence 7899999999999999999995432 222221 1221 2458899999999999999999999999999 Q ss_pred EeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 303 KIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 303 e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |.+|++++|+|+|+|+++||||++||||+|+|++++| T Consensus 310 e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 310 ERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred cchhchhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 9999999999999999999999999999999999999 No 13 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=3.9e-97 Score=549.12 Aligned_cols=327 Identities=17% Similarity=0.165 Sum_probs=289.2 Q ss_pred CccccCcccCC-Cc-ccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSY-DV-TRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~-~~-~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g 78 (339) |. +.|+- ++ +||||++.++|+.+||||+|+|||+++|+++|+|+++|++|++++|||+|||++|+.++.+|++| T Consensus 1 ~a----~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g 76 (347) T protein:vir:88 1 MA----NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPG 76 (347) T ss_pred CC----CcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeeccc Confidence 43 34454 33 99999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) ++|+.+ +++++++++|+||+.+||+|+|||+|++|+++|+|+++++++|++||+++|++|+++++++++...+ ... T Consensus 77 ~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~---~~~ 153 (347) T protein:vir:88 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA---SNE 153 (347) T ss_pred cCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccc Confidence 999875 5678899999999999999999999999999999999999999999999999999999998875433 455 Q ss_pred cccCccccccccccCccccc----cHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYK----DPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~----~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) .++|++.++.+..+.+.+.. .+..+|++|+++.++|+|++| |.++||+||+|++|++||+++++++.+|.... T Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~V--P~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~- 230 (347) T protein:vir:88 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYV--PAGDRRFYCAPEDYSAILSALMPNAANYAALI- 230 (347) T ss_pred ccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCC--CCCCCEEEeCHHHHHHHhcchhhhhhhhcccc- Confidence 66777777666655444433 456789999999999999999 88899999999999999999999999987554 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccccCC------------CccccccccccceEEEEEeccceeEEEEEeeee Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSN------------ANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVT 301 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~------------~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~ 301 (339) .+++|.|++++||+|++|||+|.+.. ..++.+. .+..+.|.+++++.++++||++|+|++++++++ T Consensus 231 -~~~~G~vg~i~G~~V~~s~nlp~~~~-~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~ 308 (347) T protein:vir:88 231 -DPETGNIRNVMGFEVIEVPHLTVGGA-GDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) T ss_pred -chhcceeeeeccceEEEeeccccccc-ccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccce Confidence 47899999999999999999995432 2233222 234567999999999999999999999999999 Q ss_pred EEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 302 SKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 302 ~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +|.+|++++|+|+|+|+++|||+++||||+|+|+++.| T Consensus 309 ~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 309 LERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred eeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 99999999999999999999999999999999999999 No 14 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.8e-96 Score=544.41 Aligned_cols=330 Identities=16% Similarity=0.129 Sum_probs=288.1 Q ss_pred cCcccCCC--cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCCCC Q lcl|NC_020078. 5 DGQTPSYD--VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTTPP 82 (339) Q Consensus 5 ~~~~~~~~--~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 82 (339) -++.++-+ .|||||+++++|++++|||+|++||+++|+++|+|++++++|++++|||+|||++|++++++|++|++++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~~g~~l~ 80 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeeccCCCCC Confidence 46666654 5999999999999999999999999999999999999999999999999999999999999999999997 Q ss_pred CC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccC Q lcl|NC_020078. 83 PS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPG 161 (339) Q Consensus 83 ~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g 161 (339) .+ +.+++++++|+||+.+||+|+|||+|++|+++|+|+++++++||+||+++|++|++++++++.. .+....+...+| T Consensus 81 ~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~-~~~~~~~~~~~g 159 (347) T protein:vir:15 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNL-PDASNENIEGLG 159 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccccccccccC Confidence 64 3578899999999999999999999999999999999999999999999999999999986643 344455555556 Q ss_pred ccccccccccCcccccc----HHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 162 HSGGNVVTLAGANDYKD----PAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 162 ~~~~~~~~~~~~~~~~~----~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) +..........+++.++ +.+++++|++++++|+|+|| |.+|||+||+|++|++||++++|++++|.++. .++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~V--P~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~--~~~ 235 (347) T protein:vir:15 160 KPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYV--PAADRTFYTTPDNYSAILAALMPNAANYQALI--DHE 235 (347) T ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCC--CccCCEEEeCHHHHHHHhcccccccccccccc--ccc Confidence 66655555555555554 46789999999999999999 77899999999999999999999999997654 478 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCcccccc--------ccccceEEEEEeccceeEEEEEeeeeEEeeechh Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAY--------DGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL 309 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y--------~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~ 309 (339) +|.|++++||+||+|||+|......+.....+|+.+.| .+.|++.++|+||++|+++++++++++|.+|+++ T Consensus 236 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~ 315 (347) T protein:vir:15 236 RGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN 315 (347) T ss_pred ceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch Confidence 99999999999999999997644333333334444444 4566778899999999999999999999999999 Q ss_pred hhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 SKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +|+|+|+++++||||++||||+|+|+++.- T Consensus 316 ~~~d~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 316 YQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhhehhhhcCCceeccccEEEEecCCC Confidence 999999999999999999999999999988 No 15 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=1.1e-95 Score=541.07 Aligned_cols=327 Identities=28% Similarity=0.430 Sum_probs=285.5 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) ||- | -+++||||+ +++++++||||+|+|||+++|+++++|++++++|+|++|||+||+++|+.++++|+||++ T Consensus 1 Ms~-----~-n~~t~~~~~-~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~pG~~ 73 (401) T protein:vir:70 1 MST-----P-NNLTNVAVS-ASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (401) T ss_pred CCC-----C-ccccccccc-cccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEeeeecCCCC Confidence 542 2 478999888 666999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFD-YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d-~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) |+++ .+.++|++|+||+++|++++|+|||++|+||| +|+||++++|+|||+++||+|++++..|++....+. .... T Consensus 74 ld~~-~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~--~~~p 150 (401) T protein:vir:70 74 PAAT-STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAK--RTNP 150 (401) T ss_pred cCCC-CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--ccCC Confidence 9875 57889999999999999999999999999999 999999999999999999999888888876433222 2233 Q ss_pred cCccccccccccCc--cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 160 PGHSGGNVVTLAGA--NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 160 ~g~~~~~~~~~~~~--~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) .+.+.|..+...++ ..+.++++|+++|+++++.|+|||| |.++++++.+|.+|++|+++++++|++|+.++++... T Consensus 151 ~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdV--P~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~ 228 (401) T protein:vir:70 151 RVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEV--DISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATI 228 (401) T ss_pred CcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCC--CccceEEEcCHHHHHHHHhcCcccchhhccccCCccc Confidence 34444445444444 4557999999999999999999999 5444444458888889999999999999877666788 Q ss_pred cceeEEEeceEEEEeccccccc-cccccccCCCccccccc--cccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGK-TITDHLLSNANNEKAYD--GDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWF 314 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~-~~~~~~l~~~~~~~~y~--~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~ 314 (339) +|.|.+++||+||+|||+|.+. ++++|.+++++++++|. +++++.++++|||+|++++|+++++.|.||++++|+|+ T Consensus 229 ~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~ 308 (401) T protein:vir:70 229 QGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYY 308 (401) T ss_pred cceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHH Confidence 9999999999999999999743 68999999999999988 89999999999999999999999999999999999999 Q ss_pred HHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 315 IDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 315 i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |+++++|||+++||||++++++.-- T Consensus 309 id~~~a~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 309 IDTFMAEGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred HHHHHHhCCcccchhheEEEeecCc Confidence 9999999999999999998743221 No 16 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=2.1e-95 Score=539.66 Aligned_cols=327 Identities=27% Similarity=0.422 Sum_probs=286.7 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) ||- | -+++||||+ +++++++||||+|+|||+++|+++++|++++++|+|++|||+||+++|++++++|+||++ T Consensus 1 Ms~-----~-n~~t~p~~~-gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~y~~pG~~ 73 (400) T protein:vir:10 1 MST-----P-NNLTNVAVS-ASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (400) T ss_pred CCC-----C-ccccccccc-cccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEeeecCCCC Confidence 542 2 478999888 666999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFD-YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d-~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) |+++ .+.++|++|+||+++|++++|||||++|+||| +|+||++|+|++||+++||+|++++.++++.. .....++. T Consensus 74 ldg~-~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~--t~~~~~~~ 150 (400) T protein:vir:10 74 PAAT-STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIAN--TQAKRTNP 150 (400) T ss_pred cCCC-CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cccccccC Confidence 9876 47889999999999999999999999999999 99999999999999999999999888877532 22222233 Q ss_pred cCcc--ccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 160 PGHS--GGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 160 ~g~~--~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) .|.. ....+........+++++|+++|+++++.|+||+| |.++++++++|.+|++|+++++|+|++|+.++++... T Consensus 151 ~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdV--P~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~ 228 (400) T protein:vir:10 151 RVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEV--DISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATI 228 (400) T ss_pred CccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCC--CccceEEEcCHHHHHHHHhCCcccchhccccCCCccc Confidence 3322 22333334445557999999999999999999999 4566777788899999999999999999877666688 Q ss_pred cceeEEEeceEEEEecccccc-ccccccccCCCccccccc--cccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFG-KTITDHLLSNANNEKAYD--GDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWF 314 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~-~~~~~~~l~~~~~~~~y~--~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~ 314 (339) .|+|.+++|++|++|||+|.. ..+++|.++.++++++|+ ++++++++++|||+|++++|++++++|.||++++|+|+ T Consensus 229 ~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~ 308 (400) T protein:vir:10 229 QGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYY 308 (400) T ss_pred cceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHH Confidence 999999999999999999964 457899999999999998 89999999999999999999999999999999999999 Q ss_pred HHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 315 IDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 315 i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |+++++|||+++||||++++++.-- T Consensus 309 id~~~a~G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 309 IDTFMSEGAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred HHHHHHhCCcccchhheEEEEecCC Confidence 9999999999999999999976322 No 17 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=8.7e-86 Score=486.92 Aligned_cols=284 Identities=13% Similarity=0.113 Sum_probs=252.3 Q ss_pred ccccccccceEEEeccccceeeeccCCCCCCCC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHH Q lcl|NC_020078. 52 PVRSVRGTSTISNRGISKAKLQKIAPGTTPPPS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEI 130 (339) Q Consensus 52 ~~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aL 130 (339) ++|+|++|||+||+++|++++++|+||++|+++ ++++++|++|+||++|||+|+|||+|+||+++|+|+++++|+||+| T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aL 80 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEAL 80 (324) T ss_pred CeeeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHH Confidence 889999999999999999999999999999763 5688999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEEC Q lcl|NC_020078. 131 ANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLP 210 (339) Q Consensus 131 A~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~ 210 (339) |+.+||+|++++++++....+....+....|+.............+.+++++|++|+++.++|+|+|| |.+|||+||+ T Consensus 81 A~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~V--P~~gR~~vv~ 158 (324) T protein:vir:99 81 AMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYI--PAGDRTFYTD 158 (324) T ss_pred HHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCC--CCCCCEEEeC Confidence 99999999999999888877776666665555444444444555667889999999999999999999 7789999999 Q ss_pred HHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccccc--------cccccccCCCcccc---cccccc Q lcl|NC_020078. 211 PAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGK--------TITDHLLSNANNEK---AYDGDF 279 (339) Q Consensus 211 P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~--------~~~~~~l~~~~~~~---~y~~~~ 279 (339) |++|++|++++++++.+|.+.+ .+++|.|++++||+||+|||+|... ++++|.++.+++.+ +|++++ T Consensus 159 P~~y~~Ll~~~~~~~~~~~~~~--~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~ 236 (324) T protein:vir:99 159 PDTYSAILAALMPNAANYAALI--DPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGA 236 (324) T ss_pred hHHHHHHhhccccccccccccc--ceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccccc Confidence 9999999999999998887654 4889999999999999999999642 23445565555543 699999 Q ss_pred ceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 280 KDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 280 ~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +.++||+||++|+++++++++++|.+|++++|+|+|+|+|+|||+++||||+++|++++- T Consensus 237 ~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 237 DNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred CceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 999999999999999999999999999999999999999999999999999998887665 No 18 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=6e-70 Score=400.10 Aligned_cols=312 Identities=15% Similarity=0.147 Sum_probs=254.0 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccc--ccccceEEEeccccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRS--VRGTSTISNRGISKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~--i~~G~tv~i~~iG~~t~~~~~~g 78 (339) ||.-. ++|+|.. .+..+.++.-|+|+++|++.|+++++++++++.++ +++|+|||||++|++++++|++| T Consensus 1 ~~~~~------~~~~~~~--~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~ 72 (341) T protein:vir:94 1 MALGN------TITGPSI--NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELGVEDKATD 72 (341) T ss_pred Ccchh------hhccccc--cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcceeeeecCC Confidence 54321 4555543 45556654449999999999999999999998764 56799999999999999999999 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) .+++.+ ++++++.+|+||+.+|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..+..++....+ T Consensus 73 ~~i~~~-~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~------- 144 (341) T protein:vir:94 73 VPVGVQ-PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQ------- 144 (341) T ss_pred Cccccc-cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccC------- Confidence 999875 466789999999999999999999999999999999999999999999999998877654422110 Q ss_pred ccCccccccccccCccccccHH-HHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 159 MPGHSGGNVVTLAGANDYKDPA-KLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 159 ~~g~~~~~~~~~~~~~~~~~~~-~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) .+ ..........+++ ..++.|+++.++|+|++| |.++||+||+|++|+.|+++++|+++++.++. .++ T Consensus 145 ~~-------~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~V--P~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~--~l~ 213 (341) T protein:vir:94 145 NV-------FSSSNGAITGNGQAFSFAVFLAARRLLLEADV--PEEKIVLLISPGQESALFTIPQFISKDFINNA--PIA 213 (341) T ss_pred cc-------ccCccccccCchhhhhHHHHHHHHHHHhhcCC--CccCCEEEeCHHHHHHHhhchhhhhhhccccc--hhh Confidence 00 1111111112223 357889999999999999 77899999999999999999999999998764 488 Q ss_pred cceeEEEeceEEEEeccccccccccccc-------------cCCCccccccccccceEEEEEeccceeEEEEEee----- Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHL-------------LSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP----- 299 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~-------------l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~----- 299 (339) +|.|++++||+|++||++|.+....... ..+.....+|++++..+.||++|++|++.++.++ T Consensus 214 ~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~ 293 (341) T protein:vir:94 214 QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAA 293 (341) T ss_pred eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhh Confidence 9999999999999999999754332111 1122334578889999999999999999998666 Q ss_pred ------eeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 300 ------VTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 300 ------~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +.+|..|++++|+|+|+|+++||||++||||+|+|+++++ T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 294 AVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred ccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcC Confidence 5667778999999999999999999999999999999999 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=1.1e-63 Score=365.80 Aligned_cols=324 Identities=13% Similarity=0.073 Sum_probs=256.0 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccc--cccceEEEeccccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSV--RGTSTISNRGISKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i--~~G~tv~i~~iG~~t~~~~~~g 78 (339) |.-..|. -..-+++....++.++..|+|+++|++.|++.+++.++++.++. +.|+|||||++|++++.+|++| T Consensus 1 ~~~~~~~-----~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~~g 75 (381) T protein:vir:80 1 MATIQGT-----GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQPQ 75 (381) T ss_pred Cceeccc-----ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeecCC Confidence 5544443 33444666667776655599999999999999999999887644 6799999999999999999999 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) ++++.++ +..++++++||+.+|+++.|+|+|++|+++|+|++++++++++||+++|+.|+..+.+......+. . T Consensus 76 ~~i~~~~-~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~-----~ 149 (381) T protein:vir:80 76 TPVNLQA-RTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQR-----I 149 (381) T ss_pred Ccccccc-cCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-----c Confidence 9998764 567789999999999999999999999999999999999999999999999988876544322111 1 Q ss_pred ccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeec Q lcl|NC_020078. 159 MPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNT 238 (339) Q Consensus 159 ~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~ 238 (339) .++................+....++.|++|.++|+|++| |.++||+||+|++|+.||++++|++++|.++. .+++ T Consensus 150 ~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~V--P~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~--~l~~ 225 (381) T protein:vir:80 150 YSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADV--PQEGRIVMVSPAQYIDLLSINQFISVDFSQVK--PVTS 225 (381) T ss_pred ccccccccccccccccccchhhHHHHHHHHHHHHHhhcCC--CcCCcEEEeCHHHHHHHhhchhhhhhhhccch--hhhc Confidence 1111111111111122234456788999999999999999 77899999999999999999999999987654 5899 Q ss_pred ceeEEEeceEEEEeccccccccccccccC-------CCcccccccccc-------------------------------- Q lcl|NC_020078. 239 KYMFAAFGVPVITSNNAVFGKTITDHLLS-------NANNEKAYDGDF-------------------------------- 279 (339) Q Consensus 239 G~v~~i~G~~V~~Snnlp~~~~~~~~~l~-------~~~~~~~y~~~~-------------------------------- 279 (339) |.|++++||+|++||++|......++... ...+++.|.+++ T Consensus 226 G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~ 305 (381) T protein:vir:80 226 GVVGTILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGA 305 (381) T ss_pred eeeeEEcceEEEeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeeccee Confidence 99999999999999999964332211111 111233333322 Q ss_pred ----------------ceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 280 ----------------KDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 280 ----------------~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++..|+++|+++.+.+.++.++++..+...+++|+|.|+++||++++||.++|+|++++- T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 306 TAADGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred eecCCCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 456688899999999999999998899999999999999999999999999999999999 No 20 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1.3e-61 Score=354.41 Aligned_cols=267 Identities=15% Similarity=0.121 Sum_probs=227.8 Q ss_pred chhHHHH-HHHHHHHHHHHHHHhhhccccccc---cccccceEEEeccccceeeeccC-CCCCCCCCCCCccceEEEEee Q lcl|NC_020078. 24 DPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR---SVRGTSTISNRGISKAKLQKIAP-GTTPPPSTEPHTSKIFLKIDT 98 (339) Q Consensus 24 ~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r---~i~~G~tv~i~~iG~~t~~~~~~-g~~i~~~~~~~~~~~~l~ID~ 98 (339) =....|+ |+|+++|++.|++.+++.++++++ +++.|+||+||++|++++.+|++ +.++.. +.+..++.+++||+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceEEEEEee Confidence 2223566 999999999999999999999875 67889999999999999999986 444544 45777889999999 Q ss_pred hhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCcccccc Q lcl|NC_020078. 99 VIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKD 178 (339) Q Consensus 99 ~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 178 (339) .+|+++.|+|+|+.|+++|+++ ++++++++||+.+|+.++..+..++.. . ..+...+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-------------------~---~~~~~~~ 136 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------------------L---TGSAPTD 136 (273) T ss_pred eeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-------------------c---ccccccc Confidence 9999999999999999999865 999999999999999999877653310 0 1122345 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc-hhhhcccccccceeecceeEEEeceEEEEeccccc Q lcl|NC_020078. 179 PAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH-ITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVF 257 (339) Q Consensus 179 ~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~-~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~ 257 (339) +.+++++|++++++|++++| |.++||+||+|++|+.|++++. +.+.++.++. ..+++|.|++++||+|++||++|. T Consensus 137 ~~~~~~~i~~a~~~ld~~~v--P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~l~~G~ig~i~G~~v~~s~~lp~ 213 (273) T protein:vir:10 137 ADDAFDLIAKALKELTKANV--PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AGLRAGTIGNLLGARIVESNNLRD 213 (273) T ss_pred hhHHHHHHHHHHHHhhhcCC--CcCCCEEEECHHHHHHHhcchhhhhhhhccccc-cceeeeeeeEEeceEEEEeccccc Confidence 67889999999999999999 7789999999999999999876 5566766554 458999999999999999999984 Q ss_pred cccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 258 GKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 258 ~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) +.. ..++++|++|++++++++ ++|..|++++|+|+|+|+++||++++|||++++|+.+ T Consensus 214 ~~~---------------------~~~~~~~~~A~~~a~q~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 214 TDD---------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred CCc---------------------cEEEEEeccceeeeeeee-hhhcccCCCcceeeeeeeeeeeeeEeccceEEEEecc Confidence 311 124789999999999876 9999999999999999999999999999999999999 Q ss_pred CC Q lcl|NC_020078. 338 AA 339 (339) Q Consensus 338 ~a 339 (339) ++ T Consensus 272 g~ 273 (273) T protein:vir:10 272 GS 273 (273) T ss_pred CC Confidence 99 No 21 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1.3e-61 Score=354.41 Aligned_cols=267 Identities=15% Similarity=0.121 Sum_probs=227.8 Q ss_pred chhHHHH-HHHHHHHHHHHHHHhhhccccccc---cccccceEEEeccccceeeeccC-CCCCCCCCCCCccceEEEEee Q lcl|NC_020078. 24 DPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR---SVRGTSTISNRGISKAKLQKIAP-GTTPPPSTEPHTSKIFLKIDT 98 (339) Q Consensus 24 ~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r---~i~~G~tv~i~~iG~~t~~~~~~-g~~i~~~~~~~~~~~~l~ID~ 98 (339) =....|+ |+|+++|++.|++.+++.++++++ +++.|+||+||++|++++.+|++ +.++.. +.+..++.+++||+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceEEEEEee Confidence 2223566 999999999999999999999875 67889999999999999999986 444544 45777889999999 Q ss_pred hhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCcccccc Q lcl|NC_020078. 99 VIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKD 178 (339) Q Consensus 99 ~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 178 (339) .+|+++.|+|+|+.|+++|+++ ++++++++||+.+|+.++..+..++.. . ..+...+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-------------------~---~~~~~~~ 136 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------------------L---TGSAPTD 136 (273) T ss_pred eeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-------------------c---ccccccc Confidence 9999999999999999999865 999999999999999999877653310 0 1122345 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc-hhhhcccccccceeecceeEEEeceEEEEeccccc Q lcl|NC_020078. 179 PAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH-ITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVF 257 (339) Q Consensus 179 ~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~-~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~ 257 (339) +.+++++|++++++|++++| |.++||+||+|++|+.|++++. +.+.++.++. ..+++|.|++++||+|++||++|. T Consensus 137 ~~~~~~~i~~a~~~ld~~~v--P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~l~~G~ig~i~G~~v~~s~~lp~ 213 (273) T protein:vir:10 137 ADDAFDLIAKALKELTKANV--PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AGLRAGTIGNLLGARIVESNNLRD 213 (273) T ss_pred hhHHHHHHHHHHHHhhhcCC--CcCCCEEEECHHHHHHHhcchhhhhhhhccccc-cceeeeeeeEEeceEEEEeccccc Confidence 67889999999999999999 7789999999999999999876 5566766554 458999999999999999999984 Q ss_pred cccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 258 GKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 258 ~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) +.. ..++++|++|++++++++ ++|..|++++|+|+|+|+++||++++|||++++|+.+ T Consensus 214 ~~~---------------------~~~~~~~~~A~~~a~q~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 214 TDD---------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred CCc---------------------cEEEEEeccceeeeeeee-hhhcccCCCcceeeeeeeeeeeeeEeccceEEEEecc Confidence 311 124789999999999876 9999999999999999999999999999999999999 Q ss_pred CC Q lcl|NC_020078. 338 AA 339 (339) Q Consensus 338 ~a 339 (339) ++ T Consensus 272 g~ 273 (273) T protein:vir:10 272 GS 273 (273) T ss_pred CC Confidence 99 No 22 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=2e-62 Score=358.82 Aligned_cols=301 Identities=12% Similarity=0.056 Sum_probs=232.2 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~ 79 (339) || -+|.+....++|. |+||.+++..+++..++.++.++...+.|+|||||+||++++++|++++ T Consensus 1 ~~---------------~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~ 65 (322) T protein:vir:31 1 MS---------------TGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQG 65 (322) T ss_pred CC---------------CCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCC Confidence 32 3445566666773 9999999999999999999988777778999999999999999999999 Q ss_pred CCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) +++.+ ++++++.+|+|||.|||+|.||| |++|+..|+++++++++||+||+.+|+++...|..+|...+... .+..+ T Consensus 66 ~i~~d-~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~-~p~vi 142 (322) T protein:vir:31 66 DFTFD-NLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQN-DPNVI 142 (322) T ss_pred Ccccc-cCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC-Cccee Confidence 99875 47888999999999999999999 99999999999999999999999999999887876652211110 00011 Q ss_pred cCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHH---------HHhcccchhhhcccc Q lcl|NC_020078. 160 PGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFT---------ALMQAEHITNGEYVT 230 (339) Q Consensus 160 ~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~---------~Ll~~~~~~n~d~~~ 230 (339) .|.. ..+ .+..+++...|+.|++++.+|+|+|| |..|||+||+|+++. +|++++||+..+-+| T Consensus 143 n~~~--~~i----v~~gt~~~~ay~~lv~l~~kLdkanV--P~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG 214 (322) T protein:vir:31 143 NGVP--HRF----VGTGTDQTMDVTDFSRVNYVMTQSKM--PMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESG 214 (322) T ss_pred cCCc--cce----eccCCCchhhHHHHHHHHHHhccccC--CCCCeEEEeCchhhhhhhhhhhhhhhhcccccccccccc Confidence 1111 111 12234667788999999999999999 888999999999965 568899998654444 Q ss_pred cccceeecceeEEEeceEEEEeccccccc--cccccccCCCccccccccccceEEEEEe-----ccceeEEEEEeeeeEE Q lcl|NC_020078. 231 SAGETLNTKYMFAAFGVPVITSNNAVFGK--TITDHLLSNANNEKAYDGDFKDIVAQMF-----SPKALLAGSTIPVTSK 303 (339) Q Consensus 231 ~~~~~l~~G~v~~i~G~~V~~Snnlp~~~--~~~~~~l~~~~~~~~y~~~~~~~~~~~~-----h~~A~~~~~~~~~~~e 303 (339) ... .++ .|++++||+||+||++|... ..+++.....+ . .++-++++ +...++..++++ ++| T Consensus 215 ~a~-g~~--~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~-----a---g~~n~f~~~~~~~~~~~~~~~~~l~-~~e 282 (322) T protein:vir:31 215 IAP-DMQ--FVRSVYGIDLFVSNLLADANETINAGGDARSTT-----A---GKCNMFMNVSDMGLLPFVVAWKEMP-TTK 282 (322) T ss_pred chh-hHH--HHHHHhceeeeeeccccccccccccCccccccc-----c---eeecccccccchhhhhhhhHhhhhh-hhh Confidence 322 122 49999999999999997421 11111111111 1 12222333 666777888876 889 Q ss_pred eeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 304 IFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 304 ~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+|++++|+|.++|+++||++++|||.+++|..+++ T Consensus 283 ~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~ 318 (322) T protein:vir:31 283 SFIDDYNDDLNTATTARWGNGLVRDENLVCVLANAD 318 (322) T ss_pred cccCccccccceeeeeeecceeecccceEEEEeccc Confidence 999999999999999999999999999999999999 No 23 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=2.8e-59 Score=341.63 Aligned_cols=267 Identities=15% Similarity=0.115 Sum_probs=226.4 Q ss_pred chhHHHH-HHHHHHHHHHHHHHhhhccccccc---cccccceEEEeccccceeeeccC-CCCCCCCCCCCccceEEEEee Q lcl|NC_020078. 24 DPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR---SVRGTSTISNRGISKAKLQKIAP-GTTPPPSTEPHTSKIFLKIDT 98 (339) Q Consensus 24 ~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r---~i~~G~tv~i~~iG~~t~~~~~~-g~~i~~~~~~~~~~~~l~ID~ 98 (339) =....|+ |+|+++|++.|++.+++.++++++ ..+.|+||+||++|.+++.+|.+ |.++.. +.++.++.+++||+ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCc-cccccceEEEEEee Confidence 1223466 999999999999999999998776 34569999999999999999874 555654 45677889999999 Q ss_pred hhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCcccccc Q lcl|NC_020078. 99 VIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKD 178 (339) Q Consensus 99 ~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 178 (339) .+|+++.|+|+|+.|+++|++ +++++++++||+.+|+.++..+..++... ..+...+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~----------------------~~~~~~~ 136 (273) T protein:vir:79 80 EKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTAL----------------------TGSAPSD 136 (273) T ss_pred ecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc----------------------ccccccc Confidence 999999999999999999997 59999999999999999987775432100 1112345 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc-chhhhcccccccceeecceeEEEeceEEEEeccccc Q lcl|NC_020078. 179 PAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE-HITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVF 257 (339) Q Consensus 179 ~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~-~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~ 257 (339) +.+++++|.++.++|++++| |.++||+||+|++|+.||+++ +|.++++.++. ..+++|.||+++||+|++||++|. T Consensus 137 ~~~~~~~i~~a~~~ld~~~v--P~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~-~~l~~G~ig~~~G~~i~~s~~lp~ 213 (273) T protein:vir:79 137 ADDAFDLIASALKELTKANV--PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AGLRAGTIGNLLGARIVESNNLRD 213 (273) T ss_pred hhhHHHHHHHHHHHhhhccC--CccCcEEEECHHHHHHHhhchhhhhhhhhcccc-cceeeeEeeEEeceEEEecccccc Confidence 66788999999999999999 778999999999999999987 47788876654 358999999999999999999985 Q ss_pred cccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 258 GKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 258 ~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) +.. ..++.+|++|++++++++ ++|..|++++|+++|+|+++||++++|||++++|+.+ T Consensus 214 ~~~---------------------~~~~a~~~~A~~~a~~~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 214 TDD---------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred cCc---------------------eEEEEEeccceeeeeehh-hhhcccCcccceeeeeeeeeeeeEEecCceEEEEecc Confidence 311 124678999999999876 8999999999999999999999999999999999999 Q ss_pred CC Q lcl|NC_020078. 338 AA 339 (339) Q Consensus 338 ~a 339 (339) ++ T Consensus 272 g~ 273 (273) T protein:vir:79 272 GS 273 (273) T ss_pred CC Confidence 99 No 24 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=5.4e-55 Score=318.08 Aligned_cols=309 Identities=16% Similarity=0.111 Sum_probs=221.2 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHH-Hhhhcccccccccc-ccce------EEEecccccee Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKR-RSIMAGFVPVRSVR-GTST------ISNRGISKAKL 72 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~-~sv~~~~v~~r~i~-~G~t------v~i~~iG~~t~ 72 (339) |.+ |..=.+--.=++++.+.|+|+|+.+|+..||. .++|++.|+.++-. ++++ +.++.+++..+ T Consensus 1 ~~~--------~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (322) T protein:vir:10 1 MKL--------NAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRS 72 (322) T ss_pred Ccc--------cceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccc Confidence 211 11111111123467889999999999999995 99999999988544 4444 44455666666 Q ss_pred eeccCCCCCCCC-CCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPS-TEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~-~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) ..+.+.+.++-+ ...++..+ .++++.+|+++.|||+|++|+++|++++|++++++||+|.+|+.|+..+...|.. T Consensus 73 ~~~~~d~~~dtp~~~~~~~~r-~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~--- 148 (322) T protein:vir:10 73 RQQSADGTYPTPVNNKPFAKR-RTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASI--- 148 (322) T ss_pred cccccCcccCCCccccccceE-EEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccc--- Confidence 666655543322 23344443 4555667999999999999999999999999999999999999987666554421 Q ss_pred cccccccccCccccccccccCc--cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVTLAGA--NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~--~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) +..+..+...... .+.+. .-.+++|++|.++|+|++|| ++.+||+||+|++|+.||++++|+++||. T Consensus 149 ---------~~~gt~v~~~ss~~i~~g~~-g~t~~kl~~a~~~l~~~dvp-~d~~R~~vv~p~~~~~LL~d~~~ts~D~~ 217 (322) T protein:vir:10 149 ---------KGTGQPVEFLATQEIGDGTK-PISFDYVTEITERFLENEIE-PEVSKVIVIGPTQARKLLQITEATSADYT 217 (322) T ss_pred ---------cccccccccCCCcccccCcc-chhHHHHHHHHHHHHhcCCC-CCCCeEEEeCHHHHHHHhcchhhhhhhcc Confidence 1111111111000 00000 11246788999999999993 34579999999999999999999999999 Q ss_pred ccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee-ch Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF-DD 308 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~-~~ 308 (339) +... ..++|.|++++||.|++||++|... .+..+.+... ..+ -....|++||++|+++++++++++|+++ ++ T Consensus 218 ~~~~-l~~~G~ig~~lGf~~i~s~~lp~~~-~t~~~~~~~~----~~~-~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~ 290 (322) T protein:vir:10 218 SAMD-LQSKGIITNWMGYTWIVSTRLDKFD-PTQWGMAAED----GPQ-GDEIWCIAMTDMALGYHSCKDIWTKVAEDPS 290 (322) T ss_pred cchh-hhhcCeeeeeeeEEEEEeccCCccc-cccccccccC----CCC-ccceeEEEEecCceeEEEeeeeeEEeeccCC Confidence 7653 3467999999999999999999542 2222211111 111 1234589999999999999999999876 55 Q ss_pred hhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 309 LSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 309 ~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +.++|+|.++++|||++++|+++++|+..-+ T Consensus 291 ~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 291 ASFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred cchhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 6779999999999999999999999999888 No 25 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=100.00 E-value=2.8e-53 Score=308.74 Aligned_cols=213 Identities=20% Similarity=0.276 Sum_probs=169.1 Q ss_pred EeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCccc Q lcl|NC_020078. 96 IDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGAND 175 (339) Q Consensus 96 ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 175 (339) ||++++++|+|||+|++|+|+|+|+++++|+||+||+++|++|++++++||....+... +.+++.. . ..++. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~------~~~g~~~-~-~~a~~ 72 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTG------QDGGFSV-N-IGAGN 72 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc------cccCcce-e-ccccc Confidence 99999999999999999999999999999999999999999999999999876544322 1112221 1 13456 Q ss_pred cccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhc--ccchhhhcccccccceeecc-eeEEEeceEEEEe Q lcl|NC_020078. 176 YKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQ--AEHITNGEYVTSAGETLNTK-YMFAAFGVPVITS 252 (339) Q Consensus 176 ~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~--~~~~~n~d~~~~~~~~l~~G-~v~~i~G~~V~~S 252 (339) ++++++||++|++|.++|+|+|| |.+|||+||+|++|+.||+ +++++|++++++++. +++| .|++++||+||+| T Consensus 73 t~~~~~l~dai~~a~~~LdekdV--P~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~-~~~g~~i~~v~G~~V~~S 149 (221) T protein:vir:17 73 TNNAQAIVDGFFEAAAVLDERSA--PMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGD-MNTGKGLYVNAGIRIYKS 149 (221) T ss_pred cCCHHHHHHHHHHHHHHHhhcCC--CCCCCEEEeCcHHHHHHHHhcCcceeeeeccccccc-ccccceeeeecCcEEEEe Confidence 68899999999999999999999 7799999999999999987 588999999987764 7777 5999999999999 Q ss_pred cccccccccc-----ccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCcccc Q lcl|NC_020078. 253 NNAVFGKTIT-----DHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINR 327 (339) Q Consensus 253 nnlp~~~~~~-----~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~r 327 (339) ||+|.+.+.. ++......+.++|+++|++++|++|||+|+|++|++...+ |++--.+ + ..+.| T Consensus 150 nnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~---~~~~~~~-------~--~~~~~ 217 (221) T protein:vir:17 150 NVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPS---RPPLVIS-------M--FSIRR 217 (221) T ss_pred ccCCcccccccccCCccccccccccccccccccceEEEEEcchheeeeeeecCCC---CCceeee-------e--eeccC Confidence 9999754432 3334445567899999999999999999999999987543 2221100 0 01334 Q ss_pred ccce Q lcl|NC_020078. 328 TEYA 331 (339) Q Consensus 328 Pe~~ 331 (339) |+-- T Consensus 218 ~~~~ 221 (221) T protein:vir:17 218 PDRR 221 (221) T ss_pred CCCC Confidence 4433 No 26 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=3.9e-46 Score=269.52 Aligned_cols=270 Identities=13% Similarity=0.035 Sum_probs=223.1 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-cc--cccceEEEeccccc-eeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SV--RGTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i--~~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+-...+|+ |+|+..|++.|.+..++.++.... ++ ++|++|+||+++.. .+.+|..|+.|+.. .+++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~-~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYS-ALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCccc-cccccee Confidence 3333444444666 999999999999999999987655 44 45999999998764 57889999999875 5788899 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++..+ +|.++|+|..|+..|++++++++++++|++..|+.++..+..+... ... T Consensus 80 ~~~i~~~~~-a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~---------------------~~~ 137 (278) T protein:vir:80 80 KHGIKKAGK-GVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE---------------------VKG 137 (278) T ss_pred eEeeehhhc-cccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------------ccc Confidence 999998654 8999999999999999999999999999999999998877543210 011 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) +.........++.+.++..+|+++++ |. .|+++|+|++|+.|++++ +|+.....+ ...+++|.|++++||+|+ T Consensus 138 ~~t~~~~~~~~~~~~da~~~l~~~~~--~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g--~~~~~~G~ig~~~G~~Vi 212 (278) T protein:vir:80 138 AINIGLIDKIENTFTDAPDAIEDESI--TT-TGVLFLNYKDTAKLREEAAGSWTKASQLG--DDLLVKGAFGELLGWEIV 212 (278) T ss_pred ccccchhhhHHHHHHHHHHhhcccCC--Cc-ccEEEECHHHHHHHHhhhhhhcccccccc--ccceeeccceeecceeEE Confidence 11122345567889999999999999 43 578999999999999875 677654333 335889999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++|+.|++++..+++++|.+|++++++|.|++++.||++++||++ T Consensus 213 ~s~~~p~~------------------------t~~l~~~gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~ 268 (278) T protein:vir:80 213 RTKKLADG------------------------NALAVKAGALKTFLKRNLLAESGRDMDHKLTKFNADQHYAVALVDETK 268 (278) T ss_pred EcCCCCcc------------------------eEEEEeccceeeeecCCcccccccchhhccceeeeeeEEEEEEEcCcc Confidence 99999832 257889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) +++|...++ T Consensus 269 ~v~it~~a~ 277 (278) T protein:vir:80 269 AVKVVPVAG 277 (278) T ss_pred eEEEeeccC Confidence 999999999 No 27 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=3.4e-45 Score=264.36 Aligned_cols=284 Identities=11% Similarity=0.036 Sum_probs=228.4 Q ss_pred ccCcccCC---------------CcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhcc-ccccc-cccccceEEEe Q lcl|NC_020078. 4 FDGQTPSY---------------DVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAG-FVPVR-SVRGTSTISNR 65 (339) Q Consensus 4 ~~~~~~~~---------------~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~-~v~~r-~i~~G~tv~i~ 65 (339) .||-|-|. .|.-.|..+.+-.++.+-+ |+|++.|++.|...+.... ++|.+ +..+|++|+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp 80 (329) T protein:vir:10 1 MDGIFITGVKTMNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVI 80 (329) T ss_pred CCceEEechhhhhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEe Confidence 66766542 1455677888999997655 9999999999988776654 45533 56689999999 Q ss_pred ccccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcch--HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 66 GISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDY--QGEVAREQGQEIANMYDETFFIMAA 143 (339) Q Consensus 66 ~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~--~~~~~~~~g~aLA~~~D~~i~~~l~ 143 (339) +++.+.++||++++..... .++.+..+++|||.+||.|.||++|..|++.++ ...+.+.+.+.++..+|.+.+..++ T Consensus 81 ~i~~~gl~DY~R~~g~~~g-~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla 159 (329) T protein:vir:10 81 KGDVTELKDYKRNATNEFD-HPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLA 159 (329) T ss_pred eecccccccccCCCCcccc-ccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHH Confidence 9999999999999888764 577889999999999999999999999998776 4556778999999999999998886 Q ss_pred hhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccch Q lcl|NC_020078. 144 KAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHI 223 (339) Q Consensus 144 ~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~ 223 (339) ..|.. . .+...+++++|++|+++.++|+|++| | ++||++|+|++|.+|+++++| T Consensus 160 ~~a~~------------------~-----~~~~~t~~nay~~i~~a~~~Lde~~v--p-~~Rvl~VtP~~~~~Lk~~~~f 213 (329) T protein:vir:10 160 RNKAK------------------H-----LTVGSGADAQYDAVLDVSVELDEIGA--G-ASRILFVTPKFYKGIKKFVIE 213 (329) T ss_pred hhccc------------------c-----cccccCHHHHHHHHHHHHHHHHhcCC--C-CCcEEEeCHHHHHHHHhhhhh Confidence 54321 0 11224578899999999999999987 6 489999999999999999999 Q ss_pred hhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEE Q lcl|NC_020078. 224 TNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSK 303 (339) Q Consensus 224 ~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e 303 (339) +... .. ....+.+|.|++++||+|+++++.... .+-.++.|++|+.++.+.+ .+| T Consensus 214 ~~~~-~~-~~~~~~~g~Vg~idG~~Ii~vps~~~k----------------------~in~ii~~~~A~~~~~K~~-~~~ 268 (329) T protein:vir:10 214 LPQG-DN-RQQVLGKGVQGELDGFTIVKVPSKMLQ----------------------GVEAMAVIGEVMASPIQAN-EAK 268 (329) T ss_pred hccc-cc-cccceeeeeeeeecCeEEEEecCCccc----------------------ceeEEEEcCCceeeeeeee-eee Confidence 8653 22 233578999999999999998665321 1224788999999999998 889 Q ss_pred eeec-hhhhHHHHHHHHHhCCccccccceEEEE-ecCC Q lcl|NC_020078. 304 IFFD-DLSKLWFIDSWLAFGVTINRTEYAGVIK-LPAA 339 (339) Q Consensus 304 ~~~~-~~~~~d~i~g~~~~Ga~v~rPe~~v~i~-~~~a 339 (339) ++++ +++|+|++++++.||++|+||++.+++. ...| T Consensus 269 ~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 269 LNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred eeCCCCccchheeeeeeeeeeEEEccccCEEEEecccC Confidence 9874 8899999999999999999999766554 2222 No 28 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=8.6e-45 Score=262.17 Aligned_cols=286 Identities=12% Similarity=0.045 Sum_probs=224.4 Q ss_pred Cc--cccCcccCCCcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhccc-cccc-cccccceEEEeccccceeeec Q lcl|NC_020078. 1 MS--IFDGQTPSYDVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGF-VPVR-SVRGTSTISNRGISKAKLQKI 75 (339) Q Consensus 1 ~~--~~~~~~~~~~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~-v~~r-~i~~G~tv~i~~iG~~t~~~~ 75 (339) |. |-.. .--.-|.-.|..+.+-+++.+.+ |.|++.|++.+...+....+ +|.+ +..+|++|+||+++.+.++|| T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:97 1 MNKTIKNA-TGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccc-cceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 21 1110 00112444566778888887766 99999999877777766543 4543 566899999999999999999 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcch--HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDY--QGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~--~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) ++++..... .++.+..+++|||.+||.|.||++|..|++.++ ...+.+.+...++..+|.+.+..++..|.. T Consensus 80 ~R~~g~~~g-~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----- 153 (319) T protein:vir:97 80 KRNATNEFD-HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----- 153 (319) T ss_pred cCCCCcccC-CcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----- Confidence 999888764 577889999999999999999999999998876 456678889999999999998887654321 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) . .+...+++++|++|+++.++|+|++| | ++||++|+|++|.+|+++++|+...-.+ . T Consensus 154 -------------~-----~~~~~t~~n~y~~i~~a~~~Lde~~V--P-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~--~ 210 (319) T protein:vir:97 154 -------------H-----LTVGTGSDAQYDAVLDVSVELDEIKA--P-ENRVLFVSPTFYKGIKKFVIALPQGDTR--Q 210 (319) T ss_pred -------------c-----cccccCHHHHHHHHHHHHHHHHhcCC--C-CCcEEEeCHHHHHHHHhhhhhhcccccc--c Confidence 0 11224568899999999999999999 7 5899999999999999999998754332 2 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee-chhhhH Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF-DDLSKL 312 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~-~~~~~~ 312 (339) ..+.+|.|++++||+|+++++-.. ..+-.++.|++|+.++.+.+ .+|+++ .+++|+ T Consensus 211 ~~~~~g~Vg~idG~~Vi~vps~~~----------------------k~in~i~~h~~A~~~~~k~~-~~~~~~p~~~~~a 267 (319) T protein:vir:97 211 QVLGKGVQGELDGFVIVKVPTKLL----------------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGMFG 267 (319) T ss_pred cceeeeeceeecCeEEEEeccccc----------------------ccceEEEEcCCeeeeeeeee-eeeccCCCccccc Confidence 457899999999999999765431 11224788999999999988 789887 588999 Q ss_pred HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |+++++++||++|+||+..+++....+ T Consensus 268 ~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 268 TLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eeeeeeeeeeeEEeccccceEEEeecC Confidence 999999999999999998777654333 No 29 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=8.6e-45 Score=262.17 Aligned_cols=286 Identities=12% Similarity=0.045 Sum_probs=224.4 Q ss_pred Cc--cccCcccCCCcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhccc-cccc-cccccceEEEeccccceeeec Q lcl|NC_020078. 1 MS--IFDGQTPSYDVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGF-VPVR-SVRGTSTISNRGISKAKLQKI 75 (339) Q Consensus 1 ~~--~~~~~~~~~~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~-v~~r-~i~~G~tv~i~~iG~~t~~~~ 75 (339) |. |-.. .--.-|.-.|..+.+-+++.+.+ |.|++.|++.+...+....+ +|.+ +..+|++|+||+++.+.++|| T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:94 1 MNKTIKNA-TGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccc-cceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 21 1110 00112444566778888887766 99999999877777766543 4543 566899999999999999999 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcch--HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDY--QGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~--~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) ++++..... .++.+..+++|||.+||.|.||++|..|++.++ ...+.+.+...++..+|.+.+..++..|.. T Consensus 80 ~R~~g~~~g-~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----- 153 (319) T protein:vir:94 80 KRNATNEFD-HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----- 153 (319) T ss_pred cCCCCcccC-CcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----- Confidence 999888764 577889999999999999999999999998876 456678889999999999998887654321 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) . .+...+++++|++|+++.++|+|++| | ++||++|+|++|.+|+++++|+...-.+ . T Consensus 154 -------------~-----~~~~~t~~n~y~~i~~a~~~Lde~~V--P-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~--~ 210 (319) T protein:vir:94 154 -------------H-----LTVGTGSDAQYDAVLDVSVELDEIKA--P-ENRVLFVSPTFYKGIKKFVIALPQGDTR--Q 210 (319) T ss_pred -------------c-----cccccCHHHHHHHHHHHHHHHHhcCC--C-CCcEEEeCHHHHHHHHhhhhhhcccccc--c Confidence 0 11224568899999999999999999 7 5899999999999999999998754332 2 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee-chhhhH Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF-DDLSKL 312 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~-~~~~~~ 312 (339) ..+.+|.|++++||+|+++++-.. ..+-.++.|++|+.++.+.+ .+|+++ .+++|+ T Consensus 211 ~~~~~g~Vg~idG~~Vi~vps~~~----------------------k~in~i~~h~~A~~~~~k~~-~~~~~~p~~~~~a 267 (319) T protein:vir:94 211 QVLGKGVQGELDGFVIVKVPTKLL----------------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGMFG 267 (319) T ss_pred cceeeeeceeecCeEEEEeccccc----------------------ccceEEEEcCCeeeeeeeee-eeeccCCCccccc Confidence 457899999999999999765431 11224788999999999988 789887 588999 Q ss_pred HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |+++++++||++|+||+..+++....+ T Consensus 268 ~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 268 TLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eeeeeeeeeeeEEeccccceEEEeecC Confidence 999999999999999998777654333 No 30 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=2.6e-44 Score=259.51 Aligned_cols=263 Identities=18% Similarity=0.139 Sum_probs=219.3 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-cc--cccceEEEecccc-ceeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SV--RGTSTISNRGISK-AKLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i--~~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+-...+++ |+|+..|.+.+++..++.++++.. ++ ++|++|+||+++. ..+.+|..|++++.. .+.+++. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~-~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchh-hccccee Confidence 3333343334555 999999999999999999998776 33 3599999999885 488899999999875 5678889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .+++|.++|++..|+..|++++++++++++||+.+|+.++..+..+.. .. T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~-----------------------~~ 135 (274) T protein:vir:96 80 EAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-----------------------TV 135 (274) T ss_pred EEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-----------------------Cc Confidence 999988 578999999999999999999999999999999999999876643211 00 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .+...+ ++.|.+|.++|+++++ .+||++|+|++|..|+++. +|+...- .+...+++|.|++++||+|+ T Consensus 136 ~~~~~~----~d~i~dA~~~l~d~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~~~--~g~~~~~~g~ig~~~G~~Vi 205 (274) T protein:vir:96 136 EADITK----LDGLQTAIDKFNDEDL----EPMVLFVNPLDAGGLRTSASDNFTRPTQ--LGDNIIVKGAFGEALGAVIV 205 (274) T ss_pred Cccccc----HHHHHHHHHHhcccCC----CceEEEeCHHHHHHHHhccccccccccc--ccccceeecccceecCeeEE Confidence 011122 5788899999998876 4699999999999999974 6776532 23346899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.+ .++++++.|++++..+++++|..|++++++|.|.+++.||++++||++ T Consensus 206 ~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~ 261 (274) T protein:vir:96 206 RSNKLNKG------------------------EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESK 261 (274) T ss_pred EcCCCCcc------------------------eEEEEeCcceeeeecCCcccccccchhhcccEEEEeeEEEEEEEcCcc Confidence 99999842 257899999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) +++|...+| T Consensus 262 vv~~t~~~~ 270 (274) T protein:vir:96 262 VVKITKGAG 270 (274) T ss_pred EEEEEcCcc Confidence 999999999 No 31 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=4.1e-43 Score=252.94 Aligned_cols=263 Identities=18% Similarity=0.155 Sum_probs=218.5 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEeccccc-eeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..++.+-...+++ |+|+.+|.+.+++..++.+++... ++. +|++|+||+++.+ .+.+|..|++|+.. .+.+++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~-~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc-cccccee Confidence 3333333333444 999999999999999999998876 444 5999999997753 67899999999865 5778889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .+++|.++|++..|+..|++++.+++++++||+.+|+.++..+.+++... T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~----------------------- 135 (274) T protein:vir:94 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV----------------------- 135 (274) T ss_pred EEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc----------------------- Confidence 999988 56799999999999999999999999999999999999987775432110 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .+..++ ++.|.+|.++|+++++ .+||++|+|++|+.|++++ +|++.. ..+...+++|.|++++||+|+ T Consensus 136 ~~~~~~----~d~i~dA~~~l~d~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~s--~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:94 136 NADITK----LNGLQSAIDKFNDEDL----EPMVLFVNPLDAGKLRGDASTNFTRAT--ELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred cccccC----HHHHHHHHHHhhccCC----CceEEEeCHHHHHHHHhhhhhhccccC--cccccceeccccceecCeeEE Confidence 011222 5678888999988765 4699999999999999985 777653 333446899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++++.|++++..+++.+|..|++++++|.|.+++.||+++++|++ T Consensus 206 ~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:94 206 RTNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred EcCCCCcc------------------------eEEEEeCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCc Confidence 99999832 257899999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.+..++| T Consensus 262 vv~~t~~~~ 270 (274) T protein:vir:94 262 AVKITKGSG 270 (274) T ss_pred eEEEecCcc Confidence 999999999 No 32 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=4.1e-43 Score=252.94 Aligned_cols=263 Identities=18% Similarity=0.155 Sum_probs=218.5 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEeccccc-eeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..++.+-...+++ |+|+.+|.+.+++..++.+++... ++. +|++|+||+++.+ .+.+|..|++|+.. .+.+++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~-~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc-cccccee Confidence 3333333333444 999999999999999999998876 444 5999999997753 67899999999865 5778889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .+++|.++|++..|+..|++++.+++++++||+.+|+.++..+.+++... T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~----------------------- 135 (274) T protein:vir:97 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV----------------------- 135 (274) T ss_pred EEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc----------------------- Confidence 999988 56799999999999999999999999999999999999987775432110 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .+..++ ++.|.+|.++|+++++ .+||++|+|++|+.|++++ +|++.. ..+...+++|.|++++||+|+ T Consensus 136 ~~~~~~----~d~i~dA~~~l~d~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~s--~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:97 136 NADITK----LNGLQSAIDKFNDEDL----EPMVLFVNPLDAGKLRGDASTNFTRAT--ELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred cccccC----HHHHHHHHHHhhccCC----CceEEEeCHHHHHHHHhhhhhhccccC--cccccceeccccceecCeeEE Confidence 011222 5678888999988765 4699999999999999985 777653 333446899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++++.|++++..+++.+|..|++++++|.|.+++.||+++++|++ T Consensus 206 ~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:97 206 RTNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred EcCCCCcc------------------------eEEEEeCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCc Confidence 99999832 257899999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.+..++| T Consensus 262 vv~~t~~~~ 270 (274) T protein:vir:97 262 AVKITKGSG 270 (274) T ss_pred eEEEecCcc Confidence 999999999 No 33 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.7e-43 Score=253.24 Aligned_cols=263 Identities=18% Similarity=0.153 Sum_probs=218.6 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEecccc-ceeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISK-AKLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+-...+++ |+|+..|++.+.+..++.+++... ++. +|++|+||+++. ..+++|..|++++.. .+.+++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~-~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccccc-cccccee Confidence 3333333334455 999999999999999999998775 444 599999999875 478899999999875 5678889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .++.|.++|++..|+..|++++.+++++++|++.+|+.++..+.++... . T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~-----------------------~ 135 (274) T protein:vir:93 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-----------------------V 135 (274) T ss_pred EEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------------------c Confidence 999987 5689999999999999999999999999999999999998776543210 0 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .+..++ ++.|.+|..+|+++++ .+||++|+|++|+.|++++ +|++.. ..+...+++|.|++++||+|+ T Consensus 136 ~~~~~~----~d~i~dA~~~l~d~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~s--~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:93 136 NADITK----LNGLQSAIDKFNDEDL----EPMVLFINPLDAGKLRGDASTNFTRAT--ELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred cccccC----HHHHHHHHHHhhhccC----CccEEEeCHHHHHHHHhhhhhcccccc--cccccceeecccceecCeeEE Confidence 111222 4677888899988765 4699999999999999986 666553 333446899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++|+.|++++..+++.+|..|++++++|.|++++.||++++||++ T Consensus 206 ~s~~~p~~------------------------t~~l~~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:93 206 RTNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred EcCCCCcc------------------------eEEEEeCCeEEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCc Confidence 99999832 257899999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.++..+| T Consensus 262 ~v~~t~~~~ 270 (274) T protein:vir:93 262 AVKITKGSG 270 (274) T ss_pred eEEEeeCcc Confidence 999999999 No 34 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=7.9e-43 Score=251.40 Aligned_cols=263 Identities=18% Similarity=0.154 Sum_probs=218.2 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-cc--cccceEEEeccccc-eeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SV--RGTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i--~~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..++.+-...+++ |+|+..|.+.+.+..++.+++.+. ++ ++|++|+||..+.. .+.+|..|+.|+.. .+.+++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~-~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD-ILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchh-hccccee Confidence 3333333333444 999999999999999999998886 44 45999999997653 67899999999865 5788889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .++.|.++|++..|+..|++++.+++++++||+.+|+.++..+.++.... T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~----------------------- 135 (274) T protein:vir:12 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV----------------------- 135 (274) T ss_pred eEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------------- Confidence 999988 68899999999999999999999999999999999999987765432100 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) ...+.+ ++.|.+|.++|.+++. .+||++|+|++|+.|++++ +|++..- .+...+++|.|++++||+|+ T Consensus 136 ~~~a~~----~d~i~dA~~~lgd~~~----~~~~ivv~p~~~~~L~k~~~~~fv~~s~--~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:12 136 NADITK----LNGLQSAIDKFNDEDL----EPMVLFINPLDAGKLRGDASTNFTRATE--LGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred cccccC----HHHHHHHHHHhccccc----cccEEEeCHHHHHHHHhhhhhhcccccc--ccccceecccceeecCeeEE Confidence 011122 5678888999987654 5799999999999999985 7876532 23346899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++++.|++++..+++++|..|++++++|.|.+++.||++++||+. T Consensus 206 ~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:12 206 RSNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred EeCCCCcc------------------------eEEEEeccceeeeecCCceeccccchhhcccEEEeeeEEEEEEEcCCc Confidence 99999832 257889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++++..++| T Consensus 262 vv~~t~~~~ 270 (274) T protein:vir:12 262 AVKITKGSG 270 (274) T ss_pred eEEEEcCCc Confidence 999999999 No 35 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=6.8e-42 Score=246.29 Aligned_cols=263 Identities=17% Similarity=0.123 Sum_probs=216.4 Q ss_pred ccCcccchhH-HH-HHHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEeccccc-eeeeccCCCCCCCCCCCCccc Q lcl|NC_020078. 18 QRHGAGDPLA-DV-TEQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSK 91 (339) Q Consensus 18 ~~~~~~~~~a-~~-ie~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~ 91 (339) ....+-...+ ++ -|+|+..|.+.+.+..+|.+++.+. ++. +|++|+||+.... .+.+|..|++|+.. .+++++ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~-~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPID-LIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchh-hcccce Confidence 3322222332 33 3999999999999999999998765 444 4999999997753 67889999999875 577888 Q ss_pred eEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccccc Q lcl|NC_020078. 92 IFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLA 171 (339) Q Consensus 92 ~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~ 171 (339) .+++|.+ .++.|.++|++..|+..|++.+.+++++++||+++|+.++..+.++... T Consensus 80 ~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~----------------------- 135 (275) T protein:vir:96 80 RQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK----------------------- 135 (275) T ss_pred eeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------------- Confidence 8899955 6999999999999999999999999999999999999998776542210 Q ss_pred CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEE Q lcl|NC_020078. 172 GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPV 249 (339) Q Consensus 172 ~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V 249 (339) ..+...+ ++.|.++..+|.+.+. .+||++|+|++|+.|+++. +|+..+..+ ...+++|.|++++|++| T Consensus 136 ~~~~~~~----~d~i~dA~~~lgd~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g--~~~~~~G~ig~~~G~~V 205 (275) T protein:vir:96 136 VEADITK----LAGLQTAIDKFNDEDL----EPMVLFVNPLDAGKLRASATDNFTRATLLG--DNVIVKGAFGEALGAII 205 (275) T ss_pred ccccccC----HHHHHHHHHHhccccC----CccEEEeCHHHHHHHHhccccccccccccc--ccceeccccceecCeeE Confidence 0111222 5677888899987654 4799999999999998874 788765444 34689999999999999 Q ss_pred EEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCcccccc Q lcl|NC_020078. 250 ITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTE 329 (339) Q Consensus 250 ~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe 329 (339) ++||++|.+ .++++++.|++++..+++++|..|++++++|.|++++.||++++||+ T Consensus 206 i~s~~~p~~------------------------t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~ 261 (275) T protein:vir:96 206 VRSNKIKEG------------------------EAILAKRGAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDES 261 (275) T ss_pred EEeCCCCcc------------------------eEEEEeccceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCc Confidence 999999732 25788999999999999999999999999999999999999999999 Q ss_pred ceEEEEecCC Q lcl|NC_020078. 330 YAGVIKLPAA 339 (339) Q Consensus 330 ~~v~i~~~~a 339 (339) ++++++++.| T Consensus 262 ~vv~~t~~~~ 271 (275) T protein:vir:96 262 KVVKITKSAS 271 (275) T ss_pred cEEEEEeccc Confidence 9999999999 No 36 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=7.1e-42 Score=246.19 Aligned_cols=267 Identities=13% Similarity=0.088 Sum_probs=217.3 Q ss_pred ccCcccchhHHH-HHHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEeccccc-eeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADV-TEQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~-ie~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+-...++ -|+|+..|.+.|.+.+++.+++... ++. .|++|+||+.+.+ ...++..|.+++.. .++.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~-~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLD-KIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChh-hcCCcce Confidence 333333333344 4999999999999999999998765 444 4999999998765 45679999999865 4678889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++. ..+|.++|+|+.|+..|++++++++++++||+.+|+.++..+..+.. . T Consensus 80 ~~~i~~~-~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~---------------------~--- 134 (272) T protein:vir:36 80 SVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ---------------------T--- 134 (272) T ss_pred eEeeehh-hccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------------c--- Confidence 9999875 56899999999999999999999999999999999998866532210 0 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEe Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITS 252 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~S 252 (339) .+...-++.|.+|..+|.++++ + .||++|+|++|+.|+++.+|.+.. ...+...+++|.|++++|++|++| T Consensus 135 ----~~~~~~~d~i~~A~~~lgd~~~--~--~~~ivv~p~~~~~L~k~~~~~~~~-~~~~~~~~~~G~ig~~~G~~Vv~s 205 (272) T protein:vir:36 135 ----VSTKANVDGVQAALDIFNDEDA--Q--AYVLIVNPKDAAKIRKDANAKNIG-SEVGANALINGTYADVLGAQIVRS 205 (272) T ss_pred ----ccccccHHHHHHHHHHhhhcCC--C--ceEEEEcHHHHHHHhccccccccc-ccccccceeeeccceecCeeEEEe Confidence 0011124678889999998876 2 589999999999999999987763 222344688999999999999999 Q ss_pred ccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceE Q lcl|NC_020078. 253 NNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAG 332 (339) Q Consensus 253 nnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v 332 (339) |++|.+.. ...++++++.|+++...+++++|..|++++|+|.|++++.||++++||++++ T Consensus 206 ~~~p~~~~--------------------~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv 265 (272) T protein:vir:36 206 KKLAEGSA--------------------LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVV 265 (272) T ss_pred CCCCCCce--------------------eEEEEEecccceeeeecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEE Confidence 99984321 1235778999999999999999999999999999999999999999999999 Q ss_pred EEEecCC Q lcl|NC_020078. 333 VIKLPAA 339 (339) Q Consensus 333 ~i~~~~a 339 (339) ++.+.+- T Consensus 266 ~~t~~g~ 272 (272) T protein:vir:36 266 NITFTGV 272 (272) T ss_pred EEeecCC Confidence 9988888 No 37 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=6.2e-42 Score=246.51 Aligned_cols=263 Identities=17% Similarity=0.154 Sum_probs=215.6 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEecccc-ceeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISK-AKLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..++.+-...+++ |+|+.+|++.+.+..++.+++... ++. +|++|+||+... ..+.+|..|+.|+.. .+.+++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~-~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTD-ILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchh-hccccee Confidence 3333333344554 999999999999999999997554 454 499999999775 367889999999875 5788889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .+++|.++|+|..|+..|++++++++++++||+.+|+.++..+.++.... T Consensus 80 ~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~----------------------- 135 (274) T protein:vir:95 80 EAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTV----------------------- 135 (274) T ss_pred EEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------------- Confidence 999988 58899999999999999999999999999999999999987765432110 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .+...+ ++.|.+|.++|.+++. .+||++|+|++|+.|++++ +|+... ..+...+++|.|++++||+|+ T Consensus 136 ~~~~~~----~d~i~~A~~~lgd~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~s--~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:95 136 EADITK----LTGLQTAIDKFNDEDL----EPMVLFISPLDAGKLRGDATTNFTRAT--ELGDDVIVKGAFGEALGAVIV 205 (274) T ss_pred cccccC----HHHHHHHHHHhccccc----cccEEEeCHHHHHHHHhhccccccccc--cccccceeccccceecCeEEE Confidence 011122 5678888999987764 4799999999999999986 677643 223346899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++++.|+++...+++.+|..|++++++|.|.+++.||++++||++ T Consensus 206 ~s~~~~~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:95 206 RSNKLEAG------------------------TAILAKKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred EeCCCCCc------------------------eEEEEeccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCc Confidence 99999732 257889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.++..+- T Consensus 262 ~v~~tk~~~ 270 (274) T protein:vir:95 262 AVKITKGSG 270 (274) T ss_pred EEEEEcCCc Confidence 999986555 No 38 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=6.2e-42 Score=246.51 Aligned_cols=263 Identities=17% Similarity=0.154 Sum_probs=215.6 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEecccc-ceeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISK-AKLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..++.+-...+++ |+|+.+|++.+.+..++.+++... ++. +|++|+||+... ..+.+|..|+.|+.. .+.+++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~-~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTD-ILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchh-hccccee Confidence 3333333344554 999999999999999999997554 454 499999999775 367889999999875 5788889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++|++ .+++|.++|+|..|+..|++++++++++++||+.+|+.++..+.++.... T Consensus 80 ~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~----------------------- 135 (274) T protein:vir:96 80 EAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTV----------------------- 135 (274) T ss_pred EEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------------- Confidence 999988 58899999999999999999999999999999999999987765432110 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .+...+ ++.|.+|.++|.+++. .+||++|+|++|+.|++++ +|+... ..+...+++|.|++++||+|+ T Consensus 136 ~~~~~~----~d~i~~A~~~lgd~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~s--~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:96 136 EADITK----LTGLQTAIDKFNDEDL----EPMVLFISPLDAGKLRGDATTNFTRAT--ELGDDVIVKGAFGEALGAVIV 205 (274) T ss_pred cccccC----HHHHHHHHHHhccccc----cccEEEeCHHHHHHHHhhccccccccc--cccccceeccccceecCeEEE Confidence 011122 5678888999987764 4799999999999999986 677643 223346899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.. .++++++.|+++...+++.+|..|++++++|.|.+++.||++++||++ T Consensus 206 ~s~~~~~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:96 206 RSNKLEAG------------------------TAILAKKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred EeCCCCCc------------------------eEEEEeccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCc Confidence 99999732 257889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.++..+- T Consensus 262 ~v~~tk~~~ 270 (274) T protein:vir:96 262 AVKITKGSG 270 (274) T ss_pred EEEEEcCCc Confidence 999986555 No 39 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=2.4e-41 Score=243.31 Aligned_cols=282 Identities=12% Similarity=0.042 Sum_probs=188.5 Q ss_pred chhHHHH-HHHHHHHHHHHHHHhhhccccccc---ccc--ccceEEEeccccceeeeccC-----CCCCCCCCCCCccce Q lcl|NC_020078. 24 DPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR---SVR--GTSTISNRGISKAKLQKIAP-----GTTPPPSTEPHTSKI 92 (339) Q Consensus 24 ~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r---~i~--~G~tv~i~~iG~~t~~~~~~-----g~~i~~~~~~~~~~~ 92 (339) =.+++|+ |+|+.++++.|++.++|.++++++ +++ .|++|+|++.+..++.+|++ +.++..+ ++..++. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 79 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS-DFTEDSF 79 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCccccc-ccccceE Confidence 1234555 999999999999999999999876 664 59999999999999999864 4556654 4667889 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +++||+.+|++|.|+|.|+.|...|++.++.++++++||+.+|++++..+..+... ... T Consensus 80 ~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~---------------------~~~ 138 (392) T protein:vir:99 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---------------------AAG 138 (392) T ss_pred EEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------------ccc Confidence 99999999999999999999999999999999999999999999998766532210 011 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc-ceeecceeEEEeceEEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG-ETLNTKYMFAAFGVPVIT 251 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~-~~l~~G~v~~i~G~~V~~ 251 (339) .....++...|+.|.+++++|+|++| |. |||++|+|++|+.|+++++|++.++.+... ..+++|.|++++||+||+ T Consensus 139 ~~~~~~~~~~~~~i~~a~~~L~~~~v--P~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~ 215 (392) T protein:vir:99 139 AVHEVAPDEFFKGVNGARRALNELYI--PQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVE 215 (392) T ss_pred cccccChhhhHHHHHHHHHHHhhcCC--CC-CCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEe Confidence 22335677889999999999999999 64 799999999999999999999998877653 348899999999999999 Q ss_pred eccccccccccccccCCCcccc--ccccccceEEEEEeccceeEEEEEeeeeEEee--echhhhHHHHHHHHHhCCcccc Q lcl|NC_020078. 252 SNNAVFGKTITDHLLSNANNEK--AYDGDFKDIVAQMFSPKALLAGSTIPVTSKIF--FDDLSKLWFIDSWLAFGVTINR 327 (339) Q Consensus 252 Snnlp~~~~~~~~~l~~~~~~~--~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~--~~~~~~~d~i~g~~~~Ga~v~r 327 (339) |+++|.+.....|..+...-.. ........ +...... ..+..... ++....++..--....|.+.+. T Consensus 216 s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~--~~s~s~~-------~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~ 286 (392) T protein:vir:99 216 STLIPHGDAYLYHPTAFIMATRAPAPPMGAVR--STAISGD-------QRIAMRWLVDYDSTITSNRSLIDTYFGLKVVE 286 (392) T ss_pred ecccccccceeeeccccccccccccccccccc--eeEEecc-------cceecceeecccceeeccccccceeEEEEEEe Confidence 9999977655544432211000 00000000 0000000 00001111 1111111111111122222221 Q ss_pred ccceEEEEe------c----------CC Q lcl|NC_020078. 328 TEYAGVIKL------P----------AA 339 (339) Q Consensus 328 Pe~~v~i~~------~----------~a 339 (339) -.+...+.. . .+ T Consensus 287 ~~~~~~~~~~~~~~~~~~~v~v~~v~~~ 314 (392) T protein:vir:99 287 DPNGVGFVRARKIHLIPGSIEVAPEAGA 314 (392) T ss_pred eccccceeeeeeeeeecceeeeeeeecc Confidence 111000000 0 00 No 40 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=5.9e-41 Score=241.12 Aligned_cols=286 Identities=15% Similarity=0.014 Sum_probs=198.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHH--HHHHHHHHHHHHHHhhhccccccc---ccc-ccceEEEeccccceeee Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVT--EQFTGTVEGTIKRRSIMAGFVPVR---SVR-GTSTISNRGISKAKLQK 74 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~i--e~~~g~v~~~f~~~sv~~~~v~~r---~i~-~G~tv~i~~iG~~t~~~ 74 (339) |.. +..-|| |+|+.++++.|++.+||.++++++ ++. .|+||+||+.+..++++ T Consensus 1 m~~---------------------~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~d 59 (418) T protein:vir:10 1 MAV---------------------QDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSAS 59 (418) T ss_pred CCc---------------------cccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecc Confidence 111 111233 799999999999999999999875 443 49999999999999998 Q ss_pred ccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 75 IAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 75 ~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) +. ++..+ ++...+..|+||+.||++|.|+|.|++|...|++++++++++++||+.+|+.++..+..++.. T Consensus 60 g~---~~~~~-~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~------ 129 (418) T protein:vir:10 60 GR---TLVKQ-PMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHS------ 129 (418) T ss_pred cC---Ccccc-ccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------ Confidence 65 46554 456678899999999999999999999999999999999999999999999998765443210 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcC-CeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEE-DMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~-~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) .++. + .+...|+.|.+++.+|++++| |.+ +||+||+|++|..|++++++.... .+. . T Consensus 130 ---------~gt~----g-----t~~~~~~~i~~a~~~Ld~~~V--P~~G~R~lVv~P~~~~~L~~~~~~~~~~-~~~-~ 187 (418) T protein:vir:10 130 ---------SGTP----G-----VRPGAFIDFANAGAKQTTYAV--PQDGMRHAVLDPFTCASLSDEVTKLFKE-SMV-E 187 (418) T ss_pred ---------cccC----C-----cCcchHHHHHHHHHHHHhcCC--CCCCceEEEeCHHHHHHHhhhccccccc-ccc-c Confidence 0100 0 011237889999999999999 766 599999999999999988775432 222 3 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccc-------------c-------C-----CCccc---------------- Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHL-------------L-------S-----NANNE---------------- 272 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~-------------l-------~-----~~~~~---------------- 272 (339) ..+++|.|++++||+||+|||+|..+.++.+. + + ..|+. T Consensus 188 ~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~ 267 (418) T protein:vir:10 188 QAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYET 267 (418) T ss_pred hhhheeeeeeeeceEEEEecCCCcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccc Confidence 46999999999999999999999533222111 0 0 00110 Q ss_pred ----cccccc----------c-------------------------------------------------ceEEEEEecc Q lcl|NC_020078. 273 ----KAYDGD----------F-------------------------------------------------KDIVAQMFSP 289 (339) Q Consensus 273 ----~~y~~~----------~-------------------------------------------------~~~~~~~~h~ 289 (339) ..|.+. + ....-++||+ T Consensus 268 ~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~ 347 (418) T protein:vir:10 268 TGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHR 347 (418) T ss_pred cccceEEEEEeeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeec Confidence 001110 0 0112389999 Q ss_pred ceeEEEEEee--e-----eEE-----------e--eechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 290 KALLAGSTIP--V-----TSK-----------I--FFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 290 ~A~~~~~~~~--~-----~~e-----------~--~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +|+.++...- . +.. + +++.+..-+.++==..||.+.+|||.++.|.=++| T Consensus 348 ~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~~ 417 (418) T protein:vir:10 348 DAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELALRLWGAAS 417 (418) T ss_pred ceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccceEEEEEeecCceeecccceEEEEeecC Confidence 9987765432 0 111 1 11121122222112389999999999877766666 No 41 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=2.3e-38 Score=226.98 Aligned_cols=285 Identities=15% Similarity=0.053 Sum_probs=198.9 Q ss_pred CcccchhHHHHHHHHHHHHHHHHHHhhhccccccc---cc--cccceEEEeccccceeeeccCCCCCCCCCCCCccceEE Q lcl|NC_020078. 20 HGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVR---SV--RGTSTISNRGISKAKLQKIAPGTTPPPSTEPHTSKIFL 94 (339) Q Consensus 20 ~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r---~i--~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~l 94 (339) .+ ..+ |.|+|+++|++.|.+.+++..+.+.. ++ .+|++|+||+++.+.++||++++....+..++.+..++ T Consensus 1 MA---~~n-~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~ 76 (299) T protein:vir:79 1 MA---ALN-YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPK 76 (299) T ss_pred Cc---cch-hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEE Confidence 22 121 67999999999999999999877654 34 46899999999999999999987544444567788999 Q ss_pred EEeehhhhhhhHHHHHHHhcCcc--hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 95 KIDTVIIARNAEPMLDEFQTDFD--YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 95 ~ID~~~y~~~~vdd~D~~q~~~d--~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +|||.+||.|.||++|..|++.. .-..+.+.+.+.++-.+|.+.+..|+..|.. .+. .. T Consensus 77 ~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~---------------~g~----~~ 137 (299) T protein:vir:79 77 VLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTA---------------LGN----TA 137 (299) T ss_pred EeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhh---------------cCC----cc Confidence 99999999999996666665544 4444556677788889999988888654321 011 12 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEE- Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVIT- 251 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~- 251 (339) .....+++++|++|+++.++|+|++| |.++||++|+|++|.+|+++++|+...=. .......+|.|++++||+|++ T Consensus 138 ~~~~~T~~n~y~~i~~~~~~lde~~v--P~~~rvl~vtp~~~~~L~~~~~f~k~~~~-~~~~~~~~g~Vg~idG~~Ii~V 214 (299) T protein:vir:79 138 DTTVLTTTNVLEVFDKLMEKMTEARV--PENGRILYVTPVVNTLIKNAKEIQRTVNI-KDAGTSLNRQTTDIDTVKIIKV 214 (299) T ss_pred cccccCHHHHHHHHHHHHHHHHhcCC--CCCCeEEEeCHHHHHHHhhchhhhccccc-ccccceeeeeeeeecceEEEEe Confidence 23335678999999999999999999 77899999999999999999999755322 223346799999999999997 Q ss_pred -eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhh-HHHHHHHHHhC-Cccccc Q lcl|NC_020078. 252 -SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSK-LWFIDSWLAFG-VTINRT 328 (339) Q Consensus 252 -Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~-~d~i~g~~~~G-a~v~rP 328 (339) |++++..-..+....+.. ...++--++.|++|+......+ .++++.|...+ +|+......|+ +-|+.- T Consensus 215 ps~r~~t~~~~~~G~~~~~--------~ak~in~ii~~~~a~~~~~K~~-~~~~~~P~~~~~~~~~~~~r~y~d~~v~~n 285 (299) T protein:vir:79 215 PSNLMKTAYDFTTGWKVGA--------GAKQIFMSLVHPSAIITPVSYQ-FSKLDEPTAVTEGKYFYFEESFEDVFILNK 285 (299) T ss_pred chhhcCccceeccCccccC--------cccccceEEEcCCeeeeeEeee-eEEeecCCCCCccceeeeeeeeeeeeeecc Confidence 677763222111111111 1223335788999998888777 77887765433 33333344444 333333 Q ss_pred cceE-EEEecCC Q lcl|NC_020078. 329 EYAG-VIKLPAA 339 (339) Q Consensus 329 e~~v-~i~~~~a 339 (339) ..-+ -+...+| T Consensus 286 k~~~i~~~~~~a 297 (299) T protein:vir:79 286 KADAIQFVVEGA 297 (299) T ss_pred ccCeEEEEeeec Confidence 3222 2333444 No 42 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=1.4e-38 Score=228.14 Aligned_cols=288 Identities=9% Similarity=0.005 Sum_probs=198.4 Q ss_pred cccchhHHHH-HHHHHHHHHHHHHHhhhccccccc---cc---cccceEEEeccccceeeeccCC--CCCCCCCCCCccc Q lcl|NC_020078. 21 GAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR---SV---RGTSTISNRGISKAKLQKIAPG--TTPPPSTEPHTSK 91 (339) Q Consensus 21 ~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r---~i---~~G~tv~i~~iG~~t~~~~~~g--~~i~~~~~~~~~~ 91 (339) =+ +-...|| ++|+.++++.|++.+||.++|+++ ++ +.|+||+|++.+..++++|.++ +.+..+ ++...+ T Consensus 1 MA-N~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~-~~~e~~ 78 (423) T protein:vir:35 1 MA-NNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKN-GLFSAK 78 (423) T ss_pred Cc-cchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccc-ccccce Confidence 11 2222464 999999999999999999999875 44 3499999999999999999864 456653 455667 Q ss_pred eEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccccc Q lcl|NC_020078. 92 IFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLA 171 (339) Q Consensus 92 ~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~ 171 (339) ..|+||+.||++|.++|.|++|..-|+. ...+.++++|++.+|+.++..+..++-. ..| +. T Consensus 79 v~l~id~~k~~a~~v~d~e~~l~i~~~~-~~l~~a~~ala~~vd~~l~~~l~~~a~~----------~vg----t~---- 139 (423) T protein:vir:35 79 ATGKVGKYITVAVEWTQIEEALKLNQLD-QILSPIHERMVTDLETELAHFMMNNGAL----------SLG----SP---- 139 (423) T ss_pred eeEEeccceeccceeCHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcccc----------ccc----cc---- Confidence 8899999999999999999999888885 5777889999999999998777654311 001 10 Q ss_pred CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc-chhhhcccccccceeeccee-EEEeceEE Q lcl|NC_020078. 172 GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE-HITNGEYVTSAGETLNTKYM-FAAFGVPV 249 (339) Q Consensus 172 ~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~-~~~n~d~~~~~~~~l~~G~v-~~i~G~~V 249 (339) .++..-|+.|.+++.+|++++| |.++||+||+|++|..|++++ +|.+.+-. ....+++|.| |+++||+| T Consensus 140 -----~t~~~~~~~i~~a~~~Ld~~~v--P~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~--~~~alr~g~i~G~i~GFdv 210 (423) T protein:vir:35 140 -----NTAIKKWADVAQTASFIKDIGI--KTGENYAIMDPWSAQRLADAQSGLHAADQL--VRTAWENAQISGNFGGIRA 210 (423) T ss_pred -----cCCcchHHHHHHHHHHHHHhcC--CcCCCEEEeCHHHHHHHhccccceeccccc--hhHHHhhccceeeecceEE Confidence 0111236889999999999999 778999999999999998755 56655422 3345888876 99999999 Q ss_pred EEeccccccccccccccC-----------------------------C-----Ccccccccc------------------ Q lcl|NC_020078. 250 ITSNNAVFGKTITDHLLS-----------------------------N-----ANNEKAYDG------------------ 277 (339) Q Consensus 250 ~~Snnlp~~~~~~~~~l~-----------------------------~-----~~~~~~y~~------------------ 277 (339) |+|||+|..+.++.+... . .|+...|.+ T Consensus 211 ~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~ 290 (423) T protein:vir:35 211 LMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTA 290 (423) T ss_pred EEcCCCccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCC Confidence 999999964333322100 0 001100111 Q ss_pred -------c------------------------------------------------cceEEEEEeccceeEEEEEeeeeE Q lcl|NC_020078. 278 -------D------------------------------------------------FKDIVAQMFSPKALLAGSTIPVTS 302 (339) Q Consensus 278 -------~------------------------------------------------~~~~~~~~~h~~A~~~~~~~~~~~ 302 (339) . ......|+|||+|++++...-... T Consensus 291 ~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~ 370 (423) T protein:vir:35 291 MSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKL 370 (423) T ss_pred ceeEEEEeccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEeecCceeEEEEEccccC Confidence 0 012245899999998876532211 Q ss_pred -----------------EeeechhhhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 303 -----------------KIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 303 -----------------e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) ..+++.+..-..++==..||.+.+|||.++-+.=.- T Consensus 371 ~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:35 371 HSLDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CccceeeccccCceEEEEEeeccccCceEEEEEeecceeeecccceEEEEecC Confidence 112222211111111236999999999986664333 No 43 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=1.1e-38 Score=228.75 Aligned_cols=286 Identities=11% Similarity=0.011 Sum_probs=195.7 Q ss_pred cccchhHHH-HHHHHHHHHHHHHHHhhhccccccc---cc---cccceEEEeccccceeeeccCCC--CCCCCCCCCccc Q lcl|NC_020078. 21 GAGDPLADV-TEQFTGTVEGTIKRRSIMAGFVPVR---SV---RGTSTISNRGISKAKLQKIAPGT--TPPPSTEPHTSK 91 (339) Q Consensus 21 ~~~~~~a~~-ie~~~g~v~~~f~~~sv~~~~v~~r---~i---~~G~tv~i~~iG~~t~~~~~~g~--~i~~~~~~~~~~ 91 (339) =+...+ .| .++|+.++++.|++.+||.++|+++ ++ +.|+||+|++.+..++++|.... .+.. +++...+ T Consensus 1 MaN~ll-T~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~-~~l~e~~ 78 (423) T protein:vir:17 1 MPNNLD-SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNK-NNLISGK 78 (423) T ss_pred Cccchh-hhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCccc-Cccccce Confidence 112222 35 4999999999999999999999876 33 36999999999999999997532 3443 3455667 Q ss_pred eEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccccc Q lcl|NC_020078. 92 IFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLA 171 (339) Q Consensus 92 ~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~ 171 (339) ..|+||+.||++|.++|.|++|..-|+ +++.+.++++||+.+|+.++..+.+.+... . ++. T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~----------~----gt~---- 139 (423) T protein:vir:17 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS----------L----GSP---- 139 (423) T ss_pred eEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----------c----ccC---- Confidence 899999999999999999999766666 789999999999999999987765533110 0 000 Q ss_pred CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc-hhhhcccccccceeeccee-EEEeceEE Q lcl|NC_020078. 172 GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH-ITNGEYVTSAGETLNTKYM-FAAFGVPV 249 (339) Q Consensus 172 ~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~-~~n~d~~~~~~~~l~~G~v-~~i~G~~V 249 (339) + ++...|+.+.+++.+|++++| |.++||+||+|++|..|+++++ |.+.+ .+ ....+++|.| |+++||+| T Consensus 140 ~-----t~~~a~~~i~~a~~~Ld~~~v--P~~~R~~Vv~p~~~a~Ll~~~~~~~~~~-~~-~~~alr~g~i~G~i~GFdv 210 (423) T protein:vir:17 140 N-----TPITKWSDVAQTASFLKDLGV--NEGENYAVMDPWSAQRLADAQTGLHASD-QL-VRTAWENAQIPTNFGGIRA 210 (423) T ss_pred C-----cccccHHHHHHHHHHHHhccC--CcCCCEEEeChHHHHHHhccccceeccc-cc-chHHHhhccceeeecceEE Confidence 0 011136789999999999999 7789999999999999998765 44443 22 2345899987 89999999 Q ss_pred EEecccccccccccccc-----------------------------C-----CCcccccccc------------------ Q lcl|NC_020078. 250 ITSNNAVFGKTITDHLL-----------------------------S-----NANNEKAYDG------------------ 277 (339) Q Consensus 250 ~~Snnlp~~~~~~~~~l-----------------------------~-----~~~~~~~y~~------------------ 277 (339) |+|||+|..+.++.|.. + ..|+.-.|.+ T Consensus 211 y~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~ 290 (423) T protein:vir:17 211 LMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATP 290 (423) T ss_pred EEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccc Confidence 99999995443332210 0 0011111111 Q ss_pred -------c------------------------------------------------cceEEEEEeccceeEEEEEeeee- Q lcl|NC_020078. 278 -------D------------------------------------------------FKDIVAQMFSPKALLAGSTIPVT- 301 (339) Q Consensus 278 -------~------------------------------------------------~~~~~~~~~h~~A~~~~~~~~~~- 301 (339) . .....-|+|||+|++++...-.. T Consensus 291 ~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~ 370 (423) T protein:vir:17 291 ISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKL 370 (423) T ss_pred cceEEEEEecccccccCceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEcccCC Confidence 0 01123379999999877642211 Q ss_pred ----------------EEeeechhhhHHHHHHH--HHhCCccccccceEEEEecC Q lcl|NC_020078. 302 ----------------SKIFFDDLSKLWFIDSW--LAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 302 ----------------~e~~~~~~~~~d~i~g~--~~~Ga~v~rPe~~v~i~~~~ 338 (339) +-.+++.+ .+..... ..||.+.+|||.++-+-=.- T Consensus 371 ~~~~~~~~~~~g~s~r~~~~~d~~--~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:17 371 HSIDSAVATYEGFSIRVHKYADGD--ANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CccceeecccCCcEEEEEEecccc--cceeEEEEEeecceeeeccceEEEEEecC Confidence 11112221 1221122 35999999999996664433 No 44 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=2.3e-38 Score=226.97 Aligned_cols=288 Identities=11% Similarity=0.019 Sum_probs=196.7 Q ss_pred cccchhHHH-HHHHHHHHHHHHHHHhhhccccccc---cc---cccceEEEeccccceeeeccCC--CCCCCCCCCCccc Q lcl|NC_020078. 21 GAGDPLADV-TEQFTGTVEGTIKRRSIMAGFVPVR---SV---RGTSTISNRGISKAKLQKIAPG--TTPPPSTEPHTSK 91 (339) Q Consensus 21 ~~~~~~a~~-ie~~~g~v~~~f~~~sv~~~~v~~r---~i---~~G~tv~i~~iG~~t~~~~~~g--~~i~~~~~~~~~~ 91 (339) =+...+ .| .++|+.++++.|++.+|+.++|+++ ++ +.|+||+|++.+..++++|+++ ..+.. +++...+ T Consensus 1 MaN~ll-T~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~-~dl~e~~ 78 (423) T protein:vir:10 1 MPNNLD-SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNK-NNLISGK 78 (423) T ss_pred Cccchh-hhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCcccccccc-Cccccce Confidence 112222 34 4999999999999999999999875 34 3599999999999999999964 34444 3466678 Q ss_pred eEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccccc Q lcl|NC_020078. 92 IFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLA 171 (339) Q Consensus 92 ~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~ 171 (339) ..|+||+.||+.|.++|.|++|..-|+ +.+.+.++++||+.+|+.++..+..++-. .. ++ . T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~----------~~----gt---~- 139 (423) T protein:vir:10 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGAL----------SL----GS---P- 139 (423) T ss_pred eEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc----------cc----cc---C- Confidence 899999999999999999998766565 88999999999999999998765543210 00 00 0 Q ss_pred CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc-hhhhcccccccceeeccee-EEEeceEE Q lcl|NC_020078. 172 GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH-ITNGEYVTSAGETLNTKYM-FAAFGVPV 249 (339) Q Consensus 172 ~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~-~~n~d~~~~~~~~l~~G~v-~~i~G~~V 249 (339) + . +...|+.+.+++.+|++++| |..+||+||+|++|..|+++++ |.+.+- .....+++|.| |+++||+| T Consensus 140 ~--t---~~~a~~~i~~a~~~Ld~~~v--P~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~--~~~~alr~g~i~G~i~GFdv 210 (423) T protein:vir:10 140 N--T---PITKWSDVAQTASFLKDLGV--NEGENYAVMDPWSAQRLADAQTGLHASDQ--LVRTAWENAQIPTNFGGIRA 210 (423) T ss_pred C--c---ccchHHHHHHHHHHHHhccC--CcCCCEEEeChHHHHHHhccccceecccc--cchhhhhhccceeeecceEE Confidence 0 0 11236789999999999999 7789999999999999997665 454432 22345899987 89999999 Q ss_pred EEecccccccccccccc-----------------------------C-----CCcccccccc------------------ Q lcl|NC_020078. 250 ITSNNAVFGKTITDHLL-----------------------------S-----NANNEKAYDG------------------ 277 (339) Q Consensus 250 ~~Snnlp~~~~~~~~~l-----------------------------~-----~~~~~~~y~~------------------ 277 (339) |+|||+|..+.++.|.. . ..|+.-.|.+ T Consensus 211 ~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~ 290 (423) T protein:vir:10 211 LMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATP 290 (423) T ss_pred EEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccC Confidence 99999996444332210 0 0011111111 Q ss_pred -----------------c--------------------------------------cceEEEEEeccceeEEEEEee--- Q lcl|NC_020078. 278 -----------------D--------------------------------------FKDIVAQMFSPKALLAGSTIP--- 299 (339) Q Consensus 278 -----------------~--------------------------------------~~~~~~~~~h~~A~~~~~~~~--- 299 (339) + .....-|+|||+|++++...- T Consensus 291 ~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~ 370 (423) T protein:vir:10 291 ISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKL 370 (423) T ss_pred cceEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCC Confidence 0 012234899999998776422 Q ss_pred --------------eeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 300 --------------VTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 300 --------------~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) +.+-.+++.+..-..++==..||.+.+|||.++-+-=.- T Consensus 371 ~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 371 HSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CccceeeccccCceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 111112222211111111235999999999996664433 No 45 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=8.6e-38 Score=223.79 Aligned_cols=262 Identities=15% Similarity=0.120 Sum_probs=212.0 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEecccc-ceeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISK-AKLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+..-++++ |+|+..|.+.+++.+++.+++.+. ++. .|++|+||+++. ..+..+..|+.++.. .+..++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~-~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMT-QLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccc-ccccceE Confidence 2211222223444 999999999999999999998765 444 599999999874 588899999999865 5778899 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) ++++++ .+..+.+.|++..|+..|+++++.+++++++++.+|+.++..+.++... + .+ T Consensus 80 ~~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-------------------~--~~ 137 (272) T protein:vir:98 80 TMTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-------------------V--EA 137 (272) T ss_pred EEEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------------c--cc Confidence 999987 4577999999999999999999999999999999999998765432100 0 00 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .. -++.|.++..+|++.+. ..|+++|+|++|..|+++. ++++. +..+...+++|.+++++|++|+ T Consensus 138 ---~~----t~d~i~da~~~l~~~~~----~~~~~vv~p~~~~~L~k~~~~~~~~~--~~~~~~~~~~g~ig~i~G~~Vi 204 (272) T protein:vir:98 138 ---TA----TVDGVSKALDIFNDEDD----AETVIVMNPADASTLRLDAAKEWLGA--TEVGANRVVSGVYGEVLGVQIV 204 (272) T ss_pred ---cc----CHHHHHHHHHHHhccCC----CccEEEEcHHHHHHHHHhcccccccc--ccccccccccccchhhcCeeEE Confidence 01 15678888899987654 4689999999999999874 44443 3333345789999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.+ .++++++.|++++...++.+|..|+++++.+.|++++.||.++++|++ T Consensus 205 ~s~~~p~~------------------------t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~ 260 (272) T protein:vir:98 205 RSRKCPKG------------------------TAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEK 260 (272) T ss_pred EcCCCCcc------------------------eEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCc Confidence 99999832 247889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.+++.+| T Consensus 261 vv~~t~~~a 269 (272) T protein:vir:98 261 AVKITLKDA 269 (272) T ss_pred eEEEEeccc Confidence 999999999 No 46 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=8.6e-38 Score=223.79 Aligned_cols=262 Identities=15% Similarity=0.120 Sum_probs=212.0 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEecccc-ceeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISK-AKLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+..-++++ |+|+..|.+.+++.+++.+++.+. ++. .|++|+||+++. ..+..+..|+.++.. .+..++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~-~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMT-QLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccc-ccccceE Confidence 2211222223444 999999999999999999998765 444 599999999874 588899999999865 5778899 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) ++++++ .+..+.+.|++..|+..|+++++.+++++++++.+|+.++..+.++... + .+ T Consensus 80 ~~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-------------------~--~~ 137 (272) T protein:vir:30 80 TMTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-------------------V--EA 137 (272) T ss_pred EEEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------------c--cc Confidence 999987 4577999999999999999999999999999999999998765432100 0 00 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc--chhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE--HITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .. -++.|.++..+|++.+. ..|+++|+|++|..|+++. ++++. +..+...+++|.+++++|++|+ T Consensus 138 ---~~----t~d~i~da~~~l~~~~~----~~~~~vv~p~~~~~L~k~~~~~~~~~--~~~~~~~~~~g~ig~i~G~~Vi 204 (272) T protein:vir:30 138 ---TA----TVDGVSKALDIFNDEDD----AETVIVMNPADASTLRLDAAKEWLGA--TEVGANRVVSGVYGEVLGVQIV 204 (272) T ss_pred ---cc----CHHHHHHHHHHHhccCC----CccEEEEcHHHHHHHHHhcccccccc--ccccccccccccchhhcCeeEE Confidence 01 15678888899987654 4689999999999999874 44443 3333345789999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +||++|.+ .++++++.|++++...++.+|..|+++++.+.|++++.||.++++|++ T Consensus 205 ~s~~~p~~------------------------t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~ 260 (272) T protein:vir:30 205 RSRKCPKG------------------------TAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEK 260 (272) T ss_pred EcCCCCcc------------------------eEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCc Confidence 99999832 247889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.+++.+| T Consensus 261 vv~~t~~~a 269 (272) T protein:vir:30 261 AVKITLKDA 269 (272) T ss_pred eEEEEeccc Confidence 999999999 No 47 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=2.4e-37 Score=221.32 Aligned_cols=263 Identities=16% Similarity=0.139 Sum_probs=214.0 Q ss_pred ccCcccchhHHH-HHHHHHHHHHHHHHHhhhccccccc-ccc--ccceEEEeccccc-eeeeccCCCCCCCCCCCCccce Q lcl|NC_020078. 18 QRHGAGDPLADV-TEQFTGTVEGTIKRRSIMAGFVPVR-SVR--GTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKI 92 (339) Q Consensus 18 ~~~~~~~~~a~~-ie~~~g~v~~~f~~~sv~~~~v~~r-~i~--~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~ 92 (339) ..+..+-...++ -|+|+..|.+.+++.++|.++..+. ++. +|++|+||..+.. .+.++..|++|+.. .+.+++. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~-~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVD-KIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcc-cccccee Confidence 333233333344 4999999999999999999998876 444 5999999988664 66789999999864 5778888 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) +.+|. ..+..|.++|++..++..|++++.++++|++||+++|+.++..+..+.... T Consensus 80 ~a~i~-~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~----------------------- 135 (276) T protein:vir:10 80 EAKIH-KIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTV----------------------- 135 (276) T ss_pred eEEee-hccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------------- Confidence 99995 479999999999999999999999999999999999999987765422110 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcc--cchhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQA--EHITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~--~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .....+ ++.|.++..+|.++++ ..++++|.|++|..|+++ .+|++..-.+ ...+++|.|++++|++|+ T Consensus 136 ~~~~~t----~d~i~~A~~~lgd~~~----~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g--~~~~~~G~ig~~~G~~Vi 205 (276) T protein:vir:10 136 SADIGT----LAGLEAAIDTFDDEDL----EPMVLFINPKDAGKLRSSASDNFTRATELG--DNIIVKGAFGEALGAVIV 205 (276) T ss_pred cccccC----HHHHHHHHHHhccccC----cccEEEEcHHHHHHHHHhcccccccccccc--ccceeccccceecceeEE Confidence 011122 4677888999988765 468999999999999764 6888764333 346899999999999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) +|+++|.. .++++++.|++++..+++++|..|++++++|.|.+++.||+++++|+. T Consensus 206 ~s~~~p~~------------------------t~~l~~~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~ 261 (276) T protein:vir:10 206 RSKKLDEG------------------------EAILAKRGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (276) T ss_pred EcCCCCcc------------------------eEEEEeccceeeeecCCceeecccchhhcccEEEEeeEEEEEEEcCcc Confidence 99999732 246889999999999999999999999999999999999999999999 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) ++.++..+= T Consensus 262 vv~~t~~~~ 270 (276) T protein:vir:10 262 AVKVTKGAG 270 (276) T ss_pred eEEEecCCc Confidence 999986543 No 48 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=1.9e-35 Score=210.88 Aligned_cols=290 Identities=9% Similarity=-0.025 Sum_probs=192.2 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHhhhccccccc---cc---cccceEEEeccccceeeeccCCCCCCCC--CCCCccce Q lcl|NC_020078. 21 GAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVR---SV---RGTSTISNRGISKAKLQKIAPGTTPPPS--TEPHTSKI 92 (339) Q Consensus 21 ~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r---~i---~~G~tv~i~~iG~~t~~~~~~g~~i~~~--~~~~~~~~ 92 (339) =+....++-.++|+.++++.|++.+||.++|++. ++ +.|+||+|++.+..++++.. +..+.+. +++...+. T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~-~~~~t~~~~~~l~e~~v 79 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTM-DGDITGKSKNSLISAKA 79 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeeccc-CcccCcccccccccceE Confidence 2223333445999999999999999999999975 33 35999999999999998743 4444432 23444568 Q ss_pred EEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 93 FLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 93 ~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) .|+||+.||+.|.++|.|+++.-.|+ +.+.+.+.++||+.+|+.++..+.+.+-. ..| +. .. T Consensus 80 ~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~----------~vg----t~---~t 141 (423) T protein:vir:10 80 TGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGAL----------SLG----SP---NT 141 (423) T ss_pred EEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccc----------ccc----cc---cc Confidence 99999999999999999998666666 78999999999999999997666543311 001 00 00 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeeccee-EEEeceEEEE Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYM-FAAFGVPVIT 251 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v-~~i~G~~V~~ 251 (339) + . .-++.+.+++.+|++++| |..+||+||+|++|..|++++.+......+ ....+++|.| |+++||+||+ T Consensus 142 ~---~---~a~~~~a~a~~~L~~~~v--P~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~~~i~G~~~GFdi~~ 212 (423) T protein:vir:10 142 P---I---KKWSDVAQTASFLKDLGI--NSGENYAVMDPWAAQRLADAQSGLHVSEQL-VRTAWENAQISGNFGGIRALM 212 (423) T ss_pred c---c---ccHHHHHHHHHHHhhccC--CcCCCEEEeCHHHHHHHhhhhhhhcccccc-chHHHHhcccceeecceEEEE Confidence 0 1 125788999999999999 778999999999999999766544333222 2345888876 9999999999 Q ss_pred eccccccccccc----c----------cc-----------CC----C-----ccc------------------------- Q lcl|NC_020078. 252 SNNAVFGKTITD----H----------LL-----------SN----A-----NNE------------------------- 272 (339) Q Consensus 252 Snnlp~~~~~~~----~----------~l-----------~~----~-----~~~------------------------- 272 (339) ||++|..+.++. + +. +. . |+. T Consensus 213 Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~ 292 (423) T protein:vir:10 213 SNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALS 292 (423) T ss_pred ecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcc Confidence 999994222110 0 00 00 0 000 Q ss_pred cccccc------------------------------------------------cceEEEEEeccceeEEEEEee----- Q lcl|NC_020078. 273 KAYDGD------------------------------------------------FKDIVAQMFSPKALLAGSTIP----- 299 (339) Q Consensus 273 ~~y~~~------------------------------------------------~~~~~~~~~h~~A~~~~~~~~----- 299 (339) ..|.+. .....-|+|||+|++++...- T Consensus 293 ~~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~ 372 (423) T protein:vir:10 293 FTATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHS 372 (423) T ss_pred eEEEEEecccccccCceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCc Confidence 011110 011234899999998775422 Q ss_pred ------------eeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 300 ------------VTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 300 ------------~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) +.+-.+++.+..-..++==..||.+.+|||.++-+-=.- T Consensus 373 ~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 373 IDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred cceeecccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 111222222211111211235999999999986664433 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=1e-33 Score=201.41 Aligned_cols=281 Identities=8% Similarity=0.085 Sum_probs=208.3 Q ss_pred chhHHHHHHHHHHHHHHHHHHhhhccccccc-cccccceEEEeccccceeeeccCCCCCCCCCCCCccceEEEEeehhhh Q lcl|NC_020078. 24 DPLADVTEQFTGTVEGTIKRRSIMAGFVPVR-SVRGTSTISNRGISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIA 102 (339) Q Consensus 24 ~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r-~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~ 102 (339) =..+ +.|+|+++|++.|.+.+++..+.+.+ ...+|++|+||+|+.+.+++|++++.....+ ++.+..++++||.+|| T Consensus 1 Main-~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~-v~~~~et~tl~qdR~~ 78 (290) T protein:vir:78 1 MAIN-YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGS-ASNTNKSYTIDFDRDV 78 (290) T ss_pred Cchh-HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCc-cccceeeEEeeccccc Confidence 1111 44999999999999999999987655 5668999999999999999999999887654 5678889999999999 Q ss_pred hhhHH--HHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHH Q lcl|NC_020078. 103 RNAEP--MLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPA 180 (339) Q Consensus 103 ~~~vd--d~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 180 (339) .|.|| |+||.|....+.....+.+.+.++-.+|.+.+..|+..|...+ ...+.+.+++ T Consensus 79 ~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~--------------------~~~~~t~t~~ 138 (290) T protein:vir:78 79 EFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS--------------------NSVAEEITKD 138 (290) T ss_pred eeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC--------------------cccccccCHH Confidence 99999 9999999999999999999999999999999988876553110 0011235678 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEec--ccccc Q lcl|NC_020078. 181 KLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSN--NAVFG 258 (339) Q Consensus 181 ~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Sn--nlp~~ 258 (339) ++|++|+++.++|+| | |.++||++|+|++|.+|+++++|+..-=.+..+....+|.|++++||+|++.+ ++-.+ T Consensus 139 n~~~~i~~~~~~lde--v--p~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t 214 (290) T protein:vir:78 139 NVFTKLKAAIRKVKK--Y--GTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYD 214 (290) T ss_pred HHHHHHHHHHHHHHh--c--CCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhh Confidence 999999999999986 7 56899999999999999999999742111111222348999999999999854 22211 Q ss_pred ccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh---hHHHHHHHHHhCCccccccceEEEE Q lcl|NC_020078. 259 KTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS---KLWFIDSWLAFGVTINRTEYAGVIK 335 (339) Q Consensus 259 ~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~---~~d~i~g~~~~Ga~v~rPe~~v~i~ 335 (339) . . ..+. | ..-.....++--++.|++|+......+ .+..+.|... -+|.+..+.-+.+-|+.-.. -.|+ T Consensus 215 ~-~--~f~~--G--~~~~~~ak~in~ii~~~~a~i~~~K~~-~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~-~~i~ 285 (290) T protein:vir:78 215 T-F--DFTD--G--YKPAAGAKKLNFLLVNKGSVVGGAKHA-SIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQK-DGVI 285 (290) T ss_pred h-h--hhcc--c--ccccCCccceeEEEEcCCceeeeeeee-EEEeeCCCCCcCcceeeeeeeeeeeeeeecccc-CeeE Confidence 1 0 0000 0 000112334456788999998887777 6787776643 35788888877777775443 3444 Q ss_pred ecCC Q lcl|NC_020078. 336 LPAA 339 (339) Q Consensus 336 ~~~a 339 (339) ..++ T Consensus 286 ~~~~ 289 (290) T protein:vir:78 286 ASTE 289 (290) T ss_pred EEee Confidence 4445 No 50 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.94 E-value=1.7e-29 Score=178.34 Aligned_cols=284 Identities=11% Similarity=0.032 Sum_probs=195.4 Q ss_pred chhHHHHHHHHHHHHHHHHHHhhhccc-c----ccc-cccccceEEEeccc-cceeeeccCCCCCCCCCCCCccceEEEE Q lcl|NC_020078. 24 DPLADVTEQFTGTVEGTIKRRSIMAGF-V----PVR-SVRGTSTISNRGIS-KAKLQKIAPGTTPPPSTEPHTSKIFLKI 96 (339) Q Consensus 24 ~~~a~~ie~~~g~v~~~f~~~sv~~~~-v----~~r-~i~~G~tv~i~~iG-~~t~~~~~~g~~i~~~~~~~~~~~~l~I 96 (339) =.. -|.++|+.+|++.|...++.... . +.+ ...+|++|+||+|. .+-.++|+++.-......++.+..++++ T Consensus 1 Mai-nya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~tl 79 (346) T protein:vir:10 1 MTI-NYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSYEL 79 (346) T ss_pred Ccc-hhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeEEEe Confidence 111 14599999999999887766332 1 222 44689999999995 5679999987766544457788999999 Q ss_pred eehhhhhhhHH--HHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCcc Q lcl|NC_020078. 97 DTVIIARNAEP--MLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGAN 174 (339) Q Consensus 97 D~~~y~~~~vd--d~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 174 (339) ||.++|.|.|| |+||.+....+...+.+.+....+-.+|.+.|..|+..+...+ .....+ T Consensus 80 ~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~------------------~~~~~~ 141 (346) T protein:vir:10 80 KNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAH------------------DGGITT 141 (346) T ss_pred eccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhc------------------cccccc Confidence 99999999999 8888876666655555556666777889998888875442110 001112 Q ss_pred ccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEE--e Q lcl|NC_020078. 175 DYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVIT--S 252 (339) Q Consensus 175 ~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~--S 252 (339) .+.+++++|++|+++.++|+|+.| |.++||++|+|++|.+|.++++|...- ...+... -+|.|++++||+|++ | T Consensus 142 ~a~T~~ni~~~i~~~~~~lde~~v--p~~~rvl~vTp~~~~lLk~s~~f~k~~-~v~~~~~-i~~~V~siDGv~Ii~VPs 217 (346) T protein:vir:10 142 NTLDEKNILPAFDNMMLDFDEARI--PSTNRILYVTPKTNAILKRAEAMNRAL-TLKDPNN-IQRTVYSLDDVTIRVVPS 217 (346) T ss_pred cccCHHHHHHHHHHHHHHHHHccC--CCCCeEEEECHHHHHHHhhchhheecc-ccccccc-cceeeeeecCeEEEEcch Confidence 335688999999999999999999 778999999999999999998886432 2222223 489999999999987 5 Q ss_pred ccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-hh-HHHHHHHHHhCCccccccc Q lcl|NC_020078. 253 NNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-SK-LWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 253 nnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-~~-~d~i~g~~~~Ga~v~rPe~ 330 (339) ++++-.-..+.+..+ .....++--++.|++|.......+ .++++.+.. .. +|.+..+.-+.+-|+.-.. T Consensus 218 ~r~~t~~~f~~G~~~--------~t~ak~INfiiv~~~A~ia~~K~~-~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~ 288 (346) T protein:vir:10 218 DLMQTAYDFSDGSKI--------IDTAKQIEMFLIYNGVQIAPEKYS-FVGFDQPSAATSGNYLYYEQSYDDVLLLNTKT 288 (346) T ss_pred hhcccchhhccCccc--------cCCccceeEEEECCceeeeeeeee-eeEeeCCCCCcccceeeeeeeeeeeeeecccc Confidence 666521111111000 112334456788999998777776 677776653 22 3578888777777776554 Q ss_pred eEE-EEecCC Q lcl|NC_020078. 331 AGV-IKLPAA 339 (339) Q Consensus 331 ~v~-i~~~~a 339 (339) -+. +....| T Consensus 289 ~~Iyv~~~~a 298 (346) T protein:vir:10 289 KGIQFVVSDK 298 (346) T ss_pred ceEEEeeecc Confidence 332 122222 No 51 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.94 E-value=6.2e-30 Score=180.69 Aligned_cols=260 Identities=11% Similarity=0.113 Sum_probs=205.4 Q ss_pred Ccccc-hhHHHHHHHHHHHHHHHHHHhhhccccccc-cc--cccceEEEeccccc-eeeeccCCCCCCCCCCCCccceEE Q lcl|NC_020078. 20 HGAGD-PLADVTEQFTGTVEGTIKRRSIMAGFVPVR-SV--RGTSTISNRGISKA-KLQKIAPGTTPPPSTEPHTSKIFL 94 (339) Q Consensus 20 ~~~~~-~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r-~i--~~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~~l 94 (339) ..-+- .+-+.-|+|+..|.+.+.+.++|.++...+ ++ ++|++|+||...-. .+.++..|++|+.. .+.+++.+. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~-~lt~~~~~a 79 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTT-QMSMTTTKV 79 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchh-hcccchhee Confidence 22222 222345999999999999999999998887 33 46999999987643 66789999999864 677888889 Q ss_pred EEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCcc Q lcl|NC_020078. 95 KIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGAN 174 (339) Q Consensus 95 ~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 174 (339) +|-+ ....+.+.|++...+..|++.+.+++++.++|+++|..++..|..+.... + T Consensus 80 ~i~~-~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~------------------------~ 134 (270) T protein:vir:95 80 TVKE-TGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA------------------------T 134 (270) T ss_pred eeeh-hhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc------------------------c Confidence 9954 46899999999998888999999999999999999999887665422110 1 Q ss_pred ccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEecc Q lcl|NC_020078. 175 DYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNN 254 (339) Q Consensus 175 ~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snn 254 (339) ...+ ++.|.++..+|.+.. ....+++|+|+.|..|.++..+.... .+...+++|.++.++|++|+++.+ T Consensus 135 ~~~t----~~~~~dA~~~lgd~~----~~~~~i~vhs~~~~~Lrk~~~~~~~~---~~~~~~~~G~ig~~~G~~Viv~s~ 203 (270) T protein:vir:95 135 VSAD----ATGILDAIEVFNSEN----DEDYVLYVNPKDYNKLVKSLFKVGGN---VQDRAISKGDLVEIVGVSDIVKSK 203 (270) T ss_pred cccC----HHHHHHHHHHhcccc----CCCcEEEEcHHHHHHHHhhhcccccc---cccchhcccccceecceeEEEeCC Confidence 1122 245667778885542 23469999999999999987554333 233457899999999999988776 Q ss_pred ccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEE Q lcl|NC_020078. 255 AVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVI 334 (339) Q Consensus 255 lp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i 334 (339) .|.. -.++++++.|++++..+++.+|..|+++++.|.|.+.+.||.++++|+.++.+ T Consensus 204 ~~~~-----------------------~~~~l~~~gAi~~~~~~~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~ 260 (270) T protein:vir:95 204 RVSE-----------------------NTAFLQRYGAMEIVNKKKPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKV 260 (270) T ss_pred CCCc-----------------------eeEEEEeccceeeeecCCceeeeccchhhcccEEEeeeEEEEEEEccceEEEE Confidence 6521 13578999999999999999999999999999999999999999999999999 Q ss_pred EecCC Q lcl|NC_020078. 335 KLPAA 339 (339) Q Consensus 335 ~~~~a 339 (339) .+..| T Consensus 261 t~~~a 265 (270) T protein:vir:95 261 TFKPS 265 (270) T ss_pred EecCC Confidence 98888 No 52 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.94 E-value=3.4e-30 Score=182.13 Aligned_cols=231 Identities=12% Similarity=0.070 Sum_probs=186.2 Q ss_pred cccccccceEEEeccccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHH Q lcl|NC_020078. 53 VRSVRGTSTISNRGISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIAN 132 (339) Q Consensus 53 ~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~ 132 (339) .--+..|+|++||.. -..+.++..|.+|+. +.++.++.+.+|.+ ...+|.|.|++..+...|++.+.+++++.+||+ T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i~~-~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEISL-DKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcCCh-hhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 122346999999754 335688999999986 45788889999966 478999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHH Q lcl|NC_020078. 133 MYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPA 212 (339) Q Consensus 133 ~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~ 212 (339) ++|..++.++.+++.. .+ .+.+ ++.|.++..+|.+.+. ..+|++|+|+ T Consensus 78 kvD~di~~~~~~a~l~-----------------------~~-~~~t----~d~i~~A~~~fgde~~----~~~vivv~p~ 125 (231) T protein:vir:73 78 KVDDDLLKAAKTTSQT-----------------------VS-TKAN----VDGVQAALDIFNDEDA----QAYVLIVNPK 125 (231) T ss_pred hhhHHHHHhhcccccc-----------------------cc-cccc----HHHHHHHHHHhccccc----cceEEEEcch Confidence 9999998766543311 01 1122 4677888888987653 3589999999 Q ss_pred HHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEecccee Q lcl|NC_020078. 213 AFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKAL 292 (339) Q Consensus 213 ~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~ 292 (339) .|+.|.++.++.... +..+...+++|.||.++|++|+.|+++|.+.. |. .+ +++.+.|+ T Consensus 126 ~~~~Lrk~~~~~~~~-~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~--------------~~---~~---~i~~~gAl 184 (231) T protein:vir:73 126 DAAKIRKDANAKNIG-SEVGANALINGTYADVLGAQIVRSKKLAEGSA--------------LM---FK---IVSNSPAL 184 (231) T ss_pred HHHhhhhccchhhhh-hhhccceeeecccceEcceEEEEcCCCCCCce--------------ee---ee---EEeeccce Confidence 999999988876642 22334568999999999999999999984321 10 01 34568999 Q ss_pred EEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 293 LAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 293 ~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++...+++++|..|+++.+++.|.+.+.|++++++|+.++.+.+.+- T Consensus 185 ~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 185 KLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred eeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 99999999999999999999999999999999999999999998888 No 53 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.94 E-value=1.4e-28 Score=173.22 Aligned_cols=292 Identities=11% Similarity=0.014 Sum_probs=206.8 Q ss_pred CcccchhHHHHHHHHHHHHHHHHHHhhhcccccc-c--cccccceEEEeccccceeeeccCCCCCCC-CCCCCccceEEE Q lcl|NC_020078. 20 HGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPV-R--SVRGTSTISNRGISKAKLQKIAPGTTPPP-STEPHTSKIFLK 95 (339) Q Consensus 20 ~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~-r--~i~~G~tv~i~~iG~~t~~~~~~g~~i~~-~~~~~~~~~~l~ 95 (339) .+ .. . -|.++|+.+|++.|.+.+++..+... . .+.+|++|+||+|.....++|++++.... +..++.+..+++ T Consensus 1 Ma-nt-l-~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~t 77 (312) T protein:vir:10 1 MA-NT-L-AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKT 77 (312) T ss_pred CC-cc-h-hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEE Confidence 12 11 2 37799999999999999988877422 2 45689999999999999999999876332 234677888999 Q ss_pred EeehhhhhhhHH--HHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCc Q lcl|NC_020078. 96 IDTVIIARNAEP--MLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGA 173 (339) Q Consensus 96 ID~~~y~~~~vd--d~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~ 173 (339) ++|.++|.|.|| |+||.+....+...+.+.+.....-.+|.+.|..|+..|...+ +.+. ... T Consensus 78 l~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~------------~~~~----~~~ 141 (312) T protein:vir:10 78 MTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIK------------GDTN----VEY 141 (312) T ss_pred eeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccc------------cccc----ccc Confidence 999999999999 9999988888888888888999999999999988876553211 0011 111 Q ss_pred cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEec Q lcl|NC_020078. 174 NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSN 253 (339) Q Consensus 174 ~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Sn 253 (339) ..+.+++++|++|.++.++|+|..| | ++|+++|+|+++.+|.++..+. ..-...+ ....++.|+++.|++|++.+ T Consensus 142 ~~~~T~~ni~~~i~~~~~~lde~~v--p-~~rvl~vTp~~~~lLk~~~~~~-~~~~~~~-~~~i~~~V~~iDgv~Ii~VP 216 (312) T protein:vir:10 142 SYSVNSSTIINKIKTGIKIIRENGY--N-GPLVCHLTYDSMFAIEEKVLEK-LTAVTFA-QGGIQTQVPSIDGCALIKTP 216 (312) T ss_pred ccccCHHHHHHHHHHHHHHHHHccC--C-CceEEEeChHHHHHHhhhhhce-ecccccc-cceeeeeeeeecccEEEEch Confidence 2335788999999999999999998 7 5999999999997777653222 1111112 22348899999999999843 Q ss_pred --cccc----cccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh---hhHHHHHHHHHhCCc Q lcl|NC_020078. 254 --NAVF----GKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL---SKLWFIDSWLAFGVT 324 (339) Q Consensus 254 --nlp~----~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~---~~~d~i~g~~~~Ga~ 324 (339) ++.. +.+.+++... ++..-.....++--++.|++|.......+ .+.++.+.. ..+|.+..+.-+.+- T Consensus 217 s~r~~t~~~f~dG~t~~~~~---gg~~~~~~ak~INfiiv~~~a~i~~~K~~-~~~if~P~~~~~~d~~~~~~R~Y~D~f 292 (312) T protein:vir:10 217 QNRMYSSILLNDGTTSNQTA---GGYLKGTKALDTNFIIAPVDVPLAITKQD-KMRIFDPETNQTANAWSMDYRRYHDLW 292 (312) T ss_pred hhhccceeeeccCccccccc---CceeecCcccccceEEeCCceeeceeeee-eeeeeCCCCCCCcceeeeeeeeeeeee Confidence 3321 1111111111 11111233445556888999987777666 667765543 346899999999998 Q ss_pred cccccceEE-EEecCC Q lcl|NC_020078. 325 INRTEYAGV-IKLPAA 339 (339) Q Consensus 325 v~rPe~~v~-i~~~~a 339 (339) |+.-..-+. +....| T Consensus 293 v~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 293 VTDNKANSVYANFKDA 308 (312) T ss_pred eeccccCeEEEEeecc Confidence 888776555 666666 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.87 E-value=2.5e-24 Score=149.97 Aligned_cols=297 Identities=13% Similarity=0.114 Sum_probs=197.8 Q ss_pred ccCCCcccCCccCcccchhHH-HHHHHHHHHHHHHHHHhhhccccccc-cc-cccceEEEeccccceeeeccCCCCCCCC Q lcl|NC_020078. 8 TPSYDVTRPNQRHGAGDPLAD-VTEQFTGTVEGTIKRRSIMAGFVPVR-SV-RGTSTISNRGISKAKLQKIAPGTTPPPS 84 (339) Q Consensus 8 ~~~~~~~r~~~~~~~~~~~a~-~ie~~~g~v~~~f~~~sv~~~~v~~r-~i-~~G~tv~i~~iG~~t~~~~~~g~~i~~~ 84 (339) .| ..++.-|+ |.++|+.+|++.|...++...+.+.. .+ .||++|+||+|....+++|++++.... T Consensus 1 ~~-----------~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~- 68 (311) T protein:vir:99 1 MP-----------TDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNS- 68 (311) T ss_pred CC-----------CcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccCccc- Confidence 22 22233344 78999999999999999888876654 34 489999999999999999999987654 Q ss_pred CCCCccceEEEEeehhhhhhhHH--HHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCc Q lcl|NC_020078. 85 TEPHTSKIFLKIDTVIIARNAEP--MLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGH 162 (339) Q Consensus 85 ~~~~~~~~~l~ID~~~y~~~~vd--d~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~ 162 (339) ...+.+..++++++..++.|.|| |+||.......-.-..+..-....=.+|.+-+..|+..|...... . T Consensus 69 g~v~~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~------~--- 139 (311) T protein:vir:99 69 GTISDEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGT------D--- 139 (311) T ss_pred cceeeeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc------c--- Confidence 45677889999999999999999 788765444433334444444556678888888777554322110 0 Q ss_pred cccc-cccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc-ccceeecce Q lcl|NC_020078. 163 SGGN-VVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS-AGETLNTKY 240 (339) Q Consensus 163 ~~~~-~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~-~~~~l~~G~ 240 (339) .... ...........+.+++++.|..+..+|++ | |.++|+++|+|++|.+|...++|... .... .+...-++. T Consensus 140 ~~~~~~~~~~~~~~~lt~~nvl~~l~~~~~~~~~--v--~~~~rvl~vTp~~~~lLk~~~~~~r~-~~~~~~~~~~i~~~ 214 (311) T protein:vir:99 140 TEGTLLAKTHKTEETLDETNAYSQLKTGIGKVRK--Y--GTQNLVGYVSSEVMDALERSKEFTRN-ITNQNVGTTALESR 214 (311) T ss_pred cchhhhccccccccccCHHHHHHHHHHHHHHHHh--c--CCCCeEEEEChHHHHHHhhchhhhee-eecccccccccccc Confidence 0000 01111233456889999999999999986 6 55799999999999988887777531 1111 111223678 Q ss_pred eEEEeceEEEEe---ccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh---hhHHH Q lcl|NC_020078. 241 MFAAFGVPVITS---NNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL---SKLWF 314 (339) Q Consensus 241 v~~i~G~~V~~S---nnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~---~~~d~ 314 (339) |+++.|++|++. +++.-.-..+....+. ..+.++--++.|++|.......+ .+..+.|.. ..+|. T Consensus 215 V~~lDgv~Ii~V~ps~r~~t~~~ft~G~~~~--------~~ak~INfiiv~~~a~i~~~K~~-~v~~f~P~~~~~gd~~l 285 (311) T protein:vir:99 215 ITSIDGVQLIEVYESNRFMTKYDFTDGAKPT--------EDAKAINFLVVAKPAVISIVKEN-AVFLFAPGQHTDGDGYL 285 (311) T ss_pred cceecCeEEEEecCchhhcchhhhcCCcccc--------CcccccceEEeCCCeeeeeeeee-eeeeeCCCCCCCcceee Confidence 999999999865 4443111111110010 11234456788999987777665 667665443 34788 Q ss_pred HHHHHHhCCccccccceEE-EEecCC Q lcl|NC_020078. 315 IDSWLAFGVTINRTEYAGV-IKLPAA 339 (339) Q Consensus 315 i~g~~~~Ga~v~rPe~~v~-i~~~~a 339 (339) +..+.-+.+-|+.-..-+. +....| T Consensus 286 ~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 286 YQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred eeeeeeeeeeeeccccCeEEEeeecC Confidence 9888888888887766443 555566 No 55 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.86 E-value=4.8e-24 Score=148.41 Aligned_cols=268 Identities=13% Similarity=0.082 Sum_probs=184.4 Q ss_pred hHH-HHHHHHHHHHHHHHHHhhhccccccc-----cccccceEEEeccc-cceeeeccCCCCCCCCCCCCccceEEEEee Q lcl|NC_020078. 26 LAD-VTEQFTGTVEGTIKRRSIMAGFVPVR-----SVRGTSTISNRGIS-KAKLQKIAPGTTPPPSTEPHTSKIFLKIDT 98 (339) Q Consensus 26 ~a~-~ie~~~g~v~~~f~~~sv~~~~v~~r-----~i~~G~tv~i~~iG-~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~ 98 (339) =++ +.++|++.|++.|...+++..+.+.. ...||++|+||++. ...+++|+++...... ..+.+..++++++ T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g-~v~~~~et~tl~~ 79 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARK-TISVGKETVKLTH 79 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCcccc-ccceeeeEEEeec Confidence 112 45999999999999988888775543 45679999999996 4689999998887654 4577889999999 Q ss_pred hhhhhhhHHHHHHHhcCcchHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCccccc Q lcl|NC_020078. 99 VIIARNAEPMLDEFQTDFDYQGEVARE-QGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYK 177 (339) Q Consensus 99 ~~y~~~~vdd~D~~q~~~d~~~~~~~~-~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 177 (339) ..++.|.||.+|.-++..=....++.+ .-....-.+|.+-|..|+..|. .. .+.+. T Consensus 80 DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~------------------~~-----~~~~~ 136 (285) T protein:vir:79 80 EDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA------------------KK-----ATDSI 136 (285) T ss_pred cccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc------------------cc-----ccccc Confidence 999999999444433211112222222 2334455788887777764321 00 11234 Q ss_pred cHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccc-cceeecceeEEEec-eEEEEe--c Q lcl|NC_020078. 178 DPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSA-GETLNTKYMFAAFG-VPVITS--N 253 (339) Q Consensus 178 ~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~-~~~l~~G~v~~i~G-~~V~~S--n 253 (339) +.+++|++|.++.++|+|..| | ++||++|+|++|.+|.++++|...-=.... ...--++.|+++.| ++|++. + T Consensus 137 T~~nv~~~i~~~~~~lde~~v--p-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~ 213 (285) T protein:vir:79 137 TKDNALDAYDTAEAYMFDNEV--P-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSD 213 (285) T ss_pred CHHHHHHHHHHHHHHHHHcCC--C-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchh Confidence 678999999999999999999 6 589999999999999999888753101111 01112467899998 899984 4 Q ss_pred cccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech---hhhHHHHHHHHHhCCccccccc Q lcl|NC_020078. 254 NAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD---LSKLWFIDSWLAFGVTINRTEY 330 (339) Q Consensus 254 nlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~---~~~~d~i~g~~~~Ga~v~rPe~ 330 (339) ++. +... ..++--++.|++|.......+ .+..+.++ ..-+|.+..+.-+++-|+.-.. T Consensus 214 r~k-t~~~-----------------~k~Infiiv~~~a~i~~~K~~-~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~ 274 (285) T protein:vir:79 214 RLK-GLGI-----------------TNHVNFILTPLSAIAPIVKYD-SVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAK 274 (285) T ss_pred hcc-CcCc-----------------chhccEEEecCceeccceeee-eeEeECCCCCCCcceeeeeeeeeeeeeehhhcc Confidence 442 1000 123345788999987766665 56666665 3447899999988888887776 Q ss_pred eEEEEecCC Q lcl|NC_020078. 331 AGVIKLPAA 339 (339) Q Consensus 331 ~v~i~~~~a 339 (339) -+...-..| T Consensus 275 ~~Iy~~~~a 283 (285) T protein:vir:79 275 KGIYVAATA 283 (285) T ss_pred ceeeeeecc Confidence 665544444 No 56 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.78 E-value=8.9e-21 Score=130.52 Aligned_cols=283 Identities=15% Similarity=0.096 Sum_probs=189.4 Q ss_pred CcccchhHHHHHHHHHHHHHHHHHHhhhcccccc-c--cccccceEEEeccc-----cceeeeccCCCCCCCCCCCCccc Q lcl|NC_020078. 20 HGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPV-R--SVRGTSTISNRGIS-----KAKLQKIAPGTTPPPSTEPHTSK 91 (339) Q Consensus 20 ~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~-r--~i~~G~tv~i~~iG-----~~t~~~~~~g~~i~~~~~~~~~~ 91 (339) .+ +.. -|.++|+.+|++.|...+++..+... . .+.||++|+||+|- .+-.++|++++..... ..+.+. T Consensus 1 Ma--ntl-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g-~v~~~~ 76 (302) T protein:vir:78 1 MA--NSL-ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQG-SVTLAW 76 (302) T ss_pred CC--chh-HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCcccc-ceeeee Confidence 11 212 37799999999999999998887433 2 46689999999995 4578899998866543 356678 Q ss_pred eEEEEeehhhhhhhHH--HHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccc Q lcl|NC_020078. 92 IFLKIDTVIIARNAEP--MLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVT 169 (339) Q Consensus 92 ~~l~ID~~~y~~~~vd--d~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~ 169 (339) .+.++++..++.|.|| |+||......+-.-+.+..-....=.+|.+-|..|+..|.... + . T Consensus 77 et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~--------------~-~-- 139 (302) T protein:vir:78 77 SDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG--------------G-V-- 139 (302) T ss_pred eeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC--------------c-c-- Confidence 8899999999999999 7777655554444444455666777889988887765442110 0 0 Q ss_pred ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccc-cceeecceeEEEeceE Q lcl|NC_020078. 170 LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSA-GETLNTKYMFAAFGVP 248 (339) Q Consensus 170 ~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~-~~~l~~G~v~~i~G~~ 248 (339) ....+...+.++++++|..+.++|+|. ++|+++|+|+.+.+|...+.|... ..... ....-++.|+++.|++ T Consensus 140 ~~~~~~~~t~~nvl~~i~~~~~~~~e~------~~~vl~vtp~~~~~Lk~a~~~~~~-~~~~~~~~~~i~~~V~~lDgv~ 212 (302) T protein:vir:78 140 IDLSKPDASAQALMGDIATAMELVDDS------NQLILVTSPTTLAGLLNTALIRES-KNTQVLRRGEVDTKITFIQDVE 212 (302) T ss_pred ccccccchhHHHHHHHHHHHHHHhhcc------CCeEEEEChHHHHHHhcchhhccc-eeccccccccccceeeeecccE Confidence 001122356789999999999999984 489999999999999887666421 11111 1111267899999999 Q ss_pred EEEec--cccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhhH--HHHHHHHHhCC Q lcl|NC_020078. 249 VITSN--NAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSKL--WFIDSWLAFGV 323 (339) Q Consensus 249 V~~Sn--nlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~~--d~i~g~~~~Ga 323 (339) |++.+ ++...-.-++ . ..-.....++--++.|++|.......+ .+.++.+. ...+ |.+..+.-+.+ T Consensus 213 Ii~VPs~r~~t~~~f~~-----G---~~~~~~ak~INfiiv~~~a~ia~~K~~-~~~if~P~~~~~gd~~l~~~R~Y~D~ 283 (302) T protein:vir:78 213 VLQVPSEYLYDKVAPKV-----G---VPDYTGAKKIPYMIFKRDAPTGIVKTD-KVRVFEPDTNQSADAYKVDLRLYHDL 283 (302) T ss_pred EEEchhhhcccceeccC-----C---ccccCCccceeEEEECCCeeeeeeeee-eeEeeCCCCCCCcceeeeeeeeEeee Confidence 99754 3431111010 0 001122345556888999988777666 67777664 3444 69999988888 Q ss_pred ccccccceEE-EEecCC Q lcl|NC_020078. 324 TINRTEYAGV-IKLPAA 339 (339) Q Consensus 324 ~v~rPe~~v~-i~~~~a 339 (339) -|+.....+. +...+| T Consensus 284 fV~~nk~~gI~~~~~~~ 300 (302) T protein:vir:78 284 IVPKNQRPGIIKASFGT 300 (302) T ss_pred eeeccccCeEEEeeccc Confidence 8887765333 333333 No 57 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.54 E-value=3.3e-16 Score=105.43 Aligned_cols=286 Identities=12% Similarity=-0.004 Sum_probs=172.2 Q ss_pred ccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccc---c---ccccceEEEeccccceeeeccCCCCCCCCC-CCCcc Q lcl|NC_020078. 18 QRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVR---S---VRGTSTISNRGISKAKLQKIAPGTTPPPST-EPHTS 90 (339) Q Consensus 18 ~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r---~---i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~ 90 (339) ..++ +.-.+++=..|.++.|+..++|...+.+. + -+.|+++.+|.--.....+ |..+..+. .+... T Consensus 1 MAn~----l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~---G~~~t~~~~~i~e~ 73 (430) T protein:vir:92 1 MALN----EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTDKATGLLEL 73 (430) T ss_pred Cccc----hhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc---CcccCCCCCccccc Confidence 2222 33355667778889999999999754432 2 2569999887766655544 65555542 22234 Q ss_pred ceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccc Q lcl|NC_020078. 91 KIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTL 170 (339) Q Consensus 91 ~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~ 170 (339) ...+++|+.+--.|.+.+-+ +...+....+.+.+..+||..+|..++.....-+.. +.+...++ . T Consensus 74 ~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~----------v~~~~~~t---~ 138 (430) T protein:vir:92 74 NVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSL----------VITSPDAI---G 138 (430) T ss_pred eEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccc----------cccccccC---C Confidence 67889999998888888755 577777888889999999999999998665432110 00000010 0 Q ss_pred cCccccccHHHHHHHHHHHHHHHHhcCCCCCc-CCeEEEECHHHHHHHhcc-cchhhhcccccccceeecceeEE-Eece Q lcl|NC_020078. 171 AGANDYKDPAKLYAAIASLVEKFLEKDVRPNE-EDMILVLPPAAFTALMQA-EHITNGEYVTSAGETLNTKYMFA-AFGV 247 (339) Q Consensus 171 ~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~-~~R~~vv~P~~~~~Ll~~-~~~~n~d~~~~~~~~l~~G~v~~-i~G~ 247 (339) ..++ .....+-.+.+.|+++.| |. .+|.++++|+.+..|... .++-+++ ......+++|.|++ +.|| T Consensus 139 ~~~~------~~~~~~A~a~~~L~~~~v--P~~~~R~~vldp~~~~~l~~~l~~l~~~~--~~~~~A~r~g~i~~~~~Gf 208 (430) T protein:vir:92 139 TNTA------DAWNFVADAEELMFSREL--NRDMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDGTIQRQVAGF 208 (430) T ss_pred CcCC------cchhhHHHHHHHHHHhcC--CCCCCcEEEeChHHHHHHHhhhccccccc--cchhHHHhhccccccchhh Confidence 0111 123456678889999999 54 589999999999998753 2333332 12334589999997 9999 Q ss_pred E-EEEeccccccccccccccC---------------------------------C-----Ccccc--------------- Q lcl|NC_020078. 248 P-VITSNNAVFGKTITDHLLS---------------------------------N-----ANNEK--------------- 273 (339) Q Consensus 248 ~-V~~Snnlp~~~~~~~~~l~---------------------------------~-----~~~~~--------------- 273 (339) + +++|+++|..+.+++.... . .|+.- T Consensus 209 d~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~ 288 (430) T protein:vir:92 209 DDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV 288 (430) T ss_pred hhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccc Confidence 5 7899998853322221110 0 01000 Q ss_pred -----cccccc-----------------------------------------------ceEEEEEeccceeEEEEEee-- Q lcl|NC_020078. 274 -----AYDGDF-----------------------------------------------KDIVAQMFSPKALLAGSTIP-- 299 (339) Q Consensus 274 -----~y~~~~-----------------------------------------------~~~~~~~~h~~A~~~~~~~~-- 299 (339) .|.+.. .-+..++|||+|+..+...- T Consensus 289 ~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~ 368 (430) T protein:vir:92 289 LAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) T ss_pred cCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccC Confidence 011110 00225999999998776542 Q ss_pred -------------------eeEEeeechhhhHH----HH--HHHHHhCCccccccceEEEEe-cCC Q lcl|NC_020078. 300 -------------------VTSKIFFDDLSKLW----FI--DSWLAFGVTINRTEYAGVIKL-PAA 339 (339) Q Consensus 300 -------------------~~~e~~~~~~~~~d----~i--~g~~~~Ga~v~rPe~~v~i~~-~~a 339 (339) +.+..+ ++.| .. +==..||.+.+|||.++++=. .+| T Consensus 369 ~~~~~~~~~~~~~~~~~~Glsirv~----~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 369 NHELFAGMKTTSFSIPDVGLNGIFA----TQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CCCHHHhhhhheeccccceEEEEEE----EecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 111111 1122 11 111379999999999754433 333 No 58 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.54 E-value=3.3e-16 Score=105.43 Aligned_cols=286 Identities=12% Similarity=-0.004 Sum_probs=172.2 Q ss_pred ccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccc---c---ccccceEEEeccccceeeeccCCCCCCCCC-CCCcc Q lcl|NC_020078. 18 QRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVR---S---VRGTSTISNRGISKAKLQKIAPGTTPPPST-EPHTS 90 (339) Q Consensus 18 ~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r---~---i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~ 90 (339) ..++ +.-.+++=..|.++.|+..++|...+.+. + -+.|+++.+|.--.....+ |..+..+. .+... T Consensus 1 MAn~----l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~---G~~~t~~~~~i~e~ 73 (430) T protein:vir:10 1 MALN----EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTDKATGLLEL 73 (430) T ss_pred Cccc----hhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc---CcccCCCCCccccc Confidence 2222 33355667778889999999999754432 2 2569999887766655544 65555542 22234 Q ss_pred ceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccc Q lcl|NC_020078. 91 KIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTL 170 (339) Q Consensus 91 ~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~ 170 (339) ...+++|+.+--.|.+.+-+ +...+....+.+.+..+||..+|..++.....-+.. +.+...++ . T Consensus 74 ~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~----------v~~~~~~t---~ 138 (430) T protein:vir:10 74 NVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSL----------VITSPDAI---G 138 (430) T ss_pred eEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccc----------cccccccC---C Confidence 67889999998888888755 577777888889999999999999998665432110 00000010 0 Q ss_pred cCccccccHHHHHHHHHHHHHHHHhcCCCCCc-CCeEEEECHHHHHHHhcc-cchhhhcccccccceeecceeEE-Eece Q lcl|NC_020078. 171 AGANDYKDPAKLYAAIASLVEKFLEKDVRPNE-EDMILVLPPAAFTALMQA-EHITNGEYVTSAGETLNTKYMFA-AFGV 247 (339) Q Consensus 171 ~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~-~~R~~vv~P~~~~~Ll~~-~~~~n~d~~~~~~~~l~~G~v~~-i~G~ 247 (339) ..++ .....+-.+.+.|+++.| |. .+|.++++|+.+..|... .++-+++ ......+++|.|++ +.|| T Consensus 139 ~~~~------~~~~~~A~a~~~L~~~~v--P~~~~R~~vldp~~~~~l~~~l~~l~~~~--~~~~~A~r~g~i~~~~~Gf 208 (430) T protein:vir:10 139 TNTA------DAWNFVADAEELMFSREL--NRDMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDGTIQRQVAGF 208 (430) T ss_pred CcCC------cchhhHHHHHHHHHHhcC--CCCCCcEEEeChHHHHHHHhhhccccccc--cchhHHHhhccccccchhh Confidence 0111 123456678889999999 54 589999999999998753 2333332 12334589999997 9999 Q ss_pred E-EEEeccccccccccccccC---------------------------------C-----Ccccc--------------- Q lcl|NC_020078. 248 P-VITSNNAVFGKTITDHLLS---------------------------------N-----ANNEK--------------- 273 (339) Q Consensus 248 ~-V~~Snnlp~~~~~~~~~l~---------------------------------~-----~~~~~--------------- 273 (339) + +++|+++|..+.+++.... . .|+.- T Consensus 209 d~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~ 288 (430) T protein:vir:10 209 DDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV 288 (430) T ss_pred hhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccc Confidence 5 7899998853322221110 0 01000 Q ss_pred -----cccccc-----------------------------------------------ceEEEEEeccceeEEEEEee-- Q lcl|NC_020078. 274 -----AYDGDF-----------------------------------------------KDIVAQMFSPKALLAGSTIP-- 299 (339) Q Consensus 274 -----~y~~~~-----------------------------------------------~~~~~~~~h~~A~~~~~~~~-- 299 (339) .|.+.. .-+..++|||+|+..+...- T Consensus 289 ~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~ 368 (430) T protein:vir:10 289 LAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) T ss_pred cCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccC Confidence 011110 00225999999998776542 Q ss_pred -------------------eeEEeeechhhhHH----HH--HHHHHhCCccccccceEEEEe-cCC Q lcl|NC_020078. 300 -------------------VTSKIFFDDLSKLW----FI--DSWLAFGVTINRTEYAGVIKL-PAA 339 (339) Q Consensus 300 -------------------~~~e~~~~~~~~~d----~i--~g~~~~Ga~v~rPe~~v~i~~-~~a 339 (339) +.+..+ ++.| .. +==..||.+.+|||.++++=. .+| T Consensus 369 ~~~~~~~~~~~~~~~~~~Glsirv~----~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 369 NHELFAGMKTTSFSIPDVGLNGIFA----TQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CCCHHHhhhhheeccccceEEEEEE----EecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 111111 1122 11 111379999999999754433 333 No 59 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.54 E-value=6e-16 Score=104.03 Aligned_cols=306 Identities=11% Similarity=-0.008 Sum_probs=168.7 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc---------ccce Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI---------SKAK 71 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i---------G~~t 71 (339) |+...=.-+-..-+.+ ++......-+++.+.|+.++.+..++.++++.++++..+. +++++||++ |..+ T Consensus 1 ~~~~~e~~~~~~~~~~-~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~~~~ 78 (338) T protein:vir:78 1 MATLNELAPNTAGSNH-QGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVGQVGVGT 78 (338) T ss_pred CcchHHhhhhhccccc-ccceecccccccchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccceeecccc Confidence 3222211111111111 2333333444777999999999999999999998877655 567777775 2334 Q ss_pred eeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 72 LQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 72 ~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) +.....|+.++.. .+...+.++..= ..+....|-+-=-.++.+|+.+.+.++.++++++..|+.++.- -... T Consensus 79 ~~~~~Eg~~~~~~-~~~f~~v~l~~~-k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G----~g~~-- 150 (338) T protein:vir:78 79 SNEQREGGTKPLS-GTAWDTRSVAPI-KLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHG----KSPL-- 150 (338) T ss_pred ccccccccccccc-ccceeEEEEEEE-EEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCC-- Confidence 4444556666543 344555555442 1222223322111235689999999999999999999987521 1000 Q ss_pred cccccccccCccccccccc--cCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhc-- Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVTL--AGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGE-- 227 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~--~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d-- 227 (339) ++....|......... .......+...+++.+.++...+..+ . .......+++|..|..|.+...+.+.+ T Consensus 151 ---~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~m~~~~~~~L~~~~~l~d~~g~ 224 (338) T protein:vir:78 151 ---TGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSAN-T--DVDFNGWAADPRYRARLLRSQAYRDANGN 224 (338) T ss_pred ---ccccccccccccccccccccccccccchhhHHHHHHHHHHhhhh-c--cccceEEEEchHHHHHHHHHhhhccCCCc Confidence 0000111111000000 01111223456678888887776532 2 222346789999999997765554432 Q ss_pred ccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec Q lcl|NC_020078. 228 YVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD 307 (339) Q Consensus 228 ~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~ 307 (339) |.-. ....+|..++++|.+|+.|+++|.....+ .......|-++|++ .......++..++.++ T Consensus 225 ~l~~--~~~~~~~~~~l~G~PV~~~~~ip~~~~~~-----~~~~~~~~~gdfs~----------~~~~~~~~~~i~~~~~ 287 (338) T protein:vir:78 225 VDPT--RINLAASAGDLLGLPVQFGKAVGGDLGAA-----TDSKVRVVGGDFSQ----------LKYGFADEIRVKMSDT 287 (338) T ss_pred eeec--ccccCCCCceeeeeeEEEccccCcccccc-----CCcccEEEEEecce----------EEEEeecccEEEEeec Confidence 1111 11335667889999999999998532211 11112233344443 2223333445555443 Q ss_pred hh-------------hh-H--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 308 DL-------------SK-L--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 308 ~~-------------~~-~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .. .| . -.+++.+.+|.+++||++.+.|+-..| T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 335 (338) T protein:vir:78 288 ATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDED 335 (338) T ss_pred ccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccC Confidence 21 11 1 235677789999999999999988777 No 60 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.52 E-value=6.3e-16 Score=103.92 Aligned_cols=291 Identities=13% Similarity=0.033 Sum_probs=167.9 Q ss_pred ccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccc-c--cc---cccceEEEeccccceeeeccCCCCCCCC-CCCCcc Q lcl|NC_020078. 18 QRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPV-R--SV---RGTSTISNRGISKAKLQKIAPGTTPPPS-TEPHTS 90 (339) Q Consensus 18 ~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~-r--~i---~~G~tv~i~~iG~~t~~~~~~g~~i~~~-~~~~~~ 90 (339) ..++-+ -++++=-.|+++.|+..++|..++.+ | +. +.|+++.+|.--.....+ |.++..+ +.+... T Consensus 1 Ma~~~~----~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~---G~~~t~~~~~~~e~ 73 (430) T protein:vir:21 1 MALNEG----QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTDKATGLLEL 73 (430) T ss_pred Cccccc----hhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccc---cccccCCCccceee Confidence 222211 23333338999999999999985332 2 22 569999887654443332 4433332 123334 Q ss_pred ceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccc Q lcl|NC_020078. 91 KIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTL 170 (339) Q Consensus 91 ~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~ 170 (339) ...+++|+.+--.|.+. .+| +...+....+.+-+..+||..+|..++..+..-+- +..+...++ . T Consensus 74 ~v~~~~~~~~~V~~~~~-~kE-l~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~----------~v~~~~~~t---~ 138 (430) T protein:vir:21 74 NVAVNMGEPDNDFFQLR-ADD-LRDETAYRRRIQSAARKLANNVELKVANMAAEMGS----------LVITSPDAI---G 138 (430) T ss_pred eEeEEEeeeccceEEee-hhH-hcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhh----------ccccccCCC---C Confidence 66789998887666666 333 56777888999999999999999999877644221 110000010 0 Q ss_pred cCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcc-cchhhhcccccccceeecceeEE-EeceE Q lcl|NC_020078. 171 AGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQA-EHITNGEYVTSAGETLNTKYMFA-AFGVP 248 (339) Q Consensus 171 ~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~-~~~~n~d~~~~~~~~l~~G~v~~-i~G~~ 248 (339) ..+++ ....+-.+.+.|+++.||. +.+|.++++|+.+..|... .++-+++ ......+++|.|++ +.||+ T Consensus 139 ~~~~~------~~~~~A~a~~~L~~~~vP~-~~~R~~~~~p~~~~~l~~~l~~~~~~~--~~~~~A~r~g~i~r~~~Gfd 209 (430) T protein:vir:21 139 TNTAD------AWNFVADAEEIMFSRELNR-DMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDGTIQRQVAGFD 209 (430) T ss_pred CCCCc------chhhHHHHHHHHHHhcCCC-CCCcEEEeChHHHHHHhhhhccccccc--cchhHHHhhcccccccchhh Confidence 11111 2355667888899999943 2589999999999998663 3444332 22234589999997 99996 Q ss_pred -EEEeccccccccccccccC---------------------------------C-----Ccccccccc------------ Q lcl|NC_020078. 249 -VITSNNAVFGKTITDHLLS---------------------------------N-----ANNEKAYDG------------ 277 (339) Q Consensus 249 -V~~Snnlp~~~~~~~~~l~---------------------------------~-----~~~~~~y~~------------ 277 (339) +++|+++|..+.+++.... . .|+.-.+++ T Consensus 210 ~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~ 289 (430) T protein:vir:21 210 DVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVL 289 (430) T ss_pred hhhhcCCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeecccccccc Confidence 8899999863322221110 0 000000011 Q ss_pred ------------ccc-------------------------------------------eEEEEEeccceeEEEEEee--- Q lcl|NC_020078. 278 ------------DFK-------------------------------------------DIVAQMFSPKALLAGSTIP--- 299 (339) Q Consensus 278 ------------~~~-------------------------------------------~~~~~~~h~~A~~~~~~~~--- 299 (339) +.+ -+..++|||+|+..+...- T Consensus 290 ~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p 369 (430) T protein:vir:21 290 AQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPAN 369 (430) T ss_pred CCcceEEEEEecCCceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCC Confidence 000 0124999999998776532 Q ss_pred ------------------eeEEeeechhhhHHHH--HHHHHhCCccccccceEEEE-ecCC Q lcl|NC_020078. 300 ------------------VTSKIFFDDLSKLWFI--DSWLAFGVTINRTEYAGVIK-LPAA 339 (339) Q Consensus 300 ------------------~~~e~~~~~~~~~d~i--~g~~~~Ga~v~rPe~~v~i~-~~~a 339 (339) +.++.++.-+-..+.. +==..||.+.+|||.++++= =.+| T Consensus 370 ~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 370 HELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CChhHhhheeeeeccccceEEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 2222222221011111 11247999999999975443 3333 No 61 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.47 E-value=7.1e-15 Score=98.16 Aligned_cols=297 Identities=14% Similarity=0.049 Sum_probs=168.4 Q ss_pred cccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCC Q lcl|NC_020078. 3 IFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTP 81 (339) Q Consensus 3 ~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i 81 (339) ...-.+++.+.+-+ ++.-.+..+.++.++.+..+..+++++++++....+ ..++||+. +.+.+.....|+++ T Consensus 1 m~~~~~~a~~~~~t------~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 73 (330) T protein:vir:77 1 MAGSTVPSTQVALT------GDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSASWTGEAERK 73 (330) T ss_pred Ccccccchhhcccc------CCCcceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceeEecCCCcc Confidence 22222443333333 222223346677889999999999999988766554 45778877 66777888888888 Q ss_pred CCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020078. 82 PPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMP 160 (339) Q Consensus 82 ~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~ 160 (339) +.. .+..++.++..- ++.. ..|.+-=-.++.+|+.+.+.++.++++++..|+.++ .+....++.. +... T Consensus 74 ~~~-~~~f~~i~~~~~--k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l----~G~g~~~~~~---g~~~ 143 (330) T protein:vir:77 74 PIT-KGSFGKQELEPV--KITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAI----HGIDKPSAFK---GYLA 143 (330) T ss_pred ccc-cceeeEEEEeEE--EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh----cccCCCCccc---cccc Confidence 764 456666666552 3332 233221111356899999999999999999998875 2221111110 0000 Q ss_pred Ccccccccc-ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc---cccee Q lcl|NC_020078. 161 GHSGGNVVT-LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS---AGETL 236 (339) Q Consensus 161 g~~~~~~~~-~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~---~~~~l 236 (339) +........ ....+........++.+.++..++..++.. ....+++|..|..|.+-..- +..|.-. ..... T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~ 218 (330) T protein:vir:77 144 ETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKK----WTGTLLDNVTEPILNTAVDG-NGRPLFVESTYTEQV 218 (330) T ss_pred cccccceeecccccccccccchhHHHHHHHHHhhhhcCCC----ccEEEEcHHHHHHHHHHhcc-CCceeecCccccccc Confidence 000011111 111112222345677788888888766652 22458999999988752111 0011100 00011 Q ss_pred ecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh------- Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL------- 309 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~------- 309 (339) ....-.+++|++|+.|+++|...... +...++...+-+..+...+++.++.++.. T Consensus 219 ~~~~~~~l~G~PV~~~~~~p~~~~~~------------------~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~ 280 (330) T protein:vir:77 219 GAIREGRILGRPTYVADNVVNGTVGN------------------RVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQ 280 (330) T ss_pred cccCCceecceeeEEeccccCCCCCC------------------ccEEEEEecceEEEEEecCcEEEEeecceeeecccc Confidence 12234578999999999998532111 11223333333334455555566554421 Q ss_pred -----------hh--HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -----------SK--LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -----------~~--~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .. .-.+++.+.+|.+++||++.+.|+..+| T Consensus 281 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~ 323 (330) T protein:vir:77 281 GGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVA 323 (330) T ss_pred cccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 01 2346778899999999999999988888 No 62 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.46 E-value=7.7e-15 Score=97.96 Aligned_cols=281 Identities=11% Similarity=0.095 Sum_probs=165.7 Q ss_pred cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCCCCCC Q lcl|NC_020078. 5 DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTTPPPS 84 (339) Q Consensus 5 ~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~ 84 (339) -|.= +.....+.+.-.+..+.++.++.+..++.++++.++++..+. |++.++++...+.+..+..|++++.. T Consensus 1 ~g~~-------a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~ 72 (299) T protein:vir:41 1 MGFN-------PDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMT-KPEEEFTFMSGVGAFWVDEAERIQTS 72 (299) T ss_pred CCcC-------CCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecC-CCcEEEEEEcCCceeeeecCcccccc Confidence 1111 111111111112455999999999999999999998876664 56678888888888888889888764 Q ss_pred CCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcc Q lcl|NC_020078. 85 TEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHS 163 (339) Q Consensus 85 ~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~ 163 (339) .++.++.++..-. .+....|.+ +-. ++..|+.+.+.++.+.++++..|+.++. +.... ...|.- T Consensus 73 -~~~f~~v~l~~~k-~~~~~~is~-ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G~g~~--------~~~gil 137 (299) T protein:vir:41 73 -KPTFTKAKMRSKK-MGVIIPTTK-ENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFT----GVESP--------YNWNIL 137 (299) T ss_pred -ccceeEEEEeeEE-EEEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhh----cccCc--------cccccc Confidence 4666666666542 222233322 222 2458899999999999999999998762 11100 001111 Q ss_pred ccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEE Q lcl|NC_020078. 164 GGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFA 243 (339) Q Consensus 164 ~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~ 243 (339) ... ... .+........++.|.++..+|...+.. + -.++++|..|..|.+-.. .+..|.... .. .+..++ T Consensus 138 ~~~--~~~-~~~~~~~~~~~~~l~~~~~~l~~~~~~-~---~~~v~n~~~~~~L~~lkd-~~G~~l~~~--~~-~~~~~~ 206 (299) T protein:vir:41 138 KSA--TDA-SNLVEETANKYDDLNEAIGLIEAEDLE-P---NGIATIRKQRVKYRSTKD-GNGMPIFNT--AT-SNGVDD 206 (299) T ss_pred ccc--ccc-ceeeccccccHHHHHHHHHhhhcccCC-c---CEEEEcHHHHHHHHHhhc-cCCceeecC--Cc-CCCCce Confidence 000 000 000111122356677787888777663 2 246999999999986321 111111111 11 233468 Q ss_pred EeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-------------- Q lcl|NC_020078. 244 AFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-------------- 309 (339) Q Consensus 244 i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-------------- 309 (339) ++|.+|+.++++|.+. +...-|-++|+ -+..+...++..|..++.. T Consensus 207 l~G~PV~~~~~~~~~~----------~~~~~~~gdfs----------~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 266 (299) T protein:vir:41 207 VLGLPIAYTPKYTFGD----------KDISELVGDWN----------QAYYGILRGVEYEILTEATLTTVADETGKPLNL 266 (299) T ss_pred ecceeeEEecccCCCC----------CceEEEEEecc----------cEEEEEecCcEEEEeecccccccccccccchhh Confidence 9999999999998431 11111222222 2223444555666655432 Q ss_pred hhH--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 SKL--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ~~~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ... -.+++..-+|.++++|++.+.|+..+| T Consensus 267 ~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 267 AERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 111 224666778999999999999999999 No 63 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.45 E-value=9.8e-15 Score=97.39 Aligned_cols=307 Identities=10% Similarity=0.017 Sum_probs=164.2 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) |..+.=.-+...-+.+ .+.......++..++++.++.+..++.++++.+.++..+.+ .+.+||+. +.+++.....|. T Consensus 1 ~a~l~el~~~~~~~~~-~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSNH-QGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhcccccc-cCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecCcc Confidence 2222111111111111 22222223336779999999999999999999988877665 45567666 445554444343 Q ss_pred CCCC-------CCCCCccceEEEEeehhhhhh-hHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_020078. 80 TPPP-------STEPHTSKIFLKIDTVIIARN-AEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASD 150 (339) Q Consensus 80 ~i~~-------~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~ 150 (339) .... ...+..++.++ ...|+..+ .|-+ +-. ++.+++.+.+.+++++++++..|+.++. +..... T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l--~~~kl~~~~~is~-ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~----G~g~~~ 151 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSV--SPIKLATIVTVSE-EFARMNPSGLYTKLQGDLAYAIGRGIDLAVFH----GKSPLT 151 (333) T ss_pred cccccccccccccccceeEEEE--eeEEEEEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCC Confidence 2211 11233333344 33343332 2222 211 4678899999999999999999998852 111100 Q ss_pred ccccccccccCccccccc-cccC-ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcc Q lcl|NC_020078. 151 SPYGTAAQMPGHSGGNVV-TLAG-ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEY 228 (339) Q Consensus 151 ~~~~~~~~~~g~~~~~~~-~~~~-~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~ 228 (339) + ....|....... .... ..........++.|+++...+..+.- ......+++|..|..|++...+.+.+- T Consensus 152 ~-----~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~~~~~~vmn~~~~~~L~~~~~~~d~~G 223 (333) T protein:vir:78 152 G-----SALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTD---VEFNGWAVDPRFRAHLLRAQAYRDANG 223 (333) T ss_pred C-----cccccccccccccccccccccccccchhHHHHHHHHHhhccccc---cCceEEEEcchHHHHHHHHhhhcCCCC Confidence 0 001111111100 0001 01111223346777777777654421 122356889999999988666554421 Q ss_pred cccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech Q lcl|NC_020078. 229 VTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD 308 (339) Q Consensus 229 ~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~ 308 (339) .-.-......|..++++|++|++|+++|..... +...+...|-++|++ +..+...+++.+..++. T Consensus 224 ~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~-----~~~~~~~~~~gD~~~----------~~~g~~~~~~i~~~~~~ 288 (333) T protein:vir:78 224 NVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGA-----AVDSKTRIIGGDFSQ----------LKFGFADEIRIKMSDTA 288 (333) T ss_pred ceeecCccccCCCceeeceeeEEccccCCCccc-----cCCCccEEEEEeccc----------EEEEEeeccEEEEeccc Confidence 111111234566789999999999999854211 111112223333333 22333344555554432 Q ss_pred ----------hhhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 309 ----------LSKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 309 ----------~~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +.|. -.+++.+-++.++++|++.+.|+...| T Consensus 289 ~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 289 TLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred cccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 1111 125677889999999999999988888 No 64 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.44 E-value=9.6e-15 Score=97.44 Aligned_cols=292 Identities=11% Similarity=0.040 Sum_probs=165.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-c-cceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-S-KAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G-~~t~~~~~~g 78 (339) +.-.-+... ....|-.....+++.-.+..+.|+.++.+..+..+.+++++++..+.+ .++.+++. + ..++.....| T Consensus 98 ~~~~~~~~~-~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~ 175 (395) T protein:vir:43 98 TSSLRGSHR-VSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES-NSVEYVRETGFVNNAAPVSEG 175 (395) T ss_pred HHHhhhhhh-hhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCC-CceEEEEEecCCCceeeecCC Confidence 000000000 011111111122222235668899999999999999999998887754 46778775 3 3566667777 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) +.++.. .+...+.++...+... -..|.+ +-.+...++.+.+.++.+.++++..|..++. +..... . T Consensus 176 ~~~~~~-~~~~~~i~~~~~k~~~-~~~is~-ell~d~~~l~~~v~~~la~a~~~~~d~~~l~----G~g~~~-------~ 241 (395) T protein:vir:43 176 TQKPYS-DLTFELENAPVRTIAH-LFKASR-QILDDASALQSYIDARARYGLMLVEECQLLY----GNGTGA-------N 241 (395) T ss_pred cccccc-ccceeEEEEeeeeEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccCCCC-------c Confidence 777654 4566666666643221 112221 1223334688889999999999999998752 221111 1 Q ss_pred ccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeec Q lcl|NC_020078. 159 MPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNT 238 (339) Q Consensus 159 ~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~ 238 (339) ..|............+........++.+.++...+...+.. .-.+|++|..|..|.+-.. .+..|... ...+ T Consensus 242 ~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----~~~~vmn~~~~~~l~~lkd-~~G~~i~~---~~~~ 313 (395) T protein:vir:43 242 LHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFP----ASGIVLNPIDWALIELNKD-AENRYIIG---SPQN 313 (395) T ss_pred cccccccccccccccccccccchhHHHHHHHHHhhccccCC----CcEEEEcHHHHHHHHHhhc-cCCceecc---cccc Confidence 11221111111111222233345677788888888766552 1256899999998865321 11122221 1235 Q ss_pred ceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-hhHH---H Q lcl|NC_020078. 239 KYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-SKLW---F 314 (339) Q Consensus 239 G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-~~~d---~ 314 (339) |....++|.+|++|+.+|.+.. +-++|+. ++..+...++++++.+... .|.. . T Consensus 314 ~~~~~l~G~pVv~~~~~~~~~~--------------~~gd~~~---------~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 370 (395) T protein:vir:43 314 GTTPTLWRLPVVETQAITQDEF--------------LTGAFSL---------GAQIFDRMDIEVLVSTENDKDFENNMVT 370 (395) T ss_pred CCCceecceeeEEcCCCCCCcE--------------EEEeccc---------eEEEEEecceEEEEeccccchhhcCcEE Confidence 6667899999999999985421 1122222 1222222344556555432 2322 4 Q ss_pred HHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 315 IDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 315 i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++.+.+|.++++|++.+.+.+++| T Consensus 371 ~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 371 IRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEEEEeeccEEecccceEEEEeccC Confidence 5666789999999999999999999 No 65 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.41 E-value=9.3e-15 Score=97.51 Aligned_cols=289 Identities=12% Similarity=0.056 Sum_probs=165.2 Q ss_pred Cccc---cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-c-cceeeec Q lcl|NC_020078. 1 MSIF---DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-S-KAKLQKI 75 (339) Q Consensus 1 ~~~~---~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G-~~t~~~~ 75 (339) +..+ .+.+..... |-.....+++.-.+..+.++.++.+.....+.+++++++..+. |.++++++. + ..++... T Consensus 86 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v 163 (385) T protein:vir:19 86 IKSWDGKQGTFGAKTF-NKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVV 163 (385) T ss_pred HHHHHHhhccchhhHH-HhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeee Confidence 1111 111111111 1111111222222455888999999999999999998887754 457888886 3 3566666 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ..|+.++.. .+...+.++.... +..+ .|.+ +-.+...++.+.+.++.+.++++..|+.++ .+..... T Consensus 164 ~E~~~~~~~-~~~~~~~~~~~~k--~~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l----~G~g~~~---- 231 (385) T protein:vir:19 164 AEKALKPES-DITFSKQTANVKT--IAHWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLL----NGDGTGD---- 231 (385) T ss_pred ccCcccccc-ccceeEEEEeeee--EEEeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHH----hccCCCC---- Confidence 777777654 3566666666543 2222 2221 122223568889999999999999998875 2221111 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) ...|....... ...+...+....++.|.++..+|...+.. .-.++++|..|..|.+-..- +..|...+ T Consensus 232 ---~~~Gi~~~~~~--~~~~~~~~~~~~~d~i~~~~~~l~~~~~~----~~~~~~~~~~~~~l~~lkd~-~G~~l~~~-- 299 (385) T protein:vir:19 232 ---NLEGLNKVATA--YDTSLNATGDTRADIIAHAIYQVTESEFS----ASGIVLNPRDWHNIALLKDN-EGRYIFGG-- 299 (385) T ss_pred ---ccccccccccc--ccccccccccchHHHHHHHHHhhccccCC----CCEEEEcHHHHHHHHHhhcC-CCceeccC-- Confidence 11121111110 01111112233567788888888766542 22568999999998663221 12222111 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhh-H Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSK-L 312 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~-~ 312 (339) ..+|....++|.+|++|+.+|.+.. +-++| +.++..+..++++++..+.. ..| - T Consensus 300 -~~~~~~~~l~G~pV~~~~~~p~~~~--------------~~gd~---------~~~~~~~~~~~~~v~~~~~~~~~~~~ 355 (385) T protein:vir:19 300 -PQAFTSNIMWGLPVVPTKAQAAGTF--------------TVGGF---------DMASQVWDRMDATVEVSREDRDNFVK 355 (385) T ss_pred -cccCCCceecceeeEEcCcCCCCcE--------------EEeec---------ccEEEEEEecceEEEEeccccchhhc Confidence 2356667899999999999984321 11122 22344445555666665543 222 2 Q ss_pred --HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 --WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 --d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..+++.+.+|.++++|++++.+++++| T Consensus 356 ~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 356 NMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccC Confidence 245667789999999999999999999 No 66 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.41 E-value=9.3e-15 Score=97.51 Aligned_cols=289 Identities=12% Similarity=0.056 Sum_probs=165.2 Q ss_pred Cccc---cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-c-cceeeec Q lcl|NC_020078. 1 MSIF---DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-S-KAKLQKI 75 (339) Q Consensus 1 ~~~~---~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G-~~t~~~~ 75 (339) +..+ .+.+..... |-.....+++.-.+..+.++.++.+.....+.+++++++..+. |.++++++. + ..++... T Consensus 86 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v 163 (385) T protein:vir:18 86 IKSWDGKQGTFGAKTF-NKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVV 163 (385) T ss_pred HHHHHHhhccchhhHH-HhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeee Confidence 1111 111111111 1111111222222455888999999999999999998887754 457888886 3 3566666 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ..|+.++.. .+...+.++.... +..+ .|.+ +-.+...++.+.+.++.+.++++..|+.++ .+..... T Consensus 164 ~E~~~~~~~-~~~~~~~~~~~~k--~~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l----~G~g~~~---- 231 (385) T protein:vir:18 164 AEKALKPES-DITFSKQTANVKT--IAHWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLL----NGDGTGD---- 231 (385) T ss_pred ccCcccccc-ccceeEEEEeeee--EEEeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHH----hccCCCC---- Confidence 777777654 3566666666543 2222 2221 122223568889999999999999998875 2221111 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) ...|....... ...+...+....++.|.++..+|...+.. .-.++++|..|..|.+-..- +..|...+ T Consensus 232 ---~~~Gi~~~~~~--~~~~~~~~~~~~~d~i~~~~~~l~~~~~~----~~~~~~~~~~~~~l~~lkd~-~G~~l~~~-- 299 (385) T protein:vir:18 232 ---NLEGLNKVATA--YDTSLNATGDTRADIIAHAIYQVTESEFS----ASGIVLNPRDWHNIALLKDN-EGRYIFGG-- 299 (385) T ss_pred ---ccccccccccc--ccccccccccchHHHHHHHHHhhccccCC----CCEEEEcHHHHHHHHHhhcC-CCceeccC-- Confidence 11121111110 01111112233567788888888766542 22568999999998663221 12222111 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhh-H Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSK-L 312 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~-~ 312 (339) ..+|....++|.+|++|+.+|.+.. +-++| +.++..+..++++++..+.. ..| - T Consensus 300 -~~~~~~~~l~G~pV~~~~~~p~~~~--------------~~gd~---------~~~~~~~~~~~~~v~~~~~~~~~~~~ 355 (385) T protein:vir:18 300 -PQAFTSNIMWGLPVVPTKAQAAGTF--------------TVGGF---------DMASQVWDRMDATVEVSREDRDNFVK 355 (385) T ss_pred -cccCCCceecceeeEEcCcCCCCcE--------------EEeec---------ccEEEEEEecceEEEEeccccchhhc Confidence 2356667899999999999984321 11122 22344445555666665543 222 2 Q ss_pred --HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 --WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 --d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..+++.+.+|.++++|++++.+++++| T Consensus 356 ~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 356 NMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccC Confidence 245667789999999999999999999 No 67 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.39 E-value=1.5e-14 Score=96.41 Aligned_cols=296 Identities=12% Similarity=0.058 Sum_probs=164.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEec-cccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRG-ISKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~-iG~~t~~~~~~g 78 (339) ..-|..-.-+-+-.+.+.. ..++--.+.-+.++.++.+..+..+.+++++++..+.+| .++.++. .+...+.....| T Consensus 106 ~~~~~~~~~~~~~~~~~~~-~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg 184 (415) T protein:vir:94 106 VRDFTEYLETRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL 184 (415) T ss_pred HHHHHHHhhhhhhhhhhcc-ccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceecccc Confidence 0001000000011111110 111111133488999999999999999999998887754 3444544 355566677777 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhh-HHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNA-EPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~-vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) ..++....+...+.++.+- ++..+. |.+-=--++.+|+.+.+.+++++++++..|+.|+.-.-.+ T Consensus 185 ~~~~~~~~~~~~~i~~~~~--k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g------------ 250 (415) T protein:vir:94 185 EENPELAVKPFFQLAYDIN--THRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG------------ 250 (415) T ss_pred ccccccccccceeeEeehe--eeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC------------ Confidence 7776433344455555443 444332 2221111356899999999999999999999886332110 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ...+........ ......+....|+.|+++..++...+.. + . .+|++|..|..|.+-..- +..|.-. ..+. T Consensus 251 ~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~--~-~~vmn~~~~~~l~~lkd~-~G~~l~~--~~~~ 321 (415) T protein:vir:94 251 STGSTSSGFEKE--GKKLEVKKAKSLDDIKDAINLNVKPNYE-H--N-VAIVSQTMFAKLDKMKDK-LGNYLIQ--PDVK 321 (415) T ss_pred cccccccccccc--ccccccccccchHHHHHHHHhhhhhccC-C--C-EEEEcHHHHHHHHHhhcc-CCCeeec--cCcC Confidence 000001111100 0111111223356677777777665552 1 2 458899999999652211 1112111 1234 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) +|..++++|.+|+.++++|.+.. ++..-+-++| +.++..+...+++++..+. ..+...+++ T Consensus 322 ~~~~~~l~G~pV~~~~~~~~~~~---------~~~~i~~gd~---------~~~~~~~~~~~~~v~~~~~-~~~~~~~r~ 382 (415) T protein:vir:94 322 EKTQQRLLGAKIEILPDEVLGQK---------GNNTLIIGNL---------KDAIVLFDRSQYQASWTDY-MHFGECLMI 382 (415) T ss_pred CCCCceecceeeEEecccccCCC---------CccEEEEEeh---------hccEEEEeecceEEEEecc-ccCceEEEE Confidence 67778899999999999874321 1111122222 2233344445555554432 344556788 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-++.++++|++++.++++.+ T Consensus 383 ~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 383 AVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEeccEEeccccEEEEEEecc Confidence 8999999999999999999988 No 68 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.37 E-value=2.7e-14 Score=95.00 Aligned_cols=296 Identities=13% Similarity=0.077 Sum_probs=160.8 Q ss_pred CccccCcccCC-------CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eEEEe-ccccce Q lcl|NC_020078. 1 MSIFDGQTPSY-------DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TISNR-GISKAK 71 (339) Q Consensus 1 ~~~~~~~~~~~-------~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv~i~-~iG~~t 71 (339) ....+.....| +..+.+.. .+.+--.+.-+.|+.++.+..+..+.+++++++..+.+|. ++.+. ..+... T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:46 99 TKVTSQEVRDFTEYLETRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhccc-cccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcc Confidence 00000000000 00000000 1111112444899999999999999999999888776543 33333 334556 Q ss_pred eeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_020078. 72 LQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASD 150 (339) Q Consensus 72 ~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~ 150 (339) +.....|..++....+..++.++.. .++..+ .|.+-=-.++.+|+.+.+.+++++++++..|+.|+.-.-.+ T Consensus 178 ~~~v~Eg~~~~~~~~~~~~~v~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g----- 250 (415) T protein:vir:46 178 LEKVEELEENPELAVKPFFQLAYDI--NTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG----- 250 (415) T ss_pred eeecccccccccccccceeeEEeee--eeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC----- Confidence 6666777777643334445555544 333332 22221112356889999999999999999999886322110 Q ss_pred ccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccc Q lcl|NC_020078. 151 SPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVT 230 (339) Q Consensus 151 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~ 230 (339) ...+........ ......+....++.|.++...+...... + . .+|++|..|..|.+-.. .+..|.- T Consensus 251 -------~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~--~-~~v~n~~~~~~L~~lkd-~~G~~i~ 316 (415) T protein:vir:46 251 -------STGSTSSGFEKE--GKKLEVKKAKSLDDIKDAINLNVKPNYE-H--N-VAIVSQTMFAKLDKMKD-KLGNYLI 316 (415) T ss_pred -------Cccccccccccc--cceeccccccchHHHHHHHHhhhhhccC-C--C-EEEEcHHHHHHHHHhhc-cCCCeee Confidence 001111111100 0111111122345666776666554442 1 2 45899999999865211 1122222 Q ss_pred cccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh Q lcl|NC_020078. 231 SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS 310 (339) Q Consensus 231 ~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~ 310 (339) .. .+.+|..++++|++|+.+++.|.+.. ++..-+-++|++ ++..+...+++++..+. .. T Consensus 317 ~~--~~~~~~~~~l~G~pV~~~~~~~~~~~---------~~~~~~~gd~~~---------~~~~~~~~~~~v~~~~~-~~ 375 (415) T protein:vir:46 317 QP--DVKEKTQQRLLGAKIEILPDEVLGQK---------GNNTLIIGNLKD---------AIVLFDRSQYQASWTDY-MH 375 (415) T ss_pred cc--CcCCCCCccccceeeEEeccccccCC---------CccEEEEEehhc---------cEEEEeecceEEEeecc-cc Confidence 11 23466777899999999998874322 111122223332 33334444555554432 33 Q ss_pred hHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 311 KLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 311 ~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +...+++.+-++.++++|++.+.++++++ T Consensus 376 ~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 376 FGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred CceEEEEEEEeccEEeccccEEEEEeecc Confidence 34457888999999999999999998888 No 69 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.37 E-value=2.7e-14 Score=95.00 Aligned_cols=296 Identities=13% Similarity=0.077 Sum_probs=160.8 Q ss_pred CccccCcccCC-------CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eEEEe-ccccce Q lcl|NC_020078. 1 MSIFDGQTPSY-------DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TISNR-GISKAK 71 (339) Q Consensus 1 ~~~~~~~~~~~-------~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv~i~-~iG~~t 71 (339) ....+.....| +..+.+.. .+.+--.+.-+.|+.++.+..+..+.+++++++..+.+|. ++.+. ..+... T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:47 99 TKVTSQEVRDFTEYLETRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhccc-cccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcc Confidence 00000000000 00000000 1111112444899999999999999999999888776543 33333 334556 Q ss_pred eeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_020078. 72 LQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASD 150 (339) Q Consensus 72 ~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~ 150 (339) +.....|..++....+..++.++.. .++..+ .|.+-=-.++.+|+.+.+.+++++++++..|+.|+.-.-.+ T Consensus 178 ~~~v~Eg~~~~~~~~~~~~~v~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g----- 250 (415) T protein:vir:47 178 LEKVEELEENPELAVKPFFQLAYDI--NTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG----- 250 (415) T ss_pred eeecccccccccccccceeeEEeee--eeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC----- Confidence 6666777777643334445555544 333332 22221112356889999999999999999999886322110 Q ss_pred ccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccc Q lcl|NC_020078. 151 SPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVT 230 (339) Q Consensus 151 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~ 230 (339) ...+........ ......+....++.|.++...+...... + . .+|++|..|..|.+-.. .+..|.- T Consensus 251 -------~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~--~-~~v~n~~~~~~L~~lkd-~~G~~i~ 316 (415) T protein:vir:47 251 -------STGSTSSGFEKE--GKKLEVKKAKSLDDIKDAINLNVKPNYE-H--N-VAIVSQTMFAKLDKMKD-KLGNYLI 316 (415) T ss_pred -------Cccccccccccc--cceeccccccchHHHHHHHHhhhhhccC-C--C-EEEEcHHHHHHHHHhhc-cCCCeee Confidence 001111111100 0111111122345666776666554442 1 2 45899999999865211 1122222 Q ss_pred cccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh Q lcl|NC_020078. 231 SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS 310 (339) Q Consensus 231 ~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~ 310 (339) .. .+.+|..++++|++|+.+++.|.+.. ++..-+-++|++ ++..+...+++++..+. .. T Consensus 317 ~~--~~~~~~~~~l~G~pV~~~~~~~~~~~---------~~~~~~~gd~~~---------~~~~~~~~~~~v~~~~~-~~ 375 (415) T protein:vir:47 317 QP--DVKEKTQQRLLGAKIEILPDEVLGQK---------GNNTLIIGNLKD---------AIVLFDRSQYQASWTDY-MH 375 (415) T ss_pred cc--CcCCCCCccccceeeEEeccccccCC---------CccEEEEEehhc---------cEEEEeecceEEEeecc-cc Confidence 11 23466777899999999998874322 111122223332 33334444555554432 33 Q ss_pred hHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 311 KLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 311 ~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +...+++.+-++.++++|++.+.++++++ T Consensus 376 ~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 376 FGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred CceEEEEEEEeccEEeccccEEEEEeecc Confidence 34457888999999999999999998888 No 70 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.37 E-value=3.8e-14 Score=94.19 Aligned_cols=285 Identities=14% Similarity=0.065 Sum_probs=158.0 Q ss_pred CccccCcccCCCcccCCccCcccchhH-HHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLA-DVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a-~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g 78 (339) |+ .++++.-. +..++++.++.+..++.|+++.+.++.... +..++||+. |.+.+.-+..| T Consensus 1 Ma-----------------~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~-~~~~~ip~~~~~~~a~wv~Eg 62 (315) T protein:vir:80 1 MA-----------------DDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEG 62 (315) T ss_pred CC-----------------CCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeCC Confidence 32 11111111 234899999999999999999988766554 456788885 56788888888 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcc----hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFD----YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d----~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) +.++.. .+..++.++.. .|+..+ .|-+-=-.++..| +++.+.++.+++|++.+|+.++. +. ++. T Consensus 63 ~~~~~s-~~~f~~v~l~~--~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~----G~---~~~- 131 (315) T protein:vir:80 63 EVKPSA-SVDVSAFTAQP--IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GI---DPA- 131 (315) T ss_pred cccccc-ccceeeeEeee--eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheee----cc---CCC- Confidence 888754 45666655543 233222 2221111122333 77889999999999999987752 10 000 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccch----hhhccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHI----TNGEYV 229 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~----~n~d~~ 229 (339) .+....|...... ............++-+.++..++...+...+ . ..+++|..+..|.+-... .+..|. T Consensus 132 -~~~~~~~~~~~~~---~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~--~-~~imn~~~~~~L~~l~~~~g~~~~g~~~ 204 (315) T protein:vir:80 132 -TGKAASAVHTSLN---KTKNIVDATDSATADLVKAVGLIAGAGLQVP--N-GVALDPAFSFALSTEVYPKGSPLAGQPM 204 (315) T ss_pred -CCccccccccccc---cccceeeccccchHHHHHHHHHHhhccCccc--e-EEEEcHHHHHHHHHHhhccCCccccccc Confidence 0011111111100 0011111112234445666666655544322 2 357899999999754221 111111 Q ss_pred ccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL 309 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~ 309 (339) - ..+..|..++++|.+|+.|+++|....... ......+-++|++. .++..+. +..|+.++.. T Consensus 205 ~---~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~-----~~~~~~~~GDfs~~--------~~g~~~~--~~i~i~~~~~ 266 (315) T protein:vir:80 205 Y---PAAGFAGLDNWRGLNVGASSTVSGAPEMSP-----ASGVKAIVGDFSRV--------HWGFQRN--FPIELIEYGD 266 (315) T ss_pred c---cccccCCCceecceeeEecCcCCccccccc-----ccccEEEEeecccE--------EEEEecC--eeEEEecccc Confidence 1 123345567899999999999985433211 11122333455442 2333332 3344443321 Q ss_pred -------hh-H--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -------SK-L--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -------~~-~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| . -.+++.+.+|.++++|++.+.|+-.+| T Consensus 267 ~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 267 PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred ccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 11 1 235667789999999999999998888 No 71 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.37 E-value=8.4e-14 Score=92.28 Aligned_cols=283 Identities=12% Similarity=0.060 Sum_probs=166.0 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) |. -+.+.+.|++-+..+ -.+.-+.++.++.+..++.+++++++++..+. +++++||+. +...+.-+..++ T Consensus 1 ma--~~~~~~~~~~~t~~g------g~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~ 71 (304) T protein:vir:94 1 MA--TPTYTPGNVILSDFK------NGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETE 71 (304) T ss_pred Cc--ccccccccccccCCC------ceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCc Confidence 32 233344444444211 12566889999999999999999998777655 456788887 566777777788 Q ss_pred CCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) +++.. .+..++.++..- ++..+ .|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.....+ T Consensus 72 ~~~~~-~~~~~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~~~------- 137 (304) T protein:vir:94 72 RIQTS-KPEYAQAEMEAK--KIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSPYN------- 137 (304) T ss_pred ccccc-cceeeEEEEEEE--EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCCcc------- Confidence 87754 456666666553 33332 232211113568999999999999999999988742 1111100 Q ss_pred ccCcccccccc-ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 159 MPGHSGGNVVT-LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 159 ~~g~~~~~~~~-~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ......+.... ........+....|+.|.++..++...+... ..++++|..|..|.+- .+. .+..+- T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----~~~v~~~~~~~~L~~l---kd~-----~G~~l~ 205 (304) T protein:vir:94 138 TSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDP----NGVLTTRSFRSKMRNA---LDA-----NDRPLF 205 (304) T ss_pred cccccccccccccccccccccccchHHHHHHHHHHhhhccCCc----CEEEEcHHHHHHHHHh---hcc-----CCcEee Confidence 00000010000 0111112233445778888888887776532 2568999999999752 221 122344 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech--------h Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD--------L 309 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~--------~ 309 (339) ....++++|.+|+.++++|.... .+..+-++|++. ..+...+++.++.++. + T Consensus 206 ~~~~~~l~G~PV~~~~~~~~~~~----------~~~~~~gd~~~~----------~~~~~~~~~i~~~~e~~~~~~~~~~ 265 (304) T protein:vir:94 206 DANGNEIMGLPLSYTGADVYDKK----------KSLALMGDWDYA----------RYGILQGIEYAISEDATLTTLQASD 265 (304) T ss_pred cCCCccccceeeEEecccccCCC----------CcEEEEEehhhE----------EEEEecceEEEEeecceeeeecccc Confidence 55567899999999999984311 111222333332 1222233333333321 1 Q ss_pred -------hhH---HHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 310 -------SKL---WFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 310 -------~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) .|. -.+++.+.+|..+++|++.+.|+.+- T Consensus 266 ~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 266 ASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 121 23566778999999999999997766 No 72 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.37 E-value=8.4e-14 Score=92.28 Aligned_cols=283 Identities=12% Similarity=0.060 Sum_probs=166.0 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) |. -+.+.+.|++-+..+ -.+.-+.++.++.+..++.+++++++++..+. +++++||+. +...+.-+..++ T Consensus 1 ma--~~~~~~~~~~~t~~g------g~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~ 71 (304) T protein:vir:10 1 MA--TPTYTPGNVILSDFK------NGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETE 71 (304) T ss_pred Cc--ccccccccccccCCC------ceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCc Confidence 32 233344444444211 12566889999999999999999998777655 456788887 566777777788 Q ss_pred CCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) +++.. .+..++.++..- ++..+ .|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.....+ T Consensus 72 ~~~~~-~~~~~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~~~------- 137 (304) T protein:vir:10 72 RIQTS-KPEYAQAEMEAK--KIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSPYN------- 137 (304) T ss_pred ccccc-cceeeEEEEEEE--EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCCcc------- Confidence 87754 456666666553 33332 232211113568999999999999999999988742 1111100 Q ss_pred ccCcccccccc-ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 159 MPGHSGGNVVT-LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 159 ~~g~~~~~~~~-~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ......+.... ........+....|+.|.++..++...+... ..++++|..|..|.+- .+. .+..+- T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----~~~v~~~~~~~~L~~l---kd~-----~G~~l~ 205 (304) T protein:vir:10 138 TSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDP----NGVLTTRSFRSKMRNA---LDA-----NDRPLF 205 (304) T ss_pred cccccccccccccccccccccccchHHHHHHHHHHhhhccCCc----CEEEEcHHHHHHHHHh---hcc-----CCcEee Confidence 00000010000 0111112233445778888888887776532 2568999999999752 221 122344 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech--------h Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD--------L 309 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~--------~ 309 (339) ....++++|.+|+.++++|.... .+..+-++|++. ..+...+++.++.++. + T Consensus 206 ~~~~~~l~G~PV~~~~~~~~~~~----------~~~~~~gd~~~~----------~~~~~~~~~i~~~~e~~~~~~~~~~ 265 (304) T protein:vir:10 206 DANGNEIMGLPLSYTGADVYDKK----------KSLALMGDWDYA----------RYGILQGIEYAISEDATLTTLQASD 265 (304) T ss_pred cCCCccccceeeEEecccccCCC----------CcEEEEEehhhE----------EEEEecceEEEEeecceeeeecccc Confidence 55567899999999999984311 111222333332 1222233333333321 1 Q ss_pred -------hhH---HHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 310 -------SKL---WFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 310 -------~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) .|. -.+++.+.+|..+++|++.+.|+.+- T Consensus 266 ~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 266 ASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 121 23566778999999999999997766 No 73 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.36 E-value=4.3e-14 Score=93.85 Aligned_cols=293 Identities=14% Similarity=0.082 Sum_probs=158.6 Q ss_pred CccccCccc---CCC-cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeec Q lcl|NC_020078. 1 MSIFDGQTP---SYD-VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~---~~~-~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~ 75 (339) ..+-.|... ... ..+....-.+++..-+--+++...+.....+.++++.+.++....+|+.+.|++. |.+++.-. T Consensus 91 ~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 170 (392) T protein:vir:13 91 AVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIV 170 (392) T ss_pred HHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeee Confidence 000011100 000 0000000011111112226677777777788899999888777777888888776 45677777 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ..|+.++.. .+..++.++..- ++..+ .|.+-=--++.+|+.+.+.++.++++++..|+.++. +... + T Consensus 171 ~E~~~~~~~-~~~f~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~Gt-~---- 238 (392) T protein:vir:13 171 GETAEIPES-YPATTQRSMGGF--KYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLT----GTGT-G---- 238 (392) T ss_pred ccccccccc-ccceeeEEeeee--eEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----ccCC-c---- Confidence 888888654 456666666553 33333 232211113677899999999999999999998752 1111 0 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) ...|.-..........+........|+.+.++...|..... ....| |++|..+..|.+-.. .+..|--. . T Consensus 239 ---~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~---~~a~~-v~n~~~~~~l~~lkd-~~G~~l~~--~ 308 (392) T protein:vir:13 239 ---QPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYR---KNAKF-VVNDLRAAQMRKLKD-ANGQYLWQ--S 308 (392) T ss_pred ---cccccccccccccccccccccccccHHHHHHHHHhhhhhhh---cCCEE-EEcHHHHHHHHHhhc-cCCceeec--C Confidence 01111111100000111111112235566666666654422 12344 789999998864211 11111111 1 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhH-- Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKL-- 312 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~-- 312 (339) .+..|.-.+++|.+|+.++++|.... +-++|+. ...+...++.++..++..... T Consensus 309 ~~~~g~~~~l~G~Pv~~~~~~~~~~i--------------~~Gdf~~----------~~i~~~~~~~i~~~~~~~~~~~~ 364 (392) T protein:vir:13 309 ALTVGAPDTFNGKVVETDDGMPADKV--------------LFADLSK----------YRVRFAGSLRVDRSVDAKFSTDQ 364 (392) T ss_pred CcCCCCCceecceeeEEcCCCCCCcE--------------EEeeccc----------eeEEeecceEEEeeccccccCCc Confidence 23456667899999999999984311 1123322 222233344445444433222 Q ss_pred HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..+++..-+|.++++|++.+.+++++| T Consensus 365 ~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 365 IVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred EEEEEEEEeccEEecccceEEEEeecc Confidence 346788899999999999999999999 No 74 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.36 E-value=4.2e-14 Score=93.93 Aligned_cols=296 Identities=12% Similarity=0.074 Sum_probs=164.5 Q ss_pred Cccc----------c---CcccCC----CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eE Q lcl|NC_020078. 1 MSIF----------D---GQTPSY----DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TI 62 (339) Q Consensus 1 ~~~~----------~---~~~~~~----~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv 62 (339) +..+ . ..|..+ ...+.+.. ..++--.+.-+.|+.++.+..+..+.+++++++..+.++. ++ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) T protein:vir:81 89 INDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) T ss_pred HHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccc-cccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeE Confidence 0000 0 000000 00000000 0011112344899999999999999999999988776432 34 Q ss_pred EEe-ccccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 63 SNR-GISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFI 140 (339) Q Consensus 63 ~i~-~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~ 140 (339) .++ ..+...+.....|.+++....+..++.++.+- ++..+ .|.+-=-.++.+|+.+.+.++.++++++..|+.++. T Consensus 168 ~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~ 245 (415) T protein:vir:81 168 PVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN--THRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIID 245 (415) T ss_pred EEEeecCCccceeeccccccCcccccceeeEEeeee--eeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 444 34556666666677776433344555555553 33332 222211123578899999999999999999998853 Q ss_pred HHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 141 MAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 141 ~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~ 220 (339) -.-.+ . ..+....... ...+...+....|+.|.++..++...+.. .. .+|++|..|..|.+- T Consensus 246 g~g~g----~--------~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---~~-~~v~n~~~~~~l~~l 307 (415) T protein:vir:81 246 VITKG----S--------TGSTSSGFEK--EGKKLEVKKAKSLDDIKDAINLNVKPNYE---HN-VAIVSQTMFAKLDKM 307 (415) T ss_pred ccccC----c--------cccccccccc--cccccccccccchhHHHHHHHhhhhhccC---CC-EEEEcHHHHHHHHHh Confidence 22110 0 0000000000 00111112223356677777777665552 13 458899999999652 Q ss_pred cchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeee Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPV 300 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~ 300 (339) .. .+..|.-.. .+.+|...+++|++|+.+++.|.... ++..-+-++| +.++......++ T Consensus 308 kd-~~G~~l~~~--~~~~~~~~~l~G~pV~~~~~~~~~~~---------~~~~~~~Gd~---------~~~~~~~~~~~~ 366 (415) T protein:vir:81 308 KD-KLGNYLIQP--DVKEKTQQRLLGAKIEILPDEVLGQK---------GNNTLIIGNL---------KDAIVLFDRSQY 366 (415) T ss_pred hc-cCCceeecc--CcCCCCCceecceeeEEecccccCCC---------CccEEEEEeh---------hccEEEEeecce Confidence 11 111222111 23466777899999999998874321 1111122222 223444555556 Q ss_pred eEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 301 TSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 301 ~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++..+. ..+...+++.+-++.++++|++++.++++++ T Consensus 367 ~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 367 QASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 6665543 3445567888999999999999999999988 No 75 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.36 E-value=4.2e-14 Score=93.93 Aligned_cols=296 Identities=12% Similarity=0.074 Sum_probs=164.5 Q ss_pred Cccc----------c---CcccCC----CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eE Q lcl|NC_020078. 1 MSIF----------D---GQTPSY----DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TI 62 (339) Q Consensus 1 ~~~~----------~---~~~~~~----~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv 62 (339) +..+ . ..|..+ ...+.+.. ..++--.+.-+.|+.++.+..+..+.+++++++..+.++. ++ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) T protein:vir:79 89 INDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) T ss_pred HHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccc-cccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeE Confidence 0000 0 000000 00000000 0011112344899999999999999999999988776432 34 Q ss_pred EEe-ccccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 63 SNR-GISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFI 140 (339) Q Consensus 63 ~i~-~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~ 140 (339) .++ ..+...+.....|.+++....+..++.++.+- ++..+ .|.+-=-.++.+|+.+.+.++.++++++..|+.++. T Consensus 168 ~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~ 245 (415) T protein:vir:79 168 PVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN--THRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIID 245 (415) T ss_pred EEEeecCCccceeeccccccCcccccceeeEEeeee--eeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 444 34556666666677776433344555555553 33332 222211123578899999999999999999998853 Q ss_pred HHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 141 MAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 141 ~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~ 220 (339) -.-.+ . ..+....... ...+...+....|+.|.++..++...+.. .. .+|++|..|..|.+- T Consensus 246 g~g~g----~--------~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---~~-~~v~n~~~~~~l~~l 307 (415) T protein:vir:79 246 VITKG----S--------TGSTSSGFEK--EGKKLEVKKAKSLDDIKDAINLNVKPNYE---HN-VAIVSQTMFAKLDKM 307 (415) T ss_pred ccccC----c--------cccccccccc--cccccccccccchhHHHHHHHhhhhhccC---CC-EEEEcHHHHHHHHHh Confidence 22110 0 0000000000 00111112223356677777777665552 13 458899999999652 Q ss_pred cchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeee Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPV 300 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~ 300 (339) .. .+..|.-.. .+.+|...+++|++|+.+++.|.... ++..-+-++| +.++......++ T Consensus 308 kd-~~G~~l~~~--~~~~~~~~~l~G~pV~~~~~~~~~~~---------~~~~~~~Gd~---------~~~~~~~~~~~~ 366 (415) T protein:vir:79 308 KD-KLGNYLIQP--DVKEKTQQRLLGAKIEILPDEVLGQK---------GNNTLIIGNL---------KDAIVLFDRSQY 366 (415) T ss_pred hc-cCCceeecc--CcCCCCCceecceeeEEecccccCCC---------CccEEEEEeh---------hccEEEEeecce Confidence 11 111222111 23466777899999999998874321 1111122222 223444555556 Q ss_pred eEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 301 TSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 301 ~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++..+. ..+...+++.+-++.++++|++++.++++++ T Consensus 367 ~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 367 QASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 6665543 3445567888999999999999999999988 No 76 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.36 E-value=4.2e-14 Score=93.93 Aligned_cols=296 Identities=12% Similarity=0.074 Sum_probs=164.5 Q ss_pred Cccc----------c---CcccCC----CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eE Q lcl|NC_020078. 1 MSIF----------D---GQTPSY----DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TI 62 (339) Q Consensus 1 ~~~~----------~---~~~~~~----~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv 62 (339) +..+ . ..|..+ ...+.+.. ..++--.+.-+.|+.++.+..+..+.+++++++..+.++. ++ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 167 (415) T protein:vir:98 89 INDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY 167 (415) T ss_pred HHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccc-cccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeE Confidence 0000 0 000000 00000000 0011112344899999999999999999999988776432 34 Q ss_pred EEe-ccccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 63 SNR-GISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFI 140 (339) Q Consensus 63 ~i~-~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~ 140 (339) .++ ..+...+.....|.+++....+..++.++.+- ++..+ .|.+-=-.++.+|+.+.+.++.++++++..|+.++. T Consensus 168 ~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~ 245 (415) T protein:vir:98 168 PVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN--THRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIID 245 (415) T ss_pred EEEeecCCccceeeccccccCcccccceeeEEeeee--eeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 444 34556666666677776433344555555553 33332 222211123578899999999999999999998853 Q ss_pred HHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 141 MAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 141 ~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~ 220 (339) -.-.+ . ..+....... ...+...+....|+.|.++..++...+.. .. .+|++|..|..|.+- T Consensus 246 g~g~g----~--------~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---~~-~~v~n~~~~~~l~~l 307 (415) T protein:vir:98 246 VITKG----S--------TGSTSSGFEK--EGKKLEVKKAKSLDDIKDAINLNVKPNYE---HN-VAIVSQTMFAKLDKM 307 (415) T ss_pred ccccC----c--------cccccccccc--cccccccccccchhHHHHHHHhhhhhccC---CC-EEEEcHHHHHHHHHh Confidence 22110 0 0000000000 00111112223356677777777665552 13 458899999999652 Q ss_pred cchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeee Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPV 300 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~ 300 (339) .. .+..|.-.. .+.+|...+++|++|+.+++.|.... ++..-+-++| +.++......++ T Consensus 308 kd-~~G~~l~~~--~~~~~~~~~l~G~pV~~~~~~~~~~~---------~~~~~~~Gd~---------~~~~~~~~~~~~ 366 (415) T protein:vir:98 308 KD-KLGNYLIQP--DVKEKTQQRLLGAKIEILPDEVLGQK---------GNNTLIIGNL---------KDAIVLFDRSQY 366 (415) T ss_pred hc-cCCceeecc--CcCCCCCceecceeeEEecccccCCC---------CccEEEEEeh---------hccEEEEeecce Confidence 11 111222111 23466777899999999998874321 1111122222 223444555556 Q ss_pred eEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 301 TSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 301 ~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++..+. ..+...+++.+-++.++++|++++.++++++ T Consensus 367 ~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 367 QASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 6665543 3445567888999999999999999999988 No 77 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.35 E-value=3.6e-14 Score=94.31 Aligned_cols=301 Identities=14% Similarity=0.085 Sum_probs=155.0 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEec-cccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRG-ISKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~-iG~~t~~~~~~g~ 79 (339) ...|.+...+....|--..+..++--.+..+.|+.++.+..+..+++++++++..+.+|+ .++++ .+.+++.-...|+ T Consensus 114 ~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~~~~~~~~~a~wv~E~~ 192 (425) T protein:vir:10 114 TEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAG-FSKLFNMGGTTSGWVGEAS 192 (425) T ss_pred HHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCc-eEEEEEcCCcceeeecccc Confidence 111222111111111100011111111445999999999999999999999887776554 44544 4666666666666 Q ss_pred CCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) .++....+...+.++.. .++..+ .|.+-=--++.+|+.+.+.++.++++++..|+.++. +.....|. +. T Consensus 193 ~~~~~~~~~f~~v~~~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~----G~G~~~p~----Gi 262 (425) T protein:vir:10 193 QRPQTNAATFQPLSFAS--GEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLA----GDGTNKPN----GL 262 (425) T ss_pred ccccccccccceeeeeh--eeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhc----ccCCCCcc----ee Confidence 66543333444445543 333322 222211113568999999999999999999997752 11111110 00 Q ss_pred ccCcccccccc------ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccc Q lcl|NC_020078. 159 MPGHSGGNVVT------LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSA 232 (339) Q Consensus 159 ~~g~~~~~~~~------~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~ 232 (339) ......+.... ....+........++.|+++...|...... +-..|++|..|..|.+-..- +..|--. T Consensus 263 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~----~a~~vmn~~~~~~L~~lkD~-~G~~l~~- 336 (425) T protein:vir:10 263 LTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTG----NARFAMNRNTQRQVRKLKDG-QGNYLWQ- 336 (425) T ss_pred eeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhcc----CCEEEEchHHHHHHHHhhcC-CCceeec- Confidence 00000000000 000001111122345566666666554331 22458999999998652211 1111111 Q ss_pred cceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh--h Q lcl|NC_020078. 233 GETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL--S 310 (339) Q Consensus 233 ~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~--~ 310 (339) ..+.+|.-++++|.+|+.++++|........ -+-++|+. ++..+.-+.+ ++.+++. + T Consensus 337 -~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~---------i~~Gd~~~---------~~~i~~~~~~--~v~~d~~~~~ 395 (425) T protein:vir:10 337 -PSYVAGQPATLAGYPVTEVPDMPDVAANSTP---------ILFGDFQQ---------TYLIIDRIGV--RVLRDPYTAK 395 (425) T ss_pred -cCccCCCCceecceeeEEecCcCCccCCccE---------EEEEehhc---------cEEEEEecce--EEEecccccC Confidence 1244566678999999999999843211110 11123332 2222222222 2333321 1 Q ss_pred hHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 311 KLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 311 ~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ---.+++..-++.++++|++.+.|++.+| T Consensus 396 ~~~~~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 396 PYVLFYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred CcEEEEEEEEeccEeecccceEEEEeecc Confidence 11235566789999999999999999999 No 78 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.35 E-value=8.3e-15 Score=97.77 Aligned_cols=299 Identities=17% Similarity=0.164 Sum_probs=174.8 Q ss_pred cCcccchhHHHH-HHHHHHHHHHHHHHhhhccccc-cccccccceEEEeccccceeeeccCCCCCCCCCCCCccceEEEE Q lcl|NC_020078. 19 RHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVP-VRSVRGTSTISNRGISKAKLQKIAPGTTPPPSTEPHTSKIFLKI 96 (339) Q Consensus 19 ~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~-~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~I 96 (339) -+-..+..|+.. |+|+.+++-.+..+.+-.++-+ +-+.-.|++.+|+.+|.++++..-..+++... .+++.|.++.| T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~~~~~~-~i~TGEIt~~i 79 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSVTLQEAEEDTPLIYN-PIETGEITFQI 79 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCceeeeccccCCCeeec-ccccceEEEEE Confidence 222333344333 9999999988887766555544 34556799999999999999999988998875 57888888888 Q ss_pred eehhhhhhh--H-HHHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 97 DTVIIARNA--E-PMLDEFQTDFD-YQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 97 D~~~y~~~~--v-dd~D~~q~~~d-~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) -. |.... | +|+-+--..+| ++.+...|.+.|+-+.+-...+..- ++.++. .--+-++.|..--.+ T Consensus 80 ~~--Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G--~~~FA~--~~~P~~vNG~PH~~V----- 148 (313) T protein:vir:95 80 TE--YKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTG--AEYFAA--NPGPHNVNGFPHVIV----- 148 (313) T ss_pred Ee--ecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhc--hhhhcc--CCCCcccccccceEE----- Confidence 65 54443 3 44444334454 7888888999999988877664221 122211 001111222221111 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhh----h-cccccccceee-cceeEEEec Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITN----G-EYVTSAGETLN-TKYMFAAFG 246 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n----~-d~~~~~~~~l~-~G~v~~i~G 246 (339) ++.++..-.++-+....-.+++.++ |.+||+.||+|..-.-|-.-..+.+ - .+.-..+ ..+ ...|.++.| T Consensus 149 -~~~T~~~~~~~~~~~~~~~~~~a~~--P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG-~A~~~~Fi~~~YG 224 (313) T protein:vir:95 149 -SAETNGVFALKHLIAMRLAFDKANV--PAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESG-MARGQRFIMNLYG 224 (313) T ss_pred -eccCCceehhhHHHHhhhhhhhccC--CccceEEEEcchhhhhhhhhheeecccccccceeeecc-CCchhHHHHHHhh Confidence 1223333334456667778899999 8899999999998877754333322 1 0110111 011 135677899 Q ss_pred eEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccc Q lcl|NC_020078. 247 VPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTIN 326 (339) Q Consensus 247 ~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~ 326 (339) .++++||.|-.. +-+.....+ +.|-+..=+.+.-+-.+--++.=+-++ ++|.++++.+--+--.-...||-++. T Consensus 225 ~Di~~SN~L~~A-N~~D~~tT~----~G~~~NlFM~i~D~~~~P~~~AWr~MP-~s~~~~~~~~~~~~~~~~~R~G~Gi~ 298 (313) T protein:vir:95 225 WDILTSNRLHVA-NYNDGTTTG----NGYVGNLFMCILDDQTKPIMGAWRRMP-KSEGERNKDRARDEHVVRCRYGFGIQ 298 (313) T ss_pred hhhhhhhhhhhc-ccccccccc----Cceeeeeeeeeecccccceeeeecccc-ccccccccccccccceeeeeecccce Confidence 999999988532 211111111 112111001000000111223333343 66777776655555555677999999 Q ss_pred cccceEEEEecCC Q lcl|NC_020078. 327 RTEYAGVIKLPAA 339 (339) Q Consensus 327 rPe~~v~i~~~~a 339 (339) |-|.++++-+.+- T Consensus 299 R~~~L~~~~~~A~ 311 (313) T protein:vir:95 299 RLDTLGLLATSAT 311 (313) T ss_pred eecceeEEEeccc Confidence 9999999877666 No 79 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.35 E-value=6.1e-14 Score=93.04 Aligned_cols=289 Identities=11% Similarity=0.036 Sum_probs=168.7 Q ss_pred CccccC---c--ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc--cceee Q lcl|NC_020078. 1 MSIFDG---Q--TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS--KAKLQ 73 (339) Q Consensus 1 ~~~~~~---~--~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG--~~t~~ 73 (339) +..+.. . .+--...+......+++.-.+..+.++.++.+..+..+.+++++++..+. +.++.+++.. ..++. T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~a~ 170 (390) T protein:vir:97 92 TGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAA 170 (390) T ss_pred HHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeecc-CCceEEEEEecCCccee Confidence 100000 0 00011222223333444444666889999999999999999998877765 4467777763 35677 Q ss_pred eccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 74 KIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 74 ~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) ....|+.++.. .+...+.++.....- .-..|.+ .-.+...++.+.+.++.+.++++..|+.++. +... T Consensus 171 ~v~Eg~~~~~~-~~~~~~i~~~~~k~~-~~~~is~-ell~ds~~l~~~i~~~la~a~~~~~d~a~l~----G~g~----- 238 (390) T protein:vir:97 171 IVAEGALKPES-SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILR----GTGA----- 238 (390) T ss_pred eecCCcccccc-ccceeEEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhh----cCCC----- Confidence 77788887754 456677777765322 1222322 1122235788999999999999999998752 1111 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) +....|..... ..............++.+.++..++.....+. . .+|++|..|..|.+-.. .+..|.-.. T Consensus 239 --~~~p~Gi~~~~--~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~---~-~~v~n~~~~~~L~~lkd-~~G~~l~~~- 308 (390) T protein:vir:97 239 --NDGLLGLIPQA--TTYAAPTTIAGATRVDQLRLAMLQASLAEYPA---S-GIVINPIDWAAIELAKD-ANNQYLIGN- 308 (390) T ss_pred --Cccccceeecc--ccccccccccccchHHHHHHHHHhhccccCCC---C-EEEEcHHHHHHHHHhhc-CCCceeecC- Confidence 11111211110 00111111222344667778888887776632 2 45789999999875321 111121111 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh-hH Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS-KL 312 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~-~~ 312 (339) ..++...+++|.+|++|+.+|.+.. +-++| +.++..+...+++++.+++... .. T Consensus 309 --~~~~~~~~l~G~pV~~~~~~~~~~~--------------~~gd~---------~~~~~~~~~~~~~i~~~~~~~~f~~ 363 (390) T protein:vir:97 309 --ARGTLTPTLWGLPVVATQAMAPGEF--------------LVGAF---------DLAAQIFDQWDARVEIGYVNDDFQR 363 (390) T ss_pred --ccCCCCceecceeeEEcCCCCCCcE--------------EEEec---------cceEEEEEecceEEEEeeccccccc Confidence 1244556899999999999984311 11122 2244455566777787765433 33 Q ss_pred HH--HHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 313 WF--IDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 313 d~--i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) +. +++.+.||.++++|++++.+.+. T Consensus 364 ~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 364 NMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred CcEEEEEEEeeccEEeccccEEEEEeC Confidence 43 56667899999999999999988 No 80 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.34 E-value=1.4e-13 Score=90.98 Aligned_cols=291 Identities=11% Similarity=0.012 Sum_probs=160.9 Q ss_pred cccCc-ccCC--CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCC Q lcl|NC_020078. 3 IFDGQ-TPSY--DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPG 78 (339) Q Consensus 3 ~~~~~-~~~~--~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g 78 (339) .+.|+ |+.- .+.+..- ++.-.+..+.++.++.+..++.+++++++++..+. |.+.+||+. +.+.+.-...| T Consensus 1 ~~~~~~~~~~~~~~~~t~~----~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGD----TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWIGDVSAQWIGEG 75 (320) T ss_pred CCCCccCCHHHHHhhcccc----ccccccccHHHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEecCC Confidence 55665 4432 2222211 11112455889999999999999999998876655 456788876 56677777888 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) ++++.. .+..++.++.+- ++..+ .|.+-=--++.+|+.+.+.++.++++++.+|+.++. +.... .+. T Consensus 76 ~~~~~~-~~~f~~v~~~~~--k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~----G~g~~-----~~~ 143 (320) T protein:vir:10 76 DMKPIT-KGNMTSQNIAPH--KIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALN----GTDSP-----FPT 143 (320) T ss_pred cccccc-ccceeEEEEeeE--EEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCC-----CCc Confidence 888754 456666666553 33332 222211113568999999999999999999998752 11100 000 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhc--cc--chhhh-cccccc Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQ--AE--HITNG-EYVTSA 232 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~--~~--~~~n~-d~~~~~ 232 (339) ...+...+......+.....+-..+-+.+.++...+...+.+ .-..+++|..|..|.+ +. +.+-. ...+.. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~ 219 (320) T protein:vir:10 144 YLAQTTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKK----WTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDE 219 (320) T ss_pred ccccccccccceecccccccccccHHHHHHHHHhhhhcccCC----CcEEEEcHHHHHHHHHhhccCCceeeccccccCc Confidence 111111111111111111111112223355566666555442 2366899999999965 21 11100 001111 Q ss_pred cceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh--- Q lcl|NC_020078. 233 GETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL--- 309 (339) Q Consensus 233 ~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~--- 309 (339) ...+ .-++++|++|+.|+++|.+.. .+++.+.+-+..+...++..++.++.. T Consensus 220 ~~~~---~~~~i~g~pv~~~~~~~~~~~----------------------~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~ 274 (320) T protein:vir:10 220 NSPF---RAGRIVSRPTILSDHVADGTT----------------------VGYMGDFRNVIWGQVGGLSFDVTDQATLNL 274 (320) T ss_pred cccc---cCceeeeeeeEecCCCCCCce----------------------EEEEeecceEEEEEecCeEEEEeecceeee Confidence 1111 224689999999999874310 112222222223444455555554431 Q ss_pred ----------hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 ----------SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ----------~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.+++||++.+.|+--+| T Consensus 275 ~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a 317 (320) T protein:vir:10 275 GTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT 317 (320) T ss_pred ccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 01 1235677889999999999999986666 No 81 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.34 E-value=1.4e-13 Score=91.08 Aligned_cols=298 Identities=11% Similarity=0.034 Sum_probs=152.7 Q ss_pred CccccCcccCCCcccC-CccCcccchhHHHHHHHHHHHH-HHHHHHhhhccccccccccccceEEEec-cccceeeeccC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRP-NQRHGAGDPLADVTEQFTGTVE-GTIKRRSIMAGFVPVRSVRGTSTISNRG-ISKAKLQKIAP 77 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~-~~~~~~~~~~a~~ie~~~g~v~-~~f~~~sv~~~~v~~r~i~~G~tv~i~~-iG~~t~~~~~~ 77 (339) ..+.....-.+...+. +....+|. .+..+.|+.++. ..+...++++.+.++-.. .|+ +.+++ .+.+.+..... T Consensus 235 ~~l~~~e~~~~~~~~~~~~t~~~gg--~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~-~~~~~~~~~~~a~~v~E 310 (543) T protein:vir:81 235 AILTEEEKRAINEVRAMGLTKADGG--YLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGD-VWHGVSSAAVQWSWDAE 310 (543) T ss_pred HHhhhhhhhhhhhhhhcccccccCc--ccCchhhhhHHHHHHHhhhchhhhhcccccC-Ccc-eEEEEecCCcceeeccc Confidence 0000111111111111 00111111 134477777765 556677888888775433 344 44554 46667777778 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) |+.++.. .+...+.++... ++..+ .|.+ +-.+...|+.+.+.++++.++++..|+.|+ .+... + T Consensus 311 g~~~~~~-~~~~~~i~~~~~--k~~~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~ail----~G~Gt-------~ 375 (543) T protein:vir:81 311 FEEVSDD-SPEFGQPEIPVK--KAQGFVPISI-EALQDEANVTETVALLFAEGKDELEAVTLT----TGTGQ-------G 375 (543) T ss_pred Ccccccc-ccccceeeeeee--eeEeeehhhH-HHHhccHHHHHHHHHHHHHHHHHHHHHHHh----ccCCC-------C Confidence 8887654 456666666554 32222 2222 223345799999999999999999999875 12111 1 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccccee Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETL 236 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l 236 (339) ....|.-..........+........++.+.++...|...+- + .-.+|++|..|..|.+-.. .+..|.-. .+ T Consensus 376 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~--~~~~v~n~~~~~~l~~lkd-~~G~~l~~---~~ 447 (543) T protein:vir:81 376 NQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHR--R--QGAWLANNLIYNKIRQFDT-QGGAGLWT---TI 447 (543) T ss_pred cccccchhhcccccccccccccccccHHHHHHHHHhhhcccc--C--CcEEEEcHHHHHHHHHhhc-CCCceecc---Cc Confidence 111111110000000111111122234556666666654432 2 2256899999999975221 11112211 13 Q ss_pred ecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhh----- Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSK----- 311 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~----- 311 (339) .+|..++++|.+|+.++++|..... ....++..-|-++| +-+..+...+++++...+-... T Consensus 448 ~~g~~~~l~G~pv~~~~~~~~~~~~----~~~~~~~~i~~gd~----------~~~~i~~~~~~~i~~~~~~~~~~~~~~ 513 (543) T protein:vir:81 448 GNGEPSQLLGRPVGEAEAMDANWNT----SASADNFVLLYGNF----------QNYVIADRIGMTVEFIPHLFGTNRRPN 513 (543) T ss_pred CCCCCccccceeeEEeccccccccc----cccCCcceEEEeec----------cceeEEeecccEEEEeccccccchhhc Confidence 3555678999999999999854321 11112222222233 3333333344444433221111 Q ss_pred -HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 -LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 -~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .-.+++.+-+|.++++|++.+.+++.+| T Consensus 514 ~~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 514 GSRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred CceEEEEEEeeccEeecccceEEEEeccc Confidence 1234555667999999999999999998 No 82 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.34 E-value=4.3e-14 Score=93.88 Aligned_cols=287 Identities=14% Similarity=0.076 Sum_probs=156.0 Q ss_pred Cccc---------------c---CcccCC--CcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_020078. 1 MSIF---------------D---GQTPSY--DVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVRSVRGT 59 (339) Q Consensus 1 ~~~~---------------~---~~~~~~--~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r~i~~G 59 (339) .... . +..-.. ..... ....+++.. +.+ ++++..+....+..++++++.++....+| T Consensus 76 ~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~-~~t~~~~g~-~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~ 153 (390) T protein:vir:62 76 GSGSGAQRSADVDDDATLRAGNLGEARSFEFAPEKR-DGTKAGNPN-VLSRTLYGQLIAQAVERSAIMRGGATTFTTSDA 153 (390) T ss_pred cccccchhhcchHHHHHHhhhhhhhhHHHHhhhhhh-cccccCCCc-cccccchHHHHHHHHhhhhhhhhcceeeecCCC Confidence 0000 0 000000 00000 001111111 233 55556666666678889988887777777 Q ss_pred ceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 60 STISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDET 137 (339) Q Consensus 60 ~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~ 137 (339) +.+.||+. |.+.+.....|+.++.. .+...+.++..- ++..+ .|.+-=--++.+|+.+.+.++.++++++..|+. T Consensus 154 ~~~~~p~~~~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~ 230 (390) T protein:vir:62 154 NPLDFTVITGRSSASIVGETAEIPES-YPATAQRSMGGF--KYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRH 230 (390) T ss_pred ceeEEEEEcCCcceeeeccccccccc-ccceeeeEeeee--eEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 88889877 55677777778888764 456777777664 33332 222111113678999999999999999999998 Q ss_pred HHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHH Q lcl|NC_020078. 138 FFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTAL 217 (339) Q Consensus 138 i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~L 217 (339) ++. +.. .| .+.......+......+..+..+ ++.+.++...|..... . +-..|++|..|..| T Consensus 231 ~l~----G~G--~p----~Gi~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~l~~~~~--~--~a~~vmn~~~~~~L 292 (390) T protein:vir:62 231 FIT----GTG--QP----RGILTDASPATATFLATDTDSKV----SDALIDLFHEVPSAYR--A--NAKYVVNDLRAAQM 292 (390) T ss_pred hhc----cCC--cc----ccccccccccccceecccccccc----hHHHHHHHHhhhhhhh--c--CCEEEEchHHHHHH Confidence 751 111 00 01111111111111111111222 4455566666655433 1 22458899999998 Q ss_pred hc--ccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEE Q lcl|NC_020078. 218 MQ--AEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAG 295 (339) Q Consensus 218 l~--~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~ 295 (339) .+ |. +..|.-. +.+.+|.-..++|.+|+.++++|.... +-++|+. .... T Consensus 293 ~~lkd~---~g~~l~~--~~~~~g~~~~l~G~Pv~~~~~~p~~~i--------------~~gd~s~----------~~i~ 343 (390) T protein:vir:62 293 RKLKDA---NGQYLWQ--SGLTVGAPSLFNGKVVETDDGMPADKI--------------LFADLSK----------YRVR 343 (390) T ss_pred HHhhcc---CCCeeec--CCcCCCccceecccceEEecCCCCccE--------------EEeeccc----------eeEE Confidence 43 32 1122111 124456667899999999999984311 1123322 1122 Q ss_pred EEeeeeEEeeechhhhHH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 296 STIPVTSKIFFDDLSKLW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 296 ~~~~~~~e~~~~~~~~~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...++.++...+....-| .+++.+-+|.++++|+++++|++++| T Consensus 344 ~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 344 FAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPG 389 (390) T ss_pred eecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecC Confidence 223344444443322222 35788889999999999999999999 No 83 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.33 E-value=1.9e-13 Score=90.28 Aligned_cols=288 Identities=13% Similarity=0.033 Sum_probs=159.1 Q ss_pred CcccchhHHH-HHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEEe Q lcl|NC_020078. 20 HGAGDPLADV-TEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKID 97 (339) Q Consensus 20 ~~~~~~~a~~-ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID 97 (339) ..+.+.-... -++|+.++.+..+..++++.+.++..+.+| .++||++ +.+.+.-+..|+.++.. .+..++.+|..- T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~~-~~~f~~v~l~~~ 78 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIPR 78 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCcccccc-cceeeEEEEeeE Confidence 2222222223 389999999999999999999887766555 5788886 67788888889888754 456666666553 Q ss_pred ehhhhh-hhHHHHHHHh-----cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccc-ccc Q lcl|NC_020078. 98 TVIIAR-NAEPMLDEFQ-----TDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNV-VTL 170 (339) Q Consensus 98 ~~~y~~-~~vdd~D~~q-----~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~-~~~ 170 (339) ++.. ..|. ++.. ...++.+.+.++.+++|++.+|+.++.-.- + ..+....|...... ... T Consensus 79 --kl~~~~~iS--~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~-------~--~~~~~~~gi~~~~~~~~~ 145 (311) T protein:vir:81 79 --KVQVTQRFS--QEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGIN-------P--LTGAALSGSPAKILDTTN 145 (311) T ss_pred --EEEEeehhh--HHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhcccc-------C--CCCcccccccccccccce Confidence 3222 2221 2221 345689999999999999999998853210 0 00000111111100 000 Q ss_pred cCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 171 AGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 171 ~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) .......+...++..+.++..++...+.. | ...+++|..+..|.+-.. .+..|.-.. ....+..++++|.+|+ T Consensus 146 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~---~~~vmn~~~~~~l~~lkd-~~G~~l~~~--~~~~~~~~tl~G~Pv~ 218 (311) T protein:vir:81 146 IVELTTGTSATPDLAVEAAVGLVLGDNLS-P---DGVALDNTFSFMLATQRD-SQGRKLYPE--LGFGTDVASFAGLNAA 218 (311) T ss_pred eeeecccccchHHHHHHHHHHHhhhcCCC-c---eEEEEcHHHHHHHHhhhc-cCCCeeecC--ccccCCCceecceeEE Confidence 00111122223344455555556555542 2 346899999999965211 111111111 1224567889999999 Q ss_pred Eeccccccccccccc----cCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh------hhH---HHHHH Q lcl|NC_020078. 251 TSNNAVFGKTITDHL----LSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL------SKL---WFIDS 317 (339) Q Consensus 251 ~Snnlp~~~~~~~~~----l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~------~~~---d~i~g 317 (339) .++++|......... ....+....+-++|++ +..+...+++.++.++.. .|. -.+++ T Consensus 219 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~----------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~ 288 (311) T protein:vir:81 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSA----------FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRA 288 (311) T ss_pred ecccccccccccccccchhcccCCccEEEEEeccc----------EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEE Confidence 999998543221111 1111111222333333 223333444556554421 121 14667 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+.+|.++++|++.+.|+-..- T Consensus 289 ~~r~d~~v~~~~a~~~l~~a~~ 310 (311) T protein:vir:81 289 EVVYGIGIMSTDAFAVVRDADE 310 (311) T ss_pred EEEeccEeecccceEEEEeecc Confidence 7889999999999888743322 No 84 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.33 E-value=1.6e-13 Score=90.80 Aligned_cols=282 Identities=10% Similarity=0.008 Sum_probs=164.2 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) ..+..-.|-..+.+ ...+...+..+.|+.++.+..+..+++++++++..+.+ .+++||+. +.+.+.-...|+ T Consensus 17 ~~~~~~~~~a~~~~------~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~ 89 (324) T protein:vir:93 17 NNVKPQVFNPDNVM------MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQ 89 (324) T ss_pred hhhhhhhccccccc------ccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCc Confidence 22222222222211 11122235569999999999999999999987776554 45778776 677788888888 Q ss_pred CCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) .++.. .+..++.++..-+ .+....|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.... ... T Consensus 90 ~~~~~-~~~f~~i~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g~~-------~~~ 156 (324) T protein:vir:93 90 KIETS-KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-------PFG 156 (324) T ss_pred ccccc-ccceeEEEEEeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----CCCCC-------CcC Confidence 88764 4666666665532 22333343311113568999999999999999999997752 11110 000 Q ss_pred cCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecc Q lcl|NC_020078. 160 PGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTK 239 (339) Q Consensus 160 ~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G 239 (339) .|...... ...........++.|.++...|..++.. + ..++++|..|..|.+- .+. .+...+..+ T Consensus 157 ~~~~~~~~----~~~~~~~~~~~~~~i~~~~~~l~~~~~~-~---~~~v~n~~~~~~L~~l---~d~----~G~~~~~~~ 221 (324) T protein:vir:93 157 KSIAQSIE----KTNKVIKGDFTQDNIIDLEALLEDDELE-A---NAFISKTQNRSLLRKI---VDP----ETKERIYDR 221 (324) T ss_pred cccccccc----ccceeccccccHHHHHHHHHhhhhccCC-C---CEEEEcHHHHHHHHHh---hCC----CCCeeecCC Confidence 11111000 0001111122356677777778776652 2 2568999999998752 221 112223445 Q ss_pred eeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh---------- Q lcl|NC_020078. 240 YMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL---------- 309 (339) Q Consensus 240 ~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~---------- 309 (339) ..++++|.+|+.+++.+.+. ...+-++|++ +..+...+++.+..++.. T Consensus 222 ~~~~l~G~PVv~~~~~~~~~------------~~i~~gdfs~----------~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:93 222 NSDSLDGLPVVNLKSSNLKR------------GELITGDFDK----------LIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred CCCcccceeeEeecCCCCCc------------ceEEEEecce----------EEEEEecCcEEEEeeccccccccccccc Confidence 66779999999887654221 1122233332 223344455566655431 Q ss_pred ---hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 ---SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ---~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.++++|++.+.|+...+ T Consensus 280 ~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 11 1346777889999999999999875555 No 85 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.32 E-value=9.6e-14 Score=91.96 Aligned_cols=288 Identities=10% Similarity=0.020 Sum_probs=161.4 Q ss_pred Cc--cccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccC Q lcl|NC_020078. 1 MS--IFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAP 77 (339) Q Consensus 1 ~~--~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~ 77 (339) |. .|-+.-.---..+......+.+...+.-+.|..++.+..+..+.+++++++.++. |.+++||++ +.+.+.-+.. T Consensus 9 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E 87 (324) T protein:vir:78 9 LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGE 87 (324) T ss_pred HHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecC Confidence 11 1111100000001111111122223455889999999999999999998877755 556888887 6677788888 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) |+.++.. .+..++.++..-+ ...-..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.... . T Consensus 88 g~~~~~~-~~~~~~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~-------~ 154 (324) T protein:vir:78 88 GQKIETS-KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-------P 154 (324) T ss_pred Ccccccc-ccceeEEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC-------C Confidence 8888764 4666666666532 22223333211113468999999999999999999998752 11110 0 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ...|..... . ...........++.|.++..+|..++.. + ..++++|..|..|.+-. +. .+...+. T Consensus 155 ~~~gi~~~~--~--~~~~~~~~~~t~~~i~~~~~~l~~~~~~-~---~~~vmn~~~~~~L~~l~---d~----~G~~~~~ 219 (324) T protein:vir:78 155 FGKSIAQSI--E--KTNKVIKGDFTQDNIIDLEALLEDDELE-A---NAFISKTQNRSLLRKIV---DP----ETKERIY 219 (324) T ss_pred cCccccccc--c--ccceeccccccHHHHHHHHHhhhhccCC-C---CEEEEcHHHHHHHHHhh---cc----CCCeeec Confidence 001111100 0 0011111122356667777777776652 2 25689999999987532 11 1112234 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-------- Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-------- 309 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-------- 309 (339) .+...+++|.+|+.++..+.+ ....+-++|++ +..+...+++.|+.++.. T Consensus 220 ~~~~~~l~G~PV~~~~~~~~~------------~~~~~~gd~~~----------~~~g~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:78 220 DRNSDSLDGLPVVNLKSSNLK------------RGELITGDFDK----------LIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred CCCCCcccceeeEeeCCCCCC------------cceEEEEecce----------EEEEEecCcEEEEeeccccccccccc Confidence 566678999999987765421 11112223322 223444555666655431 Q ss_pred -----hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -----SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -----~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.+++||++.+.|+...+ T Consensus 278 ~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 278 GTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 12 2335667789999999999998876444 No 86 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.32 E-value=9.6e-14 Score=91.96 Aligned_cols=288 Identities=10% Similarity=0.020 Sum_probs=161.4 Q ss_pred Cc--cccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccC Q lcl|NC_020078. 1 MS--IFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAP 77 (339) Q Consensus 1 ~~--~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~ 77 (339) |. .|-+.-.---..+......+.+...+.-+.|..++.+..+..+.+++++++.++. |.+++||++ +.+.+.-+.. T Consensus 9 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E 87 (324) T protein:vir:96 9 LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGE 87 (324) T ss_pred HHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecC Confidence 11 1111100000001111111122223455889999999999999999998877755 556888887 6677788888 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) |+.++.. .+..++.++..-+ ...-..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.... . T Consensus 88 g~~~~~~-~~~~~~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~-------~ 154 (324) T protein:vir:96 88 GQKIETS-KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-------P 154 (324) T ss_pred Ccccccc-ccceeEEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC-------C Confidence 8888764 4666666666532 22223333211113468999999999999999999998752 11110 0 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ...|..... . ...........++.|.++..+|..++.. + ..++++|..|..|.+-. +. .+...+. T Consensus 155 ~~~gi~~~~--~--~~~~~~~~~~t~~~i~~~~~~l~~~~~~-~---~~~vmn~~~~~~L~~l~---d~----~G~~~~~ 219 (324) T protein:vir:96 155 FGKSIAQSI--E--KTNKVIKGDFTQDNIIDLEALLEDDELE-A---NAFISKTQNRSLLRKIV---DP----ETKERIY 219 (324) T ss_pred cCccccccc--c--ccceeccccccHHHHHHHHHhhhhccCC-C---CEEEEcHHHHHHHHHhh---cc----CCCeeec Confidence 001111100 0 0011111122356667777777776652 2 25689999999987532 11 1112234 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-------- Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-------- 309 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-------- 309 (339) .+...+++|.+|+.++..+.+ ....+-++|++ +..+...+++.|+.++.. T Consensus 220 ~~~~~~l~G~PV~~~~~~~~~------------~~~~~~gd~~~----------~~~g~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:96 220 DRNSDSLDGLPVVNLKSSNLK------------RGELITGDFDK----------LIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred CCCCCcccceeeEeeCCCCCC------------cceEEEEecce----------EEEEEecCcEEEEeeccccccccccc Confidence 566678999999987765421 11112223322 223444555666655431 Q ss_pred -----hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -----SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -----~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.+++||++.+.|+...+ T Consensus 278 ~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 278 GTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 12 2335667789999999999998876444 No 87 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.30 E-value=1.5e-13 Score=90.91 Aligned_cols=283 Identities=9% Similarity=-0.024 Sum_probs=161.5 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) -.+..|+ ..++.....+.+...+.-+.++.++.+..+..+++++++++.++. |.+++||+. +.+.+.-...|+ T Consensus 16 ~~~~~~~-----~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~ 89 (324) T protein:vir:96 16 SNNVKPQ-----VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQ 89 (324) T ss_pred Hhhhhhh-----hcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeeecCCc Confidence 0111111 111111111122223556899999999999999999998877755 456888887 566777778888 Q ss_pred CCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM 159 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~ 159 (339) .++.. .+...+.++..-. ...-..|.+-=-.++.+|+.+.+.++.++++++..|+.+|. +.... ... T Consensus 90 ~~~~~-~~~f~~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~----G~g~~-------~~~ 156 (324) T protein:vir:96 90 KIETS-KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-------PFG 156 (324) T ss_pred ccccc-ccceeEEEEEeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCCC-------CcC Confidence 88754 4566666665532 22223333311113568899999999999999999998762 11110 001 Q ss_pred cCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecc Q lcl|NC_020078. 160 PGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTK 239 (339) Q Consensus 160 ~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G 239 (339) .|....... ..........++.|.++..+|...+.. + -.++++|..|..|.+-. +. .+...+.++ T Consensus 157 ~~~~~~~~~----~~~~~~~~~~~~~i~~~~~~i~~~~~~-~---~~~i~n~~~~~~L~~lk---d~----~G~~~~~~~ 221 (324) T protein:vir:96 157 KSIAQSIKK----TNKVIKGDFTQDNIIDLEALLEDDELE-A---NAFISKTQNRSLLRKIV---DP----ETKERIYDR 221 (324) T ss_pred ccccccccc----cceecccccchHHHHHHHHhhhhccCC-C---CEEEEcHHHHHHHHHhh---CC----CCCeeecCC Confidence 111111000 001111112356677777777766552 2 25689999999987532 11 111223445 Q ss_pred eeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh---------- Q lcl|NC_020078. 240 YMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL---------- 309 (339) Q Consensus 240 ~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~---------- 309 (339) ...+++|++|+.++..+.+. ... ++.+.+-+..+...++..+..++.. T Consensus 222 ~~~~l~G~PV~~~~~~~~~~------------~~~----------~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 222 NSDSLDGLPVVNLKSSNLKR------------GEL----------ITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred CCCcccceeeEeecCCCCCc------------ceE----------EEEecceEEEEEecCcEEEEeeccccccccccccc Confidence 66779999999887664321 011 2222222333444455556555431 Q ss_pred ---hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 ---SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ---~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.+++||++.+.|+...+ T Consensus 280 ~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 11 1235677889999999999998875555 No 88 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.27 E-value=2.7e-13 Score=89.51 Aligned_cols=280 Identities=11% Similarity=0.034 Sum_probs=160.3 Q ss_pred CccccCc-ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQ-TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~-~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g 78 (339) -.+..|+ |-..+. ....+...+.-+.|+.++.+..+..+.+++++++..+.+ .+++||+. +.+.+.-...| T Consensus 16 ~~~~~~~~~~a~~~------~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg 88 (324) T protein:vir:10 16 SNNVKPQVFNPDNV------MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEG 88 (324) T ss_pred HHhhccceecccce------eccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCcceeEeccC Confidence 1111222 111111 111122225558999999999999999999988777554 46888887 56677888888 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) +.++.. .+...+.++..- ++..+ .|.+ +-. ++.+|+.+.+.++.++++++..|+.++. +... + T Consensus 89 ~~~~~~-~~~~~~v~~~~~--k~~~~~~iS~-ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~----G~g~-------~ 153 (324) T protein:vir:10 89 QKIETS-KATWVNATMRAF--KLGVILPVTK-EFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN-------N 153 (324) T ss_pred cccccc-ccceeEEEEeeE--EEEEeehhhH-HHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCC-------C Confidence 888754 356666666543 33332 2322 112 3458899999999999999999998752 1110 0 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccccee Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETL 236 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l 236 (339) ....|...... ...........++.|.++...|..++... . .++++|..|..|.+- .+. .+ ...+ T Consensus 154 ~~~~~i~~~~~----~~~~~~~~~~t~~~i~~~~~~l~~~~~~~---~-~~v~n~~~~~~L~~l---~d~--~g--~~~~ 218 (324) T protein:vir:10 154 PFGKSIAQSIE----KTNKVIKGDFTQDNIIDLEALLEDDELEA---N-AFISKTQNRSLLRKI---VDP--ET--KERI 218 (324) T ss_pred ccCcccccccc----ccceeccccCCHHHHHHHHHhhhhccCCC---C-EEEEcHHHHHHHHHh---hcc--CC--ceee Confidence 00011111100 01111111223566777888887766532 2 458999999998753 222 11 1123 Q ss_pred ecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh------- Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL------- 309 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~------- 309 (339) ..+.-.+++|.+|+.++..+.+. ...+-++|+ -+..+...++..|..++.. T Consensus 219 ~~~~~~~l~G~PV~~~~~~~~~~------------~~~~~gd~~----------~~~~~~~~~~~i~~~~~~~~~~~~~~ 276 (324) T protein:vir:10 219 YDRNSDTLDGLPVVNLKSSNLKR------------GELITGDFD----------KLIYGIPQLIEYKIDETAQLSTVKNE 276 (324) T ss_pred cCCCCccccceeEEeecCCCCCc------------ceEEEEecc----------cEEEEEecCcEEEEeecccccccccc Confidence 34445679999999887654221 111222222 2223344455556554421 Q ss_pred ------hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 ------SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ------~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+.+|.++++|++.+.|+...+ T Consensus 277 ~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 277 DGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred cccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 11 1335666789999999999999977666 No 89 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.27 E-value=1.9e-13 Score=90.35 Aligned_cols=276 Identities=12% Similarity=0.144 Sum_probs=154.6 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) ..........-...+.+-....|. .+..+.|+.++.+.....+.+++++++.++.++ +..+|.. +...+..+..| T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~gg--~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~ 196 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAADAA--STIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAEL 196 (400) T ss_pred hhhhhhhHHHHHHHhhcccccCCc--ccccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCcccccccc Confidence 000000000001111111111111 234489999999999999999999888776544 3445543 45556666666 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhh-HHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNA-EPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~-vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +.......+..++.++.. .++..+. |.+-=-.++.+|+.+.+.++.+++|+...|+.|+.-. T Consensus 197 ~~~~~~~~~~f~~i~~~~--~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~--------------- 259 (400) T protein:vir:38 197 EKNPAMAKPEFKPVNWSV--ETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLL--------------- 259 (400) T ss_pred ccccccccccceeeEeeh--hheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcc--------------- Confidence 666543345555555544 3443322 2221111346789999999999999999998875211 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) +.+. ++...+.+ .+.++.... ++ +...-..|++|..|..|.+-.. .+..|.-.. .+. T Consensus 260 -----~~~~------~~~~~~~~----~~~~~~~~~----~~-~~~~a~~v~~~~~~~~l~~lkd-~~G~~i~~~--~~~ 316 (400) T protein:vir:38 260 -----KGFT------AKTISSVD----DLKHINNVD----LD-PAYSRVIIASQSFYNFLDTVKD-GNGRYLLQD--SIL 316 (400) T ss_pred -----cccc------ccccccHH----HHHHHHHhh----hh-hhhCcEEEEcHHHHHHHHHhhc-cCCCeeeec--CcC Confidence 0111 01111222 233332221 11 1123356889999999865211 122222111 234 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEec-cceeEEEEEeeeeEEeeechhhhHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFS-PKALLAGSTIPVTSKIFFDDLSKLWFID 316 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h-~~A~~~~~~~~~~~e~~~~~~~~~d~i~ 316 (339) +|..++++|++|+.+++.|... .|+.. .++.. +.++..+...++.++..++ ..+...++ T Consensus 317 ~~~~~~l~G~pv~~~~~~~~~~---------~g~~~----------~~~gd~s~~~~~~~~~~~~~~~~~~-~~~~~~~~ 376 (400) T protein:vir:38 317 TPSGKSVLGMPIAVVSDDTLGA---------AGEAH----------AFLGDIKRAILFANRADFMVRWVDD-QIYGQFLQ 376 (400) T ss_pred CCCccccccceeEEecccccCC---------CCceE----------EEEEeccccEEEEeecceEEEEecc-cccceeEE Confidence 5666789999999999887432 11111 12222 2234444445556665544 45777899 Q ss_pred HHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 317 SWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 317 g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++-+|.++++|++.+.|+++++ T Consensus 377 ~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 377 AGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred EEEEeccEEecccceEEEEeecC Confidence 99999999999999999999888 No 90 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.27 E-value=4.9e-13 Score=88.08 Aligned_cols=277 Identities=14% Similarity=0.108 Sum_probs=160.6 Q ss_pred ccCc-ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCC Q lcl|NC_020078. 4 FDGQ-TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTP 81 (339) Q Consensus 4 ~~~~-~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i 81 (339) ...+ |=..|.+-. ++...+..++|+.++.+..+..++++.++++..+.++....+++. +.+.+.....|+.+ T Consensus 1 m~~~~~~~~~~~~t------~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 74 (297) T protein:vir:95 1 MTVQTFNPENVLVS------QKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKI 74 (297) T ss_pred CCcccccccccccc------CCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccc Confidence 1222 212233222 222235669999999999999999999988877766555666655 45677888888888 Q ss_pred CCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020078. 82 PPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMP 160 (339) Q Consensus 82 ~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~ 160 (339) +.. .+...+.++..- ++.. ..|.+-=-.++..|+.+.+.++.++++++..|+.++. +.....+ . T Consensus 75 ~~~-~~~f~~v~l~~~--k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~~~--------~ 139 (297) T protein:vir:95 75 KTD-KPEVVPVTLKAH--KLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLL----GHDTPFA--------N 139 (297) T ss_pred ccc-ccceeEEEEeeE--EEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCccc--------c Confidence 754 466666666653 3333 3333311113568999999999999999999998862 1111100 0 Q ss_pred CccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecce Q lcl|NC_020078. 161 GHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKY 240 (339) Q Consensus 161 g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~ 240 (339) |.............+..+ ++.|+++..+|...+... . .++++|..|..|.+ +.+. .+..+-++. T Consensus 140 gi~~~~~~~~~~~~~~~t----~~~i~~~~~~l~~~~~~~---~-~~v~~~~~~~~L~~---l~d~-----~G~~i~~~~ 203 (297) T protein:vir:95 140 SVAKAAKDANKVIGGPIN----YDNILKLQDALYDADVEP---N-AFVSKIQNRSALRE---ARDG-----NKVSIYDKA 203 (297) T ss_pred cccccccccceecccccC----HHHHHHHHHHhhhccCCc---C-EEEEcHHHHHHHHH---hhcc-----CCceeecCC Confidence 111000000011111122 455667777777666532 2 46889999999875 2221 122344556 Q ss_pred eEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh----------- Q lcl|NC_020078. 241 MFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL----------- 309 (339) Q Consensus 241 v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~----------- 309 (339) .+.++|.+|+.+++.+... + ..+-++|+ .+..+...++..++.++.. T Consensus 204 ~~~l~G~Pv~~~~~~~~~~---~---------~~~~gd~s----------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 261 (297) T protein:vir:95 204 ANTIDGITTVDLKSARFEK---G---------DLLAGDFD----------NLIYGVPYNITYKISEEGQISTITNADGTP 261 (297) T ss_pred CCcccceeeEeecCCCCCC---c---------eEEEEecc----------cEEEEEecCeEEEEeeccccccccccCccc Confidence 6789999999887654221 0 11122222 2223444455556555431 Q ss_pred --hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 --SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 --~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+.+|.++++|++.+.|+..+= T Consensus 262 ~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 262 INLFEQEMIAIRATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred hhhhhcCcEEEEEEEEeccEeecccceEEEeecCC Confidence 01 1235666789999999999998854333 No 91 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.26 E-value=6.7e-13 Score=87.34 Aligned_cols=281 Identities=14% Similarity=0.066 Sum_probs=161.4 Q ss_pred ccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEE Q lcl|NC_020078. 18 QRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKI 96 (339) Q Consensus 18 ~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~I 96 (339) ..-.+|. +..++++.++.+..++.++++.+.++..+.+| +++||++ +.+.+.-+..|++++.. .+..++.++.. T Consensus 1 ma~~gG~---lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~ 75 (298) T protein:vir:94 1 MVLNKGT---LFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVP 75 (298) T ss_pred Ceecccc---ccChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccc-ccceeEEEEee Confidence 1111121 34488999999999999999999887766554 5788887 67788888888888753 45566666654 Q ss_pred eehhhh-hhhHHHHHHHh-----cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccc Q lcl|NC_020078. 97 DTVIIA-RNAEPMLDEFQ-----TDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTL 170 (339) Q Consensus 97 D~~~y~-~~~vdd~D~~q-----~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~ 170 (339) - ++. ...|. ++.. ...++.+.+.++.+++|++.+|+.++.-. ++.........+......... T Consensus 76 ~--k~~~~~~iS--~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~-------~~~~g~~~~~~~~~~~~~~~~ 144 (298) T protein:vir:94 76 I--KVEYGARIS--DEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV-------NPRLGTASAVIGTNHFDSKVT 144 (298) T ss_pred e--EEEEeeehh--HHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhccc-------ccCCCcccccccccccccccc Confidence 3 322 22232 2221 23568899999999999999998886321 001111111111000000000 Q ss_pred cCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEE Q lcl|NC_020078. 171 AGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVI 250 (339) Q Consensus 171 ~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~ 250 (339) ...........+++.+.++..+|...+.. + ...+++|..+..|.+-..- +..|.-.. ...+|..++++|++|+ T Consensus 145 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~---~~~vmn~~~~~~l~~lkd~-~G~~l~~~--~~~~~~~~tl~G~PV~ 217 (298) T protein:vir:94 145 QKVEAPRGIADPNGAIENAVELLTGVDAD-V---TGIAINPSFRSALAKQKDL-QGNALFPE--LKWGATPDTINGLPVD 217 (298) T ss_pred cccccccccccHHHHHHHHHHhhhhcCCC-c---cEEEEcHHHHHHHHHhhcc-CCCeeecC--cccCCCCceecceeeE Confidence 00111122344667788888888877663 2 2579999999998652211 11111111 1235666789999999 Q ss_pred EeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec--hh-----hh-HH--HHHHHHH Q lcl|NC_020078. 251 TSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD--DL-----SK-LW--FIDSWLA 320 (339) Q Consensus 251 ~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~--~~-----~~-~d--~i~g~~~ 320 (339) .|+++|.....+ ....+-++|++. +......+++.++.+. ++ .| .+ .+++.+. T Consensus 218 ~~~~v~~~~~~~--------~~~~~~Gdfs~~---------~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r 280 (298) T protein:vir:94 218 VNKTVSDMSLTQ--------RDRAIIGDFANG---------FKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELF 280 (298) T ss_pred EecccccccCCC--------ccEEEEeeccce---------EEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEE Confidence 999998532111 111233333332 2222233444454432 11 12 11 2566778 Q ss_pred hCCccccccceEEEEecC Q lcl|NC_020078. 321 FGVTINRTEYAGVIKLPA 338 (339) Q Consensus 321 ~Ga~v~rPe~~v~i~~~~ 338 (339) +|.+++||++.+.|+-.. T Consensus 281 ~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 281 LGWGILDATKFARVTEAN 298 (298) T ss_pred eccEeecccceEEEEecC Confidence 999999999999986666 No 92 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.26 E-value=5.8e-13 Score=87.65 Aligned_cols=282 Identities=13% Similarity=0.074 Sum_probs=160.9 Q ss_pred ccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEE Q lcl|NC_020078. 18 QRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKI 96 (339) Q Consensus 18 ~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~I 96 (339) ....+| .+..+.++.++.+..+..+++++++++....+|+ +.||+. +.+.+.-+..|++++.. .+..++.++.. T Consensus 1 ma~~gG---~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~-~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKG---TLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVP 75 (298) T ss_pred CcccCc---ceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCCcccccc-ccceeEEEEee Confidence 221222 2455788999999999999999998877766544 567775 67788888888888754 35555555544 Q ss_pred eehhhhhhhHHHHHHH-----hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccccc Q lcl|NC_020078. 97 DTVIIARNAEPMLDEF-----QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLA 171 (339) Q Consensus 97 D~~~y~~~~vdd~D~~-----q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~ 171 (339) .++... +.==++. ....++.+.+.++.++++++..|+.++.-. .+...++....+.......... T Consensus 76 --~k~a~~-~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~-------~~~~g~~~~~~~~~~~~~~~~~ 145 (298) T protein:vir:16 76 --IKVEYG-ARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGV-------NPRLGTASAVIGTNHFDSKVTQ 145 (298) T ss_pred --eeEEEe-ehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccc-------cCCCCccccccccccccccccc Confidence 343322 2212222 234678999999999999999999886321 0111111111111000000000 Q ss_pred CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEE Q lcl|NC_020078. 172 GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVIT 251 (339) Q Consensus 172 ~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~ 251 (339) ..........+++.|.++..++...+.+ + . ..+++|..+..|.+-..-. ..|.-.. ...+|..++++|.+|+. T Consensus 146 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~--~-~~vmn~~~~~~l~~lkd~~-G~~i~~~--~~~~~~~~~l~G~PV~~ 218 (298) T protein:vir:16 146 KVEAPRGIADPNGAIENAVELLTGVDAD-V--T-GIAINPSFRSALAKQKDLQ-DNALFPE--LKWGATPDTINGLPVDV 218 (298) T ss_pred ccccccccccHHHHHHHHHHHhhhcCCC-c--c-EEEEcHHHHHHHHHhhccC-CCeeecC--cccCCCCceecceeeEE Confidence 1111112234566788888888777663 2 2 3688999999987632211 1121111 12356667899999999 Q ss_pred eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech--h-----hhH---HHHHHHHHh Q lcl|NC_020078. 252 SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD--L-----SKL---WFIDSWLAF 321 (339) Q Consensus 252 Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~--~-----~~~---d~i~g~~~~ 321 (339) ++++|.....+ ....+-++|++. +......+++.++.+.. . +|. -.+++.+.+ T Consensus 219 ~~~v~~~~~~~--------~~~~~~GDfs~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~ 281 (298) T protein:vir:16 219 NKTVSDMSLTQ--------RDRAIIGDFANG---------FKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFL 281 (298) T ss_pred ecccccccCCC--------ccEEEEeeccce---------EEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEE Confidence 99998432111 111233344332 22222233444544432 1 121 225667789 Q ss_pred CCccccccceEEEEecC Q lcl|NC_020078. 322 GVTINRTEYAGVIKLPA 338 (339) Q Consensus 322 Ga~v~rPe~~v~i~~~~ 338 (339) |.+++||++.+.|+-.. T Consensus 282 d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 282 GWGILDATKFARVTEAN 298 (298) T ss_pred ccEeecccceEEEeecC Confidence 99999999999997666 No 93 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.26 E-value=5.7e-13 Score=87.70 Aligned_cols=273 Identities=10% Similarity=0.018 Sum_probs=161.6 Q ss_pred cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc-cceEEEeccc--cceeeeccCCCCCCCCCCCCc Q lcl|NC_020078. 13 VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG-TSTISNRGIS--KAKLQKIAPGTTPPPSTEPHT 89 (339) Q Consensus 13 ~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~-G~tv~i~~iG--~~t~~~~~~g~~i~~~~~~~~ 89 (339) +-|.......++--.+.-++|+.++.+..+..+.+++++++..+.+ ..+..|+... ...+.....|+.+.....++. T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 4444333333222234559999999999999999999988877654 3466676553 345666667777764334556 Q ss_pred cceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccc Q lcl|NC_020078. 90 SKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVT 169 (339) Q Consensus 90 ~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~ 169 (339) .+.++...... ....|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-+-+ . T Consensus 81 ~~i~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~--------------------~---- 135 (293) T protein:vir:48 81 SLIKYTIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDK--------------------L---- 135 (293) T ss_pred eEEEEeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccc--------------------c---- Confidence 66666664222 2223332111245689999999999999999999988632110 0 Q ss_pred ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEE Q lcl|NC_020078. 170 LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPV 249 (339) Q Consensus 170 ~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V 249 (339) ...+...+. +.|.++..+|..... + .-..+++|..|..|.+-.+- +..|.-.. .+.+|...+++|.+| T Consensus 136 -~~~~~~~~~----d~i~~~~~~l~~~~~--~--~a~~vmn~~~~~~L~~lkd~-~g~~l~~~--~~~~~~~~~l~G~Pv 203 (293) T protein:vir:48 136 -PTKPTLTKW----DDIIDLEAKVDPAIK--Q--TSFFLTNTSGFTALKKVKNA-LGDYLMER--DVKSPTGYSIAGFAV 203 (293) T ss_pred -cccccccCH----HHHHHHHHhhhhhhc--C--CCEEEEcHHHHHHHHHhhcc-CCceEeec--CcCCCCCceecceee Confidence 011122333 445566666654433 2 22457899999998552211 11221111 244567778999999 Q ss_pred EEeccccccccccccccCCCccccccccccceEEEEEec-cceeEEEEEeeeeEEeeech-hhhH---HHHHHHHHhCCc Q lcl|NC_020078. 250 ITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFS-PKALLAGSTIPVTSKIFFDD-LSKL---WFIDSWLAFGVT 324 (339) Q Consensus 250 ~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h-~~A~~~~~~~~~~~e~~~~~-~~~~---d~i~g~~~~Ga~ 324 (339) +.+.+.+.... ..+... .++.. ++++..+...+++.+..+.. +.|. -.+++.+-+|.+ T Consensus 204 ~~~~~~~~~~~-------~~~~~~----------~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~ 266 (293) T protein:vir:48 204 KEISDRWLPNA-------SSGVMP----------LYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVV 266 (293) T ss_pred EEecccccCCc-------cCCceE----------EEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcE Confidence 88655432110 111111 12222 33455555556666665533 2332 346677789999 Q ss_pred cccccceEEEEecCC Q lcl|NC_020078. 325 INRTEYAGVIKLPAA 339 (339) Q Consensus 325 v~rPe~~v~i~~~~a 339 (339) +++|++.+.++++++ T Consensus 267 ~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 267 ATDTEAFVPASFKAI 281 (293) T ss_pred EecccceEEEEeecc Confidence 999999999998887 No 94 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.26 E-value=3.2e-13 Score=89.07 Aligned_cols=286 Identities=9% Similarity=0.001 Sum_probs=162.5 Q ss_pred CccccCcccCCC----cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeec Q lcl|NC_020078. 1 MSIFDGQTPSYD----VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~~----~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~ 75 (339) |.+ |+|=..+ ..|+.-...+.+...+.-+.|+.++.+..+..+++++++++-... |.+++||+. +.+.+.-. T Consensus 9 ~~~--~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v 85 (324) T protein:vir:97 9 LNL--QHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWV 85 (324) T ss_pred HHH--HHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEEecCcceeEe Confidence 111 1121111 111111111122223455899999999999999999998766654 556888887 56677777 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) ..|+.++.. .+..++.++..-+ ...-..|.+-=-.++.+++.+.+.++.++++++..|+.++. +.... T Consensus 86 ~Eg~~~~~~-~~~f~~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g~~------ 153 (324) T protein:vir:97 86 GEGQKIETS-KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN------ 153 (324) T ss_pred ccCcccccc-ccceeEEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCC------ Confidence 788888754 4566666665532 22223333311113468899999999999999999998752 11100 Q ss_pred cccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) ....|.... .. ...........++.|.++...|...+.. + -.++++|..|..|.+- .+. .| ... T Consensus 154 -~~~~gi~~~--~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~-~---~~~v~n~~~~~~L~~l---kd~--~g--~~~ 217 (324) T protein:vir:97 154 -PFGKSIAQS--IE--KTNKVIKGDFTQDNIIDLEALLEDDELE-A---NAFISKTQNRSLLRKI---VDP--ET--KER 217 (324) T ss_pred -ccCcccccc--cc--ccceeccccCCHHHHHHHHHhhhhccCC-C---CEEEEcHHHHHHHHHh---hcC--CC--cee Confidence 000111100 00 0011111122356677777788776652 2 2458999999988652 211 11 112 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh------ Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL------ 309 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~------ 309 (339) +..+.-+.++|.+|+.++.++.+. +..+ +...+-+..+...+++.|+.++.. T Consensus 218 ~~~~~~~tl~G~PV~~~~~~~~~~------------~~~~----------~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:97 218 IYDRNSDTLDGLPVVNLKSSNLKR------------GELI----------TGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ecCCCCccccceeeEeecCCCCCc------------ceEE----------EEecccEEEEEecCcEEEEeeccccccccc Confidence 334455679999999987765321 1112 222222334445566667665532 Q ss_pred -------hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -------SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -------~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.++++|++.+.|+...+ T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 11 1234566779999999999999988777 No 95 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.25 E-value=4.2e-13 Score=88.46 Aligned_cols=289 Identities=11% Similarity=0.036 Sum_probs=160.8 Q ss_pred CccccCcccC-----CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc--cceee Q lcl|NC_020078. 1 MSIFDGQTPS-----YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS--KAKLQ 73 (339) Q Consensus 1 ~~~~~~~~~~-----~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG--~~t~~ 73 (339) +....+..-. -...+......+++.-.+....+..++.+.....+.+++++++.++.+ .++.+++.. ..++. T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~ 170 (390) T protein:vir:10 92 AGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAA 170 (390) T ss_pred HHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCCccee Confidence 0000000000 011111122222222335567777888888888888899888777654 467777753 35666 Q ss_pred eccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 74 KIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 74 ~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) ....|+.++.. .+..+++++.+.+.. ..+.|.+ .-.+...++.+.+.++.+.++++..|+.++ .+... T Consensus 171 ~v~Eg~~~~~~-~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~l~~~~~~~~~~~il----~G~G~----- 238 (390) T protein:vir:10 171 IVAEGALKPES-SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEIL----RGTGA----- 238 (390) T ss_pred eecCCcccccc-ccceeEEEEeeEEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHh----hcCCC----- Confidence 66778777654 456667777665321 1222222 112223578899999999999999998875 12111 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) +....|...... ....+........++.+.++...|...+.+ . -.+|++|..|..|.+-.. .+..|..... T Consensus 239 --~~~p~Gi~~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~--~--~~~v~n~~~~~~L~~lkd-~~g~~l~~~~ 309 (390) T protein:vir:10 239 --NDGLLGLIPQAT--TYAAPTTIAGATRVDQLRLAMLQASLAEYP--A--SGIVINPIDWAAIELAKD-ANNQYLIGNA 309 (390) T ss_pred --Cccccccccccc--cccccccccccchHHHHHHHHHhhccccCC--C--CEEEEcHHHHHHHHHhhc-CCCceeecCC Confidence 111112111110 011111122233466778888888777662 2 246899999999875321 1112221111 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh-hH Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS-KL 312 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~-~~ 312 (339) .++...+++|.+|+.|+.+|.+. .+-++|+. ++..+...++.++..+...+ .. T Consensus 310 ---~~~~~~~l~G~pv~~~~~~p~~~--------------~~~gdf~~---------~~~~~~~~~~~i~~~~~~~~~~~ 363 (390) T protein:vir:10 310 ---RGTLTPTLWGLPVVATQAMAPGE--------------FLVGAFDL---------AAQIFDQWDARVEIGYVNDDFQR 363 (390) T ss_pred ---cCcCCceecceeeEEcCCCCCCc--------------EEEEeccc---------eEEEEEecceEEEEeeccccccc Confidence 13344679999999999998432 12233332 23333445556666665433 23 Q ss_pred H--HHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 313 W--FIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 313 d--~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) + .+++.+.++.++++|++.+.+.+. T Consensus 364 ~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 364 NMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred CcEEEEEEEeeccEEeccccEEEEEeC Confidence 4 455667899999999999999888 No 96 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.24 E-value=1.1e-12 Score=86.15 Aligned_cols=300 Identities=12% Similarity=0.009 Sum_probs=151.1 Q ss_pred CccccCcccCC-CcccC---CccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeec Q lcl|NC_020078. 1 MSIFDGQTPSY-DVTRP---NQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~-~~~r~---~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~ 75 (339) .-...+.++.. ..... .......+...+..+.|+.++.+..+..++++++.++..+.+| ...+++. +.+.+... T Consensus 142 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v 220 (458) T protein:vir:10 142 YVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWV 220 (458) T ss_pred HHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeec Confidence 00111222221 00000 0011111223356689999999999999999999887766654 4455543 45555555 Q ss_pred cCCCCCCCCC-----CCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 76 APGTTPPPST-----EPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 76 ~~g~~i~~~~-----~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) ..++..+... .+...+++ +...++..+ .|.+-=--++.+++.+.+.++++++|++..|+.++. +.... T Consensus 221 ~e~~~~~~~~~~~~~~~~~~~i~--~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~----G~G~~ 294 (458) T protein:vir:10 221 AASTYGTDTTTGEEVKGALKEIH--FSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMT----GDGSG 294 (458) T ss_pred ccccccccccccccccccceeeE--eeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----CCCCC Confidence 5555444321 12233333 333344333 221111112458899999999999999999998752 21111 Q ss_pred cccccccccccCcccccccccc--CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLA--GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGE 227 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~--~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d 227 (339) .|..-.. .++...+...... ...+..+ |+.|.++...|...+... . ..|++|..|..|.+-.. .+.. T Consensus 295 ~p~Gi~~--~~~~~~~~~~~~~~~~~~~~~~----~~~i~~~~~~l~~~~~~~---~-~~v~~~~~~~~l~~lkd-~~G~ 363 (458) T protein:vir:10 295 KPKGLLT--LASEDSAKVVTEAKADGSVLVT----AKTISKLRRKLGRHGLKL---S-KLVLIVSMDAYYDLLED-EEWQ 363 (458) T ss_pred ccceeee--cccccccceeeccccccccccc----HHHHHHHHHhhhhhhcCC---C-EEEEcHHHHHHHHhhcc-cCCc Confidence 1100000 0000111111100 0111122 456667777777665421 2 45899999998754211 1111 Q ss_pred ccc--cccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEee Q lcl|NC_020078. 228 YVT--SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIF 305 (339) Q Consensus 228 ~~~--~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~ 305 (339) |.. ........|...+++|.+|+.++.+|...+.+. ..+|.| .++...+...+++++ T Consensus 364 ~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~-----------------~~~~~f--~~~~~~~~~~~~~v~-- 422 (458) T protein:vir:10 364 DVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAE-----------------FAVIVY--KDNFVMPRQRAVTVE-- 422 (458) T ss_pred eeeccccccccccCcCceecceeeEEccccccccCCcc-----------------eEEEEe--cccEEEEEeeceEEE-- Confidence 111 011123345667899999999999985321111 111211 122223333444433 Q ss_pred echhhhH--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 306 FDDLSKL--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 306 ~~~~~~~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++-... -.++..+-+|-.+.+|++.|...+++| T Consensus 423 ~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 423 RERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred eecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 2222111 224556678999999999999999999 No 97 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.24 E-value=5.6e-13 Score=87.73 Aligned_cols=289 Identities=11% Similarity=0.014 Sum_probs=163.1 Q ss_pred CccccCcc-cCC-CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc--cceeeecc Q lcl|NC_020078. 1 MSIFDGQT-PSY-DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS--KAKLQKIA 76 (339) Q Consensus 1 ~~~~~~~~-~~~-~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG--~~t~~~~~ 76 (339) +-...++. +.. ...+......+++.-.+..++|..++.+.....+.+++++++..+. +.++++++.. ..++.-.. T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:81 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAAIVA 173 (390) T ss_pred HhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeecc-CCceEEEEEecCCcceeeec Confidence 00000000 000 1111111122233333556788889999999999999998876654 4567777763 34666677 Q ss_pred CCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 77 PGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 77 ~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) .|+.++.. .+..++.++.+.+.- ....|.+ .-.+...++.+.+.++.+.++++..|+.++. +.. ++ T Consensus 174 Eg~~~~~~-~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~----G~g-------~~ 239 (390) T protein:vir:81 174 EGALKPES-SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILR----GTG-------AN 239 (390) T ss_pred CCcccccc-cceeeEEEEeeeEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC-------CC Confidence 78887654 456666666654221 1122221 1222235788999999999999999998752 111 11 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccccee Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETL 236 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l 236 (339) ....|....... ............++.+.++..+|...+.. + . .+|++|..|..|.+-.. .+..|.-.. . T Consensus 240 ~~~~Gi~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~-~~v~~~~~~~~l~~lkd-~~G~~l~~~---~ 309 (390) T protein:vir:81 240 DGLLGLIPQATT--YAAPTTIAGATRVDQLRLAMLQASLAEYN-P--S-GIVINPIDWAAIELAKD-ANNQYLIGN---A 309 (390) T ss_pred Ccccceeecccc--cccccccccchhHHHHHHHHHhhccccCC-C--C-EEEEcHHHHHHHHHhhc-CCCceeecC---c Confidence 111121111110 11111122233456777888888776652 1 2 45889999998865321 111121111 1 Q ss_pred ecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhH-H-- Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKL-W-- 313 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~-d-- 313 (339) .++...+++|.+|++|+++|.+.. +-++|++ ++..+...+++++..+....|. + T Consensus 310 ~~~~~~~l~G~pv~~~~~~p~~~~--------------~~gd~~~---------~~~~~~~~~~~v~~~~~~~~~~~~~v 366 (390) T protein:vir:81 310 RGTLTPTLWGLPVVATQAMAPGEF--------------LVGAFDL---------AAQIFDQWDARVEIGYVGEDFQRNMI 366 (390) T ss_pred ccccCceecceeeEEcCCCCCCcE--------------EEEehhc---------eEEEEEecceEEEEecccchhhcCcE Confidence 234456899999999999984321 1122222 3333444566777766555443 3 Q ss_pred HHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 314 FIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 314 ~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) .+++.+-++.++++|++.+.+.+. T Consensus 367 ~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 367 TVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEEEeeccEEecccceEEEEeC Confidence 456888999999999999999888 No 98 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.23 E-value=5e-13 Score=88.02 Aligned_cols=296 Identities=11% Similarity=0.059 Sum_probs=156.8 Q ss_pred CccccCcccCC----CcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccccccccceEEEeccccce---- Q lcl|NC_020078. 1 MSIFDGQTPSY----DVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAK---- 71 (339) Q Consensus 1 ~~~~~~~~~~~----~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t---- 71 (339) ...+.+.+... ...+....+...+.....+ +.+++.+....+..+++++++++.... +++++|++....+ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 181 (419) T protein:vir:94 103 RGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAG 181 (419) T ss_pred hhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeecccccccc Confidence 00111111110 0111111211122222223 667777776666677788887766543 5667777653322 Q ss_pred -----eeeccCCCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020078. 72 -----LQKIAPGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKA 145 (339) Q Consensus 72 -----~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~a 145 (339) +.....|+.++.. .+...++++.+. ++.. +.|.+ .-.+...++.+.+.++.+.++++..|+.|+. + T Consensus 182 ~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~--k~~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~aii~----G 253 (419) T protein:vir:94 182 STWNKAAVVPEGTAKPQS-TLSFDTITTTLK--TVAHWLPITR-QAADDNSQLMGYIQGRLTYGLRFLRDRQLLN----G 253 (419) T ss_pred ccCcccceecCCcccccc-ccceeeEEeeee--eEEEeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----c Confidence 2333445555432 344555555543 2222 22221 1112224688889999999999999998852 2 Q ss_pred cccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhh Q lcl|NC_020078. 146 AIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITN 225 (339) Q Consensus 146 A~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n 225 (339) .....|. +...- .+.........+...+....++.|.++...+...+.. + -.++++|..|..|++-..-.. T Consensus 254 ~G~~~p~----Gi~~~-~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~-~---~~~v~n~~~~~~l~~~k~~~~ 324 (419) T protein:vir:94 254 NGSTEMQ----GILTT-PGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFP-P---DGVVVHPQDWESIELDQAPGS 324 (419) T ss_pred cCccccc----ceecc-cccccccccccccccccchhHHHHHHHHHhhhhccCC-C---CEEEEcHHHHHHHHHHhhcCC Confidence 1111110 00000 0011111112222333445677888888888766652 2 256999999999875432222 Q ss_pred hcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEee Q lcl|NC_020078. 226 GEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIF 305 (339) Q Consensus 226 ~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~ 305 (339) ..|--.. ...+|...+++|++|+.++.+|.+. .+-++|+ .+...+...+++++.. T Consensus 325 ~~~~~~~--~~~~~~~~~l~G~pV~~~~~~~~~~--------------~~~gd~~---------~~~~~~~~~~~~v~~~ 379 (419) T protein:vir:94 325 GVFRVIA--NVQGEATPRIWGLNVVSTVAIAQGT--------------ALVGGFR---------QGATLWSRQGITVLMT 379 (419) T ss_pred CceeecC--CcccCCCccccceeeEEcCCCCCcc--------------EEEeecc---------ceEEEEEecceEEEEe Confidence 2221111 2345667789999999999998432 1112222 2222333445566665 Q ss_pred echh-hhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 306 FDDL-SKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 306 ~~~~-~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++.. .|. ..+++.+.+|.++++|++.+.+++++| T Consensus 380 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 380 DSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred ccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 5432 332 346788899999999999999999999 No 99 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.22 E-value=1.6e-12 Score=85.27 Aligned_cols=285 Identities=13% Similarity=0.076 Sum_probs=156.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) |. --|- .|.......++.-.+..+++..++.+..++.+.+++++++..+. +.+.+||+. +.+.+.-...|+ T Consensus 1 ~g----~~~e---~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~ 72 (397) T protein:vir:23 1 MG----FSAD---HSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMG-ATGIVIPHWTGDVSAQWIGEGD 72 (397) T ss_pred CC----cCHH---HHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEcCCcceEEecCCc Confidence 22 1111 11111111112222344677888888888999999998877755 456788876 456667677777 Q ss_pred CCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) .++.. .+...+.++..- ++.. ..|.+-=-.++.+|+.+.+.++.+++++++.|+.++. +.. .+.. T Consensus 73 ~~~~s-~~~f~~v~l~~~--k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g-------t~~~ 138 (397) T protein:vir:23 73 MKPIT-KGNMTKRDVHPA--KIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALH----GTN-------APSA 138 (397) T ss_pred ccccc-ccceeEEEEeeE--EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh----ccc-------CCcc Confidence 77654 456666666553 3333 3333211124568999999999999999999998852 110 0000 Q ss_pred ccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc--c-cccce Q lcl|NC_020078. 159 MPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV--T-SAGET 235 (339) Q Consensus 159 ~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~--~-~~~~~ 235 (339) ..+... .............++.+.++..+|.+..-. .-..+++|..|..|.+-..- +..|. . ..... T Consensus 139 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~----~a~~vmn~~~~~~L~~lkd~-~G~~i~~~~~~~~~ 208 (397) T protein:vir:23 139 FQGYLD-----QSNKTQSISPNAYQGLGVSGLTKLVTDGKK----WTHTLLDDTVEPVLNGSVDA-NGRPLFVESTYESL 208 (397) T ss_pred cccccc-----cccceeeecccchhHHHHHHHHhhhhcccC----CCEEEEcHHHHHHHHHhhcc-CCceeecccccccc Confidence 111111 111111112222344556666667665442 22459999999999763211 11111 0 00111 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh------ Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL------ 309 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~------ 309 (339) ...+..++++|.+|+.++++|.+. ...+-++|++.. .+...++..|+.++.. T Consensus 209 ~~~~~~~tl~G~Pv~~s~~~~~g~------------~~~~~gDfs~~~----------i~~~~~i~i~~~~e~~~~~~~~ 266 (397) T protein:vir:23 209 TTPFREGRILGRPTILSDHVAEGD------------VVGYAGDFSQII----------WGQVGGLSFDVTDQATLNLGSQ 266 (397) T ss_pred cccccCceeeeeeEEEeCCCCCCc------------eEEEEeecceEE----------EEEEeceEEEEeeeeeeeeccc Confidence 112344679999999999998421 112333444321 2222233334333221 Q ss_pred --------hhH--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 --------SKL--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 --------~~~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ... -.++..+.++.++++|++.+.++.... T Consensus 267 ~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 267 ESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred cccceeeeeeccceeEEEEeeeccceecccceEEEeeccc Confidence 111 245677889999999999999998776 No 100 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.22 E-value=9.4e-13 Score=86.52 Aligned_cols=285 Identities=9% Similarity=0.019 Sum_probs=158.0 Q ss_pred CccccCcccCCCcccCCcc--CcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc--cceEEEeccc-cceeeec Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQR--HGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG--TSTISNRGIS-KAKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~--~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~--G~tv~i~~iG-~~t~~~~ 75 (339) ..-|..-............ ..+++--.+.-+.|+.++.+..+..+.+++++++..+.+ |+....+... ...+... T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 170 (397) T protein:vir:48 91 VKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLD 170 (397) T ss_pred HHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeee Confidence 0000000000000001111 111111224559999999999999999999998887764 3333333332 2345556 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) ..|+.+.....+...+.++...+. +....|.+-=--++.+|+.+.+.++.++++++..|+.|+.- T Consensus 171 ~E~~~~~~~~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G-------------- 235 (397) T protein:vir:48 171 DEAGSIGTNDDPKLYPIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEA-------------- 235 (397) T ss_pred ccccccccccccceeeEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc-------------- Confidence 667766543344556666666422 22223332111235789999999999999999999988521 Q ss_pred cccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) .+.+. ..+..++ ++.|.++..+|..... +. . .++++|..|..|.+-..-. ..|.-. .. T Consensus 236 ------~g~~~-----~~~~~~~----~d~i~~~~~~l~~~~~--~~-a-~~v~n~~~~~~L~~lkd~~-G~~i~~--~~ 293 (397) T protein:vir:48 236 ------IATLP-----TKPTLTK----WDDIIDLQAKVDPAIK--QT-S-FFLTNTSGFTALKKVKNAF-GDYLME--RD 293 (397) T ss_pred ------ccccc-----ccccccc----HHHHHHHHHHhhhhhc--CC-C-EEEECHHHHHHHHHhhcCC-Cceeec--cC Confidence 11111 1111222 3456666667766544 21 2 5589999999996532111 112111 12 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEec-cceeEEEEEeeeeEEeeech-hhhH- Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFS-PKALLAGSTIPVTSKIFFDD-LSKL- 312 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h-~~A~~~~~~~~~~~e~~~~~-~~~~- 312 (339) +.+|.-..++|++|+.+.+.+.... ..+.. .-++.. +.++..+....+.++..+.. ++|. T Consensus 294 ~~~~~~~~l~G~PV~~~~~~~~~~~-------~~~~~----------~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~ 356 (397) T protein:vir:48 294 VKSPTGYSIDGFAVKEVADRWLANA-------SSGAM----------PLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFET 356 (397) T ss_pred cCCCCCceeccceeEEecccccCCc-------CCCce----------EEEEEeccceEEEEeecceEEEEeccchhhhhc Confidence 4466677899999998654322111 01111 112222 33555555566666766544 2333 Q ss_pred --HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 --WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 --d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..+++.+.++.++++|++.+.+.++++ T Consensus 357 ~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 357 DTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred CceeEEEEeeeccEEecccceEEEEeccc Confidence 356788889999999999999998888 No 101 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.22 E-value=7.7e-13 Score=86.98 Aligned_cols=289 Identities=11% Similarity=0.055 Sum_probs=157.1 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc--cceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS--KAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG--~~t~~~~~~g 78 (339) +..-.......+..+.. ..+.++...+..+.|+.++.......+.++++++...+. |.++.+++.. ..++.....| T Consensus 120 ~~~~~~~~~~~~~~~~~-~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~ 197 (418) T protein:vir:10 120 VRVRVDRKSIMNVPATV-GSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNAAAVAEG 197 (418) T ss_pred hhhhhHHHHHHHhhhhc-cCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCceeeeccC Confidence 00000000001111111 111122223566999999999999999999998877765 5567777753 3466666677 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +.++.. .++.+++++... ++..+ .|.+ +-.+...++.+.+.++.+.++++..|+.++. +... +. T Consensus 198 ~~~~~~-~~~f~~v~~~~~--k~~~~~~is~-ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~----G~g~-------~~ 262 (418) T protein:vir:10 198 AQKPTS-DLKFNLKNQPVR--TIAHLFKASR-QILDDAPALQSYIDGRARYGLQLTEEGQILK----GDGT-------GA 262 (418) T ss_pred cccccc-ccceeeEEEeee--eEEEeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC-------Cc Confidence 776643 355666655554 32222 1211 1222335788999999999999999998752 1111 11 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ...|........ ..+...+....++.|+++...+...+.. + . .+|++|..|..|.+-.. .+..|... ... T Consensus 263 ~p~Gi~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~--~-~~v~n~~~~~~L~~lkd-~~G~~i~~---~~~ 332 (418) T protein:vir:10 263 NILGILPQASAF--MPSITLANATPIDKIRLALLQAVLAEFP-A--T-GIVLNPIDWASIELTKD-SQGRYIVG---NPV 332 (418) T ss_pred cccccccccccc--cccccccccccHHHHHHHHHhhccccCC-C--C-EEEEcHHHHHHHHHhhc-CCCceecc---ccc Confidence 111211111100 0011111122345566666666555442 1 2 35789999998865221 11122221 123 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-hhHH--- Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-SKLW--- 313 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-~~~d--- 313 (339) +|..++++|++|+.|+++|.+.. +-++|+. ++..+...++++++.++.. .|.. T Consensus 333 ~~~~~~l~G~pV~~~~~~p~~~~--------------~~gd~s~---------~~~~~~~~~~~i~~~~~~~~~f~~~~~ 389 (418) T protein:vir:10 333 NGTTPRLWNLPVVETQAMTANEF--------------LVGAFSM---------AAQIFDRMEIEVLLSTENVDDFEKNMV 389 (418) T ss_pred cCCCceecceeeEEcCCCCCCcE--------------EEeeccc---------eEEEEEecceEEEEecccchhhhcCce Confidence 45667899999999999984321 1122222 2222333445555554432 2322 Q ss_pred HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 314 FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 314 ~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+++.+-++.++++|++.+.+.++++ T Consensus 390 ~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 390 SIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred EEEEEEeeccEEecccceEEEEeccC Confidence 45566779999999999999999888 No 102 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.22 E-value=1.2e-12 Score=86.02 Aligned_cols=300 Identities=12% Similarity=0.059 Sum_probs=153.3 Q ss_pred CccccCcccC---CCcccC--CccCcccchh--HHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEec-ccccee Q lcl|NC_020078. 1 MSIFDGQTPS---YDVTRP--NQRHGAGDPL--ADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRG-ISKAKL 72 (339) Q Consensus 1 ~~~~~~~~~~---~~~~r~--~~~~~~~~~~--a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~-iG~~t~ 72 (339) ..-|..-+.+ ..++.. -.-+.+++.+ .+.-+.|+.++.+..+..+++++++++.+..++ +..+++ .+.+++ T Consensus 83 ~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a 161 (407) T protein:vir:48 83 KEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTS 161 (407) T ss_pred HHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcce Confidence 0000000000 000000 0000011111 134489999999999999999999887776655 455544 466676 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhhhh-HHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNA-EPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~-vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .-...|+..+....+...+.++.+- ++..+. |.+-=-.++.+|+.+.+.++.++++++..|+.++. +.....| T Consensus 162 ~~v~E~~~~~~~~~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~----G~G~~~p 235 (407) T protein:vir:48 162 GWVGETDARPETATSKLGLIEPFMG--EIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTS----GDGSKKP 235 (407) T ss_pred eeecccccccccccccceeEEeeee--eeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCCcc Confidence 6666667665443344455555443 443322 21111113567999999999999999999987641 1111111 Q ss_pred cc-------cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchh Q lcl|NC_020078. 152 PY-------GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHIT 224 (339) Q Consensus 152 ~~-------~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~ 224 (339) .. .........+....+. .+.....+ ++.|.++...|..... + ... .|++|..|..|.+-..- T Consensus 236 ~Gil~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~----~d~i~~l~~~l~~~~~--~-~a~-~v~n~~~~~~L~~lkD~- 305 (407) T protein:vir:48 236 KGFLAYESTDEDDKTRAFGKLQHIA-SGAASGVT----ADAIIKLIYTLRKAHR--S-GAK-FMMNNSSLFAIRLLKDN- 305 (407) T ss_pred ceeeecccccccccccccccccccc-cccccccC----hHHHHHHHHhhchhhh--c-CCE-EEEcHHHHHHHHHhhcc- Confidence 00 0000000000000000 11111122 4556666666665543 2 123 47999999998542111 Q ss_pred hhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEe Q lcl|NC_020078. 225 NGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKI 304 (339) Q Consensus 225 n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~ 304 (339) +..|.-. ..+.+|...+++|.+|+.++++|...... ..-+-++|+. ++..+.-+.++ + T Consensus 306 ~Gr~l~~--~~~~~g~~~~l~G~PV~~~~~~p~~~~~~---------~~i~~Gd~~~---------~~~i~~~~~~~--i 363 (407) T protein:vir:48 306 DGNYLWR--PGIELGQPSSLAGYGIVENEQMPDIAADA---------KAIAFGNFKR---------GYTIVDRIGTR--I 363 (407) T ss_pred CCceeec--cCcCCCCCceecceeeEEecCcCCccCCc---------cEEEEEeccc---------cEEEEEeeceE--E Confidence 1112111 12345677789999999999998532111 1111122222 22223223333 3 Q ss_pred eechh--hhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 305 FFDDL--SKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 305 ~~~~~--~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+++- +-.-.+++.+.+|.++++|++.+.|+.++| T Consensus 364 ~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 364 LRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred EeeccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 33321 111235677789999999999999999999 No 103 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.22 E-value=3.2e-12 Score=83.59 Aligned_cols=283 Identities=11% Similarity=0.059 Sum_probs=156.5 Q ss_pred Cccc-----cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEeccc-cceee Q lcl|NC_020078. 1 MSIF-----DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRGIS-KAKLQ 73 (339) Q Consensus 1 ~~~~-----~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~iG-~~t~~ 73 (339) +.-| .|.-.+.+... ..+--.+.-+.|+.++.+..+..+++++++++..+.++ -++.++..+ .+.+. T Consensus 76 ~~~~~~~l~~~~~~a~~~~t------~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 149 (371) T protein:vir:81 76 VEAFVNHIRTRFRNAMSEGS------NQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFV 149 (371) T ss_pred HHHHHHHHHHHHHHhhccCC------CccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccee Confidence 0000 00000011000 01111134488999999999999999999988877643 344455544 46777 Q ss_pred eccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_020078. 74 KIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSP 152 (339) Q Consensus 74 ~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~ 152 (339) ....|+.+.....+...+.++... ++... .|.+-=--++.+|+.+.+.++.++++++..|+.++.-. T Consensus 150 ~v~Eg~~~~~~~~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~---------- 217 (371) T protein:vir:81 150 EVAEGAAIGEKATPQFTLLQYQVK--KYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVL---------- 217 (371) T ss_pred eeccccccccccccceeeEEeeee--EEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc---------- Confidence 788888775433345566666554 33332 22221111245789999999999999999998875211 Q ss_pred ccccccccCccccccccccCccccccHHHHHHHHHHHH-HHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc Q lcl|NC_020078. 153 YGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLV-EKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS 231 (339) Q Consensus 153 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~-~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~ 231 (339) +.+ .++...+.+. +..+. ..|....- + +=..+++|..|..|.+-..- +..|--. T Consensus 218 ----------g~~------~~~~~~~~~~----i~~~~~~~l~~~~~--~--~a~~vmn~~~~~~L~~lkd~-~g~~l~~ 272 (371) T protein:vir:81 218 ----------NTK------AKTAIADLDG----LKQIINVQLDPVFR--S--TSSVIVNQDAFNWLDTLKDQ-NGQYLLQ 272 (371) T ss_pred ----------ccc------cccccccHHH----HHHHHHhhcchhhh--c--CCEEEEcHHHHHHHHHhhcc-CCCeeee Confidence 000 0111122222 33322 23333221 1 22568999999998653211 1122111 Q ss_pred ccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hh Q lcl|NC_020078. 232 AGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LS 310 (339) Q Consensus 232 ~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~ 310 (339) . .+..|..++++|.+|+.++++|.+........+ +...-+-++| ++++..+....++++..+.. +. T Consensus 273 ~--~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~--~~~~i~~Gd~---------~~~~~~~~~~~~~i~~~~~~~~~ 339 (371) T protein:vir:81 273 P--SISSPTGRQLLGLPVVIVSNKVLANRVDGGTGA--QFAPIIVGDL---------KEAVVMFDRQRTEIMSSNVAMDA 339 (371) T ss_pred c--ccCCCCCceecceeEEEecccccCccccccccC--CcceEEEEeh---------hceEEEEeecceEEEEeccccch Confidence 1 234566788999999999999854322211111 1111122222 22344444455555554443 23 Q ss_pred h---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 311 K---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 311 ~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) | .-.+++.+.+|.++++|++.+.+++++| T Consensus 340 f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 340 FETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 3 2357788889999999999999999999 No 104 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.22 E-value=6.8e-13 Score=87.30 Aligned_cols=294 Identities=12% Similarity=0.034 Sum_probs=153.5 Q ss_pred Cc-cccCccc--C-----CCcccCCccCcccchh--HHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccc Q lcl|NC_020078. 1 MS-IFDGQTP--S-----YDVTRPNQRHGAGDPL--ADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKA 70 (339) Q Consensus 1 ~~-~~~~~~~--~-----~~~~r~~~~~~~~~~~--a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~ 70 (339) +. +..|.-+ . +.-.|. .+.+++.+ .+..++|+.++.+..+..+.+++++++.++.++..+.++..+.. T Consensus 93 ~~~l~~~~~~~~~~e~~~~~~~~a--~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 170 (409) T protein:vir:45 93 DKWMRHGASELTSEERKALRELRA--QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGT 170 (409) T ss_pred HHHHHhhhhhccHHHHHHHHHHhh--ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccC Confidence 00 0001000 0 000010 11111111 13448999999999999999999999888888888888877543 Q ss_pred -e-eeeccCCCCCCCCCCCCccceEEEEeehhhhh--hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020078. 71 -K-LQKIAPGTTPPPSTEPHTSKIFLKIDTVIIAR--NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAA 146 (339) Q Consensus 71 -t-~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~--~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA 146 (339) . ......|+.++. ..++..+.++ ...++.. +.|.+-=--++.+|+.+.+.++.++++++..|+.|+. +. T Consensus 171 ~~~~~~v~E~~~~~~-~~~~f~~~~l--~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~----G~ 243 (409) T protein:vir:45 171 SEVGVLLGENEEAGE-EDTDFGMGSL--GALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQ----GT 243 (409) T ss_pred ccccccccccccccc-cccccceeee--eeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhc----cC Confidence 2 334444555543 3344554444 3334322 2232222223568999999999999999999998751 11 Q ss_pred ccccccccccccccCcccccc-ccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeE-EEECHHHHHHHhc--ccc Q lcl|NC_020078. 147 IASDSPYGTAAQMPGHSGGNV-VTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMI-LVLPPAAFTALMQ--AEH 222 (339) Q Consensus 147 ~~~~~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~-~vv~P~~~~~Ll~--~~~ 222 (339) ... ......|...... ......+...+. +.|.++...|..... ....| ++++|..|..|.+ |.. T Consensus 244 G~~-----~~~~p~Gil~~~~~~~~~~~~~~~~~----d~i~~l~~~l~~~~~---~~a~~~~~~n~~~~~~l~~lkd~~ 311 (409) T protein:vir:45 244 GAG-----TPKQPKGLAASVTGTTQTAAANAVKW----QEILALKHSIDPAYR---RGPKFRLAFNDNTLKLISEMEDGQ 311 (409) T ss_pred CCC-----Cccccceeeeccccccccccccccch----HHHHHHHHhhhhhhc---cCCeEEEEECHHHHHHHHHhhcCC Confidence 100 0001111111100 111112222333 445566666654432 12346 4679999988743 321 Q ss_pred hhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeE Q lcl|NC_020078. 223 ITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTS 302 (339) Q Consensus 223 ~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~ 302 (339) ..|.-.. .+.+|...+++|.+|+.++++|...... ..-+-++|++. ++. ....+.. T Consensus 312 ---G~~i~~~--~~~~~~~~~l~G~PV~~~~~~p~~~~~~---------~~i~~Gd~~~~--~i~--------~~~~~~~ 367 (409) T protein:vir:45 312 ---GRPLWLP--DIVGVAPASVLNVPYVIDQEIDDIGAGK---------KFMFCGDFDRF--IIR--------RVRYMIL 367 (409) T ss_pred ---Cceeecc--CcCCCCCceecceeeEEecCcCCccCCc---------cEEEEeehhhh--hee--------eccceEE Confidence 1121111 1334556789999999999998432111 00111233321 111 2222333 Q ss_pred EeeechhhhHH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 303 KIFFDDLSKLW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 303 e~~~~~~~~~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +...++...-+ .+++.+-||.++++|++++.++..+| T Consensus 368 ~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 368 KRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred EEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 44443321112 37788899999999999999999777 No 105 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.22 E-value=9e-13 Score=86.63 Aligned_cols=281 Identities=10% Similarity=0.010 Sum_probs=162.0 Q ss_pred CccccCc-ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQ-TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~-~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g 78 (339) -....|+ |-..+. ....+...+.-+.|+.++.+..++.+++++++++..+. |.+++||+. +.+.+.-...| T Consensus 16 ~~~~~~~~~~a~~~------~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg 88 (324) T protein:vir:99 16 SNNVKPQVFNPDNV------MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPME-GTEKKFTFWADKPGAYWVGEG 88 (324) T ss_pred HHhhhhhhccccce------eccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEeccC Confidence 1111221 111111 11112223556899999999999999999998877755 456888887 55677777888 Q ss_pred CCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +.++.. .+..++.++..- ++.. ..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +... +. T Consensus 89 ~~~~~~-~~~~~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~g~-------~~ 154 (324) T protein:vir:99 89 QKIETS-KATWVNATMRAF--KLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN-------NP 154 (324) T ss_pred cccccc-ccceeEEEEeeE--EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCC-------Cc Confidence 888754 456666666553 3332 2333211113458899999999999999999998752 1111 00 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ...|..... ...+........++.|.++...|..++... . .++++|..|..|.+- .+. .+ ...+. T Consensus 155 ~~~~~~~~~----~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~---~-~~v~n~~~~~~L~~l---~d~--~g--~~~~~ 219 (324) T protein:vir:99 155 FGKSIAQSI----EKTNKVIKGDFTQDNIIDLEALLEDDELEA---N-AFISKTQNRSLLRKI---VDP--ET--KERIY 219 (324) T ss_pred cCccccccc----cccceeccccCCHHHHHHHHHhhhhccCCC---C-EEEEcHHHHHHHHHh---hcC--CC--ceeec Confidence 011111110 111111111223566777888887766532 2 458999999998752 222 11 11233 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-------- Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-------- 309 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-------- 309 (339) .+.-.+++|.+|+.++..+.+. ...+-++|+ -+..+...+++.|..++.. T Consensus 220 ~~~~~~l~G~PVv~~~~~~~~~------------~~~i~gd~~----------~~~~~~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:99 220 DRNSDTLDGLPVVNLKSSNLKR------------GELITGDFD----------KLIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred CCCCccccceeEEeecCCCCCc------------ceEEEEecc----------cEEEEEecCcEEEEeeccccccccccc Confidence 4445679999999988765321 111222222 2223344455566655431 Q ss_pred -----hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -----SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -----~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .| .-.+++.+-+|.+++||++.+.|+...+ T Consensus 278 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 278 GTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 11 1335667789999999999999987666 No 106 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.21 E-value=1.5e-12 Score=85.33 Aligned_cols=286 Identities=11% Similarity=0.035 Sum_probs=158.8 Q ss_pred Ccccchh-HHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEEe Q lcl|NC_020078. 20 HGAGDPL-ADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKID 97 (339) Q Consensus 20 ~~~~~~~-a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID 97 (339) .+..+.- .+.-++++.++.+..+..|+++.+.++..+.+ .+++||+. +.+.+.....|+.++.. .+..++.++..- T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E~~~~~~s-~~~f~~v~l~~~ 78 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPF-NGSKEFTFTLDSDIDVVAENGKKTHG-GLSLEPVTIVPI 78 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEecCcceEEeecCcccccc-ccceeeEEeeeE Confidence 2221111 13448999999999999999999988776654 45678775 66788888888888754 455556666542 Q ss_pred ehhhhhhhHHHHHHH-----hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccC Q lcl|NC_020078. 98 TVIIARNAEPMLDEF-----QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAG 172 (339) Q Consensus 98 ~~~y~~~~vdd~D~~-----q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~ 172 (339) ..+....|- ++. -...++.+.+.++.+++|++..|+.++.-. .+....+....+...-.. ..+. T Consensus 79 -kl~~~~~iS--~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~-------~~~~g~~~~~~~~~~~~~-~~~~ 147 (303) T protein:vir:97 79 -KVEYGARLS--DEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGI-------NPRTKKASDVIGTNHFDS-KVTQ 147 (303) T ss_pred -EEEEeehhh--HHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhccc-------ccCCcccccccccccccc-cccc Confidence 222222222 222 134678899999999999999999886321 111111111111110000 0011 Q ss_pred ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEe Q lcl|NC_020078. 173 ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITS 252 (339) Q Consensus 173 ~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~S 252 (339) ....++....++.|.++..++...+.. | ..++++|..+..|.+-..-.. .|--.. ..-..+..++++|.+|+.| T Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~-~---~~~vmn~~~~~~L~~lkd~~g-~~~~~~-~~~~~~~~~~l~G~Pv~~s 221 (303) T protein:vir:97 148 VVKFTESEDADANIEAAVNLIQGAEGV-V---TGLAMDTEFSTALAKVTNGEM-GPKMYP-ELAWGANPDSINGLKSSVN 221 (303) T ss_pred ccccccccchHHHHHHHHHHHhhcCCC-c---cEEEEcHHHHHHHHHhhccCC-CeEEec-CccCCCCCceecceeeEEe Confidence 111222334567778888877666653 2 246889999999865211111 111000 0011245568999999999 Q ss_pred ccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee--chh-----hhH---HHHHHHHHhC Q lcl|NC_020078. 253 NNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF--DDL-----SKL---WFIDSWLAFG 322 (339) Q Consensus 253 nnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~--~~~-----~~~---d~i~g~~~~G 322 (339) +++|...... .+....|-++|... +......+++.|+.. +++ .|. -.+++...++ T Consensus 222 ~~v~~~~~~~------~~~~~~~~Gdf~~~---------~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~ 286 (303) T protein:vir:97 222 TTVGAGADEA------ESKDLVIIGDFESM---------FKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIG 286 (303) T ss_pred cccCCccccC------CCccEEEEeecccc---------EEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEec Confidence 9998432111 11122233444332 222222333444432 111 121 1467778899 Q ss_pred CccccccceEEEEecCC Q lcl|NC_020078. 323 VTINRTEYAGVIKLPAA 339 (339) Q Consensus 323 a~v~rPe~~v~i~~~~a 339 (339) .++++|++.+.|+-.-= T Consensus 287 ~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 287 WGILDAKSFARVTKGEV 303 (303) T ss_pred cEeecccceEEeeCCCC Confidence 99999999998865444 No 107 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.21 E-value=1.7e-12 Score=85.13 Aligned_cols=284 Identities=13% Similarity=0.027 Sum_probs=157.9 Q ss_pred ccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEE Q lcl|NC_020078. 18 QRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKI 96 (339) Q Consensus 18 ~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~I 96 (339) ....+.+.-.+..++++.++.+..+..|+++.+.++..+.+| .+.||++ +.+.+.-...|+.++.. .+..++.++.. T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~s-~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKKTHG-GVSLDPVTIVP 78 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccc-cccceeeEeee Confidence 222222222245588999999999999999998877766554 4567764 56777777778877654 35666666654 Q ss_pred eehhhhhhhHHHHHHHh-----cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccccccccc Q lcl|NC_020078. 97 DTVIIARNAEPMLDEFQ-----TDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLA 171 (339) Q Consensus 97 D~~~y~~~~vdd~D~~q-----~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~ 171 (339) - ..+.-..|- ++.+ ...++.+.+.++.++++++..|+.++.-. ++....+....+... ..... T Consensus 79 ~-k~~~~~~iS--~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~-------~~~~g~~~~~~~~~~--~~~~~ 146 (300) T protein:vir:95 79 L-KVEYGARVS--DEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGI-------NPRTKQASTIIGDNC--FDKKV 146 (300) T ss_pred E-EEEEeehhh--HHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcc-------cCCCCCCcccccccc--ccccc Confidence 2 122222222 2222 24788999999999999999999886211 011111111111000 00000 Q ss_pred CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEE Q lcl|NC_020078. 172 GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVIT 251 (339) Q Consensus 172 ~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~ 251 (339) ......+....++.|.++..++...+.. | . ..+++|..+..|.+-..- +..|.-.. ...+|..++++|.+|+. T Consensus 147 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~--~-~~vmn~~~~~~L~~lkd~-~G~~i~~~--~~~~~~~~~l~G~Pv~~ 219 (300) T protein:vir:95 147 TQTVPFKDTNPDESMEDAVGMIDGSERD-I--T-GAILDPIFTTALSKMKNA-EGGKLYPE--LAWGGVPDAINGLAVDK 219 (300) T ss_pred ceeecccccchHHHHHHHHHHhhhcCCC-c--c-EEEECHHHHHHHHHhhcc-CCCeeccC--ccccCCCceecceeeEE Confidence 0011112234456777888888766653 2 2 458999999998653211 11111111 12345678899999999 Q ss_pred eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee--chh-----hhH---HHHHHHHHh Q lcl|NC_020078. 252 SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF--DDL-----SKL---WFIDSWLAF 321 (339) Q Consensus 252 Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~--~~~-----~~~---d~i~g~~~~ 321 (339) |+.+|.....+. ...+-++|++.+- ++.- .+++.++.. +++ .|. -.+++.+.+ T Consensus 220 s~~v~~~~~~~~--------~~~~~GDf~~~~~-------~~~~--~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~ 282 (300) T protein:vir:95 220 NRTVSYSQTDPK--------NTAIVGDFETMFK-------WGYA--KEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI 282 (300) T ss_pred ecCCCCCCCCCc--------cEEEEeeccceEE-------EEEe--cccEEEEeeccCCCCcchhhhhcCcEEEEEEEee Confidence 999985322111 1123344443221 1111 222333322 211 121 335677789 Q ss_pred CCccccccceEEEEecCC Q lcl|NC_020078. 322 GVTINRTEYAGVIKLPAA 339 (339) Q Consensus 322 Ga~v~rPe~~v~i~~~~a 339 (339) |.++++|++.+.|+-.+- T Consensus 283 d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 283 GWGIMDAASFARIVKTGG 300 (300) T ss_pred cceeecccceEEEecCCC Confidence 999999999999855444 No 108 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.20 E-value=2.3e-12 Score=84.37 Aligned_cols=288 Identities=12% Similarity=0.043 Sum_probs=155.3 Q ss_pred CccccCcccC-CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPS-YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~-~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g 78 (339) |.-=...=|. ....+. ++++.-.+..++++.++.+..++.+++++++++..+. +.+++||+. +.+.++-...| T Consensus 1 ~~~~~~~~~e~~~~~~~----~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 75 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQT----GDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDVSAQWIGEG 75 (318) T ss_pred CCCCCCCCHHHHHhhcc----cCcccceeechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceEEecCC Confidence 1111111111 011111 1112222445889999999999999999998876665 456778765 56778888888 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) ++++.. .+..++.++..- ++... .|.+-=-.++.+|+.+.+.++++.++++..|+.++. +.... T Consensus 76 ~~~~~~-~~~f~~i~~~~~--k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~----G~g~~-------- 140 (318) T protein:vir:24 76 DMKPIT-KGNMTSQTIAPH--KIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMH----GTDSP-------- 140 (318) T ss_pred cccccc-ccceeEEEEeeE--EEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhc----ccCCC-------- Confidence 888764 355565555442 33322 222211113568899999999999999999998751 11100 Q ss_pred cccCcccccc-ccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhh-----ccccc Q lcl|NC_020078. 158 QMPGHSGGNV-VTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNG-----EYVTS 231 (339) Q Consensus 158 ~~~g~~~~~~-~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~-----d~~~~ 231 (339) ...|...... +.....+. ......+.+.++...+...+. ..-..+++|..|..|.+-..-..+ +..+. T Consensus 141 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~----~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~ 214 (318) T protein:vir:24 141 FPTYIGQTTKAISIADTTG--ATTVYDQVAVNGLSLLVNDGK----KWTHTLLDDITEPILNGAKDQNGRPLFIESTYGE 214 (318) T ss_pred CCccccccccccccccccc--ccchHHHHHHHHHHhhccccC----CCCEEEEcHHHHHHHHHhhccCCceeecCccccC Confidence 0011111110 11111111 111122334445555544443 123569999999999642111111 11111 Q ss_pred ccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-- Q lcl|NC_020078. 232 AGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-- 309 (339) Q Consensus 232 ~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-- 309 (339) .. .......+.|++|+.++++|.+. ...+-++|+. +..+...++..|..++.. T Consensus 215 ~~---~~~~~~~i~g~pv~~~~~~~~~~------------~~~~~gdfs~----------~~~~~~~~l~i~~~~~~~~~ 269 (318) T protein:vir:24 215 AA---SPFRSGRIVARPTILSDHVVEGT------------TVGFMGDFSQ----------LIWGQIGGLSFDVTDQATLN 269 (318) T ss_pred cc---ccccCceEEEEeeEEeCCCCCCc------------cEEEEeecce----------EEEEEecCeEEEEeecccee Confidence 11 11122468999999999987431 1112223332 333444455555544321 Q ss_pred ------------hh--HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 ------------SK--LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ------------~~--~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .. .-.+++.+.+|.+++||++.+.|+.-+| T Consensus 270 ~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 270 LGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred ccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeecc Confidence 11 1335788899999999999999988877 No 109 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.18 E-value=2.1e-12 Score=84.64 Aligned_cols=300 Identities=13% Similarity=0.039 Sum_probs=148.2 Q ss_pred CccccCcc-----cCC-Cc-ccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEec-ccccee Q lcl|NC_020078. 1 MSIFDGQT-----PSY-DV-TRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRG-ISKAKL 72 (339) Q Consensus 1 ~~~~~~~~-----~~~-~~-~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~-iG~~t~ 72 (339) ...|.+-. ... .+ .|.-..+...+--.+..+.|+.++.+..+..+++++++++..+.++ +..++. .+.+.+ T Consensus 84 ~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a 162 (401) T protein:vir:44 84 KDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTAS 162 (401) T ss_pred HHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccc Confidence 00000000 000 00 0000000000001133489999999999999999999887776544 444544 455555 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhhhh-HHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNA-EPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~-vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .-...|...+....+..++.++.+ .++..+. |.+-=-.++.+|+.+.+.++.++++++..|+.++. +.....| T Consensus 163 ~wv~E~~~~~~~~~~~~~~v~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~----G~G~~~p 236 (401) T protein:vir:44 163 GWVGETDTRSQTATSRLGLIEPFM--GEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTT----GDGTKKP 236 (401) T ss_pred eeeccccccCccccccceeeeeeh--hheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCCcc Confidence 444445555433333444444443 3433322 21111113467899999999999999999987752 1111111 Q ss_pred ccc-------ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchh Q lcl|NC_020078. 152 PYG-------TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHIT 224 (339) Q Consensus 152 ~~~-------~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~ 224 (339) ..- ........+....+ .++.....+ |+.++++...|..... .+-..+++|..|..|.+-..-. T Consensus 237 ~Gil~~~~~~~~~~~~~~~~~~~~-~t~~~~~~~----~d~i~~~~~~l~~~~~----~~a~~v~n~~~~~~L~~lkd~~ 307 (401) T protein:vir:44 237 KGFLAYESTEESDKARAFGKLQHI-VSGEATAVT----ADAIIKLIYTLRKAHR----TGAKFMMNNNSLFAIRLLKDTE 307 (401) T ss_pred ceeecccccccccccccccccccc-ccccccccC----HHHHHHHHHhcchhhh----cCCEEEEcHHHHHHHHHhhccC Confidence 000 00000000000000 011111122 4556666666654332 1224589999999985421111 Q ss_pred hhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEe Q lcl|NC_020078. 225 NGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKI 304 (339) Q Consensus 225 n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~ 304 (339) . .|--. ..+.+|...+++|.+|+.++++|...... ..-+-++|+. ++..+.-+.+++ T Consensus 308 G-~~l~~--~~~~~g~~~~l~G~PVv~~~~~p~~~~~~---------~~i~~Gd~~~---------~~~i~~~~~~~~-- 364 (401) T protein:vir:44 308 G-NYLWR--PGLELGQPSSLAGYGIAENEQMPDIAADA---------KAIAFGNFKR---------GYTIVDRIGTRI-- 364 (401) T ss_pred C-ceeec--CCcCCCCCceecceeeEEecCcCCccCCc---------cEEEEeehhc---------cEEEEEecceEE-- Confidence 1 11111 11345666789999999999998432111 1111123322 222333333333 Q ss_pred eechhhhHH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 305 FFDDLSKLW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 305 ~~~~~~~~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+++-...+ .+++.+-+|.++++|++.+.|++.+| T Consensus 365 ~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 365 LRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred eeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 233221122 25667789999999999999999999 No 110 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.16 E-value=3.7e-12 Score=83.25 Aligned_cols=290 Identities=11% Similarity=0.037 Sum_probs=155.1 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc-----ceeeec Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK-----AKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~-----~t~~~~ 75 (339) .....+.+....- +-...+..++...+..+.|+.++.+.....+.+++++++....+ .++.+++... ..+... T Consensus 103 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~v 180 (413) T protein:vir:81 103 GEYVAPRVKAASD-PASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTN-TTIKYLMEKANRVVEGGFKTV 180 (413) T ss_pred hhhhhhHHHhhhh-hhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccC-CceeEEEecccccccccccee Confidence 0011111111100 11122233344445669999999999999999999988777654 4566665432 233445 Q ss_pred cCCCCCCCCCCCCccceEEEEeeh----hhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTV----IIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~----~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) ..|+.+.....+..++.++.+.+. .+.+..++| ...+-+.+.++.++++++..|+.++. +.. T Consensus 181 ~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d------s~~l~~~i~~~la~~~~~~~d~~~l~----G~G---- 246 (413) T protein:vir:81 181 AEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIED------YDFLVSYINARLLEELAIEEERQLLL----GDG---- 246 (413) T ss_pred cCcccccccCcccceeeEeeeeeEEEeehhhHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHhc----cCC---- Confidence 556666543323345555555422 222222222 12477888899999999999998752 111 Q ss_pred cccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhc--cc--chhhh- Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQ--AE--HITNG- 226 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~--~~--~~~n~- 226 (339) ++....|.... .........+...+++.+.++...+..+....+ .. +|++|..|..|.+ |. +++-. T Consensus 247 ---~~~~~~Gi~~~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~-~vmn~~~~~~l~~lkd~~G~~l~~~ 317 (413) T protein:vir:81 247 ---TGNNLTGLLKR---DGIQTLAVSNKDELADSIYKAMTNISLATPFQA--DA-LVINPLDYQELRLAKDANGQYYGGG 317 (413) T ss_pred ---CCCcccccccc---cccccccccccchhHHHHHHHHHHhhhhccCCC--cE-EEEcHHHHHHHHHhhccCCceeccc Confidence 11111121110 111111122334556677777666554443222 33 5889999998854 32 11111 Q ss_pred cccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee Q lcl|NC_020078. 227 EYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF 306 (339) Q Consensus 227 d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~ 306 (339) ...+..+ .-..+...+++|.+|+.|+.+|.+. .+-++|++ ++..+...++++++.+ T Consensus 318 ~~~~~~~-~~~~~~~~~l~G~pv~~s~~~~~~~--------------~~~gd~~~---------~~~~~~~~~~~v~~~~ 373 (413) T protein:vir:81 318 VFQGQYG-SGGIMLDPAPWGLRTVQSQVVPVGK--------------PVVGAFRS---------AASVLRKGGVRIDSTN 373 (413) T ss_pred ccccccc-ccccccCceecceeeEEcCCCCccc--------------EEEEeccc---------EEEEEEecceEEEEec Confidence 1111111 1112233579999999999998431 11122222 2223333455666655 Q ss_pred chh-hh-HH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 307 DDL-SK-LW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 307 ~~~-~~-~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ... .| -+ .+++.+.++..+.+|++.+.+++++| T Consensus 374 ~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 374 TNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred cccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 432 33 22 56677789999999999999999998 No 111 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.14 E-value=3e-12 Score=83.78 Aligned_cols=282 Identities=11% Similarity=0.016 Sum_probs=151.5 Q ss_pred CccccCcccCC---CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-c--cceeee Q lcl|NC_020078. 1 MSIFDGQTPSY---DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-S--KAKLQK 74 (339) Q Consensus 1 ~~~~~~~~~~~---~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G--~~t~~~ 74 (339) ...+.+.+... .+.--+.....++.-.+..+.|+.++.+...+.+.+++++++.++. +.++.|++. | ...... T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 166 (379) T protein:vir:10 88 NFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSIS-GGTYTFVRENGAGEGAIGA 166 (379) T ss_pred HHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeecc-CCceEEEEeecCCCccccc Confidence 00000000000 0000001111122222456889999999998999999998877664 456777764 2 233334 Q ss_pred ccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 75 IAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 75 ~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ...|+..+.. .+...++++.+. +|..+..-.-.-.+...++.+.+.++.++++++..|+.++.-+- T Consensus 167 v~Eg~~~~~~-~~~f~~i~~~~~--k~~~~~~iS~ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~----------- 232 (379) T protein:vir:10 167 QVEGATKGQK-DYDISMIDVNTD--FIAGFTRYSKKMANNLPFLTSFIPNALRRDYAKAENAAFNAVLA----------- 232 (379) T ss_pred ccCCcccccc-ccceeeeEeeee--eEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------- Confidence 4556666543 355565555543 44333221111122234588888899999999999987742110 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) ..........+..+. ++.|.++...+...+.. + + .+|++|..|..|.+-.. .+..|-...+. T Consensus 233 ---------~~~~~~~~~~~~~~~----~d~i~~~~~~~~~~~~~-~--~-~~vmn~~~~~~l~~lkd-~~G~~l~~~~~ 294 (379) T protein:vir:10 233 ---------ANATASTEIITNKNK----VEMLINEIAKQENLDFP-V--T-AIVLRPTDYYDILVTQK-SVGAGYGLPGV 294 (379) T ss_pred ---------cccccccccccCccc----HHHHHHHHHhhhhccCC-C--C-EEEEcHHHHHHHHHhhc-cCCceeccCCc Confidence 000000111111122 34556666666655542 1 3 35789999999865321 12223222222 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-hhHH Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-SKLW 313 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-~~~d 313 (339) ...+|...+++|++|+.|+.+|.+. -+-++|+.. ++.. -+++..+..++.. .|.. T Consensus 295 ~~~~~~~~~l~G~pvv~s~~~~ag~--------------~~~gdf~~~--------~~~~--~~~~~i~~~~~~~~~f~~ 350 (379) T protein:vir:10 295 VTQDNGVLRINGIPLFRATWLAANK--------------YYVGDWTRV--------TKVT--TEGLSLEFSEVEGTNFVK 350 (379) T ss_pred cCCCCCcceecceeeEecCCCCCCc--------------eEEeecccE--------EEEE--EeceEEEEeecccccccC Confidence 2234555689999999999987431 122333331 1222 2334556665542 3332 Q ss_pred ---HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 314 ---FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 314 ---~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+++.+-+|.++++|++.+.+.+++= T Consensus 351 ~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 351 NNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred CcEEEEEEEEeccEEecCccEEEEEecCC Confidence 46667789999999999999999888 No 112 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.13 E-value=1.2e-11 Score=80.48 Aligned_cols=300 Identities=14% Similarity=0.087 Sum_probs=146.7 Q ss_pred Ccc--ccCc-----------ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccc-cccccccccceEEEec Q lcl|NC_020078. 1 MSI--FDGQ-----------TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGF-VPVRSVRGTSTISNRG 66 (339) Q Consensus 1 ~~~--~~~~-----------~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~-v~~r~i~~G~tv~i~~ 66 (339) +.+ -.|. +....+.|. ....+++--.+.-+++..++.+..+..++++.+ .++-....| .+++|+ T Consensus 37 ~a~a~~~g~~~~a~~~a~~~~~~~~~~~a-~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g-~~~~p~ 114 (366) T protein:vir:57 37 MSIAAGKGNLADAAKFAATELGDTGLSMA-ISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNG-NLSMPR 114 (366) T ss_pred HHHHhcccchhHHHHHHHHhhcchhhhhh-ccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCC-ceEEEE Confidence 000 0010 111111111 000111111123478899999888888888876 443333345 477877 Q ss_pred c-ccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020078. 67 I-SKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKA 145 (339) Q Consensus 67 i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~a 145 (339) . +.+.+.-...|++++.. .+..++.++..- ..+.-..|.+-=--++.+++.+.+.+++++++++..|+.++. + T Consensus 115 ~t~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~-k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G 188 (366) T protein:vir:57 115 LSGGATAGYVGEGKDVVAT-GATFDDVKLSAK-TMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLR----D 188 (366) T ss_pred EeCCcceeeeccCcccccc-ccceeEEEEeeE-EEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhc----c Confidence 6 56677777788888754 455666665542 222222222211124678999999999999999999987752 2 Q ss_pred cccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhh Q lcl|NC_020078. 146 AIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITN 225 (339) Q Consensus 146 A~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n 225 (339) ....+.+. +.....+... ..........+...+.+.+..+......++.. ....+| +++|..|..|.+-.. .+ T Consensus 189 ~G~~~~p~---Gi~~~~~~~~-~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~-vmn~~~~~~L~~lkd-~~ 261 (366) T protein:vir:57 189 DGTGDTPK---GMKAVATAAN-RLVAWTGTAINLTTIDEYLDSLILKHMDSNSN-MIRCGW-GLSNRTYMTLFGLRD-GN 261 (366) T ss_pred CCCCcccc---ceeecccccc-ceeeccccccchhhHHHHHHHHHHhhhccccc-cccCEE-EecHHHHHHHHhhhc-cC Confidence 11111111 0000000000 00111111112222211111122222222221 112334 799999999865211 11 Q ss_pred hcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEee Q lcl|NC_020078. 226 GEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIF 305 (339) Q Consensus 226 ~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~ 305 (339) ..|. +....-+.++|++|++|+++|......+ +...-|-++|+. +..+.-.++..++. T Consensus 262 G~~l------~~~~~~g~l~G~Pvv~s~~ip~~~~~~~------~~~~i~~gdfs~----------~~i~~~~~i~i~~~ 319 (366) T protein:vir:57 262 GNKV------YPEMSQGILKGYPIQRTSAIPANLGDDG------NESEIYFCDFND----------VVIGEDGMMKVDFS 319 (366) T ss_pred Ccee------ccCCCCCeecceeeEEccccccccccCC------CccEEEEEecce----------EEEEEecceEEEEe Confidence 1111 1111224689999999999986422111 111112233333 22334444555555 Q ss_pred echhh-----------hH--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 306 FDDLS-----------KL--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 306 ~~~~~-----------~~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++..+ +. -.++..+-++-+++||++.+.+ +++ T Consensus 320 ~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~l--t~~ 364 (366) T protein:vir:57 320 TEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLG--TGV 364 (366) T ss_pred eccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEE--ecc Confidence 54311 11 2577888899999999999888 666 No 113 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.12 E-value=3.3e-12 Score=83.50 Aligned_cols=284 Identities=11% Similarity=0.037 Sum_probs=158.2 Q ss_pred CccccCcccCC--CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc-cceEEEecc--ccceeeec Q lcl|NC_020078. 1 MSIFDGQTPSY--DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG-TSTISNRGI--SKAKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~--~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~-G~tv~i~~i--G~~t~~~~ 75 (339) +.-|......- +....-....+++--.+.-+.|+.++.+..+..+.+++++++..+.+ ..+..+++. +...+... T Consensus 91 ~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 170 (397) T protein:vir:49 91 VKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANID 170 (397) T ss_pred HHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeee Confidence 11111100000 00000000111111223458999999999999999999998887764 223445544 33456777 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) ..|+.++....+..++.++.... .+....|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 171 ~E~~~~~~~~~~~~~~i~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~------------- 236 (397) T protein:vir:49 171 DEAGKIADVDDPKLSLIKYTIKR-YAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI------------- 236 (397) T ss_pred cCccccccccccceeeEEeeeee-EEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhc------------- Confidence 77777764334556666666632 2222233321112356899999999999999999999875211 Q ss_pred cccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) +.+. ..+..++ ++.|.++...|..... + .-.++++|..|..|.+-..- +..|.-.. . T Consensus 237 -------g~~~-----~~~~~~~----~d~i~~~~~~l~~~~~--~--~a~~vmn~~~~~~l~~lkd~-~G~~l~~~--~ 293 (397) T protein:vir:49 237 -------AALP-----TKPTLTK----WDDIIDLEAKVDPAIK--Q--TSFFLTNTSGFTALKKVKNA-LGDYLMER--D 293 (397) T ss_pred -------cccc-----ccccccc----HHHHHHHHHhhhhhhc--C--CCEEEEcHHHHHHHHHhhcC-CCceeecc--C Confidence 1010 0111122 3455666666665544 2 23568999999999653211 11222111 2 Q ss_pred eecceeEEEeceEEEEecc--ccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhh- Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNN--AVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSK- 311 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snn--lp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~- 311 (339) +.+|.-..++|++|+.+.+ +|... .++..-+-++ -+.++..+...++.++..+.. +.| T Consensus 294 ~~~~~~~~l~G~PV~~~~~~~~~~~~---------~~~~~i~~gd---------~~~~~~~~~~~~~~i~~~~~~~~~~~ 355 (397) T protein:vir:49 294 VKSPTGYSIDGFAVKEVADRWLANGT---------GGAMPLYFGD---------LKQAVTLFDRQHMSLLSTNIGGGAFE 355 (397) T ss_pred cCCCCCceecceeeEEeccccccccc---------CCceeEEEee---------ccceEEEEeecceEEEEeccccchhh Confidence 3456667899999997544 33111 1111111111 133455555556666665432 222 Q ss_pred --HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 --LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 --~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+++.+.+|.++++|++.+.++++++ T Consensus 356 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 356 TDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred cCceeEEEEeeeCcEEecccceEEEEeecc Confidence 3357788899999999999999999888 No 114 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.12 E-value=2.3e-12 Score=84.44 Aligned_cols=280 Identities=9% Similarity=0.055 Sum_probs=168.5 Q ss_pred CcccchhHHHH-HHHHHHHHHHHHHHhhhcc---ccccccc-----cccceEEEeccccc--eeeeccCCCCCCCCCCCC Q lcl|NC_020078. 20 HGAGDPLADVT-EQFTGTVEGTIKRRSIMAG---FVPVRSV-----RGTSTISNRGISKA--KLQKIAPGTTPPPSTEPH 88 (339) Q Consensus 20 ~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~---~v~~r~i-----~~G~tv~i~~iG~~--t~~~~~~g~~i~~~~~~~ 88 (339) .+.+-...+++ |+|...|.+.+.+.+.|.+ +++..+| .+|+++.||..+.. ...++..+++|+.. .+. T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~-kit 79 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVN-NLT 79 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchh-eec Confidence 22222112444 9999999998888777643 2332222 35999999999864 68899999999864 455 Q ss_pred ccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccc Q lcl|NC_020078. 89 TSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVV 168 (339) Q Consensus 89 ~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~ 168 (339) +.+..-+|=. .--.+.+.|+...-+--|++.++.++.+...++..+..++..| +++..... ...++..... T Consensus 80 t~~~~a~i~~-~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l-~gv~~~~~------~~~~~~~d~t- 150 (351) T protein:vir:15 80 SGKQQGIKFY-QTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVL-KGVMGVTK------IANSKVYDQT- 150 (351) T ss_pred ccceeEEEEe-eccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhchh------hcccceeccc- Confidence 5555555522 3335888999888888899999999999999999988887765 33322110 0000000000 Q ss_pred cccCccccccHHHHHHHHHHHHHHHHh-cCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEece Q lcl|NC_020078. 169 TLAGANDYKDPAKLYAAIASLVEKFLE-KDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGV 247 (339) Q Consensus 169 ~~~~~~~~~~~~~l~~ai~~a~~~L~e-~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~ 247 (339) ...+.....++ +.|.++.++|.+ ++- .-..+++.|..|..|.+.. +++. ..... .++.|+.++|. T Consensus 151 ~~~~~~~~is~----~~l~~A~~~~GD~~~~----~~~~ivmhS~v~~~L~~~~-li~~--~~~s~---~~~~i~t~~G~ 216 (351) T protein:vir:15 151 KVSPSEPMFGA----KGFTGAIGLMGDLQDT----AFGAIAVNSATYSLMKVQG-LIET--IQPQN---GATPFEAYNGL 216 (351) T ss_pred cccccccccCH----HHHHHHHHHhcccccc----ceEEEEEChHHHHHHHhhh-hhhh--ccccc---cCcccceecce Confidence 01111222333 456677777743 321 1257789999999998763 4332 22111 14578999999 Q ss_pred EEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhh-------HHHHHHHHH Q lcl|NC_020078. 248 PVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSK-------LWFIDSWLA 320 (339) Q Consensus 248 ~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~-------~d~i~g~~~ 320 (339) +|+++..+|.... .++. .+-.++++-+-|+++.+..+ .+|..|++... ....-.+|. T Consensus 217 ~VivdD~~p~~~~--------~~~~-------~~ytsyl~~~GAi~~~~~~~-~ve~~rd~~~~~g~d~l~~r~~~~~hp 280 (351) T protein:vir:15 217 RIVLDDDIEIDLT--------DKTK-------PVSTSYIFAPGAVRYSTNMR-STETKYDPLINGGQDVIVQKRVGTIHV 280 (351) T ss_pred EEEEcCCCccccC--------CCCC-------ceeEEEEEecceeeeecCCc-CcceeecccCCCCceEEEEeeeeeeee Confidence 9999999884311 1111 12246788899999887665 68888877532 233344566 Q ss_pred hCCccccccceEEEEecCC Q lcl|NC_020078. 321 FGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 321 ~Ga~v~rPe~~v~i~~~~a 339 (339) +|.+--.+.-.....-|+- T Consensus 281 ~G~s~~~~~~~~~~~sPt~ 299 (351) T protein:vir:15 281 AGTSIKASFSPSKASFPTI 299 (351) T ss_pred eeeeecccccccCcCCcCh Confidence 6666543322111111111 No 115 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.12 E-value=5.9e-12 Score=82.16 Aligned_cols=277 Identities=12% Similarity=0.059 Sum_probs=170.9 Q ss_pred CcccchhHHHH-HHHHHHHHHHHHHHhhhcc--c-cc---ccc---c-cccceEEEeccccc--eeeeccCCCCCCCCCC Q lcl|NC_020078. 20 HGAGDPLADVT-EQFTGTVEGTIKRRSIMAG--F-VP---VRS---V-RGTSTISNRGISKA--KLQKIAPGTTPPPSTE 86 (339) Q Consensus 20 ~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~--~-v~---~r~---i-~~G~tv~i~~iG~~--t~~~~~~g~~i~~~~~ 86 (339) .+.+-...+++ |+|...|.+...+.+.|.+ . ++ .++ . .+|+++.+|..+.. ...++..+++++.. . T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~-~ 79 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ-K 79 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh-h Confidence 22222122444 9999999998888877743 1 11 121 1 36999999999874 67889989999864 4 Q ss_pred CCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccc Q lcl|NC_020078. 87 PHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGN 166 (339) Q Consensus 87 ~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~ 166 (339) +.+.+..-+| ....-.+.+.|+-...+--|++.+++++.+..+++..|..++..|. ++...+... .. T Consensus 80 l~t~~~~a~i-~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~-g~~~~~~~~-----------~~ 146 (324) T protein:vir:59 80 INAGQDKAVL-ILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELA-GVFSNDDMK-----------DN 146 (324) T ss_pred cccceeeEEE-EeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhhccccc-----------cc Confidence 5555555555 3466678889988888888999999999999999999988877664 333222110 01 Q ss_pred cccccCc-cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEe Q lcl|NC_020078. 167 VVTLAGA-NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAF 245 (339) Q Consensus 167 ~~~~~~~-~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~ 245 (339) .....++ ....++ +.|.++.++|.++.- .-..++|.|..|..|.+.. +++. ..... .++.|+.++ T Consensus 147 ~~dvsa~~~~~~s~----~~l~~A~~~~GD~~~----~~~~ivmhS~v~~~L~~~~-li~~--~~~s~---~~~~i~~~~ 212 (324) T protein:vir:59 147 KLDISGTADGIYSA----ETFVDASYKLGDHES----LLTAIGMHSATMASAVKQD-LIEF--VKDSQ---SGIRFPTYM 212 (324) T ss_pred eeeeeccccceecH----HHHHHHHHHhCCccc----CcEEEEEchHHHHHHHHhh-hhhh--ccccc---cCceeeeec Confidence 1111111 112233 456677777755431 2358899999999999863 4432 21111 245789999 Q ss_pred ceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEe-eeeEEeeechhhhHHHHHH-----HH Q lcl|NC_020078. 246 GVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTI-PVTSKIFFDDLSKLWFIDS-----WL 319 (339) Q Consensus 246 G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~-~~~~e~~~~~~~~~d~i~g-----~~ 319 (339) |.+|+++..+|.... ++.. ..-.++.+.+-|+++.... ++.+|..|++..-.+.+.. ++ T Consensus 213 G~~VivdD~~p~~~~--------~~~~-------~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~ 277 (324) T protein:vir:59 213 NKRVIVDDSMPVETL--------EDGT-------KVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLH 277 (324) T ss_pred ccEEEEeCCCCcccc--------CCCC-------ceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeE Confidence 999999999984211 1111 2234688889999998865 4778999998655554433 33 Q ss_pred HhCCccccccceE------EEEecCC Q lcl|NC_020078. 320 AFGVTINRTEYAG------VIKLPAA 339 (339) Q Consensus 320 ~~Ga~v~rPe~~v------~i~~~~a 339 (339) .+|.+-....-.+ +|..++- T Consensus 278 p~G~s~~~~~~~~~sPt~~~L~~~~N 303 (324) T protein:vir:59 278 PRGVKFTENAMAGTTPTDEELANGAN 303 (324) T ss_pred eeeEEecccccCCCCCChhhhcCCcc Confidence 4443333211100 0100000 No 116 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.12 E-value=1.6e-11 Score=79.83 Aligned_cols=286 Identities=10% Similarity=0.009 Sum_probs=154.6 Q ss_pred CccccCcc-cCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEeccc--cceeeecc Q lcl|NC_020078. 1 MSIFDGQT-PSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRGIS--KAKLQKIA 76 (339) Q Consensus 1 ~~~~~~~~-~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~iG--~~t~~~~~ 76 (339) ..+-.|.. ......|.-.....++--.+.-+.|+.++.+..+..+.++++++...+.++ .++.+++.. ...+.... T Consensus 99 ~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 178 (404) T protein:vir:39 99 NMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDA 178 (404) T ss_pred HHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeec Confidence 00000100 000111111111111111245599999999999999999999988877653 344454443 34555666 Q ss_pred CCCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 77 PGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 77 ~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) .|+.++....+..++.++.+. ++.. +.|.+-=--++.+|+.+.+.++.++++++..|+.|+.- T Consensus 179 Eg~~~~~~~~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g-------------- 242 (404) T protein:vir:39 179 EDGKIPDLDNPRLTIIKYLIK--RYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA-------------- 242 (404) T ss_pred CccccccccccceeeEEeeee--eEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhc-------------- Confidence 677765433455566666664 3222 23332111235688999999999999999999987521 Q ss_pred cccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) .+.+. ..+...+.+.+.+++. ..+... +. + +=..|++|..|..|.+-..- +..|.-.. . T Consensus 243 ------~g~~~-----~~~~~~~~~~i~~~~~---~~~~~~-~~-~--~a~~v~n~~~~~~L~~lkd~-~G~~l~~~--~ 301 (404) T protein:vir:39 243 ------MGTVP-----KKPTIAKFDDVITMIN---TSVDPA-II-A--TSSLLTNQSGLNKLALVKTA-EGKYLLEP--D 301 (404) T ss_pred ------ccccc-----cccccccHHHHHHHHH---Hhhhhh-hc-c--CCEEEEcHHHHHHHHHhhcc-CCceeecc--C Confidence 01110 1111223333333322 123322 21 1 22568999999999753211 11121111 1 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-hh--- Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-SK--- 311 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-~~--- 311 (339) +.+|...+++|++|+.+.+.+.... ..+...-|-++| +.++..+...++++++.+... .| T Consensus 302 ~~~~~~~~l~G~pV~~~~~~~~~~~-------~~~~~~~~~gd~---------~~~~~~~~~~~~~i~~~~~~~~~~~~~ 365 (404) T protein:vir:39 302 PTKPNSYLIKGKKVIVVADRWLPNS-------GSTVYPLYYGDM---------SQAITLFDRENMSLLPTNIGAGAFETD 365 (404) T ss_pred cCCCCcceecceeEEEecccccCcc-------CCCccEEEEEec---------cccEEEEeecceEEEEeccchhhhhhc Confidence 3355667899999998765432211 011111111222 234444555666666655432 22 Q ss_pred HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 ~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+++.+.||.++++|++.+.++++++ T Consensus 366 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 366 TTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred eeeEEEEeeeccEEecccceEEEEeecc Confidence 2357788899999999999999998877 No 117 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.12 E-value=3.1e-12 Score=83.71 Aligned_cols=284 Identities=10% Similarity=-0.002 Sum_probs=154.9 Q ss_pred CccccCcccCCCcccCCc--cCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEecccc--ceeeec Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQ--RHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRGISK--AKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~--~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~iG~--~t~~~~ 75 (339) ..-|.............. ....++--.+.-+.|+.++.+..+..+.+++++++..+..+ .++.+++... ..+... T Consensus 91 ~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 170 (397) T protein:vir:49 91 VKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLD 170 (397) T ss_pred HHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeee Confidence 111111111100000001 01111111234599999999999999999999988877653 3445555432 344444 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ..|+.+.....+..+++++... ++.. ..|.+-=-.++.+|+.+.+.+++++++++..|+.|+.- T Consensus 171 ~E~~~~~~~~~~~~~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G------------- 235 (397) T protein:vir:49 171 DEGGQIGQNDDPKLSLIRYAIK--RYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEA------------- 235 (397) T ss_pred ccccccccccccceeeeEeeee--eeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhc------------- Confidence 4566665433344456666554 3322 22322111235689999999999999999999987521 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) .+.+. ..+...+. +.|.++...|.....+ .-..+++|..|..|.+-..= +..|.-. . T Consensus 236 -------~g~~~-----~~~~~~~~----d~i~~~~~~l~~~~~~----~a~~v~n~~~~~~l~~lkd~-~g~~l~~--~ 292 (397) T protein:vir:49 236 -------IGTLP-----NKPTLAKW----DDIIDLQAKVDPAIKQ----TSLFLTNTSGFTALKKVKNA-MGDYLME--R 292 (397) T ss_pred -------ccccc-----ccccccCH----HHHHHHHHhhhhhhcC----CCEEEEcHHHHHHHHHhhcc-CCceeec--c Confidence 11111 11122233 4455666667665542 22568999999988652111 1111111 1 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEe-ccceeEEEEEeeeeEEeeechh-hh- Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMF-SPKALLAGSTIPVTSKIFFDDL-SK- 311 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~-h~~A~~~~~~~~~~~e~~~~~~-~~- 311 (339) .+.+|.-.+++|++|+.+.+.+.... ..+.. ..++. .+.++..+....++++..+... .| T Consensus 293 ~~~~g~~~~l~G~pV~~~~~~~~~~~-------~~~~~----------~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~ 355 (397) T protein:vir:49 293 DVKSPTGYSIDGFVVKEISDRFLPNG-------TGGAM----------PLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFE 355 (397) T ss_pred cccCCCCceecceeeEEecccccccc-------cCCce----------eEEEeeccceEEEEeecccEEEEeccccchhh Confidence 23456667899999987654321110 00111 01122 2445656666666666654332 22 Q ss_pred --HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 --LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 --~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+++.+.+|.++++|++.+.+++++. T Consensus 356 ~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 356 TDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred cCeeeEEEEEeeccEEecccceEEEEeccc Confidence 2346778889999999999999988777 No 118 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.09 E-value=1.9e-11 Score=79.32 Aligned_cols=293 Identities=13% Similarity=0.045 Sum_probs=150.3 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~~g~ 79 (339) |. ..+.+--.+.-++++.++.+..+..++++.+.++..... ..++||++ +.+++.-...|+ T Consensus 1 Ma-----------------t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~-~~~~~p~~~~~~~a~wv~Eg~ 62 (311) T protein:vir:99 1 MA-----------------TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRF-GNEDIITFNGRPKAEFVGEGQ 62 (311) T ss_pred Cc-----------------eecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEEeecCc Confidence 21 111111113348899999999999999999987665554 44688887 677888888888 Q ss_pred CCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-----hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-----QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-----q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) +++.. .+...+.++.. .|+..+ +.==++. ++..|+.+.+.+++++++++.+|+.++.-- .. . T Consensus 63 ~~~~~-~~~f~~v~l~~--~k~~~~-~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~----g~-----~ 129 (311) T protein:vir:99 63 QKSST-TGEFDFVTSTP--KKAQVT-MRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRI----NP-----L 129 (311) T ss_pred ccccc-cceeeEEEEee--EEEEEe-ehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhccc----Cc-----c Confidence 88754 35556555544 233322 2222222 245789999999999999999999886211 00 0 Q ss_pred ccccccCcccccccc-ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVT-LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 155 ~~~~~~g~~~~~~~~-~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) ++....|........ ...+....+...++.-+..+...+.......+. .. .+++|..+..|.+-.. .+..|.-.. T Consensus 130 ~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~-~~-~vmn~~~~~~L~~lkd-~~G~~l~~~- 205 (311) T protein:vir:99 130 TGTVIPGWSNYLGAASKRVELTADTIANPDLAIEAAVGLLVANGHPTPV-NG-LALHPSIAWGLSTARY-TDGRKKFPE- 205 (311) T ss_pred cCccccccccccccccceeeccccccchhHHHHHHHHHHHhhhccCCCc-cE-EEEcHHHHHHHHhhhc-cCCCeeecC- Confidence 001111111100000 000001111122233344444444333221121 22 5889999999965221 111121111 Q ss_pred ceeecceeEEEeceEEEEeccccccccccccccC--CCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec--hh Q lcl|NC_020078. 234 ETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLS--NANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD--DL 309 (339) Q Consensus 234 ~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~--~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~--~~ 309 (339) ....+..++++|++|+.|+++|........... .......+-++|+. .+-....++++.+..+. .+ T Consensus 206 -~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~---------~~~~~~~~~~~~~~~~~~~~~ 275 (311) T protein:vir:99 206 -LGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFAN---------GIHWGVQRDIPVELIKYGDPD 275 (311) T ss_pred -cccCCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccc---------cEEEEEecCceEEEeecCCCC Confidence 122455678999999999999854322111111 00111112223322 22222233334444332 12 Q ss_pred hh-----HH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 SK-----LW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ~~-----~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .. .| .+++...+|..+++|++ +.+...+| T Consensus 276 ~~~~~~~~d~~~~r~~~r~d~~v~~~~~-v~~~~~~A 311 (311) T protein:vir:99 276 GQGDLKRHNQIALRLEIVYGWYVFTDRF-VVIENAVA 311 (311) T ss_pred cchhhhhcCcEEEEEEEeecceecChhH-eeeecccC Confidence 11 22 25777889999988764 45555566 No 119 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=99.09 E-value=9.6e-11 Score=75.51 Aligned_cols=323 Identities=11% Similarity=0.066 Sum_probs=171.9 Q ss_pred cCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHH----hhhcc----------------------cccccccc--ccc Q lcl|NC_020078. 9 PSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRR----SIMAG----------------------FVPVRSVR--GTS 60 (339) Q Consensus 9 ~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~----sv~~~----------------------~v~~r~i~--~G~ 60 (339) -|..-|+. +.+++. -.++|+.-|.++-.+. .+|.+ +++..++. .|+ T Consensus 1 ~~~a~T~~----~~~~p~--a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD 74 (430) T protein:vir:10 1 MTASKTTM----RYGDPN--AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGD 74 (430) T ss_pred Ccceeeec----ccCChh--HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCcc Confidence 11122222 445555 3688998887776553 22223 66666775 399 Q ss_pred eEEEeccccceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 61 TISNRGISKAKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFF 139 (339) Q Consensus 61 tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~ 139 (339) +|.|+-+...+-.....++.+++.+ .++...-.|.|||.--.-..=..+++-.+-+|+|++.-..++.=+++..||.+| T Consensus 75 ~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~ 154 (430) T protein:vir:10 75 EVRFHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSML 154 (430) T ss_pred EEEEeEeeccccCceecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999988877766666667777764 356667789999865421111345665678999999999999999999999999 Q ss_pred HHHHhhcc--------------------ccccccccc---ccccCccccccccccCcc-ccccHHHH-HHHHHHHHHHHH Q lcl|NC_020078. 140 IMAAKAAI--------------------ASDSPYGTA---AQMPGHSGGNVVTLAGAN-DYKDPAKL-YAAIASLVEKFL 194 (339) Q Consensus 140 ~~l~~aA~--------------------~~~~~~~~~---~~~~g~~~~~~~~~~~~~-~~~~~~~l-~~ai~~a~~~L~ 194 (339) ..|..+-. +.++..... -..++...-+.....+.. ..++.+.+ ++.|.++...++ T Consensus 155 v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~ 234 (430) T protein:vir:10 155 VHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMD 234 (430) T ss_pred HHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHH Confidence 99864311 011111000 000010000000000001 11112222 355666777776 Q ss_pred hcCCCCC-----cCC-------eEEEECHHHHHHHhcccchhh----hc-cccc-ccceeecceeEEEeceEEEEeccc- Q lcl|NC_020078. 195 EKDVRPN-----EED-------MILVLPPAAFTALMQAEHITN----GE-YVTS-AGETLNTKYMFAAFGVPVITSNNA- 255 (339) Q Consensus 195 e~dV~~p-----~~~-------R~~vv~P~~~~~Ll~~~~~~n----~d-~~~~-~~~~l~~G~v~~i~G~~V~~Snnl- 255 (339) ..+.|.+ .+. +++++.|.+|..|..++.+.. +. +... ...+|..|.++.++|+-|++-.+. T Consensus 235 ~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~vi 314 (430) T protein:vir:10 235 QIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPI 314 (430) T ss_pred hhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCcee Confidence 6543211 122 788899999999999998742 21 1221 234688999999999999986432 Q ss_pred cccccccccccCCCc--------cccccccccceEEEEEeccceeEEEEEee-------eeEEeee-chhhhHHHHHHHH Q lcl|NC_020078. 256 VFGKTITDHLLSNAN--------NEKAYDGDFKDIVAQMFSPKALLAGSTIP-------VTSKIFF-DDLSKLWFIDSWL 319 (339) Q Consensus 256 p~~~~~~~~~l~~~~--------~~~~y~~~~~~~~~~~~h~~A~~~~~~~~-------~~~e~~~-~~~~~~d~i~g~~ 319 (339) .+..+......+... -...+.+...-.-++++-..|++.+-... ...|-.. ..+++ .|-... T Consensus 315 rf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~--~i~~~~ 392 (430) T protein:vir:10 315 RFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKL--ELLIGA 392 (430) T ss_pred eecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchh--hhhhhH Confidence 111110000000000 00011111112223455555554443332 1223222 22222 233345 Q ss_pred HhCCcccccc----------ceEEEEecCC Q lcl|NC_020078. 320 AFGVTINRTE----------YAGVIKLPAA 339 (339) Q Consensus 320 ~~Ga~v~rPe----------~~v~i~~~~a 339 (339) ++|.+=.|=. =-++|.+++| T Consensus 393 i~G~kK~rF~~~~~~~~~~~DfGvi~idta 422 (430) T protein:vir:10 393 ILGCSKIRFAVEATNGLEYTDHGVMAIDTA 422 (430) T ss_pred HhccceeeecCCCCCCceeeeeEEEEhhhh Confidence 5565555542 3677777777 No 120 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.09 E-value=3.7e-11 Score=77.78 Aligned_cols=299 Identities=14% Similarity=0.103 Sum_probs=151.9 Q ss_pred CccccC----ccc------C-CCcccCCccCcccchh--HHHHHHHHHHHHHHHHHHhhhccc-cccccccccceEEEec Q lcl|NC_020078. 1 MSIFDG----QTP------S-YDVTRPNQRHGAGDPL--ADVTEQFTGTVEGTIKRRSIMAGF-VPVRSVRGTSTISNRG 66 (339) Q Consensus 1 ~~~~~~----~~~------~-~~~~r~~~~~~~~~~~--a~~ie~~~g~v~~~f~~~sv~~~~-v~~r~i~~G~tv~i~~ 66 (339) +++..+ +-+ . +.-.+....+..++.. .+.-+.++.++.+..+..++++.+ ++..+...| .+++|+ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~p~ 181 (435) T protein:vir:14 103 RALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPR 181 (435) T ss_pred HHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCC-ceEEEE Confidence 000000 000 0 0000111111111111 133478888998888888888876 433343344 578888 Q ss_pred c-ccceeeeccCCCCCCCCCCCCccceEEEEeehhhhh-hhHH--HHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 67 I-SKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEP--MLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMA 142 (339) Q Consensus 67 i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vd--d~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l 142 (339) + +.+.+.-...|+.++.. .+...+.++..- ++.. +.|. -++++....++.+.+.++.++++++..|+.++ T Consensus 182 ~~~~~~a~~v~E~~~~~~~-~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l--- 255 (435) T protein:vir:14 182 LKGGAIVGYIGADTDIPTT-QQQFDDLKLTAK--KMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFI--- 255 (435) T ss_pred EeCCcceeeeccCcccccc-ccceeEEEeeeE--EEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhh--- Confidence 7 55666666667766643 355555555553 3322 3332 23333223458899999999999999999885 Q ss_pred HhhcccccccccccccccCcccccc-ccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc Q lcl|NC_020078. 143 AKAAIASDSPYGTAAQMPGHSGGNV-VTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE 221 (339) Q Consensus 143 ~~aA~~~~~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~ 221 (339) .+.... ....|...... ..........+.+.+++.+.++...+...+.. ......|++|..|..|.+-. T Consensus 256 -~G~G~~-------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lk 325 (435) T protein:vir:14 256 -RDDGTA-------NTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADAN--LTQPGWIMAPRTFRFLEGLR 325 (435) T ss_pred -ccCCCC-------ccccceeecccccceeccccccchhhHHHHHHHHHHHhhhcccc--ccCCEEEEcHHHHHHHHHhh Confidence 111111 11122211111 11112222234444555666666666544331 12335689999999886532 Q ss_pred chhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeee Q lcl|NC_020078. 222 HITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVT 301 (339) Q Consensus 222 ~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~ 301 (339) . .+..|.-. .+.. +.++|.+|++|+.+|......+. ...-+-++|+.. ..+.-.+++ T Consensus 326 d-~~G~~l~~---~~~~---g~l~G~Pv~~~~~~p~~~~~~~~------~~~i~~gd~s~~----------~i~~~~~~~ 382 (435) T protein:vir:14 326 D-GNGNKVYP---ELAN---GMLKGYPVGKTTQVPINLGETGK------ESEIYFTDFGDV----------FIGEEETLE 382 (435) T ss_pred c-cCCceecc---CCCC---CeeecceeEeeccccccccCCCc------cceEEEeecccE----------EEEEecccE Confidence 1 22222111 1122 36899999999999854221111 011222333332 122333444 Q ss_pred EEeeechh----------hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 302 SKIFFDDL----------SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 302 ~e~~~~~~----------~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+..++.. .| .-.+++.+.++.++.||++++.|.=-+. T Consensus 383 ~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:14 383 IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAW 433 (435) T ss_pred EEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCC Confidence 45444321 12 2567889999999999999887743332 No 121 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.08 E-value=1.6e-11 Score=79.70 Aligned_cols=296 Identities=10% Similarity=0.025 Sum_probs=156.0 Q ss_pred Ccccc--------CcccCC-CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc-cceEEEec-ccc Q lcl|NC_020078. 1 MSIFD--------GQTPSY-DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG-TSTISNRG-ISK 69 (339) Q Consensus 1 ~~~~~--------~~~~~~-~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~-G~tv~i~~-iG~ 69 (339) .+..+ ..+.-. .-.|.-..+..++--.+.-+.|+.++.+..+..+.++++++...+.+ ..++.+++ .+. T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~ 164 (404) T protein:vir:10 85 RAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQ 164 (404) T ss_pred HHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCC Confidence 00000 011100 01111000011111113348899999999999999999999888864 33555655 466 Q ss_pred ceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020078. 70 AKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAI 147 (339) Q Consensus 70 ~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~ 147 (339) ..+.....|+...... .+..++.++... ++..+ .|.+-=--++.+++.+.+.++.++++++..|+.|+ .+.. T Consensus 165 ~~~~~v~e~~~~~~~~~~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il----~G~g 238 (404) T protein:vir:10 165 KPMKPLSENQQIPTNGDNGKLERFNFKLK--DLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEIL----YGAG 238 (404) T ss_pred cceeeccccccccccccccceeeeEeehe--eeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHh----hcCC Confidence 7777777777765432 233444444443 33322 22221111356789999999999999999999875 1211 Q ss_pred cccccccccccccCccccccccccCccccccHHHHHHHHHHHHH-HHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhh Q lcl|NC_020078. 148 ASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVE-KFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNG 226 (339) Q Consensus 148 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~-~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~ 226 (339) ... ...|........ +...+....++.+..+.. .|... .. + .. .+|++|..|..|.+-..- +. T Consensus 239 ~~~-------~~~gi~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~-~~-~-~~-~~v~n~~~~~~L~~lkd~-~G 302 (404) T protein:vir:10 239 GDE-------HATGIMTANKFK----KITLPKSPALKDFKKCKNVELLNV-FK-A-TS-SWIVNQDGFNYLDSLEDK-TG 302 (404) T ss_pred CCC-------cccceeeccccc----eeeccccccHHHHHHHHHhhhhcc-cc-C-CC-EEEEcHHHHHHHHHhhcc-CC Confidence 111 111111111111 111111122344444433 34333 21 2 12 468999999998663221 22 Q ss_pred cccccccceeecceeEEEeceEEEEec-cccccccccccccCCCccccccccccceEEEEEec-cceeEEEEEeeeeEEe Q lcl|NC_020078. 227 EYVTSAGETLNTKYMFAAFGVPVITSN-NAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFS-PKALLAGSTIPVTSKI 304 (339) Q Consensus 227 d~~~~~~~~l~~G~v~~i~G~~V~~Sn-nlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h-~~A~~~~~~~~~~~e~ 304 (339) .|.-.. .+.+|...+++|.+|+.+. .+|... .+ ..+.++.+ +.++..+....++++. T Consensus 303 ~~l~~~--~~~~~~~~~l~G~PV~~~~~~~~~~~---------~~----------~~~~~~gd~s~~~~~~~~~~~~i~~ 361 (404) T protein:vir:10 303 RPYLQP--DPKDPTQYRFLGLPVIELPNDLLLST---------ES----------AIPVLLGDTKEAYKYVSDGAYELAT 361 (404) T ss_pred ceeecc--CcCCCCCccccceeeEEecccccCCC---------CC----------ccEEEEEeccccEEEEEecceEEEE Confidence 222211 2345666789999998643 343221 11 11122332 3345455555666665 Q ss_pred eech-hhh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 305 FFDD-LSK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 305 ~~~~-~~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .++. ..| .-.+++.+.+|.++++|++.+.+++++| T Consensus 362 ~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 362 TNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred eccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 5443 222 2347888999999999999999999999 No 122 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.08 E-value=1.7e-11 Score=79.57 Aligned_cols=283 Identities=10% Similarity=-0.012 Sum_probs=156.6 Q ss_pred CccccCcccC-C-------CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEec-cccc Q lcl|NC_020078. 1 MSIFDGQTPS-Y-------DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRG-ISKA 70 (339) Q Consensus 1 ~~~~~~~~~~-~-------~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~-iG~~ 70 (339) ..-+.|.... . --.|.......++--.+.-+.|+.++.+.....+++++++++..+.++ ..+.+++ .+.+ T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (397) T protein:vir:12 99 LKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMV 178 (397) T ss_pred HHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCc Confidence 0001111100 0 011111111111111234599999999999999999999888877642 3454544 4667 Q ss_pred eeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 71 KLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 71 t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) .+..+..|+.++....+..++.++... ++..+ .|.+-=--.+.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 179 ~a~~v~Eg~~~~~~~~~~~~~v~~~~~--k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~------- 249 (397) T protein:vir:12 179 PFSPVEELGNLPEIDQPRFTKVSYSII--DYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAI------- 249 (397) T ss_pred ceeeecccccccccccccceeEEeehe--eeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc------- Confidence 788888888776444455566666554 43332 22221112356789999999999999999999875210 Q ss_pred cccccccccccCccccccccccCccccccHHHHHHHHHHHH-HHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLV-EKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEY 228 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~-~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~ 228 (339) +.+. +....+.+ .+.++. ..|....- .+=..+++|..|..|.+-..- +..| T Consensus 250 -------------g~~~------~~g~~~~~----~i~~~~~~~l~~~~~----~~a~~~~n~~~~~~L~~lkd~-~G~~ 301 (397) T protein:vir:12 250 -------------ASLK------KVDIDGLD----GIKKALNVTLDPMVA----PGSIVLTNQDGYDWLDTLKDG-TGRY 301 (397) T ss_pred -------------cccc------ccccccHH----HHHHHHhhccchhhh----CCCEEEEcHHHHHHHHHhhcc-CCce Confidence 1111 11112223 333333 23433221 122468999999998652111 1222 Q ss_pred cccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech Q lcl|NC_020078. 229 VTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD 308 (339) Q Consensus 229 ~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~ 308 (339) .-.. .+.+|.-.+++|.+|+++++...+.. .+...-+-++| +.++..+....+..+..+.. T Consensus 302 l~~~--~~~~g~~~~l~G~pv~~~~~~~~~~~--------~~~~~~~~gd~---------~~~~~~~~~~~~~i~~~~~~ 362 (397) T protein:vir:12 302 LLQP--DPTNPTKKLLDGRPVVPFTNRVLKTQ--------KGKAPLIIGNL---------KEAIVLFDREQQSIASTDTG 362 (397) T ss_pred eecc--cccCCCCccccceeeEEecccccccC--------CCccEEEEEeh---------hceEEEEeecceEEEEeccc Confidence 2111 23466667899999999887532211 11111111222 23444444455556655433 Q ss_pred h-hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 309 L-SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 309 ~-~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . .| ...+++.+-++.++++|++.+.+++++= T Consensus 363 ~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 363 AGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred cchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 2 12 3467888899999999999999999888 No 123 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.07 E-value=2.6e-11 Score=78.64 Aligned_cols=282 Identities=13% Similarity=0.134 Sum_probs=149.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) .....+.. ....+......+++--.+..+.|+.++.+..+..+++++++++..+.++ +.+++.. +...+.....+ T Consensus 97 ~~~l~~~~--~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~ 173 (394) T protein:vir:10 97 NDFIHSHG--KVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAEL 173 (394) T ss_pred HHHHhccc--hhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEecCCCcccccccc Confidence 00000000 0000000001111111234489999999999999999999887776543 4555544 44555666666 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) ++......+...++++.+- ++..+ .|.+-=-.++.+|+.+.+.+++++++++..|+.|+.-. T Consensus 174 ~~~~~~~~~~~~~v~l~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~--------------- 236 (394) T protein:vir:10 174 AENPALAEPEFEQVDWSVS--TYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVL--------------- 236 (394) T ss_pred ccccccccccceeEEeeee--eeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Confidence 6665333455566666553 33332 22222122356899999999999999999998875221 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHH-HHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcc--cccccc Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVE-KFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEY--VTSAGE 234 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~-~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~--~~~~~~ 234 (339) +.+.. .+.+.... ++.|.++.. .++... .-.+|++|..|..|.+-..- +..| ...... T Consensus 237 -----g~~~~---~~~~~~~~----~d~l~~~~~~~~~~~~------~a~~vmn~~~~~~l~~lkd~-~G~~i~~~~~~~ 297 (394) T protein:vir:10 237 -----QSFTA---KATTTDTL----VDSLKHILNVDLDPAY------SRALVVTQSLFNTLDTLKDK-NGRYLLHDASDS 297 (394) T ss_pred -----ccccc---cccccccc----HHHHHHHHHhhhhhhc------cCEEEecHHHHHHHHHhhcc-CCCeeeeccccc Confidence 01110 01111122 234444433 232221 22568999999998752111 1111 111111 Q ss_pred eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHH Q lcl|NC_020078. 235 TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWF 314 (339) Q Consensus 235 ~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~ 314 (339) ....|.-++++|.+|+.+++...... .++..-+-++| +.++..+...+++++..++ ..|... T Consensus 298 ~~~~~~~~~L~G~PV~~~~~~~~~~~--------~~~~~i~~gd~---------s~~~~~~~~~~~~v~~~~~-~~~~~~ 359 (394) T protein:vir:10 298 ITDGTAKGTVLGVPVYVVGDALLGSA--------AGDQKAFVGDL---------KRGVLFADRQQVTLAWEDS-KIYGRY 359 (394) T ss_pred cccCCcccccccceeEEecccccCCC--------CCceEEEEeec---------cccEEEEeecceEEEEecc-ccccee Confidence 11234446799999998765422111 11111111222 2233344445555554443 445566 Q ss_pred HHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 315 IDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 315 i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++.+-++.++++|++++.|+.+.+ T Consensus 360 ~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 360 LGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred EEEEEEeccEEeccccEEEEEeecc Confidence 7888889999999999999988888 No 124 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.06 E-value=5e-11 Score=77.05 Aligned_cols=305 Identities=14% Similarity=0.076 Sum_probs=143.8 Q ss_pred CccccCccc---CCCcccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhccccccccccc-cceEEEeccccc-e-ee Q lcl|NC_020078. 1 MSIFDGQTP---SYDVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGFVPVRSVRG-TSTISNRGISKA-K-LQ 73 (339) Q Consensus 1 ~~~~~~~~~---~~~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~v~~r~i~~-G~tv~i~~iG~~-t-~~ 73 (339) ...+.+... .....|. ....++.--.+.+ +....++.+..+..+++++++++..+.+ +.++.||++-.. . .. T Consensus 138 ~~~~~~~~~~~~~~~~~~~-~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~ 216 (477) T protein:vir:84 138 VESDKEIRKIAKVGEEYRD-LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAI 216 (477) T ss_pred hhhhhhHHHHHHhhhhhcc-ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceee Confidence 000000000 0001111 1000100011333 4457788888888899999999888875 668999986332 2 22 Q ss_pred eccCCCCCCCCC----CCCccceEEEEeehhhhhhhHHHHHH-HhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020078. 74 KIAPGTTPPPST----EPHTSKIFLKIDTVIIARNAEPMLDE-FQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIA 148 (339) Q Consensus 74 ~~~~g~~i~~~~----~~~~~~~~l~ID~~~y~~~~vdd~D~-~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~ 148 (339) -...|..++... .+.....++ +-.++.++..-.-.- -++.+++.+.+.++.++++++..|+.++ .+... T Consensus 217 ~~~Eg~~~~~~~~~~s~~~f~~i~~--~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l----~G~Gt 290 (477) T protein:vir:84 217 QAADNAALTAPSAHEVDLTDGFVQA--NVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVI----SGTGS 290 (477) T ss_pred eeccCcccccccccccccceeeEEE--eeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHh----ccCCC Confidence 233344333211 122333333 333443332212111 2357899999999999999999998775 22211 Q ss_pred ccccccccccccCccccccccccC-ccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccch---- Q lcl|NC_020078. 149 SDSPYGTAAQMPGHSGGNVVTLAG-ANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHI---- 223 (339) Q Consensus 149 ~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~---- 223 (339) .+.+ .+.....+.. .+..+. .+...+-..+++.|.++...+...... + ..+.+++|..|..|.+-..- T Consensus 291 ~~~p---~Gi~~~~~~~-~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~-~--~~~~v~~~~~~~~l~~lkd~~G~~ 363 (477) T protein:vir:84 291 NNQV---VGVRATAGIT-QVTATSAGSALEKHQIIYQKIADAIQRVHTSRFL-E--PEVIVMHPRRWASFHAIFAGDDRP 363 (477) T ss_pred CCcc---ceeeeccccc-cccccccccchhhHHHHHHHHHHHHhhccccccC-C--ccEEEEcHHHHHHHHHhhccCCCe Confidence 1100 0111100011 111111 112223345666677776666544331 1 23568899999888552211 Q ss_pred -hhhcccc-----cccceeecceeEEEeceEEEEeccccccccccccccCCCcccc-ccccccceEEEEEeccceeEEEE Q lcl|NC_020078. 224 -TNGEYVT-----SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEK-AYDGDFKDIVAQMFSPKALLAGS 296 (339) Q Consensus 224 -~n~d~~~-----~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~-~y~~~~~~~~~~~~h~~A~~~~~ 296 (339) ...++.+ .....+.+|..++++|.+|++|+.+|...+.. ++.. -+-++|+.. ...+ T Consensus 364 l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~-------~d~~~i~~gd~~~~----------~i~~ 426 (477) T protein:vir:84 364 LIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTG-------TDQDVIHVLRASDL----------ALFE 426 (477) T ss_pred eeecCcccccccccccccccccccchhcccceEecCccccccccc-------CCcceEEEEEeceE----------EEEe Confidence 1111111 11122445666789999999999998542211 1111 112223221 1111 Q ss_pred EeeeeEEeeec----hhhhHHHHHHHHHhCCcccc-ccceEEEEecCC Q lcl|NC_020078. 297 TIPVTSKIFFD----DLSKLWFIDSWLAFGVTINR-TEYAGVIKLPAA 339 (339) Q Consensus 297 ~~~~~~e~~~~----~~~~~d~i~g~~~~Ga~v~r-Pe~~v~i~~~~a 339 (339) ..+.+++.+. .......+.+++.+ +.+| |++.+.|.-++. T Consensus 427 -~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 427 -SSVRMRALQETRAENLSVLLQVYGYLAF--TAARFPQSVVEIGGTAL 471 (477) T ss_pred -eceeEEeccccccccceeeeeehhhhhh--hhhccccceEEeecccc Confidence 1122232211 11111223444444 4666 999999887777 No 125 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.06 E-value=6e-12 Score=82.10 Aligned_cols=289 Identities=11% Similarity=0.046 Sum_probs=146.5 Q ss_pred Cccc-----cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc-ceeee Q lcl|NC_020078. 1 MSIF-----DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK-AKLQK 74 (339) Q Consensus 1 ~~~~-----~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~-~t~~~ 74 (339) +... +...+..+-.+.. ...++--.+.-+.++.++.+..+..+.+++++++.... |+ ++||+.+. +.+.. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~~ 194 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNL--RAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPATW 194 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhh--cccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCccccc Confidence 1000 0001111111110 11111112445889999999999999999998877653 44 46777654 45556 Q ss_pred ccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-ccc Q lcl|NC_020078. 75 IAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS-DSP 152 (339) Q Consensus 75 ~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~-~~~ 152 (339) ...|++++....+..++.++.. .++..+ .|.+-=-.++..++.+.+.++.++++++..|+.|+. +.... +.+ T Consensus 195 v~E~~~~~~~~~~~f~~i~l~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~----G~G~~~~~p 268 (425) T protein:vir:95 195 IEQSGALPTGDVGTIASIDFDG--FKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVK----GTGAANKQP 268 (425) T ss_pred cccccccccccccccceeeeeh--eeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCCcccc Confidence 6677777644333445555544 344332 332211123556899999999999999999998752 11000 000 Q ss_pred ccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHH-HHHHHhcccchhh--hccc Q lcl|NC_020078. 153 YGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPA-AFTALMQAEHITN--GEYV 229 (339) Q Consensus 153 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~-~~~~Ll~~~~~~n--~d~~ 229 (339) .+..++...... .....+.. .++.+.++...+...... . .+-+.+++|. +|..|.+-..+.+ ..|. T Consensus 269 ---~Gil~~~~~~~~--~~~~~~~~----~~~~~~~~~~~~~~~~~~-~-~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i 337 (425) T protein:vir:95 269 ---LGIIPSLPPENQ--VTVEADNN----LLKNLVKQIGLIDTGDDS-V-GEIVAVMKRSTYYNRLVEFSIQVDSNGNVV 337 (425) T ss_pred ---ceeecccccccc--cccccccc----hHHHHHHHHHhhhhhccc-c-CceEEEEeChHHHHHHHHHHhhcCCCCcee Confidence 011111111100 01111222 334455555544433221 1 2223355555 4544433222211 1222 Q ss_pred ccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL 309 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~ 309 (339) .. ..++...+++|.+|+.|+++|.... +-++|+.. .+ +.-.++..+..++. T Consensus 338 ~~----~~~~~~~~l~G~pvv~~~~~~~~~i--------------~~Gd~~~~--------~~--~~~~~~~i~~~~~~- 388 (425) T protein:vir:95 338 GK----LPNLRTPDLLGLRVVFNNFLDDDTV--------------LFGEFEQY--------TL--VERENITIDSSTHV- 388 (425) T ss_pred ec----cCCCCCccccceeeEEcCcCCCccE--------------EEEecccE--------EE--EeecceEEEeeccc- Confidence 11 1245566799999999999984321 11233321 11 22233344444433 Q ss_pred hhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 SKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +|. ..+++.+-++.++++|++.+.++++.- T Consensus 389 ~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 389 KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred ccccCceEEEEEEeeCcEeecccceEEEEecCc Confidence 343 346667788999999999999998885 No 126 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.06 E-value=3.7e-11 Score=77.81 Aligned_cols=299 Identities=11% Similarity=0.060 Sum_probs=152.2 Q ss_pred CccccCc---ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeecc Q lcl|NC_020078. 1 MSIFDGQ---TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKIA 76 (339) Q Consensus 1 ~~~~~~~---~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~~ 76 (339) |.+=+.. |-..+..|--. -..++.-.+..+.++.++.+..++.+.++++.++..+. +++.+||+. +.+.+..+. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~-~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQ-TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSASWIG 78 (326) T ss_pred CCCCccchhhhcCcchhhhee-ccccCCcceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceEEec Confidence 2211100 00011111100 01111112445889999999999999998887766554 556788775 456777788 Q ss_pred CCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 77 PGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 77 ~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) .|+.++.. .+..++.++..-. ....+.|.+-=-.++.+|+.+.+.++.++++++..|+.++ .+.....+.. T Consensus 79 Eg~~~~~~-~~~f~~i~~~~~k-~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l----~G~gs~~p~g--- 149 (326) T protein:vir:42 79 EGDMKPIT-KGNMTSQTIAPHK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAI----NGTDSPFPTF--- 149 (326) T ss_pred CCcccccc-ccceeEEEEeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhh----cccCCCcccc--- Confidence 88888764 4667777776642 2233334332222456899999999999999999999875 2221111100 Q ss_pred ccccCccccccccccCccccccHHHHHHH-HHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccc--- Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAA-IASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSA--- 232 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~a-i~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~--- 232 (339) ............ ...+.........+. +..+...+..... ..-..+++|..|..|.+-..- +..|.-.. T Consensus 150 -i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~a~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~ 222 (326) T protein:vir:42 150 -LAQTTKEVSLVD-PDGTGSNADLTVYDAVAVNALSLLVNAGK----KWTHTLLDDITEPILNGAKDK-SGRPLFIESTY 222 (326) T ss_pred -ccccccccceee-cccccccccchhHHHHHHHHHhhhhhhcc----CccEEEEeHHHHHHHHHhhcc-CCceeeccccc Confidence 000000000000 011100000011111 2222233322221 223457899999999642110 11111000 Q ss_pred cceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh--- Q lcl|NC_020078. 233 GETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL--- 309 (339) Q Consensus 233 ~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~--- 309 (339) .........+.+.|++|+.++++|.+. ...+-++|++.. +..+.. +.++..++.. T Consensus 223 ~~~~~~~~~~~l~G~pv~~~~~~~~~~------------~~~~~Gd~s~~~--~~~~~~--------~~v~~~~e~~~~~ 280 (326) T protein:vir:42 223 TEENSPFRLGRIVARPTILSDHVASGT------------VVGYQGDFRQLV--WGQVGG--------LSFDVTDQATLNL 280 (326) T ss_pred cCccccccCceeeeeeEEEcCCCCCCc------------eEEEEeecceEE--EEEecc--------eEEEEeecceeee Confidence 001112234579999999999997431 111333444432 222222 2333222211 Q ss_pred -----------hhH--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 -----------SKL--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 -----------~~~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ... -.+++.+-++.+++||++.+.|+--+| T Consensus 281 ~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 281 GTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 111 335788899999999999999888777 No 127 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.06 E-value=4.2e-11 Score=77.47 Aligned_cols=295 Identities=15% Similarity=0.118 Sum_probs=152.8 Q ss_pred Cccc---------------cCcccCC-CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccc-cccccccccceEE Q lcl|NC_020078. 1 MSIF---------------DGQTPSY-DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGF-VPVRSVRGTSTIS 63 (339) Q Consensus 1 ~~~~---------------~~~~~~~-~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~-v~~r~i~~G~tv~ 63 (339) .++. .+..... +....+-. +.|. .+.-+.++.++.+..+..++++.+ .++-+...| .+. T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~gg--~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~ 178 (435) T protein:vir:80 103 RALAAARGDAQLASKLAIERGFGEEVAMSLNTLSP-GAGG--VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NIT 178 (435) T ss_pred HHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCC-CCCc--cccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceE Confidence 0000 0100000 00000000 1111 133478889999988888888876 333333334 477 Q ss_pred Eecc-ccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhH--HHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 64 NRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAE--PMLDEFQTDFDYQGEVAREQGQEIANMYDETFFI 140 (339) Q Consensus 64 i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~v--dd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~ 140 (339) +|++ +.+.+.-...|+.++. ..+..++.++...+. +..+.| .-++++...+++.+.+.++.++++++..|+.++. T Consensus 179 ~p~~~~~~~a~~v~E~~~~~~-~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~ 256 (435) T protein:vir:80 179 IPRLKGGAIVGYIGADTDIPT-TQQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR 256 (435) T ss_pred EEEEeCCcceeeeccCccccc-cccceeeEEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 8777 5666666677777765 345566666655322 222223 2334433456899999999999999999998752 Q ss_pred HHHhhcccccccccccccccCccccccc-cccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhc Q lcl|NC_020078. 141 MAAKAAIASDSPYGTAAQMPGHSGGNVV-TLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQ 219 (339) Q Consensus 141 ~l~~aA~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~ 219 (339) +....+ ...|....... ....++...+...++..+.++...|...+.. .... ..|++|..|..|.+ T Consensus 257 ----G~G~~~-------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~-~~~~-~~vmn~~~~~~L~~ 323 (435) T protein:vir:80 257 ----DDGTAN-------TPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADAN-LTQP-GWIMAPRTFRFLEG 323 (435) T ss_pred ----cCCCCC-------cccceeecccccceeecccccchhhHHHHHHHHHHHhhccccc-cccC-EEEEcHHHHHHHHh Confidence 211111 11121111100 0111112222333444456666666555442 2223 44899999998854 Q ss_pred ccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEee Q lcl|NC_020078. 220 AEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP 299 (339) Q Consensus 220 ~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~ 299 (339) -.. .+..|.-. .+.. ++++|.+|++++++|......+. .+.-|-++|+. +..+...+ T Consensus 324 lkd-~~G~~l~~---~~~~---~~l~G~pv~~~~~~p~~~~~~~~------~~~i~~gd~s~----------~~i~~~~~ 380 (435) T protein:vir:80 324 LRD-GNGNKVYP---ELAN---GMLKGYPVGKTTQVPINLGEAGK------ESEIYFTDFGD----------VFIGEEET 380 (435) T ss_pred hhc-cCCceecc---CCCC---CeEeeeeeEEeccccccccCCCC------cceEEEEEccc----------EEEEeecc Confidence 211 11222111 1122 36899999999999854221111 11122233333 22334445 Q ss_pred eeEEeeechh----------hh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 300 VTSKIFFDDL----------SK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 300 ~~~e~~~~~~----------~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++++..++.. .| .-.+++.+.|+.++.||++++.|. ++ T Consensus 381 ~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~--~~ 431 (435) T protein:vir:80 381 LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLS--GV 431 (435) T ss_pred eEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEe--cc Confidence 5666655442 11 245688899999999999998883 33 No 128 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.03 E-value=8.5e-11 Score=75.80 Aligned_cols=297 Identities=12% Similarity=0.038 Sum_probs=147.8 Q ss_pred Ccc-------------ccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccc-cccccccccceEEEec Q lcl|NC_020078. 1 MSI-------------FDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGF-VPVRSVRGTSTISNRG 66 (339) Q Consensus 1 ~~~-------------~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~-v~~r~i~~G~tv~i~~ 66 (339) +++ .....+.....|..... ++.--.+.-+.|+.++.+..+..++++.+ +++-+...| .+.||+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g-~~~~p~ 175 (428) T protein:vir:10 98 MSIAAAQGNLQDAAKFASDELNDQSVSMAISTA-AGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNG-NMSLPR 175 (428) T ss_pred HHHHHhhhhHHHHHHHhhhhhhhhhHhhhhccc-ccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCc-ceEEEE Confidence 110 00112222333332111 11101123377888888888888898887 332222223 477887 Q ss_pred c-ccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020078. 67 I-SKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKA 145 (339) Q Consensus 67 i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~a 145 (339) + +.+++.....|+.++.. .+..++.++..- ..+..+.|.+-=-.++.+++.+.+.++.+++|++..|+.++. + T Consensus 176 ~~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~-k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G 249 (428) T protein:vir:10 176 LAGGATASYTGENQDAKVS-EARFDDVKLTAK-TMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMR----D 249 (428) T ss_pred EeCCcceeeeccCcccccc-ccceeeEEeeeE-EEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhc----c Confidence 6 45677777778887754 456666666553 222223333321224678899999999999999999998751 2 Q ss_pred cccccccccccccccCcc----ccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhccc Q lcl|NC_020078. 146 AIASDSPYGTAAQMPGHS----GGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAE 221 (339) Q Consensus 146 A~~~~~~~~~~~~~~g~~----~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~ 221 (339) .... ....|.- ....+.........+...+. ...++...+............| +++|..|..|.+-. T Consensus 250 ~G~~-------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-v~n~~~~~~L~~lk 320 (428) T protein:vir:10 250 DGTG-------DTPIGMKARATQWNRLLPWAADAAVNLDTID-TYLDSIILMSMDGNSNMISSGW-GMSNRTYMKLFGLR 320 (428) T ss_pred CCCC-------ccccccccccccccccccccccccccHHHHH-HHHHHHHHhhhccccccccCEE-EEcHHHHHHHHHhh Confidence 1111 1111111 11111111112222222221 1122211111111111222344 77999999885522 Q ss_pred chhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeee Q lcl|NC_020078. 222 HITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVT 301 (339) Q Consensus 222 ~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~ 301 (339) . .+..|.-.. ... ++++|.+|+.|+++|......+ ....-|-++| +-+..+...++. T Consensus 321 d-~~G~~i~~~---~~~---g~l~G~pv~~~~~~p~~~~~~~------~~~~i~~gd~----------s~~~i~~~~~i~ 377 (428) T protein:vir:10 321 D-GNGNKVYPE---MAQ---GMLKGYPIQRTSAIPANLGEGG------KESEIYFADF----------NDVVIGEDGNMK 377 (428) T ss_pred c-cCCceeccC---CCC---CeeeceeeEEeccccccccCCC------ccceEEEEec----------ceEEEEEecceE Confidence 1 222221111 122 3689999999999985421111 0111122233 333333444555 Q ss_pred EEeeechhh----------h---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 302 SKIFFDDLS----------K---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 302 ~e~~~~~~~----------~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++..++..+ | .-.+++.+.++.++.||++.+.+ ++. T Consensus 378 i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~--t~~ 426 (428) T protein:vir:10 378 VDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLG--TGV 426 (428) T ss_pred EEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEE--ecc Confidence 555554321 1 23578889999999999999888 444 No 129 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=99.03 E-value=2.7e-10 Score=73.05 Aligned_cols=304 Identities=13% Similarity=0.075 Sum_probs=167.8 Q ss_pred CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcc----------cccccccc--ccceEEEeccccceeeeccCC Q lcl|NC_020078. 11 YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAG----------FVPVRSVR--GTSTISNRGISKAKLQKIAPG 78 (339) Q Consensus 11 ~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~----------~v~~r~i~--~G~tv~i~~iG~~t~~~~~~g 78 (339) -..|.. +++|+.+ .++|+..|..+-.+.+-|.+ +++..++. .|++|.|.-+...+-.....+ T Consensus 1 Ma~T~~----~~~~p~a--~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd 74 (364) T protein:vir:93 1 MSQTVI----PFGDPKA--VKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYGD 74 (364) T ss_pred Cceecc----CcCCHHH--HHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCcccC Confidence 223333 3466765 49999999998877765554 22323454 399999999988876666767 Q ss_pred CCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +.+++.+ .++....+|+|||.--.-..=..+++-.+-+|+|.+....++.=+++..|+.+|..|..+ +-.+..-.... T Consensus 75 ~~leGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGa-rg~~~~~~~~~ 153 (364) T protein:vir:93 75 ARVEGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGA-RGINLDFIETP 153 (364) T ss_pred ceeeccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccccccccccc Confidence 7787764 366667789999765432111457777789999999999999999999999999888642 21111100000 Q ss_pred cccCcc--------cccccccc--------CccccccHHHHHHHHHHHHHHHHhcCCCCCc-----------CCe-EEEE Q lcl|NC_020078. 158 QMPGHS--------GGNVVTLA--------GANDYKDPAKLYAAIASLVEKFLEKDVRPNE-----------EDM-ILVL 209 (339) Q Consensus 158 ~~~g~~--------~~~~~~~~--------~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~-----------~~R-~~vv 209 (339) ..++.. ...++-.. .+++..+ ++.|..+...+.....+.|+ ++. ++++ T Consensus 154 ~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~s----l~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l 229 (364) T protein:vir:93 154 DFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMA----PLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVM 229 (364) T ss_pred CcccccccccCCCCCCcEEeccccCchhhcccccccc----HHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEE Confidence 000000 00001110 1112222 34555666655443321111 122 6679 Q ss_pred CHHHHHHHhc--ccchhhhc---c-cccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEE Q lcl|NC_020078. 210 PPAAFTALMQ--AEHITNGE---Y-VTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIV 283 (339) Q Consensus 210 ~P~~~~~Ll~--~~~~~n~d---~-~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~ 283 (339) .|.++..|.. ++.+.+-. . ......++..|.++.+.|+.|++.++++.. +..+..+++. +. - T Consensus 230 ~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~-----~~~~~~~~v~---~~----r 297 (364) T protein:vir:93 230 SEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRF-----NDYGAGANVE---AA----R 297 (364) T ss_pred cchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccc-----cccccCcccc---ch----h Confidence 9999999985 33432211 1 111234588899999999999998887633 1112222221 11 1 Q ss_pred EEEeccceeEEEEEeeeeEEeeech------hhhHHHHHHHHHhCCccccc--cceEEEEecCC Q lcl|NC_020078. 284 AQMFSPKALLAGSTIPVTSKIFFDD------LSKLWFIDSWLAFGVTINRT--EYAGVIKLPAA 339 (339) Q Consensus 284 ~~~~h~~A~~~~~~~~~~~e~~~~~------~~~~d~i~g~~~~Ga~v~rP--e~~v~i~~~~a 339 (339) +|++-..|++.+-...=-+...|.+ ++++ |-....+|.+=.|= +=-++|.+++| T Consensus 298 alllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~~--i~~~~i~G~kK~rF~~~DfGvi~idta 359 (364) T protein:vir:93 298 ALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPA--IAAGFIAGMKKARFNNKDFGVISIDTA 359 (364) T ss_pred hheecceeeEEEeecCCCCCceeeecccCCCCchh--hhhhhHhhhhhcccCCccceEEEeccc Confidence 3555566654443221011222222 2221 22233444444432 33667777776 No 130 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.03 E-value=2.9e-11 Score=78.39 Aligned_cols=293 Identities=10% Similarity=-0.039 Sum_probs=148.4 Q ss_pred CccccCcccC-------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-cccee Q lcl|NC_020078. 1 MSIFDGQTPS-------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKL 72 (339) Q Consensus 1 ~~~~~~~~~~-------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~ 72 (339) +....|.-.- ++-.+- .++.++--.+.-+.++.++.+..++.+.+++++++..+.+|. ..|++. +..++ T Consensus 63 ~~~~~~~~~l~~~~r~~~~~~~~--~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~-~~i~~~~~~~~a 139 (390) T protein:vir:40 63 VLASRGANALTSDESKYYNEVIA--GNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATT-EWIISVGDVATA 139 (390) T ss_pred HHHhcCchhccHHHHHHHHHHHh--ccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCce-eEEEEEcCCcce Confidence 0000000000 000000 011112222444999999999999999999999888766544 556654 55566 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .....+..+.....+..+++++..- ++.. +.|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.....| T Consensus 140 ~~~~E~~~~~~~~~~~f~~i~l~~~--k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~----G~G~~~P 213 (390) T protein:vir:40 140 WWGPLCAEIKEVLDNGFDKIQTGMY--KLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVN----GSGKDQP 213 (390) T ss_pred eeeccccccCccccccceeeEeeee--eEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----ccCCCcc Confidence 6666666665444455666666553 3333 3333222224678899999999999999999998752 2111111 Q ss_pred cccccccccCcccccccc-ccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVT-LAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVT 230 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~ 230 (339) . +........+... ........+...+.+.+..+...+...- ..-..+-..+++|..+..+|+..+.. .. T Consensus 214 ~----Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~-~~~~~~a~~i~n~~t~~~~l~~~~~~----~d 284 (390) T protein:vir:40 214 I----GMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNG-KKSVSDAILVINPADYWSKIYAATSY----MT 284 (390) T ss_pred c----eeeeccccccccccccccccccchhhHHHHHHHHHHHhhcch-hhhhcCceEEEcchhHHHHHHHHhhc----cC Confidence 0 0000000000000 0001111122222333333333332211 00112334578887765555432211 11 Q ss_pred cccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh Q lcl|NC_020078. 231 SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS 310 (339) Q Consensus 231 ~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~ 310 (339) .++..+... ...|.+|++|+++|.... +-++|+. ...+...+++++..+ +.+ T Consensus 285 ~~G~~v~~~---~~~g~pvv~~~~~p~~~i--------------~~Gd~s~----------~~i~~~~~~~v~~~~-~~~ 336 (390) T protein:vir:40 285 PQGVWVTGI---LPVPLEIVQSVAVPVGKA--------------VAGRAKD----------YFMGIGSEQVIRTST-EYR 336 (390) T ss_pred CCCcccccc---CCCceeEEEcCCCCCCcE--------------EEEeece----------EEEEeecceEEEecc-hhh Confidence 111112111 246999999999984311 1123332 112233445555543 223 Q ss_pred h---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 311 K---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 311 ~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) | ...+++.+-++.++++|++.+.++++++ T Consensus 337 f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 337 LLDDETLYYAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred hhcCcEEEEEEEEeCCEEecccceEEEEeecc Confidence 3 2346788999999999999999999998 No 131 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.02 E-value=3e-11 Score=78.27 Aligned_cols=284 Identities=10% Similarity=0.016 Sum_probs=151.5 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc-cceEEEecccc--ceeeeccC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG-TSTISNRGISK--AKLQKIAP 77 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~-G~tv~i~~iG~--~t~~~~~~ 77 (339) +.-..+ .....-.|.-..+..++--.+.-+.|+.++.+..+..+.++++++...+.. ..++.++.... ..+..... T Consensus 101 ~~~~~~-~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 179 (408) T protein:vir:10 101 VRNPMA-FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAE 179 (408) T ss_pred hhcchh-hhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecC Confidence 000000 011111111111111111123458999999999999999999998887654 33455555433 34445556 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) |+.++....+..+++++..- ++..+ .|.+-=--++.+|+.+.+.++.++++++..|+.|+.-. T Consensus 180 ~~~~~~~~~~~~~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~-------------- 243 (408) T protein:vir:10 180 DGKIPDLDNPQLTIIKYLIK--RYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM-------------- 243 (408) T ss_pred ccccccccCcceeeEEeeee--eEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------- Confidence 76665433344555555543 33332 22221111356899999999999999999999875211 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHH-HHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLV-EKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~-~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) +.+. ......+.+ .+.++. ..|... .. .+-.++++|..|..|.+-..- +..|.-.. . T Consensus 244 ------g~~~-----~~~~~~~~~----~l~~~~~~~~~~~-~~---~~a~~v~n~~~~~~l~~lkd~-~G~~i~~~--~ 301 (408) T protein:vir:10 244 ------KAAP-----KKPTIAKFD----DVITMINTAVDPA-II---ATSSLLTNQSGLNKLALVKTA-EGKYLLEP--D 301 (408) T ss_pred ------cccc-----cccccccHH----HHHHHHHHhhhhh-hc---cCCEEEEcHHHHHHHHHhhcc-CCceEecc--C Confidence 1110 011112233 333433 234332 21 122458999999999763322 22222211 1 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhh--- Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSK--- 311 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~--- 311 (339) +.+|...+++|++|+.+.+.+.... .+++..-|-+++ +.++..+.-.+++++..+.. ..| T Consensus 302 ~~~~~~~~l~G~PV~~~~~~~~~~~-------~~~~~~i~~gd~---------~~~~~~~~~~~~~v~~~~~~~~~f~~~ 365 (408) T protein:vir:10 302 PTKPNSYLIKGKQVIVVADRWLPNT-------GSTVYPLYYGDM---------SQAITLFDRENMSLLPTNIGAGAFETD 365 (408) T ss_pred cCCCCCceecceeeEEecccccCcc-------CCCceEEEEEeh---------hccEEEEEecceEEEEcccccchhhcC Confidence 3456667899999998765321111 011111111222 23444444455565554432 222 Q ss_pred HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 ~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+++.+.++.++++|++.+.++++++ T Consensus 366 ~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 366 TTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred ceEEEEEEeeccEEeccccEEEEEeecc Confidence 2356778889999999999999998887 No 132 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.00 E-value=3.2e-11 Score=78.15 Aligned_cols=283 Identities=13% Similarity=0.117 Sum_probs=150.2 Q ss_pred Cc----cccCcccCC-CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceee Q lcl|NC_020078. 1 MS----IFDGQTPSY-DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQ 73 (339) Q Consensus 1 ~~----~~~~~~~~~-~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~ 73 (339) ++ .|....=.. ...+.-....+++.-.+..+.|+.++.+..+..+.+++++++..+.++ +.+++.. +..... T Consensus 88 ~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 166 (389) T protein:vir:10 88 IDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFS 166 (389) T ss_pred HHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCccc Confidence 00 000000000 011111111111111133488999999999999999999887776543 3444443 344545 Q ss_pred eccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_020078. 74 KIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSP 152 (339) Q Consensus 74 ~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~ 152 (339) ....++.......+...+.++... ++..+ .|.+-=-.++.+|+.+.+.++.+++|++..|..|+.-+- T Consensus 167 ~~~E~~~~~~~~~~~~~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~--------- 235 (389) T protein:vir:10 167 SVAELAENPKLAEPEFNKVDWSVA--TYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQ--------- 235 (389) T ss_pred cccccccccccccccceeeeeehe--eeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhc--------- Confidence 666666665433455566666553 32222 222111113567899999999999999999988752211 Q ss_pred ccccccccCccccccccccCccccccHHHHHHHHHHHHH-HHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc Q lcl|NC_020078. 153 YGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVE-KFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS 231 (339) Q Consensus 153 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~-~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~ 231 (339) .+. ..+.+...+ ++.+.++.. .++.. .+-.++++|..|..|.+-.. .+..|.-. T Consensus 236 -----------~~~---~~~~~~~~~----~d~l~~~~~~~~~~~------~~a~~~~n~~~~~~L~~lkd-~~G~~i~~ 290 (389) T protein:vir:10 236 -----------SFT---AKKTTTDTL----VDSLKHILNVDLDPA------YSRALVVTQSLFNTLDTLKD-KNGRYLLH 290 (389) T ss_pred -----------ccc---ccccccccc----HHHHHHHHHhhhhhh------hCcEEEecHHHHHHHHHhhc-cCCCeeee Confidence 010 111122222 333444432 23221 12356899999999975321 11122111 Q ss_pred c--cceeecceeEEEeceEEEEeccc-cccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech Q lcl|NC_020078. 232 A--GETLNTKYMFAAFGVPVITSNNA-VFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD 308 (339) Q Consensus 232 ~--~~~l~~G~v~~i~G~~V~~Snnl-p~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~ 308 (339) . ......|...+++|.+|+.+++. |+.. +++..-+-++|+ .++......+++++..++ T Consensus 291 ~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~---------~~~~~~~~gd~~---------~~~~~~~~~~~~i~~~~~- 351 (389) T protein:vir:10 291 DASDSITDGTAKGTILGVPVYVVGDTLLGSL---------AGDQKAFVGDLK---------RGVLFTDRQQVTLAWEDS- 351 (389) T ss_pred cCcccccccccccccccceeEEecccccCCC---------CCceEEEEeecc---------ccEEEEeecceEEEeecc- Confidence 1 11112345568999999887653 3221 111111222222 233334444556665543 Q ss_pred hhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 309 LSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 309 ~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..|...+++.+-+|.++++|++.+.++++.+ T Consensus 352 ~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 352 KIYGKYLGAAFRFGVQKADSKAGYFVTNTDV 382 (389) T ss_pred ccccceEEEEEEeccEEecccceEEEEeecc Confidence 4566678889999999999999999998876 No 133 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.00 E-value=3.1e-11 Score=78.21 Aligned_cols=273 Identities=15% Similarity=0.170 Sum_probs=149.9 Q ss_pred CccccCcc-cCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccC Q lcl|NC_020078. 1 MSIFDGQT-PSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAP 77 (339) Q Consensus 1 ~~~~~~~~-~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~ 77 (339) ...-.+.. ...+..+.+-....|. .+.-+.|+.++.+..+..+.+++++++..+.+|+ .++|.. +..++..... T Consensus 113 ~~~~~~~~~~~~~~~~~~~t~~~gg--~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E 189 (394) T protein:vir:97 113 EVLMPINETTPVEPQKDGIKKENAK--PVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAE 189 (394) T ss_pred HHHHHHHhhhhhhhhcccccccccc--ccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceecc Confidence 00000000 0011111111111111 1344889999998888899999998887766553 556654 4456666767 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) |+..+....+..+++++... ++..+ .|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-+ T Consensus 190 ~~~~~~~~~~~~~~v~l~~~--k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~-------------- 253 (394) T protein:vir:97 190 LEKNPALAKPDFKDVAWNID--TYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL-------------- 253 (394) T ss_pred cccccccccccceeEEeehh--heeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------- Confidence 77665433345566666553 33322 22221112356789999999999999999998875321 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHH-HHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccce Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEK-FLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGET 235 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~-L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~ 235 (339) +.++ +....+.+.+. ++... ++.. ..-..|++|..|..|.+-..- +..|.-.. . T Consensus 254 ------~~~~------~~~~~~~~~~~----~~~~~~~~~~------~~a~~v~n~~~~~~l~~lkd~-~G~~i~~~--~ 308 (394) T protein:vir:97 254 ------KSFT------TKTVKNLDEIK----ALLNGGFDPA------YNVSLIVSQSFYQTLDTLKDG-NGRYLLQD--D 308 (394) T ss_pred ------cccc------ccccccHHHHH----HHHHhhhhhh------hCCEEEEcHHHHHHHHHhhcc-CCCeeeec--C Confidence 0000 11112223333 33222 2211 122357999999998652211 11121111 2 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHH Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFI 315 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i 315 (339) +.+|.-++++|++|+.+++.+.+. ...+-++|++ ++..+...++.++..++ .++...+ T Consensus 309 ~~~~~~~~l~G~pv~~~~~~~~~~------------~~~~~gd~~~---------~~~~~~~~~~~~~~~~~-~~~~~~~ 366 (394) T protein:vir:97 309 ITAVSGKVLLGKPVFVLSDEVLGA------------NKAFIGDFKR---------GVLFADRKDLGLRWADN-EIYGQYL 366 (394) T ss_pred cCCCCCceeccceeEEecccccCC------------ccEEEeeccc---------cEEEEEecceEEEEecc-cccceeE Confidence 345666789999999976543211 1112233332 22233334455554433 3456678 Q ss_pred HHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 316 DSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 316 ~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++.+-+|.++++|++.+.|+++.+ T Consensus 367 ~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 367 QAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred EEEEEEccEEecccceEEEEeccc Confidence 999999999999999999999988 No 134 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.98 E-value=7.2e-11 Score=76.17 Aligned_cols=286 Identities=10% Similarity=0.025 Sum_probs=150.2 Q ss_pred CccccCcc------cCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEecccc-cee Q lcl|NC_020078. 1 MSIFDGQT------PSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRGISK-AKL 72 (339) Q Consensus 1 ~~~~~~~~------~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~iG~-~t~ 72 (339) +.-|.+.. ....-.|.-......+--.+..+.|+.++.+..+..+.++++++...+.++ .++.+++... ... T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 173 (408) T protein:vir:74 94 VKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPL 173 (408) T ss_pred HHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCccc Confidence 00011100 000111111111111111134599999999999999999999998887754 3556665543 233 Q ss_pred eecc-CCCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_020078. 73 QKIA-PGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASD 150 (339) Q Consensus 73 ~~~~-~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~ 150 (339) ..+. .|+.+.....+..++.++... ++.. ..|-+-=--++.+|+.+.+.++.+++|++..|+.|+. T Consensus 174 ~~~v~E~~~~~~~~~~~~~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~---------- 241 (408) T protein:vir:74 174 KAMDEEDGKIPDLDNPRLTIIKYLIK--RYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIA---------- 241 (408) T ss_pred ccccccccccccccccceeeEEeeee--eEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh---------- Confidence 3333 345554333345566666554 3222 2222211113567899999999999999999998752 Q ss_pred ccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccc Q lcl|NC_020078. 151 SPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVT 230 (339) Q Consensus 151 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~ 230 (339) |.+.+. ..+...+.+.+.+++ ...|..... + +=..|++|..|..|.+-.. .+..|.- T Consensus 242 ----------G~G~~~-----~~~~~~~~~~i~~~~---~~~l~~~~~--~--~a~~v~n~~~~~~l~~lkd-~~G~~l~ 298 (408) T protein:vir:74 242 ----------AMGTVP-----KKPTIANFDDVITMI---NTSVDPAII--A--TSSLLTNQSGLNKLALVKT-AEGKYLL 298 (408) T ss_pred ----------cccccc-----cccccccHHHHHHHH---HHhhhhhhc--C--CCEEEEcHHHHHHHHHhhc-CCCceEe Confidence 111110 111122334443332 234544433 1 1245789999999975221 1122221 Q ss_pred cccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-h Q lcl|NC_020078. 231 SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-L 309 (339) Q Consensus 231 ~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~ 309 (339) .. .+.+|.-.+++|.+|+.+.+.+..... .+...-+-++ .+.++..+.-.+++++..+.. . T Consensus 299 ~~--~~~~~~~~~l~G~pV~~~~~~~~~~~~-------~~~~~i~~gd---------~~~~~~~~~~~~~~i~~~~~~~~ 360 (408) T protein:vir:74 299 EP--DPTKPNSYLIKGKQVIVVADRWLPNSG-------STVYPLYYGD---------MSQAITLFDRENMSLLPTNIGAG 360 (408) T ss_pred cc--CcCCCCCceecceeeEEecCccccccc-------CCcceEEEEe---------hhccEEEEEecceEEEEeccccc Confidence 11 133455578999999987653211110 1111111112 233444454455566655432 1 Q ss_pred ---hhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 ---SKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ---~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +....+++.+.+|.++++|++.+.+++++. T Consensus 361 ~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFTAI 393 (408) T ss_pred hhhcceeeEEEEEeeCcEEecccceEEEEeecc Confidence 233447778889999999999999998665 No 135 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.95 E-value=1.2e-10 Score=74.95 Aligned_cols=304 Identities=12% Similarity=0.097 Sum_probs=151.8 Q ss_pred CccccCcccCC----CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeee Q lcl|NC_020078. 1 MSIFDGQTPSY----DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQK 74 (339) Q Consensus 1 ~~~~~~~~~~~----~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~ 74 (339) +.-+.+-|... ...+-...+.+++--.+..+.|+.++.+..++.+.+++++++....+| ++.|++. +...+.. T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~w 209 (497) T protein:vir:10 131 AAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAA 209 (497) T ss_pred HHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCccee Confidence 00000000000 000000011112222245699999999999999999999988777665 5788874 3457777 Q ss_pred ccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 75 IAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 75 ~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ...|+.++.. .+..+++++..- ++..+..-.-+=.+...++.+.+.++.++++++..|+.++. +.....|... T Consensus 210 v~E~~~~~~s-~~~f~~i~~~~~--k~a~~~~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G~~~p~Gi 282 (497) T protein:vir:10 210 VAEAGTYPFS-SEEFARVYEQVG--KVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGL 282 (497) T ss_pred eccCcccccc-cccceeeEeeee--eeEeecHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCccccccc Confidence 7778877654 456666666543 43333221111112234688889999999999999988752 1111100000 Q ss_pred ccc----cccC-c-ccccccc---------ccCccccc-----------------------------cHHHHHHHHHHHH Q lcl|NC_020078. 155 TAA----QMPG-H-SGGNVVT---------LAGANDYK-----------------------------DPAKLYAAIASLV 190 (339) Q Consensus 155 ~~~----~~~g-~-~~~~~~~---------~~~~~~~~-----------------------------~~~~l~~ai~~a~ 190 (339) .+. ..+. . ....... ........ +...+...++++. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:10 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 000 0000 0 0000000 00000000 0011122233333 Q ss_pred HHHHhcCCCCCcCCeEEEECHHHHHHHhc--cc--chhhh-cccccccceeecceeEEEeceEEEEeccccccccccccc Q lcl|NC_020078. 191 EKFLEKDVRPNEEDMILVLPPAAFTALMQ--AE--HITNG-EYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHL 265 (339) Q Consensus 191 ~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~--~~--~~~n~-d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~ 265 (339) ..+.......| + ..+++|..|..|.+ |. +++-. .+.+..+. ..+...+++|.+|++|+.+|.+. T Consensus 363 ~~~~~~~~~~~--~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--~~~~~~~l~G~pV~~t~~~~~~~------ 431 (497) T protein:vir:10 363 VDIQLTLFQTP--N-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN--PVNGGKNIWGVPVVTTPLIPLGT------ 431 (497) T ss_pred hhhhhhcccCC--C-eEEEchHHHHHHHHhhcCCCceeccCcccccccc--cccCCceeeceeeEecCCCCCCc------ Confidence 33322222112 2 45799999998754 32 11111 11111111 12234479999999999998431 Q ss_pred cCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 266 LSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 266 l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-++| ...++..+.-.+++++..... ..| .-.+++..-++..|++|++.+.|.+.++ T Consensus 432 --------~~~Gd~--------~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 432 --------ILVGHF--------APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred --------eEEeec--------ccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 111222 223344444455555554321 222 2236677889999999999999999988 No 136 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.95 E-value=1.2e-10 Score=74.95 Aligned_cols=304 Identities=12% Similarity=0.097 Sum_probs=151.8 Q ss_pred CccccCcccCC----CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeee Q lcl|NC_020078. 1 MSIFDGQTPSY----DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQK 74 (339) Q Consensus 1 ~~~~~~~~~~~----~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~ 74 (339) +.-+.+-|... ...+-...+.+++--.+..+.|+.++.+..++.+.+++++++....+| ++.|++. +...+.. T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~w 209 (497) T protein:vir:78 131 AAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAA 209 (497) T ss_pred HHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCccee Confidence 00000000000 000000011112222245699999999999999999999988777665 5788874 3457777 Q ss_pred ccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 75 IAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 75 ~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) ...|+.++.. .+..+++++..- ++..+..-.-+=.+...++.+.+.++.++++++..|+.++. +.....|... T Consensus 210 v~E~~~~~~s-~~~f~~i~~~~~--k~a~~~~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G~~~p~Gi 282 (497) T protein:vir:78 210 VAEAGTYPFS-SEEFARVYEQVG--KVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGL 282 (497) T ss_pred eccCcccccc-cccceeeEeeee--eeEeecHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCccccccc Confidence 7778877654 456666666543 43333221111112234688889999999999999988752 1111100000 Q ss_pred ccc----cccC-c-ccccccc---------ccCccccc-----------------------------cHHHHHHHHHHHH Q lcl|NC_020078. 155 TAA----QMPG-H-SGGNVVT---------LAGANDYK-----------------------------DPAKLYAAIASLV 190 (339) Q Consensus 155 ~~~----~~~g-~-~~~~~~~---------~~~~~~~~-----------------------------~~~~l~~ai~~a~ 190 (339) .+. ..+. . ....... ........ +...+...++++. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:78 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 000 0000 0 0000000 00000000 0011122233333 Q ss_pred HHHHhcCCCCCcCCeEEEECHHHHHHHhc--cc--chhhh-cccccccceeecceeEEEeceEEEEeccccccccccccc Q lcl|NC_020078. 191 EKFLEKDVRPNEEDMILVLPPAAFTALMQ--AE--HITNG-EYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHL 265 (339) Q Consensus 191 ~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~--~~--~~~n~-d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~ 265 (339) ..+.......| + ..+++|..|..|.+ |. +++-. .+.+..+. ..+...+++|.+|++|+.+|.+. T Consensus 363 ~~~~~~~~~~~--~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--~~~~~~~l~G~pV~~t~~~~~~~------ 431 (497) T protein:vir:78 363 VDIQLTLFQTP--N-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN--PVNGGKNIWGVPVVTTPLIPLGT------ 431 (497) T ss_pred hhhhhhcccCC--C-eEEEchHHHHHHHHhhcCCCceeccCcccccccc--cccCCceeeceeeEecCCCCCCc------ Confidence 33322222112 2 45799999998754 32 11111 11111111 12234479999999999998431 Q ss_pred cCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech-hhh---HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 266 LSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD-LSK---LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 266 l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~-~~~---~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-++| ...++..+.-.+++++..... ..| .-.+++..-++..|++|++.+.|.+.++ T Consensus 432 --------~~~Gd~--------~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 432 --------ILVGHF--------APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred --------eEEeec--------ccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 111222 223344444455555554321 222 2236677889999999999999999988 No 137 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.95 E-value=1e-10 Score=75.29 Aligned_cols=286 Identities=12% Similarity=0.022 Sum_probs=162.2 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccc-cccccceEEE----eccccceeeec Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVR-SVRGTSTISN----RGISKAKLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r-~i~~G~tv~i----~~iG~~t~~~~ 75 (339) .|+.|| +..++++.=. + ..|-........+...+.++..++ .-+++-+|+| +........+. T Consensus 8 ~s~~~~--~~itv~~ll~-----~------P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 8 VSVSDG--PAITVRELVG-----N------PLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred eeeecC--CceehHHhhC-----C------chhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhc Confidence 344444 3333333300 1 112222223333444445544434 4556778888 55666778888 Q ss_pred cCCCCCCCCCCCCccceEE-EEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFL-KIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l-~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) .+|++++.... .+.+..+ .+. ..=-.+.|-|=-..-+..++.....++++-+++++.|+.++..|..+....-+ T Consensus 75 aEggEiP~~~~-~~G~~~ia~~~-K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~--- 149 (318) T protein:vir:10 75 AEFGEIPVSAG-ARGLPRTAFAV-KKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLA--- 149 (318) T ss_pred cCcccccccCC-CCCchhhhhhe-hhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--- Confidence 99999876443 3323333 332 22234455544444578999999999999999999999998877655422111 Q ss_pred ccccccCccccccccccCccccccHHHHH-----HHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLY-----AAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~-----~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) ...+..+.+.+. .+..++..-+ +.+..-....+++- ...--.+|+.|..|..|++++++... |. T Consensus 150 --~s~~w~~~~~~~-----~d~~~A~e~v~~a~~~~~~a~~~~~~~~~---GY~pdtIVlhP~~~~~l~~n~~~~~~-y~ 218 (318) T protein:vir:10 150 --VPTAWDNGGKVR-----TDIAIAIEQISTAAPTAYPAGVGSSDEYF---GFIPDTIVMHYALLPILMDNENFMKV-YE 218 (318) T ss_pred --CCcCCCCccccc-----ccchhhhhhhhhhhhhhhhhhhhhhhhcc---CccceeeEECHHHHHHHhcchhhhhh-hh Confidence 111111111111 0111111111 11111111122221 11223899999999999999887543 43 Q ss_pred ccccce---e-eccee-EEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEE-EEeeeeEE Q lcl|NC_020078. 230 TSAGET---L-NTKYM-FAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAG-STIPVTSK 303 (339) Q Consensus 230 ~~~~~~---l-~~G~v-~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~-~~~~~~~e 303 (339) +.+... . ..|.+ ++++|++|+.|.++|.+. ++++.+..+|+- -..+++++ T Consensus 219 ~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~------------------------alvlq~g~vG~~~d~~pl~~t 274 (318) T protein:vir:10 219 RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPIDR------------------------VLIMERGTVGFYSDTRPLQFT 274 (318) T ss_pred ccchhhhhcccccccccceeeceEEeecCccCCCe------------------------eEEEecCCcceeeccccceee Confidence 332211 0 12333 678999999999998431 356677777654 44567888 Q ss_pred eeech-------hhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 304 IFFDD-------LSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 304 ~~~~~-------~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+|++ ...+|.++.++.....|.+|.++.-|+==.. T Consensus 275 ~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 275 ALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVT 317 (318) T ss_pred ecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccC Confidence 89977 6789999999999999999988776532222 No 138 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.95 E-value=2.8e-10 Score=72.92 Aligned_cols=281 Identities=7% Similarity=-0.014 Sum_probs=148.9 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc-cceEEEeccccc--eeeeccC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG-TSTISNRGISKA--KLQKIAP 77 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~-G~tv~i~~iG~~--t~~~~~~ 77 (339) ...|-.-+- +..+.+. ...++--.+.-+.|+.++.+..+..++++++++...+.+ ..++.++..... .+..... T Consensus 94 ~~~~~~~~~--~~~~~~~-~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 170 (395) T protein:vir:38 94 KNQFVKDFK--NLVTSGT-TGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDE 170 (395) T ss_pred HHHHHHHHH--HHHhhcc-CccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccc Confidence 000000000 0000000 011111123448899999999999999999988877654 234445544432 3333445 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) |+.++....+..+++++... ++..+ .|.+-=--.+.+|+.+.+.++.++++++..|+.|+.-. T Consensus 171 ~~~~~~~~~~~f~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~-------------- 234 (395) T protein:vir:38 171 SALIGDNDDPELTVVKYLIH--RYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVM-------------- 234 (395) T ss_pred ccccccccccceeeEEeeee--eeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------- Confidence 66665332344455555443 33322 22221111256789999999999999999998875211 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccccee Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETL 236 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l 236 (339) +.+. +.+...+.+.+.+++. ..|....- .+-.++++|..|..|.+-..- +..|.-.. .+ T Consensus 235 ------g~~~-----~~~~~~~~~~i~~~~~---~~l~~~~~----~~a~~v~n~~~~~~L~~lkd~-~G~~l~~~--~~ 293 (395) T protein:vir:38 235 ------GKAP-----KKPTISQFDNIKDLEN---NTLDPAIE----STSSFITNQSGYNILSKVKDA-DGRYLMQP--DV 293 (395) T ss_pred ------cccc-----cccccccHHHHHHHHH---Hhhhhhhc----CCCEEEEcHHHHHHHHHhhcc-CCceeecc--Cc Confidence 1111 0111122333333221 23333221 123568999999999763211 11121111 23 Q ss_pred ecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEec-cceeEEEEEeeeeEEeeech-hhh--- Q lcl|NC_020078. 237 NTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFS-PKALLAGSTIPVTSKIFFDD-LSK--- 311 (339) Q Consensus 237 ~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h-~~A~~~~~~~~~~~e~~~~~-~~~--- 311 (339) .+|...+++|++|+.+.+.+..... +... .++.. +.++..+...++.++..+.. .+| T Consensus 294 ~~~~~~~l~G~pV~~~~~~~~~~~~--------~~~~----------i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~ 355 (395) T protein:vir:38 294 TSPDKYLIDGKPVIRIADKWLPDVS--------GSHP----------LYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHD 355 (395) T ss_pred CCCCcceeccceeEEecccccCcCC--------Ccce----------EEEEeccccEEEEEecceEEEEeccccchhhcC Confidence 4566678999999999876532110 0111 12222 22444455566666766543 223 Q ss_pred HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 ~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+++...||.++++|++.+.+.++++ T Consensus 356 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 356 TTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred ceEEEEEEeeccEEecccceEEEEeecc Confidence 2456777779999999999999999888 No 139 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=98.95 E-value=1.6e-10 Score=74.36 Aligned_cols=275 Identities=13% Similarity=0.120 Sum_probs=143.9 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccccc-ceEEEeccccceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGT-STISNRGISKAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G-~tv~i~~iG~~t~~~~~~g~ 79 (339) .......-+ +........+...+..+.++.++... .....+.+.++...+..+ -.+.++..+...+.....++ T Consensus 121 ~~~~~~~~~-----~~~~~~~~~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~ 194 (397) T protein:vir:96 121 NAFVKSKGA-----EKRDGFTSVEGGALIPQELLQPQLEP-KDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLE 194 (397) T ss_pred HHHHHhhhh-----hhhhcccccccccchhHHHHHHHHHh-hhhhhHHHhhhhccccccceeEEEEeccCCccccccccc Confidence 110000000 00011112222234457788888764 333344556665555432 23444444555556666666 Q ss_pred CCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020078. 80 TPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQ 158 (339) Q Consensus 80 ~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~ 158 (339) .......+...++++.+. ++.. ..|.+---.++.+|+.+.+.++.++++++..|..|+.- T Consensus 195 ~~~~~~~~~~~~i~~~~~--~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g----------------- 255 (397) T protein:vir:96 195 KNPQLANPKMVEIDYSVA--TRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAV----------------- 255 (397) T ss_pred cccccccccccceeecHh--HhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc----------------- Confidence 554333455666666654 3322 22222111234678999999999999999999877521 Q ss_pred ccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeec Q lcl|NC_020078. 159 MPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNT 238 (339) Q Consensus 159 ~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~ 238 (339) .+.+ .++...+.+. |.++....... . . +-..|++|..|..|.+-.. .+..|.-.. .+.+ T Consensus 256 ---~g~~------~~~~~~~~d~----~~~~~~~~~~~-~--~--~a~~v~n~~~~~~l~~lkd-~~G~~~~~~--~~~~ 314 (397) T protein:vir:96 256 ---LKTA------TAKSVVGVDG----LKDLINKEIKK-V--Y--DVKLFISASMYSELDKLKD-KNGRYLLQD--SITA 314 (397) T ss_pred ---cccc------ccccccchHH----HHHHHHHhhhh-h--c--CcEEEEcHHHHHHHHHhhc-cCCCeEecc--CccC Confidence 0111 1111222333 33333322111 1 1 2245999999999865221 122222111 2445 Q ss_pred ceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHH Q lcl|NC_020078. 239 KYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSW 318 (339) Q Consensus 239 G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~ 318 (339) |.-.+++|.+|+.+++.+.... .+...-+-++|+. ++......++.++... ...+...++++ T Consensus 315 ~~~~~l~G~pv~~~~~~~~~~~--------~~~~~~~~gd~~~---------~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 376 (397) T protein:vir:96 315 ASGKQLLGKEVVVLDDDVIGKS--------VGNVVGFIGDAKA---------FASFFDRKQVSVSWVD-NNIYGQLLAGI 376 (397) T ss_pred CCcccccccceEEecccccCCC--------CCceEEEEeehhc---------ceEeEeecceEEEEec-ccccceeEEEE Confidence 6667899999999877543211 1111122233332 2233333444545433 34556678999 Q ss_pred HHhCCccccccceEEEEecCC Q lcl|NC_020078. 319 LAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 319 ~~~Ga~v~rPe~~v~i~~~~a 339 (339) +-+|.++++|++.+.|++++| T Consensus 377 ~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 377 IRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEccEEecccceEEEEeecC Confidence 999999999999999999999 No 140 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.94 E-value=1e-10 Score=75.37 Aligned_cols=287 Identities=12% Similarity=0.023 Sum_probs=150.7 Q ss_pred CccccCcccC---------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eEEEecc-cc Q lcl|NC_020078. 1 MSIFDGQTPS---------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TISNRGI-SK 69 (339) Q Consensus 1 ~~~~~~~~~~---------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv~i~~i-G~ 69 (339) +....+...+ ....+.......++--.+.-+.|+.++.+..+..++++++++...+.++. +..++.. +. T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 1111111110 00111111111111111334899999999999999999999988887532 3444443 45 Q ss_pred ceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 70 AKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 70 ~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) +.+.....|..++....+..+++++..-+ .+.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~------- 232 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI------- 232 (392) T ss_pred ccceeecccccccccccccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc------- Confidence 57777777777764333455666665532 2222333331112356899999999999999999998875211 Q ss_pred cccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) +.+ .+....+.+.+.+++. ..|..... + +-..|++|..|..|.+-.. .+..|. T Consensus 233 -------------g~~------~~~~~~~~d~i~~~~~---~~l~~~~~--~--~a~~vm~~~~~~~L~~lkd-~~G~~l 285 (392) T protein:vir:10 233 -------------EKL------TKQAIKSLDDIKDVLN---VKLDPAIS--P--NAILLTNQDGFNYLDKLKD-KDGKYI 285 (392) T ss_pred -------------ccc------cccCccCHHHHHHHHH---Hhhhhhhc--c--CCEEEEcHHHHHHHHHhhc-cCCCeE Confidence 000 0111223333333321 24443322 2 2346899999999965311 111221 Q ss_pred ccccceeecceeEEEeceEEEE--eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVIT--SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD 307 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~--Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~ 307 (339) -.. .+.+|.-.+++|.+++. +++.|..... ..+...-+-++| +.++..+....+..+..+. T Consensus 286 ~~~--~~~~~~~~tllG~~~v~~~~~~~~~~~~~------~~~~~~~~~gdf---------s~~~~i~~~~~~~~~~~~~ 348 (392) T protein:vir:10 286 LQS--DPTQKNKKLFAGTNPVVVVSNRFLKSKGT------TAKKAPLIIGDL---------KEAIVLFKREDMELASTDV 348 (392) T ss_pred eec--CccCCccccccCcccEEEecccccCCCcc------cCCceEEEEEeh---------hceEEEEeecceEEEEecc Confidence 111 13356667899987654 2444432111 111111111122 2234444444555555442 Q ss_pred h-hhh-HH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 308 D-LSK-LW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 308 ~-~~~-~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . ..| .+ .+++.+.+|.++++|++.+.++++++ T Consensus 349 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 122 22 37788899999999999999998777 No 141 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.94 E-value=1e-10 Score=75.37 Aligned_cols=287 Identities=12% Similarity=0.023 Sum_probs=150.7 Q ss_pred CccccCcccC---------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eEEEecc-cc Q lcl|NC_020078. 1 MSIFDGQTPS---------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TISNRGI-SK 69 (339) Q Consensus 1 ~~~~~~~~~~---------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv~i~~i-G~ 69 (339) +....+...+ ....+.......++--.+.-+.|+.++.+..+..++++++++...+.++. +..++.. +. T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 1111111110 00111111111111111334899999999999999999999988887532 3444443 45 Q ss_pred ceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 70 AKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 70 ~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) +.+.....|..++....+..+++++..-+ .+.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~------- 232 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI------- 232 (392) T ss_pred ccceeecccccccccccccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc------- Confidence 57777777777764333455666665532 2222333331112356899999999999999999998875211 Q ss_pred cccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) +.+ .+....+.+.+.+++. ..|..... + +-..|++|..|..|.+-.. .+..|. T Consensus 233 -------------g~~------~~~~~~~~d~i~~~~~---~~l~~~~~--~--~a~~vm~~~~~~~L~~lkd-~~G~~l 285 (392) T protein:vir:10 233 -------------EKL------TKQAIKSLDDIKDVLN---VKLDPAIS--P--NAILLTNQDGFNYLDKLKD-KDGKYI 285 (392) T ss_pred -------------ccc------cccCccCHHHHHHHHH---Hhhhhhhc--c--CCEEEEcHHHHHHHHHhhc-cCCCeE Confidence 000 0111223333333321 24443322 2 2346899999999965311 111221 Q ss_pred ccccceeecceeEEEeceEEEE--eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVIT--SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD 307 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~--Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~ 307 (339) -.. .+.+|.-.+++|.+++. +++.|..... ..+...-+-++| +.++..+....+..+..+. T Consensus 286 ~~~--~~~~~~~~tllG~~~v~~~~~~~~~~~~~------~~~~~~~~~gdf---------s~~~~i~~~~~~~~~~~~~ 348 (392) T protein:vir:10 286 LQS--DPTQKNKKLFAGTNPVVVVSNRFLKSKGT------TAKKAPLIIGDL---------KEAIVLFKREDMELASTDV 348 (392) T ss_pred eec--CccCCccccccCcccEEEecccccCCCcc------cCCceEEEEEeh---------hceEEEEeecceEEEEecc Confidence 111 13356667899987654 2444432111 111111111122 2234444444555555442 Q ss_pred h-hhh-HH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 308 D-LSK-LW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 308 ~-~~~-~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . ..| .+ .+++.+.+|.++++|++.+.++++++ T Consensus 349 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 122 22 37788899999999999999998777 No 142 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.94 E-value=1e-10 Score=75.37 Aligned_cols=287 Identities=12% Similarity=0.023 Sum_probs=150.7 Q ss_pred CccccCcccC---------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eEEEecc-cc Q lcl|NC_020078. 1 MSIFDGQTPS---------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TISNRGI-SK 69 (339) Q Consensus 1 ~~~~~~~~~~---------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv~i~~i-G~ 69 (339) +....+...+ ....+.......++--.+.-+.|+.++.+..+..++++++++...+.++. +..++.. +. T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 1111111110 00111111111111111334899999999999999999999988887532 3444443 45 Q ss_pred ceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 70 AKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 70 ~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) +.+.....|..++....+..+++++..-+ .+.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~------- 232 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI------- 232 (392) T ss_pred ccceeecccccccccccccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc------- Confidence 57777777777764333455666665532 2222333331112356899999999999999999998875211 Q ss_pred cccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) +.+ .+....+.+.+.+++. ..|..... + +-..|++|..|..|.+-.. .+..|. T Consensus 233 -------------g~~------~~~~~~~~d~i~~~~~---~~l~~~~~--~--~a~~vm~~~~~~~L~~lkd-~~G~~l 285 (392) T protein:vir:10 233 -------------EKL------TKQAIKSLDDIKDVLN---VKLDPAIS--P--NAILLTNQDGFNYLDKLKD-KDGKYI 285 (392) T ss_pred -------------ccc------cccCccCHHHHHHHHH---Hhhhhhhc--c--CCEEEEcHHHHHHHHHhhc-cCCCeE Confidence 000 0111223333333321 24443322 2 2346899999999965311 111221 Q ss_pred ccccceeecceeEEEeceEEEE--eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVIT--SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD 307 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~--Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~ 307 (339) -.. .+.+|.-.+++|.+++. +++.|..... ..+...-+-++| +.++..+....+..+..+. T Consensus 286 ~~~--~~~~~~~~tllG~~~v~~~~~~~~~~~~~------~~~~~~~~~gdf---------s~~~~i~~~~~~~~~~~~~ 348 (392) T protein:vir:10 286 LQS--DPTQKNKKLFAGTNPVVVVSNRFLKSKGT------TAKKAPLIIGDL---------KEAIVLFKREDMELASTDV 348 (392) T ss_pred eec--CccCCccccccCcccEEEecccccCCCcc------cCCceEEEEEeh---------hceEEEEeecceEEEEecc Confidence 111 13356667899987654 2444432111 111111111122 2234444444555555442 Q ss_pred h-hhh-HH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 308 D-LSK-LW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 308 ~-~~~-~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . ..| .+ .+++.+.+|.++++|++.+.++++++ T Consensus 349 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 122 22 37788899999999999999998777 No 143 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.94 E-value=1e-10 Score=75.37 Aligned_cols=287 Identities=12% Similarity=0.023 Sum_probs=150.7 Q ss_pred CccccCcccC---------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccc-eEEEecc-cc Q lcl|NC_020078. 1 MSIFDGQTPS---------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTS-TISNRGI-SK 69 (339) Q Consensus 1 ~~~~~~~~~~---------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~-tv~i~~i-G~ 69 (339) +....+...+ ....+.......++--.+.-+.|+.++.+..+..++++++++...+.++. +..++.. +. T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 1111111110 00111111111111111334899999999999999999999988887532 3444443 45 Q ss_pred ceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 70 AKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 70 ~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) +.+.....|..++....+..+++++..-+ .+.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~------- 232 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI------- 232 (392) T ss_pred ccceeecccccccccccccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc------- Confidence 57777777777764333455666665532 2222333331112356899999999999999999998875211 Q ss_pred cccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) +.+ .+....+.+.+.+++. ..|..... + +-..|++|..|..|.+-.. .+..|. T Consensus 233 -------------g~~------~~~~~~~~d~i~~~~~---~~l~~~~~--~--~a~~vm~~~~~~~L~~lkd-~~G~~l 285 (392) T protein:vir:10 233 -------------EKL------TKQAIKSLDDIKDVLN---VKLDPAIS--P--NAILLTNQDGFNYLDKLKD-KDGKYI 285 (392) T ss_pred -------------ccc------cccCccCHHHHHHHHH---Hhhhhhhc--c--CCEEEEcHHHHHHHHHhhc-cCCCeE Confidence 000 0111223333333321 24443322 2 2346899999999965311 111221 Q ss_pred ccccceeecceeEEEeceEEEE--eccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeec Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVIT--SNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFD 307 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~--Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~ 307 (339) -.. .+.+|.-.+++|.+++. +++.|..... ..+...-+-++| +.++..+....+..+..+. T Consensus 286 ~~~--~~~~~~~~tllG~~~v~~~~~~~~~~~~~------~~~~~~~~~gdf---------s~~~~i~~~~~~~~~~~~~ 348 (392) T protein:vir:10 286 LQS--DPTQKNKKLFAGTNPVVVVSNRFLKSKGT------TAKKAPLIIGDL---------KEAIVLFKREDMELASTDV 348 (392) T ss_pred eec--CccCCccccccCcccEEEecccccCCCcc------cCCceEEEEEeh---------hceEEEEeecceEEEEecc Confidence 111 13356667899987654 2444432111 111111111122 2234444444555555442 Q ss_pred h-hhh-HH--HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 308 D-LSK-LW--FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 308 ~-~~~-~d--~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . ..| .+ .+++.+.+|.++++|++.+.++++++ T Consensus 349 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 122 22 37788899999999999999998777 No 144 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.94 E-value=3.5e-11 Score=77.88 Aligned_cols=285 Identities=14% Similarity=0.076 Sum_probs=148.4 Q ss_pred Ccc-------ccCc-ccCCC-cccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhccc-cccccccccceEEEecc-c Q lcl|NC_020078. 1 MSI-------FDGQ-TPSYD-VTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAGF-VPVRSVRGTSTISNRGI-S 68 (339) Q Consensus 1 ~~~-------~~~~-~~~~~-~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~~-v~~r~i~~G~tv~i~~i-G 68 (339) +.+ -.|. +|.-. +.|....+.+++--.|.. +.++.++.+..+..++++.+ +++-+...| .+.||+. + T Consensus 332 ~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g-~~~ip~~~~ 410 (632) T protein:vir:96 332 LAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTS 410 (632) T ss_pred HHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCc-ceEEEEEeC Confidence 000 0010 11100 112111111111111333 34567777777778888776 333333334 5778876 5 Q ss_pred cceeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020078. 69 KAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAI 147 (339) Q Consensus 69 ~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~ 147 (339) .+++.....|+.++.. .+..++.++.. .++..+ .|..-=--++.+|+.+.+.+++++++++..|+.++. +.. T Consensus 411 ~~~a~wv~E~~~~~~s-~~~f~~i~l~~--~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G~G 483 (632) T protein:vir:96 411 GANFYWIGEDEDVQDS-DFDFTTLSFSP--KTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT----GTG 483 (632) T ss_pred CceeEeecCCcccccc-ccceeeEEeee--eEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhc----ccC Confidence 6677777778877754 45555555554 333332 222211124678999999999999999999998752 211 Q ss_pred cccccccccccccCccccccc-cccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhh Q lcl|NC_020078. 148 ASDSPYGTAAQMPGHSGGNVV-TLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNG 226 (339) Q Consensus 148 ~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~ 226 (339) ..+ ...|......+ .....+...+ ++.+.++..++...++.. ..-..+++|..+..|.... +.+ T Consensus 484 ~~~-------~p~Gi~~~~~~~~~~~~~~~~~----~~~i~~~~~~i~~~~~~~--~~~~~~~~~~~~~~l~~~~-l~d- 548 (632) T protein:vir:96 484 LAN-------DPVGLLNMTGVPALTYPAGGVD----WASVVDMETKISTFNADA--GRLAYLTSVTQRGAAKKAQ-VFD- 548 (632) T ss_pred CCC-------ccceeeecccccceecccccCC----HHHHHHHHHHHhhccccc--CccEEEEchhHHHHHHHHh-ccC- Confidence 111 11121111111 1111111122 345666777776666632 2345578998887776532 221 Q ss_pred cccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeee Q lcl|NC_020078. 227 EYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFF 306 (339) Q Consensus 227 d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~ 306 (339) +.+...+.. +.++|.+|+.||++|.... +-++|+... ++... .+..++.+ T Consensus 549 ---~~G~~i~~~---~~l~G~pv~~s~~ip~~~~--------------~~gd~s~~~--------i~~~~--~~~i~~~~ 598 (632) T protein:vir:96 549 ---NTGERIWQN---NEVNGYRAEASNQIPADTW--------------IFGDWSQIV--------IAMWG--VLDLKVDP 598 (632) T ss_pred ---CCCceeecC---CeecccceEeccccccCcE--------------EEeecceEE--------EEEec--ceEEEEcc Confidence 112222223 3679999999999985421 112333321 12222 22223222 Q ss_pred --chhhhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 307 --DDLSKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 307 --~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) ....=.-.+++++-++.++++|++.+.++..| T Consensus 599 ~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 599 YTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ccccccCceEEEEEeecCceeechhhhhheeecC Confidence 11122235677889999999999999999888 No 145 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.93 E-value=9.3e-11 Score=75.57 Aligned_cols=292 Identities=14% Similarity=0.094 Sum_probs=142.1 Q ss_pred Ccccc----CcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccceeeec Q lcl|NC_020078. 1 MSIFD----GQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKLQKI 75 (339) Q Consensus 1 ~~~~~----~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~~~~ 75 (339) .+.|. |.... .-.|. .+..+++--.+.-+.|+.++.+..+..++++.++++.... | .+.||++ +..++... T Consensus 124 r~a~~~~l~~~~~~-~e~~a-~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~-~~~~p~~~~~~~a~~~ 199 (434) T protein:vir:62 124 RSVFANYIVGNIDE-KEARA-LGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-E-NIKYPVLVKKAEAQGH 199 (434) T ss_pred HHHHHHHhccccch-hhhhh-hcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-C-ceEEEEEecCCcccce Confidence 00111 11000 00011 0111111111334899999999999999999988765433 3 4677765 23333222 Q ss_pred ---cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 76 ---APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 76 ---~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) ..|+.++. ..+...+.++.. .++..+..-.-+-. ++.+|+.+.+.++.+++|++..|+.++ .+.... T Consensus 200 ~~~~e~~~~~~-~~~~f~~v~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l----~G~G~~-- 270 (434) T protein:vir:62 200 KNERTNNEMPE-TDIEFDEIELSP--TEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMV----NGDEAN-- 270 (434) T ss_pred ecccccccccc-cccceeeEEeeh--eeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh----ccCCCC-- Confidence 22333332 234444444544 34333322111111 356899999999999999999998875 222111 Q ss_pred cccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS 231 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~ 231 (339) ....|......+ +...+....++.|+++...|..... + ...| |++|..|..|.+-. -.+..|.-. T Consensus 271 -----~~~~g~~~~~~~-----~~~~~~~~~~d~l~~l~~~l~~~~~--~-~a~~-v~n~~~~~~L~~lk-d~~G~~l~~ 335 (434) T protein:vir:62 271 -----NINDGALAKKAV-----EFKTDEKNLYDALVKMKNTPVKEVR--K-KARW-VLNTAALTKIETMK-TDDGFPLLR 335 (434) T ss_pred -----ccccceeecccc-----cccccccchhhHHHHHHhhcchhhh--c-CCEE-EEcHHHHHHHHHhh-ccCCCEeec Confidence 111111111111 1112223456777778777765432 2 2344 78999999985421 112222211 Q ss_pred ccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhh Q lcl|NC_020078. 232 AGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSK 311 (339) Q Consensus 232 ~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~ 311 (339) .......|.-.+++|.+|+.++.+|....+... .-|-++|+.- ..+.-+. ..++.+....| T Consensus 336 ~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~--------~i~~Gdfs~~----------~i~~~~g-~~~i~~~~~~~ 396 (434) T protein:vir:62 336 PFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTP--------VFYFGDFSKF----------YIQDVIG-SLEVQKLVELF 396 (434) T ss_pred cCCCccCCCCceecceeeEEecCccCccCCCce--------EEEEeeccce----------EEEEeec-eeEEEeehhhh Confidence 111233455567999999999998743211100 0111233321 1111111 11222322223 Q ss_pred HH----HHHHHHHhCCcccc-ccceEEEEec--CC Q lcl|NC_020078. 312 LW----FIDSWLAFGVTINR-TEYAGVIKLP--AA 339 (339) Q Consensus 312 ~d----~i~g~~~~Ga~v~r-Pe~~v~i~~~--~a 339 (339) .. .++++.-+..++++ |++..++++. +| T Consensus 397 ~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~ 431 (434) T protein:vir:62 397 SRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAP 431 (434) T ss_pred cccCceEEEEEeeecceeecCcccceEEEEEeccC Confidence 22 25666777788775 9988877544 33 No 146 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.91 E-value=2.9e-10 Score=72.90 Aligned_cols=278 Identities=11% Similarity=0.027 Sum_probs=145.3 Q ss_pred cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc-cceeeeccCCCCCCCCC----CC Q lcl|NC_020078. 13 VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS-KAKLQKIAPGTTPPPST----EP 87 (339) Q Consensus 13 ~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG-~~t~~~~~~g~~i~~~~----~~ 87 (339) ++- ..+++.-.+.-+.++.++.+..++.+++++++++.++.+ .+.+||+.. .+.+.-...|+..+... .+ T Consensus 1 ma~----~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MAD----ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCC----ccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccC-CcEEEEEEeCCcceEEeeccccccccccccccc Confidence 111 112222224458899999999999999999998877654 467787764 45666666666654322 22 Q ss_pred CccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccc Q lcl|NC_020078. 88 HTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGN 166 (339) Q Consensus 88 ~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~ 166 (339) ...+.++. ..|+..+ .|.+-=--++.+|+.+.+.++.++++++..|+.++. +...... .......+... T Consensus 76 ~f~~i~~~--~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~----G~g~~~~----~~~~~~~~~~~ 145 (305) T protein:vir:25 76 TWANRTLV--AEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDKPAS----WVSPALIPAAV 145 (305) T ss_pred ceeeEEee--eEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhhee----ccCCCCC----ccccccccccc Confidence 23333333 2333322 222211113568899999999999999999998862 1100000 00000000000 Q ss_pred c--ccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEE Q lcl|NC_020078. 167 V--VTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAA 244 (339) Q Consensus 167 ~--~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i 244 (339) . ..............+++.+..+...+....- .+ .. ++++|..|..|.+ +.+. .+...+.. ..+ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~-~v~~~~~~~~l~~---lkd~----~G~~i~~~---~~l 211 (305) T protein:vir:25 146 TAGQAVEVVGGVANESDIVGATNRAAKAVASAGW-AP--DT-LLSSLALRYEVAN---IRDA----NGNPVFRD---DSF 211 (305) T ss_pred cccccccccccchhhhHHHHHHHHHHHhhhhccc-cc--ce-eEecHHHHHHHHH---hhcc----CCceeecC---Ccc Confidence 0 0111111222234456666666555543221 11 12 5789999999864 2221 11111222 368 Q ss_pred eceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech---------hhh---H Q lcl|NC_020078. 245 FGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD---------LSK---L 312 (339) Q Consensus 245 ~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~---------~~~---~ 312 (339) +|.+|+.+++.|.... ....+-++|++. ..+...++..+..++- ..| . T Consensus 212 ~G~Pv~~~~~~~~~~~----------~~~~~~gd~s~~----------~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 271 (305) T protein:vir:25 212 AGFRTFFNRNGAWDAD----------AAIEVIADSSRV----------KIGVRQDITVKFLDQATLGTGENQINLAERDM 271 (305) T ss_pred cccceEEcCccCCCCC----------ccEEEEEecceE----------EEEEecCeEEEEeeeeeeecCCceeeeeecCc Confidence 9999999998863211 112223333332 2222233333433321 011 1 Q ss_pred HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 313 WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) -.+|....+|..++||++++.+..... T Consensus 272 ~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 272 VALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred EEEEEEEeecceeeCcccEEEEccccc Confidence 135566778999999999987765433 No 147 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.89 E-value=2e-10 Score=73.76 Aligned_cols=284 Identities=12% Similarity=0.062 Sum_probs=140.6 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) +.-|....-. .-.|..+.....+.-.+..+.++..+... ...+.++.++++.....| +..++.. +...+.....+ T Consensus 141 ~~~~~~~~~~-~e~~~~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~ 217 (437) T protein:vir:10 141 VTAFADYLKT-GEVRDVTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTT-TGKLPIFNNSTDLLTAHTEY 217 (437) T ss_pred hhhhHHHHHh-hhhhhhhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccC-ceeeEEeecccccccccccc Confidence 1111100000 00111111111111113337777777654 455566777776655543 3445444 33455566656 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +.++....+..+++++.. .++..+ .|..-=-..+.+|+.+.+.++.+++|++..|..|+.-. T Consensus 218 ~~~~e~~~~~~~~v~~~~--~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~--------------- 280 (437) T protein:vir:10 218 GQTTKNATPVITPILWDL--KTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITAL--------------- 280 (437) T ss_pred ccccccccccceeeeeeh--hheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------------- Confidence 655433334445544443 333322 22111011346789999999999999999998875311 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) +.+.. +.+.....+.+.+++. ..|+.... + +-..|++|..|..|.+-.. .+..|.-.. .+. T Consensus 281 -----g~~~~----~~~~~~~~~~~~~~~~---~~l~~~~~--~--~~~~~~~~~~~~~l~~lkd-~~g~~~~~~--~~~ 341 (437) T protein:vir:10 281 -----TDGIK----KTTSTYLLGDLKKVLN---VTLKPQDS--A--AASIVMSQSAYNLFDMATD-AMGRPLLQP--NVT 341 (437) T ss_pred -----ccccc----ccccccchhhHHHHHH---hhhhhhhh--c--CCEEEEcHHHHHHHHHhhc-cCCCeeecc--Ccc Confidence 01110 0011111122333221 23444332 1 2245999999999865321 111222111 234 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) +|.-.+++|.+|+.+++.+... ..+++..-+-++|++ ++..+.-.+++++...+-..+...+++ T Consensus 342 ~~~~~~l~G~pv~~~~~~~~~~-------~~~~~~~~~~gd~~~---------~~~~~~r~~~~~~~~~~~~~~~~~~~~ 405 (437) T protein:vir:10 342 AATGYTLLGKTVVIVDDKLFPS-------ASAGDVNIVVAPLKK---------AVINFKLTEITGQFQDTYDIWYKQLGI 405 (437) T ss_pred CCCCcccccceeEEecccccCC-------cCCCceEEEEeeccc---------cEEEEeeeceEEEEecccccccceeeE Confidence 5666789999999976542111 112222223333332 333333344555555444556667778 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-|+.++++|++.+.|..... T Consensus 406 ~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 406 FLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred EEEEccEEecccceEEEEeecc Confidence 8889999999999998873322 No 148 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.89 E-value=1.2e-09 Score=69.46 Aligned_cols=320 Identities=13% Similarity=0.041 Sum_probs=166.6 Q ss_pred Cccc----------cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhh---------ccccccccccc--c Q lcl|NC_020078. 1 MSIF----------DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIM---------AGFVPVRSVRG--T 59 (339) Q Consensus 1 ~~~~----------~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~---------~~~v~~r~i~~--G 59 (339) |.-| -|+|-.++..+ .+++.|++.+...=+..+-+ ...++..++.+ | T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aG 68 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNR------------SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAG 68 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCC------------hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCC Confidence 3221 12221122222 25677777654433221111 23344445543 9 Q ss_pred ceEEEeccccceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 60 STISNRGISKAKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETF 138 (339) Q Consensus 60 ~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i 138 (339) ++|.|.-+...+-.....++.+++.+ .++....+|.|||.--.-..=..+++-.+-+|+|++.-..++.-+++..||.+ T Consensus 69 d~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~ 148 (404) T protein:vir:10 69 DEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCA 148 (404) T ss_pred cEEEEeEeeecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888766666677787764 36666778999987644222256666667899999999999999999999999 Q ss_pred HHHHHhhccccccc------------cc---cccc-ccCccccccccccCccc---cccHHHH-HHHHHHHHHHHHhcCC Q lcl|NC_020078. 139 FIMAAKAAIASDSP------------YG---TAAQ-MPGHSGGNVVTLAGAND---YKDPAKL-YAAIASLVEKFLEKDV 198 (339) Q Consensus 139 ~~~l~~aA~~~~~~------------~~---~~~~-~~g~~~~~~~~~~~~~~---~~~~~~l-~~ai~~a~~~L~e~dV 198 (339) |..|.. ++..... .. .+.. .|... ..+-...++. ..+.+.+ ++.|.++.+++++.-. T Consensus 149 ~~~laG-~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~--r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~ 225 (404) T protein:vir:10 149 IVHLAG-ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHD--RHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAH 225 (404) T ss_pred HHHHhc-cccccccccceeeccccccccceeecccCCCCCC--cEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCC Confidence 988874 3320000 00 0000 00000 0000000000 0111111 2344456666654322 Q ss_pred CCC-----cCC-------eEEEECHHHHHHHhcccc---hhhh---cccc--cccceeecceeEEEeceEEEEecccccc Q lcl|NC_020078. 199 RPN-----EED-------MILVLPPAAFTALMQAEH---ITNG---EYVT--SAGETLNTKYMFAAFGVPVITSNNAVFG 258 (339) Q Consensus 199 ~~p-----~~~-------R~~vv~P~~~~~Ll~~~~---~~n~---d~~~--~~~~~l~~G~v~~i~G~~V~~Snnlp~~ 258 (339) |.+ .++ +++++.|.+|..|..++. +.+. -..+ ....+|..|.++.+.|+.|.+-.+.|-- T Consensus 226 pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Ir 305 (404) T protein:vir:10 226 PLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR 305 (404) T ss_pred CCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceee Confidence 211 222 788899999999999863 2221 1111 1234688999999999999987665521 Q ss_pred c--ccc----ccccCCCccccccccccceEEEEEeccceeEEEEEee------eeEEeeechhhhHHHHHHHHHhCCccc Q lcl|NC_020078. 259 K--TIT----DHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP------VTSKIFFDDLSKLWFIDSWLAFGVTIN 326 (339) Q Consensus 259 ~--~~~----~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~------~~~e~~~~~~~~~d~i~g~~~~Ga~v~ 326 (339) . ..+ .+..+ ++ .......+.-.-+|++-..|++.+=.+. +.-|.+...++++ |-...++|.+=+ T Consensus 306 f~~g~~~~~~~n~~~-a~-~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~ 381 (404) T protein:vir:10 306 FYQGSKVLVSENNLT-AT-TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKI 381 (404) T ss_pred ecccceeeecCCccc-cc-cccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhc Confidence 1 000 00000 00 0111111222234666666664442221 1112222223332 334556777777 Q ss_pred c-c------cceEEEEecCC Q lcl|NC_020078. 327 R-T------EYAGVIKLPAA 339 (339) Q Consensus 327 r-P------e~~v~i~~~~a 339 (339) | | +--++|.+++| T Consensus 382 rF~~~~g~~~DfGvi~idta 401 (404) T protein:vir:10 382 RFPEKSGKMQDHGVIAVDTA 401 (404) T ss_pred cccCCCCceeeEEEEEeccc Confidence 6 4 35678888888 No 149 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.89 E-value=1.2e-09 Score=69.46 Aligned_cols=320 Identities=13% Similarity=0.041 Sum_probs=166.6 Q ss_pred Cccc----------cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhh---------ccccccccccc--c Q lcl|NC_020078. 1 MSIF----------DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIM---------AGFVPVRSVRG--T 59 (339) Q Consensus 1 ~~~~----------~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~---------~~~v~~r~i~~--G 59 (339) |.-| -|+|-.++..+ .+++.|++.+...=+..+-+ ...++..++.+ | T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aG 68 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNR------------SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAG 68 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCC------------hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCC Confidence 3221 12221122222 25677777654433221111 23344445543 9 Q ss_pred ceEEEeccccceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 60 STISNRGISKAKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETF 138 (339) Q Consensus 60 ~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i 138 (339) ++|.|.-+...+-.....++.+++.+ .++....+|.|||.--.-..=..+++-.+-+|+|++.-..++.-+++..||.+ T Consensus 69 d~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~ 148 (404) T protein:vir:10 69 DEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCA 148 (404) T ss_pred cEEEEeEeeecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888766666677787764 36666778999987644222256666667899999999999999999999999 Q ss_pred HHHHHhhccccccc------------cc---cccc-ccCccccccccccCccc---cccHHHH-HHHHHHHHHHHHhcCC Q lcl|NC_020078. 139 FIMAAKAAIASDSP------------YG---TAAQ-MPGHSGGNVVTLAGAND---YKDPAKL-YAAIASLVEKFLEKDV 198 (339) Q Consensus 139 ~~~l~~aA~~~~~~------------~~---~~~~-~~g~~~~~~~~~~~~~~---~~~~~~l-~~ai~~a~~~L~e~dV 198 (339) |..|.. ++..... .. .+.. .|... ..+-...++. ..+.+.+ ++.|.++.+++++.-. T Consensus 149 ~~~laG-~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~--r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~ 225 (404) T protein:vir:10 149 IVHLAG-ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHD--RHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAH 225 (404) T ss_pred HHHHhc-cccccccccceeeccccccccceeecccCCCCCC--cEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCC Confidence 988874 3320000 00 0000 00000 0000000000 0111111 2344456666654322 Q ss_pred CCC-----cCC-------eEEEECHHHHHHHhcccc---hhhh---cccc--cccceeecceeEEEeceEEEEecccccc Q lcl|NC_020078. 199 RPN-----EED-------MILVLPPAAFTALMQAEH---ITNG---EYVT--SAGETLNTKYMFAAFGVPVITSNNAVFG 258 (339) Q Consensus 199 ~~p-----~~~-------R~~vv~P~~~~~Ll~~~~---~~n~---d~~~--~~~~~l~~G~v~~i~G~~V~~Snnlp~~ 258 (339) |.+ .++ +++++.|.+|..|..++. +.+. -..+ ....+|..|.++.+.|+.|.+-.+.|-- T Consensus 226 pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Ir 305 (404) T protein:vir:10 226 PLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR 305 (404) T ss_pred CCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceee Confidence 211 222 788899999999999863 2221 1111 1234688999999999999987665521 Q ss_pred c--ccc----ccccCCCccccccccccceEEEEEeccceeEEEEEee------eeEEeeechhhhHHHHHHHHHhCCccc Q lcl|NC_020078. 259 K--TIT----DHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP------VTSKIFFDDLSKLWFIDSWLAFGVTIN 326 (339) Q Consensus 259 ~--~~~----~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~------~~~e~~~~~~~~~d~i~g~~~~Ga~v~ 326 (339) . ..+ .+..+ ++ .......+.-.-+|++-..|++.+=.+. +.-|.+...++++ |-...++|.+=+ T Consensus 306 f~~g~~~~~~~n~~~-a~-~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~ 381 (404) T protein:vir:10 306 FYQGSKVLVSENNLT-AT-TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKI 381 (404) T ss_pred ecccceeeecCCccc-cc-cccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhc Confidence 1 000 00000 00 0111111222234666666664442221 1112222223332 334556777777 Q ss_pred c-c------cceEEEEecCC Q lcl|NC_020078. 327 R-T------EYAGVIKLPAA 339 (339) Q Consensus 327 r-P------e~~v~i~~~~a 339 (339) | | +--++|.+++| T Consensus 382 rF~~~~g~~~DfGvi~idta 401 (404) T protein:vir:10 382 RFPEKSGKMQDHGVIAVDTA 401 (404) T ss_pred cccCCCCceeeEEEEEeccc Confidence 6 4 35678888888 No 150 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.89 E-value=1.2e-09 Score=69.46 Aligned_cols=320 Identities=13% Similarity=0.041 Sum_probs=166.6 Q ss_pred Cccc----------cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhh---------ccccccccccc--c Q lcl|NC_020078. 1 MSIF----------DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIM---------AGFVPVRSVRG--T 59 (339) Q Consensus 1 ~~~~----------~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~---------~~~v~~r~i~~--G 59 (339) |.-| -|+|-.++..+ .+++.|++.+...=+..+-+ ...++..++.+ | T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aG 68 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNR------------SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAG 68 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCC------------hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCC Confidence 3221 12221122222 25677777654433221111 23344445543 9 Q ss_pred ceEEEeccccceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 60 STISNRGISKAKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETF 138 (339) Q Consensus 60 ~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i 138 (339) ++|.|.-+...+-.....++.+++.+ .++....+|.|||.--.-..=..+++-.+-+|+|++.-..++.-+++..||.+ T Consensus 69 d~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~ 148 (404) T protein:vir:81 69 DEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCA 148 (404) T ss_pred cEEEEeEeeecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888766666677787764 36666778999987644222256666667899999999999999999999999 Q ss_pred HHHHHhhccccccc------------cc---cccc-ccCccccccccccCccc---cccHHHH-HHHHHHHHHHHHhcCC Q lcl|NC_020078. 139 FIMAAKAAIASDSP------------YG---TAAQ-MPGHSGGNVVTLAGAND---YKDPAKL-YAAIASLVEKFLEKDV 198 (339) Q Consensus 139 ~~~l~~aA~~~~~~------------~~---~~~~-~~g~~~~~~~~~~~~~~---~~~~~~l-~~ai~~a~~~L~e~dV 198 (339) |..|.. ++..... .. .+.. .|... ..+-...++. ..+.+.+ ++.|.++.+++++.-. T Consensus 149 ~~~laG-~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~--r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~ 225 (404) T protein:vir:81 149 IVHLAG-ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHD--RHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAH 225 (404) T ss_pred HHHHhc-cccccccccceeeccccccccceeecccCCCCCC--cEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCC Confidence 988874 3320000 00 0000 00000 0000000000 0111111 2344456666654322 Q ss_pred CCC-----cCC-------eEEEECHHHHHHHhcccc---hhhh---cccc--cccceeecceeEEEeceEEEEecccccc Q lcl|NC_020078. 199 RPN-----EED-------MILVLPPAAFTALMQAEH---ITNG---EYVT--SAGETLNTKYMFAAFGVPVITSNNAVFG 258 (339) Q Consensus 199 ~~p-----~~~-------R~~vv~P~~~~~Ll~~~~---~~n~---d~~~--~~~~~l~~G~v~~i~G~~V~~Snnlp~~ 258 (339) |.+ .++ +++++.|.+|..|..++. +.+. -..+ ....+|..|.++.+.|+.|.+-.+.|-- T Consensus 226 pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Ir 305 (404) T protein:vir:81 226 PLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR 305 (404) T ss_pred CCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceee Confidence 211 222 788899999999999863 2221 1111 1234688999999999999987665521 Q ss_pred c--ccc----ccccCCCccccccccccceEEEEEeccceeEEEEEee------eeEEeeechhhhHHHHHHHHHhCCccc Q lcl|NC_020078. 259 K--TIT----DHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP------VTSKIFFDDLSKLWFIDSWLAFGVTIN 326 (339) Q Consensus 259 ~--~~~----~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~------~~~e~~~~~~~~~d~i~g~~~~Ga~v~ 326 (339) . ..+ .+..+ ++ .......+.-.-+|++-..|++.+=.+. +.-|.+...++++ |-...++|.+=+ T Consensus 306 f~~g~~~~~~~n~~~-a~-~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~ 381 (404) T protein:vir:81 306 FYQGSKVLVSENNLT-AT-TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKI 381 (404) T ss_pred ecccceeeecCCccc-cc-cccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhc Confidence 1 000 00000 00 0111111222234666666664442221 1112222223332 334556777777 Q ss_pred c-c------cceEEEEecCC Q lcl|NC_020078. 327 R-T------EYAGVIKLPAA 339 (339) Q Consensus 327 r-P------e~~v~i~~~~a 339 (339) | | +--++|.+++| T Consensus 382 rF~~~~g~~~DfGvi~idta 401 (404) T protein:vir:81 382 RFPEKSGKMQDHGVIAVDTA 401 (404) T ss_pred cccCCCCceeeEEEEEeccc Confidence 6 4 35678888888 No 151 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.89 E-value=1.2e-09 Score=69.46 Aligned_cols=320 Identities=13% Similarity=0.041 Sum_probs=166.6 Q ss_pred Cccc----------cCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhh---------ccccccccccc--c Q lcl|NC_020078. 1 MSIF----------DGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIM---------AGFVPVRSVRG--T 59 (339) Q Consensus 1 ~~~~----------~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~---------~~~v~~r~i~~--G 59 (339) |.-| -|+|-.++..+ .+++.|++.+...=+..+-+ ...++..++.+ | T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aG 68 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNR------------SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAG 68 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCC------------hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCC Confidence 3221 12221122222 25677777654433221111 23344445543 9 Q ss_pred ceEEEeccccceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 60 STISNRGISKAKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETF 138 (339) Q Consensus 60 ~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i 138 (339) ++|.|.-+...+-.....++.+++.+ .++....+|.|||.--.-..=..+++-.+-+|+|++.-..++.-+++..||.+ T Consensus 69 d~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~ 148 (404) T protein:vir:32 69 DEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCA 148 (404) T ss_pred cEEEEeEeeecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888766666677787764 36666778999987644222256666667899999999999999999999999 Q ss_pred HHHHHhhccccccc------------cc---cccc-ccCccccccccccCccc---cccHHHH-HHHHHHHHHHHHhcCC Q lcl|NC_020078. 139 FIMAAKAAIASDSP------------YG---TAAQ-MPGHSGGNVVTLAGAND---YKDPAKL-YAAIASLVEKFLEKDV 198 (339) Q Consensus 139 ~~~l~~aA~~~~~~------------~~---~~~~-~~g~~~~~~~~~~~~~~---~~~~~~l-~~ai~~a~~~L~e~dV 198 (339) |..|.. ++..... .. .+.. .|... ..+-...++. ..+.+.+ ++.|.++.+++++.-. T Consensus 149 ~~~laG-~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~--r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~ 225 (404) T protein:vir:32 149 IVHLAG-ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHD--RHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAH 225 (404) T ss_pred HHHHhc-cccccccccceeeccccccccceeecccCCCCCC--cEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCC Confidence 988874 3320000 00 0000 00000 0000000000 0111111 2344456666654322 Q ss_pred CCC-----cCC-------eEEEECHHHHHHHhcccc---hhhh---cccc--cccceeecceeEEEeceEEEEecccccc Q lcl|NC_020078. 199 RPN-----EED-------MILVLPPAAFTALMQAEH---ITNG---EYVT--SAGETLNTKYMFAAFGVPVITSNNAVFG 258 (339) Q Consensus 199 ~~p-----~~~-------R~~vv~P~~~~~Ll~~~~---~~n~---d~~~--~~~~~l~~G~v~~i~G~~V~~Snnlp~~ 258 (339) |.+ .++ +++++.|.+|..|..++. +.+. -..+ ....+|..|.++.+.|+.|.+-.+.|-- T Consensus 226 pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Ir 305 (404) T protein:vir:32 226 PLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR 305 (404) T ss_pred CCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceee Confidence 211 222 788899999999999863 2221 1111 1234688999999999999987665521 Q ss_pred c--ccc----ccccCCCccccccccccceEEEEEeccceeEEEEEee------eeEEeeechhhhHHHHHHHHHhCCccc Q lcl|NC_020078. 259 K--TIT----DHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP------VTSKIFFDDLSKLWFIDSWLAFGVTIN 326 (339) Q Consensus 259 ~--~~~----~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~------~~~e~~~~~~~~~d~i~g~~~~Ga~v~ 326 (339) . ..+ .+..+ ++ .......+.-.-+|++-..|++.+=.+. +.-|.+...++++ |-...++|.+=+ T Consensus 306 f~~g~~~~~~~n~~~-a~-~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~ 381 (404) T protein:vir:32 306 FYQGSKVLVSENNLT-AT-TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKI 381 (404) T ss_pred ecccceeeecCCccc-cc-cccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhc Confidence 1 000 00000 00 0111111222234666666664442221 1112222223332 334556777777 Q ss_pred c-c------cceEEEEecCC Q lcl|NC_020078. 327 R-T------EYAGVIKLPAA 339 (339) Q Consensus 327 r-P------e~~v~i~~~~a 339 (339) | | +--++|.+++| T Consensus 382 rF~~~~g~~~DfGvi~idta 401 (404) T protein:vir:32 382 RFPEKSGKMQDHGVIAVDTA 401 (404) T ss_pred cccCCCCceeeEEEEEeccc Confidence 6 4 35678888888 No 152 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.86 E-value=1.2e-10 Score=75.02 Aligned_cols=281 Identities=11% Similarity=0.054 Sum_probs=162.7 Q ss_pred ccCcccchhHHHH-HHHHHHHHHHHHHHhhhcc---ccccccc-----cccceEEEeccccc--eeeeccCCC-CCCCCC Q lcl|NC_020078. 18 QRHGAGDPLADVT-EQFTGTVEGTIKRRSIMAG---FVPVRSV-----RGTSTISNRGISKA--KLQKIAPGT-TPPPST 85 (339) Q Consensus 18 ~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~~---~v~~r~i-----~~G~tv~i~~iG~~--t~~~~~~g~-~i~~~~ 85 (339) ..+..+-...+++ |+|...|.+...+.+.|.+ +++..++ .+|+++.||..+.. ...++..|. .|+.. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~- 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETG- 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchh- Confidence 2211122222444 9999999999888777643 2332222 26999999999865 566776664 68764 Q ss_pred CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCcccc Q lcl|NC_020078. 86 EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGG 165 (339) Q Consensus 86 ~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~ 165 (339) .+.+.+..-+| ...--++.+.|+-...+-.|++.++.++.+...++..+..++..| ++........ ..+... .- T Consensus 80 ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l-~gvf~~~~~~-~~~~~~---~~ 153 (330) T protein:vir:10 80 KITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATL-NGIFATGTAG-EKGALE---ET 153 (330) T ss_pred hcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHH-Hhhhhhhhcc-cchhhh---hh Confidence 34444444443 222335778899988888899999999999999998877766544 3322211111 111000 00 Q ss_pred ccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEe Q lcl|NC_020078. 166 NVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAF 245 (339) Q Consensus 166 ~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~ 245 (339) ......+.....++ +.|.++..+|.++. +.-..+++.|..|..|.+. .+++.- ... ..++.|+.++ T Consensus 154 ~~~~~~~~~a~~s~----~~l~~A~~~~GD~~----~~~~~ivmhS~v~~~L~~~-~li~~~--~~s---~~~~~i~~~~ 219 (330) T protein:vir:10 154 HVSDQSKASTGIDA----GMVLDAKQLLGDSA----DQVTAIAMHSAVYTKLQKD-NLIQYI--QPT---TATINIPTYL 219 (330) T ss_pred heecccccccccCH----HHHHHHHHHhcccc----ccceEEEEcHHHHHHHHHh-hhhhhh--ccc---ccCccccccc Confidence 00111122222343 45666777775543 2346889999999999884 455432 111 1246789999 Q ss_pred ceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEee---eeEEeeechhhhHHHHHH----- Q lcl|NC_020078. 246 GVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP---VTSKIFFDDLSKLWFIDS----- 317 (339) Q Consensus 246 G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~---~~~e~~~~~~~~~d~i~g----- 317 (339) |.+|+++..+|... ..| .++++.+-|+++.+..+ +..|..|+++.-.+.+.. T Consensus 220 G~~VivdD~~p~~~-------------~~y-------t~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~ 279 (330) T protein:vir:10 220 GYRVIIDDGIAPTG-------------DIY-------TSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALV 279 (330) T ss_pred ceEEEEeCCCCCCC-------------Cce-------eEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEE Confidence 99999999998421 111 35677888998887543 677888887654444433 Q ss_pred HHHhCCccccccce--------EEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYA--------GVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~--------v~i~~~~a 339 (339) +|.+|.+-..+.-- .+|..++- T Consensus 280 ~hp~G~s~~~~~~~~~~~sPt~~~L~~~~N 309 (330) T protein:vir:10 280 MHPYGVKWTGAEVDAGNITPSNADLAKFKN 309 (330) T ss_pred eeeeeeeecccccccCcCCcChHHhcCCcC Confidence 34444443322100 01111000 No 153 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.81 E-value=3.4e-11 Score=77.95 Aligned_cols=277 Identities=11% Similarity=0.054 Sum_probs=142.2 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) ............+.+.-..+..++--.+..+.|+.++.+..+..+.+++++++.++.+ .++|++ +..++.-...| T Consensus 117 ~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~Eg 193 (402) T protein:vir:93 117 NEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDV 193 (402) T ss_pred hhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCcccccccc Confidence 0000000000001110000000111123458899999999998899999998876643 334543 34456666667 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +..+.. .++.+++++.+. ++..+..-.-+-. .+.+|+.+.+.++.++++++..++.+|... . T Consensus 194 ~~~~~~-~~~f~~i~~~~~--k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g---~----------- 256 (402) T protein:vir:93 194 ETAKEL-KAKGDTVKFTTN--KFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS---P----------- 256 (402) T ss_pred cccccc-ccccceeeecce--eeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC---C----------- Confidence 766653 355666655553 4444322221112 357889999999999999998776654211 0 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ..|...+.... ......++...++.|.++...|..... . ...| ++++..|..|++-.+ +. +..+- T Consensus 257 -g~g~p~g~~~~--~~~~~~~~~~~~d~l~~~~~~l~~~y~--~-na~~-imn~~t~~~~~~~~~--d~------~~~~~ 321 (402) T protein:vir:93 257 -KSGLEHMSFYN--GSVKEVEGADMYDAIINALADLHEDYR--D-NATI-YMRYADYVKIISVLS--NG------TTNFF 321 (402) T ss_pred -Cccccceeeec--cccccccccchHHHHHHHHhccChhhh--c-CCEE-EEechHHHHHHHHHh--cC------CCccc Confidence 01111111111 111122334457778888777766543 1 3456 566665555443211 11 11233 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) .|.-.+++|.+|+.++..+.. +-++|+.-...+ + .+..+.+++...-.-.+++ T Consensus 322 ~~~~~~llG~PV~~t~~~~~i----------------~~GDf~~~~~~~---~--------~~~~~~~~~~~~~~~~~~~ 374 (402) T protein:vir:93 322 DTPAEKVFGKPVVFTDAAVKP----------------IVGDFNYFGINY---D--------GTTYDTDKDVKKGEYLFVL 374 (402) T ss_pred ccCCccccccceEEecCCCce----------------eeechhhhhhhh---h--------hhhhhhhhcccCCceEEEE Confidence 344457899999998865421 112222211000 0 0112233332211222445 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..-+|.++++|++++.+++.+| T Consensus 375 ~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 375 TAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred EEEeCcEEechhheEEEEeecC Confidence 6679999999999999999888 No 154 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.80 E-value=1.6e-09 Score=68.87 Aligned_cols=251 Identities=11% Similarity=0.077 Sum_probs=138.5 Q ss_pred CccccCcccCCCcccCCccCcccchhH--------------HHHHHHHHHHHHHHHHHhhhc---------ccccccccc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLA--------------DVTEQFTGTVEGTIKRRSIMA---------GFVPVRSVR 57 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a--------------~~ie~~~g~v~~~f~~~sv~~---------~~v~~r~i~ 57 (339) |. +. ..+++.+ ..++.|++.|...-.+.+-+. .+++..++. T Consensus 1 mt------------~~----~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~ 64 (318) T protein:vir:27 1 MT------------TV----TSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN 64 (318) T ss_pred CC------------cc----CCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCC Confidence 21 11 1112221 146889998876655543322 233334564 Q ss_pred --ccceEEEeccccceeeeccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHH Q lcl|NC_020078. 58 --GTSTISNRGISKAKLQKIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMY 134 (339) Q Consensus 58 --~G~tv~i~~iG~~t~~~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~ 134 (339) .|++|.|.-+...+-.....++.+++.+ .++.....|.|||..-.-..=..+++-.+-+|+|++.-..++.-+++.. T Consensus 65 K~~GD~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~ 144 (318) T protein:vir:27 65 KQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (318) T ss_pred CCCccEEEEeEeeccccCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHH Confidence 3999999988877766666677777754 3556667888998764322224566666789999999999999999999 Q ss_pred HHHHHHHHHhhccc-cc----------cccc---cccc-ccCccccccccccCccc---cccHHHH-HHHHHHHHHHHHh Q lcl|NC_020078. 135 DETFFIMAAKAAIA-SD----------SPYG---TAAQ-MPGHSGGNVVTLAGAND---YKDPAKL-YAAIASLVEKFLE 195 (339) Q Consensus 135 D~~i~~~l~~aA~~-~~----------~~~~---~~~~-~~g~~~~~~~~~~~~~~---~~~~~~l-~~ai~~a~~~L~e 195 (339) ||.+|..|..+-.. .+ +-.. .+.. .|... .++-..+++. .++.+.+ ++.|-++.+++++ T Consensus 145 Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~--r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~ 222 (318) T protein:vir:27 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHD--RHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDE 222 (318) T ss_pred HHHHHHHHhhcccccccccceEecccCccchhhhhcccCCCCCC--cEEeccCccchhhhhhcccccHHHHHHHHHHHHH Confidence 99999988643310 00 0000 0000 00000 0000001110 0111111 2234445555544 Q ss_pred --cCCCC---CcCC-------eEEEECHHHHHHHhcccc------h-hhhccccc-ccceeecceeEEEeceEEEEeccc Q lcl|NC_020078. 196 --KDVRP---NEED-------MILVLPPAAFTALMQAEH------I-TNGEYVTS-AGETLNTKYMFAAFGVPVITSNNA 255 (339) Q Consensus 196 --~dV~~---p~~~-------R~~vv~P~~~~~Ll~~~~------~-~n~d~~~~-~~~~l~~G~v~~i~G~~V~~Snnl 255 (339) ...+| ..++ +++++.|.+|..|..+.. + .++...+. ...+|..|.++.+.|+-|.+-.++ T Consensus 223 ~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~v 302 (318) T protein:vir:27 223 MAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGM 302 (318) T ss_pred hCCCCcceeeccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCc Confidence 22211 1122 778899999999998752 2 22332221 234588999999999999998876 Q ss_pred c--ccccccccccCCCccccccccccceEE Q lcl|NC_020078. 256 V--FGKTITDHLLSNANNEKAYDGDFKDIV 283 (339) Q Consensus 256 p--~~~~~~~~~l~~~~~~~~y~~~~~~~~ 283 (339) | +. +|....| +.++ T Consensus 303 pIrf~----------~G~~v~~----~~~~ 318 (318) T protein:vir:27 303 PIRFY----------QGQRFWY----QRIT 318 (318) T ss_pred cEEEc----------CCCeeee----eecC Confidence 5 22 1111111 1111 No 155 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.80 E-value=7.6e-11 Score=76.06 Aligned_cols=277 Identities=12% Similarity=0.048 Sum_probs=141.1 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) -........+....|.-..+..++--.+..+.|+.++.+..+..+.+++++++.++.+ ..+|++ +..++.-...| T Consensus 102 ~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~E~ 178 (387) T protein:vir:93 102 NEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDV 178 (387) T ss_pred hhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC---ceEEEEeecCCccccccCc Confidence 0000001111111111000000010113448889999999988888999988876643 234443 44556666667 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +..+.. .++.+++++. ..++..+ .|.+-=-..+.+|+.+.+.++.++++++..++.+|.. +. T Consensus 179 ~~~~~~-~~~f~~v~~~--~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~---g~----------- 241 (387) T protein:vir:93 179 ETAKEL-KLKGDTVKFT--TNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV---SP----------- 241 (387) T ss_pred cccccc-ccccceeeee--heeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhc---CC----------- Confidence 766653 3555555444 4455553 2332111235689999999999999999877655411 11 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ..|...+.... ......+....++.|.++...|..... . ...| ++++..|..|++-.+ +. +..+- T Consensus 242 -g~g~p~g~l~~--~~~~~v~~~~~~d~i~~~~~~l~~~~~--~-~a~~-~mn~~t~~~~~~~~~--d~------~~~~~ 306 (387) T protein:vir:93 242 -KSGLDHMSFYN--GSVKEVEGADMYDAIINALADLHEDYR--D-NATI-YMRYADYVKIISVLS--NG------TTNFF 306 (387) T ss_pred -Cccccceeeec--cccccccccchHHHHHHHHhccChhhh--c-CCEE-EEechHHHHHHHHHh--cC------CCccc Confidence 00111111100 111122334457778888777766543 1 2456 667766655543211 11 11233 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) .|.-.+++|.+|+.++..+.. +-|+|+.-.. .+ + .+..+.+++...-...+.+ T Consensus 307 ~~~~~~llG~PV~~~~~~~~~----------------~~GDf~~~~~-~~--~--------~~~~~~~~~~~~~~~~~~~ 359 (387) T protein:vir:93 307 DTPAEKVFGKPVVFTDAAVKP----------------IVGDFNYFGI-NY--D--------GTTYDTDKDVKKGEYLFVL 359 (387) T ss_pred ccCCccccccceEEecCCCce----------------eeeehhhhhe-eh--h--------hheeeecccccCCceeEEE Confidence 344457999999998765421 1122322110 00 0 0111222222211223345 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ..-||.++++||+.+.+++.+| T Consensus 360 ~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 360 TAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EeeeCceeechhheEEEEeecC Confidence 5678999999999999998777 No 156 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.77 E-value=3.7e-10 Score=72.27 Aligned_cols=278 Identities=14% Similarity=0.051 Sum_probs=154.0 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc---ceeeeccC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK---AKLQKIAP 77 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~---~t~~~~~~ 77 (339) ..-..| .+...-.|.+.....|. .+.-+.|+.++.+..+..+.++++++...+.+| ++++++... ..+..... T Consensus 101 ~~~~~~-~~~~~~~ra~~t~~~gg--~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E 176 (421) T protein:vir:13 101 SKTIRG-IQLSEEERDIMSSTNNG--AVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAK 176 (421) T ss_pred HHhhhc-cchhHHHhhccccCCcc--eecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeeccc Confidence 000011 11111223222222221 134488999999998889999999887776544 455554432 23444555 Q ss_pred CCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 78 GTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 78 g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) |..++.. .+...++++.+.+. +.-..|.+-=-.++.+|+.+.+.+++++++++..|+.++..+. T Consensus 177 ~~~~~~s-~~~f~~i~~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~-------------- 240 (421) T protein:vir:13 177 DTELVKA-MLKTQPMAYDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAK-------------- 240 (421) T ss_pred ccccccc-ccceeEEEeeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhh-------------- Confidence 6666543 45555666655421 1222232211123567899999999999999999988764321 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) |.. ..+...+ ++.|.++...|..+..+. . .+|++|..|..|.+-.. .+..|.-.. .. T Consensus 241 ---g~~--------~~~~~~~----~d~i~~~~~~l~~~~~~~---a-~~v~n~~~~~~l~~lkd-~~G~~i~~~---~~ 297 (421) T protein:vir:13 241 ---AVL--------AEETIND----YAGLVKTINSLVPNARKR---A-IIVTNSDGRAYLDGLMD-KQGRPLLKE---LS 297 (421) T ss_pred ---hcc--------ccccccc----hHHHHHHHHHhhhhhcCC---C-EEEEcHHHHHHHHHhhc-CCCceeecC---cC Confidence 100 0111122 345556666665554421 2 44789999999864211 122222211 23 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhH--HHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKL--WFI 315 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~--d~i 315 (339) +|...+++|.+|+.+++.|.... +....+-++|+ +++......+++++..++....- ..+ T Consensus 298 ~~~~~tl~G~pV~~~~~~~~~~~---------~~~~~~~gd~~---------~~~~~~~~~~~~v~~~~~~~f~~~~~~~ 359 (421) T protein:vir:13 298 DGGDLVFKGRPVIELEESIFDVG---------DETKFIVSDFK---------TLIKFMDRKQYLIDQSKEAGYTKNETIA 359 (421) T ss_pred CCCCceecceeeEEeccccccCC---------CceEEEEEecc---------ccEEEEEecceEEEeecccccccCeeEE Confidence 45667899999999999874321 11111112222 23445555667777766654222 357 Q ss_pred HHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 316 DSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 316 ~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++..-++.++++|+++..+..... T Consensus 360 r~~~r~d~~~~~~~a~~~~~~~~~ 383 (421) T protein:vir:13 360 RIIERFDVNSPLDKSSDAEKIRKF 383 (421) T ss_pred EEEeeecceeecchhhheeeeccc Confidence 788899999999999765554432 No 157 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.74 E-value=8.5e-09 Score=64.83 Aligned_cols=272 Identities=13% Similarity=0.073 Sum_probs=161.9 Q ss_pred ccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc--cceeeeccCCC Q lcl|NC_020078. 2 SIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS--KAKLQKIAPGT 79 (339) Q Consensus 2 ~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG--~~t~~~~~~g~ 79 (339) -.-.-++|--|++----=+.+=++| |.++|+.-+.+=+ ...+..|...+..|++++++.-. ....++...|+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siD--f~~~f~~~i~~L~----~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe 74 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhh--hHHHHhhhHHHHH----HHhhhcccccccCCCEEeeccceeeeeccccccCCc Confidence 3445678887877653322333444 9999998886544 34566676778889999775422 23567888999 Q ss_pred CCCCCCCCCcc---ceEEEEeehhhhhhhHHHHHHH-hc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 80 TPPPSTEPHTS---KIFLKIDTVIIARNAEPMLDEF-QT--DFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 80 ~i~~~~~~~~~---~~~l~ID~~~y~~~~vdd~D~~-q~--~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) .|+.... ... ..++.+. ||..- + =||+ |. ..|...+..+++..++++++|..++..|-.+.. T Consensus 75 ~Iplskv-t~~~~~t~t~~ik--K~rK~-t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~------ 142 (296) T protein:vir:98 75 VIPLSKV-ERKIHSEKKIELK--KYRKA-T--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------ 142 (296) T ss_pred ccchhhh-eeeecceEEEEee--ccccc-c--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc------ Confidence 9987643 322 3567776 66555 3 4777 54 366999999999999999999999877743221 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAG 233 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~ 233 (339) ++ . ++...=...|...+.++..++.+.+- ...+++|+|...+.+|++.++.... T Consensus 143 ------------t~-~---~t~~~lQ~Ala~~~~~l~~~feded~----~~~V~FVnP~D~a~ylg~a~it~qt------ 196 (296) T protein:vir:98 143 ------------TQ-D---ALGAGLQGALASAWGKLQVLFEDYGS----ERAIVFANSLDVAEYIAKAGITTQT------ 196 (296) T ss_pred ------------ee-e---echhhHHHHHHHHhhhhhhhccccCC----CceEEEEehHHHHHHhcCCccchhh------ Confidence 00 0 00000012344556666677755431 2469999999999999998764221 Q ss_pred ceeecc-eeEEEeceEEEEecccccccccc--------------ccccCCCccccccccccceEEEEEeccceeEEEEEe Q lcl|NC_020078. 234 ETLNTK-YMFAAFGVPVITSNNAVFGKTIT--------------DHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTI 298 (339) Q Consensus 234 ~~l~~G-~v~~i~G~~V~~Snnlp~~~~~~--------------~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~ 298 (339) ...+ .+.+++|..|+.|+.+|.+...+ +..++.+++-.. +-+..+|+ .... T Consensus 197 --~fG~tyl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~---d~tglIGv---------~h~~ 262 (296) T protein:vir:98 197 --AFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYG---DPTGYIGM---------NHFQ 262 (296) T ss_pred --eechhhhhhccccEEEEcCcCCCceEEEeeecceEEEeecccccchhhhhcccc---ccccceEE---------Eecc Confidence 1122 23348999999999999654322 122333332211 22222221 1111 Q ss_pred eeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 299 PVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 299 ~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . .+.--+..-++-|...| +=|+|++++..+++| T Consensus 263 ~-----~~~~t~eT~~~~~~~lf---pE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 263 E-----NTTLTIQTLLVSGMLMY---PERIDGIVKVTLTPG 295 (296) T ss_pred c-----cceeeehhHhHhHHHhc---ccccceEEEEEecCC Confidence 1 11111222233333333 347789999999999 No 158 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.72 E-value=1.4e-10 Score=74.66 Aligned_cols=277 Identities=10% Similarity=0.008 Sum_probs=140.1 Q ss_pred Ccccc-CcccC-----CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--cccee Q lcl|NC_020078. 1 MSIFD-GQTPS-----YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKL 72 (339) Q Consensus 1 ~~~~~-~~~~~-----~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~ 72 (339) .+... ..+.. -...|....+...+--.+.-+.++.++.+..+..+.+++++++.++.+ . ++|++ +..++ T Consensus 61 r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~-~--~~p~~~~~~~~a 137 (352) T protein:vir:78 61 RHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG-L--EIPRVSYTLDDD 137 (352) T ss_pred HHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC-c--eEEEEecCCCcc Confidence 00000 00000 001111000011111113448899999999999999999988776543 2 34433 33455 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .-...|+.++.. .+..+++++.+. ++..+ .|.+-=--++.+|+.+.+.++.++++++..++.++. .+.. T Consensus 138 ~~v~E~~~~~~~-~~~f~~v~~~~~--k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~---~g~g---- 207 (352) T protein:vir:78 138 DFITDVETAKEL-KLKGDTVKFTTN--KFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA---VSPK---- 207 (352) T ss_pred cccccccccccc-cccceeeeecce--eEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhh---cCCC---- Confidence 555566666654 356666666554 44443 232211113568999999999999999875554431 1110 Q ss_pred cccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS 231 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~ 231 (339) . +...|.. .. ......+....++.|.++...|..... ....| +++|..|..|++-.+=. T Consensus 208 ---~-~~~~g~l--~~----~~~~~~t~~~~~d~i~~~~~~l~~~~~---~~a~~-~mn~~t~~~l~~~~~~~------- 266 (352) T protein:vir:78 208 ---S-GLEHMSF--YN----GSVKEVEGANMYDAIINALADLHEDYR---DNATI-YMRYADYVKIISVLSNG------- 266 (352) T ss_pred ---C-cccccce--ec----cccccccccchHHHHHHHHhccChhhh---cCCEE-EEehHHHHHHHHHHhcc------- Confidence 0 0001100 00 011112223346777777777765433 12345 77888877776522101 Q ss_pred ccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhh Q lcl|NC_020078. 232 AGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSK 311 (339) Q Consensus 232 ~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~ 311 (339) +..+-.|.-.+++|.+|+.++..+.. + -|+|+.-. +-.+ .+..+.+++...- T Consensus 267 -~~~~~~~~~~~llG~PV~~~~~~~~~-------~---------~Gdf~~~~---~~~~--------~~~~~~~~~~~~g 318 (352) T protein:vir:78 267 -TTNFFDTPAEKVFGKPVVFTDAAVKP-------I---------VGDFNYFG---INYD--------GTTYDTDKDVKKG 318 (352) T ss_pred -CCcccccCCccccccceEEecCCCce-------e---------Eeehhhhh---hhhh--------hheeeeeccccCC Confidence 11233444457899999998765421 1 12222110 0001 0112233332221 Q ss_pred HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 312 LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 312 ~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) --.+++.+-|+.++++||+.+.+.+.++ T Consensus 319 ~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 319 EYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred eeEEEEEeeeCceeechhheEEEEeecc Confidence 2234556788999999999999988888 No 159 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.71 E-value=6.8e-11 Score=76.34 Aligned_cols=277 Identities=11% Similarity=0.052 Sum_probs=140.4 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) ..............+.-..+..++--.+..+.|+.++.+..+..+.+++++++.++.+ .++|++ +..++.-...| T Consensus 102 ~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg 178 (387) T protein:vir:94 102 NEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDV 178 (387) T ss_pred hhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcccccccc Confidence 0000000000000000000000000123458899999999988889999988876543 234443 33456566667 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +..+.. .++.+++++.. .++..+..-.-+-. .+.+|+.+.+.++.++++++..++.+|... . T Consensus 179 ~~~~~~-~~~f~~v~l~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g---~----------- 241 (387) T protein:vir:94 179 ETAKEL-KAKGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS---P----------- 241 (387) T ss_pred cccccc-ccccceeeech--heeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC---C----------- Confidence 766653 35566655544 35444332222212 256889999999999999998776654211 1 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ..|...+.... ......+....++.|.++...|..... . ...| ++++..|..|++-.+ + ++..+- T Consensus 242 -g~g~~~g~~~~--~~~~~~~~~~~~d~i~~~~~~l~~~y~--~-na~~-imn~~t~~~~~~~~~--~------~~~~~~ 306 (387) T protein:vir:94 242 -KSGLEHMSFYN--GSVKEVEGADMYDAIINALADLHEDYR--D-NATI-YMRYADYVKIISVLS--N------GTTNFF 306 (387) T ss_pred -Cccccceeeec--cccccccccchHHHHHHHHhccChhhh--c-CCEE-EEechHHHHHHHHHh--c------CCCccc Confidence 00111111111 111122234456777777777765433 1 2456 566666655543211 1 011233 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) .|.-.+++|.+|+.++..+.- + -++|+.-... + + .+..+.+++...---.+++ T Consensus 307 ~~~~~~llG~PV~~~~~~~~~-------~---------~GDf~~~~~~-~--~--------~~~~~~~~~~~~~~~~~~~ 359 (387) T protein:vir:94 307 DTPAEKVFGKPVVFTDAAVKP-------I---------VGDFNYFGIN-Y--D--------GTTYDTDKDVKKGEYLFVL 359 (387) T ss_pred ccCCccccccceEEecCCCce-------e---------eechhhhhhh-h--h--------hhhheecccccCCceEEEE Confidence 444457899999998865421 1 1122211100 0 0 0111222222111122444 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-|+.++++|++.+.+++.+| T Consensus 360 ~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 360 TAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEeCcEeechhheEEEEeecC Confidence 5579999999999999999998 No 160 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.71 E-value=6.8e-11 Score=76.34 Aligned_cols=277 Identities=11% Similarity=0.052 Sum_probs=140.4 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) ..............+.-..+..++--.+..+.|+.++.+..+..+.+++++++.++.+ .++|++ +..++.-...| T Consensus 102 ~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg 178 (387) T protein:vir:96 102 NEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDV 178 (387) T ss_pred hhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcccccccc Confidence 0000000000000000000000000123458899999999988889999988876543 234443 33456566667 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +..+.. .++.+++++.. .++..+..-.-+-. .+.+|+.+.+.++.++++++..++.+|... . T Consensus 179 ~~~~~~-~~~f~~v~l~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g---~----------- 241 (387) T protein:vir:96 179 ETAKEL-KAKGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS---P----------- 241 (387) T ss_pred cccccc-ccccceeeech--heeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC---C----------- Confidence 766653 35566655544 35444332222212 256889999999999999998776654211 1 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ..|...+.... ......+....++.|.++...|..... . ...| ++++..|..|++-.+ + ++..+- T Consensus 242 -g~g~~~g~~~~--~~~~~~~~~~~~d~i~~~~~~l~~~y~--~-na~~-imn~~t~~~~~~~~~--~------~~~~~~ 306 (387) T protein:vir:96 242 -KSGLEHMSFYN--GSVKEVEGADMYDAIINALADLHEDYR--D-NATI-YMRYADYVKIISVLS--N------GTTNFF 306 (387) T ss_pred -Cccccceeeec--cccccccccchHHHHHHHHhccChhhh--c-CCEE-EEechHHHHHHHHHh--c------CCCccc Confidence 00111111111 111122234456777777777765433 1 2456 566666655543211 1 011233 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) .|.-.+++|.+|+.++..+.- + -++|+.-... + + .+..+.+++...---.+++ T Consensus 307 ~~~~~~llG~PV~~~~~~~~~-------~---------~GDf~~~~~~-~--~--------~~~~~~~~~~~~~~~~~~~ 359 (387) T protein:vir:96 307 DTPAEKVFGKPVVFTDAAVKP-------I---------VGDFNYFGIN-Y--D--------GTTYDTDKDVKKGEYLFVL 359 (387) T ss_pred ccCCccccccceEEecCCCce-------e---------eechhhhhhh-h--h--------hhhheecccccCCceEEEE Confidence 444457899999998865421 1 1122211100 0 0 0111222222111122444 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-|+.++++|++.+.+++.+| T Consensus 360 ~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 360 TAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEeCcEeechhheEEEEeecC Confidence 5579999999999999999998 No 161 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.71 E-value=6.8e-11 Score=76.34 Aligned_cols=277 Identities=11% Similarity=0.052 Sum_probs=140.4 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc--ccceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI--SKAKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i--G~~t~~~~~~g 78 (339) ..............+.-..+..++--.+..+.|+.++.+..+..+.+++++++.++.+ .++|++ +..++.-...| T Consensus 102 ~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a~~v~Eg 178 (387) T protein:vir:26 102 NEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDV 178 (387) T ss_pred hhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcccccccc Confidence 0000000000000000000000000123458899999999988889999988876543 234443 33456566667 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAA 157 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~ 157 (339) +..+.. .++.+++++.. .++..+..-.-+-. .+.+|+.+.+.++.++++++..++.+|... . T Consensus 179 ~~~~~~-~~~f~~v~l~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g---~----------- 241 (387) T protein:vir:26 179 ETAKEL-KAKGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS---P----------- 241 (387) T ss_pred cccccc-ccccceeeech--heeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC---C----------- Confidence 766653 35566655544 35444332222212 256889999999999999998776654211 1 Q ss_pred cccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceee Q lcl|NC_020078. 158 QMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLN 237 (339) Q Consensus 158 ~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~ 237 (339) ..|...+.... ......+....++.|.++...|..... . ...| ++++..|..|++-.+ + ++..+- T Consensus 242 -g~g~~~g~~~~--~~~~~~~~~~~~d~i~~~~~~l~~~y~--~-na~~-imn~~t~~~~~~~~~--~------~~~~~~ 306 (387) T protein:vir:26 242 -KSGLEHMSFYN--GSVKEVEGADMYDAIINALADLHEDYR--D-NATI-YMRYADYVKIISVLS--N------GTTNFF 306 (387) T ss_pred -Cccccceeeec--cccccccccchHHHHHHHHhccChhhh--c-CCEE-EEechHHHHHHHHHh--c------CCCccc Confidence 00111111111 111122234456777777777765433 1 2456 566666655543211 1 011233 Q ss_pred cceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHH Q lcl|NC_020078. 238 TKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDS 317 (339) Q Consensus 238 ~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g 317 (339) .|.-.+++|.+|+.++..+.- + -++|+.-... + + .+..+.+++...---.+++ T Consensus 307 ~~~~~~llG~PV~~~~~~~~~-------~---------~GDf~~~~~~-~--~--------~~~~~~~~~~~~~~~~~~~ 359 (387) T protein:vir:26 307 DTPAEKVFGKPVVFTDAAVKP-------I---------VGDFNYFGIN-Y--D--------GTTYDTDKDVKKGEYLFVL 359 (387) T ss_pred ccCCccccccceEEecCCCce-------e---------eechhhhhhh-h--h--------hhhheecccccCCceEEEE Confidence 444457899999998865421 1 1122211100 0 0 0111222222111122444 Q ss_pred HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 318 WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 318 ~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+-|+.++++|++.+.+++.+| T Consensus 360 ~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 360 TAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEeCcEeechhheEEEEeecC Confidence 5579999999999999999998 No 162 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.71 E-value=1.9e-09 Score=68.39 Aligned_cols=267 Identities=15% Similarity=0.111 Sum_probs=148.1 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc-cceeeeccCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS-KAKLQKIAPGT 79 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG-~~t~~~~~~g~ 79 (339) |.= .++.-+.+|..| =++ .|++.|+.-+.+=+ .+.+..|...+..|++++||... ....+++..|+ T Consensus 1 mAe-~nlt~~~dL~~~------~si--dfv~~f~~~i~~L~----~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe 67 (295) T protein:vir:99 1 MAE-KNLNTMADLGDI------KSI--DFVNKFSKNINDLL----KLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGE 67 (295) T ss_pred CCC-cccccHhhccCc------eee--hhhHHhhhhHHHHH----HHhccccccccccCCeEEeeeeeeecccccccCCc Confidence 221 111111223222 122 39999997665433 34566676778889999999976 34778999999 Q ss_pred CCCCCCCCCc--cceEEEEeehhhhhhhHHHHHHH-hc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|NC_020078. 80 TPPPSTEPHT--SKIFLKIDTVIIARNAEPMLDEF-QT--DFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYG 154 (339) Q Consensus 80 ~i~~~~~~~~--~~~~l~ID~~~y~~~~vdd~D~~-q~--~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~ 154 (339) .|+......+ ...++.+. ||... + =||+ |. ..|...|..+++..++++.+|..++..|-.+... T Consensus 68 ~Iplskvt~~~~~t~t~kik--K~rK~-t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t------ 136 (295) T protein:vir:99 68 TIPLSKVTRTKDKDYTVKWF--KKRRA-T--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK------ 136 (295) T ss_pred ccchhhheeeeeeeeEEEee--eeccc-c--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee------ Confidence 9987653221 23566665 66554 3 4666 54 3679999999999999999999998777432110 Q ss_pred ccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 155 TAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) + ....-+.-++.+..+...+.|.+ + ...+++|+|...+.||++-... ++.. . T Consensus 137 ------------~-------tg~~lq~a~a~~~~al~~f~Ee~-~---~~~V~FVnP~D~a~yl~~A~~~---~~~a--~ 188 (295) T protein:vir:99 137 ------------V-------KGVGLQKALSASWAKLATFNEFE-G---SPLVSFVSPLDVANYLGDTKVG---ADAS--N 188 (295) T ss_pred ------------e-------ehhhHHHHHHHhhhhhhhccccc-C---CceEEEEehHHHHHHHhccccc---cchh--h Confidence 0 00011223344444444444432 1 2359999999999999986542 1111 0 Q ss_pred eeecceeEEEeceE-EEEeccccccccccc--------------cccCCCccccccccccceEEEEEeccceeEEEEEee Q lcl|NC_020078. 235 TLNTKYMFAAFGVP-VITSNNAVFGKTITD--------------HLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP 299 (339) Q Consensus 235 ~l~~G~v~~i~G~~-V~~Snnlp~~~~~~~--------------~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~ 299 (339) .+---.+.+++|++ |+.|+.+|.+..+.. ..++..++-. .+.+..+|+ ..-.. T Consensus 189 ~fG~~~L~nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~---~D~tglIg~---------~h~~~ 256 (295) T protein:vir:99 189 VFGMTLLKNFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGLFADF---TDETGLIAA---------ARNRQ 256 (295) T ss_pred hhhhhhhhhhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhhhhhc---cCcccceEE---------Eeccc Confidence 11112344699997 999999997543321 1122222111 011112221 11111 Q ss_pred eeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 300 VTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 300 ~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+.--+..-++-|...| +=|+|++++..+.++ T Consensus 257 -----~~~~t~et~~~~~~~lf---pE~~dgiv~~tI~~~ 288 (295) T protein:vir:99 257 -----LSNLTYESVFFGANVLF---AEIPEGVVEATIEAA 288 (295) T ss_pred -----cceeeehhhhHhHHHhc---ccccceEEEEEEecC Confidence 11111222233333332 347788999888777 No 163 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.69 E-value=6.4e-09 Score=65.49 Aligned_cols=294 Identities=14% Similarity=0.076 Sum_probs=144.4 Q ss_pred Ccc--ccCc-----------ccC-CC--------cccCC-c-cCcccchhHHHHHHHHHHHHHHHHHHhhhccccccc-- Q lcl|NC_020078. 1 MSI--FDGQ-----------TPS-YD--------VTRPN-Q-RHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVR-- 54 (339) Q Consensus 1 ~~~--~~~~-----------~~~-~~--------~~r~~-~-~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r-- 54 (339) +++ -.|. ... .. +++.. . ...+|. . +.-+.|++++.+..+..++++.+.... T Consensus 300 ~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg-~-~vp~~~~~~ii~~l~~~svv~~l~~~~~~ 377 (645) T protein:vir:93 300 KSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGS-L-SEYQEYAQDFIDYLRPQTIIGRFGQGGIP 377 (645) T ss_pred HHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCC-c-cCchhhHHHHHHhhhhhhhHHhhcccccc Confidence 111 0110 000 00 00000 0 000111 1 233888899998888888888775432 Q ss_pred cccc-cceEEEecc-ccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHH Q lcl|NC_020078. 55 SVRG-TSTISNRGI-SKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIAN 132 (339) Q Consensus 55 ~i~~-G~tv~i~~i-G~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~ 132 (339) ...+ -..++||+. +.+++.....|+.++.. .+..+++++..= .++.-..|.+-=-.++.+|+.+.+.++.+++|++ T Consensus 378 ~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s-~~~f~~v~l~~~-kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~ 455 (645) T protein:vir:93 378 ALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLT-KFDFESITFSHA-KVSAIAVLTEELIRFSSPAADALVRNALAEAVVA 455 (645) T ss_pred ccccccCceeeeeeecCcceEEeccCcccccc-ccceeEEEEeeE-EEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 1211 124667764 56777777778887754 456666665542 2222222322111146788999999999999999 Q ss_pred HHHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHH Q lcl|NC_020078. 133 MYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPA 212 (339) Q Consensus 133 ~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~ 212 (339) ..|+.+|.-- .....+....|...+.. ...... ..+.-+..+...|..+++.. .+-..|++|. T Consensus 456 ~~d~a~l~g~--------g~~~~~~~p~gi~~~~~----~~~~~~---~~~~d~~~~~~~~~~a~~~~--~~a~~vmn~~ 518 (645) T protein:vir:93 456 RLDTDFVDPK--------KAAVADVSPASITHDVK----GTASSG---NPDADAEAAFGQFVAANLQP--TGAVWLMSST 518 (645) T ss_pred HHHHHhhcCC--------CcccCCccccceecccc----cccccc---chHHHHHHHHHHHHhcCCCc--cccEEEEcHH Confidence 9998875210 00000000111111110 010111 11223455666676666642 2335588999 Q ss_pred HHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEecccee Q lcl|NC_020078. 213 AFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKAL 292 (339) Q Consensus 213 ~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~ 292 (339) .+..|.+-.. .+..+.-.+ ... .| ++++|.+|+.|+++|... .+ ++++.. ++.....+ T Consensus 519 ~~~~L~~lkd-~~G~~~~~~-~~~-~~--~tL~G~PV~~s~~vp~~~-----~~----------gd~s~~--~ig~~~~v 576 (645) T protein:vir:93 519 NALALSMRKN-ALGQKEYPD-MTL-LG--GSFQGLPVIVSQYVGDQL-----VL----------VNAPDI--YLADDGGV 576 (645) T ss_pred HHHHHHhccc-cCCceeecC-CCC-CC--ceeeceeeEEeccCCcce-----eE----------eccccE--EEEEecce Confidence 9999876422 111111111 011 12 478999999999998421 11 122221 12222222 Q ss_pred EEEEEeeeeEEeeechh--------------hh--HHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 293 LAGSTIPVTSKIFFDDL--------------SK--LWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 293 ~~~~~~~~~~e~~~~~~--------------~~--~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) -.....+.+.++...+. .+ .-.|++.+.++-+++||+++++| +++ T Consensus 577 ~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~l--t~~ 637 (645) T protein:vir:93 577 AVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVI--TGV 637 (645) T ss_pred EEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEE--ecc Confidence 22222222222221110 11 12467778899999999998876 455 No 164 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.52 E-value=4.8e-09 Score=66.17 Aligned_cols=288 Identities=10% Similarity=-0.045 Sum_probs=148.5 Q ss_pred CccccCc---cc----CCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc-cee Q lcl|NC_020078. 1 MSIFDGQ---TP----SYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK-AKL 72 (339) Q Consensus 1 ~~~~~~~---~~----~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~-~t~ 72 (339) +.-..|. .. .+|-.+.+-. ..| ..+..+.++.++.+.....+.+++++++.++. |+ .+|++-.. .++ T Consensus 56 ~~~~~~~~~lt~~e~~~~~~~~~~~~-~~g--g~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 56 SSLPKSAQSLSANQRSFFMDINKNVN-YKE--EKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred HHhccCcccccHHHHHHHHHHhcccC-CCC--ceecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 0000000 00 0111111111 111 12445999999999999999999999877754 44 46666543 444 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .-..-+.++..+..+...++++. ..++..+.-=--+=. .+.+|+.+.+.++.+.++++..|+.++ .+.....| T Consensus 131 ~w~~e~~~~~~~~~~~f~~i~l~--~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i----~G~G~~qP 204 (381) T protein:vir:10 131 VWGKIYGEIKGQLDAAFSEETAI--QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQP 204 (381) T ss_pred eeecccccccccccccceeeeec--ceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeE----eccCCCCc Confidence 44343445544334444444444 444443322111111 256799999999999999999998764 22211111 Q ss_pred cccccccccCcccccccc--------ccCccccccHHHHHHHHHHHHHHHHhc--CCC-CCcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVT--------LAGANDYKDPAKLYAAIASLVEKFLEK--DVR-PNEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~--------~~~~~~~~~~~~l~~ai~~a~~~L~e~--dV~-~p~~~R~~vv~P~~~~~Ll~~ 220 (339) . +.......+.... ..+.....++..+++.+.++...|... +.+ .+..+-+.+++|..+..|+.- T Consensus 205 ~----Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~ 280 (381) T protein:vir:10 205 I----GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) T ss_pred e----eeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccc Confidence 1 1111101000000 011111224455566666665555321 111 133455678999998887654 Q ss_pred cchhhhcccccccceeecceeEEE--eceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEe Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAA--FGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTI 298 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i--~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~ 298 (339) ..+.+ . +|.-... .|.+|++|+.+|.+. . .-++|+. ...+... T Consensus 281 ~~~~~-----~------~G~~v~~l~~g~~vv~s~~~p~~~-----i---------ifgDfs~----------Y~i~~r~ 325 (381) T protein:vir:10 281 YTHLN-----A------NGVYVTALPFNLNVIESTVQEAGK-----V---------LTYVKGL----------YDGYLAG 325 (381) T ss_pred cccCC-----C------CCceeecCCCCceEEecCCCCcCc-----E---------EEEeccc----------EEEEEec Confidence 32221 1 2222222 366789999887421 1 1122222 1223334 Q ss_pred eeeEEeeechhhhHH---HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 299 PVTSKIFFDDLSKLW---FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 299 ~~~~e~~~~~~~~~d---~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+..+.+.+ .+|.. .+++.+-++.++++|++.+++.++.+ T Consensus 326 ~~~i~~~~~-~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 326 GINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred ccEEEeech-hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 445554443 33433 57888899999999999999888886 No 165 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.52 E-value=4.8e-09 Score=66.17 Aligned_cols=288 Identities=10% Similarity=-0.045 Sum_probs=148.5 Q ss_pred CccccCc---cc----CCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc-cee Q lcl|NC_020078. 1 MSIFDGQ---TP----SYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK-AKL 72 (339) Q Consensus 1 ~~~~~~~---~~----~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~-~t~ 72 (339) +.-..|. .. .+|-.+.+-. ..| ..+..+.++.++.+.....+.+++++++.++. |+ .+|++-.. .++ T Consensus 56 ~~~~~~~~~lt~~e~~~~~~~~~~~~-~~g--g~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:95 56 SSLPKSAQSLSANQRSFFMDINKNVN-YKE--EKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred HHhccCcccccHHHHHHHHHHhcccC-CCC--ceecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 0000000 00 0111111111 111 12445999999999999999999999877754 44 46666543 444 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .-..-+.++..+..+...++++. ..++..+.-=--+=. .+.+|+.+.+.++.+.++++..|+.++ .+.....| T Consensus 131 ~w~~e~~~~~~~~~~~f~~i~l~--~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i----~G~G~~qP 204 (381) T protein:vir:95 131 VWGKIYGEIKGQLDAAFSEETAI--QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQP 204 (381) T ss_pred eeecccccccccccccceeeeec--ceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeE----eccCCCCc Confidence 44343445544334444444444 444443322111111 256799999999999999999998764 22211111 Q ss_pred cccccccccCcccccccc--------ccCccccccHHHHHHHHHHHHHHHHhc--CCC-CCcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVT--------LAGANDYKDPAKLYAAIASLVEKFLEK--DVR-PNEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~--------~~~~~~~~~~~~l~~ai~~a~~~L~e~--dV~-~p~~~R~~vv~P~~~~~Ll~~ 220 (339) . +.......+.... ..+.....++..+++.+.++...|... +.+ .+..+-+.+++|..+..|+.- T Consensus 205 ~----Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~ 280 (381) T protein:vir:95 205 I----GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) T ss_pred e----eeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccc Confidence 1 1111101000000 011111224455566666665555321 111 133455678999998887654 Q ss_pred cchhhhcccccccceeecceeEEE--eceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEe Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAA--FGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTI 298 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i--~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~ 298 (339) ..+.+ . +|.-... .|.+|++|+.+|.+. . .-++|+. ...+... T Consensus 281 ~~~~~-----~------~G~~v~~l~~g~~vv~s~~~p~~~-----i---------ifgDfs~----------Y~i~~r~ 325 (381) T protein:vir:95 281 YTHLN-----A------NGVYVTALPFNLNVIESTVQEAGK-----V---------LTYVKGL----------YDGYLAG 325 (381) T ss_pred cccCC-----C------CCceeecCCCCceEEecCCCCcCc-----E---------EEEeccc----------EEEEEec Confidence 32221 1 2222222 366789999887421 1 1122222 1223334 Q ss_pred eeeEEeeechhhhHH---HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 299 PVTSKIFFDDLSKLW---FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 299 ~~~~e~~~~~~~~~d---~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+..+.+.+ .+|.. .+++.+-++.++++|++.+++.++.+ T Consensus 326 ~~~i~~~~~-~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:95 326 GINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred ccEEEeech-hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 445554443 33433 57888899999999999999888886 No 166 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.49 E-value=2.1e-08 Score=62.63 Aligned_cols=295 Identities=9% Similarity=-0.046 Sum_probs=148.3 Q ss_pred CccccCcccCCCccc-----CCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc-cceeee Q lcl|NC_020078. 1 MSIFDGQTPSYDVTR-----PNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS-KAKLQK 74 (339) Q Consensus 1 ~~~~~~~~~~~~~~r-----~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG-~~t~~~ 74 (339) ...-.+.=+-.+=.| .-...+..+--.+.-+.+..++.+...+.+.+++++++.++. | .++|++-. ..++.- T Consensus 58 ~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a~w 135 (377) T protein:vir:96 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAVW 135 (377) T ss_pred HHhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCCcceeE Confidence 000000000000000 000011111112344889999999999999999999987764 3 35566543 345554 Q ss_pred ccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 75 IAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 75 ~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) ...+.++.....+...+++|. ..++..+ .|..-=--.+.+|+-+.+.++.+.++++..|+.++. +.....|.. T Consensus 136 v~e~~~~~~~~~~~f~~i~l~--~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~----G~G~~~P~G 209 (377) T protein:vir:96 136 GDIFGEIKGQLKQAFKEQDFS--QFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNGLLQPVG 209 (377) T ss_pred eecccccccccCccceeEeee--eeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEe----ccCCCccee Confidence 444555554334455554444 4444443 232211113678899999999999999999987751 111111100 Q ss_pred -------cccccccCcccccccc---ccCccccccHHHHHHHHHHHHHHHHhcCCCC---CcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 154 -------GTAAQMPGHSGGNVVT---LAGANDYKDPAKLYAAIASLVEKFLEKDVRP---NEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 154 -------~~~~~~~g~~~~~~~~---~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~---p~~~R~~vv~P~~~~~Ll~~ 220 (339) .......+........ ..+.....+++.+++.+..+...+..+.-.. ....-+.+++|..|..++.. T Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 289 (377) T protein:vir:96 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) T ss_pred eeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccc Confidence 0000001100000000 0011122345666666666655554322100 11223467999888766432 Q ss_pred cchhhhcccccccceeecceeEEEec--eEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEe Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAAFG--VPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTI 298 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i~G--~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~ 298 (339) ..+.+ . +|.-..++| .+|++|+.+|.+. .+ -++|+. ...+... T Consensus 290 ~~~~~-----~------~G~~~~~l~~p~~v~~s~~~p~~~-----i~---------fgdf~~----------Y~i~~r~ 334 (377) T protein:vir:96 290 FTSRN-----Q------FGEYVTVLPHGITILESLAVETGK-----AI---------AFVANR----------YDAFMAT 334 (377) T ss_pred ccccC-----C------CCCceeccCCCceEEecCCCCccc-----EE---------EEEcCc----------EEEEEec Confidence 22211 1 233345554 4577888887321 11 111111 2333444 Q ss_pred eeeEEeeechhhhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 299 PVTSKIFFDDLSKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 299 ~~~~e~~~~~~~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+..+.+++ .+|. ..+++.+-++.++++|++.++|.++.- T Consensus 335 ~~~i~~~~~-~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 335 ASTIEEYDQ-TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ccEEEeehh-hhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 555555543 2232 347888899999999999999999988 No 167 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.46 E-value=1.8e-08 Score=63.04 Aligned_cols=304 Identities=10% Similarity=0.025 Sum_probs=144.9 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHH--HHHHHHHHHHHHHHhhhccccccccccccceEEEecccc--ceeeecc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVT--EQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK--AKLQKIA 76 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~i--e~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~--~t~~~~~ 76 (339) |-+.|--. --++.|.-+.-...|...-|+ ++++ ++.+..++.|.++++.++.+..++.+..|+.+|- .....+. T Consensus 1 ~~~~~~~~-~~~~~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~ 78 (315) T protein:vir:41 1 MLTIEDIR-GGKPFEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRD 78 (315) T ss_pred Ccccchhh-cCChhhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccc Confidence 21111000 001111111111222222232 5655 5667788889999998876555556666777653 2222222 Q ss_pred CCCCCCCC--CCCCccceEEEEeehhhhhhhH--HHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|NC_020078. 77 PGTTPPPS--TEPHTSKIFLKIDTVIIARNAE--PMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSP 152 (339) Q Consensus 77 ~g~~i~~~--~~~~~~~~~l~ID~~~y~~~~v--dd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~ 152 (339) .+++-... ..+...+..+..-+ +++...| +-+|++.-..|+.+.++.+.+.++++..+.+++ .+-..+..+ T Consensus 79 ~~~~~~~~~~~~~~f~~~~l~~~~-l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~----nGdg~s~~p 153 (315) T protein:vir:41 79 ETGQKLAPPESTAEVKTNTLYMRE-MVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYL----HGDTSSSDP 153 (315) T ss_pred cccCcCCCCCCccccceeeeceee-eeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhh----ccCCcCcCc Confidence 22222111 12344444444432 2222233 455655445799999999999999998887654 221110000 Q ss_pred ccccccccCccc---cccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 153 YGTAAQMPGHSG---GNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 153 ~~~~~~~~g~~~---~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) . .....|.-. ..+.......++ .....+.+.++...|..+-- ....+-..++++..+..|.+ +.+.+-. T Consensus 154 ~--~~~~~G~l~~a~~~~~~~~~~~~a--~~~~~d~l~~l~~sl~~~yr-~~~~~~~~imn~~t~~~~rk---lk~~~g~ 225 (315) T protein:vir:41 154 L--LRMSDGWLKLASEKLTESDVDPEA--EDWPMNLFDTMIESLPTPYR-NNLPNMKFYVTWDIYRAYRD---ALKGRET 225 (315) T ss_pred c--ccccccceeccccccccccccccc--ccccHHHHHHHHHhcChHHh-hcCCceEEEEcHHHHHHHHH---HhccCCC Confidence 0 001112111 000000000011 11112444455555533210 00112245899999987754 2222212 Q ss_pred ccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL 309 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~ 309 (339) ......+..|.-..++|.+|+.++++|........ .++.+++-+..+...++..+.+|+.. T Consensus 226 ~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~-------------------ilf~d~~nl~~~~~~~i~i~~~~~a~ 286 (315) T protein:vir:41 226 GLGDQALTGANSILYDGRPVQYVPALEALNDGKSR-------------------ALFVVPTQLVYGFWRNIKVVPDYDAE 286 (315) T ss_pred ccccchhhcCCCceecccceEecccccccCCCCcc-------------------EEEecccceEEEeccccEEEeeecCC Confidence 22233456677788999999999999754321111 13333443444555666777777755 Q ss_pred hhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 310 SKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 310 ~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) .....+...+-.|.++..++++++-.+.. T Consensus 287 ~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 287 MRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred CCceEEEEEEEeceeEEeccceeEeeeeC Confidence 43344444455677777677755544455 No 168 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.39 E-value=1.7e-08 Score=63.19 Aligned_cols=286 Identities=12% Similarity=0.006 Sum_probs=143.7 Q ss_pred CccccCcccC-------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceee Q lcl|NC_020078. 1 MSIFDGQTPS-------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQ 73 (339) Q Consensus 1 ~~~~~~~~~~-------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~ 73 (339) ...-.|.-.- ++--+.+-. ..|. -+..+.|..++.+.....|.+++++++.+. +|+ .+|++....... T Consensus 56 ~~~~~~~~~l~~~e~~~~~~~~~~t~-~~Gg--~lvP~~~~~~I~~~l~~~spir~~a~v~~~-~~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 56 SSLPKSAQTLSANQRNFFMDINKSVG-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) T ss_pred HHhcccccccCHHHHHHHHHHhhcCC-CCCc--eecCHHHHHHHHHHHHhhcceeeeeeeEec-Ccc-eEEEeecCCcce Confidence 0000010000 000011111 1111 144599999999999999999999988776 344 456555433323 Q ss_pred ec-cCCCCCCCCCCCCccceEEEEeehhhhhhh-H--HHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 74 KI-APGTTPPPSTEPHTSKIFLKIDTVIIARNA-E--PMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 74 ~~-~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~-v--dd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) .. .-+.++.....+...+ +.+...++..+. | +-+|+ +.+|+-+.+..+.+.++++..|+.++ .+.... T Consensus 131 ~W~~e~~~~~~~~~~~f~~--i~l~~~kl~a~i~is~elL~D--s~~~le~~i~~~la~~~a~~~~~afi----~GdG~~ 202 (381) T protein:vir:10 131 VWGKIYGEIKGQLDAAFSE--ETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFL----KGTGKD 202 (381) T ss_pred EEeecccccccccCcccee--EeecceeEEeeccccHHHHhc--cHHHHHHHHHHHHHHHHHHHhhceeE----ecccCC Confidence 33 2223343332334444 444444544332 2 22332 46789999999999999999998764 222111 Q ss_pred cccccccccccCcccccccccc--------CccccccHHHHHHHHHHHHHHHHh----cCCCCCcCCeEEEECHHHHHHH Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLA--------GANDYKDPAKLYAAIASLVEKFLE----KDVRPNEEDMILVLPPAAFTAL 217 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~--------~~~~~~~~~~l~~ai~~a~~~L~e----~dV~~p~~~R~~vv~P~~~~~L 217 (339) .|. +..+....+...... +.-...+...+++.+..+...+.- +.. .+..+.+++++|..+..| T Consensus 203 qP~----Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~vmn~~t~~~l 277 (381) T protein:vir:10 203 QPI----GLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV-AVKGNVTMVVNPSDAFEV 277 (381) T ss_pred Cce----eeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccc-cccCceEEEEchhhHHhh Confidence 111 111111111000000 001112334444444433333321 111 234566789999999888 Q ss_pred hcccchhhhcccccccceeecceeEE-EeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEE Q lcl|NC_020078. 218 MQAEHITNGEYVTSAGETLNTKYMFA-AFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGS 296 (339) Q Consensus 218 l~~~~~~n~d~~~~~~~~l~~G~v~~-i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~ 296 (339) +....+.+. ++. + +.. -.|.+|++|+++|... .+ -++|+. ...+. T Consensus 278 ~~~~~~~~~-----~G~-~----v~~lp~g~~vv~~~~~p~~~-----i~---------fGDfs~----------Y~i~~ 323 (381) T protein:vir:10 278 QAQYTHLNA-----NGV-Y----VTALPFNLNVIESTVQEAGK-----VL---------TYVKGL----------YDGYL 323 (381) T ss_pred ccccccCCC-----CCc-e----eecCCCCceeEEcCCCCcCc-----EE---------EEEccc----------EEEEE Confidence 765433221 111 1 111 1478899999887421 11 122222 12233 Q ss_pred EeeeeEEeeechhhhHH---HHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 297 TIPVTSKIFFDDLSKLW---FIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 297 ~~~~~~e~~~~~~~~~d---~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+..+...+ .+|.. .+++++-++.++++|++.+++.++.. T Consensus 324 r~~~~i~~~~~-~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 324 AGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred ecccEEEeech-hhhhcCceEEEEEEEEcCEEecCCcEEEEEEeec Confidence 34445554443 33432 57888899999999999999888755 No 169 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.38 E-value=1.8e-08 Score=63.09 Aligned_cols=288 Identities=8% Similarity=-0.078 Sum_probs=137.0 Q ss_pred Cc----cccCc--ccC-----CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc Q lcl|NC_020078. 1 MS----IFDGQ--TPS-----YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK 69 (339) Q Consensus 1 ~~----~~~~~--~~~-----~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~ 69 (339) .. --.|. +.. ++--+.+ ....| -.+..+.|+.++.+...+.+.+++++++.++ +|+ .+|++... T Consensus 59 ~~~~~~~~~g~~~lt~~e~~~~~~~~~~-~~~~g--g~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~-~~~-~~i~~~~~ 133 (383) T protein:vir:78 59 ADAYISASRTDKNITNEEIKFFNDINKE-VGYKE--ETLLPQTVVDEIFEDLTTEHPFLASIGMRTT-GLR-TKFLKSET 133 (383) T ss_pred HHHHHHhcCChhhhhHHHHHHHHHHhcc-CCCCC--ccccCHHHHHHHHHHHHhhccceeeeeeEec-CCc-eEEEEEcC Confidence 00 00000 000 0000010 01111 1244599999999999999999999987775 455 47877755 Q ss_pred ceeeec-cCCCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020078. 70 AKLQKI-APGTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAI 147 (339) Q Consensus 70 ~t~~~~-~~g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~ 147 (339) .....+ ..+.++.....++..+++|.. .++.. +.|..-=--.+.+|+.+.+.++.+.++++..|+.++ .+.. T Consensus 134 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~--~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i----~G~G 207 (383) T protein:vir:78 134 SGVAVWGKIFGEIKGQLDATFSDEESIQ--NKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYI----VGDG 207 (383) T ss_pred CcceEEeecccccccccCcceeeEeecc--eeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheE----eccC Confidence 443333 333444433344455555554 33333 333211111256789999999999999999998874 1111 Q ss_pred cccccccccccccCccc-cccccc-------cCccccccHHHHHHHHHHHHH---HHHhcCCCCCcCCeEEEECHHHHHH Q lcl|NC_020078. 148 ASDSPYGTAAQMPGHSG-GNVVTL-------AGANDYKDPAKLYAAIASLVE---KFLEKDVRPNEEDMILVLPPAAFTA 216 (339) Q Consensus 148 ~~~~~~~~~~~~~g~~~-~~~~~~-------~~~~~~~~~~~l~~ai~~a~~---~L~e~dV~~p~~~R~~vv~P~~~~~ 216 (339) ...|. +..+.... ..+... .+.....+...+++.+..+.. .+....-.........+++|..|+. T Consensus 208 ~~qP~----Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 283 (383) T protein:vir:78 208 NDKPI----GLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWD 283 (383) T ss_pred CCCce----eeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhh Confidence 11111 11110000 000000 001111122233333222211 1111100001112234677765544 Q ss_pred HhcccchhhhcccccccceeecceeEEEece--EEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEE Q lcl|NC_020078. 217 LMQAEHITNGEYVTSAGETLNTKYMFAAFGV--PVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLA 294 (339) Q Consensus 217 Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~--~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~ 294 (339) ++...... . .+|.-..++|+ +|++|+++|... .+ -++|+. ... T Consensus 284 ~~~~~~~~-----~------~~G~~~t~l~~~~~iv~s~~~p~~~-----ii---------fgdfs~----------Y~i 328 (383) T protein:vir:78 284 VKKQYTSL-----N------ANGVYVTALPFNLNIIESLFVPEKK-----AI---------SYVAER----------YDA 328 (383) T ss_pred hccchhcc-----C------CCCceeeecCCCceEEecCCCCccc-----EE---------Eeeccc----------eEE Confidence 33211111 0 12333455544 478888887431 11 011111 223 Q ss_pred EEEeeeeEEeeechhhhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 295 GSTIPVTSKIFFDDLSKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 295 ~~~~~~~~e~~~~~~~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +...++..+.++ +.+|. ..+++.+-++.++++|++.+++.++-+ T Consensus 329 ~~r~~~~i~~~~-~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 329 LIGGPLDIGTYD-QTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred EecccceEEecc-hhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEec Confidence 344455555543 33443 357888899999999999999888877 No 170 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.32 E-value=1.2e-07 Score=58.43 Aligned_cols=299 Identities=10% Similarity=0.015 Sum_probs=154.1 Q ss_pred CccccCcc---cCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccc--eeeec Q lcl|NC_020078. 1 MSIFDGQT---PSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKA--KLQKI 75 (339) Q Consensus 1 ~~~~~~~~---~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~--t~~~~ 75 (339) |-.+.-.| ++.|++..+ ++. |--++++ ++.+..+..+.++++.++.+-.+..+..|+++|.. ..... T Consensus 1 ~~~~~~~~~~~k~it~~d~~--gG~-----L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~ 72 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLG--KGI-----LAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGR 72 (314) T ss_pred CchhhhHHHhhcccccccCC--Cce-----eChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCccccccc Confidence 44433322 344555442 222 2236775 67788999999999998754334456788888742 12222 Q ss_pred cCCCCCCC-C-CCCCccceEEEEeehhhhhhhH--HHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 76 APGTTPPP-S-TEPHTSKIFLKIDTVIIARNAE--PMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 76 ~~g~~i~~-~-~~~~~~~~~l~ID~~~y~~~~v--dd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) ..+++.+. + .++......|..-++.. .+.| +-++++.-..|+.+.++.+.+.++++....+++ .+-....+ T Consensus 73 ~~~~~~~~~~~~~~tf~~~~l~~~kl~~-~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~----nGdg~~~s 147 (314) T protein:vir:41 73 NTSGTKVAPTADEVTVSTNTLEMKELVT-KVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFL----HADSSLTT 147 (314) T ss_pred ccccCCccCCcccccccceeeeeEEEEE-eecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhh----ccccCCcC Confidence 22222211 1 23444455555433222 2233 334444324589999999999999998887654 22111000 Q ss_pred cccccccccCccc---cccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcc Q lcl|NC_020078. 152 PYGTAAQMPGHSG---GNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEY 228 (339) Q Consensus 152 ~~~~~~~~~g~~~---~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~ 228 (339) .........|.-. +.++...+. + .....+.+.++...|..+.-.-+ ..-..+++++.+..+.+ +.+..- T Consensus 148 ~~~~~~~p~G~l~~a~~~~~~~~~~-~---~~~~~~~~~~l~~sl~~~yr~~~-~~~~~~m~~~t~~~~r~---~l~~~~ 219 (314) T protein:vir:41 148 GRELYRINDGWMKLAGNQYTDAEPE-D---ENWPLNLFDGMMDELDTRYLQLK-PRMKFYVSNEIYNGYRK---QLLVRE 219 (314) T ss_pred cccchhcchhhhhhcccceeecCcc-c---cccHHHHHHHHHHhcCchhhcCC-CceEEEecHHHHHHHHH---HHhccC Confidence 0000001223211 111111111 1 12233445555555543211001 12245679999887754 111111 Q ss_pred cccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech Q lcl|NC_020078. 229 VTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD 308 (339) Q Consensus 229 ~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~ 308 (339) +......+..|.-..+.|++|+.++.+|..... +...++.+++-+..+-..++..+.+|+. T Consensus 220 ~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~-------------------~~~i~fgd~~nlv~~~~~~ir~~~~~~a 280 (314) T protein:vir:41 220 TGLGDSALIGATGLQYDGIPIQYVPALDALGDD-------------------KARALLTVPTNLVYGFWRNIRIEPKRDA 280 (314) T ss_pred CcccchhhhCCCCceecceeeEecccccccCCC-------------------CceEEEechhheEEEeeceeEEeecccC Confidence 222333455677778999999999998742211 1122445566665666677777777766 Q ss_pred hhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 309 LSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 309 ~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +.-...+...+-+++.+..+++++...+--| T Consensus 281 ~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 281 AMRRTEYIASLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred cCCeEEEEEEEEeceEEEEcCcEEEEEeecc Confidence 5444444455566777877777666655555 No 171 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.30 E-value=4.2e-08 Score=61.00 Aligned_cols=291 Identities=11% Similarity=0.034 Sum_probs=137.5 Q ss_pred Ccc-----------ccCcccC-CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc Q lcl|NC_020078. 1 MSI-----------FDGQTPS-YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS 68 (339) Q Consensus 1 ~~~-----------~~~~~~~-~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG 68 (339) +.. ..+.+.. -+..+... +.++-..+.-+.++.++.+.....+.+++++++..+.+ .++++.-+ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g--~~~~~~~~ 198 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKR--AVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG--TARQNIAG 198 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhh--hhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc--eeEeeeec Confidence 000 0000000 01111111 11111123447888889888888888899888777653 34555555 Q ss_pred cc-eeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020078. 69 KA-KLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAA 146 (339) Q Consensus 69 ~~-t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA 146 (339) .. .+.-...|.+++.. .+...++++.+- ++..+..=--+-. .+.+|+-+.+.++.+++++...|+.|+. +. T Consensus 199 ~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~----G~ 271 (466) T protein:vir:80 199 AIPEGVWTEAVANLNEL-SLSFSQIEVDGY--KVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILY----GT 271 (466) T ss_pred CCcceeecccccccccc-cccccceeecce--eeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheee----cc Confidence 43 33333445555432 344555555443 4444322111111 3557899999999999999999988752 11 Q ss_pred ccccccccccccccCccccccccccCcc----ccccHHHHH----------HHHHHHH---HHHHhcCCCCCcCCeEEEE Q lcl|NC_020078. 147 IASDSPYGTAAQMPGHSGGNVVTLAGAN----DYKDPAKLY----------AAIASLV---EKFLEKDVRPNEEDMILVL 209 (339) Q Consensus 147 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~----~~~~~~~l~----------~ai~~a~---~~L~e~dV~~p~~~R~~vv 209 (339) ....|. +........++....+.. ...++..+. ..+.++. ..+..+. ....-+.++ T Consensus 272 G~~~P~----Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~w~~ 344 (466) T protein:vir:80 272 GTKMPV----GIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANY---SNGMKFWAM 344 (466) T ss_pred CCCCcc----eeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccc---cCCceeEEe Confidence 111110 011100000000000000 001111111 0111111 1111111 111223467 Q ss_pred CHHHHHHHhcccchhhhc--ccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEe Q lcl|NC_020078. 210 PPAAFTALMQAEHITNGE--YVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMF 287 (339) Q Consensus 210 ~P~~~~~Ll~~~~~~n~d--~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~ 287 (339) ++..+..|..-.-..+.. |... ..++ ..++|.+|++|+++|.+. ... ++ T Consensus 345 ~~~~~~~l~~~~~~~~~~g~~~~~----~~~~--~~i~G~pvv~s~~~~~~~-----~~~---------g~--------- 395 (466) T protein:vir:80 345 SSNTHAVLMSKAITFNSAGALVAS----LNNT--MPIVGGDIVILDFIPDND-----IIG---------GY--------- 395 (466) T ss_pred cchhHHHhhcccccccCCcccccc----CCCc--ccccccceeecCccCccc-----eee---------ec--------- Confidence 888888876543222211 1111 1122 248999999999998432 111 11 Q ss_pred ccceeEEEEEeeeeEEeeechhhhH--HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 288 SPKALLAGSTIPVTSKIFFDDLSKL--WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 288 h~~A~~~~~~~~~~~e~~~~~~~~~--d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+....+.-++++++...+....- ..+++.+-++.++++|++.+.+.++.. T Consensus 396 -~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 396 -GSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred -cccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCC Confidence 122222233344555443322112 236778889999999999999988777 No 172 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.27 E-value=1.6e-07 Score=57.78 Aligned_cols=296 Identities=11% Similarity=0.095 Sum_probs=142.9 Q ss_pred CccccCcccCCCcccCCccC--cccchhHHH--HHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeecc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRH--GAGDPLADV--TEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIA 76 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~--~~~~~~a~~--ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~ 76 (339) || +-.|=. ++.+.-+.+ ..++...-| ...++.++....+..+.++++.++..++.. +-+|+.+|-..-...+ T Consensus 1 ~~--~k~~~~-~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~-~~~i~~~~~~~~~~~~ 76 (321) T protein:vir:31 1 MA--SRTINN-DLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAK-KTRIPTLNIGERHRRP 76 (321) T ss_pred Cc--hHHHHH-HHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCc-ceeeeeeccCCccccc Confidence 22 111111 122221111 112222212 277788888998899999999888776543 3456665432111111 Q ss_pred --CCCCCCCCCCCCccceEEEEeehhh-hhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 77 --PGTTPPPSTEPHTSKIFLKIDTVII-ARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 77 --~g~~i~~~~~~~~~~~~l~ID~~~y-~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) .++.......+..++.++..-+... ..+.=+-+|++....|+.+.+.+..+.++++..++.++. +-..+.++. T Consensus 77 ~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~n----Gd~~~~~~~ 152 (321) T protein:vir:31 77 QDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAAN----GDEDAEDSF 152 (321) T ss_pred ccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheee----ccccCCCcc Confidence 1221211122333334443321111 112225567664467999999999999999998886541 111111100 Q ss_pred cccccccCccc---cccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccc Q lcl|NC_020078. 154 GTAAQMPGHSG---GNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVT 230 (339) Q Consensus 154 ~~~~~~~g~~~---~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~ 230 (339) . ....|.-. ........++...+ ++.+.++...|+++.-..+ +-.++++++.+..+++- +.+.+ +. T Consensus 153 ~--~~n~G~l~~a~~~~~~~~~~~~~~~----~d~l~~l~~~l~~~yr~~~--~~v~im~~~~~~~~~~~--l~~~~-~~ 221 (321) T protein:vir:31 153 E--NQNDGFITVAEGDVETIDAADDILD----NDLVIRTIAGLDSKYRARM--NPALIVSEDQLLSYHYT--LTDRD-TP 221 (321) T ss_pred c--ccchhhhhhhccccccccccccccC----HHHHHHHHHhccHhHhcCC--CeEEEechHHHHHHHHH--HhcCC-Cc Confidence 0 00112110 00011111122223 2455566666665432112 33568999987665431 22221 11 Q ss_pred cccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhh Q lcl|NC_020078. 231 SAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLS 310 (339) Q Consensus 231 ~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~ 310 (339) .....+.+|...+++|++|+.++++|.... ++-+.+-+...-..++..+.+++.+. T Consensus 222 ~~~~~l~~~~~~tl~G~pvv~~~~mP~~~i------------------------l~t~~~nl~~~~~~~~~~~~~~~~~~ 277 (321) T protein:vir:31 222 LGDNVIMGEADVNPFSFPIIGSGLWPDDKA------------------------MFTDPQNLIYALYRDLEIDVLTESDK 277 (321) T ss_pred cccchhhccccccccceeEEEcCCCCCCcE------------------------EEeccccEEEEEeeccEEEEeecCcc Confidence 222335566777899999999999985321 22333333334444555566555443 Q ss_pred hH---HHHHHHH--HhCCccccccceEEEE-ecCC Q lcl|NC_020078. 311 KL---WFIDSWL--AFGVTINRTEYAGVIK-LPAA 339 (339) Q Consensus 311 ~~---d~i~g~~--~~Ga~v~rPe~~v~i~-~~~a 339 (339) .. +.+..++ -++..+-++++++.++ ++-+ T Consensus 278 ~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 278 VSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred ccccceeeEeeeeeecceeEeccccEEEEecCCcc Confidence 22 2233222 2566677777777665 3333 No 173 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.26 E-value=1.2e-07 Score=58.44 Aligned_cols=287 Identities=12% Similarity=0.004 Sum_probs=139.1 Q ss_pred Ccc--ccCccc------C-CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccc- Q lcl|NC_020078. 1 MSI--FDGQTP------S-YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKA- 70 (339) Q Consensus 1 ~~~--~~~~~~------~-~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~- 70 (339) ..+ ..|.=+ . +|-.+.+-. ..|. .+.-+.++.++.+..++.+.+++++++.++. |+ +.|++.... T Consensus 64 ~~~~~~r~~~~l~~ee~~~~~~~~~~t~-~~gG--~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~ 138 (395) T protein:vir:95 64 NGILAKRSQDPLTSEERKFFNDINYDVG-YTDE--KILPETVVERVFDDLQKDHPLLSKINFQNAG-IK-TRVIKADPAG 138 (395) T ss_pred HHHHhhcCccccchHHHHHHHHHhhccC-CCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCc Confidence 000 000000 0 011111111 1111 1344899999999999999999999877663 43 567776543 Q ss_pred eeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 71 KLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 71 t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) ++.-...+.++.....++.+++++.. .++..+ .|..-=--.+.+|+-+.+.++.+.++++..|+.++. +.... T Consensus 139 ~a~w~~e~~~~~~~~~~~f~~i~l~~--~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~----G~G~~ 212 (395) T protein:vir:95 139 QAVWGKVFGEIKGQLDAAFREENFTQ--YKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIIN----GGGAA 212 (395) T ss_pred ceEEeecccccCccccccceeeeece--eeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheee----ccCCC Confidence 33332233444433344555555543 333332 232211123568899999999999999999987741 11100 Q ss_pred --cccccccccccCccccc--cccccCcc--ccccHHHHHHHHHHHHHHHHh----cCCCCCcCCeEEEECHHHHHHHhc Q lcl|NC_020078. 150 --DSPYGTAAQMPGHSGGN--VVTLAGAN--DYKDPAKLYAAIASLVEKFLE----KDVRPNEEDMILVLPPAAFTALMQ 219 (339) Q Consensus 150 --~~~~~~~~~~~g~~~~~--~~~~~~~~--~~~~~~~l~~ai~~a~~~L~e----~dV~~p~~~R~~vv~P~~~~~Ll~ 219 (339) .|. +........+ ......+. ...+...+++.+.++...|.. +.. ........+++|..+..+.. T Consensus 213 ~~qP~----Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~mn~~t~~~~~g 287 (395) T protein:vir:95 213 KTQPV----GLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKEL-KIDGKVALVVNPRDSWDVQA 287 (395) T ss_pred CcCce----eeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchh-hhcCceEEEEcchhhhhcCC Confidence 010 0100000000 00000010 111112233334333333311 111 01123355788887764432 Q ss_pred ccchhhhcccccccceeecceeEEEe--ceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEE Q lcl|NC_020078. 220 AEHITNGEYVTSAGETLNTKYMFAAF--GVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGST 297 (339) Q Consensus 220 ~~~~~n~d~~~~~~~~l~~G~v~~i~--G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~ 297 (339) ..- |.. .+|...+++ |.+|++|+.+|.... .-++|+. ...+.. T Consensus 288 ~~~-----~~~------~~G~~~~~lg~g~~v~~~~~~p~~~i--------------~fgdfs~----------y~i~~r 332 (395) T protein:vir:95 288 RYT-----YLT------ANGGFVTVLPYNVTIITSEFVPEGKL--------------VAFVTDR----------YNAVRG 332 (395) T ss_pred cce-----ecc------CCCcceeccCCcceEEEcCCCCCCcE--------------EEEeccc----------EEEEEe Confidence 211 111 134445565 556899999984321 1122322 112233 Q ss_pred eeeeEEeeechhhhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 298 IPVTSKIFFDDLSKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 298 ~~~~~e~~~~~~~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .++.++..++ .+|. ..+++.+-+|.++++|++.++|.++.+ T Consensus 333 ~~~~i~~~~~-~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 333 GGLTVKKFDQ-TLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred cceEEEeccc-hhhhCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 3444444432 2222 336788889999999999999999877 No 174 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.25 E-value=9.8e-07 Score=53.52 Aligned_cols=316 Identities=12% Similarity=0.040 Sum_probs=157.6 Q ss_pred CccccCcccCCCcccCCcc--CcccchhHHHHHHHHHHHHHHHHHHhhhcccccccccc--ccceEEEeccccc-ee-ee Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQR--HGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVR--GTSTISNRGISKA-KL-QK 74 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~--~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~--~G~tv~i~~iG~~-t~-~~ 74 (339) |- .||----++. ..+..-.+....=|-.++|..-.+-.++.++-.++++- +|+|++|.+--.. .. .- T Consensus 1 ~~-------~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~p 73 (401) T protein:vir:95 1 ML-------NYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNI 73 (401) T ss_pred CC-------ccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccc Confidence 21 1221111111 11111223444556667776666668888888888774 5999999765422 21 11 Q ss_pred ccCCCCCCCCCC---------CCcc------------------------ceEEEEeehhhhhhhHHHHHHHhcCcchHHH Q lcl|NC_020078. 75 IAPGTTPPPSTE---------PHTS------------------------KIFLKIDTVIIARNAEPMLDEFQTDFDYQGE 121 (339) Q Consensus 75 ~~~g~~i~~~~~---------~~~~------------------------~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~ 121 (339) .+.|-+..+.+. -+.. ++.-.|-|.=.|..+-|.++..-....+... T Consensus 74 l~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h 153 (401) T protein:vir:95 74 NDQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEH 153 (401) T ss_pred hhcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHH Confidence 122222222100 0001 1112233333344444555544344557766 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCC- Q lcl|NC_020078. 122 VAREQGQE-IANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVR- 199 (339) Q Consensus 122 ~~~~~g~a-LA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~- 199 (339) ++.++-.. -.+..|. +-++++.++...-.+..... ..+.++.+..+ -..-++.+..+..+|.++.-| T Consensus 154 ~s~ell~g~~~~t~d~-i~~dll~ag~~viyAg~ats---------~At~~~~~~~~-t~vt~~~l~rl~~~L~~nRapk 222 (401) T protein:vir:95 154 LSRELMNGATQITEAV-LQKDLLAAAGTVLYAGAATS---------DATITGEGSTP-SVVSYKNLMRLDQILTENRTPT 222 (401) T ss_pred HHHHHhhhhhhhHHHH-HHHHHHhhcCeeecCCccce---------eeecccccccc-ceechhHHHHHHHHHHhccccc Confidence 65554443 3444444 45566654422111111000 01111111111 112245677777888763321 Q ss_pred --------------CCcCCeEEEECH------HHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccc-c Q lcl|NC_020078. 200 --------------PNEEDMILVLPP------AAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVF-G 258 (339) Q Consensus 200 --------------~p~~~R~~vv~P------~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~-~ 258 (339) +...-|+.++.| +....|+.++.|+...-.+..+ .+.+|.||++-+|.+++++.+-. . T Consensus 223 ~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~-~i~~gEiG~i~~vR~i~~p~~~~w~ 301 (401) T protein:vir:95 223 QTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAG-TIMNGEVGSIDKFRIIQVPEMLHWA 301 (401) T ss_pred chhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCcc-ccccccccccCceeEEecccceeec Confidence 133458899888 5557788889999886444544 57899999999999998876431 1 Q ss_pred ccccccccCC-------CccccccccccceEEEEEeccceeEEEEEeeeeE----Ee-e----------echhhhHHHHH Q lcl|NC_020078. 259 KTITDHLLSN-------ANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTS----KI-F----------FDDLSKLWFID 316 (339) Q Consensus 259 ~~~~~~~l~~-------~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~----e~-~----------~~~~~~~d~i~ 316 (339) ..+.....++ .-.+..|++ |. -+++-+.|-+++..+.--. ++ . .||-.+-=.+- T Consensus 302 ~ag~~a~~~~~~y~~~~~~~gg~~dV-yp---~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vg 377 (401) T protein:vir:95 302 GAGAQATGANPGYRTSMVSGQEHYDV-YP---MLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSS 377 (401) T ss_pred CCcccccccccccccccccCCCccee-ee---eeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhh Confidence 1111001100 001122222 12 2456677777665443211 10 1 12223333444 Q ss_pred HHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 317 SWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 317 g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) =++.|++.++|||..+.|+..+- T Consensus 378 wK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 378 IKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred hhhhhhhheeccceeEEEEeecC Confidence 57899999999999999987776 No 175 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.24 E-value=1.1e-07 Score=58.72 Aligned_cols=287 Identities=9% Similarity=-0.041 Sum_probs=137.7 Q ss_pred CccccCcccC-------CC-cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-ccce Q lcl|NC_020078. 1 MSIFDGQTPS-------YD-VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAK 71 (339) Q Consensus 1 ~~~~~~~~~~-------~~-~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t 71 (339) .....+.=+- ++ +.+.+ +..+-..+..+.+..++.+...+.+.+++++++.++. |+ +++++- +.++ T Consensus 58 ~~~~~~~~~lt~ee~~~~~~~~~~~---~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~~~~~~~~~~ 132 (377) T protein:vir:98 58 FDLRDKNRELTAEEIKFFNDIDKNV---GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGT 132 (377) T ss_pred HHhccCCcccCHHHHHHHHHHHhcc---CCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-cc-eEEEEecCCcc Confidence 0000000000 00 11111 1111112344889999999999999999999887764 44 466653 4555 Q ss_pred eeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_020078. 72 LQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEF-QTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASD 150 (339) Q Consensus 72 ~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~-q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~ 150 (339) +.-...+.++.....+... .+.+...++.++..-..+=. .+.+|+-+.+.++.+.++++..|+.++ .+..... T Consensus 133 a~w~~e~~~~~~~~~~~f~--~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i----~G~G~~q 206 (377) T protein:vir:98 133 AVWGDIFGEIKGQLKQAFK--EQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIV----KGDGLLQ 206 (377) T ss_pred eeEeecccccCcccCccce--eEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceE----eccCCCc Confidence 5554444555433333444 34455555444432221111 356789999999999999999998774 2221111 Q ss_pred ccccccccccCcccccccc---ccCccccccHHHHHHH-----------HHHHHHHHH---hcCCCCCcCCeEE-EECHH Q lcl|NC_020078. 151 SPYGTAAQMPGHSGGNVVT---LAGANDYKDPAKLYAA-----------IASLVEKFL---EKDVRPNEEDMIL-VLPPA 212 (339) Q Consensus 151 ~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~l~~a-----------i~~a~~~L~---e~dV~~p~~~R~~-vv~P~ 212 (339) |. +.......+.+.. ....+...+.+.+.+. ...+...+. .+... ...||++ +++|. T Consensus 207 P~----Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klk-d~~G~~i~~~n~~ 281 (377) T protein:vir:98 207 PV----GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPL-KIAGQVKLILNPE 281 (377) T ss_pred ce----eeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhh-ccCCceEEEeccc Confidence 11 1111101111100 0011111111112111 111111111 11111 1245544 46776 Q ss_pred HHHHHhcccchhhhcccccccceeecceeEEEeceE--EEEeccccccccccccccCCCccccccccccceEEEEEeccc Q lcl|NC_020078. 213 AFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVP--VITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPK 290 (339) Q Consensus 213 ~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~--V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~ 290 (339) -|..++...... ..+|.-..++|++ |++|+++|.... + -++|+. T Consensus 282 ~~~~~~p~~~~~-----------~~~G~~~t~lg~p~~vv~s~~~p~~~i-----~---------fgdf~~--------- 327 (377) T protein:vir:98 282 DRWALEAQFTSR-----------NQFGEYVTVLPHGITILESLAVETGKA-----I---------AFVANR--------- 327 (377) T ss_pred chhhcccccccc-----------CCCCccccccCCCceEEecCCCCcccE-----E---------EEEecc--------- Confidence 655543221110 1234444566655 678888874311 0 011111 Q ss_pred eeEEEEEeeeeEEeeechhhhH---HHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 291 ALLAGSTIPVTSKIFFDDLSKL---WFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 291 A~~~~~~~~~~~e~~~~~~~~~---d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) -..+....+..+.+++ .+|. ..+++.+-+|.++++|++.++|.++.- T Consensus 328 -Y~i~~r~~~~i~~~~~-~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 328 -YDAFMATASTIEEYDQ-TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred -eeEEeecceEEEeech-hhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 2233344455554432 2232 347888899999999999999999988 No 176 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.21 E-value=4.8e-07 Score=55.24 Aligned_cols=271 Identities=12% Similarity=0.075 Sum_probs=149.5 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc----cceeeecc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS----KAKLQKIA 76 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG----~~t~~~~~ 76 (339) |.--. |++..--=..+=++| |.++|+.-+.+=++ ..+..|...+..|.+++++... ....++.. T Consensus 1 M~~e~------nl~~~~dL~~a~siD--F~~~f~~~i~~L~~----~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVa 68 (303) T protein:vir:10 1 MSAEN------NLINVEALGKAKSID--FANKLGVGLNKLFE----ALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVA 68 (303) T ss_pred CCCCc------CCcchhhcccceeeh--hhhhhhhhHHHHHH----HhhhhccccccCCceeeeeeeeceeecccccccc Confidence 33322 444332222344454 99999998875443 3455555566678777765542 23456788 Q ss_pred CCCCCCCCCCCC--ccceEEEEeehhhhhhhHHHHHHH-hc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 77 PGTTPPPSTEPH--TSKIFLKIDTVIIARNAEPMLDEF-QT--DFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 77 ~g~~i~~~~~~~--~~~~~l~ID~~~y~~~~vdd~D~~-q~--~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) .|+.|+...... ....++.++ ||... + =||+ |. ..|...+.-+++..++++.+|..++..|-++.... T Consensus 69 EGe~Iplskvt~~~~~t~~~~~k--K~rK~-t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~-- 141 (303) T protein:vir:10 69 EGDVIPLTKVTREQVDITELQFA--KYRKS-T--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENG-- 141 (303) T ss_pred CCcccchhhheeeecceEEEEee--ccccc-c--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccc-- Confidence 899998765322 124567776 76654 3 5666 54 35699999999999999999999988775432110 Q ss_pred cccccccccCccccccccccCccccccHHHHHHHHHHHHHHH---HhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcc Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKF---LEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEY 228 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L---~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~ 228 (339) ..+. +...+.+.|-+++-....+| +|.++ .-+++|+|.-.+.||++..+... T Consensus 142 -----------------~~t~-~t~~s~~glq~Al~~~~~kl~~~~ed~~-----~~V~FvNP~Daa~yl~~A~i~~~-- 196 (303) T protein:vir:10 142 -----------------KRTN-KTKLSAENLQGALSKGRANLSVLLDDEI-----TPIAFVNPNDTAEYLANGFINST-- 196 (303) T ss_pred -----------------cccc-ceeecHHHHHHHHHhhhhhccccccccc-----cEEEEEchHHHHHHhhcCCcchh-- Confidence 0000 11122344444443333343 34332 24889999999999998776532 Q ss_pred cccccceeecceeEEEeceEEEEecccccccccccc-------------ccCCCccccccccccceEEEEEeccceeEEE Q lcl|NC_020078. 229 VTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDH-------------LLSNANNEKAYDGDFKDIVAQMFSPKALLAG 295 (339) Q Consensus 229 ~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~-------------~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~ 295 (339) ++ .+---.+.+++|+.|+.|+.+|.+...... .++.++ .|.++-+..+|+ . T Consensus 197 -~t---~fG~n~L~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f---~~t~D~tglIGv---------~ 260 (303) T protein:vir:10 197 -GA---QFGVNLLTPYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELSRAF---AFATDATGFVGV---------L 260 (303) T ss_pred -hh---hhhhhhhhhhhcceEEEeccCCCceEEEeeccceEEEEecCchhhhhhh---hhccccccceEE---------E Confidence 11 111123446999999999999975432211 111111 122222222221 1 Q ss_pred EEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 296 STIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 296 ~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .... ++.--+..-++-|...| +=|+|++++..++++ T Consensus 261 h~~~-----~~~~t~eT~~~~~~~lf---pE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 261 HDIQ-----PQRLTSDTIYASAISMF---PENIDAVIKVTIKKD 296 (303) T ss_pred eccc-----cceeeehhHhHhHHHhc---ccccceEEEEEEecc Confidence 1111 11111222223333322 347788999988888 No 177 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.72 E-value=7.8e-06 Score=48.59 Aligned_cols=289 Identities=12% Similarity=0.040 Sum_probs=154.8 Q ss_pred ccCCC-cccCCccCcccchhHHHH-HHHHHHHHHHHHHHhhhc--cccccc-cc-----cccceEEEecccccee--eec Q lcl|NC_020078. 8 TPSYD-VTRPNQRHGAGDPLADVT-EQFTGTVEGTIKRRSIMA--GFVPVR-SV-----RGTSTISNRGISKAKL--QKI 75 (339) Q Consensus 8 ~~~~~-~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~~sv~~--~~v~~r-~i-----~~G~tv~i~~iG~~t~--~~~ 75 (339) .|.|| -||. .| +|+ |+|...|.+.-.+.+-|. +.+..+ ++ .+|+.+.+|..+...- ..| T Consensus 1 M~~~~~~T~l--------~D-ii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~ 71 (367) T protein:vir:80 1 MPDFNNQVRL--------VD-AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNY 71 (367) T ss_pred Ccchhhhhhh--------hh-ccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCcccc Confidence 33332 1221 12 344 888888888766655544 222222 33 5799999999987632 233 Q ss_pred cCCC---CCCCCCCCCcc-ceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 76 APGT---TPPPSTEPHTS-KIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 76 ~~g~---~i~~~~~~~~~-~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) ..++ ++++.+ +.+. ..-.++ ..--++...|+-..-+--|+|..+..+.+.--.|. ||.+|...+++....+. T Consensus 72 ~~d~~~~~~t~~k-ittg~~~a~v~--~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~-~q~~Lla~L~Gvf~~~~ 147 (367) T protein:vir:80 72 GSDNPNVEAPIDG-LGSGEMKTTKT--WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQ-WQRRIIAMAVGVYKSNL 147 (367) T ss_pred CCCCCcccccccc-cccchheeeee--hhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhh-hHHHHHHHHHHhhcccc Confidence 2222 233322 2222 222222 23346777899888888899999999988777776 55555666565543322 Q ss_pred ccccccc---------ccCccccccccccCc----cccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHh Q lcl|NC_020078. 152 PYGTAAQ---------MPGHSGGNVVTLAGA----NDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALM 218 (339) Q Consensus 152 ~~~~~~~---------~~g~~~~~~~~~~~~----~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll 218 (339) +...... ..+.....+....+. ....++ +++.+|...|.++. +.=-.++|.+..|..|. T Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~----~~~~~A~~~lGD~~----~~l~~i~mHS~V~~~L~ 219 (367) T protein:vir:80 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNR----EAFVDAAFTMGDHV----GSIAAIAVHSMVYKRMT 219 (367) T ss_pred ccchhhhhhhhccccccccccCceeeeeeccCCCccceecH----HHHHHHHHHhcccc----ccccEEEEchHHHHHHH Confidence 1110000 001111112221111 122344 44556666665432 22357899999999987 Q ss_pred cccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEe Q lcl|NC_020078. 219 QAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTI 298 (339) Q Consensus 219 ~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~ 298 (339) +.. ++. |....+. +..|..++|..|+++..+|.....+ . ..-++++|-+-|+++.+.. T Consensus 220 ~~~-li~--~i~~sd~---~~~i~ty~G~~VIvDD~~Pv~~~~a---------~-------~~yttYlfg~GAi~~~~~~ 277 (367) T protein:vir:80 220 NND-EIE--FIPDSKG---QLTIPTYMGKVVIVDDGMPVFGTGA---------D-------KTYLSILFGGAAFGYADGA 277 (367) T ss_pred hcc-ccc--cccCCCC---ccccceecceeEEEeCCCcccccCC---------C-------ceEEEEEEecceeeecccC Confidence 764 442 3322221 3568999999999999998542111 1 1234788899999988876 Q ss_pred eee-EEeeechhhh----HHHHH-----HHHHhCCccccccceE--------------------EEEecCC Q lcl|NC_020078. 299 PVT-SKIFFDDLSK----LWFID-----SWLAFGVTINRTEYAG--------------------VIKLPAA 339 (339) Q Consensus 299 ~~~-~e~~~~~~~~----~d~i~-----g~~~~Ga~v~rPe~~v--------------------~i~~~~a 339 (339) +.. +|..|++... .|.+. .+|.+|.+-....-+. +|...+- T Consensus 278 ~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~N 348 (367) T protein:vir:80 278 PQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDN 348 (367) T ss_pred CccceecccchhhhcCCceEEEEeeeeEEeecceeeecccccccccccccccccccccCCCChHHhcCCcc Confidence 533 5888888642 13322 4566666655221100 0000000 No 178 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=97.72 E-value=2.5e-06 Score=51.31 Aligned_cols=304 Identities=13% Similarity=0.128 Sum_probs=169.7 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCC Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTT 80 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~ 80 (339) .-...|..|+-- .|.--...+++..=+.-++-++-+.++-.--.+-..++-.-.++.|.+-.|+.+|-.-+.+...|++ T Consensus 56 ~Kmm~G~~p~~e-V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE 134 (393) T protein:vir:79 56 AKMMEGETPTNE-VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQE 134 (393) T ss_pred HHHhcCCCchhh-eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeecccccccc Confidence 111239999854 3333334555555445588888888754322222233332356679999999999988899988988 Q ss_pred CCCCCCCCccceEEEEeehhhhhhhHHHHHHHh--cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--ccccccccc Q lcl|NC_020078. 81 PPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQ--TDFDYQGEVAREQGQEIANMYDETFFIMAAKAAI--ASDSPYGTA 156 (339) Q Consensus 81 i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q--~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~--~~~~~~~~~ 156 (339) ++..+--..+.-.+.+-+.|| ...|.--|+.- +.+|+++-..+.++.+|||..|+-++.++-+-+- +..-+..+. T Consensus 135 ~~~~sld~~T~dsv~~~~gK~-G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ 213 (393) T protein:vir:79 135 IPEDSIDWQTHESPEIRVGKS-GIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKL 213 (393) T ss_pred ccccchhhhcCCceeEEechh-hhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCcc Confidence 876542224444677767776 56666666654 4699999999999999999999999988754332 111111222 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhh---ccccccc Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNG---EYVTSAG 233 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~---d~~~~~~ 233 (339) +..+|.+.... .......+.+.|.++.+ +..- -.| -++++.|=.|+..-|....-.. .|+.-+. T Consensus 214 ahptGr~~~~~-----qNGTlSleDllDm~~av---~~~h--yt~---svi~MHPLAWnv~AKna~me~~~~na~gN~~~ 280 (393) T protein:vir:79 214 AHTTGLDKNGV-----QNDTFSAEDFLDLIIAV---MANE--YTP---SDLMMHPLAWTVFAKNELMGSLQANPYGNYPA 280 (393) T ss_pred ceeecCCcccc-----ccccccHHHHHHHHHHH---hccc--CCc---ceEEEcCchhhhhhhhhhhcceeeccccccCc Confidence 22222221111 11223455555544433 3221 123 4788999888888776332110 1111110 Q ss_pred ceeecceeEEEe-----------ceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeE Q lcl|NC_020078. 234 ETLNTKYMFAAF-----------GVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTS 302 (339) Q Consensus 234 ~~l~~G~v~~i~-----------G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~ 302 (339) ...+ -.+.. ++.|+.|+=+|.-.. +.-+.. |.++ .+.+|++. |+- ++++ T Consensus 281 ~~~~---ts~algp~~i~~~~~~nlnv~~sPfvp~d~k------~~rFd~--~~Vd-~NnvgvlL-------V~D-~i~t 340 (393) T protein:vir:79 281 KGAP---SSMALGPDSIQGRLPFNFNVNLSPFIPLDKK------SRRFDV--YAVD-RNNVGVLL-------VRD-DLKT 340 (393) T ss_pred cccc---hhhhhchhhhccccccceeEEEecccccccc------cceeeE--EEee-cCCceEEE-------Eec-Ccce Confidence 0010 01123 377777776664321 111100 1111 23334433 332 6788 Q ss_pred EeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 303 KIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 303 e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +.|.++-+--.-|+-.-.||.+|++--.++.+----. T Consensus 341 dq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~ 377 (393) T protein:vir:79 341 DQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNIS 377 (393) T ss_pred eccccccccceeeeeeeeeceeeeeCCceEEEEecce Confidence 8888877666667777899999998776654422111 No 179 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.18 E-value=0.00014 Score=41.73 Aligned_cols=284 Identities=12% Similarity=0.027 Sum_probs=134.1 Q ss_pred CccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccccc-cc-ccceEEEecccc-ceeeeccCCCC-CCCCCCCCccce Q lcl|NC_020078. 17 NQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRS-VR-GTSTISNRGISK-AKLQKIAPGTT-PPPSTEPHTSKI 92 (339) Q Consensus 17 ~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~-i~-~G~tv~i~~iG~-~t~~~~~~g~~-i~~~~~~~~~~~ 92 (339) =++-..|-..+-+++.+..+|.+......+.+.++..++ +- +-.++.+...-. ..++-|..+.+ ++.. +.+-.+. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~-~~~~~~~ 79 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLV-DVDMVRK 79 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccc-cccceeE Confidence 111122223344556667777777777888888877663 32 245555554422 23444443332 2221 1222333 Q ss_pred EEEEee-hhhhhhhHHHHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc--cCccccccc Q lcl|NC_020078. 93 FLKIDT-VIIARNAEPMLDEFQ-TDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQM--PGHSGGNVV 168 (339) Q Consensus 93 ~l~ID~-~~y~~~~vdd~D~~q-~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~--~g~~~~~~~ 168 (339) ...|=+ ..-|...+.++..++ ...++-..-...+..++++..|+.+|.= ++.....+.. ++....... T Consensus 80 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G--------~~~~g~~GLlN~p~~~~~~~~ 151 (301) T protein:vir:80 80 SVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRG--------EKKYAIKGAFEATGIQIDVSP 151 (301) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeee--------cccccceeeecCCCccccccc Confidence 333311 133456677888886 4788888889999999999999987621 1111111111 111111100 Q ss_pred -cccCcc---ccccHHHHHHHHHHHHHHHHhc--CCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeE Q lcl|NC_020078. 169 -TLAGAN---DYKDPAKLYAAIASLVEKFLEK--DVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMF 242 (339) Q Consensus 169 -~~~~~~---~~~~~~~l~~ai~~a~~~L~e~--dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~ 242 (339) ...+.. ...+++++++-|..+..+|.++ .+..| -.++|+|+.|..|..- .++.. .+ ..+..-... T Consensus 152 ~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p---~~L~L~p~~~~~L~~~--~~~~~-~~---~tvl~~l~~ 222 (301) T protein:vir:80 152 TTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTAS---LKLCLPPKQFELINKK--RYSNE-DS---RSVLKVLQD 222 (301) T ss_pred CcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecc---cEEEecHHHHHhhhhc--cccCC-CC---eeHHHHHHH Confidence 000111 2236889999999999998664 44334 2689999999988631 11100 10 111110001 Q ss_pred EEeceEEEEeccccccccccccccCCCccccccccccceEEEEEe--ccceeEEEEEeeeeEEeeechhhhHHHHHHHH- Q lcl|NC_020078. 243 AAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMF--SPKALLAGSTIPVTSKIFFDDLSKLWFIDSWL- 319 (339) Q Consensus 243 ~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~--h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~- 319 (339) +.-+..|...+.+-.. +.. | .. ..+++ .++-+-.+-.++++.-.. ..+...+.+.+.. T Consensus 223 ~~~~~~I~~~p~L~~~-g~~-------g-~~---------~~v~~~~~~d~~~~~v~~~~~~~~~-e~~~~~~~~~~~~r 283 (301) T protein:vir:80 223 NAWFSAIVRVPDLAGM-GTA-------G-SD---------SFAVIHDSNETAELIIPMDITRHPE-EYSFPRTKVPFEER 283 (301) T ss_pred HcCcceEEEcceeccC-CCC-------c-cc---------EEEEEecCCcEEEEEecCceeeecc-eecCceeEeeeeee Confidence 1223455555555321 000 0 00 01111 122222222223221111 1111222333333 Q ss_pred HhCCccccccceEEEEecCC Q lcl|NC_020078. 320 AFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 320 ~~Ga~v~rPe~~v~i~~~~a 339 (339) ..|..+.||++++.+. += T Consensus 284 ~~Gv~i~~P~ai~~~~--GI 301 (301) T protein:vir:80 284 TAGVVVRFPAAIVRVD--GI 301 (301) T ss_pred eEEEEEEccceEEEEe--cC Confidence 3478888998877553 22 No 180 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.42 E-value=0.00064 Score=38.09 Aligned_cols=285 Identities=11% Similarity=0.051 Sum_probs=149.1 Q ss_pred CccccCcccCCCcccCCccCcccchhH-HH---HHHHHHHHHHHHHHHhhhc--cccccc-cc-----cccceEEEeccc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLA-DV---TEQFTGTVEGTIKRRSIMA--GFVPVR-SV-----RGTSTISNRGIS 68 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a-~~---ie~~~g~v~~~f~~~sv~~--~~v~~r-~i-----~~G~tv~i~~iG 68 (339) |. ....+ +. +|+|...|.+.-.+.+-|. +.+..+ ++ .+|+.+.+|..+ T Consensus 1 Ma--------------------~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:78 1 MA--------------------ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWK 60 (349) T ss_pred CC--------------------ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeee Confidence 22 11111 11 2467777766655544444 222222 22 469999999998 Q ss_pred ccee--e-ec---cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 69 KAKL--Q-KI---APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMA 142 (339) Q Consensus 69 ~~t~--~-~~---~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l 142 (339) ...- . .| .+...+++.. +.+.+..-++ ...-.++...|+=..-+--|+|..+.++.+.--.|. ||..+... T Consensus 61 ~L~g~~e~nv~~D~~~~~~t~~k-itt~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~-~q~~Lia~ 137 (349) T protein:vir:78 61 AIDTSIEPNYSNDVYQDIATPRA-IQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQ-AQRRLIAT 137 (349) T ss_pred cCCCCcccccCCCCccccccccc-ccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhH-HHHHHHHH Confidence 7542 1 12 2122333322 3333222222 233446777888877777799999999988887776 45555555 Q ss_pred HhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc Q lcl|NC_020078. 143 AKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH 222 (339) Q Consensus 143 ~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~ 222 (339) +++....+.....+... .+..+....+.+..++..+.++...+...+.. +. ...=-.+++-+..|..|.+.. T Consensus 138 L~Gvf~~~~~a~~~~~~----~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~G-d~--~~~lt~i~mHS~v~~~L~~~~- 209 (349) T protein:vir:78 138 ALGLYNDNVSATDAYHE----QNDMVVDVSATLGFDAGAFIDATQTMGDALMG-NG--GEVLGAIAMHSFVYAQARKAQ- 209 (349) T ss_pred HHHhhcccccccchhhh----cccceeeeccccCCChhhhhhhHHHHHHHhcc-cc--ccceeEEEEchHHHHHHHhhh- Confidence 56554433222111110 01111111122224555555555544443311 11 111247889999999987654 Q ss_pred hhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEee-ee Q lcl|NC_020078. 223 ITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP-VT 301 (339) Q Consensus 223 ~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~-~~ 301 (339) ++ +|.... -+...|..++|..|+++..+|....+ ... .-++++|-+-|+++....+ +. T Consensus 210 li--~~i~~s---~~~~~i~ty~G~~VivDD~~Pv~~~g---------~~~-------~yttylfg~GAi~~~~~~~~~~ 268 (349) T protein:vir:78 210 LI--DFIRDA---ENNTMFATYQGYRVIVDDSMTVVGQG---------AQR-------KFISIIFGQGAIGYGEGNPVMP 268 (349) T ss_pred hh--hhccCc---ccCcccceecCeEEEEeCCCccccCC---------CCc-------eEEEEEeecceEEEccCCCccc Confidence 43 233211 13456889999999999999853211 112 2246888899999988664 24 Q ss_pred EEeeechhhh----HHH-----HHHHHHhCCccccccceEEE------EecCC Q lcl|NC_020078. 302 SKIFFDDLSK----LWF-----IDSWLAFGVTINRTEYAGVI------KLPAA 339 (339) Q Consensus 302 ~e~~~~~~~~----~d~-----i~g~~~~Ga~v~rPe~~v~i------~~~~a 339 (339) +|..|++... -|. ...+|.+|.+-... .+.- ..+.. T Consensus 269 ~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a--~v~~~~~~~~~~sPt 319 (349) T protein:vir:78 269 LEYEREASRANGGGVETLWTRKTWLLHPFGYRFTSA--VITGNGTETIARSAS 319 (349) T ss_pred eeeecccccCCcceeEEEEEeeEEEeeeeeeeeccc--cccCCccccccCCCC Confidence 6777777532 133 33467777766643 1110 00011 No 181 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=96.28 E-value=0.00059 Score=38.30 Aligned_cols=278 Identities=13% Similarity=0.059 Sum_probs=133.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHH-HHHHHHHHHHHH----HHhhhcccccccc-ccc-cceEEEec---cccc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVT-EQFTGTVEGTIK----RRSIMAGFVPVRS-VRG-TSTISNRG---ISKA 70 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~----~~sv~~~~v~~r~-i~~-G~tv~i~~---iG~~ 70 (339) |.+ ..+|....|+ +++. +++.... ...+.+.++..++ +-. -.++.+.. .|.. T Consensus 1 ~~~-----------------~~a~~~~~f~~~ql~-~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a 62 (296) T protein:vir:10 1 MGV-----------------DKADAAGIWTVKQLT-ASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIA 62 (296) T ss_pred Ccc-----------------cchhhhHHHHHHHHH-HHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCce Confidence 211 1223333455 6665 4444433 3456666666553 222 34555444 3444 Q ss_pred eeeeccCCC-CCCCCCCCCccceEEEEee-hhhhhhhHHHHHHHhc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020078. 71 KLQKIAPGT-TPPPSTEPHTSKIFLKIDT-VIIARNAEPMLDEFQT-DFDYQGEVAREQGQEIANMYDETFFIMAAKAAI 147 (339) Q Consensus 71 t~~~~~~g~-~i~~~~~~~~~~~~l~ID~-~~y~~~~vdd~D~~q~-~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~ 147 (339) + -|..+. +++.. +.+-++....|=. ..-|...+.++..++. ..++-..-...++.++++..|+.+|. T Consensus 63 ~--~~~~~~~dip~v-~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~------- 132 (296) T protein:vir:10 63 Q--IVADYTDDLPLV-DALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWS------- 132 (296) T ss_pred e--EeCCCcccccee-eccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe------- Confidence 3 333222 23221 1222333333211 1224445778887765 68888888899999999999997751 Q ss_pred cccccccccccccCccccccccccC-ccccccHHHHHHHHHHHHHHHHhc--CCCCCcCCeEEEECHHHHHHHhcccchh Q lcl|NC_020078. 148 ASDSPYGTAAQMPGHSGGNVVTLAG-ANDYKDPAKLYAAIASLVEKFLEK--DVRPNEEDMILVLPPAAFTALMQAEHIT 224 (339) Q Consensus 148 ~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~l~~ai~~a~~~L~e~--dV~~p~~~R~~vv~P~~~~~Ll~~~~~~ 224 (339) -++... +.|.-....++... .++=.++..+++-|..+...|.++ .+..|. .++++|+.|..|... . T Consensus 133 -G~~~~g----~~GLlN~p~v~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~---~l~L~p~~~~~L~~~---~ 201 (296) T protein:vir:10 133 -GSTAHG----IPSVFDYPNINNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRAT---HLLLPTTARRIMQNL---V 201 (296) T ss_pred -eccccc----ceeEeecCCCccccccCCccCHHHHHHHHHHHHHHHHHhhCceecce---eEEeCHHHHHHHhhc---c Confidence 111111 11211111111111 111124567788788887766543 444453 678899999988642 1 Q ss_pred hhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEe Q lcl|NC_020078. 225 NGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKI 304 (339) Q Consensus 225 n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~ 304 (339) + +.+-. +..-.-.+..+..|...+.+... ++. + . +..+++...++-+..+-.++++.-. T Consensus 202 ~----~~~~t-~l~~ik~~~~~l~i~~~~~l~~a-~~~-------g-~-------~~~v~~~~~~~~~~~~v~~~~~~~~ 260 (296) T protein:vir:10 202 P----GTSVS-YGEFFRQNNSGVTVEFVQYLNDY-NGT-------G-T-------SAAIAYEKDPNNMAIEIPEATNALP 260 (296) T ss_pred C----CCCcc-HHHHHHHhcCCceEEEeeeeccC-CCC-------c-c-------eEEEEEEcCCceEEEEcCcceeeec Confidence 1 11111 11101011235555555555321 100 0 0 0111222234444444445554332 Q ss_pred eechhhhHHHHHHHHHh-CCccccccceEEEE-ecCC Q lcl|NC_020078. 305 FFDDLSKLWFIDSWLAF-GVTINRTEYAGVIK-LPAA 339 (339) Q Consensus 305 ~~~~~~~~d~i~g~~~~-Ga~v~rPe~~v~i~-~~~a 339 (339) -.++...+.++..... |..+.||++++.+. +|=| T Consensus 261 -~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 261 -AQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred -ccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 2335566667777745 69999999988661 2333 No 182 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=95.17 E-value=0.0015 Score=36.12 Aligned_cols=295 Identities=11% Similarity=0.019 Sum_probs=135.2 Q ss_pred CccccCcccC-------CCcccCCccCcccchhHHHH-HHHH---HHHHHHHHHHhhhcccccccc-cccc-ceEEEe-- Q lcl|NC_020078. 1 MSIFDGQTPS-------YDVTRPNQRHGAGDPLADVT-EQFT---GTVEGTIKRRSIMAGFVPVRS-VRGT-STISNR-- 65 (339) Q Consensus 1 ~~~~~~~~~~-------~~~~r~~~~~~~~~~~a~~i-e~~~---g~v~~~f~~~sv~~~~v~~r~-i~~G-~tv~i~-- 65 (339) |+- =+|-- ....+-|....+.+..+.|+ ++|. ..|.+......+.+.++.+++ +--| .++.+. T Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~ 78 (319) T protein:vir:10 1 MTT--KKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTF 78 (319) T ss_pred CCC--cchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeee Confidence 332 11111 11112222222323333564 4443 333333334456666666653 2223 344443 Q ss_pred -cccccee-eeccCCCCCCCCCCCCccceEEEEee-hhhhhhhHHHHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 66 -GISKAKL-QKIAPGTTPPPSTEPHTSKIFLKIDT-VIIARNAEPMLDEFQ-TDFDYQGEVAREQGQEIANMYDETFFIM 141 (339) Q Consensus 66 -~iG~~t~-~~~~~g~~i~~~~~~~~~~~~l~ID~-~~y~~~~vdd~D~~q-~~~d~~~~~~~~~g~aLA~~~D~~i~~~ 141 (339) ..|..++ .++. .+++.. +.+-.+....|=. ..-|..-+.++..++ ...++-..-...+..++++..|+.+|.= T Consensus 79 ~~~G~a~~~~d~~--~dip~v-~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 155 (319) T protein:vir:10 79 DKVGTAQIIADYT--DDLPLV-DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKG 155 (319) T ss_pred ccccceeeecCcc--ccccce-eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEee Confidence 3455432 2332 223211 1222233332211 123455567888886 4788888889999999999999987521 Q ss_pred HHhhccccccccccccc--ccCccccccccccCccccccHHHHHHHHHHHHHHHHhc--CCCCCcCCeEEEECHHHHHHH Q lcl|NC_020078. 142 AAKAAIASDSPYGTAAQ--MPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEK--DVRPNEEDMILVLPPAAFTAL 217 (339) Q Consensus 142 l~~aA~~~~~~~~~~~~--~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~--dV~~p~~~R~~vv~P~~~~~L 217 (339) ++.....+. .+|....+... .......+++.+++-|..+..+|..+ .+..|. .++++|+.|..| T Consensus 156 --------~~~~g~~GLlN~p~~~~~~~~~-~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~---~L~L~p~~~~~L 223 (319) T protein:vir:10 156 --------SAPHKIVSVFNHPNITKITSGK-WIDVSTMKPETAEAELTQAIETIETITRGQHRAT---NILIPPSMRKVL 223 (319) T ss_pred --------cccccceeEEeCCCceeeecCC-CCCccccCHHHHHHHHHHHHHHHHHhcCceeece---EEEecHHHHHhh Confidence 111111111 12221111110 01112236788998898988888643 454453 788999999988 Q ss_pred hcccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEE Q lcl|NC_020078. 218 MQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGST 297 (339) Q Consensus 218 l~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~ 297 (339) ..- ..+ +.-+-..-++. +.-+++|...+.+.... +. + + ...+++...++-+..+-. T Consensus 224 ~~~--~~~--~~~t~l~~lk~----~~~~l~I~~~pel~~ag-~~-------g-~-------~~~v~y~~~~~~~~~~v~ 279 (319) T protein:vir:10 224 AIR--MPE--TTMSYLDYFKS----QNSGIEIDSIAELEDID-GA-------G-T-------KGVLVYEKNPMNMSIEIP 279 (319) T ss_pred hcc--cCC--CCeeHHHHHHH----hcCCceEEEeeeecccC-CC-------c-c-------eEEEEEecCCceEEEecC Confidence 531 111 11010111111 12345566666554211 00 0 0 011112223444444434 Q ss_pred eeeeEEeeechhhhHHHHHHHH-HhCCccccccceEEEEecCC Q lcl|NC_020078. 298 IPVTSKIFFDDLSKLWFIDSWL-AFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 298 ~~~~~e~~~~~~~~~d~i~g~~-~~Ga~v~rPe~~v~i~~~~a 339 (339) ++++.... .++...+.+.+.. ..|.-+.||++++.+ .+= T Consensus 280 ~~~~~~~~-e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~--dGI 319 (319) T protein:vir:10 280 EAFNMLPA-QPKDLHFKVPCTSKCTGLTIYRPMTIVLI--TGV 319 (319) T ss_pred cceeeeee-eecCceEEEeeeeeeEEEEEEccceeEee--ecC Confidence 44433321 2344555555544 456889999887655 222 No 183 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=95.15 E-value=0.0026 Score=34.71 Aligned_cols=285 Identities=11% Similarity=0.091 Sum_probs=144.2 Q ss_pred ccCcccCC-CcccCCc-cCcccchh---HHHHHHHHHHHHHHHHHHhhhcccccc-----ccccccceEEEecccc--ce Q lcl|NC_020078. 4 FDGQTPSY-DVTRPNQ-RHGAGDPL---ADVTEQFTGTVEGTIKRRSIMAGFVPV-----RSVRGTSTISNRGISK--AK 71 (339) Q Consensus 4 ~~~~~~~~-~~~r~~~-~~~~~~~~---a~~ie~~~g~v~~~f~~~sv~~~~v~~-----r~i~~G~tv~i~~iG~--~t 71 (339) ..-||--| +|.+-.. ..+..+-+ -.|-|+|.+-+.+-|++++.|++..-- +-+..-++.---...+ +- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVV 80 (314) T protein:vir:98 1 MKKQFKPFLPLNNIQFFASGTANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVV 80 (314) T ss_pred CcccccccccccceeeeeeccccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeeccccee Confidence 44555554 4443322 11222211 157899999999999999999865432 2223222221111222 12 Q ss_pred ee-eccCCCCCCCCCCCC------ccceEEEEeeh-hhhhhhH--HHHHHHhcCcchHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_020078. 72 LQ-KIAPGTTPPPSTEPH------TSKIFLKIDTV-IIARNAE--PMLDEFQTDFDYQGE---VAREQGQEIANMYDETF 138 (339) Q Consensus 72 ~~-~~~~g~~i~~~~~~~------~~~~~l~ID~~-~y~~~~v--dd~D~~q~~~d~~~~---~~~~~g~aLA~~~D~~i 138 (339) +. -|..++.....+... .-+..+.+|+. .|..-+. .-||+.-.+-|+-.. -.+.++.|-++.+|..+ T Consensus 81 ig~~Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~ 160 (314) T protein:vir:98 81 VGNEYNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQH 160 (314) T ss_pred ecCcccCCCCcccccCCccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHH Confidence 22 355544432211110 01122333332 2221111 345555555554443 44567888889998776 Q ss_pred HHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHh Q lcl|NC_020078. 139 FIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALM 218 (339) Q Consensus 139 ~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll 218 (339) -..|-..|..+ ...++ .+.+.+-+.+..|.++....+|.. ...+.|.|+.|.+|+ T Consensus 161 Gk~lS~~As~t---------------------e~ltd-~~~d~V~~LF~~as~~yvn~ev~~---~~~AyV~~evYnaii 215 (314) T protein:vir:98 161 SKFISSIAEKT---------------------ETLTD-YSADNVLRLFNELSKYYVNIEAIG---TKAAKVSPELYNAIV 215 (314) T ss_pred HHHHHhhhhhh---------------------hhhhh-cchhhHHHHHHHHHhhhhcceeeE---EEEEEEchhHHhHhh Confidence 44443322110 00011 122333344444555555455533 367789999999999 Q ss_pred cccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEE-EE Q lcl|NC_020078. 219 QAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAG-ST 297 (339) Q Consensus 219 ~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~-~~ 297 (339) .++-.+.+. ++. .++-.-.|.+.-||.|.+.+.--..++ . ++ +++.+-++.+ .. T Consensus 216 D~~l~TsaK--~Ss-aNIDengi~~FkGf~i~e~P~~~~q~g---~------------------ia-~~s~dnig~aftG 270 (314) T protein:vir:98 216 DHPLTTSAK--SSS-ANIDQNGIVNFKGFAIQEIPESMLQSG---D------------------VA-YTYITNIGKAFTG 270 (314) T ss_pred ccccccccc--cce-eeeccCCcceecceEEEecchhhcCCC---c------------------EE-EEccccceeeccc Confidence 998776653 222 234444567889999988765432211 0 00 1111111111 11 Q ss_pred eeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 298 IPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 298 ~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +. .+++...++.-+-++.|---||--++.-...+.+++++- T Consensus 271 In-~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~t 311 (314) T protein:vir:98 271 IN-TSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTST 311 (314) T ss_pred ce-eeeeeecccccceeeecccccccccccccceeeEEEecC Confidence 11 233344455556667777778888888888888888877 No 184 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=95.00 E-value=0.003 Score=34.43 Aligned_cols=285 Identities=12% Similarity=0.066 Sum_probs=150.7 Q ss_pred Cc---cccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhc--cccccc-cc-----cccceEEEecccc Q lcl|NC_020078. 1 MS---IFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMA--GFVPVR-SV-----RGTSTISNRGISK 69 (339) Q Consensus 1 ~~---~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~--~~v~~r-~i-----~~G~tv~i~~iG~ 69 (339) |. ++|-..|- +|+|...|.+.-.+.+-|. +.+..+ ++ .+|+.+.+|..+. T Consensus 1 Ma~T~l~D~iipe-------------------~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~ 61 (349) T protein:vir:94 1 MAITTIGNIVTGN-------------------IPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKA 61 (349) T ss_pred CCceEEeeeeccC-------------------hHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeec Confidence 22 11111110 2467777766555544444 222222 22 4699999998876 Q ss_pred cee---eeccCCC---CCCCCCCCCccc-eEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 70 AKL---QKIAPGT---TPPPSTEPHTSK-IFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMA 142 (339) Q Consensus 70 ~t~---~~~~~g~---~i~~~~~~~~~~-~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l 142 (339) ..- ..|...+ .+++.+ +.+.+ .-+++ ..-.++...|+=..-+--|+|..+.++.+.--.|. ||..+... T Consensus 62 l~g~~e~n~~~dt~~~~~t~~k-it~~~~~a~~~--~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~-~q~~Lia~ 137 (349) T protein:vir:94 62 IDTSIEPNYSNDVYQDIATPRA-IQTGEMMARVA--YLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQ-AQRRLIAT 137 (349) T ss_pred CCCCcccccCCCCccccccccc-ccccceeeeee--eeccccchhHHHHHhhCchHHHHHHHHHHHHHhhH-HHHHHHHH Confidence 431 1233222 233322 22222 22222 23345777888877777799999999998888877 45555556 Q ss_pred HhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHH-h-cCCCCCcCCeEEEECHHHHHHHhcc Q lcl|NC_020078. 143 AKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFL-E-KDVRPNEEDMILVLPPAAFTALMQA 220 (339) Q Consensus 143 ~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~-e-~dV~~p~~~R~~vv~P~~~~~Ll~~ 220 (339) +++....+.....+.. ..........+.+..++..+.+++..+...+. + ++. =-.+++-+..|..|.+. T Consensus 138 L~Gvf~~~~~~~~~~~----~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~-----lt~i~mHS~v~~~L~~~ 208 (349) T protein:vir:94 138 ALGLYNDNVSATDAYH----EQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEV-----LGAIAMHSFVYAQARKA 208 (349) T ss_pred HHhhhccccccccccc----ccCceeEEecccCCCChhhHHHHHHHHHHHhccccccc-----eeEEEEchHHHHHHHhc Confidence 6655443322211111 01111111222333456666666655555431 1 111 23678999999998775 Q ss_pred cchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEee- Q lcl|NC_020078. 221 EHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP- 299 (339) Q Consensus 221 ~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~- 299 (339) . ++. |.... -+...|..++|..|+++..+|....++ . ..-++++|-+-|+++....+ T Consensus 209 ~-li~--~i~~s---~~~~~i~ty~G~~VivDD~~Pv~~~g~---------~-------~~yttylfg~GAi~~~~~~~~ 266 (349) T protein:vir:94 209 Q-LID--FIRDA---ENNTMFATYQGYRVIVDDSMTVVGQDT---------S-------RKFISIIFGQGAIGYGEGNPE 266 (349) T ss_pred c-hhh--hccCc---ccCcccceecCcEEEEeCCCccccCCC---------C-------ceEEEEEeecceEEeecCCCC Confidence 4 332 22211 124468899999999999998532111 1 12356888899999999863 Q ss_pred eeEEeeechhhh----HHH-----HHHHHHhCCccccccceE-------------EEEecCC Q lcl|NC_020078. 300 VTSKIFFDDLSK----LWF-----IDSWLAFGVTINRTEYAG-------------VIKLPAA 339 (339) Q Consensus 300 ~~~e~~~~~~~~----~d~-----i~g~~~~Ga~v~rPe~~v-------------~i~~~~a 339 (339) +.+|..|++... -|. ...+|.+|.+-..+.... +|...+- T Consensus 267 ~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~N 328 (349) T protein:vir:94 267 MPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNGTETIARSASWQDLANAAN 328 (349) T ss_pred cceeeecccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCccccccCCCChHHhcCCcC Confidence 347777877532 133 334667777666421110 0110000 No 185 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=94.35 E-value=0.0047 Score=33.36 Aligned_cols=263 Identities=11% Similarity=0.113 Sum_probs=141.6 Q ss_pred cchhHHHHHHHHHHHHHHHHHHhhhcccccc-----ccccccceEEEecccc--ceeeeccCCCCCCCCCCCCc------ Q lcl|NC_020078. 23 GDPLADVTEQFTGTVEGTIKRRSIMAGFVPV-----RSVRGTSTISNRGISK--AKLQKIAPGTTPPPSTEPHT------ 89 (339) Q Consensus 23 ~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~-----r~i~~G~tv~i~~iG~--~t~~~~~~g~~i~~~~~~~~------ 89 (339) -.+. .|-|+|.|-+.+-|++++.|++..-- +-++.-++.-=-...+ +-|+.|..+......+.... T Consensus 1 ~avr-~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~ 79 (287) T protein:vir:39 1 MAIK-YFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQ 79 (287) T ss_pred CCcc-cccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccc Confidence 1112 47799999999999999999865432 2233333322122222 24456666555332211110 Q ss_pred cceEEEEee------hhhhhhhHHHHHHHhcCcchH---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020078. 90 SKIFLKIDT------VIIARNAEPMLDEFQTDFDYQ---GEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMP 160 (339) Q Consensus 90 ~~~~l~ID~------~~y~~~~vdd~D~~q~~~d~~---~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~ 160 (339) -+..+.+|+ .+.++. -||+.-.+-|+- .+-.+.++.|-++.+|..+-..|...|..+ T Consensus 80 rkEi~y~dt~V~Y~~~~~ihE---GiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t----------- 145 (287) T protein:vir:39 80 RKEVKSVNKQVSYDAPLAINE---GIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASET----------- 145 (287) T ss_pred eeEEEEecccccceecccccc---ccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchh----------- Confidence 011223332 222333 344444444443 344566788999999987754443332110 Q ss_pred CccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccceeecce Q lcl|NC_020078. 161 GHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKY 240 (339) Q Consensus 161 g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~ 240 (339) . +...+.+.+-+.+.+|..+..+++|... ....+.|+|+.|.+|+.++-.+.+. ++. .++-.-. T Consensus 146 -------~-----~~~~t~d~V~~LF~~a~~~yvNn~v~~~-~~~~AyV~aevYnaiiD~~l~TsaK--~Ss-aNiDen~ 209 (287) T protein:vir:39 146 -------L-----TVKLDEDSVTKLFSDAHKKFVNNNVSIA-VPWVAYVNADIYDLLIDSKLATTAK--NSS-ANVDEQT 209 (287) T ss_pred -------e-----eeeecccchHHHHHHHHHHhhccceeeE-EEEEEEEChhHHhHHhccccccccc--cce-eeeccCC Confidence 0 0012233444556666666666666422 2457789999999999998776653 222 1343445 Q ss_pred eEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEE-EEeeeeEEeeechhhhHHHHHHHH Q lcl|NC_020078. 241 MFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAG-STIPVTSKIFFDDLSKLWFIDSWL 319 (339) Q Consensus 241 v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~-~~~~~~~e~~~~~~~~~d~i~g~~ 319 (339) +.+.-||.+-+.+.--.. ++. + ..|.++-++.+ ..+. .+++...++.-+-++.|-- T Consensus 210 i~kFkGf~l~e~P~~~~q---~g~------------------~-a~fs~dnig~af~GI~-vaR~i~sEdF~GvalQgAg 266 (287) T protein:vir:39 210 LYKFKGFILSELPDEKFQ---LNE------------------G-AYFAADNVGVAGVGIQ-VTRAMDSEDFAGTALQAAA 266 (287) T ss_pred cceecceEEEecchHhhc---cCc------------------E-EEEccccceeecccce-eEEeeecccccceeeeccc Confidence 678899999886632211 100 0 12222222211 1122 3344455566677777777 Q ss_pred HhCCccccccceEEEEecCC Q lcl|NC_020078. 320 AFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 320 ~~Ga~v~rPe~~v~i~~~~a 339 (339) -||--++.-...+.++.+.- T Consensus 267 K~G~~i~e~Nk~Ai~k~t~~ 286 (287) T protein:vir:39 267 KYGKYLPEKNKKAILKATVT 286 (287) T ss_pred ccccccccccceEEEEEecC Confidence 78888887777777776666 No 186 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=94.33 E-value=0.0047 Score=33.33 Aligned_cols=266 Identities=14% Similarity=0.135 Sum_probs=133.9 Q ss_pred ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccc----ccccccceEEEecccc--ceeeeccCCCCC Q lcl|NC_020078. 8 TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPV----RSVRGTSTISNRGISK--AKLQKIAPGTTP 81 (339) Q Consensus 8 ~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~----r~i~~G~tv~i~~iG~--~t~~~~~~g~~i 81 (339) .||.|-..| ++ .|-|+|.+-+.+-|++++.|++..-- +-+..-++.---...+ +-++.|..++.. T Consensus 1 m~t~N~n~a--------vr-~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv 71 (286) T protein:vir:94 1 MATTNNDLP--------VR-VYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANT 71 (286) T ss_pred CCCCccccc--------ee-ehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCcc Confidence 333332221 22 58899999999999999999865432 2223333221112221 244556665554 Q ss_pred CCCCCCCc------cceEEEEeeh-hhhhhhH--HHHHHHhcCcchHHH---HHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020078. 82 PPSTEPHT------SKIFLKIDTV-IIARNAE--PMLDEFQTDFDYQGE---VAREQGQEIANMYDETFFIMAAKAAIAS 149 (339) Q Consensus 82 ~~~~~~~~------~~~~l~ID~~-~y~~~~v--dd~D~~q~~~d~~~~---~~~~~g~aLA~~~D~~i~~~l~~aA~~~ 149 (339) ...+.... -+..+.+|+. .|..-+. .-+|+.-.+-|+-.. -.+.++.|-++.+|..+-..|..+|. T Consensus 72 ~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~-- 149 (286) T protein:vir:94 72 AFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGT-- 149 (286) T ss_pred ccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-- Confidence 32221110 1122333332 2221111 345555555554443 44567888888888766443332221 Q ss_pred cccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccc Q lcl|NC_020078. 150 DSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYV 229 (339) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~ 229 (339) +...-+.+-+.+..|.++....+|..| .-+.|.|+.|.+|+.++-.+.+. T Consensus 150 -------------------------~t~~~D~V~~LF~~as~~yvn~ev~~~---~~ayV~~evYnaiiD~~l~TsaK-- 199 (286) T protein:vir:94 150 -------------------------DLGAVDDVNALFESAVEKYTDLEVIAP---VRAYVTASVYNAIIDLANVTTAK-- 199 (286) T ss_pred -------------------------hhhhhhhHHHHHHHHHHHhhhhheeee---eEEEEchhHHHHHhccccccccc-- Confidence 000112233334445555555566444 23899999999999998776653 Q ss_pred ccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEE-EEeeeeEEeeech Q lcl|NC_020078. 230 TSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAG-STIPVTSKIFFDD 308 (339) Q Consensus 230 ~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~-~~~~~~~e~~~~~ 308 (339) ++. .++-.-.|.+.-||.|.+.+.-.... .. .+|.++-++.+ ..+. .++....+ T Consensus 200 ~Ss-aNiDengi~~FkGf~i~e~P~~~~~g--~~---------------------aifs~dnig~aftGIn-~aR~IesE 254 (286) T protein:vir:94 200 NSA-VNIDTNGMLSFRGIAITKVPTQYMGG--KA---------------------VIFAPDNVARVFTGIN-IARTIQAI 254 (286) T ss_pred cce-eeeccCCcceecceEEeecchhhccC--ce---------------------EEEccccceeeeccce-eeeeeecc Confidence 222 23444456788999998876422110 00 12222222211 1111 23334444 Q ss_pred hhhHHHHHHHHHhCCccccccceEEEEecC-C Q lcl|NC_020078. 309 LSKLWFIDSWLAFGVTINRTEYAGVIKLPA-A 339 (339) Q Consensus 309 ~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~-a 339 (339) +.-+-.+.|---||--++.-...+.++.+- | T Consensus 255 dF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 255 DFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred ccCceeeeccccccccccccCceeEEEeecCC Confidence 555556666666777677666655554332 2 No 187 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=94.29 E-value=0.0048 Score=33.28 Aligned_cols=293 Identities=14% Similarity=0.052 Sum_probs=130.0 Q ss_pred Cccc---------cCc-ccCCC----cccCCccCcccchhHHHHHHHHHHHHHHHHH-Hhhhcccccccccc---ccceE Q lcl|NC_020078. 1 MSIF---------DGQ-TPSYD----VTRPNQRHGAGDPLADVTEQFTGTVEGTIKR-RSIMAGFVPVRSVR---GTSTI 62 (339) Q Consensus 1 ~~~~---------~~~-~~~~~----~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~-~sv~~~~v~~r~i~---~G~tv 62 (339) |++. .|. ..+.+ +.|. -.+.++|=-.+....-...|+..|+. ..-++.|.++++++ ..+.+ T Consensus 366 ~~L~elAr~~L~~rg~~~~~~~~~~~~~~a-~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~ 444 (693) T protein:vir:95 366 MTLRELARASLVDRGIGVASLNAPQMVGLA-FTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRV 444 (693) T ss_pred CcHHHHHHHHHHhcCCccCCCCHHHHHHHH-HhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCccccccee Confidence 2111 111 11111 1110 01222232223335555667777775 56667777766554 34444 Q ss_pred EEeccccceeeeccCCCCCCCCCCCCccceEE---------EEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHH Q lcl|NC_020078. 63 SNRGISKAKLQKIAPGTTPPPSTEPHTSKIFL---------KIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANM 133 (339) Q Consensus 63 ~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~l---------~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~ 133 (339) ++-..| +......|+++..... .....++ .|...-..+ |||+.. ..+....|.+-++. T Consensus 445 ~lg~~~--~L~~V~E~gEyk~~t~-~e~~e~~~l~tyG~~~~iTRqaiIN---DDLga~-------~~ip~~~g~aA~~~ 511 (693) T protein:vir:95 445 GLGEFS--SLRQVREGAEYKYVTL-GERGEQIILATYGELFSITRQAIIN---DDLQML-------SDIPFKLGQAAKAT 511 (693) T ss_pred ecCCCC--ChhhcCCCCceeeeec-CCccceeehhhcCCeeeecHHhhhc---cchHHH-------HHHHHHHHHHHHHH Confidence 443333 4444444555443221 1111223 333222222 556554 46888899999999 Q ss_pred HHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHH-----h-cCCCCCcCCeEE Q lcl|NC_020078. 134 YDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFL-----E-KDVRPNEEDMIL 207 (339) Q Consensus 134 ~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~-----e-~dV~~p~~~R~~ 207 (339) .++.++..|..-.. ....+.-+.++|+.- .+++..+.+-+.|-.+...+...-. + +-+ .-..+|+ T Consensus 512 ~~~~vy~~L~~Np~---m~DGk~LFhadH~Nl----~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L--~i~P~~l 582 (693) T protein:vir:95 512 IGDLVYAVLTGNPA---MSDGKTLFHADHSNL----LTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTL--NIRPGFV 582 (693) T ss_pred HHHHHHHHHhcCcc---ccCCcceeecccccc----ccccccccChHHHHHHHHHHHHhhcchhccCCcee--ecccceE Confidence 99999977753211 223344444444321 1122233344444433333322211 0 011 1134799 Q ss_pred EECHHHHHHHhcccchhhhcccccccceeecceeEEEece-EEEEeccccccccccccccCCCccccccccccceEEEEE Q lcl|NC_020078. 208 VLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGV-PVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQM 286 (339) Q Consensus 208 vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~-~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~ 286 (339) +|||+......+ +++..+.-.. ....|.+--+.|+ +|+.+.+|.......-...+..+. . +..+ T Consensus 583 lvP~~le~~a~~---l~~s~~~~~a--~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~-d------tie~--- 647 (693) T protein:vir:95 583 LTPVALEDKANQ---IINSESVPGA--DVNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGS-D------TIEV--- 647 (693) T ss_pred EecchHHHHHHH---Hhcccccccc--ccccccccchhccccccccceecCCCCCceEEecCCCC-C------eEEE--- Confidence 999988765544 4444332211 1234444445564 677777774321101011110000 0 0111 Q ss_pred eccceeEEEEEe-eeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 287 FSPKALLAGSTI-PVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 287 ~h~~A~~~~~~~-~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ++...+ .+.+|.-..-..-+=.++-++=||+++++ .=+..+-++| T Consensus 648 ------~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD--~Rg~~kn~GA 693 (693) T protein:vir:95 648 ------AYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLD--FRGLQKSNGA 693 (693) T ss_pred ------EEecCCCCCeEeecCCCCcceEEEEEEEeccCceee--ccccccCCCC Confidence 111111 11222221111112234457889999994 4566777888 No 188 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=93.88 E-value=0.0061 Score=32.73 Aligned_cols=289 Identities=13% Similarity=0.103 Sum_probs=122.8 Q ss_pred ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc---ceeee-c--cCCCCC Q lcl|NC_020078. 8 TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK---AKLQK-I--APGTTP 81 (339) Q Consensus 8 ~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~---~t~~~-~--~~g~~i 81 (339) .|+.+|.-. .-.+.|.. ...|.+.|.+.+-+.....-.++. |++.+.++.-. ..... . ..+... T Consensus 1 mpaltLaea--~k~~~d~l-------~~~ViE~~~~~s~lL~~LpF~~ve-g~~~~ynR~~~~~~~~~~~v~~~~~~~g~ 70 (310) T protein:vir:97 1 MASVTLAES--AKLAQDEL-------VAGVIENIITVNRMFDVLPFDSIE-GNSLAYNRENVLGDVIMAGVGTTFSGAGA 70 (310) T ss_pred CcccchHHH--hhcCcchH-------HHHHHHHHhccchHHHhCCccccc-CCcceeeEeeccCCcccccccccccCCCc Confidence 554444433 33333333 233455555433333433334444 55777776632 22111 0 101111 Q ss_pred CCCCCCCccce--EEEEeehhhhhhhHHHH-HHHh-c-CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020078. 82 PPSTEPHTSKI--FLKIDTVIIARNAEPML-DEFQ-T-DFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 82 ~~~~~~~~~~~--~l~ID~~~y~~~~vdd~-D~~q-~-~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~ 156 (339) .... ...+++ .|.| .--.+.||.. .+.. . -.|.+.+-.+...++|++++...++ .+-...++=.... T Consensus 71 ~~~~-~t~~~~~~~L~i---~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lI----NGD~a~n~F~GL~ 142 (310) T protein:vir:97 71 GKAA-ATFTKVNSNLTT---IMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLI----NGNGAGNEFAGLI 142 (310) T ss_pred cccc-cccceeeeeeee---eeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhh----ccccCCCcccchh Confidence 1100 111111 1222 1122333321 1211 2 2456666678888899988876543 2221111110111 Q ss_pred ccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc-hhhhcccccccce Q lcl|NC_020078. 157 AQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH-ITNGEYVTSAGET 235 (339) Q Consensus 157 ~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~-~~n~d~~~~~~~~ 235 (339) .+.+ ....+...+.+...+++.| |.++++.-+ + ..+..+++..|+.+.++..--| ...+.+. ..... T Consensus 143 ~~~~---~~q~i~~~~~gg~~t~d~L-DeLl~~v~~---~----~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~-~~~~~ 210 (310) T protein:vir:97 143 QLCA---SGQKATTGATGSAISFAIL-DELMDLVVD---K----DGQVDYLTMHARTLRSYKALLRALGGASIN-EVVEL 210 (310) T ss_pred hcCC---ccceeecCCCCCCCCHHHH-HHHHHHHhc---C----CCCCCEEEecHHHHHHHHHHHHHhcCCCCC-Ccccc Confidence 1111 1122222222233344322 322222111 1 1233599999986554443222 2222211 11122 Q ss_pred eecceeEEEeceEEEEeccccccccccccccCCCccccccccccce------EEEEEeccceeEEEEEeeeeEEeee--- Q lcl|NC_020078. 236 LNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKD------IVAQMFSPKALLAGSTIPVTSKIFF--- 306 (339) Q Consensus 236 l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~------~~~~~~h~~A~~~~~~~~~~~e~~~--- 306 (339) .....|-.+.|++|+.++-+|.+... ....+..+=|.+.+.. ++|+...... .+.++... T Consensus 211 ~~G~~v~~~~GiPi~~~d~ip~~~~~----~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~-------glsVr~~G~~~ 279 (310) T protein:vir:97 211 PSGAEVPAYSGTPIFRNDYIPTNQTK----GGTTGCTTIFAGTLDDGSRTHGIAGLTATQAA-------GIQVVDVGESE 279 (310) T ss_pred CCCCEEeeeCCeEEEEeCccCCCccc----cccCCceeEEEEeeCccccccceeccccCCcc-------ceeEEeCCccc Confidence 34557789999999999999864211 1223333445554443 2333211111 12333322 Q ss_pred chhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 307 DDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 307 ~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +..-+.+.|. +-+|+.++.|+++++|+==.= T Consensus 280 ~~~v~~~~V~--~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 280 DSDEHIWRVK--WYCGLALFSEKGLACADGITN 310 (310) T ss_pred CCcceeEEEE--EeeeEEEecccceeeeccccC Confidence 1222334442 238999999999988853222 No 189 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=92.56 E-value=0.011 Score=31.35 Aligned_cols=300 Identities=11% Similarity=0.066 Sum_probs=132.0 Q ss_pred CccccCcccCCC-cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecccc-ceeeeccCC Q lcl|NC_020078. 1 MSIFDGQTPSYD-VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISK-AKLQKIAPG 78 (339) Q Consensus 1 ~~~~~~~~~~~~-~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~-~t~~~~~~g 78 (339) ..-..-|||.-- .+..= .+.-.+.-......|.+.|.+.+-+.....-.++. |++.+.++.-. +++.-+.-+ T Consensus 13 ~~~~~~~~p~l~m~alTL-----aea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve-~~~~~~~r~~~lp~a~~r~~n 86 (330) T protein:vir:94 13 WRTLTHQFPELKMPTVTL-----AESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIE-GNALAYNRENVLGDVQFLAVG 86 (330) T ss_pred eeehhccccccchhhhhh-----hHHhhcCchhhHHHHHHhhhccchHHhhccccccc-CCcceeeeeecCCcceeeecc Confidence 222345677631 11110 00111222455677777887654444444434443 44566665543 233333334 Q ss_pred CCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcC-----cchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|NC_020078. 79 TTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTD-----FDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPY 153 (339) Q Consensus 79 ~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~-----~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~ 153 (339) +.++... +.+...++.+ +..---+-+||+.-+. .|.+.+-.+...++|++++-..++ .+....+. T Consensus 87 ~~~~~~~--~~Tf~q~t~~--l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~li----nGDs~~~~-- 156 (330) T protein:vir:94 87 GTITAKN--PATFTKVTSE--LTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMI----TGDGTGNS-- 156 (330) T ss_pred ccccccC--cceeeeeeec--hhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhh----ccCCCCcc-- Confidence 4443221 1111122222 2222334466665532 467777778888888887775543 32111010 Q ss_pred cccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccc-hhhhcccccc Q lcl|NC_020078. 154 GTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEH-ITNGEYVTSA 232 (339) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~-~~n~d~~~~~ 232 (339) -.+...-......+...+.+...+++.| |.+.++.-++ +.+.-+++++......+.+-.| ..... .... T Consensus 157 -F~GL~~~~~~~q~i~tg~~gg~~T~d~L-DeLl~~v~~~-------~g~~~~~l~n~a~~r~I~a~~R~~~~~~-v~~~ 226 (330) T protein:vir:94 157 -FQGMMGLVAASQTISAGANGGTLTFELL-DQLLDLVKDK-------DGQVDYLMSSFAMRRKYFSLLRALGGAA-IGEV 226 (330) T ss_pred -ccchhhcCCcccEEecCCCCCCCCHHHH-HHHHHHhcCC-------CCCCcEEEechhHHHHHHHHHHhccCCC-CCCc Confidence 0011111112223333333344455433 3222221111 2223488877776666655333 11111 1111 Q ss_pred cceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccc------eEEEEEeccceeEEEEEeeeeEEeee Q lcl|NC_020078. 233 GETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFK------DIVAQMFSPKALLAGSTIPVTSKIFF 306 (339) Q Consensus 233 ~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~------~~~~~~~h~~A~~~~~~~~~~~e~~~ 306 (339) ...+....|-.+.|++|+.++-+|.+.. . ....+..+=|.+.+. -++|+...-.. .+.++... T Consensus 227 ~~~~~G~~v~~~~GvPi~~~d~ip~~~~-~---~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~-------glsVr~~G 295 (330) T protein:vir:94 227 MTLPSGRQIPTYRGVPWFVNDFIPSNMT-Q---GTATNATAIFAGTFDDGSNKYGIAGLTARGSA-------GLRVQNVG 295 (330) T ss_pred ccccCCCEEeeeCCeEEEecccccCCCC-c---ccCCCceeEEEEeecccccccceEeecCCCCC-------cceeeeCC Confidence 1223345778899999999998885421 0 111222233444432 23454322111 12222221 Q ss_pred --chh-hhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 307 --DDL-SKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 307 --~~~-~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +++ -+.+.| .+-+|..++.|+++++|+==.= T Consensus 296 ~~~~k~v~~~~v--~~y~~~av~~~~a~~~L~~V~~ 329 (330) T protein:vir:94 296 AKENADETITRV--KMYCGFANFSQLGLAAIKGLIP 329 (330) T ss_pred CccccceeeEEE--EEeeeeEEechhheeeeccccC Confidence 111 122333 2347889999999988752211 No 190 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=91.85 E-value=0.014 Score=30.76 Aligned_cols=272 Identities=13% Similarity=0.114 Sum_probs=121.6 Q ss_pred Cccc---cCcccC------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccc-cc Q lcl|NC_020078. 1 MSIF---DGQTPS------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGIS-KA 70 (339) Q Consensus 1 ~~~~---~~~~~~------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG-~~ 70 (339) -+++ .|---. +.-.++ ...++|...-.-..|-+.+.+-.+..-...++..+=.. .|.|..-+.+- ++ T Consensus 108 kal~~~~~Gd~~A~~~~e~~r~a~~--~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~-~g~T~eY~v~t~~~ 184 (410) T protein:vir:83 108 LDMWNSAQGNASAADRLEVYARAAD--HQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPL-NNATFYRPIVSQRP 184 (410) T ss_pred HHHhccCCchHHHHHHHHHHHHhhc--cCcccccccccchhHhhhHHHHHhhccchhhhhhhCCC-CCCeeEEeeecccc Confidence 0011 000000 000011 11112221111123444444433332222222221111 26677664442 34 Q ss_pred eeeecc-------CCCCCCCCCCCCccceEEEEeehh----hhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020078. 71 KLQKIA-------PGTTPPPSTEPHTSKIFLKIDTVI----IARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFF 139 (339) Q Consensus 71 t~~~~~-------~g~~i~~~~~~~~~~~~l~ID~~~----y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~ 139 (339) ++..++ .|..|+... +..+..+-.|++.= ..|-.|+ .++....+-.++.++.|-|+..-..+= T Consensus 185 tV~~q~~~~kqa~EGd~L~~gK-l~~~t~tA~ikTyGGyt~LSRQ~IE-----Rs~v~~L~~~lraL~~AYA~atea~vr 258 (410) T protein:vir:83 185 AVGLQGVAGGASDEKTELDSQK-MVIDRLTVNAKTLGGYVNVSRQAID-----FSSPSALDLVVNGLGQQYAIETEALVG 258 (410) T ss_pred cccccccccccccccccccccc-eeeeeccceeehhcCcccccceeee-----cCChhhHHHHHHHHHHHHHHHHHHHHH Confidence 544332 455565543 23344444554321 1111111 133334444445555555555444332 Q ss_pred HHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhc--CCCCCcCCeEEEECHHHHHHH Q lcl|NC_020078. 140 IMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEK--DVRPNEEDMILVLPPAAFTAL 217 (339) Q Consensus 140 ~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~--dV~~p~~~R~~vv~P~~~~~L 217 (339) ..| .+.+ + ...++..++++++...|.++....+.+ ++ .=+++.|+|+.+..+ T Consensus 259 a~L-~~t~------------t---------~~~a~~~~Tad~~~~~i~da~~~v~da~~~~----~~~~i~vS~DVl~~~ 312 (410) T protein:vir:83 259 AAL-ASTS------------T---------GAVGYGNATADNVASAIWQAAGAVYTAVKGM----GRLVIAIAPDVLGDF 312 (410) T ss_pred HHH-HHhh------------h---------hhhhhhhccHHHHHHHHHHHHHHHhhhhccc----eeeeEEechhhhhhc Confidence 222 1111 0 011233456778888888988888764 66 346889999997665 Q ss_pred hcccchhhhccccccc-c--eeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEE Q lcl|NC_020078. 218 MQAEHITNGEYVTSAG-E--TLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLA 294 (339) Q Consensus 218 l~~~~~~n~d~~~~~~-~--~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~ 294 (339) ..--+-.+.+.+...+ + .+..|.-|.++|++|..+..++.++ ++|+.+.||-. T Consensus 313 ~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgT------------------------A~f~~~~Ai~~ 368 (410) T protein:vir:83 313 GPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGD------------------------AYLFSTAAIEC 368 (410) T ss_pred cceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCe------------------------eeEeccceeee Confidence 5533333333222211 2 2335666889999999999887432 34556666643 Q ss_pred EEEee--eeE---EeeechhhhHHHHHHHHHhCCccccccceEEEEec Q lcl|NC_020078. 295 GSTIP--VTS---KIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLP 337 (339) Q Consensus 295 ~~~~~--~~~---e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~ 337 (339) =++-- ++. .++...+.|+ |+++| .+..|++++=|.=+ T Consensus 369 ~eS~~gp~qL~d~~i~nLt~~yS----gY~a~--a~~~~~gliPv~g~ 410 (410) T protein:vir:83 369 FEQRVGTLQVVEPSVFGLQVAYA----GYFST--LVVNEDAIVPLVGS 410 (410) T ss_pred eecCCceeEeeCCchhhhhhhhe----eeeee--ccccccceeeeccC Confidence 33321 111 2223334444 66644 34556666655433 No 191 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=91.29 E-value=0.017 Score=30.35 Aligned_cols=282 Identities=11% Similarity=0.047 Sum_probs=130.4 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccc-----eeeec Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKA-----KLQKI 75 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~-----t~~~~ 75 (339) ||+||=+- || |-.. -.++|.=.- -+..|...+ -..++.....-.|+-+..|..-.. ...++ T Consensus 1 m~lsD~~v--fN---~~~~-------~a~~e~~~q-~~~~fn~as-~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~ 66 (325) T protein:vir:95 1 MALSDLAV--YS---EYAY-------SAFSETLRQ-QVDLFNTAT-GGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNA 66 (325) T ss_pred Cchhhhhh--hh---hhhh-------hhhhhhhhh-hHhhhhhcc-cceeEeccccccCceeeccccccccccccccccC Confidence 88777542 22 1000 011111000 001111100 001122122224777777665432 33444 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) ...+.+++. .+.+.+..-++=... ..+...|+-....-.|.+++++++.|..+++...+.++..+..+....-. T Consensus 67 ~~~~~vt~~-kitt~~~~av~~~r~-~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~---- 140 (325) T protein:vir:95 67 YGSGTVAEK-VLKHLVDTSVKVAAG-TPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALS---- 140 (325) T ss_pred CCCceeccc-eeccccceeeEEecc-cCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---- Confidence 444445432 233332222220111 12223444444455778899999999999998887776666543321100 Q ss_pred cccccCcccccccccc----CccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhccccc Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLA----GANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTS 231 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~----~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~ 231 (339) ..+ ..+.... +.+...++ +.+.++.++|..+.- .=..+++....|..|.+. .+++...... T Consensus 141 --~~~----~~v~dis~~~~~~~~~~s~----~~l~~A~~klGD~~~----~l~~~~MHS~v~~~L~~~-~L~~~~~~~~ 205 (325) T protein:vir:95 141 --QVS----DVVYDATANTDAADKLPTW----NNLNNGQAKFGDQSS----QIAAWIMHSTPMHKLYGS-NLTNGERLFT 205 (325) T ss_pred --ccc----cceeeeecccCcccccccH----HHHHHHHHHhccccc----ceeEEEEchHHHHHHHHh-hccccccccc Confidence 000 0011111 11111222 466677777855432 123678999999999874 5655422111 Q ss_pred ccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechh-- Q lcl|NC_020078. 232 AGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL-- 309 (339) Q Consensus 232 ~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~-- 309 (339) .+.. ..+...+|-+|+++-..|.... +..+.| .++.+.+-|+++.+..++....++... T Consensus 206 ~~g~---~~i~t~~G~~VIVdD~~p~~~~---------g~~~~y-------tty~lg~GAi~~~~~~~~~~~~~~~~~~~ 266 (325) T protein:vir:95 206 YGTV---NVVRDPFGKLLVMTDSPNLFAA---------GTPNVY-------HILGLVPGGVLIGQNNDFDANEETKNGDE 266 (325) T ss_pred cCCc---ccccccCCcEEEEeCCCCCCCc---------cCceeE-------EEEEEecCeEEecCCCCccccccccCccc Confidence 1111 1356789999999998774321 112233 356777888888887776555543221 Q ss_pred hhHHHHH-----HHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 310 SKLWFID-----SWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 310 ~~~d~i~-----g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) +....++ .+|.+|.+-- +..+.. -|+- T Consensus 267 ~~~~~~~~~~tf~lhp~G~sw~--~s~~g~-sPt~ 298 (325) T protein:vir:95 267 NIIRTYQAEWSYNIGVKGFAWD--KANGGK-SPTD 298 (325) T ss_pred ceeeeeeeeeeEEeecceeeee--cccccC-CcCh Confidence 1222122 2355565552 111111 1111 No 192 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=88.34 E-value=0.033 Score=28.71 Aligned_cols=292 Identities=13% Similarity=0.059 Sum_probs=126.2 Q ss_pred Cccc---------cCc-ccCCC----cccCCccCcccchhHHHHHHHHHHHHHHHHH-Hhhhcccccccccc---ccceE Q lcl|NC_020078. 1 MSIF---------DGQ-TPSYD----VTRPNQRHGAGDPLADVTEQFTGTVEGTIKR-RSIMAGFVPVRSVR---GTSTI 62 (339) Q Consensus 1 ~~~~---------~~~-~~~~~----~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~-~sv~~~~v~~r~i~---~G~tv 62 (339) |++. .|. ...++ +.|. -.+.++|--.+....-...++..|+. ..-++.|.++++++ ..+.| T Consensus 331 ~~L~elAr~~L~~~G~~~~~~~~~~~v~~A-~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~ 409 (652) T protein:vir:79 331 MTLREYARMSLTERGIGVSSYNPMQMVGAA-FTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRV 409 (652) T ss_pred ccHHHHHHHHHHhhccCCCCCCHHHHHHHH-hhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCcccccccee Confidence 2221 111 11111 1110 11223333333335555566667765 55667777777654 34555 Q ss_pred EEeccccceeeeccCCCCCCCCCCCCccceEEEEeehhhhhh-------hH-HHHHHHhcCcchHHHHHHHHHHHHHHHH Q lcl|NC_020078. 63 SNRGISKAKLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-------AE-PMLDEFQTDFDYQGEVAREQGQEIANMY 134 (339) Q Consensus 63 ~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-------~v-dd~D~~q~~~d~~~~~~~~~g~aLA~~~ 134 (339) ++-.. ++......|+++.... +...+.++.+. .|-+. .| ||++.. ..+.+..|.+-++.. T Consensus 410 ~lg~~--~~L~~V~E~gEyk~~t-~~e~~e~~~l~--tyG~~~~iTRqaiINDDL~a~-------~~ip~~~g~aA~~~~ 477 (652) T protein:vir:79 410 GMGGF--SALRQVREGAEYKYVT-TGDKQATIALA--TYGELFSITRQAIINDDLNML-------TDVPMKLGRAAKSTI 477 (652) T ss_pred ecCCC--CCccccCCCCccceee-ecCccceeeee--cccCeeeeehheeeccchhHH-------HHHHHHHHHHHHHHH Confidence 44333 3444555555555432 22223344443 22221 11 556655 468889999999999 Q ss_pred HHHHHHHHHhhccccccccccccc-ccCccccccccccCccccccHHHHHHHHHHHH-HHHHhcCCCCCcCCeEEEECHH Q lcl|NC_020078. 135 DETFFIMAAKAAIASDSPYGTAAQ-MPGHSGGNVVTLAGANDYKDPAKLYAAIASLV-EKFLEKDVRPNEEDMILVLPPA 212 (339) Q Consensus 135 D~~i~~~l~~aA~~~~~~~~~~~~-~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~-~~L~e~dV~~p~~~R~~vv~P~ 212 (339) ++.+...|..-.... ...+.-+ .+.|+.. . ++.+.+-+.|-.+...++ ++-.+..+ .-..||++|||+ T Consensus 478 ~~~vy~~l~~Np~~~--~DGk~LF~hA~H~Nl-----~-~~aa~~~~~l~~ar~aM~~Qk~g~~~l--~i~P~~llvp~~ 547 (652) T protein:vir:79 478 ADLVYAILTSNPKIS--TDNVSLFDKAKHANV-----L-ESAAMDVASLDKARQLMRVQKEGERHL--NIRPAFVLVPTA 547 (652) T ss_pred HHHHHHHHhcCcccc--cCCceeecccccccc-----c-ccccCCHHHHHHHHHHHHHhccCCccc--cccccEEEecch Confidence 999987775321110 0122222 2222221 1 112333333322222221 11112112 223589999999 Q ss_pred HHHHHhcccchhhhcccccccceeecceeEEEece-EEEEeccccccccccccccCCCccccccccccceEEEEEeccce Q lcl|NC_020078. 213 AFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGV-PVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKA 291 (339) Q Consensus 213 ~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~-~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A 291 (339) ......+ +++..... +.....|.+--+.|+ +|+++.+|....+..- -++...... +..+| T Consensus 548 le~~a~~---ll~s~~v~--~a~~~~~~~Np~~~~~~~i~eprL~~~s~~~w-ylaa~~~~d------tiev~------- 608 (652) T protein:vir:79 548 MESVANQ---VIRSSSVK--GADINAGIINPVKDFATVIAEPRLDDNSQTTF-YLAASKGSD------TIEVA------- 608 (652) T ss_pred hHHHHHH---HhccCCCc--ccccccccccccccccccccccccCCCCcccE-EEecCCCCC------eEEEE------- Confidence 7654433 33322111 111223444445554 7777788754322111 011000000 01111 Q ss_pred eEEEEE-eeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 292 LLAGST-IPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 292 ~~~~~~-~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) +... ..+.+|.-..-.-.+-.++-++=||+++++.-++ .+.++ T Consensus 609 --yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~--~k~t~ 652 (652) T protein:vir:79 609 --YLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGL--VKCTA 652 (652) T ss_pred --EecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccce--eeecC Confidence 1111 1122332211122233345578899999966554 45566 No 193 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=88.08 E-value=0.035 Score=28.60 Aligned_cols=276 Identities=13% Similarity=0.064 Sum_probs=105.6 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHH----H----------Hhhhccccccccccc-------- Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIK----R----------RSIMAGFVPVRSVRG-------- 58 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~----~----------~sv~~~~v~~r~i~~-------- 58 (339) ..-..+.-+.-.+... . ..+..+ +-+.+.+.-...|- . .++.+.+.+...+.. T Consensus 168 k~~~~~~~~~~~~~~~---~-~~e~r~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (480) T protein:vir:40 168 KKEREASIPSEKPEDA---E-RKFMRE-LGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQ 242 (480) T ss_pred hhhhhhhccccchhhh---h-hHHHHH-HHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhh Confidence 0001111111111000 0 000000 00000000000000 0 011111111111110 Q ss_pred cceEEEeccccc---eee-eccCCCCCCCCCCCCccceEEEEeeh--hhh---hhhHHHHHHHhcCcchHHHHHHHHHHH Q lcl|NC_020078. 59 TSTISNRGISKA---KLQ-KIAPGTTPPPSTEPHTSKIFLKIDTV--IIA---RNAEPMLDEFQTDFDYQGEVAREQGQE 129 (339) Q Consensus 59 G~tv~i~~iG~~---t~~-~~~~g~~i~~~~~~~~~~~~l~ID~~--~y~---~~~vdd~D~~q~~~d~~~~~~~~~g~a 129 (339) ..++.. .|.. .+. ....+...+... .+... ..++. ++. ......+|++ .++.+.+..+.++. T Consensus 243 ~~~~~~--~g~~~~~~~~e~~~~~~~~~~~~---~~~~~-~~~~~v~~l~~~~k~t~~lLDDa---~~l~~~i~~~l~~~ 313 (480) T protein:vir:40 243 GLTLAE--DGVDDTFISGTFKAGTDKNKSQT---ATKRS-LRPQMAEAYLQMDKATVRGVNDS---GALSEYVMSEMVNR 313 (480) T ss_pred cceeee--ccccceeeeeeeecccccccccc---cccch-hhHHHHHHHHHhHHHHHHHhhhh---HHHHHHHHHHHHHH Confidence 011111 1111 111 111121111110 01111 11111 111 1222333432 35888899999999 Q ss_pred HHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCe-EEE Q lcl|NC_020078. 130 IANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDM-ILV 208 (339) Q Consensus 130 LA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R-~~v 208 (339) |+++..+.++ .+. ..+.. +..+... ...+.+.....+.+++.|+.+..+-..+ +. ++| T Consensus 314 ~~~~ee~a~l----~G~-------g~g~~--~~~g~~~-~~~~~~~~~~~~d~id~L~~al~~~y~~-------~a~~~v 372 (480) T protein:vir:40 314 VIQKVEYNMI----LGS-------VDGSN--GFYGLKT-ATDGWTKQIEYTDLFEGITDAVAECSIS-------DAITIV 372 (480) T ss_pred HHHHHHHHhh----ccC-------CCCcc--cccccee-ecccccccchhHHHHHHHHHhhhHHhhC-------CCCEEE Confidence 9999998764 110 00000 0011111 1111122233444444444332222111 22 568 Q ss_pred ECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEec Q lcl|NC_020078. 209 LPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFS 288 (339) Q Consensus 209 v~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h 288 (339) ++|..|..|.+-.. .+..|.-. ..+..|...+++|.+|+++...... +.++..++ +.| .+++- T Consensus 373 mn~~t~~~I~klKD-~~G~Yi~q--~~~~~~~~~~llG~pvv~~~~~~~~----~~~~~~~~--~~~--------~~~~d 435 (480) T protein:vir:40 373 MSPQTFAELRKAKG-TDGHSRFN--ELATKEQIAQSFGAVNLETRVWMPK----DEVAVYNH--DEY--------VLIGD 435 (480) T ss_pred ECHHHHHHHHHhhc-CCCCeecc--CcccccCcceecccceeeeeccccC----CcceeeeC--Ccc--------EEEEe Confidence 99999998855322 11234322 2355788889999999876543211 11111111 111 11111 Q ss_pred cceeEEEEEeeeeEEeeech--hhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 289 PKALLAGSTIPVTSKIFFDD--LSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 289 ~~A~~~~~~~~~~~e~~~~~--~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) + .+|.+++- +.-...+......|..+.+|+++..+++.+. T Consensus 436 ~-----------~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 436 L-----------NVENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred c-----------ccceecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 1 23333322 2334455556678889999999999999999 No 194 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=88.06 E-value=0.035 Score=28.59 Aligned_cols=277 Identities=10% Similarity=0.092 Sum_probs=102.8 Q ss_pred CccccCc-ccC------CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEecc-cccee Q lcl|NC_020078. 1 MSIFDGQ-TPS------YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGI-SKAKL 72 (339) Q Consensus 1 ~~~~~~~-~~~------~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~i-G~~t~ 72 (339) +..+.+. ..+ .++...+.+ +-. -...+...+...+...+.+.+.+++..+. ...++.- ....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-----~p~~~~~~i~~~~~~~~~i~~~~~~~~i~---~~~~~~~~~~~~a 292 (517) T protein:vir:97 222 VAYMSASLTKDPKAAWTAELKERGIS-GMP-----APAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQG 292 (517) T ss_pred HHHHHhcccccccceeeeeccccccc-ccc-----cchHHHHHHHHhhhhhccceeeeeecccc---ceeeeccccccee Confidence 0000000 000 011111110 000 11333445555555555555555544432 2333222 12234 Q ss_pred eeccCCCCCCCCCCCCccceEEEEeehhhhhh-hHH--HHHHHhcCcc----hHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020078. 73 QKIAPGTTPPPSTEPHTSKIFLKIDTVIIARN-AEP--MLDEFQTDFD----YQGEVAREQGQEIANMYDETFFIMAAKA 145 (339) Q Consensus 73 ~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~-~vd--d~D~~q~~~d----~~~~~~~~~g~aLA~~~D~~i~~~l~~a 145 (339) ..+..|+..+. .++...++++.+ .++++. .+. .|++ +.+| +.+-+..+..++|+++.++.++ .+ T Consensus 293 ~~~~eG~~kp~-s~~tf~~~~~~~--~~ia~~~~~S~qll~D--s~~dd~~~l~s~i~~~l~~~l~~~ee~a~l----~G 363 (517) T protein:vir:97 293 TGHTTGTDKTE-SNITLQTRVLTP--QYVYKYIKLPKIVMNS--NATDIAGAILTYVMNRLPDMVIMAVNRAII----MG 363 (517) T ss_pred eeeecCCcccc-cccceeeEEeeH--hhhhhhhhhhHHHHHH--hhhccHHHHHHHHHHHHHHHHHHHHHHHHh----cc Confidence 45555555443 234455555544 233222 221 2222 2334 7788999999999999998774 11 Q ss_pred cccccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhh Q lcl|NC_020078. 146 AIASDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITN 225 (339) Q Consensus 146 A~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n 225 (339) .. ++.+..|.- .......+........+.+.+..+...+.+. + .. .+|++|..|..|.+-.. .+ T Consensus 364 dG-------tg~~~~gi~--~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~a----~-~a-~~vmn~~t~~~I~klKD-~~ 427 (517) T protein:vir:97 364 GV-------TGVSETQIY--PVVGDAWATNVTGTTNIQELLEKLSVATPKA----A-DS-TLVIHRNDLAAIRFLKD-KN 427 (517) T ss_pred cC-------CCccccccc--ccccccccccccccchHHHHHHHHHHHhhhc----c-CC-EEEECHHHHHHHHHhhc-CC Confidence 10 111111110 0000000111111122223222222223221 1 12 34799999999865322 12 Q ss_pred hcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEee Q lcl|NC_020078. 226 GEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIF 305 (339) Q Consensus 226 ~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~ 305 (339) ..|--.. .+.++.+..++|+.-+.+. ++.. .. + ..+.+.| .++-...+.+ .+.+ T Consensus 428 G~Yl~~~--~~~~~~~~~l~G~~~~~~~-~~~~-----~~-~-~~~~~~y---------------~i~~~~g~~~-~~~f 481 (517) T protein:vir:97 428 GNYVFPV--GVSNQTIATHFGFNRLVQS-VAVD-----EK-T-AVSLSGY---------------VTNGSRGMEF-EQGT 481 (517) T ss_pred CCeeccC--cCCcccccccCCccccccc-cccC-----ce-e-Eeecccc---------------EEEeecceee-eeee Confidence 2232211 1234455556664222211 1100 00 0 0001111 1111110000 0111 Q ss_pred -echhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 306 -FDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 306 -~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) |..+ +-+ +..-+..|..|+.||.++-.+.+.. T Consensus 482 d~~~n-~~~-f~~~~~~~g~i~~~~r~a~~~~~p~ 514 (517) T protein:vir:97 482 ILVEN-NKE-YLFEMPISGSLEYKGTTAYGTYTPP 514 (517) T ss_pred ecccC-cee-EeeeeeeccccccccceEEEEEcCC Confidence 1111 111 2222455667888888665444443 No 195 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=85.93 E-value=0.049 Score=27.75 Aligned_cols=271 Identities=14% Similarity=0.042 Sum_probs=105.8 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHH-HhhhccccccccccccceEEEecccc-cee----ee Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKR-RSIMAGFVPVRSVRGTSTISNRGISK-AKL----QK 74 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~-~sv~~~~v~~r~i~~G~tv~i~~iG~-~t~----~~ 74 (339) |-| ......+|+ +-+...++++|+. ..-+..+.+ +.-+-.++-+-..+|. +.+ .+ T Consensus 1 m~i-----------------t~~~l~~l~-~~~~~~~~~~y~~a~~~~~~~a~-~~~sdf~~~~~~~lg~~p~l~e~~Ge 61 (302) T protein:vir:10 1 MLI-----------------NKQSLNAAF-VAIKTIFNNAFAAAPTTWQKIAM-EVPSNTSSNDYKWLSTFPKMRRWIGA 61 (302) T ss_pred Ccc-----------------cHHHHHHHH-HHHHHHHHHHHHhhhhhhhceee-ecCCCcceeeceecCCCCCccccccc Confidence 322 111223333 3556666666664 333333332 2112233333344443 122 33 Q ss_pred ccCCCCCCCCC---CCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 75 IAPGTTPPPST---EPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 75 ~~~g~~i~~~~---~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) |+-+ .+.... .+.+=+.++.|+ =.+|..- ++.+-..+.+++|++-++..|+.++..|..+. .... T Consensus 62 ~~~~-~l~~~~~~i~~~~~g~~v~i~--------R~~i~nD--dlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~-~~~~ 129 (302) T protein:vir:10 62 KVVK-NLKAYKYVVENEDFEATVEVD--------RNDIEDD--QIGIYSPQAKMAGYSAAQLPDELVYEAVNGAF-TKPC 129 (302) T ss_pred eeec-cccccceeEEeecccceeccc--------HHhhccc--ccchhHHHHHHHHHHHHhhHHHHHHHHHhccC-CCcc Confidence 3322 122110 011112233333 1222221 24667789999999999999999998775421 2222 Q ss_pred cccccccccCcccccccc-------ccCccccccHHHHHHHHHHHHHHH---HhcCCCCCcCCeEEEECHHHHHH---Hh Q lcl|NC_020078. 152 PYGTAAQMPGHSGGNVVT-------LAGANDYKDPAKLYAAIASLVEKF---LEKDVRPNEEDMILVLPPAAFTA---LM 218 (339) Q Consensus 152 ~~~~~~~~~g~~~~~~~~-------~~~~~~~~~~~~l~~ai~~a~~~L---~e~dV~~p~~~R~~vv~P~~~~~---Ll 218 (339) ...++...++|..+...- ...++...+++.+-+ .+.+..++ +.+.+ --..+++||+|..... |+ T Consensus 130 ~DG~~fF~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~a-a~~am~~~k~~~G~~L--~i~P~~LiVp~~le~~A~~ll 206 (302) T protein:vir:10 130 FDGQYFIDTDHPVGDASVSNKGTAPLSNASQAAAKAGYGA-ARTAMKKFKDEEGRSL--NVSPNVLLVGPALEDVAKMLL 206 (302) T ss_pred cCCcceecccccccccccccccchhhhhcccccchHHHHH-HHHHHHHHhhhccccc--ccCCCEEEecchhHHHHHHHh Confidence 344555555554332210 111122233333322 22332222 11222 2234789999986654 34 Q ss_pred cccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEE--- Q lcl|NC_020078. 219 QAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAG--- 295 (339) Q Consensus 219 ~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~--- 295 (339) .+.... .+.. ..++ |. +++++++.+.... . +-|+..+..+=.. T Consensus 207 ~~~~~~----~g~~-Np~~-g~------~~~vv~p~L~s~~---a-------------------WyL~a~~~~i~~~~l~ 252 (302) T protein:vir:10 207 TNPKLA----DNTP-NPYV-GT------AELVVDGRIESDT---A-------------------WFLLDTTKPVKPFIFQ 252 (302) T ss_pred hccccC----CCCc-ceec-cc------eEEEEeeccCCCC---c-------------------eEEEecCCccceEEEc Confidence 444332 1222 1222 22 5677777663110 0 1111111111000 Q ss_pred EEeeeeEEeeechhhhHHHHHHHHHhCC------ccccccceEEEEecCC Q lcl|NC_020078. 296 STIPVTSKIFFDDLSKLWFIDSWLAFGV------TINRTEYAGVIKLPAA 339 (339) Q Consensus 296 ~~~~~~~e~~~~~~~~~d~i~g~~~~Ga------~v~rPe~~v~i~~~~a 339 (339) ..+.+..|...+.+.-+-+++-.+.||+ +..-|..+-.=+-++| T Consensus 253 g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 253 PRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred CccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 0111122221111111222233333332 3333333333333333 No 196 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=82.96 E-value=0.046 Score=27.93 Aligned_cols=292 Identities=10% Similarity=0.006 Sum_probs=88.0 Q ss_pred CCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccCCCCCCCCCCCCcc Q lcl|NC_020078. 11 YDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAPGTTPPPSTEPHTS 90 (339) Q Consensus 11 ~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~ 90 (339) .|+ .|+.- +....+|- ....+....+... .+.|.---+|.+-+ |.+-......... T Consensus 1 i~~-~P~~~---g~~~glff-----------~~~~v~T~~V~ie-~~~~~l~lip~v~r--------g~~g~~~~~~~~~ 56 (320) T protein:vir:10 1 MNL-LPVNY---GDSRALFA-----------REKKVRTRTILVE-EKNGVLTLIQSREP--------GSTENVAKRGKRK 56 (320) T ss_pred CCc-CCchh---hhhhhhcc-----------CCCCcccceEEEE-EecCceeeeeccCC--------CCCceeecCCcce Confidence 111 12111 11111111 1111111111111 11222222222211 1110000000000 Q ss_pred ceEEEEeehhhhh---hhHHHH--------HHHhcCcchHHHHHHHHHHHHHHHHHH---HHHHHHHhhccccccccccc Q lcl|NC_020078. 91 KIFLKIDTVIIAR---NAEPML--------DEFQTDFDYQGEVAREQGQEIANMYDE---TFFIMAAKAAIASDSPYGTA 156 (339) Q Consensus 91 ~~~l~ID~~~y~~---~~vdd~--------D~~q~~~d~~~~~~~~~g~aLA~~~D~---~i~~~l~~aA~~~~~~~~~~ 156 (339) .+.+.+ ..+-. +.-+|+ ++.++--+++.+...+ |.+.+|. +...+++++.. .++-.... T Consensus 57 ~~~f~~--p~~~~~d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~----lr~~~~~T~E~m~~~AL~G~i-ldadGtv~ 129 (320) T protein:vir:10 57 VRSFVI--PHLPLEDVILPDEYEGLRGFGTTALAAKSELVKERXET----MKSSHDITHEHLRMGAKKGQI-LDADGTVL 129 (320) T ss_pred EEEEec--ceeccCCccCHHHHcCcccCCCchHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhhhcCeE-EcCCCcEE Confidence 001100 00000 000111 1111111222222222 3333332 22222222221 11100000 Q ss_pred -c--cccCcccccc-ccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhc-cccc Q lcl|NC_020078. 157 -A--QMPGHSGGNV-VTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGE-YVTS 231 (339) Q Consensus 157 -~--~~~g~~~~~~-~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d-~~~~ 231 (339) . ..-|.....+ ..+..++.. -.+++.+.++.+.+.|. .. +..+-.++++|++|.+|+.|+.+...- +... T Consensus 130 ~d~y~~fGi~~~~i~~~l~~a~~d-v~~~~~~~~~~i~~~l~--g~--~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~ 204 (320) T protein:vir:10 130 YDLYAEFGITKKTIYFGLDNKDAN-VAESCRQVLRHVEDNLR--GD--VMKDVSVDVSEEFFDKFIKHASVKEVFLNHEA 204 (320) T ss_pred EechhhhCCccceeEEecCCCCcc-HHHHHHHHHHHHHHHhc--cC--CCCceEEEEChHHHHHHhcCHHHHHHHHhhhh Confidence 0 0012111111 111111111 12344555555555553 22 455667899999999999999876552 1111 Q ss_pred ccceeecc--eeEEEeceEEEEeccccccccccccccCCCccccccccc----cceEEEEEeccceeEEEEEeeeeEEee Q lcl|NC_020078. 232 AGETLNTK--YMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGD----FKDIVAQMFSPKALLAGSTIPVTSKIF 305 (339) Q Consensus 232 ~~~~l~~G--~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~----~~~~~~~~~h~~A~~~~~~~~~~~e~~ 305 (339) +...++.. .-..+.|+.+++=+-......++....-.++.+..+... |....|.+-..+++.+- ..++=...| T Consensus 205 ~~~~l~~~~~~~f~~gGi~~~~Y~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apad~~e~vnt~-g~p~y~k~~ 283 (320) T protein:vir:10 205 AVNRLGGDTRKGFKFGGLIFNENRARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPADFNETAGTL-GKRYYAKME 283 (320) T ss_pred hhhhccccccceEEecCEEEEEcccEEEcCCCCeeEeecCCeeEEEEecCchhheeeecccCcHhhcCCc-ccccccccc Confidence 11122221 223567877776221100001111111111111111110 01000100000111110 111112223 Q ss_pred echhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 306 FDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 306 ~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+++.-+..+.+-..-=.-+-||++++-++..++ T Consensus 284 ~~~~~~g~~l~~qS~PLpi~~rP~~lv~~~~~a~ 317 (320) T protein:vir:10 284 PRRMGRGFDLHSQSNVLPMCCRPGVLVELDAAAQ 317 (320) T ss_pred cccCCCeEEEEeeecccccccCcceEEEEEecCC Confidence 3332222222222222245789999999998888 No 197 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=79.38 E-value=0.11 Score=25.95 Aligned_cols=293 Identities=11% Similarity=0.048 Sum_probs=130.5 Q ss_pred Ccc-ccCcccCC--CcccCCccCcccchhHHHH-HHHHHHHHHHHHH----Hhhhcccccccc-cccc-ceEEEec---c Q lcl|NC_020078. 1 MSI-FDGQTPSY--DVTRPNQRHGAGDPLADVT-EQFTGTVEGTIKR----RSIMAGFVPVRS-VRGT-STISNRG---I 67 (339) Q Consensus 1 ~~~-~~~~~~~~--~~~r~~~~~~~~~~~a~~i-e~~~g~v~~~f~~----~sv~~~~v~~r~-i~~G-~tv~i~~---i 67 (339) |.+ ||-..... .+.+. .....|....|+ +++. +|+....+ ..+.+.++..++ +-.+ .++.+.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~ 77 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQM--GVEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGV 77 (314) T ss_pred CccchHHHHHHHHHHHHhh--cccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccc Confidence 322 22111111 11111 122223322455 5544 45444432 355555555553 2112 3555433 3 Q ss_pred ccce-eeeccCCCCCCCCCCCCccceEEEEe-ehhhhhhhHHHHHHHhc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020078. 68 SKAK-LQKIAPGTTPPPSTEPHTSKIFLKID-TVIIARNAEPMLDEFQT-DFDYQGEVAREQGQEIANMYDETFFIMAAK 144 (339) Q Consensus 68 G~~t-~~~~~~g~~i~~~~~~~~~~~~l~ID-~~~y~~~~vdd~D~~q~-~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~ 144 (339) |..+ +.+|. .+++. .+.+-.+....|- =..-+..-+.++..++. ..++-..-...+..++++..|+.+|. T Consensus 78 G~a~~~~d~~--~dip~-vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~---- 150 (314) T protein:vir:10 78 GIAQIIADYS--DDLPL-VDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWS---- 150 (314) T ss_pred cceeeeCCcc--cccce-eecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe---- Confidence 4443 23332 22322 1122233333321 11223444677777754 68888888889999999999987651 Q ss_pred hcccccccccccccccCccccccccccCc-cccccHHHHHHHHHHHHHHHHhc--CCCCCcCCeEEEECHHHHHHHhccc Q lcl|NC_020078. 145 AAIASDSPYGTAAQMPGHSGGNVVTLAGA-NDYKDPAKLYAAIASLVEKFLEK--DVRPNEEDMILVLPPAAFTALMQAE 221 (339) Q Consensus 145 aA~~~~~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~~l~~ai~~a~~~L~e~--dV~~p~~~R~~vv~P~~~~~Ll~~~ 221 (339) -++.. ...|.-....++...+ ++=.+++.+++-|..+..+|.++ .+..|. .++++|+.|.+|.. T Consensus 151 ----G~~~~----g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~---~l~Lpp~~~~~L~~-- 217 (314) T protein:vir:10 151 ----GSAPH----GIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVT---DILLPASARRVMQG-- 217 (314) T ss_pred ----ecccc----cceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccce---eEEecHHHHHhhcc-- Confidence 11111 1112211111111111 12236788999999999998764 443442 67899999987742 Q ss_pred chhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeee Q lcl|NC_020078. 222 HITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVT 301 (339) Q Consensus 222 ~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~ 301 (339) ..+ +++. ++..=...+--++.|...+.+-.. ++. ....+ +...-.++-+.....++++ T Consensus 218 -~~~--~~~~---tvl~~l~~n~~~l~I~~~~el~~a-g~~--------g~~~~-------v~y~~~~~~~~~~vp~~~~ 275 (314) T protein:vir:10 218 -LVP--QTNL---SYGELFTRNNPGLTIRFLQFLDNY-DGA--------GGKAA-------LAFEKSPLNMSIEIPEVTN 275 (314) T ss_pred -ccc--CCCc---cHHHHHHHhCCCcEEEEccccccc-CCC--------cceEE-------EEEecCCcEEEEecCccce Confidence 111 1111 111100001124555555554311 100 01000 0111122233333333333 Q ss_pred EEeeechhhhHHHHHHHHH-hCCccccccceEEE-EecCC Q lcl|NC_020078. 302 SKIFFDDLSKLWFIDSWLA-FGVTINRTEYAGVI-KLPAA 339 (339) Q Consensus 302 ~e~~~~~~~~~d~i~g~~~-~Ga~v~rPe~~v~i-~~~~a 339 (339) .-. ..++...+.+.+... .|..+.||++++.+ =+|=| T Consensus 276 ~l~-~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 276 VLP-AQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred eec-ceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 222 122334555555554 47999999998832 23434 No 198 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=74.74 E-value=0.03 Score=28.95 Aligned_cols=110 Identities=12% Similarity=0.067 Sum_probs=64.7 Q ss_pred EECHHHHHHHhcccchhhhcccc-cccceeecceeEEEeceEEEEecccccccc----------ccccccCCCccccccc Q lcl|NC_020078. 208 VLPPAAFTALMQAEHITNGEYVT-SAGETLNTKYMFAAFGVPVITSNNAVFGKT----------ITDHLLSNANNEKAYD 276 (339) Q Consensus 208 vv~P~~~~~Ll~~~~~~n~d~~~-~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~----------~~~~~l~~~~~~~~y~ 276 (339) +|+--+|..++.++....+ ..- ..+..+..+.-.+++|.+.+.|.|+|++.. +....+... .|. T Consensus 1 vvsdlqfA~~~g~~v~~~a-LpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~P----gya 75 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKA-LPREQANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSP----EFA 75 (123) T ss_pred CcchhhHHHHhcchhcccc-cccccCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCCC----ccc Confidence 5566678888776432211 111 223445555667799999999999995421 011111111 111 Q ss_pred cccceEEEEEeccceeEEEEEeeeeEEeeechh--hhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 277 GDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDL--SKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 277 ~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~--~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) . ... ..++++..|..+ .-.|.|+++.+-=.-++.|.+.+-|+=.+- T Consensus 76 ~-----------~~~------~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 76 P-----------AGN------TGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred C-----------CCC------cceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 1 001 113455557666 778899999999999999988877765555 No 199 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=72.67 E-value=0.18 Score=24.67 Aligned_cols=265 Identities=13% Similarity=0.103 Sum_probs=123.4 Q ss_pred ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccccc----ccccccceEEEecccc--ceeeeccCCCCC Q lcl|NC_020078. 8 TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPV----RSVRGTSTISNRGISK--AKLQKIAPGTTP 81 (339) Q Consensus 8 ~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~----r~i~~G~tv~i~~iG~--~t~~~~~~g~~i 81 (339) .|+ | +.-.+. .|-|+|.+-+.+-|++++.|++..-- +-++.-++.---...+ +-++.|..++.. T Consensus 1 mp~-N--------~n~avr-~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv 70 (295) T protein:vir:47 1 MPS-N--------QNNAVR-RYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGEND 70 (295) T ss_pred CCC-C--------CCccch-hhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcc Confidence 333 1 111222 58899999999999999999865432 2233333321122222 245567766666 Q ss_pred C-CCC------CCCccceEEEEeeh-hhhhhhH--HHHHHHhcCcchHHH---HHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020078. 82 P-PST------EPHTSKIFLKIDTV-IIARNAE--PMLDEFQTDFDYQGE---VAREQGQEIANMYDETFFIMAAKAAIA 148 (339) Q Consensus 82 ~-~~~------~~~~~~~~l~ID~~-~y~~~~v--dd~D~~q~~~d~~~~---~~~~~g~aLA~~~D~~i~~~l~~aA~~ 148 (339) - ..+ .--.-+..+.+|+. .|..-+. .-+|+.-.+-|+-.. -.+.++.|-++.+|..+-..|..+|.. T Consensus 71 agFGtGTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~ 150 (295) T protein:vir:47 71 GGFGDNSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATK 150 (295) T ss_pred cccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 3 111 11001222334432 2222111 345555555554444 445678888999998765444433211 Q ss_pred ccccccccccccCccccccccccCccccccHHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcc Q lcl|NC_020078. 149 SDSPYGTAAQMPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEY 228 (339) Q Consensus 149 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~ 228 (339) + ...++ .+.+.+-+.+..+.++....+|..|. -+.|.|+.|.+|..++-.+.+. T Consensus 151 t---------------------e~~td-~t~d~V~~LF~~as~~yvn~ev~~~~---~AyV~~evYnaiiD~~l~TsaK- 204 (295) T protein:vir:47 151 T---------------------EALAD-FTDDKVKALFNKLSAFYTNNEVTAPI---TVYLRSEFYNAIVDMASVTSAK- 204 (295) T ss_pred h---------------------hhhhc-ccchhHHHHHHHHHHHhhhhheeeee---EEEEchhHHHHHhccccccccc- Confidence 0 01111 22244445555666666666774442 3899999999999998777653 Q ss_pred cccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeech Q lcl|NC_020078. 229 VTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDD 308 (339) Q Consensus 229 ~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~ 308 (339) ++. .++-.-.|.+.-||.+.+.+.--..++ .-...+..+ +|..| ..|-++..+ ++|-+..- T Consensus 205 -~Ss-aNiDengi~~FkGf~i~e~P~~~~q~G-~~aifs~dn------------ig~af--tGIn~aR~I--esEdF~GV 265 (295) T protein:vir:47 205 -GAT-ISLDENGLPKYKGFTLEETPAQYFETG-VIAIFSPNG------------IIIPF--VGISTARVI--EAENFDGV 265 (295) T ss_pred -cce-eeeccCCcceecceEEEeccHhhccCC-cEEEEcccc------------ceeec--ccceeeeee--ecccccch Confidence 222 234444567889999988765433211 000011100 11111 122233332 23322211 Q ss_pred hhh--HHHHHHHHHhC---------Ccccc Q lcl|NC_020078. 309 LSK--LWFIDSWLAFG---------VTINR 327 (339) Q Consensus 309 ~~~--~d~i~g~~~~G---------a~v~r 327 (339) .-| --.+..+++-= --..| T Consensus 266 alQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 266 NCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 100 01111111100 00112 No 200 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=65.73 E-value=0.22 Score=24.20 Aligned_cols=286 Identities=12% Similarity=0.051 Sum_probs=98.5 Q ss_pred cccCcccCCC-cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceee-----ecc Q lcl|NC_020078. 3 IFDGQTPSYD-VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQ-----KIA 76 (339) Q Consensus 3 ~~~~~~~~~~-~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~-----~~~ 76 (339) |-.|.||..+ ||.--.+= .+.. ||. .++| +.+.+. ..+.+++..|+..+. ... T Consensus 1 ~~~~~~~~dp~LT~~A~gy--~n~~--~Ia------------~~l~-P~vpV~----~~~~~~~~f~~~e~F~~~~t~r~ 59 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAY--RNGR--MIS------------DEVL-PRVPVG----KQEFKFWKYDLAQGFTVPETLVG 59 (309) T ss_pred CCCCCcCcCHhHHHHHhhc--cChh--hhh------------hhcC-CccccC----ccccceeeechhhcccccchhhc Confidence 6678888753 55442211 1111 221 1222 333221 222334444432211 011 Q ss_pred CCCCCCCCCCCCccceEE-EEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 77 PGTTPPPSTEPHTSKIFL-KIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 77 ~g~~i~~~~~~~~~~~~l-~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) ++....-- +...++.+. +.|..+...+.-.++.++...+|++....+.....|.+..+..+...+..++. . T Consensus 60 ~~~~~~~v-~~~~~~~~~~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~-------y 131 (309) T protein:vir:99 60 RKSKPNEV-EFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS-------Y 131 (309) T ss_pred cCCCcceE-eecccCceeeecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhh-------c Confidence 11111100 011111111 22222323333446667777899998887776655555544333222222211 0 Q ss_pred cccccCccccccccccCccccccH-HHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhc-cccccc Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLAGANDYKDP-AKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGE-YVTSAG 233 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~~~~~~~~~-~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d-~~~~~~ 233 (339) ..+..++++++..-.++ ...+.-|..+..++.. .| ..++++...|..|+.|++++++- |+..+. T Consensus 132 -------~~~~k~~Lsgt~~wsd~~SDPi~~i~~~~~~~g~----~P---N~~vlg~~~~~~l~~hp~i~~~ik~~~~~~ 197 (309) T protein:vir:99 132 -------AAGNKTTLSGADQWSDPTSNPLPVITDALDSVIL----RP---NIGVLGRRTATILRRHPKIVKAYNGSLGDE 197 (309) T ss_pred -------CCCceEEecCccccCCCCCCcHHHHHHHHHhhCC----Cc---ceEEechHHHHHHhhCHHHHHHhcCCCccc Confidence 11122333333221111 1233334444444421 24 37899999999999999999884 433322 Q ss_pred ceeecceeEEEeceE-EEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhH Q lcl|NC_020078. 234 ETLNTKYMFAAFGVP-VITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKL 312 (339) Q Consensus 234 ~~l~~G~v~~i~G~~-V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~ 312 (339) ..+.--.+..++|++ |++.....++.. .+. ..+-... ..+.+.|++........+....-=-..|..+..+ T Consensus 198 g~it~~~la~l~~ve~V~vg~a~~n~a~-~g~----~~~~~~i---wg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g 269 (309) T protein:vir:99 198 GMVPMAFLQELLELDAIYIGEARLNIAR-PGQ----NPNLIRA---WGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSG 269 (309) T ss_pred cccCHHHHHHHhCcceEEeecceeeccc-ccc----ccccccc---cCCcEEEEEcCCCCCCcccccccceeecccccCC Confidence 222222334467773 554333221110 000 0000000 0001111111111100000000000000111111 Q ss_pred HHHHHHH-HhCCccccccceEEEEecCC Q lcl|NC_020078. 313 WFIDSWL-AFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 313 d~i~g~~-~~Ga~v~rPe~~v~i~~~~a 339 (339) ..++-.+ .=|...+|---.+.=.+.+. T Consensus 270 ~~~d~~~~~~g~~~vr~~~~~k~~i~~~ 297 (309) T protein:vir:99 270 SIADPNIGLRGGQRVRVGESVKELVTAP 297 (309) T ss_pred ceeeeeeccCCceEEEEeccccchhcch Confidence 1110000 00111111000000000000 No 201 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=61.53 E-value=0.35 Score=23.07 Aligned_cols=299 Identities=11% Similarity=0.030 Sum_probs=111.7 Q ss_pred Cc-cccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceeeeccC-- Q lcl|NC_020078. 1 MS-IFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQKIAP-- 77 (339) Q Consensus 1 ~~-~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~~~~~-- 77 (339) ++ |..+. .++.-.+ .|-.. -+++..-|. .-+.++.+.+..++- -.+.++..|+.||-....-+.. T Consensus 15 ~~~i~k~~---it~~~l~----~g~L~---p~~a~~Fl~-~v~~~t~iL~~~r~~-~~~s~~~ei~kig~G~r~~r~~~e 82 (360) T protein:vir:99 15 MNSLSQKD---IGLAELD----GFQLP---VDVTEEFLE-RMQKGVQILGMADTM-TLARLEMEVPQFGVPRLSGHTRDE 82 (360) T ss_pred HHHHHhhh---ccccccC----ceeec---HHHHHHHHH-HHhhccchhhhccee-ecccccccccccccceeecccccc Confidence 22 22332 1222221 11111 144444443 334555556666544 2345666677776543222221 Q ss_pred -CCCCCCCCCCCccceEEEEeehhhhh-hhHHHHHH----HhcC--cchHHHHHHHHHHHHHHHHHHHH----------- Q lcl|NC_020078. 78 -GTTPPPSTEPHTSKIFLKIDTVIIAR-NAEPMLDE----FQTD--FDYQGEVAREQGQEIANMYDETF----------- 138 (339) Q Consensus 78 -g~~i~~~~~~~~~~~~l~ID~~~y~~-~~vdd~D~----~q~~--~d~~~~~~~~~g~aLA~~~D~~i----------- 138 (339) |+....+.....+-.....+...++. +..+++-+ -+.. -.+++.++++.|+-|....-+.= T Consensus 83 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~ 162 (360) T protein:vir:99 83 EGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGG 162 (360) T ss_pred CCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcc Confidence 22121111111111111222222222 22223222 1111 23667777776665544322211 Q ss_pred -------HHHHHhhccc-ccccccccccccCccc--------cccccccCccccccHHH-HHHHHHHHHHHHHhcCCCCC Q lcl|NC_020078. 139 -------FIMAAKAAIA-SDSPYGTAAQMPGHSG--------GNVVTLAGANDYKDPAK-LYAAIASLVEKFLEKDVRPN 201 (339) Q Consensus 139 -------~~~l~~aA~~-~~~~~~~~~~~~g~~~--------~~~~~~~~~~~~~~~~~-l~~ai~~a~~~L~e~dV~~p 201 (339) ..=+.|-|.. ...... ++-..+.+. ++...........++.. ....+.++.+.|..+----| T Consensus 163 ~d~fl~~~dGwlKka~~~~~~id~-a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~ 241 (360) T protein:vir:99 163 AAELDNTFKGWIARAEGDAQSVDD-AGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESD 241 (360) T ss_pred cchhhhhhHHHHHHhhcccchhhc-cccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCc Confidence 0001111100 000000 000000000 00000000000001111 11223344444543321001 Q ss_pred cCCeEEEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccce Q lcl|NC_020078. 202 EEDMILVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKD 281 (339) Q Consensus 202 ~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~ 281 (339) ...-+.+++|..+....+ .+.+++ ++.++..+.++....+.|++|+..+.+|... T Consensus 242 ~~~~~~~~s~~~~~~yr~--~L~~R~-t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~---------------------- 296 (360) T protein:vir:99 242 AYSPVLMTSPNQVQSYTM--SLTERE-DPLGSAVIFGDSDITPFSYDLVGVNGFPDEY---------------------- 296 (360) T ss_pred ccceEEEccCchHHHHHH--HHhccC-cccchhheecccccccceeeeEEcCCCCCCc---------------------- Confidence 112245778877655554 345554 4555555666666678999999999998432 Q ss_pred EEEEEeccceeEEEEEeeeeEEeeechhhhHHH---HHHH-HHhCCccccccceEEEE----ecCC Q lcl|NC_020078. 282 IVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWF---IDSW-LAFGVTINRTEYAGVIK----LPAA 339 (339) Q Consensus 282 ~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~---i~g~-~~~Ga~v~rPe~~v~i~----~~~a 339 (339) .++-++.=|..+-..++..+...++.+.++. ++.+ .++=--+++-+.+|++. .+.| T Consensus 297 --~mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 297 --MMFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred --eEEeccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 2445666666666666665544444332220 1111 01111122332233332 2333 No 202 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=52.14 E-value=0.56 Score=21.95 Aligned_cols=212 Identities=14% Similarity=0.077 Sum_probs=98.7 Q ss_pred CccccCcccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHH-Hhhhccccccc-----cccccceEEEec----cccc Q lcl|NC_020078. 1 MSIFDGQTPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKR-RSIMAGFVPVR-----SVRGTSTISNRG----ISKA 70 (339) Q Consensus 1 ~~~~~~~~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~-~sv~~~~v~~r-----~i~~G~tv~i~~----iG~~ 70 (339) |-| .....++||. -|....+.+|+. .+-+..+.++. +-+.|=-=+||. ||+. T Consensus 1 M~i-----------------~~~~l~~l~~-~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer 62 (305) T protein:vir:19 1 MIV-----------------TPASIKALMT-SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKR 62 (305) T ss_pred Ccc-----------------CHHHHHHHHH-HHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhcce Confidence 332 1222344443 345555666654 22222222111 111222223332 4666 Q ss_pred eeeeccCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_020078. 71 KLQKIAPGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASD 150 (339) Q Consensus 71 t~~~~~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~ 150 (339) .++....+..--.+ +.=+.++-|+ =+||++- ++.+-+.+.+++|++-|..-|+.++..| +++-.+. T Consensus 63 ~i~~l~~~~y~i~N---k~fe~tV~V~--------R~dIeDD--~lG~y~p~~~~~G~~aa~~pd~lv~~lL-~~Gf~~~ 128 (305) T protein:vir:19 63 TIQQMEAHGYSIAN---KTFEGTVGIS--------RDDFEDD--NLGIYAPIFQEMGRSAAVQPDELIFKLL-KDGFTQP 128 (305) T ss_pred eeeeccccceeEee---ccccceeccc--------hhhcccc--ccCchHHHHHHHHHHHhhchhhHHHHHH-HhcCCcc Confidence 66655544432211 2234555565 2355543 3678889999999999999999999866 4444443 Q ss_pred ccccccccccCccc-------cccccccC--------------------------------------------------- Q lcl|NC_020078. 151 SPYGTAAQMPGHSG-------GNVVTLAG--------------------------------------------------- 172 (339) Q Consensus 151 ~~~~~~~~~~g~~~-------~~~~~~~~--------------------------------------------------- 172 (339) -.+.++.+.++|.. |......+ T Consensus 129 cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~ 208 (305) T protein:vir:19 129 CYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEF 208 (305) T ss_pred CCCCCcccCCCCCcccCCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhcee Confidence 33444444444422 10000000 Q ss_pred --------------------ccccccHHHHHHHHHHHHHHHHhcCC----CCCcCCeEEEECHHHHHHHhcccchhhhcc Q lcl|NC_020078. 173 --------------------ANDYKDPAKLYAAIASLVEKFLEKDV----RPNEEDMILVLPPAAFTALMQAEHITNGEY 228 (339) Q Consensus 173 --------------------~~~~~~~~~l~~ai~~a~~~L~e~dV----~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~ 228 (339) .+.+.+.+ .+..+++.|....- |---..+++||||.....- .++++++. T Consensus 209 ~ygvd~R~n~Gygfwq~a~gS~~~Ls~~----nl~aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A---~qll~s~~ 281 (305) T protein:vir:19 209 LFGASTRRAAGYGFWQMAVAVKGDLTLD----NLWKGWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAA---EQLLNREL 281 (305) T ss_pred eeeeeeeeeccccchhheecCCCCCCHH----HHHHHHHHHHhhcCCCCceeeeecCeEEeCchhHHHH---HHHHhhcc Confidence 01112222 33344444433220 1111235899999876554 33555443 Q ss_pred cccccceeecceeEEEec-eEEEEeccc Q lcl|NC_020078. 229 VTSAGETLNTKYMFAAFG-VPVITSNNA 255 (339) Q Consensus 229 ~~~~~~~l~~G~v~~i~G-~~V~~Snnl 255 (339) ...+.. +.+=-+.| +++++++.| T Consensus 282 i~~g~~----~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 282 FADGNT----TVSNEMKGKLQLVVADYL 305 (305) T ss_pred cCCccc----cccceecceEEEEecccC Confidence 221111 11101233 678888888 No 203 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=51.16 E-value=0.59 Score=21.84 Aligned_cols=296 Identities=11% Similarity=0.018 Sum_probs=123.3 Q ss_pred CccccCc-ccCCCcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc---cceEEEec---cccceee Q lcl|NC_020078. 1 MSIFDGQ-TPSYDVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG---TSTISNRG---ISKAKLQ 73 (339) Q Consensus 1 ~~~~~~~-~~~~~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~---G~tv~i~~---iG~~t~~ 73 (339) |--.|.+ -|| +++-......+|-+. ||.-|--.+.+..-.--+...++.+.+ ++ -+++.|+. .|+.+ T Consensus 56 md~~~~~~~~~-~~~~l~~~~~~g~~~--~l~~~~p~~i~~~tap~~a~~l~pv~t-~g~W~~~~~~~~v~e~~G~A~-- 129 (379) T protein:vir:10 56 MDSNDIGPIPT-PLSPLSPVSIPGLIQ--FLQNWLPGHVRILTAVREADEFLGLST-VGQWDDEQIVQRVLEGLGTAQ-- 129 (379) T ss_pred hcccccccccc-ccCccccccccchHH--HHHhhcchHHHHHhhhhhhhhhccccc-CCCceeeeEEEeeeeeeeeeE-- Confidence 2211111 111 111110111222233 787777766666656666666766665 33 24555554 35543 Q ss_pred eccCCCCCCCCC-CCCccceEEEEeehhhhhhhHHHHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_020078. 74 KIAPGTTPPPST-EPHTSKIFLKIDTVIIARNAEPMLDEFQ-TDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDS 151 (339) Q Consensus 74 ~~~~g~~i~~~~-~~~~~~~~l~ID~~~y~~~~vdd~D~~q-~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~ 151 (339) -|.-+.+++... ..+-.++.+..=+ .-+..-+..+-++| +..++-.+-.+.+..+|.+..|+..|.=. .++ T Consensus 130 ~ygd~~d~pl~d~~~~~~~r~v~~~~-~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~------~d~ 202 (379) T protein:vir:10 130 PYTDGGNMALMSWTPTFETRTVVRFE-AGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGY------NDG 202 (379) T ss_pred EeccccCCCeeeeeeeeeeeeeEEEE-EEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEee------cCC Confidence 444344442111 1111122221101 01111223334443 46888888888888888888887553110 000 Q ss_pred cccccc--cccCccccccc-cccCccc---cccHHHHHHHHHHHHHHHHhc--CCCCCc-CCeEEEECHHHHHHHhcccc Q lcl|NC_020078. 152 PYGTAA--QMPGHSGGNVV-TLAGAND---YKDPAKLYAAIASLVEKFLEK--DVRPNE-EDMILVLPPAAFTALMQAEH 222 (339) Q Consensus 152 ~~~~~~--~~~g~~~~~~~-~~~~~~~---~~~~~~l~~ai~~a~~~L~e~--dV~~p~-~~R~~vv~P~~~~~Ll~~~~ 222 (339) .....+ +.|+....... +..+... ..+++.+++-|..+...|-.+ .+..|. ..-.++++|..+..|-.-. T Consensus 203 ~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n- 281 (379) T protein:vir:10 203 SGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPT- 281 (379) T ss_pred CcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcccc- Confidence 000001 11222221111 1111111 236788888888887776433 221232 2337889999999996532 Q ss_pred hhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEE------E- Q lcl|NC_020078. 223 ITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLA------G- 295 (339) Q Consensus 223 ~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~------~- 295 (339) +|.-+-.+-++ .+.-|++|...+.+-. .+ ++. ..++++-++.-+- + T Consensus 282 ----~~g~Tvl~~lk----~n~Pnl~i~t~pEL~~-ag--------gg~----------~~~~~~~~~~~~~~t~~~~~~ 334 (379) T protein:vir:10 282 ----ELGYSVAQYMR----ESYPNVTFVSAPELND-AN--------GGS----------SAIYYYADAVENNGTDDGRTW 334 (379) T ss_pred ----ccCccHHHHHH----HhcCCcEEEEcccccc-cC--------CCc----------cEEEEEeeccCCCccCCcceE Confidence 12111111111 1233566666666521 11 000 0112221110000 0 Q ss_pred -EEeeeeEEeee-chhhhHHHHHH-HHHhCCccccccceEEEEecCC Q lcl|NC_020078. 296 -STIPVTSKIFF-DDLSKLWFIDS-WLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 296 -~~~~~~~e~~~-~~~~~~d~i~g-~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...+.+..... ..+...+.+.+ --..|+-+.||-+++ +..+| T Consensus 335 ~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~--~~~G~ 379 (379) T protein:vir:10 335 LQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATY--RQTGA 379 (379) T ss_pred EEecchhhhhccceecCceeEeccccceeeeeeecchhhh--eecCC Confidence 00000000000 00111222222 235788899996654 45777 No 204 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=49.80 E-value=0.63 Score=21.69 Aligned_cols=285 Identities=11% Similarity=0.073 Sum_probs=100.0 Q ss_pred CccccCcccCCC-cccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccccceEEEeccccceee--e--c Q lcl|NC_020078. 1 MSIFDGQTPSYD-VTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRGTSTISNRGISKAKLQ--K--I 75 (339) Q Consensus 1 ~~~~~~~~~~~~-~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~G~tv~i~~iG~~t~~--~--~ 75 (339) |+-....||..+ ||.--.+=. ++ -|| ++ .+ .+.+.+ ...+.+++.+|+-... + . T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~--n~--~~I----ad--------~l-fP~vpV----~~~~~k~~~f~~e~f~~~~t~r 59 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYT--NA--EFI----GQ--------TL-MPVVEV----EKEGGKIPKFGKESFRLYQTER 59 (307) T ss_pred CCCCCCCcccCHHHHHHHhhcc--ch--hhh----hh--------hc-CCcccc----cccccceeeecccccccccccc Confidence 888888888753 444321111 11 122 11 11 122211 1222333333332111 0 1 Q ss_pred cCCCCCCCCCCCCccceEEEEeehhhhhhhHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_020078. 76 APGTTPPPSTEPHTSKIFLKIDTVIIARNAEPMLDEFQTDFDYQGEVAREQGQEIANMYDETFFIMAAKAAIASDSPYGT 155 (339) Q Consensus 76 ~~g~~i~~~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~ 155 (339) .++....--+....+..++.+++.- ....||+.+...+.+|++....+..-..+.+..+ ..+|.....+... T Consensus 60 a~~~~~~~v~~~~~~~~~~~~~~~~-l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E-------~~~A~l~~~~~~y 131 (307) T protein:vir:79 60 ALRAKSNRMNPEDIDSVDVNLDEHD-LEYPIDYREDQESAFPLEQAAVQTATDAIQLRRE-------KMIADLSQNPSSY 131 (307) T ss_pred ccCCCcceeeeeccccccccccccc-hhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHH-------HHHHHHhcccccc Confidence 1121111100001122233333322 2345777777777888877555544333333333 2333222111111 Q ss_pred cccccCccccccccccCcccccc-HHHHHHHHHHHHHHHHhcCCCCCcCCeEEEECHHHHHHHhcccchhhhcccccccc Q lcl|NC_020078. 156 AAQMPGHSGGNVVTLAGANDYKD-PAKLYAAIASLVEKFLEKDVRPNEEDMILVLPPAAFTALMQAEHITNGEYVTSAGE 234 (339) Q Consensus 156 ~~~~~g~~~~~~~~~~~~~~~~~-~~~l~~ai~~a~~~L~e~dV~~p~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~~ 234 (339) ..+..++++++..-.+ ....+.-|.+..+.+.+.--..| ..+++++..|..|+.|++++++=- +.... T Consensus 132 -------~~~~k~tLsgt~~Wsd~~sDPi~di~~~~~ai~~~~g~~P---n~~vlg~~a~~~l~~h~~i~~~lk-~~~~g 200 (307) T protein:vir:79 132 -------AAGNKKQLSATEKFTAANSDPVGVIEDGKEAIRTKIGRRP---NTMVIGASAYKTLKAHPQLIEKIK-YSMKG 200 (307) T ss_pred -------CCCceEEEccCcccCCCCCCcHHHHHHHHHHHHHhhCCcc---ceEEeCHHHHHHHhcCHHHHHHhc-Ccccc Confidence 1122333433321111 12233445555555554433334 478999999999999999988632 22222 Q ss_pred eeecceeEEEeceE-EEEecccccc-------ccccccccC------CCccccccccccceEEEEEeccceeEEEEEeee Q lcl|NC_020078. 235 TLNTKYMFAAFGVP-VITSNNAVFG-------KTITDHLLS------NANNEKAYDGDFKDIVAQMFSPKALLAGSTIPV 300 (339) Q Consensus 235 ~l~~G~v~~i~G~~-V~~Snnlp~~-------~~~~~~~l~------~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~ 300 (339) .+.--.+..++|+. |+.-...... .|+.+-.+. ..+..+-|...|.-+ +.... .. T Consensus 201 ~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt----~~~~g-------~~ 269 (307) T protein:vir:79 201 IVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYT----LRKKG-------NP 269 (307) T ss_pred ccCHHHHHHHhCceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCccccccccee----EEecC-------ce Confidence 22222333456665 3222211111 011111110 001111111111110 00000 00 Q ss_pred eEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 301 TSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 301 ~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .++ .+.+...+|.|+.....=-.++=||+--- +..| T Consensus 270 ~~d-~~~~~~~~~~vrv~~~~~~~i~~~~~G~l--i~~~ 305 (307) T protein:vir:79 270 VVD-TRIEDGKLELVRATDIFRPYLLGADAGYL--ISGI 305 (307) T ss_pred EEe-cccCCCceeEEeecccccceeeccccchh--hccC Confidence 001 11112223332222111111111110000 0111 No 205 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=45.42 E-value=0.77 Score=21.20 Aligned_cols=297 Identities=12% Similarity=0.045 Sum_probs=126.2 Q ss_pred CccccCcccCC---Cccc-CCccCcccchhHHHH-HHH---HHHHHHHHHHHhhhcccccccc-ccc-cceEEEecc--- Q lcl|NC_020078. 1 MSIFDGQTPSY---DVTR-PNQRHGAGDPLADVT-EQF---TGTVEGTIKRRSIMAGFVPVRS-VRG-TSTISNRGI--- 67 (339) Q Consensus 1 ~~~~~~~~~~~---~~~r-~~~~~~~~~~~a~~i-e~~---~g~v~~~f~~~sv~~~~v~~r~-i~~-G~tv~i~~i--- 67 (339) -.++--++..- +..+ ++....+.+. ..|+ ++| ...|.+.-....+.+.++..++ +.- =.++.+..+ T Consensus 8 ~~~~~d~~~~~~~a~~~~~~~~~~~~~~~-~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~ 86 (329) T protein:vir:79 8 KEMKYDEFEANVIANHMQLRGAKNDASDM-GIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKV 86 (329) T ss_pred hhhccchhhhhhHhhhcccccceeccchh-hHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecc Confidence 01111112111 1111 1111111222 2455 443 3334333333455566666553 222 245554444 Q ss_pred ccceeeeccCC-CCCCCCCCCCccceEEEEe-ehhhhhhhHHHHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020078. 68 SKAKLQKIAPG-TTPPPSTEPHTSKIFLKID-TVIIARNAEPMLDEFQ-TDFDYQGEVAREQGQEIANMYDETFFIMAAK 144 (339) Q Consensus 68 G~~t~~~~~~g-~~i~~~~~~~~~~~~l~ID-~~~y~~~~vdd~D~~q-~~~d~~~~~~~~~g~aLA~~~D~~i~~~l~~ 144 (339) |..+ -|..+ ++++.. +.+-.+....|= -..-+..-+.++..++ +..++-..-...+..++++..|+.+|.= T Consensus 87 G~a~--~~~d~~~dip~v-d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G--- 160 (329) T protein:vir:79 87 GHAK--IIADYTDDLSTV-DALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKG--- 160 (329) T ss_pred eeee--eecCccccccee-ecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEee--- Confidence 4433 33322 223211 112222222221 1122345567888886 4788888888999999999999877521 Q ss_pred hccccccccccccc--ccCccccccccccC-ccccccHHHHHHHHHHHHHHHHhc--CCCCCcCCeEEEECHHHHHHHhc Q lcl|NC_020078. 145 AAIASDSPYGTAAQ--MPGHSGGNVVTLAG-ANDYKDPAKLYAAIASLVEKFLEK--DVRPNEEDMILVLPPAAFTALMQ 219 (339) Q Consensus 145 aA~~~~~~~~~~~~--~~g~~~~~~~~~~~-~~~~~~~~~l~~ai~~a~~~L~e~--dV~~p~~~R~~vv~P~~~~~Ll~ 219 (339) ++.....+. .++........... .-...+++.+++-|..+..+|.++ .+..| -.++|+|+.|..|.. T Consensus 161 -----~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p---~~L~Lpp~~~~~L~~ 232 (329) T protein:vir:79 161 -----SKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRA---NMILIPPSMRKVLMV 232 (329) T ss_pred -----cccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecc---cEEEecHHHHHHhhc Confidence 111111111 11211111000000 011236888999999998888764 33334 268899999988853 Q ss_pred ccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCccccccccccceEEEEEeccceeEEEEEee Q lcl|NC_020078. 220 AEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIP 299 (339) Q Consensus 220 ~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~ 299 (339) - ..+ +.-+-..-++. +--++.|...+.+- +.+.. .. ...+.+...++-+.....++ T Consensus 233 ~--~~~--~~~tvl~~lk~----~~~~l~I~~~~el~-~ag~~--------g~-------~~~v~y~~~~~~~~~~vp~~ 288 (329) T protein:vir:79 233 R--MPE--TTMSYLDYFKQ----QNGGITIESISELE-DIDGA--------GT-------KAALVYEKDPMNMSIEIPEA 288 (329) T ss_pred c--cCC--CCccHHHHHHH----hCCCcEEEEccccc-ccCCC--------Cc-------eEEEEEecCCceEEEecCcc Confidence 1 111 11111111111 01223344434331 11100 00 00111222233333333333 Q ss_pred eeEEeeechhhhHHHHHHHH-HhCCccccccceEE---EEec Q lcl|NC_020078. 300 VTSKIFFDDLSKLWFIDSWL-AFGVTINRTEYAGV---IKLP 337 (339) Q Consensus 300 ~~~e~~~~~~~~~d~i~g~~-~~Ga~v~rPe~~v~---i~~~ 337 (339) ++.... ..+...+.+.+.. ..|+-+.||++++. |... T Consensus 289 ~~~l~~-q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 289 FNMLTA-QPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred eeeeec-eecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 333221 2233344444444 45688889988763 3344 No 206 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=32.84 E-value=1.4 Score=19.79 Aligned_cols=309 Identities=14% Similarity=0.065 Sum_probs=131.0 Q ss_pred Cc------------cccCcccCCCc------ccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhcccc---cccccccc Q lcl|NC_020078. 1 MS------------IFDGQTPSYDV------TRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFV---PVRSVRGT 59 (339) Q Consensus 1 ~~------------~~~~~~~~~~~------~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v---~~r~i~~G 59 (339) |+ =+-|.=|..-+ -|+.+++..+...+-.-|.|-.|.++.|.-..-..... ......+. T Consensus 76 i~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt 155 (457) T protein:vir:10 76 ISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGT 155 (457) T ss_pred hhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeeeccCcccCcccccccccccccccccccc Confidence 00 01122233221 14445444433211122333344444442110000000 00001111 Q ss_pred ceEEEeccccceeeeccC--------CCCCCC-CCCCCccceEEEEeeh--------hhhhhhHHHHHHHhc-C-cchHH Q lcl|NC_020078. 60 STISNRGISKAKLQKIAP--------GTTPPP-STEPHTSKIFLKIDTV--------IIARNAEPMLDEFQT-D-FDYQG 120 (339) Q Consensus 60 ~tv~i~~iG~~t~~~~~~--------g~~i~~-~~~~~~~~~~l~ID~~--------~y~~~~vdd~D~~q~-~-~d~~~ 120 (339) ...-.+--+..+...|.. ++.+.+ .......+....||+. +-+...+.-.-+.++ | .|.-. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEt 235 (457) T protein:vir:10 156 NPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQ 235 (457) T ss_pred cccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhH Confidence 111010000000001111 112211 0111234555556532 223444444444455 4 89999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCc-cccccHHHH----HHHHHHHHHHHHh Q lcl|NC_020078. 121 EVAREQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGA-NDYKDPAKL----YAAIASLVEKFLE 195 (339) Q Consensus 121 ~~~~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~~l----~~ai~~a~~~L~e 195 (339) |++.=.+.++...+.+-|++.+..-|... +..+. ...-+..+... ...+.+++. ++-..++....-+ T Consensus 236 ELaNILStEImlEINReii~~l~~~a~~~----~~~~~----~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~ 307 (457) T protein:vir:10 236 ELANILSTEILAEINREVVRTIYTNAVAG----AQNNT----ATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQ 307 (457) T ss_pred HHHHHHHHHHHHHhhHHHHHhHhhhheee----ecccc----ccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999998655321 11111 11111111111 112233322 2222333333322 Q ss_pred cCCCCCcCCeEEEECHHHHHHHhccc--chhhh---cccccccceeecceeEEE-eceEEEEe----ccccccccccccc Q lcl|NC_020078. 196 KDVRPNEEDMILVLPPAAFTALMQAE--HITNG---EYVTSAGETLNTKYMFAA-FGVPVITS----NNAVFGKTITDHL 265 (339) Q Consensus 196 ~dV~~p~~~R~~vv~P~~~~~Ll~~~--~~~n~---d~~~~~~~~l~~G~v~~i-~G~~V~~S----nnlp~~~~~~~~~ 265 (339) .- -..+.|+|.+|+..++|-... ++..+ +-+.+.-+......+|.+ .|++||.- ||-|.--- T Consensus 308 T~---rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~----- 379 (457) T protein:vir:10 308 TR---RGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFY----- 379 (457) T ss_pred hc---cccceEEEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccceE----- Confidence 22 356889999999999987643 23211 111111111233456664 67888886 55442111 Q ss_pred cCCCccccccccccceEEEEEeccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 266 LSNANNEKAYDGDFKDIVAQMFSPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 266 l~~~~~~~~y~~~~~~~~~~~~h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) ...|.+......|+||.|=- .+..+ +.. +|++|--.|-=+..||- +++|.....=.-++. T Consensus 380 ------~vG~KG~~~~~~glfy~PYv----~l~~~--~~~-dp~sfqP~~g~~tRY~l-~~NP~~~~~~~~~~~ 439 (457) T protein:vir:10 380 ------VAGYKGTSPYDAGLFYCPYV----PLQQV--RAI-NPDTFQPKIGFKTRYGM-VSNPFAGGLTQGSGA 439 (457) T ss_pred ------EEEEeCCcceecceeecccc----ccccc--Ccc-CCccccceeeeeeeeee-eeccccccccccccc Confidence 11122333445567776642 22222 222 66667666655666776 667765432222222 No 207 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=25.98 E-value=2 Score=18.94 Aligned_cols=301 Identities=11% Similarity=-0.011 Sum_probs=123.5 Q ss_pred CccccCcccC------------------C-CcccCCccCcccchhHHHHHHHHHHHHHHHHHHhhhccccccccccc--- Q lcl|NC_020078. 1 MSIFDGQTPS------------------Y-DVTRPNQRHGAGDPLADVTEQFTGTVEGTIKRRSIMAGFVPVRSVRG--- 58 (339) Q Consensus 1 ~~~~~~~~~~------------------~-~~~r~~~~~~~~~~~a~~ie~~~g~v~~~f~~~sv~~~~v~~r~i~~--- 58 (339) --.||+.++. + +..=|.-.+ +..+.+-|+.-|...+.+..-.--+.+.++.+.+ ++ T Consensus 36 gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~-~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t-~g~W~ 113 (382) T protein:vir:96 36 GLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTP-SIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDT-VGSWE 113 (382) T ss_pred ccccCcccchhHhhhhhhhhhhhhhcccccccCCccccC-CccHHHHHHhhhhhhhhhhhhhhhhhhhhccccc-cCCcc Confidence 1123333211 0 111111111 2234567889999877766666666677777665 33 Q ss_pred cceEEEec---cccceeeeccCCCCCCC-CCCCCccceEEEEeehhhhhhhHHHHHHHh---cCcchHHHHHHHHHHHHH Q lcl|NC_020078. 59 TSTISNRG---ISKAKLQKIAPGTTPPP-STEPHTSKIFLKIDTVIIARNAEPMLDEFQ---TDFDYQGEVAREQGQEIA 131 (339) Q Consensus 59 G~tv~i~~---iG~~t~~~~~~g~~i~~-~~~~~~~~~~l~ID~~~y~~~~vdd~D~~q---~~~d~~~~~~~~~g~aLA 131 (339) -+++.|+. .|+.++ |.-+++++- .-..+-.++++.. .-..+.+.++++++ +.+|+-++-.+.+..+|. T Consensus 114 ~~t~ty~~~e~~G~A~~--ygd~~D~Pl~d~~~~~~~r~v~~---~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale 188 (382) T protein:vir:96 114 DQEIVQGIVEPAGTAVE--YGDHTNIPLTSWNANFERRTIVR---GELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLE 188 (382) T ss_pred ceEEEEeeeecccceEE--eecccCCCccccccceeEEEEEE---EEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHH Confidence 25666654 466543 333344421 1122223333332 22334445555555 578888888888888888 Q ss_pred HHHHHHHHHHHHhhccccccccccccc--ccCccccccccccCccccccHHHHHHHHHHHHHHHHhcC--CCCCc-CCeE Q lcl|NC_020078. 132 NMYDETFFIMAAKAAIASDSPYGTAAQ--MPGHSGGNVVTLAGANDYKDPAKLYAAIASLVEKFLEKD--VRPNE-EDMI 206 (339) Q Consensus 132 ~~~D~~i~~~l~~aA~~~~~~~~~~~~--~~g~~~~~~~~~~~~~~~~~~~~l~~ai~~a~~~L~e~d--V~~p~-~~R~ 206 (339) ++.|+..|.=-. +.. .....+. .|..+... ....+.-...+++.+++-|..+..+|..+. +..|. .... T Consensus 189 ~~~N~i~f~G~~--~g~---~~~~yGllNdP~l~a~~-t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~ 262 (382) T protein:vir:96 189 IFRNAIGFYGWQ--SGL---GNRTYGFLNDPNLPPFQ-TPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKIT 262 (382) T ss_pred HhhceEEEEeee--cCc---CcceEEEEeCCCccccc-ccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceE Confidence 888875531000 000 0000011 11111000 001111233468889988888888885543 21122 2336 Q ss_pred EEECHHHHHHHhcccchhhhcccccccceeecceeEEEeceEEEEeccccccccccccccCCCcccccccc------ccc Q lcl|NC_020078. 207 LVLPPAAFTALMQAEHITNGEYVTSAGETLNTKYMFAAFGVPVITSNNAVFGKTITDHLLSNANNEKAYDG------DFK 280 (339) Q Consensus 207 ~vv~P~~~~~Ll~~~~~~n~d~~~~~~~~l~~G~v~~i~G~~V~~Snnlp~~~~~~~~~l~~~~~~~~y~~------~~~ 280 (339) +++||..|..|-... +|.-+-.+-++. +.-+++|...+.+-. .+..+.... .-..-|.- +-+ T Consensus 263 L~LP~~~~~~Ls~~n-----~~g~Tvl~~lk~----n~Pnl~i~t~peL~~-a~~~g~g~~--~~~~~~~~e~~~~~~~s 330 (382) T protein:vir:96 263 MALATSKVDYLSVTT-----PYGISVSDWIEQ----TYPKMRIVSAPELSG-VQMQGKTPE--DALVLFVEEVDASVDGS 330 (382) T ss_pred EeechHHHhhccccC-----ccCccHHHHHHH----hcCCcEEEEcccccc-ccCCCccce--eEEEEecchhhhhcccc Confidence 889999998885421 121111111111 122344444443311 000000000 00000000 000 Q ss_pred eEEEEEe------ccceeEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecC Q lcl|NC_020078. 281 DIVAQMF------SPKALLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPA 338 (339) Q Consensus 281 ~~~~~~~------h~~A~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~ 338 (339) ..+...| |..+++.- .....|+.+.. --..|+-+.||.+++-+. .- T Consensus 331 ~~~p~~f~q~~p~~~~~l~ve----~~~~~~~~~~s-------~~t~Gv~i~~P~ai~~~~-GI 382 (382) T protein:vir:96 331 TDGGSVFSQLVQSKFITLGVE----KRAKSYVEDFS-------NGTAGALCKRPWAVVRYL-GI 382 (382) T ss_pred cccCcceeccccceeeeccce----eecceeEeccc-------cceeeeEEEcchhhhhcc-CC Confidence 0011111 11111100 01111111111 134566677776655321 11 No 208 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=20.39 E-value=2.8 Score=18.15 Aligned_cols=310 Identities=12% Similarity=0.087 Sum_probs=128.1 Q ss_pred CccccCcccCC-CcccCCcc-------------C-cccchhHHHHHHHHHHH---HHHHHHHhhhccccccccccccc-- Q lcl|NC_020078. 1 MSIFDGQTPSY-DVTRPNQR-------------H-GAGDPLADVTEQFTGTV---EGTIKRRSIMAGFVPVRSVRGTS-- 60 (339) Q Consensus 1 ~~~~~~~~~~~-~~~r~~~~-------------~-~~~~~~a~~ie~~~g~v---~~~f~~~sv~~~~v~~r~i~~G~-- 60 (339) |+ |....++. .-++.+.+ . ..+..-+..++.|.+.+ -+++........-.......+|. T Consensus 162 ~s-~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~ 240 (523) T protein:vir:59 162 SS-GAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPS 240 (523) T ss_pred cc-cceeeeeccccccccccccccccccccccccccccccccchhhccccccccccccccccccccccccccccCCCccc Confidence 11 00000000 01111110 0 01111111122222111 01110000000000000010010 Q ss_pred ----eEEEe-ccccceeeeccCCC-CCCCCCCCCccceEEEEeeh--------hhhhhhHHHHHHHhc-C--cchHHHHH Q lcl|NC_020078. 61 ----TISNR-GISKAKLQKIAPGT-TPPPSTEPHTSKIFLKIDTV--------IIARNAEPMLDEFQT-D--FDYQGEVA 123 (339) Q Consensus 61 ----tv~i~-~iG~~t~~~~~~g~-~i~~~~~~~~~~~~l~ID~~--------~y~~~~vdd~D~~q~-~--~d~~~~~~ 123 (339) ..-.. ..|..+..--+.+. ...........|.-..||+. +-+...+.-.=+.++ | .|.-.|++ T Consensus 241 t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELa 320 (523) T protein:vir:59 241 TQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIV 320 (523) T ss_pred ccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHH Confidence 00000 01111100000000 00011112234555666632 223344444444455 3 89999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccccccccccCccccccccccCcccc-ccHH----HHHHHHHHHHHHHHhc-- Q lcl|NC_020078. 124 REQGQEIANMYDETFFIMAAKAAIASDSPYGTAAQMPGHSGGNVVTLAGANDY-KDPA----KLYAAIASLVEKFLEK-- 196 (339) Q Consensus 124 ~~~g~aLA~~~D~~i~~~l~~aA~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~~----~l~~ai~~a~~~L~e~-- 196 (339) .=++.++...+.+-|++.+..-|... +..+..+ .-+..+...++. +... ..++++..+..++.+- T Consensus 321 nILStEImlEINR~ii~~~~~~a~~~----~~~~~~~----~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n 392 (523) T protein:vir:59 321 TLMSQYIAREIDLEILSTIMAHARRT----DNYGFWS----EVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSN 392 (523) T ss_pred HHHHHHHHHHhhHHHHHhHhhhheee----eeccccc----cceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHH Confidence 99999999999999999998655321 1111110 001111111111 1111 1234444443333211 Q ss_pred CC--CCC-cCCeEEEECHHHHHHHhcccchhhhccccccc-ceeecceeEEE-eceEEEEeccccccccccccccCCCcc Q lcl|NC_020078. 197 DV--RPN-EEDMILVLPPAAFTALMQAEHITNGEYVTSAG-ETLNTKYMFAA-FGVPVITSNNAVFGKTITDHLLSNANN 271 (339) Q Consensus 197 dV--~~p-~~~R~~vv~P~~~~~Ll~~~~~~n~d~~~~~~-~~l~~G~v~~i-~G~~V~~Snnlp~~~~~~~~~l~~~~~ 271 (339) .+ .+- ..+-|+|++|+..++|-..+-+..+......+ ++. .+|.+ .|++||+-++.|.---+- +-.++ T Consensus 393 ~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~---~~g~l~~~~~vy~d~~~~~dy~~~----g~k~~ 465 (523) T protein:vir:59 393 RIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIF---YVGMVQGRYRLYKNIYQNQPVIIM----GNQDL 465 (523) T ss_pred HHHHhcccccccEEEEchhHHHHHHhccccccCCccccccccce---eEEEecCceEEEecCCCCcceEEE----Eeccc Confidence 11 111 14669999999999998777664332211111 111 23443 678999888766421111 11111 Q ss_pred ccccccccceEEEEEeccce-eEEEEEeeeeEEeeechhhhHHHHHHHHHhCCccccccceEEEEecCC Q lcl|NC_020078. 272 EKAYDGDFKDIVAQMFSPKA-LLAGSTIPVTSKIFFDDLSKLWFIDSWLAFGVTINRTEYAGVIKLPAA 339 (339) Q Consensus 272 ~~~y~~~~~~~~~~~~h~~A-~~~~~~~~~~~e~~~~~~~~~d~i~g~~~~Ga~v~rPe~~v~i~~~~a 339 (339) .+.|+ .++||.|=- |+. ..+..||++|--.|-=+..||..|.+|...+-|++.-- T Consensus 466 ~~~~~------~~~~y~Py~~l~~-------~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~ 521 (523) T protein:vir:59 466 NTPWQ------TGAVYAPYVPLLF-------TPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLL 521 (523) T ss_pred CCccc------ccceecccchhhc-------ccccccCCcccceeeeeeehhheecchhHhhhhhhhhc Confidence 12222 366766542 221 12334778887777777889998889988877665433 Done!