Query lcl|NC_015719.1_cdsid_YP_004678755.1 [gene=EnPhK30_gp34] [protein=major capsid protein] [protein_id=YP_004678755.1] [location=21333..22367] Match_columns 344 No_of_seqs 146 out of 163 Neff 7.8 Searched_HMMs 1612 Date Thu Nov 7 12:53:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_34 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_34_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8885 Length: 347 # 100.0 9E-104 6E-107 585.4 28.5 344 1-344 1-347 (347) 2 protein:vir:94576 Length: 347 100.0 4E-103 2E-106 582.1 29.0 341 1-343 1-347 (347) 3 protein:vir:10450 Length: 344 100.0 2E-102 1E-105 578.3 29.6 338 1-343 1-344 (344) 4 protein:vir:94711 Length: 347 100.0 9E-102 5E-105 574.6 28.3 342 1-344 1-346 (347) 5 protein:vir:3364 Length: 347 # 100.0 8E-102 5E-105 574.9 27.4 338 1-344 1-346 (347) 6 protein:vir:2201 Length: 345 # 100.0 2E-101 1E-104 572.8 28.1 338 1-343 1-345 (345) 7 protein:vir:1541 Length: 347 # 100.0 2E-100 2E-103 566.7 26.4 341 1-344 1-346 (347) 8 protein:vir:100057 Length: 375 100.0 7.7E-98 5E-101 553.0 25.2 342 1-344 1-371 (375) 9 protein:vir:80213 Length: 334 100.0 1.4E-93 8.5E-97 529.7 26.7 327 1-344 1-333 (334) 10 protein:vir:103323 Length: 364 100.0 2.6E-92 1.6E-95 522.7 26.5 335 1-344 1-340 (364) 11 protein:vir:6324 Length: 335 # 100.0 1.3E-91 8E-95 518.9 25.5 321 1-344 1-329 (335) 12 protein:vir:78935 Length: 335 100.0 8.1E-91 5.1E-94 514.5 25.6 323 1-344 1-329 (335) 13 protein:vir:78739 Length: 332 100.0 1.9E-90 1.2E-93 512.4 24.1 321 1-341 4-332 (332) 14 protein:vir:97031 Length: 402 100.0 6.5E-90 4E-93 509.5 25.8 328 1-344 1-334 (402) 15 protein:vir:7019 Length: 401 # 100.0 2.6E-87 1.6E-90 495.3 25.8 327 1-344 1-334 (401) 16 protein:vir:105645 Length: 400 100.0 6.3E-87 3.9E-90 493.2 26.9 328 1-344 1-334 (400) 17 protein:vir:99675 Length: 324 100.0 2.3E-82 1.4E-85 468.2 23.2 292 50-344 1-297 (324) 18 protein:vir:94622 Length: 341 100.0 3.4E-70 2.1E-73 401.4 25.7 318 1-344 3-340 (341) 19 protein:vir:80180 Length: 381 100.0 2.7E-68 1.7E-71 391.0 22.3 332 1-343 1-381 (381) 20 protein:vir:3136 Length: 322 # 100.0 7E-60 4.3E-63 344.9 16.4 307 1-344 1-319 (322) 21 protein:vir:105822 Length: 273 100.0 1.3E-58 8.3E-62 337.9 21.1 267 1-343 1-273 (273) 22 protein:vir:102605 Length: 273 100.0 1.3E-58 8.3E-62 337.9 21.1 267 1-343 1-273 (273) 23 protein:vir:7990 Length: 273 # 100.0 1.4E-56 8.6E-60 326.8 20.3 267 1-343 1-273 (273) 24 protein:vir:102655 Length: 322 100.0 1E-53 6.3E-57 311.1 21.6 308 1-344 1-322 (322) 25 protein:vir:1781 Length: 221 # 100.0 4.5E-48 2.8E-51 280.1 13.6 217 95-335 1-221 (221) 26 protein:vir:97331 Length: 319 100.0 1E-44 6.4E-48 261.7 20.1 285 1-344 5-295 (319) 27 protein:vir:94800 Length: 319 100.0 1E-44 6.4E-48 261.7 20.1 285 1-344 5-295 (319) 28 protein:vir:107120 Length: 329 100.0 1.6E-44 9.7E-48 260.7 20.0 285 1-344 16-306 (329) 29 protein:vir:80930 Length: 278 100.0 1.7E-44 1E-47 260.6 19.1 272 1-344 1-278 (278) 30 protein:vir:96123 Length: 274 100.0 6.4E-43 4E-46 251.9 18.3 265 1-344 1-271 (274) 31 protein:vir:93742 Length: 274 100.0 2.1E-42 1.3E-45 249.0 17.5 265 1-344 1-271 (274) 32 protein:vir:108303 Length: 418 100.0 1.1E-41 6.5E-45 245.2 20.5 297 1-344 1-418 (418) 33 protein:vir:1239 Length: 274 # 100.0 7.6E-42 4.7E-45 246.0 18.6 265 1-344 1-271 (274) 34 protein:vir:96262 Length: 274 100.0 7.9E-42 4.9E-45 245.9 18.1 265 1-344 1-271 (274) 35 protein:vir:95898 Length: 274 100.0 7.9E-42 4.9E-45 245.9 18.1 265 1-344 1-271 (274) 36 protein:vir:97433 Length: 274 100.0 1.2E-41 7.7E-45 244.8 18.1 265 1-344 1-271 (274) 37 protein:vir:94494 Length: 274 100.0 1.2E-41 7.7E-45 244.8 18.1 265 1-344 1-271 (274) 38 protein:vir:96833 Length: 275 100.0 2.9E-41 1.8E-44 242.8 17.7 266 1-344 1-272 (275) 39 protein:vir:3613 Length: 272 # 100.0 3.7E-41 2.3E-44 242.3 17.6 267 1-343 1-272 (272) 40 protein:vir:99075 Length: 392 100.0 1.4E-39 8.5E-43 233.6 21.1 287 1-344 1-303 (392) 41 protein:vir:3525 Length: 423 # 100.0 2E-38 1.2E-41 227.3 21.8 298 1-342 1-423 (423) 42 protein:vir:174 Length: 423 # 100.0 5.6E-38 3.5E-41 224.8 22.0 299 1-342 1-423 (423) 43 protein:vir:105374 Length: 423 100.0 6E-38 3.7E-41 224.6 21.8 298 1-342 1-423 (423) 44 protein:vir:105334 Length: 276 100.0 6.3E-38 3.9E-41 224.5 18.2 265 1-344 1-271 (276) 45 protein:vir:79008 Length: 299 100.0 3.6E-37 2.2E-40 220.4 20.9 284 1-344 1-297 (299) 46 protein:vir:105522 Length: 423 100.0 1.8E-36 1.1E-39 216.5 20.8 298 1-342 1-423 (423) 47 protein:vir:3033 Length: 272 # 100.0 6.9E-37 4.3E-40 218.8 17.8 264 1-344 1-270 (272) 48 protein:vir:9820 Length: 272 # 100.0 6.9E-37 4.3E-40 218.8 17.8 264 1-344 1-270 (272) 49 protein:vir:78920 Length: 290 100.0 2.4E-32 1.5E-35 194.0 19.6 280 1-343 1-290 (290) 50 protein:vir:102335 Length: 312 100.0 1.8E-30 1.1E-33 183.6 20.0 300 1-344 1-308 (312) 51 protein:vir:105464 Length: 346 100.0 2.2E-30 1.4E-33 183.2 19.6 285 1-344 1-298 (346) 52 protein:vir:739 Length: 231 # 99.9 9.5E-31 5.9E-34 185.2 13.8 230 51-343 1-231 (231) 53 protein:vir:95107 Length: 270 99.9 5.7E-29 3.5E-32 175.4 17.4 262 1-344 1-266 (270) 54 protein:vir:99523 Length: 311 99.9 6.9E-26 4.3E-29 158.5 19.1 297 17-344 1-311 (311) 55 protein:vir:79712 Length: 285 99.9 2E-25 1.2E-28 156.0 17.4 267 1-344 1-283 (285) 56 protein:vir:95451 Length: 313 99.9 1E-26 6.4E-30 163.1 9.3 300 17-344 1-312 (313) 57 protein:vir:78090 Length: 302 99.9 2.6E-23 1.6E-26 144.4 18.1 285 1-344 1-300 (302) 58 protein:vir:2106 Length: 430 # 99.7 7E-20 4.3E-23 125.6 17.6 300 1-344 1-430 (430) 59 protein:vir:9265 Length: 430 # 99.7 1.1E-19 7E-23 124.4 17.2 300 1-344 1-430 (430) 60 protein:vir:100939 Length: 430 99.7 1.1E-19 7E-23 124.4 17.2 300 1-344 1-430 (430) 61 protein:vir:78523 Length: 338 99.6 2.4E-17 1.5E-20 111.7 18.6 308 1-344 1-336 (338) 62 protein:vir:41 Length: 299 # N 99.6 7.8E-17 4.9E-20 108.9 19.9 282 10-344 1-299 (299) 63 protein:vir:78223 Length: 333 99.6 1.5E-16 9.3E-20 107.3 19.0 306 1-344 1-332 (333) 64 protein:vir:6242 Length: 390 # 99.6 6.5E-17 4E-20 109.3 15.3 291 1-344 97-390 (390) 65 protein:vir:1328 Length: 392 # 99.6 2.2E-16 1.4E-19 106.4 16.4 293 1-344 97-392 (392) 66 protein:vir:7771 Length: 330 # 99.5 1.1E-15 6.8E-19 102.6 19.0 298 1-344 1-324 (330) 67 protein:vir:98339 Length: 415 99.5 1.9E-15 1.2E-18 101.3 18.2 293 1-344 109-405 (415) 68 protein:vir:81100 Length: 415 99.5 1.9E-15 1.2E-18 101.3 18.2 293 1-344 109-405 (415) 69 protein:vir:79987 Length: 415 99.5 1.9E-15 1.2E-18 101.3 18.2 293 1-344 109-405 (415) 70 protein:vir:4511 Length: 409 # 99.5 3.7E-15 2.3E-18 99.7 18.5 298 1-344 93-407 (409) 71 protein:vir:105905 Length: 304 99.5 5.6E-15 3.5E-18 98.7 19.3 285 1-342 1-304 (304) 72 protein:vir:94142 Length: 304 99.5 5.6E-15 3.5E-18 98.7 19.3 285 1-342 1-304 (304) 73 protein:vir:4700 Length: 415 # 99.5 3.4E-15 2.1E-18 99.9 17.6 296 1-344 106-405 (415) 74 protein:vir:4600 Length: 415 # 99.5 3.4E-15 2.1E-18 99.9 17.6 296 1-344 106-405 (415) 75 protein:vir:96223 Length: 324 99.5 3.6E-15 2.2E-18 99.8 17.4 285 1-344 15-316 (324) 76 protein:vir:8187 Length: 311 # 99.5 7E-15 4.4E-18 98.2 18.9 291 1-344 1-311 (311) 77 protein:vir:9410 Length: 415 # 99.5 2.1E-15 1.3E-18 101.1 15.8 293 1-344 109-405 (415) 78 protein:vir:8102 Length: 543 # 99.5 5.7E-15 3.5E-18 98.7 17.6 295 1-344 237-543 (543) 79 protein:vir:9309 Length: 324 # 99.5 7E-15 4.4E-18 98.2 17.3 279 1-344 21-316 (324) 80 protein:vir:9759 Length: 303 # 99.5 1.4E-14 8.6E-18 96.5 18.9 287 1-343 1-303 (303) 81 protein:vir:1886 Length: 385 # 99.4 6.4E-15 4E-18 98.4 16.4 287 1-344 93-385 (385) 82 protein:vir:191 Length: 385 # 99.4 6.4E-15 4E-18 98.4 16.4 287 1-344 93-385 (385) 83 protein:vir:97053 Length: 390 99.4 8.1E-15 5E-18 97.8 16.6 287 1-341 99-390 (390) 84 protein:vir:94771 Length: 298 99.4 2.3E-14 1.4E-17 95.4 19.0 282 1-342 1-298 (298) 85 protein:vir:96392 Length: 324 99.4 7.9E-15 4.9E-18 97.9 16.2 285 1-344 15-316 (324) 86 protein:vir:78830 Length: 324 99.4 7.9E-15 4.9E-18 97.9 16.2 285 1-344 15-316 (324) 87 protein:vir:4339 Length: 395 # 99.4 1.9E-14 1.2E-17 95.7 17.4 291 1-343 98-395 (395) 88 protein:vir:3870 Length: 400 # 99.4 8.3E-15 5.2E-18 97.8 14.9 277 1-344 120-400 (400) 89 protein:vir:9574 Length: 300 # 99.4 6.1E-14 3.8E-17 93.0 19.6 284 1-343 1-300 (300) 90 protein:vir:97148 Length: 324 99.4 1.6E-14 1E-17 96.1 16.3 287 1-344 1-316 (324) 91 protein:vir:4856 Length: 293 # 99.4 4.2E-14 2.6E-17 93.9 18.2 274 1-344 1-282 (293) 92 protein:vir:95763 Length: 297 99.4 3.8E-14 2.3E-17 94.2 18.0 279 1-344 1-297 (297) 93 protein:vir:103955 Length: 324 99.4 3E-14 1.9E-17 94.7 17.1 282 1-344 18-316 (324) 94 protein:vir:99749 Length: 324 99.4 3.9E-14 2.4E-17 94.1 17.6 282 1-344 18-316 (324) 95 protein:vir:1638 Length: 298 # 99.4 1E-13 6.3E-17 91.8 19.2 281 1-342 1-298 (298) 96 protein:vir:100135 Length: 418 99.4 5.6E-14 3.5E-17 93.2 17.7 288 1-344 121-416 (418) 97 protein:vir:485 Length: 407 # 99.4 5.5E-14 3.4E-17 93.3 17.2 297 1-344 90-401 (407) 98 protein:vir:104085 Length: 320 99.4 1.1E-13 7E-17 91.6 18.8 295 1-344 1-317 (320) 99 protein:vir:80684 Length: 315 99.4 8E-14 5E-17 92.4 17.7 288 1-344 1-307 (315) 100 protein:vir:104256 Length: 458 99.4 6.9E-14 4.3E-17 92.7 17.0 294 1-343 155-458 (458) 101 protein:vir:4830 Length: 397 # 99.4 7.4E-14 4.6E-17 92.6 17.1 284 1-344 94-386 (397) 102 protein:vir:4997 Length: 397 # 99.4 9E-14 5.6E-17 92.1 17.1 284 1-344 95-386 (397) 103 protein:vir:10364 Length: 390 99.4 8.7E-14 5.4E-17 92.2 16.7 279 1-341 107-390 (390) 104 protein:vir:2344 Length: 397 # 99.4 1.1E-13 6.7E-17 91.7 17.1 285 1-344 1-307 (397) 105 protein:vir:81070 Length: 390 99.3 9.5E-14 5.9E-17 92.0 16.7 279 1-341 107-390 (390) 106 protein:vir:94673 Length: 419 99.3 1E-13 6.2E-17 91.9 16.2 295 1-344 110-418 (419) 107 protein:vir:2430 Length: 318 # 99.3 4.7E-13 2.9E-16 88.2 18.9 278 1-344 14-314 (318) 108 protein:vir:3991 Length: 404 # 99.3 4.9E-13 3E-16 88.1 18.7 285 1-344 98-394 (404) 109 protein:vir:99920 Length: 311 99.3 5.3E-13 3.3E-16 87.9 18.8 296 1-344 1-311 (311) 110 protein:vir:102119 Length: 404 99.3 3.3E-13 2E-16 89.0 17.6 297 1-344 92-401 (404) 111 protein:vir:4953 Length: 397 # 99.3 2.5E-13 1.5E-16 89.7 16.8 281 1-344 95-386 (397) 112 protein:vir:5974 Length: 324 # 99.3 3.9E-13 2.4E-16 88.6 17.3 280 1-344 1-295 (324) 113 protein:vir:101607 Length: 379 99.3 4.3E-13 2.7E-16 88.4 17.2 273 1-343 98-379 (379) 114 protein:vir:100247 Length: 425 99.3 6.1E-13 3.8E-16 87.5 17.9 296 1-344 117-425 (425) 115 protein:vir:95376 Length: 425 99.3 4.8E-13 3E-16 88.1 16.9 292 1-344 119-422 (425) 116 protein:vir:2504 Length: 305 # 99.3 1.7E-12 1E-15 85.2 19.4 282 1-344 1-299 (305) 117 protein:vir:80376 Length: 435 99.3 1.9E-12 1.2E-15 84.8 19.6 296 1-344 105-434 (435) 118 protein:vir:1383 Length: 421 # 99.3 2.8E-13 1.7E-16 89.4 14.8 275 1-344 104-384 (421) 119 protein:vir:4456 Length: 401 # 99.3 6.3E-13 3.9E-16 87.5 16.6 300 1-343 91-401 (401) 120 protein:vir:100172 Length: 394 99.3 9E-13 5.6E-16 86.6 17.3 278 1-344 101-385 (394) 121 protein:vir:1583 Length: 351 # 99.3 4.6E-13 2.9E-16 88.2 15.5 283 1-344 1-300 (351) 122 protein:vir:7409 Length: 408 # 99.2 1.1E-12 6.9E-16 86.1 17.3 284 1-344 97-394 (408) 123 protein:vir:4226 Length: 326 # 99.2 2.3E-12 1.4E-15 84.4 18.8 291 1-344 1-324 (326) 124 protein:vir:1268 Length: 397 # 99.2 1.3E-12 8.3E-16 85.7 16.9 281 1-343 102-397 (397) 125 protein:vir:6212 Length: 434 # 99.2 8.6E-13 5.3E-16 86.7 15.6 289 1-344 131-430 (434) 126 protein:vir:1433 Length: 435 # 99.2 4.8E-12 3E-15 82.6 19.3 295 1-344 105-435 (435) 127 protein:vir:105038 Length: 428 99.2 7.2E-12 4.5E-15 81.7 20.2 296 1-343 113-428 (428) 128 protein:vir:81160 Length: 371 99.2 3E-12 1.9E-15 83.7 17.9 281 1-343 84-371 (371) 129 protein:vir:1025 Length: 408 # 99.2 2.6E-12 1.6E-15 84.1 17.5 284 1-344 101-394 (408) 130 protein:vir:5739 Length: 366 # 99.2 5.9E-12 3.7E-15 82.1 19.3 294 1-343 52-366 (366) 131 protein:vir:9704 Length: 394 # 99.2 2.3E-12 1.4E-15 84.4 16.3 275 1-344 115-391 (394) 132 protein:vir:100884 Length: 389 99.2 3.4E-12 2.1E-15 83.5 16.6 280 1-344 95-383 (389) 133 protein:vir:1084 Length: 437 # 99.2 2.6E-12 1.6E-15 84.1 15.8 282 1-344 141-428 (437) 134 protein:vir:102944 Length: 330 99.2 1.1E-12 6.7E-16 86.2 13.6 282 1-344 1-297 (330) 135 protein:vir:81227 Length: 413 99.2 4.3E-12 2.7E-15 82.9 16.7 291 1-344 105-411 (413) 136 protein:vir:3845 Length: 395 # 99.2 6.1E-12 3.8E-15 82.1 17.1 277 1-344 98-384 (395) 137 protein:vir:78640 Length: 352 99.1 3.7E-12 2.3E-15 83.3 14.0 273 1-344 64-347 (352) 138 protein:vir:96762 Length: 632 99.1 1.1E-11 7E-15 80.6 16.3 284 1-342 334-632 (632) 139 protein:vir:102873 Length: 392 99.1 2.7E-11 1.7E-14 78.5 18.4 284 1-344 84-385 (392) 140 protein:vir:105004 Length: 392 99.1 2.7E-11 1.7E-14 78.5 18.4 284 1-344 84-385 (392) 141 protein:vir:107593 Length: 392 99.1 2.7E-11 1.7E-14 78.5 18.4 284 1-344 84-385 (392) 142 protein:vir:102082 Length: 392 99.1 2.7E-11 1.7E-14 78.5 18.4 284 1-344 84-385 (392) 143 protein:vir:4092 Length: 390 # 99.1 3.8E-11 2.4E-14 77.7 18.0 292 1-344 68-369 (390) 144 protein:vir:8420 Length: 477 # 99.0 6.6E-11 4.1E-14 76.4 17.7 300 1-344 148-472 (477) 145 protein:vir:105610 Length: 430 99.0 1.5E-10 9.4E-14 74.4 19.6 319 1-344 1-423 (430) 146 protein:vir:93616 Length: 645 99.0 8E-11 4.9E-14 75.9 17.4 289 1-344 331-641 (645) 147 protein:vir:962 Length: 397 # 99.0 1.5E-11 9.3E-15 79.9 13.2 275 1-343 121-397 (397) 148 protein:vir:7855 Length: 497 # 99.0 7.8E-11 4.9E-14 76.0 16.7 298 1-344 138-494 (497) 149 protein:vir:101650 Length: 497 99.0 7.8E-11 4.9E-14 76.0 16.7 298 1-344 138-494 (497) 150 protein:vir:9361 Length: 402 # 99.0 2.4E-11 1.5E-14 78.8 13.3 273 1-344 114-397 (402) 151 protein:vir:93881 Length: 387 99.0 6.5E-11 4E-14 76.4 14.7 273 1-344 99-382 (387) 152 protein:vir:2770 Length: 318 # 98.9 3.6E-10 2.2E-13 72.4 18.7 257 1-287 1-318 (318) 153 protein:vir:2685 Length: 387 # 98.9 6.5E-11 4E-14 76.4 13.1 273 1-344 99-382 (387) 154 protein:vir:96978 Length: 387 98.9 6.5E-11 4E-14 76.4 13.1 273 1-344 99-382 (387) 155 protein:vir:94424 Length: 387 98.9 6.5E-11 4E-14 76.4 13.1 273 1-344 99-382 (387) 156 protein:vir:93696 Length: 364 98.9 5.9E-10 3.7E-13 71.2 18.4 310 1-344 1-360 (364) 157 protein:vir:9927 Length: 295 # 98.8 1E-09 6.3E-13 69.9 16.2 271 1-344 1-289 (295) 158 protein:vir:108211 Length: 318 98.8 1.2E-09 7.6E-13 69.4 16.0 293 1-344 1-318 (318) 159 protein:vir:95875 Length: 401 98.8 2.4E-09 1.5E-12 67.8 17.6 317 1-344 1-401 (401) 160 protein:vir:9875 Length: 296 # 98.8 3.9E-09 2.4E-12 66.7 18.5 279 1-344 1-296 (296) 161 protein:vir:9643 Length: 377 # 98.7 5.8E-09 3.6E-12 65.7 16.7 284 1-343 59-377 (377) 162 protein:vir:10123 Length: 404 98.7 1.2E-08 7.5E-12 64.0 18.3 338 1-344 1-402 (404) 163 protein:vir:819 Length: 404 # 98.7 1.2E-08 7.5E-12 64.0 18.3 338 1-344 1-402 (404) 164 protein:vir:104439 Length: 404 98.7 1.2E-08 7.5E-12 64.0 18.3 338 1-344 1-402 (404) 165 protein:vir:3298 Length: 404 # 98.7 1.2E-08 7.5E-12 64.0 18.3 338 1-344 1-402 (404) 166 protein:vir:4197 Length: 314 # 98.6 7.1E-09 4.4E-12 65.3 16.6 299 1-344 1-312 (314) 167 protein:vir:80128 Length: 466 98.6 1.5E-09 9.2E-13 69.0 12.7 293 1-344 123-449 (466) 168 protein:vir:78350 Length: 383 98.6 1.1E-08 6.9E-12 64.2 15.8 286 1-344 64-376 (383) 169 protein:vir:9509 Length: 381 # 98.5 1.4E-08 9E-12 63.6 15.7 285 1-344 57-369 (381) 170 protein:vir:101291 Length: 381 98.5 1.4E-08 9E-12 63.6 15.7 285 1-344 57-369 (381) 171 protein:vir:3158 Length: 321 # 98.5 4.4E-09 2.7E-12 66.4 12.7 295 1-344 1-313 (321) 172 protein:vir:98635 Length: 377 98.5 9.3E-09 5.8E-12 64.6 13.9 284 1-343 59-377 (377) 173 protein:vir:100632 Length: 381 98.5 2.6E-08 1.6E-11 62.1 15.2 283 1-344 57-373 (381) 174 protein:vir:95963 Length: 395 98.5 1.8E-08 1.1E-11 63.1 14.2 288 1-344 61-377 (395) 175 protein:vir:4159 Length: 315 # 98.4 3.1E-08 1.9E-11 61.8 14.9 304 1-342 1-315 (315) 176 protein:vir:106647 Length: 303 98.2 4.4E-07 2.7E-10 55.4 15.8 274 1-344 1-297 (303) 177 protein:vir:79928 Length: 393 98.0 2.9E-07 1.8E-10 56.4 12.2 303 1-344 59-382 (393) 178 protein:vir:80446 Length: 367 98.0 2.3E-06 1.5E-09 51.4 16.6 297 1-344 1-337 (367) 179 protein:vir:78387 Length: 349 97.3 0.00011 6.8E-08 42.3 18.8 289 1-344 1-321 (349) 180 protein:vir:98871 Length: 314 97.3 4.9E-05 3E-08 44.2 14.3 283 1-344 11-312 (314) 181 protein:vir:97397 Length: 517 97.0 0.00013 7.9E-08 42.0 14.3 282 1-344 226-515 (517) 182 protein:vir:94528 Length: 286 96.9 0.00023 1.4E-07 40.5 15.1 269 1-344 1-286 (286) 183 protein:vir:3969 Length: 287 # 96.9 0.00018 1.1E-07 41.1 14.3 268 20-344 1-287 (287) 184 protein:vir:94989 Length: 349 96.6 0.00045 2.8E-07 38.9 19.3 289 1-344 1-327 (349) 185 protein:vir:107687 Length: 319 96.6 0.00038 2.4E-07 39.3 14.2 294 1-341 1-319 (319) 186 protein:vir:5942 Length: 523 # 96.3 0.00075 4.7E-07 37.7 14.6 313 1-344 162-522 (523) 187 protein:vir:103285 Length: 296 96.3 0.00078 4.8E-07 37.6 14.8 274 16-341 1-296 (296) 188 protein:vir:80068 Length: 301 96.2 0.00083 5.1E-07 37.5 15.7 283 19-341 1-301 (301) 189 protein:vir:103181 Length: 457 93.1 0.0088 5.5E-06 31.9 14.1 310 1-344 97-440 (457) 190 protein:vir:95512 Length: 693 91.9 0.014 8.5E-06 30.8 14.0 296 1-344 371-693 (693) 191 protein:vir:79548 Length: 652 91.5 0.016 9.8E-06 30.5 14.7 291 1-340 336-652 (652) 192 protein:vir:4786 Length: 295 # 90.4 0.021 1.3E-05 29.8 14.0 272 10-331 1-295 (295) 193 protein:vir:104342 Length: 314 90.2 0.022 1.4E-05 29.6 11.9 286 1-341 1-314 (314) 194 protein:vir:94933 Length: 330 90.0 0.023 1.5E-05 29.5 15.6 307 1-344 1-330 (330) 195 protein:vir:4074 Length: 480 # 88.7 0.03 1.9E-05 28.9 14.1 275 1-344 171-478 (480) 196 protein:vir:97255 Length: 310 88.6 0.031 1.9E-05 28.8 16.7 291 1-343 1-310 (310) 197 protein:vir:79642 Length: 329 87.5 0.038 2.4E-05 28.4 14.9 295 1-341 14-329 (329) 198 protein:vir:8324 Length: 410 # 83.2 0.07 4.4E-05 26.9 11.1 267 1-341 127-410 (410) 199 protein:vir:10324 Length: 320 75.9 0.14 8.7E-05 25.2 10.8 281 10-344 1-318 (320) 200 protein:vir:95131 Length: 325 72.2 0.19 0.00011 24.6 15.4 274 18-344 1-299 (325) 201 protein:vir:104549 Length: 462 68.1 0.24 0.00015 24.0 15.1 302 1-344 97-449 (462) 202 protein:vir:107732 Length: 379 63.8 0.31 0.00019 23.4 14.9 299 1-344 56-379 (379) 203 protein:vir:94070 Length: 339 63.6 0.31 0.0002 23.3 11.8 287 1-341 35-339 (339) 204 protein:vir:79078 Length: 307 61.3 0.36 0.00022 23.0 10.6 285 1-343 1-307 (307) 205 protein:vir:107882 Length: 307 51.4 0.58 0.00036 21.9 12.6 288 1-343 1-307 (307) 206 protein:vir:270 Length: 341 # 48.7 0.66 0.00041 21.6 8.7 292 1-344 1-333 (341) 207 protein:vir:103886 Length: 302 48.5 0.67 0.00041 21.6 14.3 280 1-344 1-301 (302) 208 protein:vir:99888 Length: 309 43.0 0.86 0.00053 20.9 10.0 284 1-344 1-308 (309) 209 protein:vir:78148 Length: 123 41.2 0.62 0.00038 21.7 5.1 116 206-343 1-123 (123) 210 protein:vir:5670 Length: 514 # 34.2 1.3 0.00081 19.9 13.1 299 1-344 133-494 (514) 211 protein:vir:96079 Length: 382 25.0 2.1 0.0013 18.8 16.9 299 1-341 63-382 (382) 212 protein:vir:106286 Length: 534 20.7 2.7 0.0017 18.2 16.4 305 1-344 125-517 (534) 213 protein:vir:6601 Length: 528 # 20.2 2.8 0.0017 18.1 15.5 314 1-344 116-502 (528) No 1 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=9.3e-104 Score=585.44 Aligned_cols=344 Identities=74% Similarity=1.124 Sum_probs=324.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |||++++++.+||+||+++++|+++||||+|+|||+++|+++|+|+++++.|++++||++|||++|++++.+|++|++++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g~~l~ 80 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeeccccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccC Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGK 160 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~ 160 (344) ++.+++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|+++++++++.+.+....++|+.. T Consensus 81 ~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~ 160 (347) T protein:vir:88 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) T ss_pred CCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCccc Confidence 88888999999999999999999999999999999999999999999999999999999999999988888888888888 Q ss_pred ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEEE Q lcl|NC_015719. 161 PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNV 240 (344) Q Consensus 161 ~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i 240 (344) ++.+.+++++..+++..+++++|+.|++|+++|+|++||++|||+||+|++|++||+++++++.++.+...+++|.|+++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~vg~i 240 (347) T protein:vir:88 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRNV 240 (347) T ss_pred cccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcceeeee Confidence 88888888888899999999999999999999999999999999999999999999999999999999889999999999 Q ss_pred eCeEEEEeccccccccccccccccccccccccccccccc---cccccceeEEEecHHHHhhhhhheeeeeeeecchhhhh Q lcl|NC_015719. 241 MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGG---KVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQAD 317 (344) Q Consensus 241 ~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d 317 (344) +||+||+|||+|.+..+.++.+.++++++..+.+...-. ..++++.++|+||++|+++++++++++|.+|++++|+| T Consensus 241 ~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d 320 (347) T protein:vir:88 241 MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQAD 320 (347) T ss_pred ccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechhhHHH Confidence 999999999999988888888777777776554332221 23467799999999999999999999999999999999 Q ss_pred hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 318 QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 318 ~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+++++||++++||||+++|+++..| T Consensus 321 ~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 321 QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred HhhhhhhhcCceeccceEEEEEeCCCC Confidence 999999999999999999999999999 No 2 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=3.8e-103 Score=582.09 Aligned_cols=341 Identities=81% Similarity=1.166 Sum_probs=317.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |||++++++++||+||+++++|+++||||+|+|||+++|+++|+|+++++.|+|++|||++||++|++++.+|+||++++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~~l~ 80 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLD 80 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccC Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGK 160 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~ 160 (344) ++.+++++++++|+||+.+|++|.|||+|++|++||+|+++++++|++||+++|++|+++++++++.+.+....+.+.+. T Consensus 81 ~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~ 160 (347) T protein:vir:94 81 DKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGK 160 (347) T ss_pred CCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCc Confidence 98888999999999999999999999999999999999999999999999999999999999999998888888888877 Q ss_pred ceeeeccccc-ccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 161 PSLLEVGAKA-DLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 161 ~~~i~~~~~~-~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) +..+.++... ..+++...+.++|+.|++|+++|+|++||++|||+||+|++|+.||+...+...++.+...+++|.|++ T Consensus 161 ~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~~ 240 (347) T protein:vir:94 161 AHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRN 240 (347) T ss_pred ceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccceeEE Confidence 7777766544 344556678889999999999999999999999999999999999998788888888888899999999 Q ss_pred EeCeEEEEecccccccccccccccccccccccccccc-----ccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPA-----TGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) ++||+||+|||+|.+.++.++.+++.+.+++.|.|+. |++ ++++++||+||++|+++++++++++|.+||+++ T Consensus 241 v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~--d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~ 318 (347) T protein:vir:94 241 VMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRV--ALDNVVGLFNHRSAVGTVKLKDMALERARRANF 318 (347) T ss_pred eeceEEEEcCccccccCcccccccccccccccccccccccccccc--cccceEEEEechhhhhhhhhcccceeeeechhh Confidence 9999999999999999999999888887777765443 544 567799999999999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) |+|+|+++++|||+++||||+++|+++.- T Consensus 319 ~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 319 QADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999999887 No 3 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=1.9e-102 Score=578.30 Aligned_cols=338 Identities=78% Similarity=1.139 Sum_probs=306.4 Q ss_pred CCCcccccccc--ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLG--TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~--~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (344) |||++++++.+ ++|+| ++++|.++||||+|+|||+++|+++|+++++++.|+|++|||++||++|++++++|+||++ T Consensus 1 ma~~~~~~~~n~~~~~~~-~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~~~~~G~~ 79 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDV-MAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (344) T ss_pred CccccccccCCcccCCcc-CCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEEeeecCCC Confidence 99999997766 45554 4777888999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++++.+++++++++|+||+.+|++|.|||+|++|++||+|+++++++|++||+++|++|+++++++++.+++.+..++++ T Consensus 80 l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~ 159 (344) T protein:vir:10 80 LDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGL 159 (344) T ss_pred CCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Confidence 99988899999999999999999999999999999999999999999999999999999999999999999999999988 Q ss_pred cCceeeeccc-ccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 159 GKPSLLEVGA-KADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 159 ~~~~~i~~~~-~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) +.++++.... +...+++...++++|+.|++|+++|+|++||++|||+||+|++|++||+++++++.+|++++.+++|+| T Consensus 160 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V 239 (344) T protein:vir:10 160 GTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSI 239 (344) T ss_pred cccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeeeEE Confidence 8887776554 334566777788999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeCeEEEEecccccccccccccccccccccccccccccc---ccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATG---GKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) ++++||+||+|||+|.++.+.+.++ .++..+.+++.. ..+++++.|||+|||+|+++++++++++|.+|++++ T Consensus 240 ~~v~G~~V~~Sn~lp~~~~~~~~~~----~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~ 315 (344) T protein:vir:10 240 RNVMGFEVVEVPHLTAGGAGTSREG----TTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANF 315 (344) T ss_pred EEEeceEEEeccccccccCCccccc----ccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhH Confidence 9999999999999998776655333 223333333322 245778899999999999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) |+|+|+|+++||+|++||||+++++++.. T Consensus 316 ~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 316 QADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHHHHhhcccceecccceEEEEeecC Confidence 99999999999999999999999999999 No 4 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=8.6e-102 Score=574.65 Aligned_cols=342 Identities=72% Similarity=1.082 Sum_probs=317.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |||++ ++..+|||||+++++|+.+||||+|.+||+++|+++|+++++++.|+|++||++|||++|++++++|+||++++ T Consensus 1 m~~~~-~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVP-GQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGERLS 79 (347) T ss_pred CCCCC-ccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCCCCcC Confidence 99986 57789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccC Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGK 160 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~ 160 (344) ++++++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|+++++++++++.+.+..+.|++. T Consensus 80 ~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~ 159 (347) T protein:vir:94 80 DKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLGT 159 (347) T ss_pred CCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcc Confidence 98888999999999999999999999999999999999999999999999999999999999999998888888889989 Q ss_pred ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEEE Q lcl|NC_015719. 161 PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNV 240 (344) Q Consensus 161 ~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i 240 (344) ++++..+..++..++.+..+++++.|++|+++|+|++||++|||+||+|++|++||+++++++.++.++..+.+|+|+++ T Consensus 160 ~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i 239 (347) T protein:vir:94 160 ASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRNV 239 (347) T ss_pred cceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceEEE Confidence 99999888888888888889999999999999999999999999999999999999999999999999889999999999 Q ss_pred eCeEEEEecccccccccccccccccccccc-cccccccc---ccccccceeEEEecHHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 241 MGFEVVEVPHLTAGGAGDDRPEEGTDASNQ-KHAFPATG---GKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 241 ~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~-~~~~~~~~---~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) +||+||+|||||..+.+.++.+.+++..+. .+.|+... ...+++++++|+|||+|+++++++++++|.+|++++|+ T Consensus 240 ~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~ 319 (347) T protein:vir:94 240 MGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQG 319 (347) T ss_pred eceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHH Confidence 999999999999998888887776554433 33333322 23456789999999999999999999999999999999 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|+|+++||+|++||||+|+|+++ .| T Consensus 320 d~i~~~~~~G~~~~rP~~a~~~~~~-~A 346 (347) T protein:vir:94 320 DLIVGKYAMGHGGLRPEAAGALVFS-PA 346 (347) T ss_pred HHhhhhhhhcCcccccceeEEEEec-CC Confidence 9999999999999999999999998 66 No 5 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=7.7e-102 Score=574.93 Aligned_cols=338 Identities=81% Similarity=1.149 Sum_probs=304.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |||++++++++|||||+++++|+++||||+|+|||+++|+++|+++++++.|++++||++|||++|++++++|++|++++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~~~g~~l~ 80 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeeecCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc-- Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL-- 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~-- 158 (344) ++++++++++++|+||+++||+|.|||+|++|+++|+|+++++++|++||+++|++|+++++++.+.+..+....+++ T Consensus 81 ~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~ 160 (347) T protein:vir:33 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGK 160 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Confidence 988889999999999999999999999999999999999999999999999999999999998777655444333333 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) ..++.+..++++...++...++++|+.|++|+++|+|++||++|||+||+|++|++||++++|++.+|.+++.+.+|.|+ T Consensus 161 ~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~V~ 240 (347) T protein:vir:33 161 PTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERGTIR 240 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccccceeE Confidence 33344555566666677777899999999999999999999999999999999999999999999999988899999999 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccc------cccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGG------KVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) +++||+||+|||||.++++.+..+.. +..++.++. ...+++.+||+||++|+++++++++++|.+|++ T Consensus 241 ~i~G~~V~~Sn~lp~~~~~~~~~~~~------ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~ 314 (347) T protein:vir:33 241 NVMGFEVVEVPHLTAGGAGDTREDAP------ADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred EEeceeEEEecccccCcccccccccc------ccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch Confidence 99999999999999988776654421 112222222 245677899999999999999999999999999 Q ss_pred hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++|+|+|+|+++||+|++||||+|+|+++.-+ T Consensus 315 ~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~ 346 (347) T protein:vir:33 315 NYQADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred hhhhHhhhhhhhcCCceecccceEEEecCCCC Confidence 99999999999999999999999999999999 No 6 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=1.9e-101 Score=572.80 Aligned_cols=338 Identities=78% Similarity=1.144 Sum_probs=307.6 Q ss_pred CCCcccccccc--ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLG--TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~--~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (344) ||+++++++.+ |||||+ +++|+++||||+|+|||+++|+++|+++++|+.|+|++|||++||++|++++++|+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV-AAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (345) T ss_pred Ccccccchhcccccccccc-cCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEEeeecCCC Confidence 99999987766 688887 577889999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++++.++++++|++|+||+.+|++|+|||+|++|++||+|+++++|+|++||+++|++|+++++++++.+.+.+..++++ T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~ 159 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGL 159 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Confidence 99988889999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred cCceeeeccc-ccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 159 GKPSLLEVGA-KADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 159 ~~~~~i~~~~-~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) +.+..+.... +.+.+++.+.+.++|+.|++|+++|+|++||.+|||+||+|++|++||++++|++.+|++++.+++|+| T Consensus 160 ~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V 239 (345) T protein:vir:22 160 GTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSI 239 (345) T ss_pred ccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccccceE Confidence 8888776555 445566777788999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeCeEEEEecccccccccccccccccccccccccccccccc----ccccceeEEEecHHHHhhhhhheeeeeeeecch Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGK----VNKENVVGLFQHRSAVGTVKLKDLALERARRAE 313 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~ 313 (344) ++++||+||+|||+|.+.++....+ +....+.++.+.+. ...++++||+|||+|+++++++++++|.+|+++ T Consensus 240 ~~i~G~~V~~sn~lp~~~~~~~~~~----~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 315 (345) T protein:vir:22 240 RNVMGFEVVEVPHLTAGGAGTAREG----TTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 (345) T ss_pred EEEeceEEEecccccccccCccccC----cccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh Confidence 9999999999999998765544332 22334455554443 234778999999999999999999999999999 Q ss_pred hhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 314 YQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 314 ~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +|+|+|+++++|||+++||||+++|+++-. T Consensus 316 ~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 316 FQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 999999999999999999999999999988 No 7 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.4e-100 Score=566.71 Aligned_cols=341 Identities=82% Similarity=1.161 Sum_probs=301.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |||++++++++|||||+++++|+++||||+|+|||+++|+++|+++++++.|++++||++|||++|++++++|++|++++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~~g~~l~ 80 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeeccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc--c Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG--L 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~--~ 158 (344) ++++++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|++++++++....+......+ . T Consensus 81 ~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~ 160 (347) T protein:vir:15 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLGK 160 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCc Confidence 98888999999999999999999999999999999999999999999999999999999999876654433333222 2 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) ..........+++.+++...+++|++.|++|+++|+|++||++|||+||+|++|+.||+++++++.+|.++..+++|.|+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~Vg 240 (347) T protein:vir:15 161 PTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGTIR 240 (347) T ss_pred cccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceEEE Confidence 22233344556677888888999999999999999999999999999999999999999999999999998999999999 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccc---cccccccceeEEEecHHHHhhhhhheeeeeeeecchhh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPAT---GGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQ 315 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~ 315 (344) +++||+||+|||||..+++.+..... ++..+.+.+- ...+.+++.++|+||++|++++++|++++|.+|++++| T Consensus 241 ~i~G~~V~~Sn~lp~~~~t~~~~~~~---~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~ 317 (347) T protein:vir:15 241 NVMGFEVVEVPHLTAGGAGDTREDAP---ADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred EEeceEEEeccccccccccccccccc---ccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhh Confidence 99999999999999887765533211 1111111111 01235677899999999999999999999999999999 Q ss_pred hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 316 ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 316 ~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+|+++++||+|++||||+++|+++.-+ T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~~~~~~~~ 346 (347) T protein:vir:15 318 ADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred hhhhehhhhcCCceeccccEEEEecCCCC Confidence 99999999999999999999999999999 No 8 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=7.7e-98 Score=552.98 Aligned_cols=342 Identities=24% Similarity=0.337 Sum_probs=307.2 Q ss_pred CCCcc----ccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCC Q lcl|NC_015719. 1 MANMQ----GGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPG 76 (344) Q Consensus 1 ma~~~----~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g 76 (344) |++.+ +++|.+|||||+ +++|+++||||+|+|||+++|+++|+++++++.|++++|||++|+++|++++++|+|| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~-~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYG-GATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccc-cccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecCC Confidence 88877 567889999999 5568889999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCc-CCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 77 ESLDDKR-KDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 77 ~~~~~~~-~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) ++++++. .++++++++|+||+.+||+|.|||+|++|+++|+|+++++|+|++||+++|++|+++++++++...+....+ T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9998763 567889999999999999999999999999999999999999999999999999999999999888877766 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhcc---chhhhhcccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAA---LMPNAANYAALIDP 232 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~---~~~~~~~~~~~~~~ 232 (344) ..+..++.+..++.+.. .....++++|+.|++++++|+|++||++|||+||+|++|++||++ +++++.+++++... T Consensus 160 ~~~~Gg~~i~~~sg~~~-~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~ 238 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTNE-SDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQ 238 (375) T ss_pred ccccCcceeeecccccc-ccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccccee Confidence 66667777766544433 233447889999999999999999999999999999999999986 67889999988888 Q ss_pred ccceeEEEeCeEEEEecccccccccccccccccccc-----------------ccccccccccccccc-cceeEEEecHH Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDAS-----------------NQKHAFPATGGKVNK-ENVVGLFQHRS 294 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~-~~~~gl~~~~~ 294 (344) .+|.|++++||+||+|||+|..+++.+.+++....+ .+...++.|++++.. .+++||+|||+ T Consensus 239 ~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~ 318 (375) T protein:vir:10 239 SGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKE 318 (375) T ss_pred ccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchh Confidence 999999999999999999999998888776654332 445577889988743 67999999999 Q ss_pred HHhhhhhheeeeeee---ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 295 AVGTVKLKDLALERA---RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 295 Av~~~~~~~~~~e~~---~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+++++++++++|.+ |+++||+|+|+++|+|||+++||||+|+|+++++| T Consensus 319 A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 319 AAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred heeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCc Confidence 999999999999987 69999999999999999999999999999999998 No 9 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=1.4e-93 Score=529.69 Aligned_cols=327 Identities=17% Similarity=0.132 Sum_probs=287.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |+|+++ +..|||+|+++++| ++||||+|+|||+++|+++|+|+++++.|+|++|||+|||++|++++++|+||++++ T Consensus 1 m~~~~~--~~~t~~~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPAA--NTHTRPGWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAGEELV 77 (334) T ss_pred CCCCcC--CCccccccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeeeecCCCCCC Confidence 999987 44599999988777 669999999999999999999999999999999999999999999999999999998 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccC Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGK 160 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~ 160 (344) ++ ++++++++|+||+.+|++++|||+|++|++||+|+++++|+|++||+++||+|+++++++++.+.+.+..++...+ T Consensus 78 ~~--~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 78 VQ--KNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CC--CcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 86 5889999999999999999999999999999999999999999999999999999999999887766544332222 Q ss_pred ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCc---CCCEEEeCHHHHHHHhccchhhhhcccc---cccccc Q lcl|NC_015719. 161 PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPA---NDRTFYTTPDVYSAILAALMPNAANYAA---LIDPER 234 (344) Q Consensus 161 ~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~---~gR~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~~ 234 (344) +....... +...+..+.++.+++++++|++.|+|++||+ .+||+||+|++|++||++++|++.+|++ ...+.+ T Consensus 156 ~~~~~~~~-g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~ 234 (334) T protein:vir:80 156 ILLPSTIS-GLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVG 234 (334) T ss_pred cceeeccc-ccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccc Confidence 22222111 1222334556788999999999999999994 6799999999999999999999999864 345889 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |+|++++||+||+|||+|.++++.+..+ ..++.|++++ ...+++|||++|+++++++++++|.+|++++ T Consensus 235 g~i~~v~G~~V~~Sn~~P~~~~t~~~~g---------~~~~~~agd~--t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~ 303 (334) T protein:vir:80 235 GRIAMLNGVRVVETPRFPQSAITANALG---------ADFNVTDAEV--RRKMITFIPSMALISAQVHPVSAQFWEEKKD 303 (334) T ss_pred eeEEEEeceEEEeecCCCCccccccccc---------cccccccccc--cceEEEEEeCceEEEEEEeecceeeeechhh Confidence 9999999999999999999887766543 3466777765 4589999999999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|+|+++++||++++||||+++++++-+- T Consensus 304 ~~d~i~~~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 304 FGHYLDTFQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred HHHHHHHHHHcCCceeccceEEEEEEeeec Confidence 999999999999999999999999999998 No 10 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=2.6e-92 Score=522.66 Aligned_cols=335 Identities=13% Similarity=0.085 Sum_probs=285.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |++.+ ..|||||++ ++|.++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||++++ T Consensus 1 ms~~n----~~t~~~~~~-~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~~~G~~ld 75 (364) T protein:vir:10 1 MSNPN----VLTQPAVSA-SGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQVLSPGKSPD 75 (364) T ss_pred CCCcc----ccccccccc-ccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEeeeccCcccC Confidence 87764 579999994 4477889999999999999999999999999999999999999999999999999999998 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChh-HHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hcccccccccccc Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLI-NLADGVNENIAGL 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a-~~~~~~~~~~~~~ 158 (344) ++ ++.++|++|+||+.+|++++|+|+|++|++|| +|+++++|+|++||+++||+|++++..++ ..+.+....+.+. T Consensus 76 ~~--~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~ 153 (364) T protein:vir:10 76 AS--PTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVA 153 (364) T ss_pred CC--CcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCccc Confidence 74 68899999999999999999999999999999 89999999999999999999998776544 3344444444444 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc--cccccccce Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA--ALIDPERGS 236 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~ 236 (344) +.|..+.++.. ..+..+.+..++++|+++.+.|+|++||.+|||+||+|++|+.||++++|++.+|+ +++.+.+|+ T Consensus 154 ~~g~~i~~~~~--a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~ 231 (364) T protein:vir:10 154 GHGFSIHIVGL--ASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGF 231 (364) T ss_pred CCcceeeeccc--CcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccce Confidence 44544444322 23345567888999999999999999999999999999999999999999999986 567799999 Q ss_pred eEEEeCeEEEEecccccccccccccccc-ccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhh Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLTAGGAGDDRPEEG-TDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQ 315 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~ 315 (344) |++++||+||+|||||+.++.....+.. .+..++...++.|....++..+++++|||+|+++++++++++|.+|++++| T Consensus 232 v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~ 311 (364) T protein:vir:10 232 VLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEK 311 (364) T ss_pred eEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeecccee Confidence 9999999999999999876544332211 111222333455554455667999999999999999999999999999999 Q ss_pred hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 316 ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 316 ~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+|+++++|||+++||||+++|++.+++ T Consensus 312 ~~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 312 TWYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred eeeeeeehcccCcccCccceEEEEecCCC Confidence 99999999999999999999999999998 No 11 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=1.3e-91 Score=518.88 Aligned_cols=321 Identities=18% Similarity=0.179 Sum_probs=282.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |.|.+ .+|||||+++++|. +||||+|+|||+++|+++|++++++++|+|++|||+|||++|+.++++|+||++++ T Consensus 1 ms~~~----~~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLN----DLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCCcc----cchhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcCcC Confidence 87764 47999999999987 79999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccC Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGK 160 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~ 160 (344) ++ ++.++|++|+||+++|++++|||+|++|++||+|+|+++|+|++||+++||+|+++++++++...+.+... ++.. T Consensus 76 ~~--~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~-~~~~ 152 (335) T protein:vir:63 76 RS--RVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLED-AFSP 152 (335) T ss_pred CC--CccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCC-CcCC Confidence 87 47889999999999999999999999999999999999999999999999999999999988877665433 3222 Q ss_pred cee--eecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCC---CEEEeCHHHHHHHhccchhhhhcccc---cccc Q lcl|NC_015719. 161 PSL--LEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAND---RTFYTTPDVYSAILAALMPNAANYAA---LIDP 232 (344) Q Consensus 161 ~~~--i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g---R~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~ 232 (344) |.. +.+.+.+. .+.++++++++.+|.++|+|++||+++ ||++|+|++|++|+++++|++.+|++ ...+ T Consensus 153 G~~~~~~~tg~~~----~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~ 228 (335) T protein:vir:63 153 GVLEKLDLTGLTA----KQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDY 228 (335) T ss_pred CcceeeeeccCcc----cccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccc Confidence 322 22222222 234678889999999999999999754 99999999999999999999999863 4568 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) .+|+|++++||+|++|||+|+++++.+.++. .|+.|++++ +..++++||++|+++++++++++|.+|++ T Consensus 229 ~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~---------a~n~~~~d~--~~~~~~~~~~~Al~t~~~~~vt~e~~~~~ 297 (335) T protein:vir:63 229 VKSRVAILNGVKVLETPRFATKAIAAHPLGR---------HFNVSAEES--ERQIALFLPSKTLITAQVAPVQAKLWEDN 297 (335) T ss_pred cCceeEEeeceEEEeeccCCCCCcccccccc---------cCCcccccc--ceeEEEEEecceEEEEEEeecccceeecc Confidence 9999999999999999999999888776543 356677755 46789999999999999999999999999 Q ss_pred hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++|+|+|+++++|||+++||||+++++++..- T Consensus 298 ~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:63 298 EKFSWVLDTFQMYNIGARRPDTAGAIELKGIG 329 (335) T ss_pred chhhHHhHHHHHcCCcccccceEEEEEEcCCC Confidence 99999999999999999999999999997644 No 12 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=8.1e-91 Score=514.48 Aligned_cols=323 Identities=17% Similarity=0.147 Sum_probs=284.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |.|.+ .+|||||+++++|. +||||+|+|||+++|+++|++++++++|+|++|||+|||++|+.++++++||++++ T Consensus 1 ms~~~----~~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLN----DLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCccc----cccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCcccC Confidence 87763 57999999998886 79999999999999999999999999999999999999999999999999999999 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccC Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGK 160 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~ 160 (344) ++ ++++++++|+||+.+|++++|||+|++|++||+|+++++|+|++||+++||+++++++++++...+.+..++-+.+ T Consensus 76 ~~--~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G 153 (335) T protein:vir:78 76 RS--RVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPG 153 (335) T ss_pred CC--CcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCC Confidence 87 5788999999999999999999999999999999999999999999999999999999999887776654432222 Q ss_pred ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC---CCEEEeCHHHHHHHhccchhhhhcccc---cccccc Q lcl|NC_015719. 161 PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN---DRTFYTTPDVYSAILAALMPNAANYAA---LIDPER 234 (344) Q Consensus 161 ~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~---gR~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~~ 234 (344) ++.....+.. .....+.++++++.++.+.|+|++||+. |||++|+|++|++|+++++|++.+|+. ...+.+ T Consensus 154 ~~~~~~~tg~---~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:78 154 VLEKLDLTGL---TAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVK 230 (335) T ss_pred cceeeeeccc---cccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccccc Confidence 2222211111 1233467889999999999999999965 699999999999999999999999863 456899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |+|++++||+|++|||||.++++.+.+++ .|+.|++++ ++.++++||++|++++++++++.|.+|++++ T Consensus 231 g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~---------a~n~~~~d~--~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~ 299 (335) T protein:vir:78 231 SRVAILNGVKVLETPRFATKAISAHPLGR---------HFNVSAEEA--ERQIALFLPSKTLITAQVAPVQAKLWEDHDQ 299 (335) T ss_pred ceeEEeeceEEEeeccCCCCCCccccccc---------cCCcccccc--cceEEEEEecceEEEEEEEecccceeeccch Confidence 99999999999999999999887776543 356666644 5678999999999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|+|+++++|||+++||||+++|+++..- T Consensus 300 ~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:78 300 FSWVLDTFQMYNIGARRPDTAGAIELKGIE 329 (335) T ss_pred hhHhhhHHHHcCCcccCcceEEEEEecCCC Confidence 999999999999999999999999998755 No 13 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=1.9e-90 Score=512.45 Aligned_cols=321 Identities=26% Similarity=0.355 Sum_probs=278.6 Q ss_pred CCCccccccccccccccccccchh-hhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKL-ALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~-~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (344) ++||+..++ .|+||+++++|++ +||||+|+|||+++|+++|+++++++.|++++|+||||+++|++++++|++|+++ T Consensus 4 ~~~~~~~~~--~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~~~~g~~l 81 (332) T protein:vir:78 4 LSNFSLPNQ--ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTPI 81 (332) T ss_pred cccccCCcc--ccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEeeecCCCCC Confidence 566666555 7899999999975 9999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +++ +++++++++|+||+.+|++|.|||+|++|+++|+|+++++++|++||+++|++|+++++++++...+....+ T Consensus 82 ~~~-~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~---- 156 (332) T protein:vir:78 82 VGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP---- 156 (332) T ss_pred CCC-CCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccc---- Confidence 875 468999999999999999999999999999999999999999999999999999999999887665554433 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhc--cchhhhhcccc-ccccccce Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILA--ALMPNAANYAA-LIDPERGS 236 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~--~~~~~~~~~~~-~~~~~~G~ 236 (344) .+..+.++.+. .+ .++++|++|++|+++|+|++||.+|||+||+|++|+.||+ +++|++.++.+ ++.+++|. T Consensus 157 g~~~~~~~~~~-~~----~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~ 231 (332) T protein:vir:78 157 GGFHVNIGAGN-TN----DAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) T ss_pred cccccccCCcc-cc----CHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecce Confidence 23333333322 22 2467899999999999999999999999999999999998 78999998866 45688886 Q ss_pred -eEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee---eeecc Q lcl|NC_015719. 237 -IRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE---RARRA 312 (344) Q Consensus 237 -Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e---~~~~~ 312 (344) |++++||+||+|||||.++++.+..+ +....++.|+++ +++.++|+||++|+++++.+++++| .+|++ T Consensus 232 ~i~~i~G~~V~~Sn~lp~~~g~~~~~~------~~~~~~n~~~~~--~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~ 303 (332) T protein:vir:78 232 GLYSIAGIRILKSNNLAGLYGQDLSSA------AVTGENNDYQVD--ASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV 303 (332) T ss_pred eeeEEeeeEEEecCccccCcccccccc------cccccccccccc--cccceEEeecccceeeeeeeccchhhhhcccch Confidence 89999999999999999887766543 334456666654 5668899999999999999988665 57899 Q ss_pred hhhhhhhhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 313 EYQADQIIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 313 ~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) ++|+|+|+|+++||++++||||+++|++. T Consensus 304 ~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 304 QYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhHhhhhhhhhhcCceecccceEEEeeC Confidence 99999999999999999999999999999 No 14 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=6.5e-90 Score=509.53 Aligned_cols=328 Identities=13% Similarity=0.078 Sum_probs=280.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |++.+ ..|||||++ ++|.++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||+.++ T Consensus 1 Ms~~n----~~t~~~~~~-s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~~~G~~ld 75 (402) T protein:vir:97 1 MSTPN----TLTNVAVSA-SGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPN 75 (402) T ss_pred CCCcc----ccccccccc-ccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEeeeccccccC Confidence 87764 579999994 4477889999999999999999999999999999999999999999999999999999998 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChh-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccccccccccc Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLIN-LADGVNENIAGL 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~-~~~~~~~~~~~~ 158 (344) ++ ++.+++++|+||+.+|++++|+|+|++|++|| +|+++++|+|++||+++||+|++++..+++ .+.++...+.+. T Consensus 76 g~--~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~ 153 (402) T protein:vir:97 76 AT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) T ss_pred CC--CcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccc Confidence 75 67899999999999999999999999999999 899999999999999999999887765444 345566666665 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc--cccccccce Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA--ALIDPERGS 236 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~ 236 (344) ..++.+....+.. ...+.+.+++++|+++.++|+|++||.+|||++|+|++|++|+++++|++.+|+ +.+.+.+|+ T Consensus 154 ~~g~s~~~~~t~~--~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~ 231 (402) T protein:vir:97 154 GHGFSINVNVTES--EALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) T ss_pred ccccccccccccc--hhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccce Confidence 5555444433222 223456788999999999999999999999999999999999999999999985 566799999 Q ss_pred eEEEeCeEEEEeccccccc--cccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLTAGG--AGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |++++||+||+|||||+.+ ++.+.+ ++....+.|....++..+++++|||+|++++++++++.|.|||+++ T Consensus 232 v~~v~Gv~Vv~SnnlP~~a~~it~~~l-------s~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~ 304 (402) T protein:vir:97 232 VLSSYNCPVIPSNRFPTFAQDQAHHLL-------SNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE 304 (402) T ss_pred eEEEeceEEEecCcccccccccccccc-------ccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhH Confidence 9999999999999999864 222222 1122233444334566789999999999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|+|+++++||++++||||+++++++..+ T Consensus 305 ~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (402) T protein:vir:97 305 KTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) T ss_pred HHHHHHHHHHhCCcccCccceEEEEEeccc Confidence 999999999999999999999999999855 No 15 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=2.6e-87 Score=495.32 Aligned_cols=327 Identities=13% Similarity=0.059 Sum_probs=283.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |++.+ .+|||||+++ +|.++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||++++ T Consensus 1 Ms~~n----~~t~~~~~~s-g~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~pG~~ld 75 (401) T protein:vir:70 1 MSTPN----NLTNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPA 75 (401) T ss_pred CCCCc----cccccccccc-cchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEeeeecCCCCcC Confidence 88875 4799999954 477889999999999999999999999999999999999999999999999999999998 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChh-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccccccccccc Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLIN-LADGVNENIAGL 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~-~~~~~~~~~~~~ 158 (344) ++ ++.++|++|+||+.+|++++|+|+|++|++|| +|+|+++++|++||+++||+|++.+..++. .+.++...+.+. T Consensus 76 ~~--~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~ 153 (401) T protein:vir:70 76 AT--STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVK 153 (401) T ss_pred CC--CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcC Confidence 75 67899999999999999999999999999999 899999999999999999999877754332 456777788888 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEE-eCHHHHHHHhccchhhhhccc--cccccccc Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFY-TTPDVYSAILAALMPNAANYA--ALIDPERG 235 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~v-v~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G 235 (344) +.|..++++.....+ ....++++++|++|...|+|++||.+ |+++ .+|.+|+.|++++++++.+|+ +++.+.+| T Consensus 154 ~~G~~i~v~~~~~~~--~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G 230 (401) T protein:vir:70 154 GHGFSINVEVAEGEA--LVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQG 230 (401) T ss_pred CCceEEecccccccc--ccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhcCcccchhhccccCCccccc Confidence 888888886654432 23456789999999999999999965 6655 577888899999999999986 55779999 Q ss_pred eeEEEeCeEEEEeccccccc--cccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGG--AGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE 313 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~ 313 (344) +|.+++||+||+|||+|+++ ++.+.+ ++....+.|....++..+++++||++|++++++++++.|.|||++ T Consensus 231 ~v~~vaGv~Vv~SnnlP~~a~~it~~~l-------s~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r 303 (401) T protein:vir:70 231 FTLSSYNCPVIPSNRFPKYSQGQTHHLL-------SNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKK 303 (401) T ss_pred eEEEEeceEEEeeccccccccccccccc-------cccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhh Confidence 99999999999999999865 333332 122234444444566788999999999999999999999999999 Q ss_pred hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 ~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+|+|+++++||++++||||+++++++.+. T Consensus 304 ~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 304 EKTYYIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred hhHHHHHHHHHhCCcccchhheEEEeecCcc Confidence 9999999999999999999999999999996 No 16 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=6.3e-87 Score=493.15 Aligned_cols=328 Identities=13% Similarity=0.077 Sum_probs=276.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |++.+ .+|||||++ ++|.++||||+|+|||+++|+++|++++++++|+|++|||++|+++|++++++|+||++++ T Consensus 1 Ms~~n----~~t~p~~~g-sg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~y~~pG~~ld 75 (400) T protein:vir:10 1 MSTPN----NLTNVAVSA-SGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPA 75 (400) T ss_pred CCCCc----ccccccccc-ccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEeeecCCCCcC Confidence 88875 479999994 4578889999999999999999999999999999999999999999999999999999998 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHHHhChh-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccccccccccc Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLIN-LADGVNENIAGL 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~-~~~~~~~~~~~~ 158 (344) ++ ++.++|++|+||+.+|++++|+|+|++|++|| +|+|+++|+|++||+++||+|++++..+.. .+..+...+.+. T Consensus 76 g~--~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~ 153 (400) T protein:vir:10 76 AT--STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVK 153 (400) T ss_pred CC--CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcc Confidence 76 57899999999999999999999999999999 999999999999999999999888765432 233344444444 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc--cccccccce Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA--ALIDPERGS 236 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~ 236 (344) ..+..+.+.+.+.. ..+.++.+.++|.+|.+.|+|++||.++++++++|.+|++|+.++++++.+|+ +++.+.+|+ T Consensus 154 ~~g~s~~v~~~~~~--~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~ 231 (400) T protein:vir:10 154 GHGFSVNVEVNEGE--ALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGF 231 (400) T ss_pred ccccceeecccccc--cccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccce Confidence 33333333322221 22345778889999999999999997766677788888899999999999986 456799999 Q ss_pred eEEEeCeEEEEeccccccccc--cccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLTAGGAG--DDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.+++|++||+|||+|+.+.. .+.+ ++...++.|....++..+++++||++|++++|+++++.|.|||+++ T Consensus 232 v~~v~Gv~Iv~Sn~lP~~a~~~~~~~l-------S~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~ 304 (400) T protein:vir:10 232 VLSSYNCPVIPSNRFPKYSQGQKHHLL-------SNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKE 304 (400) T ss_pred EEEEeceEEEeeCcCCcccCccccccc-------ccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhh Confidence 999999999999999986422 2221 2222344555445677899999999999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|+|+++++||++++||||+++++++.++ T Consensus 305 ~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 305 KTYYIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred HHHHHHHHHHhCCcccchhheEEEEecCCc Confidence 999999999999999999999999999999 No 17 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=2.3e-82 Score=468.15 Aligned_cols=292 Identities=63% Similarity=0.904 Sum_probs=253.3 Q ss_pred eeeecccccEEEEeecCcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHH Q lcl|NC_015719. 50 MQRQISSGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESL 129 (344) Q Consensus 50 ~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aL 129 (344) ++|+|++|||++||++|++++++|+||++++++++++++++++|+||+.+|++|.|||+|++|++||+|+++++|+|++| T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aL 80 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEAL 80 (324) T ss_pred CeeeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHH Confidence 88999999999999999999999999999999888899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCH Q lcl|NC_015719. 130 AMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTP 209 (344) Q Consensus 130 a~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P 209 (344) |+.+|++|++++++.++...+....+....++..+...+++.. ++...+.++|+.|++|+++|||++||.+|||+||+| T Consensus 81 A~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P 159 (324) T protein:vir:99 81 AMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKE-DPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDP 159 (324) T ss_pred HHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccccccc-ccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCh Confidence 9999999999999988877766655555545555444444433 344557789999999999999999999999999999 Q ss_pred HHHHHHhccchhhhhccccccccccceeEEEeCeEEEEeccccccccccccccccccc-----ccccccccccccccccc Q lcl|NC_015719. 210 DVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDA-----SNQKHAFPATGGKVNKE 284 (344) Q Consensus 210 ~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 284 (344) ++|++||+++++++.++++++.+++|.|++++||+||+|||+|+..++....+.+..+ +.+.+....|++ +++ T Consensus 160 ~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~--d~~ 237 (324) T protein:vir:99 160 DTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTV--GAD 237 (324) T ss_pred HHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccc--ccC Confidence 9999999888899999999999999999999999999999999976665432222111 111112223444 357 Q ss_pred ceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 285 NVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 285 ~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +++||+||++|+++++++++++|.+|++++|+|+|+|+|+|||+++||||+++++++++| T Consensus 238 ~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 238 NVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred ceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCc Confidence 899999999999999999999999999999999999999999999999999999999998 No 18 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=3.4e-70 Score=401.41 Aligned_cols=318 Identities=17% Similarity=0.153 Sum_probs=258.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeee--cccccEEEEeecCcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQ--ISSGKSAQFPVIGRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~--i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (344) |+|.-+ |++-.++.+..+..|+|+++|++.|++.++++++++.++ +++|+|||||++|++++++|++|.+ T Consensus 3 ~~~~~~--------~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~~~ 74 (341) T protein:vir:94 3 LGNTIT--------GPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELGVEDKATDVP 74 (341) T ss_pred chhhhc--------cccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcceeeeecCCCc Confidence 444333 334456667666669999999999999999999998764 5679999999999999999999998 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++. +++++++++|+||+++|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..++..+..+.+. T Consensus 75 i~~--~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~------- 145 (341) T protein:vir:94 75 VGV--QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQN------- 145 (341) T ss_pred ccc--ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCc------- Confidence 875 46889999999999999999999999999999999999999999999999999988776543222111 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) .+... ............|+.|++++++||+++||.+|||+||+|++|+.|+++++|++.++.++..+++|.|+ T Consensus 146 ----~~~~~---~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig 218 (341) T protein:vir:94 146 ----VFSSS---NGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIG 218 (341) T ss_pred ----cccCc---cccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeee Confidence 00000 00011112234578899999999999999999999999999999999999999999988889999999 Q ss_pred EEeCeEEEEeccccccccccccccccccccc-------cccccccccccccccceeEEEecHHHHhhhhh---------- Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASN-------QKHAFPATGGKVNKENVVGLFQHRSAVGTVKL---------- 301 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~---------- 301 (344) +++||+||+||++|..+.+.++.+.+..... ....+..+.+ .++..+||+||++|++.++. T Consensus 219 ~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~gl~~~~~av~~~k~~~~~~~~~~~ 296 (341) T protein:vir:94 219 SLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDS--FTSLPATFTGNSRPVHTAVMCHMDWAAAVV 296 (341) T ss_pred eEeceEEEEeccccccccccccccccceeccccccccccccccccccc--ccccEEEEEEecccccceeeecchhhhccc Confidence 9999999999999998887776655432111 1122333333 45778999999999999874 Q ss_pred -heeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 302 -KDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 302 -~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.+++|..|++.+|+|+|+|+++||||++||||++.|++.+-. T Consensus 297 ~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~ 340 (341) T protein:vir:94 297 SKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDT 340 (341) T ss_pred cccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcCC Confidence 4477888899999999999999999999999999988887777 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=2.7e-68 Score=390.99 Aligned_cols=332 Identities=17% Similarity=0.197 Sum_probs=272.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee--ecccccEEEEeecCcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR--QISSGKSAQFPVIGRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~--~i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (344) |||+|+. +.+.|++.+..+..++..|+|+++|++.|++.+++.++++.+ +.+.|+|||||++|++++.+|++|++ T Consensus 1 ~~~~~~~---~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~~g~~ 77 (381) T protein:vir:80 1 MATIQGT---GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQPQTP 77 (381) T ss_pred Cceeccc---ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeecCCCc Confidence 9999963 688999999999998778999999999999999999988765 45789999999999999999999998 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.+ ++++++++++||+++|+++.|+|+|+.|+++|++++++++++++||+++|+.|+..+.+......+.. . T Consensus 78 i~~~--~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~---~-- 150 (381) T protein:vir:80 78 VNLQ--ARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRI---Y-- 150 (381) T ss_pred cccc--ccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---c-- Confidence 8764 68899999999999999999999999999999999999999999999999999988765443322211 1 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) ..+..+..+.. ...........+++.|++|+++||+++||.+|||+||+|++|+.||++++|++.+++++..+++|.|+ T Consensus 151 t~~~~i~~~~~-~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig 229 (381) T protein:vir:80 151 SYDTTLGDGTV-NAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVG 229 (381) T ss_pred ccccccccccc-ccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeee Confidence 11111111111 11111223455789999999999999999999999999999999999999999999988899999999 Q ss_pred EEeCeEEEEeccccccccccccccccccccccc-ccccccccccc----------------------------------- Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQK-HAFPATGGKVN----------------------------------- 282 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~----------------------------------- 282 (344) +++||+||+||++|...++.+....+.+..... .....+.++++ T Consensus 230 ~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~ 309 (381) T protein:vir:80 230 TILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAAD 309 (381) T ss_pred EEcceEEEeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecC Confidence 999999999999999887777665543322211 11122222221 Q ss_pred -----------ccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 283 -----------KENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 283 -----------~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) .+-..|+++|+++.+.+.++.++.+..+...|++|.|+|+++||++++||++++.|+.+.- T Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 310 GGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred CCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 1333678899999999999999999999999999999999999999999999999999988 No 20 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=7e-60 Score=344.91 Aligned_cols=307 Identities=12% Similarity=0.064 Sum_probs=228.7 Q ss_pred CCCccccccccccccccccccchhhhh-HHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALF-LKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~-~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (344) |+. ||.+++.+++| .|+|+.+++..++++.+...+.+......|+|||||+||++++++|++++++ T Consensus 1 ~~~-------------~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~~i 67 (322) T protein:vir:31 1 MST-------------GNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQGDF 67 (322) T ss_pred CCC-------------CCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCCCc Confidence 443 22445555677 5999999999998988888888866667899999999999999999999988 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) .. +++++++.+|+|||.|||+|.||| |++|.++|+++.++++++++|++.+|+++...+..++....... T Consensus 68 ~~--d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~------- 137 (322) T protein:vir:31 68 TF--DNLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQN------- 137 (322) T ss_pred cc--ccCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC------- Confidence 65 478999999999999999999999 99999999999999999999999999999877775553221111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHH---------Hhccchhhhhcccccc Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSA---------ILAALMPNAANYAALI 230 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~---------Ll~~~~~~~~~~~~~~ 230 (344) ..+++. +.......+++.....|+.|++++.+|||++||.+|||+||+|+++.. |+++++|+...-.|. T Consensus 138 ~p~vin-~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~- 215 (322) T protein:vir:31 138 DPNVIN-GVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGI- 215 (322) T ss_pred Ccceec-CCccceeccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccc- Confidence 011111 000111112222344589999999999999999999999999999764 577888876543332 Q ss_pred ccccc--eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee Q lcl|NC_015719. 231 DPERG--SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER 308 (344) Q Consensus 231 ~~~~G--~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~ 308 (344) .+| .||+++||+||+||++|..+ +.+.+|.++... ++.+.-.+ .+.+-+...+..+++.++++.|. T Consensus 216 --a~g~~~Vg~~~GF~V~~SN~l~~~~---~~i~aG~d~~~t---~ag~~n~f----~~~~~~~~~~~~~~~~~l~~~e~ 283 (322) T protein:vir:31 216 --APDMQFVRSVYGIDLFVSNLLADAN---ETINAGGDARST---TAGKCNMF----MNVSDMGLLPFVVAWKEMPTTKS 283 (322) T ss_pred --hhhHHHHHHHhceeeeeeccccccc---cccccCcccccc---cceeeccc----ccccchhhhhhhhHhhhhhhhhc Confidence 223 49999999999999997533 222233222211 11111100 01111233455677888889999 Q ss_pred eecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 309 ARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 309 ~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|++++|+|.++++++||++++|||.+++|.+.+-- T Consensus 284 ~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~~ 319 (322) T protein:vir:31 284 FIDDYNDDLNTATTARWGNGLVRDENLVCVLANADK 319 (322) T ss_pred ccCccccccceeeeeeecceeecccceEEEEecccc Confidence 999999999999999999999999999999987766 No 21 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1.3e-58 Score=337.86 Aligned_cols=267 Identities=20% Similarity=0.175 Sum_probs=222.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ecccccEEEEeecCcceeeeeeC-C Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QISSGKSAQFPVIGRTKAAYLQP-G 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG~~t~~~~~~-g 76 (344) ||+.+ +-.|+|+++|++.|++.+++.++++.. +++.|+|||||++|++++.+|++ + T Consensus 1 MA~~~--------------------~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~ 60 (273) T protein:vir:10 1 MAFNN--------------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAG 60 (273) T ss_pred Ccchh--------------------hhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCC Confidence 66621 235999999999999999999998653 67889999999999999999986 4 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) ..++ .+++.+++++++||+.+|+++.|+|+|+.++++|+++ ++++++++||+.+|+.++..+...+.. T Consensus 61 ~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~--------- 128 (273) T protein:vir:10 61 RQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------- 128 (273) T ss_pred CccC--ccccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 4443 3578899999999999999999999999999999865 999999999999999998776532110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhh-hhccc-ccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPN-AANYA-ALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~-~~~~~~~ 234 (344) . ..++ ......+++.|++|+++|++++||.+|||+||+|++|+.|++++.++ +.+.. +...+++ T Consensus 129 -------~--~~~~-----~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~ 194 (273) T protein:vir:10 129 -------L--TGSA-----PTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) T ss_pred -------c--cccc-----ccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceee Confidence 0 0000 11124578999999999999999999999999999999999988755 45554 4456899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.||+++||+||+||++|.++. ..+++||++|+++++++. ++|..|++++ T Consensus 195 G~ig~i~G~~v~~s~~lp~~~~-----------------------------~~~~~~~~~A~~~a~q~~-~~e~~r~~~~ 244 (273) T protein:vir:10 195 GTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQDS 244 (273) T ss_pred eeeeEEeceEEEEecccccCCc-----------------------------cEEEEEeccceeeeeeee-hhhcccCCCc Confidence 9999999999999999996432 123778999999998655 8999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) |+|.|+|+++||++++|||++++|+.+++ T Consensus 245 ~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 245 FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 99999999999999999999999998888 No 22 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1.3e-58 Score=337.86 Aligned_cols=267 Identities=20% Similarity=0.175 Sum_probs=222.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ecccccEEEEeecCcceeeeeeC-C Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QISSGKSAQFPVIGRTKAAYLQP-G 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG~~t~~~~~~-g 76 (344) ||+.+ +-.|+|+++|++.|++.+++.++++.. +++.|+|||||++|++++.+|++ + T Consensus 1 MA~~~--------------------~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~ 60 (273) T protein:vir:10 1 MAFNN--------------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAG 60 (273) T ss_pred Ccchh--------------------hhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCC Confidence 66621 235999999999999999999998653 67889999999999999999986 4 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) ..++ .+++.+++++++||+.+|+++.|+|+|+.++++|+++ ++++++++||+.+|+.++..+...+.. T Consensus 61 ~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~--------- 128 (273) T protein:vir:10 61 RQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------- 128 (273) T ss_pred CccC--ccccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 4443 3578899999999999999999999999999999865 999999999999999998776532110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhh-hhccc-ccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPN-AANYA-ALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~-~~~~~~~ 234 (344) . ..++ ......+++.|++|+++|++++||.+|||+||+|++|+.|++++.++ +.+.. +...+++ T Consensus 129 -------~--~~~~-----~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~ 194 (273) T protein:vir:10 129 -------L--TGSA-----PTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) T ss_pred -------c--cccc-----ccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceee Confidence 0 0000 11124578999999999999999999999999999999999988755 45554 4456899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.||+++||+||+||++|.++. ..+++||++|+++++++. ++|..|++++ T Consensus 195 G~ig~i~G~~v~~s~~lp~~~~-----------------------------~~~~~~~~~A~~~a~q~~-~~e~~r~~~~ 244 (273) T protein:vir:10 195 GTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQDS 244 (273) T ss_pred eeeeEEeceEEEEecccccCCc-----------------------------cEEEEEeccceeeeeeee-hhhcccCCCc Confidence 9999999999999999996432 123778999999998655 8999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) |+|.|+|+++||++++|||++++|+.+++ T Consensus 245 ~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 245 FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 99999999999999999999999998888 No 23 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=1.4e-56 Score=326.82 Aligned_cols=267 Identities=20% Similarity=0.171 Sum_probs=221.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ecccccEEEEeecCcceeeeeeC-C Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QISSGKSAQFPVIGRTKAAYLQP-G 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG~~t~~~~~~-g 76 (344) ||+.+ +..|+|+++|++.|++.+++.++++.. ....|+|||||++|.+++.+|++ | T Consensus 1 MA~~~--------------------~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~ 60 (273) T protein:vir:79 1 MAFNN--------------------FIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAG 60 (273) T ss_pred Ccchh--------------------hhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCC Confidence 77621 235999999999999999999987653 33469999999999999998875 5 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) ..++ .+++++++++++||+.+++++.|+|+|+.|+++|++ +++++++++||+++|+.++..+..+... T Consensus 61 ~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~--------- 128 (273) T protein:vir:79 61 RQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTA--------- 128 (273) T ss_pred CccC--ccccccceEEEEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Confidence 5454 357899999999999999999999999999999987 5999999999999999998766532110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccch-hhhhcccc-cccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALM-PNAANYAA-LIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~-~~~~~~~~-~~~~~~ 234 (344) ...+. . .....+++.|.+|+.+||+++||.+|||+||+|++|+.||+++. +.+.++.+ +..+++ T Consensus 129 -------~~~~~---~----~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~ 194 (273) T protein:vir:79 129 -------LTGSA---P----SDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) T ss_pred -------ccccc---c----cchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceee Confidence 00000 0 11234688999999999999999999999999999999999876 55666654 456899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.||+++||+||+||++|.++. ...+.+|++|++.++++. ++|..|++++ T Consensus 195 G~ig~~~G~~i~~s~~lp~~~~-----------------------------~~~~a~~~~A~~~a~~~~-~~e~~r~~~~ 244 (273) T protein:vir:79 195 GTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQDS 244 (273) T ss_pred eEeeEEeceEEEecccccccCc-----------------------------eEEEEEeccceeeeeehh-hhhcccCccc Confidence 9999999999999999996432 113678999999988665 8999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) |+|.|+|+++||++++|||++++|+.+++ T Consensus 245 ~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 245 FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 99999999999999999999999998888 No 24 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=1e-53 Score=311.09 Aligned_cols=308 Identities=13% Similarity=0.118 Sum_probs=226.5 Q ss_pred CC--Ccccc-ccccccccccccccchhhhhHHHHhhHHHHHHHH-hhhhcCCceeeecc-cccE------EEEeecCcce Q lcl|NC_015719. 1 MA--NMQGG-QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFAR-TSVTANRHMQRQIS-SGKS------AQFPVIGRTK 69 (344) Q Consensus 1 ma--~~~~~-~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~-~s~~~~~~~~~~i~-~G~t------v~i~~iG~~t 69 (344) |+ ++-++ ..+. .++.+.|+|+|.++|+..||. +|+|++.++.++-. ++.+ +.++.+++.. T Consensus 1 ~~~~~~~~~~~~Ms---------~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (322) T protein:vir:10 1 MKLNAIMSMLPLIA---------GDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKR 71 (322) T ss_pred Ccccceeeeeeeee---------chhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeeccccccccccccc Confidence 33 11111 2222 245667999999999999995 79999999987643 3333 3334444444 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) +..+.+.+..+.+...++++.+.+.++++ |+.++|||+|+.|+++|+++++++++++||+|++|+.|+..+.+.+... T Consensus 72 ~~~~~~d~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~- 149 (322) T protein:vir:10 72 SRQQSADGTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIK- 149 (322) T ss_pred ccccccCcccCCCccccccceEEEeeccc-ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccccc- Confidence 44444444444444455677777666555 7889999999999999999999999999999999999976555433211 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCC-CEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAND-RTFYTTPDVYSAILAALMPNAANYAA 228 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (344) ..|+.+...+.....+.. ....++.|++|+++|+|++||.++ ||+||+|++|+.||++++|++.+|.+ T Consensus 150 ---------~~gt~v~~~ss~~i~~g~--~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~ 218 (322) T protein:vir:10 150 ---------GTGQPVEFLATQEIGDGT--KPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTS 218 (322) T ss_pred ---------ccccccccCCCcccccCc--cchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhccc Confidence 111222222221111111 122478899999999999999765 99999999999999999999999998 Q ss_pred cccc-ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 229 LIDP-ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 229 ~~~~-~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) ...+ ++|.|++++||+|++||+||..+.+..+.+......+ +...+++||++|+++++.+++++| T Consensus 219 ~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~--------------~~~~~~a~~k~Av~~a~~~dv~~~ 284 (322) T protein:vir:10 219 AMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQG--------------DEIWCIAMTDMALGYHSCKDIWTK 284 (322) T ss_pred chhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCc--------------cceeEEEEecCceeEEEeeeeeEE Confidence 8887 5799999999999999999988776665443221111 123358999999999999999999 Q ss_pred eee-cchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 308 RAR-RAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~-~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+ +.+.++|.|.++++||+++++|+++++|.+.+-= T Consensus 285 i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 285 VAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred eeccCCcchhhhhhhhhhhCceEeccCcEEEEEEeccC Confidence 665 5577799999999999999999999999997766 No 25 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=100.00 E-value=4.5e-48 Score=280.14 Aligned_cols=217 Identities=22% Similarity=0.272 Sum_probs=168.2 Q ss_pred eeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeeccccccccc Q lcl|NC_015719. 95 IDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTD 174 (344) Q Consensus 95 iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~ 174 (344) ||++++++|.|||+|++|++||+|+++++|+|++||+++|++|++++++++....+.+..+.+. .. .+.. +.. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~---~~-~~~a-~~t-- 73 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGF---SV-NIGA-GNT-- 73 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCc---ce-eccc-ccc-- Confidence 9999999999999999999999999999999999999999999999999888777665544322 11 1111 111 Q ss_pred chhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhc--cchhhhhcccc-ccccccc-eeEEEeCeEEEEecc Q lcl|NC_015719. 175 PVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILA--ALMPNAANYAA-LIDPERG-SIRNVMGFEVVEVPH 250 (344) Q Consensus 175 ~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~--~~~~~~~~~~~-~~~~~~G-~Vg~i~G~~V~~sn~ 250 (344) ..++++|+.|++|+++|||++||++|||+||+|++|+.||+ ++++.+.++++ ...+++| .|++++||+||+||| T Consensus 74 --~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~Snn 151 (221) T protein:vir:17 74 --NNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNV 151 (221) T ss_pred --CCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEecc Confidence 23567899999999999999999999999999999888886 46778888765 4568888 499999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCcee Q lcl|NC_015719. 251 LTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGL 330 (344) Q Consensus 251 lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~ 330 (344) +|+.+++.+...++.. ....+..++|+++ +++++||+|||+|++++|++.+- .|+|.-.+ =..++ T Consensus 152 lP~~~gt~~~~~ag~~-~~~~~~~~~yr~~--fs~~~glv~~~~Avgtvkl~~~~---~~~~~~~~---------~~~~~ 216 (221) T protein:vir:17 152 LASLYGTNLVTDPGDA-TTSGENNGSYRPA--ITDRAGLVFHKEAADTVEVLLPP---SRPPLVIS---------MFSIR 216 (221) T ss_pred CCcccccccccCCccc-ccccccccccccc--ccceEEEEEcchheeeeeeecCC---CCCceeee---------eeecc Confidence 9998887766554432 3445556677765 67799999999999999987642 23322111 12456 Q ss_pred ccccE Q lcl|NC_015719. 331 RPESA 335 (344) Q Consensus 331 Rp~~~ 335 (344) |||.- T Consensus 217 ~~~~~ 221 (221) T protein:vir:17 217 RPDRR 221 (221) T ss_pred CCCCC Confidence 66655 No 26 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=1e-44 Score=261.73 Aligned_cols=285 Identities=12% Similarity=0.017 Sum_probs=218.6 Q ss_pred CCCccccccccccccccccccchhhh-hHHHHhhHHHHHHHHhhhhcCC-ce-eeecccccEEEEeecCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLAL-FLKVFGGEVLTAFARTSVTANR-HM-QRQISSGKSAQFPVIGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l-~~e~f~geV~~~f~~~s~~~~~-~~-~~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (344) .-|.++--. ..-+|..+.+=++..+ +.|+|++.+++.+...++...+ ++ ...+.+|++||||+++.+.++||++++ T Consensus 5 ~~~~~~~~~-~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~ 83 (319) T protein:vir:97 5 IKNATGMLK-LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) T ss_pred cccccceeE-eehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCC Confidence 222222111 1122333333333333 4499999999988888777644 34 246679999999999999999999987 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhH--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) .... +.++.+..+++||+.+||.|.||++|.+|++.++ ...+.+.+...++..+|.+.+..++..+.. T Consensus 84 g~~~--g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~-------- 153 (319) T protein:vir:97 84 TNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK-------- 153 (319) T ss_pred Cccc--CCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc-------- Confidence 6654 5689999999999999999999999999998776 455677888899999999988777643211 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) ..+ +.....++|+.|+++.++|+|++|| ++||++|+|++|.+|+++++|+.....++..+++| T Consensus 154 -------~~~---------~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) T protein:vir:97 154 -------HLT---------VGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) T ss_pred -------ccc---------cccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 000 0111356799999999999999999 69999999999999999999988766666778999 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee-cchh Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR-RAEY 314 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~-~~~~ 314 (344) .|++++||+|+++|+... .+.-.++.|++|+.++...+ .+|.++ .+.+ T Consensus 217 ~Vg~idG~~Vi~vps~~~------------------------------k~in~i~~h~~A~~~~~k~~-~~~~~~p~~~~ 265 (319) T protein:vir:97 217 VQGELDGFVIVKVPTKLL------------------------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGM 265 (319) T ss_pred eceeecCeEEEEeccccc------------------------------ccceEEEEcCCeeeeeeeee-eeeccCCCccc Confidence 999999999999865421 11223788999998877655 688776 5899 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|.|++++.||++|+||+..+++....++ T Consensus 266 ~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:97 266 FGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred cceeeeeeeeeeeEEeccccceEEEeecCC Confidence 999999999999999999998888766665 No 27 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=1e-44 Score=261.73 Aligned_cols=285 Identities=12% Similarity=0.017 Sum_probs=218.6 Q ss_pred CCCccccccccccccccccccchhhh-hHHHHhhHHHHHHHHhhhhcCC-ce-eeecccccEEEEeecCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLAL-FLKVFGGEVLTAFARTSVTANR-HM-QRQISSGKSAQFPVIGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l-~~e~f~geV~~~f~~~s~~~~~-~~-~~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (344) .-|.++--. ..-+|..+.+=++..+ +.|+|++.+++.+...++...+ ++ ...+.+|++||||+++.+.++||++++ T Consensus 5 ~~~~~~~~~-~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~ 83 (319) T protein:vir:94 5 IKNATGMLK-LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) T ss_pred cccccceeE-eehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCC Confidence 222222111 1122333333333333 4499999999988888777644 34 246679999999999999999999987 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhH--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) .... +.++.+..+++||+.+||.|.||++|.+|++.++ ...+.+.+...++..+|.+.+..++..+.. T Consensus 84 g~~~--g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~-------- 153 (319) T protein:vir:94 84 TNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK-------- 153 (319) T ss_pred Cccc--CCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc-------- Confidence 6654 5689999999999999999999999999998776 455677888899999999988777643211 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) ..+ +.....++|+.|+++.++|+|++|| ++||++|+|++|.+|+++++|+.....++..+++| T Consensus 154 -------~~~---------~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) T protein:vir:94 154 -------HLT---------VGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) T ss_pred -------ccc---------cccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 000 0111356799999999999999999 69999999999999999999988766666778999 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee-cchh Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR-RAEY 314 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~-~~~~ 314 (344) .|++++||+|+++|+... .+.-.++.|++|+.++...+ .+|.++ .+.+ T Consensus 217 ~Vg~idG~~Vi~vps~~~------------------------------k~in~i~~h~~A~~~~~k~~-~~~~~~p~~~~ 265 (319) T protein:vir:94 217 VQGELDGFVIVKVPTKLL------------------------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGM 265 (319) T ss_pred eceeecCeEEEEeccccc------------------------------ccceEEEEcCCeeeeeeeee-eeeccCCCccc Confidence 999999999999865421 11223788999998877655 688776 5899 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+|.|++++.||++|+||+..+++....++ T Consensus 266 ~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:94 266 FGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred cceeeeeeeeeeeEEeccccceEEEeecCC Confidence 999999999999999999998888766665 No 28 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=1.6e-44 Score=260.75 Aligned_cols=285 Identities=13% Similarity=0.008 Sum_probs=221.1 Q ss_pred CCCccccccccccccccccccchhhh-hHHHHhhHHHHHHHHhhhhcCC-ce-eeecccccEEEEeecCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLAL-FLKVFGGEVLTAFARTSVTANR-HM-QRQISSGKSAQFPVIGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l-~~e~f~geV~~~f~~~s~~~~~-~~-~~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (344) .-|.++--. ..-+|..+.+-.+..+ +.|+|++.+++.|...++.... ++ ...+.+|++||||+++.+.++||++++ T Consensus 16 ~~~~~~~~~-~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~ 94 (329) T protein:vir:10 16 IKNATGKLK-LNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRNA 94 (329) T ss_pred hhcccceeE-EehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCCC Confidence 223222111 1223444444444444 4499999999999988776644 33 246779999999999999999999987 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhH--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) .... +.++.+..+++||+.+||.|.||++|..|++.++ ...+.+.+.+.++..+|.+.+..++..+.. T Consensus 95 g~~~--g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~-------- 164 (329) T protein:vir:10 95 TNEF--DHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK-------- 164 (329) T ss_pred Cccc--cccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc-------- Confidence 6644 5688999999999999999999999999998775 455667789999999999998777643211 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) ..+ . .....++|+.|+++.++|+|++|| ++||++|+|++|.+|+++++|+......+..+++| T Consensus 165 -------~~~----~-----~~t~~nay~~i~~a~~~Lde~~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g 227 (329) T protein:vir:10 165 -------HLT----V-----GSGADAQYDAVLDVSVELDEIGAG-ASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKG 227 (329) T ss_pred -------ccc----c-----ccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 000 0 111356799999999999999999 58999999999999999999987665666678999 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee-cchh Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR-RAEY 314 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~-~~~~ 314 (344) .|++++||+|+++|+.... +.-.++.|++|+.++...+ .+|.++ .+.+ T Consensus 228 ~Vg~idG~~Ii~vps~~~k------------------------------~in~ii~~~~A~~~~~K~~-~~~~~~p~~~~ 276 (329) T protein:vir:10 228 VQGELDGFTIVKVPSKMLQ------------------------------GVEAMAVIGEVMASPIQAN-EAKLNSNVPGM 276 (329) T ss_pred eeeeecCeEEEEecCCccc------------------------------ceeEEEEcCCceeeeeeee-eeeeeCCCCcc Confidence 9999999999998764321 1223788999998887666 788876 5899 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++|.|.+++.||++|+||+..+++.....| T Consensus 277 ~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 277 FGTLAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred chheeeeeeeeeeEEEccccCEEEEecccC Confidence 999999999999999999998887765555 No 29 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=1.7e-44 Score=260.59 Aligned_cols=272 Identities=16% Similarity=0.122 Sum_probs=223.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-eec--ccccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ-RQI--SSGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-~~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||+++ +... -+..|+|+..|.+.|.+..++.++... +++ +.|++|+||+++.. .+.+|..| T Consensus 1 Ma~~~T------~~~~--------~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g 66 (278) T protein:vir:80 1 MADLTT------KLAN--------LIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEG 66 (278) T ss_pred CCCcce------ehhh--------eecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCC Confidence 999653 3322 256699999999999999898887754 334 46999999998754 46789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. ++++.++.+++|++.. ..|.|+|++..++..|++++++++++++|++++|+.++..+.+.... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~~-~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~--------- 134 (278) T protein:vir:80 67 AAIDY--SALETESVKHGIKKAG-KGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE--------- 134 (278) T ss_pred CcCcc--cccccceeeEeeehhh-ccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 98875 4789999999999975 58999999999999999999999999999999999998877542110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) ...+. + .......++.|.++..+|+++++|. .|+++|+|++|+.|+++. +|+..+..++..+++ T Consensus 135 ---------~~~~~--t--~~~~~~~~~~~~da~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~ 200 (278) T protein:vir:80 135 ---------VKGAI--N--IGLIDKIENTFTDAPDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVK 200 (278) T ss_pred ---------ccccc--c--cchhhhHHHHHHHHHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhccccccccccceee Confidence 00000 0 1112345788999999999999995 578999999999998875 677676667778999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.||+||+||++|.+ .++++++.|++++..+++++|.+||+++ T Consensus 201 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~l~~~gAi~~~~~~~~~vE~~Rd~~~ 248 (278) T protein:vir:80 201 GAFGELLGWEIVRTKKLADG--------------------------------NALAVKAGALKTFLKRNLLAESGRDMDH 248 (278) T ss_pred ccceeecceeEEEcCCCCcc--------------------------------eEEEEeccceeeeecCCcccccccchhh Confidence 99999999999999999842 1256788999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++|.|+++++||++++||+++++++..++- T Consensus 249 ~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 249 KLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 999999999999999999999999998888 No 30 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=6.4e-43 Score=251.90 Aligned_cols=265 Identities=16% Similarity=0.167 Sum_probs=220.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCc-ceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGR-TKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (344) |||.++ +.+ | -+..|+|+..|.+.|.+..++.++.... ++ +.|++|+||+++. ..+.+|..| T Consensus 1 ma~~~T------~~~------d--~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g 66 (274) T protein:vir:96 1 MAQGTT------KVS------N--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG 66 (274) T ss_pred CCcccc------chh------h--hhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCC Confidence 999764 222 2 2567999999999999999999887664 23 3599999999875 477899999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. .+++.++.+++|++ .++.|.|+|++..++..|++++++++++++|++++|+.++..+.++.. T Consensus 67 ~~i~~--~~it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~---------- 133 (274) T protein:vir:96 67 EKIPV--DQIGTSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------- 133 (274) T ss_pred CcCch--hhcccceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---------- Confidence 98875 47899999999988 478899999999999999999999999999999999999876542110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) ... ++ ...++.|++|..+|+++++ .+||++|+|++|..|+++. +|+.....++..+++ T Consensus 134 --------~~~--~~--------~~~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 193 (274) T protein:vir:96 134 --------TVE--AD--------ITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVK 193 (274) T ss_pred --------CcC--cc--------cccHHHHHHHHHHhcccCC--CceEEEeCHHHHHHHHhcccccccccccccccceee Confidence 000 00 0127889999999999887 6799999999999998874 677766667778999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|++++||+|++||++|.+. +++|++.|++++..+++++|.+||+++ T Consensus 194 g~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:96 194 GAFGEALGAVIVRSNKLNKGE--------------------------------ALLAKKGAVKLITKRDFFLEKDRDASR 241 (274) T ss_pred cccceecCeeEEEcCCCCcce--------------------------------EEEEeCcceeeeecCCcccccccchhh Confidence 999999999999999998421 367789999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++|.|.++++||++++||+++++++..+.= T Consensus 242 ~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:96 242 KSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) T ss_pred cccEEEEeeEEEEEEEcCccEEEEEcCccc Confidence 999999999999999999999998765555 No 31 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=2.1e-42 Score=249.03 Aligned_cols=265 Identities=18% Similarity=0.161 Sum_probs=221.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCc-ceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGR-TKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (344) |||.+ |+.... +..|+|+..|.+.+.+..++.++.... ++ +.|++|+||+++. ..+++|..| T Consensus 1 ma~~~------T~~~~~--------iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg 66 (274) T protein:vir:93 1 MPQGI------TKTSNQ--------IIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eehhhe--------echHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCC Confidence 99965 444332 577999999999999999998888763 33 3599999999765 367899999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++.+|+.++..+.+... T Consensus 67 ~~i~~--~~it~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~---------- 133 (274) T protein:vir:93 67 EKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------- 133 (274) T ss_pred Ccccc--cccccceeEEEeeee-cccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 99875 478999999999884 68999999999999999999999999999999999999876643110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .+. ++.+ .++.|++|..+|+++++ ++||++|+|++|+.|+++. +|+.....++..+++ T Consensus 134 --------~~~--~~~~--------~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:93 134 --------TVN--ADIT--------KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred --------ccc--cccc--------CHHHHHHHHHHhhhccC--CccEEEeCHHHHHHHHhhhhhcccccccccccceee Confidence 000 0011 26778899999999876 6799999999999999885 667766667778999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.||+|++||++|.+ .+++|++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~l~~~gai~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:93 194 GAFGEALGAIIVRTNKLEAG--------------------------------TAILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred cccceecCeeEEEcCCCCcc--------------------------------eEEEEeCCeEEEEecCCcccccccchhh Confidence 99999999999999999842 1367788999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++|.|+++++||++++||+++++++.++++ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s 271 (274) T protein:vir:93 242 KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEeeCccc Confidence 999999999999999999999999988888 No 32 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=1.1e-41 Score=245.24 Aligned_cols=297 Identities=17% Similarity=0.113 Sum_probs=208.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ec-ccccEEEEeecCcceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QI-SSGKSAQFPVIGRTKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i-~~G~tv~i~~iG~~t~~~~~~g 76 (344) ||..+. +. |=.|+|+.++++.|++..++.++++.. ++ +.|+|||||+.+..+++++. T Consensus 1 m~~~~N--~~---------------ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~-- 61 (418) T protein:vir:10 1 MAVQDN--NL---------------LTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR-- 61 (418) T ss_pred CCcccc--cc---------------ccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC-- Confidence 776442 10 125799999999999999999988762 33 35999999999999998765 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) .+. .+++..++.+|+||+.+|++|.|+|.|++|...|++++++++++++||+.+|+.++..+....+. T Consensus 62 -~~~--~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~--------- 129 (418) T protein:vir:10 62 -TLV--KQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHS--------- 129 (418) T ss_pred -Ccc--ccccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Confidence 343 35788899999999999999999999999999999999999999999999999998655432111 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCC-CEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAND-RTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) .++.+ +.+ ..|+.|++++.+|++++||.+| ||+||+|++|+.|++++++......++..+++| T Consensus 130 ---------~gt~g--t~~-----~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G 193 (418) T protein:vir:10 130 ---------SGTPG--VRP-----GAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMG 193 (418) T ss_pred ---------cccCC--cCc-----chHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhhee Confidence 01111 111 1278899999999999999985 999999999999999888766555666679999 Q ss_pred eeEEEeCeEEEEecccccccccc-----cccccccccccc-------cccccccccc-ccc------------------- Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGD-----DRPEEGTDASNQ-------KHAFPATGGK-VNK------------------- 283 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~-----~~~~~~~~~~~~-------~~~~~~~~~~-~~~------------------- 283 (344) .||+++||+||+||++|....+. +..+....+.+. .....-..|+ +++ T Consensus 194 ~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~ 273 (418) T protein:vir:10 194 YRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQE 273 (418) T ss_pred eeeeeeceEEEEecCCCcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceE Confidence 99999999999999999633221 111111000000 0000000011 000 Q ss_pred ----------------------------------------------------------------cceeEEEecHHHHhhh Q lcl|NC_015719. 284 ----------------------------------------------------------------ENVVGLFQHRSAVGTV 299 (344) Q Consensus 284 ----------------------------------------------------------------~~~~gl~~~~~Av~~~ 299 (344) .-..-|+||++|+..+ T Consensus 274 f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~ 353 (418) T protein:vir:10 274 FVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALA 353 (418) T ss_pred EEEEeeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEE Confidence 0011289999877544 Q ss_pred hhh--------------------eeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 300 KLK--------------------DLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 300 ~~~--------------------~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ... .+++-.++|.+..-+.++--..||.+.+|||.++.|.=++-+ T Consensus 354 ~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 354 MIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred EeeccCCCCCCcceEEEeccCCeEEEEEEcccccccceEEEEEeecCceeecccceEEEEeecCC Confidence 322 222333466666777777777899999999997555444433 No 33 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=7.6e-42 Score=246.01 Aligned_cols=265 Identities=17% Similarity=0.161 Sum_probs=221.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.++ +... -+..|+|+.+|.+.|.+..++.++.... ++ +.|++|+||..+.. .+.+|..| T Consensus 1 ma~~~T------~l~d--------~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g 66 (274) T protein:vir:12 1 MAQGLT------KTSN--------QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccee------ehhh--------hhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 999663 3222 3677999999999999888998888874 33 45999999996643 56789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|++ .++.|.|+|++..++..|++++++++++++|++.+|+.++..+.++.. T Consensus 67 ~~i~~--~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~---------- 133 (274) T protein:vir:12 67 EKIPT--DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------- 133 (274) T ss_pred Cccch--hhcccceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 98865 47899999999988 588999999999999999999999999999999999999876643111 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) ... ... ..++.|++|..+|++++. .+||++|+|++|+.|+++. +|+.....+...+++ T Consensus 134 --------~~~--~~a--------~~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~ 193 (274) T protein:vir:12 134 --------TVN--ADI--------TKLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred --------ccc--ccc--------cCHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhhhhhccccccccccceec Confidence 000 000 127788899999998875 7899999999999999985 678776667778999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.||++.||+||+||++|.+ .+++|++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:12 194 GAFGEALGAIIVRSNKLEAG--------------------------------TAILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred ccceeecCeeEEEeCCCCcc--------------------------------eEEEEeccceeeeecCCceeccccchhh Confidence 99999999999999999842 1256788899998889999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|.+++.||++++||+.+++++...++ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 242 KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred cccEEEeeeEEEEEEEcCCceEEEEcCCcc Confidence 999999999999999999999999977777 No 34 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=7.9e-42 Score=245.93 Aligned_cols=265 Identities=17% Similarity=0.147 Sum_probs=219.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-eecc--cccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ-RQIS--SGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-~~i~--~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.+ |+... -+-.|+|+.+|.+.+.+..++.++... +++. .|++|+||..... .+.+|..| T Consensus 1 m~~~~------T~l~d--------~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g 66 (274) T protein:vir:96 1 MAQGM------TKLTN--------QIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEG 66 (274) T ss_pred CCcce------eehhh--------eechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCC Confidence 99965 33322 145799999999999999999988654 3444 5999999997653 56789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|++ .++.|.|+|++..++..|++++++++++++||+.+|+.++..+.++.. T Consensus 67 ~~i~~--~~lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~---------- 133 (274) T protein:vir:96 67 EKIPT--DILETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL---------- 133 (274) T ss_pred Cccch--hhcccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 98875 47899999999988 488999999999999999999999999999999999999866643110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .+.. ++ ..++.|.+|..+|++.+. .+||++|+|++|+.|++++ +|+..+..+...+++ T Consensus 134 --------~~~~-----~~-----~~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:96 134 --------TVEA-----DI-----TKLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVK 193 (274) T ss_pred --------cccc-----cc-----cCHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhccccccccccccccceec Confidence 0000 00 127788899999998875 6899999999999999986 677776667788999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.||++.||+||+||++|.. .+++|++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~~~~--------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:96 194 GAFGEALGAVIVRSNKLEAG--------------------------------TAILAKKGAVKLITKRDFFLETDRDPST 241 (274) T ss_pred cccceecCeEEEEeCCCCCc--------------------------------eEEEEeccceeeeecCCccccccccccc Confidence 99999999999999998732 1256788899998889999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.+.++++||++++||+++++++...++ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:96 242 KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 999999999999999999999999988888 No 35 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=7.9e-42 Score=245.93 Aligned_cols=265 Identities=17% Similarity=0.147 Sum_probs=219.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-eecc--cccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ-RQIS--SGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-~~i~--~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.+ |+... -+-.|+|+.+|.+.+.+..++.++... +++. .|++|+||..... .+.+|..| T Consensus 1 m~~~~------T~l~d--------~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g 66 (274) T protein:vir:95 1 MAQGM------TKLTN--------QIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEG 66 (274) T ss_pred CCcce------eehhh--------eechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCC Confidence 99965 33322 145799999999999999999988654 3444 5999999997653 56789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|++ .++.|.|+|++..++..|++++++++++++||+.+|+.++..+.++.. T Consensus 67 ~~i~~--~~lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~---------- 133 (274) T protein:vir:95 67 EKIPT--DILETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL---------- 133 (274) T ss_pred Cccch--hhcccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 98875 47899999999988 488999999999999999999999999999999999999866643110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .+.. ++ ..++.|.+|..+|++.+. .+||++|+|++|+.|++++ +|+..+..+...+++ T Consensus 134 --------~~~~-----~~-----~~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:95 134 --------TVEA-----DI-----TKLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVK 193 (274) T ss_pred --------cccc-----cc-----cCHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhccccccccccccccceec Confidence 0000 00 127788899999998875 6899999999999999986 677776667788999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.||++.||+||+||++|.. .+++|++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~~~~--------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:95 194 GAFGEALGAVIVRSNKLEAG--------------------------------TAILAKKGAVKLITKRDFFLETDRDPST 241 (274) T ss_pred cccceecCeEEEEeCCCCCc--------------------------------eEEEEeccceeeeecCCccccccccccc Confidence 99999999999999998732 1256788899998889999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.+.++++||++++||+++++++...++ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:95 242 KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 999999999999999999999999988888 No 36 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=1.2e-41 Score=244.84 Aligned_cols=265 Identities=17% Similarity=0.164 Sum_probs=221.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.+ |+... -+..|+|+..|.+.+.+..++.++.... ++ +.|++|+||.++.. .+.+|..| T Consensus 1 ma~~~------T~~~d--------~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g 66 (274) T protein:vir:97 1 MPQGL------TKTSD--------QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eehhh--------eechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 99955 33333 2677999999999999999999888764 33 35999999997643 56789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++.+|+.++..+.+... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~---------- 133 (274) T protein:vir:97 67 EKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------- 133 (274) T ss_pred Ccccc--cccccceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc---------- Confidence 99875 478999999999885 58899999999999999999999999999999999999876643110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .+. ++. ..++.|++|..+|++++. .+||++|+|++|..|+++. +|+...-.++..+++ T Consensus 134 --------~~~--~~~--------~~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:97 134 --------TVN--ADI--------TKLNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred --------ccc--ccc--------cCHHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 000 001 116788899999999876 5799999999999999885 677776667778899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.||+|++||++|.+ .+++|++.|++.+..+++.+|..||++. T Consensus 194 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:97 194 GAFGEALGAIIVRTNKLEAG--------------------------------TAILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred cccceecCeeEEEcCCCCcc--------------------------------eEEEEeCcceEeeecCCceeccccchhh Confidence 99999999999999999832 1367788999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|.++++||+++++|+.+++++.+..+ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:97 242 KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 999999999999999999999999987777 No 37 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=1.2e-41 Score=244.84 Aligned_cols=265 Identities=17% Similarity=0.164 Sum_probs=221.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.+ |+... -+..|+|+..|.+.+.+..++.++.... ++ +.|++|+||.++.. .+.+|..| T Consensus 1 ma~~~------T~~~d--------~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g 66 (274) T protein:vir:94 1 MPQGL------TKTSD--------QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eehhh--------eechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 99955 33333 2677999999999999999999888764 33 35999999997643 56789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++.+|+.++..+.+... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~---------- 133 (274) T protein:vir:94 67 EKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------- 133 (274) T ss_pred Ccccc--cccccceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc---------- Confidence 99875 478999999999885 58899999999999999999999999999999999999876643110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .+. ++. ..++.|++|..+|++++. .+||++|+|++|..|+++. +|+...-.++..+++ T Consensus 134 --------~~~--~~~--------~~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:94 134 --------TVN--ADI--------TKLNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred --------ccc--ccc--------cCHHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 000 001 116788899999999876 5799999999999999885 677776667778899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.||+|++||++|.+ .+++|++.|++.+..+++.+|..||++. T Consensus 194 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:94 194 GAFGEALGAIIVRTNKLEAG--------------------------------TAILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred cccceecCeeEEEcCCCCcc--------------------------------eEEEEeCcceEeeecCCceeccccchhh Confidence 99999999999999999832 1367788999999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|.++++||+++++|+.+++++.+..+ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:94 242 KTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 999999999999999999999999987777 No 38 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=2.9e-41 Score=242.81 Aligned_cols=266 Identities=17% Similarity=0.153 Sum_probs=218.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ecc--cccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QIS--SGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i~--~G~tv~i~~iG~~-t~~~~~~g 76 (344) ||..+. |+.. +-+..|+|+..|.+.+.+..++.++.... ++. .|++|+||..... .+.+|..| T Consensus 1 ~~~~~~-----T~l~--------d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g 67 (275) T protein:vir:96 1 MALENM-----TKLA--------NMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEG 67 (275) T ss_pred CCCccc-----chhh--------hhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCC Confidence 555442 2222 23578999999999999999999998653 343 5999999997654 56789999 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+.+|.+ .++.|.|+|++..++..|++.+++++++++||+++|+.++..+.++.. T Consensus 68 ~~i~~--~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~---------- 134 (275) T protein:vir:96 68 EEIPI--DLIETKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL---------- 134 (275) T ss_pred CCcch--hhcccceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 99875 46889999999966 589999999999999999999999999999999999999876643110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) ... ++ + ..++.|++|..+|.+.+. .+||++|+|++|..|+++. +|+..+..++..+++ T Consensus 135 --------~~~--~~---~-----~~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 194 (275) T protein:vir:96 135 --------KVE--AD---I-----TKLAGLQTAIDKFNDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVK 194 (275) T ss_pred --------ccc--cc---c-----cCHHHHHHHHHHhccccC--CccEEEeCHHHHHHHHhcccccccccccccccceec Confidence 000 00 0 127888899999987765 6799999999999998874 788777777788999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.||+||+||++|.+ .+++|++.|++++..+++++|..||+++ T Consensus 195 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~i~~~gA~~~~~~~~~~vE~~Rd~~~ 242 (275) T protein:vir:96 195 GAFGEALGAIIVRSNKIKEG--------------------------------EAILAKRGAVKLITKRDFFLETERHASH 242 (275) T ss_pred cccceecCeeEEEeCCCCcc--------------------------------eEEEEeccceeeeecCCcccccccchhh Confidence 99999999999999999842 1366788899999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|+++++||++++||++++++++++.- T Consensus 243 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 243 KSTALFSDKHYVAYLYDESKVVKITKSASG 272 (275) T ss_pred cCcEEEEeEEEEEEEEcCccEEEEEecccc Confidence 999999999999999999999999887666 No 39 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=3.7e-41 Score=242.26 Aligned_cols=267 Identities=16% Similarity=0.132 Sum_probs=220.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ecc--cccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QIS--SGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i~--~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.++ +.. +-+..|+|+..|.+.|.+..++.++.... ++. .|++|+||..+.. ...++..| T Consensus 1 ma~~~T------~~~--------d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg 66 (272) T protein:vir:36 1 MSKQKT------TLA--------DLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEG 66 (272) T ss_pred CCCcce------ehh--------hhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCC Confidence 998553 222 23678999999999999999999887663 343 4999999997665 34578889 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+++|.+. ...|.|+|++..++..|++++++++++++||+++|+.++..+.+.. T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~-~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~----------- 132 (272) T protein:vir:36 67 GEISL--DKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS----------- 132 (272) T ss_pred CccCh--hhcCCcceeEeeehh-hccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------- Confidence 98875 468899999999886 5789999999999999999999999999999999999986553211 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhh-ccccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAA-NYAALIDPERG 235 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~~~G 235 (344) .... ....++.|.+|..+|.+.++| .||++|+|++|+.|+++.++... ++.+...+++| T Consensus 133 -------~~~~-----------~~~~~d~i~~A~~~lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G 192 (272) T protein:vir:36 133 -------QTVS-----------TKANVDGVQAALDIFNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIGSEVGANALING 192 (272) T ss_pred -------cccc-----------ccccHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHhcccccccccccccccceeee Confidence 0000 112367889999999999986 58999999999999999988765 46677789999 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhh Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQ 315 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~ 315 (344) .|++++|++|++||++|.+... ...++|.+.|++++..+++++|..|+++++ T Consensus 193 ~ig~~~G~~Vv~s~~~p~~~~~----------------------------~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~ 244 (272) T protein:vir:36 193 TYADVLGAQIVRSKKLAEGSAL----------------------------MFKIVSNSPALKLVLKRGVQVETDRDIVTK 244 (272) T ss_pred ccceecCeeEEEeCCCCCCcee----------------------------EEEEEecccceeeeecCCcccccccchhhc Confidence 9999999999999999964321 123567888999998999999999999999 Q ss_pred hhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 316 ADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 316 ~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +|.|+++++||++++||+++++++.+.= T Consensus 245 ~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 245 TTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred CcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999999999999999999999999988 No 40 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=1.4e-39 Score=233.63 Aligned_cols=287 Identities=11% Similarity=0.047 Sum_probs=185.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ecc--cccEEEEeecCcceeeeeeC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QIS--SGKSAQFPVIGRTKAAYLQP 75 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i~--~G~tv~i~~iG~~t~~~~~~ 75 (344) |||.. +-.|+|+.++++.|++..++.++++.. ++. .|++|||++.+..++.+|++ T Consensus 1 Ma~~~--------------------~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~ 60 (392) T protein:vir:99 1 MANAF--------------------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKL 60 (392) T ss_pred Ccccc--------------------ccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeec Confidence 77621 345899999999999999999998653 564 59999999999999999875 Q ss_pred CC---CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 76 GE---SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 76 g~---~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) .. .-+.+.+++.+++++++||+.+|++|.|+|.|+.|...|++.+++++++++||+++|+.++..+....... T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~---- 136 (392) T protein:vir:99 61 RGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA---- 136 (392) T ss_pred cccccCCcccccccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---- Confidence 22 11223357888999999999999999999999999999999999999999999999999987654311100 Q ss_pred cccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc--- Q lcl|NC_015719. 153 ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL--- 229 (344) Q Consensus 153 ~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~--- 229 (344) ........ ...+|+.|++|+++|+|++||. |||+||+|++|+.|+++++|.+.++.++ T Consensus 137 --------------~~~~~~~~----~~~~~~~i~~a~~~L~~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~ 197 (392) T protein:vir:99 137 --------------AGAVHEVA----PDEFFKGVNGARRALNELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAV 197 (392) T ss_pred --------------cccccccC----hhhhHHHHHHHHHHHhhcCCCC-CCEEEEcHHHHHHHhcccceeecccccchhh Confidence 00011111 2346889999999999999996 8999999999999999999998876554 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccc-cccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEE-GTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER 308 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~ 308 (344) ..+++|.||+++||+||+|+++|......+.... ...... ...+..... +..+... .. .-...-. T Consensus 198 ~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a--~v~~~~~~~-------~~s~s~~----~~-v~~~~~~ 263 (392) T protein:vir:99 198 SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRA--PAPPMGAVR-------STAISGD----QR-IAMRWLV 263 (392) T ss_pred hhhhcceeeeeeeeEEEeecccccccceeeecccccccccc--ccccccccc-------eeEEecc----cc-eecceee Confidence 4589999999999999999999987543332111 110000 000000000 0000000 00 0001112 Q ss_pred eecchhhhhhhhhhhhhcCceeccccEEEEEec----CCC Q lcl|NC_015719. 309 ARRAEYQADQIIAKYAMGHGGLRPESAGALVFK----AGA 344 (344) Q Consensus 309 ~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~----~~a 344 (344) .++.....|...-....|.+.+.-.+...+... ..+ T Consensus 264 ~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~ 303 (392) T protein:vir:99 264 DYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) T ss_pred cccceeeccccccceeEEEEEEeeccccceeeeeeeeeec Confidence 223333333333233333333322111111000 000 No 41 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=2e-38 Score=227.28 Aligned_cols=298 Identities=14% Similarity=0.119 Sum_probs=210.0 Q ss_pred CCCccccccccccccccccccchhhhh-HHHHhhHHHHHHHHhhhhcCCceee---ec---ccccEEEEeecCcceeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALF-LKVFGGEVLTAFARTSVTANRHMQR---QI---SSGKSAQFPVIGRTKAAYL 73 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~-~e~f~geV~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~ 73 (344) |||.- .-| +|+|+.+.++.|++..++.++++.. ++ +.|+||+|++.++.++++| T Consensus 1 MAN~l-------------------lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~ 61 (423) T protein:vir:35 1 MANNL-------------------ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERT 61 (423) T ss_pred Cccch-------------------hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecc Confidence 77621 124 5899999999999999999998763 44 3599999999999999999 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) .++....-..+++...++.|+||+.+|++|.++|.|+.|..-|+ ..+.+.++++|++.+|+.++..+...+... T Consensus 62 ~~~~~~~~~~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~----- 135 (423) T protein:vir:35 62 ETGDITGKDKNGLFSAKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALS----- 135 (423) T ss_pred cCcCCCCccccccccceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 76432222335777788999999999999999999999988888 467788899999999999987665422110 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccch-hhhhcccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALM-PNAANYAALIDP 232 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~-~~~~~~~~~~~~ 232 (344) ++..+ +.+ ..|+.|.+++.+|++++||..|||+||+|++|..|+++++ |.+.+..++..+ T Consensus 136 ------------vgt~~--t~~-----~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:35 136 ------------LGSPN--TAI-----KKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAW 196 (423) T ss_pred ------------ccccc--CCc-----chHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHH Confidence 11111 111 1278899999999999999999999999999999997665 555555566779 Q ss_pred cccee-EEEeCeEEEEecccccccccccccccc----cc----------------------ccc-----ccccccc---- Q lcl|NC_015719. 233 ERGSI-RNVMGFEVVEVPHLTAGGAGDDRPEEG----TD----------------------ASN-----QKHAFPA---- 276 (344) Q Consensus 233 ~~G~V-g~i~G~~V~~sn~lp~~~~~~~~~~~~----~~----------------------~~~-----~~~~~~~---- 276 (344) ++|.| |+++||+||+||++|......+..... .. .+. +...|.+ T Consensus 197 r~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:35 197 ENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWL 276 (423) T ss_pred hhccceeeecceEEEEcCCCccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeec Confidence 99876 999999999999999632222110000 00 000 0000000 Q ss_pred ---------------------c-------ccc--------------------c----------------cccceeEEEec Q lcl|NC_015719. 277 ---------------------T-------GGK--------------------V----------------NKENVVGLFQH 292 (344) Q Consensus 277 ---------------------~-------~~~--------------------~----------------~~~~~~gl~~~ 292 (344) . .+. + ...-..-|+|| T Consensus 277 ~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~ 356 (423) T protein:vir:35 277 NQQSKQTLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYN 356 (423) T ss_pred cccccceeecccCCceeEEEEeccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEeec Confidence 0 000 0 00111447999 Q ss_pred HHHHhhhh-----------------hheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 293 RSAVGTVK-----------------LKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 293 ~~Av~~~~-----------------~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) ++|+..+. ...+++..++|.+..-+.++-=..||.+.+|||.++.+.-.. T Consensus 357 ~~a~~l~~~~l~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:35 357 KFFCGLGTIPLPKLHSLDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CceeEEEEEccccCCccceeeccccCceEEEEEeeccccCceEEEEEeecceeeecccceEEEEecC Confidence 98875543 334455556777766666777778999999999998877777 No 42 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=5.6e-38 Score=224.82 Aligned_cols=299 Identities=13% Similarity=0.112 Sum_probs=207.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ec---ccccEEEEeecCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QI---SSGKSAQFPVIGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~~ 74 (344) |||.- +..-+++|+.++++.|++..++.++++.. ++ +.|+||+|++.+..++++|. T Consensus 1 MaN~l------------------lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~ 62 (423) T protein:vir:17 1 MPNNL------------------DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTP 62 (423) T ss_pred Cccch------------------hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeeccc Confidence 77621 11125899999999999999999998863 33 36999999999999999997 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) ......-+.+++...++.|+||+.||++|.++|.|+.+.--|+ +++.+.+.++||+.+|+.++..+.+.+... T Consensus 63 ~~~~~~~~~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~------ 135 (423) T protein:vir:17 63 TGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS------ 135 (423) T ss_pred CcccCCcccCccccceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc------ Confidence 5332212346777788999999999999999999998665565 889999999999999999886654422110 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchh-hhhccccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMP-NAANYAALIDPE 233 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~-~~~~~~~~~~~~ 233 (344) .+..+. .+ ..|+.+++++.+|++++||.+|||+||+|++|..|++++++ ...+..++..++ T Consensus 136 -----------~gt~~t--~~-----~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr 197 (423) T protein:vir:17 136 -----------LGSPNT--PI-----TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWE 197 (423) T ss_pred -----------cccCCc--cc-----ccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHh Confidence 001110 11 12788999999999999999999999999999999987754 444555666799 Q ss_pred ccee-EEEeCeEEEEecccccccccccc-----------cccccccc---------------c-----ccccccc----- Q lcl|NC_015719. 234 RGSI-RNVMGFEVVEVPHLTAGGAGDDR-----------PEEGTDAS---------------N-----QKHAFPA----- 276 (344) Q Consensus 234 ~G~V-g~i~G~~V~~sn~lp~~~~~~~~-----------~~~~~~~~---------------~-----~~~~~~~----- 276 (344) +|.| |+++||+||+||++|......+. .+...... + +.-.|.+ T Consensus 198 ~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~ 277 (423) T protein:vir:17 198 NAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQ 277 (423) T ss_pred hccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeec Confidence 9987 89999999999999953222211 00000000 0 0000000 Q ss_pred --------------------c-------cc--------------------cccc----------------cceeEEEecH Q lcl|NC_015719. 277 --------------------T-------GG--------------------KVNK----------------ENVVGLFQHR 293 (344) Q Consensus 277 --------------------~-------~~--------------------~~~~----------------~~~~gl~~~~ 293 (344) . .+ .+++ ...+-|+||+ T Consensus 278 ~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~ 357 (423) T protein:vir:17 278 QQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNK 357 (423) T ss_pred ccccccccccccccceEEEEEecccccccCceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEecC Confidence 0 00 0000 0113379999 Q ss_pred HHHhhhh-----------------hheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 294 SAVGTVK-----------------LKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 294 ~Av~~~~-----------------~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) +|+..+. ...+++-.+||.+..-..++-=..||.+.+|||.++.+.-.. T Consensus 358 ~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:17 358 FFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred cceEEEEEcccCCCccceeecccCCcEEEEEEecccccceeEEEEEeecceeeeccceEEEEEecC Confidence 9876543 334445555666555566777777999999999998887777 No 43 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=6e-38 Score=224.63 Aligned_cols=298 Identities=14% Similarity=0.116 Sum_probs=209.4 Q ss_pred CCCccccccccccccccccccchhhhh-HHHHhhHHHHHHHHhhhhcCCceee---ec---ccccEEEEeecCcceeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALF-LKVFGGEVLTAFARTSVTANRHMQR---QI---SSGKSAQFPVIGRTKAAYL 73 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~-~e~f~geV~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~ 73 (344) |||.- + .| +|+|+.++++.|++..++.++++.. ++ +.|+||+|++.+..++++| T Consensus 1 MaN~l------------------l-T~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~ 61 (423) T protein:vir:10 1 MPNNL------------------D-SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRT 61 (423) T ss_pred Cccch------------------h-hhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeecc Confidence 77621 1 13 5899999999999999999998863 34 3699999999999999999 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) +++..-.-+.+++...++.|+||+.||++|.++|.|+.+.--|+ +.+++.+.++||+.+|+.++..+...+... T Consensus 62 ~~~~~~~~~~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~----- 135 (423) T protein:vir:10 62 PTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS----- 135 (423) T ss_pred CCccccccccCccccceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 86432112345788889999999999999999999998655555 889999999999999999876554321110 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchh-hhhcccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMP-NAANYAALIDP 232 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~-~~~~~~~~~~~ 232 (344) .+.... .+ ..|+.+.+++.+|++++||..|||+||+|++|..|++++++ ...+..++..+ T Consensus 136 ------------~gt~~t--~~-----~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:10 136 ------------LGSPNT--PI-----TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAW 196 (423) T ss_pred ------------cccCCc--cc-----chHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhh Confidence 011111 11 12788999999999999999999999999999999977664 45555666779 Q ss_pred cccee-EEEeCeEEEEecccccccccccc-----------cccccccc--------------------cccccccc---- Q lcl|NC_015719. 233 ERGSI-RNVMGFEVVEVPHLTAGGAGDDR-----------PEEGTDAS--------------------NQKHAFPA---- 276 (344) Q Consensus 233 ~~G~V-g~i~G~~V~~sn~lp~~~~~~~~-----------~~~~~~~~--------------------~~~~~~~~---- 276 (344) ++|.| |+++||+||+||++|......+. .+....+. ++.-.|++ T Consensus 197 r~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v 276 (423) T protein:vir:10 197 ENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWL 276 (423) T ss_pred hhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeee Confidence 99987 89999999999999963222211 00000000 00000000 Q ss_pred ---------------------c-------ccc--------------------ccc----------------cceeEEEec Q lcl|NC_015719. 277 ---------------------T-------GGK--------------------VNK----------------ENVVGLFQH 292 (344) Q Consensus 277 ---------------------~-------~~~--------------------~~~----------------~~~~gl~~~ 292 (344) . .+. +++ ...+-|+|| T Consensus 277 ~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~ 356 (423) T protein:vir:10 277 QQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYN 356 (423) T ss_pred cccccccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEec Confidence 0 000 000 011337999 Q ss_pred HHHHhhhh-----------------hheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 293 RSAVGTVK-----------------LKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 293 ~~Av~~~~-----------------~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) ++|+..+. ...+++-.+||.+..-..++-=..||.+.+|||.++.+.-.. T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 357 KFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeeccccCceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 99875543 344555556777666666777777999999999998887777 No 44 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=6.3e-38 Score=224.54 Aligned_cols=265 Identities=17% Similarity=0.164 Sum_probs=219.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (344) |||.++ +. .+-|..|+|+..|.+.+.+..++.++.... ++ ..|++|+||..+.. .+.++..| T Consensus 1 Ma~~~T------~l--------~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg 66 (276) T protein:vir:10 1 MAQGTT------TK--------STQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEG 66 (276) T ss_pred CCccee------eh--------hhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCC Confidence 998553 22 223678999999999999999999988764 34 36999999987654 45678889 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+.+|.+ .+..|.++|++..++..|++.+++++++.+||+++|+.++..+..... T Consensus 67 ~~i~~--~~lt~~~~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~---------- 133 (276) T protein:vir:10 67 QKIPV--DKIETNRREAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKL---------- 133 (276) T ss_pred CccCc--cccccceeeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 98875 46889999999965 689999999999999999999999999999999999999866542110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhcc--chhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAA--LMPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~--~~~~~~~~~~~~~~~~ 234 (344) ...+ + . ..++.|.+|..+|+++++ +.++++|+|++|..|+++ .+|+..+..++..+++ T Consensus 134 --------~~~~--~---~-----~t~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (276) T protein:vir:10 134 --------TVSA--D---I-----GTLAGLEAAIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVK 193 (276) T ss_pred --------cccc--c---c-----cCHHHHHHHHHHhccccC--cccEEEEcHHHHHHHHHhccccccccccccccceec Confidence 0000 0 0 126788899999998876 679999999999999764 6788777667778999 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.|++|++|+++|.+ .+++|++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~--------------------------------t~~l~~~gAi~~~~~~~~~vE~dRd~~~ 241 (276) T protein:vir:10 194 GAFGEALGAVIVRSKKLDEG--------------------------------EAILAKRGAVKLITKRDFFLETDRDPST 241 (276) T ss_pred cccceecceeEEEcCCCCcc--------------------------------eEEEEeccceeeeecCCceeecccchhh Confidence 99999999999999999742 1257788899999999999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|.+++.||+++++|+.++.++...++ T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 242 KTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred cccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 999999999999999999999999988888 No 45 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=3.6e-37 Score=220.41 Aligned_cols=284 Identities=13% Similarity=0.074 Sum_probs=188.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-----ecccccEEEEeecCcceeeeeeC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-----QISSGKSAQFPVIGRTKAAYLQP 75 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-----~i~~G~tv~i~~iG~~t~~~~~~ 75 (344) ||.++ |+|+|+..+++.|...+++..+.+.. .+.+|++||||+++.+.++||++ T Consensus 1 MA~~n---------------------~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R 59 (299) T protein:vir:79 1 MAALN---------------------YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNR 59 (299) T ss_pred Cccch---------------------hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccccccccccc Confidence 66433 67999999999999999988765542 34579999999999999999998 Q ss_pred CCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhH--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 76 GESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 76 g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) ++.... ...++.+..+++|||.+||.|.||++|..|++..+ ...+.+.+.+.++..+|.+.+..|+..+... T Consensus 60 ~~~g~~-~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~----- 133 (299) T protein:vir:79 60 DTIAVA-QRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL----- 133 (299) T ss_pred CCCccc-ccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc----- Confidence 764332 24578889999999999999999988777776554 3334455667778888888887776433211 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc-ccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN-YAALIDP 232 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~~~~~ 232 (344) +..+..+.+ .+.++|+.|+++.++|+|++||.+|||++|+|++|.+|+++++|++.. ....... T Consensus 134 -------------g~~~~~~~~--T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~ 198 (299) T protein:vir:79 134 -------------GNTADTTVL--TTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTS 198 (299) T ss_pred -------------CCccccccc--CHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccce Confidence 001111111 135689999999999999999999999999999999999999998654 3444467 Q ss_pred ccceeEEEeCeEEEE--eccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee Q lcl|NC_015719. 233 ERGSIRNVMGFEVVE--VPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR 310 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~--sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~ 310 (344) .+|.|++++||+|++ |++++..- .+ ..|... .. +..+.-.++.|++|+......+ .+..+ T Consensus 199 ~~g~Vg~idG~~Ii~Vps~r~~t~~--~~--~~G~~~--------~~----~ak~in~ii~~~~a~~~~~K~~-~~~~~- 260 (299) T protein:vir:79 199 LNRQTTDIDTVKIIKVPSNLMKTAY--DF--TTGWKV--------GA----GAKQIFMSLVHPSAIITPVSYQ-FSKLD- 260 (299) T ss_pred eeeeeeeecceEEEEechhhcCccc--ee--ccCccc--------cC----cccccceEEEcCCeeeeeEeee-eEEee- Confidence 899999999999998 45565321 01 111100 00 1123345788999876554333 33333 Q ss_pred cc--hhhhhhhhhhhhhc-CceeccccEEEEEecCCC Q lcl|NC_015719. 311 RA--EYQADQIIAKYAMG-HGGLRPESAGALVFKAGA 344 (344) Q Consensus 311 ~~--~~~~d~i~~~~~~G-~~v~Rp~~~~~l~~~~~a 344 (344) +| ...+|....-..|+ .=++.-...++..-..+| T Consensus 261 ~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a 297 (299) T protein:vir:79 261 EPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGA 297 (299) T ss_pred cCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeec Confidence 33 33334333322233 334433333333333333 No 46 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=1.8e-36 Score=216.54 Aligned_cols=298 Identities=13% Similarity=0.085 Sum_probs=206.5 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee---ec---ccccEEEEeecCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR---QI---SSGKSAQFPVIGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~~ 74 (344) |||.-. +|-.|+|+.+.++.|++..++.++++.. ++ +.|+||+|++.+..++++.. T Consensus 1 MANsl~------------------~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~ 62 (423) T protein:vir:10 1 MANNLD------------------ANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTM 62 (423) T ss_pred Cccccc------------------cccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeeccc Confidence 776221 2566899999999999999999998862 33 35999999999999888754 Q ss_pred CCCCCCC-CcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 75 PGESLDD-KRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 75 ~g~~~~~-~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) .+ .+.+ ..+++...++.++||+.+|++|.++|.|+.+.--|+ +.+++.+.++||+.+|+.+...+.+.+... T Consensus 63 ~~-~~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~----- 135 (423) T protein:vir:10 63 DG-DITGKSKNSLISAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALS----- 135 (423) T ss_pred Cc-ccCcccccccccceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccccc----- Confidence 43 3322 234566678999999999999999999988554455 889999999999999999976665422111 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchh-hhhcccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMP-NAANYAALIDP 232 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~-~~~~~~~~~~~ 232 (344) ++..+.. +. .|+.+.+++.+|++++||..|||+||+|++|..|++++.+ ...+..++..+ T Consensus 136 ------------vgt~~t~--~~-----a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:10 136 ------------LGSPNTP--IK-----KWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAW 196 (423) T ss_pred ------------ccccccc--cc-----cHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHH Confidence 0111111 10 1688999999999999999999999999999999876654 44455667779 Q ss_pred cccee-EEEeCeEEEEecccccccccccc-----cccc----cc---------------cc--------------c---- Q lcl|NC_015719. 233 ERGSI-RNVMGFEVVEVPHLTAGGAGDDR-----PEEG----TD---------------AS--------------N---- 269 (344) Q Consensus 233 ~~G~V-g~i~G~~V~~sn~lp~~~~~~~~-----~~~~----~~---------------~~--------------~---- 269 (344) ++|.| |+++||+||+||++|..+.+... .+.. .+ .+ + T Consensus 197 r~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:10 197 ENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWL 276 (423) T ss_pred HhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeee Confidence 99977 99999999999999953211110 0000 00 00 0 Q ss_pred --------------ccccccc-------------------cccc--------c----------------cccceeEEEec Q lcl|NC_015719. 270 --------------QKHAFPA-------------------TGGK--------V----------------NKENVVGLFQH 292 (344) Q Consensus 270 --------------~~~~~~~-------------------~~~~--------~----------------~~~~~~gl~~~ 292 (344) ....|-. -..+ + ...-..-|+|| T Consensus 277 ~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~ 356 (423) T protein:vir:10 277 NQQSKQTLYNGASALSFTATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYN 356 (423) T ss_pred cccccceeecccCCcceEEEEEecccccccCceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEec Confidence 0000000 0000 0 00011337999 Q ss_pred HHHHhhh-----------------hhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 293 RSAVGTV-----------------KLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 293 ~~Av~~~-----------------~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) ++|+..+ +...+++-.+||.+..-..++-=..||.+.+|||.++.+.-.. T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 357 KLFCGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeecccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 9887544 3344555566777766666777778999999999998887777 No 47 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=6.9e-37 Score=218.84 Aligned_cols=264 Identities=17% Similarity=0.137 Sum_probs=215.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCc-ceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGR-TKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (344) ||+.++ +.+ | -+..|+|+..|.+.+.+.+++.++.... ++ ..|++|+||+.+. .++.++..| T Consensus 1 MA~~~T------~~~------~--~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg 66 (272) T protein:vir:30 1 MAVGTT------KMA------Q--MLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG 66 (272) T ss_pred CCCccc------cch------h--eechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC Confidence 998653 222 2 3577999999999999999998887763 33 3599999999864 477889889 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++.. .++.++.++++.+. ...+.|+|.+..++..|+++.+.+++++++++++|+.++..+.+... T Consensus 67 ~~i~~~--~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~---------- 133 (272) T protein:vir:30 67 EAIPMT--QLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ---------- 133 (272) T ss_pred Cccccc--ccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 988754 67889999999885 57799999999999999999999999999999999999865432110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .... ...++.|.+|..+|++.+. ..|+++|+|++|..|+++. +++.....+...+++ T Consensus 134 --------~~~~-----------~~t~d~i~da~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:30 134 --------TVEA-----------TATVDGVSKALDIFNDEDD--AETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred --------cccc-----------ccCHHHHHHHHHHHhccCC--CccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 0000 0126778889999988764 5689999999999998774 455555555667899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.|++|++|+++|.+. +++|++.+++.+..+++++|.+|++++ T Consensus 193 g~ig~i~G~~Vi~s~~~p~~t--------------------------------~~~~~~~a~~~~~~~~~~ve~~r~~~~ 240 (272) T protein:vir:30 193 GVYGEVLGVQIVRSRKCPKGT--------------------------------AYMVRKGALRIMLKRNTMVETDRDITK 240 (272) T ss_pred ccchhhcCeeEEEcCCCCcce--------------------------------EEEEcCCeEEEEecCCceeeecccccc Confidence 999999999999999998421 256788888889899999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|+++++||.++++|++++.+++++-+ T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 241 AINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 999999999999999999999999999877 No 48 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=6.9e-37 Score=218.84 Aligned_cols=264 Identities=17% Similarity=0.137 Sum_probs=215.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee-ec--ccccEEEEeecCc-ceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR-QI--SSGKSAQFPVIGR-TKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (344) ||+.++ +.+ | -+..|+|+..|.+.+.+.+++.++.... ++ ..|++|+||+.+. .++.++..| T Consensus 1 MA~~~T------~~~------~--~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg 66 (272) T protein:vir:98 1 MAVGTT------KMA------Q--MLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG 66 (272) T ss_pred CCCccc------cch------h--eechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC Confidence 998653 222 2 3577999999999999999998887763 33 3599999999864 477889889 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++.. .++.++.++++.+. ...+.|+|.+..++..|+++.+.+++++++++++|+.++..+.+... T Consensus 67 ~~i~~~--~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~---------- 133 (272) T protein:vir:98 67 EAIPMT--QLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ---------- 133 (272) T ss_pred Cccccc--ccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 988754 67889999999885 57799999999999999999999999999999999999865432110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc--hhhhhcccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL--MPNAANYAALIDPER 234 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (344) .... ...++.|.+|..+|++.+. ..|+++|+|++|..|+++. +++.....+...+++ T Consensus 134 --------~~~~-----------~~t~d~i~da~~~l~~~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:98 134 --------TVEA-----------TATVDGVSKALDIFNDEDD--AETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred --------cccc-----------ccCHHHHHHHHHHHhccCC--CccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 0000 0126778889999988764 5689999999999998774 455555555667899 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) |.|+++.|++|++|+++|.+. +++|++.+++.+..+++++|.+|++++ T Consensus 193 g~ig~i~G~~Vi~s~~~p~~t--------------------------------~~~~~~~a~~~~~~~~~~ve~~r~~~~ 240 (272) T protein:vir:98 193 GVYGEVLGVQIVRSRKCPKGT--------------------------------AYMVRKGALRIMLKRNTMVETDRDITK 240 (272) T ss_pred ccchhhcCeeEEEcCCCCcce--------------------------------EEEEcCCeEEEEecCCceeeecccccc Confidence 999999999999999998421 256788888889899999999999999 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|.|+++++||.++++|++++.+++++-+ T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 241 AINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 999999999999999999999999999877 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.96 E-value=2.4e-32 Score=193.98 Aligned_cols=280 Identities=14% Similarity=0.054 Sum_probs=196.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-eecccccEEEEeecCcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ-RQISSGKSAQFPVIGRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-~~i~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (344) ||- . |.++|+..+++.|...+++..+.+. ..+.+|++|+||+++.+.+++|++++.. T Consensus 1 Mai---------------------n-~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~ 58 (290) T protein:vir:78 1 MAI---------------------N-YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGY 58 (290) T ss_pred Cch---------------------h-HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCc Confidence 331 1 4589999999999999988777654 3567899999999999999999998866 Q ss_pred CCCcCCcccceEEEEeeeeeeeceecc--chHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) ... .++.+..+++||+.++|.|.|| |+||.+....+...+.+.+.+.++..+|.+.+..|+..+.... T Consensus 59 ~~g--~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~-------- 128 (290) T protein:vir:78 59 NEG--SASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS-------- 128 (290) T ss_pred ccC--ccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC-------- Confidence 543 5788899999999999999999 8888888888999999999999999999999887765442111 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc-c-ccccccc Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA-A-LIDPERG 235 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~-~-~~~~~~G 235 (344) .. ++ . +...+++|+.|+++.++|+| ||.+|||++|+|++|.+|+++++|+..... . .....+| T Consensus 129 ----~~---~~-~-----t~t~~n~~~~i~~~~~~lde--vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~ 193 (290) T protein:vir:78 129 ----NS---VA-E-----EITKDNVFTKLKAAIRKVKK--YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIET 193 (290) T ss_pred ----cc---cc-c-----ccCHHHHHHHHHHHHHHHHh--cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccc Confidence 00 00 0 01135689999999999987 899999999999999999999999864322 2 2234599 Q ss_pred eeEEEeCeEEEEeccc-cccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHL-TAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~l-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) +|++++||+|+++++- .... .+....|+... ....+.-.++.|++|+......+ .+..+ +|.. T Consensus 194 ~V~~idG~~ii~vps~~r~~t--~~~f~~G~~~~------------~~ak~in~ii~~~~a~i~~~K~~-~~~~~-~P~~ 257 (290) T protein:vir:78 194 RITAIDGTRIVEVEAEDRFYD--TFDFTDGYKPA------------AGAKKLNFLLVNKGSVVGGAKHA-SIYLH-APGS 257 (290) T ss_pred eeeeecCcEEEEecccchhhh--hhhhccccccc------------CCccceeEEEEcCCceeeeeeee-EEEee-CCCC Confidence 9999999999997631 1111 11111111111 12234455888998875554333 33333 3322 Q ss_pred --h--hhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 --Q--ADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 --~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) . +|.+..+.-+..=++.-...+++.-.+= T Consensus 258 ~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 258 VGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred CcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 2 4566666555555555544444433333 No 50 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.95 E-value=1.8e-30 Score=183.60 Aligned_cols=300 Identities=10% Similarity=-0.007 Sum_probs=200.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee---eecccccEEEEeecCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ---RQISSGKSAQFPVIGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~---~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (344) |||.- =|.++|+.++++.|...+++..+... -.+.+|++|+||++....+++|++++ T Consensus 1 Mantl--------------------~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~ 60 (312) T protein:vir:10 1 MANTL--------------------AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGS 60 (312) T ss_pred CCcch--------------------hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeeccccccccccc Confidence 77521 17799999999999999877766422 24678999999999999999999976 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceecc--chHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) ..-.+...++.+..+++++++++|.|.|| |+||.+....+..-+.+.+.+...-.+|.+.+..|+..+...... T Consensus 61 g~~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~---- 136 (312) T protein:vir:10 61 ANAYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGD---- 136 (312) T ss_pred CCccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc---- Confidence 53223346888999999999999999999 888877777777777777889999999999888777544322110 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) +. . +.... ..+.++|+.|.++.++|+|++|| ++|+++|+|+++.+|.++..+............+| T Consensus 137 -----~~---~---~~~~~--~T~~ni~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~ 202 (312) T protein:vir:10 137 -----TN---V---EYSYS--VNSSTIINKIKTGIKIIRENGYN-GPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQT 202 (312) T ss_pred -----cc---c---ccccc--cCHHHHHHHHHHHHHHHHHccCC-CceEEEeChHHHHHHhhhhhceecccccccceeee Confidence 00 0 00011 12466899999999999999999 69999999999988877544332222223344699 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee---ecc Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA---RRA 312 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~---~~~ 312 (344) +|++++|++|++.|.-..-..-.+- .|...+.+. ..+.......++-.++.|++|+......+ .+..+ -++ T Consensus 203 ~V~~iDgv~Ii~VPs~r~~t~~~f~--dG~t~~~~~---gg~~~~~~ak~INfiiv~~~a~i~~~K~~-~~~if~P~~~~ 276 (312) T protein:vir:10 203 QVPSIDGCALIKTPQNRMYSSILLN--DGTTSNQTA---GGYLKGTKALDTNFIIAPVDVPLAITKQD-KMRIFDPETNQ 276 (312) T ss_pred eeeeecccEEEEchhhhccceeeec--cCccccccc---CceeecCcccccceEEeCCceeeceeeee-eeeeeCCCCCC Confidence 9999999999997654432211111 111000010 11111123344556888998775444332 23222 122 Q ss_pred hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ...+|.+..+.-+..=|+.-...++..--.+| T Consensus 277 ~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 277 TANAWSMDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred CcceeeeeeeeeeeeeeeccccCeEEEEeecc Confidence 33356777777777777777777775555555 No 51 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.95 E-value=2.2e-30 Score=183.20 Aligned_cols=285 Identities=9% Similarity=0.029 Sum_probs=185.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCC------ceeeecccccEEEEeecC-cceeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANR------HMQRQISSGKSAQFPVIG-RTKAAYL 73 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~------~~~~~i~~G~tv~i~~iG-~~t~~~~ 73 (344) ||- + |.++|+.++++.|..+++.... .....+.+|++|+||++. .+-+++| T Consensus 1 Mai-n---------------------ya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY 58 (346) T protein:vir:10 1 MTI-N---------------------YAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDR 58 (346) T ss_pred Ccc-h---------------------hHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccc Confidence 331 1 4589999999999887765332 222355789999999995 5678999 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceecc--chHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGV 151 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (344) +++..... ...++.+..+++||++++|.|.|| |+||.+.......-+.+......+-.+|.+.+..|+..+...+ T Consensus 59 ~R~~g~~~-~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~-- 135 (346) T protein:vir:10 59 QRRTITTP-VANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAH-- 135 (346) T ss_pred cccCCccc-ccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhc-- Confidence 87554432 245788999999999999999999 6666554455555555556666777889888777764332211 Q ss_pred ccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_015719. 152 NENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALID 231 (344) Q Consensus 152 ~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (344) .+.. ..+.+ ...++|+.|.++.++|+|++||.+|||++|+|++|.+|+++++|......++.. T Consensus 136 --------~~~~-------~~~a~--T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~ 198 (346) T protein:vir:10 136 --------DGGI-------TTNTL--DEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPN 198 (346) T ss_pred --------cccc-------ccccc--CHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheecccccccc Confidence 0000 00111 135689999999999999999999999999999999999999998654434444 Q ss_pred cccceeEEEeCeEEEEe--ccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 232 PERGSIRNVMGFEVVEV--PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 232 ~~~G~Vg~i~G~~V~~s--n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+|+|++++||+|++. ++++.. +....|.... .+..+.-.++.|++|+......+ .+..+ T Consensus 199 ~i~~~V~siDGv~Ii~VPs~r~~t~----~~f~~G~~~~------------t~ak~INfiiv~~~A~ia~~K~~-~~~if 261 (346) T protein:vir:10 199 NIQRTVYSLDDVTIRVVPSDLMQTA----YDFSDGSKII------------DTAKQIEMFLIYNGVQIAPEKYS-FVGFD 261 (346) T ss_pred ccceeeeeecCeEEEEcchhhcccc----hhhccCcccc------------CCccceeEEEECCceeeeeeeee-eeEee Confidence 46999999999999985 445421 1111111110 01233445788998775444332 23333 Q ss_pred ec-chhhh-hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RR-AEYQA-DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~-~~~~~-d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) -+ +...+ |.+..+.-+..=|+.-...++..--.+| T Consensus 262 ~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a 298 (346) T protein:vir:10 262 QPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDK 298 (346) T ss_pred CCCCCcccceeeeeeeeeeeeeeccccceEEEeeecc Confidence 22 23333 4566666666666666555554333333 No 52 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.95 E-value=9.5e-31 Score=185.17 Aligned_cols=230 Identities=17% Similarity=0.133 Sum_probs=189.2 Q ss_pred eeecccccEEEEeecCcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHH Q lcl|NC_015719. 51 QRQISSGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLA 130 (344) Q Consensus 51 ~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa 130 (344) ..-+..|+||+||.. -..+.++..|+.++. +.++.++.+.+|.+. ...|.|+|.+..+...|++.+.+++++.+|| T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i~~--~~l~~t~~~atIk~~-gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEISL--DKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcCCh--hhccccceeeeEeee-ccceeeeHHHHhhccCchHHHHHHHHHHHHH Confidence 223567999999865 335578899999985 468899999999775 7899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHH Q lcl|NC_015719. 131 MAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPD 210 (344) Q Consensus 131 ~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~ 210 (344) +++|..++..+.+.+ +.. ++. ..++.|.+|..+|.+.+. ..+|++|+|+ T Consensus 77 ~kvD~di~~~~~~a~------------------l~~--~~~---------~t~d~i~~A~~~fgde~~--~~~vivv~p~ 125 (231) T protein:vir:73 77 NKVDDDLLKAAKTTS------------------QTV--STK---------ANVDGVQAALDIFNDEDA--QAYVLIVNPK 125 (231) T ss_pred HhhhHHHHHhhcccc------------------ccc--ccc---------ccHHHHHHHHHHhccccc--cceEEEEcch Confidence 999999986554211 000 111 127888999999998863 5689999999 Q ss_pred HHHHHhccchhhhh-ccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEE Q lcl|NC_015719. 211 VYSAILAALMPNAA-NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGL 289 (344) Q Consensus 211 ~~~~Ll~~~~~~~~-~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl 289 (344) .|+.|.++.++... +..++..+++|.||.+.|++|+.|+++|.+++.. +-+ T Consensus 126 ~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~----------------------------~~~ 177 (231) T protein:vir:73 126 DAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALM----------------------------FKI 177 (231) T ss_pred HHHhhhhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCceee----------------------------eeE Confidence 99999998887663 4567788999999999999999999998643211 113 Q ss_pred EecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 290 FQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 290 ~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) ++.+.|++.+..+++++|..||++.+.|.|.+.+.|+.++.+|+.++.++++.- T Consensus 178 i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 178 VSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 456788999999999999999999999999999999999999999999999998 No 53 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.94 E-value=5.7e-29 Score=175.43 Aligned_cols=262 Identities=15% Similarity=0.137 Sum_probs=205.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeee-c--ccccEEEEeecCcc-eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQ-I--SSGKSAQFPVIGRT-KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~-i--~~G~tv~i~~iG~~-t~~~~~~g 76 (344) ||...- + +-|..|+|+..|.+.+.+..++.++....+ + ..|++|+||..... .+.++..| T Consensus 1 Ma~T~~--------------~--d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg 64 (270) T protein:vir:95 1 MTQTKK--------------A--NLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEG 64 (270) T ss_pred CCceeh--------------h--hhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCC Confidence 776332 1 236889999999999999999998887753 3 46999999987643 55678889 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++. +.++.++.+.+|-+. ...|.|+|++...+..|++.+.+++.+.+|++++|+.++..+.+... T Consensus 65 ~~i~~--~~lt~~~~~a~i~~~-gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~---------- 131 (270) T protein:vir:95 65 VAMDT--TQMSMTTTKVTVKET-GKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQ---------- 131 (270) T ss_pred Cccch--hhcccchheeeeehh-hCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 98874 478889999999664 68899999988888889999999999999999999999866543110 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccce Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGS 236 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (344) .. +.. ..++.|.+|..+|.+.. ....+++|+|..|+.|+++..+. ..-.+.+.+++|. T Consensus 132 --------~~--~~~---------~t~~~~~dA~~~lgd~~--~~~~~i~vhs~~~~~Lrk~~~~~-~~~~~~~~~~~G~ 189 (270) T protein:vir:95 132 --------TA--TVS---------ADATGILDAIEVFNSEN--DEDYVLYVNPKDYNKLVKSLFKV-GGNVQDRAISKGD 189 (270) T ss_pred --------cc--ccc---------cCHHHHHHHHHHhcccc--CCCcEEEEcHHHHHHHHhhhccc-ccccccchhcccc Confidence 00 000 01466778888886543 23478999999999999876443 2233456788999 Q ss_pred eEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) |+.+.|++|+++.+.|.. ..+.+|++.|++.+..+++.+|..||++.+. T Consensus 190 ig~~~G~~Viv~s~~~~~-------------------------------~~~~l~~~gAi~~~~~~~~~vEtdRd~~~~~ 238 (270) T protein:vir:95 190 LVEIVGVSDIVKSKRVSE-------------------------------NTAFLQRYGAMEIVNKKKPEAYTDFDILKRT 238 (270) T ss_pred cceecceeEEEeCCCCCc-------------------------------eeEEEEeccceeeeecCCceeeeccchhhcc Confidence 999999999887665421 1235778999999999999999999999999 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |.+.+.+.||.++.+|+.++.++++.+- T Consensus 239 d~i~~~~~y~v~~~~~skvv~~t~~~a~ 266 (270) T protein:vir:95 239 HLLSTNYHYSVNLKDETGVVKVTFKPSG 266 (270) T ss_pred cEEEeeeEEEEEEEccceEEEEEecCCC Confidence 9999999999999999999999976544 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.90 E-value=6.9e-26 Score=158.54 Aligned_cols=297 Identities=13% Similarity=0.099 Sum_probs=190.0 Q ss_pred cccccchhhh-hHHHHhhHHHHHHHHhhhhcCCceee-ec-ccccEEEEeecCcceeeeeeCCCCCCCCcCCcccceEEE Q lcl|NC_015719. 17 GQSAADKLAL-FLKVFGGEVLTAFARTSVTANRHMQR-QI-SSGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKHTEKTI 93 (344) Q Consensus 17 ~~~~~d~~~l-~~e~f~geV~~~f~~~s~~~~~~~~~-~i-~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l 93 (344) -...++..|| |.++|+.++++.|..++++..+.+.. .+ .||++|+||++....+++|++++... ...++.+..++ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~--~g~v~~~~et~ 78 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFN--SGTISDEKTIY 78 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccCcc--ccceeeeeeEE Confidence 1223445555 78999999999999988776665542 34 48999999999999999999977543 46788899999 Q ss_pred EeeeeeeeceeccchHHHHhC--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccc Q lcl|NC_015719. 94 NIDGLLTADVLIYDIEDAMNH--YDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKAD 171 (344) Q Consensus 94 ~iD~~~~~~~~Idd~D~~q~~--~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~ 171 (344) ++++++++.|.||-+|..+++ ...-.-+.+.......-.+|.+-+..|+..+...+... . .+... .+.. T Consensus 79 tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~---~---~~~~~---~~~~ 149 (311) T protein:vir:99 79 TMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTD---T---EGTLL---AKTH 149 (311) T ss_pred EeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccc---c---chhhh---cccc Confidence 999999999999955444443 33333344444555667788888777765443322110 0 00000 0111 Q ss_pred cccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc-ccc-ccccccceeEEEeCeEEEEe- Q lcl|NC_015719. 172 LTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN-YAA-LIDPERGSIRNVMGFEVVEV- 248 (344) Q Consensus 172 ~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~-~~~~~~G~Vg~i~G~~V~~s- 248 (344) .....-...++++.|..+..++++ ||.++|+++|+|++|.+|++++.|.+.- ... .....++.|+.++|++|+++ T Consensus 150 ~~~~~lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ 227 (311) T protein:vir:99 150 KTEETLDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVY 227 (311) T ss_pred ccccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEec Confidence 111112246689999999999987 7989999999999999998888776422 111 12235888999999999976 Q ss_pred cc--ccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc----hhhhhhhhhh Q lcl|NC_015719. 249 PH--LTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA----EYQADQIIAK 322 (344) Q Consensus 249 n~--lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~----~~~~d~i~~~ 322 (344) |+ +... -.+ ..|..... +..++-.++.|++|+......+ .+. .++| +-.+|.+..+ T Consensus 228 ps~r~~t~--~~f--t~G~~~~~------------~ak~INfiiv~~~a~i~~~K~~-~v~-~f~P~~~~~gd~~l~~~R 289 (311) T protein:vir:99 228 ESNRFMTK--YDF--TDGAKPTE------------DAKAINFLVVAKPAVISIVKEN-AVF-LFAPGQHTDGDGYLYQNR 289 (311) T ss_pred Cchhhcch--hhh--cCCccccC------------cccccceEEeCCCeeeeeeeee-eee-eeCCCCCCCcceeeeeee Confidence 54 4321 111 11111000 1223445788988775443322 222 2333 2236777777 Q ss_pred hhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 323 YAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 323 ~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .-+..=|+.-...++..--.+| T Consensus 290 ~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 290 LYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred eeeeeeeeccccCeEEEeeecC Confidence 7777777777777776666677 No 55 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.89 E-value=2e-25 Score=155.98 Aligned_cols=267 Identities=12% Similarity=0.045 Sum_probs=175.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-----eecccccEEEEeecCc-ceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ-----RQISSGKSAQFPVIGR-TKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-----~~i~~G~tv~i~~iG~-~t~~~~~ 74 (344) ||. =+.++|+..+++.|...+++..+... ..+.+|++|+||++.. ..+++|+ T Consensus 1 Mai----------------------n~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~ 58 (285) T protein:vir:79 1 MTV----------------------VLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYK 58 (285) T ss_pred Ccc----------------------hhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccc Confidence 332 14589999999999988777766543 3567899999999964 6799999 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQ-IGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~-~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) ++.... ...++.+..++++++++++.|.||.+|..++..=-.+.++.+ ......-.+|.+.+..|+..+. T Consensus 59 R~~g~~--~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~------- 129 (285) T protein:vir:79 59 RGQDNA--RKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA------- 129 (285) T ss_pred cccCcc--ccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc------- Confidence 976643 456788999999999999999999666655322123333333 3344455677776665553211 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccc-- Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALID-- 231 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~-- 231 (344) .. .+ ... ...++|+.|.++.++|+|++|| ++||++|+|++|.+|++++.|.+.-..+... T Consensus 130 --------~~---~~-~~~-----T~~nv~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~ 191 (285) T protein:vir:79 130 --------KK---AT-DSI-----TKDNALDAYDTAEAYMFDNEVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVI 191 (285) T ss_pred --------cc---cc-ccc-----CHHHHHHHHHHHHHHHHHcCCC-CceEEEEChHHHHHHHhhhhhheecccccceec Confidence 00 00 111 1356899999999999999999 6999999999999999999987653322221 Q ss_pred -cccceeEEEeC-eEEEEecc--ccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 232 -PERGSIRNVMG-FEVVEVPH--LTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 232 -~~~G~Vg~i~G-~~V~~sn~--lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) -.++.|+.++| ++|++.+. ++.... ..+.-.++.|++|+......+ .+. T Consensus 192 ~~i~~~V~~lDg~v~ii~Vps~r~kt~~~--------------------------~k~Infiiv~~~a~i~~~K~~-~~~ 244 (285) T protein:vir:79 192 NGIDRRVAQLDGGVPIVRVSSDRLKGLGI--------------------------TNHVNFILTPLSAIAPIVKYD-SVS 244 (285) T ss_pred cceeeeeccccceeEEEEcchhhccCcCc--------------------------chhccEEEecCceeccceeee-eeE Confidence 24678999999 89998654 432100 012344788998765544333 232 Q ss_pred eeecc-hhh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 308 RARRA-EYQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~~~-~~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+-++ ... +|.+..+.-+..=++.-...++ .+..+| T Consensus 245 ~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~I-y~~~~a 283 (285) T protein:vir:79 245 VIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGI-YVAATA 283 (285) T ss_pred eECCCCCCCcceeeeeeeeeeeeeehhhcccee-eeeecc Confidence 22222 233 4566666666666665555555 444455 No 56 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.89 E-value=1e-26 Score=163.06 Aligned_cols=300 Identities=17% Similarity=0.187 Sum_probs=205.8 Q ss_pred cccccchhhhhH-HHHhhHHHHHHHHhhhhcCCce-eeecccccEEEEeecCcceeeeeeCCCCCCCCcCCcccceEEEE Q lcl|NC_015719. 17 GQSAADKLALFL-KVFGGEVLTAFARTSVTANRHM-QRQISSGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKHTEKTIN 94 (344) Q Consensus 17 ~~~~~d~~~l~~-e~f~geV~~~f~~~s~~~~~~~-~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ 94 (344) -+..++.+|+.. |+|+.+++.-.+++-+-..+.+ +.++-.|++.||+.+|.++++.....+++.. .++++.|.++. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~~~~~--~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSVTLQEAEEDTPLIY--NPIETGEITFQ 78 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCceeeeccccCCCeee--cccccceEEEE Confidence 224444555444 9999999988877633333333 3355579999999999999988777776665 47999999999 Q ss_pred eeeeeeeceec-cchHHHHhChh-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-ccccccCceeeecccccc Q lcl|NC_015719. 95 IDGLLTADVLI-YDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE-NIAGLGKPSLLEVGAKAD 171 (344) Q Consensus 95 iD~~~~~~~~I-dd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~-~~~~~~~~~~i~~~~~~~ 171 (344) |.+++-.+++| +|+.+.-..+| ++++...|.++|+.+.+...+|..=. +-++..+.. .+.|+ +..+ +++.++ T Consensus 79 i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~--~~FA~~~~P~~vNG~--PH~~-V~~~T~ 153 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGA--EYFAANPGPHNVNGF--PHVI-VSAETN 153 (313) T ss_pred EEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhch--hhhccCCCCcccccc--cceE-EeccCC Confidence 99988888888 45666666676 99999999999999999887764322 222222211 11121 1222 222222 Q ss_pred cccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhh-hcccc------ccccccceeEEEeCeE Q lcl|NC_015719. 172 LTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNA-ANYAA------LIDPERGSIRNVMGFE 244 (344) Q Consensus 172 ~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~-~~~~~------~~~~~~G~Vg~i~G~~ 244 (344) ....+..|..++-.|++.++|.+||+.||+|.....|-.-..+.+ ....+ .......+|.+++|++ T Consensus 154 -------~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~D 226 (313) T protein:vir:95 154 -------GVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWD 226 (313) T ss_pred -------ceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhh Confidence 122356788899999999999999999999999888765433332 11111 1123344788999999 Q ss_pred EEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhh Q lcl|NC_015719. 245 VVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYA 324 (344) Q Consensus 245 V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~ 324 (344) ++.||.|...+-+ +++++.+-+-++..++- +-.+-..+..++..++++|.+++..+-.|.-...++ T Consensus 227 i~~SN~L~~AN~~--------D~~tT~~G~~~NlFM~i------~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R 292 (313) T protein:vir:95 227 ILTSNRLHVANYN--------DGTTTGNGYVGNLFMCI------LDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCR 292 (313) T ss_pred hhhhhhhhhcccc--------ccccccCceeeeeeeee------ecccccceeeeeccccccccccccccccccceeeee Confidence 9999999754432 12221111111111111 112334566788899999999999999999999999 Q ss_pred hcCceeccccEEEEEecCCC Q lcl|NC_015719. 325 MGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 325 ~G~~v~Rp~~~~~l~~~~~a 344 (344) ||.+++|-|.++++.+.++| T Consensus 293 ~G~Gi~R~~~L~~~~~~A~~ 312 (313) T protein:vir:95 293 YGFGIQRLDTLGLLATSATA 312 (313) T ss_pred ecccceeecceeEEEecccc Confidence 99999999999999999999 No 57 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.85 E-value=2.6e-23 Score=144.44 Aligned_cols=285 Identities=15% Similarity=0.121 Sum_probs=184.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee---eecccccEEEEeecC-----cceeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ---RQISSGKSAQFPVIG-----RTKAAY 72 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~---~~i~~G~tv~i~~iG-----~~t~~~ 72 (344) |||. . =|.++|++++++.|..++++..+... -.+.+|++|+||+|- .+-+++ T Consensus 1 Mant-------------------l-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~d 60 (302) T protein:vir:78 1 MANS-------------------L-ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKA 60 (302) T ss_pred CCch-------------------h-HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccc Confidence 7652 1 17899999999999999987776432 247799999999995 556789 Q ss_pred eeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChh--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 73 LQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYD--VRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 73 ~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d--~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) |++++... ...++.+..++++++++++.|.||-+|..+++.- .-.-+.+.......-.+|.+-+..|+..+.... T Consensus 61 y~R~~g~~--~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~- 137 (302) T protein:vir:78 61 YNRSTGFT--QGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG- 137 (302) T ss_pred cccccCcc--ccceeeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC- Confidence 99977543 3457888899999999999999996655555433 333333345566677788887766664322111 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc--c Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA--A 228 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~ 228 (344) . ....+.+...+++++++|..+.+.|+++ ++|+++|+|+++.+|++++.+...-.. . T Consensus 138 -----------~------~~~~~~~~~t~~nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~ 196 (302) T protein:vir:78 138 -----------G------VIDLSKPDASAQALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVL 196 (302) T ss_pred -----------c------cccccccchhHHHHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhccceecccc Confidence 0 0111112223577899999999999996 589999999999999988777543211 1 Q ss_pred ccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER 308 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~ 308 (344) ...-.+++|+.++|++|+++|.-.....-.+.. |.. ......++-.++.|++|+......+ .+.. T Consensus 197 ~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~--G~~------------~~~~ak~INfiiv~~~a~ia~~K~~-~~~i 261 (302) T protein:vir:78 197 RRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKV--GVP------------DYTGAKKIPYMIFKRDAPTGIVKTD-KVRV 261 (302) T ss_pred ccccccceeeeecccEEEEchhhhcccceeccC--Ccc------------ccCCccceeEEEECCCeeeeeeeee-eeEe Confidence 122348899999999999977544332222111 111 1112344556888998775544333 2333 Q ss_pred e-ecchhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 309 A-RRAEYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 309 ~-~~~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + .++...+| .+..+.-+..=|+.....+++.-.-+| T Consensus 262 f~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~ 300 (302) T protein:vir:78 262 FEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGT 300 (302) T ss_pred eCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccc Confidence 3 33456654 666666666666666666666555555 No 58 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.75 E-value=7e-20 Score=125.61 Aligned_cols=300 Identities=15% Similarity=0.130 Sum_probs=187.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCce---eeec---ccccEEEEeecCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHM---QRQI---SSGKSAQFPVIGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~---~~~i---~~G~tv~i~~iG~~t~~~~~ 74 (344) |||.- +. ++++=-.|+++.|....++..++. ..+. +.|+++++|.--.... . T Consensus 1 Ma~~~---------------~~----~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~---~ 58 (430) T protein:vir:21 1 MALNE---------------GQ----IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT---Q 58 (430) T ss_pred Ccccc---------------ch----hhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccc---c Confidence 77731 11 223222899999999888887533 2222 5799999886544332 2 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) .|.++.+...++......++||+.+-..|.+.+ +| ....|....+.+.+..+||.++|..++..+......... T Consensus 59 ~G~~~t~~~~~~~e~~v~~~~~~~~~V~~~~~~-kE-l~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~---- 132 (430) T protein:vir:21 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) T ss_pred ccccccCCCccceeeeEeEEEeeeccceEEeeh-hH-hcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccc---- Confidence 255555555566677889999999988888874 34 567778899999999999999999998765432211110 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-CCEEEeCHHHHHHHhcc-chhhhhcccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-DRTFYTTPDVYSAILAA-LMPNAANYAALIDP 232 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (344) ... +.....++ .++++..+.+.|++..||.+ +|.++++|+.+..|... .++...+-.....+ T Consensus 133 ---~~~------~t~~~~~~-------~~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~ 196 (430) T protein:vir:21 133 ---SPD------AIGTNTAD-------AWNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred ---ccC------CCCCCCCc-------chhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccchhHHH Confidence 000 11111111 25777889999999999995 79999999999988653 33444444455678 Q ss_pred ccceeEE-EeCeE-EEEeccccccccccc----ccccc-------------c-------------ccccc-----ccccc Q lcl|NC_015719. 233 ERGSIRN-VMGFE-VVEVPHLTAGGAGDD----RPEEG-------------T-------------DASNQ-----KHAFP 275 (344) Q Consensus 233 ~~G~Vg~-i~G~~-V~~sn~lp~~~~~~~----~~~~~-------------~-------------~~~~~-----~~~~~ 275 (344) ++|.|++ +.||+ +|+++++|....+.. +.+++ . .+++. .-.+. T Consensus 197 r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftia 276 (430) T protein:vir:21 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFA 276 (430) T ss_pred hhcccccccchhhhhhhcCCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEec Confidence 9999986 99996 789999997321111 11110 0 00000 00000 Q ss_pred c----------------------------------c------------cc--ccc--------------ccceeEEEecH Q lcl|NC_015719. 276 A----------------------------------T------------GG--KVN--------------KENVVGLFQHR 293 (344) Q Consensus 276 ~----------------------------------~------------~~--~~~--------------~~~~~gl~~~~ 293 (344) + - .. .++ ..-+.-|+||+ T Consensus 277 GV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~ 356 (430) T protein:vir:21 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCcceEEEEEecCCceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 0 0 00 000 00012389999 Q ss_pred HHHhhhhhhee---------------------eee--eeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 294 SAVGTVKLKDL---------------------ALE--RARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 294 ~Av~~~~~~~~---------------------~~e--~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+..+....+ .+. ..+|.+...+.++--..||.+.+|||.++++...++| T Consensus 357 ~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred ceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 98865543321 111 2244444455666677899999999999999999999 No 59 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.74 E-value=1.1e-19 Score=124.45 Aligned_cols=300 Identities=14% Similarity=0.109 Sum_probs=188.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee---ee---cccccEEEEeecCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ---RQ---ISSGKSAQFPVIGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~---~~---i~~G~tv~i~~iG~~t~~~~~ 74 (344) |||.- .-.+++-..|+++.|....++...+.. .+ -+.|++|.+|.--..... T Consensus 1 MAn~l-------------------~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~--- 58 (430) T protein:vir:92 1 MALNE-------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) T ss_pred Cccch-------------------hhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccc--- Confidence 77731 124456778889999988888865332 22 257999988877555432 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) .|.++.+...++......++||+.+--.|.+.+-| +...+....+.+.+..+||.++|..++..+........ T Consensus 59 ~G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~----- 131 (430) T protein:vir:92 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVI----- 131 (430) T ss_pred cCcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccc----- Confidence 26555555545656788999999999999998744 57777788888999999999999999865432111110 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-CCEEEeCHHHHHHHhcc-chhhhhcccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-DRTFYTTPDVYSAILAA-LMPNAANYAALIDP 232 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (344) +... +.....+ ..+..+.++.+.|++..||.+ +|.++++|+.+..|... .++...+-.....+ T Consensus 132 --~~~~------~t~~~~~-------~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~ 196 (430) T protein:vir:92 132 --TSPD------AIGTNTA-------DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred --cccc------cCCCcCC-------cchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHH Confidence 0000 1111111 125677889999999999995 89999999999998643 22333333345668 Q ss_pred ccceeEE-EeCeE-EEEeccccccccccc----cccccc--------------------------ccccc---------c Q lcl|NC_015719. 233 ERGSIRN-VMGFE-VVEVPHLTAGGAGDD----RPEEGT--------------------------DASNQ---------K 271 (344) Q Consensus 233 ~~G~Vg~-i~G~~-V~~sn~lp~~~~~~~----~~~~~~--------------------------~~~~~---------~ 271 (344) ++|.|++ +.||+ +|+++++|....+.. +.+++. .++++ + T Consensus 197 r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftia 276 (430) T protein:vir:92 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) T ss_pred hhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEec Confidence 9999996 89995 789999987322111 111100 00000 0 Q ss_pred c----------------ccccccc--------------------------------------ccc----ccceeEEEecH Q lcl|NC_015719. 272 H----------------AFPATGG--------------------------------------KVN----KENVVGLFQHR 293 (344) Q Consensus 272 ~----------------~~~~~~~--------------------------------------~~~----~~~~~gl~~~~ 293 (344) + .|..... -++ ..-+.-|+||+ T Consensus 277 GV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr 356 (430) T protein:vir:92 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 0 0000000 000 00013489999 Q ss_pred HHHhhhhhhe-----------------------eeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 294 SAVGTVKLKD-----------------------LALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 294 ~Av~~~~~~~-----------------------~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+..+.... +++-.++|.+..-..++--..||.+.+|||.++++...++| T Consensus 357 ~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 9876554332 11112344444445556667799999999999999999999 No 60 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.74 E-value=1.1e-19 Score=124.45 Aligned_cols=300 Identities=14% Similarity=0.109 Sum_probs=188.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee---ee---cccccEEEEeecCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ---RQ---ISSGKSAQFPVIGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~---~~---i~~G~tv~i~~iG~~t~~~~~ 74 (344) |||.- .-.+++-..|+++.|....++...+.. .+ -+.|++|.+|.--..... T Consensus 1 MAn~l-------------------~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~--- 58 (430) T protein:vir:10 1 MALNE-------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) T ss_pred Cccch-------------------hhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccc--- Confidence 77731 124456778889999988888865332 22 257999988877555432 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) .|.++.+...++......++||+.+--.|.+.+-| +...+....+.+.+..+||.++|..++..+........ T Consensus 59 ~G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~----- 131 (430) T protein:vir:10 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVI----- 131 (430) T ss_pred cCcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccc----- Confidence 26555555545656788999999999999998744 57777788888999999999999999865432111110 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-CCEEEeCHHHHHHHhcc-chhhhhcccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-DRTFYTTPDVYSAILAA-LMPNAANYAALIDP 232 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (344) +... +.....+ ..+..+.++.+.|++..||.+ +|.++++|+.+..|... .++...+-.....+ T Consensus 132 --~~~~------~t~~~~~-------~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~ 196 (430) T protein:vir:10 132 --TSPD------AIGTNTA-------DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred --cccc------cCCCcCC-------cchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHH Confidence 0000 1111111 125677889999999999995 89999999999998643 22333333345668 Q ss_pred ccceeEE-EeCeE-EEEeccccccccccc----cccccc--------------------------ccccc---------c Q lcl|NC_015719. 233 ERGSIRN-VMGFE-VVEVPHLTAGGAGDD----RPEEGT--------------------------DASNQ---------K 271 (344) Q Consensus 233 ~~G~Vg~-i~G~~-V~~sn~lp~~~~~~~----~~~~~~--------------------------~~~~~---------~ 271 (344) ++|.|++ +.||+ +|+++++|....+.. +.+++. .++++ + T Consensus 197 r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftia 276 (430) T protein:vir:10 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) T ss_pred hhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEec Confidence 9999996 89995 789999987322111 111100 00000 0 Q ss_pred c----------------ccccccc--------------------------------------ccc----ccceeEEEecH Q lcl|NC_015719. 272 H----------------AFPATGG--------------------------------------KVN----KENVVGLFQHR 293 (344) Q Consensus 272 ~----------------~~~~~~~--------------------------------------~~~----~~~~~gl~~~~ 293 (344) + .|..... -++ ..-+.-|+||+ T Consensus 277 GV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr 356 (430) T protein:vir:10 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 0 0000000 000 00013489999 Q ss_pred HHHhhhhhhe-----------------------eeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 294 SAVGTVKLKD-----------------------LALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 294 ~Av~~~~~~~-----------------------~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+..+.... +++-.++|.+..-..++--..||.+.+|||.++++...++| T Consensus 357 ~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 9876554332 11112344444445556667799999999999999999999 No 61 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.64 E-value=2.4e-17 Score=111.70 Aligned_cols=308 Identities=11% Similarity=0.060 Sum_probs=174.8 Q ss_pred CCCcccccccc-ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC---------ccee Q lcl|NC_015719. 1 MANMQGGQQLG-TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG---------RTKA 70 (344) Q Consensus 1 ma~~~~~~~~~-~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG---------~~t~ 70 (344) ||+++.-.-.. ....++...+..-+|..+.|..++.+..++.|.++.+.+...+. +..++||+.. ..++ T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~~~~~ 79 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVGQVGVGTS 79 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccceeeccccc Confidence 87777431111 12223334444455899999999999999999999999888765 5567787752 2333 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) .....|..++.. .++..++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|+.++..-- . . T Consensus 80 ~~~~Eg~~~~~~--~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g----~-~- 150 (338) T protein:vir:78 80 NEQREGGTKPLS--GTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKS----P-L- 150 (338) T ss_pred cccccccccccc--ccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccC----C-C- Confidence 344445555432 34556666655333 2334454411224568999999999999999999999873111 0 0 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhh--cccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAA--NYAA 228 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~--~~~~ 228 (344) ....+.+........ ..+............++.|.++...+.. +.......++++|..+..|++.....+. .+.- T Consensus 151 ~~~~~~gi~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~ 227 (338) T protein:vir:78 151 TGSALQGIDTNNVIV--NTTNVDYLQTGTTPLLDRFLDGYDLVSA-NTDVDFNGWAADPRYRARLLRSQAYRDANGNVDP 227 (338) T ss_pred ccccccccccccccc--cccccccccccchhhHHHHHHHHHHhhh-hccccceEEEEchHHHHHHHHHhhhccCCCceee Confidence 000111111111110 1111111122234457778877666643 3333445788999999998765444332 2332 Q ss_pred ccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER 308 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~ 308 (344) ......|..++++|.+|+.++++|....... ......+-+++ +-...+..+.++++. T Consensus 228 ~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~-----------~~~~~~~~gdf------------s~~~~~~~~~~~i~~ 284 (338) T protein:vir:78 228 TRINLAASAGDLLGLPVQFGKAVGGDLGAAT-----------DSKVRVVGGDF------------SQLKYGFADEIRVKM 284 (338) T ss_pred cccccCCCCceeeeeeEEEccccCccccccC-----------CcccEEEEEec------------ceEEEEeecccEEEE Confidence 3345567778999999999999986432111 00011122222 111122334455555 Q ss_pred eecch--------------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 309 ARRAE--------------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 309 ~~~~~--------------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .++.. ++. ..++..+++|.+++||++.+.|+-..-+ T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 285 SDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred eecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCC Confidence 54321 112 2357788999999999998887764444 No 62 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.62 E-value=7.8e-17 Score=108.88 Aligned_cols=282 Identities=14% Similarity=0.108 Sum_probs=177.1 Q ss_pred ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCCCCcCCcccc Q lcl|NC_015719. 10 LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKHT 89 (344) Q Consensus 10 ~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~ 89 (344) +|.++-....+++.-.+..++++.++.+..++.++++.+.+...+. +.+.+++....+.+..+..|+.++.+ .++.+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~--~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMT-KPEEEFTFMSGVGAFWVDEAERIQTS--KPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecC-CCcEEEEEEcCCceeeeecCcccccc--cccee Confidence 3333333333333334678999999999999999999999887764 56688898887888888888888754 45677 Q ss_pred eEEEEeeeeeeeceeccchHHHH-hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeeccc Q lcl|NC_015719. 90 EKTINIDGLLTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGA 168 (344) Q Consensus 90 ~~~l~iD~~~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~ 168 (344) ++++...+. +..+.|.+ +-.+ +..|+.+.+.++.++++++..|+.++.-- .. ..+.|.-. .... T Consensus 78 ~v~l~~~k~-~~~~~is~-ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~----g~-----~~~~gil~----~~~~ 142 (299) T protein:vir:41 78 KAKMRSKKM-GVIIPTTK-ENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGV----ES-----PYNWNILK----SATD 142 (299) T ss_pred EEEEeeEEE-EEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cC-----cccccccc----cccc Confidence 777776554 33355655 3333 46889999999999999999999887311 10 01111100 0000 Q ss_pred ccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEEEeCeEEEEe Q lcl|NC_015719. 169 KADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEV 248 (344) Q Consensus 169 ~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~s 248 (344) +.. ........++.|.++...|...+.+. -.++++|..|..|.+-..- +..+.-......| .++++|.+|+.+ T Consensus 143 ~~~---~~~~~~~~~~~l~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~-~~~l~G~PV~~~ 215 (299) T protein:vir:41 143 ASN---LVEETANKYDDLNEAIGLIEAEDLEP--NGIATIRKQRVKYRSTKDG-NGMPIFNTATSNG-VDDVLGLPIAYT 215 (299) T ss_pred cce---eeccccccHHHHHHHHHhhhcccCCc--CEEEEcHHHHHHHHHhhcc-CCceeecCCcCCC-CceecceeeEEe Confidence 000 00111123678888888888888753 3578999999998863221 2222222222333 368999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch--------------h Q lcl|NC_015719. 249 PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE--------------Y 314 (344) Q Consensus 249 n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~--------------~ 314 (344) +++|.+..... . ++...+-+..+..+++++|..++.. + T Consensus 216 ~~~~~~~~~~~----------------~------------~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 267 (299) T protein:vir:41 216 PKYTFGDKDIS----------------E------------LVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLA 267 (299) T ss_pred cccCCCCCceE----------------E------------EEEecccEEEEEecCcEEEEeecccccccccccccchhhh Confidence 99985421110 1 1111111122334556666666542 2 Q ss_pred hhhh--hhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQ--IIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~--i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.|. ++...++|.++++|++.+.|+.++.- T Consensus 268 ~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 268 ERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred hcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 3333 46677899999999999999887766 No 63 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.60 E-value=1.5e-16 Score=107.34 Aligned_cols=306 Identities=15% Similarity=0.141 Sum_probs=170.4 Q ss_pred CCCcccc--ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC-cceeeeeeCCC Q lcl|NC_015719. 1 MANMQGG--QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG-RTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~--~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG-~~t~~~~~~g~ 77 (344) ||.++.- +..++. .++...+..-++..+++.+++.+..++.+.++.+.+..++.+ ...+||+.. .+++.....|. T Consensus 1 ~a~l~el~~~~~~~~-~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSN-HQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhccccc-ccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecCcc Confidence 7777643 122222 222233333347899999999999999999999998888764 446677763 34444333332 Q ss_pred CCCC------CcCCcccceEEEEeeeeeeec-eeccchHHH-HhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 78 SLDD------KRKDIKHTEKTINIDGLLTAD-VLIYDIEDA-MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 78 ~~~~------~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~-q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) .... +....+..+.++ ...+... ..|.+ +-. ++..|+.+.+.++.++++++..|+.++..- .... T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l--~~~kl~~~~~is~-ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~----g~~~ 151 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSV--SPIKLATIVTVSE-EFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGK----SPLT 151 (333) T ss_pred cccccccccccccccceeEEEE--eeEEEEEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhccc----CCCC Confidence 2111 001223344344 4444443 33444 222 467889999999999999999999987311 1100 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhh--hccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNA--ANYA 227 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~~~~ 227 (344) + ..+.+......+. ..+........+...++.|+++...+..+.- ...-.++++|..|..|++.....+ ..+. T Consensus 152 ~--~~~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i 226 (333) T protein:vir:78 152 G--SALQGIDTDNVIA--NTTNVDYLQETGDPLLDRLLDGYDLVSANTD-VEFNGWAVDPRFRAHLLRAQAYRDANGNVD 226 (333) T ss_pred C--ccccccccccccc--ccccccccccccchhHHHHHHHHHhhccccc-cCceEEEEcchHHHHHHHHhhhcCCCCcee Confidence 0 0111111111111 1111111122233457888888777665432 233467889999999987554433 2333 Q ss_pred cccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 228 ~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) -......|..++++|++|+.|+++|....... ......+.+++ +-+..+..+.++++ T Consensus 227 ~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~-----------~~~~~~~~gD~------------~~~~~g~~~~~~i~ 283 (333) T protein:vir:78 227 PSRINLAAQTGDVLGLPAQFGRAVGGDLGAAV-----------DSKTRIIGGDF------------SQLKFGFADEIRIK 283 (333) T ss_pred ecCccccCCCceeeceeeEEccccCCCccccC-----------CCccEEEEEec------------ccEEEEEeeccEEE Confidence 33445567778999999999999996542221 11111222222 11222333445555 Q ss_pred eeecch-----------hhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 308 RARRAE-----------YQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~~~~-----------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..++.. ++.| .++..++++.++++|++.+.|+ .++| T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~-~~~a 332 (333) T protein:vir:78 284 MSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFV-DDEQ 332 (333) T ss_pred EeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEe-ccCC Confidence 554321 1222 3677889999999999999874 4444 No 64 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.58 E-value=6.5e-17 Score=109.32 Aligned_cols=291 Identities=12% Similarity=0.053 Sum_probs=167.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) +....-+.........+..+++..-+-.+++...+.+..+..++++.+.++....++..+.||+. |...+.....++.+ T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~ 176 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEI 176 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccc Confidence 00000000000000001111222212335566666666677788888888877777778889877 44566667778877 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.+ .++..++++.+-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +. + .|-|.. T Consensus 177 ~~~--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~---G----~p~Gi~ 242 (390) T protein:vir:62 177 PES--YPATAQRSMGGFKYG-FASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFIT----GT---G----QPRGIL 242 (390) T ss_pred ccc--ccceeeeEeeeeeEE-eehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhc----cC---C----cccccc Confidence 664 456777777765543 334455422224677999999999999999999998862 11 1 111111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) ........... ........++.|+++...|+..... +-.+|++|..|..|.+-.. .+..|.-...+..|.... T Consensus 243 ~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd-~~g~~l~~~~~~~g~~~~ 315 (390) T protein:vir:62 243 TDASPATATFL----ATDTDSKVSDALIDLFHEVPSAYRA--NAKYVVNDLRAAQMRKLKD-ANGQYLWQSGLTVGAPSL 315 (390) T ss_pred cccccccccee----cccccccchHHHHHHHHhhhhhhhc--CCEEEEchHHHHHHHHhhc-cCCCeeecCCcCCCccce Confidence 10000000000 0011112367778888788766542 3467889999998854211 122333223345676678 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhh- Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQ- 318 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~- 318 (344) ++|++|+.++++|...+ .-+++.. .++ +...+++++...+..+..|. T Consensus 316 l~G~Pv~~~~~~p~~~i--------------------~~gd~s~----~~i--------~~~~~~~v~~~~~~~~~~~~~ 363 (390) T protein:vir:62 316 FNGKVVETDDGMPADKI--------------------LFADLSK----YRV--------RFAGSLRVDRSVDAKFSTDQI 363 (390) T ss_pred ecccceEEecCCCCccE--------------------EEeeccc----eeE--------EeecceEEEeeccccccCCcE Confidence 99999999999985321 0111111 111 22334566666555444444 Q ss_pred -hhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 319 -IIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 319 -i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +++.+++|+++++|++..+|++++.| T Consensus 364 ~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 364 VYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred EEEEEEEeCcEeechhheEEEEeecCC Confidence 57889999999999999999999999 No 65 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.56 E-value=2.2e-16 Score=106.40 Aligned_cols=293 Identities=12% Similarity=0.044 Sum_probs=167.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) +..-........+...+-.+++.-.+-.+++...+.+...+.++++.+.+.....++..+.||+. |.+++..+..|+.+ T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 176 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEI 176 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccc Confidence 00000000000000001111222223345677778888888889988888877777777888776 44566667778777 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.+ .++.+++++.+-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.++.- .. ...|.|.- T Consensus 177 ~~~--~~~f~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G----~G-----t~~p~Gil 244 (392) T protein:vir:13 177 PES--YPATTQRSMGGFKYG-FASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTG----TG-----TGQPRGIL 244 (392) T ss_pred ccc--ccceeeEEeeeeeEE-eeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcc----cC-----Cccccccc Confidence 654 356677677664432 3344554222246778999999999999999999988631 10 01122211 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) ..........+ ........|+.|+++...|...... +-.+|++|..+..|..-.. .+..|.-...+..|.-.+ T Consensus 245 ~~~~~~~~~~~----~~~~~~~~~d~l~~~~~~l~~~~~~--~a~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~g~~~~ 317 (392) T protein:vir:13 245 TDATGANAAFG----EADADSKVSDALIDLFHEVPSAYRK--NAKFVVNDLRAAQMRKLKD-ANGQYLWQSALTVGAPDT 317 (392) T ss_pred ccccccccccc----ccccccccHHHHHHHHHhhhhhhhc--CCEEEEcHHHHHHHHHhhc-cCCceeecCCcCCCCCce Confidence 11110000000 0111122367777777777655432 2345779999998764221 122232222345566678 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhh--h Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQA--D 317 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~--d 317 (344) ++|.+|+.++++|.+.+ .-|++. . ...+....++++.+.++.+.. . T Consensus 318 l~G~Pv~~~~~~~~~~i--------------------~~Gdf~--~----------~~i~~~~~~~i~~~~~~~~~~~~~ 365 (392) T protein:vir:13 318 FNGKVVETDDGMPADKV--------------------LFADLS--K----------YRVRFAGSLRVDRSVDAKFSTDQI 365 (392) T ss_pred ecceeeEEcCCCCCCcE--------------------EEeecc--c----------eeEEeecceEEEeeccccccCCcE Confidence 99999999999985321 111111 1 111223445666666654443 3 Q ss_pred hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 318 QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 318 ~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+++..++|+++++|++.+++++++.| T Consensus 366 ~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 366 VYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EEEEEEEeccEEecccceEEEEeeccC Confidence 568899999999999999999999999 No 66 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.54 E-value=1.1e-15 Score=102.59 Aligned_cols=298 Identities=12% Similarity=0.039 Sum_probs=168.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||...--+...+-+ ++.-.+.++.+..++.+..++.+.++++.+.....+ ..++||+. +.+.+..+..|+.+ T Consensus 1 m~~~~~~a~~~~~t------~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 73 (330) T protein:vir:77 1 MAGSTVPSTQVALT------GDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSASWTGEAERK 73 (330) T ss_pred Ccccccchhhcccc------CCCcceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceeEecCCCcc Confidence 87754221111111 122224556778889999999999999998877654 44778876 56677777778887 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++..++++.+-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.....++........ T Consensus 74 ~~~--~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~----G~g~~~~~~g~~~~~~ 146 (330) T protein:vir:77 74 PIT--KGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIH----GIDKPSAFKGYLAETT 146 (330) T ss_pred ccc--cceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCCcccccccccc Confidence 754 35666666665433 2334454411223568999999999999999999998862 1111111100000000 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc-----ccccc Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL-----IDPER 234 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~-----~~~~~ 234 (344) ..... ..............+++.|.++...+..++.+. ..++++|..|..|.+-..- +..+.-. ..... T Consensus 147 ~~~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~~~ 220 (330) T protein:vir:77 147 KVVSL---ADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW--TGTLLDNVTEPILNTAVDG-NGRPLFVESTYTEQVGA 220 (330) T ss_pred cccee---ecccccccccccchhHHHHHHHHHhhhhcCCCc--cEEEEcHHHHHHHHHHhcc-CCceeecCccccccccc Confidence 00000 001111111223345788888888888887643 3578999999988753221 1112111 11222 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch- Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE- 313 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~- 313 (344) ..-++++|++|+.++++|....... ...+-+ ..+.+..+..+.++++..++.. T Consensus 221 ~~~~~l~G~PV~~~~~~p~~~~~~~--------------~~~~~g------------d~s~~~i~~~~~~~i~~~~e~~~ 274 (330) T protein:vir:77 221 IREGRILGRPTYVADNVVNGTVGNR--------------VVGVMG------------DFSQVIWGQIGGLSFDVTDQATL 274 (330) T ss_pred cCCceecceeeEEeccccCCCCCCc--------------cEEEEE------------ecceEEEEEecCcEEEEeeccee Confidence 2335899999999999986432110 111111 1111222333445555544321 Q ss_pred -----------------hh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 -----------------YQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 -----------------~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +. .-.++...++|.+++||++.+.|+.+... T Consensus 275 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 275 DFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAG 324 (330) T ss_pred eecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 11 23467888999999999998888655533 No 67 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.52 E-value=1.9e-15 Score=101.26 Aligned_cols=293 Identities=9% Similarity=0.037 Sum_probs=170.9 Q ss_pred CCCc-cccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANM-QGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~-~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~ 77 (344) +.+. ..... .+.+ ....++--.+..+.|..++.+..+..+.++++.+...+.++. ++.++. .+...+.....|. T Consensus 109 ~~~~~~~~~~--~~~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~ 185 (415) T protein:vir:98 109 FTEYLETRND--IQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHHhhhhh--hhhc-cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeecccc Confidence 0000 00000 0000 000111113677899999999999999999999988876432 333443 4555566666676 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... .++.+++++.+.+.-. -+.|.+-=-.++.+|+.+.+.++.++++++..|+.++..... ..+. . T Consensus 186 ~~~~~~-~~~~~~v~~~~~k~~~-~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----g~~~----~- 254 (415) T protein:vir:98 186 ENPELA-VKPFFQLAYDINTHRG-YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK----GSTG----S- 254 (415) T ss_pred ccCccc-ccceeeEEeeeeeeEe-eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----Cccc----c- Confidence 665332 2345666666554432 244544222346788999999999999999999998643211 0000 0 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ...... ..+..... .....|+.|+++...+...+... -.+|++|..|..|..-.. .+..|.-...+.+|.. T Consensus 255 ~~~~~~-~~~~~~~~-----~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:98 255 TSSGFE-KEGKKLEV-----KKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccccc-cccccccc-----ccccchhHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhc-cCCceeeccCcCCCCC Confidence 000000 00000111 11123777888888888777642 246789999999875322 2233333334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|++|+.++++|.+..+... .++.. +.++..+....++++..++ ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~---------------------------~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:98 326 QRLLGAKIEILPDEVLGQKGNNT---------------------------LIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ceecceeeEEecccccCCCCccE---------------------------EEEEehhccEEEEeecceEEEEecc-ccCc Confidence 89999999999999865432211 11112 1122234445566665544 3344 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.++++.++++|++.+.+.++.++ T Consensus 378 ~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:98 378 ECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEEeccC Confidence 5688999999999999999999999999 No 68 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.52 E-value=1.9e-15 Score=101.26 Aligned_cols=293 Identities=9% Similarity=0.037 Sum_probs=170.9 Q ss_pred CCCc-cccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANM-QGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~-~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~ 77 (344) +.+. ..... .+.+ ....++--.+..+.|..++.+..+..+.++++.+...+.++. ++.++. .+...+.....|. T Consensus 109 ~~~~~~~~~~--~~~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~ 185 (415) T protein:vir:81 109 FTEYLETRND--IQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHHhhhhh--hhhc-cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeecccc Confidence 0000 00000 0000 000111113677899999999999999999999988876432 333443 4555566666676 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... .++.+++++.+.+.-. -+.|.+-=-.++.+|+.+.+.++.++++++..|+.++..... ..+. . T Consensus 186 ~~~~~~-~~~~~~v~~~~~k~~~-~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----g~~~----~- 254 (415) T protein:vir:81 186 ENPELA-VKPFFQLAYDINTHRG-YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK----GSTG----S- 254 (415) T ss_pred ccCccc-ccceeeEEeeeeeeEe-eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----Cccc----c- Confidence 665332 2345666666554432 244544222346788999999999999999999998643211 0000 0 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ...... ..+..... .....|+.|+++...+...+... -.+|++|..|..|..-.. .+..|.-...+.+|.. T Consensus 255 ~~~~~~-~~~~~~~~-----~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:81 255 TSSGFE-KEGKKLEV-----KKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccccc-cccccccc-----ccccchhHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhc-cCCceeeccCcCCCCC Confidence 000000 00000111 11123777888888888777642 246789999999875322 2233333334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|++|+.++++|.+..+... .++.. +.++..+....++++..++ ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~---------------------------~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:81 326 QRLLGAKIEILPDEVLGQKGNNT---------------------------LIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ceecceeeEEecccccCCCCccE---------------------------EEEEehhccEEEEeecceEEEEecc-ccCc Confidence 89999999999999865432211 11112 1122234445566665544 3344 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.++++.++++|++.+.+.++.++ T Consensus 378 ~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:81 378 ECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEEeccC Confidence 5688999999999999999999999999 No 69 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.52 E-value=1.9e-15 Score=101.26 Aligned_cols=293 Identities=9% Similarity=0.037 Sum_probs=170.9 Q ss_pred CCCc-cccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANM-QGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~-~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~ 77 (344) +.+. ..... .+.+ ....++--.+..+.|..++.+..+..+.++++.+...+.++. ++.++. .+...+.....|. T Consensus 109 ~~~~~~~~~~--~~~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~ 185 (415) T protein:vir:79 109 FTEYLETRND--IQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHHhhhhh--hhhc-cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeecccc Confidence 0000 00000 0000 000111113677899999999999999999999988876432 333443 4555566666676 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... .++.+++++.+.+.-. -+.|.+-=-.++.+|+.+.+.++.++++++..|+.++..... ..+. . T Consensus 186 ~~~~~~-~~~~~~v~~~~~k~~~-~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----g~~~----~- 254 (415) T protein:vir:79 186 ENPELA-VKPFFQLAYDINTHRG-YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK----GSTG----S- 254 (415) T ss_pred ccCccc-ccceeeEEeeeeeeEe-eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----Cccc----c- Confidence 665332 2345666666554432 244544222346788999999999999999999998643211 0000 0 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ...... ..+..... .....|+.|+++...+...+... -.+|++|..|..|..-.. .+..|.-...+.+|.. T Consensus 255 ~~~~~~-~~~~~~~~-----~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:79 255 TSSGFE-KEGKKLEV-----KKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccccc-cccccccc-----ccccchhHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhc-cCCceeeccCcCCCCC Confidence 000000 00000111 11123777888888888777642 246789999999875322 2233333334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|++|+.++++|.+..+... .++.. +.++..+....++++..++ ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~---------------------------~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:79 326 QRLLGAKIEILPDEVLGQKGNNT---------------------------LIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ceecceeeEEecccccCCCCccE---------------------------EEEEehhccEEEEeecceEEEEecc-ccCc Confidence 89999999999999865432211 11112 1122234445566665544 3344 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.++++.++++|++.+.+.++.++ T Consensus 378 ~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:79 378 ECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEEeccC Confidence 5688999999999999999999999999 No 70 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.50 E-value=3.7e-15 Score=99.72 Aligned_cols=298 Identities=13% Similarity=0.081 Sum_probs=164.9 Q ss_pred CCCccccccccc-------cc--ccc-ccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcce- Q lcl|NC_015719. 1 MANMQGGQQLGT-------NQ--GKG-QSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTK- 69 (344) Q Consensus 1 ma~~~~~~~~~~-------~~--g~~-~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t- 69 (344) +.-+..+.+..+ +. ..+ ....+--.+..++|.+++.+..+..+.++++.+..++.++..+.++..+... T Consensus 93 ~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (409) T protein:vir:45 93 DKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSE 172 (409) T ss_pred HHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcc Confidence 000000000000 00 000 0001111356799999999999999999999998888888888888775432 Q ss_pred -eeeeeCCCCCCCCcCCcccceEEEEeeeeeeec--eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015719. 70 -AAYLQPGESLDDKRKDIKHTEKTINIDGLLTAD--VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLIN 146 (344) Q Consensus 70 -~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~--~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (344) ......|...+.+ +++...+++ ...+... +.|.+-=-.++.+|+.+.+.++.++++++..|+.|+.- .. T Consensus 173 ~~~~v~E~~~~~~~--~~~f~~~~l--~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G----~G 244 (409) T protein:vir:45 173 VGVLLGENEEAGEE--DTDFGMGSL--GALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQG----TG 244 (409) T ss_pred cccccccccccccc--ccccceeee--eeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhcc----CC Confidence 2334445555443 334444444 4444432 33554222235689999999999999999999998621 10 Q ss_pred cccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCE-EEeCHHHHHHHhccchhhhhc Q lcl|NC_015719. 147 LADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRT-FYTTPDVYSAILAALMPNAAN 225 (344) Q Consensus 147 ~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~-~vv~P~~~~~Ll~~~~~~~~~ 225 (344) . .....+.|...... ....+..+ ....++.|+++...|..... ....| ++++|..|..|.+-.. .+.. T Consensus 245 ~--~~~~~p~Gil~~~~--~~~~~~~~-----~~~~~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd-~~G~ 313 (409) T protein:vir:45 245 A--GTPKQPKGLAASVT--GTTQTAAA-----NAVKWQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMED-GQGR 313 (409) T ss_pred C--CCccccceeeeccc--cccccccc-----cccchHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhc-CCCc Confidence 0 00011111110000 00001111 11126777888777766654 34467 4679999988754211 1223 Q ss_pred cccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheee Q lcl|NC_015719. 226 YAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLA 305 (344) Q Consensus 226 ~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~ 305 (344) |.-...+..|...+++|.+|+.++++|....+...+ .-|++ .. ........+. T Consensus 314 ~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i---------------~~Gd~----------~~--~~i~~~~~~~ 366 (409) T protein:vir:45 314 PLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFM---------------FCGDF----------DR--FIIRRVRYMI 366 (409) T ss_pred eeeccCcCCCCCceecceeeEEecCcCCccCCccEE---------------EEeeh----------hh--hheeeccceE Confidence 332334556766799999999999998533221111 00111 11 1122234445 Q ss_pred eeeeecchhhhhh--hhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 306 LERARRAEYQADQ--IIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 306 ~e~~~~~~~~~d~--i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++...|+-..-+. |++..++|.++++|++.+.++.++.+ T Consensus 367 ~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~ 407 (409) T protein:vir:45 367 LKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSV 407 (409) T ss_pred EEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccCC Confidence 6666654322343 78899999999999999999988887 No 71 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.50 E-value=5.6e-15 Score=98.72 Aligned_cols=285 Identities=13% Similarity=0.055 Sum_probs=170.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||-..--+...+-+ ++.-.+..+.+..++.+..++.+.++.+.+...+. +..++||+. +...+.-+..+..+ T Consensus 1 ma~~~~~~~~~~~t------~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~ 73 (304) T protein:vir:10 1 MATPTYTPGNVILS------DFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETERI 73 (304) T ss_pred Cccccccccccccc------CCCceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCccc Confidence 88744211111111 22224688999999999999999999998887765 455788887 55566677777777 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++.+.+.. .-+.|.+-=..++.+|+.+.+.++.++++++..|+.++.- .....+. .... T Consensus 74 ~~~--~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G----~g~~~~~----~~~~ 142 (304) T protein:vir:10 74 QTS--KPEYAQAEMEAKKIG-VIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFG----TKSPYNT----STSG 142 (304) T ss_pred ccc--cceeeEEEEEEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheec----cCCCccc----cccc Confidence 654 456677777665543 2345554222335689999999999999999999988631 1110011 0011 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .+......... .........|+.|.++...|..++.... .++++|..|..|.+-.. ..+ ..+..+..++ T Consensus 143 ~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~lkd-----~~G-~~l~~~~~~~ 211 (304) T protein:vir:10 143 KPLVEGAEEKG---NVVTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNALD-----AND-RPLFDANGNE 211 (304) T ss_pred ccccccccccc---cccccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhc-----cCC-cEeecCCCcc Confidence 11110011110 0111123358888899888888876433 57899999999875321 111 1122334578 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.++++|........ .-+++ +-+..+..++++++..++.. T Consensus 212 l~G~PV~~~~~~~~~~~~~~~----------------~~gd~------------~~~~~~~~~~~~i~~~~e~~~~~~~~ 263 (304) T protein:vir:10 212 IMGLPLSYTGADVYDKKKSLA----------------LMGDW------------DYARYGILQGIEYAISEDATLTTLQA 263 (304) T ss_pred ccceeeEEecccccCCCCcEE----------------EEEeh------------hhEEEEEecceEEEEeecceeeeecc Confidence 999999999999864322111 11111 11112223334444443321 Q ss_pred ----------hhh--hhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 314 ----------YQA--DQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 314 ----------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) +.. -.+++.+++|..+++|++.+.|+..+ T Consensus 264 ~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 264 SDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 122 34567789999999999999999999 No 72 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.50 E-value=5.6e-15 Score=98.72 Aligned_cols=285 Identities=13% Similarity=0.055 Sum_probs=170.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||-..--+...+-+ ++.-.+..+.+..++.+..++.+.++.+.+...+. +..++||+. +...+.-+..+..+ T Consensus 1 ma~~~~~~~~~~~t------~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~ 73 (304) T protein:vir:94 1 MATPTYTPGNVILS------DFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETERI 73 (304) T ss_pred Cccccccccccccc------CCCceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCccc Confidence 88744211111111 22224688999999999999999999998887765 455788887 55566677777777 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++.+.+.. .-+.|.+-=..++.+|+.+.+.++.++++++..|+.++.- .....+. .... T Consensus 74 ~~~--~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G----~g~~~~~----~~~~ 142 (304) T protein:vir:94 74 QTS--KPEYAQAEMEAKKIG-VIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFG----TKSPYNT----STSG 142 (304) T ss_pred ccc--cceeeEEEEEEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheec----cCCCccc----cccc Confidence 654 456677777665543 2345554222335689999999999999999999988631 1110011 0011 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .+......... .........|+.|.++...|..++.... .++++|..|..|.+-.. ..+ ..+..+..++ T Consensus 143 ~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~lkd-----~~G-~~l~~~~~~~ 211 (304) T protein:vir:94 143 KPLVEGAEEKG---NVVTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNALD-----AND-RPLFDANGNE 211 (304) T ss_pred ccccccccccc---cccccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhc-----cCC-cEeecCCCcc Confidence 11110011110 0111123358888899888888876433 57899999999875321 111 1122334578 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.++++|........ .-+++ +-+..+..++++++..++.. T Consensus 212 l~G~PV~~~~~~~~~~~~~~~----------------~~gd~------------~~~~~~~~~~~~i~~~~e~~~~~~~~ 263 (304) T protein:vir:94 212 IMGLPLSYTGADVYDKKKSLA----------------LMGDW------------DYARYGILQGIEYAISEDATLTTLQA 263 (304) T ss_pred ccceeeEEecccccCCCCcEE----------------EEEeh------------hhEEEEEecceEEEEeecceeeeecc Confidence 999999999999864322111 11111 11112223334444443321 Q ss_pred ----------hhh--hhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 314 ----------YQA--DQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 314 ----------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) +.. -.+++.+++|..+++|++.+.|+..+ T Consensus 264 ~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 264 SDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 122 34567789999999999999999999 No 73 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.49 E-value=3.4e-15 Score=99.92 Aligned_cols=296 Identities=8% Similarity=0.028 Sum_probs=168.0 Q ss_pred CCCccccccccccccc-cccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGK-GQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~-~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~ 77 (344) ...+............ ...+++--.+..+.|.+++.+..+..+.++++++..++.++. ++.++. .+...+..+..|. T Consensus 106 ~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~ 185 (415) T protein:vir:47 106 VRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccccc Confidence 0000000000000000 001111123677999999999999999999999988876543 233333 3444556666676 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... ..+.+++++..-+.- .-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.....+ .+ .+ T Consensus 186 ~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g----~~-----~~ 254 (415) T protein:vir:47 186 ENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG----ST-----GS 254 (415) T ss_pred cccccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC----Cc-----cc Confidence 665331 124455555554432 22445442223456889999999999999999999987432110 00 00 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ........ ......+ ....++.|+++...+...... .=.+|++|..|..|..-.. .+..|.-...+.+|.. T Consensus 255 ~~~~~~~~-~~~~~~~-----~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-~~G~~i~~~~~~~~~~ 325 (415) T protein:vir:47 255 TSSGFEKE-GKKLEVK-----KAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccccccc-cceeccc-----cccchHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCcCCCCC Confidence 00010000 0000000 111267777787777776663 2357899999998865221 2233433334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|++|+.++++|.++.+... .++.. +.++..+..++++++.... ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~---------------------------~~~gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:47 326 QRLLGAKIEILPDEVLGQKGNNT---------------------------LIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ccccceeeEEeccccccCCCccE---------------------------EEEEehhccEEEEeecceEEEeecc-ccCc Confidence 89999999999999865432211 11111 1122233445556665543 3334 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.++++.++++|++.+.+.++..| T Consensus 378 ~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:47 378 ECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEeeccC Confidence 5678999999999999999999999998 No 74 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.49 E-value=3.4e-15 Score=99.92 Aligned_cols=296 Identities=8% Similarity=0.028 Sum_probs=168.0 Q ss_pred CCCccccccccccccc-cccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGK-GQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~-~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~ 77 (344) ...+............ ...+++--.+..+.|.+++.+..+..+.++++++..++.++. ++.++. .+...+..+..|. T Consensus 106 ~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~ 185 (415) T protein:vir:46 106 VRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccccc Confidence 0000000000000000 001111123677999999999999999999999988876543 233333 3444556666676 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... ..+.+++++..-+.- .-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.....+ .+ .+ T Consensus 186 ~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g----~~-----~~ 254 (415) T protein:vir:46 186 ENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG----ST-----GS 254 (415) T ss_pred cccccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC----Cc-----cc Confidence 665331 124455555554432 22445442223456889999999999999999999987432110 00 00 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ........ ......+ ....++.|+++...+...... .=.+|++|..|..|..-.. .+..|.-...+.+|.. T Consensus 255 ~~~~~~~~-~~~~~~~-----~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-~~G~~i~~~~~~~~~~ 325 (415) T protein:vir:46 255 TSSGFEKE-GKKLEVK-----KAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccccccc-cceeccc-----cccchHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCcCCCCC Confidence 00010000 0000000 111267777787777776663 2357899999998865221 2233433334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|++|+.++++|.++.+... .++.. +.++..+..++++++.... ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~---------------------------~~~gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:46 326 QRLLGAKIEILPDEVLGQKGNNT---------------------------LIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ccccceeeEEeccccccCCCccE---------------------------EEEEehhccEEEEeecceEEEeecc-ccCc Confidence 89999999999999865432211 11111 1122233445556665543 3334 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.++++.++++|++.+.+.++..| T Consensus 378 ~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:46 378 ECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEeeccC Confidence 5678999999999999999999999998 No 75 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.48 E-value=3.6e-15 Score=99.77 Aligned_cols=285 Identities=12% Similarity=0.036 Sum_probs=167.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) +.+.-.++. .+......+.+...+..+.+..++.+..+..|.++++.+..++. +.+++||+. +.+.+.-+..|+.+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:96 15 ASNNVKPQV--FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHhhhhhhh--cccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeeecCCccc Confidence 101000000 11111111122233678999999999999999999999888865 456888887 55566667778887 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.. .-..|.+-=-.++..|+.+.+.++.++++++..|+.+|.-- .. ...+.+.. T Consensus 92 ~~~--~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~----g~----~~~~~~~~ 160 (324) T protein:vir:96 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GN----NPFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEEeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC----CC----CCcCcccc Confidence 754 456777777665543 33556552122346889999999999999999999887311 10 01111110 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .. ...... ...+...++.|+++...|..++.... .++++|..+..|.+-.. -.+.-.+..|..+. T Consensus 161 ~~----~~~~~~----~~~~~~~~~~i~~~~~~i~~~~~~~~--~~i~n~~~~~~L~~lkd-----~~G~~~~~~~~~~~ 225 (324) T protein:vir:96 161 QS----IKKTNK----VIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVD-----PETKERIYDRNSDS 225 (324) T ss_pred cc----ccccce----ecccccchHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhC-----CCCCeeecCCCCCc Confidence 00 000000 01112236778888888887766433 57899999998875321 12223344566678 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|++|+.++..+.+... . ++.+.+.+..+..+++++|..++.. T Consensus 226 l~G~PV~~~~~~~~~~~~------------------~------------~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 226 LDGLPVVNLKSSNLKRGE------------------L------------ITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeecCCCCCcce------------------E------------EEEecceEEEEEecCcEEEEeeccccccccc Confidence 999999988766533210 0 1111111122333445566555432 Q ss_pred --------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.. -.++..+++|.+++||++.+.|+..... T Consensus 276 ~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 222 3467888999999999999988854444 No 76 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.48 E-value=7e-15 Score=98.18 Aligned_cols=291 Identities=11% Similarity=0.045 Sum_probs=170.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||....+.. +-.++|.+++.+..+..|+++.+.+...+.+| .++||+. +.+.+.-+..|+.+ T Consensus 1 mat~~~gg~----------------lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 63 (311) T protein:vir:81 1 MVALATGTF----------------QLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQK 63 (311) T ss_pred CceecCCce----------------EcchhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccc Confidence 888665321 45689999999999999999999988777655 4788886 66777778888888 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHH-----HhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDA-----MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.. +++.++++|..-+.. .-..|.+ |. ....++.+.+.++.+++|++..|+.++..... ..... T Consensus 64 ~~~--~~~f~~v~l~~~kl~-~~~~iS~--ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~------~~~~~ 132 (311) T protein:vir:81 64 SES--TATFAPVTAIPRKVQ-VTQRFSQ--EVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINP------LTGAA 132 (311) T ss_pred ccc--cceeeEEEEeeEEEE-EeehhhH--HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccC------CCCcc Confidence 754 456677777664442 2234443 32 23456899999999999999999998742110 00011 Q ss_pred cccccC-----ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 155 IAGLGK-----PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 155 ~~~~~~-----~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) +.+... ...+..+. .....++..+.++...+...+... ..++++|..+..|.+-..- +..+.=. T Consensus 133 ~~gi~~~~~~~~~~~~~~~--------~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~-~G~~l~~ 201 (311) T protein:vir:81 133 LSGSPAKILDTTNIVELTT--------GTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRDS-QGRKLYP 201 (311) T ss_pred cccccccccccceeeeecc--------cccchHHHHHHHHHHHhhhcCCCc--eEEEEcHHHHHHHHhhhcc-CCCeeec Confidence 111111 11111111 111123444555666666655532 3478999999988653211 2222212 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) .....|..++++|.+|+.++++|................. .....+.+|+ +-+..+..+.+++|.. T Consensus 202 ~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~~~gDf------------s~~~i~~~~~~~~~~~ 267 (311) T protein:vir:81 202 ELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTN--PNVKAIAGDF------------SAFRWGVQVSIPLELI 267 (311) T ss_pred CccccCCCceecceeEEecccccccccccccccchhcccC--CccEEEEEec------------ccEEEEEeccceEEEe Confidence 2334566789999999999999865432221111110000 0111112222 1112222334455555 Q ss_pred ecch-------hhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAE-------YQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~-------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++.. +..| .+++..++|.++++|++.+.|+-...| T Consensus 268 ~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 268 EFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred ccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 4421 2222 466778999999999999999999999 No 77 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.48 E-value=2.1e-15 Score=101.09 Aligned_cols=293 Identities=9% Similarity=0.035 Sum_probs=170.2 Q ss_pred CCCc-cccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeec-CcceeeeeeCCC Q lcl|NC_015719. 1 MANM-QGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVI-GRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~-~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~i-G~~t~~~~~~g~ 77 (344) +.+. ..... .+.+. ...++--.+..+.+.+++++..+..+.++++++...+.++. ++.++.. +...+.....|. T Consensus 109 ~~~~~~~~~~--~~~~~-~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~ 185 (415) T protein:vir:94 109 FTEYLETRND--IQGGS-LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHhhhhhh--hhhhc-cccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccc Confidence 0000 00000 00000 01111123466899999999999999999999988876543 4444443 445566666676 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... .++..++++.+-+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.....+ .+ .+ T Consensus 186 ~~~~~~-~~~~~~i~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g----~~-----~~ 254 (415) T protein:vir:94 186 ENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG----ST-----GS 254 (415) T ss_pred cccccc-cccceeeEeeheeee-eechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC----cc-----cc Confidence 665331 124456555554442 22345442122357889999999999999999999987432210 00 00 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ...+... .......+ ....|+.|+++...+...+.. .-.+|++|..|..|..-..- +..|.-...+.+|.. T Consensus 255 ~~~~~~~-~~~~~~~~-----~~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~ 325 (415) T protein:vir:94 255 TSSGFEK-EGKKLEVK-----KAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK-LGNYLIQPDVKEKTQ 325 (415) T ss_pred ccccccc-cccccccc-----cccchHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc-CCCeeeccCcCCCCC Confidence 0000000 00001111 111267788888888777764 23578899999998753221 222332334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|.+|+.++++|.+...... .++.. +.++..+....++++..+. ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~~---------------------------i~~gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:94 326 QRLLGAKIEILPDEVLGQKGNNT---------------------------LIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ceecceeeEEecccccCCCCccE---------------------------EEEEehhccEEEEeecceEEEEecc-ccCc Confidence 89999999999999865432211 11111 1122233344556665543 3445 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.++++.++++|++.+.+.++.++ T Consensus 378 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 378 ECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEEeccC Confidence 6688999999999999999999999999 No 78 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.47 E-value=5.7e-15 Score=98.69 Aligned_cols=295 Identities=13% Similarity=0.042 Sum_probs=160.7 Q ss_pred CCCcccc-ccccccccccccccchhhhhHHHHhhHHH-HHHHHhhhhcCCceeeecccccEEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGG-QQLGTNQGKGQSAADKLALFLKVFGGEVL-TAFARTSVTANRHMQRQISSGKSAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~-~~~~~~~g~~~~~~d~~~l~~e~f~geV~-~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~g~ 77 (344) +.....- .......+. ..++--.|.++.|..+++ +.+...+.+..+.++... .|+ +.+|+ .+...+..+..|. T Consensus 237 l~~~e~~~~~~~~~~~~--t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~-~~~~~~~~~~~a~~v~Eg~ 312 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGL--TKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGD-VWHGVSSAAVQWSWDAEFE 312 (543) T ss_pred hhhhhhhhhhhhhhccc--ccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-Ccc-eEEEEecCCcceeecccCc Confidence 0000000 000000000 111111256678887765 666677888887776443 454 44544 4556666677777 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++.. .++..++++.+.+.-.+ +.|.+ +-.+.+.|+.+.+.++.++++++..|+.|+. +.. ....+.| T Consensus 313 ~~~~~--~~~~~~i~~~~~k~~~~-~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~ail~----G~G----t~~~p~G 380 (543) T protein:vir:81 313 EVSDD--SPEFGQPEIPVKKAQGF-VPISI-EALQDEANVTETVALLFAEGKDELEAVTLTT----GTG----QGNQPTG 380 (543) T ss_pred ccccc--ccccceeeeeeeeeEee-ehhhH-HHHhccHHHHHHHHHHHHHHHHHHHHHHHhc----cCC----CCccccc Confidence 77654 45667777766555433 45655 4445568999999999999999999998862 110 0111222 Q ss_pred ccC---ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccc Q lcl|NC_015719. 158 LGK---PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPER 234 (344) Q Consensus 158 ~~~---~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (344) ... +....+.+. +.....++.++++...|...+-+ .-.+|++|..|..|.+-..- +..|.-. .+.+ T Consensus 381 i~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~l~~lkd~-~G~~l~~-~~~~ 449 (543) T protein:vir:81 381 IVTALAGTAAEIAPV-------TAETFALADVYAVYEQLAARHRR--QGAWLANNLIYNKIRQFDTQ-GGAGLWT-TIGN 449 (543) T ss_pred chhhccccccccccc-------ccccccHHHHHHHHHhhhccccC--CcEEEEcHHHHHHHHHhhcC-CCceecc-CcCC Confidence 111 111111111 11122367777787777666543 23678999999998753221 2222211 2445 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc-- Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA-- 312 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~-- 312 (344) |.-++++|.+|+.++++|.+...... ......+-|++ +-+..+...+++++...+. T Consensus 450 g~~~~l~G~pv~~~~~~~~~~~~~~~----------~~~~~i~~gd~------------~~~~i~~~~~~~i~~~~~~~~ 507 (543) T protein:vir:81 450 GEPSQLLGRPVGEAEAMDANWNTSAS----------ADNFVLLYGNF------------QNYVIADRIGMTVEFIPHLFG 507 (543) T ss_pred CCCccccceeeEEecccccccccccc----------CCcceEEEeec------------cceeEEeecccEEEEeccccc Confidence 66678999999999999975432210 00111222222 1111222333344432211 Q ss_pred --h--hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 --E--YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 --~--~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + ...-.+++..++|.++++|++.+.++++.+| T Consensus 508 ~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 508 TNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred cchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 1 1122457777899999999999999999999 No 79 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.46 E-value=7e-15 Score=98.18 Aligned_cols=279 Identities=11% Similarity=0.048 Sum_probs=167.5 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) +..++. .++ ....+...+..+.+..++.+..+..|.++++.+...+. +..++||+. +.+.+.-+..|+.+ T Consensus 21 ~~~~~a-~~~-------~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:93 21 PQVFNP-DNV-------MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhhccc-ccc-------cccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeeecCCccc Confidence 222211 111 11112223678999999999999999999998887765 445778876 66677777788888 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.. .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.+|.- ... . ..+.+.. T Consensus 92 ~~~--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G----~g~-~---~~~~~~~ 160 (324) T protein:vir:93 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QGN-N---PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEEeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcC----CCC-C---CcCcccc Confidence 754 356677777665443 3355655222235689999999999999999999988631 110 0 0111110 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .. ...... ...+...++.|.++...|..++... ..++++|..|..|.+-. +-.|.-.+..|..++ T Consensus 161 ~~----~~~~~~----~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~l~-----d~~G~~~~~~~~~~~ 225 (324) T protein:vir:93 161 QS----IEKTNK----VIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDS 225 (324) T ss_pred cc----ccccce----eccccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhh-----CCCCCeeecCCCCCc Confidence 00 000000 0111223778888888888877633 36889999999887531 222333445566678 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.+++.+.+... .+.+++ +-+..+..++++++..++.. T Consensus 226 l~G~PVv~~~~~~~~~~~------------------i~~gdf------------s~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:93 226 LDGLPVVNLKSSNLKRGE------------------LITGDF------------DKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeecCCCCCcce------------------EEEEec------------ceEEEEEecCcEEEEeeccccccccc Confidence 999999988765432110 111111 11112334455666655431 Q ss_pred --------hh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------YQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +. .-.+++.+++|.++++|++.+.|+....- T Consensus 276 ~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:93 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 11 24578888999999999999888633222 No 80 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.46 E-value=1.4e-14 Score=96.55 Aligned_cols=287 Identities=10% Similarity=0.017 Sum_probs=164.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||..+.+. .+..++++.++.+..+..|.++.+.+...+.+ .+++||+. +.+.+..+..|+.+ T Consensus 1 m~t~t~gg----------------~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E~~~~ 63 (303) T protein:vir:97 1 MGTETSKA----------------SLFDKHLVSDLINKVKGHSSLAKLSSQKPIPF-NGSKEFTFTLDSDIDVVAENGKK 63 (303) T ss_pred CcccCCCC----------------eEcchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEecCcceEEeecCccc Confidence 77543211 25778999999999999999999998887754 45788775 56677777778777 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHH-----HhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDA-----MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.+ .++-+++++..-+. .....|.+ |. ....++.+.+.++.+++|++..|+.++.... ........ T Consensus 64 ~~s--~~~f~~v~l~~~kl-~~~~~iS~--ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~----~~~g~~~~ 134 (303) T protein:vir:97 64 THG--GLSLEPVTIVPIKV-EYGARLSD--EFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGIN----PRTKKASD 134 (303) T ss_pred ccc--ccceeeEEeeeEEE-EEeehhhH--HHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccc----cCCccccc Confidence 654 35566666654333 22234433 32 2356789999999999999999999874321 00000000 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPER 234 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (344) + .+.....+..+... ........++.|.++...+...+... ..++++|..+..|.+-..-............. T Consensus 135 ~----~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~ 207 (303) T protein:vir:97 135 V----IGTNHFDSKVTQVV-KFTESEDADANIEAAVNLIQGAEGVV--TGLAMDTEFSTALAKVTNGEMGPKMYPELAWG 207 (303) T ss_pred c----cccccccccccccc-ccccccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhccCCCeEEecCccCC Confidence 0 11111001111000 00111224778888888887776643 34888999999887532221111111111223 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee--cc Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR--RA 312 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~--~~ 312 (344) +..++++|.+|+.|+++|........ ....+-+++ ++++.....+.+++|... ++ T Consensus 208 ~~~~~l~G~Pv~~s~~v~~~~~~~~~------------~~~~~~Gdf-----------~~~~~~~~~~~~~~~~~~~~~~ 264 (303) T protein:vir:97 208 ANPDSINGLKSSVNTTVGAGADEAES------------KDLVIIGDF-----------ESMFKWGYAKQIPMEIIKYGDP 264 (303) T ss_pred CCCceecceeeEEecccCCccccCCC------------ccEEEEeec-----------cccEEEEEecCcEEEEeeccCC Confidence 45578999999999999864322110 001111111 111112223334444432 11 Q ss_pred h------hhhh--hhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 313 E------YQAD--QIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 313 ~------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) + +..| .+++..+++.++++|++.+.|+-..= T Consensus 265 d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 265 DNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 1 2222 47778899999999999998876666 No 81 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.45 E-value=6.4e-15 Score=98.39 Aligned_cols=287 Identities=16% Similarity=0.145 Sum_probs=167.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC--cceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG--RTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (344) +.+..... .+..-....++.-.+.++.+..++.+.....+.++.+++...+. +.++++|+.. ..++..+..|+. T Consensus 93 ~~~~~~~~---~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 168 (385) T protein:vir:18 93 QGTFGAKT---FNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEKAL 168 (385) T ss_pred hccchhhH---HHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccCcc Confidence 11100000 00000111111112456788899999999999999998887764 5578888863 345555666777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.. .++..++++.+.+.-. .+.|.+ +-.+...++.+.+.++.++++++..|+.++.- .. .+..+.|. T Consensus 169 ~~~~--~~~~~~~~~~~~k~~~-~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G----~g----~~~~~~Gi 236 (385) T protein:vir:18 169 KPES--DITFSKQTANVKTIAH-WVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLNG----DG----TGDNLEGL 236 (385) T ss_pred cccc--ccceeEEEEeeeeEEE-eehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cC----CCCccccc Confidence 7654 3566777777766543 345654 33344566889999999999999999988621 11 11111111 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) . ........+ ........++.|.++...|...+.+. -.++++|..|..|.+-..- +..+.-. ....|..+ T Consensus 237 ~-----~~~~~~~~~-~~~~~~~~~d~i~~~~~~l~~~~~~~--~~~~~~~~~~~~l~~lkd~-~G~~l~~-~~~~~~~~ 306 (385) T protein:vir:18 237 N-----KVATAYDTS-LNATGDTRADIIAHAIYQVTESEFSA--SGIVLNPRDWHNIALLKDN-EGRYIFG-GPQAFTSN 306 (385) T ss_pred c-----ccccccccc-ccccccchHHHHHHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcC-CCceecc-CcccCCCc Confidence 1 111100000 01112234788888888887776543 3678999999988753321 2222221 23466678 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch--hhh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE--YQA 316 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~--~~~ 316 (344) .++|.+|+.|+.+|.+... -+++ .. ..++ +..+.++++..+... +.. T Consensus 307 ~l~G~pV~~~~~~p~~~~~--------------------~gd~--~~-~~~~--------~~~~~~~v~~~~~~~~~~~~ 355 (385) T protein:vir:18 307 IMWGLPVVPTKAQAAGTFT--------------------VGGF--DM-ASQV--------WDRMDATVEVSREDRDNFVK 355 (385) T ss_pred eecceeeEEcCcCCCCcEE--------------------Eeec--cc-EEEE--------EEecceEEEEeccccchhhc Confidence 9999999999999854210 0111 11 1112 223344555544331 222 Q ss_pred h--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 D--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + .++..+++|.++++|++.+.+++++.| T Consensus 356 ~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 356 NMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccCC Confidence 3 457788999999999999999999999 No 82 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.45 E-value=6.4e-15 Score=98.39 Aligned_cols=287 Identities=16% Similarity=0.145 Sum_probs=167.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC--cceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG--RTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (344) +.+..... .+..-....++.-.+.++.+..++.+.....+.++.+++...+. +.++++|+.. ..++..+..|+. T Consensus 93 ~~~~~~~~---~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 168 (385) T protein:vir:19 93 QGTFGAKT---FNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEKAL 168 (385) T ss_pred hccchhhH---HHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccCcc Confidence 11100000 00000111111112456788899999999999999998887764 5578888863 345555666777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.. .++..++++.+.+.-. .+.|.+ +-.+...++.+.+.++.++++++..|+.++.- .. .+..+.|. T Consensus 169 ~~~~--~~~~~~~~~~~~k~~~-~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G----~g----~~~~~~Gi 236 (385) T protein:vir:19 169 KPES--DITFSKQTANVKTIAH-WVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLNG----DG----TGDNLEGL 236 (385) T ss_pred cccc--ccceeEEEEeeeeEEE-eehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cC----CCCccccc Confidence 7654 3566777777766543 345654 33344566889999999999999999988621 11 11111111 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) . ........+ ........++.|.++...|...+.+. -.++++|..|..|.+-..- +..+.-. ....|..+ T Consensus 237 ~-----~~~~~~~~~-~~~~~~~~~d~i~~~~~~l~~~~~~~--~~~~~~~~~~~~l~~lkd~-~G~~l~~-~~~~~~~~ 306 (385) T protein:vir:19 237 N-----KVATAYDTS-LNATGDTRADIIAHAIYQVTESEFSA--SGIVLNPRDWHNIALLKDN-EGRYIFG-GPQAFTSN 306 (385) T ss_pred c-----ccccccccc-ccccccchHHHHHHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcC-CCceecc-CcccCCCc Confidence 1 111100000 01112234788888888887776543 3678999999988753321 2222221 23466678 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch--hhh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE--YQA 316 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~--~~~ 316 (344) .++|.+|+.|+.+|.+... -+++ .. ..++ +..+.++++..+... +.. T Consensus 307 ~l~G~pV~~~~~~p~~~~~--------------------~gd~--~~-~~~~--------~~~~~~~v~~~~~~~~~~~~ 355 (385) T protein:vir:19 307 IMWGLPVVPTKAQAAGTFT--------------------VGGF--DM-ASQV--------WDRMDATVEVSREDRDNFVK 355 (385) T ss_pred eecceeeEEcCcCCCCcEE--------------------Eeec--cc-EEEE--------EEecceEEEEeccccchhhc Confidence 9999999999999854210 0111 11 1112 223344555544331 222 Q ss_pred h--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 D--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + .++..+++|.++++|++.+.+++++.| T Consensus 356 ~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 356 NMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccCC Confidence 3 457788999999999999999999999 No 83 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.44 E-value=8.1e-15 Score=97.84 Aligned_cols=287 Identities=16% Similarity=0.105 Sum_probs=172.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC--cceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG--RTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (344) ............+.+.....++.-.+.++.+...+.+..+..+.++++++...+. +.++++++.. ..++..+..|+. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:97 99 SARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred hhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeecc-CCceEEEEEecCCcceeeecCCcc Confidence 0000000000011111222233334677889999999999999999998887775 4457777763 345666777887 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.. .++..++++.+.+.. .-..|.+ +-.+.+.++.+.+.++.++++++..|+.++.. . ..+..+.|. T Consensus 178 ~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~ds~~l~~~i~~~la~a~~~~~d~a~l~G----~----g~~~~p~Gi 245 (390) T protein:vir:97 178 KPES--SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----T----GANDGLLGL 245 (390) T ss_pred cccc--ccceeEEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhc----C----CCCccccce Confidence 7654 456777778776654 3345655 23334567999999999999999999988631 1 111112221 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) .. ..+..+..+ ...+...++.|.++...+...+.+.. .+|++|..|..|.+-.. .+..|.-.. ...|..+ T Consensus 246 ~~----~~~~~~~~~--~~~~~~~~d~~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd-~~G~~l~~~-~~~~~~~ 315 (390) T protein:vir:97 246 IP----QATTYAAPT--TIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKD-ANNQYLIGN-ARGTLTP 315 (390) T ss_pred ee----ccccccccc--cccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceeecC-ccCCCCc Confidence 11 111111111 11123346788888888888888644 56789999998875332 122222111 2345557 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc-hhhhh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA-EYQAD 317 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~-~~~~d 317 (344) +++|.+|+.|+.+|.+.. .-+++ +.+...+..+.++++..++. .+..+ T Consensus 316 ~l~G~pV~~~~~~~~~~~--------------------~~gd~-----------~~~~~~~~~~~~~i~~~~~~~~f~~~ 364 (390) T protein:vir:97 316 TLWGLPVVATQAMAPGEF--------------------LVGAF-----------DLAAQIFDQWDARVEIGYVNDDFQRN 364 (390) T ss_pred eecceeeEEcCCCCCCcE--------------------EEEec-----------cceEEEEEecceEEEEeecccccccC Confidence 899999999999985321 01111 11222233455677777654 44455 Q ss_pred h--hhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 318 Q--IIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 318 ~--i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) . ++...+||.++++|++.+.+.+. T Consensus 365 ~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 365 MVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred cEEEEEEEeeccEEeccccEEEEEeC Confidence 4 66778999999999999999999 No 84 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.44 E-value=2.3e-14 Score=95.41 Aligned_cols=282 Identities=12% Similarity=0.016 Sum_probs=167.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||- ++ | .+..++|..++.+..++.|+++.+.+...+.+| .++||++ +.+++..+..|+.+ T Consensus 1 ma~-~g--------------G---~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~ 61 (298) T protein:vir:94 1 MVL-NK--------------G---TLFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKK 61 (298) T ss_pred Cee-cc--------------c---cccChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccc Confidence 655 21 1 156688999999999999999999887777654 4788886 66677888888877 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHH-----hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAM-----NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.. .++.+++++..-+.. ..+.|.+ |.. ...++.+.+.++.+++|++.+|+.++..... . .... T Consensus 62 ~~~--~~~f~~v~l~~~k~~-~~~~iS~--ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~----~--~g~~ 130 (298) T protein:vir:94 62 THG--GVTLAPQTMVPIKVE-YGARISD--EFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNP----R--LGTA 130 (298) T ss_pred ccc--ccceeEEEEeeeEEE-EeeehhH--HHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhccccc----C--CCcc Confidence 754 456677777654443 3344543 222 3457889999999999999999998742110 0 0000 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPER 234 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (344) ..+. +........+...........+++.|.++..+|..++.... .++++|..+..|.+-..- +..+.-...... T Consensus 131 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~ 205 (298) T protein:vir:94 131 SAVI--GTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL-QGNALFPELKWG 205 (298) T ss_pred cccc--cccccccccccccccccccccHHHHHHHHHHhhhhcCCCcc--EEEEcHHHHHHHHHhhcc-CCCeeecCcccC Confidence 1110 00000111111111112223457788899999988887533 689999999988653221 222322234456 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec--c Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR--A 312 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~--~ 312 (344) |..++++|++|+.++++|....+.. ...+.+++. .++..+..++++++..+. + T Consensus 206 ~~~~tl~G~PV~~~~~v~~~~~~~~--------------~~~~~Gdfs-----------~~~~~~~~~~~~~~~~~~~~~ 260 (298) T protein:vir:94 206 ATPDTINGLPVDVNKTVSDMSLTQR--------------DRAIIGDFA-----------NGFKWGYAKEVPLEVIQYGDP 260 (298) T ss_pred CCCceecceeeEEecccccccCCCc--------------cEEEEeecc-----------ceEEEEEecCceEEEeecCCC Confidence 7778999999999999985432110 011112211 111112223344444332 2 Q ss_pred h------hhhh--hhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 313 E------YQAD--QIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 313 ~------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) + ++.| .+++.+++|.+++||++.+.|+-.- T Consensus 261 d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 261 DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1 2223 3677889999999999988886544 No 85 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.44 E-value=7.9e-15 Score=97.89 Aligned_cols=285 Identities=11% Similarity=0.031 Sum_probs=167.5 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) +.+...... .+......+++...+..+.|..++.+..+..|.++.+.+..++. |.+++||+. +.+.+.-+..|+.+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:96 15 ASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHhhhhhh--hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecCCccc Confidence 111111000 00001111222234677899999999999999999998887765 556888887 55666777778888 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.++.-- ... ..+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~----~~~~gi~ 160 (324) T protein:vir:96 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNN----PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC----CCC----CcCcccc Confidence 754 456777777765443 33456552222346899999999999999999999886311 100 1111111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .. . ...... ..+...++.|+++...|..++... ..++++|..|..|.+-.. -.+...+..|..++ T Consensus 161 ~~--~--~~~~~~----~~~~~t~~~i~~~~~~l~~~~~~~--~~~vmn~~~~~~L~~l~d-----~~G~~~~~~~~~~~ 225 (324) T protein:vir:96 161 QS--I--EKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PETKERIYDRNSDS 225 (324) T ss_pred cc--c--ccccee----ccccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhc-----cCCCeeecCCCCCc Confidence 00 0 000000 111223778888888888877643 357899999998875321 12223345566678 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.++..+.+... ...++ .+-+..+..+++++|..++.. T Consensus 226 l~G~PV~~~~~~~~~~~~------------------~~~gd------------~~~~~~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 226 LDGLPVVNLKSSNLKRGE------------------LITGD------------FDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeeCCCCCCcce------------------EEEEe------------cceEEEEEecCcEEEEeeccccccccc Confidence 999999988765432110 11111 111112334455666655431 Q ss_pred --------hh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------YQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +. .-.+++.+++|.+++||++.+.|+..... T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 12 24456778899999999999888753333 No 86 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.44 E-value=7.9e-15 Score=97.89 Aligned_cols=285 Identities=11% Similarity=0.031 Sum_probs=167.5 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) +.+...... .+......+++...+..+.|..++.+..+..|.++.+.+..++. |.+++||+. +.+.+.-+..|+.+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:78 15 ASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHhhhhhh--hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecCCccc Confidence 111111000 00001111222234677899999999999999999998887765 556888887 55666777778888 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.++.-- ... ..+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~----~~~~gi~ 160 (324) T protein:vir:78 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNN----PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC----CCC----CcCcccc Confidence 754 456777777765443 33456552222346899999999999999999999886311 100 1111111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .. . ...... ..+...++.|+++...|..++... ..++++|..|..|.+-.. -.+...+..|..++ T Consensus 161 ~~--~--~~~~~~----~~~~~t~~~i~~~~~~l~~~~~~~--~~~vmn~~~~~~L~~l~d-----~~G~~~~~~~~~~~ 225 (324) T protein:vir:78 161 QS--I--EKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PETKERIYDRNSDS 225 (324) T ss_pred cc--c--ccccee----ccccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhc-----cCCCeeecCCCCCc Confidence 00 0 000000 111223778888888888877643 357899999998875321 12223345566678 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.++..+.+... ...++ .+-+..+..+++++|..++.. T Consensus 226 l~G~PV~~~~~~~~~~~~------------------~~~gd------------~~~~~~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:78 226 LDGLPVVNLKSSNLKRGE------------------LITGD------------FDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeeCCCCCCcce------------------EEEEe------------cceEEEEEecCcEEEEeeccccccccc Confidence 999999988765432110 11111 111112334455666655431 Q ss_pred --------hh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------YQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +. .-.+++.+++|.+++||++.+.|+..... T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:78 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 12 24456778899999999999888753333 No 87 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.43 E-value=1.9e-14 Score=95.75 Aligned_cols=291 Identities=14% Similarity=0.108 Sum_probs=168.5 Q ss_pred CCCcccccccc-ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-C-cceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLG-TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-G-RTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~-~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G-~~t~~~~~~g~ 77 (344) |.....+.... -+......+++.-.+.++.|+.++.+..+..+.++++++...+. +.++.+++. + ..++..+..|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~ 176 (395) T protein:vir:43 98 TSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTE-SNSVEYVRETGFVNNAAPVSEGT 176 (395) T ss_pred HHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecC-CCceEEEEEecCCCceeeecCCc Confidence 11111111000 00000011111223677889999999999999999999988875 456778875 3 34555666677 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++.. .++.+++++.+.+...+ +.|++ +-.+...++.+.+.++.+.++++..|..++.. ... +..+.| T Consensus 177 ~~~~~--~~~~~~i~~~~~k~~~~-~~is~-ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G----~g~----~~~~~G 244 (395) T protein:vir:43 177 QKPYS--DLTFELENAPVRTIAHL-FKASR-QILDDASALQSYIDARARYGLMLVEECQLLYG----NGT----GANLHG 244 (395) T ss_pred ccccc--ccceeEEEEeeeeEEEe-ehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCC----CCcccc Confidence 76644 45667777777666433 45654 33444557888889999999999999988631 111 111111 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ......+. .............++.|.++...+...+.+. -.+|++|..|..|.+-..- +..|.-. ...+|.. T Consensus 245 i~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~-~G~~i~~-~~~~~~~ 316 (395) T protein:vir:43 245 IIPQAQAY----APPSGVVVTAEQRIDRIRLAILQAQLAEFPA--SGIVLNPIDWALIELNKDA-ENRYIIG-SPQNGTT 316 (395) T ss_pred cccccccc----ccccccccccchhHHHHHHHHHhhccccCCC--cEEEEcHHHHHHHHHhhcc-CCceecc-ccccCCC Confidence 11111000 0000111112335788888888888777643 3678999999988653221 2223222 2456667 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc--hhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA--EYQ 315 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~--~~~ 315 (344) +.++|.+|+.++.+|.+.. . -+++ .. ..++ +....++++..+.. .+. T Consensus 317 ~~l~G~pVv~~~~~~~~~~---~-----------------~gd~--~~-~~~~--------~~~~~~~i~~~~~~~~~f~ 365 (395) T protein:vir:43 317 PTLWRLPVVETQAITQDEF---L-----------------TGAF--SL-GAQI--------FDRMDIEVLVSTENDKDFE 365 (395) T ss_pred ceecceeeEEcCCCCCCcE---E-----------------EEec--cc-eEEE--------EEecceEEEEeccccchhh Confidence 7899999999999985431 0 0111 11 0111 11223445555432 222 Q ss_pred hh--hhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 316 AD--QIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 316 ~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) .+ .++...++|.++++|++.+.+.+++. T Consensus 366 ~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 366 NNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred cCcEEEEEEEeeccEEecccceEEEEeccC Confidence 33 56777899999999999999988877 No 88 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.42 E-value=8.3e-15 Score=97.78 Aligned_cols=277 Identities=13% Similarity=0.093 Sum_probs=159.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (344) +...............+..+++--.+..+.|..++++.....+.++++.++.++.++ +..+|.. +...+..+..|.. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~ 198 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAELEK 198 (400) T ss_pred hhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCcccccccccc Confidence 000000000000000011111112356799999999999999999999998877544 3455544 4444555655655 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) .+.. ..+...++++.+-+.- .-+.|.+-=-.++.+|+.+.+.++.+++|+...|+.|+.... T Consensus 199 ~~~~-~~~~f~~i~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~---------------- 260 (400) T protein:vir:38 199 NPAM-AKPEFKPVNWSVETYR-QALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLK---------------- 260 (400) T ss_pred cccc-ccccceeeEeehhhee-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccc---------------- Confidence 5432 1234556565553332 223444311224568899999999999999999988863110 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) .++ ..... .++.|.++.. .++ |...-.+|++|..|..|.+-.. .+..|.-...+..|.- T Consensus 261 -~~~------~~~~~--------~~~~~~~~~~~~~~----~~~~a~~v~~~~~~~~l~~lkd-~~G~~i~~~~~~~~~~ 320 (400) T protein:vir:38 261 -GFT------AKTIS--------SVDDLKHINNVDLD----PAYSRVIIASQSFYNFLDTVKD-GNGRYLLQDSILTPSG 320 (400) T ss_pred -ccc------ccccc--------cHHHHHHHHHhhhh----hhhCcEEEEcHHHHHHHHHhhc-cCCCeeeecCcCCCCc Confidence 000 00000 0333443322 222 1224567889999998875322 1223332234556767 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ++++|++|+.+++.|..+.+... .++.. +.++..+..+.++++..++ .++. T Consensus 321 ~~l~G~pv~~~~~~~~~~~g~~~---------------------------~~~gd~s~~~~~~~~~~~~~~~~~~-~~~~ 372 (400) T protein:vir:38 321 KSVLGMPIAVVSDDTLGAAGEAH---------------------------AFLGDIKRAILFANRADFMVRWVDD-QIYG 372 (400) T ss_pred cccccceeEEecccccCCCCceE---------------------------EEEEeccccEEEEeecceEEEEecc-cccc Confidence 79999999999998864432211 11212 1223333345566666654 5567 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+++.+++|+++++|++.+.|+++..| T Consensus 373 ~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 373 QFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred eeEEEEEEeccEEecccceEEEEeecCC Confidence 7899999999999999999999999999 No 89 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.42 E-value=6.1e-14 Score=93.02 Aligned_cols=284 Identities=13% Similarity=0.036 Sum_probs=166.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||..+... | .+..++++.++.+..++.|.++.+.+...+.+| .+.+|+. +.+.+.-+..|+.+ T Consensus 1 ma~~t~~~------G---------~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 64 (300) T protein:vir:95 1 MSEAQLSK------G---------NLFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKK 64 (300) T ss_pred CcccccCC------c---------ceechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccc Confidence 99866421 1 156788999999999999999888877776554 4667764 55666777777777 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHH-----hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAM-----NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.+ .++.+++++..-+. +.-..|.+ |.. ...|+.+.+.++.++++++..|+.++.-.. ...+.... T Consensus 65 ~~s--~~~f~~v~l~~~k~-~~~~~iS~--ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~----~~~g~~~~ 135 (300) T protein:vir:95 65 THG--GVSLDPVTIVPLKV-EYGARVSD--EFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGIN----PRTKQAST 135 (300) T ss_pred ccc--cccceeeEeeeEEE-EEeehhhH--HHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhccc----CCCCCCcc Confidence 654 35666766665433 23344543 322 347899999999999999999999973211 00000000 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPER 234 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (344) +.+ .....+.... +. ...+...++.|.++...+...+... ..++++|..+..|.+-..- +..+.-...... T Consensus 136 ~~~----~~~~~~~~~~-~~-~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~L~~lkd~-~G~~i~~~~~~~ 206 (300) T protein:vir:95 136 IIG----DNCFDKKVTQ-TV-PFKDTNPDESMEDAVGMIDGSERDI--TGAILDPIFTTALSKMKNA-EGGKLYPELAWG 206 (300) T ss_pred ccc----ccccccccce-ee-cccccchHHHHHHHHHHhhhcCCCc--cEEEECHHHHHHHHHhhcc-CCCeeccCcccc Confidence 100 0000000000 00 0011223677888888888776532 2578999999988653222 122221233445 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee--ecc Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA--RRA 312 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~--~~~ 312 (344) |..++++|.+|+.|+.+|......... .+.+|+. +. + ..+..+.++++.. .++ T Consensus 207 ~~~~~l~G~Pv~~s~~v~~~~~~~~~~--------------~~~GDf~--~~--~-------~~~~~~~~~~~v~~~~~~ 261 (300) T protein:vir:95 207 GVPDAINGLAVDKNRTVSYSQTDPKNT--------------AIVGDFE--TM--F-------KWGYAKEVPMEIIKYGDP 261 (300) T ss_pred CCCceecceeeEEecCCCCCCCCCccE--------------EEEeecc--ce--E-------EEEEecccEEEEeeccCC Confidence 677899999999999998654322110 1112211 11 1 0111222233332 222 Q ss_pred h------hhh--hhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 313 E------YQA--DQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 313 ~------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) + ++. -.++..+++|.++++|++.+.|+-.+| T Consensus 262 d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 262 DNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 1 222 345788899999999999999999999 No 90 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.41 E-value=1.6e-14 Score=96.15 Aligned_cols=287 Identities=11% Similarity=0.020 Sum_probs=167.4 Q ss_pred CCCcccccc------------ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-Cc Q lcl|NC_015719. 1 MANMQGGQQ------------LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GR 67 (344) Q Consensus 1 ma~~~~~~~------------~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~ 67 (344) |=..+.... ...+........+...+..+.|..++.+..++.+.++.+.+...+. +.+++||+. +. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEEecC Confidence 111110000 0001111111122234677999999999999999999998877764 556888887 55 Q ss_pred ceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_015719. 68 TKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINL 147 (344) Q Consensus 68 ~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (344) +.+.-+..|+.++.. .++.+++++..-+.. .-..|.+---.++.+++.+.+.++.++++++..|+.++.- .. T Consensus 80 ~~a~~v~Eg~~~~~~--~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G----~g- 151 (324) T protein:vir:97 80 PGAYWVGEGQKIETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QG- 151 (324) T ss_pred cceeEeccCcccccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcc----CC- Confidence 566667778877654 456777777665443 3345655222234688999999999999999999998731 10 Q ss_pred ccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_015719. 148 ADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA 227 (344) Q Consensus 148 ~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (344) . ...+.+. .-........ ......++.|+++...|...+... -.++++|..|..|.+-.. .. T Consensus 152 ~---~~~~~gi----~~~~~~~~~~----~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~lkd-----~~ 213 (324) T protein:vir:97 152 N---NPFGKSI----AQSIEKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PE 213 (324) T ss_pred C---CccCccc----ccccccccee----ccccCCHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhc-----CC Confidence 0 0011111 0000111111 111223778888888888877643 357899999998875321 12 Q ss_pred cccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 228 ~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) +...+..+.-+.++|.+|+.++..+.+... .++...+-+..+..+++++| T Consensus 214 g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~------------------------------~~~gd~~~~~i~~~~~~~i~ 263 (324) T protein:vir:97 214 TKERIYDRNSDTLDGLPVVNLKSSNLKRGE------------------------------LITGDFDKLIYGIPQLIEYK 263 (324) T ss_pred CceeecCCCCccccceeeEeecCCCCCcce------------------------------EEEEecccEEEEEecCcEEE Confidence 223334455568999999998776543211 01111111122334556666 Q ss_pred eeecch--------------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 308 RARRAE--------------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~~~~--------------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..++.. ++. -.++..+++|.++++|++.+.|+..... T Consensus 264 ~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 264 IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 665432 222 3456678899999999999998876554 No 91 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.41 E-value=4.2e-14 Score=93.93 Aligned_cols=274 Identities=12% Similarity=0.052 Sum_probs=166.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccc-ccEEEEeecC--cceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISS-GKSAQFPVIG--RTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG--~~t~~~~~~g~ 77 (344) |.+..... |-. +--.+..++|..++.+..+..+.++++.+...+.+ ..+..|+... ...+.....|+ T Consensus 1 ~l~~~~~~---t~~-------~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~ 70 (293) T protein:vir:48 1 MLDSKTDH---SGS-------DAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAG 70 (293) T ss_pred Cceeeccc---ccC-------cCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCc Confidence 33322111 111 11135779999999999999999999988877653 3356666543 33455666677 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++.+. .++..++++...+.. ..+.|.+-=-.++.+|+.+.+.++.++++++..|+.|+..+.+ T Consensus 71 ~~~~~~-~~~~~~i~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~-------------- 134 (293) T protein:vir:48 71 KIADID-DPKLSLIKYTIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDK-------------- 134 (293) T ss_pred cccccc-ccceeEEEEeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccc-------------- Confidence 765432 245567777665553 2345654222345789999999999999999999988742210 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ++ .... ..-|+.|.++...|..+..+ .-.++++|..|..|.+-..- +..+.-...+.+|.. T Consensus 135 ---~~-----~~~~--------~~~~d~i~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~-~g~~l~~~~~~~~~~ 195 (293) T protein:vir:48 135 ---LP-----TKPT--------LTKWDDIIDLEAKVDPAIKQ--TSFFLTNTSGFTALKKVKNA-LGDYLMERDVKSPTG 195 (293) T ss_pred ---cc-----cccc--------ccCHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhhcc-CCceEeecCcCCCCC Confidence 00 0000 01267788888888766553 33667899999988653322 222322334567777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecc-hh- Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRA-EY- 314 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~-~~- 314 (344) ++++|.+|+.+.+.+.+..+.. +.+.++.. ++++..+....++++..+.. ++ T Consensus 196 ~~l~G~Pv~~~~~~~~~~~~~~-------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 250 (293) T protein:vir:48 196 YSIAGFAVKEISDRWLPNASSG-------------------------VMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAF 250 (293) T ss_pred ceecceeeEEecccccCCccCC-------------------------ceEEEEEeccceEEEEEecceEEEEecccchhh Confidence 8999999998766544322110 01112221 22333344455666665532 22 Q ss_pred hh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .. -.++...++|.++++|++.+.+++++.+ T Consensus 251 ~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 282 (293) T protein:vir:48 251 ETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 282 (293) T ss_pred hcCeEEEEEEEeeCcEEecccceEEEEeeccc Confidence 22 3478889999999999999999988877 No 92 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.41 E-value=3.8e-14 Score=94.19 Aligned_cols=279 Identities=13% Similarity=0.052 Sum_probs=165.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) |.-..- ++.-...+++.-.|..++|..++.+.....+.++.+.+...+.++..+.++.. +.+.+..+..|+.+ T Consensus 1 m~~~~~------~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 74 (297) T protein:vir:95 1 MTVQTF------NPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKI 74 (297) T ss_pred CCcccc------ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccc Confidence 433211 11111112222347889999999999999999999988877655544556644 55677778888888 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHH-hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) +.. +++.+++++...+. .....|.+ +-.+ +..|+.+.+.++.++++++..|+.++.- ..... +.+. T Consensus 75 ~~~--~~~f~~v~l~~~k~-~~~~~is~-ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G----~g~~~-----~~gi 141 (297) T protein:vir:95 75 KTD--KPEVVPVTLKAHKL-GIILVTSR-EALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLG----HDTPF-----ANSV 141 (297) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcc----cCCcc-----cccc Confidence 754 35677777766554 23355655 3333 5688999999999999999999998721 11111 1110 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) .. ....... ... ....|+.|+++..+|..++.+.. .++++|..|..|.+-.. ..| ..+.++..+ T Consensus 142 ~~----~~~~~~~-~~~---~~~t~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d-----~~G-~~i~~~~~~ 205 (297) T protein:vir:95 142 AK----AAKDANK-VIG---GPINYDNILKLQDALYDADVEPN--AFVSKIQNRSALREARD-----GNK-VSIYDKAAN 205 (297) T ss_pred cc----cccccce-ecc---cccCHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhc-----cCC-ceeecCCCC Confidence 00 0000000 000 11137788888888988877543 57889999998875211 111 122345557 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch----- Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE----- 313 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~----- 313 (344) +++|.+|+.+++.+..... . ++...+.+..+...+++++..++.. T Consensus 206 ~l~G~Pv~~~~~~~~~~~~------------------~------------~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~ 255 (297) T protein:vir:95 206 TIDGITTVDLKSARFEKGD------------------L------------LAGDFDNLIYGVPYNITYKISEEGQISTIT 255 (297) T ss_pred cccceeeEeecCCCCCCce------------------E------------EEEecccEEEEEecCeEEEEeecccccccc Confidence 8999999988765432211 0 1111111112333445555554431 Q ss_pred ---------hhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 ---------YQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 ---------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++.| .++...++|.++++|++.+.|+...+- T Consensus 256 ~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 256 NADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 2223 356678999999999999998876666 No 93 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.40 E-value=3e-14 Score=94.68 Aligned_cols=282 Identities=12% Similarity=0.038 Sum_probs=163.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) |...+.. +........+...+..+.+..++.+...+.|.++.+.+..++. +.+++||+. +...+.-+..|..+ T Consensus 18 ~~~~~~~-----~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:10 18 NVKPQVF-----NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhcccee-----cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceeEeccCccc Confidence 2111110 0010111222223678999999999999999999998887765 446888887 55667777788887 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.- .-..|.+-=-.++..|+.+.+.++.++++++..|+.++.-- .. +..+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~----g~----~~~~~~i~ 160 (324) T protein:vir:10 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GN----NPFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC----CC----CccCcccc Confidence 754 356677666654432 33445541122346889999999999999999999886311 10 01111111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .. +. ..... ..+...++.|.++...|..++.... .++++|..|..|.+-. +..+...+..|.-++ T Consensus 161 ~~--~~--~~~~~----~~~~~t~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~-----d~~g~~~~~~~~~~~ 225 (324) T protein:vir:10 161 QS--IE--KTNKV----IKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV-----DPETKERIYDRNSDT 225 (324) T ss_pred cc--cc--cccee----ccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhh-----ccCCceeecCCCCcc Confidence 00 00 00100 1111236788888888888776333 5789999999887532 122223334455568 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.++..+.+... . ++...+-+..+..+++++|..++.. T Consensus 226 l~G~PV~~~~~~~~~~~~------------------~------------~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:10 226 LDGLPVVNLKSSNLKRGE------------------L------------ITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeEEeecCCCCCcce------------------E------------EEEecccEEEEEecCcEEEEeeccccccccc Confidence 999999988765532210 0 1111111112233445566554421 Q ss_pred --------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.. -.++..+++|.++++|++.+.|+..... T Consensus 276 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:10 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCC Confidence 122 3456778899999999998888654333 No 94 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.40 E-value=3.9e-14 Score=94.11 Aligned_cols=282 Identities=12% Similarity=0.032 Sum_probs=166.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) |...+.. +........+...+..+.|..++.+...+.+.++.+.+..++. +.+++||+. +...+.-...|..+ T Consensus 18 ~~~~~~~-----~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:99 18 NVKPQVF-----NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhhhhhc-----cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEeccCccc Confidence 1111110 0000111122223678999999999999999999998888765 456888886 45566777778887 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.- .-..|.+-=-.++..|+.+.+.++.++++++..|+.++.- ... +..+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G----~g~----~~~~~~~~ 160 (324) T protein:vir:99 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QGN----NPFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhc----CCC----CccCcccc Confidence 754 456677777664443 3345554212234688999999999999999999988631 110 01111111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) . ........ ......++.|+++...|..++.... .++++|..|..|.+-. +..+...+..+.-++ T Consensus 161 ~----~~~~~~~~----~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~-----d~~g~~~~~~~~~~~ 225 (324) T protein:vir:99 161 Q----SIEKTNKV----IKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV-----DPETKERIYDRNSDT 225 (324) T ss_pred c----ccccccee----ccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhh-----cCCCceeecCCCCcc Confidence 0 00111111 1112236788889888988776333 5789999999887532 222223334444568 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch------ Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE------ 313 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (344) ++|.+|+.++..+.+... . ++...+-+..+..+++++|..++.. T Consensus 226 l~G~PVv~~~~~~~~~~~------------------~------------i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:99 226 LDGLPVVNLKSSNLKRGE------------------L------------ITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeEEeecCCCCCcce------------------E------------EEEecccEEEEEecCcEEEEeeccccccccc Confidence 999999998876543211 0 1111111112333445666655431 Q ss_pred --------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++. -.++..+++|.+++||++.+.|+..... T Consensus 276 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~ 316 (324) T protein:vir:99 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCC Confidence 122 3456778899999999999998765544 No 95 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.39 E-value=1e-13 Score=91.80 Aligned_cols=281 Identities=12% Similarity=0.016 Sum_probs=164.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||...+ .|.++.+..++.+..+..|+++.+.+...+.+|+ +.||+. +.+++..+..|+.+ T Consensus 1 ma~~gG------------------~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~ 61 (298) T protein:vir:16 1 MVLNKG------------------TLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKK 61 (298) T ss_pred CcccCc------------------ceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCCccc Confidence 664221 1566788899999999999999999888776554 677774 56677778778777 Q ss_pred CCCcCCcccceEEEEeeeeeeec-eeccchHHH-----HhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDA-----MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) +.+ +++..++++.. .++.. ..|.+ |. .+..++.+.+.++.++++++..|+.++.... +... T Consensus 62 ~~~--~~~f~~v~l~~--~k~a~~~~iS~--ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~-------~~~g 128 (298) T protein:vir:16 62 THG--GVTLAPQTMVP--IKVEYGARISD--EFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVN-------PRLG 128 (298) T ss_pred ccc--ccceeEEEEee--eeEEEeehhhH--HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------CCCC Confidence 754 34556656655 33333 34433 32 2346789999999999999999999874211 0111 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPE 233 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (344) .+.+. .+.....+..+............++.|.++...+..++.+.. .++++|..+..|.+-..- +..+.-..... T Consensus 129 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~-~G~~i~~~~~~ 204 (298) T protein:vir:16 129 TASAV-IGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL-QDNALFPELKW 204 (298) T ss_pred ccccc-ccccccccccccccccccccccHHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhcc-CCCeeecCccc Confidence 11100 011011111111111112223346778888888888887543 477899999988764322 22232223445 Q ss_pred cceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec-- Q lcl|NC_015719. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR-- 311 (344) Q Consensus 234 ~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~-- 311 (344) .|..++++|.+|+.++++|....+.. ...+.|++. . ++.......++++..++ T Consensus 205 ~~~~~~l~G~PV~~~~~v~~~~~~~~--------------~~~~~GDfs--~---------~~~~~~~~~~~~~~~~~~~ 259 (298) T protein:vir:16 205 GATPDTINGLPVDVNKTVSDMSLTQR--------------DRAIIGDFA--N---------GFKWGYAKEVPLEVIQYGD 259 (298) T ss_pred CCCCceecceeeEEecccccccCCCc--------------cEEEEeecc--c---------eEEEEEecCceEEEeeccC Confidence 67778999999999999985432110 111222221 1 11111222334444332 Q ss_pred ch------hhh--hhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 312 AE------YQA--DQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 312 ~~------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) +. ++. -.++..+++|.+++||++.+.|+-.- T Consensus 260 ~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 21 122 33677889999999999988885444 No 96 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.39 E-value=5.6e-14 Score=93.24 Aligned_cols=288 Identities=15% Similarity=0.122 Sum_probs=164.3 Q ss_pred CCCcccc--ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCc--ceeeeeeCC Q lcl|NC_015719. 1 MANMQGG--QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGR--TKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~--~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~--~t~~~~~~g 76 (344) |...... .....+.+. ..++.-.+..+.|+.++.+.....+.++++++...+. +.++.+++... .++.....| T Consensus 121 ~~~~~~~~~~~~~~~~~~--~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~ 197 (418) T protein:vir:10 121 RVRVDRKSIMNVPATVGS--GVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNAAAVAEG 197 (418) T ss_pred hhhhHHHHHHHhhhhccC--CCCCCccccchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCceeeeccC Confidence 1110000 000000111 1112223678999999999999999999999887765 55677777533 455556667 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++.. .++.+++++...+... -..|.+ +-.+.+.|+.+.+.++.++++++..|+.++.- .. .+..|. T Consensus 198 ~~~~~~--~~~f~~v~~~~~k~~~-~~~is~-ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G----~g----~~~~p~ 265 (418) T protein:vir:10 198 AQKPTS--DLKFNLKNQPVRTIAH-LFKASR-QILDDAPALQSYIDGRARYGLQLTEEGQILKG----DG----TGANIL 265 (418) T ss_pred cccccc--ccceeeEEEeeeeEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhcc----CC----CCcccc Confidence 766543 3566676666655433 244554 33444568999999999999999999988621 10 111122 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccce Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGS 236 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (344) |............+ ......++.|+++...+...+.+.. .+|++|..|..|.+-..- +..|.-. ...+|. T Consensus 266 Gi~~~~~~~~~~~~------~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~-~G~~i~~-~~~~~~ 335 (418) T protein:vir:10 266 GILPQASAFMPSIT------LANATPIDKIRLALLQAVLAEFPAT--GIVLNPIDWASIELTKDS-QGRYIVG-NPVNGT 335 (418) T ss_pred cccccccccccccc------ccccccHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC-CCceecc-ccccCC Confidence 21111110000000 0011236677777777776665433 477899999988653221 2223222 234566 Q ss_pred eEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch--h Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE--Y 314 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~--~ 314 (344) .++++|++|+.|+++|.+.. . -+++ .. ..+++ ....++++..++.. + T Consensus 336 ~~~l~G~pV~~~~~~p~~~~---~-----------------~gd~--s~-~~~~~--------~~~~~~i~~~~~~~~~f 384 (418) T protein:vir:10 336 TPRLWNLPVVETQAMTANEF---L-----------------VGAF--SM-AAQIF--------DRMEIEVLLSTENVDDF 384 (418) T ss_pred CceecceeeEEcCCCCCCcE---E-----------------Eeec--cc-eEEEE--------EecceEEEEecccchhh Confidence 78999999999999985431 0 0111 10 11112 12334555444332 2 Q ss_pred hh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .. -.+++.++++.++++|++.+.+..+.+| T Consensus 385 ~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~ 416 (418) T protein:vir:10 385 EKNMVSIRAEERLALAVYRPESFVTGALVEQA 416 (418) T ss_pred hcCceEEEEEEeeccEEecccceEEEEeccCC Confidence 22 2466778899999999999999999988 No 97 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.38 E-value=5.5e-14 Score=93.27 Aligned_cols=297 Identities=11% Similarity=0.008 Sum_probs=161.8 Q ss_pred CCCcccccc-----ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEee-cCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQ-----LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPV-IGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~-----~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~ 74 (344) |.+...... -....+....+| .+..+.|..++.+..+..+.++++.+..++.++ .+.+++ .+.+++.-.. T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG---~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~ 165 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGG---YAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTSGWVG 165 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCc---ccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcceeeec Confidence 211110000 000001110111 256799999999999999999999888777655 455554 4555666566 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) .|...+.+.. .+..++++.+- ++.. +.|.+-=-.++.+|+.+.+.++.++++++..|+.++.- ... . T Consensus 166 E~~~~~~~~~-~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G----~G~-----~ 233 (407) T protein:vir:48 166 ETDARPETAT-SKLGLIEPFMG--EIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSG----DGS-----K 233 (407) T ss_pred cccccccccc-ccceeEEeeee--eeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc----CCC-----C Confidence 6666654321 24455555553 3333 34544222235679999999999999999999987521 100 1 Q ss_pred ccccccCceeeeccc----ccc--cccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGA----KAD--LTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA 227 (344) Q Consensus 154 ~~~~~~~~~~i~~~~----~~~--~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (344) .|.|.-......... .+. ...........++.|+++...|..+..+. -.+|++|..|..|.+-..- +..|. T Consensus 234 ~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~--a~~v~n~~~~~~L~~lkD~-~Gr~l 310 (407) T protein:vir:48 234 KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSG--AKFMMNNSSLFAIRLLKDN-DGNYL 310 (407) T ss_pred ccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcC--CEEEEcHHHHHHHHHhhcc-CCcee Confidence 111111000000000 000 00011111223678888888887776642 2467999999988642211 22222 Q ss_pred cccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 228 ~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) =...+..|..++++|.+|+.++++|....+...+ .-|++. .+...+..+.++++ T Consensus 311 ~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i---------------~~Gd~~-----------~~~~i~~~~~~~i~ 364 (407) T protein:vir:48 311 WRPGIELGQPSSLAGYGIVENEQMPDIAADAKAI---------------AFGNFK-----------RGYTIVDRIGTRIL 364 (407) T ss_pred eccCcCCCCCceecceeeEEecCcCCccCCccEE---------------EEEecc-----------ccEEEEEeeceEEE Confidence 2223456777899999999999998643222211 111111 01111122222332 Q ss_pred eeecch--hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 308 RARRAE--YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~~~~--~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|+- .---.+++.+++|+++++|++.+.|+.++.| T Consensus 365 --~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 365 --RDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAAT 401 (407) T ss_pred --eeccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 3322 1223477888999999999999999999888 No 98 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.38 E-value=1.1e-13 Score=91.58 Aligned_cols=295 Identities=12% Similarity=0.018 Sum_probs=161.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||--+... ...|.-..-.+++.-.+..+.+..+|.+..++.+.++.+.+...+. +.+.+||+. +.+.+.-+..|+.+ T Consensus 1 ~~~~~~~~-~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAFQ-VDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWIGDVSAQWIGEGDMK 78 (320) T ss_pred CCCCccCC-HHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEecCCccc Confidence 55433321 1122111111122222567889999999999999999998887765 455788876 55567777778887 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++.+-+.- .-+.|.+-=-.++..|+.+.+.++.++++++..|+.++.- .... ....+.+.. T Consensus 79 ~~~--~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G----~g~~--~~~~~~~~~ 149 (320) T protein:vir:10 79 PIT--KGNMTSQNIAPHKIA-TIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNG----TDSP--FPTYLAQTT 149 (320) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCC--CCccccccc Confidence 754 456677666664432 3345554222235789999999999999999999998621 1100 000111111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc-----ccccc Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL-----IDPER 234 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~-----~~~~~ 234 (344) .+... ...+..+... ....-+.+.++...+...+.+ .-+++++|..|..|.+-..- +..+... ..... T Consensus 150 ~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~~~~~ 222 (320) T protein:vir:10 150 KSVSL--ADPGGATASD--LTAYDAVAVNGLSLLVNAKKK--WTHTLLDDIVEPILNGAKDK-NGRPLFIESTYTDENSP 222 (320) T ss_pred ccccc--eecccccccc--cccHHHHHHHHHhhhhcccCC--CcEEEEcHHHHHHHHHhhcc-CCceeeccccccCcccc Confidence 11110 0111111000 111123455566666655543 34788999999999753221 1111111 11111 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch- Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE- 313 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~- 313 (344) ..-++++|++|+.++++|.+... +++.+.+-+..+....+++|..++.. T Consensus 223 ~~~~~i~g~pv~~~~~~~~~~~~------------------------------~~~gd~~~~~~~~~~~~~i~~~~~~~~ 272 (320) T protein:vir:10 223 FRAGRIVSRPTILSDHVADGTTV------------------------------GYMGDFRNVIWGQVGGLSFDVTDQATL 272 (320) T ss_pred ccCceeeeeeeEecCCCCCCceE------------------------------EEEeecceEEEEEecCeEEEEeeccee Confidence 11247899999999998753211 01111111112333445555554432 Q ss_pred -------------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 -------------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 -------------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.. -.++..+++|.+++||++.+.|+ ..+| T Consensus 273 ~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~-~~~a 317 (320) T protein:vir:10 273 NLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLT-NVVT 317 (320) T ss_pred eeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEE-eccC Confidence 112 33577889999999999988886 2334 No 99 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.38 E-value=8e-14 Score=92.37 Aligned_cols=288 Identities=14% Similarity=0.081 Sum_probs=161.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||...+. .|. .+..+++++++.+..++.|+++.+.++.... +..++||+. |.+++.-+..|+.+ T Consensus 1 Ma~~~~~------~gg--------~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~-~~~~~ip~~~~~~~a~wv~Eg~~~ 65 (315) T protein:vir:80 1 MADDFLS------AGK--------LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVK 65 (315) T ss_pred CCCCcCC------cCc--------eEcchHHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeCCccc Confidence 8864421 111 2567899999999999999999988877654 445788885 56677777778877 Q ss_pred CCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChh----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYD----VRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d----~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.+ +.+.+++++.. .+... ..|.+-=-.++..| +.+.+.++.+++|++.+|+.++.- .... .... T Consensus 66 ~~s--~~~f~~v~l~~--~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G----~~~~--~~~~ 135 (315) T protein:vir:80 66 PSA--SVDVSAFTAQP--IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHG----IDPA--TGKA 135 (315) T ss_pred ccc--ccceeeeEeee--eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeec----cCCC--CCcc Confidence 754 35666666654 33332 34433111112233 678889999999999999888631 1000 0001 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc---cc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL---ID 231 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~---~~ 231 (344) +.+... .+. ..+. ........++.|.++...+..++.-.. ..++++|..+..|.+-......+..+. .. T Consensus 136 ~~~~~~--~~~--~~~~---~~~~~~~~~~d~~~~~~~~~~~~~~~~-~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~ 207 (315) T protein:vir:80 136 ASAVHT--SLN--KTKN---IVDATDSATADLVKAVGLIAGAGLQVP-NGVALDPAFSFALSTEVYPKGSPLAGQPMYPA 207 (315) T ss_pred cccccc--ccc--cccc---eeeccccchHHHHHHHHHHhhccCccc-eEEEEcHHHHHHHHHHhhccCCcccccccccc Confidence 111110 000 0000 011111235667777777766655333 347789999999875432222221111 12 Q ss_pred cccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec Q lcl|NC_015719. 232 PERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR 311 (344) Q Consensus 232 ~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~ 311 (344) +..|..++++|.+|+.++++|....... ......+.||+.. -.++. ...+++|..++ T Consensus 208 ~~~g~~~tl~G~PV~~~~~~~~~~~~~~-----------~~~~~~~~GDfs~----------~~~g~--~~~~~i~i~~~ 264 (315) T protein:vir:80 208 AGFAGLDNWRGLNVGASSTVSGAPEMSP-----------ASGVKAIVGDFSR----------VHWGF--QRNFPIELIEY 264 (315) T ss_pred cccCCCceecceeeEecCcCCccccccc-----------ccccEEEEeeccc----------EEEEE--ecCeeEEEecc Confidence 3455567999999999999986543221 1111122233221 11111 22334444332 Q ss_pred ch--------hhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 312 AE--------YQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 312 ~~--------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .. ++.| .+++..++|.++++|++.+.|+.++.. T Consensus 265 ~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~ 307 (315) T protein:vir:80 265 GDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) T ss_pred ccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCC Confidence 11 2222 456778999999999999999865544 No 100 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.37 E-value=6.9e-14 Score=92.74 Aligned_cols=294 Identities=12% Similarity=0.053 Sum_probs=155.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEee-cCcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPV-IGRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~g~~~ 79 (344) +..+.. ...+ ....+...+..+.+..++.+..+..+.++.+.+...+.++ ..++++ .+.+.+..+..+... T Consensus 155 ~~~~~a-----~~~~--~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~e~~~~ 226 (458) T protein:vir:10 155 QRHLKA-----VNQS--SSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWVAASTYG 226 (458) T ss_pred hhhhhh-----hhhc--ccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeeccccccc Confidence 111000 0000 1111223367889999999999999999988888777554 455554 344454444445444 Q ss_pred CCCc----CCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKR----KDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~----~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.+. ...+..+++ +...++.. +.|.+-=-.++.+++.+.+..+.+++|++..|+.++.- .. + .. T Consensus 227 ~~~~~~~~~~~~~~~i~--~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G----~G-~----~~ 295 (458) T protein:vir:10 227 TDTTTGEEVKGALKEIH--FSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTG----DG-S----GK 295 (458) T ss_pred ccccccccccccceeeE--eeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcC----CC-C----Cc Confidence 4321 112334444 44444444 34544212234589999999999999999999988631 10 0 11 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc----cc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA----LI 230 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~----~~ 230 (344) |.|.........+...............++.|+++...|...+.. .-.+|++|..|..|..-..- +..+.. .. T Consensus 296 p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~lkd~-~G~~i~~~~~~~ 372 (458) T protein:vir:10 296 PKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDE-EWQDVAQVGNDS 372 (458) T ss_pred cceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHhhccc-CCceeecccccc Confidence 111111110000000000011111122367788888888877653 34568899999887642211 222221 12 Q ss_pred ccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee Q lcl|NC_015719. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR 310 (344) Q Consensus 231 ~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~ 310 (344) ....|...+++|.+|+.++.+|..+...... +|. |. +....+....++++... T Consensus 373 ~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~-------------------------~~~-f~-~~~~~~~~~~~~v~~d~ 425 (458) T protein:vir:10 373 VKLQGQVGRIYGLPVVVSEYFPAKANSAEFA-------------------------VIV-YK-DNFVMPRQRAVTVERER 425 (458) T ss_pred ccccCcCceecceeeEEccccccccCCcceE-------------------------EEE-ec-ccEEEEEeeceEEEeec Confidence 3445666789999999999998643222110 010 00 11112222333443221 Q ss_pred cchhhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 311 RAEYQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 311 ~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) -...-.-.++...++|..+.+|++.+..++++- T Consensus 426 ~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 426 QAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 111112336777899999999999988777666 No 101 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.37 E-value=7.4e-14 Score=92.56 Aligned_cols=284 Identities=14% Similarity=0.046 Sum_probs=163.3 Q ss_pred CCC-ccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccc--ccEEEEeec-CcceeeeeeCC Q lcl|NC_015719. 1 MAN-MQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISS--GKSAQFPVI-GRTKAAYLQPG 76 (344) Q Consensus 1 ma~-~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~--G~tv~i~~i-G~~t~~~~~~g 76 (344) +.. +.+.............+++--.+..+.|..+|++..+..+.++++++...+.+ |+....+.. +...+.....| T Consensus 94 ~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 173 (397) T protein:vir:48 94 FKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEA 173 (397) T ss_pred HHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccc Confidence 000 00000000000000111122235779999999999999999999988877653 333322222 22234445556 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++.. ..++..++++.+.+.. ....|.+-=-.++.+|+.+.+.++.+++|++..|+.|+... T Consensus 174 ~~~~~~-~~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~--------------- 236 (397) T protein:vir:48 174 GSIGTN-DDPKLYPIRYAIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI--------------- 236 (397) T ss_pred cccccc-cccceeeEEeeheeee-eehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Confidence 666533 1245567677665543 33456542223467899999999999999999999986311 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccce Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGS 236 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (344) +.+. ..++. .-++.|+++...|.....+. =.++++|..|..|.+-..- +..+.-...+..|. T Consensus 237 --g~~~--~~~~~-----------~~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~L~~lkd~-~G~~i~~~~~~~~~ 298 (397) T protein:vir:48 237 --ATLP--TKPTL-----------TKWDDIIDLQAKVDPAIKQT--SFFLTNTSGFTALKKVKNA-FGDYLMERDVKSPT 298 (397) T ss_pred --cccc--ccccc-----------ccHHHHHHHHHHhhhhhcCC--CEEEECHHHHHHHHHhhcC-CCceeeccCcCCCC Confidence 0111 00000 12677788888888777643 3667899999998763222 22232223456777 Q ss_pred eEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecch-h Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAE-Y 314 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~-~ 314 (344) -+.++|++|+.+.+.+..+.+.. ..+.++.. +.++..+....++++..+... + T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~~-------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~ 353 (397) T protein:vir:48 299 GYSIDGFAVKEVADRWLANASSG-------------------------AMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGA 353 (397) T ss_pred CceeccceeEEecccccCCcCCC-------------------------ceEEEEEeccceEEEEeecceEEEEeccchhh Confidence 78999999998765433221110 01112222 223333444455666655332 2 Q ss_pred h---hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 Q---ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~---~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) | .-.+++.++++.++++|++.+.+++++.+ T Consensus 354 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 354 FETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred hhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 2 24678889999999999999999999888 No 102 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.36 E-value=9e-14 Score=92.10 Aligned_cols=284 Identities=13% Similarity=0.055 Sum_probs=162.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeecCc--ceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVIGR--TKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~iG~--~t~~~~~~g~ 77 (344) ..-+.++....-+.......++--.+..+.|..++.+..+..+.+++++++..+..+. ++.++.... ..+.....|. T Consensus 95 ~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:49 95 KNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGG 174 (397) T ss_pred HHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeecccc Confidence 0000000000000000011111123567999999999999999999998888776432 344554432 2344444566 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... ..+.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+... T Consensus 175 ~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~---------------- 236 (397) T protein:vir:49 175 QIGQND-DPKLSLIRYAIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI---------------- 236 (397) T ss_pred cccccc-ccceeeeEeeeeeeE-eehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------- Confidence 654321 134466666665543 22445542223467899999999999999999999886311 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) +.++. .++. .-++.|.++...|+....+. -.+|++|..|..|.+-..- +..|.-...+..|.- T Consensus 237 -g~~~~--~~~~-----------~~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lkd~-~g~~l~~~~~~~g~~ 299 (397) T protein:vir:49 237 -GTLPN--KPTL-----------AKWDDIIDLQAKVDPAIKQT--SLFLTNTSGFTALKKVKNA-MGDYLMERDVKSPTG 299 (397) T ss_pred -ccccc--cccc-----------cCHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhcc-CCceeecccccCCCC Confidence 01110 0000 01677888888888877653 4778999999988653211 222222223456767 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEe-cHHHHhhhhhheeeeeeeecc--hh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQ-HRSAVGTVKLKDLALERARRA--EY 314 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~-~~~Av~~~~~~~~~~e~~~~~--~~ 314 (344) ++++|++|+.+.+.+.+..+.. ....++. .+.++..+....++++..+.. .+ T Consensus 300 ~~l~G~pV~~~~~~~~~~~~~~-------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 354 (397) T protein:vir:49 300 YSIDGFVVKEISDRFLPNGTGG-------------------------AMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAF 354 (397) T ss_pred ceecceeeEEecccccccccCC-------------------------ceeEEEeeccceEEEEeecccEEEEeccccchh Confidence 7999999998765443221110 0011111 122333444455566655432 12 Q ss_pred h--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 Q--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) . ...+++..++|.++++|++.+.++++++| T Consensus 355 ~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 355 ETDTTKVRVIDRFDVVSTDTEAFVPASFKAIA 386 (397) T ss_pred hcCeeeEEEEEeeccEEecccceEEEEecccc Confidence 2 23478889999999999999999999988 No 103 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.35 E-value=8.7e-14 Score=92.17 Aligned_cols=279 Identities=16% Similarity=0.104 Sum_probs=163.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC--cceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG--RTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (344) ++..+.. ....+++.-.+.+..+...+.+.....+.++++++..++.+ .++++|+.. ..++.....|+. T Consensus 107 ~~~~~~~--------~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:10 107 KAALNTA--------STDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred HHHHHhh--------hcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCCcceeeecCCcc Confidence 1111111 11111111235666777778888888888888888877654 467888763 345566667777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.. +++..++++.+.+.. .-+.|.+ +-.+...++.+.+.++.+.++++..|+.++.- .. .+..+.|. T Consensus 178 ~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~l~~~~~~~~~~~il~G----~G----~~~~p~Gi 245 (390) T protein:vir:10 178 KPES--SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TG----ANDGLLGL 245 (390) T ss_pred cccc--ccceeEEEEeeEEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhc----CC----CCcccccc Confidence 6653 456677777776553 2345554 23344568899999999999999999988621 11 11112221 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) ..... ....+ ....+...++.+.++...|...+.+.. .+|++|..|..|.+-..- +..|.-... ..+..+ T Consensus 246 ~~~~~-----~~~~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~-~g~~l~~~~-~~~~~~ 315 (390) T protein:vir:10 246 IPQAT-----TYAAP-TTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDA-NNQYLIGNA-RGTLTP 315 (390) T ss_pred ccccc-----ccccc-ccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC-CCceeecCC-cCcCCc Confidence 11110 00000 011122346778888888888887644 467999999988753321 222221111 233346 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc-hhhhh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA-EYQAD 317 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~-~~~~d 317 (344) .++|.+|+.++.+|.+.. +-+++. . ++..+..+.++++..+.. .+..| T Consensus 316 ~l~G~pv~~~~~~p~~~~--------------------~~gdf~--~---------~~~~~~~~~~~i~~~~~~~~~~~~ 364 (390) T protein:vir:10 316 TLWGLPVVATQAMAPGEF--------------------LVGAFD--L---------AAQIFDQWDARVEIGYVNDDFQRN 364 (390) T ss_pred eecceeeEEcCCCCCCcE--------------------EEEecc--c---------eEEEEEecceEEEEeecccccccC Confidence 899999999999985321 111111 1 111122344566666543 33445 Q ss_pred --hhhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 318 --QIIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 318 --~i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) .+++..+++.++++|++.+.+.+. T Consensus 365 ~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 365 MVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cEEEEEEEeeccEEeccccEEEEEeC Confidence 456778999999999999999999 No 104 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.35 E-value=1.1e-13 Score=91.69 Aligned_cols=285 Identities=12% Similarity=0.060 Sum_probs=160.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) |.- ++ ..+.......++.-.+.++++..++.+..++.+.++++.+...+. +.+++||+. +.+.+.-+..|..+ T Consensus 1 ~g~-~~----e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~ 74 (397) T protein:vir:23 1 MGF-SA----DHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMG-ATGIVIPHWTGDVSAQWIGEGDMK 74 (397) T ss_pred CCc-CH----HHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEcCCcceEEecCCccc Confidence 321 11 111111111111112455667788888888889999998887765 455788876 44555666667777 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++..++++.+-+. ..-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.++.-- ... ..+.+ T Consensus 75 ~~s--~~~f~~v~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~----gt~----~~~~~-- 141 (397) T protein:vir:23 75 PIT--KGNMTKRDVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGT----NAP----SAFQG-- 141 (397) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCC----ccccc-- Confidence 654 45666766666443 233455542222456899999999999999999999986311 100 00111 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc-----ccccccc Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA-----ALIDPER 234 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~-----~~~~~~~ 234 (344) ............ ....++.++++...|.+...+ .-.++++|..|..|.+-..- +..+. ....... T Consensus 142 ---~~~~~~~~~~~~----~~~~~~~~~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~-~G~~i~~~~~~~~~~~~ 211 (397) T protein:vir:23 142 ---YLDQSNKTQSIS----PNAYQGLGVSGLTKLVTDGKK--WTHTLLDDTVEPVLNGSVDA-NGRPLFVESTYESLTTP 211 (397) T ss_pred ---ccccccceeeec----ccchhHHHHHHHHhhhhcccC--CCEEEEcHHHHHHHHHhhcc-CCceeeccccccccccc Confidence 111111111111 112245566677777777654 24578999999988863222 11221 1112223 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch- Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE- 313 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~- 313 (344) +..+++.|.+|+.++++|.+... .+.+++ ... ..+..+.+.+|..++.. T Consensus 212 ~~~~tl~G~Pv~~s~~~~~g~~~------------------~~~gDf--s~~----------~i~~~~~i~i~~~~e~~~ 261 (397) T protein:vir:23 212 FREGRILGRPTILSDHVAEGDVV------------------GYAGDF--SQI----------IWGQVGGLSFDVTDQATL 261 (397) T ss_pred ccCceeeeeeEEEeCCCCCCceE------------------EEEeec--ceE----------EEEEEeceEEEEeeeeee Confidence 34468999999999999854311 011111 111 11222334455444322 Q ss_pred -------------hhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 -------------YQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 -------------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..| .++..++++.++++|++.+.+...... T Consensus 262 ~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~ 307 (397) T protein:vir:23 262 NLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVL 307 (397) T ss_pred eeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccc Confidence 2223 457778999999999999999887666 No 105 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.35 E-value=9.5e-14 Score=91.97 Aligned_cols=279 Identities=15% Similarity=0.092 Sum_probs=166.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCc--ceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGR--TKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~--~t~~~~~~g~~ 78 (344) .+..+. ......++.-.+.++++...+.+.....+.++++++...+. +.++++++... .++..+..|+. T Consensus 107 ~~~~~~--------~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:81 107 KAALNT--------ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred HHHHHh--------hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeecc-CCceEEEEEecCCcceeeecCCcc Confidence 011110 01111222223567788888999999899999998877765 45677777633 45666777777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.. .++.+++++.+.+... -..|.+ +-.+.+.++.+.+.++.+.++++..|+.++.. . ..+..+.|. T Consensus 178 ~~~~--~~~~~~i~~~~~k~~~-~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~G----~----g~~~~~~Gi 245 (390) T protein:vir:81 178 KPES--SLKFAKKTDTTHVIAH-TMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----T----GANDGLLGL 245 (390) T ss_pred cccc--cceeeEEEEeeeEEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc----C----CCCCcccce Confidence 7654 3566777777665543 345554 33344568999999999999999999988631 1 011112221 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) . ........ .........++.|.++...|...+.+.. .+|++|..|..|.+-..- +..|.-. ....|... T Consensus 246 ~-----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~-~G~~l~~-~~~~~~~~ 315 (390) T protein:vir:81 246 I-----PQATTYAA-PTTIAGATRVDQLRLAMLQASLAEYNPS--GIVINPIDWAAIELAKDA-NNQYLIG-NARGTLTP 315 (390) T ss_pred e-----eccccccc-ccccccchhHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC-CCceeec-CcccccCc Confidence 1 11000000 0011122346788888888888887544 567899999988753321 1222211 12244456 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhh-hh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQ-AD 317 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~-~d 317 (344) .++|.+|+.++.+|.+.. .-+++ +. ++..+....++++..+...+| .| T Consensus 316 ~l~G~pv~~~~~~p~~~~--------------------~~gd~--~~---------~~~~~~~~~~~v~~~~~~~~~~~~ 364 (390) T protein:vir:81 316 TLWGLPVVATQAMAPGEF--------------------LVGAF--DL---------AAQIFDQWDARVEIGYVGEDFQRN 364 (390) T ss_pred eecceeeEEcCCCCCCcE--------------------EEEeh--hc---------eEEEEEecceEEEEecccchhhcC Confidence 899999999999985421 11111 11 111122345677776654433 34 Q ss_pred --hhhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 318 --QIIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 318 --~i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) .++...+++.++++|++.+.+++. T Consensus 365 ~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 365 MITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cEEEEEEEeeccEEecccceEEEEeC Confidence 467888999999999999999999 No 106 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.34 E-value=1e-13 Score=91.86 Aligned_cols=295 Identities=12% Similarity=0.129 Sum_probs=160.5 Q ss_pred CCCccccccccccccccccccc-hhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcce---------e Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAAD-KLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTK---------A 70 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d-~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t---------~ 70 (344) +.......... .....+...+ ...+..+.+.+.+.......+.++++.+..... +.++++++....+ + T Consensus 110 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a 187 (419) T protein:vir:94 110 MRDIDPNRLLS-RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKA 187 (419) T ss_pred HHHHHHHHhhc-cccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeeccccccccccCccc Confidence 00000000000 0011111111 112344677888887777777788888877654 5567777653322 2 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) ..+..|+.++.. .++..++++.+.+.-. -+.|.+ +-.+...++.+.+.++.++++++..|+.|+. +... T Consensus 188 ~~v~Eg~~~~~~--~~~~~~i~~~~~k~~~-~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~aii~----G~G~--- 256 (419) T protein:vir:94 188 AVVPEGTAKPQS--TLSFDTITTTLKTVAH-WLPITR-QAADDNSQLMGYIQGRLTYGLRFLRDRQLLN----GNGS--- 256 (419) T ss_pred ceecCCcccccc--ccceeeEEeeeeeEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccCc--- Confidence 233345555432 3455666666655432 245554 3333445788889999999999999999863 1110 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALI 230 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (344) ..|.|......+........+. .......++.|+++...+...+.+.. .++++|..|..|+.-..-....+.-.. T Consensus 257 --~~p~Gi~~~~~~~~~~~~~~~~-~~t~~~~~~~l~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~~k~~~~~~~~~~~ 331 (419) T protein:vir:94 257 --TEMQGILTTPGIGTYQQPKPTA-PATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSGVFRVIA 331 (419) T ss_pred --ccccceeccccccccccccccc-ccccchhHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHHhhcCCCceeecC Confidence 1122211111010000000010 11123357889999988888777433 678999999998764333333332233 Q ss_pred ccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee Q lcl|NC_015719. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR 310 (344) Q Consensus 231 ~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~ 310 (344) ....|..+.++|++|+.++.+|.+.. +-+++ .. ..+. +..+.++++..+ T Consensus 332 ~~~~~~~~~l~G~pV~~~~~~~~~~~--------------------~~gd~--~~-~~~~--------~~~~~~~v~~~~ 380 (419) T protein:vir:94 332 NVQGEATPRIWGLNVVSTVAIAQGTA--------------------LVGGF--RQ-GATL--------WSRQGITVLMTD 380 (419) T ss_pred CcccCCCccccceeeEEcCCCCCccE--------------------EEeec--cc-eEEE--------EEecceEEEEec Confidence 44567778999999999999985321 01111 11 0111 223345555554 Q ss_pred cch-hh---hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 311 RAE-YQ---ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 311 ~~~-~~---~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ... +| ...++...++|.++++|++.+.+++++-= T Consensus 381 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~ 418 (419) T protein:vir:94 381 SHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) T ss_pred cccchhhcCcEEEEEEEeeccEEeccccEEEEEeccCC Confidence 332 22 23567888999999999999988876544 No 107 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.32 E-value=4.7e-13 Score=88.19 Aligned_cols=278 Identities=11% Similarity=0.036 Sum_probs=157.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) |++... .+.-.+..+++..+|.+..++.++++.+.+...+. +.+++||+. +.+.+.-+..|+.+ T Consensus 14 ~~~~~~--------------~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 78 (318) T protein:vir:24 14 IAQTGD--------------TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDVSAQWIGEGDMK 78 (318) T ss_pred hhcccC--------------cccceeechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceEEecCCccc Confidence 322211 11112567889999999999999999998887765 455778765 55677777778887 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. .++.+++++..-+.. .-..|.+-=-.++.+|+.+.+.++.++++++.+|+.++.- .... .+.+.. T Consensus 79 ~~~--~~~f~~i~~~~~k~~-~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G----~g~~-----~~~~~~ 146 (318) T protein:vir:24 79 PIT--KGNMTSQTIAPHKIA-TIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHG----TDSP-----FPTYIG 146 (318) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcc----cCCC-----CCcccc Confidence 754 355666666554432 2234544111235678999999999999999999998621 1110 111111 Q ss_pred Cce-eeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccc----- Q lcl|NC_015719. 160 KPS-LLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPE----- 233 (344) Q Consensus 160 ~~~-~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~----- 233 (344) ... .+..+.. +... ....+.+.++...+...+. ..-.++++|..|..|.+-..- +..+.-..... T Consensus 147 ~~~~~~~~~~~---~~~~---~~~~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~~~~ 217 (318) T protein:vir:24 147 QTTKAISIADT---TGAT---TVYDQVAVNGLSLLVNDGK--KWTHTLLDDITEPILNGAKDQ-NGRPLFIESTYGEAAS 217 (318) T ss_pred ccccccccccc---cccc---chHHHHHHHHHHhhccccC--CCCEEEEcHHHHHHHHHhhcc-CCceeecCccccCccc Confidence 100 0111111 1111 1112334455555544443 334679999999988753222 12221111111 Q ss_pred cceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch Q lcl|NC_015719. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE 313 (344) Q Consensus 234 ~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~ 313 (344) ...-+.+.|++|+.++++|.+.... ..+++ +.+..+..+++.+|..++.. T Consensus 218 ~~~~~~i~g~pv~~~~~~~~~~~~~------------------~~gdf------------s~~~~~~~~~l~i~~~~~~~ 267 (318) T protein:vir:24 218 PFRSGRIVARPTILSDHVVEGTTVG------------------FMGDF------------SQLIWGQIGGLSFDVTDQAT 267 (318) T ss_pred cccCceEEEEeeEEeCCCCCCccEE------------------EEeec------------ceEEEEEecCeEEEEeeccc Confidence 1122578999999999887532110 01111 11112233445565555432 Q ss_pred --------------hhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 --------------YQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 --------------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +.. -.++..+++|.+++||++.+.|+...-+ T Consensus 268 ~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~ 314 (318) T protein:vir:24 268 LNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSG 314 (318) T ss_pred eeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccC Confidence 222 3357889999999999999998876555 No 108 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.32 E-value=4.9e-13 Score=88.06 Aligned_cols=285 Identities=11% Similarity=0.058 Sum_probs=159.7 Q ss_pred CCCcccccc----ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccc-cEEEEeecCc--ceeeee Q lcl|NC_015719. 1 MANMQGGQQ----LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSG-KSAQFPVIGR--TKAAYL 73 (344) Q Consensus 1 ma~~~~~~~----~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG~--~t~~~~ 73 (344) +..+..+.. ...|.-.....++--.+..+.|..++.+..+..+.++++++...+.++ .++.++.... ..+..+ T Consensus 98 ~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 177 (404) T protein:vir:39 98 VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (404) T ss_pred HHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeee Confidence 000000000 001111111111222357899999999999999999999988877643 2344444332 344456 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) ..|+.++... .++..++++.+.+.. ..+.|.+-=-..+.+|+.+.+..+.++++++..|+.|+.-. T Consensus 178 ~Eg~~~~~~~-~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~------------ 243 (404) T protein:vir:39 178 AEDGKIPDLD-NPRLTIIKYLIKRYA-GIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM------------ 243 (404) T ss_pred cCcccccccc-ccceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------ Confidence 5666665321 245667777776554 33456552223357889999999999999999999886311 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDP 232 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (344) +.++ ..+... -++.+.++.. .++....+ +-.+|++|..|..|..-..- +..|.-...+ T Consensus 244 -----g~~~--~~~~~~-----------~~~~i~~~~~~~~~~~~~~--~a~~v~n~~~~~~L~~lkd~-~G~~l~~~~~ 302 (404) T protein:vir:39 244 -----GTVP--KKPTIA-----------KFDDVITMINTSVDPAIIA--TSSLLTNQSGLNKLALVKTA-EGKYLLEPDP 302 (404) T ss_pred -----cccc--cccccc-----------cHHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhcc-CCceeeccCc Confidence 0111 011110 1344444433 33333222 34688999999999853221 2223322334 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) ..|..++++|++|+.+.+.+.+..+.. ....+.+++ +.++..+..+.++++..+.. T Consensus 303 ~~~~~~~l~G~pV~~~~~~~~~~~~~~-------------~~~~~~gd~-----------~~~~~~~~~~~~~i~~~~~~ 358 (404) T protein:vir:39 303 TKPNSYLIKGKKVIVVADRWLPNSGST-------------VYPLYYGDM-----------SQAITLFDRENMSLLPTNIG 358 (404) T ss_pred CCCCcceecceeEEEecccccCccCCC-------------ccEEEEEec-----------cccEEEEeecceEEEEeccc Confidence 566667999999998876443322110 001111111 12233333455566665543 Q ss_pred h----hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 E----YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~----~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) . .....++..++||.++++|++.+.+.+++.| T Consensus 359 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a 394 (404) T protein:vir:39 359 AGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIA 394 (404) T ss_pred hhhhhhceeeEEEEeeeccEEecccceEEEEeeccc Confidence 2 2224578889999999999999999988877 No 109 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.31 E-value=5.3e-13 Score=87.87 Aligned_cols=296 Identities=13% Similarity=0.078 Sum_probs=157.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (344) ||+.+.... .+..++|+.++.+..+..|+++.+.+...+.+| .++||+. +.+++.-+..|+.+ T Consensus 1 Mat~tt~~g---------------~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 64 (311) T protein:vir:99 1 MATFGTGNL---------------KNLPRNIADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAEFVGEGQQK 64 (311) T ss_pred CceecCCCc---------------eeccHHHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEEeecCccc Confidence 997664111 146789999999999999999999887776544 4688887 66777777788888 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHH-----HhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDA-----MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) +.. .++..++++..-+. ..-+.|.+ |. ++..|+.+.+.++.+++|++.+|+.++.-... ..... T Consensus 65 ~~~--~~~f~~v~l~~~k~-~~~~~iS~--ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~------~~g~~ 133 (311) T protein:vir:99 65 SST--TGEFDFVTSTPKKA-QVTMRFNE--EVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINP------LTGTV 133 (311) T ss_pred ccc--cceeeEEEEeeEEE-EEeehhhH--HHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCc------ccCcc Confidence 754 34566666655332 22244443 32 34678999999999999999999998732110 00000 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPER 234 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (344) +.+. ...+... ....+.........++.+..+...+...+....---++++|..+..|.+-..- +..|.-...... T Consensus 134 ~~g~--~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~-~G~~l~~~~~~~ 209 (311) T protein:vir:99 134 IPGW--SNYLGAA-SKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYT-DGRKKFPELGLG 209 (311) T ss_pred cccc--ccccccc-cceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhcc-CCCeeecCcccC Confidence 1110 0001000 11111111111112333444444444443321111278899999988653221 122222223345 Q ss_pred ceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec--c Q lcl|NC_015719. 235 GSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR--A 312 (344) Q Consensus 235 G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~--~ 312 (344) +..++++|++|+.|+++|........... ...++ ....+.|++. ..+.....+.++++..+. + T Consensus 210 ~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~--~~~~~--~~~~~~Gdf~-----------~~~~~~~~~~~~~~~~~~~~~ 274 (311) T protein:vir:99 210 IGVSSFEGIDASVSDTVNGGDEADPDDED--LDAAR--AVRGIVGDFA-----------NGIHWGVQRDIPVELIKYGDP 274 (311) T ss_pred CCCceecceeeEeecccccccccccccch--hhccC--cceEEEeecc-----------ccEEEEEecCceEEEeecCCC Confidence 55679999999999999865432211000 00000 0111112211 111111223333433321 1 Q ss_pred h-----hhhhh--hhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 E-----YQADQ--IIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~-----~~~d~--i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + +..|. +++..++|.++++|+++.... +.| T Consensus 275 ~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~--~~A 311 (311) T protein:vir:99 275 DGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIEN--AVA 311 (311) T ss_pred CcchhhhhcCcEEEEEEEeecceecChhHeeeec--ccC Confidence 1 33343 477889999999986655444 444 No 110 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.31 E-value=3.3e-13 Score=89.02 Aligned_cols=297 Identities=10% Similarity=0.062 Sum_probs=162.2 Q ss_pred CCCcccccc-cc---ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccc-cEEEEee-cCcceeeeee Q lcl|NC_015719. 1 MANMQGGQQ-LG---TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSG-KSAQFPV-IGRTKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~-~~---~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~-iG~~t~~~~~ 74 (344) +......+. .. .|--.....++--.+..+.|.+++.+..+..+.++++.+..++.++ ..+.+++ .+...+.... T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~ 171 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLS 171 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecc Confidence 111000000 00 0000000111112246689999999999899999999988887632 2455555 4666677777 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) .|+..+.+...++.+++++...+.. .-..|.+-=-.++.+++.+.+.++.++++++..|+.|+.- ... +.. T Consensus 172 e~~~~~~~~~~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G----~g~----~~~ 242 (404) T protein:vir:10 172 ENQQIPTNGDNGKLERFNFKLKDLA-DFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYG----AGG----DEH 242 (404) T ss_pred ccccccccccccceeeeEeeheeeE-eeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCC----CCc Confidence 7777665433345566666554442 2244554212235678999999999999999999988621 111 111 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPE 233 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (344) +.|......+ ...+... ...++.+..+.. .|....-+ +-.+|++|..|..|.+-... +..|.-...+. T Consensus 243 ~~gi~~~~~~-----~~~~~~~---~~~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~ 311 (404) T protein:vir:10 243 ATGIMTANKF-----KKITLPK---SPALKDFKKCKNVELLNVFKA--TSSWIVNQDGFNYLDSLEDK-TGRPYLQPDPK 311 (404) T ss_pred ccceeecccc-----ceeeccc---cccHHHHHHHHHhhhhccccC--CCEEEEcHHHHHHHHHhhcc-CCceeeccCcC Confidence 1111111000 0000000 012455554443 34333322 23578999999988763222 23333223455 Q ss_pred cceeEEEeCeEEEEecc-ccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeec Q lcl|NC_015719. 234 RGSIRNVMGFEVVEVPH-LTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARR 311 (344) Q Consensus 234 ~G~Vg~i~G~~V~~sn~-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~ 311 (344) .|...+++|.+|+.+++ +|..+.+. .+.++.. ++++..+....++++..++ T Consensus 312 ~~~~~~l~G~PV~~~~~~~~~~~~~~---------------------------~~~~~gd~s~~~~~~~~~~~~i~~~~~ 364 (404) T protein:vir:10 312 DPTQYRFLGLPVIELPNDLLLSTESA---------------------------IPVLLGDTKEAYKYVSDGAYELATTNI 364 (404) T ss_pred CCCCccccceeeEEecccccCCCCCc---------------------------cEEEEEeccccEEEEEecceEEEEecc Confidence 67778999999986544 33221111 1112222 2233334444556666544 Q ss_pred c--hh--hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 312 A--EY--QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 312 ~--~~--~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + .+ -.-.+++.+++|.++++|++.+.++++..| T Consensus 365 ~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa 401 (404) T protein:vir:10 365 GAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVES 401 (404) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 3 12 223588999999999999999999999999 No 111 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.31 E-value=2.5e-13 Score=89.72 Aligned_cols=281 Identities=14% Similarity=0.081 Sum_probs=163.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccc--ccEEEEeecC--cceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISS--GKSAQFPVIG--RTKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~--G~tv~i~~iG--~~t~~~~~~g 76 (344) ...+.++............+++--.+..+.|..++.+..+..+.++++.+...+.+ |+ ..++... ...+..+..| T Consensus 95 ~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~ 173 (397) T protein:vir:49 95 KNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGS-RVYEKWTDITGLANIDDEA 173 (397) T ss_pred HHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccc-eEEEeeccCCcceeeecCc Confidence 00000000000000001111222235679999999999999999999988887753 33 4455433 3445666667 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) ..++... .++..++++.+.+.. .-..|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.... T Consensus 174 ~~~~~~~-~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g-------------- 237 (397) T protein:vir:49 174 GKIADVD-DPKLSLIKYTIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIA-------------- 237 (397) T ss_pred ccccccc-ccceeeEEeeeeeEE-eeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------- Confidence 7765322 245567677664443 334565422234568999999999999999999998863211 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccce Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGS 236 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (344) .++ ..++. .-++.|.++...|..+..+. -.+|++|..|..|..-..- +..|.-...+..|. T Consensus 238 ---~~~--~~~~~-----------~~~d~i~~~~~~l~~~~~~~--a~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~ 298 (397) T protein:vir:49 238 ---ALP--TKPTL-----------TKWDDIIDLEAKVDPAIKQT--SFFLTNTSGFTALKKVKNA-LGDYLMERDVKSPT 298 (397) T ss_pred ---ccc--ccccc-----------ccHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhcC-CCceeeccCcCCCC Confidence 000 00000 01567788888888877643 4678999999988753222 22332222355677 Q ss_pred eEEEeCeEEEEecc--ccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecc- Q lcl|NC_015719. 237 IRNVMGFEVVEVPH--LTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRA- 312 (344) Q Consensus 237 Vg~i~G~~V~~sn~--lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~- 312 (344) -+.++|++|+.+.+ +|..+.... +.++.. +.++..+..+.++++..+.. T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~~~~---------------------------~i~~gd~~~~~~~~~~~~~~i~~~~~~~ 351 (397) T protein:vir:49 299 GYSIDGFAVKEVADRWLANGTGGAM---------------------------PLYFGDLKQAVTLFDRQHMSLLSTNIGG 351 (397) T ss_pred CceecceeeEEecccccccccCCce---------------------------eEEEeeccceEEEEeecceEEEEecccc Confidence 78999999998655 332221110 111111 12333334455566654422 Q ss_pred h-h--hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 E-Y--QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~-~--~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + + ....+++..+++.++++|++.+.+++++.+ T Consensus 352 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 352 GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 386 (397) T ss_pred chhhcCceeEEEEeeeCcEEecccceEEEEeeccc Confidence 1 2 223578889999999999999999999887 No 112 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.30 E-value=3.9e-13 Score=88.61 Aligned_cols=280 Identities=15% Similarity=0.086 Sum_probs=173.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhc--CCce----eee----cccccEEEEeecCcc-- Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTA--NRHM----QRQ----ISSGKSAQFPVIGRT-- 68 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~--~~~~----~~~----i~~G~tv~i~~iG~~-- 68 (344) ||. |+.. | -+-.|+|...|.+...+.+.|. +.+. ..+ -.+|+++.+|..+.. T Consensus 1 MA~--------T~ls------d--~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~G 64 (324) T protein:vir:59 1 MAY--------TKIS------D--VIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDG 64 (324) T ss_pred CCc--------eeee------c--eechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCC Confidence 884 2221 1 1455999999999888887662 2221 111 237999999999875 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) ...++..++.++. ..+..++..-+|= .....+.+.|+....+--|++.++.++.+..+++..+..++..|.+..... T Consensus 65 d~~~v~~~~~i~~--~~l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~ 141 (324) T protein:vir:59 65 DSQVLNDTDDLVP--QKINAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSND 141 (324) T ss_pred cccccCCCcccch--hhcccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 4677887887764 4577777665553 567788999988888888999999999999999999999988775433221 Q ss_pred cccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_015719. 149 DGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA 228 (344) Q Consensus 149 ~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (344) .. ....+++.++.+. ..-.+.|.+|..+|.++. ..-..+++.|..|..|.+........+.. T Consensus 142 ~~---------~~~~~dvsa~~~~-------~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~li~~~~~s~ 203 (324) T protein:vir:59 142 DM---------KDNKLDISGTADG-------IYSAETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQDLIEFVKDSQ 203 (324) T ss_pred cc---------ccceeeeeccccc-------eecHHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhhhhhhccccc Confidence 11 1122222222111 011466778888887753 23357889999999998764322122211 Q ss_pred ccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhh-eeeee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLK-DLALE 307 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~-~~~~e 307 (344) .++.|+.+.|.+|+.+..+|....++. . .....+++-+.|++....+ ++.+| T Consensus 204 ----~~~~i~~~~G~~VivdD~~p~~~~~~~-----------~------------~~y~s~l~~~GAi~~~~~~~~v~vE 256 (324) T protein:vir:59 204 ----SGIRFPTYMNKRVIVDDSMPVETLEDG-----------T------------KVFTSYLFGAGALGYAEGQPEVPTE 256 (324) T ss_pred ----cCceeeeecccEEEEeCCCCccccCCC-----------C------------ceEEEEEEecCeEEEeecCCCccee Confidence 246789999999999999986322110 0 1112244556666666543 46789 Q ss_pred eeecchhhhhhhhhhhhhcCcee--ccccEEEEEecCCC Q lcl|NC_015719. 308 RARRAEYQADQIIAKYAMGHGGL--RPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~~~~~~~d~i~~~~~~G~~v~--Rp~~~~~l~~~~~a 344 (344) ..|++..-.|.+...+.|...+. ......+-....+- T Consensus 257 ~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~ 295 (324) T protein:vir:59 257 TARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTD 295 (324) T ss_pred cccCccccceEEEEeeEEEeEeeeEEecccccCCCCCCh Confidence 99998877777766666554332 11100000001110 No 113 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.29 E-value=4.3e-13 Score=88.38 Aligned_cols=273 Identities=15% Similarity=0.028 Sum_probs=156.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC---cceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG---RTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG---~~t~~~~~~g~ 77 (344) +..... .-.+.....++.-.+.++.|..++++.-.+.+.++++.+..++. +.++.|++.- .........|+ T Consensus 98 ~~~~~~-----~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~Eg~ 171 (379) T protein:vir:10 98 GKSIQV-----KAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSIS-GGTYTFVRENGAGEGAIGAQVEGA 171 (379) T ss_pred hhhhhh-----hhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeecc-CCceEEEEeecCCCcccccccCCc Confidence 111100 00011111222223567889999999998888999998887765 4557887642 22333445666 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) ..+.. .++.+++++.+.++-- -+.|.+ +-.+...++.+.+..+.+++|++..|+.++.-+. T Consensus 172 ~~~~~--~~~f~~i~~~~~k~~~-~~~iS~-ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~--------------- 232 (379) T protein:vir:10 172 TKGQK--DYDISMIDVNTDFIAG-FTRYSK-KMANNLPFLTSFIPNALRRDYAKAENAAFNAVLA--------------- 232 (379) T ss_pred ccccc--ccceeeeEeeeeeEEe-eehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 66543 3566776666655532 234543 2233344578888888999999999988753110 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc--ccccccc Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA--LIDPERG 235 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~--~~~~~~G 235 (344) .+........+. ...++.|.++...+...+.+.. .+|++|..|..|.+-..- +..|.. ......| T Consensus 233 --~~~~~~~~~~~~--------~~~~d~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~ 299 (379) T protein:vir:10 233 --ANATASTEIITN--------KNKVEMLINEIAKQENLDFPVT--AIVLRPTDYYDILVTQKS-VGAGYGLPGVVTQDN 299 (379) T ss_pred --cccccccccccC--------cccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhcc-CCceeccCCccCCCC Confidence 000000000111 1124667777777777766433 467899999988653322 223322 1123356 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch-h Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE-Y 314 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~-~ 314 (344) ...+++|++|+.|+.+|.+.. +-+++. . +.++ ..+.++++..++.. + T Consensus 300 ~~~~l~G~pvv~s~~~~ag~~--------------------~~gdf~--~-~~~~---------~~~~~~i~~~~~~~~~ 347 (379) T protein:vir:10 300 GVLRINGIPLFRATWLAANKY--------------------YVGDWT--R-VTKV---------TTEGLSLEFSEVEGTN 347 (379) T ss_pred CcceecceeeEecCCCCCCce--------------------EEeecc--c-EEEE---------EEeceEEEEeeccccc Confidence 666899999999999975321 111111 1 1111 12334566655542 2 Q ss_pred hh---hhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 315 QA---DQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 315 ~~---d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) |. -.+++..++|.++++|++.+.+.+++= T Consensus 348 f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 348 FVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred ccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 22 356677899999999999999888877 No 114 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.29 E-value=6.1e-13 Score=87.55 Aligned_cols=296 Identities=13% Similarity=0.067 Sum_probs=158.5 Q ss_pred CCC-ccccc--cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEee-cCcceeeeeeCC Q lcl|NC_015719. 1 MAN-MQGGQ--QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPV-IGRTKAAYLQPG 76 (344) Q Consensus 1 ma~-~~~~~--~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~g 76 (344) +.+ +..+. .. -+.| ..++--.+..+.|..++.+..+..+.++++.+..++.+++ +++|. .+.+++.-...| T Consensus 117 f~~~l~~~e~~~a-l~~~---t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~~~~~~~~~a~wv~E~ 191 (425) T protein:vir:10 117 FKAHVKRGDVQAA-LNKG---EDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAG-FSKLFNMGGTTSGWVGEA 191 (425) T ss_pred HHHHhhhhhhHHH-hhcC---cCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCc-eEEEEEcCCcceeeeccc Confidence 000 00000 00 0000 1111112567999999999999999999999888776554 55554 455566555566 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) ..++.+. ..+..++++.. .++.. ..|.+-=-.++.+|+.+.+.++.++++++..|+.++.- .. . ..| T Consensus 192 ~~~~~~~-~~~f~~v~~~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G----~G-~----~~p 259 (425) T protein:vir:10 192 SQRPQTN-AATFQPLSFAS--GEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAG----DG-T----NKP 259 (425) T ss_pred ccccccc-ccccceeeeeh--eeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcc----cC-C----CCc Confidence 6655331 12345555544 43333 34444222245689999999999999999999988621 10 0 111 Q ss_pred ccccCceeeeccccc------ccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKA------DLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 156 ~~~~~~~~i~~~~~~------~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) .|.-........... ............++.|+++...|..... .+-.+|++|..|..|.+-..- +..|.=. T Consensus 260 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~L~~lkD~-~G~~l~~ 336 (425) T protein:vir:10 260 NGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQVRKLKDG-QGNYLWQ 336 (425) T ss_pred ceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHHHHHhhcC-CCceeec Confidence 111110000000000 0000011122236778888777766554 233568999999988653221 1222212 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+.+|.-++++|.+|+.++++|.......++ .-|++. . ++..+..+.+++ . T Consensus 337 ~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i---------------~~Gd~~--~---------~~~i~~~~~~~v--~ 388 (425) T protein:vir:10 337 PSYVAGQPATLAGYPVTEVPDMPDVAANSTPI---------------LFGDFQ--Q---------TYLIIDRIGVRV--L 388 (425) T ss_pred cCccCCCCceecceeeEEecCcCCccCCccEE---------------EEEehh--c---------cEEEEEecceEE--E Confidence 23456777899999999999998543322211 111111 1 111122222222 3 Q ss_pred ecchhhh--hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQA--DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +++-... -.+++..+++.++++|++...|.+++.= T Consensus 389 ~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 389 RDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 3322112 3466788999999999999888776655 No 115 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.28 E-value=4.8e-13 Score=88.10 Aligned_cols=292 Identities=10% Similarity=0.075 Sum_probs=156.5 Q ss_pred CCCccccccc-----cccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc-eeeeee Q lcl|NC_015719. 1 MANMQGGQQL-----GTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT-KAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~-----~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-t~~~~~ 74 (344) |......... ..+.......++--.+..+.+..++.+..+..+.++++++..++. |+ ++||+.... .+..+. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~~v~ 196 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPATWIE 196 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCccccccc Confidence 1110000000 000000011111223577899999999999999999999887764 54 467776544 334455 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) .|..++.... ...+++++.. .++.. +.|.+-=-.++..++.+.+..+.++++++..|+.++.- .+.... T Consensus 197 E~~~~~~~~~-~~f~~i~l~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G-------~G~~~~ 266 (425) T protein:vir:95 197 QSGALPTGDV-GTIASIDFDG--FKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKG-------TGAANK 266 (425) T ss_pred cccccccccc-cccceeeeeh--eeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhcc-------CCCCcc Confidence 5666654321 1345555544 44443 44554222335568999999999999999999988621 111011 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHH-HHHHHhccchhh--hhcccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPD-VYSAILAALMPN--AANYAALI 230 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~-~~~~Ll~~~~~~--~~~~~~~~ 230 (344) .|.|...+ +......+. ......++.|.++...+.....+..+-+++++|. +|..|..-.... +..|... T Consensus 267 ~p~Gil~~----~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~- 339 (425) T protein:vir:95 267 QPLGIIPS----LPPENQVTV--EADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK- 339 (425) T ss_pred ccceeecc----ccccccccc--ccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeec- Confidence 11111110 011111110 0112236677777777776665544444455555 455443321111 2223211 Q ss_pred ccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee Q lcl|NC_015719. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR 310 (344) Q Consensus 231 ~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~ 310 (344) ...+...+++|.+|+.++++|...+ .-|++. .. + .+..+.++++... T Consensus 340 -~~~~~~~~l~G~pvv~~~~~~~~~i--------------------~~Gd~~--~~--~--------~~~~~~~~i~~~~ 386 (425) T protein:vir:95 340 -LPNLRTPDLLGLRVVFNNFLDDDTV--------------------LFGEFE--QY--T--------LVERENITIDSST 386 (425) T ss_pred -cCCCCCccccceeeEEcCcCCCccE--------------------EEEecc--cE--E--------EEeecceEEEeec Confidence 2355567899999999999985421 001111 10 1 1123445566555 Q ss_pred cchhh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 311 RAEYQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 311 ~~~~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..+- ...+++..++++++++|++.+.+.++... T Consensus 387 ~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~ 422 (425) T protein:vir:95 387 HVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPV 422 (425) T ss_pred ccccccCceEEEEEEeeCcEeecccceEEEEecCcC Confidence 54222 23677888999999999999999998855 No 116 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.27 E-value=1.7e-12 Score=85.15 Aligned_cols=282 Identities=12% Similarity=0.063 Sum_probs=152.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC-cceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG-RTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG-~~t~~~~~~g~~~ 79 (344) ||+.++.. |. .+..+.++.++.+..++.+.++.+.+..++. +.+++||+.. .+.+.-+..|... T Consensus 1 ma~~t~~~------gg--------~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~wv~E~~~~ 65 (305) T protein:vir:25 1 MADISRAE------VA--------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATD 65 (305) T ss_pred CCCccCCc------cc--------eecCHHHHHHHHHHHHhhchhhhhcceeecc-CCcEEEEEEeCCcceEEeeccccc Confidence 88866522 11 2577899999999999999999999988875 4467787754 4456666666655 Q ss_pred CCCcC---CcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 80 DDKRK---DIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 80 ~~~~~---~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) +.... .++..++++.. .++.. ..|.+-=-.++.+|+.+.+.++.+++|++..|+.++.-- ... ....+ T Consensus 66 ~~~~~~~s~~~f~~i~~~~--~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~----g~~--~~~~~ 137 (305) T protein:vir:25 66 PKGVKPTSKVTWANRTLVA--EEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT----DKP--ASWVS 137 (305) T ss_pred ccccccccccceeeEEeee--EEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheecc----CCC--CCccc Confidence 43211 22334444443 44333 445441122356889999999999999999999987311 100 00000 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) .+. .+.....+.....+........+++.+..+...+....-... -++++|..|..|.+-.. -.+.-.+.. T Consensus 138 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd-----~~G~~i~~~- 208 (305) T protein:vir:25 138 PAL-IPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPD--TLLSSLALRYEVANIRD-----ANGNPVFRD- 208 (305) T ss_pred ccc-ccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccc--eeEecHHHHHHHHHhhc-----cCCceeecC- Confidence 000 000000000000111111122234545555544443332111 26779999998864211 112222222 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc--- Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA--- 312 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~--- 312 (344) +.++|.+|+.++++|....... ..-+++. + ...+..+.++++..++. T Consensus 209 --~~l~G~Pv~~~~~~~~~~~~~~----------------~~~gd~s--~----------~~i~~~~~~~i~~~~~~~~~ 258 (305) T protein:vir:25 209 --DSFAGFRTFFNRNGAWDADAAI----------------EVIADSS--R----------VKIGVRQDITVKFLDQATLG 258 (305) T ss_pred --CcccccceEEcCccCCCCCccE----------------EEEEecc--e----------EEEEEecCeEEEEeeeeeee Confidence 3789999999999875432211 1111211 1 11122233344444321 Q ss_pred -------hhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 -------EYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 -------~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .++.| .++...++|..++||++++.+.....| T Consensus 259 ~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~ 299 (305) T protein:vir:25 259 TGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) T ss_pred cCCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccccc Confidence 12222 356778899999999999998886655 No 117 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.27 E-value=1.9e-12 Score=84.80 Aligned_cols=296 Identities=14% Similarity=0.086 Sum_probs=159.9 Q ss_pred CCCccc-----------------cccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCC-ceeeecccccEEEE Q lcl|NC_015719. 1 MANMQG-----------------GQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANR-HMQRQISSGKSAQF 62 (344) Q Consensus 1 ma~~~~-----------------~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~-~~~~~i~~G~tv~i 62 (344) |+..++ ..+ ....+....+| .+..+.+..+|.+..+..+.++.+ .+..+...| .+.+ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gg---~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~ 179 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAM-SLNTLSPGAGG---VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITI 179 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhh-hhcccCCCCCc---cccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceEE Confidence 111000 000 00011111111 245688999999988888888776 344444444 4778 Q ss_pred eec-CcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccch--HHHHhChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 63 PVI-GRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDI--EDAMNHYDVRSEYTSQIGESLAMAADGAVLA 139 (344) Q Consensus 63 ~~i-G~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~--D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~ 139 (344) |+. +.+.+.-+..|+.++.. +++.+++++.+.+.. .-+.|.+- +.....+++.+.+.++.+++|++..|+.++. T Consensus 180 p~~~~~~~a~~v~E~~~~~~~--~~~f~~i~~~~~k~~-~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~ 256 (435) T protein:vir:80 180 PRLKGGAIVGYIGADTDIPTT--QQQFDDLKLTAKKMA-ALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR 256 (435) T ss_pred EEEeCCcceeeeccCcccccc--ccceeeEEEeeEEEE-EeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 776 55565556667766643 356667666665543 23455431 1222245788999999999999999998863 Q ss_pred HHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccc Q lcl|NC_015719. 140 ELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAAL 219 (344) Q Consensus 140 ~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~ 219 (344) - .. ....|.|......... . ...........++..+.++...|..++....+-.+|++|..|..|..-. T Consensus 257 G----~G----~~~~p~Gi~~~~~~~~--~-~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk 325 (435) T protein:vir:80 257 D----DG----TANTPKGLRFWALPGN--V-ITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLR 325 (435) T ss_pred c----CC----CCCcccceeecccccc--e-eecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhh Confidence 1 10 1111222211110000 0 0011111223345567777777777766544556689999998885432 Q ss_pred hhhhhccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhh Q lcl|NC_015719. 220 MPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTV 299 (344) Q Consensus 220 ~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~ 299 (344) . .+..|.-. .... ++++|.+|+.++++|........ ....+-+++. -+..+ T Consensus 326 d-~~G~~l~~-~~~~---~~l~G~pv~~~~~~p~~~~~~~~------------~~~i~~gd~s------------~~~i~ 376 (435) T protein:vir:80 326 D-GNGNKVYP-ELAN---GMLKGYPVGKTTQVPINLGEAGK------------ESEIYFTDFG------------DVFIG 376 (435) T ss_pred c-cCCceecc-CCCC---CeEeeeeeEEeccccccccCCCC------------cceEEEEEcc------------cEEEE Confidence 1 12222110 1123 37899999999999964322110 0011112211 11122 Q ss_pred hhheeeeeeeecch-----------hh--hhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 300 KLKDLALERARRAE-----------YQ--ADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 300 ~~~~~~~e~~~~~~-----------~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ....++++..++.. ++ .-.++...+|+.++.||++.+.|.--.-. T Consensus 377 ~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:80 377 EEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWG 434 (435) T ss_pred eecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCC Confidence 34455666665542 11 24668899999999999999999866655 No 118 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.27 E-value=2.8e-13 Score=89.41 Aligned_cols=275 Identities=11% Similarity=0.010 Sum_probs=164.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcce---eeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTK---AAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t---~~~~~~g~ 77 (344) |-+.... .-.|.+....+| -.+..+.+..++.+..+..+.++++++..++.++ ++++++..... +.....|. T Consensus 104 ~~~~~~~--~~~ra~~t~~~g--g~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~ 178 (421) T protein:vir:13 104 IRGIQLS--EEERDIMSSTNN--GAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKDT 178 (421) T ss_pred hhccchh--HHHhhccccCCc--ceecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeeccccc Confidence 1110000 002222222222 2356789999999999888999999988877544 45666543322 33444555 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++.+ .++..++++.+.+.. .-+.|.+-=-.++.+|+.+.+.++.+++++...|..++.++.+.. T Consensus 179 ~~~~s--~~~f~~i~~~~~k~~-~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~------------ 243 (421) T protein:vir:13 179 ELVKA--MLKTQPMAYDIDDYG-LLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVL------------ 243 (421) T ss_pred ccccc--ccceeEEEeeeeeeE-eehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcc------------ Confidence 55433 345566666665443 234455422234568899999999999999999988864332110 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) ...+. .-++.|+++...|..+..+. -.+|++|..|..|..-..- +..|.-. ....|.. T Consensus 244 -------~~~~~-----------~~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lkd~-~G~~i~~-~~~~~~~ 301 (421) T protein:vir:13 244 -------AEETI-----------NDYAGLVKTINSLVPNARKR--AIIVTNSDGRAYLDGLMDK-QGRPLLK-ELSDGGD 301 (421) T ss_pred -------ccccc-----------cchHHHHHHHHHhhhhhcCC--CEEEEcHHHHHHHHHhhcC-CCceeec-CcCCCCC Confidence 00000 01567777888887776643 3567899999988753211 2223211 2446667 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHH-HHhhhhhheeeeeeeecchhhh Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRS-AVGTVKLKDLALERARRAEYQA 316 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~-Av~~~~~~~~~~e~~~~~~~~~ 316 (344) ..++|.+|+.++++|..+.+.. ..++..-+ ++..+..+.++++..++..+.. T Consensus 302 ~tl~G~pV~~~~~~~~~~~~~~---------------------------~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~ 354 (421) T protein:vir:13 302 LVFKGRPVIELEESIFDVGDET---------------------------KFIVSDFKTLIKFMDRKQYLIDQSKEAGYTK 354 (421) T ss_pred ceecceeeEEeccccccCCCce---------------------------EEEEEeccccEEEEEecceEEEeeccccccc Confidence 7999999999999886432211 11222211 2334445667888877765444 Q ss_pred h--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 D--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) + .+++..+++.++++|++...+.....+ T Consensus 355 ~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (421) T protein:vir:13 355 NETIARIIERFDVNSPLDKSSDAEKIRKFG 384 (421) T ss_pred CeeEEEEEeeecceeecchhhheeeecccc Confidence 3 588899999999999998766655433 No 119 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.26 E-value=6.3e-13 Score=87.48 Aligned_cols=300 Identities=11% Similarity=0.018 Sum_probs=156.5 Q ss_pred CCCcccccc--ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEee-cCcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQ--LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPV-IGRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~--~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~g~ 77 (344) |-+...... .-.|--.....++--.+..+.|..++.+..+..+.++.+.+..++.++ +.+++. .+.+.+.-...|. T Consensus 91 lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~wv~E~~ 169 (401) T protein:vir:44 91 LRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTASGWVGETD 169 (401) T ss_pred HhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccceeecccc Confidence 000000000 000000000000101246699999999999999999999888777544 445554 4544454444455 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) ..+.+. ..+.+++++.+-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.++.- .. +. .|.| T Consensus 170 ~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G----~G-~~----~p~G 238 (401) T protein:vir:44 170 TRSQTA-TSRLGLIEPFMGEIY-GNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTG----DG-TK----KPKG 238 (401) T ss_pred ccCccc-cccceeeeeehhhee-eehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc----CC-CC----ccce Confidence 544321 124455555554332 2234444222235678999999999999999999988631 10 00 1111 Q ss_pred ccCc-eeeec-----ccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_015719. 158 LGKP-SLLEV-----GAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALID 231 (344) Q Consensus 158 ~~~~-~~i~~-----~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (344) .-.. ..... ................|+.|+++...|..... .+-.++++|..|..|.+-..- +..+.-... T Consensus 239 il~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~-~G~~l~~~~ 315 (401) T protein:vir:44 239 FLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKDT-EGNYLWRPG 315 (401) T ss_pred eeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhcc-CCceeecCC Confidence 0000 00000 00000000011112237778888877766544 234578999999988643221 122322223 Q ss_pred cccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec Q lcl|NC_015719. 232 PERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR 311 (344) Q Consensus 232 ~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~ 311 (344) +.+|..++++|.+|+.++++|........+ .-|++. .+...+..+.++++ ++ T Consensus 316 ~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i---------------~~Gd~~-----------~~~~i~~~~~~~~~--~~ 367 (401) T protein:vir:44 316 LELGQPSSLAGYGIAENEQMPDIAADAKAI---------------AFGNFK-----------RGYTIVDRIGTRIL--RD 367 (401) T ss_pred cCCCCCceecceeeEEecCcCCccCCccEE---------------EEeehh-----------ccEEEEEecceEEe--ee Confidence 456777899999999999998643222211 111111 11112222333333 33 Q ss_pred chhhhh--hhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 312 AEYQAD--QIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 312 ~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +-...+ .+++..++|+++++|++.+.|+.++- T Consensus 368 ~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 368 PYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 322223 36777899999999999999988877 No 120 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.26 E-value=9e-13 Score=86.61 Aligned_cols=278 Identities=10% Similarity=0.070 Sum_probs=155.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (344) ....... .+......+++--.+..+.|..++.+..+..+.++++++..++.++ +.+++.. +...+.....+.. T Consensus 101 ~~~~~~~----~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~ 175 (394) T protein:vir:10 101 HSHGKVI----DNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAE 175 (394) T ss_pred hccchhh----hhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEecCCCcccccccccc Confidence 0000000 0000111122222356799999999999999999999998877543 4555544 4445555555555 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) .+.. ..++..++++.+-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.|+.... T Consensus 176 ~~~~-~~~~~~~v~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g---------------- 237 (394) T protein:vir:10 176 NPAL-AEPEFEQVDWSVSTYR-GAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQ---------------- 237 (394) T ss_pred cccc-ccccceeEEeeeeeeE-eeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Confidence 5432 1245566666664443 224455422234678999999999999999999998863221 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc----ccccc Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA----LIDPE 233 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~----~~~~~ 233 (344) .+.... ..+ ...++.|.++.. .++.+. .-.+|++|..|..|.+-..- +..|.- ..... T Consensus 238 -~~~~~~--~~~---------~~~~d~l~~~~~~~~~~~~----~a~~vmn~~~~~~l~~lkd~-~G~~i~~~~~~~~~~ 300 (394) T protein:vir:10 238 -SFTAKA--TTT---------DTLVDSLKHILNVDLDPAY----SRALVVTQSLFNTLDTLKDK-NGRYLLHDASDSITD 300 (394) T ss_pred -cccccc--ccc---------cccHHHHHHHHHhhhhhhc----cCEEEecHHHHHHHHHhhcc-CCCeeeecccccccc Confidence 111100 011 111444555432 233221 24688999999998753221 122211 11122 Q ss_pred cceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch Q lcl|NC_015719. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE 313 (344) Q Consensus 234 ~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~ 313 (344) .|.-++++|.+|+.+++...+..++. .+..-+++ ++++..+...+++++..++ . T Consensus 301 ~~~~~~L~G~PV~~~~~~~~~~~~~~--------------~~i~~gd~-----------s~~~~~~~~~~~~v~~~~~-~ 354 (394) T protein:vir:10 301 GTAKGTVLGVPVYVVGDALLGSAAGD--------------QKAFVGDL-----------KRGVLFADRQQVTLAWEDS-K 354 (394) T ss_pred CCcccccccceeEEecccccCCCCCc--------------eEEEEeec-----------cccEEEEeecceEEEEecc-c Confidence 34446899999998776432221110 00111111 1122233345566665554 4 Q ss_pred hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 ~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .|...+++.+++++++++|++.+.++.+..+ T Consensus 355 ~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~~ 385 (394) T protein:vir:10 355 IYGRYLGAAFRFGVKQADSNAGYFVTNTDAA 385 (394) T ss_pred ccceeEEEEEEeccEEeccccEEEEEeeccc Confidence 4556689999999999999999999998888 No 121 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.25 E-value=4.6e-13 Score=88.20 Aligned_cols=283 Identities=12% Similarity=0.062 Sum_probs=165.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhc--CCce-eeec-----ccccEEEEeecCcc--ee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTA--NRHM-QRQI-----SSGKSAQFPVIGRT--KA 70 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~--~~~~-~~~i-----~~G~tv~i~~iG~~--t~ 70 (344) ||. |+.. | -+-.|+|...|.+.+.+.+.|. +.+. .-++ .+|+++.||..+.. .. T Consensus 1 MA~--------T~ls------d--~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~ 64 (351) T protein:vir:15 1 MAE--------THLS------D--LIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDP 64 (351) T ss_pred CCc--------eeee------e--eechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcc Confidence 884 2221 2 1456999999999888877663 2222 1122 36999999999864 56 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) .++..++.++. +.+...+..-+| ...-..+.+.|+...-+-.|++.++.++.+...++..+..+|..|.+....... T Consensus 65 ~~~~~~~~i~~--~kitt~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~ 141 (351) T protein:vir:15 65 DNWTDSDDIDV--NNLTSGKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKI 141 (351) T ss_pred cccCCCcccch--heecccceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhh Confidence 78888888764 467777776666 344456889998888888899999999999999999999988776543221111 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALI 230 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (344) ..+...+....+..+ ...-++.|.+|..+|-+..-. .-..+++.|..|..|.+...+....+.. T Consensus 142 --------~~~~~~d~t~~~~~~-----~~is~~~l~~A~~~~GD~~~~-~~~~ivmhS~v~~~L~~~~li~~~~~s~-- 205 (351) T protein:vir:15 142 --------ANSKVYDQTKVSPSE-----PMFGAKGFTGAIGLMGDLQDT-AFGAIAVNSATYSLMKVQGLIETIQPQN-- 205 (351) T ss_pred --------cccceeccccccccc-----cccCHHHHHHHHHHhcccccc-ceEEEEEChHHHHHHHhhhhhhhccccc-- Confidence 111112211111111 111256788888888654221 1367889999999998764322222211 Q ss_pred ccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeee Q lcl|NC_015719. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERAR 310 (344) Q Consensus 231 ~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~ 310 (344) .++.|+.+.|.+|+.+..+|....+.. . .....+++-+.|++..+.. +.+|..| T Consensus 206 --~~~~i~t~~G~~VivdD~~p~~~~~~~-----------~------------~~ytsyl~~~GAi~~~~~~-~~ve~~r 259 (351) T protein:vir:15 206 --GATPFEAYNGLRIVLDDDIEIDLTDKT-----------K------------PVSTSYIFAPGAVRYSTNM-RSTETKY 259 (351) T ss_pred --cCcccceecceEEEEcCCCccccCCCC-----------C------------ceeEEEEEecceeeeecCC-cCcceee Confidence 245789999999999999986432211 0 0112244555666655544 3577777 Q ss_pred cchhhh--hhhhh-----hhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 311 RAEYQA--DQIIA-----KYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 311 ~~~~~~--d~i~~-----~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |+...+ |.+.. ++.+|.+--.+.......-+.-+ T Consensus 260 d~~~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~sPt~~ 300 (351) T protein:vir:15 260 DPLINGGQDVIVQKRVGTIHVAGTSIKASFSPSKASFPTID 300 (351) T ss_pred cccCCCCceEEEEeeeeeeeeeeeeecccccccCcCCcChH Confidence 765432 33322 33334333222111101111111 No 122 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.25 E-value=1.1e-12 Score=86.12 Aligned_cols=284 Identities=11% Similarity=0.060 Sum_probs=157.1 Q ss_pred CCCc-ccc----ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccc-cEEEEeecCcc-eeeee Q lcl|NC_015719. 1 MANM-QGG----QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSG-KSAQFPVIGRT-KAAYL 73 (344) Q Consensus 1 ma~~-~~~----~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG~~-t~~~~ 73 (344) +.+. ..+ .....+.-.....++--.+..+.|..++.+..+..+.++++++...+.++ ..+.+++.... ....+ T Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (408) T protein:vir:74 97 FVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAM 176 (408) T ss_pred HHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccc Confidence 0000 000 00001111111111112356799999999999999999999988887653 24556654332 22233 Q ss_pred -eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 74 -QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 74 -~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) ..|+.++.. ..++..++++...+.- .-..|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.- T Consensus 177 v~E~~~~~~~-~~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G------------ 242 (408) T protein:vir:74 177 DEEDGKIPDL-DNPRLTIIKYLIKRYA-GIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA------------ 242 (408) T ss_pred cccccccccc-cccceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc------------ Confidence 234454432 1245566666665543 2245554222346779999999999999999999988631 Q ss_pred cccccccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_015719. 153 ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALID 231 (344) Q Consensus 153 ~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (344) .|+....++.. -++.|.++. ..|+....+ +-.+|++|..|..|..-.. .+..|.-... T Consensus 243 -------~G~~~~~~~~~-----------~~~~i~~~~~~~l~~~~~~--~a~~v~n~~~~~~l~~lkd-~~G~~l~~~~ 301 (408) T protein:vir:74 243 -------MGTVPKKPTIA-----------NFDDVITMINTSVDPAIIA--TSSLLTNQSGLNKLALVKT-AEGKYLLEPD 301 (408) T ss_pred -------ccccccccccc-----------cHHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhhc-CCCceEeccC Confidence 01100000000 144555543 355555543 3357789999999875321 2233332233 Q ss_pred cccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeee Q lcl|NC_015719. 232 PERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERAR 310 (344) Q Consensus 232 ~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~ 310 (344) +..|.-+.++|.+|+.+++.+.+..+.. .. +.++.. ++++..+..+.++++..+ T Consensus 302 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~-------------~~------------~i~~gd~~~~~~~~~~~~~~i~~~~ 356 (408) T protein:vir:74 302 PTKPNSYLIKGKQVIVVADRWLPNSGST-------------VY------------PLYYGDMSQAITLFDRENMSLLPTN 356 (408) T ss_pred cCCCCCceecceeeEEecCcccccccCC-------------cc------------eEEEEehhccEEEEEecceEEEEec Confidence 4566667999999998876433221110 00 011111 222333334445565554 Q ss_pred cc----hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 311 RA----EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 311 ~~----~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .. .+..-.+++.++++.++++|++.+.+++++.+ T Consensus 357 ~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 394 (408) T protein:vir:74 357 IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIA 394 (408) T ss_pred cccchhhcceeeEEEEEeeCcEEecccceEEEEeeccc Confidence 32 23334578889999999999999999998877 No 123 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.24 E-value=2.3e-12 Score=84.37 Aligned_cols=291 Identities=10% Similarity=0.028 Sum_probs=155.3 Q ss_pred CCC------ccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-Ccceeeee Q lcl|NC_015719. 1 MAN------MQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYL 73 (344) Q Consensus 1 ma~------~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~ 73 (344) |+- -.-..+. +....-..++.-.+..+.+..++.+..++.+.++.+.+...+. ++..+||+. +.+.+..+ T Consensus 1 ~~~~~~r~~~~~~~~e--~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~~v 77 (326) T protein:vir:42 1 MAVNPDRTTPFLGVND--PKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSASWI 77 (326) T ss_pred CCCCccchhhhcCcch--hhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceEEe Confidence 211 0000000 0000001111112567889999999999999998888776654 556778775 44566777 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) ..|+.++.. .++..++++...+.. .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.- ... . T Consensus 78 ~Eg~~~~~~--~~~f~~i~~~~~k~~-~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G----~gs-----~ 145 (326) T protein:vir:42 78 GEGDMKPIT--KGNMTSQTIAPHKIA-TIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAING----TDS-----P 145 (326) T ss_pred cCCcccccc--ccceeEEEEeeEEEE-EeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCC-----C Confidence 778887754 456677777765543 3455655223346789999999999999999999998631 110 0 Q ss_pred ccccccCce---eeecccccccccchhhHHHHHHHH--HHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_015719. 154 NIAGLGKPS---LLEVGAKADLTDPVKLGQAVIAQL--TIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA 228 (344) Q Consensus 154 ~~~~~~~~~---~i~~~~~~~~t~~~~~~~~i~~~l--~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (344) .+.+..... .......+..+ +...+..+ ..+...+. +.....-.+|++|..+..|.+-..- +..+.- T Consensus 146 ~p~gi~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~n~~~~~~L~~lkd~-~G~~l~ 217 (326) T protein:vir:42 146 FPTFLAQTTKEVSLVDPDGTGSN-----ADLTVYDAVAVNALSLLV--NAGKKWTHTLLDDITEPILNGAKDK-SGRPLF 217 (326) T ss_pred ccccccccccccceeeccccccc-----ccchhHHHHHHHHHhhhh--hhccCccEEEEeHHHHHHHHHhhcc-CCceee Confidence 111110000 00000111111 11111211 12222222 2223344677899999998752221 112211 Q ss_pred c-----cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 229 L-----IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 229 ~-----~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) . ........+.+.|++|+.++.+|.+... .+.+++. .. ++ +..+. T Consensus 218 ~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~------------------~~~Gd~s--~~--~~--------~~~~~ 267 (326) T protein:vir:42 218 IESTYTEENSPFRLGRIVARPTILSDHVASGTVV------------------GYQGDFR--QL--VW--------GQVGG 267 (326) T ss_pred ccccccCccccccCceeeeeeEEEcCCCCCCceE------------------EEEeecc--eE--EE--------EEecc Confidence 1 1122223458999999999999853211 0111111 11 11 11223 Q ss_pred eeeeeeecch--------------hhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 304 LALERARRAE--------------YQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 304 ~~~e~~~~~~--------------~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++++..++.. +..| .++..++++.+++||++.+.|+-+..+ T Consensus 268 ~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 268 LSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDAT 324 (326) T ss_pred eEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeecccc Confidence 3444333321 2223 357899999999999999999888877 No 124 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.23 E-value=1.3e-12 Score=85.69 Aligned_cols=281 Identities=8% Similarity=0.000 Sum_probs=160.5 Q ss_pred CC-Ccccc-----cc-ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccc-cEEEEee-cCcceee Q lcl|NC_015719. 1 MA-NMQGG-----QQ-LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSG-KSAQFPV-IGRTKAA 71 (344) Q Consensus 1 ma-~~~~~-----~~-~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~-iG~~t~~ 71 (344) |- ..... .. ...|...+...++--.+..+.|..++.+.....+.++++.+...+.++ ..+.+++ .+.+.+. T Consensus 102 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 181 (397) T protein:vir:12 102 LRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFS 181 (397) T ss_pred HhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCccee Confidence 00 00000 00 000111111112222356799999999999999999999888877642 2344554 4556677 Q ss_pred eeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_015719. 72 YLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGV 151 (344) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (344) .+..|..++... .++.+++++...+... -..|.+-=-..+.+|+.+.+.++.+++|++..|..|+... T Consensus 182 ~v~Eg~~~~~~~-~~~~~~v~~~~~k~~~-~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~---------- 249 (397) T protein:vir:12 182 PVEELGNLPEID-QPRFTKVSYSIIDYGG-IMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAI---------- 249 (397) T ss_pred eecccccccccc-cccceeEEeeheeeEe-eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc---------- Confidence 777777765432 2355666666644432 2445442223356789999999999999999999886311 Q ss_pred ccccccccCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_015719. 152 NENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALI 230 (344) Q Consensus 152 ~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (344) +.+. +.+. ..++.|.++.. .|+...- .+-.++++|..|..|.+-..- +..|.-.. T Consensus 250 -------g~~~--~~g~------------~~~~~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~L~~lkd~-~G~~l~~~ 305 (397) T protein:vir:12 250 -------ASLK--KVDI------------DGLDGIKKALNVTLDPMVA--PGSIVLTNQDGYDWLDTLKDG-TGRYLLQP 305 (397) T ss_pred -------cccc--cccc------------ccHHHHHHHHhhccchhhh--CCCEEEEcHHHHHHHHHhhcc-CCceeecc Confidence 0010 0000 01455555442 4443332 334678999999988653211 22333233 Q ss_pred ccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeee Q lcl|NC_015719. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERA 309 (344) Q Consensus 231 ~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~ 309 (344) .+.+|.-+.++|.+|+.+++...+...+ +...++.. +.++..+..+.++++.. T Consensus 306 ~~~~g~~~~l~G~pv~~~~~~~~~~~~~--------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~ 359 (397) T protein:vir:12 306 DPTNPTKKLLDGRPVVPFTNRVLKTQKG--------------------------KAPLIIGNLKEAIVLFDREQQSIAST 359 (397) T ss_pred cccCCCCccccceeeEEecccccccCCC--------------------------ccEEEEEehhceEEEEeecceEEEEe Confidence 4567777899999999887643221110 00011111 12222333445566655 Q ss_pred ecch--h--hhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 310 RRAE--Y--QADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 310 ~~~~--~--~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +... + -...+++.++++.++++|++.+.+.+++. T Consensus 360 ~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 360 DTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 4332 2 23468899999999999999999999988 No 125 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.23 E-value=8.6e-13 Score=86.74 Aligned_cols=289 Identities=10% Similarity=0.053 Sum_probs=150.3 Q ss_pred CCCccccccccccc-cccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-Ccceeeee---eC Q lcl|NC_015719. 1 MANMQGGQQLGTNQ-GKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYL---QP 75 (344) Q Consensus 1 ma~~~~~~~~~~~~-g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~---~~ 75 (344) |..... ..+. ..+..+++--.|..+.|+.+|.+..+..+.++.+.+..... | .+.+|.. +..+..-. .. T Consensus 131 l~~~~~----~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~-~~~~p~~~~~~~a~~~~~~~e 204 (434) T protein:vir:62 131 IVGNID----EKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-E-NIKYPVLVKKAEAQGHKNERT 204 (434) T ss_pred hccccc----hhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-C-ceEEEEEecCCcccceecccc Confidence 111000 0000 00011112123567999999999999999999888775543 3 3667665 22222222 22 Q ss_pred CCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 76 GESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 76 g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) |...+. .+++..++++.+ .++.. +.|.+-=-.++.+|+.+.+.++.+++|++..|+.++. +... +.. T Consensus 205 ~~~~~~--~~~~f~~v~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~----~~~ 272 (434) T protein:vir:62 205 NNEMPE--TDIEFDEIELSP--TEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVN----GDEA----NNI 272 (434) T ss_pred cccccc--cccceeeEEeeh--eeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----Ccc Confidence 333332 233445555554 33333 3344311223567999999999999999999998862 1110 011 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc--ccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA--ALIDP 232 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~ 232 (344) +.+......+.. . ......++.|+++...|+....+ .. .+|++|..|..|.+-..- +..|. ..... T Consensus 273 ~~g~~~~~~~~~----~-----~~~~~~~d~l~~l~~~l~~~~~~-~a-~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~ 340 (434) T protein:vir:62 273 NDGALAKKAVEF----K-----TDEKNLYDALVKMKNTPVKEVRK-KA-RWVLNTAALTKIETMKTD-DGFPLLRPFNQA 340 (434) T ss_pred ccceeecccccc----c-----ccccchhhHHHHHHhhcchhhhc-CC-EEEEcHHHHHHHHHhhcc-CCCEeeccCCCc Confidence 111111111111 0 11122478888888888776554 22 457899999988652211 22332 12223 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) ..|.-..++|.+|+.++.+|.+..+.... .+-|++.. ++++-+ ...++++...+. T Consensus 341 ~~g~~~tl~G~pV~~~~~~~~~~~~~~~~--------------i~~Gdfs~----~~i~~~-------~g~~~i~~~~~~ 395 (434) T protein:vir:62 341 EGGIGYTLLGFPVEEEDAIDIPDSPDTPV--------------FYFGDFSK----FYIQDV-------IGSLEVQKLVEL 395 (434) T ss_pred cCCCCceecceeeEEecCccCccCCCceE--------------EEEeeccc----eEEEEe-------eceeEEEeehhh Confidence 45666789999999999998654322110 11122211 011111 012334444332 Q ss_pred hhhhhh--hhhhhhhcCceec-cccEEEEEecCCC Q lcl|NC_015719. 313 EYQADQ--IIAKYAMGHGGLR-PESAGALVFKAGA 344 (344) Q Consensus 313 ~~~~d~--i~~~~~~G~~v~R-p~~~~~l~~~~~a 344 (344) -+--+. +++..++.+++++ |++..++.+..++ T Consensus 396 ~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~ 430 (434) T protein:vir:62 396 FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKA 430 (434) T ss_pred hcccCceEEEEEeeecceeecCcccceEEEEEecc Confidence 211233 6888899999775 9999988666443 No 126 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.22 E-value=4.8e-12 Score=82.62 Aligned_cols=295 Identities=14% Similarity=0.107 Sum_probs=153.1 Q ss_pred CCCcccc-----------------ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCC-ceeeecccccEEEE Q lcl|NC_015719. 1 MANMQGG-----------------QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANR-HMQRQISSGKSAQF 62 (344) Q Consensus 1 ma~~~~~-----------------~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~-~~~~~i~~G~tv~i 62 (344) ++...+. .+. ...+....+| .+..+.+..++.+..+..+.++.+ .+..+..+| .+++ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~t~~~gg---~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~ 179 (435) T protein:vir:14 105 LAAARGDAQLASKLAIERGFGEEVAMS-LNTLSPGAGG---VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITI 179 (435) T ss_pred HHhhcchhhHHHHHHHhhhhhhhhhhh-cccCCcCCCc---cccchhHHHHHHHHHhhhchhhhhcceeeecCCC-ceEE Confidence 0000000 000 0011111111 245688889998888877887765 444444445 4788 Q ss_pred eec-CcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHh--ChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 63 PVI-GRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMN--HYDVRSEYTSQIGESLAMAADGAVLA 139 (344) Q Consensus 63 ~~i-G~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~--~~d~~~~~~~~~~~aLa~~~D~~i~~ 139 (344) |++ +.+.+.-+..|..++.. +++..++++.+-+.. .-+.|.+-=-.++ ..++.+.+..+.+++|++..|+.++. T Consensus 180 p~~~~~~~a~~v~E~~~~~~~--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~ 256 (435) T protein:vir:14 180 PRLKGGAIVGYIGADTDIPTT--QQQFDDLKLTAKKMA-ALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR 256 (435) T ss_pred EEEeCCcceeeeccCcccccc--ccceeEEEeeeEEEE-EeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 887 55555556666666543 345566666654443 2344543111123 34588889999999999999999862 Q ss_pred HHHHhhhcccccccccccccCceeee-cccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhcc Q lcl|NC_015719. 140 ELAGLINLADGVNENIAGLGKPSLLE-VGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAA 218 (344) Q Consensus 140 ~~~~~a~~~~~~~~~~~~~~~~~~i~-~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~ 218 (344) - .. ....|.|........ +.... .......++..+.++...|...+.-.....+|++|..|..|.+- T Consensus 257 G----~G----~~~~p~Gi~~~~~~~~~~~~~----~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~l 324 (435) T protein:vir:14 257 D----DG----TANTPKGLRFWALPSNVITAS----DASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGL 324 (435) T ss_pred c----CC----CCccccceeecccccceeccc----cccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHh Confidence 1 10 111122221111000 00011 11122334556666666666665433445678999999988653 Q ss_pred chhhhhccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhh Q lcl|NC_015719. 219 LMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGT 298 (344) Q Consensus 219 ~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~ 298 (344) .. .+..|.=. .... +.++|.+|+.++.+|......... ...+-+++. . +.. T Consensus 325 kd-~~G~~l~~-~~~~---g~l~G~Pv~~~~~~p~~~~~~~~~------------~~i~~gd~s--~----------~~i 375 (435) T protein:vir:14 325 RD-GNGNKVYP-ELAN---GMLKGYPVGKTTQVPINLGETGKE------------SEIYFTDFG--D----------VFI 375 (435) T ss_pred hc-cCCceecc-CCCC---CeeecceeEeeccccccccCCCcc------------ceEEEeecc--c----------EEE Confidence 22 12222110 1223 378999999999999753221100 001112211 1 111 Q ss_pred hhhheeeeeeeecch-----------hh--hhhhhhhhhhcCceeccccEEEEEec-CCC Q lcl|NC_015719. 299 VKLKDLALERARRAE-----------YQ--ADQIIAKYAMGHGGLRPESAGALVFK-AGA 344 (344) Q Consensus 299 ~~~~~~~~e~~~~~~-----------~~--~d~i~~~~~~G~~v~Rp~~~~~l~~~-~~a 344 (344) +...+++++..++.. ++ .-.++..++++.+++||++.+.|.-. -|| T Consensus 376 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 376 GEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred EEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 223344555544321 11 25678899999999999998877643 344 No 127 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.22 E-value=7.2e-12 Score=81.66 Aligned_cols=296 Identities=11% Similarity=0.049 Sum_probs=148.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCC-ceeeecccccEEEEeec-CcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANR-HMQRQISSGKSAQFPVI-GRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~-~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~ 78 (344) |+..........+.-.. .++.--.+..+.+..++.+..+..++++.+ ++.-+..+|+ ++||++ +.+++..+..|+. T Consensus 113 ~~~~~~~~~~~~~~~~~-~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~-~~~p~~~~~~~a~~v~Eg~~ 190 (428) T protein:vir:10 113 FASDELNDQSVSMAIST-AAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGN-MSLPRLAGGATASYTGENQD 190 (428) T ss_pred HhhhhhhhhhHhhhhcc-cccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcc-eEEEEEeCCcceeeeccCcc Confidence 21111110000111000 001111235678888888888888888877 3332333344 778876 4445666666777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.+ .+..+++++...+.. .-+.|.+-=-.++.+++.+.+.++.+++|++..|+.++. +.. .+..|.|. T Consensus 191 ~~~~--~~~f~~i~~~~~k~~-~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~G----~~~~p~Gi 259 (428) T protein:vir:10 191 AKVS--EARFDDVKLTAKTMI-AMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMR----DDG----TGDTPIGM 259 (428) T ss_pred cccc--ccceeeEEeeeEEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhc----cCC----CCcccccc Confidence 7654 356667676664443 335565522234678999999999999999999998862 111 11122221 Q ss_pred cCce----eeecccccccccchhhHHHHHHHHHHHHHHHhh-cCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccc Q lcl|NC_015719. 159 GKPS----LLEVGAKADLTDPVKLGQAVIAQLTIARAALTK-NYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPE 233 (344) Q Consensus 159 ~~~~----~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~-~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (344) -... .+...... ....... .+...++...+.. .+.....-..+++|..|..|..-.. .+..|.-. ... T Consensus 260 ~~~~~~~~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-~~G~~i~~-~~~ 332 (428) T protein:vir:10 260 KARATQWNRLLPWAAD----AAVNLDT-IDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRD-GNGNKVYP-EMA 332 (428) T ss_pred cccccccccccccccc----ccccHHH-HHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhc-cCCceecc-CCC Confidence 1110 11000000 0111111 1111222111111 1111223345679999988865322 12222110 122 Q ss_pred cceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecch Q lcl|NC_015719. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAE 313 (344) Q Consensus 234 ~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~ 313 (344) . +.++|.+|+.++.+|....... + ....+-+++ +-+..+....++++..++.. T Consensus 333 ~---g~l~G~pv~~~~~~p~~~~~~~----------~--~~~i~~gd~------------s~~~i~~~~~i~i~~~~~~~ 385 (428) T protein:vir:10 333 Q---GMLKGYPIQRTSAIPANLGEGG----------K--ESEIYFADF------------NDVVIGEDGNMKVDFSKEAS 385 (428) T ss_pred C---CeeeceeeEEeccccccccCCC----------c--cceEEEEec------------ceEEEEEecceEEEeecccc Confidence 3 3699999999999997532211 0 001111121 11122233445555555432 Q ss_pred -----------hhh--hhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 314 -----------YQA--DQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 314 -----------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) ++. -.++...+++.++.||++.++++.-.= T Consensus 386 ~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 386 YIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 121 356888999999999999998876655 No 128 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.21 E-value=3e-12 Score=83.75 Aligned_cols=281 Identities=12% Similarity=0.049 Sum_probs=160.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccc-cEEEEeecC-cceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSG-KSAQFPVIG-RTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG-~~t~~~~~~g~~ 78 (344) -....-+.+.+|- .+--.+..+.|..++.+..+..+.++++++...+.++ -++.++..+ .+.+..+..|+. T Consensus 84 ~~~~~~a~~~~t~-------~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 156 (371) T protein:vir:81 84 RTRFRNAMSEGSN-------QDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAA 156 (371) T ss_pred HHHHHHhhccCCC-------ccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccc Confidence 0000000011111 1111356789999999999999999999988887643 234455543 346667777877 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++... .++.+++++...+.. ..+.|.+-=-..+.+|+.+.+.++.++++++..|+.++.... T Consensus 157 ~~~~~-~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g---------------- 218 (371) T protein:vir:81 157 IGEKA-TPQFTLLQYQVKKYA-GFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLN---------------- 218 (371) T ss_pred ccccc-ccceeeEEeeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------- Confidence 65432 235566666665543 224555421223467999999999999999999988863110 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccccccee Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSI 237 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (344) .+. ..+ .. -++.+..+. ..|....- .+-.+|++|..|..|.+-..- +..|.-...+..|.. T Consensus 219 -~~~--~~~----~~--------~~~~i~~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd~-~g~~l~~~~~~~~~~ 280 (371) T protein:vir:81 219 -TKA--KTA----IA--------DLDGLKQIINVQLDPVFR--STSSVIVNQDAFNWLDTLKDQ-NGQYLLQPSISSPTG 280 (371) T ss_pred -ccc--ccc----cc--------cHHHHHHHHHhhcchhhh--cCCEEEEcHHHHHHHHHhhcc-CCCeeeecccCCCCC Confidence 000 000 00 133343332 23433332 234678999999988753221 233332334556777 Q ss_pred EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc-hhh- Q lcl|NC_015719. 238 RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA-EYQ- 315 (344) Q Consensus 238 g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~-~~~- 315 (344) ++++|.+|+.++++|.+.......+ .......-|++ ++.+..+....++++..+.. .+| T Consensus 281 ~~l~G~pV~~~~~~~~~~~~~~~~~--------~~~~~i~~Gd~-----------~~~~~~~~~~~~~i~~~~~~~~~f~ 341 (371) T protein:vir:81 281 RQLLGLPVVIVSNKVLANRVDGGTG--------AQFAPIIVGDL-----------KEAVVMFDRQRTEIMSSNVAMDAFE 341 (371) T ss_pred ceecceeEEEecccccCcccccccc--------CCcceEEEEeh-----------hceEEEEeecceEEEEeccccchhh Confidence 8999999999999986543221110 00011111111 11222333444555555433 122 Q ss_pred --hhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 316 --ADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 316 --~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) .-.+++.+++|.++++|++.+.+.++.- T Consensus 342 ~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 342 TDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred cCceEEEEEEeeccEEecccceEEEEEecC Confidence 2467888999999999999999998766 No 129 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.21 E-value=2.6e-12 Score=84.05 Aligned_cols=284 Identities=10% Similarity=0.022 Sum_probs=157.4 Q ss_pred CCCcccc-ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccc-ccEEEEeecCcc--eeeeeeCC Q lcl|NC_015719. 1 MANMQGG-QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISS-GKSAQFPVIGRT--KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~-~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG~~--t~~~~~~g 76 (344) +-+.... .....|.-.....++--.+..+.++.++.+..+..+.++++++...+.+ ..++.++..... .......| T Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 180 (408) T protein:vir:10 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) T ss_pred hhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCc Confidence 1000000 0000111111112222235679999999999999999999998877653 223445544332 33445556 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++.+. .+...++++..-+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.|+.... T Consensus 181 ~~~~~~~-~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-------------- 244 (408) T protein:vir:10 181 GKIPDLD-NPQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK-------------- 244 (408) T ss_pred ccccccc-CcceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------- Confidence 6665321 134566666554443 223454422223578999999999999999999998863211 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) .++. .++. ..++.|.++. ..|+...- .+-.++++|..|..|.+-... +..|.-...+.+| T Consensus 245 ---~~~~--~~~~-----------~~~~~l~~~~~~~~~~~~~--~~a~~v~n~~~~~~l~~lkd~-~G~~i~~~~~~~~ 305 (408) T protein:vir:10 245 ---AAPK--KPTI-----------AKFDDVITMINTAVDPAII--ATSSLLTNQSGLNKLALVKTA-EGKYLLEPDPTKP 305 (408) T ss_pred ---cccc--cccc-----------ccHHHHHHHHHHhhhhhhc--cCCEEEEcHHHHHHHHHhhcc-CCceEeccCcCCC Confidence 1110 0000 0145555544 33443332 234678999999998764322 2233322335567 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecc-h Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRA-E 313 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~-~ 313 (344) ...+++|++|+.+++.+.+..+.. ....++.. ++++..+....++++..+.. . T Consensus 306 ~~~~l~G~PV~~~~~~~~~~~~~~-------------------------~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~ 360 (408) T protein:vir:10 306 NSYLIKGKQVIVVADRWLPNTGST-------------------------VYPLYYGDMSQAITLFDRENMSLLPTNIGAG 360 (408) T ss_pred CCceecceeeEEecccccCccCCC-------------------------ceEEEEEehhccEEEEEecceEEEEcccccc Confidence 778999999999775433221110 00111111 12233333444556555432 1 Q ss_pred ---hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 ---YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 ---~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..-.+++.++++.++++|++.+.+++++.+ T Consensus 361 ~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~ 394 (408) T protein:vir:10 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) T ss_pred hhhcCceEEEEEEeeccEEeccccEEEEEeeccc Confidence 2234678889999999999999999999887 No 130 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.21 E-value=5.9e-12 Score=82.15 Aligned_cols=294 Identities=13% Similarity=0.053 Sum_probs=150.3 Q ss_pred CCCccccc-----cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCC-ceeeecccccEEEEeec-Ccceeeee Q lcl|NC_015719. 1 MANMQGGQ-----QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANR-HMQRQISSGKSAQFPVI-GRTKAAYL 73 (344) Q Consensus 1 ma~~~~~~-----~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~-~~~~~i~~G~tv~i~~i-G~~t~~~~ 73 (344) |+.-..+. ...+-.+. --.|..+++.+++.+..+..++++.+ .+.-....|+ +++|+. +.+.+.-. T Consensus 52 ~a~~~~~~~~~~~a~~~~~~~------Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~-~~~p~~t~~~~a~wv 124 (366) T protein:vir:57 52 FAATELGDTGLSMAISTAAGS------GGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGN-LSMPRLSGGATAGYV 124 (366) T ss_pred HHHHhhcchhhhhhccccccC------CccccchhHHHHHHHHHhhhcchhhhceeeeecCCCc-eEEEEEeCCcceeee Confidence 11110000 00111111 11245688999999998888888766 4444444454 777776 55566667 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) ..|+.++.+ +++.+++++..-+.. .-..|.+-=-.++.+++.+.+.++.++++++..|+.++.- .. ... T Consensus 125 ~E~~~~~~s--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~G----~~~ 193 (366) T protein:vir:57 125 GEGKDVVAT--GATFDDVKLSAKTMI-ALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRD----DG----TGD 193 (366) T ss_pred ccCcccccc--ccceeEEEEeeEEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhcc----CC----CCc Confidence 778777654 355666666554332 3344544112356789999999999999999999988631 11 111 Q ss_pred ccccccCceeee-cccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLE-VGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDP 232 (344) Q Consensus 154 ~~~~~~~~~~i~-~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (344) .+.|........ .......+.. ....+...+..+...+...+.-...-..+++|..|..|.+-..- +..+. + T Consensus 194 ~p~Gi~~~~~~~~~~~~~~~t~~--~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~-~G~~l----~ 266 (366) T protein:vir:57 194 TPKGMKAVATAANRLVAWTGTAI--NLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDG-NGNKV----Y 266 (366) T ss_pred cccceeeccccccceeecccccc--chhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhcc-CCcee----c Confidence 122211000000 0000000010 01111111111112222222222333457999999988753211 11111 1 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) ....-+.++|++|+.|+++|....+.. .....+-+++ +-+..+...+++++..++. T Consensus 267 ~~~~~g~l~G~Pvv~s~~ip~~~~~~~------------~~~~i~~gdf------------s~~~i~~~~~i~i~~~~ea 322 (366) T protein:vir:57 267 PEMSQGILKGYPIQRTSAIPANLGDDG------------NESEIYFCDF------------NDVVIGEDGMMKVDFSTEA 322 (366) T ss_pred cCCCCCeecceeeEEccccccccccCC------------CccEEEEEec------------ceEEEEEecceEEEEeecc Confidence 111124789999999999997532211 0011111222 1111223344556655543 Q ss_pred h-----------hhh--hhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 313 E-----------YQA--DQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 313 ~-----------~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) . ++. -.++..++++.+++||++.+.|+-..= T Consensus 323 ~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 323 TYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 2 112 257888899999999999998876555 No 131 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.19 E-value=2.3e-12 Score=84.44 Aligned_cols=275 Identities=13% Similarity=0.072 Sum_probs=155.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (344) ......... ..+...+...++--.+..+.|..+|.+..+..+.++++++..++.+|+ .++|.. +..++..+..|.. T Consensus 115 ~~~~~~~~~-~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~ 192 (394) T protein:vir:97 115 LMPINETTP-VEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEK 192 (394) T ss_pred HHHHHhhhh-hhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceeccccc Confidence 000000000 000000111112223577899999999888889999999887776543 566654 4445566666666 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) .+.. ..+...++++...+.- .-..|.+-=-.++.+|+.+.+..+.+++|++..|..|+..+. T Consensus 193 ~~~~-~~~~~~~v~l~~~k~~-~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~---------------- 254 (394) T protein:vir:97 193 NPAL-AKPDFKDVAWNIDTYR-GAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK---------------- 254 (394) T ss_pred cccc-ccccceeEEeehhhee-eehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Confidence 6532 1245566666664432 334454422234567899999999999999999988863210 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeE Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIR 238 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 238 (344) .++ ..... -++.|.++...+-. |..+-.+|++|..|..|..-..- +..|.-...+.+|.-+ T Consensus 255 -~~~------~~~~~--------~~~~~~~~~~~~~~---~~~~a~~v~n~~~~~~l~~lkd~-~G~~i~~~~~~~~~~~ 315 (394) T protein:vir:97 255 -SFT------TKTVK--------NLDEIKALLNGGFD---PAYNVSLIVSQSFYQTLDTLKDG-NGRYLLQDDITAVSGK 315 (394) T ss_pred -ccc------ccccc--------cHHHHHHHHHhhhh---hhhCCEEEEcHHHHHHHHHhhcc-CCCeeeecCcCCCCCc Confidence 000 00000 13444443322211 22234578999999988753211 2222222234566667 Q ss_pred EEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhh Q lcl|NC_015719. 239 NVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQ 318 (344) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~ 318 (344) .++|++|+.+++.+.+... .+-|++. . .+..+..+.++++...+ .++... T Consensus 316 ~l~G~pv~~~~~~~~~~~~------------------~~~gd~~--~---------~~~~~~~~~~~~~~~~~-~~~~~~ 365 (394) T protein:vir:97 316 VLLGKPVFVLSDEVLGANK------------------AFIGDFK--R---------GVLFADRKDLGLRWADN-EIYGQY 365 (394) T ss_pred eeccceeEEecccccCCcc------------------EEEeecc--c---------cEEEEEecceEEEEecc-ccccee Confidence 9999999998765433211 1111111 1 11122234455665444 445667 Q ss_pred hhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 319 IIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 319 i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +++.+++|.++++|++.+.+.++.++ T Consensus 366 ~~~~~r~d~~v~~~~a~~~~~~~~~~ 391 (394) T protein:vir:97 366 LQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) T ss_pred EEEEEEEccEEecccceEEEEecccc Confidence 89999999999999999999999999 No 132 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.18 E-value=3.4e-12 Score=83.48 Aligned_cols=280 Identities=8% Similarity=0.065 Sum_probs=155.2 Q ss_pred CCC-ccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCC Q lcl|NC_015719. 1 MAN-MQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGE 77 (344) Q Consensus 1 ma~-~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~ 77 (344) +.. +.+.... .+.-.+..+++--.+..+.|..++.+..+..+.++++.+..++.++ +.+++.. +......+..+. T Consensus 95 ~~~~lr~~~~~-~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~ 172 (389) T protein:vir:10 95 INDFIHSHGKV-IDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELA 172 (389) T ss_pred HHHHhhcchhh-hhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCccccccccc Confidence 000 0000000 0000011112222346689999999999999999999888877543 3455544 333444555555 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) ..+.. ..++..++++.+.+.. .-+.|.+-=-..+.+|+.+.+.++.+++|++..|..|+..+.. T Consensus 173 ~~~~~-~~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~-------------- 236 (389) T protein:vir:10 173 ENPKL-AEPEFNKVDWSVATYR-GAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQS-------------- 236 (389) T ss_pred ccccc-ccccceeeeeeheeeE-eeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-------------- Confidence 55432 2345566666664442 3344544222345688999999999999999999988632210 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc----cccc Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA----LIDP 232 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~----~~~~ 232 (344) +. ..+..+. ..++.|.++.. .++.. .+-.++++|..|..|..-..- +..|.- .... T Consensus 237 ---~~--~~~~~~~---------~~~d~l~~~~~~~~~~~----~~a~~~~n~~~~~~L~~lkd~-~G~~i~~~~~~~~~ 297 (389) T protein:vir:10 237 ---FT--AKKTTTD---------TLVDSLKHILNVDLDPA----YSRALVVTQSLFNTLDTLKDK-NGRYLLHDASDSIT 297 (389) T ss_pred ---cc--ccccccc---------ccHHHHHHHHHhhhhhh----hCcEEEecHHHHHHHHHhhcc-CCCeeeecCccccc Confidence 00 0011111 11455555433 33322 234688999999988763322 222221 1122 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeec Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARR 311 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~ 311 (344) ..|..++++|.+|+.+++...+..++. .+.++.. ++++..+..+.++++..++ T Consensus 298 ~~~~~~~l~G~pV~~~~~~~~~~~~~~--------------------------~~~~~gd~~~~~~~~~~~~~~i~~~~~ 351 (389) T protein:vir:10 298 DGTAKGTILGVPVYVVGDTLLGSLAGD--------------------------QKAFVGDLKRGVLFTDRQQVTLAWEDS 351 (389) T ss_pred ccccccccccceeEEecccccCCCCCc--------------------------eEEEEeeccccEEEEeecceEEEeecc Confidence 345567899999998766422211110 0111111 1122233345567776655 Q ss_pred chhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 312 AEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 312 ~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..|...+++.+++|+++++|++.+.+.++.++ T Consensus 352 -~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~ 383 (389) T protein:vir:10 352 -KIYGKYLGAAFRFGVQKADSKAGYFVTNTDVP 383 (389) T ss_pred -ccccceEEEEEEeccEEecccceEEEEeeccC Confidence 45556789999999999999999999888766 No 133 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.17 E-value=2.6e-12 Score=84.08 Aligned_cols=282 Identities=11% Similarity=0.022 Sum_probs=150.0 Q ss_pred CCCccccccc-cccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQL-GTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~-~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~ 77 (344) +..+...-.. ..|..+.....+.-.+..+.+...+... ...+.++.+.+..++..+ +.++|.. +...+..+..+. T Consensus 141 ~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~ 218 (437) T protein:vir:10 141 VTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTT-TGKLPIFNNSTDLLTAHTEYG 218 (437) T ss_pred hhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccC-ceeeEEeeccccccccccccc Confidence 0000000000 0011111122222234567888887664 455566777777665544 3455544 333445555555 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .++... .+..+++++.+-+.. .-+.|.+-=-..+.+|+.+.+..+.+++|+...|..|+.... T Consensus 219 ~~~e~~-~~~~~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g--------------- 281 (437) T protein:vir:10 219 QTTKNA-TPVITPILWDLKTYT-GGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALT--------------- 281 (437) T ss_pred cccccc-cccceeeeeehhhee-eehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhc--------------- Confidence 554321 234455555554432 223444311224567899999999999999999988863211 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccce Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGS 236 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (344) .+.. .+.++. .++.|.++. ..|+....+ +-.+|++|..|..|..-.. .++.|.-...+..|. T Consensus 282 --~~~~--~~~~~~----------~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd-~~g~~~~~~~~~~~~ 344 (437) T protein:vir:10 282 --DGIK--KTTSTY----------LLGDLKKVLNVTLKPQDSA--AASIVMSQSAYNLFDMATD-AMGRPLLQPNVTAAT 344 (437) T ss_pred --cccc--cccccc----------chhhHHHHHHhhhhhhhhc--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCccCCC Confidence 1110 001110 022333332 244444432 3356999999998865322 222333223455677 Q ss_pred eEEEeCeEEEEeccc--cccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchh Q lcl|NC_015719. 237 IRNVMGFEVVEVPHL--TAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEY 314 (344) Q Consensus 237 Vg~i~G~~V~~sn~l--p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (344) -++++|.+|+.+++. |..+.+. ....-|++ +.++..+..+.++++...+-.. T Consensus 345 ~~~l~G~pv~~~~~~~~~~~~~~~---------------~~~~~gd~-----------~~~~~~~~r~~~~~~~~~~~~~ 398 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFPSASAGD---------------VNIVVAPL-----------KKAVINFKLTEITGQFQDTYDI 398 (437) T ss_pred CcccccceeEEecccccCCcCCCc---------------eEEEEeec-----------cccEEEEeeeceEEEEeccccc Confidence 789999999998765 3322111 11111221 1122233334556665555556 Q ss_pred hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 315 QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 315 ~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +...+++.++|++++++|++.+.|+.+.+| T Consensus 399 ~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~ 428 (437) T protein:vir:10 399 WYKQLGIFLRQNVVQASKDLIVNLTGKLKA 428 (437) T ss_pred ccceeeEEEEEccEEecccceEEEEeeccc Confidence 667788889999999999999999877777 No 134 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.17 E-value=1.1e-12 Score=86.18 Aligned_cols=282 Identities=14% Similarity=0.115 Sum_probs=166.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhc--CCcee-eec-----ccccEEEEeecCcc--ee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTA--NRHMQ-RQI-----SSGKSAQFPVIGRT--KA 70 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~--~~~~~-~~i-----~~G~tv~i~~iG~~--t~ 70 (344) ||+.+ |+... -+-.|+|...|.+...+.+.|. +.+.. -++ .+|+++.+|..+.. .. T Consensus 1 Ma~~~------T~l~d--------~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~ 66 (330) T protein:vir:10 1 MANEL------TKILD--------TITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDS 66 (330) T ss_pred CCCCc------eEeee--------eechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcc Confidence 99844 33322 1566999999999998776663 22222 122 26999999999765 34 Q ss_pred eeeeCCC-CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 71 AYLQPGE-SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 71 ~~~~~g~-~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) ..+..|. .++. ..+..++..-+| ...-..+.+.|+....+-.|++.++.++.+...++..+..++..+.+.-+... T Consensus 67 ~~~~dg~~~i~~--~ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~ 143 (330) T protein:vir:10 67 EVLGNGDKALET--GKITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGT 143 (330) T ss_pred cccCCCccccch--hhcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhh Confidence 5565553 4543 356666655554 33445688999998888889999999999999999988888876665433211 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhh-hcccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNA-ANYAA 228 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~-~~~~~ 228 (344) ... ...+......+. +. ..+..-++.|.+|..+|.++. ..-..+++.|..|..|.+.. +++ ..+. T Consensus 144 ~~~--~~~~~~~~~~~~-~~-------~~a~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~~~~s- 209 (330) T protein:vir:10 144 AGE--KGALEETHVSDQ-SK-------ASTGIDAGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKDN-LIQYIQPT- 209 (330) T ss_pred ccc--chhhhhhheecc-cc-------cccccCHHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHhh-hhhhhccc- Confidence 110 000000000000 00 001111466788888887764 23468899999999998743 332 2221 Q ss_pred ccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhh---eee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLK---DLA 305 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~---~~~ 305 (344) ..++.|+.++|.+|+.+..+|..... | ..+++-+-|++.+... .+. T Consensus 210 ---~~~~~i~~~~G~~VivdD~~p~~~~~----------------y------------t~yl~~~GAi~~~~~~~~~~v~ 258 (330) T protein:vir:10 210 ---TATINIPTYLGYRVIIDDGIAPTGDI----------------Y------------TSYLFRTGSIGLNTGNPSGLTT 258 (330) T ss_pred ---ccCcccccccceEEEEeCCCCCCCCc----------------e------------eEEEEecCceeeecccCCcccc Confidence 12567899999999999999853210 0 1133345555554422 256 Q ss_pred eeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 306 LERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 306 ~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +|..|+++.-.|.+...+.|...+.=...........+. T Consensus 259 ~EtdRd~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 259 FETSREAAKGNDMIYTRRALVMHPYGVKWTGAEVDAGNI 297 (330) T ss_pred ccccCCccccceEEEEeeEEEeeeeeeeecccccccCcC Confidence 788899887777776666665443221111111111111 No 135 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.17 E-value=4.3e-12 Score=82.89 Aligned_cols=291 Identities=12% Similarity=0.042 Sum_probs=156.1 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc-----eeeeeeC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT-----KAAYLQP 75 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-----t~~~~~~ 75 (344) +....- .....+-..+...++...+..+.|+.++.+.....+.++++++..++. +.++.+++.... .+..+.. T Consensus 105 ~~~~~~-~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E 182 (413) T protein:vir:81 105 YVAPRV-KAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMT-NTTIKYLMEKANRVVEGGFKTVAE 182 (413) T ss_pred hhhhHH-HhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEeccccccccccceecC Confidence 000000 000001111222334445678999999999999999999999888875 445666664322 2334455 Q ss_pred CCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 76 GESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 76 g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) |+.++... ....+++++.+.+... -+.|.+ +-.+.+.++.+.+.++.++++++..|+.++.. .. ....+ T Consensus 183 g~~~~~~~-~~~f~~i~~~~~k~~~-~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~~~l~G----~G----~~~~~ 251 (413) T protein:vir:81 183 GGKKPYMR-FADFDIVTESLSKIAG-LTKITD-EMIEDYDFLVSYINARLLEELAIEEERQLLLG----DG----TGNNL 251 (413) T ss_pred cccccccC-cccceeeEeeeeeEEE-eehhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CC----CCCcc Confidence 66654321 1234556666555432 245654 22222345788888889999999999988631 11 11111 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc------- Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA------- 228 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~------- 228 (344) .|. +.... ..+........+++.+.++...+..+..-..+. +|++|..|..|.+-..- +..|.- T Consensus 252 ~Gi-----~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~ 322 (413) T protein:vir:81 252 TGL-----LKRDG--IQTLAVSNKDELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKDA-NGQYYGGGVFQGQ 322 (413) T ss_pred ccc-----ccccc--cccccccccchhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhcc-CCceecccccccc Confidence 111 11100 001111112234666666766655444322333 67899999987543211 111111 Q ss_pred ccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER 308 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~ 308 (344) ......+..++++|.+|+.|+.+|.+.. +-+++. . ..+++ ....++++. T Consensus 323 ~~~~~~~~~~~l~G~pv~~s~~~~~~~~--------------------~~gd~~--~-~~~~~--------~~~~~~v~~ 371 (413) T protein:vir:81 323 YGSGGIMLDPAPWGLRTVQSQVVPVGKP--------------------VVGAFR--S-AASVL--------RKGGVRIDS 371 (413) T ss_pred ccccccccCceecceeeEEcCCCCcccE--------------------EEEecc--c-EEEEE--------EecceEEEE Confidence 1111122335899999999999985321 111111 1 11222 233455666 Q ss_pred eecc-hh-hhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 309 ARRA-EY-QAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 309 ~~~~-~~-~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+.. .+ ..| .+++.++++..+++|++.+.++++..+ T Consensus 372 ~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 411 (413) T protein:vir:81 372 TNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEVV 411 (413) T ss_pred eccccchhhcCcEEEEEEEeeccEEecccceEEEEecCCC Confidence 5543 22 233 567778999999999999999998888 No 136 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.16 E-value=6.1e-12 Score=82.08 Aligned_cols=277 Identities=8% Similarity=0.014 Sum_probs=155.6 Q ss_pred CCCcccccc-ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccc-ccEEEEeecCcc--eeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQ-LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISS-GKSAQFPVIGRT--KAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~-~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG~~--t~~~~~~g 76 (344) +........ ..+-++ +--.+..+.|+.+|.+..+..+.++++.+..++.+ ...+.++..... .+.....| T Consensus 98 ~~~~~~~~~~~~~~~~------~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 171 (395) T protein:vir:38 98 VKDFKNLVTSGTTGTG------NAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDES 171 (395) T ss_pred HHHHHHHHhhccCccC------CCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccc Confidence 100000000 001111 11135678999999999999999999988877653 233444444332 22334446 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIA 156 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (344) +.++... .++..++++...+...+ ..|.+-=-..+.+|+.+.+..+.+++|++..|+.|+.-.. T Consensus 172 ~~~~~~~-~~~f~~v~~~~~k~~~~-~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g-------------- 235 (395) T protein:vir:38 172 ALIGDND-DPELTVVKYLIHRYAGI-TTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMG-------------- 235 (395) T ss_pred ccccccc-ccceeeEEeeeeeeEee-hhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------- Confidence 5554321 23445555555444322 3454421223568899999999999999999998863111 Q ss_pred cccCceeeecccccccccchhhHHHHHHHHHHHHH-HHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 157 GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARA-ALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 157 ~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) .+. ..++.. -++.|.++.. .|+...- .+-.++++|..|..|.+-..- +..|.-...+.+| T Consensus 236 ---~~~--~~~~~~-----------~~~~i~~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~ 296 (395) T protein:vir:38 236 ---KAP--KKPTIS-----------QFDNIKDLENNTLDPAIE--STSSFITNQSGYNILSKVKDA-DGRYLMQPDVTSP 296 (395) T ss_pred ---ccc--cccccc-----------cHHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHHhhcc-CCceeeccCcCCC Confidence 111 001111 1344444432 3333322 334678999999998753222 2333323345677 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEec-HHHHhhhhhheeeeeeeecch- Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQH-RSAVGTVKLKDLALERARRAE- 313 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~-~~Av~~~~~~~~~~e~~~~~~- 313 (344) ....++|++|+.+++.+.+..++. .+.++.. +.++..+..+.++++..+... T Consensus 297 ~~~~l~G~pV~~~~~~~~~~~~~~--------------------------~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~ 350 (395) T protein:vir:38 297 DKYLIDGKPVIRIADKWLPDVSGS--------------------------HPLYFGDLKQGITLFDRQQMQIDTTNVGAG 350 (395) T ss_pred CcceeccceeEEecccccCcCCCc--------------------------ceEEEEeccccEEEEEecceEEEEeccccc Confidence 778999999999987655432211 0111111 112333444556666665432 Q ss_pred ---hhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 314 ---YQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 314 ---~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +-...++...++|+++++|++.+.+..+..+ T Consensus 351 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (395) T protein:vir:38 351 SFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVA 384 (395) T ss_pred hhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 2234578888899999999999999999888 No 137 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.11 E-value=3.7e-12 Score=83.27 Aligned_cols=273 Identities=14% Similarity=0.093 Sum_probs=146.3 Q ss_pred CC---------CccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--Ccce Q lcl|NC_015719. 1 MA---------NMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTK 69 (344) Q Consensus 1 ma---------~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t 69 (344) |. ......+.. ..| ..++--.|..+.+..++.+..+..+.++++.++.++.+ . ++|.+ +..+ T Consensus 64 ~~~~~~~~~~~~~~~~~~al-~~~---~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~-~--~~p~~~~~~~~ 136 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHAL-PTG---NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG-L--EIPRVSYTLDD 136 (352) T ss_pred hhhhHHHHHHhhHHHHHHHh-ccC---CCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC-c--eEEEEecCCCc Confidence 00 000000000 000 11111225679999999999999999999998877643 3 33432 2234 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) +.-...|..++.. +++.+++++.+.++.. -+.|.+-=-.++.+|+.+.+.++.++++++.-++.++.. . T Consensus 137 a~~v~E~~~~~~~--~~~f~~v~~~~~k~~~-~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~--------g 205 (352) T protein:vir:78 137 DDFITDVETAKEL--KLKGDTVKFTTNKFKV-FAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV--------S 205 (352) T ss_pred ccccccccccccc--cccceeeeecceeEEe-echhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhc--------C Confidence 5555556666554 3566776766654432 245554222235789999999999999987644444310 0 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ...+...+.....+ ....+ +...|+.|+++...|+..... . -.++++|..|..|+.-.+-.+ T Consensus 206 ----~g~~~~~g~l~~~~-~~~~t-----~~~~~d~i~~~~~~l~~~~~~-~-a~~~mn~~t~~~l~~~~~~~~------ 267 (352) T protein:vir:78 206 ----PKSGLEHMSFYNGS-VKEVE-----GANMYDAIINALADLHEDYRD-N-ATIYMRYADYVKIISVLSNGT------ 267 (352) T ss_pred ----CCCcccccceeccc-ccccc-----ccchHHHHHHHHhccChhhhc-C-CEEEEehHHHHHHHHHHhccC------ Confidence 00111111111100 01111 122367788787777666542 3 345667777776654322111 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|.-.+++|.+|+.++..+..- -|+|.. -.+. .....++.+ T Consensus 268 ~~~~~~~~~~llG~PV~~~~~~~~~~----------------------~Gdf~~----------~~~~---~~~~~~~~~ 312 (352) T protein:vir:78 268 TNFFDTPAEKVFGKPVVFTDAAVKPI----------------------VGDFNY----------FGIN---YDGTTYDTD 312 (352) T ss_pred CcccccCCccccccceEEecCCCcee----------------------Eeehhh----------hhhh---hhhheeeee Confidence 12233444579999999987654211 011110 0000 011234444 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++....--.+++.+++++++++|++.+.+.+++.| T Consensus 313 ~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~ 347 (352) T protein:vir:78 313 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEST 347 (352) T ss_pred ccccCCeeEEEEEeeeCceeechhheEEEEeeccc Confidence 44333234566678999999999999999999988 No 138 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.10 E-value=1.1e-11 Score=80.61 Aligned_cols=284 Identities=15% Similarity=0.092 Sum_probs=153.1 Q ss_pred CCCccc----c-----ccccccccccccccchhhhhHHH-HhhHHHHHHHHhhhhcCC-ceeeecccccEEEEeec-Ccc Q lcl|NC_015719. 1 MANMQG----G-----QQLGTNQGKGQSAADKLALFLKV-FGGEVLTAFARTSVTANR-HMQRQISSGKSAQFPVI-GRT 68 (344) Q Consensus 1 ma~~~~----~-----~~~~~~~g~~~~~~d~~~l~~e~-f~geV~~~f~~~s~~~~~-~~~~~i~~G~tv~i~~i-G~~ 68 (344) ++...+ + .....|.......++--.|.... ++.++.+..+..++++.+ ++..+...| .+.||+. +.+ T Consensus 334 ~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g-~~~ip~~~~~~ 412 (632) T protein:vir:96 334 IADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTSGA 412 (632) T ss_pred HHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCc-ceEEEEEeCCc Confidence 000000 0 00001111111112212244444 567888887778887776 343344444 4778876 555 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINL 147 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (344) ++..+..|+.++.+ .++.+++++.. .++.. +.|.+-=-.++.+|+.+.+..+.+++|++..|+.+|.- .. T Consensus 413 ~a~wv~E~~~~~~s--~~~f~~i~l~~--~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~G- 483 (632) T protein:vir:96 413 NFYWIGEDEDVQDS--DFDFTTLSFSP--KTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTG----TG- 483 (632) T ss_pred eeEeecCCcccccc--ccceeeEEeee--eEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcc----cC- Confidence 66666677777654 35566666655 33333 33433112246789999999999999999999988621 11 Q ss_pred ccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_015719. 148 ADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYA 227 (344) Q Consensus 148 ~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (344) ....|.|....+.+. ..+. ..+...++.|+++...+...++....-..+++|..+..|...... +-. T Consensus 484 ---~~~~p~Gi~~~~~~~-----~~~~--~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~---d~~ 550 (632) T protein:vir:96 484 ---LANDPVGLLNMTGVP-----ALTY--PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVF---DNT 550 (632) T ss_pred ---CCCccceeeeccccc-----ceec--ccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhcc---CCC Confidence 111122211111010 0000 001112667888888888888865566678899888777653211 111 Q ss_pred cccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 228 ~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) |.-.+.. +.+.|.+|+.||.+|.+.+. -+++ ... ++ +....+.++ T Consensus 551 G~~i~~~---~~l~G~pv~~s~~ip~~~~~--------------------~gd~--s~~--~i--------~~~~~~~i~ 595 (632) T protein:vir:96 551 GERIWQN---NEVNGYRAEASNQIPADTWI--------------------FGDW--SQI--VI--------AMWGVLDLK 595 (632) T ss_pred CceeecC---CeecccceEeccccccCcEE--------------------Eeec--ceE--EE--------EEecceEEE Confidence 2222333 36899999999999864310 0111 110 00 111112232 Q ss_pred ee--ecchhhhhhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 308 RA--RRAEYQADQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 308 ~~--~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) .. .....-.-.++..+.++.++++|++.++++.++ T Consensus 596 ~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 596 VDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 22 222222336778899999999999999999998 No 139 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.10 E-value=2.7e-11 Score=78.53 Aligned_cols=284 Identities=10% Similarity=0.043 Sum_probs=154.7 Q ss_pred CCCccccc--------cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeec-Cccee Q lcl|NC_015719. 1 MANMQGGQ--------QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVI-GRTKA 70 (344) Q Consensus 1 ma~~~~~~--------~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~i-G~~t~ 70 (344) |.+-.... ..-.+.......++--.+..+.|.+++.+..+..+.++++.+...+.++. ...++.. +.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000011111112467899999999999999999999988886432 3344443 44466 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) .....|..++... .++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.... T Consensus 164 ~~v~E~~~~~~~~-~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPETD-NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeecccccccccc-cccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 6666676665321 23556666666444 3334555422223568999999999999999999998863110 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) .++. ... ..++.|.++. ..|+....+ +-.+|++|..|..|.+-.. .+..|.=. T Consensus 234 ---------~~~~------~~~--------~~~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLTK------QAI--------KSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQ 287 (392) T ss_pred ---------cccc------cCc--------cCHHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEee Confidence 0000 000 0145555544 345554443 3457899999999865321 12223222 Q ss_pred cccccceeEEEeCeEEEE-e-ccccccccccccccccccccccccccccccccccccceeEEEecH-HHHhhhhhheeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVE-V-PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR-SAVGTVKLKDLAL 306 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~-~Av~~~~~~~~~~ 306 (344) ..+..|.-++++|.+++. + +++|....... ...+.++... ..+..+....+++ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~------------------------~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA------------------------KKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccC------------------------CceEEEEEehhceEEEEeecceEE Confidence 234566667899997654 3 33332211100 0111122221 1222333444555 Q ss_pred eeee--cchhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 307 ERAR--RAEYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 307 e~~~--~~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..+ +..+..+ .+++..++|.++++|++.+.+.++..| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 5443 2233333 378889999999999999999998887 No 140 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.10 E-value=2.7e-11 Score=78.53 Aligned_cols=284 Identities=10% Similarity=0.043 Sum_probs=154.7 Q ss_pred CCCccccc--------cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeec-Cccee Q lcl|NC_015719. 1 MANMQGGQ--------QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVI-GRTKA 70 (344) Q Consensus 1 ma~~~~~~--------~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~i-G~~t~ 70 (344) |.+-.... ..-.+.......++--.+..+.|.+++.+..+..+.++++.+...+.++. ...++.. +.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000011111112467899999999999999999999988886432 3344443 44466 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) .....|..++... .++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.... T Consensus 164 ~~v~E~~~~~~~~-~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPETD-NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeecccccccccc-cccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 6666676665321 23556666666444 3334555422223568999999999999999999998863110 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) .++. ... ..++.|.++. ..|+....+ +-.+|++|..|..|.+-.. .+..|.=. T Consensus 234 ---------~~~~------~~~--------~~~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLTK------QAI--------KSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQ 287 (392) T ss_pred ---------cccc------cCc--------cCHHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEee Confidence 0000 000 0145555544 345554443 3457899999999865321 12223222 Q ss_pred cccccceeEEEeCeEEEE-e-ccccccccccccccccccccccccccccccccccccceeEEEecH-HHHhhhhhheeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVE-V-PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR-SAVGTVKLKDLAL 306 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~-~Av~~~~~~~~~~ 306 (344) ..+..|.-++++|.+++. + +++|....... ...+.++... ..+..+....+++ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~------------------------~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA------------------------KKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccC------------------------CceEEEEEehhceEEEEeecceEE Confidence 234566667899997654 3 33332211100 0111122221 1222333444555 Q ss_pred eeee--cchhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 307 ERAR--RAEYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 307 e~~~--~~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..+ +..+..+ .+++..++|.++++|++.+.+.++..| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 5443 2233333 378889999999999999999998887 No 141 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.10 E-value=2.7e-11 Score=78.53 Aligned_cols=284 Identities=10% Similarity=0.043 Sum_probs=154.7 Q ss_pred CCCccccc--------cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeec-Cccee Q lcl|NC_015719. 1 MANMQGGQ--------QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVI-GRTKA 70 (344) Q Consensus 1 ma~~~~~~--------~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~i-G~~t~ 70 (344) |.+-.... ..-.+.......++--.+..+.|.+++.+..+..+.++++.+...+.++. ...++.. +.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000011111112467899999999999999999999988886432 3344443 44466 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) .....|..++... .++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.... T Consensus 164 ~~v~E~~~~~~~~-~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPETD-NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeecccccccccc-cccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 6666676665321 23556666666444 3334555422223568999999999999999999998863110 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) .++. ... ..++.|.++. ..|+....+ +-.+|++|..|..|.+-.. .+..|.=. T Consensus 234 ---------~~~~------~~~--------~~~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLTK------QAI--------KSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQ 287 (392) T ss_pred ---------cccc------cCc--------cCHHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEee Confidence 0000 000 0145555544 345554443 3457899999999865321 12223222 Q ss_pred cccccceeEEEeCeEEEE-e-ccccccccccccccccccccccccccccccccccccceeEEEecH-HHHhhhhhheeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVE-V-PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR-SAVGTVKLKDLAL 306 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~-~Av~~~~~~~~~~ 306 (344) ..+..|.-++++|.+++. + +++|....... ...+.++... ..+..+....+++ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~------------------------~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA------------------------KKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccC------------------------CceEEEEEehhceEEEEeecceEE Confidence 234566667899997654 3 33332211100 0111122221 1222333444555 Q ss_pred eeee--cchhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 307 ERAR--RAEYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 307 e~~~--~~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..+ +..+..+ .+++..++|.++++|++.+.+.++..| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 5443 2233333 378889999999999999999998887 No 142 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.10 E-value=2.7e-11 Score=78.53 Aligned_cols=284 Identities=10% Similarity=0.043 Sum_probs=154.7 Q ss_pred CCCccccc--------cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeec-Cccee Q lcl|NC_015719. 1 MANMQGGQ--------QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVI-GRTKA 70 (344) Q Consensus 1 ma~~~~~~--------~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~i-G~~t~ 70 (344) |.+-.... ..-.+.......++--.+..+.|.+++.+..+..+.++++.+...+.++. ...++.. +.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000011111112467899999999999999999999988886432 3344443 44466 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) .....|..++... .++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.... T Consensus 164 ~~v~E~~~~~~~~-~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPETD-NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeecccccccccc-cccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 6666676665321 23556666666444 3334555422223568999999999999999999998863110 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHH-HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIAR-AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) .++. ... ..++.|.++. ..|+....+ +-.+|++|..|..|.+-.. .+..|.=. T Consensus 234 ---------~~~~------~~~--------~~~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLTK------QAI--------KSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQ 287 (392) T ss_pred ---------cccc------cCc--------cCHHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEee Confidence 0000 000 0145555544 345554443 3457899999999865321 12223222 Q ss_pred cccccceeEEEeCeEEEE-e-ccccccccccccccccccccccccccccccccccccceeEEEecH-HHHhhhhhheeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVE-V-PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR-SAVGTVKLKDLAL 306 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~-~Av~~~~~~~~~~ 306 (344) ..+..|.-++++|.+++. + +++|....... ...+.++... ..+..+....+++ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~------------------------~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA------------------------KKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccC------------------------CceEEEEEehhceEEEEeecceEE Confidence 234566667899997654 3 33332211100 0111122221 1222333444555 Q ss_pred eeee--cchhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 307 ERAR--RAEYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 307 e~~~--~~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +..+ +..+..+ .+++..++|.++++|++.+.+.++..| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 5443 2233333 378889999999999999999998887 No 143 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.07 E-value=3.8e-11 Score=77.68 Aligned_cols=292 Identities=11% Similarity=0.028 Sum_probs=151.7 Q ss_pred CC-Cccccc----cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-Ccceeeeee Q lcl|NC_015719. 1 MA-NMQGGQ----QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQ 74 (344) Q Consensus 1 ma-~~~~~~----~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~ 74 (344) +. .+.... +. .+.. ++.++--.+..+.|..++.+..++.+.++++++..++.+| ..+||.. +..++.... T Consensus 68 ~~~~l~~~~r~~~~~-~~~~--~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~-~~~i~~~~~~~~a~~~~ 143 (390) T protein:vir:40 68 GANALTSDESKYYNE-VIAG--NGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTAT-TEWIISVGDVATAWWGP 143 (390) T ss_pred CchhccHHHHHHHHH-HHhc--cCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eeEEEEEcCCcceeeec Confidence 00 000000 00 0000 1112222367799999999999999999999998887554 4556664 444555555 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) .+..++.. ..++.++++|.+-+. +.-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.- ... .. T Consensus 144 E~~~~~~~-~~~~f~~i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G----~G~-----~~ 212 (390) T protein:vir:40 144 LCAEIKEV-LDNGFDKIQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNG----SGK-----DQ 212 (390) T ss_pred cccccCcc-ccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc----cCC-----Cc Confidence 55555432 234566767766444 23355654333346778999999999999999999988631 110 11 Q ss_pred cccccCce-eeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCc-CCCEEEeCHHHHHHHhccchhhhhcccccccc Q lcl|NC_015719. 155 IAGLGKPS-LLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPA-NDRTFYTTPDVYSAILAALMPNAANYAALIDP 232 (344) Q Consensus 155 ~~~~~~~~-~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~-~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (344) |.|..... ....+.....+........+.+.+..+...+....-+. .+-+++++|..+..+++..+... +- ++.+ T Consensus 213 P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~-d~--~G~~ 289 (390) T protein:vir:40 213 PIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYM-TP--QGVW 289 (390) T ss_pred cceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhcc-CC--CCcc Confidence 11111100 00000000001111111112333333433443332221 23456788876554444322111 11 1111 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) ..+ ....|.+|+.++++|.+.. + -+++ ... ++ +..+.++++...+. T Consensus 290 v~~--~~~~g~pvv~~~~~p~~~i----~----------------~Gd~--s~~--~i--------~~~~~~~v~~~~~~ 335 (390) T protein:vir:40 290 VTG--ILPVPLEIVQSVAVPVGKA----V----------------AGRA--KDY--FM--------GIGSEQVIRTSTEY 335 (390) T ss_pred ccc--cCCCceeEEEcCCCCCCcE----E----------------EEee--ceE--EE--------EeecceEEEecchh Confidence 111 1246999999999985421 0 0111 110 11 22344566655433 Q ss_pred hh--hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 EY--QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~~--~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+ -...+++.+++++++++|++.++|.+++-+ T Consensus 336 ~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~ 369 (390) T protein:vir:40 336 RLLDDETLYYAKQYANGRPKDNSSFLVFDITGLE 369 (390) T ss_pred hhhcCcEEEEEEEEeCCEEecccceEEEEeeccC Confidence 22 224578999999999999999999998887 No 144 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.03 E-value=6.6e-11 Score=76.39 Aligned_cols=300 Identities=11% Similarity=0.032 Sum_probs=144.9 Q ss_pred CCCccccccccccccccccccchhhhhHHH-HhhHHHHHHHHhhhhcCCceeeeccc-ccEEEEeecCcce-eee-eeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKV-FGGEVLTAFARTSVTANRHMQRQISS-GKSAQFPVIGRTK-AAY-LQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~-f~geV~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG~~t-~~~-~~~g 76 (344) ...........+-.+. ++ .+.+.. ..+++.+..+..+.++++++...+.+ +.++.||++.... ..+ ...| T Consensus 148 ~~~~~~~~~~~~~~~~---gg---~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg 221 (477) T protein:vir:84 148 AKVGEEYRDLDRNGGT---GG---YAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADN 221 (477) T ss_pred HHhhhhhccccccCCC---cc---eeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccC Confidence 0000000000000111 11 134444 57889888888888888888887764 5578999863332 222 3334 Q ss_pred CCCCCCcC---CcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 77 ESLDDKRK---DIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 77 ~~~~~~~~---~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) ..+..... ++....++ ++-.++.. +.|.+-=-.++.+|+.+.+.++.+++|+...|+.++. +.. .. T Consensus 222 ~~~~~~~~~~s~~~f~~i~--~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~----G~G----t~ 291 (477) T protein:vir:84 222 AALTAPSAHEVDLTDGFVQ--ANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVIS----GTG----SN 291 (477) T ss_pred cccccccccccccceeeEE--EeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhc----cCC----CC Confidence 43322211 12233333 44444443 3344322234578999999999999999999998862 111 11 Q ss_pred cccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc-----c- Q lcl|NC_015719. 153 ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN-----Y- 226 (344) Q Consensus 153 ~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-----~- 226 (344) ..|.|......+.....+...........+++.|+++...++.... .....++++|..|..|.+-..-.... + T Consensus 292 ~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~ 370 (477) T protein:vir:84 292 NQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGP 370 (477) T ss_pred CccceeeeccccccccccccccchhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCcc Confidence 1222221111010000011011111123356667776665554433 23356788999988876532211110 0 Q ss_pred ------ccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhh Q lcl|NC_015719. 227 ------AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVK 300 (344) Q Consensus 227 ------~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~ 300 (344) .....+.+|..++++|.+|+.|+.+|...+... +. ....-+++ .+ .+++ + T Consensus 371 ~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~----------d~--~~i~~gd~--~~--~~i~--------~ 426 (477) T protein:vir:84 371 GFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGT----------DQ--DVIHVLRA--SD--LALF--------E 426 (477) T ss_pred cccccccccccccccccchhcccceEecCcccccccccC----------Cc--ceEEEEEe--ce--EEEE--------e Confidence 011235566678999999999999996432110 00 00011111 11 0111 1 Q ss_pred hheeeeeeeecchhhhhhhh-hhhhhc---Cceec-cccEEEEEecCCC Q lcl|NC_015719. 301 LKDLALERARRAEYQADQII-AKYAMG---HGGLR-PESAGALVFKAGA 344 (344) Q Consensus 301 ~~~~~~e~~~~~~~~~d~i~-~~~~~G---~~v~R-p~~~~~l~~~~~a 344 (344) . .+.++ .++..+.+... .+.+|| .+.+| |++.+.++.++.+ T Consensus 427 ~-~~~~~--~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 427 S-SVRMR--ALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALT 472 (477) T ss_pred e-ceeEE--eccccccccceeeeeehhhhhhhhhccccceEEeeccccc Confidence 1 12233 33333333222 222232 35666 9999998888777 No 145 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=99.03 E-value=1.5e-10 Score=74.42 Aligned_cols=319 Identities=12% Similarity=0.087 Sum_probs=171.5 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhh--------------------------hcCCceeeec Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSV--------------------------TANRHMQRQI 54 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~--------------------------~~~~~~~~~i 54 (344) |.... |..+. +|+. -+++|+.-+...-.+++. -++.++..++ T Consensus 1 ~~~a~------T~~~~----~~p~--a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL 68 (430) T protein:vir:10 1 MTASK------TTMRY----GDPN--AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDL 68 (430) T ss_pred Cccee------eeccc----CChh--HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccC Confidence 54433 44544 3444 466777766655544211 1235555555 Q ss_pred c--cccEEEEeecCcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceecc-chHHHHhChhHHHHHHHHHHHHHHH Q lcl|NC_015719. 55 S--SGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIY-DIEDAMNHYDVRSEYTSQIGESLAM 131 (344) Q Consensus 55 ~--~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Id-d~D~~q~~~d~~~~~~~~~~~aLa~ 131 (344) . .|++|.|+-+...+-.-...++.+.+.-+.++.....|.|||.. .++.+. .++.-.+-+|+|.+--..++.=+++ T Consensus 69 ~K~~GD~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R-~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~ 147 (430) T protein:vir:10 69 GRNKGDEVRFHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQAR-FPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDA 147 (430) T ss_pred CCCCccEEEEeEeeccccCceecCceeeccccceEEEeeEEEEeeec-cccccCCchhhhhhhhHHHHHHHHHHHHHHHH Confidence 3 59999999998877666666778888888888899999999986 344443 4566677899999999999999999 Q ss_pred HHHHHHHHHHHHhhh-----------------------cccccccccccccCceeeecccccc----------cccchhh Q lcl|NC_015719. 132 AADGAVLAELAGLIN-----------------------LADGVNENIAGLGKPSLLEVGAKAD----------LTDPVKL 178 (344) Q Consensus 132 ~~D~~i~~~~~~~a~-----------------------~~~~~~~~~~~~~~~~~i~~~~~~~----------~t~~~~~ 178 (344) ..||.+|.+|+++-. ...+|+.+ -....-+.++. .+... T Consensus 148 ~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~n------rh~~~~G~at~~~~~~~~~~sl~stD-- 219 (430) T protein:vir:10 148 YLDQSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKN------RHFVASADAITGVAPNAGEYNITTAD-- 219 (430) T ss_pred HHHHHHHHHHhhhhcccccccccccccCCcchhhhhccccCCCCCc------eeEeecccccccccccccccchhhhc-- Confidence 999999999976411 11111111 01111111111 11111 Q ss_pred HHHHHHHHHHHHHHHhhcCCC-------cCC-------CEEEeCHHHHHHHhccchhhh----h----ccccccccccce Q lcl|NC_015719. 179 GQAVIAQLTIARAALTKNYVP-------AND-------RTFYTTPDVYSAILAALMPNA----A----NYAALIDPERGS 236 (344) Q Consensus 179 ~~~i~~~l~~a~~~Ld~~~VP-------~~g-------R~~vv~P~~~~~Ll~~~~~~~----~----~~~~~~~~~~G~ 236 (344) ..-++.|.+++..++..+.| .+. ++++++|.+|..|+.++.+-. + ..+....+-.|. T Consensus 220 -~~s~~~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~ 298 (430) T protein:vir:10 220 -VLDVDVVDSIATYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVD 298 (430) T ss_pred -ccCHHHHHHHHHHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecc Confidence 12267777888888887643 122 567899999999999987631 1 112345688999 Q ss_pred eEEEeCeEEEEeccc-ccccc--ccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe-------eee Q lcl|NC_015719. 237 IRNVMGFEVVEVPHL-TAGGA--GDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD-------LAL 306 (344) Q Consensus 237 Vg~i~G~~V~~sn~l-p~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~-------~~~ 306 (344) +++++|+-|++-+++ ++..+ ..+.................+.....- .-+|+.-.-|++.+.... .=. T Consensus 299 ~gm~ngvii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v--~RalllGaQA~~~A~g~~~~~g~~f~w~ 376 (430) T protein:vir:10 299 AGLWSNTLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAV--DRALLLGGQALAQAWAASEHSGMPFFWS 376 (430) T ss_pred eeeecCeEEecCCceeeecCCCccccccCCcccccccccccccccccccc--hhhhhccchhheeeeeccCCCCcceeee Confidence 999999999987643 22211 111000000000000000000000000 011222222332222221 112 Q ss_pred eeeecchhhhhhhhhhhhhcCceeccc----------cEEEEEecCCC Q lcl|NC_015719. 307 ERARRAEYQADQIIAKYAMGHGGLRPE----------SAGALVFKAGA 344 (344) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~Rp~----------~~~~l~~~~~a 344 (344) |-..|-.+.- .|.....+|.+=.|-. =-|+|.+..-| T Consensus 377 Ee~~D~g~~~-~i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa 423 (430) T protein:vir:10 377 EKDMDHGDKL-ELLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTAV 423 (430) T ss_pred eeccccCchh-hhhhhHHhccceeeecCCCCCCceeeeeEEEEhhhhh Confidence 3233322221 2344444554333332 23444444333 No 146 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.01 E-value=8e-11 Score=75.95 Aligned_cols=289 Identities=10% Similarity=0.031 Sum_probs=148.8 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceee--eccc-ccEEEEeec-CcceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQR--QISS-GKSAQFPVI-GRTKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~--~i~~-G~tv~i~~i-G~~t~~~~~~g 76 (344) ++.+..+. .+.++++ |. -+..+.|.+++.+..+..++++.+-... ...+ -..++||+. +.+++..+..| T Consensus 331 ~~a~~~~~--~~~~~~~---Gg--~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg 403 (645) T protein:vir:93 331 KSAVGAGT--TTDPQWA---GS--LSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEG 403 (645) T ss_pred hhhhhccc--ccccccc---CC--ccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccC Confidence 00000000 0111111 21 1456889999998888888887664321 1111 124677764 55666667778 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENI 155 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (344) +.++.+ ..+.+++++.. .+... ..|.+-=-.++.+|+.+.+..+.+++|++..|+.+|..- .+...+..+ T Consensus 404 ~~~~~s--~~~f~~v~l~~--~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~-----g~~~~~~~p 474 (645) T protein:vir:93 404 KTKPLT--KFDFESITFSH--AKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPK-----KAAVADVSP 474 (645) T ss_pred cccccc--ccceeEEEEee--EEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCC-----CcccCCccc Confidence 777654 34666666654 33333 334431112467899999999999999999999886211 001011111 Q ss_pred ccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccc Q lcl|NC_015719. 156 AGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERG 235 (344) Q Consensus 156 ~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (344) .+. ..+..+..+. ......+..+...|..+++...+-+.|++|..+..|.+-..-. ..+.-...-..| T Consensus 475 ~gi------~~~~~~~~~~-----~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~-G~~~~~~~~~~~ 542 (645) T protein:vir:93 475 ASI------THDVKGTASS-----GNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNAL-GQKEYPDMTLLG 542 (645) T ss_pred cce------eccccccccc-----cchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccC-CceeecCCCCCC Confidence 111 1111110111 1123456677777888888666677889999999987643221 121101111122 Q ss_pred eeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc--- Q lcl|NC_015719. 236 SIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA--- 312 (344) Q Consensus 236 ~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~--- 312 (344) ++++|.+|+.|+++|..-+ .+.. ...+-+..+ -+-+.+...+ +++....+ T Consensus 543 --~tL~G~PV~~s~~vp~~~~----~gd~------s~~~ig~~~------~v~i~~s~~a---------~~~~~~~~~~~ 595 (645) T protein:vir:93 543 --GSFQGLPVIVSQYVGDQLV----LVNA------PDIYLADDG------GVAVDMSREA---------SLEMQSEPTGD 595 (645) T ss_pred --ceeeceeeEEeccCCccee----Eecc------ccEEEEEec------ceEEEeecce---------eEEEeeccccc Confidence 4899999999999985311 0000 000000000 0001111111 11111100 Q ss_pred -----------hhhh--hhhhhhhhhcCceeccccEEEEEec-CCC Q lcl|NC_015719. 313 -----------EYQA--DQIIAKYAMGHGGLRPESAGALVFK-AGA 344 (344) Q Consensus 313 -----------~~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~-~~a 344 (344) .++. -.|+..++++.+++||++.++|+-- =|| T Consensus 596 ~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~ 641 (645) T protein:vir:93 596 STTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGS 641 (645) T ss_pred ccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCc Confidence 1222 2467778899999999999988721 122 No 147 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.01 E-value=1.5e-11 Score=79.94 Aligned_cols=275 Identities=13% Similarity=0.070 Sum_probs=148.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccccc-EEEEeecCcceeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGK-SAQFPVIGRTKAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~iG~~t~~~~~~g~~~ 79 (344) +....... .+.-.+....+...+.++.+..++.+. .....+++..+...+..++ .+.++..+...+..+..+... T Consensus 121 ~~~~~~~~---~~~~~~~~~~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 196 (397) T protein:vir:96 121 NAFVKSKG---AEKRDGFTSVEGGALIPQELLQPQLEP-KDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKN 196 (397) T ss_pred HHHHHhhh---hhhhhcccccccccchhHHHHHHHHHh-hhhhhHHHhhhhccccccceeEEEEeccCCccccccccccc Confidence 00000000 011111122222335668888888764 3334445666665554322 244444454555555555555 Q ss_pred CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Q lcl|NC_015719. 80 DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLG 159 (344) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~ 159 (344) +.. ..+...++++.+.+.. .-..|.+---.++.+|+.+.+..+.++++++..|..|+... + T Consensus 197 ~~~-~~~~~~~i~~~~~~~~-~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~-----------------g 257 (397) T protein:vir:96 197 PQL-ANPKMVEIDYSVATRR-GYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVL-----------------K 257 (397) T ss_pred ccc-ccccccceeecHhHhh-cchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------------c Confidence 432 1345566677665442 22344432223356789999999999999999998875211 0 Q ss_pred CceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 160 KPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 160 ~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .++. . +. ..++.|.++....... ..+-.+|++|..|..|..-.. .+..|.-...+.+|.-++ T Consensus 258 ~~~~--~---~~---------~~~d~~~~~~~~~~~~---~~~a~~v~n~~~~~~l~~lkd-~~G~~~~~~~~~~~~~~~ 319 (397) T protein:vir:96 258 TATA--K---SV---------VGVDGLKDLINKEIKK---VYDVKLFISASMYSELDKLKD-KNGRYLLQDSITAASGKQ 319 (397) T ss_pred cccc--c---cc---------cchHHHHHHHHHhhhh---hcCcEEEEcHHHHHHHHHhhc-cCCCeEeccCccCCCccc Confidence 0000 0 00 0144454443322221 123468999999999875322 223333223455666679 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHH-HHhhhhhheeeeeeeecchhhhhh Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRS-AVGTVKLKDLALERARRAEYQADQ 318 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~-Av~~~~~~~~~~e~~~~~~~~~d~ 318 (344) ++|.+|+.+++.+.+...+. .+.++..-+ ++..+....++++...+ .++... T Consensus 320 l~G~pv~~~~~~~~~~~~~~--------------------------~~~~~gd~~~~~~~~~~~~~~~~~~~~-~~~~~~ 372 (397) T protein:vir:96 320 LLGKEVVVLDDDVIGKSVGN--------------------------VVGFIGDAKAFASFFDRKQVSVSWVDN-NIYGQL 372 (397) T ss_pred ccccceEEecccccCCCCCc--------------------------eEEEEeehhcceEeEeecceEEEEecc-ccccee Confidence 99999999887544322110 111211111 22233344556665543 455667 Q ss_pred hhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 319 IIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 319 i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +++.+++|+++++|++.+.+.++.. T Consensus 373 ~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 373 LAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEEEEEccEEecccceEEEEeecC Confidence 8999999999999999999999888 No 148 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.00 E-value=7.8e-11 Score=75.99 Aligned_cols=298 Identities=14% Similarity=0.125 Sum_probs=151.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (344) +.+....... -+....+.+++--.+..+.|..++.+..++.+.++++.+...+.++ ++.||+. +..++..+..|+. T Consensus 138 ~~~~~~~~~~-~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:78 138 FADGETAPAA-IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred HhhhhhhHHH-HHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcc Confidence 0000000000 0000001112222357799999999999999999999998877655 5888874 3456677777777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.+ +++.+++++..-+.-.+ ..|.+ +-.+.+.++.+.+.++.++++++..|..+|.- ... ..|.|. T Consensus 216 ~~~s--~~~f~~i~~~~~k~a~~-~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G----~G~-----~~p~Gi 282 (497) T protein:vir:78 216 YPFS--SEEFARVYEQVGKVANA-LTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG----GGY-----PGVNGL 282 (497) T ss_pred cccc--cccceeeEeeeeeeEee-cHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcC----CCc-----cccccc Confidence 7654 35667766665444322 34443 22233456888899999999999999988631 100 011111 Q ss_pred cCce---eeeccccc-cc--------------ccchhhH------------------------------HHHHHHHHHHH Q lcl|NC_015719. 159 GKPS---LLEVGAKA-DL--------------TDPVKLG------------------------------QAVIAQLTIAR 190 (344) Q Consensus 159 ~~~~---~i~~~~~~-~~--------------t~~~~~~------------------------------~~i~~~l~~a~ 190 (344) -... ....+... .. +...... ..+...+..+. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:78 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 0000 00000000 00 0000000 00111122222 Q ss_pred HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcc-----ccccccccceeEEEeCeEEEEeccccccccccccccccc Q lcl|NC_015719. 191 AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANY-----AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGT 265 (344) Q Consensus 191 ~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~-----~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~ 265 (344) ..+..... ...-.+|++|..|..|.+-..-....+ .+......+....++|.+|+.++.+|.+.. .. T Consensus 363 ~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~---~~---- 434 (497) T protein:vir:78 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI---LV---- 434 (497) T ss_pred hhhhhhcc-cCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCce---EE---- Confidence 22222111 011147789999988754322211111 111111122234899999999999985321 10 Q ss_pred cccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec--chhhhh--hhhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 266 DASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR--AEYQAD--QIIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~--~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) +++ ...++..+....++++.... ..+..| .|++..+++..+++|++.+.+.++ T Consensus 435 -------------Gd~----------~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 435 -------------GHF----------APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred -------------eec----------ccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEec Confidence 111 11122223334445554432 122233 477788999999999999999999 Q ss_pred CCC Q lcl|NC_015719. 342 AGA 344 (344) Q Consensus 342 ~~a 344 (344) +++ T Consensus 492 ~~~ 494 (497) T protein:vir:78 492 KGA 494 (497) T ss_pred CCc Confidence 999 No 149 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.00 E-value=7.8e-11 Score=75.99 Aligned_cols=298 Identities=14% Similarity=0.125 Sum_probs=151.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--CcceeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTKAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (344) +.+....... -+....+.+++--.+..+.|..++.+..++.+.++++.+...+.++ ++.||+. +..++..+..|+. T Consensus 138 ~~~~~~~~~~-~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:10 138 FADGETAPAA-IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred HhhhhhhHHH-HHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcc Confidence 0000000000 0000001112222357799999999999999999999998877655 5888874 3456677777777 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ++.+ +++.+++++..-+.-.+ ..|.+ +-.+.+.++.+.+.++.++++++..|..+|.- ... ..|.|. T Consensus 216 ~~~s--~~~f~~i~~~~~k~a~~-~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G----~G~-----~~p~Gi 282 (497) T protein:vir:10 216 YPFS--SEEFARVYEQVGKVANA-LTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG----GGY-----PGVNGL 282 (497) T ss_pred cccc--cccceeeEeeeeeeEee-cHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcC----CCc-----cccccc Confidence 7654 35667766665444322 34443 22233456888899999999999999988631 100 011111 Q ss_pred cCce---eeeccccc-cc--------------ccchhhH------------------------------HHHHHHHHHHH Q lcl|NC_015719. 159 GKPS---LLEVGAKA-DL--------------TDPVKLG------------------------------QAVIAQLTIAR 190 (344) Q Consensus 159 ~~~~---~i~~~~~~-~~--------------t~~~~~~------------------------------~~i~~~l~~a~ 190 (344) -... ....+... .. +...... ..+...+..+. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:10 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 0000 00000000 00 0000000 00111122222 Q ss_pred HHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcc-----ccccccccceeEEEeCeEEEEeccccccccccccccccc Q lcl|NC_015719. 191 AALTKNYVPANDRTFYTTPDVYSAILAALMPNAANY-----AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGT 265 (344) Q Consensus 191 ~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~-----~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~ 265 (344) ..+..... ...-.+|++|..|..|.+-..-....+ .+......+....++|.+|+.++.+|.+.. .. T Consensus 363 ~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~---~~---- 434 (497) T protein:vir:10 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI---LV---- 434 (497) T ss_pred hhhhhhcc-cCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCce---EE---- Confidence 22222111 011147789999988754322211111 111111122234899999999999985321 10 Q ss_pred cccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeec--chhhhh--hhhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 266 DASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARR--AEYQAD--QIIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~--~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) +++ ...++..+....++++.... ..+..| .|++..+++..+++|++.+.+.++ T Consensus 435 -------------Gd~----------~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 435 -------------GHF----------APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred -------------eec----------ccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEec Confidence 111 11122223334445554432 122233 477788999999999999999999 Q ss_pred CCC Q lcl|NC_015719. 342 AGA 344 (344) Q Consensus 342 ~~a 344 (344) +++ T Consensus 492 ~~~ 494 (497) T protein:vir:10 492 KGA 494 (497) T ss_pred CCc Confidence 999 No 150 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.98 E-value=2.4e-11 Score=78.82 Aligned_cols=273 Identities=13% Similarity=0.059 Sum_probs=145.2 Q ss_pred CCCcccc---------ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--Ccce Q lcl|NC_015719. 1 MANMQGG---------QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTK 69 (344) Q Consensus 1 ma~~~~~---------~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t 69 (344) |...... .+.. ..|.. ++--.+..+.|..++.+.....+.+++++++.++.+ .++|++ +..+ T Consensus 114 ~~~~~~~~~~~~~~~~~~a~-~~~t~---~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~ 186 (402) T protein:vir:93 114 ILPNEFEKPSMEAQRLLHAL-PTGND---SGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDD 186 (402) T ss_pred HhhhhHHHHHHhHHHHHhhh-ccCCC---cCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCc Confidence 0000000 0000 00110 111135679999999999998899999998877643 234443 3344 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) +.-...|...+.. +++.+++++.+.+.. .-+.|.+-=-..+.+|+.+.+.++.++++++..++.++..- T Consensus 187 a~~v~Eg~~~~~~--~~~f~~i~~~~~k~~-~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g-------- 255 (402) T protein:vir:93 187 DDFITDVETAKEL--KAKGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS-------- 255 (402) T ss_pred ccccccccccccc--ccccceeeecceeee-eechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC-------- Confidence 5555666666543 345666666554442 22445532122357899999999999999987666554210 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ...+.+.+.....+.+. ..+...++.|+++...|+..... ...|+ +++..|..|+.-.+-.+ T Consensus 256 ----~g~g~p~g~~~~~~~~~------~~~~~~~d~l~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~d~~------ 317 (402) T protein:vir:93 256 ----PKSGLEHMSFYNGSVKE------VEGADMYDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNGT------ 317 (402) T ss_pred ----CCccccceeeecccccc------ccccchHHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcCC------ Confidence 11111111111111110 11222477888888888776653 44665 55555444433211111 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|.-.+++|.+|+.++..+.-.. |+|.. +..... .+.++.+ T Consensus 318 ~~~~~~~~~~llG~PV~~t~~~~~i~~----------------------GDf~~-----------~~~~~~--~~~~~~~ 362 (402) T protein:vir:93 318 TNFFDTPAEKVFGKPVVFTDAAVKPIV----------------------GDFNY-----------FGINYD--GTTYDTD 362 (402) T ss_pred CcccccCCccccccceEEecCCCceee----------------------echhh-----------hhhhhh--hhhhhhh Confidence 122234445799999999876542111 11110 000001 1223444 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++...---.+++..++++++++|++...|..++-+ T Consensus 363 ~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~ 397 (402) T protein:vir:93 363 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 397 (402) T ss_pred hcccCCceEEEEEEEeCcEEechhheEEEEeecCC Confidence 44433234567788999999999999999998877 No 151 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.95 E-value=6.5e-11 Score=76.43 Aligned_cols=273 Identities=13% Similarity=0.065 Sum_probs=144.1 Q ss_pred CCCcccccc-cc-------ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEee--cCccee Q lcl|NC_015719. 1 MANMQGGQQ-LG-------TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPV--IGRTKA 70 (344) Q Consensus 1 ma~~~~~~~-~~-------~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~--iG~~t~ 70 (344) +........ .. ...+.. ++--.+..+.|..++.+..+..+.++++.++.++.+. .+|. .+..++ T Consensus 99 ~~~~~~~~~~~~~~~~~~al~~~t~---s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~---~~p~~~~~~~~a 172 (387) T protein:vir:93 99 ILPNEFEKPSMEAQRLLHALPTGND---SGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDDD 172 (387) T ss_pred hhhhhhhhhhhhhHHHHHhhccCcC---CCCceeechhHHHHHHHHHHhhchhhhheeeeecCCc---eEEEEeecCCcc Confidence 000000000 00 000010 1111256789999999999888889999888776432 3443 233455 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) .-...|+..+.. +++.+++++.. .++.. +.|.+-=-..+.+|+.+.+.++.++++++..++.++. .+ T Consensus 173 ~~v~E~~~~~~~--~~~f~~v~~~~--~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~---~g----- 240 (387) T protein:vir:93 173 DFITDVETAKEL--KLKGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA---VS----- 240 (387) T ss_pred ccccCccccccc--ccccceeeeeh--eeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh---cC----- Confidence 556666665543 35566665554 44444 4455422233578999999999999999876665541 10 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ...+.+.+.....+.. . ..+...++.|+++...|+.+... ...| ++++..|..|+.-.+-.+ T Consensus 241 ----~g~g~p~g~l~~~~~~-~-----v~~~~~~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~~~~~~~d~~------ 302 (387) T protein:vir:93 241 ----PKSGLDHMSFYNGSVK-E-----VEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGT------ 302 (387) T ss_pred ----CCccccceeeeccccc-c-----ccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCC------ Confidence 1111111111111100 0 11122467788887778777663 4456 556665554443211111 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|.-.+++|.+|+.++..+... -|+|.. .. +. .. .+..+.+ T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~~----------------------~GDf~~--~~-~~--------~~--~~~~~~~ 347 (387) T protein:vir:93 303 TNFFDTPAEKVFGKPVVFTDAAVKPI----------------------VGDFNY--FG-IN--------YD--GTTYDTD 347 (387) T ss_pred CcccccCCccccccceEEecCCCcee----------------------eeehhh--hh-ee--------hh--hheeeec Confidence 12223444589999999987654211 111111 00 00 01 1123334 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++...-...+++..++|+++++|++...+.+++.| T Consensus 348 ~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~ 382 (387) T protein:vir:93 348 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred ccccCCceeEEEEeeeCceeechhheEEEEeecCC Confidence 43333334466778999999999999999887777 No 152 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.95 E-value=3.6e-10 Score=72.37 Aligned_cols=257 Identities=11% Similarity=0.062 Sum_probs=152.0 Q ss_pred CCCccccccc-------cccccccccccchhhhhHHHHhhHHHHHHHHhhhhcC---------Cceeeecc--cccEEEE Q lcl|NC_015719. 1 MANMQGGQQL-------GTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTAN---------RHMQRQIS--SGKSAQF 62 (344) Q Consensus 1 ma~~~~~~~~-------~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~---------~~~~~~i~--~G~tv~i 62 (344) |+|+.+++-. -|..+. .|+ .++.|++.+...-++.+-+.. .++..++. .|++|.| T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~----~~~---~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf 73 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANR----NRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTF 73 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhc----CCh---HHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEE Confidence 9998876511 112222 222 357888887655544433332 23333443 5999999 Q ss_pred eecCcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceec-cchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 63 PVIGRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAEL 141 (344) Q Consensus 63 ~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (344) .-+...+-.-...++.+.+.-+.++.....|.|||..-. +.. ..++.-.+-+|+|++--..++.-+++..||.+|.+| T Consensus 74 ~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~-V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~l 152 (318) T protein:vir:27 74 SIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL 152 (318) T ss_pred eEeeccccCccccCceeeccccceEEEeeEEEEeeeccc-cccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999887766666677888888888888899999988632 222 346666778999999999999999999999999999 Q ss_pred HHhhhccc-----------ccc----cc-cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCC------ Q lcl|NC_015719. 142 AGLINLAD-----------GVN----EN-IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVP------ 199 (344) Q Consensus 142 ~~~a~~~~-----------~~~----~~-~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP------ 199 (344) +++...-. +.. .+ +..-.....+-.+.++...+-...-..-++.|..+...+++..-| T Consensus 153 aGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v 232 (318) T protein:vir:27 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRL 232 (318) T ss_pred hhcccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceee Confidence 86543100 000 00 000001112222222211111110111255566777778774332 Q ss_pred -cCC-------CEEEeCHHHHHHHhccch------hh-hhcc---ccccccccceeEEEeCeEEEEeccccc--cccccc Q lcl|NC_015719. 200 -AND-------RTFYTTPDVYSAILAALM------PN-AANY---AALIDPERGSIRNVMGFEVVEVPHLTA--GGAGDD 259 (344) Q Consensus 200 -~~g-------R~~vv~P~~~~~Ll~~~~------~~-~~~~---~~~~~~~~G~Vg~i~G~~V~~sn~lp~--~~~~~~ 259 (344) .+. ++++++|.++..|+.+.. +. ++.. +....+..|.+|+++|+=|.+-+++|. .++.. T Consensus 233 ~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~~G~~- 311 (318) T protein:vir:27 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQR- 311 (318) T ss_pred ccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEcCCCe- Confidence 112 567899999999998752 22 2222 234568899999999999999998753 21110 Q ss_pred ccccccccccccccccccccccccccee Q lcl|NC_015719. 260 RPEEGTDASNQKHAFPATGGKVNKENVV 287 (344) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (344) + .+..+. T Consensus 312 v---------------------~~~~~~ 318 (318) T protein:vir:27 312 F---------------------WYQRIT 318 (318) T ss_pred e---------------------eeeecC Confidence 0 000000 No 153 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.91 E-value=6.5e-11 Score=76.43 Aligned_cols=273 Identities=13% Similarity=0.056 Sum_probs=145.1 Q ss_pred CCCcccc---------ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--Ccce Q lcl|NC_015719. 1 MANMQGG---------QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTK 69 (344) Q Consensus 1 ma~~~~~---------~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t 69 (344) |...... .+.. ..|....+| .+.++.|..++.+..+..+.++++.++.++.+. ++|++ +..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~~ 171 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHAL-PTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDD 171 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCCc Confidence 1000000 0000 001110111 356799999999999988999999888776543 33432 3344 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) +.-...|...+.. +++.+++++...+... -+.|.+-=-..+.+|+.+.+.++.++++++..++.++..- T Consensus 172 a~~v~Eg~~~~~~--~~~f~~v~l~~~k~~~-~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g-------- 240 (387) T protein:vir:26 172 DDFITDVETAKEL--KAKGDTVKFTTNKFKV-FAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS-------- 240 (387) T ss_pred ccccccccccccc--ccccceeeechheeee-echhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC-------- Confidence 5555666666543 3556666666544422 2445431122356889999999999999987666554211 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ...+.+.+.....+... . .+...++.|+++...|+.+..+ ...|+ +++..|..|+.-.+-. + T Consensus 241 ----~g~g~~~g~~~~~~~~~--~----~~~~~~d~i~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~~~------~ 302 (387) T protein:vir:26 241 ----PKSGLEHMSFYNGSVKE--V----EGADMYDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNG------T 302 (387) T ss_pred ----CCccccceeeecccccc--c----cccchHHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcC------C Confidence 01111111111111111 1 1122467788887777776554 34565 5555555444321111 1 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|.-.+++|.+|+.++..+..- -|+|. . +..-.. ....+.+ T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~~----------------------~GDf~----------~-~~~~~~--~~~~~~~ 347 (387) T protein:vir:26 303 TNFFDTPAEKVFGKPVVFTDAAVKPI----------------------VGDFN----------Y-FGINYD--GTTYDTD 347 (387) T ss_pred CcccccCCccccccceEEecCCCcee----------------------eechh----------h-hhhhhh--hhhheec Confidence 22333444689999999987654211 01111 0 000011 1123444 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++...---.+++..++++++++|++.+.+..++.+ T Consensus 348 ~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:26 348 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred ccccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 44332233466677899999999999999998877 No 154 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.91 E-value=6.5e-11 Score=76.43 Aligned_cols=273 Identities=13% Similarity=0.056 Sum_probs=145.1 Q ss_pred CCCcccc---------ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--Ccce Q lcl|NC_015719. 1 MANMQGG---------QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTK 69 (344) Q Consensus 1 ma~~~~~---------~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t 69 (344) |...... .+.. ..|....+| .+.++.|..++.+..+..+.++++.++.++.+. ++|++ +..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~~ 171 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHAL-PTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDD 171 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCCc Confidence 1000000 0000 001110111 356799999999999988999999888776543 33432 3344 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) +.-...|...+.. +++.+++++...+... -+.|.+-=-..+.+|+.+.+.++.++++++..++.++..- T Consensus 172 a~~v~Eg~~~~~~--~~~f~~v~l~~~k~~~-~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g-------- 240 (387) T protein:vir:96 172 DDFITDVETAKEL--KAKGDTVKFTTNKFKV-FAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS-------- 240 (387) T ss_pred ccccccccccccc--ccccceeeechheeee-echhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC-------- Confidence 5555666666543 3556666666544422 2445431122356889999999999999987666554211 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ...+.+.+.....+... . .+...++.|+++...|+.+..+ ...|+ +++..|..|+.-.+-. + T Consensus 241 ----~g~g~~~g~~~~~~~~~--~----~~~~~~d~i~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~~~------~ 302 (387) T protein:vir:96 241 ----PKSGLEHMSFYNGSVKE--V----EGADMYDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNG------T 302 (387) T ss_pred ----CCccccceeeecccccc--c----cccchHHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcC------C Confidence 01111111111111111 1 1122467788887777776554 34565 5555555444321111 1 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|.-.+++|.+|+.++..+..- -|+|. . +..-.. ....+.+ T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~~----------------------~GDf~----------~-~~~~~~--~~~~~~~ 347 (387) T protein:vir:96 303 TNFFDTPAEKVFGKPVVFTDAAVKPI----------------------VGDFN----------Y-FGINYD--GTTYDTD 347 (387) T ss_pred CcccccCCccccccceEEecCCCcee----------------------eechh----------h-hhhhhh--hhhheec Confidence 22333444689999999987654211 01111 0 000011 1123444 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++...---.+++..++++++++|++.+.+..++.+ T Consensus 348 ~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:96 348 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred ccccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 44332233466677899999999999999998877 No 155 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.91 E-value=6.5e-11 Score=76.43 Aligned_cols=273 Identities=13% Similarity=0.056 Sum_probs=145.1 Q ss_pred CCCcccc---------ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec--Ccce Q lcl|NC_015719. 1 MANMQGG---------QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI--GRTK 69 (344) Q Consensus 1 ma~~~~~---------~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t 69 (344) |...... .+.. ..|....+| .+.++.|..++.+..+..+.++++.++.++.+. ++|++ +..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~~ 171 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHAL-PTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDD 171 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCCc Confidence 1000000 0000 001110111 356799999999999988999999888776543 33432 3344 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) +.-...|...+.. +++.+++++...+... -+.|.+-=-..+.+|+.+.+.++.++++++..++.++..- T Consensus 172 a~~v~Eg~~~~~~--~~~f~~v~l~~~k~~~-~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g-------- 240 (387) T protein:vir:94 172 DDFITDVETAKEL--KAKGDTVKFTTNKFKV-FAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS-------- 240 (387) T ss_pred ccccccccccccc--ccccceeeechheeee-echhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC-------- Confidence 5555666666543 3556666666544422 2445431122356889999999999999987666554211 Q ss_pred ccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 150 GVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 150 ~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ...+.+.+.....+... . .+...++.|+++...|+.+..+ ...|+ +++..|..|+.-.+-. + T Consensus 241 ----~g~g~~~g~~~~~~~~~--~----~~~~~~d~i~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~~~------~ 302 (387) T protein:vir:94 241 ----PKSGLEHMSFYNGSVKE--V----EGADMYDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNG------T 302 (387) T ss_pred ----CCccccceeeecccccc--c----cccchHHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcC------C Confidence 01111111111111111 1 1122467788887777776554 34565 5555555444321111 1 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|.-.+++|.+|+.++..+..- -|+|. . +..-.. ....+.+ T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~~----------------------~GDf~----------~-~~~~~~--~~~~~~~ 347 (387) T protein:vir:94 303 TNFFDTPAEKVFGKPVVFTDAAVKPI----------------------VGDFN----------Y-FGINYD--GTTYDTD 347 (387) T ss_pred CcccccCCccccccceEEecCCCcee----------------------eechh----------h-hhhhhh--hhhheec Confidence 22333444689999999987654211 01111 0 000011 1123444 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++...---.+++..++++++++|++.+.+..++.+ T Consensus 348 ~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:94 348 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred ccccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 44332233466677899999999999999998877 No 156 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.90 E-value=5.9e-10 Score=71.17 Aligned_cols=310 Identities=12% Similarity=0.056 Sum_probs=170.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcC-Cce---------eeecc--cccEEEEeecCcc Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTAN-RHM---------QRQIS--SGKSAQFPVIGRT 68 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~-~~~---------~~~i~--~G~tv~i~~iG~~ 68 (344) ||..+ .+. +|+. -.++|+..+...-.+.|-+.+ ++- ..++. .|++|.|.-+... T Consensus 1 Ma~T~--------~~~----~~p~--a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) T protein:vir:93 1 MSQTV--------IPF----GDPK--AVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHL 66 (364) T ss_pred Cceec--------cCc----CCHH--HHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeec Confidence 87633 333 5665 468999999888877765554 322 22333 4999999999888 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceec-cchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINL 147 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (344) +-.....++.+.+.-+.++....+|+|||..- ++.. ..+++-.+-+|+|.+--...+.=+++..|+-++.+++++... T Consensus 67 ~g~gv~Gd~~leGnee~L~~~~~~i~idq~r~-~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~ 145 (364) T protein:vir:93 67 RGKPTYGDARVEGKEESLRFYQDEVRIDQVRH-SVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGI 145 (364) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeccc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 76667777888888888888999999999863 2222 357778889999999999999999999999999988753211 Q ss_pred cccccccc--cc--------ccCceeeeccccc---ccccchhhHHHHHHHHHHHHHHHhhcCCC--c-----------C Q lcl|NC_015719. 148 ADGVNENI--AG--------LGKPSLLEVGAKA---DLTDPVKLGQAVIAQLTIARAALTKNYVP--A-----------N 201 (344) Q Consensus 148 ~~~~~~~~--~~--------~~~~~~i~~~~~~---~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP--~-----------~ 201 (344) -.+....+ .+ -.....+-.+.++ ..++.. ..-++.|.++...++....+ + + T Consensus 146 ~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD---~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~ 222 (364) T protein:vir:93 146 NLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATD---IMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGD 222 (364) T ss_pred ccccccccCcccccccccCCCCCCcEEeccccCchhhccccc---cccHHHHHHHHHHHHHhCCCCCCCcccceeEecCc Confidence 10000000 00 0011111111111 111111 11267777888877765431 0 1 Q ss_pred CC-EEEeCHHHHHHHhccc--h---hhhhc---cccccccccceeEEEeCeEEEEecccccccccccccccccccccccc Q lcl|NC_015719. 202 DR-TFYTTPDVYSAILAAL--M---PNAAN---YAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKH 272 (344) Q Consensus 202 gR-~~vv~P~~~~~Ll~~~--~---~~~~~---~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~ 272 (344) +. ++++.|.++..|..+. . +.+.- -+....+-+|.+|++.|+-|++.++++..+..+. +. T Consensus 223 ~~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~----~~------- 291 (364) T protein:vir:93 223 DHYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGA----GA------- 291 (364) T ss_pred ceeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCccccccccc----Cc------- Confidence 23 6779999999998543 3 32221 1233458899999999999999998875432111 00 Q ss_pred ccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceecc--ccEEEEEecCCC Q lcl|NC_015719. 273 AFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRP--ESAGALVFKAGA 344 (344) Q Consensus 273 ~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp--~~~~~l~~~~~a 344 (344) +.+.-++-+==-.+++++|-+. -+ .+..-.|-.+|-.+.- .|.....+|.+=.|- .=-|+|.+..-| T Consensus 292 ~v~~~ralllGaQA~~~a~g~~---~g-~~~~w~Ee~~D~gn~~-~i~~~~i~G~kK~rF~~~DfGvi~idtaa 360 (364) T protein:vir:93 292 NVEAARALFMGRQAGVIAYGTA---NG-LRFDWEETVKDYGNEP-AIAAGFIAGMKKARFNNKDFGVISIDTAA 360 (364) T ss_pred cccchhhheecceeeEEEeecC---CC-CCceeeecccCCCCch-hhhhhhHhhhhhcccCCccceEEEecccc Confidence 0001000000011223333220 00 1111133333322211 133333344322222 222333333333 No 157 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.80 E-value=1e-09 Score=69.89 Aligned_cols=271 Identities=14% Similarity=0.071 Sum_probs=152.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc-eeeeeeCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT-KAAYLQPGESL 79 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-t~~~~~~g~~~ 79 (344) ||.-+.- ..+..+ ..=..+ |++.|+.-+.+-+ .+++..|...+..|+++++|...-+ ...++..|+.| T Consensus 1 mAe~nlt--~~~dL~---~~~sid--fv~~f~~~i~~L~----~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~I 69 (295) T protein:vir:99 1 MAEKNLN--TMADLG---DIKSID--FVNKFSKNINDLL----KLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETI 69 (295) T ss_pred CCCcccc--cHhhcc---Cceeeh--hhHHhhhhHHHHH----HHhccccccccccCCeEEeeeeeeecccccccCCccc Confidence 8884421 112222 122344 8999987665443 2567778888889999999987533 45789999999 Q ss_pred CCCcCCcccc---eEEEEeeeeeeeceeccchHHH-H-h-ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 80 DDKRKDIKHT---EKTINIDGLLTADVLIYDIEDA-M-N-HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 80 ~~~~~~~~~~---~~~l~iD~~~~~~~~Idd~D~~-q-~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) |.+. ++.+ ..++++.++.- .+. ||+ | + ..|...+..+++..+|++.+|..++..+..+. T Consensus 70 plsk--vt~~~~~t~t~kikK~rK---~tT--dEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat-------- 134 (295) T protein:vir:99 70 PLSK--VTRTKDKDYTVKWFKKRR---ATT--AEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKP-------- 134 (295) T ss_pred chhh--heeeeeeeeEEEeeeecc---ccc--HHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCc-------- Confidence 8863 4433 35666655433 244 455 4 4 34599999999999999999999987653210 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhh--hhccccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPN--AANYAALID 231 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~--~~~~~~~~~ 231 (344) ... ....-+..++.+..+...+.|.+- ...+++|+|..++.|+.+-... .+...|..- T Consensus 135 ----------~t~--------tg~~lq~a~a~~~~al~~f~Ee~~--~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~ 194 (295) T protein:vir:99 135 ----------TKV--------KGVGLQKALSASWAKLATFNEFEG--SPLVSFVSPLDVANYLGDTKVGADASNVFGMTL 194 (295) T ss_pred ----------eee--------ehhhHHHHHHHhhhhhhhcccccC--CceEEEEehHHHHHHHhccccccchhhhhhhhh Confidence 000 011112234444444444444321 1368999999999999876553 222123333 Q ss_pred cccceeEEEeCeE-EEEeccccccccccccccc----cccccc--cccccccccccccccceeEEEecHHHHhhhhhhee Q lcl|NC_015719. 232 PERGSIRNVMGFE-VVEVPHLTAGGAGDDRPEE----GTDASN--QKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDL 304 (344) Q Consensus 232 ~~~G~Vg~i~G~~-V~~sn~lp~~~~~~~~~~~----~~~~~~--~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~ 304 (344) +. ++.|++ |+.|+.+|.+..-.++.-. -.+..+ -...|+- ..+..||+. T Consensus 195 L~-----nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~------~~D~tglIg------------- 250 (295) T protein:vir:99 195 LK-----NFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGLFAD------FTDETGLIA------------- 250 (295) T ss_pred hh-----hhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhhhhh------ccCcccceE------------- Confidence 33 499997 9999999987543332111 000000 0111111 123333321 Q ss_pred eeeeeecchhhhhhhhhhhhhcC--ceeccccEEEEEecCCC Q lcl|NC_015719. 305 ALERARRAEYQADQIIAKYAMGH--GGLRPESAGALVFKAGA 344 (344) Q Consensus 305 ~~e~~~~~~~~~d~i~~~~~~G~--~v~Rp~~~~~l~~~~~a 344 (344) + ..++....=-+..+...|. =+=|+|++++.++++.+ T Consensus 251 -~--~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~ 289 (295) T protein:vir:99 251 -A--ARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAA 289 (295) T ss_pred -E--EeccccceeeehhhhHhHHHhcccccceEEEEEEecCc Confidence 0 1111111111223333333 24578899999987777 No 158 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.77 E-value=1.2e-09 Score=69.44 Aligned_cols=293 Identities=12% Similarity=0.024 Sum_probs=158.1 Q ss_pred CCCccccccccccccccccccchhhhhH-HHHh-hHHHHHHHHhhhhcC-CceeeecccccEEEE----eecCcceeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFL-KVFG-GEVLTAFARTSVTAN-RHMQRQISSGKSAQF----PVIGRTKAAYL 73 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~-e~f~-geV~~~f~~~s~~~~-~~~~~~i~~G~tv~i----~~iG~~t~~~~ 73 (344) |.+-++--.+. .++ .=.++.|.- ..|- ..|.+.- +...+.+ +.+.-.-+++-+|++ +........+. T Consensus 1 ~~~~~~i~s~~----~~~-~itv~~ll~~P~~I~~~i~e~~-~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAPTGIVSVS----DGP-AITVRELVGNPLWIPTALKKMM-VNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCCCcceeee----cCC-ceehHHhhCCchhHHHHHHHHH-hccchhhhhhhcccccccceeEEEecccccccCcHhhc Confidence 76654422221 111 111111110 1121 2222222 2222333 223223445667887 44555567788 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) .+|.+++... ..+.+..+-.-+..--.+.|.|--..+...|.++...++++.+++++.|+.++..|..+.-...+ T Consensus 75 aEggEiP~~~--~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~--- 149 (318) T protein:vir:10 75 AEFGEIPVSA--GARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLA--- 149 (318) T ss_pred cCcccccccC--CCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--- Confidence 8899988653 34444444222233456889887777789999999999999999999999998766432111111 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHH---HHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIA---RAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALI 230 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a---~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (344) ...++..+.... .+..++....+.....+..+ .+.+.--..| -.+|+.|..|..|++++.+... |.+.. T Consensus 150 ~s~~w~~~~~~~----~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~p---dtIVlhP~~~~~l~~n~~~~~~-y~~~a 221 (318) T protein:vir:10 150 VPTAWDNGGKVR----TDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIP---DTIVMHYALLPILMDNENFMKV-YERNA 221 (318) T ss_pred CCcCCCCccccc----ccchhhhhhhhhhhhhhhhhhhhhhhhccCccc---eeeEECHHHHHHHhcchhhhhh-hhccc Confidence 011111111000 01111110000000111110 0011001122 3799999999999999876442 21111 Q ss_pred ------cccccee-EEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhh-hhh Q lcl|NC_015719. 231 ------DPERGSI-RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTV-KLK 302 (344) Q Consensus 231 ------~~~~G~V-g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~-~~~ 302 (344) .-..|.+ ++++|++|+.|+++|.+. ++++++..+|+- -.. T Consensus 222 ~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~--------------------------------alvlq~g~vG~~~d~~ 269 (318) T protein:vir:10 222 NYVSTAPDWTGNFPGSVMGLNVIRSRTFPIDR--------------------------------VLIMERGTVGFYSDTR 269 (318) T ss_pred hhhhhcccccccccceeeceEEeecCccCCCe--------------------------------eEEEecCCcceeeccc Confidence 1124544 678999999999999532 133444444433 245 Q ss_pred eeeeeeeecc-------hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 303 DLALERARRA-------EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 303 ~~~~e~~~~~-------~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +++++.+|.+ ...+|.++.++.....|.+|.++.-|+==.+- T Consensus 270 pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 270 PLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred cceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 5788888876 77789999999999999999998877633333 No 159 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.77 E-value=2.4e-09 Score=67.80 Aligned_cols=317 Identities=10% Similarity=0.052 Sum_probs=163.5 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeec--ccccEEEEeecCcceeeeee---C Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQI--SSGKSAQFPVIGRTKAAYLQ---P 75 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i--~~G~tv~i~~iG~~t~~~~~---~ 75 (344) |-|-+-..+......+ ++....+...=|-.+++..-.+.-++..+-..+++ .+|+|+++.+--.-. .+.+ . T Consensus 1 ~~~~~a~~~~~~~s~~---g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~-~~~~pl~e 76 (401) T protein:vir:95 1 MLNYNAPTDGQKSSID---GANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLL-DDRNINDQ 76 (401) T ss_pred CCccCCCccccccccc---ccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEeccccc-ccccchhc Confidence 6665533321111111 11122234444555555555555666777777766 369999988764321 1122 2 Q ss_pred CCCCCCC----------cC----------------------CcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHH Q lcl|NC_015719. 76 GESLDDK----------RK----------------------DIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTS 123 (344) Q Consensus 76 g~~~~~~----------~~----------------------~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~ 123 (344) |.+..+. .. ..+-.++...|-|+-.|..+=|.++..-....+...++. T Consensus 77 Gv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ 156 (401) T protein:vir:95 77 GIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSR 156 (401) T ss_pred CCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHH Confidence 3322221 00 111122334455665555444444444444446655544 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCc--- Q lcl|NC_015719. 124 QIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPA--- 200 (344) Q Consensus 124 ~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~--- 200 (344) |.-..=+...-..+.+++..++ ....+ + |......+.+..+. +.....++.|.++...|+++..|+ T Consensus 157 ell~g~~~~t~d~i~~dll~ag--~~viy---A----g~ats~At~~~~~~--~~t~vt~~~l~rl~~~L~~nRapk~t~ 225 (401) T protein:vir:95 157 ELMNGATQITEAVLQKDLLAAA--GTVLY---A----GAATSDATITGEGS--TPSVVSYKNLMRLDQILTENRTPTQTT 225 (401) T ss_pred HHhhhhhhhHHHHHHHHHHhhc--Ceeec---C----Cccceeeecccccc--ccceechhHHHHHHHHHHhcccccchh Confidence 4433333322223334443111 00000 0 00000111111111 111223788999999999977776 Q ss_pred -------C-------CCEEEeCH------HHHHHHhccchhhhh-ccccccccccceeEEEeCeEEEEeccccccccccc Q lcl|NC_015719. 201 -------N-------DRTFYTTP------DVYSAILAALMPNAA-NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDD 259 (344) Q Consensus 201 -------~-------gR~~vv~P------~~~~~Ll~~~~~~~~-~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~ 259 (344) - -|++++.| ....+|+.++.|+.. .|+..+...+|.||++.+|++++++.+--+...+- T Consensus 226 ~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~ 305 (401) T protein:vir:95 226 IITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGA 305 (401) T ss_pred hhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcc Confidence 1 26788777 445777888999875 58888899999999999999999998765443332 Q ss_pred ccccc--------ccccccccccccccccccccceeEEEecHHHHhhhhhheee----ee-----------eeecchhhh Q lcl|NC_015719. 260 RPEEG--------TDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLA----LE-----------RARRAEYQA 316 (344) Q Consensus 260 ~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~----~e-----------~~~~~~~~~ 316 (344) ..... +.++++...|+ -|++-+.|-+++..+--- ++ ..-||.-|. T Consensus 306 ~a~~~~~~y~~~~~~~gg~~dVyp------------~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~ 373 (401) T protein:vir:95 306 QATGANPGYRTSMVSGQEHYDVYP------------MLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGET 373 (401) T ss_pred cccccccccccccccCCCcceeee------------eeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccce Confidence 11100 01111111111 244555554443322110 01 113555555 Q ss_pred hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 317 DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 317 d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) =.+.=++.|++.++||+..+.|+..+.- T Consensus 374 g~vgwK~~~a~~vL~~e~m~~ies~a~~ 401 (401) T protein:vir:95 374 GFSSIKWYYGILVKRPERLALIKTVAPL 401 (401) T ss_pred ehhhhhhhhhhheeccceeEEEEeecCC Confidence 5666678899999999999999988877 No 160 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.76 E-value=3.9e-09 Score=66.66 Aligned_cols=279 Identities=10% Similarity=0.046 Sum_probs=156.1 Q ss_pred CCCccccccccccccccc-cccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc--eeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQ-SAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT--KAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~-~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~--t~~~~~~g~ 77 (344) |-.-.+..-.+.-...-. ..=++| |+++|+.-+.+-++ +++.+|...+..|++++++.-... ..++...|+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siD--f~~~f~~~i~~L~~----~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe 74 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhh--hHHHHhhhHHHHHH----HhhhcccccccCCCEEeeccceeeeeccccccCCc Confidence 544333222111111111 122455 88889887765543 567778888888999977543222 346788899 Q ss_pred CCCCCcCCcccc---eEEEEeeeeeeeceeccchHHH-H-h-ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_015719. 78 SLDDKRKDIKHT---EKTINIDGLLTADVLIYDIEDA-M-N-HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGV 151 (344) Q Consensus 78 ~~~~~~~~~~~~---~~~l~iD~~~~~~~~Idd~D~~-q-~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (344) .||.+. ++.+ ..+++|.++.-. +. ||+ | + ..|...+..++...++++.+|..++..+..+. T Consensus 75 ~Iplsk--vt~~~~~t~t~~ikK~rK~---tT--dEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT------ 141 (296) T protein:vir:98 75 VIPLSK--VERKIHSEKKIELKKYRKA---TT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT------ 141 (296) T ss_pred ccchhh--heeeecceEEEEeeccccc---cC--HHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccc------ Confidence 998763 4433 366777554433 43 555 5 4 34599999999999999999999987664211 Q ss_pred ccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_015719. 152 NENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALID 231 (344) Q Consensus 152 ~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (344) ++. . ......-.++...+.++..+|.+.+ ....+++|+|.-.+.+|.+..+.....-|..- T Consensus 142 ---------~t~-------~-~t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~it~qt~fG~ty 202 (296) T protein:vir:98 142 ---------GTQ-------D-ALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGLTY 202 (296) T ss_pred ---------cee-------e-echhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCccchhheechhh Confidence 000 0 0112333455566777777787764 23478999999999999888765433222222 Q ss_pred cccceeEEEeCeEEEEeccccccccccccccc----ccccc--ccccccccccccccccceeEEEecHHHHhhhhhheee Q lcl|NC_015719. 232 PERGSIRNVMGFEVVEVPHLTAGGAGDDRPEE----GTDAS--NQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLA 305 (344) Q Consensus 232 ~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~----~~~~~--~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~ 305 (344) +. ++.|+.|+.|+.+|.+..-..+.-. ..+.. --+..|+-+ .+..||+. T Consensus 203 l~-----nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~------~d~tglIG-------------- 257 (296) T protein:vir:98 203 LV-----DFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLY------GDPTGYIG-------------- 257 (296) T ss_pred hh-----hccccEEEEcCcCCCceEEEeeecceEEEeecccccchhhhhccc------cccccceE-------------- Confidence 22 4888999999999987543332211 11100 001112111 12333321 Q ss_pred eeeeecchhhhhhhhhhhhhcC--ceeccccEEEEEecCCC Q lcl|NC_015719. 306 LERARRAEYQADQIIAKYAMGH--GGLRPESAGALVFKAGA 344 (344) Q Consensus 306 ~e~~~~~~~~~d~i~~~~~~G~--~v~Rp~~~~~l~~~~~a 344 (344) + ..++....=-+..+...|. =+=|+|++++.+++++- T Consensus 258 v--~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 258 M--NHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred E--EeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 0 0111111111222333332 24578888888886666 No 161 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.67 E-value=5.8e-09 Score=65.75 Aligned_cols=284 Identities=15% Similarity=0.069 Sum_probs=148.7 Q ss_pred CCCcccccccccc--------ccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCc-ceee Q lcl|NC_015719. 1 MANMQGGQQLGTN--------QGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGR-TKAA 71 (344) Q Consensus 1 ma~~~~~~~~~~~--------~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~-~t~~ 71 (344) +++ ...+..|. --+....++--.|..+.+..++.+...+.|.++.++++.++. |. ++|+.... .++. T Consensus 59 ~~~--~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a~ 134 (377) T protein:vir:96 59 DLR--DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAV 134 (377) T ss_pred Hhc--cCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCccee Confidence 111 00000000 000111222233677999999999999999999999988864 43 55665433 3444 Q ss_pred eeeCCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 72 YLQPGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) -...+..+..+. +.+..+++|.. .++.. ..|..-=-..+.+|+-+.+..+.++++++..|+.++.- ... T Consensus 135 wv~e~~~~~~~~-~~~f~~i~l~~--~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G----~G~--- 204 (377) T protein:vir:96 135 WGDIFGEIKGQL-KQAFKEQDFSQ--FKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKG----NGL--- 204 (377) T ss_pred Eeeccccccccc-CccceeEeeee--eeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEec----cCC--- Confidence 344444444321 23455555554 44444 34543222236778999999999999999999988621 100 Q ss_pred cccccccccC---------------ceeeecc-cccccccchhhHHHHHHHHHHHHHHHhhcCC--C---cCCCEEEeCH Q lcl|NC_015719. 151 VNENIAGLGK---------------PSLLEVG-AKADLTDPVKLGQAVIAQLTIARAALTKNYV--P---ANDRTFYTTP 209 (344) Q Consensus 151 ~~~~~~~~~~---------------~~~i~~~-~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~V--P---~~gR~~vv~P 209 (344) ..|.|.-. +...... ..+..+. .....+++.+..+...+....- | ..+-+++++| T Consensus 205 --~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~ 280 (377) T protein:vir:96 205 --LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSD--LDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNP 280 (377) T ss_pred --Ccceeeeeccccccccccccccccceeecccccccccc--CChhHHHHHHHHHHHhhccccccccccccCceEEEEch Confidence 11111100 0000000 0000000 1123344444455444543321 2 1223577888 Q ss_pred HHHHHHhccchhhhhccccccccccceeEEEeC--eEEEEecccccccccccccccccccccccccccccccccccccee Q lcl|NC_015719. 210 DVYSAILAALMPNAANYAALIDPERGSIRNVMG--FEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVV 287 (344) Q Consensus 210 ~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G--~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (344) ..|..++......+ .+|.-.++.| ..|++|+.+|.+.+. -+++ .. T Consensus 281 ~t~~~~~~~~~~~~---------~~G~~~~~l~~p~~v~~s~~~p~~~i~--------------------fgdf--~~-- 327 (377) T protein:vir:96 281 EDRWTLEAKFTSRN---------QFGEYVTVLPHGITILESLAVETGKAI--------------------AFVA--NR-- 327 (377) T ss_pred hhHHhccccccccC---------CCCCceeccCCCceEEecCCCCcccEE--------------------EEEc--Cc-- Confidence 88776643211111 2444445654 457888888753210 0111 11 Q ss_pred EEEecHHHHhhhhhheeeeeeeecchhhh--hhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 288 GLFQHRSAVGTVKLKDLALERARRAEYQA--DQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 288 gl~~~~~Av~~~~~~~~~~e~~~~~~~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) ...+....++++.+.+..... ..+++.+++++++++|++.++|.++.| T Consensus 328 --------Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 328 --------YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred --------EEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 112334556676665432222 458899999999999999999999999 No 162 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.66 E-value=1.2e-08 Score=63.99 Aligned_cols=338 Identities=11% Similarity=0.073 Sum_probs=171.6 Q ss_pred CCCcccccccccccccccc-ccchhhhhHHHHhhHHHHHHHHhhhh---------cCCceeeecc--cccEEEEeecCcc Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQS-AADKLALFLKVFGGEVLTAFARTSVT---------ANRHMQRQIS--SGKSAQFPVIGRT 68 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~-~~d~~~l~~e~f~geV~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (344) |..+..++ .......+.- .-..+.-++++|++.+...=+..+-+ ++.++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~-a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:10 1 MTTVTSAQ-ANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcc-hhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 66655432 1111111100 00000114667776643322221111 2333344443 5999999999888 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) +-.....++.+.+.-+.++....+|.||+..-.-..=..+++-.+-+|+|++.-...+.-+++..||.+|.+|++..... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:10 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 76667777888888888999999999999863311113566777889999999999999999999999999998544210 Q ss_pred c------cc-----cc----c-cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-------C--- Q lcl|NC_015719. 149 D------GV-----NE----N-IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-------D--- 202 (344) Q Consensus 149 ~------~~-----~~----~-~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-------g--- 202 (344) . |. .. + +..-.....+-.+.++....-...-..-++.|.++.+.+++..-|-. . T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:10 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 0 00 00 0 00000011111111111110000011125667788888877544421 2 Q ss_pred ----CEEEeCHHHHHHHhccch------hhh-hcc---ccccccccceeEEEeCeEEEEecccccc--cccccc-ccccc Q lcl|NC_015719. 203 ----RTFYTTPDVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAG--GAGDDR-PEEGT 265 (344) Q Consensus 203 ----R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~Vg~i~G~~V~~sn~lp~~--~~~~~~-~~~~~ 265 (344) ++++++|.+|..|..++. +.. +.. +....+-.|.+|++.|+-|++-++.|.- .+.... ..... T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:10 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 567899999999999853 222 111 2345688999999999999998877631 111100 00000 Q ss_pred ccccccc--ccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceec-c------ccEE Q lcl|NC_015719. 266 DASNQKH--AFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLR-P------ESAG 336 (344) Q Consensus 266 ~~~~~~~--~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~R-p------~~~~ 336 (344) ...+... .++.-++-.==-.+++++|-+.. + ...--.|-.+|-.+.- .|.....+|.+=.| | .-=| T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~---g-~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:10 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKA---G-GHFNMVEKKTDMDNRT-EIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccC---C-CCceeEeeccccCchh-hhhhHHHhhhhhccccCCCCceeeEE Confidence 0000000 00000100000122333443210 0 1111233333322222 24455556765555 4 2456 Q ss_pred EEEecCCC Q lcl|NC_015719. 337 ALVFKAGA 344 (344) Q Consensus 337 ~l~~~~~a 344 (344) +|.+..-| T Consensus 395 vi~idta~ 402 (404) T protein:vir:10 395 VIAVDTAV 402 (404) T ss_pred EEEecccc Confidence 66666555 No 163 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.66 E-value=1.2e-08 Score=63.99 Aligned_cols=338 Identities=11% Similarity=0.073 Sum_probs=171.6 Q ss_pred CCCcccccccccccccccc-ccchhhhhHHHHhhHHHHHHHHhhhh---------cCCceeeecc--cccEEEEeecCcc Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQS-AADKLALFLKVFGGEVLTAFARTSVT---------ANRHMQRQIS--SGKSAQFPVIGRT 68 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~-~~d~~~l~~e~f~geV~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (344) |..+..++ .......+.- .-..+.-++++|++.+...=+..+-+ ++.++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~-a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:81 1 MTTVTSAQ-ANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcc-hhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 66655432 1111111100 00000114667776643322221111 2333344443 5999999999888 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) +-.....++.+.+.-+.++....+|.||+..-.-..=..+++-.+-+|+|++.-...+.-+++..||.+|.+|++..... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:81 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 76667777888888888999999999999863311113566777889999999999999999999999999998544210 Q ss_pred c------cc-----cc----c-cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-------C--- Q lcl|NC_015719. 149 D------GV-----NE----N-IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-------D--- 202 (344) Q Consensus 149 ~------~~-----~~----~-~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-------g--- 202 (344) . |. .. + +..-.....+-.+.++....-...-..-++.|.++.+.+++..-|-. . T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:81 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 0 00 00 0 00000011111111111110000011125667788888877544421 2 Q ss_pred ----CEEEeCHHHHHHHhccch------hhh-hcc---ccccccccceeEEEeCeEEEEecccccc--cccccc-ccccc Q lcl|NC_015719. 203 ----RTFYTTPDVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAG--GAGDDR-PEEGT 265 (344) Q Consensus 203 ----R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~Vg~i~G~~V~~sn~lp~~--~~~~~~-~~~~~ 265 (344) ++++++|.+|..|..++. +.. +.. +....+-.|.+|++.|+-|++-++.|.- .+.... ..... T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:81 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 567899999999999853 222 111 2345688999999999999998877631 111100 00000 Q ss_pred ccccccc--ccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceec-c------ccEE Q lcl|NC_015719. 266 DASNQKH--AFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLR-P------ESAG 336 (344) Q Consensus 266 ~~~~~~~--~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~R-p------~~~~ 336 (344) ...+... .++.-++-.==-.+++++|-+.. + ...--.|-.+|-.+.- .|.....+|.+=.| | .-=| T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~---g-~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:81 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKA---G-GHFNMVEKKTDMDNRT-EIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccC---C-CCceeEeeccccCchh-hhhhHHHhhhhhccccCCCCceeeEE Confidence 0000000 00000100000122333443210 0 1111233333322222 24455556765555 4 2456 Q ss_pred EEEecCCC Q lcl|NC_015719. 337 ALVFKAGA 344 (344) Q Consensus 337 ~l~~~~~a 344 (344) +|.+..-| T Consensus 395 vi~idta~ 402 (404) T protein:vir:81 395 VIAVDTAV 402 (404) T ss_pred EEEecccc Confidence 66666555 No 164 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.66 E-value=1.2e-08 Score=63.99 Aligned_cols=338 Identities=11% Similarity=0.073 Sum_probs=171.6 Q ss_pred CCCcccccccccccccccc-ccchhhhhHHHHhhHHHHHHHHhhhh---------cCCceeeecc--cccEEEEeecCcc Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQS-AADKLALFLKVFGGEVLTAFARTSVT---------ANRHMQRQIS--SGKSAQFPVIGRT 68 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~-~~d~~~l~~e~f~geV~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (344) |..+..++ .......+.- .-..+.-++++|++.+...=+..+-+ ++.++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~-a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:10 1 MTTVTSAQ-ANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcc-hhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 66655432 1111111100 00000114667776643322221111 2333344443 5999999999888 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) +-.....++.+.+.-+.++....+|.||+..-.-..=..+++-.+-+|+|++.-...+.-+++..||.+|.+|++..... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:10 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 76667777888888888999999999999863311113566777889999999999999999999999999998544210 Q ss_pred c------cc-----cc----c-cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-------C--- Q lcl|NC_015719. 149 D------GV-----NE----N-IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-------D--- 202 (344) Q Consensus 149 ~------~~-----~~----~-~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-------g--- 202 (344) . |. .. + +..-.....+-.+.++....-...-..-++.|.++.+.+++..-|-. . T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:10 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 0 00 00 0 00000011111111111110000011125667788888877544421 2 Q ss_pred ----CEEEeCHHHHHHHhccch------hhh-hcc---ccccccccceeEEEeCeEEEEecccccc--cccccc-ccccc Q lcl|NC_015719. 203 ----RTFYTTPDVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAG--GAGDDR-PEEGT 265 (344) Q Consensus 203 ----R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~Vg~i~G~~V~~sn~lp~~--~~~~~~-~~~~~ 265 (344) ++++++|.+|..|..++. +.. +.. +....+-.|.+|++.|+-|++-++.|.- .+.... ..... T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:10 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 567899999999999853 222 111 2345688999999999999998877631 111100 00000 Q ss_pred ccccccc--ccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceec-c------ccEE Q lcl|NC_015719. 266 DASNQKH--AFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLR-P------ESAG 336 (344) Q Consensus 266 ~~~~~~~--~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~R-p------~~~~ 336 (344) ...+... .++.-++-.==-.+++++|-+.. + ...--.|-.+|-.+.- .|.....+|.+=.| | .-=| T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~---g-~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:10 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKA---G-GHFNMVEKKTDMDNRT-EIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccC---C-CCceeEeeccccCchh-hhhhHHHhhhhhccccCCCCceeeEE Confidence 0000000 00000100000122333443210 0 1111233333322222 24455556765555 4 2456 Q ss_pred EEEecCCC Q lcl|NC_015719. 337 ALVFKAGA 344 (344) Q Consensus 337 ~l~~~~~a 344 (344) +|.+..-| T Consensus 395 vi~idta~ 402 (404) T protein:vir:10 395 VIAVDTAV 402 (404) T ss_pred EEEecccc Confidence 66666555 No 165 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.66 E-value=1.2e-08 Score=63.99 Aligned_cols=338 Identities=11% Similarity=0.073 Sum_probs=171.6 Q ss_pred CCCcccccccccccccccc-ccchhhhhHHHHhhHHHHHHHHhhhh---------cCCceeeecc--cccEEEEeecCcc Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQS-AADKLALFLKVFGGEVLTAFARTSVT---------ANRHMQRQIS--SGKSAQFPVIGRT 68 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~-~~d~~~l~~e~f~geV~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (344) |..+..++ .......+.- .-..+.-++++|++.+...=+..+-+ ++.++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~-a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:32 1 MTTVTSAQ-ANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcc-hhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 66655432 1111111100 00000114667776643322221111 2333344443 5999999999888 Q ss_pred eeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 69 KAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) +-.....++.+.+.-+.++....+|.||+..-.-..=..+++-.+-+|+|++.-...+.-+++..||.+|.+|++..... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~ 159 (404) T protein:vir:32 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 76667777888888888999999999999863311113566777889999999999999999999999999998544210 Q ss_pred c------cc-----cc----c-cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-------C--- Q lcl|NC_015719. 149 D------GV-----NE----N-IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-------D--- 202 (344) Q Consensus 149 ~------~~-----~~----~-~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-------g--- 202 (344) . |. .. + +..-.....+-.+.++....-...-..-++.|.++.+.+++..-|-. . T Consensus 160 ~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~ 239 (404) T protein:vir:32 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) T ss_pred ccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccC Confidence 0 00 00 0 00000011111111111110000011125667788888877544421 2 Q ss_pred ----CEEEeCHHHHHHHhccch------hhh-hcc---ccccccccceeEEEeCeEEEEecccccc--cccccc-ccccc Q lcl|NC_015719. 203 ----RTFYTTPDVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAG--GAGDDR-PEEGT 265 (344) Q Consensus 203 ----R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~Vg~i~G~~V~~sn~lp~~--~~~~~~-~~~~~ 265 (344) ++++++|.+|..|..++. +.. +.. +....+-.|.+|++.|+-|++-++.|.- .+.... ..... T Consensus 240 ~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~ 319 (404) T protein:vir:32 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) T ss_pred ccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcc Confidence 567899999999999853 222 111 2345688999999999999998877631 111100 00000 Q ss_pred ccccccc--ccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceec-c------ccEE Q lcl|NC_015719. 266 DASNQKH--AFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLR-P------ESAG 336 (344) Q Consensus 266 ~~~~~~~--~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~R-p------~~~~ 336 (344) ...+... .++.-++-.==-.+++++|-+.. + ...--.|-.+|-.+.- .|.....+|.+=.| | .-=| T Consensus 320 ~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~---g-~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF~~~~g~~~DfG 394 (404) T protein:vir:32 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKA---G-GHFNMVEKKTDMDNRT-EIAISWINGLKKIRFPEKSGKMQDHG 394 (404) T ss_pred ccccccccccccchhheeecceeEEEEeeccC---C-CCceeEeeccccCchh-hhhhHHHhhhhhccccCCCCceeeEE Confidence 0000000 00000100000122333443210 0 1111233333322222 24455556765555 4 2456 Q ss_pred EEEecCCC Q lcl|NC_015719. 337 ALVFKAGA 344 (344) Q Consensus 337 ~l~~~~~a 344 (344) +|.+..-| T Consensus 395 vi~idta~ 402 (404) T protein:vir:32 395 VIAVDTAV 402 (404) T ss_pred EEEecccc Confidence 66666555 No 166 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.65 E-value=7.1e-09 Score=65.27 Aligned_cols=299 Identities=14% Similarity=0.101 Sum_probs=163.2 Q ss_pred CC------CccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcce--eee Q lcl|NC_015719. 1 MA------NMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTK--AAY 72 (344) Q Consensus 1 ma------~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t--~~~ 72 (344) |= ++...=++ +..+. | -|-.++|+ ++.+..++.|.++.+.++.+-.+..+..|+.+|... ... T Consensus 1 ~~~~~~~~~~~k~it~-~d~~g----G---~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~ 71 (314) T protein:vir:41 1 MDFLNKPFQITPKIDV-PDLGK----G---ILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPG 71 (314) T ss_pred CchhhhHHHhhccccc-ccCCC----c---eeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCcccccc Confidence 32 22221111 12221 1 14557885 666888899999999987543345567898887431 111 Q ss_pred eeC-CCCCCCCcCCcccceEEEEeeeeeeeceeccc-hHHHHh-ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc- Q lcl|NC_015719. 73 LQP-GESLDDKRKDIKHTEKTINIDGLLTADVLIYD-IEDAMN-HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA- 148 (344) Q Consensus 73 ~~~-g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd-~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~- 148 (344) ..- |+.-..+..+++.+...|..-+... .+.|.+ +=+..+ ..|+.+.++.+.++++++......+.-= .+..+ T Consensus 72 ~~~~~~~~~~~~~~~tf~~~~l~~~kl~~-~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGd--g~~~s~ 148 (314) T protein:vir:41 72 RNTSGTKVAPTADEVTVSTNTLEMKELVT-KVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHAD--SSLTTG 148 (314) T ss_pred cccccCCccCCcccccccceeeeeEEEEE-eecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccc--cCCcCc Confidence 211 1111112234566677777766554 355643 112222 2489999999999999998877665210 00001 Q ss_pred cccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcC-CCEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_015719. 149 DGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-DRTFYTTPDVYSAILAALMPNAANYA 227 (344) Q Consensus 149 ~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (344) .+-...+.|+-......+...+.. .+....+.|.++...|....--.. .-.++++++.+..+.+-.. .+..+. T Consensus 149 ~~~~~~p~G~l~~a~~~~~~~~~~-----~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~-~~~~~l 222 (314) T protein:vir:41 149 RELYRINDGWMKLAGNQYTDAEPE-----DENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLL-VRETGL 222 (314) T ss_pred ccchhcchhhhhhcccceeecCcc-----ccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHh-ccCCcc Confidence 110112233321111000000000 112234455566666655432111 2245679988877654110 122334 Q ss_pred cccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeee Q lcl|NC_015719. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALE 307 (344) Q Consensus 228 ~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e 307 (344) ++..+..|.-..+.|++|+.++.+|..+... .+.++-+++-+..+-...++.| T Consensus 223 ~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~---------------------------~~i~fgd~~nlv~~~~~~ir~~ 275 (314) T protein:vir:41 223 GDSALIGATGLQYDGIPIQYVPALDALGDDK---------------------------ARALLTVPTNLVYGFWRNIRIE 275 (314) T ss_pred cchhhhCCCCceecceeeEecccccccCCCC---------------------------ceEEEechhheEEEeeceeEEe Confidence 5556667777889999999999997432111 1123334444434556667888 Q ss_pred eeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 308 RARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 308 ~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+|+.+.-...+...+++++.+..+++++...+...+ T Consensus 276 ~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~ 312 (314) T protein:vir:41 276 PKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSS 312 (314) T ss_pred ecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccC Confidence 8888877777788888999999888888777776666 No 167 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.64 E-value=1.5e-09 Score=68.98 Aligned_cols=293 Identities=12% Similarity=0.106 Sum_probs=144.0 Q ss_pred CCCccccc------------cccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc Q lcl|NC_015719. 1 MANMQGGQ------------QLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT 68 (344) Q Consensus 1 ma~~~~~~------------~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~ 68 (344) |...+... ..........+.++-..+..+.+..++.+.....+.+++.+++..+.+ .++++.-+.. T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g--~~~~~~~~~~ 200 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG--TARQNIAGAI 200 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc--eeEeeeecCC Confidence 00000000 000000000011112235778899999998888888888888877653 3455555543 Q ss_pred ee-eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_015719. 69 KA-AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINL 147 (344) Q Consensus 69 t~-~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (344) .. .-...|..++.. +++..++++.+.+.- .-+.|.+-=-..+..|+.+.+..+.+++|+...|+.|+.- ... T Consensus 201 ~~a~wv~E~~~~~~~--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G----~G~ 273 (466) T protein:vir:80 201 PEGVWTEAVANLNEL--SLSFSQIEVDGYKVG-GFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYG----TGT 273 (466) T ss_pred cceeecccccccccc--cccccceeecceeee-eehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeec----cCC Confidence 32 223445555432 345566666554442 2244544222235578999999999999999999988631 111 Q ss_pred ccccccccccccC----ceeeeccc--cc---ccc--------cchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHH Q lcl|NC_015719. 148 ADGVNENIAGLGK----PSLLEVGA--KA---DLT--------DPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPD 210 (344) Q Consensus 148 ~~~~~~~~~~~~~----~~~i~~~~--~~---~~t--------~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~ 210 (344) ..|.|... .+...... +. +.+ .....+...+..+..+ ..+.+.+.....-+.++++. T Consensus 274 -----~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~w~~~~~ 347 (466) T protein:vir:80 274 -----KMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLK-LSKARANYSNGMKFWAMSSN 347 (466) T ss_pred -----CCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHH-HHhhhccccCCceeEEecch Confidence 11111100 00000000 00 000 0001111112211111 11122222222234678888 Q ss_pred HHHHHhccchhhhh--ccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeE Q lcl|NC_015719. 211 VYSAILAALMPNAA--NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVG 288 (344) Q Consensus 211 ~~~~Ll~~~~~~~~--~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 288 (344) .+..|+.-....+. .+.. ...++ ..++|.+|+.|+++|.+.. . .+++ T Consensus 348 ~~~~l~~~~~~~~~~g~~~~--~~~~~--~~i~G~pvv~s~~~~~~~~----~----------------~g~~------- 396 (466) T protein:vir:80 348 THAVLMSKAITFNSAGALVA--SLNNT--MPIVGGDIVILDFIPDNDI----I----------------GGYG------- 396 (466) T ss_pred hHHHhhcccccccCCccccc--cCCCc--ccccccceeecCccCccce----e----------------eecc------- Confidence 88877643322111 1111 11122 2589999999999986431 0 0000 Q ss_pred EEecHHHHhhhhhheeeeeeeecchhhhh--hhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 289 LFQHRSAVGTVKLKDLALERARRAEYQAD--QIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 289 l~~~~~Av~~~~~~~~~~e~~~~~~~~~d--~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +....+..+.++++...+..+.-| .+++.+++++++++|++.+.+.++.-+ T Consensus 397 -----~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~ 449 (466) T protein:vir:80 397 -----SLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANAN 449 (466) T ss_pred -----ccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCC Confidence 001112234456665544433333 478999999999999999999877766 No 168 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.57 E-value=1.1e-08 Score=64.18 Aligned_cols=286 Identities=14% Similarity=0.064 Sum_probs=139.7 Q ss_pred CCCccccccccc----cc----cccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeee Q lcl|NC_015719. 1 MANMQGGQQLGT----NQ----GKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAY 72 (344) Q Consensus 1 ma~~~~~~~~~~----~~----g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~ 72 (344) +.. .+.+..+ +. .. +..++--.|..+.|..++.+...+.|.++.+++..++ +|+ .+|++........ T Consensus 64 ~~~--~g~~~lt~~e~~~~~~~~~-~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~-~~~-~~i~~~~~~~~a~ 138 (383) T protein:vir:78 64 SAS--RTDKNITNEEIKFFNDINK-EVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTT-GLR-TKFLKSETSGVAV 138 (383) T ss_pred Hhc--CChhhhhHHHHHHHHHHhc-cCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEec-CCc-eEEEEEcCCcceE Confidence 000 0000000 00 00 0111112367799999999999999999999988776 455 4777775554333 Q ss_pred e-eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_015719. 73 L-QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGV 151 (344) Q Consensus 73 ~-~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (344) . ..+..+... .+.+..+++|..-++ +.-+.|..-=-..+.+|+.+.+.++.++++++..|+.++.- .. T Consensus 139 w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G----~G----- 207 (383) T protein:vir:78 139 WGKIFGEIKGQ-LDATFSDEESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVG----DG----- 207 (383) T ss_pred Eeecccccccc-cCcceeeEeecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEec----cC----- Confidence 3 223334322 133556666665333 34455544222235678999999999999999999988621 11 Q ss_pred ccccccccC----ceeeecccccccccchh----hHHHHHHHHH---HHHHHHhhcCC-CcCC-CEEEeCHHHHHHHhcc Q lcl|NC_015719. 152 NENIAGLGK----PSLLEVGAKADLTDPVK----LGQAVIAQLT---IARAALTKNYV-PAND-RTFYTTPDVYSAILAA 218 (344) Q Consensus 152 ~~~~~~~~~----~~~i~~~~~~~~t~~~~----~~~~i~~~l~---~a~~~Ld~~~V-P~~g-R~~vv~P~~~~~Ll~~ 218 (344) ...|.|.-. .+....+...+.+.... .....++.+. +....+....- ...+ -.++++|.-|+.++.. T Consensus 208 ~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 287 (383) T protein:vir:78 208 NDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQ 287 (383) T ss_pred CCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccc Confidence 112222211 01111110001011100 0011111111 11111111111 1111 2356677555444321 Q ss_pred chhhhhccccccccccceeEEEeCe--EEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHH Q lcl|NC_015719. 219 LMPNAANYAALIDPERGSIRNVMGF--EVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAV 296 (344) Q Consensus 219 ~~~~~~~~~~~~~~~~G~Vg~i~G~--~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av 296 (344) .... ..+|....+.|+ .|++|+.+|.+.+. -++ |+. . T Consensus 288 ~~~~---------~~~G~~~t~l~~~~~iv~s~~~p~~~ii--------------------fgd----------fs~--Y 326 (383) T protein:vir:78 288 YTSL---------NANGVYVTALPFNLNIIESLFVPEKKAI--------------------SYV----------AER--Y 326 (383) T ss_pred hhcc---------CCCCceeeecCCCceEEecCCCCcccEE--------------------Eee----------ccc--e Confidence 1111 124544455544 58888888853210 011 111 1 Q ss_pred hhhhhheeeeeeeecchhhh---hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 297 GTVKLKDLALERARRAEYQA---DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 297 ~~~~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..+..+.++++.+. ..+|. ..+++.+++++++++|++.++|.++-.. T Consensus 327 ~i~~r~~~~i~~~~-~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~ 376 (383) T protein:vir:78 327 DALIGGPLDIGTYD-QTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINP 376 (383) T ss_pred EEEecccceEEecc-hhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecC Confidence 12334556776654 33443 4689999999999999999997776333 No 169 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.55 E-value=1.4e-08 Score=63.57 Aligned_cols=285 Identities=13% Similarity=0.035 Sum_probs=146.0 Q ss_pred CCCcccccccccc--------ccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc-eee Q lcl|NC_015719. 1 MANMQGGQQLGTN--------QGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT-KAA 71 (344) Q Consensus 1 ma~~~~~~~~~~~--------~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-t~~ 71 (344) +.... + +..|. ... +..++--.|..+.+..++.+...+.|.++.+.+..++ +|+ .+|++.... .+. T Consensus 57 ~~~~~-~-~~lt~~e~~~~~~~~~-~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~-~~~-~~i~~~~~~~~a~ 131 (381) T protein:vir:95 57 SLPKS-A-QSLSANQRSFFMDINK-NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVAV 131 (381) T ss_pred HhccC-c-ccccHHHHHHHHHHhc-ccCCCCceecCHHHHHHHHHHHHhhccceeheeeEec-Ccc-eEEEEecCCccee Confidence 11110 0 00000 000 0111112367799999999999999999999988776 454 466666443 333 Q ss_pred eeeCCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 72 YLQPGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) -...+..+..+. +.+..+++|.. .++.. ..|..-=-..+.+|+.+.+.++.++++++..|+.++.- ... T Consensus 132 w~~e~~~~~~~~-~~~f~~i~l~~--~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~--- 201 (381) T protein:vir:95 132 WGKIYGEIKGQL-DAAFSEETAIQ--NKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG----TGK--- 201 (381) T ss_pred eecccccccccc-cccceeeeecc--eeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEec----cCC--- Confidence 333333443321 23445555544 44443 44543112235678999999999999999999887521 111 Q ss_pred cccccccccC----ceeeecccccccccc----hhhHHHHHHHHHHHHHHHhhc----C-CCcCCCEEEeCHHHHHHHhc Q lcl|NC_015719. 151 VNENIAGLGK----PSLLEVGAKADLTDP----VKLGQAVIAQLTIARAALTKN----Y-VPANDRTFYTTPDVYSAILA 217 (344) Q Consensus 151 ~~~~~~~~~~----~~~i~~~~~~~~t~~----~~~~~~i~~~l~~a~~~Ld~~----~-VP~~gR~~vv~P~~~~~Ll~ 217 (344) ..|.|... ......+...+.+.. .......++.|..+...|... . .+..+-+++++|..+..|+. T Consensus 202 --~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~ 279 (381) T protein:vir:95 202 --DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) T ss_pred --CCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc Confidence 11222110 000111100000000 001112234444444444322 2 23445677899999887764 Q ss_pred cchhhhhccccccccccceeEEE--eCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHH Q lcl|NC_015719. 218 ALMPNAANYAALIDPERGSIRNV--MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSA 295 (344) Q Consensus 218 ~~~~~~~~~~~~~~~~~G~Vg~i--~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~A 295 (344) ...... .+|..... .|.+|++|+.+|.+.+ .-+++. . T Consensus 280 ~~~~~~---------~~G~~v~~l~~g~~vv~s~~~p~~~i--------------------ifgDfs--~---------- 318 (381) T protein:vir:95 280 QYTHLN---------ANGVYVTALPFNLNVIESTVQEAGKV--------------------LTYVKG--L---------- 318 (381) T ss_pred ccccCC---------CCCceeecCCCCceEEecCCCCcCcE--------------------EEEecc--c---------- Confidence 322211 13433233 3667999998885321 011111 1 Q ss_pred Hhhhhhheeeeeeeecchhhh---hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 296 VGTVKLKDLALERARRAEYQA---DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 296 v~~~~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ...+.+..++++.+.+. +|. ..+++.+++++++++|++.+++.++--. T Consensus 319 Y~i~~r~~~~i~~~~~~-~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:95 319 YDGYLAGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) T ss_pred EEEEEecccEEEeechh-HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecC Confidence 11223444566555433 333 3688999999999999999997666533 No 170 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.55 E-value=1.4e-08 Score=63.57 Aligned_cols=285 Identities=13% Similarity=0.035 Sum_probs=146.0 Q ss_pred CCCcccccccccc--------ccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc-eee Q lcl|NC_015719. 1 MANMQGGQQLGTN--------QGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT-KAA 71 (344) Q Consensus 1 ma~~~~~~~~~~~--------~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-t~~ 71 (344) +.... + +..|. ... +..++--.|..+.+..++.+...+.|.++.+.+..++ +|+ .+|++.... .+. T Consensus 57 ~~~~~-~-~~lt~~e~~~~~~~~~-~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~-~~~-~~i~~~~~~~~a~ 131 (381) T protein:vir:10 57 SLPKS-A-QSLSANQRSFFMDINK-NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVAV 131 (381) T ss_pred HhccC-c-ccccHHHHHHHHHHhc-ccCCCCceecCHHHHHHHHHHHHhhccceeheeeEec-Ccc-eEEEEecCCccee Confidence 11110 0 00000 000 0111112367799999999999999999999988776 454 466666443 333 Q ss_pred eeeCCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 72 YLQPGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) -...+..+..+. +.+..+++|.. .++.. ..|..-=-..+.+|+.+.+.++.++++++..|+.++.- ... T Consensus 132 w~~e~~~~~~~~-~~~f~~i~l~~--~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~--- 201 (381) T protein:vir:10 132 WGKIYGEIKGQL-DAAFSEETAIQ--NKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG----TGK--- 201 (381) T ss_pred eecccccccccc-cccceeeeecc--eeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEec----cCC--- Confidence 333333443321 23445555544 44443 44543112235678999999999999999999887521 111 Q ss_pred cccccccccC----ceeeecccccccccc----hhhHHHHHHHHHHHHHHHhhc----C-CCcCCCEEEeCHHHHHHHhc Q lcl|NC_015719. 151 VNENIAGLGK----PSLLEVGAKADLTDP----VKLGQAVIAQLTIARAALTKN----Y-VPANDRTFYTTPDVYSAILA 217 (344) Q Consensus 151 ~~~~~~~~~~----~~~i~~~~~~~~t~~----~~~~~~i~~~l~~a~~~Ld~~----~-VP~~gR~~vv~P~~~~~Ll~ 217 (344) ..|.|... ......+...+.+.. .......++.|..+...|... . .+..+-+++++|..+..|+. T Consensus 202 --~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~ 279 (381) T protein:vir:10 202 --DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) T ss_pred --CCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc Confidence 11222110 000111100000000 001112234444444444322 2 23445677899999887764 Q ss_pred cchhhhhccccccccccceeEEE--eCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHH Q lcl|NC_015719. 218 ALMPNAANYAALIDPERGSIRNV--MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSA 295 (344) Q Consensus 218 ~~~~~~~~~~~~~~~~~G~Vg~i--~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~A 295 (344) ...... .+|..... .|.+|++|+.+|.+.+ .-+++. . T Consensus 280 ~~~~~~---------~~G~~v~~l~~g~~vv~s~~~p~~~i--------------------ifgDfs--~---------- 318 (381) T protein:vir:10 280 QYTHLN---------ANGVYVTALPFNLNVIESTVQEAGKV--------------------LTYVKG--L---------- 318 (381) T ss_pred ccccCC---------CCCceeecCCCCceEEecCCCCcCcE--------------------EEEecc--c---------- Confidence 322211 13433233 3667999998885321 011111 1 Q ss_pred Hhhhhhheeeeeeeecchhhh---hhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 296 VGTVKLKDLALERARRAEYQA---DQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 296 v~~~~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ...+.+..++++.+.+. +|. ..+++.+++++++++|++.+++.++--. T Consensus 319 Y~i~~r~~~~i~~~~~~-~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:10 319 YDGYLAGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) T ss_pred EEEEEecccEEEeechh-HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecC Confidence 11223444566555433 333 3688999999999999999997666533 No 171 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.54 E-value=4.4e-09 Score=66.41 Aligned_cols=295 Identities=9% Similarity=-0.004 Sum_probs=147.6 Q ss_pred CCCccccccccccccccc--cccchh--hhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeee-- Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQ--SAADKL--ALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQ-- 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~--~~~d~~--~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~-- 74 (344) |+.-. -.+...+..+.+ ..+|.. .+....+..++.+.-++.|.++.+.+...+.+. ..+|+.+|-....... T Consensus 1 ~~~k~-~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~-~~~i~~~~~~~~~~~~~~ 78 (321) T protein:vir:31 1 MASRT-INNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAK-KTRIPTLNIGERHRRPQD 78 (321) T ss_pred CchHH-HHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCc-ceeeeeeccCCccccccc Confidence 55422 111112221111 112222 233477888888888899999999998876543 3556666532111111 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccc--hHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYD--IEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) .++..... ..++.+++++.+-+... -..|.+ +|+.....|+.+.+....++++++..+..++. +...+.++. T Consensus 79 e~~~~~~~-~~~~~~~~~~~~~k~~~-~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~n----Gd~~~~~~~ 152 (321) T protein:vir:31 79 EGEWNENE-SDVSTGTIDISTEKATV-AWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAAN----GDEDAEDSF 152 (321) T ss_pred cccccccc-ccceeeeeeeeeEEEEe-ehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheee----ccccCCCcc Confidence 12211111 12344555666644433 334433 22222246899999999999999998877652 111111111 Q ss_pred cc-cccccC---ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_015719. 153 EN-IAGLGK---PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAA 228 (344) Q Consensus 153 ~~-~~~~~~---~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (344) .+ ..|+-. ........++ ....++.|.++...|+++.--..+-+++++++.+..++.-..- ...... T Consensus 153 ~~~n~G~l~~a~~~~~~~~~~~--------~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~-~~~~~~ 223 (321) T protein:vir:31 153 ENQNDGFITVAEGDVETIDAAD--------DILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTD-RDTPLG 223 (321) T ss_pred cccchhhhhhhccccccccccc--------cccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhc-CCCccc Confidence 10 112110 0000001111 1112456667777777665422234678999987655431111 112233 Q ss_pred ccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER 308 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~ 308 (344) ...+..|...++.|++|+.++.+|...+--. ++++.+ .+..+..+.+. T Consensus 224 ~~~l~~~~~~tl~G~pvv~~~~mP~~~il~t----------------------~~~nl~----------~~~~~~~~~~~ 271 (321) T protein:vir:31 224 DNVIMGEADVNPFSFPIIGSGLWPDDKAMFT----------------------DPQNLI----------YALYRDLEIDV 271 (321) T ss_pred cchhhccccccccceeEEEcCCCCCCcEEEe----------------------ccccEE----------EEEeeccEEEE Confidence 4446677777899999999999996432110 112211 11223345666 Q ss_pred eecchhh---hhhhhhhh--hhcCceeccccEEEEE-ecCCC Q lcl|NC_015719. 309 ARRAEYQ---ADQIIAKY--AMGHGGLRPESAGALV-FKAGA 344 (344) Q Consensus 309 ~~~~~~~---~d~i~~~~--~~G~~v~Rp~~~~~l~-~~~~a 344 (344) .++.+.. .+.+...+ -++..+-++++++.++ ++.+- T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~ 313 (321) T protein:vir:31 272 LTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPL 313 (321) T ss_pred eecCccccccceeeEeeeeeecceeEeccccEEEEecCCcch Confidence 6554332 23344333 2667788888888877 34433 No 172 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.52 E-value=9.3e-09 Score=64.62 Aligned_cols=284 Identities=12% Similarity=0.014 Sum_probs=141.2 Q ss_pred CCCcccc---------ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-Cccee Q lcl|NC_015719. 1 MANMQGG---------QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKA 70 (344) Q Consensus 1 ma~~~~~---------~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~ 70 (344) +++.... -+.....| ..++-..+..+.|..++.+...+.|.++.+++..++. |+ ++++.- +..++ T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~---~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~~~~~~~~~~a 133 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDIDKNV---GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTA 133 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhcc---CCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-cc-eEEEEecCCcce Confidence 1110000 00000111 1122223577999999999999999999999888764 54 466653 44444 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeece-eccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~-~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) .-...+..+..+. ..+..+ +++...++..+ .|..-=-..+.+|+-+.+.++.++++++..|+.++. +.. T Consensus 134 ~w~~e~~~~~~~~-~~~f~~--i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~G--- 203 (377) T protein:vir:98 134 VWGDIFGEIKGQL-KQAFKE--QDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVK----GDG--- 203 (377) T ss_pred eEeecccccCccc-Ccccee--EeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEe----ccC--- Confidence 4444444443221 223344 45555554443 443311223577899999999999999999998862 111 Q ss_pred ccccccccccCc---eeeeccc----ccccccchhhHHHHH----------HHHHHH--HHHHhhcCCCcCCCE-EEeCH Q lcl|NC_015719. 150 GVNENIAGLGKP---SLLEVGA----KADLTDPVKLGQAVI----------AQLTIA--RAALTKNYVPANDRT-FYTTP 209 (344) Q Consensus 150 ~~~~~~~~~~~~---~~i~~~~----~~~~t~~~~~~~~i~----------~~l~~a--~~~Ld~~~VP~~gR~-~vv~P 209 (344) ...|.|.-.. ..+.... .+..++.....+..+ ..+... ...+++-.- ..||+ ++++| T Consensus 204 --~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd-~~G~~i~~~n~ 280 (377) T protein:vir:98 204 --LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLK-IAGQVKLILNP 280 (377) T ss_pred --CCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhc-cCCceEEEecc Confidence 1112221100 0000000 000000000000000 001000 111122122 24554 55777 Q ss_pred HHHHHHhccchhhhhccccccccccceeEEEeCeE--EEEecccccccccccccccccccccccccccccccccccccee Q lcl|NC_015719. 210 DVYSAILAALMPNAANYAALIDPERGSIRNVMGFE--VVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVV 287 (344) Q Consensus 210 ~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G~~--V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (344) .-|..++..... ...+|.-..++|++ |++|+.+|.+.+. -+++ T Consensus 281 ~~~~~~~p~~~~---------~~~~G~~~t~lg~p~~vv~s~~~p~~~i~--------------------fgdf------ 325 (377) T protein:vir:98 281 EDRWALEAQFTS---------RNQFGEYVTVLPHGITILESLAVETGKAI--------------------AFVA------ 325 (377) T ss_pred cchhhccccccc---------cCCCCccccccCCCceEEecCCCCcccEE--------------------EEEe------ Confidence 666555422111 11345444666554 7788888753210 0111 Q ss_pred EEEecHHHHhhhhhheeeeeeeecchhhh--hhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 288 GLFQHRSAVGTVKLKDLALERARRAEYQA--DQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 288 gl~~~~~Av~~~~~~~~~~e~~~~~~~~~--d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) .. -..+....++++.+.+....- ..+++.++++++++.|++.++|.++.| T Consensus 326 ----~~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 326 ----NR--YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ----cc--eeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 11 112334456666654332222 458899999999999999999999999 No 173 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.47 E-value=2.6e-08 Score=62.15 Aligned_cols=283 Identities=13% Similarity=0.044 Sum_probs=141.4 Q ss_pred CCCcccccccccc----------ccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCccee Q lcl|NC_015719. 1 MANMQGGQQLGTN----------QGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKA 70 (344) Q Consensus 1 ma~~~~~~~~~~~----------~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~ 70 (344) +.. .+.+..+. .+.+ .+|. .|..+.|..++.+...+.|.++.+.++.++ +|. .+|++...... T Consensus 57 ~~~--~~~~~l~~~e~~~~~~~~~~t~-~~Gg--~lvP~~~~~~I~~~l~~~spir~~a~v~~~-~~~-~~i~~~~~~~~ 129 (381) T protein:vir:10 57 SLP--KSAQTLSANQRNFFMDINKSVG-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGV 129 (381) T ss_pred Hhc--ccccccCHHHHHHHHHHhhcCC-CCCc--eecCHHHHHHHHHHHHhhcceeeeeeeEec-Ccc-eEEEeecCCcc Confidence 110 00000000 1111 1111 367799999999999999999999998876 444 45555543332 Q ss_pred eee-eCCCCCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 71 AYL-QPGESLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 71 ~~~-~~g~~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) ... ..+..+..+. ..+.+++ .+...++.. ..|..-=-..+.+|+-+.+..+.++++++..|+.++. +.. . T Consensus 130 a~W~~e~~~~~~~~-~~~f~~i--~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~----GdG-~ 201 (381) T protein:vir:10 130 AVWGKIYGEIKGQL-DAAFSEE--TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG-K 201 (381) T ss_pred eEEeeccccccccc-CccceeE--eecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEe----ccc-C Confidence 222 2222333221 2334444 444555444 3443311223567899999999999999999988752 111 1 Q ss_pred ccccccccccc----Cceeeecccccccccch----hhHHHHHHHHHHHHHHH----hhcCC-CcCCCEEEeCHHHHHHH Q lcl|NC_015719. 149 DGVNENIAGLG----KPSLLEVGAKADLTDPV----KLGQAVIAQLTIARAAL----TKNYV-PANDRTFYTTPDVYSAI 215 (344) Q Consensus 149 ~~~~~~~~~~~----~~~~i~~~~~~~~t~~~----~~~~~i~~~l~~a~~~L----d~~~V-P~~gR~~vv~P~~~~~L 215 (344) ..|.|.. .+.....+...+.+... ......++.+..+...+ ..+.. +..+.+++++|..+..| T Consensus 202 ----~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l 277 (381) T protein:vir:10 202 ----DQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEV 277 (381) T ss_pred ----CCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhh Confidence 1222211 11111111111100000 00111122222222222 11222 34557789999998888 Q ss_pred hccchhhhhcccccccccccee-EEE-eCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecH Q lcl|NC_015719. 216 LAALMPNAANYAALIDPERGSI-RNV-MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR 293 (344) Q Consensus 216 l~~~~~~~~~~~~~~~~~~G~V-g~i-~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~ 293 (344) +......+. +|.. ..+ .|.+|++++.+|.+.+ .-++++. T Consensus 278 ~~~~~~~~~---------~G~~v~~lp~g~~vv~~~~~p~~~i--------------------~fGDfs~---------- 318 (381) T protein:vir:10 278 QAQYTHLNA---------NGVYVTALPFNLNVIESTVQEAGKV--------------------LTYVKGL---------- 318 (381) T ss_pred ccccccCCC---------CCceeecCCCCceeEEcCCCCcCcE--------------------EEEEccc---------- Confidence 654332221 2221 112 4788999999985321 0111111 Q ss_pred HHHhhhhhheeeeeeeecchhhh---hhhhhhhhhcCceeccccEEEEEec--CC--C Q lcl|NC_015719. 294 SAVGTVKLKDLALERARRAEYQA---DQIIAKYAMGHGGLRPESAGALVFK--AG--A 344 (344) Q Consensus 294 ~Av~~~~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~Rp~~~~~l~~~--~~--a 344 (344) ...+.+..++++...+. +|. ..+++.+++++++++|++.+++.++ ++ | T Consensus 319 --Y~i~~r~~~~i~~~~~~-~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:10 319 --YDGYLAGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred --EEEEEecccEEEeechh-hhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccc Confidence 11123444566655433 333 3688999999999999999997665 21 1 No 174 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.47 E-value=1.8e-08 Score=63.08 Aligned_cols=288 Identities=13% Similarity=0.045 Sum_probs=141.7 Q ss_pred CCCcc----ccccccc----------cccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC Q lcl|NC_015719. 1 MANMQ----GGQQLGT----------NQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG 66 (344) Q Consensus 1 ma~~~----~~~~~~~----------~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG 66 (344) +.+.. .+.+..+ +.+-...+| .|..+.+..++.+..++.|.++.++++.++ +|+ ++|+... T Consensus 61 ~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG---~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~-~~~-~~i~~~~ 135 (395) T protein:vir:95 61 VVDNGILAKRSQDPLTSEERKFFNDINYDVGYTDE---KILPETVVERVFDDLQKDHPLLSKINFQNA-GIK-TRVIKAD 135 (395) T ss_pred HHHHHHHhhcCccccchHHHHHHHHHhhccCCCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEec Confidence 00000 0000000 111111111 356799999999999999999999998776 454 5677654 Q ss_pred cceeeee-eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015719. 67 RTKAAYL-QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLI 145 (344) Q Consensus 67 ~~t~~~~-~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a 145 (344) ....... ..+..+... .+.+.++++|..-+. +.-+.|.+-=-..+..|+-+.+.++.++++++..|+.++.- . T Consensus 136 ~~~~a~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G----~ 209 (395) T protein:vir:95 136 PAGQAVWGKVFGEIKGQ-LDAAFREENFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIING----G 209 (395) T ss_pred CCcceEEeecccccCcc-ccccceeeeeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeec----c Confidence 4433222 222333322 134555655555332 33344544222335688999999999999999999988521 0 Q ss_pred hcccccccccccccCcee-----eecccccccccchhhHHHHHHHHHHHHHHHhh----cC-CCcCCCEEEeCHHHHHHH Q lcl|NC_015719. 146 NLADGVNENIAGLGKPSL-----LEVGAKADLTDPVKLGQAVIAQLTIARAALTK----NY-VPANDRTFYTTPDVYSAI 215 (344) Q Consensus 146 ~~~~~~~~~~~~~~~~~~-----i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~----~~-VP~~gR~~vv~P~~~~~L 215 (344) +.....|.|.-.... ...+..+.. .........+..+..+...|.- .. .......++++|..+..+ T Consensus 210 ---G~~~~qP~Gil~~~~~~~~~~~~~~~~~~-~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~ 285 (395) T protein:vir:95 210 ---GAAKTQPVGLMKDVNTNSGAVTDKASSGT-LTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDV 285 (395) T ss_pred ---CCCCcCceeeeecccccccccccccccch-hhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhc Confidence 000001222111000 000000000 0011112223333333332211 11 111234567888776654 Q ss_pred hccchhhhhccccccccccceeEEEe--CeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecH Q lcl|NC_015719. 216 LAALMPNAANYAALIDPERGSIRNVM--GFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR 293 (344) Q Consensus 216 l~~~~~~~~~~~~~~~~~~G~Vg~i~--G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~ 293 (344) ....-. . ...|...++. |.+|++++.+|.+.+ + -+++. . .+ T Consensus 286 ~g~~~~-----~----~~~G~~~~~lg~g~~v~~~~~~p~~~i----~----------------fgdfs--~--y~---- 328 (395) T protein:vir:95 286 QARYTY-----L----TANGGFVTVLPYNVTIITSEFVPEGKL----V----------------AFVTD--R--YN---- 328 (395) T ss_pred CCccee-----c----cCCCcceeccCCcceEEEcCCCCCCcE----E----------------EEecc--c--EE---- Confidence 321111 0 1245555664 667899999985321 0 01111 1 01 Q ss_pred HHHhhhhhheeeeeeeecchh--hhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 294 SAVGTVKLKDLALERARRAEY--QADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 294 ~Av~~~~~~~~~~e~~~~~~~--~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+....++++...+... -...+++..++|+++++|++.++|.++..- T Consensus 329 ----i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~ 377 (395) T protein:vir:95 329 ----AVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVAS 377 (395) T ss_pred ----EEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccC Confidence 12234455655543221 124478899999999999999999887333 No 175 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.44 E-value=3.1e-08 Score=61.78 Aligned_cols=304 Identities=15% Similarity=0.091 Sum_probs=153.9 Q ss_pred CCCccccc-cccccccccccccchhh--hhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc--eeeeeeC Q lcl|NC_015719. 1 MANMQGGQ-QLGTNQGKGQSAADKLA--LFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT--KAAYLQP 75 (344) Q Consensus 1 ma~~~~~~-~~~~~~g~~~~~~d~~~--l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~--t~~~~~~ 75 (344) |-.+..-- +-..+.-+....+|... |-.++++ +..+.-++.|.++.+.++.+..++.+..|+.+|.. ....++. T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~ 79 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDE 79 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCccccccccc Confidence 21111000 00011111111122222 2346665 45577778899999988765445566667776532 1111111 Q ss_pred CCCC-CCCcCCcccceEEEEeeeeeeeceeccc--hHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 76 GESL-DDKRKDIKHTEKTINIDGLLTADVLIYD--IEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 76 g~~~-~~~~~~~~~~~~~l~iD~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) ++.- +.+...++..+.+|.+-+.. ....|.+ +|+..-..|+.+.++.+.++++++..+..++. +-..+..+. T Consensus 80 ~~~~~~~~~~~~~f~~~~l~~~~l~-~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~n----Gdg~s~~p~ 154 (315) T protein:vir:41 80 TGQKLAPPESTAEVKTNTLYMREMV-TKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLH----GDTSSSDPL 154 (315) T ss_pred ccCcCCCCCCccccceeeeceeeee-eeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhc----cCCcCcCcc Confidence 1111 11112234455555554443 3344532 22222235899999999999999988877652 111111110 Q ss_pred -cccccccCceeeec-ccccccccchhhHHHHHHHHHHHHHHHhhcCCCc-CCCEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_015719. 153 -ENIAGLGKPSLLEV-GAKADLTDPVKLGQAVIAQLTIARAALTKNYVPA-NDRTFYTTPDVYSAILAALMPNAANYAAL 229 (344) Q Consensus 153 -~~~~~~~~~~~i~~-~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~-~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (344) ..+.|+-....... +...+... .....+.|.++...|..+.--. .+-.+++++..+..|.+-. -.+..|..+ T Consensus 155 ~~~~~G~l~~a~~~~~~~~~~~~a----~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk-~~~g~~lw~ 229 (315) T protein:vir:41 155 LRMSDGWLKLASEKLTESDVDPEA----EDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDAL-KGRETGLGD 229 (315) T ss_pred ccccccceeccccccccccccccc----ccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHh-ccCCCcccc Confidence 11223211000000 00011111 1112344555555555543211 2335688999988775421 123456666 Q ss_pred cccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee Q lcl|NC_015719. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA 309 (344) Q Consensus 230 ~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~ 309 (344) ..+..|....+.|.+|+.++.+|........ .++.+.+-+..+-...++.|.+ T Consensus 230 ~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~---------------------------ilf~d~~nl~~~~~~~i~i~~~ 282 (315) T protein:vir:41 230 QALTGANSILYDGRPVQYVPALEALNDGKSR---------------------------ALFVVPTQLVYGFWRNIKVVPD 282 (315) T ss_pred chhhcCCCceecccceEecccccccCCCCcc---------------------------EEEecccceEEEeccccEEEee Confidence 6778888889999999999999854321111 1122222222233456788888 Q ss_pred ecchhhhhhhhhhhhhcCceeccccEEEEEecC Q lcl|NC_015719. 310 RRAEYQADQIIAKYAMGHGGLRPESAGALVFKA 342 (344) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~ 342 (344) |+.......+....+.|.++.-++++++-..+- T Consensus 283 ~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 283 YDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 887766666777788898877777766655555 No 176 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.18 E-value=4.4e-07 Score=55.44 Aligned_cols=274 Identities=12% Similarity=0.046 Sum_probs=143.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecC----cceeeeeeCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIG----RTKAAYLQPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG----~~t~~~~~~g 76 (344) |+--..-. ..+..+. .=++| |.++|+.-+.+-++ .++.+|...+..|.+++++... ....++...| T Consensus 1 M~~e~nl~-~~~dL~~---a~siD--F~~~f~~~i~~L~~----~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEG 70 (303) T protein:vir:10 1 MSAENNLI-NVEALGK---AKSID--FANKLGVGLNKLFE----ALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEG 70 (303) T ss_pred CCCCcCCc-chhhccc---ceeeh--hhhhhhhhHHHHHH----HhhhhccccccCCceeeeeeeeceeeccccccccCC Confidence 54322100 1133332 23455 99999988776553 4566676677778888766542 1224577889 Q ss_pred CCCCCCcCCcc-cceEEEEeeeeeeeceeccchHHH-H-hC-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 77 ESLDDKRKDIK-HTEKTINIDGLLTADVLIYDIEDA-M-NH-YDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 77 ~~~~~~~~~~~-~~~~~l~iD~~~~~~~~Idd~D~~-q-~~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) +.||.+.-..+ ....++++.++.- -+. ||+ | .- .|...+.-++...++++.+|..++..+..+.... T Consensus 71 e~Iplskvt~~~~~t~~~~~kK~rK---~tT--dEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~---- 141 (303) T protein:vir:10 71 DVIPLTKVTREQVDITELQFAKYRK---STS--AEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENG---- 141 (303) T ss_pred cccchhhheeeecceEEEEeecccc---ccc--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccc---- Confidence 99987632211 2345677755443 233 455 4 43 4599999999999999999999987665321100 Q ss_pred cccccccCceeeecccccccccchhhHHHHHHHHHHHHHHH-------hhcCCCcCCCEEEeCHHHHHHHhccchhhhh- Q lcl|NC_015719. 153 ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAAL-------TKNYVPANDRTFYTTPDVYSAILAALMPNAA- 224 (344) Q Consensus 153 ~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~L-------d~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~- 224 (344) .. +..+.. .++.|..|...+ +|.++ .-+++|+|.-.+.||.+-..... T Consensus 142 -------~~--------t~~t~~------s~~glq~Al~~~~~kl~~~~ed~~---~~V~FvNP~Daa~yl~~A~i~~~~ 197 (303) T protein:vir:10 142 -------KR--------TNKTKL------SAENLQGALSKGRANLSVLLDDEI---TPIAFVNPNDTAEYLANGFINSTG 197 (303) T ss_pred -------cc--------ccceee------cHHHHHHHHHhhhhhccccccccc---cEEEEEchHHHHHHhhcCCcchhh Confidence 00 000111 123334443333 33332 24888999999999987766532 Q ss_pred ccccccccccceeEEEeCeEEEEeccccccccccccccc----ccccccc-ccccccccccccccceeEEEecHHHHhhh Q lcl|NC_015719. 225 NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEE----GTDASNQ-KHAFPATGGKVNKENVVGLFQHRSAVGTV 299 (344) Q Consensus 225 ~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~~~gl~~~~~Av~~~ 299 (344) ..-|..-+. ++.|+.|+.|+.+|.+..-.++.-. -.+..++ ...|+- ..+..||+. T Consensus 198 t~fG~n~L~-----nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~------t~D~tglIG-------- 258 (303) T protein:vir:10 198 AQFGVNLLT-----PYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELSRAFAF------ATDATGFVG-------- 258 (303) T ss_pred hhhhhhhhh-----hhhcceEEEeccCCCceEEEeeccceEEEEecCchhhhhhhhh------ccccccceE-------- Confidence 222333333 4999999999999987543332110 0000000 011110 112223221 Q ss_pred hhheeeeeeeecchhhhhhhhhhhhhcC--ceeccccEEEEEecCCC Q lcl|NC_015719. 300 KLKDLALERARRAEYQADQIIAKYAMGH--GGLRPESAGALVFKAGA 344 (344) Q Consensus 300 ~~~~~~~e~~~~~~~~~d~i~~~~~~G~--~v~Rp~~~~~l~~~~~a 344 (344) + ..++....=-+..+...|. =+=|+|++++..+++.= T Consensus 259 ------v--~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e 297 (303) T protein:vir:10 259 ------V--LHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDE 297 (303) T ss_pred ------E--EeccccceeeehhHhHhHHHhcccccceEEEEEEeccc Confidence 0 1111111111223333333 24577888888884433 No 177 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=98.03 E-value=2.9e-07 Score=56.42 Aligned_cols=303 Identities=15% Similarity=0.156 Sum_probs=171.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLD 80 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (344) |+.=.++..+.+|--- .+++..=|..+.-++-|.++-.-=.+-..++..-.++.|.+..|+.+|..-..+...|.+++ T Consensus 59 m~G~~p~~eV~~~e~m--tt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~ 136 (393) T protein:vir:79 59 MEGETPTNEVNLREFM--ATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIP 136 (393) T ss_pred hcCCCchhheehhhhh--cCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeecccccccccc Confidence 8887777775555444 34444434678888888775432233333444445678999999999988888888888887 Q ss_pred CCcCCcccceEEEEeeeeeeeceeccchHHH--HhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Q lcl|NC_015719. 81 DKRKDIKHTEKTINIDGLLTADVLIYDIEDA--MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGL 158 (344) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~--q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~ 158 (344) ...-+. .+.-.+.+-+.++. ..|.=-|+. .+..|+++-..+.++.+|+|..|+.++++.-+-... .. .++ T Consensus 137 ~~sld~-~T~dsv~~~~gK~G-~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ght---vf---Da~ 208 (393) T protein:vir:79 137 EDSIDW-QTHESPEIRVGKSG-IRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHT---VF---DNY 208 (393) T ss_pred ccchhh-hcCCceeEEechhh-hhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccce---ee---ecc Confidence 654331 22224556566643 344322332 368999999999999999999999999887532210 00 011 Q ss_pred cCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhh---hccc-------- Q lcl|NC_015719. 159 GKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNA---ANYA-------- 227 (344) Q Consensus 159 ~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~---~~~~-------- 227 (344) ..++.....+-.. +..-++....++|.++.-.--.... .+-++++.|=.|+..-+....-. ..|+ T Consensus 209 st~t~ahptGr~~--~~~qNGTlSleDllDm~~av~~~hy--t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ 284 (393) T protein:vir:79 209 STNKLAHTTGLDK--NGVQNDTFSAEDFLDLIIAVMANEY--TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAP 284 (393) T ss_pred ccCccceeecCCc--cccccccccHHHHHHHHHHHhcccC--CcceEEEcCchhhhhhhhhhhcceeeccccccCccccc Confidence 1222211111000 0011222334556555544443333 34678888888876655432211 1111 Q ss_pred cccc----cccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 228 ALID----PERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 228 ~~~~----~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) .+.. +.+|++ -+.|+|+.||-+|.-.... -|--|. --.+.+|++.-++ + T Consensus 285 ts~algp~~i~~~~--~~nlnv~~sPfvp~d~k~~--------------rFd~~~---Vd~NnvgvlLV~D--------~ 337 (393) T protein:vir:79 285 SSMALGPDSIQGRL--PFNFNVNLSPFIPLDKKSR--------------RFDVYA---VDRNNVGVLLVRD--------D 337 (393) T ss_pred hhhhhchhhhcccc--ccceeEEEecccccccccc--------------eeeEEE---eecCCceEEEEec--------C Confidence 1111 112211 1458999999888643210 111111 1245566655333 5 Q ss_pred eeeeeeecchhhhhhhhhhhhhcCceeccccEEEEE----ecCCC Q lcl|NC_015719. 304 LALERARRAEYQADQIIAKYAMGHGGLRPESAGALV----FKAGA 344 (344) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~----~~~~a 344 (344) +++|.+.|+-+--.-|+-.-+||.+|+.-..++..- ...+= T Consensus 338 i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~y 382 (393) T protein:vir:79 338 LKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKSY 382 (393) T ss_pred cceeccccccccceeeeeeeeeceeeeeCCceEEEEecceeeccc Confidence 789999988887788889999999999886665432 11111 No 178 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.00 E-value=2.3e-06 Score=51.45 Aligned_cols=297 Identities=11% Similarity=0.090 Sum_probs=156.2 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhc--CCceee-ec-----ccccEEEEeecCcceee- Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTA--NRHMQR-QI-----SSGKSAQFPVIGRTKAA- 71 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~--~~~~~~-~i-----~~G~tv~i~~iG~~t~~- 71 (344) |+.++.- |+. +| -+-.|+|...|.+...+.+-|. +.+... ++ .+|+.+.+|..+...-. T Consensus 1 M~~~~~~----T~l------~D--ii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~ 68 (367) T protein:vir:80 1 MPDFNNQ----VRL------VD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE 68 (367) T ss_pred Ccchhhh----hhh------hh--ccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCc Confidence 8876631 222 22 2455999999988887665543 333322 22 57999999999876421 Q ss_pred -eeeCCCCC-CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 72 -YLQPGESL-DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 72 -~~~~g~~~-~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) .|...++. ..++..++..+..-. =...-.+|...|+-..-+--|+|..+..+-+.--.+..-+.+|..|.+.-+... T Consensus 69 ~n~~~d~~~~~~t~~kittg~~~a~-v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~ 147 (367) T protein:vir:80 69 PNYGSDNPNVEAPIDGLGSGEMKTT-KTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNL 147 (367) T ss_pred cccCCCCCcccccccccccchheee-eehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhcccc Confidence 12111110 011122333322111 123445677889998888889999999998877777766665555543322211 Q ss_pred ccc-----------cccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhcc Q lcl|NC_015719. 150 GVN-----------ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAA 218 (344) Q Consensus 150 ~~~-----------~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~ 218 (344) ..+ ....+.....+.+..+.+...+.. .-.+.+.+|+..|-++. +.=-.++|.+.+|..|.+. T Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~----~s~~~~~~A~~~lGD~~--~~l~~i~mHS~V~~~L~~~ 221 (367) T protein:vir:80 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAV----FNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNN 221 (367) T ss_pred ccchhhhhhhhccccccccccCceeeeeeccCCCccce----ecHHHHHHHHHHhcccc--ccccEEEEchHHHHHHHhc Confidence 111 011223344555554333211111 11456778888886642 2336788999999999876 Q ss_pred chhhhhccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhh Q lcl|NC_015719. 219 LMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGT 298 (344) Q Consensus 219 ~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~ 298 (344) ..+.-..+.. .+..|+.++|.+|++...+|....+... .| ++ .+|-.-|++. T Consensus 222 ~li~~i~~sd----~~~~i~ty~G~~VIvDD~~Pv~~~~a~~------------~y-----------tt-Ylfg~GAi~~ 273 (367) T protein:vir:80 222 DEIEFIPDSK----GQLTIPTYMGKVVIVDDGMPVFGTGADK------------TY-----------LS-ILFGGAAFGY 273 (367) T ss_pred cccccccCCC----CccccceecceeEEEeCCCcccccCCCc------------eE-----------EE-EEEecceeee Confidence 5332222211 1456899999999999999975432110 01 11 2334445554 Q ss_pred hhhhee-eeeeeecchhh----hhhhhh-----hhhhcCceeccccEEEE-Eec-------CCC Q lcl|NC_015719. 299 VKLKDL-ALERARRAEYQ----ADQIIA-----KYAMGHGGLRPESAGAL-VFK-------AGA 344 (344) Q Consensus 299 ~~~~~~-~~e~~~~~~~~----~d~i~~-----~~~~G~~v~Rp~~~~~l-~~~-------~~a 344 (344) ....+. -+|..||+... .|.+.. +|.+|.+-....-+.-- ..+ ..+ T Consensus 274 ~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~s 337 (367) T protein:vir:80 274 ADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) T ss_pred cccCCccceecccchhhhcCCceEEEEeeeeEEeecceeeecccccccccccccccccccccCC Confidence 443322 25888888753 144432 33444443322111000 000 000 No 179 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.27 E-value=0.00011 Score=42.29 Aligned_cols=289 Identities=14% Similarity=0.080 Sum_probs=145.6 Q ss_pred CCCccccccccccccccccccchhhhhH--HHHhhHHHHHHHHhhhhc--CCceee-ec-----ccccEEEEeecCccee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFL--KVFGGEVLTAFARTSVTA--NRHMQR-QI-----SSGKSAQFPVIGRTKA 70 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~--e~f~geV~~~f~~~s~~~--~~~~~~-~i-----~~G~tv~i~~iG~~t~ 70 (344) ||. ||. +|. +.. |+|...|.+...+.+.|. +.+... .+ .+|+.+.+|..+...- T Consensus 1 Ma~--------T~l------~D~--iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g 64 (349) T protein:vir:78 1 MAI--------TTI------GDI--VTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDT 64 (349) T ss_pred CCc--------eEE------eee--eccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCC Confidence 774 222 222 233 478888888887765553 433322 22 4699999999987542 Q ss_pred e-e--eeC-CCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015719. 71 A-Y--LQP-GESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLIN 146 (344) Q Consensus 71 ~-~--~~~-g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (344) . . |-. +..=+.++..++..+.. -+=..+-.+|...|+-..-+--|+|..+..+-+.--.|...+.++..|.+.-+ T Consensus 65 ~~e~nv~~D~~~~~~t~~kitt~~~~-a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~ 143 (349) T protein:vir:78 65 SIEPNYSNDVYQDIATPRAIQTGEMM-ARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYN 143 (349) T ss_pred CcccccCCCCccccccccccccccee-eeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc Confidence 1 1 110 00001112233333322 22234555678888888878779999999998888877766666655543322 Q ss_pred cccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcC--CCcCC-CEEEeCHHHHHHHhccchhhh Q lcl|NC_015719. 147 LADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNY--VPAND-RTFYTTPDVYSAILAALMPNA 223 (344) Q Consensus 147 ~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~--VP~~g-R~~vv~P~~~~~Ll~~~~~~~ 223 (344) .... ......+....+.+.++.+..+ .+.+.+|..+|...- -..+. ..+++.+.+|..|.+...+. T Consensus 144 ~~~~-a~~~~~~~~~~t~d~s~~a~~~---------~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~- 212 (349) T protein:vir:78 144 DNVS-ATDAYHEQNDMVVDVSATLGFD---------AGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID- 212 (349) T ss_pred cccc-ccchhhhcccceeeeccccCCC---------hhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh- Confidence 1111 0111111112222222222111 233445655655541 11222 46889999999988654432 Q ss_pred hccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 224 ~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) |.. ..-++..|..++|..|++...+|....++.. .| .-.+|-+-|++.....+ T Consensus 213 --~i~-~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~------------~y------------ttylfg~GAi~~~~~~~ 265 (349) T protein:vir:78 213 --FIR-DAENNTMFATYQGYRVIVDDSMTVVGQGAQR------------KF------------ISIIFGQGAIGYGEGNP 265 (349) T ss_pred --hcc-CcccCcccceecCeEEEEeCCCccccCCCCc------------eE------------EEEEeecceEEEccCCC Confidence 211 1123556889999999999999975422110 01 11333445555554332 Q ss_pred -eeeeeeecchhh----hhhhhhhhhhcC--ceeccccEEEEE-------ecCC-C Q lcl|NC_015719. 304 -LALERARRAEYQ----ADQIIAKYAMGH--GGLRPESAGALV-------FKAG-A 344 (344) Q Consensus 304 -~~~e~~~~~~~~----~d~i~~~~~~G~--~v~Rp~~~~~l~-------~~~~-a 344 (344) +.+|..||+... .|.+..+++|.. +-+..+.+.+.. ...+ + T Consensus 266 ~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~a 321 (349) T protein:vir:78 266 VMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFTSAVITGNGTETIARSASWQ 321 (349) T ss_pred ccceeeecccccCCcceeEEEEEeeEEEeeeeeeeeccccccCCccccccCCCChH Confidence 236777777543 366655444332 222222222110 0000 0 No 180 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=97.26 E-value=4.9e-05 Score=44.21 Aligned_cols=283 Identities=11% Similarity=0.089 Sum_probs=133.0 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-----eecccccEEEEeecCcc--eee-e Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ-----RQISSGKSAQFPVIGRT--KAA-Y 72 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-----~~i~~G~tv~i~~iG~~--t~~-~ 72 (344) +-|+|-.++.-++... -++ .|-|+|.|-+.+-|+.++.|++..-- .-+...++.---....+ -++ . T Consensus 11 ~~~~~~~~~~t~N~n~-----avr-~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~ 84 (314) T protein:vir:98 11 LNNIQFFASGTANQNK-----AAR-SYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNE 84 (314) T ss_pred ccceeeeeeccccCcc-----cee-eecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCc Confidence 5555544432232222 122 48899999999999999999876542 12222222111111111 111 2 Q ss_pred eeCCCCC---CCCcCCcccceE--EEEeeee-ee-eceec-cchHHHHhChh---HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 73 LQPGESL---DDKRKDIKHTEK--TINIDGL-LT-ADVLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAEL 141 (344) Q Consensus 73 ~~~g~~~---~~~~~~~~~~~~--~l~iD~~-~~-~~~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (344) |..+... .++.......++ .+..|+. .| +.+.| .-+|..--+-| ..++..+.++.|-++.+|..+-.-| T Consensus 85 Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~l 164 (314) T protein:vir:98 85 YNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFI 164 (314) T ss_pred ccCCCCcccccCCccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3332211 000000011111 1222221 11 11111 12333222323 3455667778888888887664333 Q ss_pred HHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchh Q lcl|NC_015719. 142 AGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMP 221 (344) Q Consensus 142 ~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~ 221 (344) ...+..+. .. ++.+. ..+...+-.+.++.-...|- ..-.+.|.|++|.+|..++.. T Consensus 165 S~~As~te----~l--------------td~~~-----d~V~~LF~~as~~yvn~ev~-~~~~AyV~~evYnaiiD~~l~ 220 (314) T protein:vir:98 165 SSIAEKTE----TL--------------TDYSA-----DNVLRLFNELSKYYVNIEAI-GTKAAKVSPELYNAIVDHPLT 220 (314) T ss_pred Hhhhhhhh----hh--------------hhcch-----hhHHHHHHHHHhhhhcceee-EEEEEEEchhHHhHhhccccc Confidence 22221110 00 11111 11223333444445444442 225577999999999998877 Q ss_pred hhhccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhh Q lcl|NC_015719. 222 NAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKL 301 (344) Q Consensus 222 ~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~ 301 (344) +...-.+-+.=.+| |.+.-||-|-|.|.-...... .+..+ -+-+|..|. . T Consensus 221 TsaK~SsaNIDeng-i~~FkGf~i~e~P~~~~q~g~-------ia~~s--------------~dnig~aft--------G 270 (314) T protein:vir:98 221 TSAKSSSANIDQNG-IVNFKGFAIQEIPESMLQSGD-------VAYTY--------------ITNIGKAFT--------G 270 (314) T ss_pred cccccceeeeccCC-cceecceEEEecchhhcCCCc-------EEEEc--------------cccceeecc--------c Confidence 76543333333455 558899999987754332110 00000 011222211 0 Q ss_pred heeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 302 KDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) +. .......+++-|-.+-|-=-||--++.-...++++++.|- T Consensus 271 In-~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~tp 312 (314) T protein:vir:98 271 IN-TSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTSTP 312 (314) T ss_pred ce-eeeeeecccccceeeecccccccccccccceeeEEEecCC Confidence 11 1222233344455555555577778888888888877776 No 181 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=97.00 E-value=0.00013 Score=41.95 Aligned_cols=282 Identities=11% Similarity=0.033 Sum_probs=112.6 Q ss_pred CCCcccccccc--ccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-CcceeeeeeCCC Q lcl|NC_015719. 1 MANMQGGQQLG--TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYLQPGE 77 (344) Q Consensus 1 ma~~~~~~~~~--~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~ 77 (344) ++......+.. .+.......+. +-...+...+.+.+...+.+.+.++..++. ...++.- ....+..+..|+ T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~---~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~---~~~~~~~~~~~~a~~~~eG~ 299 (517) T protein:vir:97 226 SASLTKDPKAAWTAELKERGISGM---PAPAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQGTGHTTGT 299 (517) T ss_pred Hhcccccccceeeeeccccccccc---ccchHHHHHHHHhhhhhccceeeeeecccc---ceeeecccccceeeeeecCC Confidence 11111111000 00000000010 112334444555555555555555544432 2333322 112333455555 Q ss_pred CCCCCcCCcccceEEEEeeeeeeec-eeccchHHHHhChh----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTAD-VLIYDIEDAMNHYD----VRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d----~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) ..+.+ +++..++++.+-+ +.. +.+..---..+.+| +.+-+..+.+++|+++.++.++.- .. .. T Consensus 300 ~kp~s--~~tf~~~~~~~~~--ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~G----dG----tg 367 (517) T protein:vir:97 300 DKTES--NITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG----GV----TG 367 (517) T ss_pred ccccc--ccceeeEEeeHhh--hhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcc----cC----CC Confidence 54432 3444555554422 222 22322111112334 777788999999999999888621 00 00 Q ss_pred cccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcccccccc Q lcl|NC_015719. 153 ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALIDP 232 (344) Q Consensus 153 ~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (344) .+.. + .+......... +......+.+.+......+.+ ..+-.+|++|..|..|.+-.. .+..|.=...+ T Consensus 368 ~~~~----g-i~~~a~~~~~~-~~~~~~~~~d~i~~l~~a~~~----a~~a~~vmn~~t~~~I~klKD-~~G~Yl~~~~~ 436 (517) T protein:vir:97 368 VSET----Q-IYPVVGDAWAT-NVTGTTNIQELLEKLSVATPK----AADSTLVIHRNDLAAIRFLKD-KNGNYVFPVGV 436 (517) T ss_pred cccc----c-ccccccccccc-cccccchHHHHHHHHHHHhhh----ccCCEEEECHHHHHHHHHhhc-CCCCeeccCcC Confidence 0110 0 11111000000 000111122222222222222 224457899999998865433 23334333344 Q ss_pred ccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecc Q lcl|NC_015719. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRA 312 (344) Q Consensus 233 ~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~ 312 (344) ..+.+..++|+.-+.. .++.+. ...+ . ...| . ++.. +.+..-..+|- T Consensus 437 ~~~~~~~l~G~~~~~~-~~~~~~---~~~~-------~---~~~y--------~--i~~~---------~g~~~~~~fd~ 483 (517) T protein:vir:97 437 SNQTIATHFGFNRLVQ-SVAVDE---KTAV-------S---LSGY--------V--TNGS---------RGMEFEQGTIL 483 (517) T ss_pred CcccccccCCcccccc-ccccCc---eeEe-------e---cccc--------E--EEee---------cceeeeeeeec Confidence 5666667777432221 122111 0000 0 0000 0 0000 00011011111 Q ss_pred hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 313 EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 313 ~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+-.+.+...++.|..|+.|++++..+.+..+ T Consensus 484 ~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~ 515 (517) T protein:vir:97 484 VENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) T ss_pred ccCceeEeeeeeeccccccccceEEEEEcCCC Confidence 12233355556788889999998887777766 No 182 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=96.93 E-value=0.00023 Score=40.51 Aligned_cols=269 Identities=16% Similarity=0.070 Sum_probs=131.6 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee----eecccccEEEEeecCc--ceeeeee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ----RQISSGKSAQFPVIGR--TKAAYLQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~----~~i~~G~tv~i~~iG~--~t~~~~~ 74 (344) |+..+. + . -++ .|-|+|.|-+.+-|+.++.|++..-- .-+.+.++.---.... +-++.|. T Consensus 1 m~t~N~--n------~-----avr-~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~ 66 (286) T protein:vir:94 1 MATTNN--D------L-----PVR-VYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYS 66 (286) T ss_pred CCCCcc--c------c-----cee-ehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEeccc Confidence 665432 1 1 112 48899999999999999999866542 1222222211111111 1234454 Q ss_pred CCCCCC---CCcCCcccceE--EEEeeee-ee-eceec-cchHHHHhChh---HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 75 PGESLD---DKRKDIKHTEK--TINIDGL-LT-ADVLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAELAG 143 (344) Q Consensus 75 ~g~~~~---~~~~~~~~~~~--~l~iD~~-~~-~~~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~~~ 143 (344) .+...- ++.......++ .+..|+. .| +.+.| .-+|..--+-| ..++..+.++.|-.+.+|..+-.-|.. T Consensus 67 TdeNv~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~ 146 (286) T protein:vir:94 67 TDANTAFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALAT 146 (286) T ss_pred CCCccccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 432211 11001111111 1222221 11 11111 12333333333 345566677888888888765433322 Q ss_pred hhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhh Q lcl|NC_015719. 144 LINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNA 223 (344) Q Consensus 144 ~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (344) .+.. .+.. +.+...+-.+.++.-...|-.. .-+.|.|++|.+|..++..+. T Consensus 147 ~A~~---------------------------t~~~-D~V~~LF~~as~~yvn~ev~~~-~~ayV~~evYnaiiD~~l~Ts 197 (286) T protein:vir:94 147 AGTD---------------------------LGAV-DDVNALFESAVEKYTDLEVIAP-VRAYVTASVYNAIIDLANVTT 197 (286) T ss_pred hhhh---------------------------hhhh-hhHHHHHHHHHHHhhhhheeee-eEEEEchhHHHHHhccccccc Confidence 1111 0000 1223334445555555555322 238899999999999887776 Q ss_pred hccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 224 ~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) ..-.+-+.=.+| |.+.-||-|-|.|.-...+.. ++|+++-++.+-.-. T Consensus 198 aK~SsaNiDeng-i~~FkGf~i~e~P~~~~~g~~-------------------------------aifs~dnig~aftGI 245 (286) T protein:vir:94 198 AKNSAVNIDTNG-MLSFRGIAITKVPTQYMGGKA-------------------------------VIFAPDNVARVFTGI 245 (286) T ss_pred cccceeeeccCC-cceecceEEeecchhhccCce-------------------------------EEEccccceeeeccc Confidence 543333333455 558899999988742111100 111222111111111 Q ss_pred eeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 304 LALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) -.......+++-|-.+.|-=-||--++.-...++++....| T Consensus 246 n~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 246 NIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred eeeeeeeccccCceeeeccccccccccccCceeEEEeecCC Confidence 11222233444555566666688788888888888888888 No 183 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=96.89 E-value=0.00018 Score=41.07 Aligned_cols=268 Identities=15% Similarity=0.124 Sum_probs=133.8 Q ss_pred ccchhhhhHHHHhhHHHHHHHHhhhhcCCcee-----eecccccEEEEeecCc--ceeeeeeCCCCCC---CCcCCcccc Q lcl|NC_015719. 20 AADKLALFLKVFGGEVLTAFARTSVTANRHMQ-----RQISSGKSAQFPVIGR--TKAAYLQPGESLD---DKRKDIKHT 89 (344) Q Consensus 20 ~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~-----~~i~~G~tv~i~~iG~--~t~~~~~~g~~~~---~~~~~~~~~ 89 (344) -+ ++ .|-|+|.|.+.+-|+.+|.|++..-- .-+...++.---.... +-++.|..+...- ++.+..... T Consensus 1 ~a-vr-~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG 78 (287) T protein:vir:39 1 MA-IK-YFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFG 78 (287) T ss_pred CC-cc-cccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCcccccc Confidence 01 11 37799999999999999999866542 1233333321111111 1234454432210 000000111 Q ss_pred eE--EEEeeee-ee-eceec-cchHHHHhChh---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCc Q lcl|NC_015719. 90 EK--TINIDGL-LT-ADVLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKP 161 (344) Q Consensus 90 ~~--~l~iD~~-~~-~~~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~ 161 (344) ++ .+.+|+. .| +...| .-+|..--+-| ..++..+.++.|-++.+|..+-.-|...+.... T Consensus 79 ~rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~------------ 146 (287) T protein:vir:39 79 QRKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETL------------ 146 (287) T ss_pred ceeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchhe------------ Confidence 11 1222221 11 11111 11222222222 455667778889999999776443332221110 Q ss_pred eeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCE-EEeCHHHHHHHhccchhhhhccccccccccceeEEE Q lcl|NC_015719. 162 SLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRT-FYTTPDVYSAILAALMPNAANYAALIDPERGSIRNV 240 (344) Q Consensus 162 ~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~-~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i 240 (344) .+ ..++ ..+...+-++..++-.++|.....| +.|+|++|.+|..++..+.+--.+-+.=.+| |.+. T Consensus 147 ---~~----~~t~-----d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK~SsaNiDen~-i~kF 213 (287) T protein:vir:39 147 ---TV----KLDE-----DSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAKNSSANVDEQT-LYKF 213 (287) T ss_pred ---ee----eecc-----cchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhccccccccccceeeeccCC-ccee Confidence 00 0111 1123445577777777777655655 6699999999999887775543333333455 5588 Q ss_pred eCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhh Q lcl|NC_015719. 241 MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQII 320 (344) Q Consensus 241 ~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~ 320 (344) -||-+-|.|.-..-.+ . ...|+++-++.+-.-.-.......+++-|-.+- T Consensus 214 kGf~l~e~P~~~~q~g------~------------------------~a~fs~dnig~af~GI~vaR~i~sEdF~GvalQ 263 (287) T protein:vir:39 214 KGFILSELPDEKFQLN------E------------------------GAYFAADNVGVAGVGIQVTRAMDSEDFAGTALQ 263 (287) T ss_pred cceEEEecchHhhccC------c------------------------EEEEccccceeecccceeEEeeecccccceeee Confidence 9999998873321110 0 011112111111111112223334455556666 Q ss_pred hhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 321 AKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 321 ~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) |---||--++.....++++++.+- T Consensus 264 gAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 264 AAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred cccccccccccccceEEEEEecCC Confidence 666677778888888888777666 No 184 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=96.63 E-value=0.00045 Score=38.92 Aligned_cols=289 Identities=13% Similarity=0.085 Sum_probs=145.1 Q ss_pred CCCccccccccccccccccccchhhhhH--HHHhhHHHHHHHHhhhhc--CCceee-ec-----ccccEEEEeecCccee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFL--KVFGGEVLTAFARTSVTA--NRHMQR-QI-----SSGKSAQFPVIGRTKA 70 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~--e~f~geV~~~f~~~s~~~--~~~~~~-~i-----~~G~tv~i~~iG~~t~ 70 (344) ||.. |. +|. +.. |+|...|.+...+.+.|. +.+... ++ .+|+.+.+|..+...- T Consensus 1 Ma~T--------~l------~D~--iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g 64 (349) T protein:vir:94 1 MAIT--------TI------GNI--VTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDT 64 (349) T ss_pred CCce--------EE------eee--eccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCC Confidence 7742 22 222 233 478888888887765553 444432 22 4699999999876432 Q ss_pred e---eeeCCCCC-CCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015719. 71 A---YLQPGESL-DDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLIN 146 (344) Q Consensus 71 ~---~~~~g~~~-~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (344) . .|...++. +.++..++..+.. -+=..+-.+|...|+-..-+--|+|..+..+-+.--.|...+.++..|.+.-+ T Consensus 65 ~~e~n~~~dt~~~~~t~~kit~~~~~-a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~ 143 (349) T protein:vir:94 65 SIEPNYSNDVYQDIATPRAIQTGEMM-ARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYN 143 (349) T ss_pred CcccccCCCCccccccccccccccee-eeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhc Confidence 1 12211111 0111223333222 12234455677888888888779999999999988888766666655543322 Q ss_pred cccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcC--CCcCC-CEEEeCHHHHHHHhccchhhh Q lcl|NC_015719. 147 LADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNY--VPAND-RTFYTTPDVYSAILAALMPNA 223 (344) Q Consensus 147 ~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~--VP~~g-R~~vv~P~~~~~Ll~~~~~~~ 223 (344) ...... ............+++.+.. .++ .+.+|..+|-.+- -..+. -.+++.+.+|..|.+...+.- T Consensus 144 ~~~~~~-~~~~~~~~~~~d~~~~a~~-----~~~----~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~ 213 (349) T protein:vir:94 144 DNVSAT-DAYHEQNDMVVDVSATSGF-----DAG----AFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDF 213 (349) T ss_pred cccccc-ccccccCceeEEecccCCC-----Chh----hHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhh Confidence 111111 1111112222332222211 122 3344555554431 11222 468899999999887654322 Q ss_pred hccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 224 ~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) .. ..-++..|..++|..|++...+|....++.. .| .-.+|-+-|++.....+ T Consensus 214 i~----~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~------------~y------------ttylfg~GAi~~~~~~~ 265 (349) T protein:vir:94 214 IR----DAENNTMFATYQGYRVIVDDSMTVVGQDTSR------------KF------------ISIIFGQGAIGYGEGNP 265 (349) T ss_pred cc----CcccCcccceecCcEEEEeCCCccccCCCCc------------eE------------EEEEeecceEEeecCCC Confidence 11 1113445789999999999999975422210 01 11233344555555432 Q ss_pred -eeeeeeecchhh----hhhhhhh-----hhhcCceeccccEE----EEEecCC-------C Q lcl|NC_015719. 304 -LALERARRAEYQ----ADQIIAK-----YAMGHGGLRPESAG----ALVFKAG-------A 344 (344) Q Consensus 304 -~~~e~~~~~~~~----~d~i~~~-----~~~G~~v~Rp~~~~----~l~~~~~-------a 344 (344) +.+|..||+... .|.+..+ |.+|.+-..+.... .+....+ + T Consensus 266 ~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:94 266 EMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNGTETIARSASWQDLANAA 327 (349) T ss_pred CcceeeecccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCccccccCCCChHHhcCCc Confidence 246777777543 3555553 44444433221110 0000000 0 No 185 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=96.61 E-value=0.00038 Score=39.31 Aligned_cols=294 Identities=11% Similarity=0.025 Sum_probs=134.1 Q ss_pred CCCccccc-------cccccccccccccchhhhhHHHHhhHHHHHHH----HhhhhcCCceeee-cc-cccEEEEee--- Q lcl|NC_015719. 1 MANMQGGQ-------QLGTNQGKGQSAADKLALFLKVFGGEVLTAFA----RTSVTANRHMQRQ-IS-SGKSAQFPV--- 64 (344) Q Consensus 1 ma~~~~~~-------~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~----~~s~~~~~~~~~~-i~-~G~tv~i~~--- 64 (344) |-+++.-- ....+.|--.++.+..++|+.+...+|+.... ..-..+.++..++ +- +-.++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~ 80 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDK 80 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeecc Confidence 66666431 01112222233333444665444345554443 2334455555542 22 233444433 Q ss_pred cCcceeeeeeC-CCCCCCCcCCcccceEEEEeeee-eeeceeccchHHHH-hChhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 65 IGRTKAAYLQP-GESLDDKRKDIKHTEKTINIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAEL 141 (344) Q Consensus 65 iG~~t~~~~~~-g~~~~~~~~~~~~~~~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (344) +|..+ -|.. .++++.. +..-++....|-.. .-+.+.+.+++.++ ...++-.+-...++.++++..|+.++.-. T Consensus 81 ~G~a~--~~~d~~~dip~v--~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~ 156 (319) T protein:vir:10 81 VGTAQ--IIADYTDDLPLV--DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGS 156 (319) T ss_pred cccee--eecCccccccce--eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec Confidence 34443 2322 2223321 22333444444332 23334456666665 47778888888999999999999887432 Q ss_pred HHhhhcccccccccccccCc-e--eeecccccccccchhhHHHHHHHHHHHHHHHhhc--CCCcCCCEEEeCHHHHHHHh Q lcl|NC_015719. 142 AGLINLADGVNENIAGLGKP-S--LLEVGAKADLTDPVKLGQAVIAQLTIARAALTKN--YVPANDRTFYTTPDVYSAIL 216 (344) Q Consensus 142 ~~~a~~~~~~~~~~~~~~~~-~--~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll 216 (344) .+ ....|+-.. + ....+.... -.++..+.|+++|..+..+|.++ .+ ..--.++|+|+.|..|. T Consensus 157 ~~---------~g~~GLlN~p~~~~~~~~~~~~--~~t~t~~~i~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~ 224 (319) T protein:vir:10 157 AP---------HKIVSVFNHPNITKITSGKWID--VSTMKPETAEAELTQAIETIETITRGQ-HRATNILIPPSMRKVLA 224 (319) T ss_pred cc---------ccceeEEeCCCceeeecCCCCC--ccccCHHHHHHHHHHHHHHHHHhcCce-eeceEEEecHHHHHhhh Confidence 11 111221111 1 111111111 11223456889999998888765 33 11236889999999885 Q ss_pred ccchhhhhccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHH Q lcl|NC_015719. 217 AALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAV 296 (344) Q Consensus 217 ~~~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av 296 (344) . +..+.+..--.-+.+ +.-+++|...+.|...++.+. .....| ...++-+ T Consensus 225 ~--~~~~~~~t~l~~lk~----~~~~l~I~~~pel~~ag~~g~------------~~~v~y------------~~~~~~~ 274 (319) T protein:vir:10 225 I--RMPETTMSYLDYFKS----QNSGIEIDSIAELEDIDGAGT------------KGVLVY------------EKNPMNM 274 (319) T ss_pred c--ccCCCCeeHHHHHHH----hcCCceEEEeeeecccCCCcc------------eEEEEE------------ecCCceE Confidence 2 111111100011111 224667777777764321110 000111 0011111 Q ss_pred hhhhhheeeeeeeecchhhhhhhhhhhh-hcCceeccccEEEEEec Q lcl|NC_015719. 297 GTVKLKDLALERARRAEYQADQIIAKYA-MGHGGLRPESAGALVFK 341 (344) Q Consensus 297 ~~~~~~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~Rp~~~~~l~~~ 341 (344) ...-.+++++... .++.....+....+ .|.-+.||++++.+.== T Consensus 275 ~~~v~~~~~~~~~-e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 275 SIEIPEAFNMLPA-QPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred EEecCcceeeeee-eecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 1111222222221 22334455555555 45789999988765422 No 186 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=96.32 E-value=0.00075 Score=37.72 Aligned_cols=313 Identities=11% Similarity=-0.040 Sum_probs=136.5 Q ss_pred CCCcc-----------------ccccccccccccccccchhhhhHHHHhh---HHHHHHHHhhhhcCCceeeecccccEE Q lcl|NC_015719. 1 MANMQ-----------------GGQQLGTNQGKGQSAADKLALFLKVFGG---EVLTAFARTSVTANRHMQRQISSGKSA 60 (344) Q Consensus 1 ma~~~-----------------~~~~~~~~~g~~~~~~d~~~l~~e~f~g---eV~~~f~~~s~~~~~~~~~~i~~G~tv 60 (344) |+... ......+++.++. .+..-+..++.|.+ ..-++|........-.......+|..- T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~-tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~ 240 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGD-PENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPS 240 (523) T ss_pred cccceeeeecccccccccccccccccccccccccc-ccccccchhhccccccccccccccccccccccccccccCCCccc Confidence 22110 0000011111110 11111111222221 111111111000000000000000000 Q ss_pred -----EEeecCc--ceeeeeeCC-CCCCCCcCCcccceEEEEeeeeeeec--------eeccchHHHHh---ChhHHHHH Q lcl|NC_015719. 61 -----QFPVIGR--TKAAYLQPG-ESLDDKRKDIKHTEKTINIDGLLTAD--------VLIYDIEDAMN---HYDVRSEY 121 (344) Q Consensus 61 -----~i~~iG~--~t~~~~~~g-~~~~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~---~~d~~~~~ 121 (344) -...++. .+..--..+ ....+. ....-.+.-+.||+...-+ ..+.-..+.++ -.|.-.|+ T Consensus 241 t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~-~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~EL 319 (523) T protein:vir:59 241 TQDLDLVYYIDARNDFEDQSTDPDYPDPGF-QSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEI 319 (523) T ss_pred ccccccccccccccchhhcccccccccccc-ccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHH Confidence 0000000 000000000 000000 1123356678888764432 44544455555 38888899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeeccccccccc-chhhHHHHHHHHHHHHHHHhhc--CC Q lcl|NC_015719. 122 TSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTD-PVKLGQAVIAQLTIARAALTKN--YV 198 (344) Q Consensus 122 ~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~-~~~~~~~i~~~l~~a~~~Ld~~--~V 198 (344) +.=.+..+..++.+-|++.|..-+..- + ..+.....+.++...++... ....+...++.+..+..++++. .+ T Consensus 320 anILStEImlEINR~ii~~~~~~a~~~----~-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i 394 (523) T protein:vir:59 320 VTLMSQYIAREIDLEILSTIMAHARRT----D-NYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRI 394 (523) T ss_pred HHHHHHHHHHHhhHHHHHhHhhhheee----e-eccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 999999999999999999887543221 1 11111111222222221110 0111111123333333333321 12 Q ss_pred C-----cCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEEE-eCeEEEEecccccccccccccccccccccccc Q lcl|NC_015719. 199 P-----ANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNV-MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKH 272 (344) Q Consensus 199 P-----~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i-~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~ 272 (344) - -.+-|+|++|++.+.|-..+-+.............=.+|.+ .|++||.-++.|.. +.+ T Consensus 395 ~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~~~~d----y~~----------- 459 (523) T protein:vir:59 395 QQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRLYKNIYQNQP----VII----------- 459 (523) T ss_pred HHhcccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEEEecCCCCcc----eEE----------- Confidence 1 14569999999999987766664332221111111134555 47799988876642 111 Q ss_pred ccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 273 AFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 273 ~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ..|.+....- -.||+|+|- ++..+ .....||+.|--.|-.+.|||-.|.+|...+.|+++--- T Consensus 460 --~g~k~~~~~~-~~~~~y~Py----~~l~~--~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~ 522 (523) T protein:vir:59 460 --MGNQDLNTPW-QTGAVYAPY----VPLLF--TPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQ 522 (523) T ss_pred --EEecccCCcc-cccceeccc----chhhc--ccccccCCcccceeeeeeehhheecchhHhhhhhhhhcC Confidence 1122211111 157888884 22222 223358999999999999999999999999887654322 No 187 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=96.29 E-value=0.00078 Score=37.62 Aligned_cols=274 Identities=12% Similarity=0.035 Sum_probs=128.7 Q ss_pred ccccccchhhhhH-HHHhhHHHHHHH----HhhhhcCCceeee-cc-cccEEEEeec---CcceeeeeeCC-CCCCCCcC Q lcl|NC_015719. 16 KGQSAADKLALFL-KVFGGEVLTAFA----RTSVTANRHMQRQ-IS-SGKSAQFPVI---GRTKAAYLQPG-ESLDDKRK 84 (344) Q Consensus 16 ~~~~~~d~~~l~~-e~f~geV~~~f~----~~s~~~~~~~~~~-i~-~G~tv~i~~i---G~~t~~~~~~g-~~~~~~~~ 84 (344) -+...+|....|+ +++. .|+.... ..-..+.++..++ +- +-.++.+... |..+ -|..+ .+++.. T Consensus 1 ~~~~~a~~~~~f~~~ql~-~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~--~~~~~~~dip~v-- 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLT-ASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQ--IVADYTDDLPLV-- 75 (296) T ss_pred CcccchhhhHHHHHHHHH-HHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCcee--EeCCCcccccee-- Confidence 2223334433444 5554 4444443 2334455555543 21 2345554443 4433 33332 223321 Q ss_pred CcccceEEEEeeee-eeeceeccchHHHHh-ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCce Q lcl|NC_015719. 85 DIKHTEKTINIDGL-LTADVLIYDIEDAMN-HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPS 162 (344) Q Consensus 85 ~~~~~~~~l~iD~~-~~~~~~Idd~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~ 162 (344) +..-++....|-.. .-+.+.+.+++.++. ..++-......++.++++..|+.++.-... ....|+-... T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~---------~g~~GLlN~p 146 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTA---------HGIPSVFDYP 146 (296) T ss_pred eccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc---------ccceeEeecC Confidence 22333444444332 223344567766654 677888888889999999999888632110 1112221110 Q ss_pred eeec-ccccccccchhhHHHHHHHHHHHHHHHhhc--CCCcCCCEEEeCHHHHHHHhccchhhhhccccccccccceeEE Q lcl|NC_015719. 163 LLEV-GAKADLTDPVKLGQAVIAQLTIARAALTKN--YVPANDRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRN 239 (344) Q Consensus 163 ~i~~-~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 239 (344) .++. .+.++=.+ ...++++|..+...|.++ .+= .--.++|+|+.|..|... .+. ++ ..+.+=.--+ T Consensus 147 ~v~~~~~~~~W~~----~t~i~~Di~~~~~~l~~~s~g~~-~p~~l~L~p~~~~~L~~~---~~~-~~--~t~l~~ik~~ 215 (296) T protein:vir:10 147 NINNVVSGGSWSQ----PTTAVSDITSLLDIIETSTNGQH-RATHLLLPTTARRIMQNL---VPG-TS--VSYGEFFRQN 215 (296) T ss_pred CCccccccCCccC----HHHHHHHHHHHHHHHHHhhCcee-cceeEEeCHHHHHHHhhc---cCC-CC--ccHHHHHHHh Confidence 0111 11111111 235788888888877654 331 112578899999988532 111 11 1111100012 Q ss_pred EeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEe--cHHHHhhhhhheeeeeeeecchhhhh Q lcl|NC_015719. 240 VMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQ--HRSAVGTVKLKDLALERARRAEYQAD 317 (344) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~--~~~Av~~~~~~~~~~e~~~~~~~~~d 317 (344) ..+.+|...+.|...++.+. . ..+++ .++-+..+-.+++++.. -.++.... T Consensus 216 ~~~l~i~~~~~l~~a~~~g~------------~--------------~~v~~~~~~~~~~~~v~~~~~~~~-~e~~~l~~ 268 (296) T protein:vir:10 216 NSGVTVEFVQYLNDYNGTGT------------S--------------AAIAYEKDPNNMAIEIPEATNALP-AQPKDLHF 268 (296) T ss_pred cCCceEEEeeeeccCCCCcc------------e--------------EEEEEEcCCceEEEEcCcceeeec-ccccCceE Confidence 34667777777654322110 0 00111 11111111122333221 23344556 Q ss_pred hhhhhhhh-cCceeccccEEEE---Eec Q lcl|NC_015719. 318 QIIAKYAM-GHGGLRPESAGAL---VFK 341 (344) Q Consensus 318 ~i~~~~~~-G~~v~Rp~~~~~l---~~~ 341 (344) .+....+. |..+.||++++.+ ++. T Consensus 269 ~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 269 KIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEeeEeeEEEEEEECCceeEEEeeeecC Confidence 66666765 5899999999987 666 No 188 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=96.25 E-value=0.00083 Score=37.49 Aligned_cols=283 Identities=13% Similarity=0.085 Sum_probs=127.6 Q ss_pred cccchhhhhHHHH----hhHHHHHHHHhhhhcCCceeee-cc-cccEEEEeecCcc-eeeeeeCC-CCCCCCcCCcccce Q lcl|NC_015719. 19 SAADKLALFLKVF----GGEVLTAFARTSVTANRHMQRQ-IS-SGKSAQFPVIGRT-KAAYLQPG-ESLDDKRKDIKHTE 90 (344) Q Consensus 19 ~~~d~~~l~~e~f----~geV~~~f~~~s~~~~~~~~~~-i~-~G~tv~i~~iG~~-t~~~~~~g-~~~~~~~~~~~~~~ 90 (344) --+|..+.|+.++ ...|.+.....-..+.++..++ +- +..++.++....+ .++-|..+ ++++.. +..-.+ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~--~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLV--DVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccc--ccccee Confidence 1122223455433 3445555555555667666653 22 3445555544322 23333332 223322 222234 Q ss_pred EEEEeeee-eeeceeccchHHHH-hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeee--- Q lcl|NC_015719. 91 KTINIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLE--- 165 (344) Q Consensus 91 ~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~--- 165 (344) ....|-.. .-|.+.+.+++.++ ...++-.+-...++.++++..|+.++.-..+ ....|+-....+. T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~---------~g~~GLlN~p~~~~~~ 149 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK---------YAIKGAFEATGIQIDV 149 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc---------ccceeeecCCCccccc Confidence 44444332 22344456666665 4788888889999999999999988743221 0111111111000 Q ss_pred c---ccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCC-CEEEeCHHHHHHHhccchhhhhccccccccccceeEEEe Q lcl|NC_015719. 166 V---GAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAND-RTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNVM 241 (344) Q Consensus 166 ~---~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~ 241 (344) . +..+...=..+..+.|+++|.++..+|.++.-=... -.++|+|+.|..|..- +... .. + ..+.+=.-.+.- T Consensus 150 ~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~-~~~~-~~-~-~tvl~~l~~~~~ 225 (301) T protein:vir:80 150 SPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKK-RYSN-ED-S-RSVLKVLQDNAW 225 (301) T ss_pred ccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhc-cccC-CC-C-eeHHHHHHHHcC Confidence 0 011111112234567899999999998775211112 3688999999988631 1111 11 1 011000001223 Q ss_pred CeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhh Q lcl|NC_015719. 242 GFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIA 321 (344) Q Consensus 242 G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~ 321 (344) +.+|...+.|...+..+ .+....+... .+.+-+.+ .+++++-.. .++.....+.. T Consensus 226 ~~~I~~~p~L~~~g~~g------------~~~~v~~~~~---~d~~~~~v---------~~~~~~~~~-e~~~~~~~~~~ 280 (301) T protein:vir:80 226 FSAIVRVPDLAGMGTAG------------SDSFAVIHDS---NETAELII---------PMDITRHPE-EYSFPRTKVPF 280 (301) T ss_pred cceEEEcceeccCCCCc------------ccEEEEEecC---CcEEEEEe---------cCceeeecc-eecCceeEeee Confidence 45677777665322110 1111111100 00111111 112111110 11111233334 Q ss_pred hhhh-cCceeccccEEEEEec Q lcl|NC_015719. 322 KYAM-GHGGLRPESAGALVFK 341 (344) Q Consensus 322 ~~~~-G~~v~Rp~~~~~l~~~ 341 (344) ..+. |..+.||++++.+.== T Consensus 281 ~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 281 EERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeeeEEEEEEccceEEEEecC Confidence 4444 6799999998775433 No 189 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=93.10 E-value=0.0088 Score=31.86 Aligned_cols=310 Identities=14% Similarity=0.048 Sum_probs=142.3 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCce---eeecccccEEEEeecCcceeeeeeC-- Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHM---QRQISSGKSAQFPVIGRTKAAYLQP-- 75 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~---~~~i~~G~tv~i~~iG~~t~~~~~~-- 75 (344) |..=++ .=-.-|.-.++..+...+-.-|.|-.|.++.|.-..-...... .....+....-.+.-+..+...+.. T Consensus 97 mTgPTG-LIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~ 175 (457) T protein:vir:10 97 MTGPTG-LIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDAT 175 (457) T ss_pred CCCcce-eeeeeeeeecCccccccccccceeeeccCcccCcccccccccccccccccccccccccCcccccccccccccc Confidence 322111 0000122222222211112234444555555542111000000 0000011000000000000001111 Q ss_pred ------CCCCCCCcCCcccceEEEEeeeeeeec--------eeccchHHHHh--ChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 76 ------GESLDDKRKDIKHTEKTINIDGLLTAD--------VLIYDIEDAMN--HYDVRSEYTSQIGESLAMAADGAVLA 139 (344) Q Consensus 76 ------g~~~~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~--~~d~~~~~~~~~~~aLa~~~D~~i~~ 139 (344) ++.+........-.+.-+.||+...-+ ..+.-..+.++ -.|.-.|++.=.+..+..++.+-|++ T Consensus 176 gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~ 255 (457) T protein:vir:10 176 GMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVR 255 (457) T ss_pred chhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHH Confidence 111110001112356677787764432 44554555555 47888899999999999999999999 Q ss_pred HHHHhhhcccccccccccccCceeeecccccccccchhhHHHH-HHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhcc Q lcl|NC_015719. 140 ELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAV-IAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAA 218 (344) Q Consensus 140 ~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i-~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~ 218 (344) .|...+..- ...+.....+.++....+.......++.+ |..-.+|.....+--- -.+.|+|.+|.+.++|-.. T Consensus 256 ~l~~~a~~~-----~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~r-g~gn~~i~S~~Va~~L~~s 329 (457) T protein:vir:10 256 TIYTNAVAG-----AQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTRR-GKGNILICSADVVSALGMA 329 (457) T ss_pred hHhhhheee-----eccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhcc-ccceEEEEchhHHHHHhhc Confidence 887544221 11222222333332222211122222322 3323444444333332 3568999999999988764 Q ss_pred ch--hhhhcc---cc--ccccccceeEEEe-CeEEEEe----ccccccccccccccccccccccccccccccccccccce Q lcl|NC_015719. 219 LM--PNAANY---AA--LIDPERGSIRNVM-GFEVVEV----PHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENV 286 (344) Q Consensus 219 ~~--~~~~~~---~~--~~~~~~G~Vg~i~-G~~V~~s----n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (344) .- +..+.. .. .++.....+|.+. |++||.- +|-|.. +.+. +|.| +.... T Consensus 330 g~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~d----y~~v-------------G~KG--~~~~~ 390 (457) T protein:vir:10 330 GVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKH----FYVA-------------GYKG--TSPYD 390 (457) T ss_pred ccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccc----eEEE-------------EEeC--Cccee Confidence 33 332211 11 1345566678874 7888876 333321 1111 1112 12234 Q ss_pred eEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 287 VGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 287 ~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .||+|+|- +.+.++. .-||+.|--.|-.+.|||- .++|.....=-..+.- T Consensus 391 ~glfy~PY----v~l~~~~---~~dp~sfqP~~g~~tRY~l-~~NP~~~~~~~~~~~~ 440 (457) T protein:vir:10 391 AGLFYCPY----VPLQQVR---AINPDTFQPKIGFKTRYGM-VSNPFAGGLTQGSGAL 440 (457) T ss_pred cceeeccc----ccccccC---ccCCccccceeeeeeeeee-eecccccccccccccc Confidence 67888884 3333332 2399999999999999999 8889865432211111 No 190 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=91.89 E-value=0.014 Score=30.79 Aligned_cols=296 Identities=14% Similarity=0.103 Sum_probs=129.7 Q ss_pred CC------------CccccccccccccccccccchhhhhHHHHhhHHHHHHHH-hhhhcCCceeeec---ccccEEEEee Q lcl|NC_015719. 1 MA------------NMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFAR-TSVTANRHMQRQI---SSGKSAQFPV 64 (344) Q Consensus 1 ma------------~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~-~s~~~~~~~~~~i---~~G~tv~i~~ 64 (344) || .++.-. +..| .-.+.++|=-.|+...-...++..|+. .+-++.|...+++ +..+.+.+-. T Consensus 371 lAr~~L~~rg~~~~~~~~~~-~~~~-a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~ 448 (693) T protein:vir:95 371 LARASLVDRGIGVASLNAPQ-MVGL-AFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGE 448 (693) T ss_pred HHHHHHHhcCCccCCCCHHH-HHHH-HHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCC Confidence 11 111000 0000 001234443333334444556666664 3455666665544 4445454433 Q ss_pred cCcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015719. 65 IGRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGL 144 (344) Q Consensus 65 iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~ 144 (344) .| ++..+..+.++.. ..+....-++.+.++- --|.|+...-.=-+.+....+....|.+-++..++.++..|..- T Consensus 449 ~~--~L~~V~E~gEyk~--~t~~e~~e~~~l~tyG-~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~N 523 (693) T protein:vir:95 449 FS--SLRQVREGAEYKY--VTLGERGEQIILATYG-ELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGN 523 (693) T ss_pred CC--ChhhcCCCCceee--eecCCccceeehhhcC-CeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33 3444444444422 2233344455554442 12333321111113446667788899999999999998776531 Q ss_pred hhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCC--------C--cCCCEEEeCHHHHHH Q lcl|NC_015719. 145 INLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYV--------P--ANDRTFYTTPDVYSA 214 (344) Q Consensus 145 a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~V--------P--~~gR~~vv~P~~~~~ 214 (344) ...........+.| +++++ ++++..+ ++.|-.++..|.++.- + -..+|+||||+.... T Consensus 524 p~m~DGk~LFhadH--~Nl~t-ga~sals---------~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~ 591 (693) T protein:vir:95 524 PAMSDGKTLFHADH--SNLLT-GAASALS---------IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDK 591 (693) T ss_pred ccccCCcceeeccc--ccccc-ccccccC---------hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHH Confidence 11111111111111 11111 1111111 3344445455544331 1 134788998887765 Q ss_pred HhccchhhhhccccccccccceeEEEeCe-EEEEeccccccccccccccccccccccccccccccccccccceeEEEecH Q lcl|NC_015719. 215 ILAALMPNAANYAALIDPERGSIRNVMGF-EVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHR 293 (344) Q Consensus 215 Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G~-~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~ 293 (344) ..+ +++..+........|.|--+.|+ +|+..++|...+.+.+-+.+. + .. ++.=+.| T Consensus 592 a~~---l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~-~---------------~~-dtie~~y-- 649 (693) T protein:vir:95 592 ANQ---IINSESVPGADVNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAK-K---------------GS-DTIEVAY-- 649 (693) T ss_pred HHH---HhccccccccccccccccchhccccccccceecCCCCCceEEecC-C---------------CC-CeEEEEE-- Confidence 443 33333322222335555556665 788889986544333322211 0 00 1111111 Q ss_pred HHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 294 SAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 294 ~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .-+ .+.+.+|....-..-+=.++-.+-||++++..-++ +-..|| T Consensus 650 ---L~G-~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~---~kn~GA 693 (693) T protein:vir:95 650 ---LDG-VDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGL---QKSNGA 693 (693) T ss_pred ---ecC-CCCCeEeecCCCCcceEEEEEEEeccCceeecccc---ccCCCC Confidence 011 12234554433233333445566688888876653 234455 No 191 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=91.45 E-value=0.016 Score=30.46 Aligned_cols=291 Identities=13% Similarity=0.096 Sum_probs=130.2 Q ss_pred CCCcccccccccccccc---------------ccccchhhhhHHHHhhHHHHHHHHh-hhhcCCceeeec---ccccEEE Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKG---------------QSAADKLALFLKVFGGEVLTAFART-SVTANRHMQRQI---SSGKSAQ 61 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~---------------~~~~d~~~l~~e~f~geV~~~f~~~-s~~~~~~~~~~i---~~G~tv~ 61 (344) ||-.. ..+.|.+ ++++|--.|+...-...+...|+.. .-++.|.+.+++ +..+.++ T Consensus 336 lAr~~-----L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~ 410 (652) T protein:vir:79 336 YARMS-----LTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVG 410 (652) T ss_pred HHHHH-----HHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceee Confidence 11100 0011111 2344433333333334455566543 455666666554 4455555 Q ss_pred EeecCcceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 62 FPVIGRTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAEL 141 (344) Q Consensus 62 i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (344) +-..| ++..+..|.++-. ..+....-++.+.++- --|.|...--.=-+.+....+....|.+-++..++.+...| T Consensus 411 lg~~~--~L~~V~E~gEyk~--~t~~e~~e~~~l~tyG-~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l 485 (652) T protein:vir:79 411 MGGFS--ALRQVREGAEYKY--VTTGDKQATIALATYG-ELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAIL 485 (652) T ss_pred cCCCC--CccccCCCCccce--eeecCccceeeeeccc-CeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54333 4444544554433 2344555567665542 11233220000002345667788888998999998888766 Q ss_pred HHhhhcccccccccccccC-ceeeecccccccccchhhHHHHHHHHHHHHHHHhhcC-----CCcCCCEEEeCHHHHHHH Q lcl|NC_015719. 142 AGLINLADGVNENIAGLGK-PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNY-----VPANDRTFYTTPDVYSAI 215 (344) Q Consensus 142 ~~~a~~~~~~~~~~~~~~~-~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~-----VP~~gR~~vv~P~~~~~L 215 (344) ..-.... ..+...+.|.. ++... ++..+ .+.|-.++..|.++. +--..||++|||+..... T Consensus 486 ~~Np~~~-~DGk~LF~hA~H~Nl~~---~aa~~---------~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a 552 (652) T protein:vir:79 486 TSNPKIS-TDNVSLFDKAKHANVLE---SAAMD---------VASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVA 552 (652) T ss_pred hcCcccc-cCCceeecccccccccc---cccCC---------HHHHHHHHHHHHHhccCCccccccccEEEecchhHHHH Confidence 5211110 01111221111 12111 11111 122333433333332 112358999999876543 Q ss_pred hccchhhhhccccccccccceeEEEeCe-EEEEeccccccccccccccccccccccccccccccccccccceeEEEecHH Q lcl|NC_015719. 216 LAALMPNAANYAALIDPERGSIRNVMGF-EVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRS 294 (344) Q Consensus 216 l~~~~~~~~~~~~~~~~~~G~Vg~i~G~-~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~ 294 (344) .+ +++...........|.+--+.|+ .|++.++|...+.+.+-+++.- . .++.=+.| T Consensus 553 ~~---ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~----------------~-~dtiev~y--- 609 (652) T protein:vir:79 553 NQ---VIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASK----------------G-SDTIEVAY--- 609 (652) T ss_pred HH---HhccCCCcccccccccccccccccccccccccCCCCcccEEEecCC----------------C-CCeEEEEE--- Confidence 32 33222211122234555555665 8888999865443333222110 0 01111111 Q ss_pred HHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEe Q lcl|NC_015719. 295 AVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVF 340 (344) Q Consensus 295 Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~ 340 (344) .-+ .+.+.+|....-...+=.++-.+=||++++..-++.-.++ T Consensus 610 --L~G-~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 610 --LNG-VDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred --ecC-CCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 011 2234555543333334445666779999998887765444 No 192 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=90.40 E-value=0.021 Score=29.78 Aligned_cols=272 Identities=17% Similarity=0.110 Sum_probs=115.7 Q ss_pred ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCcee----eecccccEEEEeecCc--ceeeeeeCCCCCC--C Q lcl|NC_015719. 10 LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQ----RQISSGKSAQFPVIGR--TKAAYLQPGESLD--D 81 (344) Q Consensus 10 ~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~----~~i~~G~tv~i~~iG~--~t~~~~~~g~~~~--~ 81 (344) +.++... -++ .|-|+|.|-+.+-|+.++.|++..-- .-+.+.++.---.... +-++.|..+...- + T Consensus 1 mp~N~n~-----avr-~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFG 74 (295) T protein:vir:47 1 MPSNQNN-----AVR-RYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFG 74 (295) T ss_pred CCCCCCc-----cch-hhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcccccc Confidence 1121111 122 48899999999999999999866542 1223333221111111 2234565555442 1 Q ss_pred --CcCCcccceE--EEEeeee-ee-eceec-cchHHHHhChh---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_015719. 82 --KRKDIKHTEK--TINIDGL-LT-ADVLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAELAGLINLADGV 151 (344) Q Consensus 82 --~~~~~~~~~~--~l~iD~~-~~-~~~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (344) +.......++ .+..|+. .| +.+.| .-+|..--+-| ..++..+.++.|-++.+|..+-.-|...+.... T Consensus 75 tGTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~te-- 152 (295) T protein:vir:47 75 DNSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKTE-- 152 (295) T ss_pred cCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-- Confidence 1111111111 1222222 11 11111 12333333333 345566777888888888766444433222111 Q ss_pred ccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_015719. 152 NENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANYAALID 231 (344) Q Consensus 152 ~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (344) .-++.++ ..+...+-.+.++.-...|-..-| +.|.|++|.+|..++..+.+.-.+-+. T Consensus 153 ----------------~~td~t~-----d~V~~LF~~as~~yvn~ev~~~~~-AyV~~evYnaiiD~~l~TsaK~SsaNi 210 (295) T protein:vir:47 153 ----------------ALADFTD-----DKVKALFNKLSAFYTNNEVTAPIT-VYLRSEFYNAIVDMASVTSAKGATISL 210 (295) T ss_pred ----------------hhhcccc-----hhHHHHHHHHHHHhhhhheeeeeE-EEEchhHHHHHhccccccccccceeee Confidence 0112221 123344556666777777643323 889999999999988777654333333 Q ss_pred cccceeEEEeCeEEEEecccccccc--cccccc-ccccccccccccccccccccccceeEEEecH--HHHhhhhhheeee Q lcl|NC_015719. 232 PERGSIRNVMGFEVVEVPHLTAGGA--GDDRPE-EGTDASNQKHAFPATGGKVNKENVVGLFQHR--SAVGTVKLKDLAL 306 (344) Q Consensus 232 ~~~G~Vg~i~G~~V~~sn~lp~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~--~Av~~~~~~~~~~ 306 (344) =.+| |-+.-||.|-|.|.-....+ .-+.+- -+.++++-. ........+-.|+.++- +-+.+. .+++ T Consensus 211 Deng-i~~FkGf~i~e~P~~~~q~G~~aifs~dnig~aftGIn-----~aR~IesEdF~GValQ~~~~~~~~~---~~~~ 281 (295) T protein:vir:47 211 DENG-LPKYKGFTLEETPAQYFETGVIAIFSPNGIIIPFVGIS-----TARVIEAENFDGVNCKLLLRVVLTL---LMTI 281 (295) T ss_pred ccCC-cceecceEEEeccHhhccCCcEEEEccccceeecccce-----eeeeeecccccchHHHHHHHHHHHH---HHHH Confidence 3455 55889999998876543211 100000 000000000 00001112222222211 000000 0000 Q ss_pred eeeecchhhhhhhhhhhhhcCceec Q lcl|NC_015719. 307 ERARRAEYQADQIIAKYAMGHGGLR 331 (344) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~R 331 (344) . +.|...-.-+|+ | T Consensus 282 ~-----~~~~~~~~~~~~------~ 295 (295) T protein:vir:47 282 R-----KQFTKLQELLYR------R 295 (295) T ss_pred H-----HHHHHHHHHhhc------C Confidence 0 011111111111 1 No 193 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=90.18 E-value=0.022 Score=29.65 Aligned_cols=286 Identities=13% Similarity=0.033 Sum_probs=125.0 Q ss_pred CC--------CccccccccccccccccccchhhhhH-HHHhhHHHHHHHH----hhhhcCCceeee-cc-cccEEEEee- Q lcl|NC_015719. 1 MA--------NMQGGQQLGTNQGKGQSAADKLALFL-KVFGGEVLTAFAR----TSVTANRHMQRQ-IS-SGKSAQFPV- 64 (344) Q Consensus 1 ma--------~~~~~~~~~~~~g~~~~~~d~~~l~~-e~f~geV~~~f~~----~s~~~~~~~~~~-i~-~G~tv~i~~- 64 (344) || .++... .+.+ ....|....|+ +++. .|+....+ .-..+.++..++ +- .-.++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~---~~~~--~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~ 74 (314) T protein:vir:10 1 MAIKFDAEQAKITTHL---EQMG--VEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEF 74 (314) T ss_pred CccchHHHHHHHHHHH---Hhhc--ccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCCCCceeEEEeeee Confidence 22 111100 1111 23334332444 4443 55554433 233344555442 11 123444333 Q ss_pred --cCcceeeeeeC-CCCCCCCcCCcccceEEEEeeeee-eeceeccchHHHH-hChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 65 --IGRTKAAYLQP-GESLDDKRKDIKHTEKTINIDGLL-TADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLA 139 (344) Q Consensus 65 --iG~~t~~~~~~-g~~~~~~~~~~~~~~~~l~iD~~~-~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~ 139 (344) .|..+ -|.. +.+++.. +..-.+....|-... -+.+.+.++..++ ...++-.+-...+..++++..|+.++. T Consensus 75 e~~G~a~--~~~d~~~dip~v--d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 150 (314) T protein:vir:10 75 DGVGIAQ--IIADYSDDLPLV--DAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWS 150 (314) T ss_pred cccccee--eeCCccccccee--ecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe Confidence 34433 3332 2334332 223334444443321 2223345555554 377777777888888888888887763 Q ss_pred HHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhc--CCCcCCCEEEeCHHHHHHHhc Q lcl|NC_015719. 140 ELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKN--YVPANDRTFYTTPDVYSAILA 217 (344) Q Consensus 140 ~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~ 217 (344) -... ....|+-....++...++ ++- +..+.|+++|..+..+|.++ .+= .--.++|+|+.|..|.. T Consensus 151 G~~~---------~g~~GLlN~p~v~~~~~~--~~W-aT~~ei~~Di~~~~~~l~~~s~g~~-~p~~l~Lpp~~~~~L~~ 217 (314) T protein:vir:10 151 GSAP---------HGIVSVFDQPNINNVVAT--PNW-SVPQNAIDDVTAMIDAVESSTQGLH-HVTDILLPASARRVMQG 217 (314) T ss_pred eccc---------ccceeEeecCCCccccCC--CCc-ccHHHHHHHHHHHHHHHHHhcCccc-cceeEEecHHHHHhhcc Confidence 2110 112222111111111111 011 12356899999999999875 221 11368899999986632 Q ss_pred cchhhhh-ccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHH Q lcl|NC_015719. 218 ALMPNAA-NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAV 296 (344) Q Consensus 218 ~~~~~~~-~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av 296 (344) .... +..-..-+.+ +--+++|...+.|...++.+ ......|..+ .+.+.+.+ T Consensus 218 ---~~~~~~~tvl~~l~~----n~~~l~I~~~~el~~ag~~g------------~~~~v~y~~~---~~~~~~~v----- 270 (314) T protein:vir:10 218 ---LVPQTNLSYGELFTR----NNPGLTIRFLQFLDNYDGAG------------GKAALAFEKS---PLNMSIEI----- 270 (314) T ss_pred ---cccCCCccHHHHHHH----hCCCcEEEEcccccccCCCc------------ceEEEEEecC---CcEEEEec----- Confidence 1111 1000011111 12366777777765322111 1111111110 01111111 Q ss_pred hhhhhheeeeeeeecchhhhhhhhhhhhh-cCceeccccEE---EEEec Q lcl|NC_015719. 297 GTVKLKDLALERARRAEYQADQIIAKYAM-GHGGLRPESAG---ALVFK 341 (344) Q Consensus 297 ~~~~~~~~~~e~~~~~~~~~d~i~~~~~~-G~~v~Rp~~~~---~l~~~ 341 (344) .++++.-. ..++.....+....+. |..+.||.+++ -|++. T Consensus 271 ----p~~~~~l~-~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 271 ----PEVTNVLP-AQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred ----Cccceeec-ceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 11222111 1223344555555565 57999999998 45665 No 194 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=89.96 E-value=0.023 Score=29.52 Aligned_cols=307 Identities=11% Similarity=0.065 Sum_probs=130.8 Q ss_pred CCCccccc-----cccc--cccccc---cccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcc-e Q lcl|NC_015719. 1 MANMQGGQ-----QLGT--NQGKGQ---SAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRT-K 69 (344) Q Consensus 1 ma~~~~~~-----~~~~--~~g~~~---~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-t 69 (344) |--+-+.. ...+ -|..++ .-.+.-.|........|.+.|.+.+-++.+....++. |++.+.++.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve-~~~~~~~r~~~lp~ 79 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIE-GNALAYNRENVLGD 79 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhccccccc-CCcceeeeeecCCc Confidence 22111110 0000 000000 0111112333456778889998776666666555554 444555554332 1 Q ss_pred eeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHh-----ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015719. 70 AAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMN-----HYDVRSEYTSQIGESLAMAADGAVLAELAGL 144 (344) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~-----~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~ 144 (344) +.-+.-+..++.. .+.+..+.+..+ .... .+-++|+.-+ -.|.+.+..+...++|++++...++.- T Consensus 80 a~~r~~n~~~~~~-~~~Tf~q~t~~l---~~l~-~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linG---- 150 (330) T protein:vir:94 80 VQFLAVGGTITAK-NPATFTKVTSEL---TTLI-GDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITG---- 150 (330) T ss_pred ceeeecccccccc-Ccceeeeeeech---hhhh-hhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhcc---- Confidence 1222223333211 111122333221 1111 1224454432 346788888888889988877766531 Q ss_pred hhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcC-CCcCCCEEEeCHHHHHHHhccchhhh Q lcl|NC_015719. 145 INLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNY-VPANDRTFYTTPDVYSAILAALMPNA 223 (344) Q Consensus 145 a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~-VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (344) -.....-.........++.+..++.+....+.. +|.|+ +... -|-+.-+++++..+...+..-.|-.. T Consensus 151 Ds~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~-----LDeLl------~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~ 219 (330) T protein:vir:94 151 DGTGNSFQGMMGLVAASQTISAGANGGTLTFEL-----LDQLL------DLVKDKDGQVDYLMSSFAMRRKYFSLLRALG 219 (330) T ss_pred CCCCccccchhhcCCcccEEecCCCCCCCCHHH-----HHHHH------HHhcCCCCCCcEEEechhHHHHHHHHHHhcc Confidence 000000001111223445555444333322211 23232 2221 12233588888887777765444221 Q ss_pred h-c-cccccccccceeEEEeCeEEEEecccccccccccccccccccccccccccccccccc-ccceeEEEecHHHHhhhh Q lcl|NC_015719. 224 A-N-YAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVN-KENVVGLFQHRSAVGTVK 300 (344) Q Consensus 224 ~-~-~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~gl~~~~~Av~~~~ 300 (344) . . +.......--.|-.+.|++|+.++-+|.+.+... .++..+.+...-++.. .-..+||-.... T Consensus 220 ~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~-------~~~ttsIyav~~G~~~~~qgV~Gl~~~g~------ 286 (330) T protein:vir:94 220 GAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGT-------ATNATAIFAGTFDDGSNKYGIAGLTARGS------ 286 (330) T ss_pred CCCCCCcccccCCCEEeeeCCeEEEecccccCCCCccc-------CCCceeEEEEeecccccccceEeecCCCC------ Confidence 1 1 1112223233567889999999999987632110 0111111221111100 112344422111 Q ss_pred hheeeeeeee--cc-hhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 301 LKDLALERAR--RA-EYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 301 ~~~~~~e~~~--~~-~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) -.+.++... ++ .-+.+.| .+-+|..++.|+++++|+=-.-- T Consensus 287 -~glsVr~~G~~~~k~v~~~~v--~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 287 -AGLRVQNVGAKENADETITRV--KMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred -CcceeeeCCCccccceeeEEE--EEeeeeEEechhheeeeccccCC Confidence 012333222 11 1112222 23478899999999998744434 No 195 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=88.74 E-value=0.03 Score=28.90 Aligned_cols=275 Identities=12% Similarity=0.070 Sum_probs=104.7 Q ss_pred CCCccccccccccc--------cccccccchhhhhHHHHhhHHHHHHH------HhhhhcCCceeeeccc--------cc Q lcl|NC_015719. 1 MANMQGGQQLGTNQ--------GKGQSAADKLALFLKVFGGEVLTAFA------RTSVTANRHMQRQISS--------GK 58 (344) Q Consensus 1 ma~~~~~~~~~~~~--------g~~~~~~d~~~l~~e~f~geV~~~f~------~~s~~~~~~~~~~i~~--------G~ 58 (344) |-+...... .... +....+.+ ..-|+ .++.++-. ..++.+.+.+...+.. .. T Consensus 171 ~~~~~~~~~-~~~~~~~e~r~~~~~~~~~~-e~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (480) T protein:vir:40 171 REASIPSEK-PEDAERKFMRELGSKMAEMP-EQGFL----REFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGL 244 (480) T ss_pred hhhhccccc-hhhhhhHHHHHHHHHhccch-hhhhh----hhhhhhccccccccccccccchhhheeechhhhhhhhhcc Confidence 111110000 0000 00000000 00000 00000000 0011111111111100 00 Q ss_pred EEEEeecCcc---eeeee-eCCCCCCCCcCCcccceEEEEeeee--eeec---eeccchHHHHhChhHHHHHHHHHHHHH Q lcl|NC_015719. 59 SAQFPVIGRT---KAAYL-QPGESLDDKRKDIKHTEKTINIDGL--LTAD---VLIYDIEDAMNHYDVRSEYTSQIGESL 129 (344) Q Consensus 59 tv~i~~iG~~---t~~~~-~~g~~~~~~~~~~~~~~~~l~iD~~--~~~~---~~Idd~D~~q~~~d~~~~~~~~~~~aL 129 (344) ++. ..|.. .+... ..+.... +...+...+ .++. ++.. .....+|. ..++.+-+..+.++.| T Consensus 245 ~~~--~~g~~~~~~~~e~~~~~~~~~----~~~~~~~~~-~~~~v~~l~~~~k~t~~lLDD---a~~l~~~i~~~l~~~~ 314 (480) T protein:vir:40 245 TLA--EDGVDDTFISGTFKAGTDKNK----SQTATKRSL-RPQMAEAYLQMDKATVRGVND---SGALSEYVMSEMVNRV 314 (480) T ss_pred eee--eccccceeeeeeeeccccccc----ccccccchh-hHHHHHHHHHhHHHHHHHhhh---hHHHHHHHHHHHHHHH Confidence 111 11111 01111 1111100 001111111 0100 1110 00111121 2358888899999999 Q ss_pred HHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCC-EEEeC Q lcl|NC_015719. 130 AMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDR-TFYTT 208 (344) Q Consensus 130 a~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR-~~vv~ 208 (344) +++.++.++.-- .... + + ..+ +..... ..+.. ..+...++.|+.+ |.+..- .+. .+|++ T Consensus 315 ~~~ee~a~l~G~------g~g~--~--~-~~g--~~~~~~-~~~~~-~~~~d~id~L~~a---l~~~y~--~~a~~~vmn 374 (480) T protein:vir:40 315 IQKVEYNMILGS------VDGS--N--G-FYG--LKTATD-GWTKQ-IEYTDLFEGITDA---VAECSI--SDAITIVMS 374 (480) T ss_pred HHHHHHHhhccC------CCCc--c--c-ccc--ceeecc-ccccc-chhHHHHHHHHHh---hhHHhh--CCCCEEEEC Confidence 999888775210 0000 0 0 011 111111 11111 1122233334333 332221 123 57899 Q ss_pred HHHHHHHhccchhhhhccccccccccceeEEEeCeEEEEecc-cccccccccccccccccccccccccccccccccccee Q lcl|NC_015719. 209 PDVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPH-LTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVV 287 (344) Q Consensus 209 P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (344) |..+..|.+-..- +..|.=...+..|....++|++|++++. +|... +..+. + +.. T Consensus 375 ~~t~~~I~klKD~-~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~---~~~~~----------~----------~~~ 430 (480) T protein:vir:40 375 PQTFAELRKAKGT-DGHSRFNELATKEQIAQSFGAVNLETRVWMPKDE---VAVYN----------H----------DEY 430 (480) T ss_pred HHHHHHHHHhhcC-CCCeeccCcccccCcceecccceeeeeccccCCc---ceeee----------C----------Ccc Confidence 9999987654322 3446545567788899999999887643 33211 10000 0 111 Q ss_pred EEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 288 GLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 288 gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .+++-++ .+....++-++-...+....+.|..+.+|+++..++.++.= T Consensus 431 ~~~~d~~---------~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~ 478 (480) T protein:vir:40 431 VLIGDLN---------VENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSL 478 (480) T ss_pred EEEEecc---------cceecccccccchhhhhhhhhhceeeEccccEEEEEeccCc Confidence 2222221 11112223334456677788899999999999888888776 No 196 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=88.59 E-value=0.031 Score=28.83 Aligned_cols=291 Identities=9% Similarity=0.027 Sum_probs=129.7 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCc---ceeeee-eCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGR---TKAAYL-QPG 76 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~---~t~~~~-~~g 76 (344) |.-++ ..-++-.+.|. =...|.+.|.+.|.+..+.+..++. |++.+.++.-. ...... .+- T Consensus 1 mpalt-------Laea~k~~~d~-------l~~~ViE~~~~~s~lL~~LpF~~ve-g~~~~ynR~~~~~~~~~~~v~~~~ 65 (310) T protein:vir:97 1 MASVT-------LAESAKLAQDE-------LVAGVIENIITVNRMFDVLPFDSIE-GNSLAYNRENVLGDVIMAGVGTTF 65 (310) T ss_pred Ccccc-------hHHHhhcCcch-------HHHHHHHHHhccchHHHhCCccccc-CCcceeeEeeccCCcccccccccc Confidence 55332 22222222222 2456778888777777666666655 55677766632 221111 111 Q ss_pred CCCCCCcCCcccceEEEEeeeeeeeceeccchHHH--H---h-ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 77 ESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDA--M---N-HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 77 ~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~--q---~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) ++......+.+.++++..+ +.+. .+-++|.. + . -+|.+.+-.+...++|++.+...++.- -....+ T Consensus 66 ~~~g~~~~~~t~~~~~~~L---~i~~-g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lING----D~a~n~ 137 (310) T protein:vir:97 66 SGAGAGKAAATFTKVNSNL---TTIM-GDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLING----NGAGNE 137 (310) T ss_pred cCCCccccccccceeeeee---eeee-ehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhcc----ccCCCc Confidence 1111111112223333322 1121 12234421 1 2 345677778888899998887766530 000000 Q ss_pred cccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhh--hcccc Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNA--ANYAA 228 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~~~~~ 228 (344) -.........++.+..++.+....+. .+|.|++. .-+ -..+..+++.+|.++..+..--|-.. .-|.. T Consensus 138 F~GL~~~~~~~q~i~~~~~gg~~t~d-----~LDeLl~~---v~~--~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~ 207 (310) T protein:vir:97 138 FAGLIQLCASGQKATTGATGSAISFA-----ILDELMDL---VVD--KDGQVDYLTMHARTLRSYKALLRALGGASINEV 207 (310) T ss_pred ccchhhcCCccceeecCCCCCCCCHH-----HHHHHHHH---Hhc--CCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCc Confidence 00111112334555544433332221 12322221 111 11133699999987665554333322 12322 Q ss_pred ccccccceeEEEeCeEEEEecccccccccccccccccccccccccccccccccccc----ceeEEEecHHHHhhhhhhee Q lcl|NC_015719. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKE----NVVGLFQHRSAVGTVKLKDL 304 (344) Q Consensus 229 ~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~gl~~~~~Av~~~~~~~~ 304 (344) +....--.|-.+.|++|+.++.+|.+..... .++. ...|...+.-+ ..+||...... -+ T Consensus 208 ~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~~-------~~gt---TsIya~r~Ge~~~~~Gv~Gl~~~~~~-------gl 270 (310) T protein:vir:97 208 VELPSGAEVPAYSGTPIFRNDYIPTNQTKGG-------TTGC---TTIFAGTLDDGSRTHGIAGLTATQAA-------GI 270 (310) T ss_pred cccCCCCEEeeeCCeEEEEeCccCCCccccc-------cCCc---eeEEEEeeCccccccceeccccCCcc-------ce Confidence 3333334567999999999999997643210 0111 11222222211 22333211111 12 Q ss_pred eeeeee---cchhhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 305 ALERAR---RAEYQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 305 ~~e~~~---~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) .++... ++--+.+.|. +-+|..++.|+++++|+=--- T Consensus 271 sVr~~G~~~~~~v~~~~V~--~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 271 QVVDVGESEDSDEHIWRVK--WYCGLALFSEKGLACADGITN 310 (310) T ss_pred eEEeCCcccCCcceeEEEE--EeeeEEEecccceeeeccccC Confidence 333322 2222333332 237889999999998864333 No 197 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=87.52 E-value=0.038 Score=28.36 Aligned_cols=295 Identities=9% Similarity=0.013 Sum_probs=122.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHH----hhhhcCCceeee-cc-cccEEEEeec---Ccceee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFAR----TSVTANRHMQRQ-IS-SGKSAQFPVI---GRTKAA 71 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~----~s~~~~~~~~~~-i~-~G~tv~i~~i---G~~t~~ 71 (344) =++....++....++.-..+.+.. .|+......|+....+ .-..+.++..++ +- +-.++.+..+ |..+ T Consensus 14 ~~~~~~~a~~~~~~~~~~~~~~~~-~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~-- 90 (329) T protein:vir:79 14 EFEANVIANHMQLRGAKNDASDMG-IWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAK-- 90 (329) T ss_pred hhhhhhHhhhcccccceeccchhh-HHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeee-- Confidence 000000000000111111122223 3553333344444432 333345555442 22 2334554444 4433 Q ss_pred eeeC-CCCCCCCcCCcccceEEEEeeee-eeeceeccchHHHH-hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_015719. 72 YLQP-GESLDDKRKDIKHTEKTINIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLA 148 (344) Q Consensus 72 ~~~~-g~~~~~~~~~~~~~~~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (344) -|.. .++++.. +..-.+....|-.. .-+.+.+.++..++ ...++-.+-...+..++++..|+.++.--.. T Consensus 91 ~~~d~~~dip~v--d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~----- 163 (329) T protein:vir:79 91 IIADYTDDLSTV--DALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKP----- 163 (329) T ss_pred eecCccccccee--ecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeeccc----- Confidence 3332 2333322 22222323333222 12234456666665 4778888888888899999999887632110 Q ss_pred cccccccccccCceeee---cccccccccchhhHHHHHHHHHHHHHHHhhc--CCCcCCCEEEeCHHHHHHHhccchhhh Q lcl|NC_015719. 149 DGVNENIAGLGKPSLLE---VGAKADLTDPVKLGQAVIAQLTIARAALTKN--YVPANDRTFYTTPDVYSAILAALMPNA 223 (344) Q Consensus 149 ~~~~~~~~~~~~~~~i~---~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (344) ....|+-....++ .++.+...-..+..+.|+++|.++..+|.++ .+ ..--.++|+|+.|..|..- ..+ T Consensus 164 ----~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~-~~p~~L~Lpp~~~~~L~~~--~~~ 236 (329) T protein:vir:79 164 ----HKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQ-HRANMILIPPSMRKVLMVR--MPE 236 (329) T ss_pred ----ccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHHhhcc--cCC Confidence 1111211111111 1111111222334567899999998888875 32 1113688999999888521 111 Q ss_pred hccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 224 ~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) .+..--.-+.+ +...++|...+.|-..+.. .......|. ..++-+...-.++ T Consensus 237 ~~~tvl~~lk~----~~~~l~I~~~~el~~ag~~------------g~~~~v~y~------------~~~~~~~~~vp~~ 288 (329) T protein:vir:79 237 TTMSYLDYFKQ----QNGGITIESISELEDIDGA------------GTKAALVYE------------KDPMNMSIEIPEA 288 (329) T ss_pred CCccHHHHHHH----hCCCcEEEEcccccccCCC------------CceEEEEEe------------cCCceEEEecCcc Confidence 11000011111 1124556665555321110 011111111 1111111111222 Q ss_pred eeeeeeecchhhhhhhhhhhhh-cCceeccccEEEE---Eec Q lcl|NC_015719. 304 LALERARRAEYQADQIIAKYAM-GHGGLRPESAGAL---VFK 341 (344) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~-G~~v~Rp~~~~~l---~~~ 341 (344) +++.. ..++.....+....+. |.-+.||.+++.+ .+. T Consensus 289 ~~~l~-~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 289 FNMLT-AQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred eeeee-ceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 22221 1223333445555554 5799999988753 333 No 198 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=83.21 E-value=0.07 Score=26.90 Aligned_cols=267 Identities=14% Similarity=0.117 Sum_probs=114.9 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeec-Ccceeeee------ Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVI-GRTKAAYL------ 73 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~------ 73 (344) |++... .+.++|...-....|-+.+.+-...+-...++...=.. .|.|..-+.+ .++++..+ T Consensus 127 ~r~a~~----------~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~-~g~T~eY~v~t~~~tV~~q~~~~kq 195 (410) T protein:vir:83 127 YARAAD----------HQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPL-NNATFYRPIVSQRPAVGLQGVAGGA 195 (410) T ss_pred HHHhhc----------cCcccccccccchhHhhhHHHHHhhccchhhhhhhCCC-CCCeeEEeeeccccccccccccccc Confidence 222221 12233332212234555555444433223333222111 2666655433 22233222 Q ss_pred -eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_015719. 74 -QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVN 152 (344) Q Consensus 74 -~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (344) +.|..++.. .+.....+-.|+.+--..+ +....--.++....+-.++-.+.+-|+.....+=..|.. +.. T Consensus 196 a~EGd~L~~g--Kl~~~t~tA~ikTyGGyt~-LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~-t~t----- 266 (410) T protein:vir:83 196 SDEKTELDSQ--KMVIDRLTVNAKTLGGYVN-VSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALAS-TST----- 266 (410) T ss_pred cccccccccc--ceeeeeccceeehhcCccc-ccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH-hhh----- Confidence 245555532 3455555566666543221 211111123444444444555555555554444332311 100 Q ss_pred cccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhc--CCCcCCCEEEeCHHHHHHHhccchhhhhc---cc Q lcl|NC_015719. 153 ENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKN--YVPANDRTFYTTPDVYSAILAALMPNAAN---YA 227 (344) Q Consensus 153 ~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~---~~ 227 (344) + ..+.+. + .++.+...+.++....+.+ ++ .=+++.|+|+++..+..--.-.+.+ .. T Consensus 267 ----~--------~~a~~~-~----Tad~~~~~i~da~~~v~da~~~~--~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~ 327 (410) T protein:vir:83 267 ----G--------AVGYGN-A----TADNVASAIWQAAGAVYTAVKGM--GRLVIAIAPDVLGDFGPLFAPVNPTNAHST 327 (410) T ss_pred ----h--------hhhhhh-c----cHHHHHHHHHHHHHHHhhhhccc--eeeeEEechhhhhhccceeeccCCCCcccc Confidence 0 001111 1 1344556666788888876 44 3368899999976665432222222 11 Q ss_pred --cccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhh--e Q lcl|NC_015719. 228 --ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLK--D 303 (344) Q Consensus 228 --~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~--~ 303 (344) +.+.+..|.-|.+.|++|...+.+|.+.. .++.+.||-.-+.- + T Consensus 328 Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA--------------------------------~f~~~~Ai~~~eS~~gp 375 (410) T protein:vir:83 328 GFEAGRFGQGVMGSISGIPVVMSAALGSGDA--------------------------------YLFSTAAIECFEQRVGT 375 (410) T ss_pred cccccccccchhhhhcccceEEecCCCcCee--------------------------------eEeccceeeeeecCCce Confidence 22234477668999999999998875431 22333333111111 1 Q ss_pred eeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEec Q lcl|NC_015719. 304 LALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFK 341 (344) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~ 341 (344) +++....--+...+ +.|.+ +..+.-|++++=+.=+ T Consensus 376 ~qL~d~~i~nLt~~-ySgY~--a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 376 LQVVEPSVFGLQVA-YAGYF--STLVVNEDAIVPLVGS 410 (410) T ss_pred eEeeCCchhhhhhh-heeee--eeccccccceeeeccC Confidence 11111100011111 22332 3355566666555544 No 199 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=75.95 E-value=0.14 Score=25.25 Aligned_cols=281 Identities=9% Similarity=0.008 Sum_probs=94.6 Q ss_pred ccccccccccccchhhhhHHHHhhHHHHHH-HHhhhhcCCceeeecccccEEEEeecCcceeeeeeCCCCCCCCcCCccc Q lcl|NC_015719. 10 LGTNQGKGQSAADKLALFLKVFGGEVLTAF-ARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLDDKRKDIKH 88 (344) Q Consensus 10 ~~~~~g~~~~~~d~~~l~~e~f~geV~~~f-~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~ 88 (344) ++..|+.- . .++..| .+..+....+.... +.|.---+|.+-+.. +|... .. T Consensus 1 i~~~P~~~--------------g-~~~glff~~~~v~T~~V~ie~-~~~~l~lip~v~rg~-----~g~~~-------~~ 52 (320) T protein:vir:10 1 MNLLPVNY--------------G-DSRALFAREKKVRTRTILVEE-KNGVLTLIQSREPGS-----TENVA-------KR 52 (320) T ss_pred CCcCCchh--------------h-hhhhhccCCCCcccceEEEEE-ecCceeeeeccCCCC-----Cceee-------cC Confidence 33344432 1 111222 12222222222222 223322233332211 11111 11 Q ss_pred ceEEEEeeeeeeec--eec--cch--------HHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc-cc-- Q lcl|NC_015719. 89 TEKTINIDGLLTAD--VLI--YDI--------EDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGV-NE-- 153 (344) Q Consensus 89 ~~~~l~iD~~~~~~--~~I--dd~--------D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~-~~-- 153 (344) .++.+..=+-.|+. ..| +|+ ++.++--+++.+...++ .+.+|... ..+.-.|-. +.+ .. T Consensus 53 ~~~~~~~f~~p~~~~~d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~l----r~~~~~T~-E~m~~~AL~-G~ildadG 126 (320) T protein:vir:10 53 GKRKVRSFVIPHLPLEDVILPDEYEGLRGFGTTALAAKSELVKERXETM----KSSHDITH-EHLRMGAKK-GQILDADG 126 (320) T ss_pred CcceEEEEecceeccCCccCHHHHcCcccCCCchHHHHHHHHHHHHHHH----HHHHHHHH-HHHHHhhhc-CeEEcCCC Confidence 11111110111110 001 010 12222222333333333 33333222 112111111 111 00 Q ss_pred ----c---cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc- Q lcl|NC_015719. 154 ----N---IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN- 225 (344) Q Consensus 154 ----~---~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~- 225 (344) + ..|... ..+...-.+ ..++....+.+.+..+...|. ..|..+-+++++|++|..|+.++.+...- T Consensus 127 tv~~d~y~~fGi~~-~~i~~~l~~---a~~dv~~~~~~~~~~i~~~l~--g~~~t~v~al~g~~f~~al~~h~~Vke~y~ 200 (320) T protein:vir:10 127 TVLYDLYAEFGITK-KTIYFGLDN---KDANVAESCRQVLRHVEDNLR--GDVMKDVSVDVSEEFFDKFIKHASVKEVFL 200 (320) T ss_pred cEEEechhhhCCcc-ceeEEecCC---CCccHHHHHHHHHHHHHHHhc--cCCCCceEEEEChHHHHHHhcCHHHHHHHH Confidence 0 011111 111111111 112223445555555555564 45666667899999999999998875432 Q ss_pred -cc-cccccccc--eeEEEeCeEEEEecc-ccccccccc-ccccc----cccccccccccccccccc---ccceeEEEec Q lcl|NC_015719. 226 -YA-ALIDPERG--SIRNVMGFEVVEVPH-LTAGGAGDD-RPEEG----TDASNQKHAFPATGGKVN---KENVVGLFQH 292 (344) Q Consensus 226 -~~-~~~~~~~G--~Vg~i~G~~V~~sn~-lp~~~~~~~-~~~~~----~~~~~~~~~~~~~~~~~~---~~~~~gl~~~ 292 (344) +. +...++.. .-..+.|+.+++-.- .+...+... .+..+ ++.+ ..+.|-.+.+-.+ ..++.|+ T Consensus 201 ~~~~~~~~l~~~~~~~f~~gGi~~~~Y~g~~~d~~g~~~~~I~~~~~~~~p~g-~~~~f~~~~apad~~e~vnt~g~--- 276 (320) T protein:vir:10 201 NHEAAVNRLGGDTRKGFKFGGLIFNENRARHVDEEGKETRFIKAGKGHAFPTG-TTNTFFTALAPADFNETAGTLGK--- 276 (320) T ss_pred hhhhhhhhccccccceEEecCEEEEEcccEEEcCCCCeeEeecCCeeEEEEec-CchhheeeecccCcHhhcCCccc--- Confidence 11 11112211 113677888887532 111111110 01111 1111 1112222211111 1122221 Q ss_pred HHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 293 RSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 293 ~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) ++-...+.++.-.+..+..-..-=.-+.||++++-++..++= T Consensus 277 ----------p~y~k~~~~~~~~g~~l~~qS~PLpi~~rP~~lv~~~~~a~~ 318 (320) T protein:vir:10 277 ----------RYYAKMEPRRMGRGFDLHSQSNVLPMCCRPGVLVELDAAAQP 318 (320) T ss_pred ----------ccccccccccCCCeEEEEeeecccccccCcceEEEEEecCCC Confidence 111222222222222222221112456799999987776655 No 200 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=72.24 E-value=0.19 Score=24.60 Aligned_cols=274 Identities=13% Similarity=0.061 Sum_probs=119.1 Q ss_pred ccccchhhhhHHHHhhHHHHHHHHh-----hhhc----CC-ceeeecccccEEEEeecCcc-----eeeeeeCCCCCCCC Q lcl|NC_015719. 18 QSAADKLALFLKVFGGEVLTAFART-----SVTA----NR-HMQRQISSGKSAQFPVIGRT-----KAAYLQPGESLDDK 82 (344) Q Consensus 18 ~~~~d~~~l~~e~f~geV~~~f~~~-----s~~~----~~-~~~~~i~~G~tv~i~~iG~~-----t~~~~~~g~~~~~~ 82 (344) .+-+|.- +|..++..++.+. .+|. +. +.....-.|+-+..|..-.. +..++.....+. T Consensus 1 m~lsD~~-----vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt-- 73 (325) T protein:vir:95 1 MALSDLA-----VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVA-- 73 (325) T ss_pred Cchhhhh-----hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceec-- Confidence 3444443 4666666665543 1111 11 11111224777766665432 112222222221 Q ss_pred cCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCce Q lcl|NC_015719. 83 RKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPS 162 (344) Q Consensus 83 ~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~ 162 (344) +..+...+..-++ -..-..+...|+...-...|.+++++++.|..+++...+.++..+.+....+-... ... T Consensus 74 ~~kitt~~~~av~-~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~-------~~~ 145 (325) T protein:vir:95 74 EKVLKHLVDTSVK-VAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQV-------SDV 145 (325) T ss_pred cceeccccceeeE-EecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------ccc Confidence 1223333322111 12222233445444445567888899999999998877777766654332211111 111 Q ss_pred eeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCC-CEEEeCHHHHHHHhccchhhhhc-cccccccccceeEEE Q lcl|NC_015719. 163 LLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAND-RTFYTTPDVYSAILAALMPNAAN-YAALIDPERGSIRNV 240 (344) Q Consensus 163 ~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~-~~~~~~~~~G~Vg~i 240 (344) +.+..+..+..+ .....+.|.+|+.+|-++. +. ..+++.+.+|..|.+........ +...+.. .|... T Consensus 146 v~dis~~~~~~~----~~~s~~~l~~A~~klGD~~---~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~---~i~t~ 215 (325) T protein:vir:95 146 VYDATANTDAAD----KLPTWNNLNNGQAKFGDQS---SQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVN---VVRDP 215 (325) T ss_pred eeeeecccCccc----ccccHHHHHHHHHHhcccc---cceeEEEEchHHHHHHHHhhccccccccccCCcc---ccccc Confidence 222222221110 0112466788888886642 22 45789999999998653322111 1111111 35678 Q ss_pred eCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeee---ecchhhhh Q lcl|NC_015719. 241 MGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERA---RRAEYQAD 317 (344) Q Consensus 241 ~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~---~~~~~~~d 317 (344) +|-+|+.+..+|....+..- .| .-+.|-+-|++.....++..... ++++. +. T Consensus 216 ~G~~VIVdD~~p~~~~g~~~------------~y------------tty~lg~GAi~~~~~~~~~~~~~~~~~~~~~-~~ 270 (325) T protein:vir:95 216 FGKLLVMTDSPNLFAAGTPN------------VY------------HILGLVPGGVLIGQNNDFDANEETKNGDENI-IR 270 (325) T ss_pred CCcEEEEeCCCCCCCccCce------------eE------------EEEEEecCeEEecCCCCccccccccCcccce-ee Confidence 89999999999875432210 00 01222333333333333222222 22211 11 Q ss_pred hhh-----hhhhhcCceeccccEEEEEecCCC Q lcl|NC_015719. 318 QII-----AKYAMGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 318 ~i~-----~~~~~G~~v~Rp~~~~~l~~~~~a 344 (344) .++ .++.+|.+- +....-.-+.-| T Consensus 271 ~~~~~~tf~lhp~G~sw---~~s~~g~sPt~a 299 (325) T protein:vir:95 271 TYQAEWSYNIGVKGFAW---DKANGGKSPTDA 299 (325) T ss_pred eeeeeeeEEeecceeee---ecccccCCcChH Confidence 112 223444333 111000000000 No 201 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=68.11 E-value=0.24 Score=23.96 Aligned_cols=302 Identities=16% Similarity=0.107 Sum_probs=130.9 Q ss_pred CCCcccc-----ccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhc--CCcee------eecccccEEEEeecCc Q lcl|NC_015719. 1 MANMQGG-----QQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTA--NRHMQ------RQISSGKSAQFPVIGR 67 (344) Q Consensus 1 ma~~~~~-----~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~--~~~~~------~~i~~G~tv~i~~iG~ 67 (344) |..=++- ..++++...++.++. |-|-.|.++.|....-.. ..... ....+......+.-+. T Consensus 97 MTgPTGLIFAmRsrY~~~~~~~nq~gt------EAlfnEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 170 (462) T protein:vir:10 97 MTGPTGLIFAMRSFYGSERRPANSDFR------EALFNEPNAGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPA 170 (462) T ss_pred CCcchhhhheeeeeccCCccccccccc------hhhhccCCcCccccccccccccccccccccccccccccceeecCCCc Confidence 3221110 111111222222221 333355555554211000 00000 0000011000110000 Q ss_pred ceeeeee--------CCCCCCCCcCCcccceEEEEeeeeeeec--------eeccchHHHHh--ChhHHHHHHHHHHHHH Q lcl|NC_015719. 68 TKAAYLQ--------PGESLDDKRKDIKHTEKTINIDGLLTAD--------VLIYDIEDAMN--HYDVRSEYTSQIGESL 129 (344) Q Consensus 68 ~t~~~~~--------~g~~~~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~--~~d~~~~~~~~~~~aL 129 (344) .+...+. .++.+........-.+.-+.||+...-+ ..+.-..+.++ -.|.-.|++.=.+..+ T Consensus 171 g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEI 250 (462) T protein:vir:10 171 GTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEI 250 (462) T ss_pred cceecccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHH Confidence 0000000 0111110000113356678888765432 44554555555 4788889999999999 Q ss_pred HHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHh-hc------CCCcCC Q lcl|NC_015719. 130 AMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALT-KN------YVPAND 202 (344) Q Consensus 130 a~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld-~~------~VP~~g 202 (344) ..++.+-|++.|..-+.. ....+.....++++.... .+...++....+.-+++ |+ ----.+ T Consensus 251 mlEINReii~~l~~~a~~-----~k~~~~~~~Gv~dl~~~~-------~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~ 318 (462) T protein:vir:10 251 LAEINREVVRTIYVNAVK-----GAIANTATDGIFDLDVDS-------NGRWSVEKFKGLLFQIERDSNAIGQETRRGKG 318 (462) T ss_pred HHHhhHHHHhhhhhhhee-----eecccccccceeeecccc-------chHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 999999999988754322 111222222333331111 12333333333333332 11 111345 Q ss_pred CEEEeCHHHHHHHhccch--hh----hhccc-cccccccceeEEEe-CeEEEEecccccccccccccccccccccccccc Q lcl|NC_015719. 203 RTFYTTPDVYSAILAALM--PN----AANYA-ALIDPERGSIRNVM-GFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAF 274 (344) Q Consensus 203 R~~vv~P~~~~~Ll~~~~--~~----~~~~~-~~~~~~~G~Vg~i~-G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~ 274 (344) -|+|++|++.+.|-...- +. ....+ ..++.....+|.+. |++||.-+-....+...+.+ T Consensus 319 n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~------------- 385 (462) T protein:vir:10 319 NILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYV------------- 385 (462) T ss_pred eEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceEE------------- Confidence 799999999998854432 21 11121 22344556677774 77888754221111111211 Q ss_pred ccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEe-----cCCC Q lcl|NC_015719. 275 PATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVF-----KAGA 344 (344) Q Consensus 275 ~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~-----~~~a 344 (344) .+|.+. ..-..||+|+|- +...++ ..-||+.|--.|-.+.|||-.+ .|=. ..+.- ..+. T Consensus 386 vG~KG~--~~~~~glfy~PY----v~l~~~---~~~dp~sfqP~~g~~tRY~l~~-NP~t-~~~~~~~~~~~~~~ 449 (462) T protein:vir:10 386 AGYKGT--SPYDAGLFYCPY----VPLQQV---RAINPNTFQPKIGFKTRYGMVS-NPFS-GGLTQGSGALTANA 449 (462) T ss_pred EEEeCC--cccccceeeccc----cccccc---cccCCccccceeeeeeeeeeee-cCCC-CCcCCccccccccC Confidence 111121 122367888885 333333 3349999999888888898643 2221 11111 1111 No 202 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=63.83 E-value=0.31 Score=23.37 Aligned_cols=299 Identities=12% Similarity=0.027 Sum_probs=123.3 Q ss_pred CC--CccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeeccc---ccEEEEee---cCcceeee Q lcl|NC_015719. 1 MA--NMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISS---GKSAQFPV---IGRTKAAY 72 (344) Q Consensus 1 ma--~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~---G~tv~i~~---iG~~t~~~ 72 (344) |. +..+.+..++-......++=+. |++-|...+.+..-.--+...++...+ ++ -+++.++. .|.. +- T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~--~l~~~~p~~i~~~tap~~a~~l~pv~t-~g~W~~~~~~~~v~e~~G~A--~~ 130 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQ--FLQNWLPGHVRILTAVREADEFLGLST-VGQWDDEQIVQRVLEGLGTA--QP 130 (379) T ss_pred hccccccccccccCccccccccchHH--HHHhhcchHHHHHhhhhhhhhhccccc-CCCceeeeEEEeeeeeeeee--EE Confidence 43 3333222211111112223344 788887554444443334455555544 21 24444444 4554 34 Q ss_pred eeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHH---hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_015719. 73 LQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLAD 149 (344) Q Consensus 73 ~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (344) |..+++.+-..-..+-.++.+.. . -..+.+.+.+... +..|+-.+-.+.+..+|.+..|+..+.-. ..+ T Consensus 131 ygd~~d~pl~d~~~~~~~r~v~~--~-~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~----~d~- 202 (379) T protein:vir:10 131 YTDGGNMALMSWTPTFETRTVVR--F-EAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGY----NDG- 202 (379) T ss_pred eccccCCCeeeeeeeeeeeeeEE--E-EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEee----cCC- Confidence 44343332211111111222211 1 1223444444322 47778777777788888887777654211 000 Q ss_pred ccccccccccC------ceeeecccccccccchhhHHHHHHHHHHHHHHHhhc--C--CCcCCC-EEEeCHHHHHHHhcc Q lcl|NC_015719. 150 GVNENIAGLGK------PSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKN--Y--VPANDR-TFYTTPDVYSAILAA 218 (344) Q Consensus 150 ~~~~~~~~~~~------~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~--~--VP~~gR-~~vv~P~~~~~Ll~~ 218 (344) .....|+-. ......++.+...=..+..+.|+++|..+...|-.+ . .|.+-+ .++++|..+..|..- T Consensus 203 --~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~ 280 (379) T protein:vir:10 203 --SGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP 280 (379) T ss_pred --CcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc Confidence 011111111 111111111111111234566888888887776654 2 265444 688999999988643 Q ss_pred chhhhhccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccc-c-ceeEEEecHHHH Q lcl|NC_015719. 219 LMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNK-E-NVVGLFQHRSAV 296 (344) Q Consensus 219 ~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~gl~~~~~Av 296 (344) . .|+ ..+.+=.-.+.-+++|...+.|-..++.+.. .-.+.....+... + ..+-+.++..-- T Consensus 281 n-----~~g--~Tvl~~lk~n~Pnl~i~t~pEL~~aggg~~~----------~~~~~~~~~~~~t~~~~~~~~~~p~k~~ 343 (379) T protein:vir:10 281 T-----ELG--YSVAQYMRESYPNVTFVSAPELNDANGGSSA----------IYYYADAVENNGTDDGRTWLQVVPTKMF 343 (379) T ss_pred c-----ccC--ccHHHHHHHhcCCcEEEEcccccccCCCccE----------EEEEeeccCCCccCCcceEEEecchhhh Confidence 2 121 1111100012446778887777432211100 0000000000000 0 011122222110 Q ss_pred hhhhhheeeeeeeecchhhhhhhhhhhh-hcCceeccccEEEEEecCCC Q lcl|NC_015719. 297 GTVKLKDLALERARRAEYQADQIIAKYA-MGHGGLRPESAGALVFKAGA 344 (344) Q Consensus 297 ~~~~~~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~Rp~~~~~l~~~~~a 344 (344) . +.+ .++..++.+....+ .|.-+.||-+++-+. || T Consensus 344 ~------l~v----e~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~---G~ 379 (379) T protein:vir:10 344 T------LGV----EKKIKGYAEGYTNATAGAMLKRPFATYRQT---GA 379 (379) T ss_pred h------ccc----eecCceeEeccccceeeeeeecchhhheec---CC Confidence 0 000 01222233444444 567888898765443 33 No 203 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=63.57 E-value=0.31 Score=23.34 Aligned_cols=287 Identities=10% Similarity=-0.005 Sum_probs=122.4 Q ss_pred CCCccccccccccccccccccchhhhhHHH-HhhHHHHHH----HHhhhhcCCceeeecc--cccEEEEee---cCccee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKV-FGGEVLTAF----ARTSVTANRHMQRQIS--SGKSAQFPV---IGRTKA 70 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~-f~geV~~~f----~~~s~~~~~~~~~~i~--~G~tv~i~~---iG~~t~ 70 (344) ||--. .+ ..|..+ .... .+|.. ....|+..+ ...-..+.++...+.- .-+++.+.. .|.. T Consensus 35 ~a~d~--~~--~~~~~~--~~~~--~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a-- 104 (339) T protein:vir:94 35 YAMDA--VN--LTPTLQ--TTAN--AGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQV-- 104 (339) T ss_pred hhccc--cc--cccccc--cccc--cchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccce-- Confidence 11000 00 001111 1111 12322 223333222 1222334444444321 135666644 3544 Q ss_pred eeeeCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHH---hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_015719. 71 AYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINL 147 (344) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (344) +-|..+++.+-.....+-.++++.+=+ ..+.+..++... +..|+-..-.+.+..+|.+..|+..+.-- T Consensus 105 ~~ygd~ad~Pl~~~~v~~~~~~v~~~~---~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd------ 175 (339) T protein:vir:94 105 ATYSDWSANGMSKANVNFESRQNYRYQ---TWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGV------ 175 (339) T ss_pred EEcccccCCCcccccceeeEEeEEEEE---EEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeee------ Confidence 345444444322222333344444333 334455555433 36778777778888888888887654211 Q ss_pred ccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcC----CCcCCCEEEeCHHHHHHHhccchhhh Q lcl|NC_015719. 148 ADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNY----VPANDRTFYTTPDVYSAILAALMPNA 223 (344) Q Consensus 148 ~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~----VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (344) ......|+-..-.+....+...+=..+..+.|+++|..+...|.... -|..-..++++|..|..|-.-..+ + T Consensus 176 ---~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~n~~-~ 251 (339) T protein:vir:94 176 ---AGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRTNNF-G 251 (339) T ss_pred ---cccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccCCcC-C Confidence 11112222211111111111111112335678899998888886663 244446799999999988643211 0 Q ss_pred hccccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhhe Q lcl|NC_015719. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKD 303 (344) Q Consensus 224 ~~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~ 303 (344) .+ -..-+.+ +.-+++|...+.|-..++... ..+..+ ..-.+.+-+.++.. -+. T Consensus 252 ~T--vl~~lk~----n~pnl~i~~~~el~~a~g~~~------------~~~~~~---~~~~~~~~~~~p~~------~~~ 304 (339) T protein:vir:94 252 LS--AGAKIAQ----TYPNIQFVAVPEFDTASGRLV------------QLWVPE---VNGQPTGEVAFAEK------LRS 304 (339) T ss_pred cc--HHHHHHH----hcCCcEEEEccccccCCCceE------------EEEEEe---ccCCcceEEEcchh------hhc Confidence 00 0001111 234567777666632221110 011111 01112222333221 111 Q ss_pred eeeeeeecchhhhhhhhhhhh-hcCceeccccEEEEEec Q lcl|NC_015719. 304 LALERARRAEYQADQIIAKYA-MGHGGLRPESAGALVFK 341 (344) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~-~G~~v~Rp~~~~~l~~~ 341 (344) +.+| ++...+.+....+ .|.-+.||.+++-+.== T Consensus 305 lpvq----~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 305 HSIE----RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred cccE----EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 1111 2333455666666 67899999987664333 No 204 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=61.29 E-value=0.36 Score=23.04 Aligned_cols=285 Identities=11% Similarity=0.049 Sum_probs=104.6 Q ss_pred CCCccccccccccccccccccch-h-hhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceee--e--ee Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADK-L-ALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAA--Y--LQ 74 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~-~-~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~--~--~~ 74 (344) |.+++. .|+= |+ + ++.+.-+.. .|-... +.+.+.+ ...+.+++..|+-... + .. T Consensus 1 m~~~~~-----~~~~------dp~LT~~A~gy~n~----~~Iad~-lfP~vpV----~~~~~k~~~f~~e~f~~~~t~ra 60 (307) T protein:vir:79 1 MGRLSK-----LRIV------DPVLTNLAIGYTNA----EFIGQT-LMPVVEV----EKEGGKIPKFGKESFRLYQTERA 60 (307) T ss_pred CCCCCC-----Cccc------CHHHHHHHhhccch----hhhhhh-cCCcccc----cccccceeeeccccccccccccc Confidence 666553 2221 21 1 111111111 121122 2233332 2333444444432211 1 11 Q ss_pred CCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN 154 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (344) ++.... ....-..+..++.+++.. -...||+.+...+.||++...++.....+.+..+-.+. .++-. T Consensus 61 ~~~~~~-~v~~~~~~~~~~~~~~~~-l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A----~l~~~------- 127 (307) T protein:vir:79 61 LRAKSN-RMNPEDIDSVDVNLDEHD-LEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIA----DLSQN------- 127 (307) T ss_pred cCCCcc-eeeeeccccccccccccc-hhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHH----HHhcc------- Confidence 222111 111111233455555543 23568888888889998776555544444333333222 11111 Q ss_pred cccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc-cccccccc Q lcl|NC_015719. 155 IAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN-YAALIDPE 233 (344) Q Consensus 155 ~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~~~~~~ 233 (344) ......++.+++.++..-+++ ....+..|.++++.+.+..- ..--.+|+++..|..|+.++++.+.- +.+.+.+. T Consensus 128 ~~~y~~~~k~tLsgt~~Wsd~---~sDPi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it 203 (307) T protein:vir:79 128 PSSYAAGNKKQLSATEKFTAA---NSDPVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT 203 (307) T ss_pred ccccCCCceEEEccCcccCCC---CCCcHHHHHHHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCccccccC Confidence 111223344444433222221 12235667777777766533 22368999999999999999988653 33322222 Q ss_pred cceeEEEeCeE-EEEecccccccccc-ccccccccccccccccccccccccccceeEEEecHHHHh-------------h Q lcl|NC_015719. 234 RGSIRNVMGFE-VVEVPHLTAGGAGD-DRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVG-------------T 298 (344) Q Consensus 234 ~G~Vg~i~G~~-V~~sn~lp~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~-------------~ 298 (344) .=.+..+.|+. |+.-....+..... ..+.. +.+.|.+.+.+.+ + T Consensus 204 ~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~---------------------~~~~l~y~~~~~~~~~~~~~~ps~Gyt 262 (307) T protein:vir:79 204 VDLLKEIFEVENIAVGEAIYADDKDRFTDIWG---------------------ANIVLAYVPLQRGGQQRTPYEPSYGYT 262 (307) T ss_pred HHHHHHHhCceeEEEeeeeeecccccchhcCC---------------------CceEEEecccccCCCCCccccccccee Confidence 22334566765 33322222211100 00000 0011111111100 0 Q ss_pred hhhheeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 299 VKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 299 ~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +....-.....+.+...+|.|+.....=-.++=||+---|.=.-| T Consensus 263 ~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 263 LRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred EEecCceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 000000011111122223332222211111111111000000000 No 205 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=51.40 E-value=0.58 Score=21.87 Aligned_cols=288 Identities=12% Similarity=0.033 Sum_probs=101.9 Q ss_pred CCCccccccccccccccccccchh--hhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceeeee-eCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKL--ALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAAYL-QPGE 77 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~--~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~-~~g~ 77 (344) |.+++- .|+= |+. ++-+--+.. .|-..++ .+.+.+.. ++++-.+|+.-.-....+. .++. T Consensus 1 m~~~~~-----~~~~------dp~LT~~A~gy~n~----~~ia~~l-~P~vpv~~-~~~k~~~f~~eaF~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRLSK-----LRIV------DPVLTNLAIGYTNA----EFIGQSL-MPVVEVEK-EGGKIPKFGKESFRLYKTERALRA 63 (307) T ss_pred CCCCCC-----Cccc------ChhHHHHHHhhcch----hhhhhhc-CCcccccc-cccceeeECcccccchhhhcccCC Confidence 555442 2221 110 011111111 1212222 33333211 2344444432111000111 1111 Q ss_pred CCCCCcCCcccceEEEEeeeeeeeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|NC_015719. 78 SLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAG 157 (344) Q Consensus 78 ~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~ 157 (344) .. ...++-..+.....+-+.- -...||+-+...+.||++....+.....|.+..+-.+.. ++ - +... T Consensus 64 ~~-~~v~~~~~~~~~~~~~~~~-L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~-l~---~-------~~~~ 130 (307) T protein:vir:10 64 RS-NRMNPEDLGSIDIVLDEHD-LEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVAD-LA---Q-------NPNS 130 (307) T ss_pred Cc-ceeeccccccccccccccc-ccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-Hh---c-------Cccc Confidence 11 1111111112222222221 224577777778899988777666665555444433321 11 1 1111 Q ss_pred ccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc-cccccccccce Q lcl|NC_015719. 158 LGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN-YAALIDPERGS 236 (344) Q Consensus 158 ~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~~~~~~~G~ 236 (344) ...++.+.++++..-+++. ...+..|.++++.+.+..- ..-..+++++..|..|+.++++.+.- +.+.+.+..=. T Consensus 131 y~~~~k~tLsGt~~Wsd~~---sDPi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~ 206 (307) T protein:vir:10 131 YAGGNKKQLSATEKFTAAG---SDPVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDL 206 (307) T ss_pred cCCCceEEeccccccCCCC---CCcHHHHHHHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHH Confidence 2233444444433222211 2235667777777766533 22368999999999999999988643 33322222223 Q ss_pred eEEEeCeEEEEecccc-ccccccc-cccccccccccccccccccccccccceeEEEecHHHHh-------------hhhh Q lcl|NC_015719. 237 IRNVMGFEVVEVPHLT-AGGAGDD-RPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVG-------------TVKL 301 (344) Q Consensus 237 Vg~i~G~~V~~sn~lp-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~-------------~~~~ 301 (344) +..+.|++.+....-- +...... .+. .+.+.|.+.+...+ ++.. T Consensus 207 la~ll~v~~i~vg~a~~~~~~~~~~~iw---------------------~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~ 265 (307) T protein:vir:10 207 LKEIFEVENIAVGEAIYADDKDRFTDIW---------------------GANIVLAYVPLQRGGQQRTPYEPSYGYTLRK 265 (307) T ss_pred HHHHhCceeEEEeeeeeeccCCccceeC---------------------CCceEEEecccccCCCCCcccccccceeEEE Confidence 3566777665543211 1110000 000 00011111111000 0000 Q ss_pred heeeeeeeecchhhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 302 KDLALERARRAEYQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) +.-.+...+.+...+|.|+..-..=--++=|++---|.=.-| T Consensus 266 ~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 266 KGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred cCCeEeeceecCCceeEEeccccccceeecccccceeccCCC Confidence 111111112222222222211111111111111000000000 No 206 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=48.67 E-value=0.66 Score=21.56 Aligned_cols=292 Identities=8% Similarity=0.016 Sum_probs=118.6 Q ss_pred CCC-cccc-----ccccccccccccccchhhhhHHHH------hhHHHHHHHHhhhhcCCceeeecc--cccEEEEeecC Q lcl|NC_015719. 1 MAN-MQGG-----QQLGTNQGKGQSAADKLALFLKVF------GGEVLTAFARTSVTANRHMQRQIS--SGKSAQFPVIG 66 (344) Q Consensus 1 ma~-~~~~-----~~~~~~~g~~~~~~d~~~l~~e~f------~geV~~~f~~~s~~~~~~~~~~i~--~G~tv~i~~iG 66 (344) |+. |+.- .++..+..+.++-.|.. +.| ...+..+.++.|-|+..++...+. .|..|-+-.-| T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~----~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g 76 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVA----ELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSG 76 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCccccc----ceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeeccccc Confidence 553 2211 11222333333332322 344 345677888899999888876554 47777776666 Q ss_pred cceeeeeeCCCCCCCCcCCcccceEEEEeeeeeeeceec-cchHHHHhC----hhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015719. 67 RTKAAYLQPGESLDDKRKDIKHTEKTINIDGLLTADVLI-YDIEDAMNH----YDVRSEYTSQIGESLAMAADGAVLAEL 141 (344) Q Consensus 67 ~~t~~~~~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~----~d~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (344) ..+-..-+ +.-+- ++.-+...+.+-+.-+..+.= ..+| +|++ .|+...+.......+|. |...+..- T Consensus 77 ~iagrtdt--~R~~r---~~~l~~~~Y~c~qtn~dt~i~y~~lD-aWA~~g~~~dF~~r~~~~i~~~~AL--D~i~IGfn 148 (341) T protein:vir:27 77 LYTGRKAG--GRFTK---QVGVGGHKYKLAETDSCAAITWAMLC-QWANQGGRDQFMKHLTEFSNQMFAL--DIMRIGWN 148 (341) T ss_pred ceeeccCC--Cceec---ccccCCcceEEEEeeeeeeecHHHHH-HHHhcCCChHHHHHHHHHHHHHHhh--hhhhhccc Confidence 55432211 11111 112233344444443333222 2343 4553 66766666655555543 33332221 Q ss_pred HHhhhcccccccccc-------------cccCceeeecccccccccchhhHHHHHHHHHHHHHH-HhhcCCCcCCCEEEe Q lcl|NC_015719. 142 AGLINLADGVNENIA-------------GLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAA-LTKNYVPANDRTFYT 207 (344) Q Consensus 142 ~~~a~~~~~~~~~~~-------------~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~-Ld~~~VP~~gR~~vv 207 (344) -..++...-+..+|- ......++.-+.....+ ...++++=..+.++... +++..--..+.++|| T Consensus 149 Gts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~--~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVviv 226 (341) T protein:vir:27 149 GVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDET--NGDYRTLDAMASDIINNQIHPMFRNDPRLTVFV 226 (341) T ss_pred ceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccC--CCccccHHHHHHHHHhcccChHHhcCCCEEEEE Confidence 111111111111110 01111222211111110 11122222223344443 455544344567777 Q ss_pred CHHHHHHHhccchh--hhhccccccccccce-eEEEeCeEEEEecccccccccccccccccccccccccccccccccccc Q lcl|NC_015719. 208 TPDVYSAILAALMP--NAANYAALIDPERGS-IRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKE 284 (344) Q Consensus 208 ~P~~~~~Ll~~~~~--~~~~~~~~~~~~~G~-Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (344) . ..|+.++.| ++.....+..+.--. ..++.|.+.|..|.+|..+.--+.+ .| T Consensus 227 G----~dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L-------~N-------------- 281 (341) T protein:vir:27 227 G----SGLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIP-------EN-------------- 281 (341) T ss_pred c----hhhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeec-------cc-------------- Confidence 7 556665543 332211111111111 2488999999999999765432211 11 Q ss_pred ceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceeccccE-----EEEEecCCC Q lcl|NC_015719. 285 NVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRPESA-----GALVFKAGA 344 (344) Q Consensus 285 ~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~-----~~l~~~~~a 344 (344) ..+-|+..+. + -+++...+.+++.++-.+ |+. =...|. ..++++.+| T Consensus 282 --LsIY~Q~gs~----R--R~~~d~p~r~rie~yes~-YvV----Edyg~~~~~~~~~vkl~~~~ 333 (341) T protein:vir:27 282 --LQVLTQHGTA----Q--RKAKHESDRKRSKTHTGA-WKV----TQWVCWKRSPLTTQKKSTSA 333 (341) T ss_pred --eEEEEecCcE----E--EEEEeccccccccchhhh-hee----ehhhhhhhccccccccCccc Confidence 1122333211 0 122222222334443222 111 111222 224555555 No 207 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=48.54 E-value=0.67 Score=21.55 Aligned_cols=280 Identities=14% Similarity=0.048 Sum_probs=111.4 Q ss_pred CCCccccccccccccccccccchhhhhHHHHhhHHHHHHHHh-hhhcCCceeeecccccEEEEeecCcc-eeeeeeCCCC Q lcl|NC_015719. 1 MANMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFART-SVTANRHMQRQISSGKSAQFPVIGRT-KAAYLQPGES 78 (344) Q Consensus 1 ma~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~-s~~~~~~~~~~i~~G~tv~i~~iG~~-t~~~~~~g~~ 78 (344) |.-.+ ....+|+. -|.....++|+.. +-...+.+. .-+-.++-+...+|.. .+.... |+ T Consensus 1 m~it~---------------~~l~~l~~-~~~~~~~~~y~~a~~~~~~~a~~-~~sdf~~~~~~~lg~~p~l~e~~-Ge- 61 (302) T protein:vir:10 1 MLINK---------------QSLNAAFV-AIKTIFNNAFAAAPTTWQKIAME-VPSNTSSNDYKWLSTFPKMRRWI-GA- 61 (302) T ss_pred CcccH---------------HHHHHHHH-HHHHHHHHHHHhhhhhhhceeee-cCCCcceeeceecCCCCCccccc-cc- Confidence 32211 11222222 4455555555542 222333221 1122333444444432 111111 21 Q ss_pred CCCCcCCcccceEEEEeeeeeeeceeccc--hHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc-- Q lcl|NC_015719. 79 LDDKRKDIKHTEKTINIDGLLTADVLIYD--IEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNEN-- 154 (344) Q Consensus 79 ~~~~~~~~~~~~~~l~iD~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~-- 154 (344) .. ...+....-++.+.++- -.+.|+. +... ++..-..+.+++|++-++..|+.++..|..+.+..-..... T Consensus 62 ~~--~~~l~~~~~~i~~~~~g-~~v~i~R~~i~nD--dlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF 136 (302) T protein:vir:10 62 KV--VKNLKAYKYVVENEDFE-ATVEVDRNDIEDD--QIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFI 136 (302) T ss_pred ee--eccccccceeEEeeccc-ceecccHHhhccc--ccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCccee Confidence 11 11233344455544432 2233432 2222 34677888999999999999999998775422210000000 Q ss_pred cccccCcee--eecccccccccchhhHHHHHHHHHHHHHHH-hhcCCCc--CCCEEEeCHHHHHH---Hhccchhhhhcc Q lcl|NC_015719. 155 IAGLGKPSL--LEVGAKADLTDPVKLGQAVIAQLTIARAAL-TKNYVPA--NDRTFYTTPDVYSA---ILAALMPNAANY 226 (344) Q Consensus 155 ~~~~~~~~~--i~~~~~~~~t~~~~~~~~i~~~l~~a~~~L-d~~~VP~--~gR~~vv~P~~~~~---Ll~~~~~~~~~~ 226 (344) ...|..+.. -+.+.+.-...........+++.+.++.++ +..+-|. ..+++||+|..... |+.+.+..+ T Consensus 137 ~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~--- 213 (302) T protein:vir:10 137 DTDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLAD--- 213 (302) T ss_pred cccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccCC--- Confidence 011111100 001100000000111122344333333333 2223332 24799999987653 444443321 Q ss_pred ccccccccceeEEEeCeEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeee Q lcl|NC_015719. 227 AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLAL 306 (344) Q Consensus 227 ~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~ 306 (344) ++.....|. ++++.++.|..++ +||-.... ...-.++++ ..+.+.+ T Consensus 214 -g~~Np~~g~------~~~vv~p~L~s~~-------------------aWyL~a~~-~~i~~~~l~-------g~~~P~~ 259 (302) T protein:vir:10 214 -NTPNPYVGT------AELVVDGRIESDT-------------------AWFLLDTT-KPVKPFIFQ-------PRKQPEF 259 (302) T ss_pred -CCcceeccc------eEEEEeeccCCCC-------------------ceEEEecC-CccceEEEc-------CccccEE Confidence 222222232 5788888874211 12211100 001112221 1233456 Q ss_pred eeeecchhhhhhhhhhhhhcCceeccccEEEE-------EecCCC Q lcl|NC_015719. 307 ERARRAEYQADQIIAKYAMGHGGLRPESAGAL-------VFKAGA 344 (344) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~Rp~~~~~l-------~~~~~a 344 (344) |...+++.-+=.++-.+.||+ +.-+.+-+ .-+++| T Consensus 260 ~~~~~~~~dgv~~k~~~d~Gv---d~R~~~G~~~wq~a~~s~g~~ 301 (302) T protein:vir:10 260 VSQVNLDSDDVFNLRKLKFGA---EARAAAGYGFWQLAYGSTGTG 301 (302) T ss_pred EeccCCCCCceEEEEEEEEee---eeeeecchhhhhhhhccCccC Confidence 665555555555666666774 22223322 222233 No 208 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=43.00 E-value=0.86 Score=20.94 Aligned_cols=284 Identities=11% Similarity=-0.019 Sum_probs=102.9 Q ss_pred CCCcccccc-ccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcceee----e-ee Q lcl|NC_015719. 1 MANMQGGQQ-LGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTKAA----Y-LQ 74 (344) Q Consensus 1 ma~~~~~~~-~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~----~-~~ 74 (344) |+|--=..+ +.|....+ ..+ + .|...++| +.+++ ...+.+++..|..+.. + .. T Consensus 1 ~~~~~~~~dp~LT~~A~g--y~n----------~----~~Ia~~l~-P~vpV----~~~~~~~~~f~~~e~F~~~~t~r~ 59 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIA--YRN----------G----RMISDEVL-PRVPV----GKQEFKFWKYDLAQGFTVPETLVG 59 (309) T ss_pred CCCCCcCcCHhHHHHHhh--ccC----------h----hhhhhhcC-Ccccc----Cccccceeeechhhcccccchhhc Confidence 555211000 11211111 011 1 12222332 44432 2333445555554321 1 11 Q ss_pred CCCCCCCCcCCcccceEEEEeeeee-eeceeccchHHHHhChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|NC_015719. 75 PGESLDDKRKDIKHTEKTINIDGLL-TADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNE 153 (344) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~iD~~~-~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (344) ++.... .-+...++.++.+.+.- ...+...++.++...||++....+.....|....+..+.. ++ -. T Consensus 60 ~~~~~~--~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~-lv---~~------ 127 (309) T protein:vir:99 60 RKSKPN--EVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSK-LV---FS------ 127 (309) T ss_pred cCCCcc--eEeecccCceeeecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHH-Hh---cC------ Confidence 222211 11223334444443332 2233333444666689998887777666555554433221 21 11 Q ss_pred ccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhc-ccccc-- Q lcl|NC_015719. 154 NIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAAN-YAALI-- 230 (344) Q Consensus 154 ~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~~~-- 230 (344) ......++.++++++..-+++. ...+..|..++.++. --| -.++++...|..|+.++++.+.- +.+.+ T Consensus 128 -~a~y~~~~k~~Lsgt~~wsd~~---SDPi~~i~~~~~~~g--~~P---N~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g 198 (309) T protein:vir:99 128 -PNSYAAGNKTTLSGADQWSDPT---SNPLPVITDALDSVI--LRP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEG 198 (309) T ss_pred -hhhcCCCceEEecCccccCCCC---CCcHHHHHHHHHhhC--CCc---ceEEechHHHHHHhhCHHHHHHhcCCCcccc Confidence 1112233444444433222221 123444555554431 123 47999999999999999998764 43321 Q ss_pred ccccceeEEEeCe-EEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeee- Q lcl|NC_015719. 231 DPERGSIRNVMGF-EVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALER- 308 (344) Q Consensus 231 ~~~~G~Vg~i~G~-~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~- 308 (344) .+..-++..++|+ +|+......+++..+. . ..+... -++.+.|++......+.+. ++..- T Consensus 199 ~it~~~la~l~~ve~V~vg~a~~n~a~~g~----~-------~~~~~i-----wg~~~~L~y~~~~~~~~~~--ps~G~t 260 (309) T protein:vir:99 199 MVPMAFLQELLELDAIYIGEARLNIARPGQ----N-------PNLIRA-----WGPHASFIYRDRLADTRNG--TTFGLT 260 (309) T ss_pred ccCHHHHHHHhCcceEEeecceeecccccc----c-------cccccc-----cCCcEEEEEcCCCCCCccc--ccccce Confidence 2223344577888 4665443332211000 0 000000 0111222222111111110 00000 Q ss_pred -eecchhhhhhhhhhhh-hcCceecccc----------EEEEEecCCC Q lcl|NC_015719. 309 -ARRAEYQADQIIAKYA-MGHGGLRPES----------AGALVFKAGA 344 (344) Q Consensus 309 -~~~~~~~~d~i~~~~~-~G~~v~Rp~~----------~~~l~~~~~a 344 (344) .|..+..+..++-.+- -|...+|-.- +|-|...+.| T Consensus 261 ~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va 308 (309) T protein:vir:99 261 AQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) T ss_pred eecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhccc Confidence 1111112211111111 1112222100 1111111111 No 209 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=41.25 E-value=0.62 Score=21.73 Aligned_cols=116 Identities=11% Similarity=0.010 Sum_probs=59.0 Q ss_pred EeCHHHHHHHhccchhhhhccc-ccccccccee-EEEeCeEEEEeccccccccccccccccccccc---ccccccccccc Q lcl|NC_015719. 206 YTTPDVYSAILAALMPNAANYA-ALIDPERGSI-RNVMGFEVVEVPHLTAGGAGDDRPEEGTDASN---QKHAFPATGGK 280 (344) Q Consensus 206 vv~P~~~~~Ll~~~~~~~~~~~-~~~~~~~G~V-g~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 280 (344) +|+--+|+.++.++-....--- ..+...+|.. -+++|..++.|+|+|.+. ..+... .-.++ +.--.+.|.+. T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~--a~vlDs-t~lGgmaDE~l~~Pgya~~ 77 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTD--PWLFDV-EQLGGMADEKLLSPEFAPA 77 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecCcceeeeceeeeecCCCCCCc--cceeeh-hhhccccccccCCCcccCC Confidence 5555567777765332211100 1133445544 479999999999999432 111110 00000 00111111110 Q ss_pred ccccceeEEEecHHHHhhhhhheeeeeeeecch--hhhhhhhhhhhhcCceeccccEEEEEecCC Q lcl|NC_015719. 281 VNKENVVGLFQHRSAVGTVKLKDLALERARRAE--YQADQIIAKYAMGHGGLRPESAGALVFKAG 343 (344) Q Consensus 281 ~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~--~~~d~i~~~~~~G~~v~Rp~~~~~l~~~~~ 343 (344) ....+++...|..+ .-++.++++.+-=.-++.|.+.+-|+=..- T Consensus 78 -------------------~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 78 -------------------GNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred -------------------CCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 01113455556555 667778888887778888877766554444 No 210 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=34.20 E-value=1.3 Score=19.95 Aligned_cols=299 Identities=14% Similarity=0.141 Sum_probs=123.7 Q ss_pred CCC------ccccccccccccccc-------------cccchhhhhHHHHhhHHHHHHHH--hhhhcC-------Cceee Q lcl|NC_015719. 1 MAN------MQGGQQLGTNQGKGQ-------------SAADKLALFLKVFGGEVLTAFAR--TSVTAN-------RHMQR 52 (344) Q Consensus 1 ma~------~~~~~~~~~~~g~~~-------------~~~d~~~l~~e~f~geV~~~f~~--~s~~~~-------~~~~~ 52 (344) ++. ++.+.- +..|++. ..++........-.|.+...+.. ..+... ..... T Consensus 133 ~tg~EAf~~~nEadt--~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~ 210 (514) T protein:vir:56 133 LTGAEAFHPTRQADA--SFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTD 210 (514) T ss_pred cccccccccccccCc--Ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 110 000000 0011100 00011000000001111000000 000000 00000 Q ss_pred ecccccEEEEeecCc--ceeeeeeCCCC---CCCCcCCcccceEEEEeeeeeeec--------eeccchHHHHh--ChhH Q lcl|NC_015719. 53 QISSGKSAQFPVIGR--TKAAYLQPGES---LDDKRKDIKHTEKTINIDGLLTAD--------VLIYDIEDAMN--HYDV 117 (344) Q Consensus 53 ~i~~G~tv~i~~iG~--~t~~~~~~g~~---~~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~--~~d~ 117 (344) .+.+|. +..+|. .+. .++. +.+. ....-.+.-+.||+...-+ ..|.-..+.++ -.|. T Consensus 211 ~~a~~~---~y~~~~Gm~Ta----~aEal~~lggs-~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDA 282 (514) T protein:vir:56 211 GVAGGL---LVEIDAGMATS----QAELQENFNGS-SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDA 282 (514) T ss_pred ccccch---hhhhhhhhhhh----hhhhcccCCCC-cccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCCh Confidence 111111 111111 111 0111 1111 1123356677887764432 44555555566 4888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccchhhHHHHHHHHHHHHHHHhhc- Q lcl|NC_015719. 118 RSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKN- 196 (344) Q Consensus 118 ~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~i~~~l~~a~~~Ld~~- 196 (344) -.|++.=.+..+..++++-|++.|...+.-...- ...+.....+.++....+... +...++.+..+..++.+. T Consensus 283 EtELsNILSTEImlEINReii~~l~~~atv~~~~--~~~~~~~~G~~d~~~~~d~~~----~~~~~e~~~~l~~~i~~~a 356 (514) T protein:vir:56 283 DAELSGILANEVMVELNREIVNLVNSQAQIGKSG--WTQGAGAAGVFDFSDAVDVKG----ARWAGEAYKALLIQIEKEA 356 (514) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcc--ccccccccccccccccccccc----chHHHHHHHHHHHHHHHHH Confidence 8999999999999999999988876544221111 111122222333322222111 122233344333333322 Q ss_pred C-C----C-cCCCEEEeCHHHHHHHhccchh--------hhhcccc--ccccccceeEEEeCeEEEEecccccccccccc Q lcl|NC_015719. 197 Y-V----P-ANDRTFYTTPDVYSAILAALMP--------NAANYAA--LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDR 260 (344) Q Consensus 197 ~-V----P-~~gR~~vv~P~~~~~Ll~~~~~--------~~~~~~~--~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~ 260 (344) + + - -.+.|+|.+|.+.+.|-...-+ ..+.... ...+.-|.+. .|++||.-++.|.. +. T Consensus 357 n~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l~--~~~~vy~D~y~~~d----y~ 430 (514) T protein:vir:56 357 NEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLG--GRFKVYIDQYAVND----YF 430 (514) T ss_pred HHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceEEEEec--CceEEEecCCCCcc----eE Confidence 1 1 1 2468999999999988643322 1111111 1112224332 58899988887652 11 Q ss_pred ccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcCceecc---ccEEE Q lcl|NC_015719. 261 PEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGHGGLRP---ESAGA 337 (344) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~Rp---~~~~~ 337 (344) +. .|.| +.....||+|+|- +++.++ +..||+.|--.|-.+.|||-.+ .| +.+.. T Consensus 431 ~v-------------G~KG--~~~~~~glfyaPY----v~l~~~---~~~dp~sfqP~~g~~tRY~l~~-NPy~~~~~~~ 487 (514) T protein:vir:56 431 TV-------------GFKG--STEMDAGVFYSPY----VPLTPL---RGSDSKNFQPVIGFKTRYGVQV-NPFADPTASA 487 (514) T ss_pred EE-------------EEec--Ccceecceeeccc----cccccc---cccCCccccceeeeeeeeceee-CCCCCccccc Confidence 11 1112 1223367888885 333332 3469999999998899998654 33 11111 Q ss_pred EEecCCC Q lcl|NC_015719. 338 LVFKAGA 344 (344) Q Consensus 338 l~~~~~a 344 (344) +.....- T Consensus 488 ~~~~~~~ 494 (514) T protein:vir:56 488 TKVGNGA 494 (514) T ss_pred cccCCcc Confidence 1110000 No 211 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=25.03 E-value=2.1 Score=18.81 Aligned_cols=299 Identities=11% Similarity=0.008 Sum_probs=121.9 Q ss_pred CC-CccccccccccccccccccchhhhhHHHHhhHHHHHHHHhhhhcCCceeee---cccccEEEEee---cCcceeeee Q lcl|NC_015719. 1 MA-NMQGGQQLGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQ---ISSGKSAQFPV---IGRTKAAYL 73 (344) Q Consensus 1 ma-~~~~~~~~~~~~g~~~~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~---i~~G~tv~i~~---iG~~t~~~~ 73 (344) |= +.+++ .|.++. + +.+.|++-|...+.+....--+...++...+ |. -+++.++. +|..+ -| T Consensus 63 mDa~~~~~---~t~~~~----g-~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~-~~t~ty~~~e~~G~A~--~y 131 (382) T protein:vir:96 63 MDSNFTAP---VTTPSI----P-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWE-DQEIVQGIVEPAGTAV--EY 131 (382) T ss_pred cccccCCc---cccCCc----c-HHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCcc-ceEEEEeeeecccceE--Ee Confidence 21 11111 122221 1 2445666676544333333333445555543 21 24555544 46654 34 Q ss_pred eCCCCCCCCcCCcccceEEEEeeeeeeeceeccchHHHH---hChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|NC_015719. 74 QPGESLDDKRKDIKHTEKTINIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADG 150 (344) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (344) ..+++.+-.....+-.++++..=+ ..+.+.++++.+ +.+|+-++-...+..+|.+..|+..+.-.. ... T Consensus 132 gd~~D~Pl~d~~~~~~~r~v~~~~---~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~-----~g~ 203 (382) T protein:vir:96 132 GDHTNIPLTSWNANFERRTIVRGE---LGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQ-----SGL 203 (382) T ss_pred ecccCCCccccccceeEEEEEEEE---EeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeee-----cCc Confidence 444444333223333334443222 335566777666 478888887777888888887776542110 000 Q ss_pred cccccccccCceeeeccccc-ccccchhhHHHHHHHHHHHHHHHhhcCC----CcC-CCEEEeCHHHHHHHhccchhhhh Q lcl|NC_015719. 151 VNENIAGLGKPSLLEVGAKA-DLTDPVKLGQAVIAQLTIARAALTKNYV----PAN-DRTFYTTPDVYSAILAALMPNAA 224 (344) Q Consensus 151 ~~~~~~~~~~~~~i~~~~~~-~~t~~~~~~~~i~~~l~~a~~~Ld~~~V----P~~-gR~~vv~P~~~~~Ll~~~~~~~~ 224 (344) . ....|+-..-.+....+. ...=..+..+.|+++|..+...|...-- |.. ...++|+|..|..|-... T Consensus 204 ~-~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~n----- 277 (382) T protein:vir:96 204 G-NRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVTT----- 277 (382) T ss_pred C-cceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccccC----- Confidence 0 001111111111110000 0001223356788999999888877652 443 346889999998885321 Q ss_pred ccccccccccceeEEEeCeEEEEecccccccccccccccccccccccccccccccccc----ccceeEEEecHHHHhhhh Q lcl|NC_015719. 225 NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVN----KENVVGLFQHRSAVGTVK 300 (344) Q Consensus 225 ~~~~~~~~~~G~Vg~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~gl~~~~~Av~~~~ 300 (344) .|+ ..+.+=.-.+.-+++|...+.|-.....+. +.......+..++. -...+...|...-- ++ T Consensus 278 ~~g--~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~---------g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p--~~ 344 (382) T protein:vir:96 278 PYG--ISVSDWIEQTYPKMRIVSAPELSGVQMQGK---------TPEDALVLFVEEVDASVDGSTDGGSVFSQLVQ--SK 344 (382) T ss_pred ccC--ccHHHHHHHhcCCcEEEEccccccccCCCc---------cceeEEEEecchhhhhcccccccCcceecccc--ce Confidence 121 111110001233556666665532111000 00011111111110 00111122211000 00 Q ss_pred hheeeeeeeecchhhhhhhhhhhh-hcCceeccccEEEEEec Q lcl|NC_015719. 301 LKDLALERARRAEYQADQIIAKYA-MGHGGLRPESAGALVFK 341 (344) Q Consensus 301 ~~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~Rp~~~~~l~~~ 341 (344) -+.+-+| + ..-+..+....+ .|.-+.||.+++-+.== T Consensus 345 ~~~l~ve--~--~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 345 FITLGVE--K--RAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred eeeccce--e--ecceeEeccccceeeeEEEcchhhhhccCC Confidence 0000111 0 111122222222 55677777665543222 No 212 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=20.70 E-value=2.7 Score=18.20 Aligned_cols=305 Identities=15% Similarity=0.079 Sum_probs=129.3 Q ss_pred CCCcccc-----ccccccccccccccchhhhhHH-----HHhh--------------------------HHHHHHHHhhh Q lcl|NC_015719. 1 MANMQGG-----QQLGTNQGKGQSAADKLALFLK-----VFGG--------------------------EVLTAFARTSV 44 (344) Q Consensus 1 ma~~~~~-----~~~~~~~g~~~~~~d~~~l~~e-----~f~g--------------------------eV~~~f~~~s~ 44 (344) |..=++- +.++++++.+ +-.-++|-| -|+| ++.+....++- T Consensus 125 MTgPTGLIFAMRsrY~n~~~~~---s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~ 201 (534) T protein:vir:10 125 MTSSTGQVFTLRAIYGGNSQDA---NAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTK 201 (534) T ss_pred CCchhhhheeeeeeecCCCCCc---ccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 2221110 0011111111 001112222 1111 11111111111 Q ss_pred hcCCceeeecc--------cccEE-------EEeecCcceeeeeeCCCCC---CCCcCCcccceEEEEeeeeeeec---- Q lcl|NC_015719. 45 TANRHMQRQIS--------SGKSA-------QFPVIGRTKAAYLQPGESL---DDKRKDIKHTEKTINIDGLLTAD---- 102 (344) Q Consensus 45 ~~~~~~~~~i~--------~G~tv-------~i~~iG~~t~~~~~~g~~~---~~~~~~~~~~~~~l~iD~~~~~~---- 102 (344) ....+....+- .|-.+ ....+|.... -..++.+ .++ ....-.|.-+.||+...-+ T Consensus 202 ~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~--Ta~AE~lg~~ggs-~~~~f~EMsFsIdKvtVtAKSRa 278 (534) T protein:vir:10 202 TVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMA--TAFAELQQGFNGS-ADNEWNEMSFRIDKQVVEAKSRQ 278 (534) T ss_pred ccccccccccccccCCccccccccccccccccceecccccc--hhhHhhhccCCCC-cccchhhcceEEEEEEEeeeccc Confidence 11100000000 00000 0000000000 0001111 011 0112345667787764432 Q ss_pred ----eeccchHHHHh--ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeecccccccccch Q lcl|NC_015719. 103 ----VLIYDIEDAMN--HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTDPV 176 (344) Q Consensus 103 ----~~Idd~D~~q~--~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~~~ 176 (344) ..|.-..+.++ -.|.-.|++.=.+..++.++++-|++.|...+..-......+-+. ...+.++....+.. T Consensus 279 LKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~-~~G~~d~~~~~~~~--- 354 (534) T protein:vir:10 279 LKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGG-KAGVFDFQDTKDIR--- 354 (534) T ss_pred eeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeeccccccccc-ccceeeeecccccc--- Confidence 44555555566 478888999999999999999999998875433211110000001 11222332222211 Q ss_pred hhHHHHHHHHHHHHHHHhhcC--CC-----cCCCEEEeCHHHHHHHhccchhhhhcc-----ccc-cccccceeEEEe-C Q lcl|NC_015719. 177 KLGQAVIAQLTIARAALTKNY--VP-----ANDRTFYTTPDVYSAILAALMPNAANY-----AAL-IDPERGSIRNVM-G 242 (344) Q Consensus 177 ~~~~~i~~~l~~a~~~Ld~~~--VP-----~~gR~~vv~P~~~~~Ll~~~~~~~~~~-----~~~-~~~~~G~Vg~i~-G 242 (344) .+....+.+..+..++.+.. +- -.+-|+|++|++.+.|-....+..... ... +....=.+|.+. | T Consensus 355 -~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~ 433 (534) T protein:vir:10 355 -GARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGK 433 (534) T ss_pred -chhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCc Confidence 12223344444444444331 21 135699999999999976655432111 111 111122456664 7 Q ss_pred eEEEEeccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhh Q lcl|NC_015719. 243 FEVVEVPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAK 322 (344) Q Consensus 243 ~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~ 322 (344) ++||.-++.|.. +.+ ..|.| +.....||+|+|- ++..+ .+..||+.|--.|-.+ T Consensus 434 ~~vy~D~y~~~d----y~~-------------vG~KG--~~~~~~glfyaPY----v~l~~---~~~~dp~sfqP~~g~~ 487 (534) T protein:vir:10 434 YRVYIDQYAVED----YFT-------------VGYKG--ASEMDAGLYYCPY----VALTP---LRGTDPKNFQPVLGFK 487 (534) T ss_pred eEEEecCCCCcc----eEE-------------EEEeC--Ccccccceeeccc----ccccc---ccccCCccccceeeee Confidence 899988877652 111 11112 2223367888885 33333 3457999999999999 Q ss_pred hhhcCceeccccEEEEEecC------C----C Q lcl|NC_015719. 323 YAMGHGGLRPESAGALVFKA------G----A 344 (344) Q Consensus 323 ~~~G~~v~Rp~~~~~l~~~~------~----a 344 (344) .|||-.+ .| .+....-++ + + T Consensus 488 tRY~l~~-NP-~~~~~~~~~~~~i~~g~~~~~ 517 (534) T protein:vir:10 488 TRYGVKL-HP-MADATQNKGFAKISNGMPQHT 517 (534) T ss_pred eeeceee-cC-cccccCCccccccccCCcchh Confidence 9998754 34 221111111 1 0 No 213 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=314 Identities=11% Similarity=0.044 Sum_probs=128.4 Q ss_pred CCCcccc-----ccccccccccc------cccchhhhhHHHHhhHHHHHHHHhhhhcCCceeeecccccEEEEeecCcce Q lcl|NC_015719. 1 MANMQGG-----QQLGTNQGKGQ------SAADKLALFLKVFGGEVLTAFARTSVTANRHMQRQISSGKSAQFPVIGRTK 69 (344) Q Consensus 1 ma~~~~~-----~~~~~~~g~~~------~~~d~~~l~~e~f~geV~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t 69 (344) |..=++- ..+++++...+ ..-.++++|=|.-..+-......+.+|..+.+..+..+|+.+.++...... T Consensus 116 MTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~f 195 (528) T protein:vir:66 116 MSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGI 195 (528) T ss_pred CCchhhhheeeeeeecCCcccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccccce Confidence 2220000 00000010000 001222222221111100000112233333332233333333332111000 Q ss_pred e-------------------------------eeeeCCCC-----------CCCCcCCcccceEEEEeeeeeeec----- Q lcl|NC_015719. 70 A-------------------------------AYLQPGES-----------LDDKRKDIKHTEKTINIDGLLTAD----- 102 (344) Q Consensus 70 ~-------------------------------~~~~~g~~-----------~~~~~~~~~~~~~~l~iD~~~~~~----- 102 (344) . ..|.-+.. +.+. ....-.+.-+.||+...-+ T Consensus 196 s~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~-s~~~f~EMaFsIeK~tVtAKSRaL 274 (528) T protein:vir:66 196 AYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGS-SNNPWAEMSMRIDKQVVEAKSRQL 274 (528) T ss_pred eeeccccccccccCcccccccccccccccccccceecccccchhhhhhhcccCCC-cccchhhcceEEEeEEEEeeccce Confidence 0 00000100 0000 0112345667777764332 Q ss_pred ---eeccchHHHHh--ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccCceeeeccccccccc--- Q lcl|NC_015719. 103 ---VLIYDIEDAMN--HYDVRSEYTSQIGESLAMAADGAVLAELAGLINLADGVNENIAGLGKPSLLEVGAKADLTD--- 174 (344) Q Consensus 103 ---~~Idd~D~~q~--~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~~~~~~~~i~~~~~~~~t~--- 174 (344) ..+.-..+.++ -.|.-.|++.=.+..+..++.+-|++.+-..+. .........-.....+.++....+... T Consensus 275 KAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~-~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw 353 (528) T protein:vir:66 275 KARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQ-VGKTGMTQTVGSKAGVFDLQDPIDTRGARW 353 (528) T ss_pred eccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheee-eeeeeeeeccccccceeecccccccccchh Confidence 34444445555 378888888888888999999999865532211 111000000000111222211111100 Q ss_pred chhhHHHHHHHHHHHHHHHhhcCCCcCCCEEEeCHHHHHHHhccchhhhhcc------ccccccccceeEEEe-CeEEEE Q lcl|NC_015719. 175 PVKLGQAVIAQLTIARAALTKNYVPANDRTFYTTPDVYSAILAALMPNAANY------AALIDPERGSIRNVM-GFEVVE 247 (344) Q Consensus 175 ~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~------~~~~~~~~G~Vg~i~-G~~V~~ 247 (344) ....++.++-.|.++....-.+---..+-|+|++|++.+.|-.......... ...+....=.+|.+. |++||. T Consensus 354 ~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~ 433 (528) T protein:vir:66 354 AGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFI 433 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEe Confidence 0112222332233222222222222345799999999998866543221111 111222222356664 789998 Q ss_pred eccccccccccccccccccccccccccccccccccccceeEEEecHHHHhhhhhheeeeeeeecchhhhhhhhhhhhhcC Q lcl|NC_015719. 248 VPHLTAGGAGDDRPEEGTDASNQKHAFPATGGKVNKENVVGLFQHRSAVGTVKLKDLALERARRAEYQADQIIAKYAMGH 327 (344) Q Consensus 248 sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gl~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~ 327 (344) -++.|..- .+ ..|.|. .....||+|+|-. + +.+.+..||+.|--.|-.+.|||- T Consensus 434 D~y~~~dy----~~-------------vG~KG~--~~~~~glfyaPYv----~---l~~~~~~dp~sfqP~~g~~tRY~l 487 (528) T protein:vir:66 434 DQYARQDY----FT-------------VGYKGD--NEMDAGIYYAPYV----A---LTPLRATDPQSFHPVLGFKTRYGI 487 (528) T ss_pred cCCCCcce----EE-------------EEEeCC--cccccceeecccc----c---ceeeEeeCCccccceeeeeeeece Confidence 88766421 11 111121 2233678888852 2 345567899999999999999987 Q ss_pred ceeccccEEEEEecCCC Q lcl|NC_015719. 328 GGLRPESAGALVFKAGA 344 (344) Q Consensus 328 ~v~Rp~~~~~l~~~~~a 344 (344) .+ .| ......-...| T Consensus 488 ~v-NP-~~~~~~~~~~~ 502 (528) T protein:vir:66 488 GI-NP-FADSKSQEPSA 502 (528) T ss_pred ee-cC-cccccCccccc Confidence 54 45 22222111122 Done!