Query lcl|NC_021299.1_cdsid_YP_008050950.1 [gene=M052_gp018] [protein=capsid protein] [protein_id=YP_008050950.1] [location=9853..11016] Match_columns 387 No_of_seqs 241 out of 1836 Neff 10.3 Searched_HMMs 1612 Date Thu Nov 7 16:24:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_18 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_18_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99075 Length: 392 100.0 1.2E-81 7.3E-85 464.3 34.0 384 1-386 1-392 (392) 2 protein:vir:3525 Length: 423 # 100.0 5.2E-50 3.2E-53 290.8 24.4 356 1-387 1-371 (423) 3 protein:vir:174 Length: 423 # 100.0 6.1E-50 3.8E-53 290.4 24.4 369 1-387 1-404 (423) 4 protein:vir:108303 Length: 418 100.0 1.8E-49 1.1E-52 287.9 26.5 356 1-387 1-399 (418) 5 protein:vir:105374 Length: 423 100.0 5.2E-49 3.2E-52 285.3 25.1 373 1-387 1-406 (423) 6 protein:vir:105522 Length: 423 100.0 9E-49 5.6E-52 284.0 24.6 346 1-387 1-404 (423) 7 protein:vir:102605 Length: 273 100.0 9.9E-45 6.1E-48 261.8 23.1 267 1-297 1-273 (273) 8 protein:vir:105822 Length: 273 100.0 9.9E-45 6.1E-48 261.8 23.1 267 1-297 1-273 (273) 9 protein:vir:7990 Length: 273 # 100.0 1.2E-44 7.3E-48 261.4 22.6 267 1-297 1-273 (273) 10 protein:vir:94622 Length: 341 100.0 1.2E-42 7.6E-46 250.4 21.4 288 1-318 3-341 (341) 11 protein:vir:80180 Length: 381 100.0 8.2E-39 5.1E-42 229.4 21.7 328 1-359 15-381 (381) 12 protein:vir:3136 Length: 322 # 100.0 1.3E-37 8.4E-41 222.7 12.9 281 1-318 1-322 (322) 13 protein:vir:1239 Length: 274 # 100.0 6.6E-34 4.1E-37 202.5 21.8 266 1-320 1-274 (274) 14 protein:vir:96262 Length: 274 100.0 1.7E-33 1.1E-36 200.2 21.8 266 1-320 1-274 (274) 15 protein:vir:95898 Length: 274 100.0 1.7E-33 1.1E-36 200.2 21.8 266 1-320 1-274 (274) 16 protein:vir:80930 Length: 278 100.0 1.5E-33 9.1E-37 200.6 21.0 267 1-298 1-278 (278) 17 protein:vir:94494 Length: 274 100.0 2.9E-33 1.8E-36 198.9 21.5 266 1-320 1-274 (274) 18 protein:vir:97433 Length: 274 100.0 2.9E-33 1.8E-36 198.9 21.5 266 1-320 1-274 (274) 19 protein:vir:93742 Length: 274 100.0 3.8E-33 2.4E-36 198.3 21.2 265 1-320 1-274 (274) 20 protein:vir:96123 Length: 274 100.0 1.3E-32 8.3E-36 195.3 21.6 264 1-300 1-274 (274) 21 protein:vir:96833 Length: 275 100.0 1.1E-32 6.7E-36 195.8 20.7 266 1-320 3-275 (275) 22 protein:vir:100939 Length: 430 100.0 1.5E-31 9.4E-35 189.5 18.6 287 1-324 1-430 (430) 23 protein:vir:9265 Length: 430 # 100.0 1.5E-31 9.4E-35 189.5 18.6 287 1-324 1-430 (430) 24 protein:vir:3613 Length: 272 # 100.0 4.1E-31 2.6E-34 187.2 20.7 262 1-297 1-272 (272) 25 protein:vir:78739 Length: 332 100.0 8.4E-32 5.2E-35 190.9 16.9 282 1-295 7-332 (332) 26 protein:vir:105334 Length: 276 100.0 6E-31 3.7E-34 186.3 21.0 268 1-322 1-276 (276) 27 protein:vir:1541 Length: 347 # 100.0 3E-31 1.8E-34 188.0 18.1 288 1-311 1-347 (347) 28 protein:vir:3364 Length: 347 # 100.0 2.1E-31 1.3E-34 188.8 16.6 285 1-311 1-347 (347) 29 protein:vir:10450 Length: 344 100.0 4.1E-31 2.6E-34 187.2 17.1 284 1-309 1-344 (344) 30 protein:vir:94711 Length: 347 99.9 2.6E-30 1.6E-33 182.7 13.8 284 1-310 1-347 (347) 31 protein:vir:2106 Length: 430 # 99.9 1.4E-28 9E-32 173.2 23.0 370 1-387 1-411 (430) 32 protein:vir:8885 Length: 347 # 99.9 1.7E-29 1.1E-32 178.3 16.5 286 1-315 1-347 (347) 33 protein:vir:100057 Length: 375 99.9 1.7E-28 1.1E-31 172.8 19.1 312 1-338 9-375 (375) 34 protein:vir:2201 Length: 345 # 99.9 1.3E-28 8.4E-32 173.4 17.3 279 1-307 1-345 (345) 35 protein:vir:80213 Length: 334 99.9 1.5E-28 9.2E-32 173.1 17.4 287 1-309 1-334 (334) 36 protein:vir:94576 Length: 347 99.9 1.9E-28 1.2E-31 172.6 17.3 285 1-309 1-347 (347) 37 protein:vir:3033 Length: 272 # 99.9 1.9E-27 1.2E-30 167.0 21.1 262 1-298 1-272 (272) 38 protein:vir:9820 Length: 272 # 99.9 1.9E-27 1.2E-30 167.0 21.1 262 1-298 1-272 (272) 39 protein:vir:79008 Length: 299 99.9 1.3E-26 7.9E-30 162.5 21.3 285 1-311 1-299 (299) 40 protein:vir:95107 Length: 270 99.9 1.3E-25 7.8E-29 157.1 19.2 265 1-320 1-270 (270) 41 protein:vir:103323 Length: 364 99.9 1.7E-24 1E-27 150.9 21.2 311 1-355 1-364 (364) 42 protein:vir:6324 Length: 335 # 99.9 4.3E-25 2.7E-28 154.1 17.5 292 1-330 1-335 (335) 43 protein:vir:78935 Length: 335 99.9 5.4E-25 3.3E-28 153.6 17.5 291 1-315 1-335 (335) 44 protein:vir:107120 Length: 329 99.9 4.7E-24 2.9E-27 148.5 21.3 288 1-328 36-329 (329) 45 protein:vir:99675 Length: 324 99.9 3.7E-25 2.3E-28 154.5 14.9 282 28-356 1-324 (324) 46 protein:vir:94800 Length: 319 99.9 4.7E-24 2.9E-27 148.5 20.3 289 1-320 25-319 (319) 47 protein:vir:97331 Length: 319 99.9 4.7E-24 2.9E-27 148.5 20.3 289 1-320 25-319 (319) 48 protein:vir:78920 Length: 290 99.9 6.7E-24 4.2E-27 147.6 19.8 272 1-308 1-290 (290) 49 protein:vir:97031 Length: 402 99.8 2.1E-22 1.3E-25 139.5 18.1 334 1-367 1-402 (402) 50 protein:vir:739 Length: 231 # 99.8 1.1E-21 7E-25 135.4 17.5 231 34-297 1-231 (231) 51 protein:vir:102655 Length: 322 99.8 2.4E-21 1.5E-24 133.7 17.7 289 1-313 13-322 (322) 52 protein:vir:7019 Length: 401 # 99.8 9.6E-22 6E-25 135.8 14.8 345 1-368 1-401 (401) 53 protein:vir:105464 Length: 346 99.8 6.7E-20 4.2E-23 125.7 18.5 305 1-343 1-346 (346) 54 protein:vir:102335 Length: 312 99.7 4E-19 2.5E-22 121.5 20.0 282 1-319 1-312 (312) 55 protein:vir:1781 Length: 221 # 99.7 8.6E-20 5.4E-23 125.1 13.4 202 83-322 1-221 (221) 56 protein:vir:79712 Length: 285 99.7 2.1E-18 1.3E-21 117.5 18.6 269 1-292 1-285 (285) 57 protein:vir:118 Length: 449 # 99.7 4.2E-17 2.6E-20 110.4 22.2 343 1-387 52-432 (449) 58 protein:vir:105645 Length: 400 99.6 4.1E-17 2.5E-20 110.4 18.8 328 1-341 1-400 (400) 59 protein:vir:99523 Length: 311 99.6 5.5E-16 3.4E-19 104.3 18.1 268 1-309 8-311 (311) 60 protein:vir:5974 Length: 324 # 99.5 5.3E-15 3.3E-18 98.8 20.5 297 1-350 1-324 (324) 61 protein:vir:102944 Length: 330 99.5 8.7E-15 5.4E-18 97.7 19.3 300 1-350 1-330 (330) 62 protein:vir:78090 Length: 302 99.5 7.1E-15 4.4E-18 98.1 18.5 273 1-300 1-302 (302) 63 protein:vir:1583 Length: 351 # 99.4 3.2E-14 2E-17 94.5 19.4 315 1-340 1-351 (351) 64 protein:vir:95451 Length: 313 99.3 1.2E-13 7.2E-17 91.5 12.8 278 1-298 4-313 (313) 65 protein:vir:5202 Length: 448 # 99.2 5.3E-12 3.3E-15 82.4 17.8 340 1-387 52-439 (448) 66 protein:vir:9927 Length: 295 # 98.8 1.4E-09 8.6E-13 69.1 15.6 275 1-322 1-295 (295) 67 protein:vir:80446 Length: 367 98.6 5.3E-09 3.3E-12 66.0 15.6 295 1-310 1-367 (367) 68 protein:vir:106647 Length: 303 98.5 1.4E-08 8.7E-12 63.6 14.4 275 1-316 1-303 (303) 69 protein:vir:108211 Length: 318 98.5 1.6E-08 1E-11 63.3 14.7 273 1-290 22-318 (318) 70 protein:vir:41 Length: 299 # N 98.4 1.1E-07 6.8E-11 58.7 17.8 271 1-310 1-299 (299) 71 protein:vir:9759 Length: 303 # 98.4 8.1E-08 5.1E-11 59.4 16.4 279 1-306 1-303 (303) 72 protein:vir:99749 Length: 324 98.4 2.6E-07 1.6E-10 56.7 18.2 279 1-342 30-324 (324) 73 protein:vir:96223 Length: 324 98.4 2.5E-07 1.6E-10 56.7 18.0 279 1-342 30-324 (324) 74 protein:vir:9875 Length: 296 # 98.3 4.1E-08 2.5E-11 61.1 13.4 266 1-322 1-296 (296) 75 protein:vir:9309 Length: 324 # 98.3 3.4E-07 2.1E-10 56.0 18.3 278 1-318 30-324 (324) 76 protein:vir:94142 Length: 304 98.3 5.1E-07 3.2E-10 55.1 18.4 272 1-310 1-304 (304) 77 protein:vir:105905 Length: 304 98.3 5.1E-07 3.2E-10 55.1 18.4 272 1-310 1-304 (304) 78 protein:vir:97148 Length: 324 98.3 4.8E-07 3E-10 55.2 18.3 276 1-342 31-324 (324) 79 protein:vir:78387 Length: 349 98.3 7.9E-07 4.9E-10 54.0 19.0 309 1-350 1-349 (349) 80 protein:vir:103955 Length: 324 98.3 6.6E-07 4.1E-10 54.5 18.2 274 1-342 30-324 (324) 81 protein:vir:78830 Length: 324 98.2 7.5E-07 4.7E-10 54.2 18.3 274 1-342 30-324 (324) 82 protein:vir:96392 Length: 324 98.2 7.5E-07 4.7E-10 54.2 18.3 274 1-342 30-324 (324) 83 protein:vir:100135 Length: 418 98.2 5.3E-07 3.3E-10 55.0 17.3 262 1-321 136-418 (418) 84 protein:vir:80684 Length: 315 98.2 6.8E-07 4.2E-10 54.4 17.3 291 1-322 1-315 (315) 85 protein:vir:1383 Length: 421 # 98.2 1.7E-06 1E-09 52.3 18.8 298 1-349 116-421 (421) 86 protein:vir:78223 Length: 333 98.2 1.6E-06 9.8E-10 52.4 18.6 287 1-334 20-333 (333) 87 protein:vir:94771 Length: 298 98.2 9.6E-07 5.9E-10 53.6 17.3 273 1-305 1-298 (298) 88 protein:vir:94989 Length: 349 98.1 3.1E-06 1.9E-09 50.8 19.5 309 1-350 1-349 (349) 89 protein:vir:1886 Length: 385 # 98.1 1.8E-06 1.1E-09 52.1 18.0 260 1-310 105-385 (385) 90 protein:vir:191 Length: 385 # 98.1 1.8E-06 1.1E-09 52.1 18.0 260 1-310 105-385 (385) 91 protein:vir:4339 Length: 395 # 98.1 2.1E-06 1.3E-09 51.7 17.9 259 1-309 117-395 (395) 92 protein:vir:94673 Length: 419 98.1 2.4E-06 1.5E-09 51.4 18.1 266 1-314 130-419 (419) 93 protein:vir:78523 Length: 338 98.0 2.9E-06 1.8E-09 51.0 17.9 291 1-309 10-338 (338) 94 protein:vir:95763 Length: 297 98.0 3.7E-06 2.3E-09 50.4 18.4 265 1-304 9-297 (297) 95 protein:vir:9410 Length: 415 # 98.0 4.1E-06 2.6E-09 50.1 18.4 278 1-322 127-415 (415) 96 protein:vir:79987 Length: 415 98.0 6.2E-06 3.9E-09 49.1 19.1 278 1-322 127-415 (415) 97 protein:vir:98339 Length: 415 98.0 6.2E-06 3.9E-09 49.1 19.1 278 1-322 127-415 (415) 98 protein:vir:81100 Length: 415 98.0 6.2E-06 3.9E-09 49.1 19.1 278 1-322 127-415 (415) 99 protein:vir:81227 Length: 413 98.0 4E-06 2.5E-09 50.2 17.8 270 1-315 118-413 (413) 100 protein:vir:2344 Length: 397 # 98.0 8.6E-06 5.3E-09 48.4 22.6 342 1-387 10-395 (397) 101 protein:vir:7771 Length: 330 # 98.0 5.8E-06 3.6E-09 49.3 18.3 282 1-316 1-330 (330) 102 protein:vir:1638 Length: 298 # 98.0 2.9E-06 1.8E-09 50.9 16.5 273 1-305 1-298 (298) 103 protein:vir:97053 Length: 390 98.0 4.7E-06 2.9E-09 49.8 17.5 257 1-307 113-390 (390) 104 protein:vir:4700 Length: 415 # 97.9 7.7E-06 4.8E-09 48.6 18.5 277 1-322 127-415 (415) 105 protein:vir:4600 Length: 415 # 97.9 7.7E-06 4.8E-09 48.6 18.5 277 1-322 127-415 (415) 106 protein:vir:9574 Length: 300 # 97.9 5.6E-06 3.5E-09 49.4 17.6 272 1-306 1-300 (300) 107 protein:vir:6242 Length: 390 # 97.9 5.3E-06 3.3E-09 49.5 16.4 260 1-310 116-390 (390) 108 protein:vir:81070 Length: 390 97.9 9.6E-06 6E-09 48.1 17.6 257 1-307 113-390 (390) 109 protein:vir:101607 Length: 379 97.8 1E-05 6.5E-09 47.9 17.6 267 1-309 109-379 (379) 110 protein:vir:8187 Length: 311 # 97.8 1.5E-05 9.2E-09 47.1 18.3 284 1-312 1-311 (311) 111 protein:vir:6212 Length: 434 # 97.8 5.3E-06 3.3E-09 49.5 15.8 278 1-316 143-434 (434) 112 protein:vir:104256 Length: 458 97.8 1.7E-05 1E-08 46.7 18.8 271 1-309 165-458 (458) 113 protein:vir:4511 Length: 409 # 97.8 1.7E-05 1.1E-08 46.7 18.2 266 1-315 117-409 (409) 114 protein:vir:104085 Length: 320 97.8 1.5E-05 9.6E-09 47.0 17.6 274 1-309 14-320 (320) 115 protein:vir:1328 Length: 392 # 97.8 1.5E-05 9.4E-09 47.0 17.4 261 1-308 114-392 (392) 116 protein:vir:4830 Length: 397 # 97.8 9.2E-06 5.7E-09 48.2 16.1 268 1-320 111-397 (397) 117 protein:vir:93616 Length: 645 97.7 2.8E-05 1.7E-08 45.5 19.4 288 1-324 344-645 (645) 118 protein:vir:10364 Length: 390 97.7 2.8E-05 1.8E-08 45.5 18.1 256 1-307 114-390 (390) 119 protein:vir:100172 Length: 394 97.7 3.1E-05 1.9E-08 45.3 18.6 269 1-322 111-394 (394) 120 protein:vir:2430 Length: 318 # 97.6 4.7E-05 2.9E-08 44.3 19.2 266 1-323 14-318 (318) 121 protein:vir:99920 Length: 311 97.5 4.8E-05 3E-08 44.3 17.5 284 1-311 1-311 (311) 122 protein:vir:96762 Length: 632 97.5 3.5E-05 2.2E-08 45.0 16.5 257 1-311 357-632 (632) 123 protein:vir:8102 Length: 543 # 97.5 5.1E-05 3.2E-08 44.1 17.2 273 1-310 251-543 (543) 124 protein:vir:4856 Length: 293 # 97.5 5.6E-05 3.4E-08 43.9 17.4 270 1-328 5-293 (293) 125 protein:vir:4953 Length: 397 # 97.5 5.1E-05 3.2E-08 44.1 16.7 269 1-328 109-397 (397) 126 protein:vir:4226 Length: 326 # 97.4 6.9E-05 4.3E-08 43.4 18.0 282 1-316 22-326 (326) 127 protein:vir:4997 Length: 397 # 97.4 7E-05 4.3E-08 43.4 16.8 271 1-322 109-397 (397) 128 protein:vir:95376 Length: 425 97.4 8.2E-05 5.1E-08 43.0 17.9 266 1-316 144-425 (425) 129 protein:vir:7409 Length: 408 # 97.4 8.3E-05 5.1E-08 43.0 17.2 280 1-331 116-408 (408) 130 protein:vir:100247 Length: 425 97.4 7.1E-05 4.4E-08 43.3 16.2 264 1-310 130-425 (425) 131 protein:vir:8420 Length: 477 # 97.3 7E-05 4.3E-08 43.4 15.4 283 1-319 163-477 (477) 132 protein:vir:4456 Length: 401 # 97.3 0.00011 6.9E-08 42.3 16.4 265 1-309 107-401 (401) 133 protein:vir:1433 Length: 435 # 97.3 0.00012 7.1E-08 42.2 18.1 279 1-320 130-435 (435) 134 protein:vir:5739 Length: 366 # 97.2 0.00015 9.1E-08 41.6 18.2 264 1-311 64-366 (366) 135 protein:vir:485 Length: 407 # 97.1 0.00016 9.9E-08 41.4 17.9 271 1-318 106-407 (407) 136 protein:vir:80376 Length: 435 97.1 0.00018 1.1E-07 41.1 18.5 277 1-320 138-435 (435) 137 protein:vir:3991 Length: 404 # 97.1 0.00019 1.2E-07 41.0 17.4 278 1-322 116-404 (404) 138 protein:vir:102119 Length: 404 97.0 0.00016 9.8E-08 41.4 15.1 272 1-316 110-404 (404) 139 protein:vir:1025 Length: 408 # 97.0 0.00021 1.3E-07 40.7 17.2 273 1-342 121-408 (408) 140 protein:vir:3870 Length: 400 # 97.0 0.00022 1.4E-07 40.6 16.4 259 1-310 140-400 (400) 141 protein:vir:100884 Length: 389 97.0 0.00023 1.4E-07 40.5 18.4 272 1-322 109-389 (389) 142 protein:vir:105038 Length: 428 96.7 0.00043 2.7E-07 39.0 18.1 275 1-306 125-428 (428) 143 protein:vir:3845 Length: 395 # 96.6 0.00046 2.9E-07 38.9 17.5 278 1-322 105-395 (395) 144 protein:vir:81160 Length: 371 96.5 0.00058 3.6E-07 38.3 17.4 271 1-309 91-371 (371) 145 protein:vir:1268 Length: 397 # 96.4 0.00063 3.9E-07 38.1 16.4 265 1-307 123-397 (397) 146 protein:vir:95875 Length: 401 96.0 0.0012 7.5E-07 36.6 16.3 298 1-316 19-401 (401) 147 protein:vir:2504 Length: 305 # 95.8 0.0014 8.5E-07 36.3 18.9 272 1-317 1-305 (305) 148 protein:vir:96792 Length: 315 95.8 0.0014 8.6E-07 36.2 17.3 292 1-332 1-315 (315) 149 protein:vir:3158 Length: 321 # 94.9 0.0033 2.1E-06 34.2 15.1 271 1-316 24-321 (321) 150 protein:vir:95131 Length: 325 94.8 0.0034 2.1E-06 34.1 18.3 293 1-347 1-325 (325) 151 protein:vir:9704 Length: 394 # 94.7 0.0037 2.3E-06 33.9 17.6 256 1-318 133-394 (394) 152 protein:vir:93696 Length: 364 94.1 0.0053 3.3E-06 33.1 19.1 296 1-308 1-364 (364) 153 protein:vir:107593 Length: 392 94.1 0.0053 3.3E-06 33.1 17.3 276 1-321 106-392 (392) 154 protein:vir:102873 Length: 392 94.1 0.0053 3.3E-06 33.1 17.3 276 1-321 106-392 (392) 155 protein:vir:105004 Length: 392 94.1 0.0053 3.3E-06 33.1 17.3 276 1-321 106-392 (392) 156 protein:vir:102082 Length: 392 94.1 0.0053 3.3E-06 33.1 17.3 276 1-321 106-392 (392) 157 protein:vir:94424 Length: 387 94.1 0.0053 3.3E-06 33.0 14.3 255 1-314 118-387 (387) 158 protein:vir:2685 Length: 387 # 94.1 0.0053 3.3E-06 33.0 14.3 255 1-314 118-387 (387) 159 protein:vir:96978 Length: 387 94.1 0.0053 3.3E-06 33.0 14.3 255 1-314 118-387 (387) 160 protein:vir:78640 Length: 352 93.7 0.0067 4.1E-06 32.5 17.1 256 1-320 83-352 (352) 161 protein:vir:4197 Length: 314 # 93.6 0.007 4.3E-06 32.4 18.2 267 1-319 19-314 (314) 162 protein:vir:93881 Length: 387 93.0 0.0092 5.7E-06 31.7 15.3 256 1-316 118-387 (387) 163 protein:vir:9361 Length: 402 # 92.6 0.011 6.7E-06 31.4 14.5 254 1-314 133-402 (402) 164 protein:vir:79928 Length: 393 92.5 0.011 6.9E-06 31.3 12.4 280 1-322 74-393 (393) 165 protein:vir:80128 Length: 466 91.5 0.016 9.7E-06 30.5 15.9 278 1-342 155-466 (466) 166 protein:vir:4092 Length: 390 # 91.0 0.018 1.1E-05 30.2 18.2 280 1-340 87-390 (390) 167 protein:vir:101650 Length: 497 90.0 0.023 1.4E-05 29.5 17.6 267 1-317 151-497 (497) 168 protein:vir:7855 Length: 497 # 90.0 0.023 1.4E-05 29.5 17.6 267 1-317 151-497 (497) 169 protein:vir:1084 Length: 437 # 89.1 0.029 1.8E-05 29.1 16.6 270 1-318 156-437 (437) 170 protein:vir:962 Length: 397 # 88.7 0.031 1.9E-05 28.9 15.2 258 1-309 138-397 (397) 171 protein:vir:9643 Length: 377 # 78.4 0.11 7.1E-05 25.7 15.4 252 1-307 82-377 (377) 172 protein:vir:2770 Length: 318 # 78.1 0.12 7.3E-05 25.7 17.2 226 1-251 22-318 (318) 173 protein:vir:4159 Length: 315 # 74.7 0.16 9.6E-05 25.0 17.2 262 1-304 19-315 (315) 174 protein:vir:10123 Length: 404 72.8 0.18 0.00011 24.7 17.4 299 1-318 22-404 (404) 175 protein:vir:104439 Length: 404 72.8 0.18 0.00011 24.7 17.4 299 1-318 22-404 (404) 176 protein:vir:3298 Length: 404 # 72.8 0.18 0.00011 24.7 17.4 299 1-318 22-404 (404) 177 protein:vir:819 Length: 404 # 72.8 0.18 0.00011 24.7 17.4 299 1-318 22-404 (404) 178 protein:vir:105610 Length: 430 67.5 0.25 0.00016 23.9 17.2 305 1-340 1-430 (430) 179 protein:vir:101291 Length: 381 66.1 0.27 0.00017 23.7 16.5 263 1-318 76-381 (381) 180 protein:vir:9509 Length: 381 # 66.1 0.27 0.00017 23.7 16.5 263 1-318 76-381 (381) 181 protein:vir:8324 Length: 410 # 62.7 0.33 0.00021 23.2 12.5 255 1-281 136-410 (410) 182 protein:vir:95963 Length: 395 53.2 0.54 0.00033 22.1 16.6 270 1-330 91-395 (395) 183 protein:vir:100632 Length: 381 40.7 0.96 0.00059 20.7 16.9 267 1-324 80-381 (381) 184 protein:vir:98635 Length: 377 25.1 2.1 0.0013 18.8 14.8 252 1-319 79-377 (377) No 1 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=1.2e-81 Score=464.28 Aligned_cols=384 Identities=45% Similarity=0.703 Sum_probs=314.6 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) |||++|+||+|+++++++|+++|+|++|+||||++||.+++||||+||+|+...++++...+..++.++.++++.++.++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceEE Confidence 99999999999999999999999999999999999999999999999999999999999888888899999999999999 Q ss_pred EEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---CCcchhHHHHHHHHHHHh Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSL---VDEDEIWNGVVSNRRWLN 157 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~a~~~l~ 157 (387) ++||++++++|.++|+|+.+.+.|+++++++|++++||+++|.++++++.+++...... .++.+.|+.|++++++|+ T Consensus 81 ~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~~L~ 160 (392) T protein:vir:99 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) T ss_pred EEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999998876655443 345678999999999999 Q ss_pred hccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccccccccccc Q lcl|NC_021299. 158 EQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSP 237 (387) Q Consensus 158 ~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~ 237 (387) ++++|. +|+++++|++++.|+++++|.+.+..++.+...+++|.+|+++||+|++++++|....+.+|.+++.+..+.+ T Consensus 161 ~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~ 239 (392) T protein:vir:99 161 ELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAP 239 (392) T ss_pred hcCCCC-CCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccccccccccc Confidence 999996 8999999999999999999999999998877889999999999999999999999988899999888888777 Q ss_pred ccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccce-eccccccceeeeeeeecccccc Q lcl|NC_021299. 238 GRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDD-EPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 238 ~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ..+.+........ ........|..+++.....+........|.............. .........+.+.+........ T Consensus 240 v~~~~~~~~~s~s-~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~~ 318 (392) T protein:vir:99 240 APPMGAVRSTAIS-GDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATI 318 (392) T ss_pred cccccccceeEEe-cccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeeccccee Confidence 6666655433222 2223344567777766666555555544443322211111111 1111122233333444444455 Q ss_pred ccccccceeEEEeeccCCccccCcceEEEecCceEEEEcCCceEEEEecceEEEEEEE----CCEEEEEEEEEe Q lcl|NC_021299. 317 TVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKATIDANGVVTGVAAGTSEITAVV----DGLTVKKTITVT 386 (387) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd~~G~VTa~~~Gta~Itat~----~~~~~~~~vtVt 386 (387) ++..+++.++++++.+.+.....+.++|+||||+|||||++|+|||+++|+++|||++ ++++++|+|+|- T Consensus 319 ~~~~~~~~~~~~t~~~~~~~~~~~~vtw~Ssn~~vAtV~~~G~Vt~v~~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 319 TAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAGGLVTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred EeeeccceeEEEEEEecCCccccceEEEEEcCCeeEEEcCCceEEEEecceEEEEEEEEcCCCcEEEEEEEEeC Confidence 6667777777777766666666788999999999999999999999999999999997 568999999999 No 2 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=5.2e-50 Score=290.77 Aligned_cols=356 Identities=10% Similarity=0.088 Sum_probs=223.7 Q ss_pred Ccccccc--HHHHHHHHHHHHHhhccccceeeeccccccc-ccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAFIK--PPVIIASILGQLQHELVLPNFVFKNGYGDVA-HKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~~~--pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~-~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||++++ ||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+..++.++.. +....+.++++.+. T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~---~~~~~~~~~~~~e~ 77 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTET---GDITGKDKNGLFSA 77 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccC---cCCCCccccccccc Confidence 9999965 9999999999999999999999999999996 5789999999999999888742 22455778999999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hcccccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TKAPYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++++||++||++|.++|+|+.+++.++ ++++++++++|++++|.++++.+ ..+++..++.++....|+.|++++++| T Consensus 78 ~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a~~~L 156 (423) T protein:vir:35 78 KATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQTASFI 156 (423) T ss_pred eeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHHHHHH Confidence 9999999999999999999999999999 68899999999999999999754 557777777778788899999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccc-hhhhhhcccccceeeeeeEE-EEeecceeeeeeccceeeeeeeccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDR-FIRYDSAGEAGASRLQTARI-GRLAQYDVVTVDTLPHGDAYLSHPTAYAMLT 234 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~-~~~~~~~g~~~~~~~~~g~i-g~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~ 234 (387) ++.++|..+|++|++|+++..|++++. |...+..+ ...+++|.+ |+++||+||+|+++|......++.... .. T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~---~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~--v~ 231 (423) T protein:vir:35 157 KDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLV---RTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAIT--VK 231 (423) T ss_pred HHhcCCcCCCEEEeCHHHHHHHhccccceeccccch---hHHHhhccceeeecceEEEEcCCCcccccccccccee--ec Confidence 999999999999999999999997654 54444333 456889876 999999999999999876665554321 11 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIE 314 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~ 314 (387) +......... ........+....|...++.....+ ..+..|+..++............ .......+..... T Consensus 232 ~a~~v~~~a~--~~~~~~~~~~~~~~~~~~g~l~~GD---~~t~aGv~~v~~~t~~~~~~~~t-~~~~~~~V~~~~~--- 302 (423) T protein:vir:35 232 TAPNVDYLSV--KDSYQFTVALTGATPSKTGFLKAGD---QLKFTSTHWLNQQSKQTLYNGST-AMSFTATVLEETN--- 302 (423) T ss_pred cccccccccc--cccccceeeeeeeeeccCCcEEecc---eEEeeeeeeccccccceeecccC-CceeEEEEecccc--- Confidence 1111111111 0111111122223333333222222 22333433332222221110000 0000000000000 Q ss_pred ccccccccceeEEEeeccCCccccCcceEEEecCceEEEEc----CCceEEEEecceEEEEEEEC---CEE--EEEEEEE Q lcl|NC_021299. 315 GETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKATID----ANGVVTGVAAGTSEITAVVD---GLT--VKKTITV 385 (387) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd----~~G~VTa~~~Gta~Itat~~---~~~--~~~~vtV 385 (387) +. .+....+++.+. +.+-..+...++|+ ++..||.+..+.++.++..- +.- ++..+-+ T Consensus 303 --~~-a~g~~~v~i~p~----------~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~ 369 (423) T protein:vir:35 303 --ST-ASGDVTVKLSGV----------PIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPK 369 (423) T ss_pred --cc-ccCceeEEcccc----------ccccCCCcccccccccccCCceeeeeecCCCceeEEEeecCceeEEEEEcccc Confidence 00 000111222111 11111222222222 23455555555555554321 111 1111111 Q ss_pred eC Q lcl|NC_021299. 386 TA 387 (387) Q Consensus 386 ta 387 (387) .. T Consensus 370 ~~ 371 (423) T protein:vir:35 370 LH 371 (423) T ss_pred CC Confidence 11 No 3 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=6.1e-50 Score=290.40 Aligned_cols=369 Identities=9% Similarity=0.051 Sum_probs=221.8 Q ss_pred Cccccc--cHHHHHHHHHHHHHhhccccceeeeccccccc-ccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAFI--KPPVIIASILGQLQHELVLPNFVFKNGYGDVA-HKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~~--~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~-~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||+++ +||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+...+.++.... ...+.++++.+. T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~---~~~~~~~~l~e~ 77 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGD---ISGQNKNNLISG 77 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcc---cCCcccCccccc Confidence 999985 59999999999999999999999999999996 579999999999999998875332 234678999999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-APYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++++||++||++|+++|+|+.+++.++ ++++++|+++||++||.++++++.. ++...+++++....|+++++++++| T Consensus 78 ~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a~~~L 156 (423) T protein:vir:17 78 KATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFL 156 (423) T ss_pred eeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHHHHHH Confidence 9999999999999999999999999998 7999999999999999999998755 4556677777778899999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEE-EEeecceeeeeeccceeeeeeecccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARI-GRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTR 235 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~i-g~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~ 235 (387) ++.++|.++|++|++|+++..|++++.+......+ ....+|+|.+ |+++||+||+++++|......++.+.... . T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~--~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~--~ 232 (423) T protein:vir:17 157 KDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQL--VRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVK--T 232 (423) T ss_pred HhccCCcCCCEEEeChHHHHHHhccccceeccccc--chHHHhhccceeeecceEEEEeCCCccccccceeceeeec--c Confidence 99999999999999999999999876543332222 2356899987 89999999999999987777766543321 1 Q ss_pred ccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeecc-----ceeccccccc-------- Q lcl|NC_021299. 236 SPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANL-----DDEPRFVRGT-------- 302 (387) Q Consensus 236 ~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~-----~~~~~~v~~~-------- 302 (387) .+....... ........+....|..+++.....+ ..+..|+...+....... .....++.+. T Consensus 233 ~~~v~~~a~--~~~~~~~~~~~~~~~~~~g~l~~GD---~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~ 307 (423) T protein:vir:17 233 QPTVTYNAV--KDSYQFTVTLTGATTSVTGFLKAGD---QVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSG 307 (423) T ss_pred ccccccccc--ccccceeeeeeeeeeeccCceeecc---eEEecceeeecccccccccccccccceEEEEEecccccccC Confidence 111111111 0111111222223333333222222 223334333322222100 0111111110 Q ss_pred --eeeeeeeeccc----cccccccccceeEEEeeccCCccccCcceEEEecCceEEEEcCCceEEEEecceEEEEEEECC Q lcl|NC_021299. 303 --RIHLKATDAEI----EGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKATIDANGVVTGVAAGTSEITAVVDG 376 (387) Q Consensus 303 --~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd~~G~VTa~~~Gta~Itat~~~ 376 (387) .|.+.+..+-. ...++.........++.-.........++.|+-+--..+++.-. ..+-.. +- +++++| T Consensus 308 ~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~--~~~~~~--~~-~~~~~g 382 (423) T protein:vir:17 308 DVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLP--KLHSID--SA-VATYEG 382 (423) T ss_pred ceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEccc--CCCccc--ee-ecccCC Confidence 01111000000 00000000000000111111111112233344333333333211 000000 00 122222 Q ss_pred EEEE-----------EEEEEeC Q lcl|NC_021299. 377 LTVK-----------KTITVTA 387 (387) Q Consensus 377 ~~~~-----------~~vtVta 387 (387) .+-. ..+..-. T Consensus 383 ~s~r~~~~~d~~~~~~~~r~d~ 404 (423) T protein:vir:17 383 FSIRVHKYADGDANVQKMRFDL 404 (423) T ss_pred cEEEEEEecccccceeEEEEEe Confidence 2211 1111111 No 4 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=1.8e-49 Score=287.85 Aligned_cols=356 Identities=19% Similarity=0.147 Sum_probs=222.8 Q ss_pred Cc---cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MA---NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma---~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) || |++|+||+|++++++.|+++|+|++++||||++||. +.||||+||+|+..+++++ .++.++++++. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~-~~GDTV~I~vp~~~~v~dg--------~~~~~~~~te~ 71 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFG-KVGDTIRLKLPYRVKSASG--------RTLVKQPMVDQ 71 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHh-hCCCEEEEeeCCceeeccc--------CCccccccccc Confidence 88 788999999999999999999999999999999995 5799999999998888764 35778899999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHHh Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWLN 157 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~ 157 (387) .++|+||++||++|.++|+|+.+++.+++++++++++++||+++|++++.++.++++..+++++....|+++++++++|+ T Consensus 72 ~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a~~~Ld 151 (418) T protein:vir:10 72 TIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRPGAFIDFANAGAKQT 151 (418) T ss_pred eEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCcchHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999988888888889999999999999 Q ss_pred hccCCcC-CcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccc Q lcl|NC_021299. 158 EQKVPKD-GRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRS 236 (387) Q Consensus 158 ~~~vp~~-~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~ 236 (387) ++++|.+ +|++|++|++|..|+++..+..... .....+|+|.+|+++||+||+++++|.......+.+.+...... T Consensus 152 ~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~---~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~ 228 (418) T protein:vir:10 152 TYAVPQDGMRHAVLDPFTCASLSDEVTKLFKES---MVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVV 228 (418) T ss_pred hcCCCCCCceEEEeCHHHHHHHhhhcccccccc---ccchhhheeeeeeeeceEEEEecCCCcccccccccceeeecccc Confidence 9999987 4999999999999988876644322 23457999999999999999999999766554444333221111 Q ss_pred cccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccc----------eeee Q lcl|NC_021299. 237 PGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGT----------RIHL 306 (387) Q Consensus 237 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~----------~v~~ 306 (387) .....+... + +...+. ........+..|+...+............++... .|++ T Consensus 229 --~~~~~~~~~-------~----t~s~~g---~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i 292 (418) T protein:vir:10 229 --NGDTVGFDG-------G----TASTTG---FLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKI 292 (418) T ss_pred --cceeEEEee-------c----ceeecc---ceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEe Confidence 000000000 0 000000 0000001111222221111111111111111111 1111 Q ss_pred eeeec----cc---cccccccccceeEEEeec--------cCCccccCcceEEEecCceEEEEcCCceEEEEecceEEEE Q lcl|NC_021299. 307 KATDA----EI---EGETVKAGEKLALALEDS--------NGDNRAGDPLVTWTSGTTAKATIDANGVVTGVAAGTSEIT 371 (387) Q Consensus 307 ~~~~~----~~---~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~v~w~Ssn~~VAtVd~~G~VTa~~~Gta~It 371 (387) .+... .. ....+.......++..+. .........++.|+-+--..++..- .-..+.+...++ T Consensus 293 ~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l---~~p~g~~~~~~~ 369 (418) T protein:vir:10 293 SPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDL---ELPQSAVIKSRA 369 (418) T ss_pred ccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeec---cCCCCCCcceEE Confidence 11000 00 000011111111111111 1111122345666666555555542 111122222222 Q ss_pred EE-ECCEE-------------EEEEEEEeC Q lcl|NC_021299. 372 AV-VDGLT-------------VKKTITVTA 387 (387) Q Consensus 372 at-~~~~~-------------~~~~vtVta 387 (387) ++ +.|.+ ..|.+-|=- T Consensus 370 ~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~ 399 (418) T protein:vir:10 370 ADPETGLSLTLTGAYDINEQSEIHRIDAVW 399 (418) T ss_pred EeccCCeEEEEEEcccccccceEEEEEeec Confidence 22 22221 122222111 No 5 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=5.2e-49 Score=285.29 Aligned_cols=373 Identities=10% Similarity=0.039 Sum_probs=221.5 Q ss_pred Cccccc--cHHHHHHHHHHHHHhhccccceeeeccccccc-ccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAFI--KPPVIIASILGQLQHELVLPNFVFKNGYGDVA-HKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~~--~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~-~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||+++ +||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+..++.++... ....++++++.+. T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~---~~~~~~~~dl~e~ 77 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTG---DISGQNKNNLISG 77 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCc---cccccccCccccc Confidence 999986 59999999999999999999999999999996 67999999999999999988643 2234678999999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA-PYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++++||++||++|+++|+|+.+++.++ ++++++|+++||++||.++++++... +...++.++....|+.+++++++| T Consensus 78 ~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~~~L 156 (423) T protein:vir:10 78 KATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFL 156 (423) T ss_pred eeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHHHHH Confidence 9999999999999999999999999998 79999999999999999999987664 455566666778899999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchh-hhhhcccccceeeeeeEE-EEeecceeeeeeccceeeeeeeccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFI-RYDSAGEAGASRLQTARI-GRLAQYDVVTVDTLPHGDAYLSHPTAYAMLT 234 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~-~~~~~g~~~~~~~~~g~i-g~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~ 234 (387) +++++|.++|++|++|+++..|++++.+. ..+.. ....+|+|.+ |+++||+||+++++|......++.+.... T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~---~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~-- 231 (423) T protein:vir:10 157 KDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQL---VRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVK-- 231 (423) T ss_pred HhccCCcCCCEEEeChHHHHHHhccccceeccccc---chhhhhhccceeeecceEEEEeCCCccccccccccceeee-- Confidence 99999999999999999999999876543 33333 3456899987 89999999999999988777766554321 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeee-----ccceeccccccc------- Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTA-----NLDDEPRFVRGT------- 302 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~-----~~~~~~~~v~~~------- 302 (387) ..+..+.+.... ...........|...+..-...+ ..+..|+..++..... .......++... T Consensus 232 ~~~~v~~~a~~~--a~~~~~~~~~~~~~~~~~l~~GD---~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~ 306 (423) T protein:vir:10 232 TQPTVTYNAVKD--SYQFTVTLTGATASVTGFLKAGD---QVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSG 306 (423) T ss_pred ecceeccccccc--cceeeeeeeeccccccCceeecc---eEEecceeeecccccccccccccCcceEEEEEeeeeeccC Confidence 111111111000 00000011111111111111111 2223333332222111 000111111111 Q ss_pred ---eeeeeeeeccc----cccccccccceeEEEeeccCCccccCcceEEEecCceEEEEcCC--ceE---EEEecceEEE Q lcl|NC_021299. 303 ---RIHLKATDAEI----EGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKATIDAN--GVV---TGVAAGTSEI 370 (387) Q Consensus 303 ---~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd~~--G~V---Ta~~~Gta~I 370 (387) .+.+.+..+-. ...++.........++.-.........++.|+-+--..++..-. |.. ++--.|-... T Consensus 307 g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r 386 (423) T protein:vir:10 307 GDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFSIR 386 (423) T ss_pred CceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeeccccCceEE Confidence 11111100000 00011111000111111111111122344555444444444321 100 0000121111 Q ss_pred EEEE-CCEE--EEEEEEEeC Q lcl|NC_021299. 371 TAVV-DGLT--VKKTITVTA 387 (387) Q Consensus 371 tat~-~~~~--~~~~vtVta 387 (387) -+.+ +..+ ..+.+-|=- T Consensus 387 ~~~~~d~~~~~~~~r~d~l~ 406 (423) T protein:vir:10 387 VHKYADGDANVQKMRFDLLP 406 (423) T ss_pred EEEeeeccccceEEEEEeec Confidence 1111 1111 112222111 No 6 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=9e-49 Score=283.98 Aligned_cols=346 Identities=10% Similarity=0.094 Sum_probs=210.3 Q ss_pred Ccccc--ccHHHHHHHHHHHHHhhccccceeeeccccccc-ccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAF--IKPPVIIASILGQLQHELVLPNFVFKNGYGDVA-HKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~--~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~-~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||++ |+||+|++++|+.|+++|+|++++||+|++||. ++.||||+||+|+...+++..... ......+++.+. T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~---~t~~~~~~l~e~ 77 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGD---ITGKSKNSLISA 77 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcc---cCcccccccccc Confidence 99999 999999999999999999999999999999996 678999999999998887643222 123456789999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hcccccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TKAPYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++++||++||++|.++|+|+.+++.++ ++++++|+++||++||.+++..+ ..+++..+++.+....|+++++++++| T Consensus 78 ~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a~a~~~L 156 (423) T protein:vir:10 78 KATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVAQTASFL 156 (423) T ss_pred eEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccHHHHHHHHHHH Confidence 9999999999999999999999999999 78999999999999999997544 556667777777778899999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEE-EEeecceeeeeeccceeee----eeecccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARI-GRLAQYDVVTVDTLPHGDA----YLSHPTAYA 231 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~i-g~~~g~~v~~s~~~~~~~~----~~~~~~a~~ 231 (387) ++.++|..+|++|++|++++.|++++.+......+ +...+++|.+ |+++||+||+++++|.... +.++.++.. T Consensus 157 ~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~--~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~ 234 (423) T protein:vir:10 157 KDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQL--VRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTP 234 (423) T ss_pred hhccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc--chHHHHhcccceeecceEEEEecCCcccccccccceeeeeeee Confidence 99999999999999999999999765544333222 2456889976 9999999999999985322 233433333 Q ss_pred ccccccccccCcee--eeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeec-----cceecccccccee Q lcl|NC_021299. 232 MLTRSPGRPMTNTV--ATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTAN-----LDDEPRFVRGTRI 304 (387) Q Consensus 232 ~~~~~~~~~~~~t~--~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~-----~~~~~~~v~~~~v 304 (387) ...+.......... ..+......+ +... .-..+..|+..++...... ......++..... T Consensus 235 ~vt~a~~~~~~~~~~~~~~~T~s~~g----~l~~---------GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~ 301 (423) T protein:vir:10 235 EVNYDSVKDSYAFTATLTGATASKKG----FLKV---------GDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDA 301 (423) T ss_pred EEEecccccccccccceeeccceece----eEEe---------cceEeecceeeecccccceeecccCCcceEEEEEecc Confidence 32322211100000 0000000000 0110 1111222222222111110 0000111110000 Q ss_pred eeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCceEEEEc----CCceEEEEecceEEEEEE------- Q lcl|NC_021299. 305 HLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKATID----ANGVVTGVAAGTSEITAV------- 373 (387) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd----~~G~VTa~~~Gta~Itat------- 373 (387) .++ .+....+++.+... +...++.-.+|+ .+..||.++...++.++. T Consensus 302 -----------~~~-a~~~~tv~i~p~~~----------~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a 359 (423) T protein:vir:10 302 -----------NAH-SSGDVTVKISGVPI----------FDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLF 359 (423) T ss_pred -----------ccc-ccCceEEEeccccc----------cccCcccccceeccccCCceeEEeeccCCceeEEEEecCcc Confidence 000 00011111111110 111112122222 233444444444444432 Q ss_pred --------------------ECCEEEE-----------EEEEEeC Q lcl|NC_021299. 374 --------------------VDGLTVK-----------KTITVTA 387 (387) Q Consensus 374 --------------------~~~~~~~-----------~~vtVta 387 (387) ++|.+-. ..+..-. T Consensus 360 ~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~ 404 (423) T protein:vir:10 360 CGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADGDANKQMMRFDL 404 (423) T ss_pred eEEEEEcccCCCccceeecccccceEEEEEeeeccccceEEEEEe Confidence 2221111 1111111 No 7 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=9.9e-45 Score=261.83 Aligned_cols=267 Identities=19% Similarity=0.190 Sum_probs=197.9 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) |||++|+||+|+++++++|++.++|+++++|||+.++ +.||||+||+++...+++|.. .+..+..+++.+..++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~----~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA--SKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc--ccCceEEEeeccccccccccc----CCCccCccccccceEE Confidence 9999999999999999999999999999999998764 679999999999988888753 3455778899999999 Q ss_pred EEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--cccCCcchhHHHHHHHHHHHhh Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK--VSLVDEDEIWNGVVSNRRWLNE 158 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~a~~~l~~ 158 (387) ++||+++++++.++|.|+.+...++. .+++|++++||+++|.++++++.++.... ..+++..+.|+.|++++++|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 99999999999999999999999975 59999999999999999999988764433 4445667889999999999999 Q ss_pred ccCCcCCcEEEEchHHHHHHhcccchh-hhhhcccccceeeeeeEEEEeecceeeeeeccceee---eeeeccccccccc Q lcl|NC_021299. 159 QKVPKDGRVLLVGSAVEEALLLDDRFI-RYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGD---AYLSHPTAYAMLT 234 (387) Q Consensus 159 ~~vp~~~r~~v~~~~~~~~l~~~~~~~-~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~---~~~~~~~a~~~~~ 234 (387) +++|.++|++|++|+++..|++++.|. +....++. ..+++|.+|+++||+|++++++|.+. .+.+|+.++++.. T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~ 231 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS 231 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccc--cceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeee Confidence 999999999999999999999988644 44554443 46899999999999999999998653 3455666655443 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) ..... +.. .+.....+.......+|.....+.........+. T Consensus 232 q~~~~-e~~--------------------r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 232 QIDTV-EAL--------------------RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeehh-hcc--------------------cCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 22110 000 0000000000011111111111111111100000 No 8 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=9.9e-45 Score=261.83 Aligned_cols=267 Identities=19% Similarity=0.190 Sum_probs=197.9 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) |||++|+||+|+++++++|++.++|+++++|||+.++ +.||||+||+++...+++|.. .+..+..+++.+..++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~----~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA--SKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc--ccCceEEEeeccccccccccc----CCCccCccccccceEE Confidence 9999999999999999999999999999999998764 679999999999988888753 3455778899999999 Q ss_pred EEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--cccCCcchhHHHHHHHHHHHhh Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK--VSLVDEDEIWNGVVSNRRWLNE 158 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~a~~~l~~ 158 (387) ++||+++++++.++|.|+.+...++. .+++|++++||+++|.++++++.++.... ..+++..+.|+.|++++++|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 99999999999999999999999975 59999999999999999999988764433 4445667889999999999999 Q ss_pred ccCCcCCcEEEEchHHHHHHhcccchh-hhhhcccccceeeeeeEEEEeecceeeeeeccceee---eeeeccccccccc Q lcl|NC_021299. 159 QKVPKDGRVLLVGSAVEEALLLDDRFI-RYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGD---AYLSHPTAYAMLT 234 (387) Q Consensus 159 ~~vp~~~r~~v~~~~~~~~l~~~~~~~-~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~---~~~~~~~a~~~~~ 234 (387) +++|.++|++|++|+++..|++++.|. +....++. ..+++|.+|+++||+|++++++|.+. .+.+|+.++++.. T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~ 231 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS 231 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccc--cceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeee Confidence 999999999999999999999988644 44554443 46899999999999999999998653 3455666655443 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) ..... +.. .+.....+.......+|.....+.........+. T Consensus 232 q~~~~-e~~--------------------r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 232 QIDTV-EAL--------------------RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeehh-hcc--------------------cCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 22110 000 0000000000011111111111111111100000 No 9 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=1.2e-44 Score=261.43 Aligned_cols=267 Identities=19% Similarity=0.197 Sum_probs=198.7 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) |||++|+||+|+++++++|++.++|.+++||||+. .+++||||+||.++.....+|.. .+.++..+++.+..++ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~--~~~~GdTv~ip~~~~~~~~d~~~----~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEG--IASKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccc--cccCCcEEEEeecCccccccccc----CCCccCccccccceEE Confidence 99999999999999999999999999999999975 46789999999999888887753 3456778899999999 Q ss_pred EEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--cccCCcchhHHHHHHHHHHHhh Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK--VSLVDEDEIWNGVVSNRRWLNE 158 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~a~~~l~~ 158 (387) ++||+++++++.++|.|+.+...++. ++++|++++||+++|+++++++.++.... +.+.+..+.++.|++++.+|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhh Confidence 99999999999999999999999985 68999999999999999999987765433 4455667889999999999999 Q ss_pred ccCCcCCcEEEEchHHHHHHhcccc-hhhhhhcccccceeeeeeEEEEeecceeeeeeccceeee---eeeccccccccc Q lcl|NC_021299. 159 QKVPKDGRVLLVGSAVEEALLLDDR-FIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDA---YLSHPTAYAMLT 234 (387) Q Consensus 159 ~~vp~~~r~~v~~~~~~~~l~~~~~-~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~---~~~~~~a~~~~~ 234 (387) ++||.++|++|++|+++..|++++. |.+....++. ..+++|.+|+++||+|++++.+|.+.. +.+|+.++.+.. T Consensus 154 ~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~--~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~ 231 (273) T protein:vir:79 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS 231 (273) T ss_pred ccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccc--cceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeee Confidence 9999999999999999999999875 5555555543 468999999999999999999987643 345666665543 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) ........ .+.....+.......+|.....+.........+. T Consensus 232 ~~~~~e~~---------------------r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 232 QIDTVEAL---------------------RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ehhhhhcc---------------------cCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 32211000 0000000011111112221111111111110000 No 10 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=1.2e-42 Score=250.36 Aligned_cols=288 Identities=15% Similarity=0.126 Sum_probs=194.8 Q ss_pred Ccccc------------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccc Q lcl|NC_021299. 1 MANAF------------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRN 68 (387) Q Consensus 1 Ma~~~------------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~ 68 (387) |+|++ |+||+|++++++.|+++++|.+++ |||+.++ +.||||+||.++...+.++. .+.+ T Consensus 3 ~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~-~d~~~~~--~~Gdtv~ip~~g~~~~~d~~-----~~~~ 74 (341) T protein:vir:94 3 LGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV-KTWGAQV--KKGDTFHVPRISELGVEDKA-----TDVP 74 (341) T ss_pred chhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc-ccccccc--cCCceEEEeccCcceeeeec-----CCCc Confidence 55654 889999999999999999999987 7998775 45999999999988887764 4567 Q ss_pred ccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----------- Q lcl|NC_021299. 69 MVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKV----------- 137 (387) Q Consensus 69 ~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~----------- 137 (387) ++++++.+..++++||+++++++.++|+|+.+...|+++++++|++++||+++|+++++++..+..... T Consensus 75 i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~ 154 (341) T protein:vir:94 75 VGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAI 154 (341) T ss_pred cccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccc Confidence 888999999999999999999999999999999999999999999999999999999988765432211 Q ss_pred ccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 138 SLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 138 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) ++......|+.|++++++|++++||.++|++|++|+++..|+++++|.+.+..++ ..+++|.+|+++||+|++++++ T Consensus 155 t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~---~~l~~G~ig~i~G~~V~~Sn~l 231 (341) T protein:vir:94 155 TGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINN---APIAQGQIGSLMGVRVIRTSLI 231 (341) T ss_pred cCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhcccc---chhheeeeeeEeceEEEEeccc Confidence 1112235689999999999999999999999999999999999999999887765 3579999999999999999999 Q ss_pred ceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeee----------------------------ccc Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDA----------------------------TST 269 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~----------------------------~~~ 269 (387) |......++..........................+............. ... T Consensus 232 p~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (341) T protein:vir:94 232 GNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQ 311 (341) T ss_pred cccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhh Confidence 9877665544333221111111010110010000000000000000000 000 Q ss_pred eeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccccccc Q lcl|NC_021299. 270 TERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETV 318 (387) Q Consensus 270 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~ 318 (387) .+.......+|.....+... +.+.. ...++ T Consensus 312 ~~~i~~~~~~G~~~lrp~~~--------------v~~~~-----~~~~~ 341 (341) T protein:vir:94 312 VWLMVGRQAYGARLYRPLHA--------------VNIHT-----TGDTV 341 (341) T ss_pred hhhhhhhhhhcccccCccee--------------EEEec-----CcCCC Confidence 00000011111111111110 11111 11111 No 11 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=8.2e-39 Score=229.37 Aligned_cols=328 Identities=12% Similarity=0.044 Sum_probs=213.2 Q ss_pred Ccc---ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MAN---AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~---~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |+. +.|+||+|++++++.|+++++|.+++++ .+|.++.||||+||.++...+.++. ++.++.++++.+. T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~---~~~~~~~GdTV~ip~~g~~~a~d~~-----~g~~i~~~~~~~~ 86 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK---IPFEGKKGDLIHIPNISRAAVYDKQ-----PQTPVNLQARTDS 86 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc---ccceeecCceEEeeccCcceeeeec-----CCCcccccccCCc Confidence 442 2488999999999999999999998864 2345667999999999888777764 4567889999999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------------ccc Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE-------------------KVS 138 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~-------------------~~~ 138 (387) .++++||+++++++.++|.|+.+...|++.++.+++..+||+++|++++..+...... ..+ T Consensus 87 ~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t 166 (381) T protein:vir:80 87 EFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLT 166 (381) T ss_pred eEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999998876432211 112 Q ss_pred cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccc Q lcl|NC_021299. 139 LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLP 218 (387) Q Consensus 139 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~ 218 (387) +.+....|+.|++++++|+++++|.++|++|++|+++..|+++++|.+.+..++ ..+++|.+|+++||+|++++.+| T Consensus 167 ~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~---~~l~~G~Ig~i~G~~Vv~Sn~lp 243 (381) T protein:vir:80 167 GTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQV---KPVTSGVVGTILGMEVIVTTQIG 243 (381) T ss_pred cchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccc---hhhhceeeeEEcceEEEeecccc Confidence 233456789999999999999999999999999999999999999988776443 46899999999999999999999 Q ss_pred eeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccc Q lcl|NC_021299. 219 HGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRF 298 (387) Q Consensus 219 ~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 298 (387) ......++..+.......+ ...+ ..+.....+......+...||.....+.+.++...|.................. T Consensus 244 ~~~~t~~~~~agap~~~~~-~~~~--~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 320 (381) T protein:vir:80 244 INSLTGYVNGQGAPTQPTP-GVLG--SPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGGA 320 (381) T ss_pred cccccceeeeccccccccc-cccc--cccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeeeehhh Confidence 7654433322211111110 0011 111122223345667778888888777777776655443333222222222111 Q ss_pred cc------------cceeeeeeee-cccccccccccccee--E--EEeeccCCccccCcceEEEecCceEEEEcCCce Q lcl|NC_021299. 299 VR------------GTRIHLKATD-AEIEGETVKAGEKLA--L--ALEDSNGDNRAGDPLVTWTSGTTAKATIDANGV 359 (387) Q Consensus 299 v~------------~~~v~~~~~~-~~~~~~~~~~~~~~~--~--~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd~~G~ 359 (387) .. ...+.+.... .+.+ .+-.+.-.- . -+.+ ..|. .-|.-+.| |. T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~----------~~ 381 (381) T protein:vir:80 321 NRWATAVVCHPDWLAVGVQQNVKSESSRE--TMYLADAFVTSCVYGAKV-FRPD----HCVLLHTS----------GI 381 (381) T ss_pred hhhhhhcccccccccccceeEeecccchh--heeehhhhhhhhhhcccc-ccch----hhhhhhhc----------CC Confidence 11 1111111110 0000 000000000 0 0000 0000 00111111 11 No 12 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=1.3e-37 Score=222.72 Aligned_cols=281 Identities=14% Similarity=0.081 Sum_probs=177.7 Q ss_pred Ccc--------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MAN--------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma~--------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) |+. .+|+||+|+++++..|++.|++.++.++.++ +.|||||||.++..++.+|. .+.++.+| T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~-----g~GDtV~InsIg~~tV~dY~-----~~~~i~~d 70 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDF-----PDGDKLTIPSVGTPVVRSRP-----EQGDFTFD 70 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhccccc-----CCCCeEEecccccccccccc-----CCCCcccc Confidence 761 1467999999999999999999998876543 35999999999999999885 45678999 Q ss_pred ccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------c-------- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY---------E-------- 135 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~---------~-------- 135 (387) ++++++++|+||+.||++|.++| |+.|...+++...+++++++|+..+|+++.++++.... . T Consensus 71 ~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~i 149 (322) T protein:vir:31 71 NLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRF 149 (322) T ss_pred cCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccce Confidence 99999999999999999999999 99999999999999999999999999999886553221 0 Q ss_pred ccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHH---------HhcccchhhhhhcccccceeeeeeEEEEe Q lcl|NC_021299. 136 KVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEA---------LLLDDRFIRYDSAGEAGASRLQTARIGRL 206 (387) Q Consensus 136 ~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~---------l~~~~~~~~~~~~g~~~~~~~~~g~ig~~ 206 (387) +.++..+.+.|+.+++++.+||+++||.++||+||+|.++.. +++|++|.+....|.. ..++ .+|++ T Consensus 150 v~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a--~g~~--~Vg~~ 225 (322) T protein:vir:31 150 VGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIA--PDMQ--FVRSV 225 (322) T ss_pred eccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccch--hhHH--HHHHH Confidence 122334567899999999999999999999999999998764 4678888876666542 1222 38999 Q ss_pred ecceeeeeeccceeeeeee-ccccccccccc-----cccccCceeeeeeeccccccee-eeeeeeeeccceeeeeeeeee Q lcl|NC_021299. 207 AQYDVVTVDTLPHGDAYLS-HPTAYAMLTRS-----PGRPMTNTVATSTVATENGVQL-RWLGDYDATSTTERSIVDTWI 279 (387) Q Consensus 207 ~g~~v~~s~~~~~~~~~~~-~~~a~~~~~~~-----~~~~~~~t~~~~~~~~~~~~~~-~~~~~~d~~~~~~~~~~~~~~ 279 (387) +||+|+.|+.++....-.. ...+.....+. ...+.+..-..+.. ..+ ......+...-.+....-..+ T Consensus 226 ~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~-----~~l~~~e~~r~~~~~~d~~~~~~~~ 300 (322) T protein:vir:31 226 YGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAW-----KEMPTTKSFIDDYNDDLNTATTARW 300 (322) T ss_pred hceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHh-----hhhhhhhcccCccccccceeeeeee Confidence 9999999999864321100 00000000000 00000000000000 000 000000001111111111122 Q ss_pred eeccccceeeeccceeccccccceeeeeeeecccccccc Q lcl|NC_021299. 280 GVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETV 318 (387) Q Consensus 280 g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~ 318 (387) |........-..... +..+.+. T Consensus 301 g~g~~r~e~l~~~~a-----------------~~~~~~~ 322 (322) T protein:vir:31 301 GNGLVRDENLVCVLA-----------------NADKVTF 322 (322) T ss_pred cceeecccceEEEEe-----------------ccccccC Confidence 221111110000000 0000000 No 13 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=6.6e-34 Score=202.50 Aligned_cols=266 Identities=17% Similarity=0.155 Sum_probs=196.3 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||.. ++||+|+++++++|++.++|.+++.+|+ ++.+++|++|+||.+.... +.+ ....+..++++++ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~--~l~g~~G~tv~iP~~~~ig--~a~--~~~~g~~i~~~~l 74 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSG--DAQ--VVAEGEKIPTDIL 74 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecc--cccCCCCCEEEEeeecCCC--ccc--cccCCCccchhhc Confidence 99975 9999999999999999999999999986 4778899999999876431 211 2345677899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++..++|++ .+++|.++|++..+...|++.+.++|+.+++|+++|++++..+.++..... .....|+.|++|.. T Consensus 75 t~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~---~~a~~~d~i~dA~~ 150 (274) T protein:vir:12 75 ETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN---ADITKLNGLQSAID 150 (274) T ss_pred ccceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccccCHHHHHHHHH Confidence 99999999966 689999999999999999999999999999999999999998887665442 34567999999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) +|++++. .+|+++++|+.+..|+++. +|.+....+. ..+++|.+|++.|++|+.++.+|.+..+.++..++.+ T Consensus 151 ~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:12 151 KFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred Hhccccc--cccEEEeCHHHHHHHHhhhhhhccccccccc---cceecccceeecCeeEEEeCCCCcceEEEEeccceee Confidence 9998864 7899999999999999975 6776655543 4679999999999999999999999888888777665 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) ...... ....+.+.....+.......++............. ...-+ T Consensus 226 ~~~~~~--------------------~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t--------------~~~~~ 271 (274) T protein:vir:12 226 ILKRDF--------------------FLEVARDASTKTTALYSDKHYVAYLYDESKAVKIT--------------KGSGS 271 (274) T ss_pred eecCCc--------------------eeccccchhhcccEEEeeeEEEEEEEcCCceEEEE--------------cCCcc Confidence 433211 11122222222222223333333222221111111 00011 Q ss_pred cccccccc Q lcl|NC_021299. 313 IEGETVKA 320 (387) Q Consensus 313 ~~~~~~~~ 320 (387) + .+ T Consensus 272 ~-----~~ 274 (274) T protein:vir:12 272 L-----EM 274 (274) T ss_pred c-----cC Confidence 1 11 No 14 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.7e-33 Score=200.21 Aligned_cols=266 Identities=16% Similarity=0.157 Sum_probs=194.4 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||. +++||+|+++++++|.+.++|.+++..| ++|.+++|+||+||.+.... +.+ ....+..++++++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig--~a~--~~~~g~~i~~~~l 74 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEID--NTLVGQPGDTLTFPAFIYSG--DAK--VVAEGEKIPTDIL 74 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceec--ccccCCCCCEEEeeeecCCC--ccc--cccCCCccchhhc Confidence 9984 4889999999999999999999997666 34678899999999887532 222 2345667899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++.+++|++ .+++|.++|++..+...|++.+.++|+.+++|+++|++++..+.++..... .....|+.|++|.. T Consensus 75 t~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~---~~~~~~d~i~~A~~ 150 (274) T protein:vir:96 75 ETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVE---ADITKLTGLQTAID 150 (274) T ss_pred ccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccccCHHHHHHHHH Confidence 99999999966 689999999999999999999999999999999999999999887765542 34456999999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) +|++++. .+|+++++|+.++.|+++. +|.+....+ ...+++|.+|++.|++|+.++.+|.+..+.++..++.+ T Consensus 151 ~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:96 151 KFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELG---DDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred Hhccccc--cccEEEeCHHHHHHHHhhcccccccccccc---ccceeccccceecCeEEEEeCCCCCceEEEEeccceee Confidence 9998874 6899999999999999985 566655554 35789999999999999999999999888888777765 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) ....... ...+.+.....+.......++.....+....... +..-+ T Consensus 226 ~~~~~~~--------------------vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t--------------k~~~~ 271 (274) T protein:vir:96 226 ITKRDFF--------------------LETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT--------------KGSGS 271 (274) T ss_pred eecCCcc--------------------cccccccccccCEEEEeEEEEEEEEcCCcEEEEE--------------cCCcc Confidence 4332111 1112222222222223333333222221111110 11111 Q ss_pred cccccccc Q lcl|NC_021299. 313 IEGETVKA 320 (387) Q Consensus 313 ~~~~~~~~ 320 (387) + .+ T Consensus 272 ~-----~~ 274 (274) T protein:vir:96 272 L-----EM 274 (274) T ss_pred c-----cC Confidence 1 11 No 15 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.7e-33 Score=200.21 Aligned_cols=266 Identities=16% Similarity=0.157 Sum_probs=194.4 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||. +++||+|+++++++|.+.++|.+++..| ++|.+++|+||+||.+.... +.+ ....+..++++++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig--~a~--~~~~g~~i~~~~l 74 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEID--NTLVGQPGDTLTFPAFIYSG--DAK--VVAEGEKIPTDIL 74 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceec--ccccCCCCCEEEeeeecCCC--ccc--cccCCCccchhhc Confidence 9984 4889999999999999999999997666 34678899999999887532 222 2345667899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++.+++|++ .+++|.++|++..+...|++.+.++|+.+++|+++|++++..+.++..... .....|+.|++|.. T Consensus 75 t~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~---~~~~~~d~i~~A~~ 150 (274) T protein:vir:95 75 ETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVE---ADITKLTGLQTAID 150 (274) T ss_pred ccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccccCHHHHHHHHH Confidence 99999999966 689999999999999999999999999999999999999999887765542 34456999999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) +|++++. .+|+++++|+.++.|+++. +|.+....+ ...+++|.+|++.|++|+.++.+|.+..+.++..++.+ T Consensus 151 ~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:95 151 KFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELG---DDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred Hhccccc--cccEEEeCHHHHHHHHhhcccccccccccc---ccceeccccceecCeEEEEeCCCCCceEEEEeccceee Confidence 9998874 6899999999999999985 566655554 35789999999999999999999999888888777765 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) ....... ...+.+.....+.......++.....+....... +..-+ T Consensus 226 ~~~~~~~--------------------vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t--------------k~~~~ 271 (274) T protein:vir:95 226 ITKRDFF--------------------LETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT--------------KGSGS 271 (274) T ss_pred eecCCcc--------------------cccccccccccCEEEEeEEEEEEEEcCCcEEEEE--------------cCCcc Confidence 4332111 1112222222222223333333222221111110 11111 Q ss_pred cccccccc Q lcl|NC_021299. 313 IEGETVKA 320 (387) Q Consensus 313 ~~~~~~~~ 320 (387) + .+ T Consensus 272 ~-----~~ 274 (274) T protein:vir:95 272 L-----EM 274 (274) T ss_pred c-----cC Confidence 1 11 No 16 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=1.5e-33 Score=200.59 Aligned_cols=267 Identities=14% Similarity=0.126 Sum_probs=193.5 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) ||| ++|+||+|+++++++|++.++|.+++.++++ +.+++|++|+||.+...... + ....+..++++++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~--l~g~~G~tv~ip~~~~~g~a--~--~~~~g~~i~~~~l 74 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNS--LEGQPGSEITVPKYKYIGDA--Q--DVAEGAAIDYSAL 74 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceeccc--ccCCCCCEEEEeeeccCCcc--e--eecCCCcCccccc Confidence 998 3599999999999999999999999988854 66889999999987654221 1 2345667899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCC---cchhHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVD---EDEIWNGVVS 151 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~ 151 (387) +.++.+++|++ .+++|.++|++..+...|++.+.++|+++++++++|++++..+.++........+ ....|+.+.+ T Consensus 75 t~~~~~~~i~~-~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~d 153 (278) T protein:vir:80 75 ETESVKHGIKK-AGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTD 153 (278) T ss_pred ccceeeEeeeh-hhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHH Confidence 99999999966 4679999999999999999999999999999999999999999887665544333 3356889999 Q ss_pred HHHHHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccc Q lcl|NC_021299. 152 NRRWLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 152 a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a 229 (387) +..+|+++++|. .++++++|+++..|+++. +|.+....++ ..+++|.+|++.||+|+.++.+|.+..+.++.++ T Consensus 154 a~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA 229 (278) T protein:vir:80 154 APDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGD---DLLVKGAFGELLGWEIVRTKKLADGNALAVKAGA 229 (278) T ss_pred HHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhccccccccc---cceeeccceeecceeEEEcCCCCcceEEEEeccc Confidence 999999999995 578999999999998874 5665555543 4678999999999999999999999988888887 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccc Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRF 298 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 298 (387) +.+........+ .+.+.....+.......++.....+............ T Consensus 230 i~~~~~~~~~vE--------------------~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 230 LKTFLKRNLLAE--------------------SGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eeeeecCCcccc--------------------cccchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 765432211111 1111111111111222222222211111111100000 No 17 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=2.9e-33 Score=198.94 Aligned_cols=266 Identities=17% Similarity=0.155 Sum_probs=194.0 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||.. |+||+|+++++++|++.++|.+++.+|++ +.+++|++|+||.+... .+.+ ....+..++++++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~--l~g~~G~tv~iP~~~~~--g~a~--~~~~g~~i~~~~l 74 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDST--LQGQPGDTLTFPAFVYS--GDAQ--VVAEGEKIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceeccc--ccCCCCCEEEEeeecCC--Cccc--cccCCCccccccc Confidence 99976 99999999999999999999999999864 66889999999987643 1221 2345677899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++.+++|++ .+++|.++|++..+...|++.+.++|+.+++++++|++++..+.++..... .....|+.|++|.. T Consensus 75 t~~~~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~---~~~~~~d~i~dA~~ 150 (274) T protein:vir:94 75 ETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN---ADITKLNGLQSAID 150 (274) T ss_pred ccceeEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc---ccccCHHHHHHHHH Confidence 99999999966 678999999999999999999999999999999999999999887765542 23456899999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) +|++++. ..|+++++|+.+..|+++. +|.+....++ ..+++|.+|++.|++|+.++.+|.+..+.++..++.+ T Consensus 151 ~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:94 151 KFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccc---cceeccccceecCeeEEEcCCCCcceEEEEeCcceEe Confidence 9999875 6799999999999999875 6666655543 4678999999999999999999999888888777665 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) ....... ...+.+.....+.......++............ ....-+ T Consensus 226 ~~~~~~~--------------------vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~--------------t~~~~~ 271 (274) T protein:vir:94 226 ILKRDFF--------------------LEVARDASTKTTALYSDKHYVAYLYDESKAVKI--------------TKGSGS 271 (274) T ss_pred eecCCce--------------------eccccchhhcccEEEEEEEEEEEEEcCCceEEE--------------ecCccc Confidence 4332111 111112222222222222333222221111100 000001 Q ss_pred cccccccc Q lcl|NC_021299. 313 IEGETVKA 320 (387) Q Consensus 313 ~~~~~~~~ 320 (387) +.+ T Consensus 272 -----~~~ 274 (274) T protein:vir:94 272 -----LEM 274 (274) T ss_pred -----ccC Confidence 111 No 18 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=2.9e-33 Score=198.94 Aligned_cols=266 Identities=17% Similarity=0.155 Sum_probs=194.0 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||.. |+||+|+++++++|++.++|.+++.+|++ +.+++|++|+||.+... .+.+ ....+..++++++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~--l~g~~G~tv~iP~~~~~--g~a~--~~~~g~~i~~~~l 74 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDST--LQGQPGDTLTFPAFVYS--GDAQ--VVAEGEKIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceeccc--ccCCCCCEEEEeeecCC--Cccc--cccCCCccccccc Confidence 99976 99999999999999999999999999864 66889999999987643 1221 2345677899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++.+++|++ .+++|.++|++..+...|++.+.++|+.+++++++|++++..+.++..... .....|+.|++|.. T Consensus 75 t~~~~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~---~~~~~~d~i~dA~~ 150 (274) T protein:vir:97 75 ETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN---ADITKLNGLQSAID 150 (274) T ss_pred ccceeEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc---ccccCHHHHHHHHH Confidence 99999999966 678999999999999999999999999999999999999999887765542 23456899999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) +|++++. ..|+++++|+.+..|+++. +|.+....++ ..+++|.+|++.|++|+.++.+|.+..+.++..++.+ T Consensus 151 ~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:97 151 KFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccc---cceeccccceecCeeEEEcCCCCcceEEEEeCcceEe Confidence 9999875 6799999999999999875 6666655543 4678999999999999999999999888888777665 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) ....... ...+.+.....+.......++............ ....-+ T Consensus 226 ~~~~~~~--------------------vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~--------------t~~~~~ 271 (274) T protein:vir:97 226 ILKRDFF--------------------LEVARDASTKTTALYSDKHYVAYLYDESKAVKI--------------TKGSGS 271 (274) T ss_pred eecCCce--------------------eccccchhhcccEEEEEEEEEEEEEcCCceEEE--------------ecCccc Confidence 4332111 111112222222222222333222221111100 000001 Q ss_pred cccccccc Q lcl|NC_021299. 313 IEGETVKA 320 (387) Q Consensus 313 ~~~~~~~~ 320 (387) +.+ T Consensus 272 -----~~~ 274 (274) T protein:vir:97 272 -----LEM 274 (274) T ss_pred -----ccC Confidence 111 No 19 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.8e-33 Score=198.29 Aligned_cols=265 Identities=17% Similarity=0.151 Sum_probs=194.4 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccc-eeeceecccccccccccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPT-IAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (387) |||+. |+||+|+++++++|++.++|.+++.+|++ +.+++|++|+||.+... .+.+ ...+..+++++ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~--l~g~~G~tv~ip~~~~~g~~~~-----~~eg~~i~~~~ 73 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDST--LQGQPGDTLTFPAFVYSGDAQV-----VAEGEKIPTDI 73 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhccccccccc--ccCCCCCEEEEEeeccCCCccc-----ccCCCcccccc Confidence 99986 89999999999999999999999998864 66889999999987643 2322 34567789999 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++.++.++++++ .++.|.++|++..+...|++.+..+++.+++++++|++++..+.++.... ......++.|++|. T Consensus 74 it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~---~~~~~~~d~i~dA~ 149 (274) T protein:vir:93 74 LETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV---NADITKLNGLQSAI 149 (274) T ss_pred cccceeEEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---cccccCHHHHHHHH Confidence 999999999955 67999999999999999999999999999999999999999887776543 23445789999999 Q ss_pred HHHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccccc Q lcl|NC_021299. 154 RWLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYA 231 (387) Q Consensus 154 ~~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~ 231 (387) .+|++++. +.|+++++|+.+..|+++. +|.+....+ ...+++|.+|++.|++|+.++.+|.+..+.++..++. T Consensus 150 ~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~ 224 (274) T protein:vir:93 150 DKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELG---DDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVK 224 (274) T ss_pred HHhhhccC--CccEEEeCHHHHHHHHhhhhhccccccccc---ccceeecccceecCeeEEEcCCCCcceEEEEeCCeEE Confidence 99999875 6799999999999999875 555555544 3467999999999999999999999998888888776 Q ss_pred ccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeec Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDA 311 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~ 311 (387) +........ ..+.+.....+.......++.......... .+....- T Consensus 225 ~~~~~~~~v--------------------E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v--------------~~t~~~~ 270 (274) T protein:vir:93 225 LILKRDFFL--------------------EVARDASTKTTALYSDKHYVAYLYDESKAV--------------KITKGSG 270 (274) T ss_pred EEecCCccc--------------------ccccchhhcccEEEEEEEEEEEEEcCCceE--------------EEeeCcc Confidence 654321111 111111112222222223332222221110 0111111 Q ss_pred ccccccccc Q lcl|NC_021299. 312 EIEGETVKA 320 (387) Q Consensus 312 ~~~~~~~~~ 320 (387) | +.+ T Consensus 271 s-----~~~ 274 (274) T protein:vir:93 271 S-----LEM 274 (274) T ss_pred c-----cCC Confidence 1 111 No 20 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=1.3e-32 Score=195.34 Aligned_cols=264 Identities=17% Similarity=0.171 Sum_probs=193.0 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccc-eeeceecccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPT-IAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (387) |||. +++||+|+++++++|++.++|.+++++|+ ++.+++|++|+||.+... ...+ ...+..+++++ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~--~l~g~~G~tv~ip~~~~~g~~~~-----~~~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDS--TLVGQPGDTLTFPAFTYSGDAQV-----IAEGEKIPVDQ 73 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccc--cccCCCCCEEEEEeeccCCCccc-----cCCCCcCchhh Confidence 9974 48899999999999999999999998885 467889999999987632 3332 34566789999 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++.++.+++|++ .+++|.++|++..+...|++.+..+|+.+++++++|++++..+.+++... ......|+.|++|. T Consensus 74 it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~---~~~~~~~d~i~dA~ 149 (274) T protein:vir:96 74 IGTSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV---EADITKLDGLQTAI 149 (274) T ss_pred cccceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc---CcccccHHHHHHHH Confidence 999999999966 68999999999999999999999999999999999999999887765433 23345689999999 Q ss_pred HHHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccccc Q lcl|NC_021299. 154 RWLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYA 231 (387) Q Consensus 154 ~~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~ 231 (387) .+|++++. ..|+++++|+.+..|+++. +|......| ...+++|.+|++.|++|+.++.+|.+..+.++.+++. T Consensus 150 ~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g---~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~ 224 (274) T protein:vir:96 150 DKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLG---DNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVK 224 (274) T ss_pred HHhcccCC--CceEEEeCHHHHHHHHhccccccccccccc---ccceeecccceecCeeEEEcCCCCcceEEEEeCccee Confidence 99999875 6799999999999999874 566655544 3467999999999999999999999998888888776 Q ss_pred ccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeecccee-ccccc Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDE-PRFVR 300 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~v~ 300 (387) +........+ .+.+.....+.......+|.....+......... ...+- T Consensus 225 ~~~~~~~~vE--------------------~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 225 LITKRDFFLE--------------------KDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred eeecCCcccc--------------------cccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 6443221111 1111111122222222233222222111111000 00000 No 21 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=1.1e-32 Score=195.82 Aligned_cols=266 Identities=14% Similarity=0.142 Sum_probs=193.1 Q ss_pred Cccc-----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MANA-----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~-----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) |+|+ +++||+|+++++++|++.++|.+++..| +++.+++|++|+||.+... .+.+ ....+..+++++++ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~--~~l~g~~G~tv~iP~~~~i--g~a~--~~~~g~~i~~~~lt 76 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADID--NTLVGQPGNTITFPAFVYS--GDAK--VVPEGEEIPIDLIE 76 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceec--ccccCCCCCEEEeeeeccC--Cccc--cccCCCCcchhhcc Confidence 5553 5889999999999999999999998777 4577889999999987653 2222 23456778999999 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHH Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRW 155 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~ 155 (387) .++..++| ++.+++|.++|++..+...|++.+.++|+.+++|+++|++++..+.++.... ......|+.|++|..+ T Consensus 77 ~~~~~~~i-~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~---~~~~~~~d~i~dA~~~ 152 (275) T protein:vir:96 77 TKKRQATI-RKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKV---EADITKLAGLQTAIDK 152 (275) T ss_pred cceeeEEe-ehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---cccccCHHHHHHHHHH Confidence 99999999 5579999999999999999999999999999999999999999888765543 2344679999999999 Q ss_pred HhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccccccc Q lcl|NC_021299. 156 LNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAML 233 (387) Q Consensus 156 l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~ 233 (387) |.+++. ..|+++++|+.+..|+++. +|.+.+..++ ..+++|.+|++.|++|+.++.+|.+..+.++..++.+. T Consensus 153 lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA~~~~ 227 (275) T protein:vir:96 153 FNDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLGD---NVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGAVKLI 227 (275) T ss_pred hccccC--CccEEEeCHHHHHHHHhcccccccccccccc---cceeccccceecCeeEEEeCCCCcceEEEEeccceeee Confidence 988763 6899999999999998874 6776666553 46799999999999999999999998888887776654 Q ss_pred ccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccc Q lcl|NC_021299. 234 TRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEI 313 (387) Q Consensus 234 ~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~ 313 (387) ...... ...+.+.....+.......++.......... .++. T Consensus 228 ~~~~~~--------------------vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv------------~~t~------- 268 (275) T protein:vir:96 228 TKRDFF--------------------LETERHASHKSTALFSDKHYVAYLYDESKVV------------KITK------- 268 (275) T ss_pred ecCCcc--------------------cccccchhhcCcEEEEeEEEEEEEEcCccEE------------EEEe------- Confidence 332111 1111122222222222333332222221111 0111 Q ss_pred ccccccc Q lcl|NC_021299. 314 EGETVKA 320 (387) Q Consensus 314 ~~~~~~~ 320 (387) .+..++. T Consensus 269 ~~~~~~~ 275 (275) T protein:vir:96 269 SASGLGV 275 (275) T ss_pred cccccCC Confidence 1111111 No 22 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.96 E-value=1.5e-31 Score=189.54 Aligned_cols=287 Identities=14% Similarity=0.153 Sum_probs=177.6 Q ss_pred CccccccH-HHHHHHHHHHHHhhccccce--eeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAFIKP-PVIIASILGQLQHELVLPNF--VFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~~~p-e~~~~~~~~~l~~~~~~~~~--~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||++.+- +++++|+++.|+++++|+.. ++|+|+.+| .+.||||.+|.|......+.. ......+++.+. T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~-~r~Gdti~~p~~~~~~~~~G~------~~t~~~~~i~e~ 73 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWMPVEQESPTQEGW------DLTDKATGLLEL 73 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhh-hcccceEEeccccccccccCc------ccCCCCCccccc Confidence 99999884 89999999999999999996 569998887 488999999999877666522 112234578899 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccCCcchhHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK-----VSLVDEDEIWNGVVSN 152 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~-----~~~~~~~~~~~~i~~a 152 (387) ++++++++++...|.++++|+ ...+++++++++++++||++||.++++++...+..+ ++.......|+++..+ T Consensus 74 ~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:10 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred eEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHH Confidence 999999999999999999984 566777899999999999999999999876544332 3444555678999999 Q ss_pred HHHHhhccCCcC-CcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEE-eeccee-eeeeccceeeeeeec--- Q lcl|NC_021299. 153 RRWLNEQKVPKD-GRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGR-LAQYDV-VTVDTLPHGDAYLSH--- 226 (387) Q Consensus 153 ~~~l~~~~vp~~-~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~-~~g~~v-~~s~~~~~~~~~~~~--- 226 (387) ++.|++.++|.+ +|.+|++|+.+..+.. .+.+....+......+|+|.+++ +.||++ +.++.+|........ T Consensus 152 ~~~L~~~~vP~~~~R~~vldp~~~~~l~~--~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~t 229 (430) T protein:vir:10 152 EELMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHHh--hhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCce Confidence 999999999996 7999999999998863 34454555555556789999997 889975 677776653211000 Q ss_pred --------cccc------------------------cccccccccccC---------------ceeeeee---------- Q lcl|NC_021299. 227 --------PTAY------------------------AMLTRSPGRPMT---------------NTVATST---------- 249 (387) Q Consensus 227 --------~~a~------------------------~~~~~~~~~~~~---------------~t~~~~~---------- 249 (387) ..++ ++..|..-...+ ..+.... T Consensus 230 v~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~ 309 (430) T protein:vir:10 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEIT 309 (430) T ss_pred eccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEe Confidence 0000 000000000000 0000000 Q ss_pred --e------------c----------------cc--cc--ceeeeeee-------------------------------- Q lcl|NC_021299. 250 --V------------A----------------TE--NG--VQLRWLGD-------------------------------- 263 (387) Q Consensus 250 --~------------~----------------~~--~~--~~~~~~~~-------------------------------- 263 (387) . . .. .+ ..+.|..+ T Consensus 310 paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Gls 389 (430) T protein:vir:10 310 PKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLN 389 (430) T ss_pred ccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEE Confidence 0 0 00 00 01112211 Q ss_pred ------eeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccccccccce Q lcl|NC_021299. 264 ------YDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKL 324 (387) Q Consensus 264 ------~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~ 324 (387) ||.........++..+|.....+...+.. + .|.+. T Consensus 390 irv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~--------------------l------~g~~~ 430 (430) T protein:vir:10 390 GIFATQGDISTLSGLCRIALWYGVNATRPEAIGVG--------------------L------PGQTA 430 (430) T ss_pred EEEEEecccccCceEEEEeeeccceecCcceEEEE--------------------c------CCCCC Confidence 11111111111111111111111111000 0 01111 No 23 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.96 E-value=1.5e-31 Score=189.54 Aligned_cols=287 Identities=14% Similarity=0.153 Sum_probs=177.6 Q ss_pred CccccccH-HHHHHHHHHHHHhhccccce--eeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAFIKP-PVIIASILGQLQHELVLPNF--VFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~~~p-e~~~~~~~~~l~~~~~~~~~--~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||++.+- +++++|+++.|+++++|+.. ++|+|+.+| .+.||||.+|.|......+.. ......+++.+. T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~-~r~Gdti~~p~~~~~~~~~G~------~~t~~~~~i~e~ 73 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWMPVEQESPTQEGW------DLTDKATGLLEL 73 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhh-hcccceEEeccccccccccCc------ccCCCCCccccc Confidence 99999884 89999999999999999996 569998887 488999999999877666522 112234578899 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccCCcchhHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK-----VSLVDEDEIWNGVVSN 152 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~-----~~~~~~~~~~~~i~~a 152 (387) ++++++++++...|.++++|+ ...+++++++++++++||++||.++++++...+..+ ++.......|+++..+ T Consensus 74 ~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:92 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred eEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHH Confidence 999999999999999999984 566777899999999999999999999876544332 3444555678999999 Q ss_pred HHHHhhccCCcC-CcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEE-eeccee-eeeeccceeeeeeec--- Q lcl|NC_021299. 153 RRWLNEQKVPKD-GRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGR-LAQYDV-VTVDTLPHGDAYLSH--- 226 (387) Q Consensus 153 ~~~l~~~~vp~~-~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~-~~g~~v-~~s~~~~~~~~~~~~--- 226 (387) ++.|++.++|.+ +|.+|++|+.+..+.. .+.+....+......+|+|.+++ +.||++ +.++.+|........ T Consensus 152 ~~~L~~~~vP~~~~R~~vldp~~~~~l~~--~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~t 229 (430) T protein:vir:92 152 EELMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHHh--hhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCce Confidence 999999999996 7999999999998863 34454555555556789999997 889975 677776653211000 Q ss_pred --------cccc------------------------cccccccccccC---------------ceeeeee---------- Q lcl|NC_021299. 227 --------PTAY------------------------AMLTRSPGRPMT---------------NTVATST---------- 249 (387) Q Consensus 227 --------~~a~------------------------~~~~~~~~~~~~---------------~t~~~~~---------- 249 (387) ..++ ++..|..-...+ ..+.... T Consensus 230 v~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~ 309 (430) T protein:vir:92 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEIT 309 (430) T ss_pred eccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEe Confidence 0000 000000000000 0000000 Q ss_pred --e------------c----------------cc--cc--ceeeeeee-------------------------------- Q lcl|NC_021299. 250 --V------------A----------------TE--NG--VQLRWLGD-------------------------------- 263 (387) Q Consensus 250 --~------------~----------------~~--~~--~~~~~~~~-------------------------------- 263 (387) . . .. .+ ..+.|..+ T Consensus 310 paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Gls 389 (430) T protein:vir:92 310 PKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLN 389 (430) T ss_pred ccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEE Confidence 0 0 00 00 01112211 Q ss_pred ------eeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccccccccce Q lcl|NC_021299. 264 ------YDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKL 324 (387) Q Consensus 264 ------~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~ 324 (387) ||.........++..+|.....+...+.. + .|.+. T Consensus 390 irv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~--------------------l------~g~~~ 430 (430) T protein:vir:92 390 GIFATQGDISTLSGLCRIALWYGVNATRPEAIGVG--------------------L------PGQTA 430 (430) T ss_pred EEEEEecccccCceEEEEeeeccceecCcceEEEE--------------------c------CCCCC Confidence 11111111111111111111111111000 0 01111 No 24 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.96 E-value=4.1e-31 Score=187.16 Aligned_cols=262 Identities=13% Similarity=0.124 Sum_probs=186.6 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||+ +++||+|+++++++|.+.++|.+++.+|+ ++.+++|+||+||.+... .+.+ ...++..++++++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~--~l~g~~G~ti~iP~~~~~--gda~--~~~eg~~i~~~~l 74 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDT--TLQGQPGNTLKFPAFTYI--GDAA--DVAEGGEISLDKI 74 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhcccccccc--ccccCCCCEEEEeeeccC--cccc--ccCCCCccChhhc Confidence 9974 48899999999999999999999998874 467889999999987654 2222 2345677899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++.++++.+ ..++|.++|++..+...|++.++.+|+++++|+++|++++..+.++.... +....++.|.+|+. T Consensus 75 t~~~~~~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~----~~~~~~d~i~~A~~ 149 (272) T protein:vir:36 75 GTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTV----STKANVDGVQAALD 149 (272) T ss_pred CCcceeEeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccHHHHHHHHH Confidence 99999999955 67899999999999999999999999999999999999999887765543 44567899999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee----eeccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY----LSHPTAY 230 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~----~~~~~a~ 230 (387) .|.+++.+ .|+++++|+.+..|+++..|...... .+...+++|.+|++.|++|+.++.+|.+... .+...++ T Consensus 150 ~lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~--~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~ 225 (272) T protein:vir:36 150 IFNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIGSE--VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPAL 225 (272) T ss_pred HhhhcCCC--ceEEEEcHHHHHHHhccccccccccc--ccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccce Confidence 99999875 68999999999999998877654332 2345789999999999999999999976553 2223333 Q ss_pred cccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 231 AMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 231 ~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) .+.... ......+.+.....+.......++.....+.........+- T Consensus 226 ~~~~~~--------------------~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 226 KLVLKR--------------------GVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeeecC--------------------CcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 222111 11111112222222222222333322222221111110000 No 25 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.96 E-value=8.4e-32 Score=190.94 Aligned_cols=282 Identities=11% Similarity=0.147 Sum_probs=179.1 Q ss_pred Cc----------------c-ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccc Q lcl|NC_021299. 1 MA----------------N-AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRAT 63 (387) Q Consensus 1 Ma----------------~-~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~ 63 (387) |+ + ++|+ |+|+.++++.|.+..+|.+++++- ++ +.|++++|+..+...+.++. T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r---~i--~~G~tv~i~~ig~~~~~~~~---- 76 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DL--RGGKSKQFMFTGKLSAGYHT---- 76 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccc---cc--cccceEEEEeccceeEeeec---- Confidence 21 1 3566 999999999999999999998732 34 35999999999999888775 Q ss_pred ccccccccc-ccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------- Q lcl|NC_021299. 64 GADRNMVAS-DLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE------- 135 (387) Q Consensus 64 ~~~~~~~~~-~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~------- 135 (387) .+..+..+ ++.+++++|+||+.+|+.+.++|.|+.+...|++.++.++++++||+.+|+.++.++..+... T Consensus 77 -~g~~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~ 155 (332) T protein:vir:78 77 -PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) T ss_pred -CCCCCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccc Confidence 34455554 578899999999999999999999999999999999999999999999999999877643221 Q ss_pred ---------ccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhc--ccchhhhhhcccccceeeeeeE-E Q lcl|NC_021299. 136 ---------KVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLL--DDRFIRYDSAGEAGASRLQTAR-I 203 (387) Q Consensus 136 ---------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~--~~~~~~~~~~g~~~~~~~~~g~-i 203 (387) .+...++...|+.|++++++|++++||.++||+|++|++|..|++ +++|.+....++++ .+++|. + T Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~--~~~~g~~i 233 (332) T protein:vir:78 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG--DMNSGKGL 233 (332) T ss_pred ccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeecccccc--ceecceee Confidence 111234556789999999999999999999999999999999997 77787766665543 456664 8 Q ss_pred EEeecceeeeeeccceeeeeeeccccccccccccccccCc----eeeeeeecccccc--eeeeee-eeeeccceeeeeee Q lcl|NC_021299. 204 GRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTN----TVATSTVATENGV--QLRWLG-DYDATSTTERSIVD 276 (387) Q Consensus 204 g~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~----t~~~~~~~~~~~~--~~~~~~-~~d~~~~~~~~~~~ 276 (387) +++.||+|++++++|..........+..-.........+. -++-.+....... ....+. .++.....+..... T Consensus 234 ~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~ 313 (332) T protein:vir:78 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGK 313 (332) T ss_pred eEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhh Confidence 9999999999999986544333222111000000000000 0000000000000 000000 00000000110000 Q ss_pred eeeeeccccceeeecccee Q lcl|NC_021299. 277 TWIGVKAVLDPVTANLDDE 295 (387) Q Consensus 277 ~~~g~~~~~~~~~~~~~~~ 295 (387) ..+|.....+......... T Consensus 314 ~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 314 LAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhcCceecccceEEEeeC Confidence 1111111111100000000 No 26 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.96 E-value=6e-31 Score=186.28 Aligned_cols=268 Identities=16% Similarity=0.179 Sum_probs=193.9 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |||.. ++||+|++++++++++.++|.+++.+|. ++.+++|++|+||.+... .+.+ ...++..++++++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~--~l~g~~G~ti~iP~~~~i--gda~--~~~eg~~i~~~~l 74 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDS--TLVGQPGDTLTFPAFVYS--GDAT--VVPEGQKIPVDKI 74 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecc--cccCCCCCEEEeeeecCC--Cccc--cccCCCccCcccc Confidence 99754 8899999999999999999999998874 577889999999987654 2222 2345677899999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.++..++| ++.+++|.++|++..+...|++.+.++|+.+++|+++|++++..+.++..... .....|+.|.+|.. T Consensus 75 t~~~~~a~i-~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~---~~~~t~d~i~~A~~ 150 (276) T protein:vir:10 75 ETNRREAKI-HKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVS---ADIGTLAGLEAAID 150 (276) T ss_pred ccceeeEEe-ehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccccCHHHHHHHHH Confidence 999999999 45799999999999999999999999999999999999999998887655432 23456899999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhcc--cchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLD--DRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~--~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) .|++++. +.++++++|+.+..|+++ .+|.+....+. ..+++|.+|.+.|++|+.++.+|.+..+.++..++.+ T Consensus 151 ~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~ 225 (276) T protein:vir:10 151 TFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELGD---NIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKL 225 (276) T ss_pred HhccccC--cccEEEEcHHHHHHHHHhccccccccccccc---cceeccccceecceeEEEcCCCCcceEEEEeccceee Confidence 9998864 679999999999999875 57777666553 4678999999999999999999999888888777765 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) ....... ...+.+.....+.......++................. T Consensus 226 ~~~~~~~--------------------vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~--------------- 270 (276) T protein:vir:10 226 ITKRDFF--------------------LETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAG--------------- 270 (276) T ss_pred eecCCce--------------------eecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCc--------------- Confidence 4332111 11122222222222222333322222211111100000 Q ss_pred cccccccccc Q lcl|NC_021299. 313 IEGETVKAGE 322 (387) Q Consensus 313 ~~~~~~~~~~ 322 (387) +...+. T Consensus 271 ----~~~~~~ 276 (276) T protein:vir:10 271 ----TTDSGA 276 (276) T ss_pred ----CCcCCC Confidence 000000 No 27 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.95 E-value=3e-31 Score=187.95 Aligned_cols=288 Identities=15% Similarity=0.052 Sum_probs=179.6 Q ss_pred Cccccc--------------------cHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceec Q lcl|NC_021299. 1 MANAFI--------------------KPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGL 60 (387) Q Consensus 1 Ma~~~~--------------------~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~ 60 (387) |||+.- -=|+|+.+++..|++..+|.+++++. ++ +.|++++|+..+...+.++.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~---~~--~~G~sv~i~~ig~~t~~~~~~ 75 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR---SI--ASGKSAQFPVIGRTKAAYLKP 75 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc---cc--cccceeEeeeccceeeeeecc Confidence 665431 12778999999999999999998742 33 359999999999998887753 Q ss_pred ccccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----- Q lcl|NC_021299. 61 RATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE----- 135 (387) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~----- 135 (387) . ...+...+++....+.|+||+.+++.+.++|.|+.+...|++.++.+++.++||+.+|+.++..+..+... T Consensus 76 g---~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~ 152 (347) T protein:vir:15 76 G---ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASN 152 (347) T ss_pred C---CCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 2 12223456678899999999999999999999999999999999999999999999999998765432100 Q ss_pred ---------------ccccCC-------cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc Q lcl|NC_021299. 136 ---------------KVSLVD-------EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA 193 (387) Q Consensus 136 ---------------~~~~~~-------~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~ 193 (387) ...+.. ....++.+.+|+++|++++||.++||+|++|++|..|+++++|...+..+. T Consensus 153 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~- 231 (347) T protein:vir:15 153 ENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQAL- 231 (347) T ss_pred ccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccc- Confidence 000111 123477888999999999999999999999999999999999887766443 Q ss_pred cceeeeeeEEEEeecceeeeeeccceeeeeeecccc-----ccccccccc-------cccCceeeeeeecccccceeeee Q lcl|NC_021299. 194 GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA-----YAMLTRSPG-------RPMTNTVATSTVATENGVQLRWL 261 (387) Q Consensus 194 ~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a-----~~~~~~~~~-------~~~~~t~~~~~~~~~~~~~~~~~ 261 (387) ..+++|.++++.||+|++++++|..........+ +.+...... ...+.-++..+............ T Consensus 232 --~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e 309 (347) T protein:vir:15 232 --IDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALE 309 (347) T ss_pred --ccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeee Confidence 4579999999999999999999865332211111 111000000 00000001111111111111111 Q ss_pred eeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeec Q lcl|NC_021299. 262 GDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDA 311 (387) Q Consensus 262 ~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~ 311 (387) ..++.....+....-..+|.....+... ..+.+++..- T Consensus 310 ~~~~~~~~~d~i~~~~~~G~~vlrP~~a------------v~~~~~~~~~ 347 (347) T protein:vir:15 310 RARRANYQADQIIAKYAMGHGGLRPEAA------------GAIVLPKVSE 347 (347) T ss_pred ecccchhhhhhhehhhhcCCceeccccE------------EEEecCCCCC Confidence 2222222222211111122211111111 1111111100 No 28 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.95 E-value=2.1e-31 Score=188.80 Aligned_cols=285 Identities=15% Similarity=0.069 Sum_probs=180.1 Q ss_pred Cccc---------------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeecee Q lcl|NC_021299. 1 MANA---------------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRG 59 (387) Q Consensus 1 Ma~~---------------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~ 59 (387) |||+ +|+ |+|+.+++..|.+..+|.++++.- ++ +.|++++|+..+...+.++. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r---~~--~~G~sv~i~~iG~~t~~~~~ 74 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR---SI--ASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccc---cc--cccceeEeeeccceeeeeec Confidence 6543 355 999999999999999999999731 23 35999999999999998875 Q ss_pred cccccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc------ Q lcl|NC_021299. 60 LRATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAP------ 133 (387) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~------ 133 (387) .. ...+...+++...++.|+||+.+|+.+.|+|.|+.+...|++.++.+++.++||+.+|+.++..+..+. T Consensus 75 ~g---~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~ 151 (347) T protein:vir:33 75 PG---ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGS 151 (347) T ss_pred CC---CCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 32 122223466788899999999999999999999999999999999999999999999999986542110 Q ss_pred --------------ccccccC-------CcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccc Q lcl|NC_021299. 134 --------------YEKVSLV-------DEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGE 192 (387) Q Consensus 134 --------------~~~~~~~-------~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~ 192 (387) .....+. .+...|+.|++++++|++++||.++||+|++|++|..|+++++|...+..+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~- 230 (347) T protein:vir:33 152 NENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQA- 230 (347) T ss_pred ccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccc- Confidence 0011111 123568899999999999999999999999999999999999998776643 Q ss_pred ccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCcee--------------eeeeeccccccee Q lcl|NC_021299. 193 AGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTV--------------ATSTVATENGVQL 258 (387) Q Consensus 193 ~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~--------------~~~~~~~~~~~~~ 258 (387) ...+++|.++++.||+|++++++|..... .+..+..... ......+.+. +..+......... T Consensus 231 --~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~-~~~~~~~ag~-~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~ 306 (347) T protein:vir:33 231 --LLDPERGTIRNVMGFEVVEVPHLTAGGAG-DTREDAPADQ-KHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDL 306 (347) T ss_pred --ccccccceeEEEeceeEEEecccccCccc-cccccccccc-cccccCCcccceeccccceeeeeecchhheeeeeece Confidence 34678999999999999999999875332 2211111000 0000000000 0000000111111 Q ss_pred eeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeec Q lcl|NC_021299. 259 RWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDA 311 (387) Q Consensus 259 ~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~ 311 (387) .....++.....+....-..+|.....+... ..+.+++..- T Consensus 307 ~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~a------------v~i~~~~~~~ 347 (347) T protein:vir:33 307 ALERARRANYQADQIIAKYAMGHGGLRPEAA------------GAIVLPKVSE 347 (347) T ss_pred eeeeccchhhhhHhhhhhhhcCCceecccce------------EEEecCCCCC Confidence 1111122111111111111112111111110 1111111100 No 29 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.95 E-value=4.1e-31 Score=187.17 Aligned_cols=284 Identities=15% Similarity=0.089 Sum_probs=176.1 Q ss_pred Cccc----------------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeece Q lcl|NC_021299. 1 MANA----------------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTR 58 (387) Q Consensus 1 Ma~~----------------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~ 58 (387) |||+ +|+ |+|+.|++..|.+..+|.+++++ .++. -|++++|+..+...+..+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~---r~i~--~g~s~~~~~iG~~~~~~~ 74 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMV---RSIS--SGKSAQFPVLGRTQAAYL 74 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhccccee---eeec--ccceEEEEeeceeEEEee Confidence 7765 244 89999999999999999999874 2454 499999999998888866 Q ss_pred ecccccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---- Q lcl|NC_021299. 59 GLRATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY---- 134 (387) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~---- 134 (387) ... .......+++....+.|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+.+|+.++..+..+.. T Consensus 75 ~~G---~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~ 151 (344) T protein:vir:10 75 APG---ENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQ 151 (344) T ss_pred ecC---CCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 532 2222334678889999999999999999999999999999999999999999999999999866532110 Q ss_pred ----------c-------ccccC-----CcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccc Q lcl|NC_021299. 135 ----------E-------KVSLV-----DEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGE 192 (387) Q Consensus 135 ----------~-------~~~~~-----~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~ 192 (387) . .+... .....|+.+.++++.|++++||.++||+|++|++|..|+++++|......+ T Consensus 152 ~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~- 230 (344) T protein:vir:10 152 YNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAA- 230 (344) T ss_pred cccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccccccccccc- Confidence 0 00000 112457889999999999999999999999999999999998887766543 Q ss_pred ccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCceeee------------eeecccccceeee Q lcl|NC_021299. 193 AGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVAT------------STVATENGVQLRW 260 (387) Q Consensus 193 ~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~------------~~~~~~~~~~~~~ 260 (387) ...+++|.++++.||+|++++++|...... +..+.+-.........+..... .+........+.. T Consensus 231 --~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~-~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~ 307 (344) T protein:vir:10 231 --LIDPEKGSIRNVMGFEVVEVPHLTAGGAGT-SREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLAL 307 (344) T ss_pred --ccceeeeEEEEEeceEEEeccccccccCCc-ccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhcccee Confidence 345789999999999999999998642221 1111111000000000000000 0000000000000 Q ss_pred eeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 261 LGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 261 ~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) ...++.....+.......+|.....+... ..|.+... T Consensus 308 e~~r~~~~~~d~i~g~~~~G~~vlRPe~a------------~~v~~~~~ 344 (344) T protein:vir:10 308 ERARRANFQADQIIAKYAMGHGGLRPEAA------------GAVVFKTK 344 (344) T ss_pred ecccchhHHHHHHHHHhhcccceecccce------------EEEEeecC Confidence 11111111111110001111111111000 00111111 No 30 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.94 E-value=2.6e-30 Score=182.75 Aligned_cols=284 Identities=14% Similarity=0.067 Sum_probs=170.1 Q ss_pred Cccc--------------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceec Q lcl|NC_021299. 1 MANA--------------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGL 60 (387) Q Consensus 1 Ma~~--------------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~ 60 (387) |||. +|+ |.|..|+...|.+..+|.+++.+- ++ +.|++++||..+...+.++.. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r---~i--~~G~sv~i~~iG~~tv~~~t~ 74 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVR---TI--QNGKSAQFPVMGRTSGVYLAP 74 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cc--cccceEEEecccceeeeeecC Confidence 3332 122 455555556677778888887532 34 459999999999999988763 Q ss_pred ccccccccc--cccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---- Q lcl|NC_021299. 61 RATGADRNM--VASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY---- 134 (387) Q Consensus 61 ~~~~~~~~~--~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~---- 134 (387) +..+ ..+++.+.++.|+||+.+++.+.++|.|+.+...|++.++.++++++||+.+|+.++.++..... T Consensus 75 -----G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~ 149 (347) T protein:vir:94 75 -----GERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAA 149 (347) T ss_pred -----CCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 3333 45678889999999999999999999999999999999999999999999999999876532100 Q ss_pred --cc------------cccC-------CcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc Q lcl|NC_021299. 135 --EK------------VSLV-------DEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA 193 (387) Q Consensus 135 --~~------------~~~~-------~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~ 193 (387) .. .... .....++.|.+++++|++.+||.++||+|++|++|..|+.+..+......++ T Consensus 150 ~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~- 228 (347) T protein:vir:94 150 SNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAAL- 228 (347) T ss_pred cccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccc- Confidence 00 0000 1234578899999999999999999999999999999998887777655443 Q ss_pred cceeeeeeEEEEeecceeeeeeccceeeeee-eccccccccccccccccCc---------------eeeeeeecccccce Q lcl|NC_021299. 194 GASRLQTARIGRLAQYDVVTVDTLPHGDAYL-SHPTAYAMLTRSPGRPMTN---------------TVATSTVATENGVQ 257 (387) Q Consensus 194 ~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~-~~~~a~~~~~~~~~~~~~~---------------t~~~~~~~~~~~~~ 257 (387) ..+++|.+++++||+|++++++|...... .....+....+..-...+. -++-.+........ T Consensus 229 --~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~ 306 (347) T protein:vir:94 229 --IDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRD 306 (347) T ss_pred --ccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccc Confidence 35788999999999999999999643221 1111122211110000000 00000000000000 Q ss_pred eeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeee Q lcl|NC_021299. 258 LRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATD 310 (387) Q Consensus 258 ~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~ 310 (387) +.....++.....+.......+|.....+..... +.+.... T Consensus 307 ~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~------------~~~~~A~ 347 (347) T protein:vir:94 307 LALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGA------------LVFSPAE 347 (347) T ss_pred ccccchhchhhHHHHhhhhhhhcCcccccceeEE------------EEecCCC Confidence 0111111111111111111111111111110000 0000000 No 31 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.94 E-value=1.4e-28 Score=173.21 Aligned_cols=370 Identities=12% Similarity=0.098 Sum_probs=203.7 Q ss_pred Cccccc-cHHHHHHHHHHHHHhhccccce--eeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANAFI-KPPVIIASILGQLQHELVLPNF--VFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~~~-~pe~~~~~~~~~l~~~~~~~~~--~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |||++- .-|+..+|+++.|+++++|.++ ++|+|+.+| .+.||||.+|.|...+..+.. ......+++.+. T Consensus 1 Ma~~~~~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~-~r~Gdti~ip~p~~~~~~~G~------~~t~~~~~~~e~ 73 (430) T protein:vir:21 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWMPVEQESPTQEGW------DLTDKATGLLEL 73 (430) T ss_pred CccccchhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhh-hcccceEEeeccccccccccc------cccCCCccceee Confidence 999871 2345559999999999999996 679998887 488999999998777665422 112334688999 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccCCcchhHHHHHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK-----VSLVDEDEIWNGVVSN 152 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~-----~~~~~~~~~~~~i~~a 152 (387) ++++++++++...|.++++|+ ...+++++++++++++||++||.+++.++...+..+ ++.......|+++..+ T Consensus 74 ~v~~~~~~~~~V~~~~~~kEl--~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:21 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred eEeEEEeeeccceEEeehhHh--cChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHH Confidence 999999999999999999984 577888999999999999999999999886644333 3444555679999999 Q ss_pred HHHHhhccCCcC-CcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEE-eeccee-eeeeccceeeeeeecccc Q lcl|NC_021299. 153 RRWLNEQKVPKD-GRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGR-LAQYDV-VTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 153 ~~~l~~~~vp~~-~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~-~~g~~v-~~s~~~~~~~~~~~~~~a 229 (387) ++.|++.++|.+ +|.++++|+.+..+.. .+.+....+......+|+|.+++ +.||++ +.++.+|........ + T Consensus 152 ~~~L~~~~vP~~~~R~~~~~p~~~~~l~~--~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t--~ 227 (430) T protein:vir:21 152 EEIMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT--G 227 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHhh--hhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCc--C Confidence 999999999995 7999999999988754 24444444444556789999997 889975 778888765443321 1 Q ss_pred ccccccccc-cccCceeeeeeecc-cccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccc----- Q lcl|NC_021299. 230 YAMLTRSPG-RPMTNTVATSTVAT-ENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGT----- 302 (387) Q Consensus 230 ~~~~~~~~~-~~~~~t~~~~~~~~-~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~----- 302 (387) .+ ..++.. .....+........ ........... ..+.-...-..+..|+..++............++... T Consensus 228 ~t-v~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s--~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~t 304 (430) T protein:vir:21 228 IT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLS--ATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) T ss_pred ce-eccccccccccceeccccccccccccceeeeee--cccceecccEEEecceeeeccccccccCCcceEEEEEecCCc Confidence 11 111110 00000000000000 00000000000 0000111111222333333333322222222222221 Q ss_pred eeeeeeeeccccccccc--cccceeEEEeeccCCc------cccCcceEEEecCceEEEEcC---Cce----EEEE--e- Q lcl|NC_021299. 303 RIHLKATDAEIEGETVK--AGEKLALALEDSNGDN------RAGDPLVTWTSGTTAKATIDA---NGV----VTGV--A- 364 (387) Q Consensus 303 ~v~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~------~~~~~~v~w~Ssn~~VAtVd~---~G~----VTa~--~- 364 (387) .+.+.+..+-+.-.++. ......++..+.++.. .....++.|.-+-=..+++.= .|. .+.. - T Consensus 305 tv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~ 384 (430) T protein:vir:21 305 HVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIP 384 (430) T ss_pred eeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeecc Confidence 12222211111100111 1111122111111110 001234666655555555541 121 1110 0 Q ss_pred -cceEEEEEEE--CCE--EEEEEEEEeC Q lcl|NC_021299. 365 -AGTSEITAVV--DGL--TVKKTITVTA 387 (387) Q Consensus 365 -~Gta~Itat~--~~~--~~~~~vtVta 387 (387) .|-. |.+.+ +.. ...|.+-|=- T Consensus 385 ~~Gls-irv~~~yd~~~~~~~~r~Dily 411 (430) T protein:vir:21 385 DVGLN-GIFATQGDISTLSGLCRIALWY 411 (430) T ss_pred ccceE-EEEEEccccccCceEEEEEeec Confidence 1222 44432 222 3333333322 No 32 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.94 E-value=1.7e-29 Score=178.32 Aligned_cols=286 Identities=15% Similarity=0.080 Sum_probs=175.2 Q ss_pred Cccc---------------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeecee Q lcl|NC_021299. 1 MANA---------------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRG 59 (387) Q Consensus 1 Ma~~---------------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~ 59 (387) |||+ +|+ |+|+.|++..|.+..+|.++++.- ++ +.|++++||..+...+..+. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r---~i--~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TI--QNGKSASFPVMGRTKGYYLA 74 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccc---cc--cCcceEEEeeecceeeeeec Confidence 6632 355 899999999999999999998642 34 45999999999998887654 Q ss_pred cccccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---- Q lcl|NC_021299. 60 LRATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE---- 135 (387) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~---- 135 (387) .. ........++..+++.|+||+.+|+.+.|+|.|+.+...|++.++.++++++||+.+|+.++..+..+... T Consensus 75 ~g---~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:88 75 PG---ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) T ss_pred cc---cCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 32 22223345778899999999999999999999999999999999999999999999999988655322110 Q ss_pred --------------ccccC-------CcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc Q lcl|NC_021299. 136 --------------KVSLV-------DEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG 194 (387) Q Consensus 136 --------------~~~~~-------~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~ 194 (387) .+++. .....|+.|++++++|++++||.++|++|++|++|..|+++.++......+. T Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~-- 229 (347) T protein:vir:88 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL-- 229 (347) T ss_pred ccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccc-- Confidence 01000 1123478899999999999999999999999999999999888776655433 Q ss_pred ceeeeeeEEEEeecceeeeeeccceeeeeeecccc-ccccccccccccCceeeeee--------------ecccccceee Q lcl|NC_021299. 195 ASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA-YAMLTRSPGRPMTNTVATST--------------VATENGVQLR 259 (387) Q Consensus 195 ~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a-~~~~~~~~~~~~~~t~~~~~--------------~~~~~~~~~~ 259 (387) ..+++|.++++.||.|++++++|........... +................+.. ........+. T Consensus 230 -~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~ 308 (347) T protein:vir:88 230 -IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) T ss_pred -cchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccce Confidence 3578899999999999999999865333211111 10000000000000000000 0000000000 Q ss_pred eeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccc Q lcl|NC_021299. 260 WLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEG 315 (387) Q Consensus 260 ~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~ 315 (387) ....++.....+.......+|.....+......... + .. T Consensus 309 ~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~------------~-----a~ 347 (347) T protein:vir:88 309 LERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFT------------P-----AA 347 (347) T ss_pred eeeeechhhHHHHhhhhhhhcCceeccceEEEEEeC------------C-----CC Confidence 111111111111111111111111111110000000 0 00 No 33 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.93 E-value=1.7e-28 Score=172.80 Aligned_cols=312 Identities=15% Similarity=0.116 Sum_probs=173.5 Q ss_pred Cc--c--------------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccc Q lcl|NC_021299. 1 MA--N--------------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATG 64 (387) Q Consensus 1 Ma--~--------------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~ 64 (387) |+ | .+|+ |+|+.+++..|.+..+|.++++. +++. -|++++|+..+...+.++.. T Consensus 9 ~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~---rti~--~Gksv~f~~iG~~t~~~~t~---- 78 (375) T protein:vir:10 9 LGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTK---RTLK--NGKSLQFIYTGRMTSSFHTP---- 78 (375) T ss_pred cCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhccccc---cccc--cCceEEEEeeeeeEEeeecC---- Confidence 22 1 3444 89999999999999999999863 2454 49999999999999988763 Q ss_pred ccccccc---cccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------ Q lcl|NC_021299. 65 ADRNMVA---SDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE------ 135 (387) Q Consensus 65 ~~~~~~~---~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~------ 135 (387) +..+.- .++..++++|+||+.+|+.+.|+|.|+.+...|++.++.++++++||+.+|+.++..+..+... T Consensus 79 -G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~ 157 (375) T protein:vir:10 79 -GTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSA 157 (375) T ss_pred -CcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 333322 2455677889999999999999999999999999999999999999999999998776432110 Q ss_pred ------------------ccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcc---cchhhhhhccccc Q lcl|NC_021299. 136 ------------------KVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLD---DRFIRYDSAGEAG 194 (387) Q Consensus 136 ------------------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~---~~~~~~~~~g~~~ 194 (387) .....++...|+.|++++++|++++||.++||+|++|++|..|+++ ++|.+.+..+ T Consensus 158 ~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~--- 234 (375) T protein:vir:10 158 TNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQG--- 234 (375) T ss_pred ccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccc--- Confidence 0111235567999999999999999999999999999999999975 4566554433 Q ss_pred ceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeec----cce Q lcl|NC_021299. 195 ASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDAT----STT 270 (387) Q Consensus 195 ~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~----~~~ 270 (387) .....+|.++++.||.|++++++|......+...+. .....+................... .-.+|+.. .+. T Consensus 235 ~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~-~~~~a~~~~~~~~~~~~~~~~~~~g---~~~~y~~d~~~~~~~ 310 (375) T protein:vir:10 235 SALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGT-TGETSPGNLGSHIGPTPENANATGG---VNNDYGTNAELGAKS 310 (375) T ss_pred cceeccceEEEEeceEEEEecccccccccccccccc-ccccchhhhhccccccCCcceeecc---ccccccccccccCce Confidence 235567888999999999999999764432221111 1100010000000000000000000 00011100 000 Q ss_pred eeeeeeee-eeec-cccceeeeccceeccccccceeeeeeeecccccccccccccee---EEEeeccCCcccc Q lcl|NC_021299. 271 ERSIVDTW-IGVK-AVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLA---LALEDSNGDNRAG 338 (387) Q Consensus 271 ~~~~~~~~-~g~~-~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 338 (387) .....+.. .|+. .............. .-..-.+. +....++.+.-.. +.+......+... T Consensus 311 ~~~~~~~~A~g~v~~~~~~~~~~~~~~~---~~~q~~~i-----~~~~a~G~~~lrp~~av~l~~~~~~~~~~ 375 (375) T protein:vir:10 311 CGLIFQKEAAGVVEAIGPQVQVTNGDVS---VIYQGDVI-----LGRMAMGADYLNPAAAVELYIGATAPSAF 375 (375) T ss_pred EEEEEchhheeeeeeeccccccccchhh---heeeeeee-----eeeeeeccCccCceeEEEEecCcCccccC Confidence 00000000 0000 00000000000000 00000000 0000001000000 0000000000000 No 34 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.93 E-value=1.3e-28 Score=173.38 Aligned_cols=279 Identities=15% Similarity=0.101 Sum_probs=171.9 Q ss_pred Ccc----------------------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeece Q lcl|NC_021299. 1 MAN----------------------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTR 58 (387) Q Consensus 1 Ma~----------------------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~ 58 (387) |++ .+|+ |+|+.|++..|.+..+|.++++. .++. -|++++|+..+...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~---r~i~--~gks~~~~~iG~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMV---RSIS--SGKSAQFPVLGRTQAAYL 74 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhccccee---eecc--ccceEEEeeecceEEEee Confidence 332 2455 89999999999999999999863 2454 399999999999888877 Q ss_pred ecccccccccccc--cccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Q lcl|NC_021299. 59 GLRATGADRNMVA--SDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE- 135 (387) Q Consensus 59 ~~~~~~~~~~~~~--~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~- 135 (387) .. +..+.. .++...+..|+||+.+++.+.|+|.|+.+...|++.++.++++++||+.+|+.++..+..+... T Consensus 75 ~~-----G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~ 149 (345) T protein:vir:22 75 AP-----GENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVE 149 (345) T ss_pred ec-----CCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 53 233333 3466678889999999999999999999999999999999999999999999988654321100 Q ss_pred -----------c---------cc-----cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhc Q lcl|NC_021299. 136 -----------K---------VS-----LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSA 190 (387) Q Consensus 136 -----------~---------~~-----~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~ 190 (387) . +. .......|+.|.+|+++|++++||.++||+|++|++|..|+++++|...... T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~ 229 (345) T protein:vir:22 150 SKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYA 229 (345) T ss_pred ccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccc Confidence 0 00 0122356899999999999999999999999999999999999998876665 Q ss_pred ccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccc-eeeeee------- Q lcl|NC_021299. 191 GEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGV-QLRWLG------- 262 (387) Q Consensus 191 g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~-~~~~~~------- 262 (387) ++ ...++|.++++.||+|++++++|........ .+. .. .....+....... ........ ...+.. T Consensus 230 ~~---~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~-~~~-~~-~~~~~~~~~g~~~-~~~~~~~~~~l~~h~~A~~~v~ 302 (345) T protein:vir:22 230 AL---IDPEKGSIRNVMGFEVVEVPHLTAGGAGTAR-EGT-TG-QKHVFPANKGEGN-VKVAKDNVIGLFMHRSAVGTVK 302 (345) T ss_pred cc---cccccceEEEEeceEEEecccccccccCccc-cCc-cc-cccccccccccee-eeeccCceEEEEEehhheeeee Confidence 43 3467999999999999999999854222111 110 00 0000000000000 00000000 000000 Q ss_pred --------eeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeee Q lcl|NC_021299. 263 --------DYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLK 307 (387) Q Consensus 263 --------~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~ 307 (387) .++.....+.......+|... ..+....+....+. T Consensus 303 ~~~~~~e~~r~~~~~~d~I~~~~a~G~~v----------lRPeaa~~i~~~~~ 345 (345) T protein:vir:22 303 LRDLALERARRANFQADQIIAKYAMGHGG----------LRPEAAGAVVFKVE 345 (345) T ss_pred eecceeeeeechhHHHHHHHHHHhcCCcc----------cccceeEEEEEeeC Confidence 000000000000000000000 00011111111111 No 35 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.93 E-value=1.5e-28 Score=173.14 Aligned_cols=287 Identities=11% Similarity=0.068 Sum_probs=181.4 Q ss_pred Cccc------------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccc Q lcl|NC_021299. 1 MANA------------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRA 62 (387) Q Consensus 1 Ma~~------------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~ 62 (387) |+|- +|+ |+|+.+++..|.+..+|.+++.+ +++ +.|++++|+..+...+..+. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l-e~~~geV~~af~~~s~~~~~~~~---r~i--~~G~s~~~~~iG~~~~~~~~--- 71 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI-EEHLGLVDASFMYSSKFASWMNV---RSL--RGTNQLRVDRVGASTIAGRK--- 71 (334) T ss_pred CCCCcCCCccccccccccchheehh-hhhhhHHHHHHHHhhhhhcccee---eec--cccceEEEeeecceeeeeec--- Confidence 6643 233 99999999999999999998864 234 45999999999999888765 Q ss_pred ccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccc----- Q lcl|NC_021299. 63 TGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAP-YEK----- 136 (387) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~-~~~----- 136 (387) .+.++..+++...++.|+||+.+++.+.|.|.|+.+..+|++.++.++++++||+..|+.++..+..+. ... T Consensus 72 --~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~ 149 (334) T protein:vir:80 72 --AGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLK 149 (334) T ss_pred --CCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 466788889999999999999999999999999999999999999999999999999998876543211 100 Q ss_pred -----c---------cc----CCcchhHHHHHHHHHHHhhccCCc---CCcEEEEchHHHHHHhcccchhhhhhcccccc Q lcl|NC_021299. 137 -----V---------SL----VDEDEIWNGVVSNRRWLNEQKVPK---DGRVLLVGSAVEEALLLDDRFIRYDSAGEAGA 195 (387) Q Consensus 137 -----~---------~~----~~~~~~~~~i~~a~~~l~~~~vp~---~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~ 195 (387) + ++ ..+...+..+.+|++.|++.++|+ .+|++|++|++|..|+.+++|.+.+..++++. T Consensus 150 ~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~ 229 (334) T protein:vir:80 150 PAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGG 229 (334) T ss_pred ccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccc Confidence 0 00 111233567889999999999994 67999999999999999999998877666656 Q ss_pred eeeeeeEEEEeecceeeeeeccceeeeeeecccccccc--ccccccccCceeeeeeecccccceeeeeeeeeeccceeee Q lcl|NC_021299. 196 SRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAML--TRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERS 273 (387) Q Consensus 196 ~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~--~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 273 (387) ..+.+|.++++.||+|++++++|.... ..+..+..+. .+......+.-.+..+........+.....++.....+.. T Consensus 230 ~~~~~g~i~~v~G~~V~~Sn~~P~~~~-t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i 308 (334) T protein:vir:80 230 NSFVGGRIAMLNGVRVVETPRFPQSAI-TANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYL 308 (334) T ss_pred ccccceeEEEEeceEEEeecCCCCccc-cccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHH Confidence 678899999999999999999986532 2221111111 0000000000000011100000001111111111111110 Q ss_pred eeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 274 IVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 274 ~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) .....+|.... .+..+.+.+++.... T Consensus 309 ~~~~a~G~g~l----------RPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 309 DTFQSYNIGQR----------RPDAVAVHDITVTNP 334 (334) T ss_pred HHHHHcCCcee----------ccceEEEEEEeeecC Confidence 00001111111 111111112111100 No 36 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.93 E-value=1.9e-28 Score=172.61 Aligned_cols=285 Identities=15% Similarity=0.075 Sum_probs=172.6 Q ss_pred Cccc---------------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeecee Q lcl|NC_021299. 1 MANA---------------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRG 59 (387) Q Consensus 1 Ma~~---------------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~ 59 (387) |||+ +|+ |+|+.|++..|.+..+|.+++++ .++ +.|++++||..+...+..+. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~---rti--~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLV---RSI--QSGKSAQFPVLGRTKAAYLQ 74 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhh---eec--cccceEEeeeccceeEeeee Confidence 5543 355 99999999999999999999874 234 35999999999999888765 Q ss_pred cccccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---- Q lcl|NC_021299. 60 LRATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE---- 135 (387) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~---- 135 (387) .. .......+++...++.|+||+.+|+.+.|+|.|+.+...|++.++.++++++||+.+|+.++..+..+... T Consensus 75 ~G---~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:94 75 PG---ENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTAN 151 (347) T ss_pred cC---cCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 32 22222346788899999999999999999999999999999999999999999999999988654321110 Q ss_pred --------------ccc--------cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc Q lcl|NC_021299. 136 --------------KVS--------LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA 193 (387) Q Consensus 136 --------------~~~--------~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~ 193 (387) .+. ...+...|+.+.+++.+|++++||.++||+|++|++|..|++...+..... . T Consensus 152 ~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~---~ 228 (347) T protein:vir:94 152 NENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANY---Q 228 (347) T ss_pred ccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccc---c Confidence 000 012334588999999999999999999999999999999997544333322 2 Q ss_pred cceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccc-cccccCceeeee--------------eeccccccee Q lcl|NC_021299. 194 GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRS-PGRPMTNTVATS--------------TVATENGVQL 258 (387) Q Consensus 194 ~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~-~~~~~~~t~~~~--------------~~~~~~~~~~ 258 (387) ....+++|.++.+.||+|++++++|..........+....... ...+.+....+. +........+ T Consensus 229 ~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~ 308 (347) T protein:vir:94 229 ALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDM 308 (347) T ss_pred cccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhccc Confidence 2345788999999999999999998754322111111000000 000000000000 0000000000 Q ss_pred eeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 259 RWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 259 ~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) .....++.....+........|.....+.... .+.++.. T Consensus 309 ~~e~~~~~~~~~~~i~~~~a~G~g~~rPe~a~------------~i~~~~a 347 (347) T protein:vir:94 309 ALERARRANFQADQIIAKYAMGHGGLRPEACG------------ALVFKKA 347 (347) T ss_pred ceeeeechhhhhhhhhhhhhhcCcccccceeE------------EEEecCC Confidence 01111111111111111111111111111000 0011110 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.93 E-value=1.9e-27 Score=167.04 Aligned_cols=262 Identities=16% Similarity=0.205 Sum_probs=184.3 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) ||++ +|+||+|+++++++|++.+++.+++.+++ +|.+++|++|+||++..... ....+++..++.+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~--~~~g~~G~tv~iP~~~~~~~----a~~v~eg~~i~~~~~ 74 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDT--TLEGQPGTTLTVPKWDYIGD----AEDVAEGEAIPMTQL 74 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccc--cccCCCCCEEEEEEecCCCC----cccccCCCccccccc Confidence 9975 49999999999999999999999998875 46788899999998653211 112345677889999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.+++.+++.+ .++.+.++|++..+...|++..+.+++.+++++++|++++..+.++... .+....++.+++|.. T Consensus 75 ~~~~~~~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~----~~~~~t~d~i~da~~ 149 (272) T protein:vir:30 75 GFKKTTMTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT----VEATATVDGVSKALD 149 (272) T ss_pred ccceEEEEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----cccccCHHHHHHHHH Confidence 99999999966 5688999999999999999999999999999999999999888776543 344567899999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) .|++++ ...|+++++|+.+..|+++. +|.+....+ ...+++|.+|++.|+.|+.++.+|.+..+.++..++.+ T Consensus 150 ~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~---~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~ 224 (272) T protein:vir:30 150 IFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVG---ANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRI 224 (272) T ss_pred HHhccC--CCccEEEEcHHHHHHHHHhcccccccccccc---ccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEE Confidence 998876 45789999999999998764 344433333 34678999999999999999999998888888777665 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccce--eeeccceeccc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDP--VTANLDDEPRF 298 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~--~~~~~~~~~~~ 298 (387) ..+.....+ .+.+.....+.......++.....+. ........++. T Consensus 225 ~~~~~~~ve--------------------~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 225 MLKRNTMVE--------------------TDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EecCCceee--------------------eccccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 543211100 01111111111111111111111110 00000000000 No 38 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.93 E-value=1.9e-27 Score=167.04 Aligned_cols=262 Identities=16% Similarity=0.205 Sum_probs=184.3 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) ||++ +|+||+|+++++++|++.+++.+++.+++ +|.+++|++|+||++..... ....+++..++.+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~--~~~g~~G~tv~iP~~~~~~~----a~~v~eg~~i~~~~~ 74 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDT--TLEGQPGTTLTVPKWDYIGD----AEDVAEGEAIPMTQL 74 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccc--cccCCCCCEEEEEEecCCCC----cccccCCCccccccc Confidence 9975 49999999999999999999999998875 46788899999998653211 112345677889999 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +.+++.+++.+ .++.+.++|++..+...|++..+.+++.+++++++|++++..+.++... .+....++.+++|.. T Consensus 75 ~~~~~~~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~----~~~~~t~d~i~da~~ 149 (272) T protein:vir:98 75 GFKKTTMTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT----VEATATVDGVSKALD 149 (272) T ss_pred ccceEEEEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----cccccCHHHHHHHHH Confidence 99999999966 5688999999999999999999999999999999999999888776543 344567899999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhccc--chhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccc Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDD--RFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAM 232 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~ 232 (387) .|++++ ...|+++++|+.+..|+++. +|.+....+ ...+++|.+|++.|+.|+.++.+|.+..+.++..++.+ T Consensus 150 ~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~---~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~ 224 (272) T protein:vir:98 150 IFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVG---ANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRI 224 (272) T ss_pred HHhccC--CCccEEEEcHHHHHHHHHhcccccccccccc---ccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEE Confidence 998876 45789999999999998764 344433333 34678999999999999999999998888888777665 Q ss_pred cccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccce--eeeccceeccc Q lcl|NC_021299. 233 LTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDP--VTANLDDEPRF 298 (387) Q Consensus 233 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~--~~~~~~~~~~~ 298 (387) ..+.....+ .+.+.....+.......++.....+. ........++. T Consensus 225 ~~~~~~~ve--------------------~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 225 MLKRNTMVE--------------------TDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EecCCceee--------------------eccccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 543211100 01111111111111111111111110 00000000000 No 39 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=99.92 E-value=1.3e-26 Score=162.54 Aligned_cols=285 Identities=10% Similarity=0.019 Sum_probs=166.1 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) ||.-.+ +|+|++++++.|++.+++..|.++.+.+++....|++|+||..+.....||+....+. ...+++.+..+ T Consensus 1 MA~~n~-a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~----~~g~~~~~~~t 75 (299) T protein:vir:79 1 MAALNY-AKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAV----AQRNYDNAWEP 75 (299) T ss_pred Cccchh-HHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcc----cccccCcceeE Confidence 996456 5999999999999999999998887777765555899999999888888887533222 22356778889 Q ss_pred EEEEeeeecceeec--cHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----ccccCCcchhHHHHHHHH Q lcl|NC_021299. 81 IKLTDVIYNRIDLT--DEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE-----KVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 81 ~~id~~~~~~~~~~--d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~-----~~~~~~~~~~~~~i~~a~ 153 (387) ++||+.+++.|.++ |.+++...........+.+...+++++|.+.++.+...... ..+..++.++|+.|.++. T Consensus 76 ~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~ 155 (299) T protein:vir:79 76 KVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLM 155 (299) T ss_pred EEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHH Confidence 99999999999999 55554332222222233345668899999988765432221 233456788999999999 Q ss_pred HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeee--eeccceeeeeeecccccc Q lcl|NC_021299. 154 RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVT--VDTLPHGDAYLSHPTAYA 231 (387) Q Consensus 154 ~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~--s~~~~~~~~~~~~~~a~~ 231 (387) ++|+++++|.++|+++++|+++..|.++++|.+....+.. ...++|.+|++.||.|++ ++.++.. +.+..+ .+ T Consensus 156 ~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~--~~~~~g~Vg~idG~~Ii~Vps~r~~t~--~~~~~G-~~ 230 (299) T protein:vir:79 156 EKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDA--GTSLNRQTTDIDTVKIIKVPSNLMKTA--YDFTTG-WK 230 (299) T ss_pred HHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccc--cceeeeeeeeecceEEEEechhhcCcc--ceeccC-cc Confidence 9999999999999999999999999999999887776543 246899999999999986 3333321 111110 00 Q ss_pred ccccccccccCceeeeeeecccccceee-----eeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeee Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLR-----WLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHL 306 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~ 306 (387) .... +..+.+ .......... ...-+.+..+.... +..........-........+.+.. T Consensus 231 ~~~~------ak~in~--ii~~~~a~~~~~K~~~~~~~~P~~~~~~~--------~~~~~r~y~d~~v~~nk~~~i~~~~ 294 (299) T protein:vir:79 231 VGAG------AKQIFM--SLVHPSAIITPVSYQFSKLDEPTAVTEGK--------YFYFEESFEDVFILNKKADAIQFVV 294 (299) T ss_pred ccCc------ccccce--EEEcCCeeeeeEeeeeEEeecCCCCCccc--------eeeeeeeeeeeeeeccccCeEEEEe Confidence 0000 000000 0000000000 00000000000000 0000000000000000000001110 Q ss_pred eeeec Q lcl|NC_021299. 307 KATDA 311 (387) Q Consensus 307 ~~~~~ 311 (387) ....- T Consensus 295 ~~a~~ 299 (299) T protein:vir:79 295 EGAGA 299 (299) T ss_pred eecCC Confidence 00000 No 40 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.90 E-value=1.3e-25 Score=157.09 Aligned_cols=265 Identities=12% Similarity=0.134 Sum_probs=182.0 Q ss_pred Ccccc----ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccccc Q lcl|NC_021299. 1 MANAF----IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma~~~----~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (387) ||.+. ++||+|++++.+++.+.++|.+++..|+ ++.+++|++|+||.+.. ..+.+ ....+..+++++++. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~--~L~g~~G~ti~~P~~~~--igdae--~~~eg~~i~~~~lt~ 74 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDD--TLVGQPGDTITRPKYAY--IGAAE--DLQEGVAMDTTQMSM 74 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhcccccccc--ccCCCCCCEEEeeeecC--CCccc--cccCCCccchhhccc Confidence 99875 5999999999999999999999998875 57789999999988763 23333 234566788999999 Q ss_pred ceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) ++...+| ++..++|.++|++......|++.+..+|+...+|+++|+++++.++++.... +....++.|.+|...| T Consensus 75 ~~~~a~i-~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~----~~~~t~~~~~dA~~~l 149 (270) T protein:vir:95 75 TTTKVTV-KETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA----TVSADATGILDAIEVF 149 (270) T ss_pred chheeee-ehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc----ccccCHHHHHHHHHHh Confidence 9999999 5568999999999999999999999999999999999999999988776543 3345678999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeee-eeccceeeeeeecccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVT-VDTLPHGDAYLSHPTAYAMLTR 235 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~-s~~~~~~~~~~~~~~a~~~~~~ 235 (387) .++. ...++++++|..+..|.++. +..... .+...+++|.+|.+.|++++. ++..+.+..+.++..++.+... T Consensus 150 gd~~--~~~~~i~vhs~~~~~Lrk~~-~~~~~~---~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~~~ 223 (270) T protein:vir:95 150 NSEN--DEDYVLYVNPKDYNKLVKSL-FKVGGN---VQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIVNK 223 (270) T ss_pred cccc--CCCcEEEEcHHHHHHHHhhh-cccccc---cccchhcccccceecceeEEEeCCCCCceeEEEEeccceeeeec Confidence 8764 34578999999999998865 333222 234467899999999999755 4455555666666655554332 Q ss_pred ccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccc Q lcl|NC_021299. 236 SPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEG 315 (387) Q Consensus 236 ~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~ 315 (387) .. .....+.+.....+......+++.......... .++..+ . T Consensus 224 ~~--------------------~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv------------~~t~~~-----a- 265 (270) T protein:vir:95 224 KK--------------------PEAYTDFDILKRTHLLSTNYHYSVNLKDETGVV------------KVTFKP-----S- 265 (270) T ss_pred CC--------------------ceeeeccchhhcccEEEeeeEEEEEEEccceEE------------EEEecC-----C- Confidence 21 111122222222222222333332222211111 111000 0 Q ss_pred ccccc Q lcl|NC_021299. 316 ETVKA 320 (387) Q Consensus 316 ~~~~~ 320 (387) .+... T Consensus 266 ~~~~~ 270 (270) T protein:vir:95 266 GSLEM 270 (270) T ss_pred CCcCC Confidence 00000 No 41 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.89 E-value=1.7e-24 Score=150.91 Aligned_cols=311 Identities=10% Similarity=-0.038 Sum_probs=175.8 Q ss_pred Ccccc---------------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccc Q lcl|NC_021299. 1 MANAF---------------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGA 65 (387) Q Consensus 1 Ma~~~---------------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~ 65 (387) |++-+ |-=|+|..|++..|....+|.++++. +++ +.|++++||..+...+.++.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~---rti--~~gkS~q~~~iG~~~~~~~~~----- 70 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV---QEV--VGTNSVSNKYIGETELQVLSP----- 70 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee---eee--cccceEEeeeeeeeEEeeecc----- Confidence 65322 11389999999999999999888753 345 459999999999888876653 Q ss_pred cccccccccccceEEEEEEeeeecceeeccHHHhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccc------- Q lcl|NC_021299. 66 DRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRS-FAVDVLPRQVRAVAEQIEDAVSYLITKAP-YEK------- 136 (387) Q Consensus 66 ~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~~la~~vd~~~~~~~~~~~-~~~------- 136 (387) +..++.+.+...+..|+||+.+++.+.|.|.|+.+...| ++.++.++++++||+..|+.++.++..+. .+. T Consensus 71 G~~ld~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~ 150 (364) T protein:vir:10 71 GKSPDASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNP 150 (364) T ss_pred CcccCCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCC Confidence 344666778888999999999999999999999999999 79999999999999999999987664221 100 Q ss_pred ---cc------cCC-------cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeee Q lcl|NC_021299. 137 ---VS------LVD-------EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQT 200 (387) Q Consensus 137 ---~~------~~~-------~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~ 200 (387) +. ..+ ....++.|.++.+.|++.+||.++|+++++|++|..|+++++|...+...++ .....+ T Consensus 151 ~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~-~~~~~~ 229 (364) T protein:vir:10 151 RVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAA-SDNTVD 229 (364) T ss_pred cccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccC-CCcccc Confidence 00 011 1123567889999999999999999999999999999999998876654332 345789 Q ss_pred eEEEEeecceeeeeeccceeeeee-------eccccccc-cccc-----cccccCceeeeeeecccccceeeeeeeeeec Q lcl|NC_021299. 201 ARIGRLAQYDVVTVDTLPHGDAYL-------SHPTAYAM-LTRS-----PGRPMTNTVATSTVATENGVQLRWLGDYDAT 267 (387) Q Consensus 201 g~ig~~~g~~v~~s~~~~~~~~~~-------~~~~a~~~-~~~~-----~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~ 267 (387) |.++.+.|+.|++++++|...... .|+-+... ...- .....+..++-.+.....-..+.....++.. T Consensus 230 G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~ 309 (364) T protein:vir:10 230 GFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK 309 (364) T ss_pred ceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc Confidence 999999999999999998643221 11100000 0000 0000000000000000000000000000000 Q ss_pred cceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEec Q lcl|NC_021299. 268 STTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSG 347 (387) Q Consensus 268 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ss 347 (387) ...+.......+|.....+............- .--.-.++-.-. ..+++|.-+ T Consensus 310 ~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~~-----------~~~~~~~~~~~~----------------~~~~~~~~~ 362 (364) T protein:vir:10 310 EKTWYIDTFLAEGAIPDRWEAVAVVTAADTAE-----------LATDHNAILARA----------------NRKVTLTKS 362 (364) T ss_pred eeeeeeeeehcccCcccCccceEEEEecCCCC-----------Cccchhhhhhhc----------------cccEEEEEe Confidence 00000000111111111110000000000000 000000000001 122222211 Q ss_pred CceEEEEc Q lcl|NC_021299. 348 TTAKATID 355 (387) Q Consensus 348 n~~VAtVd 355 (387) |+ T Consensus 363 ------~~ 364 (364) T protein:vir:10 363 ------VN 364 (364) T ss_pred ------cC Confidence 11 No 42 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.88 E-value=4.3e-25 Score=154.15 Aligned_cols=292 Identities=12% Similarity=0.078 Sum_probs=176.3 Q ss_pred Cccc----------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccc Q lcl|NC_021299. 1 MANA----------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATG 64 (387) Q Consensus 1 Ma~~----------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~ 64 (387) |+|- +|+ |+|..|++..|....+|.+++.. .++ +.|++++||..+...+..+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~---rti--~~g~s~~~~~iG~~~~~~~~----- 69 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNI---RDL--RGSNVVRLDRLGNVEAKGRR----- 69 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccce---eee--ccceeEEEeeeeeeeeeccc----- Confidence 6642 344 89999999999999999998863 234 45999999999988888765 Q ss_pred ccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-c------- Q lcl|NC_021299. 65 ADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE-K------- 136 (387) Q Consensus 65 ~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~-~------- 136 (387) .+.++..+.+......|+||..++..+.|.|.|+.+..+|++.++.++++++||+..|+.++..+..+... . T Consensus 70 pG~~l~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~ 149 (335) T protein:vir:63 70 AGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred CCcCcCCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCC Confidence 35566667777888999999999999999999999999999999999999999999999987544322111 0 Q ss_pred ---c-------ccC----CcchhHHHHHHHHHHHhhccCCcCC---cEEEEchHHHHHHhcccchhhhhhcccccceeee Q lcl|NC_021299. 137 ---V-------SLV----DEDEIWNGVVSNRRWLNEQKVPKDG---RVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQ 199 (387) Q Consensus 137 ---~-------~~~----~~~~~~~~i~~a~~~l~~~~vp~~~---r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~ 199 (387) + ++. .....+..+.++..+|++++||+++ |+++++|++|..|+.+++|.+.+....++..... T Consensus 150 ~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~ 229 (335) T protein:vir:63 150 FSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYV 229 (335) T ss_pred cCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccccccccc Confidence 0 011 1122345677899999999999754 9999999999999999999887766555556678 Q ss_pred eeEEEEeecceeeeeeccceeeeeeecc--ccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeee Q lcl|NC_021299. 200 TARIGRLAQYDVVTVDTLPHGDAYLSHP--TAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDT 277 (387) Q Consensus 200 ~g~ig~~~g~~v~~s~~~~~~~~~~~~~--~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 277 (387) +|.+.++.|+.|++++++|.... ..|. .++....+......+.-++..+.....-..+.....++.....+...... T Consensus 230 ~g~v~~v~Gv~V~~sn~lP~~~~-t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~ 308 (335) T protein:vir:63 230 KSRVAILNGVKVLETPRFATKAI-AAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQ 308 (335) T ss_pred CceeEEeeceEEEeeccCCCCCc-ccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHH Confidence 99999999999999999986532 2221 11110000000000000000000000000000000000000000000000 Q ss_pred eeeeccccceeeeccceeccccccceeeeeeeeccccccccccccceeEEEee Q lcl|NC_021299. 278 WIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALED 330 (387) Q Consensus 278 ~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (387) .+|.....+. .... +.+.. .| .+..+. T Consensus 309 a~G~g~lRPe----------~a~~--i~~tg-----------~~---~~~~~~ 335 (335) T protein:vir:63 309 MYNIGARRPD----------TAGA--IELKG-----------IG---AFDITA 335 (335) T ss_pred HcCCcccccc----------eEEE--EEEcC-----------CC---ceeecC Confidence 0111111110 0000 00000 00 000000 No 43 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.88 E-value=5.4e-25 Score=153.63 Aligned_cols=291 Identities=11% Similarity=0.065 Sum_probs=177.1 Q ss_pred Cccc----------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccc Q lcl|NC_021299. 1 MANA----------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATG 64 (387) Q Consensus 1 Ma~~----------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~ 64 (387) |+|- +|+ |+|+.|++..|....+|.+++.+- ++ +.|++++||..+...+..+. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r---ti--~~g~s~~~~~iG~~~~~~~~----- 69 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR---DL--RGSNVVRLDRLGNVEAKGRR----- 69 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhcccccee---ee--ccceeEEEeeeeeeeecccc----- Confidence 6642 355 899999999999999999988642 34 55999999999888887654 Q ss_pred ccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-c------- Q lcl|NC_021299. 65 ADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE-K------- 136 (387) Q Consensus 65 ~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~-~------- 136 (387) .+.++..+.+......|+||+.++..+.|.|.|+.+..+|++.++.++++++||+..|+.++..+..+... . T Consensus 70 pG~~l~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~ 149 (335) T protein:vir:78 70 AGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred cCcccCCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCC Confidence 45566777788889999999999999999999999999999999999999999999999887544322110 0 Q ss_pred ---c-------ccC----CcchhHHHHHHHHHHHhhccCCcC---CcEEEEchHHHHHHhcccchhhhhhcccccceeee Q lcl|NC_021299. 137 ---V-------SLV----DEDEIWNGVVSNRRWLNEQKVPKD---GRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQ 199 (387) Q Consensus 137 ---~-------~~~----~~~~~~~~i~~a~~~l~~~~vp~~---~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~ 199 (387) + ++. ......+.+.++...|++.++|+. +|+++++|++|..|+.+++|.+.+...+++..... T Consensus 150 ~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~ 229 (335) T protein:vir:78 150 FSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYV 229 (335) T ss_pred cCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccc Confidence 0 001 112235677889999999999965 69999999999999999999887766565556778 Q ss_pred eeEEEEeecceeeeeeccceeeeeeecc--ccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeee Q lcl|NC_021299. 200 TARIGRLAQYDVVTVDTLPHGDAYLSHP--TAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDT 277 (387) Q Consensus 200 ~g~ig~~~g~~v~~s~~~~~~~~~~~~~--~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 277 (387) +|.++++.|+.|++++++|... +..|. .++....+........-++..+.....-..+.....++.....+...... T Consensus 230 ~g~v~~v~Gv~V~~Sn~lP~~~-~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~ 308 (335) T protein:vir:78 230 KSRVAILNGVKVLETPRFATKA-ISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQ 308 (335) T ss_pred cceeEEeeceEEEeeccCCCCC-CccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHH Confidence 9999999999999999999653 22221 11110000000000000000000000000000000000000000000000 Q ss_pred eeeeccccceeeeccceeccccccceeeeee-eeccccc Q lcl|NC_021299. 278 WIGVKAVLDPVTANLDDEPRFVRGTRIHLKA-TDAEIEG 315 (387) Q Consensus 278 ~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~-~~~~~~~ 315 (387) .+|.....+. ...... +.. ..++++. T Consensus 309 a~G~g~lRPe----------~a~~i~--~tg~~~~~~~~ 335 (335) T protein:vir:78 309 MYNIGARRPD----------TAGAIE--LKGIEAFDITA 335 (335) T ss_pred HcCCcccCcc----------eEEEEE--ecCCCcccccC Confidence 0111111110 000000 000 0000000 No 44 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=99.88 E-value=4.7e-24 Score=148.46 Aligned_cols=288 Identities=10% Similarity=-0.025 Sum_probs=173.8 Q ss_pred CccccccHHHHHHHHHHHHHhh-ccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHE-LVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~-~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (387) -+|++.-.|++.+.+.+.|... +..+.++|++|+ + ..|++|+||..+.....||+. +.....++++.+.. T Consensus 36 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e--~--~~g~tVkIp~i~~~gl~DY~R-----~~g~~~g~vt~~~~ 106 (329) T protein:vir:10 36 EPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAI--F--MQGRSFTVIKGDVTELKDYKR-----NATNEFDHPQIQET 106 (329) T ss_pred CCchhHHHHHHHHHHHHHHHhhceeeeeeccccee--e--ccCcEEEEeeecccccccccC-----CCCcccccccccee Confidence 5677766789999999888765 556778999886 3 359999999998888888753 34566778899999 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSF--AVDVLPRQVRAVAEQIEDAVSYLITKA-PYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~la~~vd~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++||+.+++.|.+++.|..+....+ .....+++...+++++|.+.++.+... ......+.++.+.|+.|.++..+| T Consensus 107 t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~~~~~t~~nay~~i~~a~~~L 186 (329) T protein:vir:10 107 TYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHLTVGSGADAQYDAVLDVSVEL 186 (329) T ss_pred EEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 99999999999999998887765544 233345577889999999998776543 333344567788999999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc--ceeeeeeeccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL--PHGDAYLSHPTAYAMLT 234 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~--~~~~~~~~~~~a~~~~~ 234 (387) +++++| ++|+++++|+++..|.++++|....... ....++|.++++.||.|++.+.. ...+.+..|+.+..+.. T Consensus 187 de~~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~---~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~ 262 (329) T protein:vir:10 187 DEIGAG-ASRILFVTPKFYKGIKKFVIELPQGDNR---QQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPI 262 (329) T ss_pred HhcCCC-CCcEEEeCHHHHHHHHhhhhhhcccccc---ccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeeee Confidence 999999 5899999999999999988887643332 34678999999999999876432 22233445555544322 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIE 314 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~ 314 (387) ....... +.......+..+.....++.-.... ...+.+. ......... . -... T Consensus 263 K~~~~~~-----~~p~~~~~a~~v~gr~yyd~~V~~~-----k~~~I~~----------~~~~a~~~~-----~--~~~~ 315 (329) T protein:vir:10 263 QANEAKL-----NSNVPGMFGTLAEQMLYTGAFVPEH-----LQKYIFT----------IGGKEVETN-----R--DGVD 315 (329) T ss_pred eeeeeee-----eCCCCccchheeeeeeeeeeEEEcc-----ccCEEEE----------ecccCcccC-----C--CCCC Confidence 2110000 0000000011111111111100000 0000000 000000000 0 0000 Q ss_pred ccccccccceeEEE Q lcl|NC_021299. 315 GETVKAGEKLALAL 328 (387) Q Consensus 315 ~~~~~~~~~~~~~~ 328 (387) +.++..+......+ T Consensus 316 ~~~~~~~~~~~~~~ 329 (329) T protein:vir:10 316 AHADETNASADTGA 329 (329) T ss_pred ccccccccccccCC Confidence 00000000000000 No 45 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.88 E-value=3.7e-25 Score=154.50 Aligned_cols=282 Identities=11% Similarity=0.045 Sum_probs=152.6 Q ss_pred eeeecccccccccCCCEEEEEecccceeeceecccccccccc--cccccccceEEEEEEeeeecceeeccHHHhhhhhhH Q lcl|NC_021299. 28 FVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNM--VASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSF 105 (387) Q Consensus 28 ~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~ 105 (387) ++ +.+. .|++++|+..+...+..+.. +..+ ..+++......|+||+.+++.+.++|.|+.+..+|+ T Consensus 1 ~v-----r~i~--~g~s~~~~~iG~~~~~~~~~-----G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dl 68 (324) T protein:vir:99 1 MT-----RTIT--SGKSAQFPVMGRTKARYLKQ-----GQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDV 68 (324) T ss_pred Ce-----eeee--cCceEEEeeeeeeEeccccC-----CCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccc Confidence 22 3354 49999999999888887753 3334 346788899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccc---------ccc---------c------cCCcchhHHHHHHHHHHHhhccC Q lcl|NC_021299. 106 AVDVLPRQVRAVAEQIEDAVSYLITKAPY---------EKV---------S------LVDEDEIWNGVVSNRRWLNEQKV 161 (387) Q Consensus 106 ~~~~~~~~~~~la~~vd~~~~~~~~~~~~---------~~~---------~------~~~~~~~~~~i~~a~~~l~~~~v 161 (387) +.++.++++++||+.+|+.++..+..... ..+ . .......|+.+++++++|++++| T Consensus 69 r~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~V 148 (324) T protein:vir:99 69 RSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYI 148 (324) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCC Confidence 99999999999999999998866532110 000 0 01122458899999999999999 Q ss_pred CcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccccccccccccccc Q lcl|NC_021299. 162 PKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPM 241 (387) Q Consensus 162 p~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~ 241 (387) |.++||+|++|++|..|+.+..+......+ ...+++|.++++.||+|++++++|........ .++......+.... T Consensus 149 P~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~---~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~-~a~~~~~~~~~~~~ 224 (324) T protein:vir:99 149 PAGDRTFYTDPDTYSAILAALMPNAANYAA---LIDPETGNIRNVMGFEVVETPHMTAQMVTNPT-DAFDGTGHIFPATG 224 (324) T ss_pred CCCCCEEEeChHHHHHHhhccccccccccc---ccceecceEEEEeceEEEecCCcccccccccc-cccccccccccccc Confidence 999999999999999887655554433322 34588999999999999999999975332211 11111111111000 Q ss_pred Cceeeeeeeccc-ccceeee---------------eeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceee Q lcl|NC_021299. 242 TNTVATSTVATE-NGVQLRW---------------LGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIH 305 (387) Q Consensus 242 ~~t~~~~~~~~~-~~~~~~~---------------~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~ 305 (387) ............ ....+.+ ...++.....+.......+|.....+..... ++ T Consensus 225 ~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~------------v~ 292 (324) T protein:vir:99 225 DSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGA------------II 292 (324) T ss_pred ccccccccccccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEE------------EE Confidence 000000000000 0000000 0001111111111111111111111110000 01 Q ss_pred eeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCceEEEEcC Q lcl|NC_021299. 306 LKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKATIDA 356 (387) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAtVd~ 356 (387) ++ .+.+-.+++.... ......-.-..+|+.++ T Consensus 293 l~--------------~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 324 (324) T protein:vir:99 293 FE--------------DGETPAVAPDVIT-----GVASFAAPASTRAKSSA 324 (324) T ss_pred Ec--------------cCccccccchhhh-----hhccccCcccceeeecC Confidence 10 0000000000000 00000001112222222 No 46 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=99.87 E-value=4.7e-24 Score=148.50 Aligned_cols=289 Identities=11% Similarity=-0.010 Sum_probs=172.5 Q ss_pred CccccccHHHHHHHHHHHHHhhccc-cceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVL-PNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~-~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (387) =+|++.--|.|++.+.+.+...+.- +.++|++|+.. .|++|+||..+.....||+. +.....++++.+.. T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~----gg~tVkIp~i~~~gl~DY~R-----~~g~~~g~vt~~~~ 95 (319) T protein:vir:94 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM----EGRSFTVMKGDTTELKDYKR-----NATNEFDHPKIEET 95 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEec----cCcEEEEeeecccccccccC-----CCCcccCCccccee Confidence 3455555678998766666655543 34678887643 59999999988888887753 34566778899999 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSF--AVDVLPRQVRAVAEQIEDAVSYLITKA-PYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~la~~vd~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++||+.+++.|.+++.|..+....+ .....+++...+++++|.+.++.+... ......+.++.++|+.|.++..+| T Consensus 96 t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~L 175 (319) T protein:vir:94 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) T ss_pred EEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 99999999999999998888776554 333346667788999999988776543 333344567788999999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeec--cceeeeeeeccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT--LPHGDAYLSHPTAYAMLT 234 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~--~~~~~~~~~~~~a~~~~~ 234 (387) ++.+|| ++|+++++|+++..|.++++|.+....++ ..+++|.++++.||.|++.+. +...+.+..|+.+..+.. T Consensus 176 de~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~ 251 (319) T protein:vir:94 176 DEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI 251 (319) T ss_pred HhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeee Confidence 999999 69999999999999999999988776654 356899999999999987643 223334455555544322 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIE 314 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~ 314 (387) ....... ...... ..+..+.....++.-... ....+.+................ ....+--+ T Consensus 252 k~~~~~~----~~p~~~-~~a~~v~gr~y~d~~V~~-----~k~~~Iy~~~~~~~~~~~~~~~~-~~~~~~~~------- 313 (319) T protein:vir:94 252 QADLAKT----NSNIPG-MFGTLAEQLLYTGAFVPE-----HLQKYIFTIGGTEVATKRDGVDA-HADNVAKP------- 313 (319) T ss_pred eeeeeec----cCCCcc-ccceeeeeeeeeeeEEec-----cccceEEEeecCCcccCCCcccc-ccccccCC------- Confidence 2110000 000000 001111111111111100 00001110000000000000000 00000000 Q ss_pred cccccc Q lcl|NC_021299. 315 GETVKA 320 (387) Q Consensus 315 ~~~~~~ 320 (387) ..++.+ T Consensus 314 ~~~~~~ 319 (319) T protein:vir:94 314 SGSLEM 319 (319) T ss_pred cccccC Confidence 001111 No 47 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=99.87 E-value=4.7e-24 Score=148.50 Aligned_cols=289 Identities=11% Similarity=-0.010 Sum_probs=172.5 Q ss_pred CccccccHHHHHHHHHHHHHhhccc-cceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVL-PNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~-~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (387) =+|++.--|.|++.+.+.+...+.- +.++|++|+.. .|++|+||..+.....||+. +.....++++.+.. T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~----gg~tVkIp~i~~~gl~DY~R-----~~g~~~g~vt~~~~ 95 (319) T protein:vir:97 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM----EGRSFTVMKGDTTELKDYKR-----NATNEFDHPKIEET 95 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEec----cCcEEEEeeecccccccccC-----CCCcccCCccccee Confidence 3455555678998766666655543 34678887643 59999999988888887753 34566778899999 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSF--AVDVLPRQVRAVAEQIEDAVSYLITKA-PYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~la~~vd~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) +++||+.+++.|.+++.|..+....+ .....+++...+++++|.+.++.+... ......+.++.++|+.|.++..+| T Consensus 96 t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~L 175 (319) T protein:vir:97 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) T ss_pred EEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 99999999999999998888776554 333346667788999999988776543 333344567788999999999999 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeec--cceeeeeeeccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT--LPHGDAYLSHPTAYAMLT 234 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~--~~~~~~~~~~~~a~~~~~ 234 (387) ++.+|| ++|+++++|+++..|.++++|.+....++ ..+++|.++++.||.|++.+. +...+.+..|+.+..+.. T Consensus 176 de~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~ 251 (319) T protein:vir:97 176 DEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI 251 (319) T ss_pred HhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeee Confidence 999999 69999999999999999999988776654 356899999999999987643 223334455555544322 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIE 314 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~ 314 (387) ....... ...... ..+..+.....++.-... ....+.+................ ....+--+ T Consensus 252 k~~~~~~----~~p~~~-~~a~~v~gr~y~d~~V~~-----~k~~~Iy~~~~~~~~~~~~~~~~-~~~~~~~~------- 313 (319) T protein:vir:97 252 QADLAKT----NSNIPG-MFGTLAEQLLYTGAFVPE-----HLQKYIFTIGGTEVATKRDGVDA-HADNVAKP------- 313 (319) T ss_pred eeeeeec----cCCCcc-ccceeeeeeeeeeeEEec-----cccceEEEeecCCcccCCCcccc-ccccccCC------- Confidence 2110000 000000 001111111111111100 00001110000000000000000 00000000 Q ss_pred cccccc Q lcl|NC_021299. 315 GETVKA 320 (387) Q Consensus 315 ~~~~~~ 320 (387) ..++.+ T Consensus 314 ~~~~~~ 319 (319) T protein:vir:97 314 SGSLEM 319 (319) T ss_pred cccccC Confidence 001111 No 48 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.87 E-value=6.7e-24 Score=147.63 Aligned_cols=272 Identities=14% Similarity=0.066 Sum_probs=166.9 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) ||.+++ ++|++++++.|.+.+++..+.+++++ +. .|++|+||..+.....||+.. ......+++.+..+ T Consensus 1 Main~a--~~~~~~Ld~~~~~~~~t~~l~~~~~~--~~--ggktVkI~~i~~~gl~DY~R~-----~g~~~g~v~~~~et 69 (290) T protein:vir:78 1 MAINYV--DKYGKELDQKLVFGTYTNELETPNLL--WL--DAKTFKIQTITTTGLKAHTRN-----KGYNEGSASNTNKS 69 (290) T ss_pred CchhHH--HHHHHHHHHHHHhhheeeecccccee--ec--cCCEEEEeeeccCcccccccC-----CCcccCccccceee Confidence 999884 79999999999999999999988865 33 489999999888888887643 33444566778889 Q ss_pred EEEEeeeecceeec--cHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc-c---cccccCCcchhHHHHHHHHH Q lcl|NC_021299. 81 IKLTDVIYNRIDLT--DEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAP-Y---EKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 81 ~~id~~~~~~~~~~--d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~-~---~~~~~~~~~~~~~~i~~a~~ 154 (387) ++|++.+++.|.++ |.|++.....+.....+++...+++++|.+.++.+.... . ....+.++.++|+.+.++.. T Consensus 70 ~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~ 149 (290) T protein:vir:78 70 YTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIR 149 (290) T ss_pred EEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHH Confidence 99999999999999 888876666665555667777889999999887554322 1 22234567899999999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc-ceeeeeeecccc---- Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL-PHGDAYLSHPTA---- 229 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~-~~~~~~~~~~~a---- 229 (387) +|++ +|.++|+++++|+++..|.++++|.+....+..+.. ..+|.++++.||.+++.... --...+.+..+. T Consensus 150 ~lde--vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~-~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~ 226 (290) T protein:vir:78 150 KVKK--YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPS-SIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAA 226 (290) T ss_pred HHHh--cCCCCeEEEECHHHHHHHhhChhhhccccccccccc-cccceeeeecCcEEEEecccchhhhhhhhcccccccC Confidence 9986 899999999999999999999999887666554433 34889999999999874321 000011111100 Q ss_pred ----cccc---ccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccc Q lcl|NC_021299. 230 ----YAML---TRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGT 302 (387) Q Consensus 230 ----~~~~---~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~ 302 (387) +.+. ..+...+.-.....-.........-.|..++..- .+....+... ..+ T Consensus 227 ~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y--~d~~v~~nk~--------------------~~i 284 (290) T protein:vir:78 227 GAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVY--HDIFVLDQQK--------------------DGV 284 (290) T ss_pred CccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeee--eeeeeecccc--------------------Cee Confidence 0000 0000000000000000000000001122111110 0001000000 000 Q ss_pred eeeeee Q lcl|NC_021299. 303 RIHLKA 308 (387) Q Consensus 303 ~v~~~~ 308 (387) -+.... T Consensus 285 ~~~~~~ 290 (290) T protein:vir:78 285 IASTEV 290 (290) T ss_pred EEEeeC Confidence 000000 No 49 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.83 E-value=2.1e-22 Score=139.46 Aligned_cols=334 Identities=9% Similarity=-0.031 Sum_probs=181.3 Q ss_pred Ccccc---------------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccc Q lcl|NC_021299. 1 MANAF---------------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGA 65 (387) Q Consensus 1 Ma~~~---------------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~ 65 (387) |++-+ |-=|+|..|++..|....+|.++++. +++ +.|++++|+..+...+..+.. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v---rti--~~GkS~qf~~iG~~~a~y~~~----- 70 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV---QTV--TGTNTVSNKYLGETELQVLAP----- 70 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee---eee--cccceEEEEEEeeeEEeeecc----- Confidence 65332 11389999999999999999888753 345 459999999998888876642 Q ss_pred cccccccccccceEEEEEEeeeecceeeccHHHhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHhcccc--c------- Q lcl|NC_021299. 66 DRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRS-FAVDVLPRQVRAVAEQIEDAVSYLITKAPY--E------- 135 (387) Q Consensus 66 ~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~~la~~vd~~~~~~~~~~~~--~------- 135 (387) +..++.+++......|+||...+..+.|.|.|+.+...| ++.++.++++++||+..|+.++.++..+.. . T Consensus 71 G~~ldg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~ 150 (402) T protein:vir:97 71 GQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) T ss_pred ccccCCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 334556677778889999999999999999999999999 799999999999999999999876643211 0 Q ss_pred --c------ccc-------CCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeee Q lcl|NC_021299. 136 --K------VSL-------VDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQT 200 (387) Q Consensus 136 --~------~~~-------~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~ 200 (387) . ... .++....+.|.++..+|++.+||.++|+++++|++|..|+++++|.+.+.... +...+.+ T Consensus 151 ~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~-~~g~~~~ 229 (402) T protein:vir:97 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTIS-QSGATIN 229 (402) T ss_pred cccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccc-cCCcccc Confidence 0 001 12233457788999999999999999999999999999999999887766433 2345789 Q ss_pred eEEEEeecceeeeeeccceeee-eeeccccccccccc-------cccccCceeeeeeecccccceeeeeeeeeeccceee Q lcl|NC_021299. 201 ARIGRLAQYDVVTVDTLPHGDA-YLSHPTAYAMLTRS-------PGRPMTNTVATSTVATENGVQLRWLGDYDATSTTER 272 (387) Q Consensus 201 g~ig~~~g~~v~~s~~~~~~~~-~~~~~~a~~~~~~~-------~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 272 (387) |.++.+.|+.|++++++|.... +..|.-. ....+. .....+..+.-.+.....-..+.-...++.....+. T Consensus 230 G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls-~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~ 308 (402) T protein:vir:97 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLS-NEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYY 308 (402) T ss_pred ceeEEEeceEEEecCccccccccccccccc-cCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHH Confidence 9999999999999999996421 1111100 000000 000000000000000000000000001111111111 Q ss_pred eeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCceEE Q lcl|NC_021299. 273 SIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKA 352 (387) Q Consensus 273 ~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VA 352 (387) ......+|.....+...... ++....-. ..+-+.+... .++ ......++++.++-..-| T Consensus 309 id~~~a~G~g~~RPeaa~vv------------~~~~~~t~--~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~~~~ 367 (402) T protein:vir:97 309 IDTFMAEGAIPDRWEAVSVV------------TTKRDATT--GDAGGPGDDH-ATV------LARAQRKAVYVKTEGAAA 367 (402) T ss_pred HHHHHHhCCcccCccceEEE------------EEeccccc--ccCCccccch-hhh------hcccccceEEEeccccch Confidence 00011111111111000000 00000000 0000000000 000 000011122222222111 Q ss_pred EEcC--------------------CceEEEEecce Q lcl|NC_021299. 353 TIDA--------------------NGVVTGVAAGT 367 (387) Q Consensus 353 tVd~--------------------~G~VTa~~~Gt 367 (387) ..++ +=+-|+.++-+ T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 402 (402) T protein:vir:97 368 AFSAAPAGIQAEDLVAAVRAVMANDIKPTAMKPTE 402 (402) T ss_pred hccccccccchHHHHHHHHHHHhccccccccCCCC Confidence 1111 01223333333 No 50 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.81 E-value=1.1e-21 Score=135.42 Aligned_cols=231 Identities=12% Similarity=0.069 Sum_probs=159.1 Q ss_pred cccccccCCCEEEEEecccceeeceecccccccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHH Q lcl|NC_021299. 34 YGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQ 113 (387) Q Consensus 34 ~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 113 (387) +.-. ..||||+||.+ ..+.+ ...++..++++.++.++.+++| ++.+++|.++|++....++|++++..+|+ T Consensus 1 ~~~~--~~Gdtit~P~~----iGda~--~v~eG~~i~~~~l~~t~~~atI-k~~gk~~~itD~a~l~~~gDp~~ea~~Q~ 71 (231) T protein:vir:73 1 ENGI--NLANLCEYPND----IGDAA--DVAEGGEISLDKIGTTTKSVTI-KKAAKGTEITDEAALSGYGDPIGESNKQL 71 (231) T ss_pred Cccc--cCCceEEeccc----ccchh--hhcCCCcCChhhccccceeeeE-eeeccceeeeHHHHhhccCchHHHHHHHH Confidence 2222 46999999864 22322 2356777999999999999999 66799999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc Q lcl|NC_021299. 114 VRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA 193 (387) Q Consensus 114 ~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~ 193 (387) ..+||+++|+++++.+..+..... ....++.|.+|...|.+++ ...++++++|..+..|+++..+..... .. T Consensus 72 ~~~iA~kvD~di~~~~~~a~l~~~----~~~t~d~i~~A~~~fgde~--~~~~vivv~p~~~~~Lrk~~~~~~~~~--~~ 143 (231) T protein:vir:73 72 GLSLANKVDDDLLKAAKTTSQTVS----TKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGS--EV 143 (231) T ss_pred HHHHHHhhhHHHHHhhcccccccc----ccccHHHHHHHHHHhcccc--ccceEEEEcchHHHhhhhccchhhhhh--hh Confidence 999999999999988877665443 3467899999999999886 456899999999999998776544322 23 Q ss_pred cceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeee Q lcl|NC_021299. 194 GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERS 273 (387) Q Consensus 194 ~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 273 (387) +...+++|.+|++.|++|+.++.+|.+...... +.. ...+.............+++.....+.. T Consensus 144 g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~---~i~-------------~~gAl~~~~k~~~~vEtdRd~~~k~~~i 207 (231) T protein:vir:73 144 GANALINGTYADVLGAQIVRSKKLAEGSALMFK---IVS-------------NSPALKLVLKRGVQVETDRDIVTKTTVI 207 (231) T ss_pred ccceeeecccceEcceEEEEcCCCCCCceeeee---EEe-------------eccceeeeecccceeeccccccccccEE Confidence 456789999999999999999999976554321 000 0112222222333333444444444455 Q ss_pred eeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 274 IVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 274 ~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) ..+.+++...............+. T Consensus 208 ~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EEeEEEEEEEEcCccEEEEEeecC Confidence 555555544433332222111111 No 51 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.80 E-value=2.4e-21 Score=133.66 Aligned_cols=289 Identities=10% Similarity=0.014 Sum_probs=154.5 Q ss_pred Ccccc---ccHHHHHHHHHHHHH-hhccccceeeecccccccccCCCEEEEEeccccee-eceeccccc--ccccccccc Q lcl|NC_021299. 1 MANAF---IKPPVIIASILGQLQ-HELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIA-HTRGLRATG--ADRNMVASD 73 (387) Q Consensus 1 Ma~~~---~~pe~~~~~~~~~l~-~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~-~~~~~~~~~--~~~~~~~~~ 73 (387) |+-++ |+ +.|+.++...+. +..+|.+.|- .. . ..+.+++++.+....... ......... ...+.++.+ T Consensus 13 Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~-~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~ 87 (322) T protein:vir:10 13 IAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQ-HK-N--ESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNN 87 (322) T ss_pred eechhhhHHH-HHHHHHHHHHHHHhhhhhhcccc-cc-c--ccccccceeecccccccccccccccccccCcccCCCccc Confidence 77765 44 668887777764 4455655542 21 1 234567776654322111 000000000 011222333 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------cccC Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK-------------VSLV 140 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~-------------~~~~ 140 (387) .......+.+ ..+++++.++|.|+.+...|+...+++++.++|+++.|..++..+.+..... ...+ T Consensus 88 ~~~~~r~~~~-~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g 166 (322) T protein:vir:10 88 KPFAKRRTNV-DTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDG 166 (322) T ss_pred cccceEEEee-cccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccC Confidence 3445555555 5568899999999999999999999999999999999999886555432211 1122 Q ss_pred CcchhHHHHHHHHHHHhhccCCcC-CcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccce Q lcl|NC_021299. 141 DEDEIWNGVVSNRRWLNEQKVPKD-GRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPH 219 (387) Q Consensus 141 ~~~~~~~~i~~a~~~l~~~~vp~~-~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~ 219 (387) +....++.+++|++.|+++++|++ +||+|++|+++..||.+++|...+..+.+ ...++|.++++.||.|+.++.+|. T Consensus 167 ~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~--~l~~~G~ig~~lGf~~i~s~~lp~ 244 (322) T protein:vir:10 167 TKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAM--DLQSKGIITNWMGYTWIVSTRLDK 244 (322) T ss_pred ccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccch--hhhhcCeeeeeeeEEEEEeccCCc Confidence 346678999999999999999976 49999999999999999999988887643 233679999999999999999985 Q ss_pred eeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 220 GDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 220 ~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ................. ......++.++.....+.......++++...-. ..+......+...... - T Consensus 245 ~~~t~~~~~~~~~~~~~--~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a----------~~I~~~~~~Ga~ri~~-~ 311 (322) T protein:vir:10 245 FDPTQWGMAAEDGPQGD--EIWCIAMTDMALGYHSCKDIWTKVAEDPSASFA----------WRIYSAFTADCVRVED-E 311 (322) T ss_pred cccccccccccCCCCcc--ceeEEEEecCceeEEEeeeeeEEeeccCCcchh----------hhhhhhhhhCceEecc-C Confidence 43221111110000000 000000111111111111111110111100000 0000000000000000 0 Q ss_pred ccceeeeeeeeccc Q lcl|NC_021299. 300 RGTRIHLKATDAEI 313 (387) Q Consensus 300 ~~~~v~~~~~~~~~ 313 (387) .++.+... -++ T Consensus 312 gVv~i~~~---e~~ 322 (322) T protein:vir:10 312 HIFKLRLK---NSL 322 (322) T ss_pred cEEEEEEe---ccC Confidence 00011110 001 No 52 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.79 E-value=9.6e-22 Score=135.80 Aligned_cols=345 Identities=8% Similarity=-0.034 Sum_probs=195.7 Q ss_pred Cccc----------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccc Q lcl|NC_021299. 1 MANA----------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATG 64 (387) Q Consensus 1 Ma~~----------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~ 64 (387) |++- +| =|+|..|++..|....+|..++.. ..+ +.|++++|+..+...+..+.. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~-Le~f~GeV~taF~~~si~~~~~~v---Rti--~~gkS~qf~~~G~s~~~~~~p---- 70 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLL-IEKFNGKVNEQYLKGENIMSYFDV---QTV--TGTNTVSNKYLGETELQVLAP---- 70 (401) T ss_pred CCCCccccccccccccchhHhH-HhHhcchHHHHHHHHhhhccccee---eee--cccceEEEEEeeeeEeeeecC---- Confidence 6532 23 389999999999999888887752 234 459999999998888887653 Q ss_pred ccccccccccccceEEEEEEeeeecceeeccHHHhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cc Q lcl|NC_021299. 65 ADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRS-FAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EK 136 (387) Q Consensus 65 ~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~ 136 (387) +..++.+++......|+||.-++..+.|.|.|+.+...| ++.++.++++++||+..|+.++.++..+.. .. T Consensus 71 -G~~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~ 149 (401) T protein:vir:70 71 -GQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTN 149 (401) T ss_pred -CCCcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 344666777788889999999999999999999999999 799999999999999999999877643211 00 Q ss_pred c-----------------ccCCcchhHHHHHHHHHHHhhccCCcCCcEEEE-chHHHHHHhcccchhhhhhcccccceee Q lcl|NC_021299. 137 V-----------------SLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLV-GSAVEEALLLDDRFIRYDSAGEAGASRL 198 (387) Q Consensus 137 ~-----------------~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~-~~~~~~~l~~~~~~~~~~~~g~~~~~~~ 198 (387) . ...++......|.++...|++.+||.+ |++++ +|.+|..|+..+++...+...++ .... T Consensus 150 p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~-~g~~ 227 (401) T protein:vir:70 150 PRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADRIVDKTYTISQ-SGAT 227 (401) T ss_pred CCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhcCcccchhhcccc-CCcc Confidence 0 001112345678899999999999965 66665 67777777776777766654333 3457 Q ss_pred eeeEEEEeecceeeeeeccceeeeeeecccccccccccccc-------ccCceeeeeeecccccceeeeeeeeeecccee Q lcl|NC_021299. 199 QTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGR-------PMTNTVATSTVATENGVQLRWLGDYDATSTTE 271 (387) Q Consensus 199 ~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~-------~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~ 271 (387) .+|.+..++|+.|++++++|.......+........+..-. ..+..+.-.+.....-..+.-...++.....+ T Consensus 228 ~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~ 307 (401) T protein:vir:70 228 IQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTY 307 (401) T ss_pred ccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHH Confidence 89999999999999999999743221111100000000000 00000000000000000000000111111111 Q ss_pred eeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccccccc--ccccceeEEEeeccCCccccCcceEEEecCc Q lcl|NC_021299. 272 RSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETV--KAGEKLALALEDSNGDNRAGDPLVTWTSGTT 349 (387) Q Consensus 272 ~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~ 349 (387) .......+|.....+............+ +........+.-++ ..+...-+.+.++. ......|+||-+ T Consensus 308 ~id~~~a~g~g~~RPeaa~vv~~k~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 377 (401) T protein:vir:70 308 YIDTFMAEGAIPDRWEAVSVVTTKRNTT-----TGAVEGTDGAQHTIVKNRAQRKAVYVKNAA-----PVAAAAASLSAE 377 (401) T ss_pred HHHHHHHhCCcccchhheEEEeecCccc-----ccccccCCcchhhhhhhhccceeEEecccc-----chhhhccccchH Confidence 1111112222222222111111111111 11111112222222 22222223333322 356789999998 Q ss_pred eEEE-E---c-CCceEEEEecceE Q lcl|NC_021299. 350 AKAT-I---D-ANGVVTGVAAGTS 368 (387) Q Consensus 350 ~VAt-V---d-~~G~VTa~~~Gta 368 (387) .+.- | = .+=+-|++++-+- T Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~ 401 (401) T protein:vir:70 378 DLVAAVRAVMANDIKPTALKPTEE 401 (401) T ss_pred HHHHHHHHHHhccccccccCcCCC Confidence 7632 1 1 1235567766544 No 53 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.75 E-value=6.7e-20 Score=125.69 Aligned_cols=305 Identities=11% Similarity=0.071 Sum_probs=160.8 Q ss_pred CccccccHHHHHHHHHHHHHhhccc-cceeeecccccccccCCCEEEEEeccc-ceeeceecccccccccccccccccce Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVL-PNFVFKNGYGDVAHKFNDTITIRIPVP-TIAHTRGLRATGADRNMVASDLTEVT 78 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~-~~~~~~d~~~~~~~~~gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (387) |+.++ -++|.+++.+.|...+.. +.+.+....+++..-.|++|+||.... ....+|+... +.+ ...+++... T Consensus 1 Mainy--a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~-g~~---~~g~v~~~~ 74 (346) T protein:vir:10 1 MTINY--AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRT-ITT---PVANYSNDW 74 (346) T ss_pred Ccchh--HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccC-Ccc---cccccccce Confidence 99988 469999999999877543 444443333433334589999998753 3456665321 111 124567788 Q ss_pred EEEEEEeeeecceeec--cHHHhh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-c-----cccccccCCcchhHH Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLT--DEEREL---DVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-A-----PYEKVSLVDEDEIWN 147 (387) Q Consensus 79 ~~~~id~~~~~~~~~~--d~~~~~---~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~-----~~~~~~~~~~~~~~~ 147 (387) .+++|++.+++.|.++ |.|++. .+.+.+.++ +...+++++|.+.++.+.. + ......+.++.++|+ T Consensus 75 et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef---~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~ 151 (346) T protein:vir:10 75 DSYELKNERYWSTLVDPSDIDETNMVVSLANITKQF---NLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILP 151 (346) T ss_pred eEEEeeccccceecccccchHHHHHHhHHHHHHHHH---HHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHH Confidence 8999999999999999 777654 455555554 4445679999997765432 2 112234467889999 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeee--eccceeee--- Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTV--DTLPHGDA--- 222 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s--~~~~~~~~--- 222 (387) .+.++..+|++.++|.++|+|+++|+++..|.++++|.+...+++.+ ..++.++++.|+.|++. +.+...-. T Consensus 152 ~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~---~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~ 228 (346) T protein:vir:10 152 AFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPN---NIQRTVYSLDDVTIRVVPSDLMQTAYDFSD 228 (346) T ss_pred HHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheecccccccc---ccceeeeeecCeEEEEcchhhcccchhhcc Confidence 99999999999999999999999999999888889998877776533 35899999999999763 33321100 Q ss_pred -------------eeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeee-eeecccccee Q lcl|NC_021299. 223 -------------YLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTW-IGVKAVLDPV 288 (387) Q Consensus 223 -------------~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~-~g~~~~~~~~ 288 (387) +.+|+.+.. .+.-.....-... .+...-.|..++..- .+....+.. .|.+...... T Consensus 229 G~~~~t~ak~INfiiv~~~A~i-------a~~K~~~~~if~P-~~~~~g~~l~~~R~Y--~D~fv~~nk~~~Iyv~~~~a 298 (346) T protein:vir:10 229 GSKIIDTAKQIEMFLIYNGVQI-------APEKYSFVGFDQP-SAATSGNYLYYEQSY--DDVLLLNTKTKGIQFVVSDK 298 (346) T ss_pred CccccCCccceeEEEECCceee-------eeeeeeeeEeeCC-CCCcccceeeeeeee--eeeeeeccccceEEEeeecc Confidence 111111110 0000000000000 011111122222111 011111100 0000000000 Q ss_pred e--eccce-------eccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceE Q lcl|NC_021299. 289 T--ANLDD-------EPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVT 343 (387) Q Consensus 289 ~--~~~~~-------~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 343 (387) . ..... ....+.-.+--+...++.-++.+..... +.+ |. T Consensus 299 ~~~~~~~~~~~~kpt~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-------------~~ 346 (346) T protein:vir:10 299 PKKDQEQSGQDAKPTAESTLEEIKAYLDKNHIDYTGKTKKDEL---LAL-------------VK 346 (346) T ss_pred cccCccCcccccCcccccchHHHHHHhcccccccccccchhhH---Hhh-------------cC Confidence 0 00000 0001111111122222222222111000 000 00 No 54 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.73 E-value=4e-19 Score=121.46 Aligned_cols=282 Identities=13% Similarity=0.016 Sum_probs=151.9 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) |||++--.++|.+++.+.+...+....|..-+..-+|. -|++|+||........+|+...... -...+++....+ T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~--ggktVkIp~i~~~gl~DY~R~~g~~---~~~g~v~~~~et 75 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYE--GGKEVKIGKLSTDGLGDYSRGSANA---YVGGDVKFEYET 75 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEe--cCcEEEEEeeecccccccccccCCc---ccccccccccee Confidence 99988778999999999999999888774222112343 4899999998888888876432211 111246667788 Q ss_pred EEEEeeeecceeec--cHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------cccccCCcchhHHHHH Q lcl|NC_021299. 81 IKLTDVIYNRIDLT--DEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY--------EKVSLVDEDEIWNGVV 150 (387) Q Consensus 81 ~~id~~~~~~~~~~--d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~--------~~~~~~~~~~~~~~i~ 150 (387) ++|++.+++.|.++ |.|++........-..+.+...+++++|++.++.+..... ....+.++.++|+.+. T Consensus 76 ~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~ 155 (312) T protein:vir:10 76 KTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIK 155 (312) T ss_pred EEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHH Confidence 99999999999999 8887654444333334446667789999998866542211 1223457889999999 Q ss_pred HHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccc- Q lcl|NC_021299. 151 SNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA- 229 (387) Q Consensus 151 ~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a- 229 (387) ++..+|++.++| ++|+++++|+++..|.++..+ ..... .......++.++.+.|+.|++...---...+.+..+. T Consensus 156 ~~~~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~-~~~~~--~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t 231 (312) T protein:vir:10 156 TGIKIIRENGYN-GPLVCHLTYDSMFAIEEKVLE-KLTAV--TFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTT 231 (312) T ss_pred HHHHHHHHccCC-CceEEEeChHHHHHHhhhhhc-eeccc--ccccceeeeeeeeecccEEEEchhhhccceeeeccCcc Confidence 999999999999 699999999988655543222 21211 1223346888999999999763321111112211110 Q ss_pred -----ccccccccc-------cccCcee---eeee----ecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeee Q lcl|NC_021299. 230 -----YAMLTRSPG-------RPMTNTV---ATST----VATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTA 290 (387) Q Consensus 230 -----~~~~~~~~~-------~~~~~t~---~~~~----~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~ 290 (387) ..+...+.. ....+.. .... ....+...-.|..++..- .+....+ T Consensus 232 ~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y--~D~fv~~-------------- 295 (312) T protein:vir:10 232 SNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRY--HDLWVTD-------------- 295 (312) T ss_pred cccccCceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeee--eeeeeec-------------- Confidence 000000000 0000000 0000 000000000111111100 0000000 Q ss_pred ccceeccccccceeeeeeeeccccccccc Q lcl|NC_021299. 291 NLDDEPRFVRGTRIHLKATDAEIEGETVK 319 (387) Q Consensus 291 ~~~~~~~~v~~~~v~~~~~~~~~~~~~~~ 319 (387) .....+.+.... . ...+ T Consensus 296 ------nk~~~Iyv~~k~--a----~~~~ 312 (312) T protein:vir:10 296 ------NKANSVYANFKD--A----KPVG 312 (312) T ss_pred ------cccCeEEEEeec--c----cCCC Confidence 000000000000 0 0000 No 55 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.71 E-value=8.6e-20 Score=125.10 Aligned_cols=202 Identities=14% Similarity=0.125 Sum_probs=118.9 Q ss_pred EEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------ccccCCcchhH Q lcl|NC_021299. 83 LTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE----------------KVSLVDEDEIW 146 (387) Q Consensus 83 id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~----------------~~~~~~~~~~~ 146 (387) ||.-....+.|.|.|+.+.++|++.++.+|++++||+.+|+.++.++..+... .+.++++...| T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 89999999999999999999999999999999999999999999877643211 11123344567 Q ss_pred HHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhc--ccchhhhhhcccccceeeeee-EEEEeecceeeeeeccceeeee Q lcl|NC_021299. 147 NGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLL--DDRFIRYDSAGEAGASRLQTA-RIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 147 ~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~--~~~~~~~~~~g~~~~~~~~~g-~ig~~~g~~v~~s~~~~~~~~~ 223 (387) +.|++++++|++++||.++||+|++|++|..|++ ++.+.+.+..++++ .+++| .++++.||+|++++++|...+. T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g--~~~~g~~i~~v~G~~V~~SnnlP~~~gt 158 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQG--DMNTGKGLYVNAGIRIYKSNVLASLYGT 158 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccc--cccccceeeeecCcEEEEeccCCccccc Confidence 8999999999999999999999999998888886 45555555554443 36677 5899999999999999975443 Q ss_pred eeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccce Q lcl|NC_021299. 224 LSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTR 303 (387) Q Consensus 224 ~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~ 303 (387) ..+..+..+....... . .+ .....+...+.|... ...+++ -....... + T Consensus 159 ~~~~~ag~~~~~~~~~---~--~y-r~~fs~~~glv~~~~----------------Avgtvk-----l~~~~~~~----~ 207 (221) T protein:vir:17 159 NLVTDPGDATTSGENN---G--SY-RPAITDRAGLVFHKE----------------AADTVE-----VLLPPSRP----P 207 (221) T ss_pred ccccCCcccccccccc---c--cc-cccccceEEEEEcch----------------heeeee-----eecCCCCC----c Confidence 3322221111000000 0 00 000000001111110 000000 00000000 0 Q ss_pred eeeeeeecccccccccccc Q lcl|NC_021299. 304 IHLKATDAEIEGETVKAGE 322 (387) Q Consensus 304 v~~~~~~~~~~~~~~~~~~ 322 (387) +.+ ..+++.. ..-. T Consensus 208 ~~~--~~~~~~~---~~~~ 221 (221) T protein:vir:17 208 LVI--SMFSIRR---PDRR 221 (221) T ss_pred eee--eeeeccC---CCCC Confidence 000 0001100 0000 No 56 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.69 E-value=2.1e-18 Score=117.45 Aligned_cols=269 Identities=15% Similarity=0.106 Sum_probs=153.7 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEeccc-ceeeceecccccccccccccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVP-TIAHTRGLRATGADRNMVASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (387) ||+++ -++|.+.+.++|...+....+.......++....|++|+||.... ....+|.. +......+++.... T Consensus 1 Main~--~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R-----~~g~~~g~v~~~~e 73 (285) T protein:vir:79 1 MTVVL--DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKR-----GQDNARKTISVGKE 73 (285) T ss_pred Ccchh--hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeeccccccccccc-----ccCccccccceeee Confidence 99987 579999999999998888777654433333334489999998753 45666643 34455567777888 Q ss_pred EEEEEeeeecceeec--cHHHh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-cccccccccCCcchhHHHHHHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLT--DEERE--LDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT-KAPYEKVSLVDEDEIWNGVVSNRR 154 (387) Q Consensus 80 ~~~id~~~~~~~~~~--d~~~~--~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~a~~ 154 (387) +++|++.+++.|.++ |.|+. ..+.+.+.++ +...+++++|.+.++.+. .+......+.++.++|+.+.++.. T Consensus 74 t~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef---~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~i~~~~~ 150 (285) T protein:vir:79 74 TVKLTHEDWFGYDLDQFDMDENGAYTVENVVREH---NKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDAYDTAEA 150 (285) T ss_pred EEEeeccccceecccccchhhhhhhhHHHHHHHH---HhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHH Confidence 999999999999998 55543 3334444443 334567999999876554 344444556778899999999999 Q ss_pred HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeec-ceeeee--eccceee------eeee Q lcl|NC_021299. 155 WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQ-YDVVTV--DTLPHGD------AYLS 225 (387) Q Consensus 155 ~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g-~~v~~s--~~~~~~~------~~~~ 225 (387) +|++.++| ++|+++++|+++..|.++++|.+...........-.++.++.+.| +.+++. ..+.... -+.+ T Consensus 151 ~lde~~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv 229 (285) T protein:vir:79 151 YMFDNEVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILT 229 (285) T ss_pred HHHHcCCC-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEe Confidence 99999999 699999999999999988888876655332111123456777776 566542 2222110 1122 Q ss_pred ccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeee-eeeccccceeeecc Q lcl|NC_021299. 226 HPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTW-IGVKAVLDPVTANL 292 (387) Q Consensus 226 ~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~-~g~~~~~~~~~~~~ 292 (387) |+.+..- +.-.....-.....+...-.|..++..-. +....+.. .|.+.. ..... T Consensus 230 ~~~a~i~-------~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~--d~fv~~nk~~~Iy~~---~~a~~ 285 (285) T protein:vir:79 230 PLSAIAP-------IVKYDSVSVIDPSTDRSGNRWTIKGLSYY--DAIVLDNAKKGIYVA---ATAGV 285 (285) T ss_pred cCceecc-------ceeeeeeEeECCCCCCCcceeeeeeeeee--eeeehhhccceeeee---ecccC Confidence 2221100 00000000000000001111222211100 01110000 000000 00000 No 57 >protein:vir:118 Length: 449 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690641;swissprot:sw:q37888;genbank:gi:22855155;interpro:IPR003343;uniprot:Q37888;genbank:GeneID:955370 Probab=99.66 E-value=4.2e-17 Score=110.36 Aligned_cols=343 Identities=10% Similarity=0.065 Sum_probs=146.4 Q ss_pred Ccccccc----HHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccccc Q lcl|NC_021299. 1 MANAFIK----PPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma~~~~~----pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (387) |+.+.+. ..+|-+..+...+..+-.-+|. .|..+ .-..|++|.=..-.......|++.. .+-.+-...++.- T Consensus 52 ~~~~~~~n~~~~sl~~ri~~~~~~~~~~~NPL~--~F~~~-~~~~g~~i~~~~~d~~~~~~~~~~~-~e~~~f~~~~p~i 127 (449) T protein:vir:11 52 LVNQTVQNEFLTSLVDRIGLVIVKSISLRNPLA--KFKKG-ALPMGRTIEEIFTDITKEKLYDAEE-AEQKVFEREIPNV 127 (449) T ss_pred hhhHHHHHHHHHHHHHhhhhhhhhhhhhcChhH--HHhcC-Cccccceeeeheecccceeeechhh-hcccccccCCCce Confidence 6655433 3344444333333333222221 22111 0135777755444444444444211 1222233333444 Q ss_pred ceEEEEEEeeeecceeeccHHHhh--hhhhHHHHHHHHHHHHHH--HHHHHHHH-H-HHhcc----cccc---cccCCcc Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEEREL--DVRSFAVDVLPRQVRAVA--EQIEDAVS-Y-LITKA----PYEK---VSLVDED 143 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~--~~~~~~~~~~~~~~~~la--~~vd~~~~-~-~~~~~----~~~~---~~~~~~~ 143 (387) ...-.+.+++.++-+.+.+..... ....-..+++.+.+.+|. ..+|++.. . ++..+ ...+ .-..+.. T Consensus 128 ~a~~h~~~r~~~~~~ti~~~~~~~af~s~~~~~~~~~~~~~~~~~s~~~~ey~~~~~l~~~~~~~~~~~~~~i~d~~t~~ 207 (449) T protein:vir:11 128 KTLFHERNRQSFYHQTIQDDSLKTAFISWGNFESFIASIINAIYNSAEVDEYEYMKLIIDNYYSKGLFKVVKVDDPMTST 207 (449) T ss_pred eEEEeeccccceeeEeeeHHHHHhhhcChhHHHHHHHHHHHHHhccCchHHHHHHHHHHHHhhccCceEEeeCCccccch Confidence 445566777777888888765432 333335556666655554 34454422 1 11111 1111 1112333 Q ss_pred hhHHHHHH-HHHHHhhccCCc----------------CCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEe Q lcl|NC_021299. 144 EIWNGVVS-NRRWLNEQKVPK----------------DGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRL 206 (387) Q Consensus 144 ~~~~~i~~-a~~~l~~~~vp~----------------~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~ 206 (387) ..++.+++ ++..-.+...|. ++.+++++|+....+-.+ -|.+..+.... -..+....+ T Consensus 208 ~~~~~~~k~~~~~~~~m~~P~~t~~~N~~~v~~~ad~~dl~~i~~~d~~~~ld~t-~ls~afN~tav----Da~~~~tvV 282 (449) T protein:vir:11 208 GALTNFIKKARATALKMTLPQGTRDYNAMAVRTRSDIRDVHLFIDADLNAELDVD-VLAKAFNMDRT----TFLGNVTVI 282 (449) T ss_pred HHHHHHHHHHHHHHHhhcCCCCCCCCCceeeccccCccceEEEEccCcceecccc-cchhhhcccee----eeeeeeeec Confidence 44555554 333334556773 334566676655444221 12222221110 001111111 Q ss_pred ecceeeeeeccceeeeee-eccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeecccc Q lcl|NC_021299. 207 AQYDVVTVDTLPHGDAYL-SHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVL 285 (387) Q Consensus 207 ~g~~v~~s~~~~~~~~~~-~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~ 285 (387) .+|.- .+.... .....+. .. ......... .......+ .+.. .......+ T Consensus 283 ddfAs-------t~~~a~~~sk~~~~-~~-------d~~~~~~~~--~~~~G~y~--n~~~------tvt~t~~~----- 332 (449) T protein:vir:11 283 DGFAS-------TGLKAVMVDKDWFM-VY-------DTLQKMETI--RNPRGLYW--NYYY------HVWQVLSA----- 332 (449) T ss_pred CccCC-------ccceeeeeccceeE-Ee-------eeeeEEEEE--EcCcceee--ccce------EEEEEEec----- Confidence 12100 000000 0000000 00 000000000 00000000 0000 00000000 Q ss_pred ceeeeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCce-EEEEcCCceEEEEe Q lcl|NC_021299. 286 DPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTA-KATIDANGVVTGVA 364 (387) Q Consensus 286 ~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~-VAtVd~~G~VTa~~ 364 (387) ..........+....+.+..+.++++..+|..|.+.+++++.. +....++.|+|+|||+. +|+||++|+|||++ T Consensus 333 ---~~~~~~~a~~~~~~~~~VTsVsVtPss~tL~~G~T~qLTATV~--psnatnk~VTWSsSd~s~~ATVda~G~VTAva 407 (449) T protein:vir:11 333 ---SRFANAVAFVTGDDVPAVTQVIVSPAIASVKQGKSQAFTAYVR--ATDDKEHEVVWSVDGGSTGTSISSDGVLTVAA 407 (449) T ss_pred ---ccccceeeeeeeeccceeeEEEeeccceeeecCceEEEEEEEe--cCCCCCceEEEEEeCCceEEEEcCCceEEEec Confidence 0000000001111112233444455566677777777766554 33455688999988886 69999999999999 Q ss_pred cceEEEEEEEC--CEEEEEEEEEeC Q lcl|NC_021299. 365 AGTSEITAVVD--GLTVKKTITVTA 387 (387) Q Consensus 365 ~Gta~Itat~~--~~~~~~~vtVta 387 (387) .|+++|||+++ +.+++|+++|+. T Consensus 408 ~GTAtITAta~~~s~TaT~tvtV~~ 432 (449) T protein:vir:11 408 NETNQLTVKATVDIGTADEPKPVVG 432 (449) T ss_pred CccEEEEEEEecCcEEEEEEeeecc Confidence 99999999874 467777777766 No 58 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.63 E-value=4.1e-17 Score=110.44 Aligned_cols=328 Identities=9% Similarity=-0.020 Sum_probs=169.8 Q ss_pred Cccc----------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccc Q lcl|NC_021299. 1 MANA----------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATG 64 (387) Q Consensus 1 Ma~~----------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~ 64 (387) |++- +|+ |+|..|++..|....+|..++.. ..+ +.|++++|+..+...+..+.. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~L-e~f~GeV~taF~~~si~~~~~~v---RtI--~~gkS~qf~~lG~s~a~y~~p---- 70 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENIMSYFDV---QTV--TGTNTVSNKYLGETELQVLAP---- 70 (400) T ss_pred CCCCccccccccccccchhhhHH-hHhcchHHHHHHHHhhhccccee---eee--cccceEEEEEeeeeEEeeecC---- Confidence 6532 333 89999999999999888887752 234 459999999998888877653 Q ss_pred ccccccccccccceEEEEEEeeeecceeeccHHHhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------- Q lcl|NC_021299. 65 ADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRS-FAVDVLPRQVRAVAEQIEDAVSYLITKAPY--------- 134 (387) Q Consensus 65 ~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~~la~~vd~~~~~~~~~~~~--------- 134 (387) +..+..+++......|+||.-.+....|.|.|+.+..+| ++.++.++++++||+..|+.++.++..+.. T Consensus 71 -G~~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~ 149 (400) T protein:vir:10 71 -GQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTN 149 (400) T ss_pred -CCCcCCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 444566677778889999999999999999999999999 899999999999999999998876533211 Q ss_pred --cccc-------cCC------cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeee Q lcl|NC_021299. 135 --EKVS-------LVD------EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQ 199 (387) Q Consensus 135 --~~~~-------~~~------~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~ 199 (387) .... +.. +......+.+|...|++.+||.+.+.++++|++|..|+..+++...+...++ ..... T Consensus 150 ~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~-~g~~~ 228 (400) T protein:vir:10 150 PRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQ-SGATI 228 (400) T ss_pred CCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccC-CCccc Confidence 0000 001 1112345778999999999997655555677877788777777766654333 23467 Q ss_pred eeEEEEeecceeeeeeccceeeee-eeccc-----cccccc-cccccccCceeeeeeecccccceeeeeeeeeeccceee Q lcl|NC_021299. 200 TARIGRLAQYDVVTVDTLPHGDAY-LSHPT-----AYAMLT-RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTER 272 (387) Q Consensus 200 ~g~ig~~~g~~v~~s~~~~~~~~~-~~~~~-----a~~~~~-~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 272 (387) .|.+..++|+.|++++++|..... ..|.- +..+.. +......+..+.-.+.....-..+.-...++.....+. T Consensus 229 ~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~ 308 (400) T protein:vir:10 229 QGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYY 308 (400) T ss_pred cceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHH Confidence 888999999999999999864211 11111 000000 00001111111111111111111111111111111111 Q ss_pred eeeeeeeeecccccee--------------eeccceecccc---ccceeeeeee---eccccccccccccceeEEE---- Q lcl|NC_021299. 273 SIVDTWIGVKAVLDPV--------------TANLDDEPRFV---RGTRIHLKAT---DAEIEGETVKAGEKLALAL---- 328 (387) Q Consensus 273 ~~~~~~~g~~~~~~~~--------------~~~~~~~~~~v---~~~~v~~~~~---~~~~~~~~~~~~~~~~~~~---- 328 (387) ......+|.....+.. ..+.......+ ...++.+.+. ..+..+..+.... +...+ T Consensus 309 id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 387 (400) T protein:vir:10 309 IDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAAQHTQVLNRAQRKAVYVKNAAPAGAFAAASLSAED-LVAAVRAVM 387 (400) T ss_pred HHHHHHhCCcccchhheEEEEecCCcccccccCcchhHHHHHhhcccceEEEecccccccccccccchHH-HHHHHHHHH Confidence 1111111111111100 00000000000 0001111000 0000000000000 00000 Q ss_pred eeccCCccccCcc Q lcl|NC_021299. 329 EDSNGDNRAGDPL 341 (387) Q Consensus 329 ~~~~~~~~~~~~~ 341 (387) .-...|..+...+ T Consensus 388 ~~~~~~~~~~~~~ 400 (400) T protein:vir:10 388 ANDIKPTAMKPTE 400 (400) T ss_pred hccccccccCCCC Confidence 0000111111111 No 59 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.55 E-value=5.5e-16 Score=104.26 Aligned_cols=268 Identities=10% Similarity=0.013 Sum_probs=150.4 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) ||-+. -++|.+++.++|.+.+....+.+.++. |- ..|++|+||........+|+... .....+++....+ T Consensus 8 mAlny--a~~~~~~Ld~~~~~~~~t~~l~~~~~~--~~-~Gak~VkIp~i~~~gl~dY~R~~-----g~~~g~v~~~~et 77 (311) T protein:vir:99 8 RGFNY--VTKDGNLLDQKITAGLFTAALGTPEVD--LV-NGGRSFTLKTISTSGLKDHTRGK-----GFNSGTISDEKTI 77 (311) T ss_pred hHHHH--HHHHHHHHHHHHHhhhcccceecCchh--ee-ecCCEEEEEeeeecccccccccc-----CccccceeeeeeE Confidence 55333 689999999999999988888887754 42 23899999999888888887543 2334566677788 Q ss_pred EEEEeeeecceeec--cHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------ccccc Q lcl|NC_021299. 81 IKLTDVIYNRIDLT--DEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY----------------EKVSL 139 (387) Q Consensus 81 ~~id~~~~~~~~~~--d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~----------------~~~~~ 139 (387) .+|++.+++.|.++ |.|++ ..+.+.+.++ +...+++++|.+.++.+..... ..... T Consensus 78 ~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f---~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~ 154 (311) T protein:vir:99 78 YTMGQDRDVEFYLDRQDVDETDNELAMANISNVF---ITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEET 154 (311) T ss_pred EEeeeccceeeecchhchhhhhhhhHHHHHHHHH---HHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccc Confidence 89999999999998 77764 3445555554 4445669999998876542211 12234 Q ss_pred CCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeee---ec Q lcl|NC_021299. 140 VDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTV---DT 216 (387) Q Consensus 140 ~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s---~~ 216 (387) .+..++++.+..+...|++ +|.++|+|+++|+.+..|...++|.+.......+... .++.++.+.|+.+++. .. T Consensus 155 lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~-i~~~V~~lDgv~Ii~V~ps~r 231 (311) T protein:vir:99 155 LDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTA-LESRITSIDGVQLIEVYESNR 231 (311) T ss_pred cCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccc-cccccceecCeEEEEecCchh Confidence 5667789999999999987 6889999999999999877777787654444333332 4677899999988754 22 Q ss_pred cceeeeeeecccccccccc------------ccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccc Q lcl|NC_021299. 217 LPHGDAYLSHPTAYAMLTR------------SPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAV 284 (387) Q Consensus 217 ~~~~~~~~~~~~a~~~~~~------------~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~ 284 (387) +.. .+.+..+ ++.... +...+.-.....-.....+...-.|..++..- .+....+. T Consensus 232 ~~t--~~~ft~G-~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y--~D~fv~~n------- 299 (311) T protein:vir:99 232 FMT--KYDFTDG-AKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLY--HDLFIKKH------- 299 (311) T ss_pred hcc--hhhhcCC-ccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeee--eeeeeecc------- Confidence 211 1111111 000000 00000000000000000000000122111110 00010000 Q ss_pred cceeeeccceeccccccceeeeeee Q lcl|NC_021299. 285 LDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 285 ~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) ....+.+..... T Consensus 300 -------------k~~~Iyv~~k~A 311 (311) T protein:vir:99 300 -------------KRDGIFVSVKKA 311 (311) T ss_pred -------------ccCeEEEeeecC Confidence 000000100000 No 60 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.51 E-value=5.3e-15 Score=98.83 Aligned_cols=297 Identities=12% Similarity=0.053 Sum_probs=163.8 Q ss_pred Cccc----cccHHHHHHHHHHHHHhhccccc--eeeec--cccccc-ccCCCEEEEEecccceeeceecccccccccccc Q lcl|NC_021299. 1 MANA----FIKPPVIIASILGQLQHELVLPN--FVFKN--GYGDVA-HKFNDTITIRIPVPTIAHTRGLRATGADRNMVA 71 (387) Q Consensus 1 Ma~~----~~~pe~~~~~~~~~l~~~~~~~~--~~~~d--~~~~~~-~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~ 71 (387) ||.+ +|+||+|.+++.+++.+.+.|.. ++-++ ...-|. +.+|++|++|.++..... ......+..+.+ T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd---~~~v~~~~~i~~ 77 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGD---SQVLNDTDDLVP 77 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCc---ccccCCCcccch Confidence 9955 48999999999999999988722 22121 112232 358999999988765322 223345677888 Q ss_pred cccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------cccccCCc Q lcl|NC_021299. 72 SDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY---------EKVSLVDE 142 (387) Q Consensus 72 ~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~---------~~~~~~~~ 142 (387) +.++..+...++ ++..+++..+|+.......|++.+..+|....++++.++++++.+.+.-. ....+... T Consensus 78 ~~l~t~~~~a~i-~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~ 156 (324) T protein:vir:59 78 QKINAGQDKAVL-ILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADG 156 (324) T ss_pred hhcccceeeEEE-EeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccc Confidence 999998888877 57889999999999999999999999999999999999999988765321 11122223 Q ss_pred chhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecccee-- Q lcl|NC_021299. 143 DEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHG-- 220 (387) Q Consensus 143 ~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~-- 220 (387) ...++.+.+|..+|.++. ..-..+++.|..+..|.++. +.......+ .++.++.+.|..|+.+..+|.. T Consensus 157 ~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~~~~s~------~~~~i~~~~G~~VivdD~~p~~~~ 227 (324) T protein:vir:59 157 IYSAETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQD-LIEFVKDSQ------SGIRFPTYMNKRVIVDDSMPVETL 227 (324) T ss_pred eecHHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhh-hhhhccccc------cCceeeeecccEEEEeCCCCcccc Confidence 345788999999998864 34467889999999998764 333322211 2456788999999999988753 Q ss_pred -------eeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccc Q lcl|NC_021299. 221 -------DAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLD 293 (387) Q Consensus 221 -------~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 293 (387) ..+.+..+++.+..+....+ ....-....+...-+. ++. ..+ ...|......... T Consensus 228 ~~~~~~y~s~l~~~GAi~~~~~~~~v~-----vE~dRd~~~g~~~l~~-r~~-------~~~-~p~G~s~~~~~~~---- 289 (324) T protein:vir:59 228 EDGTKVFTSYLFGAGALGYAEGQPEVP-----TETARNALGSQDILIN-RKH-------FVL-HPRGVKFTENAMA---- 289 (324) T ss_pred CCCCceEEEEEEecCeEEEeecCCCcc-----eecccCccccceEEEE-eeE-------EEe-EeeeEEecccccC---- Confidence 23334444433322111100 0000000011110000 000 000 0011000000000 Q ss_pred eeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCce Q lcl|NC_021299. 294 DEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTA 350 (387) Q Consensus 294 ~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~ 350 (387) ..+++-..|..+..+.....+.. -+=+.+.+.-++ T Consensus 290 ----------------~~sPt~~~L~~~~NW~~v~~~k~------i~i~~~~~~~~~ 324 (324) T protein:vir:59 290 ----------------GTTPTDEELANGANWQRVYDPKK------IRIVQFKHRLQA 324 (324) T ss_pred ----------------CCCCChhhhcCCcccccccCccc------cceEEEEeeccC Confidence 00000011111111111000000 000111111111 No 61 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.48 E-value=8.7e-15 Score=97.66 Aligned_cols=300 Identities=13% Similarity=0.042 Sum_probs=159.6 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccc---eeeec-ccccccccCCCEEEEEecccceeeceeccccccc-ccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPN---FVFKN-GYGDVAHKFNDTITIRIPVPTIAHTRGLRATGAD-RNM 69 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~---~~~~d-~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~-~~~ 69 (387) ||++ +|+||+|++++.+++.+.+.|-. ++... ....+. .+|+++++|.++...... + ....+ ..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~-~~G~~i~~P~~~~l~G~~-~--~~~dg~~~i 76 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNIT-SGGLLVNMPFWNDLTGDS-E--VLGNGDKAL 76 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhh-cCCCEEEecccccCCCcc-c--ccCCCcccc Confidence 9974 38899999999999988877722 33322 122233 389999999987553222 1 12222 458 Q ss_pred cccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Q lcl|NC_021299. 70 VASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK------------- 136 (387) Q Consensus 70 ~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~------------- 136 (387) +++.++..+....+ +...+++..+|+...++..|++.+..+|.....++..+..+++.+.+.-... T Consensus 77 ~~~ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~ 155 (330) T protein:vir:10 77 ETGKITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHV 155 (330) T ss_pred chhhcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhhe Confidence 88888888888777 5567899999999999999999999999999999999999888776432211 Q ss_pred --cccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeee Q lcl|NC_021299. 137 --VSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTV 214 (387) Q Consensus 137 --~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s 214 (387) .........++.+.+|..+|.++. ..-..+++.|..+..|.++ .+.......+ .++.++.+.|..|+.+ T Consensus 156 ~~~~~~~a~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~-~li~~~~~s~------~~~~i~~~~G~~Vivd 226 (330) T protein:vir:10 156 SDQSKASTGIDAGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKD-NLIQYIQPTT------ATINIPTYLGYRVIID 226 (330) T ss_pred ecccccccccCHHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHh-hhhhhhcccc------cCcccccccceEEEEe Confidence 011122345688999999998875 3456888999999998874 3444333222 2456788999999999 Q ss_pred eccceee----eeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeee Q lcl|NC_021299. 215 DTLPHGD----AYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTA 290 (387) Q Consensus 215 ~~~~~~~----~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~ 290 (387) ..+|... .+.+..+++.+..+.+... ......-....+...-+ ..+. ..++ ..|.......... T Consensus 227 D~~p~~~~~yt~yl~~~GAi~~~~~~~~~~---v~~EtdRd~~~g~~~l~-~r~~-------~~~h-p~G~s~~~~~~~~ 294 (330) T protein:vir:10 227 DGIAPTGDIYTSYLFRTGSIGLNTGNPSGL---TTFETSREAAKGNDMIY-TRRA-------LVMH-PYGVKWTGAEVDA 294 (330) T ss_pred CCCCCCCCceeEEEEecCceeeecccCCcc---ccccccCCccccceEEE-EeeE-------EEee-eeeeeeccccccc Confidence 9987542 3344444444332211100 00000000001111100 0000 0000 0111111000000 Q ss_pred ccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCce Q lcl|NC_021299. 291 NLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTA 350 (387) Q Consensus 291 ~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~ 350 (387) .. .+++...|..+..+.....+..- +=+.+.+-=-+ T Consensus 295 ~~------------------~sPt~~~L~~~~NW~~v~~~k~i------~iv~~~~~~~~ 330 (330) T protein:vir:10 295 GN------------------ITPSNADLAKFKNWKRVYEPKNI------GIIALKHKIGK 330 (330) T ss_pred Cc------------------CCcChHHhcCCcCcccccChhhc------ceEEEEEecCC Confidence 00 00000001111111100000000 00000000000 No 62 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.48 E-value=7.1e-15 Score=98.15 Aligned_cols=273 Identities=9% Similarity=-0.007 Sum_probs=140.7 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccc-----eeeceecccccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPT-----IAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~ 75 (387) |||++=-.++|.+++.+.|...+....|......-+|. -|++|.||..... ...+|+.. ......+++ T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~--Gak~vkIp~is~~~~~TsGl~dy~R~-----~g~~~g~v~ 73 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYN--GGNTIKIADISFGSGTTGDLKAYNRS-----TGFTQGSVT 73 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceEEEe--cCcEEEEEEEEeeccccccccccccc-----cCcccccee Confidence 99987446899999999999999888774322111344 4899999987643 34455433 222233445 Q ss_pred cceEEEEEEeeeecceeec--cHHHhh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcc Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLT--DEEREL---DVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDED 143 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~--d~~~~~---~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~ 143 (387) ....+.+|++..++.|.++ |.|++. .+.+.+.++ +...+++++|.+.++.+..... ......+.. T Consensus 74 ~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef---~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~ 150 (302) T protein:vir:78 74 LAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEY---QRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQ 150 (302) T ss_pred eeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHH---HHhhhcchhhHHHHHHHHHhhhccCccccccccchhHH Confidence 5666677777777888877 776653 344455544 4455679999998866543221 112234677 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) +.++.+..+...|+++ ++|+++++|..+..|...+.|.+.......+. ...++.++.+.|+.+++...---...+ T Consensus 151 nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~-~~i~~~V~~lDgv~Ii~VPs~r~~t~~ 225 (302) T protein:vir:78 151 ALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVLRR-GEVDTKITFIQDVEVLQVPSEYLYDKV 225 (302) T ss_pred HHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhccceecccccc-ccccceeeeecccEEEEchhhhcccce Confidence 8899999999999985 58999999999888877777765544332222 224677889999988754321111111 Q ss_pred eecccccccccc------------ccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeec Q lcl|NC_021299. 224 LSHPTAYAMLTR------------SPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTAN 291 (387) Q Consensus 224 ~~~~~a~~~~~~------------~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~ 291 (387) .+..+ +..... +...+.-.....-.....+...-.|..++..- .+....+.. .... T Consensus 226 ~f~~G-~~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y--~D~fV~~nk---------~~gI 293 (302) T protein:vir:78 226 APKVG-VPDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLY--HDLIVPKNQ---------RPGI 293 (302) T ss_pred eccCC-ccccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeE--eeeeeeccc---------cCeE Confidence 11110 000000 00000000000000000000011111111100 000000000 0000 Q ss_pred cceeccccc Q lcl|NC_021299. 292 LDDEPRFVR 300 (387) Q Consensus 292 ~~~~~~~v~ 300 (387) ..+....+. T Consensus 294 ~~~~~~~~~ 302 (302) T protein:vir:78 294 IKASFGTIA 302 (302) T ss_pred EEeeccccC Confidence 000000000 No 63 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.44 E-value=3.2e-14 Score=94.53 Aligned_cols=315 Identities=11% Similarity=0.032 Sum_probs=165.1 Q ss_pred Ccccc----ccHHHHHHHHHHHHHhhccccc---eeeecccccccccCCCEEEEEecccceeeceecccccccccccccc Q lcl|NC_021299. 1 MANAF----IKPPVIIASILGQLQHELVLPN---FVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma~~~----~~pe~~~~~~~~~l~~~~~~~~---~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (387) ||.+. |+||+|++++.+++.+.+.|-. ++.+.....+...+|+++++|.++.... + ......+..+.++. T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~G-d--~~~~~~~~~i~~~k 77 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTG-D--PDNWTDSDDIDVNN 77 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCC-c--ccccCCCcccchhe Confidence 99775 8999999999999988887732 4433222222224899999998875422 1 22334567788899 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------cccCC Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK------------VSLVD 141 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~------------~~~~~ 141 (387) ++..+...++ ++..+++..+|+.......|++.++..|.....++..++.+++.+++..... .+... T Consensus 78 itt~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~ 156 (351) T protein:vir:15 78 LTSGKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSE 156 (351) T ss_pred ecccceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccccccc Confidence 9988888877 6677899999999999999999999999999999999999998876531111 01122 Q ss_pred cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecccee- Q lcl|NC_021299. 142 EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHG- 220 (387) Q Consensus 142 ~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~- 220 (387) ....++.+.+|..+|.+..- ..-..+++.|..+..|.++. +.......+ .++.++.+.|..|+.+..+|.. T Consensus 157 ~~is~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~~-li~~~~~s~------~~~~i~t~~G~~VivdD~~p~~~ 228 (351) T protein:vir:15 157 PMFGAKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQG-LIETIQPQN------GATPFEAYNGLRIVLDDDIEIDL 228 (351) T ss_pred cccCHHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhhh-hhhhccccc------cCcccceecceEEEEcCCCcccc Confidence 33456889999999976531 11356778999999988754 333332221 2345789999999999998863 Q ss_pred --------eeeeeccccccccccccccccCceeeeeeecccccceeeeeee---eeeccceeeeeeeeeeeeccccceee Q lcl|NC_021299. 221 --------DAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGD---YDATSTTERSIVDTWIGVKAVLDPVT 289 (387) Q Consensus 221 --------~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~---~d~~~~~~~~~~~~~~g~~~~~~~~~ 289 (387) ..+.+..+++.+..+.+... .........+....+.+. ..+-+..+........+.. + .. T Consensus 229 ~~~~~~~ytsyl~~~GAi~~~~~~~~ve-----~~rd~~~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~s---P-t~ 299 (351) T protein:vir:15 229 TDKTKPVSTSYIFAPGAVRYSTNMRSTE-----TKYDPLINGGQDVIVQKRVGTIHVAGTSIKASFSPSKASF---P-TI 299 (351) T ss_pred CCCCCceeEEEEEecceeeeecCCcCcc-----eeecccCCCCceEEEEeeeeeeeeeeeeecccccccCcCC---c-Ch Confidence 23444455544433322110 000011111111111110 0000000000000000000 0 00 Q ss_pred eccceecc--c---cccceeeeeeeeccccccccccccceeEEEeeccCCccccCc Q lcl|NC_021299. 290 ANLDDEPR--F---VRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDP 340 (387) Q Consensus 290 ~~~~~~~~--~---v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (387) ........ . .....|.+......+.+ .+..+.+....-+..+ ....+ T Consensus 300 ~~L~~~~NW~~v~~~d~k~I~iv~~~~~~~~-~~~~~~~~~~~~~~~~---~~~~~ 351 (351) T protein:vir:15 300 DELAKSSTWEVVDGIDVRSIGVVAYTAQLDP-ALTPGAQMPAADTSTD---TGTTK 351 (351) T ss_pred HHhcCCcccccccCCCccccceEEEEEecCc-ccccCCcCcCCCCccc---cCCCC Confidence 00000000 0 01111111111111110 0111100000000000 01111 No 64 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.27 E-value=1.2e-13 Score=91.49 Aligned_cols=278 Identities=16% Similarity=0.174 Sum_probs=161.0 Q ss_pred Cccc--cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccce Q lcl|NC_021299. 1 MANA--FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVT 78 (387) Q Consensus 1 Ma~~--~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (387) -+|+ ++..|+|+++++..|++.| ++.-++|+.. +|. .|++.|||..+..+.+.. .+.++..++++.++. T Consensus 4 TSNT~A~I~SE~~s~~I~~~LH~~L-L~~~~~R~V~-DF~--~G~~L~I~tiGs~~~~~~-----~E~~~~~~~~i~TGE 74 (313) T protein:vir:95 4 TSNTRAFIESEQYSKFILLNLHDGL-LPETFYRNVS-DFG--SGETLHIKTIGSVTLQEA-----EEDTPLIYNPIETGE 74 (313) T ss_pred cccchheehhhhHHHHHHHHhhccc-cchhhhhhhc-cCC--CCCEEEecccCceeeecc-----ccCCCeeecccccce Confidence 2343 4779999999999999998 6665667654 463 499999988777766653 467789999999999 Q ss_pred EEEEEEeeeecceeeccHH--HhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------cccc-ccccCCc Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEE--RELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-------------APYE-KVSLVDE 142 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-------------~~~~-~~~~~~~ 142 (387) +++.|..++.-++.++++- ..-.+.+++.+..+++.+++-+..+.|++++-.. .|+. +++.... T Consensus 75 It~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~ 154 (313) T protein:vir:95 75 ITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNG 154 (313) T ss_pred EEEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCc Confidence 9999999999999999864 3456777888888999999999999999876432 1111 2334455 Q ss_pred chhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeE------EEEeecceeeeeec Q lcl|NC_021299. 143 DEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTAR------IGRLAQYDVVTVDT 216 (387) Q Consensus 143 ~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~------ig~~~g~~v~~s~~ 216 (387) ......+.+++-.+++.++|.+||+.+++|..+.-|...-.+.+ .+.+.+...+..|. +-++||++++.|+. T Consensus 155 ~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~--~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~ 232 (313) T protein:vir:95 155 VFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITH--DVTDFGKMILESGMARGQRFIMNLYGWDILTSNR 232 (313) T ss_pred eehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeec--ccccccceeeeccCCchhHHHHHHhhhhhhhhhh Confidence 67778999999999999999999999999998877755332222 12222223334443 45678999998887 Q ss_pred cceeeeeeeccccccccccccccccCceeeeeeec-ccccceeeeeeeeeec-----cce-eeeeeeeeeeeccccce-e Q lcl|NC_021299. 217 LPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVA-TENGVQLRWLGDYDAT-----STT-ERSIVDTWIGVKAVLDP-V 288 (387) Q Consensus 217 ~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~-~~~~~~~~~~~~~d~~-----~~~-~~~~~~~~~g~~~~~~~-~ 288 (387) +...+......+ ..+.-.. -+. .... ........|..--... .+. ....+..-+|....... . T Consensus 233 L~~AN~~D~~tT----~~G~~~N----lFM-~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L 303 (313) T protein:vir:95 233 LHVANYNDGTTT----GNGYVGN----LFM-CILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTL 303 (313) T ss_pred hhhccccccccc----cCceeee----eee-eeecccccceeeeeccccccccccccccccccceeeeeecccceeecce Confidence 654322211111 0000000 000 0000 0000000111000000 000 00001111111110000 0 Q ss_pred eeccceeccc Q lcl|NC_021299. 289 TANLDDEPRF 298 (387) Q Consensus 289 ~~~~~~~~~~ 298 (387) .+...+...+ T Consensus 304 ~~~~~~A~~~ 313 (313) T protein:vir:95 304 GLLATSATAY 313 (313) T ss_pred eEEEeccccC Confidence 0000000000 No 65 >protein:vir:5202 Length: 448 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040725;genbank:gi:9626396;genbank:GeneID:1260967 Probab=99.18 E-value=5.3e-12 Score=82.40 Aligned_cols=340 Identities=12% Similarity=0.103 Sum_probs=138.2 Q ss_pred CccccccHHHHHHHHHHH---HHhhccccceeeecccccccc---cCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQ---LQHELVLPNFVFKNGYGDVAH---KFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~---l~~~~~~~~~~~~d~~~~~~~---~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.++.+.-|.+...+.|. |-+.+.+ ++.-..|.. ..|+++.=..-....-..|+... ++-.+-...++ T Consensus 52 ~~~~~~~nef~~sLi~rIg~~~~~~~s~-----~NPL~~Fk~~~~~~g~~ieei~~d~~~~~~yd~~~-~e~~~F~~~~p 125 (448) T protein:vir:52 52 LINQTVQNDFITSLVDRIGLVVIRQVSL-----NNPLKKFKKGQIPLGRTIEEIYTDITKEKQYDAEE-AEHKVFEREMP 125 (448) T ss_pred hhhHHHHHHHHHHHHHhhhhheeccccc-----cchHHHHhhccccchhhhhhheeccccceeechhh-hcccccccCCC Confidence 665555555554432221 1111111 111122221 23566532222222222222211 11222223333 Q ss_pred ccceEEEEEEeeeecceeeccHHH--hhhhhhHHHHHHHHHHHHHH--HHHHHHHH-HH-Hhcccc----c---ccccCC Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEER--ELDVRSFAVDVLPRQVRAVA--EQIEDAVS-YL-ITKAPY----E---KVSLVD 141 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~--~~~~~~~~~~~~~~~~~~la--~~vd~~~~-~~-~~~~~~----~---~~~~~~ 141 (387) .-...-.+.+++.++-+.+.|..+ +.....-..+++.+...++. ..+|.+.. .+ +..+-. . .....+ T Consensus 126 ~vka~~h~~~r~~~y~~ti~~~~~~~aF~s~~~~d~~~~~i~~s~~~s~~~~ey~~~~~li~~~~~k~l~~~~~i~d~~t 205 (448) T protein:vir:52 126 NVKTLFHERNRQGFYHQTIQDDSLKTAFVSWGNFESFVSSIINAIYNSAEVDEYEYMKLLVDNYYSKGLFTTVKIDEPTS 205 (448) T ss_pred cceeeeeeccCcceeEEEEehhHHHHHHhhhcchHHHHHHHHHHHhcccchHHHHHHHHHHHHhhhccCeEEeeCCCccc Confidence 444455667777788888887543 33333345666666665555 34444422 11 111111 0 111112 Q ss_pred cchhHHHHHH-HHHHHhhccCCc----------------CCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEE Q lcl|NC_021299. 142 EDEIWNGVVS-NRRWLNEQKVPK----------------DGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIG 204 (387) Q Consensus 142 ~~~~~~~i~~-a~~~l~~~~vp~----------------~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig 204 (387) ....+..+++ ++..-.+...|. ++.++++++++...|- .+.|..+.+.... . .-+.+- T Consensus 206 ~~~~~~~~~k~~r~~~~~~~lp~~~~~~N~~~v~~~~~~~dl~li~~~~~~~~ld-v~~la~afn~~~~--~--~~~~~~ 280 (448) T protein:vir:52 206 STGALTEFVKKMRATARKLTLPQGSRDWNSMAVRTRSYMEDLHLIIDADLEAELD-VDVLAKAFNMNRT--D--FLGNVT 280 (448) T ss_pred chhHHHHHHHHHhhhhhheeCCCCCcccccccccccccceeeEEEECCCceEeec-HHHHHHHhccccc--c--cCcceE Confidence 2233443333 222222223332 3346777777654431 2234444433221 1 111222 Q ss_pred EeecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccc Q lcl|NC_021299. 205 RLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAV 284 (387) Q Consensus 205 ~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~ 284 (387) .+.||.- .++. .+.+...++.. .-. ...-... .+...+.+..++.. T Consensus 281 ~vd~F~~---~g~~---~i~vskk~~~~-~d~-------~~kg~t~--~na~GL~~N~~~TI------------------ 326 (448) T protein:vir:52 281 VIDGFAS---TGLE---AVLVDKDWFMV-YDN-------LHKMETV--RNPRGLYWNYYYHV------------------ 326 (448) T ss_pred EecCccc---cCce---eeeeeeeeeee-eec-------cceeeee--eccccceeeeeeEE------------------ Confidence 3344421 1110 11111111000 000 0000000 00000101000000 Q ss_pred cceeeecc-ceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCceE-EEEcCCceEEE Q lcl|NC_021299. 285 LDPVTANL-DDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAK-ATIDANGVVTG 362 (387) Q Consensus 285 ~~~~~~~~-~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~V-AtVd~~G~VTa 362 (387) ........ ......+....+.+....+++...++..|.+++++++.+ +....++.|+|++||.++ +|||++|++|+ T Consensus 327 tatss~~~~t~atA~V~~t~paVtsVsVsPttasL~~G~TqqlTATVs--g~na~~~~VTWSvS~ns~~aTVsssG~vTv 404 (448) T protein:vir:52 327 WQTLSVSRSANAVAFVSGDVPAVTQVIVSPNIAAVKQGGKQQFTAYVR--ATDGKDHKVVWSVEGGSTGTAITGDGLLSV 404 (448) T ss_pred EEEEccCccccceEEEEecccccceEEEcccceeecCCCeEEEEEEEe--cCCCCCCceEEEEcCCceeeEEeCCccEEe Confidence 00000000 000011111112233344555666778888888777665 333445779999998787 89999999999 Q ss_pred EecceEEEEEEEC--CEEEEE--------EEEEeC Q lcl|NC_021299. 363 VAAGTSEITAVVD--GLTVKK--------TITVTA 387 (387) Q Consensus 363 ~~~Gta~Itat~~--~~~~~~--------~vtVta 387 (387) .+.|+++|||++. ..++.+ .|+|+. T Consensus 405 ~a~gTatITVtATvdts~a~~~~~vv~ea~VsvtP 439 (448) T protein:vir:52 405 SGNEENQLTVKATVDIGTEDKPNLVVGEAVVSIRP 439 (448) T ss_pred ccCCcceEEEEEEecCcccCCceeeeeeEEEEecC Confidence 9999999999753 222222 222222 No 66 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.75 E-value=1.4e-09 Score=69.15 Aligned_cols=275 Identities=13% Similarity=0.053 Sum_probs=132.9 Q ss_pred Ccccccc-------HHHHHH-----HHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccc Q lcl|NC_021299. 1 MANAFIK-------PPVIIA-----SILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRN 68 (387) Q Consensus 1 Ma~~~~~-------pe~~~~-----~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~ 68 (387) ||-+.++ |+.+.- .-+..|.+.| .+.. ......|+||++|++... .+.. ..+++.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~L---gi~r-----~~p~a~G~tIt~pK~~~t--gda~--dVaEGe~ 68 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLL---GVTR-----RETLTNDLKIQTYKWEVT--LDQT--DPGEGET 68 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHh---cccc-----ccccccCCeEEeeeeeee--cccc--cccCCcc Confidence 8877543 332210 1122233333 1111 112245999999886533 2222 3456777 Q ss_pred ccccccccc---eEEEEEEeeeecceeeccHHH-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcch Q lcl|NC_021299. 69 MVASDLTEV---TVDIKLTDVIYNRIDLTDEER-ELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDE 144 (387) Q Consensus 69 ~~~~~~~~~---~~~~~id~~~~~~~~~~d~~~-~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~ 144 (387) ++.+.++.+ ..++++.|+. + .++||.. ...+.+...+.-+|..++|+++||++++..++.++.... ... T Consensus 69 Iplskvt~~~~~t~t~kikK~r-K--~tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~t----g~~ 141 (295) T protein:vir:99 69 IPLSKVTRTKDKDYTVKWFKKR-R--ATTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVK----GVG 141 (295) T ss_pred cchhhheeeeeeeeEEEeeeec-c--cccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeee----hhh Confidence 888888865 4667774432 3 3589985 789999999999999999999999999988876655432 122 Q ss_pred hHHHHHHHHHHHhhccCCc-CCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecce-eeeeeccceeee Q lcl|NC_021299. 145 IWNGVVSNRRWLNEQKVPK-DGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYD-VVTVDTLPHGDA 222 (387) Q Consensus 145 ~~~~i~~a~~~l~~~~vp~-~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~-v~~s~~~~~~~~ 222 (387) .-..+..+..+|+.++-.. ...+++++|...+.++++.... ++....-+...+ -++.|++ ++.+..+|.+.. T Consensus 142 lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~-~~~a~~fG~~~L-----~nfLG~q~II~S~kv~~G~~ 215 (295) T protein:vir:99 142 LQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVG-ADASNVFGMTLL-----KNFLGMQNVIVMPSVPEGKI 215 (295) T ss_pred HHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccc-cchhhhhhhhhh-----hhhhccceEEEcccCCCceE Confidence 2223333333444433222 2358889999999999876542 222211122222 2588997 999999999888 Q ss_pred eeecccccccccccccc-ccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccc- Q lcl|NC_021299. 223 YLSHPTAYAMLTRSPGR-PMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVR- 300 (387) Q Consensus 223 ~~~~~~a~~~~~~~~~~-~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~- 300 (387) +......+.+....... ..+..+.... ...|.. ...-+.+... ..+......+..-.+.... T Consensus 216 ~aT~~~Ni~~ay~~~~~g~l~~~f~~~~--D~tglI-g~~h~~~~~~-------------~t~et~~~~~~~lfpE~~dg 279 (295) T protein:vir:99 216 YSTAVENLVFASLNVKGGDLGGLFADFT--DETGLI-AAARNRQLSN-------------LTYESVFFGANVLFAEIPEG 279 (295) T ss_pred EEeeccceEEEEecCCchhhhhhhhhcc--Ccccce-EEEeccccce-------------eeehhhhHhHHHhcccccce Confidence 76665555443332110 0111111000 000000 0000000000 0000000000000000000 Q ss_pred cceeeeeeeecccccccccccc Q lcl|NC_021299. 301 GTRIHLKATDAEIEGETVKAGE 322 (387) Q Consensus 301 ~~~v~~~~~~~~~~~~~~~~~~ 322 (387) ++..++.. +..-+.|. T Consensus 280 iv~~tI~~------~~~~~~~~ 295 (295) T protein:vir:99 280 VVEATIEA------AAVPGIGG 295 (295) T ss_pred EEEEEEec------CcCCCCCC Confidence 00001100 00011111 No 67 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.64 E-value=5.3e-09 Score=65.96 Aligned_cols=295 Identities=12% Similarity=0.013 Sum_probs=147.1 Q ss_pred Cccc--------cccHHHHHHHHHHHHHhhccc--cceeeeccccccc---ccCCCEEEEEecccceeeceecccccccc Q lcl|NC_021299. 1 MANA--------FIKPPVIIASILGQLQHELVL--PNFVFKNGYGDVA---HKFNDTITIRIPVPTIAHTRGLRATGADR 67 (387) Q Consensus 1 Ma~~--------~~~pe~~~~~~~~~l~~~~~~--~~~~~~d~~~~~~---~~~gdtv~i~~~~~~~~~~~~~~~~~~~~ 67 (387) ||-- +|+||+|.+.+.++-.+...| ..++-+| .+|. ...|++|+||.++.....+..+....... T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d--~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~ 78 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASN--DFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecC--HHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcc Confidence 9922 399999999999988766665 3344333 2333 36799999999877654332221111122 Q ss_pred cccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------ Q lcl|NC_021299. 68 NMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE------------ 135 (387) Q Consensus 68 ~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~------------ 135 (387) .+.+..++..+..-.+ .+..++|..+|....+.-.|++..+..|-..--.+.-.+.+++.+++.-.. T Consensus 79 ~~t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~ 157 (367) T protein:vir:80 79 EAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) T ss_pred cccccccccchheeee-ehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhh Confidence 3555666666554433 455678888888777777788888877766555555555566655432111 Q ss_pred -----------------cc-ccC--CcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccc Q lcl|NC_021299. 136 -----------------KV-SLV--DEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGA 195 (387) Q Consensus 136 -----------------~~-~~~--~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~ 195 (387) .. .+. ......+.+.+|+..|.++. ..=..+++.|..+..|.+.. +.......+ T Consensus 158 ~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~--~~l~~i~mHS~V~~~L~~~~-li~~i~~sd--- 231 (367) T protein:vir:80 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNND-EIEFIPDSK--- 231 (367) T ss_pred hccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhcccc--ccccEEEEchHHHHHHHhcc-ccccccCCC--- Confidence 00 111 12244678999999998864 34467889999999988764 333222221 Q ss_pred eeeeeeEEEEeecceeeeeecccee--------eeeeeccccccccccccccccCceeeeeeecc--cccceeeeeee-- Q lcl|NC_021299. 196 SRLQTARIGRLAQYDVVTVDTLPHG--------DAYLSHPTAYAMLTRSPGRPMTNTVATSTVAT--ENGVQLRWLGD-- 263 (387) Q Consensus 196 ~~~~~g~ig~~~g~~v~~s~~~~~~--------~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~--~~~~~~~~~~~-- 263 (387) ....++...|..|+.+..+|+. ..+.+..+++.+..+.+..+. ........ ..+...-+.+- T Consensus 232 ---~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~---E~~Rd~~~~~~gG~d~L~~Rr~~ 305 (367) T protein:vir:80 232 ---GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPV---AVGRRELRGNGSGLEYILERKEW 305 (367) T ss_pred ---CccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccce---ecccchhhhcCCceEEEEeeeeE Confidence 1345788889999999999863 234455555555444322211 11111111 11222222221 Q ss_pred -eeeccceeee--e---eeee--eeecc-ccceeeeccceec---ccccccee---eeeeee Q lcl|NC_021299. 264 -YDATSTTERS--I---VDTW--IGVKA-VLDPVTANLDDEP---RFVRGTRI---HLKATD 310 (387) Q Consensus 264 -~d~~~~~~~~--~---~~~~--~g~~~-~~~~~~~~~~~~~---~~v~~~~v---~~~~~~ 310 (387) .-+.+..... . .++. .|... ............. .......| .+.... T Consensus 306 ~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~NW~~v~d~K~I~iv~~it~g 367 (367) T protein:vir:80 306 IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) T ss_pred EeecceeeecccccccccccccccccccccCCCChHHhcCCcccccccchhhcceEEEEecC Confidence 0011111000 0 0000 00000 0000000000000 00011111 111111 No 68 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.50 E-value=1.4e-08 Score=63.65 Aligned_cols=275 Identities=9% Similarity=0.030 Sum_probs=132.5 Q ss_pred Ccc--ccccHHHH------------HHHHHHHHHhhc---cccceeeecccccccccCCCEEEEEecc-cceeeceeccc Q lcl|NC_021299. 1 MAN--AFIKPPVI------------IASILGQLQHEL---VLPNFVFKNGYGDVAHKFNDTITIRIPV-PTIAHTRGLRA 62 (387) Q Consensus 1 Ma~--~~~~pe~~------------~~~~~~~l~~~~---~~~~~~~~d~~~~~~~~~gdtv~i~~~~-~~~~~~~~~~~ 62 (387) |+. ++..++-+ ++ -+..|.+.| ++.++ ..|.+++++++. +....+. .. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~-~i~~L~~~LGv~r~~pl-----------a~Gt~iktyK~~~~~y~gda--~d 66 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGV-GLNKLFEALAIQNKIPM-----------NVGSALKQYRFKVEDSEKPN--GD 66 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhh-hHHHHHHHhhhhccccc-----------cCCceeeeeeeeceeecccc--cc Confidence 662 23333332 22 233333444 22222 247788776532 3222222 23 Q ss_pred ccccccccccccccc---eEEEEEEeeeecceeeccHHH-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_021299. 63 TGADRNMVASDLTEV---TVDIKLTDVIYNRIDLTDEER-ELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVS 138 (387) Q Consensus 63 ~~~~~~~~~~~~~~~---~~~~~id~~~~~~~~~~d~~~-~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~ 138 (387) .+++..++.+.++.. ..++++.|+. + .++||.. ..++.+...+.-+|..++|+++|+++++..++.+...... T Consensus 67 VaEGe~Iplskvt~~~~~t~~~~~kK~r-K--~tTdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~ 143 (303) T protein:vir:10 67 VAEGDVIPLTKVTREQVDITELQFAKYR-K--STSAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKR 143 (303) T ss_pred ccCCcccchhhheeeecceEEEEeeccc-c--cccHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccccc Confidence 346677878888754 5677786543 3 3399985 7899999999999999999999999999888776554433 Q ss_pred cCCcchhHHHHHHHHH----HHhhccCCcCCcEEEEchHHHHHHhcccchhhh-hhcccccceeeeeeEEEEeecceeee Q lcl|NC_021299. 139 LVDEDEIWNGVVSNRR----WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRY-DSAGEAGASRLQTARIGRLAQYDVVT 213 (387) Q Consensus 139 ~~~~~~~~~~i~~a~~----~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~-~~~g~~~~~~~~~g~ig~~~g~~v~~ 213 (387) +.+....++.+..|-. +|+...--...-+++++|...+.++++...... ...|. ..+ -++.|+.++. T Consensus 144 t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~---n~L-----~nfLG~~II~ 215 (303) T protein:vir:10 144 TNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGV---NLL-----TPYVGVKIVE 215 (303) T ss_pred ccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhh---hhh-----hhhhcceEEE Confidence 3333333444444433 332222112235788999999999987654321 22222 222 2488999999 Q ss_pred eeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccc Q lcl|NC_021299. 214 VDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLD 293 (387) Q Consensus 214 s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 293 (387) +..+|.+..+......+.+......-..+..+..+.. ..|... ..-+.+. ....+......+.. T Consensus 216 S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D--~tglIG-v~h~~~~-------------~~~t~eT~~~~~~~ 279 (303) T protein:vir:10 216 FADVPQGEVWMTVAENLNVAYANPRGELSRAFAFATD--ATGFVG-VLHDIQP-------------QRLTSDTIYASAIS 279 (303) T ss_pred eccCCCceEEEeeccceEEEEecCchhhhhhhhhccc--cccceE-EEecccc-------------ceeeehhHhHhHHH Confidence 9999998877665555544433221111111111100 011100 0000000 00000000000000 Q ss_pred eeccccc-cceeeeeeeecccccc Q lcl|NC_021299. 294 DEPRFVR-GTRIHLKATDAEIEGE 316 (387) Q Consensus 294 ~~~~~v~-~~~v~~~~~~~~~~~~ 316 (387) -.+.... ++..++.....+-.+. T Consensus 280 lfpE~~dgiv~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 280 MFPENIDAVIKVTIKKDEAGELPS 303 (303) T ss_pred hcccccceEEEEEEeccccCCCCC Confidence 0000000 0011111100000000 No 69 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.49 E-value=1.6e-08 Score=63.30 Aligned_cols=273 Identities=11% Similarity=0.069 Sum_probs=137.6 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccce-eeceecccccccccccccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTI-AHTRGLRATGADRNMVASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (387) |++ |+++..++++.+++.. +.....+.++ .+.+..+....-.+.. ..++... .++..+...+...+.. T Consensus 22 l~~----P~~I~~~i~e~~~~~~-iad~lf~~~~----a~~~~~v~f~~~~p~~~~~d~e~V--aEggEiP~~~~~~G~~ 90 (318) T protein:vir:10 22 VGN----PLWIPTALKKMMVNQF-ISESLFRNGG----ANPNGVVAYNEGNPSFLEDDVADV--AEFGEIPVSAGARGLP 90 (318) T ss_pred hCC----chhHHHHHHHHHhccc-hhhhhhhccc----ccccceeEEEecccccccCcHhhc--cCcccccccCCCCCch Confidence 444 7777777666664444 4454445533 2335566665422221 2233322 3444555566666666 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----cccCCcchhHHHHHHHH-- Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK----VSLVDEDEIWNGVVSNR-- 153 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~----~~~~~~~~~~~~i~~a~-- 153 (387) .+-.-+.....+.++||.......+.+.+.++++..++++.+|+.++..+..+.... +...+......++++|. T Consensus 91 ~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~A~e~ 170 (318) T protein:vir:10 91 RTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAIAIEQ 170 (318) T ss_pred hhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccchhhhhh Confidence 665545667899999999999999999999999999999999999998775542211 11111122222322222 Q ss_pred -----HHHhhccC-------CcCCcEEEEchHHHHHHhcccchhhhhhcccccceee---eeeE-EEEeecceeeeeecc Q lcl|NC_021299. 154 -----RWLNEQKV-------PKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRL---QTAR-IGRLAQYDVVTVDTL 217 (387) Q Consensus 154 -----~~l~~~~v-------p~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~---~~g~-ig~~~g~~v~~s~~~ 217 (387) ..+..+.. .-.-..+|+.|..+..|++++.+.+.-. +.+..... ..|- -+++.|++|..+..+ T Consensus 171 v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~-~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~ 249 (318) T protein:vir:10 171 ISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYE-RNANYVSTAPDWTGNFPGSVMGLNVIRSRTF 249 (318) T ss_pred hhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhh-ccchhhhhcccccccccceeeceEEeecCcc Confidence 22211111 1112479999999999998877655321 11110110 1222 356799999999999 Q ss_pred ceeeeeeeccccccccccccccccCceeeeee-ecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeee Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATST-VATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTA 290 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~-~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~ 290 (387) |.+..+.+..+..++..... +...+..+.. .....+..-.|..+.. - .....+........++..... T Consensus 250 p~~~alvlq~g~vG~~~d~~--pl~~t~~~~egg~~~g~~~~s~~~~~~--~-~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 250 PIDRVLIMERGTVGFYSDTR--PLQFTALYPEGNGPNGGPTESYRADAS--H-KRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred CCCeeEEEecCCcceeeccc--cceeeecccCCCCCCCCcchhhheehh--e-eeeeeeeCcceeEEEeeccCC Confidence 99888777766555433211 1110000000 0001111111211100 0 000011111110000000000 No 70 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.44 E-value=1.1e-07 Score=58.73 Aligned_cols=271 Identities=10% Similarity=0.006 Sum_probs=130.5 Q ss_pred Cccc-----------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccc Q lcl|NC_021299. 1 MANA-----------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNM 69 (387) Q Consensus 1 Ma~~-----------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~ 69 (387) |..+ .++|+.++.++++.+++..++..++..- . -.+.+.++|+.....+.- .+++..+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~-----~-~~~~~~~~~~~~~~~a~~-----v~E~~~~ 69 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAV-----P-MTKPEEEFTFMSGVGAFW-----VDEAERI 69 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceee-----e-cCCCcEEEEEEcCCceee-----eecCccc Confidence 3322 2679999999999999999988876421 1 135666777654322221 2344555 Q ss_pred cccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHH---------HHhcccccccccC Q lcl|NC_021299. 70 VASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSY---------LITKAPYEKVSLV 140 (387) Q Consensus 70 ~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~---------~~~~~~~~~~~~~ 140 (387) +..+++-+.+++...+ .+.-+.++++-+.....++...+.++..+++++++|+.++. .+..+........ T Consensus 70 ~~~~~~f~~v~l~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~ 148 (299) T protein:vir:41 70 QTSKPTFTKAKMRSKK-MGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVE 148 (299) T ss_pred cccccceeEEEEeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeec Confidence 5566666677776643 44556677765555667888888899999999999998873 1111111111222 Q ss_pred CcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccce Q lcl|NC_021299. 141 DEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPH 219 (387) Q Consensus 141 ~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~ 219 (387) .....|++++++...|..++.+ +...+++|..+..|.+...- .|.- ...... +..+.+.|+.++.++.+|. T Consensus 149 ~~~~~~~~l~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd~-----~G~~l~~~~~~-~~~~~l~G~PV~~~~~~~~ 220 (299) T protein:vir:41 149 ETANKYDDLNEAIGLIEAEDLE--PNGIATIRKQRVKYRSTKDG-----NGMPIFNTATS-NGVDDVLGLPIAYTPKYTF 220 (299) T ss_pred cccccHHHHHHHHHhhhcccCC--cCEEEEcHHHHHHHHHhhcc-----CCceeecCCcC-CCCceecceeeEEecccCC Confidence 3446689999998888877654 34678999998888753211 1110 001111 2235788999999888875 Q ss_pred eee---eeec-cccccccccccccccCceeeeeeecccccceeeeeeeeeecccee--eeeeeee-eeeccccceeeecc Q lcl|NC_021299. 220 GDA---YLSH-PTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTE--RSIVDTW-IGVKAVLDPVTANL 292 (387) Q Consensus 220 ~~~---~~~~-~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~--~~~~~~~-~g~~~~~~~~~~~~ 292 (387) +.. +.+. ...+.+ ....+..+....+.......+ ....+.. .+...+........ T Consensus 221 ~~~~~~~~~gdfs~~~i------------------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~ 282 (299) T protein:vir:41 221 GDKDISELVGDWNQAYY------------------GILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGF 282 (299) T ss_pred CCCceEEEEEecccEEE------------------EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 421 0000 000000 000001111000000000000 0000000 00000000000000 Q ss_pred ceeccccccceeeeeeee Q lcl|NC_021299. 293 DDEPRFVRGTRIHLKATD 310 (387) Q Consensus 293 ~~~~~~v~~~~v~~~~~~ 310 (387) .... .-....+...... T Consensus 283 ~v~~-~~A~~~l~~~aa~ 299 (299) T protein:vir:41 283 MVVK-DEAFSAVQPKAGN 299 (299) T ss_pred EEec-ccceEEEEeccCC Confidence 0000 0000000000000 No 71 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.41 E-value=8.1e-08 Score=59.45 Aligned_cols=279 Identities=9% Similarity=-0.045 Sum_probs=129.0 Q ss_pred Cccc----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccccc Q lcl|NC_021299. 1 MANA----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma~~----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (387) |+.. .++|+.+++++++.+++...+..++.+.. -.+.+++||+......... .+++..++..+++- T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~------~~~~~~~ip~~~~~~~a~w----v~E~~~~~~s~~~f 70 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKP------IPFNGSKEFTFTLDSDIDV----VAENGKKTHGGLSL 70 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceee------cCCCceEEEEEecCcceEE----eecCccccccccce Confidence 8854 48899999999999999999888875321 1245677876432222211 23445555556666 Q ss_pred ceEEEEEEeeeecceeeccHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh---c-------------cccccc Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT---K-------------APYEKV 137 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~---~-------------~~~~~~ 137 (387) +++++.. +..+.-+.++++-+. .+..++...+.++..++++.++|..++.-.. + ...... T Consensus 71 ~~v~l~~-~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~ 149 (303) T protein:vir:97 71 EPVTIVP-IKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVV 149 (303) T ss_pred eeEEeee-EEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccc Confidence 6666665 333455566665432 3445677778888899999999998874211 0 011111 Q ss_pred ccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 138 SLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 138 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) ........|+++.++...+...+.. ....+++|..+..|.+...-...... ....-..+..+++.|+.++.++.+ T Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~L~~lkd~~g~~~~---~~~~~~~~~~~~l~G~Pv~~s~~v 224 (303) T protein:vir:97 150 KFTESEDADANIEAAVNLIQGAEGV--VTGLAMDTEFSTALAKVTNGEMGPKM---YPELAWGANPDSINGLKSSVNTTV 224 (303) T ss_pred ccccccchHHHHHHHHHHHhhcCCC--ccEEEEcHHHHHHHHHhhccCCCeEE---ecCccCCCCCceecceeeEEeccc Confidence 2223445688999888888665532 24588999998888642111000000 000011223457889999998888 Q ss_pred ceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccce-eeeeeeeeeeeccccceeeeccceec Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTT-ERSIVDTWIGVKAVLDPVTANLDDEP 296 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~~ 296 (387) |...........+.+ +. .... -......+..+.+....+..... +....+... .-............. T Consensus 225 ~~~~~~~~~~~~~~~--Gd----f~~~---~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~--~r~~~r~~~~v~~p~ 293 (303) T protein:vir:97 225 GAGADEAESKDLVII--GD----FESM---FKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIY--LRAEAYIGWGILDAK 293 (303) T ss_pred CCccccCCCccEEEE--ee----cccc---EEEEEecCcEEEEeeccCCCCcchhhhhcCcEE--EEEEEEeccEeeccc Confidence 753211110000000 00 0000 00000111111111110000000 000000000 000000000111111 Q ss_pred cccccceeee Q lcl|NC_021299. 297 RFVRGTRIHL 306 (387) Q Consensus 297 ~~v~~~~v~~ 306 (387) .++...+..+ T Consensus 294 af~~l~~~~~ 303 (303) T protein:vir:97 294 SFARVTKGEV 303 (303) T ss_pred ceEEeeCCCC Confidence 1111111111 No 72 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.36 E-value=2.6e-07 Score=56.69 Aligned_cols=279 Identities=10% Similarity=0.022 Sum_probs=127.6 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |+.+ .+.|+.|+.++++.+++..++..++..- . -.+.+++||+....... . -.+++..++..+++-. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~-----~-~~~~~~~~p~~~~~~~a--~--~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:99 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWV 99 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcce--e--EeccCcccccccccee Confidence 3322 2779999999999999999988876422 1 12556788764322111 1 1234555666667777 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------ccccccCCcchhHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAP---------YEKVSLVDEDEIWNG 148 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~---------~~~~~~~~~~~~~~~ 148 (387) .+++...+ ...-+.++++-+.....++...+.++..++++.++|+.++..-...+ ............|++ T Consensus 100 ~v~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) T protein:vir:99 100 NATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHH Confidence 77776644 34556677665555567888888889999999999998873211110 001111223456899 Q ss_pred HHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--c Q lcl|NC_021299. 149 VVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--H 226 (387) Q Consensus 149 i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~ 226 (387) ++++...|...+.. .-.++++|..+..|.+.. . .. +...+..+..+.+.|+.++.+..++......+ . T Consensus 179 i~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~l~---d--~~---g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd 248 (324) T protein:vir:99 179 IIDLEALLEDDELE--ANAFISKTQNRSLLRKIV---D--PE---TKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHhhhhccCC--CCEEEEcHHHHHHHHHhh---c--CC---CceeecCCCCccccceeEEeecCCCCCcceEEEEe Confidence 99999988876543 235789999988776421 1 11 11223333445678888876655443322111 0 Q ss_pred cccccccccccccccCceeeeeeecccccce--eeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccccccee Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVATENGVQ--LRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRI 304 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v 304 (387) ...+.+. ...+..+............ .....+. ..............+. .......++ .+ T Consensus 249 ~~~~~~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~~~-f~~~~~~~r~~~r~d~---------~v~~~~a~~---~l 310 (324) T protein:vir:99 249 FDKLIYG-----IPQLIEYKIDETAQLSTVKNEDGTPVNL-FEQDMVALRATMHVAL---------HIADDKAFA---KL 310 (324) T ss_pred cccEEEE-----EecCcEEEEeecccccccccccccchhh-hhcCcEEEEEEEEEcc---------EEecccceE---EE Confidence 0000000 0001111000000000000 0000000 0000000000000000 000000000 00 Q ss_pred eeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 305 HLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) +.........+ .+| T Consensus 311 t~a~~~~~~~~------------------------~~~ 324 (324) T protein:vir:99 311 VPADKKTDSVP------------------------GEV 324 (324) T ss_pred EeccCCCCCCC------------------------CCC Confidence 00000000000 001 No 73 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.35 E-value=2.5e-07 Score=56.75 Aligned_cols=279 Identities=10% Similarity=0.025 Sum_probs=127.4 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |+.+ .++|+-|+.++++.+++..++..++.+- . -.|.++++|+....... . -.+++...+..+++-. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~p~~~~~~~a--~--~v~Eg~~~~~~~~~f~ 99 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWV 99 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcce--e--eecCCcccccccccee Confidence 3322 2778889999999999999988876432 1 12556788764322111 1 1234555666677777 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------cccccccCCcchhHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA---------PYEKVSLVDEDEIWNG 148 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 148 (387) .+++...+ ...-+.++++-+.....++...+.++..++++.++|..++.--... .............|++ T Consensus 100 ~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) T protein:vir:96 100 NATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEEeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHH Confidence 77777744 3455777776655566788888889999999999999877321100 0011111223456899 Q ss_pred HHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeee--ec Q lcl|NC_021299. 149 VVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYL--SH 226 (387) Q Consensus 149 i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~--~~ 226 (387) ++++...|...+.. ...++++|..+..|.+... .. +...+..+..+.+.|+.++.+...+...... .. T Consensus 179 i~~~~~~i~~~~~~--~~~~i~n~~~~~~L~~lkd-----~~---G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:96 179 IIDLEALLEDDELE--ANAFISKTQNRSLLRKIVD-----PE---TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHhhhhccCC--CCEEEEcHHHHHHHHHhhC-----CC---CCeeecCCCCCcccceeeEeecCCCCCcceEEEEe Confidence 99988888776543 3457899998888764311 11 1222334445667888876654433222111 00 Q ss_pred cccccccccccccccCceeeeeeeccc-ccceee-eeeeeeeccceeeeeeeeeeeeccccceeeeccceecccccccee Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVATE-NGVQLR-WLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRI 304 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~~-~~~~~~-~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v 304 (387) ...+.+. ...+........... ...... ...++ ..............+. .......++ .+ T Consensus 249 ~s~~~~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~~~-~~~n~v~~r~~~r~d~---------~v~~~~a~~---~l 310 (324) T protein:vir:96 249 FDKLIYG-----IPQLIEYKIDETAQLSTVKNEDGTPVNL-FEQDMVALRATMHVAL---------HIADDKAFA---KL 310 (324) T ss_pred cceEEEE-----EecCcEEEEeecccccccccccccchhh-hhcCcEEEEEEEEecc---------EEecccceE---EE Confidence 0000000 000000000000000 000000 00000 0000000000000000 000000000 00 Q ss_pred eeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 305 HLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) +.....-+. ..|+ + T Consensus 311 ~~a~~~~~~-----~~~~-------------------~ 324 (324) T protein:vir:96 311 VPADKRTDS-----VPGE-------------------V 324 (324) T ss_pred ecccccCCC-----CCCC-------------------C Confidence 000000000 0000 0 No 74 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.34 E-value=4.1e-08 Score=61.09 Aligned_cols=266 Identities=10% Similarity=0.078 Sum_probs=127.8 Q ss_pred Ccccc------------cc-------HHHHHHHHHHHHHhhc---cccceeeecccccccccCCCEEEEEecccceeece Q lcl|NC_021299. 1 MANAF------------IK-------PPVIIASILGQLQHEL---VLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTR 58 (387) Q Consensus 1 Ma~~~------------~~-------pe~~~~~~~~~l~~~~---~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~ 58 (387) |-..- |. -+.+++ -+..|.+.| ++.++ ..|++|++. |.+....+. T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~-~i~~L~~~LGv~r~~pl-----------a~GstIkt~-k~~~y~gda 67 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQE-NISKLLEMLGVTRKISV-----------SEGMTLKTY-AGYDVTLAE 67 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhh-hHHHHHHHhhhcccccc-----------cCCCEEeec-cceeeeecc Confidence 21110 10 122332 223333333 22222 248999653 334444433 Q ss_pred ecccccccccccccccccc---eEEEEEEeeeecceeeccHHH-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_021299. 59 GLRATGADRNMVASDLTEV---TVDIKLTDVIYNRIDLTDEER-ELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY 134 (387) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~---~~~~~id~~~~~~~~~~d~~~-~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~ 134 (387) .. .+++..++.+.++.. ..++++.|+. +. ++||.. ..++.+...+.-+|..++|+++||++++..++.+.. T Consensus 68 ~d--VaEGe~Iplskvt~~~~~t~t~~ikK~r-K~--tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~ 142 (296) T protein:vir:98 68 GN--VPEGEVIPLSKVERKIHSEKKIELKKYR-KA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG 142 (296) T ss_pred cc--ccCCcccchhhheeeecceEEEEeeccc-cc--cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc Confidence 32 346667777888765 4677775533 33 489985 789999999999999999999999999988876644 Q ss_pred cccccCC--cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceee Q lcl|NC_021299. 135 EKVSLVD--EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVV 212 (387) Q Consensus 135 ~~~~~~~--~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~ 212 (387) .....+. .......+.++...|++.+ ....+++++|...+.++++..+......|- -++-++.|..++ T Consensus 143 t~~~t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~it~qt~fG~--------tyl~nfLG~~II 212 (296) T protein:vir:98 143 TQDALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGL--------TYLVDFTGTVII 212 (296) T ss_pred eeeechhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCccchhheech--------hhhhhccccEEE Confidence 3221111 0111234445556666653 245788999999999999876532111111 112247788899 Q ss_pred eeeccceeeeeeeccccccccccccc-cccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeec Q lcl|NC_021299. 213 TVDTLPHGDAYLSHPTAYAMLTRSPG-RPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTAN 291 (387) Q Consensus 213 ~s~~~~~~~~~~~~~~a~~~~~~~~~-~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~ 291 (387) .+..+|.+..+......+.+...... -..+..+.... ...|... ..-+.+.. ...+......+ T Consensus 213 ~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~--d~tglIG-v~h~~~~~-------------~~t~eT~~~~~ 276 (296) T protein:vir:98 213 STNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYG--DPTGYIG-MNHFQENT-------------TLTIQTLLVSG 276 (296) T ss_pred EcCcCCCceEEEeeecceEEEeecccccchhhhhcccc--ccccceE-EEeccccc-------------eeeehhHhHhH Confidence 99999988777665555444433211 00111111100 0001100 00000000 00000000000 Q ss_pred cceecccccc-ceeeeeeeecccccccccccc Q lcl|NC_021299. 292 LDDEPRFVRG-TRIHLKATDAEIEGETVKAGE 322 (387) Q Consensus 292 ~~~~~~~v~~-~~v~~~~~~~~~~~~~~~~~~ 322 (387) ..-.+..... +..++.+ +. T Consensus 277 ~~lfpE~~dgiv~~tI~~------------~~ 296 (296) T protein:vir:98 277 MLMYPERIDGIVKVTLTP------------GV 296 (296) T ss_pred HHhcccccceEEEEEecC------------CC Confidence 0000000000 0000000 00 No 75 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.33 E-value=3.4e-07 Score=56.01 Aligned_cols=278 Identities=10% Similarity=0.042 Sum_probs=126.8 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |+.+ .++|+.|..++++.+++..++..++..- . -.+.+++||+....... . -.+++..+...+++-. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~ip~~~~~~~a--~--~v~Eg~~~~~~~~~f~ 99 (324) T protein:vir:93 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWV 99 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcce--e--eecCCcccccccccee Confidence 2212 2779999999999999999988876422 1 12556778764322111 1 1234555666667777 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------cccccCCcchhHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY---------EKVSLVDEDEIWNG 148 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~---------~~~~~~~~~~~~~~ 148 (387) .++++..+ .+.-+.++++-+.....++...+.++..+++++++|+.++.--..... ...........|++ T Consensus 100 ~i~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) T protein:vir:93 100 NATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEEeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHH Confidence 77777744 345567777665556678888888888999999999988732110000 00111123356899 Q ss_pred HHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--c Q lcl|NC_021299. 149 VVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--H 226 (387) Q Consensus 149 i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~ 226 (387) +.++...|...+. ....++++|..+..|.+... .. +...+..+..+.+.|+.++.+...+......+ . T Consensus 179 i~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~l~d-----~~---G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gd 248 (324) T protein:vir:93 179 IIDLEALLEDDEL--EANAFISKTQNRSLLRKIVD-----PE---TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhC-----CC---CCeeecCCCCCcccceeeEeecCCCCCcceEEEEe Confidence 9999888887654 33468899999888764311 11 12223344456677887766544332211110 0 Q ss_pred cccccccccccccccCceeeeeeeccc-cc--ceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccce Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVATE-NG--VQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTR 303 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~~-~~--~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~ 303 (387) ...+.+. ...+........... .. ..-.....+. ............+. .......++. T Consensus 249 fs~~~~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~--~n~~~~r~~~r~d~---------~v~~~~a~~~--- 309 (324) T protein:vir:93 249 FDKLIYG-----IPQLIEYKIDETAQLSTVKNEDGTPVNLFE--QDMVALRATMHVAL---------HIADDKAFAK--- 309 (324) T ss_pred cceEEEE-----EecCcEEEEeecccccccccccccchhhhh--cCcEEEEEEEEecc---------EEecccceEE--- Confidence 0000000 000000000000000 00 0000000000 00000000000000 0000000000 Q ss_pred eeeeeeecccccccc Q lcl|NC_021299. 304 IHLKATDAEIEGETV 318 (387) Q Consensus 304 v~~~~~~~~~~~~~~ 318 (387) ++.....-+.++..+ T Consensus 310 l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 310 LVPADKRTDSVPGEV 324 (324) T ss_pred EecccccCCCCCCCC Confidence 000000000000001 No 76 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.29 E-value=5.1e-07 Score=55.09 Aligned_cols=272 Identities=12% Similarity=0.051 Sum_probs=127.6 Q ss_pred Ccc--------------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccc Q lcl|NC_021299. 1 MAN--------------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGAD 66 (387) Q Consensus 1 Ma~--------------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~ 66 (387) ||. ..++|+.+..++++.+++..++..++.+-. . .+..++||+........ -.+++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~ip~~~~~~~a~----~v~E~ 70 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEP---M---TAQKKKFTYLAKGVGAY----WVSET 70 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceee---c---cCCceEEEEEeCCcceE----EeecC Confidence 442 236799999999999999999888764321 1 24556777653221111 12344 Q ss_pred ccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh--------------cc Q lcl|NC_021299. 67 RNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT--------------KA 132 (387) Q Consensus 67 ~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~--------------~~ 132 (387) ...+..+++-..+++...+. +.-+.++.+-+.....++...+.++..++++.++|..++.--. .+ T Consensus 71 ~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:94 71 ERIQTSKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 45555566666677766443 3445666665555667788888888899999999998873210 00 Q ss_pred cccccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceee Q lcl|NC_021299. 133 PYEKVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVV 212 (387) Q Consensus 133 ~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~ 212 (387) .............|++++++...|...+.. ....+++|..+..|.+... ..| ..+-....+++.|+.++ T Consensus 150 ~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L~~lkd-----~~G----~~l~~~~~~~l~G~PV~ 218 (304) T protein:vir:94 150 EEKGNVVTDTNNLYVDLSALMATIEDEELD--PNGVLTTRSFRSKMRNALD-----AND----RPLFDANGNEIMGLPLS 218 (304) T ss_pred cccccccccccchHHHHHHHHHHhhhccCC--cCEEEEcHHHHHHHHHhhc-----cCC----cEeecCCCccccceeeE Confidence 111111123345689999998888776643 3457899999988865211 111 12333445678899999 Q ss_pred eeeccceeeee-eeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccce-eeeeeeeeeeeccccceeee Q lcl|NC_021299. 213 TVDTLPHGDAY-LSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTT-ERSIVDTWIGVKAVLDPVTA 290 (387) Q Consensus 213 ~s~~~~~~~~~-~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~g~~~~~~~~~~ 290 (387) .++.+|....- .+-...+. ........+....... ........+.+..+.. .....+. ......... T Consensus 219 ~~~~~~~~~~~~~~~~gd~~--~~~~~~~~~~~i~~~~-----e~~~~~~~~~~~~g~~~~~f~~~~----~~~r~~~r~ 287 (304) T protein:vir:94 219 YTGADVYDKKKSLALMGDWD--YARYGILQGIEYAISE-----DATLTTLQASDASGQPVSLFERDM----FALRATMHI 287 (304) T ss_pred EecccccCCCCcEEEEEehh--hEEEEEecceEEEEee-----cceeeeecccccCccchhhhhcCc----EEEEEEEEe Confidence 88887643210 00000000 0000000000000000 0000000000000000 0000000 000000000 Q ss_pred --ccceeccccccceeeeeeee Q lcl|NC_021299. 291 --NLDDEPRFVRGTRIHLKATD 310 (387) Q Consensus 291 --~~~~~~~~v~~~~v~~~~~~ 310 (387) .......++ .+.... T Consensus 288 ~~~v~~~~a~~-----~l~~a~ 304 (304) T protein:vir:94 288 AYMNVKPEAFA-----TLKPTE 304 (304) T ss_pred ccEeecccceE-----EEEecC Confidence 000000000 000000 No 77 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.29 E-value=5.1e-07 Score=55.09 Aligned_cols=272 Identities=12% Similarity=0.051 Sum_probs=127.6 Q ss_pred Ccc--------------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccc Q lcl|NC_021299. 1 MAN--------------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGAD 66 (387) Q Consensus 1 Ma~--------------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~ 66 (387) ||. ..++|+.+..++++.+++..++..++.+-. . .+..++||+........ -.+++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~ip~~~~~~~a~----~v~E~ 70 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEP---M---TAQKKKFTYLAKGVGAY----WVSET 70 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceee---c---cCCceEEEEEeCCcceE----EeecC Confidence 442 236799999999999999999888764321 1 24556777653221111 12344 Q ss_pred ccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh--------------cc Q lcl|NC_021299. 67 RNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT--------------KA 132 (387) Q Consensus 67 ~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~--------------~~ 132 (387) ...+..+++-..+++...+. +.-+.++.+-+.....++...+.++..++++.++|..++.--. .+ T Consensus 71 ~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:10 71 ERIQTSKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 45555566666677766443 3445666665555667788888888899999999998873210 00 Q ss_pred cccccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceee Q lcl|NC_021299. 133 PYEKVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVV 212 (387) Q Consensus 133 ~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~ 212 (387) .............|++++++...|...+.. ....+++|..+..|.+... ..| ..+-....+++.|+.++ T Consensus 150 ~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L~~lkd-----~~G----~~l~~~~~~~l~G~PV~ 218 (304) T protein:vir:10 150 EEKGNVVTDTNNLYVDLSALMATIEDEELD--PNGVLTTRSFRSKMRNALD-----AND----RPLFDANGNEIMGLPLS 218 (304) T ss_pred cccccccccccchHHHHHHHHHHhhhccCC--cCEEEEcHHHHHHHHHhhc-----cCC----cEeecCCCccccceeeE Confidence 111111123345689999998888776643 3457899999988865211 111 12333445678899999 Q ss_pred eeeccceeeee-eeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccce-eeeeeeeeeeeccccceeee Q lcl|NC_021299. 213 TVDTLPHGDAY-LSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTT-ERSIVDTWIGVKAVLDPVTA 290 (387) Q Consensus 213 ~s~~~~~~~~~-~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~g~~~~~~~~~~ 290 (387) .++.+|....- .+-...+. ........+....... ........+.+..+.. .....+. ......... T Consensus 219 ~~~~~~~~~~~~~~~~gd~~--~~~~~~~~~~~i~~~~-----e~~~~~~~~~~~~g~~~~~f~~~~----~~~r~~~r~ 287 (304) T protein:vir:10 219 YTGADVYDKKKSLALMGDWD--YARYGILQGIEYAISE-----DATLTTLQASDASGQPVSLFERDM----FALRATMHI 287 (304) T ss_pred EecccccCCCCcEEEEEehh--hEEEEEecceEEEEee-----cceeeeecccccCccchhhhhcCc----EEEEEEEEe Confidence 88887643210 00000000 0000000000000000 0000000000000000 0000000 000000000 Q ss_pred --ccceeccccccceeeeeeee Q lcl|NC_021299. 291 --NLDDEPRFVRGTRIHLKATD 310 (387) Q Consensus 291 --~~~~~~~~v~~~~v~~~~~~ 310 (387) .......++ .+.... T Consensus 288 ~~~v~~~~a~~-----~l~~a~ 304 (304) T protein:vir:10 288 AYMNVKPEAFA-----TLKPTE 304 (304) T ss_pred ccEeecccceE-----EEEecC Confidence 000000000 000000 No 78 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.29 E-value=4.8e-07 Score=55.20 Aligned_cols=276 Identities=10% Similarity=0.036 Sum_probs=127.0 Q ss_pred Cc--cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccce Q lcl|NC_021299. 1 MA--NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVT 78 (387) Q Consensus 1 Ma--~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (387) ++ ...++|+.|..++++.+++..++..++.+- . -.+.+++||+....... . -.+++..++..+++-.. T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~-----~-~~~~~~~ip~~~~~~~a--~--~v~Eg~~~~~~~~~f~~ 100 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWVN 100 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcchhhhccee-----e-ccCCceEEEEEecCcce--e--EeccCccccccccceeE Confidence 22 223789999999999999999988876432 1 13566788765322111 1 12345556666677777 Q ss_pred EEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------cccccccCCcchhHHHH Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA---------PYEKVSLVDEDEIWNGV 149 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---------~~~~~~~~~~~~~~~~i 149 (387) +++...+ ...-+.++++-+.....++...+.++..++++.++|+.++.--... .............|+++ T Consensus 101 v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i 179 (324) T protein:vir:97 101 ATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) T ss_pred EEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHH Confidence 7776633 3455667765555556778888888889999999999887321110 00011112244568999 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--cc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--HP 227 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~~ 227 (387) +++...|...+.. ....+++|..+..|.+... ..| ...+..+..+.+.|+.++.+...+......+ .. T Consensus 180 ~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd-----~~g---~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~ 249 (324) T protein:vir:97 180 IDLEALLEDDELE--ANAFISKTQNRSLLRKIVD-----PET---KERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDF 249 (324) T ss_pred HHHHHhhhhccCC--CCEEEEcHHHHHHHHHhhc-----CCC---ceeecCCCCccccceeeEeecCCCCCcceEEEEec Confidence 9999888876643 3467899999887764211 111 1222333445678888776654433221111 00 Q ss_pred ccccccccccccccCceeeeeeecccccceeeeeeeeeeccc--eeeeeeeee-eeeccccceeee--ccceeccccccc Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATST--TERSIVDTW-IGVKAVLDPVTA--NLDDEPRFVRGT 302 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~--~~~~~~~~~-~g~~~~~~~~~~--~~~~~~~~v~~~ 302 (387) ..+.+. ...+.. +....+...... .+....+.. .....+...... .......++ T Consensus 250 ~~~~i~-----~~~~~~-------------i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~--- 308 (324) T protein:vir:97 250 DKLIYG-----IPQLIE-------------YKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA--- 308 (324) T ss_pred ccEEEE-----EecCcE-------------EEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE--- Confidence 000000 000100 000000000000 000000000 000000000000 000000000 Q ss_pred eeeeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 303 RIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 303 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) .+......-...+ .+| T Consensus 309 ~l~~~~~~~~~~~------------------------~~~ 324 (324) T protein:vir:97 309 KLVPADKKTDSVP------------------------GEV 324 (324) T ss_pred EEEeccCCCCCCC------------------------CCC Confidence 0000000000000 000 No 79 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=98.27 E-value=7.9e-07 Score=54.04 Aligned_cols=309 Identities=9% Similarity=0.013 Sum_probs=142.5 Q ss_pred Ccccc----ccHH--HHHHHHHHHHHhhccc--cceeeeccccccc---ccCCCEEEEEecccceee-ceeccccccccc Q lcl|NC_021299. 1 MANAF----IKPP--VIIASILGQLQHELVL--PNFVFKNGYGDVA---HKFNDTITIRIPVPTIAH-TRGLRATGADRN 68 (387) Q Consensus 1 Ma~~~----~~pe--~~~~~~~~~l~~~~~~--~~~~~~d~~~~~~---~~~gdtv~i~~~~~~~~~-~~~~~~~~~~~~ 68 (387) ||-+- ++|| +|.+.+.++-.+...| ..++-+| .++. ...|+.+++|.++..... +..+...+.... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d--~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTST--PYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceecc--HHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 99654 6787 7999988877666655 3344444 2333 257999999988765432 211111122234 Q ss_pred ccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------ Q lcl|NC_021299. 69 MVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK------------ 136 (387) Q Consensus 69 ~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~------------ 136 (387) +.+..++..+..-.+ ....++|..+|....+.-.|++..+.+|-..--.+.-.+.+++.+++.-... T Consensus 79 ~t~~kitt~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~ 157 (349) T protein:vir:78 79 ATPRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQND 157 (349) T ss_pred cccccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhccc Confidence 455566666554433 4556777778776666667888888777766666666666666665432111 Q ss_pred ---cccCCcchhHHHHHHHHHHHhhccC---CcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecce Q lcl|NC_021299. 137 ---VSLVDEDEIWNGVVSNRRWLNEQKV---PKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYD 210 (387) Q Consensus 137 ---~~~~~~~~~~~~i~~a~~~l~~~~v---p~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~ 210 (387) ..........+.++++...|.+.-. ...-..+++.+..+..|.+...+..... .-+...++.+.|.. T Consensus 158 ~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~-------s~~~~~i~ty~G~~ 230 (349) T protein:vir:78 158 MVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRD-------AENNTMFATYQGYR 230 (349) T ss_pred ceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccC-------cccCcccceecCeE Confidence 0112223456788888888877621 1222567799999999887544322111 11344578888999 Q ss_pred eeeeeccceeee--------eeeccccccccccccccccCceeeeeeecc--cccceeeeeeeeeeccceeeeeeeeeee Q lcl|NC_021299. 211 VVTVDTLPHGDA--------YLSHPTAYAMLTRSPGRPMTNTVATSTVAT--ENGVQLRWLGDYDATSTTERSIVDTWIG 280 (387) Q Consensus 211 v~~s~~~~~~~~--------~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~--~~~~~~~~~~~~d~~~~~~~~~~~~~~g 280 (387) |+.+..+|.... +.+..+++.+..+.+..+. ........ ..+...-+.+. .... ...| T Consensus 231 VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~---et~rd~~~g~~~G~d~l~~R~--------~~~~-hp~G 298 (349) T protein:vir:78 231 VIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPL---EYEREASRANGGGVETLWTRK--------TWLL-HPFG 298 (349) T ss_pred EEEeCCCccccCCCCceEEEEEeecceEEEccCCCccce---eeecccccCCcceeEEEEEee--------EEEe-eeee Confidence 999999986532 2333344333322211100 00000000 01111111110 0001 1111 Q ss_pred eccccceeeeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCce Q lcl|NC_021299. 281 VKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTA 350 (387) Q Consensus 281 ~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~ 350 (387) ............... ...+++-..|..+..+.....+.. -+=+.+.+...+ T Consensus 299 ~s~~~a~v~~~~~~~-------------~~~sPt~aeLa~~~NW~~v~~~K~------I~iv~~~~~~~a 349 (349) T protein:vir:78 299 YRFTSAVITGNGTET-------------IARSASWQDLANATNWNRVVDRKH------VPIAFLVTGVGA 349 (349) T ss_pred eeeccccccCCcccc-------------ccCCCChHHhcCCcCcccccChhh------cceEEEEeccCC Confidence 111111000000000 000000000111111111000000 000111111111 No 80 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.25 E-value=6.6e-07 Score=54.47 Aligned_cols=274 Identities=10% Similarity=0.048 Sum_probs=125.6 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |+-+ .+.|+.|..++++.+++...+..++..- . -.+.+++||+....... . -.+++..++..+++-. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~-----~-~~~~~~~~p~~~~~~~a--~--~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:10 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWV 99 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEeCCcce--e--EeccCcccccccccee Confidence 3322 3779999999999999999988876432 1 12456788765322111 1 1234555556666666 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------ccccccCCcchhHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAP---------YEKVSLVDEDEIWNG 148 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~---------~~~~~~~~~~~~~~~ 148 (387) .+++...+ ...-+.++.+-+.....++...+.++..++++.++|..++..-.... ............|++ T Consensus 100 ~v~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:10 100 NATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHH Confidence 77776633 34556676665555567788888888999999999998873211110 001111223456899 Q ss_pred HHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--c Q lcl|NC_021299. 149 VVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--H 226 (387) Q Consensus 149 i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~ 226 (387) +.++...|...+.. .-.++++|..+..|.+... .. +...+..+..+.+.|+.++.+...+......+ . T Consensus 179 i~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~l~d-----~~---g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:10 179 IIDLEALLEDDELE--ANAFISKTQNRSLLRKIVD-----PE---TKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHhhhhccCC--CCEEEEcHHHHHHHHHhhc-----cC---CceeecCCCCccccceeEEeecCCCCCcceEEEEe Confidence 99998888776543 2357899999888764211 11 11223334445678888776554433221111 0 Q ss_pred cccccccccccccccCceeeeeeeccc------ccceee-eeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVATE------NGVQLR-WLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~~------~~~~~~-~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ...+.+. ...+..+........ .+..+. +.. ...........+. .......++ T Consensus 249 ~~~~~~~-----~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~r~~~r~d~---------~v~~~~A~~ 308 (324) T protein:vir:10 249 FDKLIYG-----IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ------DMVALRATMHVAL---------HIADDKAFA 308 (324) T ss_pred cccEEEE-----EecCcEEEEeecccccccccccccchhhhhc------CcEEEEEEEEEcc---------EEecccceE Confidence 0000000 000111000000000 000000 000 0000000000000 000000000 Q ss_pred ccceeeeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 300 RGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 300 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) . ++.....-..++. +| T Consensus 309 ~---l~~a~~~~~~~~~------------------------~~ 324 (324) T protein:vir:10 309 K---LVPADKKTDSVPG------------------------EV 324 (324) T ss_pred E---EEeccCCCCCCCC------------------------CC Confidence 0 0000000000000 00 No 81 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.24 E-value=7.5e-07 Score=54.16 Aligned_cols=274 Identities=11% Similarity=0.048 Sum_probs=127.3 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |... .++|+-|..++++.+++..++..++.+- . -.|.+++||+....... . -.+++..++..+++-. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~p~~~~~~~a--~--~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:78 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWV 99 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcce--e--EecCCcccccccccee Confidence 3222 3789999999999999999998887532 1 23566778765322211 1 1234555666666667 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------cccccccCCcchhHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA---------PYEKVSLVDEDEIWNG 148 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 148 (387) .+++...+ ...-+.++++-+.....++...+.++..++++.++|..++.--... .............|++ T Consensus 100 ~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:78 100 NATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHH Confidence 77776643 3455666666555556788888888899999999999887321100 0001111223456899 Q ss_pred HHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--c Q lcl|NC_021299. 149 VVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--H 226 (387) Q Consensus 149 i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~ 226 (387) +.++...|...+.. ...++++|..+..|.+... .. +...+..+..+.+.|+.++.+...+......+ . T Consensus 179 i~~~~~~l~~~~~~--~~~~vmn~~~~~~L~~l~d-----~~---G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:78 179 IIDLEALLEDDELE--ANAFISKTQNRSLLRKIVD-----PE---TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHhhhhccCC--CCEEEEcHHHHHHHHHhhc-----cC---CCeeecCCCCCcccceeeEeeCCCCCCcceEEEEe Confidence 99998888776643 3467899999888764321 11 12223344556678888776544332211110 0 Q ss_pred cccccccccccccccCceeeeeeecc------ccccee-eeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVAT------ENGVQL-RWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~------~~~~~~-~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ...+.+. ...+.......... ..+..+ .+.. ...........+ ........++ T Consensus 249 ~~~~~~g-----~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~------d~~~~r~~~r~d---------~~v~~~~A~~ 308 (324) T protein:vir:78 249 FDKLIYG-----IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ------DMVALRATMHVA---------LHIADDKAFA 308 (324) T ss_pred cceEEEE-----EecCcEEEEeecccccccccccccchhhhhc------CcEEEEEEEEEc---------cEEecccceE Confidence 0000000 00000000000000 000000 0000 000000000000 0000000000 Q ss_pred ccceeeeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 300 RGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 300 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) ..... ...-+.++ |+ + T Consensus 309 ~l~~a---~~~~~~~~-----~~-------------------~ 324 (324) T protein:vir:78 309 KLVPA---DKRTDSVP-----GE-------------------V 324 (324) T ss_pred EEecc---cccCCCCC-----CC-------------------C Confidence 00000 00000000 00 0 No 82 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.24 E-value=7.5e-07 Score=54.16 Aligned_cols=274 Identities=11% Similarity=0.048 Sum_probs=127.3 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |... .++|+-|..++++.+++..++..++.+- . -.|.+++||+....... . -.+++..++..+++-. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~p~~~~~~~a--~--~v~Eg~~~~~~~~~~~ 99 (324) T protein:vir:96 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGA--Y--WVGEGQKIETSKATWV 99 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcce--e--EecCCcccccccccee Confidence 3222 3789999999999999999998887532 1 23566778765322211 1 1234555666666667 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------cccccccCCcchhHHH Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA---------PYEKVSLVDEDEIWNG 148 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 148 (387) .+++...+ ...-+.++++-+.....++...+.++..++++.++|..++.--... .............|++ T Consensus 100 ~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:96 100 NATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHH Confidence 77776643 3455666666555556788888888899999999999887321100 0001111223456899 Q ss_pred HHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--c Q lcl|NC_021299. 149 VVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--H 226 (387) Q Consensus 149 i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~ 226 (387) +.++...|...+.. ...++++|..+..|.+... .. +...+..+..+.+.|+.++.+...+......+ . T Consensus 179 i~~~~~~l~~~~~~--~~~~vmn~~~~~~L~~l~d-----~~---G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:96 179 IIDLEALLEDDELE--ANAFISKTQNRSLLRKIVD-----PE---TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHhhhhccCC--CCEEEEcHHHHHHHHHhhc-----cC---CCeeecCCCCCcccceeeEeeCCCCCCcceEEEEe Confidence 99998888776643 3467899999888764321 11 12223344556678888776544332211110 0 Q ss_pred cccccccccccccccCceeeeeeecc------ccccee-eeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVAT------ENGVQL-RWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~------~~~~~~-~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ...+.+. ...+.......... ..+..+ .+.. ...........+ ........++ T Consensus 249 ~~~~~~g-----~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~------d~~~~r~~~r~d---------~~v~~~~A~~ 308 (324) T protein:vir:96 249 FDKLIYG-----IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ------DMVALRATMHVA---------LHIADDKAFA 308 (324) T ss_pred cceEEEE-----EecCcEEEEeecccccccccccccchhhhhc------CcEEEEEEEEEc---------cEEecccceE Confidence 0000000 00000000000000 000000 0000 000000000000 0000000000 Q ss_pred ccceeeeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 300 RGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 300 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) ..... ...-+.++ |+ + T Consensus 309 ~l~~a---~~~~~~~~-----~~-------------------~ 324 (324) T protein:vir:96 309 KLVPA---DKRTDSVP-----GE-------------------V 324 (324) T ss_pred EEecc---cccCCCCC-----CC-------------------C Confidence 00000 00000000 00 0 No 83 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.24 E-value=5.3e-07 Score=54.97 Aligned_cols=262 Identities=13% Similarity=0.055 Sum_probs=124.3 Q ss_pred Cc-----cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MA-----NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma-----~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) |. -..++|+.|+.++++.+++...+.++++.- . . .+.++.+|........ ..-.+++......+++ T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~---a~~v~E~~~~~~~~~~ 206 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPG--Q-T---SSSSIEYTVETGFTNN---AAAVAEGAQKPTSDLK 206 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhccee--e-c---cCCceeEEEEecCCCc---eeeeccCccccccccc Confidence 11 123789999999999999999998887532 1 1 2455666653221111 1112344455555566 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-h---------ccc-ccccccCCcch Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-T---------KAP-YEKVSLVDEDE 144 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~---------~~~-~~~~~~~~~~~ 144 (387) -..+.+...+.. .-+.++++ ...+..++...+.++..++++.++|..++.-- . .+. ........... T Consensus 207 f~~v~~~~~k~~-~~~~is~e-ll~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~ 284 (418) T protein:vir:10 207 FNLKNQPVRTIA-HLFKASRQ-ILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANAT 284 (418) T ss_pred eeeEEEeeeeEE-EeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccc Confidence 666666664433 33456655 55566677666667788999999999887311 0 000 01111222334 Q ss_pred hHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeee Q lcl|NC_021299. 145 IWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYL 224 (387) Q Consensus 145 ~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~ 224 (387) .+++++++...+...+.+ .-.++++|..+..|.+... ..|.-.-.....+..+.+.|+.++.++.+|.+..+. T Consensus 285 ~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-----~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~ 357 (418) T protein:vir:10 285 PIDKIRLALLQAVLAEFP--ATGIVLNPIDWASIELTKD-----SQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLV 357 (418) T ss_pred cHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhc-----CCCceeccccccCCCceecceeeEEcCCCCCCcEEE Confidence 578888887777665543 2357899999887764221 111110011234455788999999999998765332 Q ss_pred eccc-cccccccccccccCceeeeeeec----ccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 225 SHPT-AYAMLTRSPGRPMTNTVATSTVA----TENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 225 ~~~~-a~~~~~~~~~~~~~~t~~~~~~~----~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) .-.+ .+.+..+ .+......... ..+...+......+ ....+..+ + T Consensus 358 gd~s~~~~~~~~-----~~~~i~~~~~~~~~f~~~~~~~r~~~~~d------~~~~~~~a------------------~- 407 (418) T protein:vir:10 358 GAFSMAAQIFDR-----MEIEVLLSTENVDDFEKNMVSIRAEERLA------LAVYRPES------------------F- 407 (418) T ss_pred eeccceEEEEEe-----cceEEEEecccchhhhcCceEEEEEEeec------cEEecccc------------------e- Confidence 2111 1111100 01111100000 00000011000000 00000000 0 Q ss_pred ccceeeeeeeeccccccccccc Q lcl|NC_021299. 300 RGTRIHLKATDAEIEGETVKAG 321 (387) Q Consensus 300 ~~~~v~~~~~~~~~~~~~~~~~ 321 (387) ..+++. ....| T Consensus 408 --~~~~~~---------~~~~g 418 (418) T protein:vir:10 408 --VTGALV---------EQAGG 418 (418) T ss_pred --EEEEec---------cCCCC Confidence 000000 00000 No 84 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.20 E-value=6.8e-07 Score=54.39 Aligned_cols=291 Identities=11% Similarity=0.018 Sum_probs=123.7 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) ||... +.|+.+++++++.|++..++..++.+- . -.+..++||+....... +. .+++..+...++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i-----~-~~~~~~~ip~~~~~~~a-~w---v~Eg~~~~~s~~ 70 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ-----P-TIFGPVKGAVFSGVPRA-KI---VGEGEVKPSASV 70 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee-----e-cCCCceEEEEEeCCcce-EE---eeCCcccccccc Confidence 99653 789999999999999999988876432 1 12445677764322211 11 234555555666 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhh----HHHHHHHHHHHHHHHHHHHHHHHHHh---cccc-----ccccc--- Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRS----FAVDVLPRQVRAVAEQIEDAVSYLIT---KAPY-----EKVSL--- 139 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~----~~~~~~~~~~~~la~~vd~~~~~~~~---~~~~-----~~~~~--- 139 (387) +-+++++...| ...-+.++++-+.....+ +...+.++..+++++++|..++.--. +... ..... T Consensus 71 ~f~~v~l~~~k-l~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~ 149 (315) T protein:vir:80 71 DVSAFTAQPIK-VVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNI 149 (315) T ss_pred ceeeeEeeeee-EEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccce Confidence 66666665533 334456666644333333 33444566688899998887773211 0000 00000 Q ss_pred -CCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccc Q lcl|NC_021299. 140 -VDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLP 218 (387) Q Consensus 140 -~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~ 218 (387) ......|.+++++...+...+.... ...+++|.....|.+..........+.-....+..|..+++.|+.++.++.+| T Consensus 150 ~~~~~~~~~d~~~~~~~~~~~~~~~~-~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~ 228 (315) T protein:vir:80 150 VDATDSATADLVKAVGLIAGAGLQVP-NGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVS 228 (315) T ss_pred eeccccchHHHHHHHHHHhhccCccc-eEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCC Confidence 1123457788888777755543332 34678999988887542211111111111112234445688999999888887 Q ss_pred eeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeecc--ceec Q lcl|NC_021299. 219 HGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANL--DDEP 296 (387) Q Consensus 219 ~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~--~~~~ 296 (387) ......-......+ .+ .+..-......+..+....+.+..... ..... .+...+........ .... T Consensus 229 ~~~~~~~~~~~~~~-~G--------Dfs~~~~g~~~~~~i~i~~~~~~~~~~-~~~~~--~~~v~~r~~~r~~~~v~~~~ 296 (315) T protein:vir:80 229 GAPEMSPASGVKAI-VG--------DFSRVHWGFQRNFPIELIEYGDPDQTG-RDLKG--HNEVMVRAEAVLYVAIESLD 296 (315) T ss_pred cccccccccccEEE-Ee--------ecccEEEEEecCeeEEEeccccccCcc-cchhh--cCcEEEEEEEEecceeeccc Confidence 54322100000000 00 000000000011111111110000000 00000 00000000000000 0000 Q ss_pred cccccceeeeeeeecccccccccccc Q lcl|NC_021299. 297 RFVRGTRIHLKATDAEIEGETVKAGE 322 (387) Q Consensus 297 ~~v~~~~v~~~~~~~~~~~~~~~~~~ 322 (387) .++. +. . .... ..+...+. T Consensus 297 a~~~---l~--~-~~a~-~~~~~~~~ 315 (315) T protein:vir:80 297 SFAV---VK--E-KAAP-KPNPPAEN 315 (315) T ss_pred ceEE---Ee--e-ccCC-CCCCCCCC Confidence 0000 00 0 0000 00000000 No 85 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.17 E-value=1.7e-06 Score=52.28 Aligned_cols=298 Identities=9% Similarity=-0.029 Sum_probs=130.7 Q ss_pred Cc----cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccccc Q lcl|NC_021299. 1 MA----NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma----~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (387) |. ...++|+-|...++..+++...+.+++..- . -.+.++.+|++......... ..+++..+...+++- T Consensus 116 ~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~-----~-~~~~~~~~~~~~~~~~~~~~--~~~E~~~~~~s~~~f 187 (421) T protein:vir:13 116 IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVI-----P-VNRNAGKMPVRAGASVDKLA--NLAKDTELVKAMLKT 187 (421) T ss_pred ccccCCcceecchhhHHHHHHHHHhhhhhhhhceee-----e-ccCCceEEEEeecCCcccee--eccccccccccccce Confidence 11 123789999999999999999888877532 1 12345667655443332221 123344454455566 Q ss_pred ceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) ..+++.+.+. +.-+.++++-+.....++...+.++..++++..+|..++....+... ......|++|+++...| T Consensus 188 ~~i~~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~-----~~~~~~~d~i~~~~~~l 261 (421) T protein:vir:13 188 QPMAYDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLA-----EETINDYAGLVKTINSL 261 (421) T ss_pred eEEEeeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhccc-----cccccchHHHHHHHHHh Confidence 6666666443 34455666655445566777777778888999999888876554432 22335688899888888 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--ccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--HPTAYAMLT 234 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~~~a~~~~~ 234 (387) ..+..+ +-..|++|..+..|.+... ..|.-.-.....|..+.+.|+.++.+..+|....... --..+.-. T Consensus 262 ~~~~~~--~a~~v~n~~~~~~l~~lkd-----~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~- 333 (421) T protein:vir:13 262 VPNARK--RAIIVTNSDGRAYLDGLMD-----KQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTL- 333 (421) T ss_pred hhhhcC--CCEEEEcHHHHHHHHHhhc-----CCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEecccc- Confidence 766543 3456889999888764211 1111000112334456788999988877664321100 00000000 Q ss_pred cccccccCceeeeeeecccccc--eeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGV--QLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~--~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~ 312 (387) .......+..........+... .+....-+ +....+.. ............++..... ++ .-. T Consensus 334 ~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~------d~~~~~~~-------a~~~~~~~~~~a~v~~~~~--~~-~~~ 397 (421) T protein:vir:13 334 IKFMDRKQYLIDQSKEAGYTKNETIARIIERF------DVNSPLDK-------SSDAEKIRKFGVIVKLQEV--LK-SSP 397 (421) T ss_pred EEEEEecceEEEeecccccccCeeEEEEEeee------cceeecch-------hhheeeecccceeeccccc--cC-CCC Confidence 0000001111111111000000 00000000 00000000 0000000000000000000 00 000 Q ss_pred ccccccccccceeEEEeeccCCccccCcceEEEecCc Q lcl|NC_021299. 313 IEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTT 349 (387) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~ 349 (387) .+..+...| ..+++..+.-.... + T Consensus 398 ~~~~~~~~~-~~~~~~~~~~~~~~------------~ 421 (421) T protein:vir:13 398 RSGKNKNES-KEEIKEEGEATQQN------------E 421 (421) T ss_pred cCCCCcccc-chheeeccccccCC------------C Confidence 011111112 11222222211111 1 No 86 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.16 E-value=1.6e-06 Score=52.38 Aligned_cols=287 Identities=10% Similarity=0.018 Sum_probs=123.9 Q ss_pred Ccc--ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeece----eccccccccccccccc Q lcl|NC_021299. 1 MAN--AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTR----GLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~--~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 74 (387) |.. .-+.|+.+..++++.+++..++..++..- .. .+..+.+|+......... ......++...+..++ T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~---~~---~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~ 93 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQI---PI---SYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGT 93 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhccee---ec---cCCceEEEEEeCCceeEeecCccccccccccccccccc Confidence 111 11679999999999999999888776432 11 245667766433222111 1111112222333344 Q ss_pred ccceEEEEEEeee-ecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-ccc-------c---------cc Q lcl|NC_021299. 75 TEVTVDIKLTDVI-YNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT-KAP-------Y---------EK 136 (387) Q Consensus 75 ~~~~~~~~id~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~-~~~-------~---------~~ 136 (387) +-..+++.. +| +.-+.++++-+..+..++...+.++..++++.++|..++.--. ..+ . .. T Consensus 94 ~f~~i~l~~--~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~ 171 (333) T protein:vir:78 94 AWDTRSVSP--IKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVD 171 (333) T ss_pred ceeEEEEee--EEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccccccc Confidence 444445444 33 3445566654555677788888888899999999998873111 000 0 01 Q ss_pred cccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeec Q lcl|NC_021299. 137 VSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT 216 (387) Q Consensus 137 ~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~ 216 (387) .........+++++++...+..+. .......+++|..+..|++...+...+... -.......+..+.+.|+.++.++. T Consensus 172 ~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~-i~~~~~~~~~~~~l~G~Pv~~~~~ 249 (333) T protein:vir:78 172 YLQETGDPLLDRLLDGYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRDANGNV-DPSRINLAAQTGDVLGLPAQFGRA 249 (333) T ss_pred ccccccchhHHHHHHHHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcCCCCce-eecCccccCCCceeeceeeEEccc Confidence 111223345788888877765443 223346788999888886644332211110 001123345567899999999988 Q ss_pred cceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeee-eeeeeccccceeeec--cc Q lcl|NC_021299. 217 LPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVD-TWIGVKAVLDPVTAN--LD 293 (387) Q Consensus 217 ~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~g~~~~~~~~~~~--~~ 293 (387) +|.+...........+ .+. +..-......+..+....+..... ......+ ...+...+....... .. T Consensus 250 i~~~~~~~~~~~~~~~-~gD--------~~~~~~g~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~v~~r~~~r~d~~v~ 319 (333) T protein:vir:78 250 VGGDLGAAVDSKTRII-GGD--------FSQLKFGFADEIRIKMSDTATLTD-SGSATVSMWQTNQIAILIEVTFGWLLG 319 (333) T ss_pred cCCCccccCCCccEEE-EEe--------cccEEEEEeeccEEEEeccccccc-cccceeehhhcCcEEEEEEEEEccEEe Confidence 8765321111000000 000 000000000011111100000000 0000000 000000000000000 00 Q ss_pred eeccccccceeeeeeeeccccccccccccceeEEEeeccCC Q lcl|NC_021299. 294 DEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGD 334 (387) Q Consensus 294 ~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (387) ....++.....+ .| T Consensus 320 ~~~a~~~l~~~~---------------------------a~ 333 (333) T protein:vir:78 320 DKQAFVKFVDDE---------------------------QP 333 (333) T ss_pred cccceEEEeccC---------------------------CC Confidence 000010000000 00 No 87 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.16 E-value=9.6e-07 Score=53.58 Aligned_cols=273 Identities=9% Similarity=-0.015 Sum_probs=127.4 Q ss_pred Cccc--cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccce Q lcl|NC_021299. 1 MANA--FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVT 78 (387) Q Consensus 1 Ma~~--~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (387) |+.+ .++|+.+..++++.+++..++..++..- . -.+..++||+........ . .+++......+++-.+ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~-----~-~~~~~~~~p~~~~~~~a~--~--v~Eg~~~~~~~~~f~~ 70 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK-----P-IPFNGEKVFTFTMDSEID--V--VAESGKKTHGGVTLAP 70 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcceE--E--eeCCccccccccceeE Confidence 9976 3788888999999999999887776422 1 123456776643222111 1 2344455555666666 Q ss_pred EEEEEEeeeecceeeccHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------------cccccc Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA-----------------PYEKVS 138 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~-----------------~~~~~~ 138 (387) +++...|. ..-+.++++-+. .+..++...+.++..+++++++|..++...... ...... T Consensus 71 v~l~~~k~-~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) T protein:vir:94 71 QTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) T ss_pred EEEeeeEE-EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccccc Confidence 77666333 345566666432 234567777778889999999999887431100 000011 Q ss_pred cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 139 LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 139 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) .......++++.++...|..++.. ....+++|..+..|.+...-. |.- .......|..+.+.|+.++.++.+ T Consensus 150 ~~~~~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~-----G~~l~~~~~~~~~~~tl~G~PV~~~~~v 222 (298) T protein:vir:94 150 PRGIADPNGAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDLQ-----GNALFPELKWGATPDTINGLPVDVNKTV 222 (298) T ss_pred ccccccHHHHHHHHHHhhhhcCCC--ccEEEEcHHHHHHHHHhhccC-----CCeeecCcccCCCCceecceeeEEeccc Confidence 122334577888888888877643 346899999998886532111 110 011223445567889999988877 Q ss_pred ceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccce--eeecccee Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDP--VTANLDDE 295 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~--~~~~~~~~ 295 (387) |...... .. .+..+.. .. .-......+..+.+..+.+..... ...... +...+... ........ T Consensus 223 ~~~~~~~---~~-~~~~Gdf----s~---~~~~~~~~~~~~~~~~~~~~d~~~-~~~f~~--~~v~~r~~~r~~~~~~~~ 288 (298) T protein:vir:94 223 SDMSLTQ---RD-RAIIGDF----AN---GFKWGYAKEVPLEVIQYGDPDNSG-LDLKGY--NQVYIRAELFLGWGILDA 288 (298) T ss_pred ccccCCC---cc-EEEEeec----cc---eEEEEEecCceEEEeecCCCcCcc-hhhhhc--CcEEEEEEEEeccEeecc Confidence 6431100 00 0000000 00 000000111111111111100000 000000 00000000 00000001 Q ss_pred ccccccceee Q lcl|NC_021299. 296 PRFVRGTRIH 305 (387) Q Consensus 296 ~~~v~~~~v~ 305 (387) ..++....++ T Consensus 289 ~a~~~l~~~t 298 (298) T protein:vir:94 289 TKFARVTEAN 298 (298) T ss_pred cceEEEEecC Confidence 1111111111 No 88 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=98.13 E-value=3.1e-06 Score=50.80 Aligned_cols=309 Identities=9% Similarity=0.016 Sum_probs=140.9 Q ss_pred Ccccc----ccHH--HHHHHHHHHHHhhccc--cceeeeccccccc---ccCCCEEEEEecccceee-ceeccccccccc Q lcl|NC_021299. 1 MANAF----IKPP--VIIASILGQLQHELVL--PNFVFKNGYGDVA---HKFNDTITIRIPVPTIAH-TRGLRATGADRN 68 (387) Q Consensus 1 Ma~~~----~~pe--~~~~~~~~~l~~~~~~--~~~~~~d~~~~~~---~~~gdtv~i~~~~~~~~~-~~~~~~~~~~~~ 68 (387) ||-+- ++|| +|.+.+.++-.+...| ..++-+| .++. ...|+.+++|.++..... +........... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d--~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPT--PYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceecc--HHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 99654 6777 7999988877665555 3344444 2343 256999999988764322 111111111123 Q ss_pred ccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------ Q lcl|NC_021299. 69 MVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK------------ 136 (387) Q Consensus 69 ~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~------------ 136 (387) +.+..++..+..-.+ ....++|..+|.-..+.-.|++..+.++-..--.+.-.+.+++.+++.-... T Consensus 79 ~t~~kit~~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~ 157 (349) T protein:vir:94 79 ATPRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQND 157 (349) T ss_pred cccccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCc Confidence 555555555543333 4455677777776666666888888777776666666666777665432211 Q ss_pred ---cccCCcchhHHHHHHHHHHHhhccC---CcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecce Q lcl|NC_021299. 137 ---VSLVDEDEIWNGVVSNRRWLNEQKV---PKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYD 210 (387) Q Consensus 137 ---~~~~~~~~~~~~i~~a~~~l~~~~v---p~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~ 210 (387) ..........+.+++|...|.+... ...-..+++.+..+..|.+...+..... .-+...++.+.|.. T Consensus 158 ~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~-------s~~~~~i~ty~G~~ 230 (349) T protein:vir:94 158 MVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRD-------AENNTMFATYQGYR 230 (349) T ss_pred eeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccC-------cccCcccceecCcE Confidence 1112233456778888888877521 1222567799999999887654322211 11344568889999 Q ss_pred eeeeeccceee--------eeeeccccccccccccccccCceeeeeeecc--cccceeeeeeeeeeccceeeeeeeeeee Q lcl|NC_021299. 211 VVTVDTLPHGD--------AYLSHPTAYAMLTRSPGRPMTNTVATSTVAT--ENGVQLRWLGDYDATSTTERSIVDTWIG 280 (387) Q Consensus 211 v~~s~~~~~~~--------~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~--~~~~~~~~~~~~d~~~~~~~~~~~~~~g 280 (387) |+.+..+|... .+.+..+++.+..+.+..+. ........ ..+...-+.+. .... ...| T Consensus 231 VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~---E~~rd~~~g~~~G~d~L~~R~--------~~~~-hp~G 298 (349) T protein:vir:94 231 VIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPL---EYEREASRANGGGVETLWTRK--------TWLL-HPFG 298 (349) T ss_pred EEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcce---eeecccccCCcceeEEEEEee--------EEEe-eeee Confidence 99999998642 22334444443333221110 00000000 01111111110 0001 1111 Q ss_pred eccccceeeeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCce Q lcl|NC_021299. 281 VKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTA 350 (387) Q Consensus 281 ~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~ 350 (387) ............... ...+++-..|..+..+.....+.. -+=+.+.+...+ T Consensus 299 ~s~~~a~v~~~~~~~-------------~~~sPt~aeLa~~~NW~~v~~~K~------I~iv~~~~~~~a 349 (349) T protein:vir:94 299 YSFTSAVITGNGTET-------------IARSASWQDLANAANWNRVVDRKH------VPIAFLVTGVGA 349 (349) T ss_pred eeecccccCCCcccc-------------ccCCCChHHhcCCcCcccccChhh------cceEEEEeccCC Confidence 111111000000000 000000000111111111000000 000111111111 No 89 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.12 E-value=1.8e-06 Score=52.11 Aligned_cols=260 Identities=11% Similarity=0.003 Sum_probs=123.8 Q ss_pred Cccc-----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MANA-----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~-----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) |... .++|+.+...+++.+++...+..++..- . . .+..+++|........ .. ..+++..++..+++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~-a~--~v~E~~~~~~~~~~ 175 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG--R-T---SSNALEYVREEVFTNN-AD--VVAEKALKPESDIT 175 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee--c-c---cCcceEEEEEecCCcc-ee--eeccCccccccccc Confidence 2211 2456667888999999999888876532 1 1 2445667653221111 11 12344455555666 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-h---------cccc-cccccCCcch Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-T---------KAPY-EKVSLVDEDE 144 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~---------~~~~-~~~~~~~~~~ 144 (387) -..+++.+.+.. .-+.++++ ...+..++...+.++..++++.++|..++.-- . .+.. .......... T Consensus 176 ~~~~~~~~~k~~-~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~ 253 (385) T protein:vir:18 176 FSKQTANVKTIA-HWVQASRQ-VMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDT 253 (385) T ss_pred eeEEEEeeeeEE-EeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccc Confidence 666676664433 34556654 55566666666677778999999998877321 0 0000 1111123345 Q ss_pred hHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeee Q lcl|NC_021299. 145 IWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYL 224 (387) Q Consensus 145 ~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~ 224 (387) .++.+.++...|..... ..-.++++|..+..|.+.... .|...-.....+..+.+.|+.|+.+..+|.+..+. T Consensus 254 ~~d~i~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~lkd~-----~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~ 326 (385) T protein:vir:18 254 RADIIAHAIYQVTESEF--SASGIVLNPRDWHNIALLKDN-----EGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTV 326 (385) T ss_pred hHHHHHHHHHhhccccC--CCCEEEEcHHHHHHHHHhhcC-----CCceeccCcccCCCceecceeeEEcCcCCCCcEEE Confidence 68889998888876653 334678999998887653211 11110011234555778999999999988654332 Q ss_pred eccc-cccccccccccccCceeeeeeec--cc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 225 SHPT-AYAMLTRSPGRPMTNTVATSTVA--TE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 225 ~~~~-a~~~~~~~~~~~~~~t~~~~~~~--~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) .... ++.+..+ .+......... .+ +...+......+. ...+ .. T Consensus 327 gd~~~~~~~~~~-----~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~------~v~~------------------~~--- 374 (385) T protein:vir:18 327 GGFDMASQVWDR-----MDATVEVSREDRDNFVKNMLTILCEERLAL------AHYR------------------PT--- 374 (385) T ss_pred eecccEEEEEEe-----cceEEEEeccccchhhcCcEEEEEEEeecc------EEec------------------cc--- Confidence 2111 1111100 01110000000 00 0000000000000 0000 00 Q ss_pred ccceeeeeeee Q lcl|NC_021299. 300 RGTRIHLKATD 310 (387) Q Consensus 300 ~~~~v~~~~~~ 310 (387) ....+++.... T Consensus 375 a~~~~~~~aa~ 385 (385) T protein:vir:18 375 AIIKGTFSSGS 385 (385) T ss_pred ceEEEEeccCC Confidence 00000100000 No 90 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.12 E-value=1.8e-06 Score=52.11 Aligned_cols=260 Identities=11% Similarity=0.003 Sum_probs=123.8 Q ss_pred Cccc-----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MANA-----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~-----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) |... .++|+.+...+++.+++...+..++..- . . .+..+++|........ .. ..+++..++..+++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~-a~--~v~E~~~~~~~~~~ 175 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG--R-T---SSNALEYVREEVFTNN-AD--VVAEKALKPESDIT 175 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee--c-c---cCcceEEEEEecCCcc-ee--eeccCccccccccc Confidence 2211 2456667888999999999888876532 1 1 2445667653221111 11 12344455555666 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-h---------cccc-cccccCCcch Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-T---------KAPY-EKVSLVDEDE 144 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~---------~~~~-~~~~~~~~~~ 144 (387) -..+++.+.+.. .-+.++++ ...+..++...+.++..++++.++|..++.-- . .+.. .......... T Consensus 176 ~~~~~~~~~k~~-~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~ 253 (385) T protein:vir:19 176 FSKQTANVKTIA-HWVQASRQ-VMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDT 253 (385) T ss_pred eeEEEEeeeeEE-EeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccc Confidence 666676664433 34556654 55566666666677778999999998877321 0 0000 1111123345 Q ss_pred hHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeee Q lcl|NC_021299. 145 IWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYL 224 (387) Q Consensus 145 ~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~ 224 (387) .++.+.++...|..... ..-.++++|..+..|.+.... .|...-.....+..+.+.|+.|+.+..+|.+..+. T Consensus 254 ~~d~i~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~lkd~-----~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~ 326 (385) T protein:vir:19 254 RADIIAHAIYQVTESEF--SASGIVLNPRDWHNIALLKDN-----EGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTV 326 (385) T ss_pred hHHHHHHHHHhhccccC--CCCEEEEcHHHHHHHHHhhcC-----CCceeccCcccCCCceecceeeEEcCcCCCCcEEE Confidence 68889998888876653 334678999998887653211 11110011234555778999999999988654332 Q ss_pred eccc-cccccccccccccCceeeeeeec--cc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 225 SHPT-AYAMLTRSPGRPMTNTVATSTVA--TE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 225 ~~~~-a~~~~~~~~~~~~~~t~~~~~~~--~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) .... ++.+..+ .+......... .+ +...+......+. ...+ .. T Consensus 327 gd~~~~~~~~~~-----~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~------~v~~------------------~~--- 374 (385) T protein:vir:19 327 GGFDMASQVWDR-----MDATVEVSREDRDNFVKNMLTILCEERLAL------AHYR------------------PT--- 374 (385) T ss_pred eecccEEEEEEe-----cceEEEEeccccchhhcCcEEEEEEEeecc------EEec------------------cc--- Confidence 2111 1111100 01110000000 00 0000000000000 0000 00 Q ss_pred ccceeeeeeee Q lcl|NC_021299. 300 RGTRIHLKATD 310 (387) Q Consensus 300 ~~~~v~~~~~~ 310 (387) ....+++.... T Consensus 375 a~~~~~~~aa~ 385 (385) T protein:vir:19 375 AIIKGTFSSGS 385 (385) T ss_pred ceEEEEeccCC Confidence 00000100000 No 91 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.09 E-value=2.1e-06 Score=51.69 Aligned_cols=259 Identities=12% Similarity=0.051 Sum_probs=123.5 Q ss_pred Cccc--cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccce Q lcl|NC_021299. 1 MANA--FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVT 78 (387) Q Consensus 1 Ma~~--~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (387) ++.+ .+.|+-|+.++++.+++...+.+++++.. . .|.++++|+.......-.. .+++...+-.+++-.. T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~---~---~~~~~~~~~~~~~~~~a~~---v~E~~~~~~~~~~~~~ 187 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGT---T---ESNSVEYVRETGFVNNAAP---VSEGTQKPYSDLTFEL 187 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhcccee---c---CCCceEEEEEecCCCceee---ecCCccccccccceeE Confidence 1111 25566688899999999999988876431 1 2455677654322111111 2334445555566666 Q ss_pred EEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH----------Hhcccc---cccccCCcchh Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL----------ITKAPY---EKVSLVDEDEI 145 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~----------~~~~~~---~~~~~~~~~~~ 145 (387) +.+.+.+.. .-+.++++ +..+..++...+.++..++++..+|..++.- +..... ........... T Consensus 188 i~~~~~k~~-~~~~is~e-ll~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~ 265 (395) T protein:vir:43 188 ENAPVRTIA-HLFKASRQ-ILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQR 265 (395) T ss_pred EEEeeeeEE-EeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchh Confidence 666664433 34456654 5556656555556667889999999987731 110000 01112223355 Q ss_pred HHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee Q lcl|NC_021299. 146 WNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS 225 (387) Q Consensus 146 ~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~ 225 (387) ++.+.++...+...+.+ .-.++++|..+..|.+... ..|.-.......+..+.+.|+.|+.++.+|.+..+.. T Consensus 266 ~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd-----~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~g 338 (395) T protein:vir:43 266 IDRIRLAILQAQLAEFP--ASGIVLNPIDWALIELNKD-----AENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTG 338 (395) T ss_pred HHHHHHHHHhhccccCC--CcEEEEcHHHHHHHHHhhc-----cCCceeccccccCCCceecceeeEEcCCCCCCcEEEE Confidence 88888888888766543 3467899999888754221 1111111112344556789999999998886653321 Q ss_pred cccc-ccccccccccccCceeeeeeec----ccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccc Q lcl|NC_021299. 226 HPTA-YAMLTRSPGRPMTNTVATSTVA----TENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVR 300 (387) Q Consensus 226 ~~~a-~~~~~~~~~~~~~~t~~~~~~~----~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~ 300 (387) -... +.+..+ .+......... ..+...+......+. ...+ ... T Consensus 339 d~~~~~~~~~~-----~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~------~v~~------------------~~a--- 386 (395) T protein:vir:43 339 AFSLGAQIFDR-----MDIEVLVSTENDKDFENNMVTIRAEERLAF------AVYR------------------PEA--- 386 (395) T ss_pred eccceEEEEEe-----cceEEEEeccccchhhcCcEEEEEEEeecc------EEec------------------ccc--- Confidence 1111 111000 01111000000 000000000000000 0000 000 Q ss_pred cceeeeeee Q lcl|NC_021299. 301 GTRIHLKAT 309 (387) Q Consensus 301 ~~~v~~~~~ 309 (387) ...+++... T Consensus 387 ~~~~~~taa 395 (395) T protein:vir:43 387 FVTGSLTAS 395 (395) T ss_pred eEEEEeccC Confidence 000000000 No 92 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.08 E-value=2.4e-06 Score=51.39 Aligned_cols=266 Identities=11% Similarity=0.019 Sum_probs=119.3 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceee----ceeccccccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAH----TRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 76 (387) -+...+.|+.+...+....+..+.+..+++.- . ..+..++++........ .....-.+++...+..+++- T Consensus 130 ~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) T protein:vir:94 130 NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ-----N-ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) T ss_pred CCcccccchhhhHHHHHHHhhhhhhhhcceee-----e-ccCCceeeeeeccccccccccCcccceecCCccccccccce Confidence 22334678999988888877777776665421 1 12445555442111110 00001112344444455555 Q ss_pred ceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc--------------cccccCC Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY--------------EKVSLVD 141 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~--------------~~~~~~~ 141 (387) ..+++.+.+. +.-+.++. ++..+..++...+.++..++++.++|..++.- -.+.+. ......+ T Consensus 204 ~~i~~~~~k~-~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t 281 (419) T protein:vir:94 204 DTITTTLKTV-AHWLPITR-QAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPAT 281 (419) T ss_pred eeEEeeeeeE-EEeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccc Confidence 5666666333 23345554 45556666666666668899999999988731 111000 0011123 Q ss_pred cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceee Q lcl|NC_021299. 142 EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGD 221 (387) Q Consensus 142 ~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~ 221 (387) ....|+++.++...+.....+ ...++++|..+..|.+...-...... .......+..+.+.|+.++.+..+|.+. T Consensus 282 ~~~~~~~l~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~~k~~~~~~~~---~~~~~~~~~~~~l~G~pV~~~~~~~~~~ 356 (419) T protein:vir:94 282 DEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGVFR---VIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) T ss_pred cchhHHHHHHHHHhhhhccCC--CCEEEEcHHHHHHHHHHhhcCCCcee---ecCCcccCCCccccceeeEEcCCCCCcc Confidence 345588899988888776643 33678999998887642110000000 0111234455688999999999888654 Q ss_pred eeeeccc-cccccccccccccCceeeeeeec--c--cccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceec Q lcl|NC_021299. 222 AYLSHPT-AYAMLTRSPGRPMTNTVATSTVA--T--ENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEP 296 (387) Q Consensus 222 ~~~~~~~-a~~~~~~~~~~~~~~t~~~~~~~--~--~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 296 (387) .+..... .+....+ .+.+....... . .+...+.+...++. ...+ . . T Consensus 357 ~~~gd~~~~~~~~~~-----~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~------~v~~---------~---------~ 407 (419) T protein:vir:94 357 ALVGGFRQGATLWSR-----QGITVLMTDSHADFFTANTLVILAEFRANL------AVYQ---------P---------K 407 (419) T ss_pred EEEeeccceEEEEEe-----cceEEEEeccccchhhcCcEEEEEEEeecc------EEec---------c---------c Confidence 3221111 1110000 01100000000 0 00000111100000 0000 0 0 Q ss_pred cccccceeeeeeeecccc Q lcl|NC_021299. 297 RFVRGTRIHLKATDAEIE 314 (387) Q Consensus 297 ~~v~~~~v~~~~~~~~~~ 314 (387) .+ ..+++.... + T Consensus 408 a~---~~~~~~aa~---~ 419 (419) T protein:vir:94 408 AF---VRVTFAAAT---T 419 (419) T ss_pred cE---EEEEeccCC---C Confidence 00 000000000 0 No 93 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.05 E-value=2.9e-06 Score=50.98 Aligned_cols=291 Identities=11% Similarity=0.018 Sum_probs=127.9 Q ss_pred Ccc------------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceee----ceeccccc Q lcl|NC_021299. 1 MAN------------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAH----TRGLRATG 64 (387) Q Consensus 1 Ma~------------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~----~~~~~~~~ 64 (387) |+. .-++|+-|+.++++.+++...+..++.+- .-.+..+.||+....... .....-.+ T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~------~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~ 83 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENI------PISYGETIIPTTVKRPEVGQVGVGTSNEQR 83 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhccee------eccCCceEEEEEecCccceeeccccccccc Confidence 111 11789999999999999999998887431 123567777764322111 11111123 Q ss_pred ccccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc--------c---- Q lcl|NC_021299. 65 ADRNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK--------A---- 132 (387) Q Consensus 65 ~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~--------~---- 132 (387) ++...+..+++-..+++...+ .+.-+.++++-+.....++...+.++..++++.++|..++.--.. . T Consensus 84 Eg~~~~~~~~~f~~v~l~~~k-~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~ 162 (338) T protein:vir:78 84 EGGTKPLSGTAWDTRSVAPIK-LATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNN 162 (338) T ss_pred ccccccccccceeEEEEEEEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccc Confidence 344455556666666666633 345566777655556678888888889999999999988742110 0 Q ss_pred ccc-----ccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEee Q lcl|NC_021299. 133 PYE-----KVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLA 207 (387) Q Consensus 133 ~~~-----~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~ 207 (387) ... ..........|+.+.++...+..+ ........+++|..+..|.+...+.....-. -.......+..+.+. T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~-l~~~~~~~~~~~~l~ 240 (338) T protein:vir:78 163 VIVNTTNVDYLQTGTTPLLDRFLDGYDLVSAN-TDVDFNGWAADPRYRARLLRSQAYRDANGNV-DPTRINLAASAGDLL 240 (338) T ss_pred ccccccccccccccchhhHHHHHHHHHHhhhh-ccccceEEEEchHHHHHHHHHhhhccCCCce-eecccccCCCCceee Confidence 000 001112234567777776666433 2234456889999888876543322211000 001122345557889 Q ss_pred cceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccce--eeeeeeee-eeeccc Q lcl|NC_021299. 208 QYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTT--ERSIVDTW-IGVKAV 284 (387) Q Consensus 208 g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~--~~~~~~~~-~g~~~~ 284 (387) |+.++.++.+|.............+ .+. +..-......+..+....+....... .....+-. ...... T Consensus 241 G~PV~~~~~ip~~~~~~~~~~~~~~-~gd--------fs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (338) T protein:vir:78 241 GLPVQFGKAVGGDLGAATDSKVRVV-GGD--------FSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAI 311 (338) T ss_pred eeeEEEccccCccccccCCcccEEE-EEe--------cceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEE Confidence 9999998888753221100000000 000 00000000001111110000000000 00000000 000000 Q ss_pred ccee--eeccceeccccccceeeeeee Q lcl|NC_021299. 285 LDPV--TANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 285 ~~~~--~~~~~~~~~~v~~~~v~~~~~ 309 (387) .... .........++...+.+-+.. T Consensus 312 r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 312 LIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EEEEEeccEeecccceEEEecccCCCC Confidence 0000 000000001111100000000 No 94 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.04 E-value=3.7e-06 Score=50.37 Aligned_cols=265 Identities=10% Similarity=0.071 Sum_probs=121.2 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. ..++|+.|..++++.+++...+..++.+-. ..+ +..+.+|+....... .. .+++..+...++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~---~~~--~~~~~~~~~~~~~~a-~~---v~Eg~~~~~~~~ 79 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQE---MEG--EQEKTVYVQTDGISA-YW---VNETEKIKTDKP 79 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee---cCC--CccEEEEEEcCCcee-EE---eecCcccccccc Confidence 211 127799999999999999999888775431 111 223445433222111 11 233444555555 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hc-------ccccccccCCcchhH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TK-------APYEKVSLVDEDEIW 146 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~-------~~~~~~~~~~~~~~~ 146 (387) +-..+++...+ .+.-+.++++-+.....++...+.++..++++.++|..++.-- .. ..............| T Consensus 80 ~f~~v~l~~~k-~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~ 158 (297) T protein:vir:95 80 EVVPVTLKAHK-LGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINY 158 (297) T ss_pred ceeEEEEeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCH Confidence 66666666633 3445566665555566788888888899999999999987311 00 011111112234568 Q ss_pred HHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeee-- Q lcl|NC_021299. 147 NGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYL-- 224 (387) Q Consensus 147 ~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~-- 224 (387) ++++++...|...+.+. -..+++|+.+..|.+.. . ..| ..+..+..+.+.|+.++.+...+...... T Consensus 159 ~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~---d--~~G----~~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~ 227 (297) T protein:vir:95 159 DNILKLQDALYDADVEP--NAFVSKIQNRSALREAR---D--GNK----VSIYDKAANTIDGITTVDLKSARFEKGDLLA 227 (297) T ss_pred HHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhh---c--cCC----ceeecCCCCcccceeeEeecCCCCCCceEEE Confidence 99999999888776543 35788999988876421 1 111 22334445667788776554433221111 Q ss_pred eccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceee--eeeeee-eeeccccceeeec--cceeccc- Q lcl|NC_021299. 225 SHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTER--SIVDTW-IGVKAVLDPVTAN--LDDEPRF- 298 (387) Q Consensus 225 ~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~--~~~~~~-~g~~~~~~~~~~~--~~~~~~~- 298 (387) .-...+.+. ...+..+....+.......+. ...+-. .+...+....... ......+ T Consensus 228 gd~s~~~~~------------------~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 289 (297) T protein:vir:95 228 GDFDNLIYG------------------VPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFA 289 (297) T ss_pred EecccEEEE------------------EecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceE Confidence 000000000 000000000000000000000 000000 0000000000000 0000000 Q ss_pred --ccccee Q lcl|NC_021299. 299 --VRGTRI 304 (387) Q Consensus 299 --v~~~~v 304 (387) ....+| T Consensus 290 ~l~~at~~ 297 (297) T protein:vir:95 290 KLTPAERV 297 (297) T ss_pred EEeecCCC Confidence 001111 No 95 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.03 E-value=4.1e-06 Score=50.10 Aligned_cols=278 Identities=12% Similarity=0.026 Sum_probs=125.7 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (387) -....++|+.|..++++.+++..++..+++.-. . .+...++++|......... ..+++..+.- +.+.-..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~--~v~Eg~~~~~~~~~~~~~i 198 (415) T protein:vir:94 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---V---TNGSGKYPVVRQSEVAALE--KVEELEENPELAVKPFFQL 198 (415) T ss_pred ccccccCcHHHHHHHHHHHHhhhhhhhhcceee---c---cCCceeEEEEeecCCccce--eccccccccccccccceee Confidence 223357899999999999999999988765321 1 1222334333221111111 1122333331 23344556 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-c---------ccccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA-P---------YEKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~-~---------~~~~~~~~~~~~~~~i 149 (387) ++.+.+. +.-+.++++-+.....++...+.++..++++..+|..++...... + .......+....|++| T Consensus 199 ~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:94 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeheee-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHH Confidence 6655333 344566666555556677777888888999999998887532211 1 0112223345668999 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a 229 (387) .++...+...... +-..|++|..+..|.+...-... .. ......+|..+.+.|+.|+.++.+|....... . T Consensus 278 ~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G~-~l---~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~---~ 348 (415) T protein:vir:94 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDKLGN-YL---IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN---T 348 (415) T ss_pred HHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhccCCC-ee---eccCcCCCCCceecceeeEEecccccCCCCcc---E Confidence 9988888766643 44578999998888652111100 00 01123455567889999988877764432110 0 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+. .. ... .......+..+.+.. + .. ........ ........... ....+++... T Consensus 349 i~~g--d~---~~~----~~~~~~~~~~v~~~~-~-~~-~~~~~r~~---------~r~d~~~~~~~---a~~~~~~~~~ 404 (415) T protein:vir:94 349 LIIG--NL---KDA----IVLFDRSQYQASWTD-Y-MH-FGECLMIA---------VRQDCRILDYK---SAIVIEYDDS 404 (415) T ss_pred EEEE--eh---hcc----EEEEeecceEEEEec-c-cc-CceEEEEE---------EEeccEEeccc---cEEEEEEecc Confidence 0000 00 000 000000111111110 0 00 00000000 00000000000 0111111110 Q ss_pred ecccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAGE 322 (387) Q Consensus 310 ~~~~~~~~~~~~~ 322 (387) .-.+..++... T Consensus 405 --~~~~~~~~~~~ 415 (415) T protein:vir:94 405 --ERGEGDLGLEA 415 (415) T ss_pred --CCCCCccccCC Confidence 00111222221 No 96 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.01 E-value=6.2e-06 Score=49.13 Aligned_cols=278 Identities=11% Similarity=0.023 Sum_probs=125.4 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (387) -....++|+.|..++++.+++..++..+++.-. . .+...++++|......... ..+++..+.- +.+.-..+ T Consensus 127 ~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~--~v~E~~~~~~~~~~~~~~v 198 (415) T protein:vir:79 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---V---TNGSGKYPVVRQSEVAALE--KVEELEENPELAVKPFFQL 198 (415) T ss_pred cccccccchHHHHHHHHHHHhhhhhhhheeeee---c---cCCceeEEEEeecCCccce--eeccccccCcccccceeeE Confidence 122348899999999999999998888775321 1 1223344443322111111 1122333332 22344556 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc---------ccccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-AP---------YEKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~~---------~~~~~~~~~~~~~~~i 149 (387) ++.+.+. +.-+.++++-+.....++...+.++..++++..+|..++..... .+ ............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:79 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 6666443 33455666654445667777778888899999999988753321 11 1112233455678999 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a 229 (387) +++...|...... +-..+++|+.+..|.+...-.. .... ......|..+.+.|+.|+.+..+|....... . T Consensus 278 ~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~~G-~~l~---~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~---~ 348 (415) T protein:vir:79 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDKLG-NYLI---QPDVKEKTQQRLLGAKIEILPDEVLGQKGNN---T 348 (415) T ss_pred HHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhccCC-ceee---ccCcCCCCCceecceeeEEecccccCCCCcc---E Confidence 9988888776643 3457899999888864211000 0000 1123445556889999888777664321110 0 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+. .. ... + ......+..+.+.. +. .... .... ............ ....+++.. T Consensus 349 ~~~G--d~---~~~---~-~~~~~~~~~v~~~~-~~-~~~~-~~~~---------~~r~d~~v~~~~---a~~~~~~~~- 403 (415) T protein:vir:79 349 LIIG--NL---KDA---I-VLFDRSQYQASWTD-YM-HFGE-CLMI---------AVRQDCRILDYK---SAIVIEYDD- 403 (415) T ss_pred EEEE--eh---hcc---E-EEEeecceEEEEec-cc-cCce-EEEE---------EEEeccEEeccc---cEEEEEEec- Confidence 0000 00 000 0 00000111111110 00 0000 0000 000000000000 011111111 Q ss_pred ecccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAGE 322 (387) Q Consensus 310 ~~~~~~~~~~~~~ 322 (387) ..-.+..++... T Consensus 404 -~~~~~~~~~~~~ 415 (415) T protein:vir:79 404 -SERGEGDLGLEA 415 (415) T ss_pred -cCCCCCccccCC Confidence 001111222222 No 97 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.01 E-value=6.2e-06 Score=49.13 Aligned_cols=278 Identities=11% Similarity=0.023 Sum_probs=125.4 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (387) -....++|+.|..++++.+++..++..+++.-. . .+...++++|......... ..+++..+.- +.+.-..+ T Consensus 127 ~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~--~v~E~~~~~~~~~~~~~~v 198 (415) T protein:vir:98 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---V---TNGSGKYPVVRQSEVAALE--KVEELEENPELAVKPFFQL 198 (415) T ss_pred cccccccchHHHHHHHHHHHhhhhhhhheeeee---c---cCCceeEEEEeecCCccce--eeccccccCcccccceeeE Confidence 122348899999999999999998888775321 1 1223344443322111111 1122333332 22344556 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc---------ccccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-AP---------YEKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~~---------~~~~~~~~~~~~~~~i 149 (387) ++.+.+. +.-+.++++-+.....++...+.++..++++..+|..++..... .+ ............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:98 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 6666443 33455666654445667777778888899999999988753321 11 1112233455678999 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a 229 (387) +++...|...... +-..+++|+.+..|.+...-.. .... ......|..+.+.|+.|+.+..+|....... . T Consensus 278 ~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~~G-~~l~---~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~---~ 348 (415) T protein:vir:98 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDKLG-NYLI---QPDVKEKTQQRLLGAKIEILPDEVLGQKGNN---T 348 (415) T ss_pred HHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhccCC-ceee---ccCcCCCCCceecceeeEEecccccCCCCcc---E Confidence 9988888776643 3457899999888864211000 0000 1123445556889999888777664321110 0 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+. .. ... + ......+..+.+.. +. .... .... ............ ....+++.. T Consensus 349 ~~~G--d~---~~~---~-~~~~~~~~~v~~~~-~~-~~~~-~~~~---------~~r~d~~v~~~~---a~~~~~~~~- 403 (415) T protein:vir:98 349 LIIG--NL---KDA---I-VLFDRSQYQASWTD-YM-HFGE-CLMI---------AVRQDCRILDYK---SAIVIEYDD- 403 (415) T ss_pred EEEE--eh---hcc---E-EEEeecceEEEEec-cc-cCce-EEEE---------EEEeccEEeccc---cEEEEEEec- Confidence 0000 00 000 0 00000111111110 00 0000 0000 000000000000 011111111 Q ss_pred ecccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAGE 322 (387) Q Consensus 310 ~~~~~~~~~~~~~ 322 (387) ..-.+..++... T Consensus 404 -~~~~~~~~~~~~ 415 (415) T protein:vir:98 404 -SERGEGDLGLEA 415 (415) T ss_pred -cCCCCCccccCC Confidence 001111222222 No 98 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.01 E-value=6.2e-06 Score=49.13 Aligned_cols=278 Identities=11% Similarity=0.023 Sum_probs=125.4 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (387) -....++|+.|..++++.+++..++..+++.-. . .+...++++|......... ..+++..+.- +.+.-..+ T Consensus 127 ~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~--~v~E~~~~~~~~~~~~~~v 198 (415) T protein:vir:81 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---V---TNGSGKYPVVRQSEVAALE--KVEELEENPELAVKPFFQL 198 (415) T ss_pred cccccccchHHHHHHHHHHHhhhhhhhheeeee---c---cCCceeEEEEeecCCccce--eeccccccCcccccceeeE Confidence 122348899999999999999998888775321 1 1223344443322111111 1122333332 22344556 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc---------ccccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-AP---------YEKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~~---------~~~~~~~~~~~~~~~i 149 (387) ++.+.+. +.-+.++++-+.....++...+.++..++++..+|..++..... .+ ............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:81 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 6666443 33455666654445667777778888899999999988753321 11 1112233455678999 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a 229 (387) +++...|...... +-..+++|+.+..|.+...-.. .... ......|..+.+.|+.|+.+..+|....... . T Consensus 278 ~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~~G-~~l~---~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~---~ 348 (415) T protein:vir:81 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDKLG-NYLI---QPDVKEKTQQRLLGAKIEILPDEVLGQKGNN---T 348 (415) T ss_pred HHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhccCC-ceee---ccCcCCCCCceecceeeEEecccccCCCCcc---E Confidence 9988888776643 3457899999888864211000 0000 1123445556889999888777664321110 0 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+. .. ... + ......+..+.+.. +. .... .... ............ ....+++.. T Consensus 349 ~~~G--d~---~~~---~-~~~~~~~~~v~~~~-~~-~~~~-~~~~---------~~r~d~~v~~~~---a~~~~~~~~- 403 (415) T protein:vir:81 349 LIIG--NL---KDA---I-VLFDRSQYQASWTD-YM-HFGE-CLMI---------AVRQDCRILDYK---SAIVIEYDD- 403 (415) T ss_pred EEEE--eh---hcc---E-EEEeecceEEEEec-cc-cCce-EEEE---------EEEeccEEeccc---cEEEEEEec- Confidence 0000 00 000 0 00000111111110 00 0000 0000 000000000000 011111111 Q ss_pred ecccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAGE 322 (387) Q Consensus 310 ~~~~~~~~~~~~~ 322 (387) ..-.+..++... T Consensus 404 -~~~~~~~~~~~~ 415 (415) T protein:vir:81 404 -SERGEGDLGLEA 415 (415) T ss_pred -cCCCCCccccCC Confidence 001111222222 No 99 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.99 E-value=4e-06 Score=50.15 Aligned_cols=270 Identities=10% Similarity=0.028 Sum_probs=111.2 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) ++ ...+.|+.|...+++.+++...+..++..- . -.|.++.+++.......+....-.+++....-.+. T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~---~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 191 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL---T---MTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRF 191 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhccee---e---ccCCceeEEEeccccccccccceecCcccccccCc Confidence 11 123568999999999999999887776421 1 12445566543322111111111122333322222 Q ss_pred -ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH----------HhcccccccccCCcc Q lcl|NC_021299. 75 -TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL----------ITKAPYEKVSLVDED 143 (387) Q Consensus 75 -~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~----------~~~~~~~~~~~~~~~ 143 (387) .-..+++.+.+.. .-+.++++ +..+...+...+.+...++++.++|..++.- +...........+.. T Consensus 192 ~~f~~i~~~~~k~~-~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~ 269 (413) T protein:vir:81 192 ADFDIVTESLSKIA-GLTKITDE-MIEDYDFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKD 269 (413) T ss_pred ccceeeEeeeeeEE-EeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccc Confidence 2344555553332 33566665 4445545444444556889999999987731 101111111122334 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhccc--c--hhhhhhcccccceeeeeeEEEEeecceeeeeeccce Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDD--R--FIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPH 219 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~--~--~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~ 219 (387) ..++.+..+...+........+. ++++|..+..|.+.. . +........ .......+..+.+.|+.++.+..+|. T Consensus 270 ~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~~G~~l~~~~~~~-~~~~~~~~~~~~l~G~pv~~s~~~~~ 347 (413) T protein:vir:81 270 ELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKDANGQYYGGGVFQG-QYGSGGIMLDPAPWGLRTVQSQVVPV 347 (413) T ss_pred hhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhccCCceeccccccc-cccccccccCceecceeeEEcCCCCc Confidence 45666766665554332222233 688999988875421 1 110000000 00001111235688999999988886 Q ss_pred eeeeeeccc-cccccccccccccCceeeeeeec--cc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccce Q lcl|NC_021299. 220 GDAYLSHPT-AYAMLTRSPGRPMTNTVATSTVA--TE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDD 294 (387) Q Consensus 220 ~~~~~~~~~-a~~~~~~~~~~~~~~t~~~~~~~--~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 294 (387) +..+..-.. ++.+..+ .+......... .+ +...+.....++ ....+. T Consensus 348 ~~~~~gd~~~~~~~~~~-----~~~~v~~~~~~~~~~~~~~~~~r~~~r~d------~~~~~~----------------- 399 (413) T protein:vir:81 348 GKPVVGAFRSAASVLRK-----GGVRIDSTNTNVDDFENNLITVRAEERVG------LMVTFP----------------- 399 (413) T ss_pred ccEEEEecccEEEEEEe-----cceEEEEeccccchhhcCcEEEEEEEeec------cEEecc----------------- Confidence 543321111 1111000 01111100000 00 000010000000 000000 Q ss_pred eccccccceeeeeeeeccccc Q lcl|NC_021299. 295 EPRFVRGTRIHLKATDAEIEG 315 (387) Q Consensus 295 ~~~~v~~~~v~~~~~~~~~~~ 315 (387) ..+ . .+++. -..++ T Consensus 400 -~a~-~--~l~~~---~~~~p 413 (413) T protein:vir:81 400 -EAI-V--QLDVA---EVVTP 413 (413) T ss_pred -cce-E--EEEec---CCCCC Confidence 000 0 00000 00000 No 100 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.97 E-value=8.6e-06 Score=48.36 Aligned_cols=342 Identities=15% Similarity=0.089 Sum_probs=142.1 Q ss_pred Cc-------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccc Q lcl|NC_021299. 1 MA-------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma-------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (387) |+ -.++.|++ ..++++.+++...+..++.+- . -.+.+++||+........ . .+++..+...+ T Consensus 10 ~~~~~t~~~~g~l~~~~-~~~ii~~l~~~s~i~~l~~~~-----~-~~~~~~~ip~~~~~~~a~-w---v~Eg~~~~~s~ 78 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQ-AKDYFAEAEKTSIVQRVAQKI-----P-MGATGIVIPHWTGDVSAQ-W---IGEGDMKPITK 78 (397) T ss_pred HhhccCCCCccccchhH-HHHHHHHHHhccchhhhccee-----e-ccCCceEEEEEcCCcceE-E---ecCCccccccc Confidence 32 22466665 567888888888887776432 1 124567887654332221 1 23455566666 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------cccccCCcch Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY---------EKVSLVDEDE 144 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~---------~~~~~~~~~~ 144 (387) ++-.++++.+.+ ...-+.++++-+.....++...+.++..++++.++|+.++.-- +.+. .......... T Consensus 79 ~~f~~v~l~~~k-~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~-gt~~~~~~~~~~~~~~~~~~~~~ 156 (397) T protein:vir:23 79 GNMTKRDVHPAK-IATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGT-NAPSAFQGYLDQSNKTQSISPNA 156 (397) T ss_pred cceeEEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc-cCCcccccccccccceeeecccc Confidence 777777777733 3455667777666667788888889999999999999887311 1100 0111122334 Q ss_pred hHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhh-hcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 145 IWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYD-SAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 145 ~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~-~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) .++.++++...|.....+ ....+++|..+..|.+...-.... ............+..+++.|+.++.++.+|.+... T Consensus 157 ~~~~~~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~ 234 (397) T protein:vir:23 157 YQGLGVSGLTKLVTDGKK--WTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVV 234 (397) T ss_pred hhHHHHHHHHhhhhcccC--CCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceE Confidence 466777777777766543 356789999988877532110000 00000001111223357889999998888765432 Q ss_pred eecc--ccccccccccccccCceeeeeeecc-c-----ccceee-eeeeeeeccceeeeeeeeeeeeccccceeeeccce Q lcl|NC_021299. 224 LSHP--TAYAMLTRSPGRPMTNTVATSTVAT-E-----NGVQLR-WLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDD 294 (387) Q Consensus 224 ~~~~--~a~~~~~~~~~~~~~~t~~~~~~~~-~-----~~~~~~-~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 294 (387) .+-. ..+.+. ...+.......... . .+..+. +.. + ..........+..... T Consensus 235 ~~~gDfs~~~i~-----~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~--d----~v~~ra~~r~d~~v~~--------- 294 (397) T protein:vir:23 235 GYAGDFSQIIWG-----QVGGLSFDVTDQATLNLGSQESPNFVSLWQH--N----LVAVRVEAEYGLLIND--------- 294 (397) T ss_pred EEEeecceEEEE-----EEeceEEEEeeeeeeeeccccccceeeeeec--c----ceeEEEEeeeccceec--------- Confidence 1110 000000 00011110000000 0 000000 000 0 0000000000000000 Q ss_pred eccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEecCceEEE----EcC---Cc--eEEEEec Q lcl|NC_021299. 295 EPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSGTTAKAT----IDA---NG--VVTGVAA 365 (387) Q Consensus 295 ~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ssn~~VAt----Vd~---~G--~VTa~~~ 365 (387) ...+ ..+.......+.. ..++.....+++++..... +..+.|.-+...|.+ +|. .| .||+ .. T Consensus 295 ~~a~---~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 365 (397) T protein:vir:23 295 VNAF---VKLTFDPVLTTYA-LDLDGASAGNFTLSLDGKT----SANIAYNASTATVKSAIVAIDDGVSADDVTVTG-SA 365 (397) T ss_pred ccce---EEEeeccccceee-ecccccCcceEEEEecCcc----ccCcccccchhhhHHHhhhcccccccceeeeec-CC Confidence 0000 0000000000000 0011111222222222111 122333333322211 110 01 2233 23 Q ss_pred ceEEEEEEECCEEE---------EEEEEEeC Q lcl|NC_021299. 366 GTSEITAVVDGLTV---------KKTITVTA 387 (387) Q Consensus 366 Gta~Itat~~~~~~---------~~~vtVta 387 (387) |-.+|+.. +.+.+ ...|+|-+ T Consensus 366 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 395 (397) T protein:vir:23 366 GDYTITVP-GTLTADFSGLTDGEGASISVVS 395 (397) T ss_pred ceeEEEec-cccccCccccccCccccceeee Confidence 33444442 11111 12233333 No 101 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.97 E-value=5.8e-06 Score=49.29 Aligned_cols=282 Identities=11% Similarity=0.006 Sum_probs=124.8 Q ss_pred Cccc--------------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccc Q lcl|NC_021299. 1 MANA--------------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGAD 66 (387) Q Consensus 1 Ma~~--------------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~ 66 (387) |+-. -+.|+.+.+++++.+++..++.+++..- . -.+..+.+|+........ . .+++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~-----~-~~~~~~~~p~~~~~~~a~-~---v~Eg 70 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKV-----P-MGPTGISIPHWTGAVSAS-W---TGEA 70 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhccee-----e-ccCCceEEEEEcCCccee-E---ecCC Confidence 3311 1334445678999999999988876532 1 124557777654322211 1 2345 Q ss_pred ccccccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH----------Hhcccc-- Q lcl|NC_021299. 67 RNMVASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL----------ITKAPY-- 134 (387) Q Consensus 67 ~~~~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~----------~~~~~~-- 134 (387) ..++..+++-.++++...+ .+.-+.++++-+.....++...+.++..++++.++|+.++.- +..... T Consensus 71 ~~~~~~~~~f~~i~~~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~ 149 (330) T protein:vir:77 71 ERKPITKGSFGKQELEPVK-ITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVV 149 (330) T ss_pred CccccccceeeEEEEeEEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccc Confidence 5566666666667766633 345556776655556678888888889999999999988721 111100 Q ss_pred ------cccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhh-cccccceeeeeeEEEEee Q lcl|NC_021299. 135 ------EKVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDS-AGEAGASRLQTARIGRLA 207 (387) Q Consensus 135 ------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~-~g~~~~~~~~~g~ig~~~ 207 (387) ...........|+++.++...|..++.+ ....+++|..+..|.+...-...-. .............-+.+. T Consensus 150 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~ 227 (330) T protein:vir:77 150 SLADTNLTTASGPQGNAYLAVNNALSLLVNSGKK--WTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRIL 227 (330) T ss_pred eeecccccccccccchhHHHHHHHHHhhhhcCCC--ccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceec Confidence 0111122335578888888888776643 3457899999888765211000000 000000011122335788 Q ss_pred cceeeeeeccceeeee-----ee-ccccccccccccccccCceeeeeeecccccceeeeeeeeeeccc---------eee Q lcl|NC_021299. 208 QYDVVTVDTLPHGDAY-----LS-HPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATST---------TER 272 (387) Q Consensus 208 g~~v~~s~~~~~~~~~-----~~-~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~---------~~~ 272 (387) |+.++.++.+|..... .+ ....+.+ ....+..+....+...... ... T Consensus 228 G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i------------------~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~ 289 (330) T protein:vir:77 228 GRPTYVADNVVNGTVGNRVVGVMGDFSQVIW------------------GQIGGLSFDVTDQATLDFGEEQGGVWVPKLI 289 (330) T ss_pred ceeeEEeccccCCCCCCccEEEEEecceEEE------------------EEecCcEEEEeecceeeeccccccccccccc Confidence 9999999888753211 00 0000000 0001111111111000000 000 Q ss_pred eeeeeeeeeccccceeeeccceeccccccceeeeeeeecccccc Q lcl|NC_021299. 273 SIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 273 ~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ..... ....+............ .-....+......-.+++. T Consensus 290 ~~f~~--~~~~~r~~~r~d~~v~~-~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 290 SLWQH--NMVAVRCEAEFAFMVND-KDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred chhhc--CcEEEEEEEEeccEEec-ccceEEEEeccCCcCCCCC Confidence 00000 00000000000000000 0000011110000111110 No 102 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=97.96 E-value=2.9e-06 Score=50.92 Aligned_cols=273 Identities=8% Similarity=-0.015 Sum_probs=125.1 Q ss_pred Cccc--cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccce Q lcl|NC_021299. 1 MANA--FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVT 78 (387) Q Consensus 1 Ma~~--~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (387) ||-+ .++|+-+..++++.+++...+..++.+- . -.+..+.||+....... +. .+++..+...+++-.+ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~-----~-~~~~~~~ip~~~~~~~a-~~---v~E~~~~~~~~~~f~~ 70 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK-----P-IPFNGEKVFTFTMDSEI-DV---VAESGKKTHGGVTLAP 70 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee-----e-ccCCceEEEEEecCcce-EE---ecCCccccccccceeE Confidence 9965 3677667888999999998888776422 1 12345677664322221 11 2345555555666666 Q ss_pred EEEEEEeeeecceeeccHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHH---hcccc--------------cccc Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLI---TKAPY--------------EKVS 138 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~---~~~~~--------------~~~~ 138 (387) +++...| ...-+.++++-+. .+..++...+.++..+++++++|..++... .+.+. .... T Consensus 71 v~l~~~k-~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) T protein:vir:16 71 QTMVPIK-VEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) T ss_pred EEEeeee-EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccc Confidence 6666533 2344556665442 234567777788889999999999987431 11100 0011 Q ss_pred cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 139 LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 139 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) .......+.++.++...+..++.+. ...+++|..+..|.+...-. |.- -......+..+++.|+.++.++.+ T Consensus 150 ~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~-----G~~i~~~~~~~~~~~~l~G~PV~~~~~v 222 (298) T protein:vir:16 150 PRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKDLQ-----DNALFPELKWGATPDTINGLPVDVNKTV 222 (298) T ss_pred ccccccHHHHHHHHHHHhhhcCCCc--cEEEEcHHHHHHHHHhhccC-----CCeeecCcccCCCCceecceeeEEeccc Confidence 1122334678888888887766542 35788999998886532111 110 011223455578899999988887 Q ss_pred ceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccce--eeecccee Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDP--VTANLDDE 295 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~--~~~~~~~~ 295 (387) |...... ... +..+. ..... ......+..+.+....+.... ..+.... +-...... ........ T Consensus 223 ~~~~~~~---~~~-~~~GD----fs~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~f~~--~~v~~ra~~r~d~~v~~~ 288 (298) T protein:vir:16 223 SDMSLTQ---RDR-AIIGD----FANGF---KWGYAKEVPLEVIQYGDPDNS-GLDLKGY--NQVYIRAELFLGWGILDA 288 (298) T ss_pred ccccCCC---ccE-EEEee----ccceE---EEEEecCceEEEeeccCCcCc-chhhhhc--CcEEEEEEEEEccEeecc Confidence 7431100 000 00000 00000 000011111111111110000 0000000 00000000 00000000 Q ss_pred ccccccceee Q lcl|NC_021299. 296 PRFVRGTRIH 305 (387) Q Consensus 296 ~~~v~~~~v~ 305 (387) ..++.....+ T Consensus 289 ~a~~~l~~at 298 (298) T protein:vir:16 289 TKFARVTEAN 298 (298) T ss_pred cceEEEeecC Confidence 0010000000 No 103 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=97.96 E-value=4.7e-06 Score=49.81 Aligned_cols=257 Identities=9% Similarity=0.006 Sum_probs=124.6 Q ss_pred Cc-c-----ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MA-N-----AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma-~-----~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |. . -.+.|+-+.+.+++.+++...+.+++..- . -.+.++.+|........ . ...+++......++ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~-----~-~~~~~~~~~~~~~~~~~-a--~~v~Eg~~~~~~~~ 183 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG-----R-TDSALIEYVQETGFVNN-A--AIVAEGALKPESSL 183 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhccee-----e-ccCCceEEEEEecCCcc-e--eeecCCcccccccc Confidence 21 1 12556667788999999999888776432 1 12455666654321111 1 11234555555666 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH----------hccc-ccccccCCcc Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI----------TKAP-YEKVSLVDED 143 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~----------~~~~-~~~~~~~~~~ 143 (387) +-..+++.+.+. +.-+.++++ ...+..++...+.++..++++.++|..++.-- ..+. .......+.. T Consensus 184 ~~~~i~~~~~k~-~~~~~is~e-ll~ds~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~ 261 (390) T protein:vir:97 184 KFAKKTDTTHVI-AHTMKATRQ-ILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGA 261 (390) T ss_pred ceeEEEEeeeeE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccccccccccccc Confidence 667777777543 344566665 44555566666677789999999999877320 0000 0111122344 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) ..++.+.++...|.....+. -.++++|..+..|.+... ..|.-.-.....+..+.+.|+.++.++.+|.+..+ T Consensus 262 ~~~d~~~~~~~~~~~~~~~~--~~~v~n~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~ 334 (390) T protein:vir:97 262 TRVDQLRLAMLQASLAEYPA--SGIVINPIDWAAIELAKD-----ANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFL 334 (390) T ss_pred chHHHHHHHHHhhccccCCC--CEEEEcHHHHHHHHHhhc-----CCCceeecCccCCCCceecceeeEEcCCCCCCcEE Confidence 55788888888887777643 357889999888764321 11110000112334467899999999888865433 Q ss_pred eeccc-cccccccccccccCceeeeeee-ccc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 224 LSHPT-AYAMLTRSPGRPMTNTVATSTV-ATE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 224 ~~~~~-a~~~~~~~~~~~~~~t~~~~~~-~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ..-.. ++.+.. ..+........ ..+ +...+.....++. ...+.. . T Consensus 335 ~gd~~~~~~~~~-----~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~------~v~~~~------------------a-- 383 (390) T protein:vir:97 335 VGAFDLAAQIFD-----QWDARVEIGYVNDDFQRNMVTVLAEERLAL------VVYRPE------------------A-- 383 (390) T ss_pred EEeccceEEEEE-----ecceEEEEeecccccccCcEEEEEEEeecc------EEeccc------------------c-- Confidence 21111 111100 01111111100 000 0000111111100 000000 0 Q ss_pred ccceeeee Q lcl|NC_021299. 300 RGTRIHLK 307 (387) Q Consensus 300 ~~~~v~~~ 307 (387) ...+++. T Consensus 384 -~v~~~~a 390 (390) T protein:vir:97 384 -LITGSFA 390 (390) T ss_pred -EEEEEeC Confidence 0001100 No 104 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=97.95 E-value=7.7e-06 Score=48.62 Aligned_cols=277 Identities=11% Similarity=0.021 Sum_probs=124.0 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (387) -....++|+.|..++++.+++..++..+++.-. . .+.+..+|++......... ..+++....- +.++-..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~--~v~Eg~~~~~~~~~~~~~v 198 (415) T protein:vir:47 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---V---TNGSGKYPVVRQSEVAALE--KVEELEENPELAVKPFFQL 198 (415) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhhcceee---c---cCCceeEEEEEecCCccee--ecccccccccccccceeeE Confidence 123347899999999999999999888764321 1 1223344443211111111 1122333321 23344555 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hcccc---------cccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TKAPY---------EKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~~~~---------~~~~~~~~~~~~~~i 149 (387) ++...+. +.-+.++++-+.....++...+.++..++++.++|..++... .+.+. ...........|+++ T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:47 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHH Confidence 6655332 344566665544455677778888889999999999887432 11111 111223344668889 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeeeeeccc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPT 228 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~ 228 (387) +++...+...... +-..|++|+.+..|.+... ..|.- ....+.+|..+.+.|+.|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-----~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~-- 348 (415) T protein:vir:47 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT-- 348 (415) T ss_pred HHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-----cCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE-- Confidence 9888887766543 3467899999888754211 01110 011234555678899999888777643221100 Q ss_pred cccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeee Q lcl|NC_021299. 229 AYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKA 308 (387) Q Consensus 229 a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~ 308 (387) +.+. .. ... -......+....+.. +.. . ....... ............+ ..+++.. T Consensus 349 -~~~g--d~---~~~----~~~~~~~~~~v~~~~-~~~-~-~~~~~~~---------~r~d~~v~~~~a~---~~~~~~~ 403 (415) T protein:vir:47 349 -LIIG--NL---KDA----IVLFDRSQYQASWTD-YMH-F-GECLMIA---------VRQDCRILDYKSA---IVIEYDD 403 (415) T ss_pred -EEEE--eh---hcc----EEEEeecceEEEeec-ccc-C-ceEEEEE---------EEeccEEeccccE---EEEEeec Confidence 0000 00 000 000000111111110 000 0 0000000 0000000000000 0111110 Q ss_pred eecccccccccccc Q lcl|NC_021299. 309 TDAEIEGETVKAGE 322 (387) Q Consensus 309 ~~~~~~~~~~~~~~ 322 (387) ..--+..++... T Consensus 404 --~~~~~~~~~~~~ 415 (415) T protein:vir:47 404 --SERGEGDLGLEA 415 (415) T ss_pred --cCCCCCCccCCC Confidence 000011222221 No 105 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=97.95 E-value=7.7e-06 Score=48.62 Aligned_cols=277 Identities=11% Similarity=0.021 Sum_probs=124.0 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (387) -....++|+.|..++++.+++..++..+++.-. . .+.+..+|++......... ..+++....- +.++-..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~--~v~Eg~~~~~~~~~~~~~v 198 (415) T protein:vir:46 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---V---TNGSGKYPVVRQSEVAALE--KVEELEENPELAVKPFFQL 198 (415) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhhcceee---c---cCCceeEEEEEecCCccee--ecccccccccccccceeeE Confidence 123347899999999999999999888764321 1 1223344443211111111 1122333321 23344555 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hcccc---------cccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TKAPY---------EKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~~~~---------~~~~~~~~~~~~~~i 149 (387) ++...+. +.-+.++++-+.....++...+.++..++++.++|..++... .+.+. ...........|+++ T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:46 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHH Confidence 6655332 344566665544455677778888889999999999887432 11111 111223344668889 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeeeeeccc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPT 228 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~ 228 (387) +++...+...... +-..|++|+.+..|.+... ..|.- ....+.+|..+.+.|+.|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-----~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~-- 348 (415) T protein:vir:46 278 KDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-----KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT-- 348 (415) T ss_pred HHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-----cCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE-- Confidence 9888887766543 3467899999888754211 01110 011234555678899999888777643221100 Q ss_pred cccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeee Q lcl|NC_021299. 229 AYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKA 308 (387) Q Consensus 229 a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~ 308 (387) +.+. .. ... -......+....+.. +.. . ....... ............+ ..+++.. T Consensus 349 -~~~g--d~---~~~----~~~~~~~~~~v~~~~-~~~-~-~~~~~~~---------~r~d~~v~~~~a~---~~~~~~~ 403 (415) T protein:vir:46 349 -LIIG--NL---KDA----IVLFDRSQYQASWTD-YMH-F-GECLMIA---------VRQDCRILDYKSA---IVIEYDD 403 (415) T ss_pred -EEEE--eh---hcc----EEEEeecceEEEeec-ccc-C-ceEEEEE---------EEeccEEeccccE---EEEEeec Confidence 0000 00 000 000000111111110 000 0 0000000 0000000000000 0111110 Q ss_pred eecccccccccccc Q lcl|NC_021299. 309 TDAEIEGETVKAGE 322 (387) Q Consensus 309 ~~~~~~~~~~~~~~ 322 (387) ..--+..++... T Consensus 404 --~~~~~~~~~~~~ 415 (415) T protein:vir:46 404 --SERGEGDLGLEA 415 (415) T ss_pred --cCCCCCCccCCC Confidence 000011222221 No 106 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.93 E-value=5.6e-06 Score=49.37 Aligned_cols=272 Identities=8% Similarity=-0.031 Sum_probs=126.8 Q ss_pred Ccccc-----ccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MANAF-----IKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~~-----~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) ||.+. ++|+-++.++++.+++...+..++..- --++..+++|+....... +. .+++......+++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~------~~~~~~~~~p~~~~~~~a-~w---v~Eg~~~~~s~~~ 70 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQK------PIPFNGQREFVFDFDSDI-DI---VAENGKKTHGGVS 70 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhccee------eccCCceEEEEEecCcce-EE---eeCCccccccccc Confidence 98664 667778999999999998887765321 112445677653322111 11 2344555555666 Q ss_pred cceEEEEEEeeeecceeeccHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHH---hcccc------------ccc Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLI---TKAPY------------EKV 137 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~---~~~~~------------~~~ 137 (387) -++++++..+ .+.-+.++++-+. -+..++...+.++..++++.++|..++.-. .+.+. ... T Consensus 71 f~~v~l~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 149 (300) T protein:vir:95 71 LDPVTIVPLK-VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQT 149 (300) T ss_pred ceeeEeeeEE-EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCccccccccccccccee Confidence 6666666633 3455566665432 245677778888899999999999988431 11110 011 Q ss_pred ccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeec Q lcl|NC_021299. 138 SLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDT 216 (387) Q Consensus 138 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~ 216 (387) ...+....|+.+.++...+...+.. ....+++|..+..|.+...- .|.-. ......+..+++.|+.++.++. T Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~L~~lkd~-----~G~~i~~~~~~~~~~~~l~G~Pv~~s~~ 222 (300) T protein:vir:95 150 VPFKDTNPDESMEDAVGMIDGSERD--ITGAILDPIFTTALSKMKNA-----EGGKLYPELAWGGVPDAINGLAVDKNRT 222 (300) T ss_pred ecccccchHHHHHHHHHHhhhcCCC--ccEEEECHHHHHHHHHhhcc-----CCCeeccCccccCCCceecceeeEEecC Confidence 1223445688888888888766532 33578999998888653211 11100 1122344567899999999888 Q ss_pred cceeeeee---eccccccccccccccccCceeeeeeecccccceee-eeeeeeeccceeeeeeeeeeeeccccceeeecc Q lcl|NC_021299. 217 LPHGDAYL---SHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLR-WLGDYDATSTTERSIVDTWIGVKAVLDPVTANL 292 (387) Q Consensus 217 ~~~~~~~~---~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~ 292 (387) +|...... .-...+.... ......+............+.... |.. + ..........+ ... T Consensus 223 v~~~~~~~~~~~~~GDf~~~~-~~~~~~~~~~~v~~~~~~d~~~~~~f~~--~----~v~~r~~~r~d---------~~v 286 (300) T protein:vir:95 223 VSYSQTDPKNTAIVGDFETMF-KWGYAKEVPMEIIKYGDPDNSGRDLKGY--N----QIYIRCEAYIG---------WGI 286 (300) T ss_pred CCCCCCCCccEEEEeeccceE-EEEEecccEEEEeeccCCCCcchhhhhc--C----cEEEEEEEeec---------cee Confidence 86532110 0000110000 000000111100000000000000 000 0 00000000000 000 Q ss_pred ceeccccccceeee Q lcl|NC_021299. 293 DDEPRFVRGTRIHL 306 (387) Q Consensus 293 ~~~~~~v~~~~v~~ 306 (387) .....++....+-- T Consensus 287 ~~~~a~~~l~~~~g 300 (300) T protein:vir:95 287 MDAASFARIVKTGG 300 (300) T ss_pred ecccceEEEecCCC Confidence 00000000000000 No 107 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=97.87 E-value=5.3e-06 Score=49.50 Aligned_cols=260 Identities=10% Similarity=0.000 Sum_probs=122.5 Q ss_pred Cc-cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceE Q lcl|NC_021299. 1 MA-NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTV 79 (387) Q Consensus 1 Ma-~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (387) -+ ..++.|+++...+...+++...+..++.+- .-..|+.+.||+-........ .+++..++..++.-..+ T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~-----~~~~~~~~~~p~~~~~~~a~w----v~E~~~~~~~~~~f~~i 186 (390) T protein:vir:62 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTF-----TTSDANPLDFTVITGRSSASI----VGETAEIPESYPATAQR 186 (390) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceee-----ecCCCceeEEEEEcCCcceee----ecccccccccccceeee Confidence 11 124677888877777777777776665431 112345577765432222111 23455555566666677 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-------Hhccc---ccccccCCcchhHHHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-------ITKAP---YEKVSLVDEDEIWNGV 149 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-------~~~~~---~~~~~~~~~~~~~~~i 149 (387) ++...+. +.-+.++++-+.....++...+.++..++++.++|..++.- +.... .....+......|+++ T Consensus 187 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l 265 (390) T protein:vir:62 187 SMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDAL 265 (390) T ss_pred EeeeeeE-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHHH Confidence 7766433 34455666555555557777777888899999999987731 11110 1111122234568889 Q ss_pred HHHHHHHhhccCCcCCcEEEEchHHHHHHhc--ccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecc Q lcl|NC_021299. 150 VSNRRWLNEQKVPKDGRVLLVGSAVEEALLL--DDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHP 227 (387) Q Consensus 150 ~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~--~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~ 227 (387) +++...|+.... .+-..+++|..+..|.+ |.... .. -...+..|..+.+.|+.++.++.+|....+..-. T Consensus 266 ~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd~~g~---~l---~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~ 337 (390) T protein:vir:62 266 IDLFHEVPSAYR--ANAKYVVNDLRAAQMRKLKDANGQ---YL---WQSGLTVGAPSLFNGKVVETDDGMPADKILFADL 337 (390) T ss_pred HHHHHhhhhhhh--cCCEEEEchHHHHHHHHhhccCCC---ee---ecCCcCCCccceecccceEEecCCCCccEEEeec Confidence 888888865532 34456889998888744 22110 00 0112334555678999999988887643221111 Q ss_pred ccccccccccccccCceeeeeeeccc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceee Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVATE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIH 305 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~ 305 (387) +.+.+.. ..+..........+ +...+....-.+ ....+ ...+....+ T Consensus 338 s~~~i~~-----~~~~~v~~~~~~~~~~~~~~~~~~~r~d------~~~~~-------------------~~A~~~l~~- 386 (390) T protein:vir:62 338 SKYRVRF-----AGSLRVDRSVDAKFSTDQIVYRFLQRAD------GLLVD-------------------ARGAKVLTV- 386 (390) T ss_pred cceeEEe-----ecceEEEeeccccccCCcEEEEEEEEeC------cEeec-------------------hhheEEEEe- Confidence 1110000 00000000000000 000000000000 00000 000011111 Q ss_pred eeeee Q lcl|NC_021299. 306 LKATD 310 (387) Q Consensus 306 ~~~~~ 310 (387) .... T Consensus 387 -~~~a 390 (390) T protein:vir:62 387 -TPGA 390 (390) T ss_pred -ecCC Confidence 0000 No 108 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=97.85 E-value=9.6e-06 Score=48.08 Aligned_cols=257 Identities=9% Similarity=0.001 Sum_probs=120.9 Q ss_pred Cc-cc-----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MA-NA-----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma-~~-----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |. .. .+.|+-+...+++.+++...+..++..-. -.+.++++|........ .. ..+++...+..++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~------~~~~~~~~~~~~~~~~~-a~--~v~Eg~~~~~~~~ 183 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGR------TDSALIEYVQETGFVNN-AA--IVAEGALKPESSL 183 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceee------ccCCceEEEEEecCCcc-ee--eecCCcccccccc Confidence 11 11 13444466779999999998888775321 12455666553322111 11 1234444555566 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-h-----c----cc-ccccccCCcc Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-T-----K----AP-YEKVSLVDED 143 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~-----~----~~-~~~~~~~~~~ 143 (387) +-..+++.+.+. +.-+.++++ ...+..++...+.++..++++.++|..++.-- . + +. .......+.. T Consensus 184 ~~~~i~~~~~k~-~~~~~is~e-ll~d~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~ 261 (390) T protein:vir:81 184 KFAKKTDTTHVI-AHTMKATRQ-ILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGA 261 (390) T ss_pred eeeEEEEeeeEE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccc Confidence 666667766443 344556664 55555566666667788999999999877320 0 0 00 0111122344 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) ..++++.++...|...+.+. -.++++|..+..|.+...- .|.-.-.....+..+.+.|+.++.++.+|.+..+ T Consensus 262 ~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd~-----~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~ 334 (390) T protein:vir:81 262 TRVDQLRLAMLQASLAEYNP--SGIVINPIDWAAIELAKDA-----NNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFL 334 (390) T ss_pred hhHHHHHHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcC-----CCceeecCcccccCceecceeeEEcCCCCCCcEE Confidence 56788888888887776543 3578899998877642210 1110000112333457889999999988866433 Q ss_pred eeccc-cccccccccccccCceeeeeee-cc--cccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 224 LSHPT-AYAMLTRSPGRPMTNTVATSTV-AT--ENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 224 ~~~~~-a~~~~~~~~~~~~~~t~~~~~~-~~--~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ..... ++.+.. ..+........ .. .+...+......+. ...+. .. T Consensus 335 ~gd~~~~~~~~~-----~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~------~v~~~------------------~a-- 383 (390) T protein:vir:81 335 VGAFDLAAQIFD-----QWDARVEIGYVGEDFQRNMITVLAEERLAL------VVYRP------------------EA-- 383 (390) T ss_pred EEehhceEEEEE-----ecceEEEEecccchhhcCcEEEEEEEeecc------EEecc------------------cc-- Confidence 22111 111110 00111111000 00 00000111100000 00000 00 Q ss_pred ccceeeee Q lcl|NC_021299. 300 RGTRIHLK 307 (387) Q Consensus 300 ~~~~v~~~ 307 (387) ...+++. T Consensus 384 -~v~~t~a 390 (390) T protein:vir:81 384 -LISGSFA 390 (390) T ss_pred -eEEEEeC Confidence 0000100 No 109 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.84 E-value=1e-05 Score=47.88 Aligned_cols=267 Identities=9% Similarity=0.017 Sum_probs=124.0 Q ss_pred Ccc----ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccccc Q lcl|NC_021299. 1 MAN----AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma~----~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (387) |.. ..++|+-|...+++.+++.+.+..++.. +.. .+.++.+|......... ....+++...+..+++- T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~-----~~~-~~~~~~~~~~~~~~~~~--~~~v~Eg~~~~~~~~~f 180 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGA-----VSI-SGGTYTFVRENGAGEGA--IGAQVEGATKGQKDYDI 180 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhcee-----eec-cCCceEEEEeecCCCcc--cccccCCccccccccce Confidence 211 1267899999999999998888777642 111 24567776542211111 11123344445556666 Q ss_pred ceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHH Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWL 156 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 156 (387) ..+++.+.+... -+.++++ +..+...+...+.++..++++.++|..++............+.+....++++.++...+ T Consensus 181 ~~i~~~~~k~~~-~~~iS~e-ll~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~d~i~~~~~~~ 258 (379) T protein:vir:10 181 SMIDVNTDFIAG-FTRYSKK-MANNLPFLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIITNKNKVEMLINEIAKQ 258 (379) T ss_pred eeeEeeeeeEEe-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccCcccHHHHHHHHHhh Confidence 677776644433 3456654 55565555555556677889999998887654433333333445556678888877777 Q ss_pred hhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccc Q lcl|NC_021299. 157 NEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRS 236 (387) Q Consensus 157 ~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~ 236 (387) ...+.+ ....|++|..+..|.+...-.. ...... ......|....+.|+.++.++.+|.+..+..-...+.+ T Consensus 259 ~~~~~~--~~~~vmn~~~~~~l~~lkd~~G-~~l~~~-~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~---- 330 (379) T protein:vir:10 259 ENLDFP--VTAIVLRPTDYYDILVTQKSVG-AGYGLP-GVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTK---- 330 (379) T ss_pred hhccCC--CCEEEEcHHHHHHHHHhhccCC-ceeccC-CccCCCCCcceecceeeEecCCCCCCceEEeecccEEE---- Confidence 666543 3357889998888754211111 001000 00112333347889999988888754322111110000 Q ss_pred cccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 237 PGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 237 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) ....+......... .+ +............-.+.... ... ..+.+++... T Consensus 331 -~~~~~~~i~~~~~~----------~~-~f~~~~~~~r~~~R~~~~v~---------~p~---a~v~~~~~~~ 379 (379) T protein:vir:10 331 -VTTEGLSLEFSEVE----------GT-NFVKNNITARIEAQVALAVE---------QPA---ALIFGDFTAV 379 (379) T ss_pred -EEEeceEEEEeecc----------cc-cccCCcEEEEEEEEeccEEe---------cCc---cEEEEEecCC Confidence 00001111000000 00 00000000000000000000 000 0111111111 No 110 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.83 E-value=1.5e-05 Score=47.06 Aligned_cols=284 Identities=10% Similarity=-0.018 Sum_probs=124.5 Q ss_pred Cccc----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccccc Q lcl|NC_021299. 1 MANA----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTE 76 (387) Q Consensus 1 Ma~~----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (387) ||-. +++|+-+.+++++.+++..++..++..-. . .+..+++|+........ . .+++..++..+++- T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~---~---~~~~~~~p~~~~~~~a~-w---v~Eg~~~~~~~~~f 70 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP---Q---EFGEQQYMTLTAPPRGE-V---VGEGAQKSESTATF 70 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceee---c---CCCceEEEEEeCCceeE-E---eecCccccccccee Confidence 7743 58999999999999999999888775321 1 23467787643222211 1 23455555566666 Q ss_pred ceEEEEEEeeeecceeeccHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh---cccc------------cccc Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT---KAPY------------EKVS 138 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~---~~~~------------~~~~ 138 (387) .++++...|. +.-+.++++-+. .+..++...+.++..++++.++|..++.--. +... .... T Consensus 71 ~~v~l~~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~ 149 (311) T protein:vir:81 71 APVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL 149 (311) T ss_pred eEEEEeeEEE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeee Confidence 6677666333 344556665332 2344567777788899999999998874311 0000 0001 Q ss_pred c-CCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeec Q lcl|NC_021299. 139 L-VDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDT 216 (387) Q Consensus 139 ~-~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~ 216 (387) + ......+..+..+...+...+.. ....+++|..+..|.+...- .|.-. ......+..+.+.|+.++.++. T Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~-----~G~~l~~~~~~~~~~~tl~G~Pv~~~~~ 222 (311) T protein:vir:81 150 TTGTSATPDLAVEAAVGLVLGDNLS--PDGVALDNTFSFMLATQRDS-----QGRKLYPELGFGTDVASFAGLNAAVSDT 222 (311) T ss_pred cccccchHHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHHhhhcc-----CCCeeecCccccCCCceecceeEEeccc Confidence 1 11223345566665565554432 23478999998888653211 11100 0112334457789999998887 Q ss_pred cceeeeeeecccccc-ccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccc-- Q lcl|NC_021299. 217 LPHGDAYLSHPTAYA-MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLD-- 293 (387) Q Consensus 217 ~~~~~~~~~~~~a~~-~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~-- 293 (387) +|............. ...... ...-+.+..-......+..+....+.+.......... +...+......... T Consensus 223 i~~~~~~~~~~~~~~~~~~~~~-~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~r~~~r~d~~v~ 297 (311) T protein:vir:81 223 VRGGPEAVTASTGVYRTTNPNV-KAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQ----NQIAIRAEVVYGIGIM 297 (311) T ss_pred ccccccccccccchhcccCCcc-EEEEEecccEEEEEeccceEEEeccCCCCcchhhhhc----CcEEEEEEEEeccEee Confidence 775432221111000 000000 0000000000000011111111111110000000000 00000000000000 Q ss_pred eeccccccceeeeeeeecc Q lcl|NC_021299. 294 DEPRFVRGTRIHLKATDAE 312 (387) Q Consensus 294 ~~~~~v~~~~v~~~~~~~~ 312 (387) ....++...... .. T Consensus 298 ~~~a~~~l~~a~-----~~ 311 (311) T protein:vir:81 298 STDAFAVVRDAD-----ES 311 (311) T ss_pred cccceEEEEeec-----cC Confidence 000000000000 00 No 111 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.83 E-value=5.3e-06 Score=49.51 Aligned_cols=278 Identities=12% Similarity=0.017 Sum_probs=120.6 Q ss_pred Ccc-----ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MAN-----AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~-----~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) +.. -+++|+-|..++++.+++..++..++++- . .+..+.+|+........ .......+...+..+++ T Consensus 143 ~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~-----~--~~~~~~~p~~~~~~~a~-~~~~~~e~~~~~~~~~~ 214 (434) T protein:vir:62 143 LGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGV-----K--TKENIKYPVLVKKAEAQ-GHKNERTNNEMPETDIE 214 (434) T ss_pred hcccccccceecchhhHHHHHHhhhhhhhhhhhccee-----c--cCCceEEEEEecCCccc-ceecccccccccccccc Confidence 111 14789999999999999999888777532 1 12345666542221111 11111223344444555 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhc-------ccccccccCCcchhHH Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITK-------APYEKVSLVDEDEIWN 147 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~-------~~~~~~~~~~~~~~~~ 147 (387) -..+++...+. +.-+.++++-+.....++...+.++..++++.++|..++.- -.+ +.............|+ T Consensus 215 f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d 293 (434) T protein:vir:62 215 FDEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYD 293 (434) T ss_pred eeeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhh Confidence 55566655332 23344555544445567777778888999999999988731 111 0111112223445689 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecc Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHP 227 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~ 227 (387) +++++...|.....+ +-..+++|..+..|.+...-.. ...-.. ......|....+.|+.|+.++.+|......... T Consensus 294 ~l~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd~~G-~~l~~~-~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~ 369 (434) T protein:vir:62 294 ALVKMKNTPVKEVRK--KARWVLNTAALTKIETMKTDDG-FPLLRP-FNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPV 369 (434) T ss_pred HHHHHHhhcchhhhc--CCEEEEcHHHHHHHHHhhccCC-CEeecc-CCCccCCCCceecceeeEEecCccCccCCCceE Confidence 999988888765432 3345789999888754211100 000000 001123444568899998888776432211100 Q ss_pred ccccccccccccccCceeeeeeec-ccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeee Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVA-TENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHL 306 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~-~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~ 306 (387) +..+.. ...... ......+....+..............-. .+-..-.+..+.+..+.+ T Consensus 370 ----i~~Gdf--------s~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~---------Dgk~i~~~~~~~~~~~~~ 428 (434) T protein:vir:62 370 ----FYFGDF--------SKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLL---------DAQLIHSPFEVPVYKYVL 428 (434) T ss_pred ----EEEeec--------cceEEEEeeceeEEEeehhhhcccCceEEEEEeee---------cceeecCcccceEEEEEe Confidence 000000 000000 0000000000000000000000000000 000000011111111111 Q ss_pred eeeecccccc Q lcl|NC_021299. 307 KATDAEIEGE 316 (387) Q Consensus 307 ~~~~~~~~~~ 316 (387) ... +.. T Consensus 429 ~~~----~~~ 434 (434) T protein:vir:62 429 KAP----TGA 434 (434) T ss_pred ccC----CCC Confidence 000 000 No 112 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.82 E-value=1.7e-05 Score=46.75 Aligned_cols=271 Identities=13% Similarity=0.038 Sum_probs=116.6 Q ss_pred Ccc---ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeece-ecc-cccccccccccccc Q lcl|NC_021299. 1 MAN---AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTR-GLR-ATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~---~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~ 75 (387) +.+ ..++|+.|...+++.+++..++..++..-. . .|....+++-........ .-. ...........+++ T Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~ 238 (458) T protein:vir:10 165 SSVEVSSESYETIFSQRIIRDLQKELVVGALFEELP---M---SSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGA 238 (458) T ss_pred ccCccccceehhhHhHHHHHHHHhhhhHHhhcceee---c---CCcceEEEEecCCcceeeccccccccccccccccccc Confidence 221 137899999999999999998877765321 1 233444443221111110 000 00011111112223 Q ss_pred cceEEEEEEeeeecc-eeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc--------c-------ccc Q lcl|NC_021299. 76 EVTVDIKLTDVIYNR-IDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY--------E-------KVS 138 (387) Q Consensus 76 ~~~~~~~id~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~--------~-------~~~ 138 (387) -..+++.. ++... +.++++-+.....++...+.++..++++.++|..++.- -.+.+. . ... T Consensus 239 ~~~i~~~~--~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) T protein:vir:10 239 LKEIHFST--YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKA 316 (458) T ss_pred ceeeEeee--eeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccc Confidence 33444433 44433 45666544445567888888888999999999988731 000000 0 001 Q ss_pred cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcc-cccceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 139 LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAG-EAGASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 139 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g-~~~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) .......|++++++...|..... .+-..+++|..+..|.+....... ... .........|..+.+.|+.|+.+..+ T Consensus 317 ~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~~lkd~~G~-~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 393 (458) T protein:vir:10 317 DGSVLVTAKTISKLRRKLGRHGL--KLSKLVLIVSMDAYYDLLEDEEWQ-DVAQVGNDSVKLQGQVGRIYGLPVVVSEYF 393 (458) T ss_pred cccccccHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHhhcccCCc-eeeccccccccccCcCceecceeeEEcccc Confidence 11123458899999888876653 344578999988877542211110 000 00111223445567889999999888 Q ss_pred ceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) |...... .+.+. . .. ..........+.+..+..............-.|.....+ .. T Consensus 394 p~~~~~~----~~~~~--~----f~-----~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~---------~a 449 (458) T protein:vir:10 394 PAKANSA----EFAVI--V----YK-----DNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFA---------NG 449 (458) T ss_pred ccccCCc----ceEEE--E----ec-----ccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecc---------cc Confidence 7532100 00000 0 00 000000001111111110000000000000001000000 00 Q ss_pred ccccceeeeeee Q lcl|NC_021299. 298 FVRGTRIHLKAT 309 (387) Q Consensus 298 ~v~~~~v~~~~~ 309 (387) ++. .++... T Consensus 450 ~v~---~~~aa~ 458 (458) T protein:vir:10 450 VVS---GTYAAS 458 (458) T ss_pred eEE---EeeccC Confidence 000 000000 No 113 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.80 E-value=1.7e-05 Score=46.69 Aligned_cols=266 Identities=8% Similarity=-0.003 Sum_probs=117.5 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. -+++|+.|..++++.+++...+.++++.- .-..+..+.++........-. -.+++......++ T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~---~v~E~~~~~~~~~ 188 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQIL-----TTSDGRTMEWATADGTSEVGV---LLGENEEAGEEDT 188 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceee-----ecCCCceEEEEeeccCccccc---ccccccccccccc Confidence 321 13789999999999999998887766432 111233444443221111001 1123344444444 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhc-----------ccccccccCCc Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITK-----------APYEKVSLVDE 142 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~-----------~~~~~~~~~~~ 142 (387) .-..+.+...+....-+.++++-+.....++...+..+..++++.++|..++.- -.+ ........... T Consensus 189 ~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~ 268 (409) T protein:vir:45 189 DFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAAN 268 (409) T ss_pred ccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccccccccccc Confidence 444444433222222345776655555567777788888999999999987731 111 11111222234 Q ss_pred chhHHHHHHHHHHHhhccCCcCCcE-EEEchHHHHHHhc--ccchhhhhhcccccceeeeeeEEEEeecceeeeeeccce Q lcl|NC_021299. 143 DEIWNGVVSNRRWLNEQKVPKDGRV-LLVGSAVEEALLL--DDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPH 219 (387) Q Consensus 143 ~~~~~~i~~a~~~l~~~~vp~~~r~-~v~~~~~~~~l~~--~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~ 219 (387) ...|++++++...|..... ....+ +++++..+..|.+ |..-. .. .......|....+.|+.|+.++.+|. T Consensus 269 ~~~~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd~~G~---~i---~~~~~~~~~~~~l~G~PV~~~~~~p~ 341 (409) T protein:vir:45 269 AVKWQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMEDGQGR---PL---WLPDIVGVAPASVLNVPYVIDQEIDD 341 (409) T ss_pred ccchHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhcCCCc---ee---eccCcCCCCCceecceeeEEecCcCC Confidence 4568889988888865542 23345 4578888877643 22110 00 01122344446789999999988874 Q ss_pred eee----eeeccccccccccccccccCceeeeeeecccc--cceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccc Q lcl|NC_021299. 220 GDA----YLSHPTAYAMLTRSPGRPMTNTVATSTVATEN--GVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLD 293 (387) Q Consensus 220 ~~~----~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 293 (387) ... +.+. .+. ........+............ ...+....-++ ....+. T Consensus 342 ~~~~~~~i~~G--d~~--~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d------~~~~~~---------------- 395 (409) T protein:vir:45 342 IGAGKKFMFCG--DFD--RFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFD------CILEDT---------------- 395 (409) T ss_pred ccCCccEEEEe--ehh--hhheeeccceEEEEeecccccCCcEEEEEEEEec------cEeech---------------- Confidence 211 1110 000 000000000000000000000 00010000000 000000 Q ss_pred eeccccccceeeeeeeeccccc Q lcl|NC_021299. 294 DEPRFVRGTRIHLKATDAEIEG 315 (387) Q Consensus 294 ~~~~~v~~~~v~~~~~~~~~~~ 315 (387) . .+. .+++.. +... T Consensus 396 --~-A~~--~l~~k~---s~~~ 409 (409) T protein:vir:45 396 --S-AIK--ALVGKG---SVGG 409 (409) T ss_pred --h-heE--EEEecc---CCCC Confidence 0 000 000000 0000 No 114 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=97.78 E-value=1.5e-05 Score=46.96 Aligned_cols=274 Identities=12% Similarity=0.024 Sum_probs=121.3 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |+.+ -++|+.|+.++++.+++..++..++.+- . -.+++++||+........ -.+++..++..++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~-----~-~~~~~~~~p~~~~~~~a~----~v~E~~~~~~~~~ 83 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-----P-MGTTGQKIPHWIGDVSAQ----WIGEGDMKPITKG 83 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhccee-----e-ccCCceEEEEEeCCcceE----EecCCcccccccc Confidence 3332 1567888999999999999888876532 1 125667887654322221 1234555656666 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc-----------cccccCCc Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY-----------EKVSLVDE 142 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~-----------~~~~~~~~ 142 (387) +-.++++...| ...-+.++++-+.....++...+.++..++++.++|+.++.- -.+.+. ......+. T Consensus 84 ~f~~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 162 (320) T protein:vir:10 84 NMTSQNIAPHK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATA 162 (320) T ss_pred ceeEEEEeeEE-EEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccc Confidence 66666666633 345566776665556778888888889999999999998731 110000 00001111 Q ss_pred ch--hH-HHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhh----hhhcccccceeeeeeEEEEeecceeeeee Q lcl|NC_021299. 143 DE--IW-NGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIR----YDSAGEAGASRLQTARIGRLAQYDVVTVD 215 (387) Q Consensus 143 ~~--~~-~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~----~~~~g~~~~~~~~~g~ig~~~g~~v~~s~ 215 (387) .. .+ ..+.++...+.... ......+++|..+..|.+...-.. ......... ....-+++.|+.++.++ T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~---~~~~~~~i~g~pv~~~~ 237 (320) T protein:vir:10 163 SDLTAYDAVAVNGLSLLVNAK--KKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDEN---SPFRAGRIVSRPTILSD 237 (320) T ss_pred cccccHHHHHHHHHhhhhccc--CCCcEEEEcHHHHHHHHHhhccCCceeeccccccCcc---ccccCceeeeeeeEecC Confidence 11 12 23445555555444 335578899999988865221100 000000000 11112467888999888 Q ss_pred ccceeeeeee--ccccccccccccccccCceeeeeeecccccceeeeeeeeee------ccceeeeeeeeeeeeccccce Q lcl|NC_021299. 216 TLPHGDAYLS--HPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDA------TSTTERSIVDTWIGVKAVLDP 287 (387) Q Consensus 216 ~~~~~~~~~~--~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~------~~~~~~~~~~~~~g~~~~~~~ 287 (387) .+|.+....+ ....+.+..+ .+......... ...+..+... .............+ T Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~-----~~~~i~~~~~~-----~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d------- 300 (320) T protein:vir:10 238 HVADGTTVGYMGDFRNVIWGQV-----GGLSFDVTDQA-----TLNLGTPTEPNFVSLWQHNLVAVRVEAEYA------- 300 (320) T ss_pred CCCCCceEEEEeecceEEEEEe-----cCeEEEEeecc-----eeeeccccccccchhhhcCcEEEEEEEeec------- Confidence 8876543221 1111111000 01000000000 0000000000 00000000000000 Q ss_pred eeeccceeccccccceeeeeee Q lcl|NC_021299. 288 VTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 288 ~~~~~~~~~~~v~~~~v~~~~~ 309 (387) ........++....+.-++. T Consensus 301 --~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 301 --FHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred --cEEecccceEEEEeccCCCC Confidence 00000000000000000000 No 115 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=97.76 E-value=1.5e-05 Score=47.00 Aligned_cols=261 Identities=10% Similarity=0.010 Sum_probs=119.5 Q ss_pred Cc---cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MA---NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma---~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) ++ ..++.|++|...+.+.+++..++..++.. +....+..+.+|.-...... .. .+++..++..++.-. T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~~~~~~a-~~---v~E~~~~~~~~~~f~ 184 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGAST-----FTTSDANPMDFTVITGRATA-GI---VGETAEIPESYPATT 184 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhccee-----eecCCCceeEEEEEcCCcce-ee---eccccccccccccee Confidence 11 12466778877766777777666555432 11123455666643322111 11 234455555566666 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHH-H--------Hhcccc---cccccCCcchh Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSY-L--------ITKAPY---EKVSLVDEDEI 145 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~-~--------~~~~~~---~~~~~~~~~~~ 145 (387) .+.+...+ .+.-+.++++-+.....++...+.++..++++..+|..++. . +..... ....+...... T Consensus 185 ~v~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~ 263 (392) T protein:vir:13 185 QRSMGGFK-YGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKV 263 (392) T ss_pred eEEeeeee-EEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccccccccccccccc Confidence 66766643 23444566665555555777777788889999999998873 1 111110 11112223455 Q ss_pred HHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeeee Q lcl|NC_021299. 146 WNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAYL 224 (387) Q Consensus 146 ~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~ 224 (387) |++++++...|.... ..+-..|++|..+..|.+... ..|.- .......|..+.+.|+.++.++.+|.+..+. T Consensus 264 ~d~l~~~~~~l~~~~--~~~a~~v~n~~~~~~l~~lkd-----~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~ 336 (392) T protein:vir:13 264 SDALIDLFHEVPSAY--RKNAKFVVNDLRAAQMRKLKD-----ANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLF 336 (392) T ss_pred HHHHHHHHHhhhhhh--hcCCEEEEcHHHHHHHHHhhc-----cCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEE Confidence 888888877776543 223346889998887754211 01110 0112334545678999999998888654322 Q ss_pred eccccccccccccccccCceeeeeeeccccc--ceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccc Q lcl|NC_021299. 225 SHPTAYAMLTRSPGRPMTNTVATSTVATENG--VQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGT 302 (387) Q Consensus 225 ~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~ 302 (387) .....+.+..+ .+..........+.. ..+....-. +....+ . ..+... T Consensus 337 Gdf~~~~i~~~-----~~~~i~~~~~~~~~~~~~~~r~~~r~------d~~~~~------------------~-~A~~~~ 386 (392) T protein:vir:13 337 ADLSKYRVRFA-----GSLRVDRSVDAKFSTDQIVYRFLQRA------DGLLVD------------------A-RGAKVL 386 (392) T ss_pred eeccceeEEee-----cceEEEeeccccccCCcEEEEEEEEe------ccEEec------------------c-cceEEE Confidence 11111111100 000000000000000 000000000 000000 0 000011 Q ss_pred eeeeee Q lcl|NC_021299. 303 RIHLKA 308 (387) Q Consensus 303 ~v~~~~ 308 (387) .++... T Consensus 387 ~~~~aa 392 (392) T protein:vir:13 387 TVTPAA 392 (392) T ss_pred EeeccC Confidence 110000 No 116 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.76 E-value=9.2e-06 Score=48.20 Aligned_cols=268 Identities=10% Similarity=0.036 Sum_probs=120.5 Q ss_pred Ccc----ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccccc Q lcl|NC_021299. 1 MAN----AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLT 75 (387) Q Consensus 1 Ma~----~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 75 (387) ++. -.++|+-|..++++.+++..++..++++-. .. +.+..++++....... ...-.+++..+. .+.++ T Consensus 111 ~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~-~a~~v~E~~~~~~~~~~~ 183 (397) T protein:vir:48 111 DASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVEN---VT---TLTGSRVYEKWADITG-LAKLDDEAGSIGTNDDPK 183 (397) T ss_pred ccCCccccccccHHHHHHHHHHHHHHHHHHhhhceee---cc---CCcceEEEEeecCCCc-ceeeeccccccccccccc Confidence 221 247899999999999999999888775431 11 2333333332211110 011122233332 22345 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHH Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRW 155 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~ 155 (387) -..+++.+.+. +.-+.++++-+.....++...+.++..++++.++|..++..- ..+........|++++++... T Consensus 184 ~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~-----g~~~~~~~~~~~d~i~~~~~~ 257 (397) T protein:vir:48 184 LYPIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI-----ATLPTKPTLTKWDDIIDLQAK 257 (397) T ss_pred eeeEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc-----cccccccccccHHHHHHHHHH Confidence 56666666333 344567766555566778888888889999999999887421 112223445678999999888 Q ss_pred HhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeec--cceee----eeeec-- Q lcl|NC_021299. 156 LNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDT--LPHGD----AYLSH-- 226 (387) Q Consensus 156 l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~--~~~~~----~~~~~-- 226 (387) |..... .+-..+++|..+..|.+...-. |.-. ...+..|..+.+.|+.|+.... ++... .+.+. T Consensus 258 l~~~~~--~~a~~v~n~~~~~~L~~lkd~~-----G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~ 330 (397) T protein:vir:48 258 VDPAIK--QTSFFLTNTSGFTALKKVKNAF-----GDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDL 330 (397) T ss_pred hhhhhc--CCCEEEECHHHHHHHHHhhcCC-----CceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEec Confidence 877654 3456789999998886532111 1100 1123445556788988876432 21110 00000 Q ss_pred cccccccccccccccCceeeeeeec----ccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccc Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVA----TENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGT 302 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~----~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~ 302 (387) ...+.+..+ .+......... ..+...+..... .+....+. .. .. T Consensus 331 ~~~~~~~~~-----~~~~i~~~~~~~~~~~~~~~~~r~~~r------~d~~~~~~------------------~a---~~ 378 (397) T protein:vir:48 331 KQAVTLFDR-----QQMSLLSTNIGGGAFETDTTKIRVIDR------FDVVATDT------------------ES---FV 378 (397) T ss_pred cceEEEEee-----cceEEEEeccchhhhhcCceeEEEEee------eccEEecc------------------cc---eE Confidence 000000000 00000000000 000000000000 00000000 00 00 Q ss_pred eeeeeeeec-ccccccccc Q lcl|NC_021299. 303 RIHLKATDA-EIEGETVKA 320 (387) Q Consensus 303 ~v~~~~~~~-~~~~~~~~~ 320 (387) .+++....- .....++.. T Consensus 379 ~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 379 PASFKAIADQKGNLGSTAV 397 (397) T ss_pred EEEecccccCCCCccccCC Confidence 000000000 000000111 No 117 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.69 E-value=2.8e-05 Score=45.54 Aligned_cols=288 Identities=12% Similarity=0.039 Sum_probs=122.3 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) -+-.++.|+.+.+++++.|++..++..+..+-.. -+...++ .++||........ +. .+++...+..+++-..++ T Consensus 344 ~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~-~~~~~~~-~~~ip~~t~~~~a-~w---v~Eg~~~~~s~~~f~~v~ 417 (645) T protein:vir:93 344 WAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIP-ALRQVPF-NIRVHAQVSGGAA-GW---VGEGKTKPLTKFDFESIT 417 (645) T ss_pred ccCCccCchhhHHHHHHhhhhhhhHHhhcccccc-ccccccC-ceeeeeeecCcce-EE---eccCccccccccceeEEE Confidence 1234688999999999999998888766432211 1122122 3455542211111 11 234455555666666666 Q ss_pred EEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----cccc----cccCCcchhHHHHHH Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA-----PYEK----VSLVDEDEIWNGVVS 151 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~-----~~~~----~~~~~~~~~~~~i~~ 151 (387) +...| .+.-+.++++-+.....++...+.++..++++.++|..++..-... +... .........+.++.. T Consensus 418 l~~~k-la~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~ 496 (645) T protein:vir:93 418 FSHAK-VSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEA 496 (645) T ss_pred EeeEE-EEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHH Confidence 65522 3334445544444455566666677889999999999887321111 1110 011122344566777 Q ss_pred HHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecccccc Q lcl|NC_021299. 152 NRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYA 231 (387) Q Consensus 152 a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~ 231 (387) +...|..+++...+-..+++|.....|.+...-. |...-.. ....-+.+.|+.++.++.+|....+. ..+.+. T Consensus 497 ~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~-----G~~~~~~-~~~~~~tL~G~PV~~s~~vp~~~~~g-d~s~~~ 569 (645) T protein:vir:93 497 AFGQFVAANLQPTGAVWLMSSTNALALSMRKNAL-----GQKEYPD-MTLLGGSFQGLPVIVSQYVGDQLVLV-NAPDIY 569 (645) T ss_pred HHHHHHhcCCCccccEEEEcHHHHHHHHhccccC-----CceeecC-CCCCCceeeceeeEEeccCCcceeEe-ccccEE Confidence 7777777776666667789999988886542211 1100000 01112578999999999887542211 111111 Q ss_pred ccccccccccCceeeeeeecccccceeeeeeeeeecc--ceeeeeeeee-eeeccccceeeeccc--eeccccccceeee Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATS--TTERSIVDTW-IGVKAVLDPVTANLD--DEPRFVRGTRIHL 306 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~--~~~~~~~~~~-~g~~~~~~~~~~~~~--~~~~~v~~~~v~~ 306 (387) +... .+..+.... ...+.+........ ......++-. ..-..+......... ....++....++ T Consensus 570 ig~~-----~~v~i~~s~-----~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~- 638 (645) T protein:vir:93 570 LADD-----GGVAVDMSR-----EASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVN- 638 (645) T ss_pred EEEe-----cceEEEeec-----ceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEeccc- Confidence 1100 000000000 00000000000000 0000000000 000000000000000 000000000000 Q ss_pred eeeeccccccccccccce Q lcl|NC_021299. 307 KATDAEIEGETVKAGEKL 324 (387) Q Consensus 307 ~~~~~~~~~~~~~~~~~~ 324 (387) .+..-+. T Consensus 639 -----------~g~~~~~ 645 (645) T protein:vir:93 639 -----------YGSASGG 645 (645) T ss_pred -----------CCcccCC Confidence 0000000 No 118 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.69 E-value=2.8e-05 Score=45.51 Aligned_cols=256 Identities=10% Similarity=0.006 Sum_probs=119.6 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. .++.|+ +...+++.+++...+.+++..- . -.+..+++|........-.. .+++....-.++ T Consensus 114 ~~~~~~~~g~~~~~~-~~~~ii~~~~~~~~l~~~~~~~-----~-~~~~~~~~~~~~~~~~~a~~---v~Eg~~~~~~~~ 183 (390) T protein:vir:10 114 STDAAGSAGALTTPN-RLPGFITQPDARLTVRDLIGSG-----R-TDSALIEYVQETGFVNNAAI---VAEGALKPESSL 183 (390) T ss_pred hcccccccccccchh-HHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCCcceee---ecCCcccccccc Confidence 111 134455 5567888999888887776421 1 12445667654322111111 123444555556 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH----------hccc-ccccccCCcc Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI----------TKAP-YEKVSLVDED 143 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~----------~~~~-~~~~~~~~~~ 143 (387) +-..+.+.+.+. +.-+.++++ +..+..++...+.++..++++.++|..++.-- ..+. .......... T Consensus 184 ~~~~i~~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 261 (390) T protein:vir:10 184 KFAKKTDTTHVI-AHTMKATRQ-ILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGA 261 (390) T ss_pred ceeEEEEeeEEE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccccccc Confidence 666677766443 334556654 55555666666667778899999999887320 0000 1111222344 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) ..++.+.++...|.....+ .-.++++|..+..|.+...-.. ..... . ...+..+.+.|+.++.++.+|.+..+ T Consensus 262 ~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd~~g-~~l~~---~-~~~~~~~~l~G~pv~~~~~~p~~~~~ 334 (390) T protein:vir:10 262 TRVDQLRLAMLQASLAEYP--ASGIVINPIDWAAIELAKDANN-QYLIG---N-ARGTLTPTLWGLPVVATQAMAPGEFL 334 (390) T ss_pred chHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhcCCC-ceeec---C-CcCcCCceecceeeEEcCCCCCCcEE Confidence 5678888888888877654 3457899999888764321110 00000 0 11223356889999999988865433 Q ss_pred eeccc-cccccccccccccCceeeeeeec-cc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccc Q lcl|NC_021299. 224 LSHPT-AYAMLTRSPGRPMTNTVATSTVA-TE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFV 299 (387) Q Consensus 224 ~~~~~-a~~~~~~~~~~~~~~t~~~~~~~-~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v 299 (387) ..-.. ++.+.. ..+......... .. +...+......+. ...+. .. T Consensus 335 ~gdf~~~~~~~~-----~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~------~v~~~------------------~a-- 383 (390) T protein:vir:10 335 VGAFDLAAQIFD-----QWDARVEIGYVNDDFQRNMVTVLAEERLAL------VVYRP------------------EA-- 383 (390) T ss_pred EEeccceEEEEE-----ecceEEEEeecccccccCcEEEEEEEeecc------EEecc------------------cc-- Confidence 21111 111100 001111100000 00 0000000000000 00000 00 Q ss_pred ccceeeee Q lcl|NC_021299. 300 RGTRIHLK 307 (387) Q Consensus 300 ~~~~v~~~ 307 (387) ...+++. T Consensus 384 -~~~~~~a 390 (390) T protein:vir:10 384 -LISGSFA 390 (390) T ss_pred -EEEEEeC Confidence 0000000 No 119 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.66 E-value=3.1e-05 Score=45.28 Aligned_cols=269 Identities=13% Similarity=0.035 Sum_probs=117.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (387) |. -.+++|+-|..++++.+++..++..+++.- .-.+.+.++|++......-.. .+++.... .++ T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~------~~~~~~~~~~~~~~~~~~~~~---~~E~~~~~~~~~ 181 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT------PVTTPKGTYPILKRATDRFSS---VAELAENPALAE 181 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee------eccCCceEEEEEecCCCcccc---cccccccccccc Confidence 11 225789999999999999999998877532 112445666655432211111 12222222 234 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++.+.+.. .-+.++++-+.....++...+.+...++++..+|..++...... ..........++.+.++. T Consensus 182 ~~~~~v~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~---~~~~~~~~~~~d~l~~~~ 257 (394) T protein:vir:10 182 PEFEQVDWSVSTYR-GAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSF---TAKATTTDTLVDSLKHIL 257 (394) T ss_pred ccceeEEeeeeeeE-eeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccccccccHHHHHHHH Confidence 55566676664433 33556666555555678777888888999999999887543221 222233445567777654 Q ss_pred H-HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeee-----eeecc Q lcl|NC_021299. 154 R-WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDA-----YLSHP 227 (387) Q Consensus 154 ~-~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~-----~~~~~ 227 (387) . .++.. .+-..|++|..+..|.+...-...--...........+.-+.+.|+.|+.......... +.+.. T Consensus 258 ~~~~~~~----~~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd 333 (394) T protein:vir:10 258 NVDLDPA----YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGD 333 (394) T ss_pred Hhhhhhh----ccCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEee Confidence 3 33322 23468899999888875321110000000000111123335688888876543211110 11000 Q ss_pred -c-cccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceee Q lcl|NC_021299. 228 -T-AYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIH 305 (387) Q Consensus 228 -~-a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~ 305 (387) + .+.+. ...+....+..+ ...... .... ............ ...++ T Consensus 334 ~s~~~~~~------------------~~~~~~v~~~~~--~~~~~~-~~~~---------~r~d~~~~~~~a---i~~~~ 380 (394) T protein:vir:10 334 LKRGVLFA------------------DRQQVTLAWEDS--KIYGRY-LGAA---------FRFGVKQADSNA---GYFVT 380 (394) T ss_pred ccccEEEE------------------eecceEEEEecc--ccccee-EEEE---------EEeccEEecccc---EEEEE Confidence 0 00000 000111111100 000000 0000 000000000000 00111 Q ss_pred eeeeecccccccccccc Q lcl|NC_021299. 306 LKATDAEIEGETVKAGE 322 (387) Q Consensus 306 ~~~~~~~~~~~~~~~~~ 322 (387) +.. ...+.+-+.|. T Consensus 381 ~~~---~~~~~~~~~~~ 394 (394) T protein:vir:10 381 NTD---AASGSTSGTGK 394 (394) T ss_pred eec---ccCCCCCCCCC Confidence 111 11111122222 No 120 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.55 E-value=4.7e-05 Score=44.33 Aligned_cols=266 Identities=13% Similarity=0.019 Sum_probs=118.6 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |++. -+.|+.+..++++.+++..++.+++.+- . -.+.+++||+....... .-.+++..+...++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~-----~-~~~~~~~ip~~~~~~~a----~~v~Eg~~~~~~~~ 83 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-----P-MGTTGQKIPHWVGDVSA----QWIGEGDMKPITKG 83 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEeCCcce----EEecCCcccccccc Confidence 4332 2578889999999999999988887532 1 12456777654322111 11234555555666 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hcccc-------c--c-cccCCcc Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TKAPY-------E--K-VSLVDED 143 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~~~~-------~--~-~~~~~~~ 143 (387) +-.+++++..+ ...-+.++++-+.....++...+.++..++++.++|..++.-- .+.+. . . ....... T Consensus 84 ~f~~i~~~~~k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 162 (318) T protein:vir:24 84 NMTSQTIAPHK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATT 162 (318) T ss_pred ceeEEEEeeEE-EEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccc Confidence 66666666633 3345566666555566778888888889999999999887321 10000 0 0 0011111 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhh----hhhcccccceeeeeeEEEEeecceeeeeeccce Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIR----YDSAGEAGASRLQTARIGRLAQYDVVTVDTLPH 219 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~----~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~ 219 (387) ...+.++++...+.... ......+++|..+..|.+...-.. ....... .......+.+.|+.++.++.++. T Consensus 163 ~~~~~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~---~~~~~~~~~i~g~pv~~~~~~~~ 237 (318) T protein:vir:24 163 VYDQVAVNGLSLLVNDG--KKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGE---AASPFRSGRIVARPTILSDHVVE 237 (318) T ss_pred hHHHHHHHHHHhhcccc--CCCCEEEEcHHHHHHHHHhhccCCceeecCccccC---ccccccCceEEEEeeEEeCCCCC Confidence 22234455555554433 344567999999888864211000 0000000 01111124677888888777765 Q ss_pred eeeeee--ccccccccccccccccCceeeeeeec---------------c-cccceeeeeeeeeeccceeeeeeeeeeee Q lcl|NC_021299. 220 GDAYLS--HPTAYAMLTRSPGRPMTNTVATSTVA---------------T-ENGVQLRWLGDYDATSTTERSIVDTWIGV 281 (387) Q Consensus 220 ~~~~~~--~~~a~~~~~~~~~~~~~~t~~~~~~~---------------~-~~~~~~~~~~~~d~~~~~~~~~~~~~~g~ 281 (387) +....+ ....+.+.. ..+......... . .+...+......+..... T Consensus 238 ~~~~~~~gdfs~~~~~~-----~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~----------- 301 (318) T protein:vir:24 238 GTTVGFMGDFSQLIWGQ-----IGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCND----------- 301 (318) T ss_pred CccEEEEeecceEEEEE-----ecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEec----------- Confidence 543211 111111110 000000000000 0 000001111111100000 Q ss_pred ccccceeeeccceeccccccceeeeeeeeccccccccccccc Q lcl|NC_021299. 282 KAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEK 323 (387) Q Consensus 282 ~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~ 323 (387) ...++. + .. .+-+-+++ T Consensus 302 -------------~~a~~~---i--~~-------~~a~~~~~ 318 (318) T protein:vir:24 302 -------------AEAFVA---L--TN-------VVSGGGEG 318 (318) T ss_pred -------------ccceEE---E--Ee-------eccCCCCC Confidence 000000 0 00 00000000 No 121 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.54 E-value=4.8e-05 Score=44.28 Aligned_cols=284 Identities=11% Similarity=-0.034 Sum_probs=119.8 Q ss_pred Cccc-----cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccc Q lcl|NC_021299. 1 MANA-----FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~-----~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (387) ||.. .++|+.++.++++.+++..++..++.+-. . .+..++||+........ . .+++..++..+++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~----~--~~~~~~~p~~~~~~~a~-w---v~Eg~~~~~~~~~ 70 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKP----Q--RFGNEDIITFNGRPKAE-F---VGEGQQKSSTTGE 70 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceee----c--cCCceEEEEEeCCceeE-E---eecCcccccccce Confidence 8853 37799999999999999999887764321 1 12456776643222111 1 2344555555666 Q ss_pred cceEEEEEEeeeecceeeccHHHh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc--------------cccc Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERE---LDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-AP--------------YEKV 137 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~---~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~~--------------~~~~ 137 (387) -..+++...| .+.-+.++++-+. .+..++...+.++..++++.++|+.++..-.. .+ .... T Consensus 71 f~~v~l~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~ 149 (311) T protein:vir:99 71 FDFVTSTPKK-AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVE 149 (311) T ss_pred eeEEEEeeEE-EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceee Confidence 6666666532 3344556655432 34566777788888999999999988843110 00 0000 Q ss_pred c-cCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeee Q lcl|NC_021299. 138 S-LVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVD 215 (387) Q Consensus 138 ~-~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~ 215 (387) . .......+.++..+...+...+.....-..+++|..+..|.+...- .|.-. ......+..+.+.|+.++.++ T Consensus 150 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~-----~G~~l~~~~~~~~~~~~l~G~Pv~~s~ 224 (311) T protein:vir:99 150 LTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYT-----DGRKKFPELGLGIGVSSFEGIDASVSD 224 (311) T ss_pred ccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhcc-----CCCeeecCcccCCCCceecceeeEeec Confidence 0 1111223445555555554443322222378999998888653211 01100 011123334678899999888 Q ss_pred ccceeeeeeeccccccccccccccccCceeee-eeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccce Q lcl|NC_021299. 216 TLPHGDAYLSHPTAYAMLTRSPGRPMTNTVAT-STVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDD 294 (387) Q Consensus 216 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~-~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 294 (387) .+|.................... .-+.+.. -......+..+......+..........+ ...+.......... T Consensus 225 ~i~~~~~~~~~~~~~~~~~~~~~--~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d----~~~~r~~~r~d~~v 298 (311) T protein:vir:99 225 TVNGGDEADPDDEDLDAARAVRG--IVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHN----QIALRLEIVYGWYV 298 (311) T ss_pred ccccccccccccchhhccCcceE--EEeeccccEEEEEecCceEEEeecCCCCcchhhhhcC----cEEEEEEEeeccee Confidence 77644322111111101000000 0000000 00000000000000000000000000000 00000000000000 Q ss_pred e-ccccccceeeeeeeec Q lcl|NC_021299. 295 E-PRFVRGTRIHLKATDA 311 (387) Q Consensus 295 ~-~~~v~~~~v~~~~~~~ 311 (387) . ..++. +. ...- T Consensus 299 ~~~~~v~---~~--~~~A 311 (311) T protein:vir:99 299 FTDRFVV---IE--NAVA 311 (311) T ss_pred cChhHee---ee--cccC Confidence 0 00000 00 0000 No 122 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.54 E-value=3.5e-05 Score=45.03 Aligned_cols=257 Identities=12% Similarity=0.123 Sum_probs=118.9 Q ss_pred Ccc------ccccH-HHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKP-PVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma~------~~~~p-e~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (387) |.. ..++| +++.+++++.|++.+++..+-.+- +....| .++||.-...... +. .+++..+...+ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~----~~~~~g-~~~ip~~~~~~~a-~w---v~E~~~~~~s~ 427 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM----LPGLVG-DVDIPKKTSGANF-YW---IGEDEDVQDSD 427 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceE----eecCCc-ceEEEEEeCCcee-Ee---ecCCccccccc Confidence 111 12444 667889999999888776652221 222222 4666653222111 11 13444555556 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh--ccccc--------ccccCCcc Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT--KAPYE--------KVSLVDED 143 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~--~~~~~--------~~~~~~~~ 143 (387) ++-..+++...+ .+.-+.++.+-+..+..++...+......+++.++|..++.--. ..+.. ........ T Consensus 428 ~~f~~i~l~~~k-~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~ 506 (632) T protein:vir:96 428 FDFTTLSFSPKT-IAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGG 506 (632) T ss_pred cceeeEEeeeeE-EEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceeccccc Confidence 666666666633 23344555554445556666666777889999999998874211 11110 01112233 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) ..|..++++...+...++...+-..+++|.....+.+.. +. +.. +...... +.+.|+.++.++.+|.+..+ T Consensus 507 ~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~-l~--d~~---G~~i~~~---~~l~G~pv~~s~~ip~~~~~ 577 (632) T protein:vir:96 507 VDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ-VF--DNT---GERIWQN---NEVNGYRAEASNQIPADTWI 577 (632) T ss_pred CCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHh-cc--CCC---CceeecC---CeecccceEeccccccCcEE Confidence 568889999888888776555556678887766654321 11 111 1222222 46789999999888865433 Q ss_pred eeccccccccccccccccCceeeeeeec--ccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccccc Q lcl|NC_021299. 224 LSHPTAYAMLTRSPGRPMTNTVATSTVA--TENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRG 301 (387) Q Consensus 224 ~~~~~a~~~~~~~~~~~~~~t~~~~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~ 301 (387) ......+.+.. ..+......... ..+...+.....++..... ...++ . T Consensus 578 ~gd~s~~~i~~-----~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~------------------------~~af~-~ 627 (632) T protein:vir:96 578 FGDWSQIVIAM-----WGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRR------------------------KEAFC-I 627 (632) T ss_pred EeecceEEEEE-----ecceEEEEccccccccCceEEEEEeecCceeec------------------------hhhhh-h Confidence 21111111100 000000000000 0000011111111100000 00000 0 Q ss_pred ceeeeeeeec Q lcl|NC_021299. 302 TRIHLKATDA 311 (387) Q Consensus 302 ~~v~~~~~~~ 311 (387) .+.. - T Consensus 628 ~k~~-----A 632 (632) T protein:vir:96 628 AKKG-----A 632 (632) T ss_pred eeec-----C Confidence 0000 0 No 123 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.52 E-value=5.1e-05 Score=44.12 Aligned_cols=273 Identities=10% Similarity=0.023 Sum_probs=120.9 Q ss_pred Ccc-----ccccHHHHHHHHH-HHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN-----AFIKPPVIIASIL-GQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~-----~~~~pe~~~~~~~-~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) +.. ..++|+-+...++ ..++....+..++.. +.. .| .+.+|+-..... ... .+++..+...++ T Consensus 251 ~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~-----~~~-~g-~~~~~~~~~~~~--a~~--v~Eg~~~~~~~~ 319 (543) T protein:vir:81 251 MGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQ-----VVA-TG-DVWHGVSSAAVQ--WSW--DAEFEEVSDDSP 319 (543) T ss_pred cccccccCcccCchhhhhHHHHHHHhhhchhhhhccc-----ccC-Cc-ceEEEEecCCcc--eee--cccCcccccccc Confidence 111 1367776666655 556676667666532 111 23 344544222111 111 234555666667 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH----------Hhc---ccccccccCC Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL----------ITK---APYEKVSLVD 141 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~----------~~~---~~~~~~~~~~ 141 (387) +-..+++...+. +.-+.++.+ +..+..++...+.+...++++.++|..++.- +.. ......+... T Consensus 320 ~~~~i~~~~~k~-~~~~~is~e-ll~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~ 397 (543) T protein:vir:81 320 EFGQPEIPVKKA-QGFVPISIE-ALQDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTA 397 (543) T ss_pred ccceeeeeeeee-EeeehhhHH-HHhccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccccccc Confidence 777777776443 344566664 4455678888888888999999999987631 100 0111122233 Q ss_pred cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceee Q lcl|NC_021299. 142 EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGD 221 (387) Q Consensus 142 ~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~ 221 (387) ....|++++++...|..... .+-.++++|..+..|.+...-. |.-.-..+..|..+.+.|+.++.+..+|... T Consensus 398 ~~~~~~~~~~~~~~l~~~~~--~~~~~v~n~~~~~~l~~lkd~~-----G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~ 470 (543) T protein:vir:81 398 ETFALADVYAVYEQLAARHR--RQGAWLANNLIYNKIRQFDTQG-----GAGLWTTIGNGEPSQLLGRPVGEAEAMDANW 470 (543) T ss_pred ccccHHHHHHHHHhhhcccc--CCcEEEEcHHHHHHHHHhhcCC-----CceeccCcCCCCCccccceeeEEeccccccc Confidence 44668889888888765543 2345789999988886532111 1100011234445678999999999887654 Q ss_pred eeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeecccee-ccccc Q lcl|NC_021299. 222 AYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDE-PRFVR 300 (387) Q Consensus 222 ~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~v~ 300 (387) ......+...+..+.. .. .......+..+.+..+....... ..+...+......+.... +.. T Consensus 471 ~~~~~~~~~~i~~gd~---~~-----~~i~~~~~~~i~~~~~~~~~~~~-------~~~~~~~~~~~r~d~~v~~~~A-- 533 (543) T protein:vir:81 471 NTSASADNFVLLYGNF---QN-----YVIADRIGMTVEFIPHLFGTNRR-------PNGSRGWFAYYRMGADVVNPNA-- 533 (543) T ss_pred cccccCCcceEEEeec---cc-----eeEEeecccEEEEeccccccchh-------hcCceEEEEEEeeccEeecccc-- Confidence 3222211111111100 00 00000011111111000000000 000000000000000000 000 Q ss_pred cceeeeeeee Q lcl|NC_021299. 301 GTRIHLKATD 310 (387) Q Consensus 301 ~~~v~~~~~~ 310 (387) ...+.+.... T Consensus 534 ~~~l~~~~~a 543 (543) T protein:vir:81 534 FRLLNVETAS 543 (543) T ss_pred eEEEEecccC Confidence 0000000000 No 124 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.50 E-value=5.6e-05 Score=43.91 Aligned_cols=270 Identities=10% Similarity=0.059 Sum_probs=121.0 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |+-. .++|+.|+.++++.+++...+..+++.-. ... ...+..|+....... .... .+++..+.- +. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~---~~~-~~g~~~~~~~~~~~~-~a~~--v~Eg~~~~~~~~ 77 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVEN---VTT-LTGSRVYEKWTDITG-LANI--DDEAGKIADIDD 77 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeee---ccC-CcceEEEEeecCCCc-ceee--ecCCcccccccc Confidence 4422 47899999999999999999877764321 111 112333433221111 1111 123333332 33 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++-..+++...+. +.-+.++++-+.....++...+.++..++++.+.|+.++...... .+......|++++++. T Consensus 78 ~~~~~i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~-----~~~~~~~~~d~i~~~~ 151 (293) T protein:vir:48 78 PKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKL-----PTKPTLTKWDDIIDLE 151 (293) T ss_pred cceeEEEEeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccc-----cccccccCHHHHHHHH Confidence 4556666666443 344667776665566777777888889999999998888543221 2233456789999998 Q ss_pred HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecccee--ee----eeec- Q lcl|NC_021299. 154 RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHG--DA----YLSH- 226 (387) Q Consensus 154 ~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~--~~----~~~~- 226 (387) ..|..... .+-..+++|..+..|.+...-... .. ....+..|..+++.|+.++.+...+.. .. +.+. T Consensus 152 ~~l~~~~~--~~a~~vmn~~~~~~L~~lkd~~g~-~l---~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd 225 (293) T protein:vir:48 152 AKVDPAIK--QTSFFLTNTSGFTALKKVKNALGD-YL---MERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGD 225 (293) T ss_pred Hhhhhhhc--CCCEEEEcHHHHHHHHHhhccCCc-eE---eecCcCCCCCceecceeeEEecccccCCccCCceEEEEEe Confidence 88876543 345678999998887542211100 00 011234455568889888764432211 10 0000 Q ss_pred -cccccccccccccccCceeeeeee---c-ccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccccc Q lcl|NC_021299. 227 -PTAYAMLTRSPGRPMTNTVATSTV---A-TENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRG 301 (387) Q Consensus 227 -~~a~~~~~~~~~~~~~~t~~~~~~---~-~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~ 301 (387) ..++.+..+ .+........ . ..+...+.+...++. ...+. .. . T Consensus 226 ~~~~~~~~~~-----~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~------~~~~~------------------~a---~ 273 (293) T protein:vir:48 226 LKQAVTLFDR-----QQMSLLSTNIGGGAFETDTTKVRVIDRFDV------VATDT------------------EA---F 273 (293) T ss_pred ccceEEEEEe-----cceEEEEecccchhhhcCeEEEEEEEeeCc------EEecc------------------cc---e Confidence 000000000 0000000000 0 000000000000000 00000 00 0 Q ss_pred ceeeeeeeeccccccccccccceeEEE Q lcl|NC_021299. 302 TRIHLKATDAEIEGETVKAGEKLALAL 328 (387) Q Consensus 302 ~~v~~~~~~~~~~~~~~~~~~~~~~~~ 328 (387) ..+++... .-.+.+++.. .+ T Consensus 274 ~~l~~~~~--~~~~~~~~~~-----~~ 293 (293) T protein:vir:48 274 VPASFKAI--ADQKGNIGST-----AV 293 (293) T ss_pred EEEEeecc--ccCCcccccc-----CC Confidence 00000000 0000000000 00 No 125 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.47 E-value=5.1e-05 Score=44.10 Aligned_cols=269 Identities=11% Similarity=0.063 Sum_probs=118.2 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |+ ...++|+-|...+++.+++..++..++.... .....|. +.++....... ... -.+++..+.. .. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~-~~~~~~~~~~~-~a~--~v~E~~~~~~~~~ 181 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVEN---VTTLTGS-RVYEKWTDITG-LAN--IDDEAGKIADVDD 181 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceee---cccCccc-eEEEeeccCCc-cee--eecCccccccccc Confidence 32 2247899999999999999999888775331 1111222 22322111111 111 1123333332 34 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++.+.+ .+.-+.++++-+.....++...+.++..++++..+|..++..... +........|+++.++. T Consensus 182 ~~~~~i~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~-----~~~~~~~~~~d~i~~~~ 255 (397) T protein:vir:49 182 PKLSLIKYTIKR-YAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAA-----LPTKPTLTKWDDIIDLE 255 (397) T ss_pred cceeeEEeeeee-EEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----cccccccccHHHHHHHH Confidence 455666666633 334455666654445567777788888999999999988743211 12223345688999988 Q ss_pred HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeec--cceeee----eeec Q lcl|NC_021299. 154 RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDT--LPHGDA----YLSH 226 (387) Q Consensus 154 ~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~--~~~~~~----~~~~ 226 (387) ..|..+..+ +-..+++|..+..|.+...- .|.-. ...+..|..+.+.|+.|+.... +|.... +.+. T Consensus 256 ~~l~~~~~~--~a~~vmn~~~~~~l~~lkd~-----~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~g 328 (397) T protein:vir:49 256 AKVDPAIKQ--TSFFLTNTSGFTALKKVKNA-----LGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFG 328 (397) T ss_pred HhhhhhhcC--CCEEEEcHHHHHHHHHhhcC-----CCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEe Confidence 888776543 45678999999888653211 11100 0113345556788888865432 222110 1110 Q ss_pred c-c-cccccccccccccCceeeeeeec--cc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccc Q lcl|NC_021299. 227 P-T-AYAMLTRSPGRPMTNTVATSTVA--TE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVR 300 (387) Q Consensus 227 ~-~-a~~~~~~~~~~~~~~t~~~~~~~--~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~ 300 (387) . . .+.+.. ..+......... .+ +...+..... .+....+. ..+ T Consensus 329 d~~~~~~~~~-----~~~~~i~~~~~~~~~~~~~~~~~r~~~r------~d~~~~~~------------------~a~-- 377 (397) T protein:vir:49 329 DLKQAVTLFD-----RQHMSLLSTNIGGGAFETDTTKVRVIDR------FDVVATDT------------------EAF-- 377 (397) T ss_pred eccceEEEEe-----ecceEEEEeccccchhhcCceeEEEEee------eCcEEecc------------------cce-- Confidence 0 0 000000 000000000000 00 0000000000 00000000 000 Q ss_pred cceeeeeeeeccccccccccccceeEEE Q lcl|NC_021299. 301 GTRIHLKATDAEIEGETVKAGEKLALAL 328 (387) Q Consensus 301 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 328 (387) ..+++.. ..-..+......+ T Consensus 378 -~~~~~~~-------~~~~~~~~~~~~~ 397 (397) T protein:vir:49 378 -VPASFKA-------IADQKGNLGSTAV 397 (397) T ss_pred -EEEEeec-------ccCCCCCcccccC Confidence 0000000 0000000000000 No 126 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.43 E-value=6.9e-05 Score=43.38 Aligned_cols=282 Identities=13% Similarity=0.021 Sum_probs=117.4 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) ++.. -+.|+-+++++++.+++...+..++.+-. -.+.+.++|+....... .. .+++..++-.+++-. T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~------~~~~~~~~p~~~~~~~a--~~--v~Eg~~~~~~~~~f~ 91 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIP------MGTTGQKIPHWTGDVSA--SW--IGEGDMKPITKGNMT 91 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcchhhhhcceee------ccCCceEEEEEeCCcce--EE--ecCCcccccccccee Confidence 1111 15677788999999999998877765321 12456677653322111 11 234555666666667 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-hccc-------------ccccccCCcc Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLI-TKAP-------------YEKVSLVDED 143 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~-~~~~-------------~~~~~~~~~~ 143 (387) .+++...+ ...-+.++++-+.....++...+.++..++++.++|+.++.-- .+.+ ....+..... T Consensus 92 ~i~~~~~k-~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~ 170 (326) T protein:vir:42 92 SQTIAPHK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNAD 170 (326) T ss_pred EEEEeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeeccccccccc Confidence 77777633 4566777776666667788888888888999999999887310 0000 0011111122 Q ss_pred hhHHHHH--HHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcc--cccceeeeeeEEEEeecceeeeeeccce Q lcl|NC_021299. 144 EIWNGVV--SNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAG--EAGASRLQTARIGRLAQYDVVTVDTLPH 219 (387) Q Consensus 144 ~~~~~i~--~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g--~~~~~~~~~g~ig~~~g~~v~~s~~~~~ 219 (387) ..+.++. .+...+. .....+...+++|..+..|.+...-.. .... ............+.+.|+.++.++.+|. T Consensus 171 ~~~~~~~~~~~~~~~~--~~~~~~a~~v~n~~~~~~L~~lkd~~G-~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 247 (326) T protein:vir:42 171 LTVYDAVAVNALSLLV--NAGKKWTHTLLDDITEPILNGAKDKSG-RPLFIESTYTEENSPFRLGRIVARPTILSDHVAS 247 (326) T ss_pred chhHHHHHHHHHhhhh--hhccCccEEEEeHHHHHHHHHhhccCC-ceeeccccccCccccccCceeeeeeEEEcCCCCC Confidence 2233322 2222222 222344567899999888864211000 0000 0000011112235688999988888876 Q ss_pred eeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeecc--ceecc Q lcl|NC_021299. 220 GDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANL--DDEPR 297 (387) Q Consensus 220 ~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~--~~~~~ 297 (387) +....+-. .+.. .......+.......... .....+.+... ......+ ...+........ ..... T Consensus 248 ~~~~~~~G-d~s~--~~~~~~~~~~v~~~~e~~-----~~~~~~~~~~~-~~~~~~d----~~~~r~~~~~d~~v~~~~a 314 (326) T protein:vir:42 248 GTVVGYQG-DFRQ--LVWGQVGGLSFDVTDQAT-----LNLGTPQAPNF-VSLWQHN----LVAVRVEAEYAFHCNDKDA 314 (326) T ss_pred CceEEEEe-ecce--EEEEEecceEEEEeecce-----eeecccccccc-hhhhhcC----cEEEEEEEEeccEEecccc Confidence 54332110 0000 000000000000000000 00000000000 0000000 000000000000 00000 Q ss_pred ccccceeeeeeeecccccc Q lcl|NC_021299. 298 FVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 298 ~v~~~~v~~~~~~~~~~~~ 316 (387) ++. +...... +. T Consensus 315 ~~~-----l~~~~~~--~~ 326 (326) T protein:vir:42 315 FVK-----LTNVDAT--EA 326 (326) T ss_pred eEE-----Eeecccc--CC Confidence 000 0000000 00 No 127 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.42 E-value=7e-05 Score=43.38 Aligned_cols=271 Identities=11% Similarity=0.039 Sum_probs=117.5 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc-c Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS-D 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~-~ 73 (387) |+ ..+++|+.|...+++.+++..++..++..-. +... ..++.++...... ..... .+++..+.-. . T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~-~~~~~~~~~~~~~-~~a~~--v~E~~~~~~~~~ 181 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVEN---VTTL-TGSRVYEKWADIT-GLAKL--DDEGGQIGQNDD 181 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceee---ccCC-cceEEEEeeccCC-cceee--eccccccccccc Confidence 32 1257899999999999999998877764321 1111 1123333221111 11111 1223333222 2 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++-..+++.+.+. +.-+.++.+-+.....++...+.++..++++..+|..++.-- + .+.+......|+++.++. T Consensus 182 ~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~-g----~~~~~~~~~~~d~i~~~~ 255 (397) T protein:vir:49 182 PKLSLIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI-G----TLPNKPTLAKWDDIIDLQ 255 (397) T ss_pred cceeeeEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc-c----cccccccccCHHHHHHHH Confidence 3445666666443 344556665454556677888888889999999999877421 1 122334456788999988 Q ss_pred HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeec--cceeee----eeec Q lcl|NC_021299. 154 RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDT--LPHGDA----YLSH 226 (387) Q Consensus 154 ~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~--~~~~~~----~~~~ 226 (387) ..|.....+ ....+++|..+..|.+...- .|.-. ...+..|.-+.+.|+.|+.+.. +|.... +.+. T Consensus 256 ~~l~~~~~~--~a~~v~n~~~~~~l~~lkd~-----~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~g 328 (397) T protein:vir:49 256 AKVDPAIKQ--TSLFLTNTSGFTALKKVKNA-----MGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFG 328 (397) T ss_pred HhhhhhhcC--CCEEEEcHHHHHHHHHhhcc-----CCceeecccccCCCCceecceeeEEecccccccccCCceeEEEe Confidence 888776643 45778999998887653211 11100 0112345456788888776443 222111 0000 Q ss_pred --cccccccccccccccCceeeeeeecccccceeeeeeeeeecccee--eeeeeeeeeeccccceeeeccceeccccccc Q lcl|NC_021299. 227 --PTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTE--RSIVDTWIGVKAVLDPVTANLDDEPRFVRGT 302 (387) Q Consensus 227 --~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~ 302 (387) ...+.+.. ..+..+............+ ........+. ....... .. T Consensus 329 d~~~~~~~~~------------------~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~---------~~~~~~a---~~ 378 (397) T protein:vir:49 329 DLKQAVTLFD------------------RQHLSLLSTNIGGGAFETDTTKVRVIDRFDV---------VSTDTEA---FV 378 (397) T ss_pred eccceEEEEe------------------ecccEEEEeccccchhhcCeeeEEEEEeecc---------EEecccc---eE Confidence 00000000 0000000000000000000 0000000000 0000000 00 Q ss_pred eeeeeeeecccccccccccc Q lcl|NC_021299. 303 RIHLKATDAEIEGETVKAGE 322 (387) Q Consensus 303 ~v~~~~~~~~~~~~~~~~~~ 322 (387) .+++..... ..+.+-..+. T Consensus 379 ~~~~~~~~~-~~~~~~~~~~ 397 (397) T protein:vir:49 379 PASFKAIAD-QKAKLSTAGA 397 (397) T ss_pred EEEeccccc-ccCcccccCC Confidence 000000000 0000000000 No 128 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.37 E-value=8.2e-05 Score=42.97 Aligned_cols=266 Identities=17% Similarity=0.132 Sum_probs=114.9 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccc-cccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD-LTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 79 (387) =+...++|+.+..++++.+++...+.+++..- . . .| ...||+........ -.+++..+...+ .+-..+ T Consensus 144 ~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~--~-~---~g-~~~ip~~~~~~~a~----~v~E~~~~~~~~~~~f~~i 212 (425) T protein:vir:95 144 AGGELTIPEVVVNRIMDIMGDYTTLYPLVDKI--R-V---KG-TTRILVDTDTSPAT----WIEQSGALPTGDVGTIASI 212 (425) T ss_pred ccCceeccHHHHHHHHHHHHhhhhHHHhhcee--e-c---Cc-eeEEEEecCCcccc----cccccccccccccccccee Confidence 01224789999999999999998887776421 1 1 23 34666533221111 112333333333 233455 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-----------Hhccccc-ccccCCcchhHH Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-----------ITKAPYE-KVSLVDEDEIWN 147 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-----------~~~~~~~-~~~~~~~~~~~~ 147 (387) ++...+ .+.-+.++++-+.....++...+..+..++++.++|..++.- +...+.. ..........|+ T Consensus 213 ~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~ 291 (425) T protein:vir:95 213 DFDGFK-VGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLK 291 (425) T ss_pred eeehee-eeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHH Confidence 554422 234445666555555667777777888999999999988842 1110000 111122345678 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHH-HhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeec Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEA-LLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSH 226 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~-l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~ 226 (387) .++++...+..+..+..+-..++++..+.. +.....+.. ..|. .....-.+..+.+.|+.++.++.+|....+... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd--~~g~-~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd 368 (425) T protein:vir:95 292 NLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVD--SNGN-VVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGE 368 (425) T ss_pred HHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcC--CCCc-eeeccCCCCCccccceeeEEcCcCCCccEEEEe Confidence 888887777665544444455677664432 221111000 0010 000011233456889999988888765322111 Q ss_pred cccccccccccccccCceeeeeeeccccc--ceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecccccccee Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVATENG--VQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRI 304 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v 304 (387) ...+.+.. ..+..........+.. ..+......| ....+. ..+....+ T Consensus 369 ~~~~~~~~-----~~~~~i~~~~~~~f~~~~~~~~~~~r~d------~~~~~~-------------------~a~~~~~i 418 (425) T protein:vir:95 369 FEQYTLVE-----RENITIDSSTHVKFTEDQTAFRGKGRFD------GKPVKP-------------------EAFVLVTI 418 (425) T ss_pred cccEEEEe-----ecceEEEeecccccccCceEEEEEEeeC------cEeecc-------------------cceEEEEe Confidence 11010000 0000110000000000 0000000000 000000 00000011 Q ss_pred eeeeeecccccc Q lcl|NC_021299. 305 HLKATDAEIEGE 316 (387) Q Consensus 305 ~~~~~~~~~~~~ 316 (387) +-+ ..+. T Consensus 419 ~~~-----~~g~ 425 (425) T protein:vir:95 419 TDP-----VQGA 425 (425) T ss_pred cCc-----CCCC Confidence 000 0000 No 129 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.37 E-value=8.3e-05 Score=42.97 Aligned_cols=280 Identities=11% Similarity=-0.009 Sum_probs=114.7 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASD 73 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (387) |.. -+++|+-|..++++.+++...+..++..-. . .+....++++......... .-.+++.... .++ T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~-~~v~E~~~~~~~~~ 188 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES---V---STSSGSRVYEKWTDVTPLK-AMDEEDGKIPDLDN 188 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceee---c---cCCcceEEEEeecCCcccc-cccccccccccccc Confidence 211 137799999999999999999888775321 1 1222233333221111000 0112222332 233 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++-..+++++.+ .+.-+.++++-+.....++...+.++..++++.++|..++.- ... ..+......|++++++. T Consensus 189 ~~~~~i~~~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G---~G~--~~~~~~~~~~~~i~~~~ 262 (408) T protein:vir:74 189 PRLTIIKYLIKR-YAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA---MGT--VPKKPTIANFDDVITMI 262 (408) T ss_pred cceeeEEeeeee-EEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc---ccc--cccccccccHHHHHHHH Confidence 455666666633 334456666655555667888888888999999999987742 111 12223334577777754 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeec--cceeeeeeecccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDT--LPHGDAYLSHPTA 229 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~--~~~~~~~~~~~~a 229 (387) ..|..... .+-..+++|..+..|.+... ..|.-. ...+..|..+.+.|+.|+.+.. +|..... ... T Consensus 263 ~~~l~~~~~--~~a~~v~n~~~~~~l~~lkd-----~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~---~~~ 332 (408) T protein:vir:74 263 NTSVDPAII--ATSSLLTNQSGLNKLALVKT-----AEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGST---VYP 332 (408) T ss_pred HHhhhhhhc--CCCEEEEcHHHHHHHHHhhc-----CCCceEeccCcCCCCCceecceeeEEecCcccccccCC---cce Confidence 45554443 24467889999888865321 111110 0112344456788988876543 2221100 000 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeee--ccceeeeeeeeeeeeccccceeeeccceeccccccceeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDA--TSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLK 307 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~--~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~ 307 (387) +.+ +... .. -......+..+.+...... .............+ ........+ ..+++. T Consensus 333 i~~--gd~~---~~----~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d---------~~~~~~~a~---~~~~~~ 391 (408) T protein:vir:74 333 LYY--GDMS---QA----ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD---------VKATDSEAL---VAGSFT 391 (408) T ss_pred EEE--Eehh---cc----EEEEEecceEEEEeccccchhhcceeeEEEEEeeC---------cEEecccce---EEEEee Confidence 000 0000 00 0000000111111100000 00000000000000 000000000 000000 Q ss_pred eeeccccccccccccceeEEEeec Q lcl|NC_021299. 308 ATDAEIEGETVKAGEKLALALEDS 331 (387) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~ 331 (387) . .. -..+.+.+.+++.. T Consensus 392 ~--~~-----~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 392 A--IA-----DQVGNFKTTTSTAV 408 (408) T ss_pred c--cc-----CCCCCCCCCccccC Confidence 0 00 00000000000000 No 130 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=97.37 E-value=7.1e-05 Score=43.34 Aligned_cols=264 Identities=11% Similarity=-0.010 Sum_probs=117.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccc- Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD- 73 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~- 73 (387) |.. -+++|+-|..++++.+++..++..+++.- . -.+...++|+....... .. .+++......+ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~~~~~~~~~a-~w---v~E~~~~~~~~~ 199 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQ-----P-VSKAGFSKLFNMGGTTS-GW---VGEASQRPQTNA 199 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceee-----e-ccCCceEEEEEcCCcce-ee---eccccccccccc Confidence 321 13789999999999999999998877421 1 11234556543222211 11 12333332222 Q ss_pred cccceEEEEEEeeee-cceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------Hhccccc-------- Q lcl|NC_021299. 74 LTEVTVDIKLTDVIY-NRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL---------ITKAPYE-------- 135 (387) Q Consensus 74 ~~~~~~~~~id~~~~-~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~---------~~~~~~~-------- 135 (387) .+-..+++.. ++. .-+.++++-+.....++...+.++..++++.++|..++.- +...... T Consensus 200 ~~f~~v~~~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~ 277 (425) T protein:vir:10 200 ATFQPLSFAS--GEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPF 277 (425) T ss_pred cccceeeeeh--eeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccc Confidence 2334445444 333 3344555444444568888888999999999999987731 1100000 Q ss_pred -----ccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecc Q lcl|NC_021299. 136 -----KVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQY 209 (387) Q Consensus 136 -----~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~ 209 (387) ..+.......|++++++...|.... ..+-..+++|..+..|.+...- .|.- -...+..|..+.+.|+ T Consensus 278 ~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~--~~~a~~vmn~~~~~~L~~lkD~-----~G~~l~~~~~~~g~~~~l~G~ 350 (425) T protein:vir:10 278 GAIEVVNSGAAADITSDGIIDLVYDLPSAF--TGNARFAMNRNTQRQVRKLKDG-----QGNYLWQPSYVAGQPATLAGY 350 (425) T ss_pred cccccccccccccccHHHHHHHHhhhhhhh--ccCCEEEEchHHHHHHHHhhcC-----CCceeeccCccCCCCceecce Confidence 0011223356788888877776543 2344678999998887642211 1110 0112344555678999 Q ss_pred eeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceee Q lcl|NC_021299. 210 DVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVT 289 (387) Q Consensus 210 ~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~ 289 (387) .|+.++.+|...... ..+.+ +... .. ........+.+..+..... +...+..... T Consensus 351 PV~~~~~~p~~~~~~---~~i~~--Gd~~---~~------~~i~~~~~~~v~~d~~~~~-----------~~~~~~~~~r 405 (425) T protein:vir:10 351 PVTEVPDMPDVAANS---TPILF--GDFQ---QT------YLIIDRIGVRVLRDPYTAK-----------PYVLFYTTKR 405 (425) T ss_pred eeEEecCcCCccCCc---cEEEE--Eehh---cc------EEEEEecceEEEecccccC-----------CcEEEEEEEE Confidence 999988887421110 00000 0000 00 0000000111111100000 0000000000 Q ss_pred ecccee-ccccccceeeeeeee Q lcl|NC_021299. 290 ANLDDE-PRFVRGTRIHLKATD 310 (387) Q Consensus 290 ~~~~~~-~~~v~~~~v~~~~~~ 310 (387) ...... +..+.. +.+.... T Consensus 406 ~d~~v~~~~A~~~--l~~~as~ 425 (425) T protein:vir:10 406 VGGGLLNPEPMRA--MKVAASE 425 (425) T ss_pred eccEeecccceEE--EEeeccC Confidence 000000 000000 1110000 No 131 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=97.29 E-value=7e-05 Score=43.38 Aligned_cols=283 Identities=12% Similarity=0.043 Sum_probs=112.2 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccc-----ccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRN-----MVASDLT 75 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 75 (387) -+..++.||.+..++++.+++..++..++..-. +.+ .+..+.||+.......-+.. +++.. ....+++ T Consensus 163 ~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~---~~~-~~~~~~ip~~~~~~~~a~~~---~Eg~~~~~~~~~~s~~~ 235 (477) T protein:vir:84 163 TGGYAVPPLWMMNRFIELARAGRTYANLCPTEP---LPG-GTSSINIPKILTGTSTAIQA---ADNAALTAPSAHEVDLT 235 (477) T ss_pred CcceeeccchhHHHHHHHhhhcchHHHhhceee---ecC-CcceeEEEEEecCcceeeee---ccCcccccccccccccc Confidence 112245678888899999999888877664321 111 23456776532222111111 12221 2223334 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---cc----c----ccccCCcc- Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA---PY----E----KVSLVDED- 143 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---~~----~----~~~~~~~~- 143 (387) -..+++...+. +.-+.++++-+.....++...+.++..++++.++|..++.- .+. +. . ........ T Consensus 236 f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G-~Gt~~~p~Gi~~~~~~~~~~~~~~~~ 313 (477) T protein:vir:84 236 DGFVQANVKTI-AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISG-TGSNNQVVGVRATAGITQVTATSAGS 313 (477) T ss_pred eeeEEEeeeeE-EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhcc-CCCCCccceeeecccccccccccccc Confidence 44455544332 22334554444444557777778888999999999987731 111 00 0 00111111 Q ss_pred ------hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhh-----hhcccc----cceeeeeeEEEEeec Q lcl|NC_021299. 144 ------EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRY-----DSAGEA----GASRLQTARIGRLAQ 208 (387) Q Consensus 144 ------~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~-----~~~g~~----~~~~~~~g~ig~~~g 208 (387) ..++.++++...++.... ......+++|..+..|.+...-... ...+.. ....+..+..+.+.| T Consensus 314 t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G 392 (477) T protein:vir:84 314 ALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG 392 (477) T ss_pred chhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc Confidence 224445555544443321 2234678899888877542211110 000000 011233445578899 Q ss_pred ceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeecccccee Q lcl|NC_021299. 209 YDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPV 288 (387) Q Consensus 209 ~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~ 288 (387) +.++.++.+|.+.+.......+.+..-. .. .. ...+... ..+.+.........+ ..++.... T Consensus 393 ~pVv~s~~~p~~~~~~~d~~~i~~gd~~-----~~-----~i-~~~~~~~--~~~~~~~~~~~~~~~-~v~~~~~~---- 454 (477) T protein:vir:84 393 LPVVTDPTLPTTLGTGTDQDVIHVLRAS-----DL-----AL-FESSVRM--RALQETRAENLSVLL-QVYGYLAF---- 454 (477) T ss_pred cceEecCcccccccccCCcceEEEEEec-----eE-----EE-EeeceeE--Eeccccccccceeee-eehhhhhh---- Confidence 9999999888653221111111111000 00 00 0000000 000000000000000 00000000 Q ss_pred eeccceeccccccceeeeeeeeccccccccc Q lcl|NC_021299. 289 TANLDDEPRFVRGTRIHLKATDAEIEGETVK 319 (387) Q Consensus 289 ~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~ 319 (387) ........++... ....+ .-+.. T Consensus 455 -~~~r~~~afv~~t-----~~~~~--~~~~~ 477 (477) T protein:vir:84 455 -TAARFPQSVVEIG-----GTALT--APTFA 477 (477) T ss_pred -hhhccccceEEee-----ccccc--ccccC Confidence 0000000010000 00000 00000 No 132 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=97.27 E-value=0.00011 Score=42.27 Aligned_cols=265 Identities=16% Similarity=0.053 Sum_probs=116.2 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |.. -+++|+-|..++++.+++..++..++..- . . .|....+|+-...... .. .+++..... +. T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~a--~w--v~E~~~~~~~~~ 176 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVI--T-V---GGSDYKKLVNLGGTAS--GW--VGETDTRSQTAT 176 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceee--e-c---CCCceEEEEecCCccc--ee--eccccccCcccc Confidence 332 24889999999999999999887776431 1 1 1334445432111111 11 122222221 22 Q ss_pred cccceEEEEEEeeeec-ceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc----------------- Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYN-RIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY----------------- 134 (387) Q Consensus 74 ~~~~~~~~~id~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~----------------- 134 (387) .+-..+++.. ++.. -+.++.+-+.....++...+.++..++++..+|..++.- -.+.+. T Consensus 177 ~~~~~v~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~ 254 (401) T protein:vir:44 177 SRLGLIEPFM--GEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAF 254 (401) T ss_pred ccceeeeeeh--hheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccc Confidence 2344444444 3333 344555444444567778888888999999999887731 111000 Q ss_pred ----cccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecce Q lcl|NC_021299. 135 ----EKVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYD 210 (387) Q Consensus 135 ----~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~ 210 (387) ...+.......|++++++...|..... .+-..++++..+..|.+...-... .. ....+..|..+.+.|+. T Consensus 255 ~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~~G~-~l---~~~~~~~g~~~~l~G~P 328 (401) T protein:vir:44 255 GKLQHIVSGEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKDTEGN-YL---WRPGLELGQPSSLAGYG 328 (401) T ss_pred ccccccccccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhccCCc-ee---ecCCcCCCCCceeccee Confidence 001112233458899988888765432 344578999998888542111100 00 01123455566789999 Q ss_pred eeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeee Q lcl|NC_021299. 211 VVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTA 290 (387) Q Consensus 211 v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~ 290 (387) |+.++.+|...... ..+.+ +.... .+ .......+.+..+................+ . T Consensus 329 Vv~~~~~p~~~~~~---~~i~~--Gd~~~------~~---~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d---------~ 385 (401) T protein:vir:44 329 IAENEQMPDIAADA---KAIAF--GNFKR------GY---TIVDRIGTRILRDPYTNKPFVGFYTTKRTG---------G 385 (401) T ss_pred eEEecCcCCccCCc---cEEEE--eehhc------cE---EEEEecceEEeeeccccCCcEEEEEEEEec---------c Confidence 99988887421100 00000 00000 00 000000111111100000000000000000 0 Q ss_pred ccceeccccccceeeeeee Q lcl|NC_021299. 291 NLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 291 ~~~~~~~~v~~~~v~~~~~ 309 (387) .......+ ..+.+... T Consensus 386 ~~~~~~a~---~~l~~~aa 401 (401) T protein:vir:44 386 MLVDSQAI---KLLKIAAA 401 (401) T ss_pred EEecccce---EEEEeecC Confidence 00000000 00111000 No 133 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.25 E-value=0.00012 Score=42.18 Aligned_cols=279 Identities=11% Similarity=0.018 Sum_probs=112.0 Q ss_pred Cc-c-------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MA-N-------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma-~-------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) ++ + -+++|+-|..++++.+++..++..+..+. +....| .+.+|........ +. .+++...+-. T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~----~~~~~~-~~~~p~~~~~~~a-~~---v~E~~~~~~~ 200 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART----LPLSNG-NITIPRLKGGAIV-GY---IGADTDIPTT 200 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhccee----eecCCC-ceEEEEEeCCcce-ee---eccCcccccc Confidence 11 1 13789999999999999888776652222 111122 4566554222111 11 1334444445 Q ss_pred ccccceEEEEEEeeeecceeeccHHHhhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccc---------c-- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDEERELDVRS--FAVDVLPRQVRAVAEQIEDAVSYLITKA---PYE---------K-- 136 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~--~~~~~~~~~~~~la~~vd~~~~~~~~~~---~~~---------~-- 136 (387) +++-..+++...+. +.-+.++++-+.....+ +...+..+..++++.++|..++.- .+. +.. . T Consensus 201 ~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G-~G~~~~p~Gi~~~~~~~~~~~ 278 (435) T protein:vir:14 201 QQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRD-DGTANTPKGLRFWALPSNVIT 278 (435) T ss_pred ccceeEEEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCccccceeecccccceec Confidence 55555566555332 33455555533333223 445556677899999999988731 010 000 0 Q ss_pred -cccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeee Q lcl|NC_021299. 137 -VSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVD 215 (387) Q Consensus 137 -~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~ 215 (387) ....+....+.++.++...|.....-..+...+++|..+..|.+... ..| ...+....-+.+.|+.++.++ T Consensus 279 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-----~~G---~~l~~~~~~g~l~G~Pv~~~~ 350 (435) T protein:vir:14 279 ASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRD-----GNG---NKVYPELANGMLKGYPVGKTT 350 (435) T ss_pred cccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhc-----cCC---ceeccCCCCCeeecceeEeec Confidence 01122223345566666666655433345567899999887754321 111 111111122467899999998 Q ss_pred ccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccce-eeeeee-eeeeeccccceeeeccc Q lcl|NC_021299. 216 TLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTT-ERSIVD-TWIGVKAVLDPVTANLD 293 (387) Q Consensus 216 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~-~~~g~~~~~~~~~~~~~ 293 (387) .+|...........+.+. .. +.........+.+..+.+..... ...... ...+...+......... T Consensus 351 ~~p~~~~~~~~~~~i~~g--d~----------s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~ 418 (435) T protein:vir:14 351 QVPINLGETGKESEIYFT--DF----------GDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFG 418 (435) T ss_pred cccccccCCCccceEEEe--ec----------ccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCce Confidence 887642211111000000 00 00000000000000000000000 000000 00000000000000000 Q ss_pred eeccccccceeeeeeeecccccccccc Q lcl|NC_021299. 294 DEPRFVRGTRIHLKATDAEIEGETVKA 320 (387) Q Consensus 294 ~~~~~v~~~~v~~~~~~~~~~~~~~~~ 320 (387) +..+....-++...-+. T Consensus 419 ----------~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 419 ----------PRHVESIAVLAGVAWGA 435 (435) T ss_pred ----------eecccceEEEecCCCCC Confidence 00000000011111111 No 134 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.16 E-value=0.00015 Score=41.60 Aligned_cols=264 Identities=13% Similarity=0.029 Sum_probs=111.6 Q ss_pred Cccc-------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccc Q lcl|NC_021299. 1 MANA-------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma~~-------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (387) |+.. .++|+.+..++++.+++..++..+-.|- +....| .+++|+....... +. .+++..++..+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~----v~~~~g-~~~~p~~t~~~~a-~w---v~E~~~~~~s~ 134 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARS----IPLPNG-NLSMPRLSGGATA-GY---VGEGKDVVATG 134 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceee----eecCCC-ceEEEEEeCCcce-ee---eccCccccccc Confidence 2211 3679999999999999988886652222 222223 4667654322111 11 23445555556 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-c-cccc----------ccccCC Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLIT-K-APYE----------KVSLVD 141 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~-~-~~~~----------~~~~~~ 141 (387) ++-..+++...+ .+.-+.++++-+.....++...+.++..++++.++|..++.--. + .+.. ...... T Consensus 135 ~~f~~i~~~~~k-~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~ 213 (366) T protein:vir:57 135 ATFDDVKLSAKT-MIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTG 213 (366) T ss_pred cceeEEEEeeEE-EEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccc Confidence 666667766633 33455666665445556777777788899999999998773210 0 0000 000001 Q ss_pred cchhHHHHHHHH----HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 142 EDEIWNGVVSNR----RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 142 ~~~~~~~i~~a~----~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) ....+..+.... ..+...+....+-..+++|..+..|.+... ..| ...+....-+.+.|+.++.++.+ T Consensus 214 t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd-----~~G---~~l~~~~~~g~l~G~Pvv~s~~i 285 (366) T protein:vir:57 214 TAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD-----GNG---NKVYPEMSQGILKGYPIQRTSAI 285 (366) T ss_pred cccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhc-----cCC---ceeccCCCCCeecceeeEEcccc Confidence 111222222111 111122222233446799998888765321 111 11111222356889999999988 Q ss_pred ceeeeeeeccccccccccc---cccccCceeeeeeecc-------------cccceeeeeeeeeeccceeeeeeeeeeee Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRS---PGRPMTNTVATSTVAT-------------ENGVQLRWLGDYDATSTTERSIVDTWIGV 281 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~---~~~~~~~t~~~~~~~~-------------~~~~~~~~~~~~d~~~~~~~~~~~~~~g~ 281 (387) |...........+.+..-. .....+.......... .+...+......+..... T Consensus 286 p~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~----------- 354 (366) T protein:vir:57 286 PANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRH----------- 354 (366) T ss_pred ccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeec----------- Confidence 8643211111111110000 0000000000000000 000011111111100000 Q ss_pred ccccceeeeccceeccccccceeeeeeeec Q lcl|NC_021299. 282 KAVLDPVTANLDDEPRFVRGTRIHLKATDA 311 (387) Q Consensus 282 ~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~ 311 (387) ...++....+ .. T Consensus 355 -------------~~a~~~lt~~-----~~ 366 (366) T protein:vir:57 355 -------------PEGLVLGTGV-----IW 366 (366) T ss_pred -------------cccEEEEecc-----cC Confidence 0000000000 00 No 135 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=97.13 E-value=0.00016 Score=41.41 Aligned_cols=271 Identities=13% Similarity=0.036 Sum_probs=118.6 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc-c Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS-D 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~-~ 73 (387) |. .-.++|+-|..++++.+++..++..++..- .- .+..+.+|+-....... -.+++...+.. . T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~~a~----~v~E~~~~~~~~~ 175 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVI-----TL-GGSDYKKLVNLGGTTSG----WVGETDARPETAT 175 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceee-----ec-CCCceEEEEecCCccee----eeccccccccccc Confidence 22 124789999999999999999887776421 11 13345554322211111 11223333222 2 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc------------------ Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY------------------ 134 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~------------------ 134 (387) ..-..+++.+.+. ..-+.++++-+.....++...+.++..++++.++|..++.- -.+.+. T Consensus 176 ~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~ 254 (407) T protein:vir:48 176 SKLGLIEPFMGEI-YGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFG 254 (407) T ss_pred ccceeEEeeeeee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccccc Confidence 3344555555332 22345665554555667888888888999999999887631 000000 Q ss_pred ---cccccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeeccee Q lcl|NC_021299. 135 ---EKVSLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDV 211 (387) Q Consensus 135 ---~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v 211 (387) ...+.......|++++++...|..... .+-..++++..+..|.+...-... .. ....+..|..+.+.|+.+ T Consensus 255 ~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~--~~a~~v~n~~~~~~L~~lkD~~Gr-~l---~~~~~~~g~~~~l~G~PV 328 (407) T protein:vir:48 255 KLQHIASGAASGVTADAIIKLIYTLRKAHR--SGAKFMMNNSSLFAIRLLKDNDGN-YL---WRPGIELGQPSSLAGYGI 328 (407) T ss_pred cccccccccccccChHHHHHHHHhhchhhh--cCCEEEEcHHHHHHHHHhhccCCc-ee---eccCcCCCCCceecceee Confidence 001111233458889888888876543 233568999988877542111100 00 011234556678899999 Q ss_pred eeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeec Q lcl|NC_021299. 212 VTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTAN 291 (387) Q Consensus 212 ~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~ 291 (387) +.++.+|...... ..+.+ +.. .. .........+.+..+..... +...+....... T Consensus 329 ~~~~~~p~~~~~~---~~i~~--Gd~----~~-----~~~i~~~~~~~i~~d~~~~~-----------~~~~~~~~~r~d 383 (407) T protein:vir:48 329 VENEQMPDIAADA---KAIAF--GNF----KR-----GYTIVDRIGTRILRDPYTNK-----------PFVGFYTTKRTG 383 (407) T ss_pred EEecCcCCccCCc---cEEEE--Eec----cc-----cEEEEEeeceEEEeeccccC-----------CcEEEEEEEEec Confidence 9998887521100 00000 000 00 00000000111111100000 000000000000 Q ss_pred c--ceeccccccceeeeeeeecccccccc Q lcl|NC_021299. 292 L--DDEPRFVRGTRIHLKATDAEIEGETV 318 (387) Q Consensus 292 ~--~~~~~~v~~~~v~~~~~~~~~~~~~~ 318 (387) . .....+ ..+.+... +.+...- T Consensus 384 ~~v~~~~a~---~~l~~~aa--~~~~~~~ 407 (407) T protein:vir:48 384 GMLVDSQAI---KLMKIGAA--TRQKAAA 407 (407) T ss_pred cEEecccce---EEEEeecc--CCCCCCC Confidence 0 000000 00111000 0000000 No 136 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.07 E-value=0.00018 Score=41.08 Aligned_cols=277 Identities=10% Similarity=0.034 Sum_probs=114.2 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) ..--+++|+-|..++++.+++..++..+..+- +.... ..+.+|+....... .. .+++...+-.+++-..++ T Consensus 138 ~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~----v~~~~-~~~~~p~~~~~~~a--~~--v~E~~~~~~~~~~f~~i~ 208 (435) T protein:vir:80 138 GAGGVLVPENLSSEVIELLRPKSVVRKLGART----LPLSN-GNITIPRLKGGAIV--GY--IGADTDIPTTQQQFDDLK 208 (435) T ss_pred CCCccccchhHHHHHHHHHhhhchhhhcccee----eecCC-CceEEEEEeCCcce--ee--eccCccccccccceeeEE Confidence 11123789999999999999888776652221 11111 24566543222111 11 123444444555555666 Q ss_pred EEEEeeeecceeeccHHHhhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccc------------ccccCCcc Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDV--RSFAVDVLPRQVRAVAEQIEDAVSYLITKA---PYE------------KVSLVDED 143 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~--~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---~~~------------~~~~~~~~ 143 (387) +...+. +.-+.++++-+.... .++...+.++..++++.++|..++.- .+. +.. .....+.. T Consensus 209 ~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G-~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 286 (435) T protein:vir:80 209 LTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRD-DGTANTPKGLRFWALPGNVITASDGSTLQ 286 (435) T ss_pred EeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCCcccceeecccccceeecccccchh Confidence 655333 345556655433332 24556666777899999999988732 111 000 00111222 Q ss_pred hhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 144 EIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 144 ~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) ..+.++.++...|........+-..+++|..+..|.+... ..|. ..+....-+.+.|+.++.++.+|..... T Consensus 287 ~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd-----~~G~---~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 287 KIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRD-----GNGN---KVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhc-----cCCc---eeccCCCCCeEeeeeeEEeccccccccC Confidence 3345666766667666544445566899998887754221 1111 1111111246889999999988764322 Q ss_pred eeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccc-eeeeeeeee-eeeccccc--eeeeccceecccc Q lcl|NC_021299. 224 LSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATST-TERSIVDTW-IGVKAVLD--PVTANLDDEPRFV 299 (387) Q Consensus 224 ~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~-~~~~~~~~~-~g~~~~~~--~~~~~~~~~~~~v 299 (387) ......+.+.. +.........+..+ ..+...... ......... .....+.. ...........++ T Consensus 359 ~~~~~~i~~gd----------~s~~~i~~~~~~~i--~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~ 426 (435) T protein:vir:80 359 AGKESEIYFTD----------FGDVFIGEEETLEI--DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIA 426 (435) T ss_pred CCCcceEEEEE----------cccEEEEeecceEE--EEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceE Confidence 11111010000 00000000000000 000000000 000000000 00000000 0000000001111 Q ss_pred ccceeeeeeeecccccccccc Q lcl|NC_021299. 300 RGTRIHLKATDAEIEGETVKA 320 (387) Q Consensus 300 ~~~~v~~~~~~~~~~~~~~~~ 320 (387) ....+ ..+. T Consensus 427 ~l~~~------------~~~~ 435 (435) T protein:vir:80 427 VLSGV------------AWGA 435 (435) T ss_pred EEecc------------CCCC Confidence 11111 1111 No 137 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=97.06 E-value=0.00019 Score=41.03 Aligned_cols=278 Identities=11% Similarity=-0.023 Sum_probs=116.3 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (387) |. -..++|+.+..++++.+++..++..++..-. .. +...+++++...... ....-.+++.... .+. T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~ 188 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES---VS---TSNGSRVYEKWTDVT-PLTVMDAEDGKIPDLDN 188 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceee---cc---CCcceEEEEeecCCc-cceeeecCccccccccc Confidence 21 1246899999999999999999888764321 11 222333333211110 0000112233332 234 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++.+.+. +.-+.++++-+.....++...+.++..++++.++|..++..- . ...+......++++.++. T Consensus 189 ~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~---g--~~~~~~~~~~~~~i~~~~ 262 (404) T protein:vir:39 189 PRLTIIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM---G--TVPKKPTIAKFDDVITMI 262 (404) T ss_pred cceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc---c--ccccccccccHHHHHHHH Confidence 5556677777443 344567766555556777777888889999999999887421 1 112223334577777664 Q ss_pred H-HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeeccceeeeeeecccccc Q lcl|NC_021299. 154 R-WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYA 231 (387) Q Consensus 154 ~-~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~ 231 (387) . .++.... .+-..+++|..+..|.+...- .|... ...+..+..+.+.|+.++.+...+...... .. .. T Consensus 263 ~~~~~~~~~--~~a~~v~n~~~~~~L~~lkd~-----~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~-~~--~~ 332 (404) T protein:vir:39 263 NTSVDPAII--ATSSLLTNQSGLNKLALVKTA-----EGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGS-TV--YP 332 (404) T ss_pred HHhhhhhhc--cCCEEEEcHHHHHHHHHhhcc-----CCceeeccCcCCCCcceecceeEEEecccccCccCC-Cc--cE Confidence 3 3433322 234678999998888753111 11100 011233444678888887654322111000 00 00 Q ss_pred ccccccccccCceeeeeeecccccceeeeeeeeee--ccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDA--TSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~--~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +..+... .. -......+..+.+...... .............+ ........ ...+++ . T Consensus 333 ~~~gd~~---~~----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d---------~~~~~~~a---~~~~~~--~ 391 (404) T protein:vir:39 333 LYYGDMS---QA----ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD---------VKTTDSEA---LVAGSF--T 391 (404) T ss_pred EEEEecc---cc----EEEEeecceEEEEeccchhhhhhceeeEEEEeeec---------cEEecccc---eEEEEe--e Confidence 0000000 00 0000000111111100000 00000000000000 00000000 001111 1 Q ss_pred ecccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAGE 322 (387) Q Consensus 310 ~~~~~~~~~~~~~ 322 (387) .......+.+.|. T Consensus 392 ~~a~~~~~~~~~~ 404 (404) T protein:vir:39 392 AIADQVGNFTAGK 404 (404) T ss_pred ccccCCCCCCCCC Confidence 1111122223332 No 138 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.03 E-value=0.00016 Score=41.42 Aligned_cols=272 Identities=10% Similarity=-0.005 Sum_probs=107.9 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc--c Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA--S 72 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~--~ 72 (387) |.. ..++|+-|..++++.+++..++..++.... ... ...++.++.-.... ..... +++..... . T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~---~~~-~~g~~~~~~~~~~~--~~~~v--~e~~~~~~~~~ 181 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEP---VFT-RSGSRTYEKRSKQK--PMKPL--SENQQIPTNGD 181 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceee---ccC-CccceEEEEecCCc--ceeec--ccccccccccc Confidence 321 236799999999999999998888764321 111 12234443321111 11111 12222222 2 Q ss_pred ccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-c--------cccccccCCcc Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITK-A--------PYEKVSLVDED 143 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~-~--------~~~~~~~~~~~ 143 (387) +++-..++++..+. +.-+.++++-+.....++...+.+...++++..+|..++.--.. . ........... T Consensus 182 ~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~ 260 (404) T protein:vir:10 182 NGKLERFNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKS 260 (404) T ss_pred ccceeeeEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeecccc Confidence 23344555555332 33455666544444567777788888999999999988732110 0 00111122334 Q ss_pred hhHHHHHHHHH-HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeec-cceee Q lcl|NC_021299. 144 EIWNGVVSNRR-WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT-LPHGD 221 (387) Q Consensus 144 ~~~~~i~~a~~-~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~-~~~~~ 221 (387) ..++++..+.. .|....- .+-.++++|..+..|.+....... .. ....+..+..+.+.|+.|+.... ++... T Consensus 261 ~~~~~~~~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~-~l---~~~~~~~~~~~~l~G~PV~~~~~~~~~~~ 334 (404) T protein:vir:10 261 PALKDFKKCKNVELLNVFK--ATSSWIVNQDGFNYLDSLEDKTGR-PY---LQPDPKDPTQYRFLGLPVIELPNDLLLST 334 (404) T ss_pred ccHHHHHHHHHhhhhcccc--CCCEEEEcHHHHHHHHHhhccCCc-ee---eccCcCCCCCccccceeeEEecccccCCC Confidence 55777766543 4443332 233568999998887653211110 00 00112344556778888764332 22211 Q ss_pred eeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeec----cceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 222 AYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDAT----STTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 222 ~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~----~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) .. ...+.+ +.. .. .........+.+..+.+.. ............+ ........ T Consensus 335 ~~---~~~~~~--gd~---s~------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d---------~~v~~~~a 391 (404) T protein:vir:10 335 ES---AIPVLL--GDT---KE------AYKYVSDGAYELATTNIGAGAFETNTTKARIIMRID---------GNVKDSEA 391 (404) T ss_pred CC---ccEEEE--Eec---cc------cEEEEEecceEEEEeccccchhhcCceEEEEEEeec---------cEEecccc Confidence 00 000000 000 00 0000000001111000000 0000000000000 00000000 Q ss_pred ccccceeeeeeeecccccc Q lcl|NC_021299. 298 FVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 298 ~v~~~~v~~~~~~~~~~~~ 316 (387) + ..+++.. ...+. T Consensus 392 ~---~~~~~~~---aa~~~ 404 (404) T protein:vir:10 392 L---LIAEIPV---ESVQA 404 (404) T ss_pred e---EEEEeec---ccCCC Confidence 0 0000000 00000 No 139 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=97.00 E-value=0.00021 Score=40.70 Aligned_cols=273 Identities=10% Similarity=0.018 Sum_probs=113.9 Q ss_pred Ccc-ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cccccce Q lcl|NC_021299. 1 MAN-AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTEVT 78 (387) Q Consensus 1 Ma~-~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 78 (387) .++ -+++|+.|+.++++.+++...+..++..-. . .+...+++++....... ...-.+++....- +.++-.. T Consensus 121 ~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~---~---~~~~~~~~~~~~~~~~~-~a~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:10 121 DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES---V---STSNGSRVYEKWTDVTP-LTVMDAEDGKIPDLDNPQLTI 193 (408) T ss_pred ccCCceeccHhHHHHHHHHHHhhchhhhhcceee---c---cCCcceEEEeecccccc-ceeeecCccccccccCcceee Confidence 111 247799999999999999999888765321 1 12223344332211100 0111123333332 2344455 Q ss_pred EEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH-HHHh Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR-RWLN 157 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l~ 157 (387) +++...+. +.-+.++++-+.....++...+.+...++++.++|..++..... +........|++++++. ..|+ T Consensus 194 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~-----~~~~~~~~~~~~l~~~~~~~~~ 267 (408) T protein:vir:10 194 IKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA-----APKKPTIAKFDDVITMINTAVD 267 (408) T ss_pred EEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccccccHHHHHHHHHHhhh Confidence 66655333 34455665544445667877778888899999999988753221 12223345677777754 4454 Q ss_pred hccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeec--cceeee----eeecc-c-c Q lcl|NC_021299. 158 EQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT--LPHGDA----YLSHP-T-A 229 (387) Q Consensus 158 ~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~--~~~~~~----~~~~~-~-a 229 (387) ... ..+-..+++|..+..|.+....... ..- ......|..+.+.|+.++.+.+ +|.... +.+.. . . T Consensus 268 ~~~--~~~a~~v~n~~~~~~l~~lkd~~G~-~i~---~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~ 341 (408) T protein:vir:10 268 PAI--IATSSLLTNQSGLNKLALVKTAEGK-YLL---EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQA 341 (408) T ss_pred hhh--ccCCEEEEcHHHHHHHHHhhccCCc-eEe---ccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhcc Confidence 433 2344678999999888653211110 000 0112344556788988876543 232111 00000 0 0 Q ss_pred ccccccccccccCceeeeeeecc----cccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVAT----ENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIH 305 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~----~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~ 305 (387) +.+.. ..+.......... .+...+..... .+....+ ...++ .++ T Consensus 342 ~~~~~-----~~~~~v~~~~~~~~~f~~~~~~~r~~~r------~d~~v~~------------------~~a~~---~~~ 389 (408) T protein:vir:10 342 ITLFD-----RENMSLLPTNIGAGAFETDTTKIRVIDR------FDVKATD------------------SEALV---AGS 389 (408) T ss_pred EEEEE-----ecceEEEEcccccchhhcCceEEEEEEe------eccEEec------------------cccEE---EEE Confidence 00000 0000000000000 00000000000 0000000 00000 000 Q ss_pred eeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 306 LKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) +... +-..+.....+.+. | T Consensus 390 ~~~~-------~~~~~~~~~~~~~~-----------~ 408 (408) T protein:vir:10 390 FSAI-------ADQVGNFKTTTSTA-----------V 408 (408) T ss_pred eecc-------ccCCCCCCCCCccc-----------C Confidence 0000 00000000000000 0 No 140 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=96.99 E-value=0.00022 Score=40.63 Aligned_cols=259 Identities=11% Similarity=0.018 Sum_probs=113.7 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 79 (387) -....++|+-|..++++.+++...+.+++..- . -.+.++.+|++......-.. .+++.... ..++.-..+ T Consensus 140 ~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~---~~E~~~~~~~~~~~f~~i 210 (400) T protein:vir:38 140 ADAASTIPETISNTPQRELQTVVDLKPFTNVF-----Q-ASTQKGTYPTVANATTKMVT---VAELEKNPAMAKPEFKPV 210 (400) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhcceeE-----e-ccCcceEEEEEecCCCcccc---ccccccccccccccceee Confidence 11235889999999999999988887776421 1 12345666665422211111 12222222 233444555 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHHhhc Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWLNEQ 159 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 159 (387) ++...+ .+.-+.++++-+.....++...+.+...++++...|..++....+. .......++++.++....-+ T Consensus 211 ~~~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~~~~~~~- 282 (400) T protein:vir:38 211 NWSVET-YRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF------TAKTISSVDDLKHINNVDLD- 282 (400) T ss_pred Eeehhh-eeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc------cccccccHHHHHHHHHhhhh- Confidence 555523 2344556655444445667777777788888888888776432211 12233456666665432211 Q ss_pred cCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccc Q lcl|NC_021299. 160 KVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPG 238 (387) Q Consensus 160 ~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~ 238 (387) +..+...+++|..+..|.+...- .|.- ....+..|..+.+.|+.++.+..+|....... .+..+... T Consensus 283 --~~~~a~~v~~~~~~~~l~~lkd~-----~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~-----~~~~gd~s 350 (400) T protein:vir:38 283 --PAYSRVIIASQSFYNFLDTVKDG-----NGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEA-----HAFLGDIK 350 (400) T ss_pred --hhhCcEEEEcHHHHHHHHHhhcc-----CCCeeeecCcCCCCccccccceeEEecccccCCCCce-----EEEEEecc Confidence 22345678999998887653111 0110 01123345556789999988877664321100 00000000 Q ss_pred cccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeee Q lcl|NC_021299. 239 RPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATD 310 (387) Q Consensus 239 ~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~ 310 (387) . ........+..+.+..+ + .. ......... ......... ....+++.+.. T Consensus 351 ---~----~~~~~~~~~~~~~~~~~-~-~~-~~~~~~~~r---------~d~~~~~~~---a~~~l~~~~~a 400 (400) T protein:vir:38 351 ---R----AILFANRADFMVRWVDD-Q-IY-GQFLQAGMR---------FGVSVADEK---AGYFLTYTPKA 400 (400) T ss_pred ---c----cEEEEeecceEEEEecc-c-cc-ceeEEEEEE---------eccEEeccc---ceEEEEeecCC Confidence 0 00000001111111110 0 00 000000000 000000000 01111111111 No 141 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=96.98 E-value=0.00023 Score=40.54 Aligned_cols=272 Identities=13% Similarity=0.023 Sum_probs=114.3 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASD 73 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (387) |+- .+++|+-|...+++.+++...+..+++.- . . .+.+.++|+.......... .+++.... ..+ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~~~~---~~E~~~~~~~~~ 179 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT--P-V---TTPKGTYPILKRATDRFSS---VAELAENPKLAE 179 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhccee--e-c---cCCeeEEEEEecCCCcccc---cccccccccccc Confidence 332 14789999999999999999887776421 1 1 1334555544321111111 12222222 234 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++.+.+. +.-+.++++-+.....++...+.+...++++...|..++...... ..........|+++.++. T Consensus 180 ~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~---~~~~~~~~~~~d~l~~~~ 255 (389) T protein:vir:10 180 PEFNKVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSF---TAKKTTTDTLVDSLKHIL 255 (389) T ss_pred ccceeeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccc---ccccccccccHHHHHHHH Confidence 4555666666333 344556655444445677777777788899999998887654332 222334556677777654 Q ss_pred H-HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc-ceeeeeeecccccc Q lcl|NC_021299. 154 R-WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL-PHGDAYLSHPTAYA 231 (387) Q Consensus 154 ~-~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~-~~~~~~~~~~~a~~ 231 (387) . .++.. .+...+++|..+..|.+...-...--...........+..+.+.|+.|+..+.. +....... .+. T Consensus 256 ~~~~~~~----~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~---~~~ 328 (389) T protein:vir:10 256 NVDLDPA----YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQ---KAF 328 (389) T ss_pred Hhhhhhh----hCcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCce---EEE Confidence 3 33322 235678999998888753211110000000011112233457889998765432 22111000 000 Q ss_pred ccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeec Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDA 311 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~ 311 (387) ++ .. .. .-......+..+.+.. +..... ......-.+ ........ ...+++.... T Consensus 329 ~g--d~----~~---~~~~~~~~~~~i~~~~--~~~~~~-~~~~~~r~d---------~~~~~~~a---~~~~~~~~~~- 383 (389) T protein:vir:10 329 VG--DL----KR---GVLFTDRQQVTLAWED--SKIYGK-YLGAAFRFG---------VQKADSKA---GYFVTNTDVP- 383 (389) T ss_pred Ee--ec----cc---cEEEEeecceEEEeec--cccccc-eEEEEEEec---------cEEecccc---eEEEEeeccC- Confidence 00 00 00 0000000111111110 000000 000000000 00000000 0011111000 Q ss_pred ccccccccccc Q lcl|NC_021299. 312 EIEGETVKAGE 322 (387) Q Consensus 312 ~~~~~~~~~~~ 322 (387) .. ..++ T Consensus 384 ---~~--~~~~ 389 (389) T protein:vir:10 384 ---GS--ALGK 389 (389) T ss_pred ---CC--CCCC Confidence 00 0011 No 142 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=96.66 E-value=0.00043 Score=39.04 Aligned_cols=275 Identities=11% Similarity=0.070 Sum_probs=111.9 Q ss_pred Cc-------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccc Q lcl|NC_021299. 1 MA-------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma-------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (387) ++ --+++|+-|..++++.+++..++..+..+- +... ...+.+|+....... .. .+++...+..+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~----~~~~-~g~~~~p~~~~~~~a--~~--v~Eg~~~~~~~ 195 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARS----IPLP-NGNMSLPRLAGGATA--SY--TGENQDAKVSE 195 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhccee----eecC-CcceEEEEEeCCcce--ee--eccCccccccc Confidence 11 124789999999999999988876663221 1111 123566654322111 11 23445555566 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---cc--------cc---ccc Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA---PY--------EK---VSL 139 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~---~~--------~~---~~~ 139 (387) ++-..+++...+. +.-+.++++-+.....++...+.+...++++.++|..++.- .+. +. .. ... T Consensus 196 ~~f~~i~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G-~G~~~~p~Gi~~~~~~~~~~~~~~ 273 (428) T protein:vir:10 196 ARFDDVKLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRD-DGTGDTPIGMKARATQWNRLLPWA 273 (428) T ss_pred cceeeEEeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCcccccccccccccccccccc Confidence 6666667666333 34566666655555667777777888999999999987731 010 00 00 000 Q ss_pred CCcchhHHHH---HHHHHHHh-hccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeee Q lcl|NC_021299. 140 VDEDEIWNGV---VSNRRWLN-EQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVD 215 (387) Q Consensus 140 ~~~~~~~~~i---~~a~~~l~-~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~ 215 (387) ......++.+ .++...+. .......+-..+++|..+..|.+... ..|. ..+....-|.+.|+.++.++ T Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-----~~G~---~i~~~~~~g~l~G~pv~~~~ 345 (428) T protein:vir:10 274 ADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRD-----GNGN---KVYPEMAQGMLKGYPIQRTS 345 (428) T ss_pred ccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhc-----cCCc---eeccCCCCCeeeceeeEEec Confidence 1112222222 22221111 11122223345789998887754221 1111 11111222468899999988 Q ss_pred ccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccc-eeeeeeee-eeeeccccceeee--c Q lcl|NC_021299. 216 TLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATST-TERSIVDT-WIGVKAVLDPVTA--N 291 (387) Q Consensus 216 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~-~~~~~~~~-~~g~~~~~~~~~~--~ 291 (387) .+|...........+.+.. ++.........+.+..+.+.... ........ ......+...... . T Consensus 346 ~~p~~~~~~~~~~~i~~gd------------~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~ 413 (428) T protein:vir:10 346 AIPANLGEGGKESEIYFAD------------FNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIG 413 (428) T ss_pred cccccccCCCccceEEEEe------------cceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCce Confidence 8876532211111111000 00000000000011000000000 00000000 0000000000000 0 Q ss_pred cceeccccccceeee Q lcl|NC_021299. 292 LDDEPRFVRGTRIHL 306 (387) Q Consensus 292 ~~~~~~~v~~~~v~~ 306 (387) ......++....+.- T Consensus 414 v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 414 FRHPEGLVLGTGVLF 428 (428) T ss_pred eeccceEEEEeccCC Confidence 000111111111110 No 143 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=96.62 E-value=0.00046 Score=38.86 Aligned_cols=278 Identities=9% Similarity=-0.048 Sum_probs=112.7 Q ss_pred Cc--------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-c Q lcl|NC_021299. 1 MA--------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-A 71 (387) Q Consensus 1 Ma--------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~ 71 (387) |+ ...++|+-|+.++++.+++..++..+++.-. .. ++...+++|........ ..-.+++.... . T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~-a~~v~E~~~~~~~ 177 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVEN---VT---TSHGSRVYEKLADITPL-KDLDDESALIGDN 177 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceee---cc---CCcceEEEEeeccCCcc-ccccccccccccc Confidence 11 1237899999999999999999888764311 11 11222222211110000 00011222222 1 Q ss_pred cccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHH Q lcl|NC_021299. 72 SDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVS 151 (387) Q Consensus 72 ~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 151 (387) +.++-..++++..+.. .-+.++++-+.....++...+.++..++++..+|..++.-.. . +.+......|+++.+ T Consensus 178 ~~~~f~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g-~----~~~~~~~~~~~~i~~ 251 (395) T protein:vir:38 178 DDPELTVVKYLIHRYA-GITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMG-K----APKKPTISQFDNIKD 251 (395) T ss_pred cccceeeEEeeeeeeE-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-c----cccccccccHHHHHH Confidence 2233445555553332 334555554444456777777788889999999988774211 1 112233345777776 Q ss_pred HHH-HHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeeeeecccc Q lcl|NC_021299. 152 NRR-WLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTA 229 (387) Q Consensus 152 a~~-~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a 229 (387) +.. .|.... ..+-..+++|..+..|.+...- .|.- ....+..|..+.+.|+.++.+..++...... ... T Consensus 252 ~~~~~l~~~~--~~~a~~v~n~~~~~~L~~lkd~-----~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~--~~~ 322 (395) T protein:vir:38 252 LENNTLDPAI--ESTSSFITNQSGYNILSKVKDA-----DGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSG--SHP 322 (395) T ss_pred HHHHhhhhhh--cCCCEEEEcHHHHHHHHHhhcc-----CCceeeccCcCCCCcceeccceeEEecccccCcCCC--cce Confidence 543 443332 2345678999998888653211 1110 0112344555678899988876543321110 000 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeee--ccceeeeeeeeeeeeccccceeeeccceeccccccceeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDA--TSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLK 307 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~--~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~ 307 (387) +.++ .. .. .-......+..+.+...... .............+. ....... ...+++. T Consensus 323 i~~g--d~---~~----~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~---------~~~~~~a---~~~~~~~ 381 (395) T protein:vir:38 323 LYFG--DL---KQ----GITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDV---------QLIDDGA---FAAASFK 381 (395) T ss_pred EEEE--ec---cc----cEEEEEecceEEEEeccccchhhcCceEEEEEEeecc---------EEecccc---eEEEEee Confidence 0000 00 00 00000000111111100000 000000000000000 0000000 0011111 Q ss_pred eeecccccccccccc Q lcl|NC_021299. 308 ATDAEIEGETVKAGE 322 (387) Q Consensus 308 ~~~~~~~~~~~~~~~ 322 (387) .. .+-.+.+...|+ T Consensus 382 ~~-~~~~~~~~~~~~ 395 (395) T protein:vir:38 382 TV-ANQAQGTAGTGK 395 (395) T ss_pred cc-cCCCCCccCCCC Confidence 10 011112222222 No 144 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=96.49 E-value=0.00058 Score=38.33 Aligned_cols=271 Identities=7% Similarity=-0.085 Sum_probs=116.9 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (387) |. ...++|+-+..++++.+++..++..++.... . .+...+++++.......... .+++.... ... T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~---~---~~~~~~~~~~~~~~~~~a~~--v~Eg~~~~~~~~ 162 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEP---V---TTLSGSRVFKKRSQQTGFVE--VAEGAAIGEKAT 162 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeee---c---cCCceeEEEEeecCCcceee--eccccccccccc Confidence 32 2247899999999999999999888764321 1 12233333332211111111 12233332 233 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++-..+++...+.. .-+.++++-+.....++...+.++..++++..+|..++..... ...+....++++..+. T Consensus 163 ~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~------~~~~~~~~~~~i~~~~ 235 (371) T protein:vir:81 163 PQFTLLQYQVKKYA-GFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNT------KAKTAIADLDGLKQII 235 (371) T ss_pred cceeeEEeeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc------ccccccccHHHHHHHH Confidence 45566666664432 3456666654444567777778888899999999887753211 1112234466666543 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccc--cc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPT--AY 230 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~--a~ 230 (387) ..|.... ..+-..+++|..+..|.+...-.. ... ....+..|..+.+.|+.++.+..+|.+........ .. T Consensus 236 ~~~l~~~~--~~~a~~vmn~~~~~~L~~lkd~~g-~~l---~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~ 309 (371) T protein:vir:81 236 NVQLDPVF--RSTSSVIVNQDAFNWLDTLKDQNG-QYL---LQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFA 309 (371) T ss_pred Hhhcchhh--hcCCEEEEcHHHHHHHHHhhccCC-Cee---eecccCCCCCceecceeEEEecccccCccccccccCCcc Confidence 3443332 234567899999888765321110 000 01123445567889999999888775432211100 00 Q ss_pred cccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 231 AMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 231 ~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) .+..+.. .. ...........+..+..... .... +...+............ .-....+++... T Consensus 310 ~i~~Gd~--------~~-~~~~~~~~~~~i~~~~~~~~---~f~~----~~v~~~~~~r~d~~~~~-~~a~~~~~~~~A 371 (371) T protein:vir:81 310 PIIVGDL--------KE-AVVMFDRQRTEIMSSNVAMD---AFET----DATLWRAIERMDVKMRD-DEAFVFGEVQLA 371 (371) T ss_pred eEEEEeh--------hc-eEEEEeecceEEEEeccccc---hhhc----CceEEEEEEeeccEEec-ccceEEEEEecC Confidence 0000000 00 00000000111111000000 0000 00000000000000000 000000000000 No 145 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=96.43 E-value=0.00063 Score=38.14 Aligned_cols=265 Identities=11% Similarity=-0.026 Sum_probs=110.3 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |+. ..++|+.|..++++.+++..++..+++.-. ..... ..+.++.-.... .... .+++..... .. T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~-~~~~~~~~~~~~--~a~~--v~Eg~~~~~~~~ 194 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEP---VTTRS-GTRLLEKNADMV--PFSP--VEELGNLPEIDQ 194 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceee---ccCCc-eeEEEEEecCCc--ceee--eccccccccccc Confidence 322 247899999999999999998877764321 11111 233343211111 1111 122333322 23 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) ++-..+++...+. +.-+.++++-+.....++...+.++..++++.++|..++..-. ... ......|+++.++. T Consensus 195 ~~~~~v~~~~~k~-~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g---~~~---~~g~~~~~~i~~~~ 267 (397) T protein:vir:12 195 PRFTKVSYSIIDY-GGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIA---SLK---KVDIDGLDGIKKAL 267 (397) T ss_pred ccceeEEeeheee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccc---ccc---ccccccHHHHHHHH Confidence 3445555555333 2344555554444456777778888899999999988774211 111 12234567777654 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeeeeecccccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYA 231 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~ 231 (387) ..|+... ..+-..+++|..+..|.+...- .|.- ....+..|..+.+.|+.++.++......... ...+. T Consensus 268 ~~~l~~~~--~~~a~~~~n~~~~~~L~~lkd~-----~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~--~~~~~ 338 (397) T protein:vir:12 268 NVTLDPMV--APGSIVLTNQDGYDWLDTLKDG-----TGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKG--KAPLI 338 (397) T ss_pred hhccchhh--hCCCEEEEcHHHHHHHHHhhcc-----CCceeecccccCCCCccccceeeEEecccccccCCC--ccEEE Confidence 3454332 2345678999998888642110 1110 0012335555678898887665432211100 00000 Q ss_pred ccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccce-eccccccceeeee Q lcl|NC_021299. 232 MLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDD-EPRFVRGTRIHLK 307 (387) Q Consensus 232 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~v~~~~v~~~ 307 (387) ++. . ... -......+..+.+....+.....+.. .+.......... .+..+....++.. T Consensus 339 ~gd--~---~~~----~~~~~~~~~~i~~~~~~~~~f~~~~~---------~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 339 IGN--L---KEA----IVLFDREQQSIASTDTGAGAFETNST---------KVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEe--h---hce----EEEEeecceEEEEeccccchhhcCce---------EEEEEEeeccEEecccceEEEEEeeC Confidence 000 0 000 00000000011110000000000000 000000000000 0000111111111 No 146 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=95.95 E-value=0.0012 Score=36.58 Aligned_cols=298 Identities=14% Similarity=0.138 Sum_probs=121.1 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccc----------- Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNM----------- 69 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~----------- 69 (387) |+.++=+ --|.+..|...++.++|.++...- . +--..|.||.++.+.+..... .+...+-.... T Consensus 19 ~~~~~~t-~y~~~k~L~~Aa~~lv~~~fA~~~--p-iPkn~GkTIk~r~y~pl~~~~-~pl~eGv~a~G~~~~~g~~y~~ 93 (401) T protein:vir:95 19 NSDQMQT-FFWLKKAIITARKEQYFMPLASVT--N-MPKHYGKTIKVYEYVPLLDDR-NINDQGIDASGATIVNGNLYGS 93 (401) T ss_pred ccceeee-hhhHHHHHhhhhhhhhhhhccccc--c-cccccCCeEEEEecccccccc-cchhcCCCcccccccCcccccc Confidence 4443311 247777888888889998887432 2 223569999998765543321 11111111000 Q ss_pred --ccc-------------------ccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_021299. 70 --VAS-------------------DLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVS-- 126 (387) Q Consensus 70 --~~~-------------------~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~-- 126 (387) ++. ..+-..+..+|.| -.+=..++|+-...+..+.+.+++.+-+..-+..+..+++ T Consensus 94 ~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~q-yG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~ 172 (401) T protein:vir:95 94 SKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHK-FGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQK 172 (401) T ss_pred ccccceeecccccccccccccccccceeeeeeeeeee-ccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHH Confidence 111 1111223344422 2344567776655555555555433333333333333322 Q ss_pred HHHhcc--------cc--cc---cccCCcchhHHHHHHHHHHHhhccCCc----------C-------CcEEEEch---- Q lcl|NC_021299. 127 YLITKA--------PY--EK---VSLVDEDEIWNGVVSNRRWLNEQKVPK----------D-------GRVLLVGS---- 172 (387) Q Consensus 127 ~~~~~~--------~~--~~---~~~~~~~~~~~~i~~a~~~l~~~~vp~----------~-------~r~~v~~~---- 172 (387) .+++.+ .. .. ....+....++.+.++.+.|+++..|. - -|++++.| T Consensus 173 dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~ 252 (401) T protein:vir:95 173 DLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVP 252 (401) T ss_pred HHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchh Confidence 112111 11 11 122344466899999999999988776 1 24577666 Q ss_pred HH--HHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCcee-eeee Q lcl|NC_021299. 173 AV--EEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTV-ATST 249 (387) Q Consensus 173 ~~--~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~-~~~~ 249 (387) +. ...+.+++.|....+.++. ..+.+|.+|.+.++.++.+..+..-.......++..-....+....+... .+.. T Consensus 253 di~a~~D~~~~~~fi~v~kYa~~--~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~ 330 (401) T protein:vir:95 253 ELKAMKDLFGNKAFIETQHYADA--GTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPM 330 (401) T ss_pred HHHHHHHhcCCCCceehhhcCCc--cccccccccccCceeEEecccceeecCCcccccccccccccccccCCCcceeeee Confidence 33 3566678899988887764 56789999999999988765532111111000000000000000000000 0000 Q ss_pred ecccccceeeeeeeeeeccce---eeeeeeeeeeecc--ccceeee---------ccceeccccccceeeeeeeeccccc Q lcl|NC_021299. 250 VATENGVQLRWLGDYDATSTT---ERSIVDTWIGVKA--VLDPVTA---------NLDDEPRFVRGTRIHLKATDAEIEG 315 (387) Q Consensus 250 ~~~~~~~~~~~~~~~d~~~~~---~~~~~~~~~g~~~--~~~~~~~---------~~~~~~~~v~~~~v~~~~~~~~~~~ 315 (387) .. .....+..+ ........ .........++.. ....... ...-..... +.+. ++.+ T Consensus 331 lV-~G~dAf~~~-~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m----~~ie----s~a~ 400 (401) T protein:vir:95 331 LV-VGDDSFTSI-GFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERL----ALIK----TVAP 400 (401) T ss_pred eE-Eccccceec-ccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheecccee----EEEE----eecC Confidence 00 000000000 00000000 0000000000000 0000000 000000000 0000 0000 Q ss_pred c Q lcl|NC_021299. 316 E 316 (387) Q Consensus 316 ~ 316 (387) . T Consensus 401 ~ 401 (401) T protein:vir:95 401 L 401 (401) T ss_pred C Confidence 0 No 147 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=95.84 E-value=0.0014 Score=36.27 Aligned_cols=272 Identities=10% Similarity=0.038 Sum_probs=109.0 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccc-----c Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRN-----M 69 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~-----~ 69 (387) ||.. .++|+.+.+++++.+++..++..++..- . -.+.++++|+........ . .+++.. . T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~p~~~~~~~a~--w--v~E~~~~~~~~~ 70 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV-----N-MGTKTTHLPVLATLPEAD--W--VGESATDPKGVK 70 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhccee-----e-ccCCcEEEEEEeCCcceE--E--eecccccccccc Confidence 8865 3789999999999999999988887532 1 124567776543222111 1 111221 2 Q ss_pred cccccccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Q lcl|NC_021299. 70 VASDLTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY--------------- 134 (387) Q Consensus 70 ~~~~~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~--------------- 134 (387) ...+++-..+++...| .+.-+.++++-+.....++...+.++..++++.++|..++.-- +.+. T Consensus 71 ~~s~~~f~~i~~~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~-g~~~~~~~~~~~~~~~~~~ 148 (305) T protein:vir:25 71 PTSKVTWANRTLVAEE-IAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT-DKPASWVSPALIPAAVTAG 148 (305) T ss_pred cccccceeeEEeeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheecc-CCCCCcccccccccccccc Confidence 2233444455555433 2344566665555556777777788888999999999887311 1100 Q ss_pred cccccCCcchhHHHH----HHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecce Q lcl|NC_021299. 135 EKVSLVDEDEIWNGV----VSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYD 210 (387) Q Consensus 135 ~~~~~~~~~~~~~~i----~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~ 210 (387) ...........+.++ ..+...+.+.... ..-.+++|..+..|.+.. . .. +...++. +.+.|+. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~lk---d--~~---G~~i~~~---~~l~G~P 215 (305) T protein:vir:25 149 QAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIR---D--AN---GNPVFRD---DSFAGFR 215 (305) T ss_pred ccccccccchhhhHHHHHHHHHHHhhhhcccc--cceeEecHHHHHHHHHhh---c--cC---CceeecC---Ccccccc Confidence 001111112222333 3333333322211 123678999888875421 1 11 1122222 3577888 Q ss_pred eeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeee-eeeccccceee Q lcl|NC_021299. 211 VVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTW-IGVKAVLDPVT 289 (387) Q Consensus 211 v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~-~g~~~~~~~~~ 289 (387) ++.++.++....-. .+..+. +.........+..+.... +............. ........... T Consensus 216 v~~~~~~~~~~~~~------~~~~gd--------~s~~~i~~~~~~~i~~~~--~~~~~~~~~~~~~~~~~~~~~R~~~r 279 (305) T protein:vir:25 216 TFFNRNGAWDADAA------IEVIAD--------SSRVKIGVRQDITVKFLD--QATLGTGENQINLAERDMVALRLKAR 279 (305) T ss_pred eEEcCccCCCCCcc------EEEEEe--------cceEEEEEecCeEEEEee--eeeeecCCceeeeeecCcEEEEEEEe Confidence 87776655321100 000000 000000000011110000 00000000000000 00000000000 Q ss_pred eccc--eeccccccceeeeeeeeccccccc Q lcl|NC_021299. 290 ANLD--DEPRFVRGTRIHLKATDAEIEGET 317 (387) Q Consensus 290 ~~~~--~~~~~v~~~~v~~~~~~~~~~~~~ 317 (387) .+.. .....+ .++..+.. .+++.+ T Consensus 280 ~~~~v~~p~a~v---~~~~~~~~-~~~pa~ 305 (305) T protein:vir:25 280 FAYVLGVSATAQ---GANKTPVA-VVAPAA 305 (305) T ss_pred ecceeeCcccEE---EEcccccc-ccCCCC Confidence 0000 000000 00000000 000001 No 148 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=95.83 E-value=0.0014 Score=36.25 Aligned_cols=292 Identities=12% Similarity=0.008 Sum_probs=117.6 Q ss_pred Ccccc------ccHHHHHHHHHHHHHhhcc-ccc-------eeeecccccccccCCCEEEEEeccc-ceeeceecccccc Q lcl|NC_021299. 1 MANAF------IKPPVIIASILGQLQHELV-LPN-------FVFKNGYGDVAHKFNDTITIRIPVP-TIAHTRGLRATGA 65 (387) Q Consensus 1 Ma~~~------~~pe~~~~~~~~~l~~~~~-~~~-------~~~~d~~~~~~~~~gdtv~i~~~~~-~~~~~~~~~~~~~ 65 (387) ||-+. |.|.+..+ .++.+.+.+. |-. |.|.-++ ||=...+.... .-..+.. ... T Consensus 1 ~~~t~~sdl~vfn~~~~~a-~~e~~~~~~~~Fnaas~Gai~l~~~~~~-------GDf~~~~ff~i~~~~~~rn---v~~ 69 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTA-YLERNMDNLAVFNENSRAAIGLNSELIE-------GDLKLRSFYKVGGAIADRD---VNS 69 (315) T ss_pred Cceeeecceeeehhhhhhh-HHhhhHHHHHHhhhhcCCcccccccccc-------cccccccccccccchhhcc---cCC Confidence 88653 66776665 4555555443 211 1122222 33333322220 0001111 112 Q ss_pred cccccccccccc-eEEEEEEeeeecceeeccHHHhhhhhhHH---HHHHHHHHHHHHHHHHHHHHHHHhcccc----ccc Q lcl|NC_021299. 66 DRNMVASDLTEV-TVDIKLTDVIYNRIDLTDEERELDVRSFA---VDVLPRQVRAVAEQIEDAVSYLITKAPY----EKV 137 (387) Q Consensus 66 ~~~~~~~~~~~~-~~~~~id~~~~~~~~~~d~~~~~~~~~~~---~~~~~~~~~~la~~vd~~~~~~~~~~~~----~~~ 137 (387) ..++....++.. ++.+.+ .+.+-++.++..++...-.+++ .++.++...++-+.+-...++.+.++.. ... T Consensus 70 ~~~~t~~kit~~~dvaVk~-~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~ 148 (315) T protein:vir:96 70 TATVAGTKIAADEMVSVKV-PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNV 148 (315) T ss_pred CccccceecccccceeEEE-eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccc Confidence 334555555444 344445 5566667777776664434433 3344444444334443444333332211 122 Q ss_pred ccCCcchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeecc Q lcl|NC_021299. 138 SLVDEDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTL 217 (387) Q Consensus 138 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~ 217 (387) .........+.+.+|..+|.++. ..=.-+++.+..+..|.+ ..+..... +......+.+..+.+ |..|..+..+ T Consensus 149 ~~~~a~~~~~~l~dA~~klGD~~--~~l~~~vMHS~v~~~L~~-q~L~~~~~--~~~~~~~~~~~~~~l-GkrViVdD~~ 222 (315) T protein:vir:96 149 SGELATEGKKVLTKGLRTMGDKA--SSIAIWVMDSTSYFDIVD-EAIDNKLY--EEAGVVVYGGTPGTL-GKPVLVTDQC 222 (315) T ss_pred cccccccCHHHHHHHHHHhcccc--cCeeEEEEchHHHHHHHH-hhhhhhcc--cccceeEecCcCccc-ccEEEEECCC Confidence 22334456788999999997764 122346688999999988 44443222 222222333333333 8899999999 Q ss_pred ceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceecc Q lcl|NC_021299. 218 PHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPR 297 (387) Q Consensus 218 ~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 297 (387) |.+..+.+..+++.+..+.......... .............|.....+.+.. .....+. .+ ... . T Consensus 223 P~~~~~gl~~GAi~~~~~~~~~~~~~~~-~g~e~l~~~~r~e~tf~l~p~G~s----w~~~~~~---sP----t~a---e 287 (315) T protein:vir:96 223 PATKIFGLVAGAVMITESQAPGMRSYQI-DDQENLAIGFRAEGTANVEVLGYK----WKTKTNV---NP----ASA---T 287 (315) T ss_pred CcceeeeeecceeeecCCCccccccccC-CCcceeEEEEeeeeEeeeeeeeEE----eecCCCc---CC----ChH---H Confidence 9987777667766555433210000000 000000000000011000000000 0000000 00 000 0 Q ss_pred ccccceeeeeeeeccccccccccccceeEEEeecc Q lcl|NC_021299. 298 FVRGTRIHLKATDAEIEGETVKAGEKLALALEDSN 332 (387) Q Consensus 298 ~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (387) .-.. ...+.-....+...+..+.++..+ T Consensus 288 Lat~-------~NWekV~~~~K~tagv~~~~~~~~ 315 (315) T protein:vir:96 288 LATT-------TNWEKYATDDKATAGFIITLTTTP 315 (315) T ss_pred hcCC-------cCcccccCCCcccceEEEEecCCC Confidence 0000 000000001111112222221111 No 149 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=94.86 E-value=0.0033 Score=34.18 Aligned_cols=271 Identities=14% Similarity=0.073 Sum_probs=107.6 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) ....++.|.-+.+++++.+.+...|..+++.-.-.. ..| .|+..+...... . ............+++-+.++ T Consensus 24 ~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~---~~~---~i~~~~~~~~~~-~-~~~e~~~~~~~~~~~~~~~~ 95 (321) T protein:vir:31 24 LDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGA---KKT---RIPTLNIGERHR-R-PQDEGEWNENESDVSTGTID 95 (321) T ss_pred cCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccC---cce---eeeeeccCCccc-c-cccccccccccccceeeeee Confidence 222234444456678888888877877765432111 122 333222111000 0 00011122223344455556 Q ss_pred EEEEeeeecceeeccHHHhhhh--hhHHHHHHHHHHHHHHHHHHHHHH-HHHhcccc--------------cc--cccCC Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDV--RSFAVDVLPRQVRAVAEQIEDAVS-YLITKAPY--------------EK--VSLVD 141 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~--~~~~~~~~~~~~~~la~~vd~~~~-~~~~~~~~--------------~~--~~~~~ 141 (387) +.+.+ ......++.+-+.... .++...+.....++++..++...+ +.-.+.+. .. ..... T Consensus 96 ~~~~k-~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~ 174 (321) T protein:vir:31 96 ISTEK-ATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAAD 174 (321) T ss_pred eeeEE-EEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccccccccc Confidence 65533 3345556655443322 356666666667777777776544 21111100 00 01112 Q ss_pred cchhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhc---ccchhhhhhcccccceeeeeeEEEEeecceeeeeeccc Q lcl|NC_021299. 142 EDEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLL---DDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLP 218 (387) Q Consensus 142 ~~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~---~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~ 218 (387) ....++.+.++...|+...--..+-..+++++....++. +.. ...+ ...+..+....+.|+.++..+.+| T Consensus 175 ~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~----~~~~---~~~l~~~~~~tl~G~pvv~~~~mP 247 (321) T protein:vir:31 175 DILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRD----TPLG---DNVIMGEADVNPFSFPIIGSGLWP 247 (321) T ss_pred cccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCC----Cccc---cchhhccccccccceeEEEcCCCC Confidence 234467777777777654321122345688887655432 211 1111 223444555578899999999988 Q ss_pred eeeeeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeecc-cee----eeeeeeeeeeccccceeeeccc Q lcl|NC_021299. 219 HGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATS-TTE----RSIVDTWIGVKAVLDPVTANLD 293 (387) Q Consensus 219 ~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~-~~~----~~~~~~~~g~~~~~~~~~~~~~ 293 (387) ....+......+.+... .+.......+.+... ... ....+..+ ... T Consensus 248 ~~~il~t~~~nl~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~ve 298 (321) T protein:vir:31 248 DDKAMFTDPQNLIYALY------------------RDLEIDVLTESDKVSERDLHARYFMRGDDDF-----------AIE 298 (321) T ss_pred CCcEEEeccccEEEEEe------------------eccEEEEeecCccccccceeeEeeeeeecce-----------eEe Confidence 76544433333221110 000010000000000 000 00000000 000 Q ss_pred eeccccccceeeeeeeecccccc Q lcl|NC_021299. 294 DEPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 294 ~~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ..........+..+...+..+.. T Consensus 299 ~~~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 299 NTEAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred ccccEEEEecCCcchhcccCCCC Confidence 11111111111111111111110 No 150 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=94.82 E-value=0.0034 Score=34.10 Aligned_cols=293 Identities=10% Similarity=-0.032 Sum_probs=122.3 Q ss_pred Cccc---cccHHHHHHHHHHHHHhhccccc------eeeecccccccccCCCEEEEEecccceeeceecccccccccccc Q lcl|NC_021299. 1 MANA---FIKPPVIIASILGQLQHELVLPN------FVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA 71 (387) Q Consensus 1 Ma~~---~~~pe~~~~~~~~~l~~~~~~~~------~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~ 71 (387) ||-. .|.|+++.+. ++.+.+++-.-+ ++..+ ++ -.||.++.|.+..................+.+ T Consensus 1 m~lsD~~vfN~~~~~a~-~e~~~q~~~~fn~as~gai~l~~---~~--~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~ 74 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAF-SETLRQQVDLFNTATGGAIMLQS---AA--HQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAE 74 (325) T ss_pred Cchhhhhhhhhhhhhhh-hhhhhhhHhhhhhcccceeEecc---cc--ccCceeeccccccccccccccccCCCCceecc Confidence 7743 3778888764 444554433222 22211 11 23899999887755443333333334445666 Q ss_pred cccccceE-EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHH----HHHHhcccc-------ccccc Q lcl|NC_021299. 72 SDLTEVTV-DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAV----SYLITKAPY-------EKVSL 139 (387) Q Consensus 72 ~~~~~~~~-~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~----~~~~~~~~~-------~~~~~ 139 (387) ..++..+. .+.+ ...+++...|+.......+.+.+++.+....+++...++. ++.+.++.. ..... T Consensus 75 ~kitt~~~~av~~--~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~ 152 (325) T protein:vir:95 75 KVLKHLVDTSVKV--AAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATAN 152 (325) T ss_pred ceeccccceeeEE--ecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecc Confidence 66555443 3333 3345566666666555556666666555555555544443 333322211 11111 Q ss_pred CCc---chhHHHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeec Q lcl|NC_021299. 140 VDE---DEIWNGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT 216 (387) Q Consensus 140 ~~~---~~~~~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~ 216 (387) ... -..++.+.+|..+|.++. ..=..+++.+..|..|.++. +............ ..+....|..|+.+.. T Consensus 153 ~~~~~~~~s~~~l~~A~~klGD~~--~~l~~~~MHS~v~~~L~~~~-L~~~~~~~~~~g~----~~i~t~~G~~VIVdD~ 225 (325) T protein:vir:95 153 TDAADKLPTWNNLNNGQAKFGDQS--SQIAAWIMHSTPMHKLYGSN-LTNGERLFTYGTV----NVVRDPFGKLLVMTDS 225 (325) T ss_pred cCcccccccHHHHHHHHHHhcccc--cceeEEEEchHHHHHHHHhh-ccccccccccCCc----ccccccCCcEEEEeCC Confidence 111 134688999999997764 22246778999999998743 3322222111111 1235567889998888 Q ss_pred cceeee--------eeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeecccccee Q lcl|NC_021299. 217 LPHGDA--------YLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPV 288 (387) Q Consensus 217 ~~~~~~--------~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~ 288 (387) +|.... +.+..+++.+........... . ..+. -.....++... ...++ ..|...... T Consensus 226 ~p~~~~g~~~~ytty~lg~GAi~~~~~~~~~~~~~------~--~~~~-~~~~~~~~~~~---tf~lh-p~G~sw~~s-- 290 (325) T protein:vir:95 226 PNLFAAGTPNVYHILGLVPGGVLIGQNNDFDANEE------T--KNGD-ENIIRTYQAEW---SYNIG-VKGFAWDKA-- 290 (325) T ss_pred CCCCCccCceeEEEEEEecCeEEecCCCCcccccc------c--cCcc-cceeeeeeeee---eEEee-cceeeeecc-- Confidence 876432 222233332222111000000 0 0000 00000000000 00000 011111000 Q ss_pred eeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcceEEEec Q lcl|NC_021299. 289 TANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLVTWTSG 347 (387) Q Consensus 289 ~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~w~Ss 347 (387) ....+++...|..+..+...+.+.-. ...|.-+.. T Consensus 291 -------------------~~g~sPt~aeL~~~~NW~rv~~~~K~-----tagv~~~~~ 325 (325) T protein:vir:95 291 -------------------NGGKSPTDAALFTSTNWDKYATSHKD-----LAGVVVKTN 325 (325) T ss_pred -------------------cccCCcChHhhcCCcCcceecCCCcc-----ccceeEeeC Confidence 00001111122222222222111000 001111111 No 151 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=94.70 E-value=0.0037 Score=33.90 Aligned_cols=256 Identities=9% Similarity=-0.005 Sum_probs=110.0 Q ss_pred Cc-cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccccccce Q lcl|NC_021299. 1 MA-NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLTEVT 78 (387) Q Consensus 1 Ma-~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 78 (387) -+ ...++|+-|...+++.+++..++.+++..- . -.+.+.++|++......-.. .+++.... .+.+.-.. T Consensus 133 ~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~---v~E~~~~~~~~~~~~~~ 203 (394) T protein:vir:97 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY-----Q-AKKASGKYPVLQRATTKMVT---VAELEKNPALAKPDFKD 203 (394) T ss_pred cccccccChHHHHHHHHHHhhhhhhhhhhceee-----e-ccCcceEEEEEecCCCccce---eccccccccccccccee Confidence 11 114789999999999999999888776431 1 11334566554322111111 12233332 23344455 Q ss_pred EEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHHhh Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWLNE 158 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 158 (387) +++...+ .+.-+.++.+-+.....++...+.++..++++...|..++...... .......|+++.++...+-+ T Consensus 204 v~l~~~k-~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~------~~~~~~~~~~~~~~~~~~~~ 276 (394) T protein:vir:97 204 VAWNIDT-YRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF------TTKTVKNLDEIKALLNGGFD 276 (394) T ss_pred EEeehhh-eeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccccccHHHHHHHHHhhhh Confidence 6665533 2344556655444455567777777788889998888777432211 11223456766665433221 Q ss_pred ccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeeccceeeee-eecc-c-cccccc Q lcl|NC_021299. 159 QKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTLPHGDAY-LSHP-T-AYAMLT 234 (387) Q Consensus 159 ~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~-~~~~-~-a~~~~~ 234 (387) |..+-..|++|..+..|.+...-. |.- ....+..|..+.+.|+.|+.+.....+... .+.. . .+.+. T Consensus 277 ---~~~~a~~v~n~~~~~~l~~lkd~~-----G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~- 347 (394) T protein:vir:97 277 ---PAYNVSLIVSQSFYQTLDTLKDGN-----GRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFA- 347 (394) T ss_pred ---hhhCCEEEEcHHHHHHHHHhhccC-----CCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEE- Confidence 222345789999988876432111 110 001223444567889888765433222111 1000 0 00000 Q ss_pred cccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeecccc Q lcl|NC_021299. 235 RSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIE 314 (387) Q Consensus 235 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~ 314 (387) ...+....+..+ +. . ........-.+ ..... +.. ...+++.+. .. T Consensus 348 -----------------~~~~~~~~~~~~-~~-~-~~~~~~~~r~d---------~~v~~-~~a--~~~~~~~~~---~~ 392 (394) T protein:vir:97 348 -----------------DRKDLGLRWADN-EI-Y-GQYLQAVLRFG---------VSKVD-DKA--GYYVTFTPE---PL 392 (394) T ss_pred -----------------EecceEEEEecc-cc-c-ceeEEEEEEEc---------cEEec-ccc--eEEEEeccc---cc Confidence 000111111100 00 0 00000000000 00000 000 011111110 00 Q ss_pred cccc Q lcl|NC_021299. 315 GETV 318 (387) Q Consensus 315 ~~~~ 318 (387) + + T Consensus 393 p--~ 394 (394) T protein:vir:97 393 P--L 394 (394) T ss_pred C--C Confidence 1 1 No 152 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=296 Identities=11% Similarity=0.055 Sum_probs=132.7 Q ss_pred Ccccccc---HH---HHHHHHHHHHHhhccccc-eeeeccc------ccccccCCCEEEEEecccceeeceecccccccc Q lcl|NC_021299. 1 MANAFIK---PP---VIIASILGQLQHELVLPN-FVFKNGY------GDVAHKFNDTITIRIPVPTIAHTRGLRATGADR 67 (387) Q Consensus 1 Ma~~~~~---pe---~~~~~~~~~l~~~~~~~~-~~~~d~~------~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~ 67 (387) ||.+.+- |+ +|+..+-..-.+...|.+ ++-+... .|+.-..||+|+++...........- .... T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~G---d~~l 77 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYG---DARV 77 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCccc---Ccee Confidence 9987632 44 688766655555555544 4422111 23444569999997654443221110 0111 Q ss_pred cccccccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------- Q lcl|NC_021299. 68 NMVASDLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE----------- 135 (387) Q Consensus 68 ~~~~~~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~----------- 135 (387) ...-+.+.-.+..|.||+.- .++...++ ..--...|++.+..+....=++...|+..+-.+.++... T Consensus 78 eGnee~L~~~~~~i~idq~r-~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~ 156 (364) T protein:vir:93 78 EGKEESLRFYQDEVRIDQVR-HSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFT 156 (364) T ss_pred eccccceeEEeeEEEEeecc-ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcc Confidence 12234556677888887754 45655432 222345566666555555666666666655444332100 Q ss_pred ------c----------------cccC--CcchhHHHHHHHHHHHhhccCC--------------cCCcEEEEchHHHHH Q lcl|NC_021299. 136 ------K----------------VSLV--DEDEIWNGVVSNRRWLNEQKVP--------------KDGRVLLVGSAVEEA 177 (387) Q Consensus 136 ------~----------------~~~~--~~~~~~~~i~~a~~~l~~~~vp--------------~~~r~~v~~~~~~~~ 177 (387) + .... +-...++.|-.+...+...+.+ ++--++++.|.++.. T Consensus 157 ~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) T protein:vir:93 157 GYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATD 236 (364) T ss_pred cccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhh Confidence 0 0000 1113456677777776554321 111256689998888 Q ss_pred Hhc--ccchhhhhhc---ccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecc Q lcl|NC_021299. 178 LLL--DDRFIRYDSA---GEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVAT 252 (387) Q Consensus 178 l~~--~~~~~~~~~~---g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~ 252 (387) |.. +++|...++. .......+-.|.+|.+.|+-+++...++-........ .+ ...-....|+.....+... T Consensus 237 Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~---~v-~~~ralllGaQA~~~a~g~ 312 (364) T protein:vir:93 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGA---NV-EAARALFMGRQAGVIAYGT 312 (364) T ss_pred hhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCc---cc-cchhhheecceeeEEEeec Confidence 874 4555555543 3333456778999999998887776654332111110 00 0111122222222222222 Q ss_pred cccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeee Q lcl|NC_021299. 253 ENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKA 308 (387) Q Consensus 253 ~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~ 308 (387) .++..+.|....-.-.+.....+....|..-.... ..+.+..+.-..+.+-. T Consensus 313 ~~g~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~----~~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 313 ANGLRFDWEETVKDYGNEPAIAAGFIAGMKKARFN----NKDFGVISIDTAAKKHS 364 (364) T ss_pred CCCCCceeeecccCCCCchhhhhhhHhhhhhcccC----CccceEEEecccccccC Confidence 34455555443211111111111111111111000 01111000000000000 No 153 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=276 Identities=9% Similarity=-0.067 Sum_probs=108.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |. -.+++|+.+..++++.+++..++..++..- .. .++....++|......... -.+++....- +. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~--~v~E~~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PV---RTRSGSRVLEKNSDMIPFA--EITEMGEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ec---cCCceeEEEEeecCCccce--eeccccccccccc Confidence 32 224789999999999999999887776421 11 1222223222211111111 1122233321 22 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++...+ .+.-+.++++-+.....++...+.+...++++..+|..++...... .......|+++.++. T Consensus 178 ~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~------~~~~~~~~d~i~~~~ 250 (392) T protein:vir:10 178 PKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL------TKQAIKSLDDIKDVL 250 (392) T ss_pred ccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccCccCHHHHHHHH Confidence 344555555533 2445556665444445678888888888999999998887533211 122345678887754 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeee-ecc-ceeeeeeecccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTV-DTL-PHGDAYLSHPTA 229 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s-~~~-~~~~~~~~~~~a 229 (387) ..|..... .+-..+++|..+..|.+...-. |.- ....+..|..+.+.|+.++.. ... +........... T Consensus 251 ~~~l~~~~~--~~a~~vm~~~~~~~L~~lkd~~-----G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 251 NVKLDPAIS--PNAILLTNQDGFNYLDKLKDKD-----GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHhhhhhhc--cCCEEEEcHHHHHHHHHhhccC-----CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 45555443 3456789999988886531100 100 001122344456677654332 111 111111000000 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+ +.. . ..-......+..+.+....+.....+.. .+............ .-....+++... T Consensus 324 ~~~--gdf----s---~~~~i~~~~~~~~~~~~~~~~~f~~~~~---------~~r~~~r~d~~v~~-~~a~~~l~~~~~ 384 (392) T protein:vir:10 324 LII--GDL----K---EAIVLFKREDMELASTDVGGKAFTRNTL---------DLRAIQRDDVQMWD-NEAAVYGEIDLS 384 (392) T ss_pred EEE--Eeh----h---ceEEEEeecceEEEEeccccchhhcCce---------EEEEEEeeccEEec-ccceEEEEeccc Confidence 000 000 0 0000000001111111000000000000 00000000000000 000001111000 Q ss_pred eccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAG 321 (387) Q Consensus 310 ~~~~~~~~~~~~ 321 (387) .... -..| T Consensus 385 a~~~----~~~~ 392 (392) T protein:vir:10 385 APVE----QPQG 392 (392) T ss_pred cccc----CCCC Confidence 0000 0111 No 154 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=276 Identities=9% Similarity=-0.067 Sum_probs=108.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |. -.+++|+.+..++++.+++..++..++..- .. .++....++|......... -.+++....- +. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~--~v~E~~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PV---RTRSGSRVLEKNSDMIPFA--EITEMGEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ec---cCCceeEEEEeecCCccce--eeccccccccccc Confidence 32 224789999999999999999887776421 11 1222223222211111111 1122233321 22 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++...+ .+.-+.++++-+.....++...+.+...++++..+|..++...... .......|+++.++. T Consensus 178 ~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~------~~~~~~~~d~i~~~~ 250 (392) T protein:vir:10 178 PKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL------TKQAIKSLDDIKDVL 250 (392) T ss_pred ccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccCccCHHHHHHHH Confidence 344555555533 2445556665444445678888888888999999998887533211 122345678887754 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeee-ecc-ceeeeeeecccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTV-DTL-PHGDAYLSHPTA 229 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s-~~~-~~~~~~~~~~~a 229 (387) ..|..... .+-..+++|..+..|.+...-. |.- ....+..|..+.+.|+.++.. ... +........... T Consensus 251 ~~~l~~~~~--~~a~~vm~~~~~~~L~~lkd~~-----G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 251 NVKLDPAIS--PNAILLTNQDGFNYLDKLKDKD-----GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHhhhhhhc--cCCEEEEcHHHHHHHHHhhccC-----CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 45555443 3456789999988886531100 100 001122344456677654332 111 111111000000 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+ +.. . ..-......+..+.+....+.....+.. .+............ .-....+++... T Consensus 324 ~~~--gdf----s---~~~~i~~~~~~~~~~~~~~~~~f~~~~~---------~~r~~~r~d~~v~~-~~a~~~l~~~~~ 384 (392) T protein:vir:10 324 LII--GDL----K---EAIVLFKREDMELASTDVGGKAFTRNTL---------DLRAIQRDDVQMWD-NEAAVYGEIDLS 384 (392) T ss_pred EEE--Eeh----h---ceEEEEeecceEEEEeccccchhhcCce---------EEEEEEeeccEEec-ccceEEEEeccc Confidence 000 000 0 0000000001111111000000000000 00000000000000 000001111000 Q ss_pred eccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAG 321 (387) Q Consensus 310 ~~~~~~~~~~~~ 321 (387) .... -..| T Consensus 385 a~~~----~~~~ 392 (392) T protein:vir:10 385 APVE----QPQG 392 (392) T ss_pred cccc----CCCC Confidence 0000 0111 No 155 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=276 Identities=9% Similarity=-0.067 Sum_probs=108.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |. -.+++|+.+..++++.+++..++..++..- .. .++....++|......... -.+++....- +. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~--~v~E~~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PV---RTRSGSRVLEKNSDMIPFA--EITEMGEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ec---cCCceeEEEEeecCCccce--eeccccccccccc Confidence 32 224789999999999999999887776421 11 1222223222211111111 1122233321 22 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++...+ .+.-+.++++-+.....++...+.+...++++..+|..++...... .......|+++.++. T Consensus 178 ~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~------~~~~~~~~d~i~~~~ 250 (392) T protein:vir:10 178 PKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL------TKQAIKSLDDIKDVL 250 (392) T ss_pred ccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccCccCHHHHHHHH Confidence 344555555533 2445556665444445678888888888999999998887533211 122345678887754 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeee-ecc-ceeeeeeecccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTV-DTL-PHGDAYLSHPTA 229 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s-~~~-~~~~~~~~~~~a 229 (387) ..|..... .+-..+++|..+..|.+...-. |.- ....+..|..+.+.|+.++.. ... +........... T Consensus 251 ~~~l~~~~~--~~a~~vm~~~~~~~L~~lkd~~-----G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 251 NVKLDPAIS--PNAILLTNQDGFNYLDKLKDKD-----GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHhhhhhhc--cCCEEEEcHHHHHHHHHhhccC-----CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 45555443 3456789999988886531100 100 001122344456677654332 111 111111000000 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+ +.. . ..-......+..+.+....+.....+.. .+............ .-....+++... T Consensus 324 ~~~--gdf----s---~~~~i~~~~~~~~~~~~~~~~~f~~~~~---------~~r~~~r~d~~v~~-~~a~~~l~~~~~ 384 (392) T protein:vir:10 324 LII--GDL----K---EAIVLFKREDMELASTDVGGKAFTRNTL---------DLRAIQRDDVQMWD-NEAAVYGEIDLS 384 (392) T ss_pred EEE--Eeh----h---ceEEEEeecceEEEEeccccchhhcCce---------EEEEEEeeccEEec-ccceEEEEeccc Confidence 000 000 0 0000000001111111000000000000 00000000000000 000001111000 Q ss_pred eccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAG 321 (387) Q Consensus 310 ~~~~~~~~~~~~ 321 (387) .... -..| T Consensus 385 a~~~----~~~~ 392 (392) T protein:vir:10 385 APVE----QPQG 392 (392) T ss_pred cccc----CCCC Confidence 0000 0111 No 156 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=276 Identities=9% Similarity=-0.067 Sum_probs=108.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |. -.+++|+.+..++++.+++..++..++..- .. .++....++|......... -.+++....- +. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~--~v~E~~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PV---RTRSGSRVLEKNSDMIPFA--EITEMGEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ec---cCCceeEEEEeecCCccce--eeccccccccccc Confidence 32 224789999999999999999887776421 11 1222223222211111111 1122233321 22 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++...+ .+.-+.++++-+.....++...+.+...++++..+|..++...... .......|+++.++. T Consensus 178 ~~~~~v~l~~~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~------~~~~~~~~d~i~~~~ 250 (392) T protein:vir:10 178 PKFSNVQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL------TKQAIKSLDDIKDVL 250 (392) T ss_pred ccceeEEeeeee-EEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccCccCHHHHHHHH Confidence 344555555533 2445556665444445678888888888999999998887533211 122345678887754 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeee-ecc-ceeeeeeecccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTV-DTL-PHGDAYLSHPTA 229 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s-~~~-~~~~~~~~~~~a 229 (387) ..|..... .+-..+++|..+..|.+...-. |.- ....+..|..+.+.|+.++.. ... +........... T Consensus 251 ~~~l~~~~~--~~a~~vm~~~~~~~L~~lkd~~-----G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 251 NVKLDPAIS--PNAILLTNQDGFNYLDKLKDKD-----GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHhhhhhhc--cCCEEEEcHHHHHHHHHhhccC-----CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 45555443 3456789999988886531100 100 001122344456677654332 111 111111000000 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) +.+ +.. . ..-......+..+.+....+.....+.. .+............ .-....+++... T Consensus 324 ~~~--gdf----s---~~~~i~~~~~~~~~~~~~~~~~f~~~~~---------~~r~~~r~d~~v~~-~~a~~~l~~~~~ 384 (392) T protein:vir:10 324 LII--GDL----K---EAIVLFKREDMELASTDVGGKAFTRNTL---------DLRAIQRDDVQMWD-NEAAVYGEIDLS 384 (392) T ss_pred EEE--Eeh----h---ceEEEEeecceEEEEeccccchhhcCce---------EEEEEEeeccEEec-ccceEEEEeccc Confidence 000 000 0 0000000001111111000000000000 00000000000000 000001111000 Q ss_pred eccccccccccc Q lcl|NC_021299. 310 DAEIEGETVKAG 321 (387) Q Consensus 310 ~~~~~~~~~~~~ 321 (387) .... -..| T Consensus 385 a~~~----~~~~ 392 (392) T protein:vir:10 385 APVE----QPQG 392 (392) T ss_pred cccc----CCCC Confidence 0000 0111 No 157 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=94.12 E-value=0.0053 Score=33.05 Aligned_cols=255 Identities=9% Similarity=-0.036 Sum_probs=108.4 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. -+++|+-+..++++.+++...+..+++.- .. .| ..+|...... .+. .-.+++...+-.++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~---~~---~~--~~~p~~~~~~-~~a--~~v~Eg~~~~~~~~ 186 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI---KG--LEIPRVSYTL-DDD--DFITDVETAKELKA 186 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee---ec---CC--ceeeeeeccC-Ccc--cccccccccccccc Confidence 221 24789999999999998888776665421 11 11 2333221111 111 11233444444455 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcchhHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDEDEIWN 147 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~~~~~ 147 (387) .-..+++...+. +.-+.++.+-+.....++...+.++..++++...+..++....+... ......+....|+ T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d 265 (387) T protein:vir:94 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHH Confidence 555566555333 22345555544444567777777777888887766666543222111 1112234455688 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecc Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHP 227 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~ 227 (387) +++++-..|.....+ +-..++++..+..+++. .+ +. ...+..|.-..+.|+.|+.++..+.-....+.. T Consensus 266 ~i~~~~~~l~~~y~~--na~~imn~~t~~~~~~~---~~--~~----~~~~~~~~~~~llG~PV~~~~~~~~~~~GDf~~ 334 (387) T protein:vir:94 266 AIINALADLHEDYRD--NATIYMRYADYVKIISV---LS--NG----TTNFFDTPAEKVFGKPVVFTDAAVKPIVGDFNY 334 (387) T ss_pred HHHHHHhccChhhhc--CCEEEEechHHHHHHHH---Hh--cC----CCcccccCCccccccceEEecCCCceeeechhh Confidence 998887777665432 22346776666555431 00 00 112333444578899988877554321111100 Q ss_pred ccccccccccccccCceeeeeeecccc-cceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeee Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVATEN-GVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHL 306 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~ 306 (387) .+ ... .+.... ....... ...+.... +.+....+ . ..+.. +.+ T Consensus 335 -~~--~~~-----~~~~~~-~~~~~~~~~~~~~~~~------r~Dg~v~~------------------~-~A~~~--l~~ 378 (387) T protein:vir:94 335 -FG--INY-----DGTTYD-TDKDVKKGEYLFVLTA------WYDQQRTL------------------D-SAFRI--AKA 378 (387) T ss_pred -hh--hhh-----hhhhhe-ecccccCCceEEEEEE------EeCcEeec------------------h-hheEE--EEe Confidence 00 000 000000 0000000 00000000 00000000 0 00000 111 Q ss_pred eeee-cccc Q lcl|NC_021299. 307 KATD-AEIE 314 (387) Q Consensus 307 ~~~~-~~~~ 314 (387) +... ..++ T Consensus 379 ka~~~~~~~ 387 (387) T protein:vir:94 379 KENTGPLPS 387 (387) T ss_pred ecCCCCCCC Confidence 0000 0000 No 158 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=94.12 E-value=0.0053 Score=33.05 Aligned_cols=255 Identities=9% Similarity=-0.036 Sum_probs=108.4 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. -+++|+-+..++++.+++...+..+++.- .. .| ..+|...... .+. .-.+++...+-.++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~---~~---~~--~~~p~~~~~~-~~a--~~v~Eg~~~~~~~~ 186 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI---KG--LEIPRVSYTL-DDD--DFITDVETAKELKA 186 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee---ec---CC--ceeeeeeccC-Ccc--cccccccccccccc Confidence 221 24789999999999998888776665421 11 11 2333221111 111 11233444444455 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcchhHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDEDEIWN 147 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~~~~~ 147 (387) .-..+++...+. +.-+.++.+-+.....++...+.++..++++...+..++....+... ......+....|+ T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d 265 (387) T protein:vir:26 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHH Confidence 555566555333 22345555544444567777777777888887766666543222111 1112234455688 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecc Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHP 227 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~ 227 (387) +++++-..|.....+ +-..++++..+..+++. .+ +. ...+..|.-..+.|+.|+.++..+.-....+.. T Consensus 266 ~i~~~~~~l~~~y~~--na~~imn~~t~~~~~~~---~~--~~----~~~~~~~~~~~llG~PV~~~~~~~~~~~GDf~~ 334 (387) T protein:vir:26 266 AIINALADLHEDYRD--NATIYMRYADYVKIISV---LS--NG----TTNFFDTPAEKVFGKPVVFTDAAVKPIVGDFNY 334 (387) T ss_pred HHHHHHhccChhhhc--CCEEEEechHHHHHHHH---Hh--cC----CCcccccCCccccccceEEecCCCceeeechhh Confidence 998887777665432 22346776666555431 00 00 112333444578899988877554321111100 Q ss_pred ccccccccccccccCceeeeeeecccc-cceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeee Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVATEN-GVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHL 306 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~ 306 (387) .+ ... .+.... ....... ...+.... +.+....+ . ..+.. +.+ T Consensus 335 -~~--~~~-----~~~~~~-~~~~~~~~~~~~~~~~------r~Dg~v~~------------------~-~A~~~--l~~ 378 (387) T protein:vir:26 335 -FG--INY-----DGTTYD-TDKDVKKGEYLFVLTA------WYDQQRTL------------------D-SAFRI--AKA 378 (387) T ss_pred -hh--hhh-----hhhhhe-ecccccCCceEEEEEE------EeCcEeec------------------h-hheEE--EEe Confidence 00 000 000000 0000000 00000000 00000000 0 00000 111 Q ss_pred eeee-cccc Q lcl|NC_021299. 307 KATD-AEIE 314 (387) Q Consensus 307 ~~~~-~~~~ 314 (387) +... ..++ T Consensus 379 ka~~~~~~~ 387 (387) T protein:vir:26 379 KENTGPLPS 387 (387) T ss_pred ecCCCCCCC Confidence 0000 0000 No 159 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=94.12 E-value=0.0053 Score=33.05 Aligned_cols=255 Identities=9% Similarity=-0.036 Sum_probs=108.4 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. -+++|+-+..++++.+++...+..+++.- .. .| ..+|...... .+. .-.+++...+-.++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~---~~---~~--~~~p~~~~~~-~~a--~~v~Eg~~~~~~~~ 186 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI---KG--LEIPRVSYTL-DDD--DFITDVETAKELKA 186 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee---ec---CC--ceeeeeeccC-Ccc--cccccccccccccc Confidence 221 24789999999999998888776665421 11 11 2333221111 111 11233444444455 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcchhHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDEDEIWN 147 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~~~~~ 147 (387) .-..+++...+. +.-+.++.+-+.....++...+.++..++++...+..++....+... ......+....|+ T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d 265 (387) T protein:vir:96 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHH Confidence 555566555333 22345555544444567777777777888887766666543222111 1112234455688 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecc Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHP 227 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~ 227 (387) +++++-..|.....+ +-..++++..+..+++. .+ +. ...+..|.-..+.|+.|+.++..+.-....+.. T Consensus 266 ~i~~~~~~l~~~y~~--na~~imn~~t~~~~~~~---~~--~~----~~~~~~~~~~~llG~PV~~~~~~~~~~~GDf~~ 334 (387) T protein:vir:96 266 AIINALADLHEDYRD--NATIYMRYADYVKIISV---LS--NG----TTNFFDTPAEKVFGKPVVFTDAAVKPIVGDFNY 334 (387) T ss_pred HHHHHHhccChhhhc--CCEEEEechHHHHHHHH---Hh--cC----CCcccccCCccccccceEEecCCCceeeechhh Confidence 998887777665432 22346776666555431 00 00 112333444578899988877554321111100 Q ss_pred ccccccccccccccCceeeeeeecccc-cceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeee Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVATEN-GVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHL 306 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~ 306 (387) .+ ... .+.... ....... ...+.... +.+....+ . ..+.. +.+ T Consensus 335 -~~--~~~-----~~~~~~-~~~~~~~~~~~~~~~~------r~Dg~v~~------------------~-~A~~~--l~~ 378 (387) T protein:vir:96 335 -FG--INY-----DGTTYD-TDKDVKKGEYLFVLTA------WYDQQRTL------------------D-SAFRI--AKA 378 (387) T ss_pred -hh--hhh-----hhhhhe-ecccccCCceEEEEEE------EeCcEeec------------------h-hheEE--EEe Confidence 00 000 000000 0000000 00000000 00000000 0 00000 111 Q ss_pred eeee-cccc Q lcl|NC_021299. 307 KATD-AEIE 314 (387) Q Consensus 307 ~~~~-~~~~ 314 (387) +... ..++ T Consensus 379 ka~~~~~~~ 387 (387) T protein:vir:96 379 KENTGPLPS 387 (387) T ss_pred ecCCCCCCC Confidence 0000 0000 No 160 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=93.70 E-value=0.0067 Score=32.51 Aligned_cols=256 Identities=10% Similarity=0.001 Sum_probs=109.3 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |. .-+++|+-+..++++.+++...+..+++.- .. .|. ++|..... ..+.. -.+++..+.-.++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~---~~---~~~--~~p~~~~~-~~~a~--~v~E~~~~~~~~~ 151 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI---KGL--EIPRVSYT-LDDDD--FITDVETAKELKL 151 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeE---ec---CCc--eEEEEecC-CCccc--ccccccccccccc Confidence 22 235889999999999999998887776431 11 121 23321111 01111 1123444444555 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcchhHH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDEDEIWN 147 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~~~~~ 147 (387) +-..+++...+.. .-+.++.+-+.....++...+.+..+++++...+..++..-.+... ......+....|+ T Consensus 152 ~f~~v~~~~~k~~-~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d 230 (352) T protein:vir:78 152 KGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYD 230 (352) T ss_pred cceeeeecceeEE-eechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHH Confidence 6666666664433 2355665544444667777777777788876644545532221111 1112234445688 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeecc Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHP 227 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~ 227 (387) .++++...|..... .+-..++++..+..+++.- + +. ...+..|.-..+.|+.|+.++..+.-. + T Consensus 231 ~i~~~~~~l~~~~~--~~a~~~mn~~t~~~l~~~~---~--~~----~~~~~~~~~~~llG~PV~~~~~~~~~~---~-- 294 (352) T protein:vir:78 231 AIINALADLHEDYR--DNATIYMRYADYVKIISVL---S--NG----TTNFFDTPAEKVFGKPVVFTDAAVKPI---V-- 294 (352) T ss_pred HHHHHHhccChhhh--cCCEEEEehHHHHHHHHHH---h--cc----CCcccccCCccccccceEEecCCCcee---E-- Confidence 89888877765542 2445577887776665421 1 00 112233444567888888766543211 1 Q ss_pred ccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeecccee-ccccccceeee Q lcl|NC_021299. 228 TAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDE-PRFVRGTRIHL 306 (387) Q Consensus 228 ~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~v~~~~v~~ 306 (387) +.|..... . ..+.. .....+... +...+........... +.. ...+.+ T Consensus 295 Gdf~~~~~---~-------------~~~~~--~~~~~~~~~-----------g~~~f~~~~r~Dg~~~~~eA--~~~l~~ 343 (352) T protein:vir:78 295 GDFNYFGI---N-------------YDGTT--YDTDKDVKK-----------GEYLFVLTAWYDQQRTLDSA--FRIAKA 343 (352) T ss_pred eehhhhhh---h-------------hhhhe--eeeeccccC-----------CeeEEEEEeeeCceeechhh--eEEEEe Confidence 01100000 0 00000 000000000 0000000000000000 000 011111 Q ss_pred eeeecccccccccc Q lcl|NC_021299. 307 KATDAEIEGETVKA 320 (387) Q Consensus 307 ~~~~~~~~~~~~~~ 320 (387) ...... ++. T Consensus 344 ~a~~~~-----~~~ 352 (352) T protein:vir:78 344 KESTGS-----LPS 352 (352) T ss_pred ecccCC-----CCC Confidence 110000 000 No 161 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=267 Identities=14% Similarity=0.066 Sum_probs=111.5 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) .+--.|.|+.+. ++++.+.+...+.+++++.- .. ..++..|+..+....-.......+.....+-.+++-++++ T Consensus 19 ~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~--t~---~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~ 92 (314) T protein:vir:41 19 LGKGILAVQRFG-EFVREVRENSAIIKDARVLN--AL---KSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNT 92 (314) T ss_pred CCCceeChHHHH-HHHHHHHhccchhhheeeec--cc---CccceeecccccCcccccccccccCCccCCccccccccee Confidence 333358899875 68899999999988876431 11 1234556543321100000001112223344566677777 Q ss_pred EEEEeeeecceeeccHHHhhhhh--hHHHHHHHHHHHHHHHHHHHHHHHH-----------------Hhccccccc--cc Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVR--SFAVDVLPRQVRAVAEQIEDAVSYL-----------------ITKAPYEKV--SL 139 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~--~~~~~~~~~~~~~la~~vd~~~~~~-----------------~~~~~~~~~--~~ 139 (387) +...+.. ..+.++++.+...+. ++...+..+.+++++...+...+.- ++.+...+. .+ T Consensus 93 l~~~kl~-~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~~ 171 (314) T protein:vir:41 93 LEMKELV-TKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDAEP 171 (314) T ss_pred eeeEEEE-EeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeecCc Confidence 7774444 457777776555543 6666666667777877776655421 111111111 11 Q ss_pred CCcchhHHHHHHHHHHHhhccCCcC-CcEEEEchHHHHHHhc--ccchhhhhhcccccceeeeeeEEEEeecceeeeeec Q lcl|NC_021299. 140 VDEDEIWNGVVSNRRWLNEQKVPKD-GRVLLVGSAVEEALLL--DDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDT 216 (387) Q Consensus 140 ~~~~~~~~~i~~a~~~l~~~~vp~~-~r~~v~~~~~~~~l~~--~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~ 216 (387) .+..+..+.+.++...|....--.. +-..+++++....+.+ +++- ...+ ...+..|....+.|+.|+..+. T Consensus 172 ~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~---~~l~---~~~~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 172 EDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRE---TGLG---DSALIGATGLQYDGIPIQYVPA 245 (314) T ss_pred cccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccC---Cccc---chhhhCCCCceecceeeEeccc Confidence 1222334556666666654321111 1234568887777653 1110 1111 1233445555678999988777 Q ss_pred cceee-----eeeeccccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeec Q lcl|NC_021299. 217 LPHGD-----AYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTAN 291 (387) Q Consensus 217 ~~~~~-----~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~ 291 (387) ++.-. .+......+.+.. ........+++.............. .+. T Consensus 246 ~~~~~~~~~~i~fgd~~nlv~~~--------------------~~~ir~~~~~~a~~~~~~~~~~~r~---------d~~ 296 (314) T protein:vir:41 246 LDALGDDKARALLTVPTNLVYGF--------------------WRNIRIEPKRDAAMRRTEYIASLRA---------DCN 296 (314) T ss_pred ccccCCCCceEEEechhheEEEe--------------------eceeEEeecccCcCCeEEEEEEEEe---------ceE Confidence 65311 1111111111100 0001111111111000000000000 000 Q ss_pred cceeccccccceeeeeeeeccccccccc Q lcl|NC_021299. 292 LDDEPRFVRGTRIHLKATDAEIEGETVK 319 (387) Q Consensus 292 ~~~~~~~v~~~~v~~~~~~~~~~~~~~~ 319 (387) ...... ++...+.... -+ T Consensus 297 ~~~~~a---a~~~~~~~~~-------~~ 314 (314) T protein:vir:41 297 YEDENA---AVAAVIDMSS-------GG 314 (314) T ss_pred EEEcCc---EEEEEeeccC-------CC Confidence 000000 0000000000 00 No 162 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=92.98 E-value=0.0092 Score=31.74 Aligned_cols=256 Identities=10% Similarity=0.026 Sum_probs=106.6 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. -+++|+-+..++++.+++...+..++..- .. .| ..+|..... .... .-.+++......++ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~---~~---~~--~~~p~~~~~-~~~a--~~v~E~~~~~~~~~ 186 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI---KG--LEIPRVSYT-LDDD--DFITDVETAKELKL 186 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeee---ec---CC--ceEEEEeec-CCcc--ccccCccccccccc Confidence 221 24789999999999999888776665421 11 12 223321111 0111 11223444444455 Q ss_pred ccceEEEEEEeeeecc-eeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcchhH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNR-IDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDEDEIW 146 (387) Q Consensus 75 ~~~~~~~~id~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~~~~ 146 (387) .-..+++.. +++.. +.++.+-+.....++...+.++..++++.+.+..++..-.+... ......+....| T Consensus 187 ~f~~v~~~~--~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~ 264 (387) T protein:vir:93 187 KGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMY 264 (387) T ss_pred ccceeeeeh--eeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchH Confidence 555555554 44433 45555534334567777777777888888766666543222111 111223444568 Q ss_pred HHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeec Q lcl|NC_021299. 147 NGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSH 226 (387) Q Consensus 147 ~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~ 226 (387) ++++++-..|+..... ... .++++..+..+++. + + + +...+..|.-..+.|+.|+.+...+...... T Consensus 265 d~i~~~~~~l~~~~~~-~a~-~~mn~~t~~~~~~~--~-~--d----~~~~~~~~~~~~llG~PV~~~~~~~~~~~GD-- 331 (387) T protein:vir:93 265 DAIINALADLHEDYRD-NAT-IYMRYADYVKIISV--L-S--N----GTTNFFDTPAEKVFGKPVVFTDAAVKPIVGD-- 331 (387) T ss_pred HHHHHHHhccChhhhc-CCE-EEEechHHHHHHHH--H-h--c----CCCcccccCCccccccceEEecCCCceeeee-- Confidence 8898887777665432 233 46776655544321 0 0 0 0112223444567898888776544211111 Q ss_pred cccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeee Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHL 306 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~ 306 (387) |..... . +.+.. +..+.+........... ....+....... +.. +.+ T Consensus 332 ---f~~~~~---~-------------~~~~~--~~~~~~~~~~~~~~~~~---------~r~d~~v~~~eA-~~~--l~~ 378 (387) T protein:vir:93 332 ---FNYFGI---N-------------YDGTT--YDTDKDVKKGEYLFVLT---------AWYDQQRTLDSA-FRI--AKA 378 (387) T ss_pred ---hhhhhe---e-------------hhhhe--eeecccccCCceeEEEE---------eeeCceeechhh-eEE--EEe Confidence 110000 0 00000 00000000000000000 000000000000 000 111 Q ss_pred eeeecccccc Q lcl|NC_021299. 307 KATDAEIEGE 316 (387) Q Consensus 307 ~~~~~~~~~~ 316 (387) .....+. +. T Consensus 379 k~~~~~~-~~ 387 (387) T protein:vir:93 379 KENTGSL-PS 387 (387) T ss_pred ecCCCCC-CC Confidence 0000000 00 No 163 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=92.57 E-value=0.011 Score=31.36 Aligned_cols=254 Identities=9% Similarity=-0.027 Sum_probs=107.2 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |.. -+++|+-+..++++.+++...+..+++.- .. .| ..+|...... .+. .-.+++....-.++ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~---~~---~~--~~~p~~~~~~-~~a--~~v~Eg~~~~~~~~ 201 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI---KG--LEIPRVSYTL-DDD--DFITDVETAKELKA 201 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceee---ec---CC--ceeeeeeccC-Ccc--cccccccccccccc Confidence 221 24789999999999999888887666421 11 12 2233221111 111 11123334444445 Q ss_pred ccceEEEEEEeeeecc-eeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccCCcchhH Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNR-IDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY-------EKVSLVDEDEIW 146 (387) Q Consensus 75 ~~~~~~~~id~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~-------~~~~~~~~~~~~ 146 (387) +-..+++.. ++... +.++.+-+.....++...+.++..++++.+.+..++..-.+... ......+....| T Consensus 202 ~f~~i~~~~--~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~ 279 (402) T protein:vir:93 202 KGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMY 279 (402) T ss_pred ccceeeecc--eeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchH Confidence 555555554 33333 44555433444567777777777888888766655543222110 111223445668 Q ss_pred HHHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeeec Q lcl|NC_021299. 147 NGVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSH 226 (387) Q Consensus 147 ~~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~ 226 (387) ++++++...|+..... ... .++++..+..+++. .+ +. ...+..|.-..+.|+.|+.++..+.-....+. T Consensus 280 d~l~~~~~~l~~~y~~-na~-~imn~~t~~~~~~~---~~--d~----~~~~~~~~~~~llG~PV~~t~~~~~i~~GDf~ 348 (402) T protein:vir:93 280 DAIINALADLHEDYRD-NAT-IYMRYADYVKIISV---LS--NG----TTNFFDTPAEKVFGKPVVFTDAAVKPIVGDFN 348 (402) T ss_pred HHHHHHHhccChhhhc-CCE-EEEechHHHHHHHH---Hh--cC----CCcccccCCccccccceEEecCCCceeeechh Confidence 8898888777665432 233 46776655554431 01 00 11223344456889998887755432111111 Q ss_pred cccccccccccccccCcee-eeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceee Q lcl|NC_021299. 227 PTAYAMLTRSPGRPMTNTV-ATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIH 305 (387) Q Consensus 227 ~~a~~~~~~~~~~~~~~t~-~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~ 305 (387) . .+. .. .+... .+.... .....+.... +.+....+. ..+.. +. T Consensus 349 ~-~~~--~~-----~~~~~~~~~~~~-~~~~~~~~~~------r~Dg~v~~~-------------------~A~~~--l~ 392 (402) T protein:vir:93 349 Y-FGI--NY-----DGTTYDTDKDVK-KGEYLFVLTA------WYDQQRTLD-------------------SAFRI--AK 392 (402) T ss_pred h-hhh--hh-----hhhhhhhhhccc-CCceEEEEEE------EeCcEEech-------------------hheEE--EE Confidence 0 000 00 00000 000000 0000000000 000111000 00000 00 Q ss_pred eeee-ecccc Q lcl|NC_021299. 306 LKAT-DAEIE 314 (387) Q Consensus 306 ~~~~-~~~~~ 314 (387) ++.. ..+++ T Consensus 393 ik~~~~~~~~ 402 (402) T protein:vir:93 393 AKENTGPLPS 402 (402) T ss_pred eecCCCCCCC Confidence 0000 00000 No 164 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=92.49 E-value=0.011 Score=31.28 Aligned_cols=280 Identities=11% Similarity=0.053 Sum_probs=115.5 Q ss_pred Cc---cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccc Q lcl|NC_021299. 1 MA---NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEV 77 (387) Q Consensus 1 Ma---~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (387) |+ -++++|.+++.-+++.-+.-....+++ .+...+.|....++-.+ ..+.+++...++-.....+.-+.. T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~-----qk~~L~~Grsm~F~~~g--~~Ra~~IgEGgE~~~~sld~~T~d 146 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKML-----QKIRLKSGQSMIFPSIG--IMRAYDVAEGQEIPEDSIDWQTHE 146 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHH-----HHHhhhcCcceeccchh--eeeeccccccccccccchhhhcCC Confidence 54 457999999988777332222122222 12223345555553333 444454443333344444555566 Q ss_pred eEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---------c--cc------C Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEK---------V--SL------V 140 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~---------~--~~------~ 140 (387) .+++...| ....+.++|+-..-.--|++...++++.++|+++.+..++...+...+.+ + ++ - T Consensus 147 sv~~~~gK-~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~q 225 (393) T protein:vir:79 147 SPEIRVGK-SGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQ 225 (393) T ss_pred ceeEEech-hhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccc Confidence 66666633 34778889988888888999999999999999999999998876543321 0 11 0 Q ss_pred CcchhHHHHHHHH-HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeec----------- Q lcl|NC_021299. 141 DEDEIWNGVVSNR-RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQ----------- 208 (387) Q Consensus 141 ~~~~~~~~i~~a~-~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g----------- 208 (387) +.....++++++. ....+.. ..-.+++.|-.+..+-+....-...... -++..-+.-...+..| T Consensus 226 NGTlSleDllDm~~av~~~hy---t~svi~MHPLAWnv~AKna~me~~~~na-~gN~~~~~~~ts~algp~~i~~~~~~n 301 (393) T protein:vir:79 226 NDTFSAEDFLDLIIAVMANEY---TPSDLMMHPLAWTVFAKNELMGSLQANP-YGNYPAKGAPSSMALGPDSIQGRLPFN 301 (393) T ss_pred cccccHHHHHHHHHHHhcccC---CcceEEEcCchhhhhhhhhhhcceeecc-ccccCccccchhhhhchhhhccccccc Confidence 1223356777754 3334443 3456888887766655532111000000 0000001001111112 Q ss_pred ceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccccceeeeeee-------eeeccceeeeeeeeeeee Q lcl|NC_021299. 209 YDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGD-------YDATSTTERSIVDTWIGV 281 (387) Q Consensus 209 ~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~-------~d~~~~~~~~~~~~~~g~ 281 (387) ++|..+.-+|.+... ..+.+.... .+......+.| .+............-+|. T Consensus 302 lnv~~sPfvp~d~k~-------------------~rFd~~~Vd-~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~ 361 (393) T protein:vir:79 302 FNVNLSPFIPLDKKS-------------------RRFDVYAVD-RNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGI 361 (393) T ss_pred eeEEEeccccccccc-------------------ceeeEEEee-cCCceEEEEecCcceeccccccccceeeeeeeeece Confidence 333333333332220 000111110 11111111111 000000000011111111 Q ss_pred ccccceeeeccceeccccccceeeeeeeecccccccc-cccc Q lcl|NC_021299. 282 KAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETV-KAGE 322 (387) Q Consensus 282 ~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~-~~~~ 322 (387) ...... ........+++...... +.-+ ..|. T Consensus 362 gvLn~g--------kaiavakNI~~~k~y~~--P~~~~~~~~ 393 (393) T protein:vir:79 362 GILNEG--------KAIAVAKNISMDKSYAE--PMLIKNVGN 393 (393) T ss_pred eeeeCC--------ceEEEEecceeeccccc--chhhhccCC Confidence 110000 00000001111000000 0000 0000 No 165 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=91.48 E-value=0.016 Score=30.49 Aligned_cols=278 Identities=15% Similarity=0.087 Sum_probs=104.1 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccccccccceEE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDLTEVTVD 80 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (387) -.-..++|+-+..+++..+++...+.+++... . . +.++.+++-....... . .+++..++-.+++-..++ T Consensus 155 ~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~--~-~----~g~~~~~~~~~~~~a~-w---v~E~~~~~~~~~~f~~i~ 223 (466) T protein:vir:80 155 SGAELTIPDVMLELLRDNMHRYSKLISKVRLR--P-L----KGTARQNIAGAIPEGV-W---TEAVANLNELSLSFSQIE 223 (466) T ss_pred ccccccccHHHHHHHHHhhhhhhhhhhheeee--e-c----CceeEeeeecCCccee-e---ccccccccccccccccee Confidence 11125789999999999998888877766421 1 1 2234454332211110 1 123334444455555566 Q ss_pred EEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHH-HHhcccccc----------cccCCcch----- Q lcl|NC_021299. 81 IKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSY-LITKAPYEK----------VSLVDEDE----- 144 (387) Q Consensus 81 ~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~-~~~~~~~~~----------~~~~~~~~----- 144 (387) +.+.+. +.-+.++++-+.....++...+.+...++++..+|..++. .-.+.|... ........ T Consensus 224 ~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 302 (466) T protein:vir:80 224 VDGYKV-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNL 302 (466) T ss_pred ecceee-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeeccccccccccccccccccccc Confidence 555333 2334455554444445677777777889999999988763 111111000 00000000 Q ss_pred h--------------HHHHHHHHHHHh--hccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeec Q lcl|NC_021299. 145 I--------------WNGVVSNRRWLN--EQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQ 208 (387) Q Consensus 145 ~--------------~~~i~~a~~~l~--~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g 208 (387) . +..+.++...+. +........+.++++..+..+.+..-.. ...+......+....+.| T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~-----~~~g~~~~~~~~~~~i~G 377 (466) T protein:vir:80 303 STTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITF-----NSAGALVASLNNTMPIVG 377 (466) T ss_pred chhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccc-----cCCccccccCCCcccccc Confidence 0 001111111111 1111112233456777766665432110 111111111111124678 Q ss_pred ceeeeeeccceeeeeeeccccccccccccccccCceeeeeeeccc--ccceeeeeeeeeeccceeeeeeeeeeeeccccc Q lcl|NC_021299. 209 YDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLD 286 (387) Q Consensus 209 ~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~ 286 (387) .+++.++.+|.+..+......+.+..+ .+..........+ +...+....- T Consensus 378 ~pvv~s~~~~~~~~~~g~~~~y~i~~r-----~~~~i~~~~~~~f~~d~~~~r~~~r----------------------- 429 (466) T protein:vir:80 378 GDIVILDFIPDNDIIGGYGSLYLLAER-----ADIKLAQSEHVRFIEDQTVFKGTAR----------------------- 429 (466) T ss_pred cceeecCccCccceeeeccccEEEEee-----cceEEEechhhhhhcCcEEEEEEEE----------------------- Confidence 888888888765433222222211111 0111110000000 0000000000 Q ss_pred eeeeccceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCcce Q lcl|NC_021299. 287 PVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDPLV 342 (387) Q Consensus 287 ~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 342 (387) ..+.......++. +. ++ ++ +......+.++.+.. ..| T Consensus 430 -~dg~~~~~~afv~---~~-------~~--~~--~~~~~~~~~~~~~~~----~~~ 466 (466) T protein:vir:80 430 -YDGKPVFGEGFVA---VN-------IA--NA--NPTTSITFAPDEANV----PEV 466 (466) T ss_pred -EccEEeccCceEE---EE-------ec--CC--CcccceeeecCcCcC----CCC Confidence 0000000011100 00 00 01 111122222221111 111 No 166 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=91.03 E-value=0.018 Score=30.18 Aligned_cols=280 Identities=11% Similarity=-0.011 Sum_probs=108.2 Q ss_pred Cc---cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-cccccc Q lcl|NC_021299. 1 MA---NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLTE 76 (387) Q Consensus 1 Ma---~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 76 (387) .+ .-.++|+-+..++++.+++...+..+++.-. -.+....||+......... .+++..+. ..+++- T Consensus 87 ~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~------~~~~~~~i~~~~~~~~a~~----~~E~~~~~~~~~~~f 156 (390) T protein:vir:40 87 NGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVN------TTATTEWIISVGDVATAWW----GPLCAEIKEVLDNGF 156 (390) T ss_pred cCcccCcccccHHHHHHHHHHHHhhhhhhhhceeee------cCCceeEEEEEcCCcceee----eccccccCccccccc Confidence 11 1247899999999999999988877775321 1234556655332211111 12222332 234555 Q ss_pred ceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc--------cc--------ccc Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY--------EK--------VSL 139 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~--------~~--------~~~ 139 (387) +.+++...+. +.-+.++.+-+.....++...+.++..++++.++|..++.- -.+.|. .. ... T Consensus 157 ~~i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~ 235 (390) T protein:vir:40 157 DKIQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATP 235 (390) T ss_pred eeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccc Confidence 6666666433 34456666655556667777788888999999999988731 111110 00 000 Q ss_pred CCcchhHHHHHHHHHHHhhccCC-cCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeecceeeeeeccc Q lcl|NC_021299. 140 VDEDEIWNGVVSNRRWLNEQKVP-KDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLP 218 (387) Q Consensus 140 ~~~~~~~~~i~~a~~~l~~~~vp-~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~ 218 (387) .+.....+.+..+...|.+.... ..+-..+++|..+..+++.-...+ +..| . .+. -....|..++.++.+| T Consensus 236 ~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~-d~~G---~-~v~---~~~~~g~pvv~~~~~p 307 (390) T protein:vir:40 236 LTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYM-TPQG---V-WVT---GILPVPLEIVQSVAVP 307 (390) T ss_pred cchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhcc-CCCC---c-ccc---ccCCCceeEEEcCCCC Confidence 11111122222333333332211 123456788876544443111100 1111 1 111 1123577888888887 Q ss_pred eeeeeeeccccccccccccccccCceeeeeeeccc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceec Q lcl|NC_021299. 219 HGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEP 296 (387) Q Consensus 219 ~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 296 (387) .+..+..-.+.+.+. ...+..+.......+ ....+......| ....+. . T Consensus 308 ~~~i~~Gd~s~~~i~-----~~~~~~v~~~~~~~f~~~~~~~r~~~r~d------g~v~~~------------------~ 358 (390) T protein:vir:40 308 VGKAVAGRAKDYFMG-----IGSEQVIRTSTEYRLLDDETLYYAKQYAN------GRPKDN------------------S 358 (390) T ss_pred CCcEEEEeeceEEEE-----eecceEEEecchhhhhcCcEEEEEEEEeC------CEEecc------------------c Confidence 653221111111100 000111100000000 000011111000 000000 0 Q ss_pred cccccceeeeeeeeccccccccccccceeEEEeeccCCccccCc Q lcl|NC_021299. 297 RFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDP 340 (387) Q Consensus 297 ~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (387) .+ .. +.+....-+ ......++....+...... T Consensus 359 A~-~~--l~~~~~~~~---------~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:40 359 SF-LV--FDITGLEGS---------PAIDVNVVNNATPSETPAE 390 (390) T ss_pred ce-EE--EEeeccCCC---------CCCCcceeeCCCCCCCCCC Confidence 00 00 000000000 0000000000000000000 No 167 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=89.97 E-value=0.023 Score=29.53 Aligned_cols=267 Identities=15% Similarity=0.027 Sum_probs=103.8 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |... .++|+-|...+++.+++...+..++++- .- .+..+.||........-. -.+++...+..++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~-----~~-~~~~~~~~~~~~~~~~a~---wv~E~~~~~~s~~ 221 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR-----PV-TSPNLSYLTESAAHNNAA---AVAEAGTYPFSSE 221 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc-----cc-CCCceEEEEEcCCCCcce---eeccCcccccccc Confidence 2211 3678889999999999999888877531 11 234577765322111111 1234455555566 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc---------cccccC-Cc- Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY---------EKVSLV-DE- 142 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~---------~~~~~~-~~- 142 (387) +-..+++...+... -+.++. ++..+..++...+.++..++++.++|..++.- -.+.+. ...... .. T Consensus 222 ~f~~i~~~~~k~a~-~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 299 (497) T protein:vir:10 222 EFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) T ss_pred cceeeEeeeeeeEe-ecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchh Confidence 66666666644332 334544 45556555555556677899999999887731 000000 000000 00 Q ss_pred ----------------------------------------------------chhHHHHHHHHHHHhhccCCcCCcEEEE Q lcl|NC_021299. 143 ----------------------------------------------------DEIWNGVVSNRRWLNEQKVPKDGRVLLV 170 (387) Q Consensus 143 ----------------------------------------------------~~~~~~i~~a~~~l~~~~vp~~~r~~v~ 170 (387) ......+..+...+..... ...-..++ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~vm 378 (497) T protein:vir:10 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVM 378 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-cCCCeEEE Confidence 0000011111111111100 01114678 Q ss_pred chHHHHHHhcc--cc--hhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--ccccccccccccccccCce Q lcl|NC_021299. 171 GSAVEEALLLD--DR--FIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--HPTAYAMLTRSPGRPMTNT 244 (387) Q Consensus 171 ~~~~~~~l~~~--~~--~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~~~a~~~~~~~~~~~~~~t 244 (387) +|..+..|.+. .. +......+... ....+....+.|+.|+.++.+|.+..+.. ....+.+.-+ .+.. T Consensus 379 n~~~~~~l~~lkd~~G~~i~~~~~~~~~--~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r-----~~~~ 451 (497) T protein:vir:10 379 NPRDWELLRLTKDANGQYMGGNFFGNAY--GNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-----EGVT 451 (497) T ss_pred chHHHHHHHHhhcCCCceeccCcccccc--cccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe-----cccE Confidence 88888776542 11 11101100000 00111123677899999988886543210 0000000000 0000 Q ss_pred eeeeee--ccc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccc Q lcl|NC_021299. 245 VATSTV--ATE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGET 317 (387) Q Consensus 245 ~~~~~~--~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~ 317 (387) ...+.. ..+ +...+......+ . . ...... ...+++... .. .+ T Consensus 452 v~~~~~~~~~f~~n~v~~r~~~r~~------~-------~-----------v~~p~A---~~~l~~~~~---~~-~~ 497 (497) T protein:vir:10 452 MQMTNSNGTDFVDGKVTVRAEERLG------L-------L-----------VYRPSA---FQLIQLKKG---AT-GS 497 (497) T ss_pred EEeecccchhhhcCcEEEEEEEeec------c-------e-----------eecccc---EEEEEecCC---cc-CC Confidence 000000 000 000000000000 0 0 000000 000000000 00 00 No 168 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=89.97 E-value=0.023 Score=29.53 Aligned_cols=267 Identities=15% Similarity=0.027 Sum_probs=103.8 Q ss_pred Cccc------cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MANA------FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma~~------~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |... .++|+-|...+++.+++...+..++++- .- .+..+.||........-. -.+++...+..++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~-----~~-~~~~~~~~~~~~~~~~a~---wv~E~~~~~~s~~ 221 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR-----PV-TSPNLSYLTESAAHNNAA---AVAEAGTYPFSSE 221 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc-----cc-CCCceEEEEEcCCCCcce---eeccCcccccccc Confidence 2211 3678889999999999999888877531 11 234577765322111111 1234455555566 Q ss_pred ccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-Hhcccc---------cccccC-Cc- Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYL-ITKAPY---------EKVSLV-DE- 142 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~-~~~~~~---------~~~~~~-~~- 142 (387) +-..+++...+... -+.++. ++..+..++...+.++..++++.++|..++.- -.+.+. ...... .. T Consensus 222 ~f~~i~~~~~k~a~-~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 299 (497) T protein:vir:78 222 EFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) T ss_pred cceeeEeeeeeeEe-ecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchh Confidence 66666666644332 334544 45556555555556677899999999887731 000000 000000 00 Q ss_pred ----------------------------------------------------chhHHHHHHHHHHHhhccCCcCCcEEEE Q lcl|NC_021299. 143 ----------------------------------------------------DEIWNGVVSNRRWLNEQKVPKDGRVLLV 170 (387) Q Consensus 143 ----------------------------------------------------~~~~~~i~~a~~~l~~~~vp~~~r~~v~ 170 (387) ......+..+...+..... ...-..++ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~vm 378 (497) T protein:vir:78 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVM 378 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-cCCCeEEE Confidence 0000011111111111100 01114678 Q ss_pred chHHHHHHhcc--cc--hhhhhhcccccceeeeeeEEEEeecceeeeeeccceeeeeee--ccccccccccccccccCce Q lcl|NC_021299. 171 GSAVEEALLLD--DR--FIRYDSAGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLS--HPTAYAMLTRSPGRPMTNT 244 (387) Q Consensus 171 ~~~~~~~l~~~--~~--~~~~~~~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~--~~~a~~~~~~~~~~~~~~t 244 (387) +|..+..|.+. .. +......+... ....+....+.|+.|+.++.+|.+..+.. ....+.+.-+ .+.. T Consensus 379 n~~~~~~l~~lkd~~G~~i~~~~~~~~~--~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r-----~~~~ 451 (497) T protein:vir:78 379 NPRDWELLRLTKDANGQYMGGNFFGNAY--GNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-----EGVT 451 (497) T ss_pred chHHHHHHHHhhcCCCceeccCcccccc--cccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe-----cccE Confidence 88888776542 11 11101100000 00111123677899999988886543210 0000000000 0000 Q ss_pred eeeeee--ccc--ccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccc Q lcl|NC_021299. 245 VATSTV--ATE--NGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGET 317 (387) Q Consensus 245 ~~~~~~--~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~ 317 (387) ...+.. ..+ +...+......+ . . ...... ...+++... .. .+ T Consensus 452 v~~~~~~~~~f~~n~v~~r~~~r~~------~-------~-----------v~~p~A---~~~l~~~~~---~~-~~ 497 (497) T protein:vir:78 452 MQMTNSNGTDFVDGKVTVRAEERLG------L-------L-----------VYRPSA---FQLIQLKKG---AT-GS 497 (497) T ss_pred EEeecccchhhhcCcEEEEEEEeec------c-------e-----------eecccc---EEEEEecCC---cc-CC Confidence 000000 000 000000000000 0 0 000000 000000000 00 00 No 169 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=89.07 E-value=0.029 Score=29.05 Aligned_cols=270 Identities=11% Similarity=0.027 Sum_probs=103.3 Q ss_pred Ccc------ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccc Q lcl|NC_021299. 1 MAN------AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASD 73 (387) Q Consensus 1 Ma~------~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (387) |+. .+++|+-+...+. .++....+..+++.- .. ......+|++......... .+.+.... .++ T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~---~~e~~~~~e~~~ 225 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTE---SV---TTTTGKLPIFNNSTDLLTA---HTEYGQTTKNAT 225 (437) T ss_pred hhhcccccccccchHHHHHHHH-HhhhhhhhhhcceeE---ee---ccCceeeEEeecccccccc---cccccccccccc Confidence 111 1367888877554 445554554444321 01 1223455544222111111 11222222 233 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHH Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNR 153 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 153 (387) +.-..+++...+. +.-+.++.+-+.....++...+.+...++++..+|..++........ .......++++.++. T Consensus 226 ~~~~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~----~~~~~~~~~~~~~~~ 300 (437) T protein:vir:10 226 PVITPILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIK----KTTSTYLLGDLKKVL 300 (437) T ss_pred ccceeeeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc----ccccccchhhHHHHH Confidence 3444555554332 34455666544455567777777888899999999888754322111 122233345555533 Q ss_pred -HHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc-cceeeeeeEEEEeecceeeeeecc--ceeeeeeecccc Q lcl|NC_021299. 154 -RWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA-GASRLQTARIGRLAQYDVVTVDTL--PHGDAYLSHPTA 229 (387) Q Consensus 154 -~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~-~~~~~~~g~ig~~~g~~v~~s~~~--~~~~~~~~~~~a 229 (387) ..|+.... .+-..+++|..+..|.+... ..|.- ....+..|..+.+.|+.|+.+..+ |...... T Consensus 301 ~~~l~~~~~--~~~~~~~~~~~~~~l~~lkd-----~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~----- 368 (437) T protein:vir:10 301 NVTLKPQDS--AAASIVMSQSAYNLFDMATD-----AMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGD----- 368 (437) T ss_pred Hhhhhhhhh--cCCEEEEcHHHHHHHHHhhc-----cCCCeeeccCccCCCCcccccceeEEecccccCCcCCCc----- Confidence 24443322 23456899999888765311 01110 011233455567899998876543 2211100 Q ss_pred ccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 230 YAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 230 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) ..+..+... ..+ ......+..+.+..+++.......... -.+ ........++ .++.... T Consensus 369 ~~~~~gd~~------~~~-~~~~r~~~~~~~~~~~~~~~~~~~~~~--r~d---------~~~~~~~a~~---~l~~~~~ 427 (437) T protein:vir:10 369 VNIVVAPLK------KAV-INFKLTEITGQFQDTYDIWYKQLGIFL--RQN---------VVQASKDLIV---NLTGKLK 427 (437) T ss_pred eEEEEeecc------ccE-EEEeeeceEEEEecccccccceeeEEE--EEc---------cEEecccceE---EEEeecc Confidence 000000000 000 000001111111111111110000000 000 0000000000 0000000 Q ss_pred ecccc-cccc Q lcl|NC_021299. 310 DAEIE-GETV 318 (387) Q Consensus 310 ~~~~~-~~~~ 318 (387) ..+.. ..++ T Consensus 428 ~~~~~~~~~~ 437 (437) T protein:vir:10 428 AVTVVQSTAV 437 (437) T ss_pred ccccCCCCCC Confidence 00000 0011 No 170 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=258 Identities=9% Similarity=0.012 Sum_probs=100.4 Q ss_pred CccccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccccccceE Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLTEVTV 79 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 79 (387) +....++|+-+...+.+. .+...+..++..- .. .+....+|++......... .+++.... ..++.-..+ T Consensus 138 ~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~---~~E~~~~~~~~~~~~~~i 207 (397) T protein:vir:96 138 VEGGALIPQELLQPQLEP-KDIVDLSKYVRSV---PV---NSASGKFPVISKSGSKMAT---VQQLEKNPQLANPKMVEI 207 (397) T ss_pred cccccchhHHHHHHHHHh-hhhhhHHHhhhhc---cc---cccceeEEEEeccCCcccc---ccccccccccccccccce Confidence 444457788888777764 3333333333211 01 1223444443322111111 11122221 234445556 Q ss_pred EEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccCCcchhHHHHHHHHHHHhhc Q lcl|NC_021299. 80 DIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYEKVSLVDEDEIWNGVVSNRRWLNEQ 159 (387) Q Consensus 80 ~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 159 (387) ++.+.+. +.-+.++.+-+.....++...+.+...++++...+..++...... .......|+++.++....... T Consensus 208 ~~~~~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~------~~~~~~~~d~~~~~~~~~~~~ 280 (397) T protein:vir:96 208 DYSVATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTA------TAKSVVGVDGLKDLINKEIKK 280 (397) T ss_pred eecHhHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------ccccccchHHHHHHHHHhhhh Confidence 6655332 334445544333345566666667777888888888777432211 112334577777664432222 Q ss_pred cCCcCCcEEEEchHHHHHHhcccchhhhhhccccc-ceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccc Q lcl|NC_021299. 160 KVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAG-ASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPG 238 (387) Q Consensus 160 ~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~-~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~ 238 (387) ..+-..|++|..+..|.+... ..|.-. ...+..+..+.+.|+.|+.++......... ...+..+.. T Consensus 281 ---~~~a~~v~n~~~~~~l~~lkd-----~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~----~~~~~~gd~- 347 (397) T protein:vir:96 281 ---VYDVKLFISASMYSELDKLKD-----KNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVG----NVVGFIGDA- 347 (397) T ss_pred ---hcCcEEEEcHHHHHHHHHhhc-----cCCCeEeccCccCCCcccccccceEEecccccCCCCC----ceEEEEeeh- Confidence 224567899999988865311 111100 112334455678899887665432221100 000000000 Q ss_pred cccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceeeeccceeccccccceeeeeee Q lcl|NC_021299. 239 RPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKAT 309 (387) Q Consensus 239 ~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~ 309 (387) ... + ......+....+.. ....... .... ........... . ...+.++.. T Consensus 348 --~~~---~-~~~~~~~~~~~~~~--~~~~~~~-~~~~---------~r~d~~~~~~~-a--~~~~~~~~a 397 (397) T protein:vir:96 348 --KAF---A-SFFDRKQVSVSWVD--NNIYGQL-LAGI---------IRYDVKATDKK-A--GFYVTFTIG 397 (397) T ss_pred --hcc---e-EeEeecceEEEEec--cccccee-EEEE---------EEEccEEeccc-c--eEEEEeecC Confidence 000 0 00000011111100 0000000 0000 00000000000 0 011111110 No 171 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=78.39 E-value=0.11 Score=25.74 Aligned_cols=252 Identities=9% Similarity=-0.036 Sum_probs=101.1 Q ss_pred Ccc---ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-ccccc Q lcl|NC_021299. 1 MAN---AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SDLTE 76 (387) Q Consensus 1 Ma~---~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 76 (387) .+. -.++|+-+..++++.|.+...+.+++++- . . +....|++....... ... +....+.. .+++- T Consensus 82 ~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~--~-~----~~~~~i~~~~~~~~a-~wv---~e~~~~~~~~~~~f 150 (377) T protein:vir:96 82 VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK--N-T----SLRLKALTAETSGTA-VWG---DIFGEIKGQLKQAF 150 (377) T ss_pred CCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeE--e-c----CCceEEEEecCCcce-eEe---ecccccccccCccc Confidence 111 24889999999999999998888887642 1 1 234566654332211 111 11122221 23333 Q ss_pred ceEEEEEEeeeecceeeccHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHH-HHhcccccc-----------cc----- Q lcl|NC_021299. 77 VTVDIKLTDVIYNRIDLTDEEREL-DVRSFAVDVLPRQVRAVAEQIEDAVSY-LITKAPYEK-----------VS----- 138 (387) Q Consensus 77 ~~~~~~id~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~la~~vd~~~~~-~~~~~~~~~-----------~~----- 138 (387) ..+. +..++...+..-..++.. ...++..-+.++..++++..++..++. .-.+.|... .. T Consensus 151 ~~i~--l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~ 228 (377) T protein:vir:96 151 KEQD--FSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDIT 228 (377) T ss_pred eeEe--eeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccc Confidence 4444 444555444444444444 555666667777888999999988763 111100000 00 Q ss_pred -------------cCCcchhHHHHHHHHHHHhhcc--CC---cCCcEEEEchHHHHHHhcccchhhhhhcccccceeeee Q lcl|NC_021299. 139 -------------LVDEDEIWNGVVSNRRWLNEQK--VP---KDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQT 200 (387) Q Consensus 139 -------------~~~~~~~~~~i~~a~~~l~~~~--vp---~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~ 200 (387) ..+.+..++.+..+...+...+ -| ..+-..+++|..+..+.....+. .. + T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~--~~----------~ 296 (377) T protein:vir:96 229 TYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSR--NQ----------F 296 (377) T ss_pred ceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccccccc--CC----------C Confidence 0111122233333444443321 11 12335778888766553221111 11 1 Q ss_pred eEEEEee--cceeeeeeccceeeeeeeccccccccccccccccCceeeeeeeccc--ccceeeeeeeeeeccceeeeeee Q lcl|NC_021299. 201 ARIGRLA--QYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATE--NGVQLRWLGDYDATSTTERSIVD 276 (387) Q Consensus 201 g~ig~~~--g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~ 276 (387) |....+. +..+..+..+|.+..+..-.+.+.+.-+ .+........... ....+....-.| ...++ T Consensus 297 G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r-----~~~~i~~~~~~~~~~d~~~f~~~~r~d------G~~~d 365 (377) T protein:vir:96 297 GEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMA-----TASTIEEYDQTFAMEDLQLYLTKNYFY------GKAKD 365 (377) T ss_pred CCceeccCCCceEEecCCCCcccEEEEEcCcEEEEEe-----cccEEEeehhhhhhcCCeEEEEEEEEc------CEEec Confidence 2222333 3345666666654332221122222111 1111111111000 111111111111 11110 Q ss_pred eeeeeccccceeeeccceeccccccceeeee Q lcl|NC_021299. 277 TWIGVKAVLDPVTANLDDEPRFVRGTRIHLK 307 (387) Q Consensus 277 ~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~ 307 (387) ..+ ..+..++.. T Consensus 366 ~~a-------------------~~vl~l~~~ 377 (377) T protein:vir:96 366 NHT-------------------AALLTLAGG 377 (377) T ss_pred CCc-------------------EEEEEEecC Confidence 000 000011000 No 172 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=78.14 E-value=0.12 Score=25.68 Aligned_cols=226 Identities=10% Similarity=-0.019 Sum_probs=105.2 Q ss_pred CccccccHHHHHHHHHHHHHhhccccce--------eeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLPNF--------VFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~~~--------~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) +.|+-. --+|+..+-..-.+..-+..+ +.|- .|+.-..||+|++.........-..-. ......-+ T Consensus 22 ~~~~~~-vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~--~dL~K~~GD~Vtf~L~~~L~g~gv~Gd---~~lEGnee 95 (318) T protein:vir:27 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTMGD---ERVEGRGE 95 (318) T ss_pred hcCChH-HHHHHHhhhhHHHhhhhhhcccCCCCCceEEEe--ccCCCCCccEEEEeEeeccccCccccC---ceeecccc Confidence 333321 135665433322222222222 2222 234445799999976543322211100 11112234 Q ss_pred ccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPY----------------- 134 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~----------------- 134 (387) .+.-.+..|.||+.. .++...+. +.--...|++.+.......-+++..|+-.+-.+.++.. T Consensus 96 ~L~~~~d~l~IDq~r-~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~ 174 (318) T protein:vir:27 96 DLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (318) T ss_pred ceEEEeeEEEEeeec-cccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccc Confidence 555567788887654 55554432 11224455555555555555666666655433332221 Q ss_pred ------cccc----------------cCCcc--hhHHHHHHHHHHHhhccCC-------cCC-------cEEEEchHHHH Q lcl|NC_021299. 135 ------EKVS----------------LVDED--EIWNGVVSNRRWLNEQKVP-------KDG-------RVLLVGSAVEE 176 (387) Q Consensus 135 ------~~~~----------------~~~~~--~~~~~i~~a~~~l~~~~vp-------~~~-------r~~v~~~~~~~ 176 (387) .... .++.. ..++.|-.++..++...-| .+. +++++.|.++. T Consensus 175 ~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~ 254 (318) T protein:vir:27 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (318) T ss_pred hhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCCcceEEEEechHHHH Confidence 0000 01111 1233455666777653333 111 56679999999 Q ss_pred HHhcccc---hhhhhh----cccccceeeeeeEEEEeecceeeeeeccceeeeeeeccccccccccccccccCceeeeee Q lcl|NC_021299. 177 ALLLDDR---FIRYDS----AGEAGASRLQTARIGRLAQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATST 249 (387) Q Consensus 177 ~l~~~~~---~~~~~~----~g~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~ 249 (387) .|..+.. |.+.++ -+......+-.|.+|.+.|+-+.+...+|-=.. .+... .+.- T Consensus 255 ~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~-----------~G~~v-------~~~~ 316 (318) T protein:vir:27 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY-----------QGQRF-------WYQR 316 (318) T ss_pred HHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEc-----------CCCee-------eeee Confidence 9998753 444332 111234567889999999988777665543211 11000 0000 Q ss_pred ec Q lcl|NC_021299. 250 VA 251 (387) Q Consensus 250 ~~ 251 (387) .. T Consensus 317 ~~ 318 (318) T protein:vir:27 317 IT 318 (318) T ss_pred cC Confidence 00 No 173 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=74.68 E-value=0.16 Score=25.02 Aligned_cols=262 Identities=13% Similarity=0.019 Sum_probs=104.9 Q ss_pred Cc-----cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccc--eeeceecccccccccccccc Q lcl|NC_021299. 1 MA-----NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPT--IAHTRGLRATGADRNMVASD 73 (387) Q Consensus 1 Ma-----~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~--~~~~~~~~~~~~~~~~~~~~ 73 (387) |. --.+.||.+. ++++.+.+...|..+++.. ... .+++..|+..+.. ...... ..+......-.+ T Consensus 19 ~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi--~~~---~~~~~~i~~~g~~~~~~~g~~--~~~~~~~~~~~~ 90 (315) T protein:vir:41 19 IDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARID--NAL---KSYEKDISRLSLVLDVGPGRD--ETGQKLAPPEST 90 (315) T ss_pred cCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceee--ecc---ccccccccccccCcccccccc--cccCcCCCCCCc Confidence 22 2247899875 5778888888887776531 000 0122223222111 000000 011112222234 Q ss_pred cccceEEEEEEeeeecceeeccHHHhhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHh-----------c----ccccc Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEERELDVR--SFAVDVLPRQVRAVAEQIEDAVSYLIT-----------K----APYEK 136 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~~~~--~~~~~~~~~~~~~la~~vd~~~~~~~~-----------~----~~~~~ 136 (387) ++-.++++...+. +..+.++++.+...+. ++...+....+.+++.+.+...+.-=. + +.... T Consensus 91 ~~f~~~~l~~~~l-~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~ 169 (315) T protein:vir:41 91 AEVKTNTLYMREM-VTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKL 169 (315) T ss_pred cccceeeeceeee-eeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccc Confidence 4455555555333 2345677765554433 677777777788888877766553200 0 00000 Q ss_pred -cc---cCCcchhHHHHHHHHHHHhhccCCc-CCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEeeccee Q lcl|NC_021299. 137 -VS---LVDEDEIWNGVVSNRRWLNEQKVPK-DGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLAQYDV 211 (387) Q Consensus 137 -~~---~~~~~~~~~~i~~a~~~l~~~~vp~-~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~g~~v 211 (387) .. ........+.+.++...|...--.. .+-..+++.+....+.+...- +.... ....+..|....+.|+.| T Consensus 170 ~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~-~g~~l---w~~~~~~g~~~tl~G~PV 245 (315) T protein:vir:41 170 TESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKG-RETGL---GDQALTGANSILYDGRPV 245 (315) T ss_pred cccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhcc-CCCcc---ccchhhcCCCceecccce Confidence 00 0111223455666666554432111 123457888888776542110 01111 123345555567889888 Q ss_pred eeeeccceeee----eeec-cccccccccccccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccc Q lcl|NC_021299. 212 VTVDTLPHGDA----YLSH-PTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLD 286 (387) Q Consensus 212 ~~s~~~~~~~~----~~~~-~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~ 286 (387) +..+.+|.... +.+. ...+.+. .........+.+............ T Consensus 246 ~~~~~m~~~~~~~~~ilf~d~~nl~~~--------------------~~~~i~i~~~~~a~~~~~~~~~~~--------- 296 (315) T protein:vir:41 246 QYVPALEALNDGKSRALFVVPTQLVYG--------------------FWRNIKVVPDYDAEMRLTKYVASL--------- 296 (315) T ss_pred EecccccccCCCCccEEEecccceEEE--------------------eccccEEEeeecCCCCceEEEEEE--------- Confidence 87776654211 1111 0001000 001111111111111110000000 Q ss_pred eeeeccc-eecccccccee Q lcl|NC_021299. 287 PVTANLD-DEPRFVRGTRI 304 (387) Q Consensus 287 ~~~~~~~-~~~~~v~~~~v 304 (387) ...+... .....+...+| T Consensus 297 r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 297 RTDNHYEDEEGAVSATITV 315 (315) T ss_pred EeceeEEeccceeEeeeeC Confidence 0000000 00111111111 No 174 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=72.81 E-value=0.18 Score=24.69 Aligned_cols=299 Identities=10% Similarity=0.013 Sum_probs=119.3 Q ss_pred CccccccHHHHHHHHHHHHHhhcccc--------ceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLP--------NFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~--------~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) +.|+-.. -+|+..+...-....-+. ..+-|- .|+.-..||+|+++...........- .......-+ T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~G---d~~lEGnee 95 (404) T protein:vir:10 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTMG---DERVEGRGE 95 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCccc---Cceeecccc Confidence 4443322 234332221111111111 111121 23444579999997654433221110 011122335 Q ss_pred ccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE---------------- 135 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~---------------- 135 (387) .+.-.+..|.||+.- .+++..++ ..--...|++.+.......-+++..|+-.+-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r-~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:10 96 DLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeec-ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 566677888887754 55554432 122345566666656666667777777766444433210 Q ss_pred -------ccc-----------c-----CCc--chhHHHHHHHHHHHhhccCCcC--------------CcEEEEchHHHH Q lcl|NC_021299. 136 -------KVS-----------L-----VDE--DEIWNGVVSNRRWLNEQKVPKD--------------GRVLLVGSAVEE 176 (387) Q Consensus 136 -------~~~-----------~-----~~~--~~~~~~i~~a~~~l~~~~vp~~--------------~r~~v~~~~~~~ 176 (387) +.. + ++. ...++.|-.+++.+++..-|.. -+++++.|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:10 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 000 0 000 1123445567777766444421 156779999999 Q ss_pred HHhcccc---hhhhhhc---c-cccceeeeeeEEEEeecceeeeeeccceee----eeeecccccccccc--------cc Q lcl|NC_021299. 177 ALLLDDR---FIRYDSA---G-EAGASRLQTARIGRLAQYDVVTVDTLPHGD----AYLSHPTAYAMLTR--------SP 237 (387) Q Consensus 177 ~l~~~~~---~~~~~~~---g-~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~----~~~~~~~a~~~~~~--------~~ 237 (387) .|..|.. |.+.+.. + ......+-.|..|.+.|+-+.+....|.-. .+............ .- T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:10 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999864 3333331 1 123456778889999998777655443211 11110000000000 00 Q ss_pred ccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceee-eccceeccccccceeeeeeeecccccc Q lcl|NC_021299. 238 GRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVT-ANLDDEPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 238 ~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ....|+.....+.....+..+.|.....-..+.....+....|..-...... .....-+..+.-..+. T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~----------- 403 (404) T protein:vir:10 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK----------- 403 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc----------- Confidence 1112221111111122333444433211111111111111111111110000 0000000000000000 Q ss_pred cc Q lcl|NC_021299. 317 TV 318 (387) Q Consensus 317 ~~ 318 (387) | T Consensus 404 -~ 404 (404) T protein:vir:10 404 -L 404 (404) T ss_pred -C Confidence 0 No 175 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=72.81 E-value=0.18 Score=24.69 Aligned_cols=299 Identities=10% Similarity=0.013 Sum_probs=119.3 Q ss_pred CccccccHHHHHHHHHHHHHhhcccc--------ceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLP--------NFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~--------~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) +.|+-.. -+|+..+...-....-+. ..+-|- .|+.-..||+|+++...........- .......-+ T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~G---d~~lEGnee 95 (404) T protein:vir:10 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTMG---DERVEGRGE 95 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCccc---Cceeecccc Confidence 4443322 234332221111111111 111121 23444579999997654433221110 011122335 Q ss_pred ccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE---------------- 135 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~---------------- 135 (387) .+.-.+..|.||+.- .+++..++ ..--...|++.+.......-+++..|+-.+-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r-~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:10 96 DLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeec-ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 566677888887754 55554432 122345566666656666667777777766444433210 Q ss_pred -------ccc-----------c-----CCc--chhHHHHHHHHHHHhhccCCcC--------------CcEEEEchHHHH Q lcl|NC_021299. 136 -------KVS-----------L-----VDE--DEIWNGVVSNRRWLNEQKVPKD--------------GRVLLVGSAVEE 176 (387) Q Consensus 136 -------~~~-----------~-----~~~--~~~~~~i~~a~~~l~~~~vp~~--------------~r~~v~~~~~~~ 176 (387) +.. + ++. ...++.|-.+++.+++..-|.. -+++++.|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:10 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 000 0 000 1123445567777766444421 156779999999 Q ss_pred HHhcccc---hhhhhhc---c-cccceeeeeeEEEEeecceeeeeeccceee----eeeecccccccccc--------cc Q lcl|NC_021299. 177 ALLLDDR---FIRYDSA---G-EAGASRLQTARIGRLAQYDVVTVDTLPHGD----AYLSHPTAYAMLTR--------SP 237 (387) Q Consensus 177 ~l~~~~~---~~~~~~~---g-~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~----~~~~~~~a~~~~~~--------~~ 237 (387) .|..|.. |.+.+.. + ......+-.|..|.+.|+-+.+....|.-. .+............ .- T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:10 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999864 3333331 1 123456778889999998777655443211 11110000000000 00 Q ss_pred ccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceee-eccceeccccccceeeeeeeecccccc Q lcl|NC_021299. 238 GRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVT-ANLDDEPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 238 ~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ....|+.....+.....+..+.|.....-..+.....+....|..-...... .....-+..+.-..+. T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~----------- 403 (404) T protein:vir:10 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK----------- 403 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc----------- Confidence 1112221111111122333444433211111111111111111111110000 0000000000000000 Q ss_pred cc Q lcl|NC_021299. 317 TV 318 (387) Q Consensus 317 ~~ 318 (387) | T Consensus 404 -~ 404 (404) T protein:vir:10 404 -L 404 (404) T ss_pred -C Confidence 0 No 176 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=72.81 E-value=0.18 Score=24.69 Aligned_cols=299 Identities=10% Similarity=0.013 Sum_probs=119.3 Q ss_pred CccccccHHHHHHHHHHHHHhhcccc--------ceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLP--------NFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~--------~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) +.|+-.. -+|+..+...-....-+. ..+-|- .|+.-..||+|+++...........- .......-+ T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~G---d~~lEGnee 95 (404) T protein:vir:32 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTMG---DERVEGRGE 95 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCccc---Cceeecccc Confidence 4443322 234332221111111111 111121 23444579999997654433221110 011122335 Q ss_pred ccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE---------------- 135 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~---------------- 135 (387) .+.-.+..|.||+.- .+++..++ ..--...|++.+.......-+++..|+-.+-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r-~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:32 96 DLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeec-ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 566677888887754 55554432 122345566666656666667777777766444433210 Q ss_pred -------ccc-----------c-----CCc--chhHHHHHHHHHHHhhccCCcC--------------CcEEEEchHHHH Q lcl|NC_021299. 136 -------KVS-----------L-----VDE--DEIWNGVVSNRRWLNEQKVPKD--------------GRVLLVGSAVEE 176 (387) Q Consensus 136 -------~~~-----------~-----~~~--~~~~~~i~~a~~~l~~~~vp~~--------------~r~~v~~~~~~~ 176 (387) +.. + ++. ...++.|-.+++.+++..-|.. -+++++.|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:32 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 000 0 000 1123445567777766444421 156779999999 Q ss_pred HHhcccc---hhhhhhc---c-cccceeeeeeEEEEeecceeeeeeccceee----eeeecccccccccc--------cc Q lcl|NC_021299. 177 ALLLDDR---FIRYDSA---G-EAGASRLQTARIGRLAQYDVVTVDTLPHGD----AYLSHPTAYAMLTR--------SP 237 (387) Q Consensus 177 ~l~~~~~---~~~~~~~---g-~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~----~~~~~~~a~~~~~~--------~~ 237 (387) .|..|.. |.+.+.. + ......+-.|..|.+.|+-+.+....|.-. .+............ .- T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:32 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999864 3333331 1 123456778889999998777655443211 11110000000000 00 Q ss_pred ccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceee-eccceeccccccceeeeeeeecccccc Q lcl|NC_021299. 238 GRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVT-ANLDDEPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 238 ~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ....|+.....+.....+..+.|.....-..+.....+....|..-...... .....-+..+.-..+. T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~----------- 403 (404) T protein:vir:32 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK----------- 403 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc----------- Confidence 1112221111111122333444433211111111111111111111110000 0000000000000000 Q ss_pred cc Q lcl|NC_021299. 317 TV 318 (387) Q Consensus 317 ~~ 318 (387) | T Consensus 404 -~ 404 (404) T protein:vir:32 404 -L 404 (404) T ss_pred -C Confidence 0 No 177 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=72.81 E-value=0.18 Score=24.69 Aligned_cols=299 Identities=10% Similarity=0.013 Sum_probs=119.3 Q ss_pred CccccccHHHHHHHHHHHHHhhcccc--------ceeeecccccccccCCCEEEEEecccceeeceeccccccccccccc Q lcl|NC_021299. 1 MANAFIKPPVIIASILGQLQHELVLP--------NFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVAS 72 (387) Q Consensus 1 Ma~~~~~pe~~~~~~~~~l~~~~~~~--------~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (387) +.|+-.. -+|+..+...-....-+. ..+-|- .|+.-..||+|+++...........- .......-+ T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~G---d~~lEGnee 95 (404) T protein:vir:81 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTMG---DERVEGRGE 95 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCccc---Cceeecccc Confidence 4443322 234332221111111111 111121 23444579999997654433221110 011122335 Q ss_pred ccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|NC_021299. 73 DLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVSYLITKAPYE---------------- 135 (387) Q Consensus 73 ~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~~~~---------------- 135 (387) .+.-.+..|.||+.- .+++..++ ..--...|++.+.......-+++..|+-.+-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r-~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:81 96 DLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeec-ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 566677888887754 55554432 122345566666656666667777777766444433210 Q ss_pred -------ccc-----------c-----CCc--chhHHHHHHHHHHHhhccCCcC--------------CcEEEEchHHHH Q lcl|NC_021299. 136 -------KVS-----------L-----VDE--DEIWNGVVSNRRWLNEQKVPKD--------------GRVLLVGSAVEE 176 (387) Q Consensus 136 -------~~~-----------~-----~~~--~~~~~~i~~a~~~l~~~~vp~~--------------~r~~v~~~~~~~ 176 (387) +.. + ++. ...++.|-.+++.+++..-|.. -+++++.|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:81 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 000 0 000 1123445567777766444421 156779999999 Q ss_pred HHhcccc---hhhhhhc---c-cccceeeeeeEEEEeecceeeeeeccceee----eeeecccccccccc--------cc Q lcl|NC_021299. 177 ALLLDDR---FIRYDSA---G-EAGASRLQTARIGRLAQYDVVTVDTLPHGD----AYLSHPTAYAMLTR--------SP 237 (387) Q Consensus 177 ~l~~~~~---~~~~~~~---g-~~~~~~~~~g~ig~~~g~~v~~s~~~~~~~----~~~~~~~a~~~~~~--------~~ 237 (387) .|..|.. |.+.+.. + ......+-.|..|.+.|+-+.+....|.-. .+............ .- T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:81 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999864 3333331 1 123456778889999998777655443211 11110000000000 00 Q ss_pred ccccCceeeeeeecccccceeeeeeeeeeccceeeeeeeeeeeeccccceee-eccceeccccccceeeeeeeecccccc Q lcl|NC_021299. 238 GRPMTNTVATSTVATENGVQLRWLGDYDATSTTERSIVDTWIGVKAVLDPVT-ANLDDEPRFVRGTRIHLKATDAEIEGE 316 (387) Q Consensus 238 ~~~~~~t~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~v~~~~v~~~~~~~~~~~~ 316 (387) ....|+.....+.....+..+.|.....-..+.....+....|..-...... .....-+..+.-..+. T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~----------- 403 (404) T protein:vir:81 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK----------- 403 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc----------- Confidence 1112221111111122333444433211111111111111111111110000 0000000000000000 Q ss_pred cc Q lcl|NC_021299. 317 TV 318 (387) Q Consensus 317 ~~ 318 (387) | T Consensus 404 -~ 404 (404) T protein:vir:81 404 -L 404 (404) T ss_pred -C Confidence 0 No 178 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=67.48 E-value=0.25 Score=23.87 Aligned_cols=305 Identities=12% Similarity=0.118 Sum_probs=116.0 Q ss_pred Cccc--------cccHHHHHHHHHHHHHhhccc-cce------------------------eeecccccccccCCCEEEE Q lcl|NC_021299. 1 MANA--------FIKPPVIIASILGQLQHELVL-PNF------------------------VFKNGYGDVAHKFNDTITI 47 (387) Q Consensus 1 Ma~~--------~~~pe~~~~~~~~~l~~~~~~-~~~------------------------~~~d~~~~~~~~~gdtv~i 47 (387) |--. -....+|++.+-..-.+...| ..+ +.|- .|+.-..||+|++ T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~--~dL~K~~GD~Vtf 78 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQA--QDLGRNKGDEVRF 78 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEe--ccCCCCCccEEEE Confidence 4321 123457876553322221111 001 2222 2344347999999 Q ss_pred EecccceeeceecccccccccccccccccceEEEEEEeeeecceeeccH-HHhhhhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021299. 48 RIPVPTIAHTRGLRATGADRNMVASDLTEVTVDIKLTDVIYNRIDLTDE-ERELDVRSFAVDVLPRQVRAVAEQIEDAVS 126 (387) Q Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~la~~vd~~~~ 126 (387) .........-..-. ......-+.+.-.+..|+||+.- .++...+. ..--...|++.+.......=+++..|+-.+ T Consensus 79 ~L~~~L~g~gv~Gd---~~lEGnee~L~~~~d~l~IDq~R-~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~ 154 (430) T protein:vir:10 79 HFVQPANAFPIMGS---EYAEGKGTGLKIGSDQLRVNQAR-FPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSML 154 (430) T ss_pred eEeeccccCceecC---ceeeccccceEEEeeEEEEeeec-cccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 76443322111100 01112224455567788888764 46665542 112233444444444444444444444333 Q ss_pred HHHhcc-----------------------------cccc------ccc------------CC--cchhHHHHHHHHHHHh Q lcl|NC_021299. 127 YLITKA-----------------------------PYEK------VSL------------VD--EDEIWNGVVSNRRWLN 157 (387) Q Consensus 127 ~~~~~~-----------------------------~~~~------~~~------------~~--~~~~~~~i~~a~~~l~ 157 (387) --+.++ |... +.+ ++ -...++.|-.++..++ T Consensus 155 v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~ 234 (430) T protein:vir:10 155 VHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMD 234 (430) T ss_pred HHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHH Confidence 222211 1110 100 01 1123455666777777 Q ss_pred hccCC-------cCC-------cEEEEchHHHHHHhcccchhhhhh-----cccccceeeeeeEEEEeecceeeeeeccc Q lcl|NC_021299. 158 EQKVP-------KDG-------RVLLVGSAVEEALLLDDRFIRYDS-----AGEAGASRLQTARIGRLAQYDVVTVDTLP 218 (387) Q Consensus 158 ~~~vp-------~~~-------r~~v~~~~~~~~l~~~~~~~~~~~-----~g~~~~~~~~~g~ig~~~g~~v~~s~~~~ 218 (387) ....| .+. +++++.|.++..|..+..|..++. ........+-.|.+|.+.|+-+++....- T Consensus 235 ~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~vi 314 (430) T protein:vir:10 235 QIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPI 314 (430) T ss_pred hhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCcee Confidence 65433 112 567799999999999988765432 22223467778999999998777653220 Q ss_pred ---eeeeeeec-------------ccccccc-ccccccccCceeeeeeec--ccccceeeeeeeeeeccceeeeeeeeee Q lcl|NC_021299. 219 ---HGDAYLSH-------------PTAYAML-TRSPGRPMTNTVATSTVA--TENGVQLRWLGDYDATSTTERSIVDTWI 279 (387) Q Consensus 219 ---~~~~~~~~-------------~~a~~~~-~~~~~~~~~~t~~~~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 279 (387) .+....+. +..+... ...-....++.....+.. ..++..+.|.....-..+.....+.... T Consensus 315 rf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~i~ 394 (430) T protein:vir:10 315 RFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGAIL 394 (430) T ss_pred eecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhHHh Confidence 00000000 0000000 000011122211111111 1244445554421111111111111111 Q ss_pred eeccccceeeecc----ceeccccccceeeeeeeeccccccccccccceeEEEeeccCCccccCc Q lcl|NC_021299. 280 GVKAVLDPVTANL----DDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALEDSNGDNRAGDP 340 (387) Q Consensus 280 g~~~~~~~~~~~~----~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (387) |..-.......+. .+-+..+.-..+. ..++. + T Consensus 395 G~kK~rF~~~~~~~~~~~DfGvi~idtaa~-------------------------~~~~~----~ 430 (430) T protein:vir:10 395 GCSKIRFAVEATNGLEYTDHGVMAIDTAVK-------------------------IIGPR----K 430 (430) T ss_pred ccceeeecCCCCCCceeeeeEEEEhhhhhh-------------------------hhcCC----C Confidence 1111100000000 0000000000000 00000 0 No 179 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=66.10 E-value=0.27 Score=23.68 Aligned_cols=263 Identities=10% Similarity=-0.045 Sum_probs=104.9 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |. --.+.|+-+..++++.|++...+.++++.- ... | ...|++........ .. +....+.- .+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~---~~~---~-~~~i~~~~~~~~a~-w~---~e~~~~~~~~~ 144 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---NAG---L-RLKFLKSETSGVAV-WG---KIYGEIKGQLD 144 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE---ecC---c-ceEEEEecCCccee-ee---ccccccccccc Confidence 11 124789999999999999999998887532 111 3 34565543222111 11 11111211 12 Q ss_pred cccceEEEEEEeeeecceeeccHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHH-HHhcccccc---------cc---- Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEEREL-DVRSFAVDVLPRQVRAVAEQIEDAVSY-LITKAPYEK---------VS---- 138 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~la~~vd~~~~~-~~~~~~~~~---------~~---- 138 (387) ++-.+ +++.-++...+..-..++.. ...++...+.++..++++..+|..++. .-.+.|... .+ T Consensus 145 ~~f~~--i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~ 222 (381) T protein:vir:10 145 AAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) T ss_pred cccee--eeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccc Confidence 23334 44444554444444444444 455677777777889999999887652 111111000 00 Q ss_pred ----------cCCcchhHHHHHHHHHHHhhc-----cCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEE Q lcl|NC_021299. 139 ----------LVDEDEIWNGVVSNRRWLNEQ-----KVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARI 203 (387) Q Consensus 139 ----------~~~~~~~~~~i~~a~~~l~~~-----~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~i 203 (387) .......++.+.++...|... ..+..+..++++|..+..+.....+.. . +|.. T Consensus 223 ~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~-----~-------~G~~ 290 (381) T protein:vir:10 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-----A-------NGVY 290 (381) T ss_pred cccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCC-----C-------CCce Confidence 001112244455444444322 123444567889988777654322111 0 1111 Q ss_pred EEe--ecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccc--cceeeeeeeeeeccceeeeeeeeee Q lcl|NC_021299. 204 GRL--AQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATEN--GVQLRWLGDYDATSTTERSIVDTWI 279 (387) Q Consensus 204 g~~--~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~ 279 (387) ... +|..++.++.+|.+..+..-.+.+.+.-+ .+............ ...+.... +.+...++. T Consensus 291 v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r-----~~~~i~~~~~~~~~~d~~~f~a~~------r~dg~~~~~-- 357 (381) T protein:vir:10 291 VTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLA-----GGINVQKFKETLALDDMDLYTAKQ------FAYGKAKDN-- 357 (381) T ss_pred eecCCCCceEEecCCCCcCcEEEEecccEEEEEe-----cccEEEeechhHhhcCCeEEEEEE------EEcCEEecC-- Confidence 111 34556777767654322111111111111 11111110000000 00011100 000111000 Q ss_pred eeccccceeeeccceeccccccceeeee--eeecccccccc Q lcl|NC_021299. 280 GVKAVLDPVTANLDDEPRFVRGTRIHLK--ATDAEIEGETV 318 (387) Q Consensus 280 g~~~~~~~~~~~~~~~~~~v~~~~v~~~--~~~~~~~~~~~ 318 (387) ....+..++.. +......+.++ T Consensus 358 -----------------~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 358 -----------------KVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred -----------------ceEEEEEEEecCCCcCcccccccC Confidence 00111111110 00011111111 No 180 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=66.10 E-value=0.27 Score=23.68 Aligned_cols=263 Identities=10% Similarity=-0.045 Sum_probs=104.9 Q ss_pred Cc------cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceecccccccccccc-cc Q lcl|NC_021299. 1 MA------NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVA-SD 73 (387) Q Consensus 1 Ma------~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (387) |. --.+.|+-+..++++.|++...+.++++.- ... | ...|++........ .. +....+.- .+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~---~~~---~-~~~i~~~~~~~~a~-w~---~e~~~~~~~~~ 144 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---NAG---L-RLKFLKSETSGVAV-WG---KIYGEIKGQLD 144 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE---ecC---c-ceEEEEecCCccee-ee---ccccccccccc Confidence 11 124789999999999999999998887532 111 3 34565543222111 11 11111211 12 Q ss_pred cccceEEEEEEeeeecceeeccHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHH-HHhcccccc---------cc---- Q lcl|NC_021299. 74 LTEVTVDIKLTDVIYNRIDLTDEEREL-DVRSFAVDVLPRQVRAVAEQIEDAVSY-LITKAPYEK---------VS---- 138 (387) Q Consensus 74 ~~~~~~~~~id~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~la~~vd~~~~~-~~~~~~~~~---------~~---- 138 (387) ++-.+ +++.-++...+..-..++.. ...++...+.++..++++..+|..++. .-.+.|... .+ T Consensus 145 ~~f~~--i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~ 222 (381) T protein:vir:95 145 AAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) T ss_pred cccee--eeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccc Confidence 23334 44444554444444444444 455677777777889999999887652 111111000 00 Q ss_pred ----------cCCcchhHHHHHHHHHHHhhc-----cCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEE Q lcl|NC_021299. 139 ----------LVDEDEIWNGVVSNRRWLNEQ-----KVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARI 203 (387) Q Consensus 139 ----------~~~~~~~~~~i~~a~~~l~~~-----~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~i 203 (387) .......++.+.++...|... ..+..+..++++|..+..+.....+.. . +|.. T Consensus 223 ~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~-----~-------~G~~ 290 (381) T protein:vir:95 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-----A-------NGVY 290 (381) T ss_pred cccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCC-----C-------CCce Confidence 001112244455444444322 123444567889988777654322111 0 1111 Q ss_pred EEe--ecceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccc--cceeeeeeeeeeccceeeeeeeeee Q lcl|NC_021299. 204 GRL--AQYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATEN--GVQLRWLGDYDATSTTERSIVDTWI 279 (387) Q Consensus 204 g~~--~g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~ 279 (387) ... +|..++.++.+|.+..+..-.+.+.+.-+ .+............ ...+.... +.+...++. T Consensus 291 v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r-----~~~~i~~~~~~~~~~d~~~f~a~~------r~dg~~~~~-- 357 (381) T protein:vir:95 291 VTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLA-----GGINVQKFKETLALDDMDLYTAKQ------FAYGKAKDN-- 357 (381) T ss_pred eecCCCCceEEecCCCCcCcEEEEecccEEEEEe-----cccEEEeechhHhhcCCeEEEEEE------EEcCEEecC-- Confidence 111 34556777767654322111111111111 11111110000000 00011100 000111000 Q ss_pred eeccccceeeeccceeccccccceeeee--eeecccccccc Q lcl|NC_021299. 280 GVKAVLDPVTANLDDEPRFVRGTRIHLK--ATDAEIEGETV 318 (387) Q Consensus 280 g~~~~~~~~~~~~~~~~~~v~~~~v~~~--~~~~~~~~~~~ 318 (387) ....+..++.. +......+.++ T Consensus 358 -----------------~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 358 -----------------KVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred -----------------ceEEEEEEEecCCCcCcccccccC Confidence 00111111110 00011111111 No 181 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=62.65 E-value=0.33 Score=23.22 Aligned_cols=255 Identities=14% Similarity=0.114 Sum_probs=113.4 Q ss_pred Cccc--cccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEec-ccceeecee--cccccccccccccccc Q lcl|NC_021299. 1 MANA--FIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIP-VPTIAHTRG--LRATGADRNMVASDLT 75 (387) Q Consensus 1 Ma~~--~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~-~~~~~~~~~--~~~~~~~~~~~~~~~~ 75 (387) =++. .+.|+ |-+-.++.+++.-.+.++..+ + -.+|.|+.-|+- +..++..+. -....++..+.+..++ T Consensus 136 Tgd~~~~i~~~-~v~d~i~li~q~r~i~slf~t-----L-P~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~ 208 (410) T protein:vir:83 136 TGDLQGVIPDP-IVGPVIDFIDSARPLVSTLGT-----L-PLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMV 208 (410) T ss_pred ccccccccchh-HhhhHHHHHhhccchhhhhhh-----C-CCCCCeeEEeeeccccccccccccccccccccccccccee Confidence 1222 34566 777788888877777666532 2 234777655432 222222121 1223356678888888 Q ss_pred cceEEEEEEeeeecceeeccHHHhhhhhhHHHHHHHHHHHHH----HHHHHH----HHHHHHhcccccccccCCcchhHH Q lcl|NC_021299. 76 EVTVDIKLTDVIYNRIDLTDEERELDVRSFAVDVLPRQVRAV----AEQIED----AVSYLITKAPYEKVSLVDEDEIWN 147 (387) Q Consensus 76 ~~~~~~~id~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~l----a~~vd~----~~~~~~~~~~~~~~~~~~~~~~~~ 147 (387) -++.+..|+.+-.... ++-. .+..-.-..+.-++++| |+..++ .+...+.++. .....+.++-.. T Consensus 209 ~~t~tA~ikTyGGyt~-LSRQ----~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~--a~~~~Tad~~~~ 281 (410) T protein:vir:83 209 IDRLTVNAKTLGGYVN-VSRQ----AIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAV--GYGNATADNVAS 281 (410) T ss_pred eeeccceeehhcCccc-ccce----eeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh--hhhhccHHHHHH Confidence 8888888865543221 1111 11111111112222222 222222 2222222221 111123334445 Q ss_pred HHHHHHHHHhhccCCcCCcEEEEchHHHHHHhcccchhhhhhcccc----cceeeeeeEEEEeecceeeeeeccceeeee Q lcl|NC_021299. 148 GVVSNRRWLNEQKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEA----GASRLQTARIGRLAQYDVVTVDTLPHGDAY 223 (387) Q Consensus 148 ~i~~a~~~l~~~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~----~~~~~~~g~ig~~~g~~v~~s~~~~~~~~~ 223 (387) .|.++....+++.--..-+++.++|+....+. +.|....-.+.. +...+..|..|.+.+..|.+....+.+... T Consensus 282 ~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~--~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~ 359 (410) T protein:vir:83 282 AIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFG--PLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAY 359 (410) T ss_pred HHHHHHHHHhhhhccceeeeEEechhhhhhcc--ceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeee Confidence 56677777777632234578899999854443 233322221111 222333666788899999999888888777 Q ss_pred eeccccccccccccccccCceeeeeeecccccceeeeeeeee---eccceeeeeeeeeeee Q lcl|NC_021299. 224 LSHPTAYAMLTRSPGRPMTNTVATSTVATENGVQLRWLGDYD---ATSTTERSIVDTWIGV 281 (387) Q Consensus 224 ~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~g~ 281 (387) .+.+.++...-... +.+.... +.......+|. .........+-...|. T Consensus 360 f~~~~Ai~~~eS~~-----gp~qL~d-----~~i~nLt~~ySgY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 360 LFSTAAIECFEQRV-----GTLQVVE-----PSVFGLQVAYAGYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred EeccceeeeeecCC-----ceeEeeC-----CchhhhhhhheeeeeeccccccceeeeccC Confidence 76665554322110 0111111 11111111110 0000000000000000 No 182 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=53.16 E-value=0.54 Score=22.07 Aligned_cols=270 Identities=10% Similarity=-0.013 Sum_probs=97.9 Q ss_pred Cc-cccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccccccce Q lcl|NC_021299. 1 MA-NAFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLTEVT 78 (387) Q Consensus 1 Ma-~~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 78 (387) -+ .-.++|+-+..++++.|++..++.+++++- .. +..+.|++......... . .....+. -.+++-.. T Consensus 91 ~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~---~~----~~~~~i~~~~~~~~a~w-~---~e~~~~~~~~~~~f~~ 159 (395) T protein:vir:95 91 GYTDEKILPETVVERVFDDLQKDHPLLSKINFQ---NA----GIKTRVIKADPAGQAVW-G---KVFGEIKGQLDAAFRE 159 (395) T ss_pred CCCCceeccHHHHHHHHHHHHhhhhhhhhceeE---ec----CCceEEEEecCCcceEE-e---ecccccCcccccccee Confidence 11 113789999999999999999988887642 11 23456665433222111 0 0111111 12333344 Q ss_pred EEEEEEeeeecceeeccHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc----cccc--------c--ccC--C Q lcl|NC_021299. 79 VDIKLTDVIYNRIDLTDEEREL-DVRSFAVDVLPRQVRAVAEQIEDAVSYLITKA----PYEK--------V--SLV--D 141 (387) Q Consensus 79 ~~~~id~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~la~~vd~~~~~~~~~~----~~~~--------~--~~~--~ 141 (387) +++.. ++...+..-..++.. ...++...+.+...++++.++|+.++.- .+. |... . ... . T Consensus 160 i~l~~--~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G-~G~~~~qP~Gil~~~~~~~~~~~~~~~~ 236 (395) T protein:vir:95 160 ENFTQ--YKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIING-GGAAKTQPVGLMKDVNTNSGAVTDKASS 236 (395) T ss_pred eeece--eeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeec-cCCCCcCceeeeeccccccccccccccc Confidence 44444 444444433444444 4566666777778899999999877621 111 1100 0 000 0 Q ss_pred cchhHHH-------HHHHHHHHhh-----ccCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEe--e Q lcl|NC_021299. 142 EDEIWNG-------VVSNRRWLNE-----QKVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRL--A 207 (387) Q Consensus 142 ~~~~~~~-------i~~a~~~l~~-----~~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~--~ 207 (387) ....++. +.++...|.. ......+...+++|..+..+....-+.. ..|..... + T Consensus 237 ~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~------------~~G~~~~~lg~ 304 (395) T protein:vir:95 237 GTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLT------------ANGGFVTVLPY 304 (395) T ss_pred chhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceecc------------CCCcceeccCC Confidence 0011222 2222222210 0011223345677766554433221111 11222233 3 Q ss_pred cceeeeeeccceeeeeeeccccccccccccccccCceeeeeeecccc--cceeeeeeeeeeccceeeeeeeeeeeecccc Q lcl|NC_021299. 208 QYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATEN--GVQLRWLGDYDATSTTERSIVDTWIGVKAVL 285 (387) Q Consensus 208 g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~ 285 (387) |..++.++.+|.+..+..-.+.+.+.. ..+............ ...+.... +.+....+ T Consensus 305 g~~v~~~~~~p~~~i~fgdfs~y~i~~-----r~~~~i~~~~~~~~~~d~~~f~~~~------r~dg~~~~--------- 364 (395) T protein:vir:95 305 NVTIITSEFVPEGKLVAFVTDRYNAVR-----GGGLTVKKFDQTLALEDAVLFTAKT------FAYGQPDD--------- 364 (395) T ss_pred cceEEEcCCCCCCcEEEEecccEEEEE-----ecceEEEeccchhhhCCcEEEEEEE------EECCEEec--------- Confidence 555777777775432211111111100 000000000000000 00000000 00000000 Q ss_pred ceeeeccceeccccccceeeeeeeeccccccccccccceeEEEee Q lcl|NC_021299. 286 DPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKLALALED 330 (387) Q Consensus 286 ~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (387) ........++....... +...+.+..-.+.+ T Consensus 365 ----------~~A~~~l~i~~~~~~~~----~~~~~~~~~~~~~~ 395 (395) T protein:vir:95 365 ----------NKASAVYDLKVASAPRR----QTSAGGTTDGIAEA 395 (395) T ss_pred ----------cccEEEEEeeccCCCCC----CCCCCCCCCccccC Confidence 00011111111100000 00000000000000 No 183 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=40.72 E-value=0.96 Score=20.68 Aligned_cols=267 Identities=10% Similarity=-0.037 Sum_probs=101.4 Q ss_pred Ccc--ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccc-ccccccc Q lcl|NC_021299. 1 MAN--AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMV-ASDLTEV 77 (387) Q Consensus 1 Ma~--~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 77 (387) +.- -.+.|+-+..++++.|.+...+..+++.- . . +....|++......... . .....+. -.+++-+ T Consensus 80 t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~--~-~----~~~~~i~~~~~~~~a~W-~---~e~~~~~~~~~~~f~ 148 (381) T protein:vir:10 80 VGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK--N-A----GLRLKFLKSETSGVAVW-G---KIYGEIKGQLDAAFS 148 (381) T ss_pred CCCCCceecCHHHHHHHHHHHHhhcceeeeeeeE--e-c----CcceEEEeecCCcceEE-e---ecccccccccCccce Confidence 221 15789999999999999999888887532 1 1 23345665443322211 0 1111111 1122333 Q ss_pred eEEEEEEeeeecceeeccHHHhhh-hhhHHHHHHHHHHHHHHHHHHHHHH-HHHhcccccc----------cccC----- Q lcl|NC_021299. 78 TVDIKLTDVIYNRIDLTDEERELD-VRSFAVDVLPRQVRAVAEQIEDAVS-YLITKAPYEK----------VSLV----- 140 (387) Q Consensus 78 ~~~~~id~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~~~la~~vd~~~~-~~~~~~~~~~----------~~~~----- 140 (387) . +++..++...+..-..++..+ ..++...+..+..++++..++..++ +.-.+.|... .... T Consensus 149 ~--i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~ 226 (381) T protein:vir:10 149 E--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKE 226 (381) T ss_pred e--EeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccccccccccccc Confidence 4 444445544444444455444 5566666667778899999888765 2111111100 0000 Q ss_pred --------CcchhHHHHHHHHHHHhhc-----cCCcCCcEEEEchHHHHHHhcccchhhhhhcccccceeeeeeEEEEee Q lcl|NC_021299. 141 --------DEDEIWNGVVSNRRWLNEQ-----KVPKDGRVLLVGSAVEEALLLDDRFIRYDSAGEAGASRLQTARIGRLA 207 (387) Q Consensus 141 --------~~~~~~~~i~~a~~~l~~~-----~vp~~~r~~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~~~~g~ig~~~ 207 (387) .....++.+......+... ..+..+.+++++|..+..+.+...+.. ..| . .+ . .--+ T Consensus 227 ~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~--~~G---~-~v-~---~lp~ 296 (381) T protein:vir:10 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN--ANG---V-YV-T---ALPF 296 (381) T ss_pred ccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCC--CCC---c-ee-e---cCCC Confidence 0001122222221122111 122345677899988877754322211 111 1 00 0 0114 Q ss_pred cceeeeeeccceeeeeeeccccccccccccccccCceeeeeeeccccc--ceeeeeeeeeeccceeeeeeeeeeeecccc Q lcl|NC_021299. 208 QYDVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATENG--VQLRWLGDYDATSTTERSIVDTWIGVKAVL 285 (387) Q Consensus 208 g~~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~ 285 (387) |..++.++.+|.+..+..-.+.+.+. ...+............. ..+.... ..+...++. T Consensus 297 g~~vv~~~~~p~~~i~fGDfs~Y~i~-----~r~~~~i~~~~~~~~~~d~~~f~a~~------r~dG~~~~~-------- 357 (381) T protein:vir:10 297 NLNVIESTVQEAGKVLTYVKGLYDGY-----LAGGINVQKFKETLALDDMDLYTAKQ------FAYGKAKDN-------- 357 (381) T ss_pred CceeEEcCCCCcCcEEEEEcccEEEE-----EecccEEEeechhhhhcCceEEEEEE------EEcCEEecC-------- Confidence 66677777777543221111111111 11111111110000000 0011000 000111100 Q ss_pred ceeeeccceeccccccceeeeeeeeccccccccccccce Q lcl|NC_021299. 286 DPVTANLDDEPRFVRGTRIHLKATDAEIEGETVKAGEKL 324 (387) Q Consensus 286 ~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~~~~~~ 324 (387) ....+..+.+......++ +..+++ T Consensus 358 -----------~A~~v~~l~~~~~~~~~~----~~~~~~ 381 (381) T protein:vir:10 358 -----------KVAAVWKLDLKGHKPALE----DTEETL 381 (381) T ss_pred -----------CcEEEEEEeecCCccccc----cccccC Confidence 000111111100000000 001111 No 184 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=25.08 E-value=2.1 Score=18.82 Aligned_cols=252 Identities=9% Similarity=-0.038 Sum_probs=92.8 Q ss_pred Cc-----c-ccccHHHHHHHHHHHHHhhccccceeeecccccccccCCCEEEEEecccceeeceeccccccccccccccc Q lcl|NC_021299. 1 MA-----N-AFIKPPVIIASILGQLQHELVLPNFVFKNGYGDVAHKFNDTITIRIPVPTIAHTRGLRATGADRNMVASDL 74 (387) Q Consensus 1 Ma-----~-~~~~pe~~~~~~~~~l~~~~~~~~~~~~d~~~~~~~~~gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (387) |. + -.++|+-+..++++.|.+...+..+++.- . . .| .+++|+........ .. ++...+. +.. T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~--~-~---~~-~~~~~~~~~~~~a~-w~---~e~~~~~-~~~ 146 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK--N-T---SL-RLKALTAETSGTAV-WG---DIFGEIK-GQL 146 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeE--e-c---Cc-ceEEEEecCCccee-Ee---ecccccC-ccc Confidence 21 1 24779999999999999998888877532 1 1 13 34666533222111 11 1111111 111 Q ss_pred ccceEEEEEEeeeecceeeccHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHH-HHhcccccc-----------ccc-- Q lcl|NC_021299. 75 TEVTVDIKLTDVIYNRIDLTDEEREL-DVRSFAVDVLPRQVRAVAEQIEDAVSY-LITKAPYEK-----------VSL-- 139 (387) Q Consensus 75 ~~~~~~~~id~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~la~~vd~~~~~-~~~~~~~~~-----------~~~-- 139 (387) ...-..+++..++..++..-..++.. +..++..-+.++..++++..+|..++. .-.+.|... ... T Consensus 147 ~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~ 226 (377) T protein:vir:98 147 KQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRD 226 (377) T ss_pred CccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccc Confidence 12223455555555555444455544 455666666777889999999987763 111111100 000 Q ss_pred -CCcchhHHHHHHH--------------------HHHHhhccCCcCCcE-EEEchHHHHHHhcccchhhhhhccccccee Q lcl|NC_021299. 140 -VDEDEIWNGVVSN--------------------RRWLNEQKVPKDGRV-LLVGSAVEEALLLDDRFIRYDSAGEAGASR 197 (387) Q Consensus 140 -~~~~~~~~~i~~a--------------------~~~l~~~~vp~~~r~-~v~~~~~~~~l~~~~~~~~~~~~g~~~~~~ 197 (387) .+....++.+.++ ...+.+..- .++|+ ++++|..+..+. +.... . T Consensus 227 ~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd-~~G~~i~~~n~~~~~~~~--p~~~~----------~ 293 (377) T protein:vir:98 227 ITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLK-IAGQVKLILNPEDRWALE--AQFTS----------R 293 (377) T ss_pred cccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhc-cCCceEEEecccchhhcc--ccccc----------c Confidence 0000001111110 001111111 13443 445665443332 11100 0 Q ss_pred eeeeEEEEeecc--eeeeeeccceeeeeeeccccccccccccccccCceeeeeeeccc--ccceeeeeeeeeeccceeee Q lcl|NC_021299. 198 LQTARIGRLAQY--DVVTVDTLPHGDAYLSHPTAYAMLTRSPGRPMTNTVATSTVATE--NGVQLRWLGDYDATSTTERS 273 (387) Q Consensus 198 ~~~g~ig~~~g~--~v~~s~~~~~~~~~~~~~~a~~~~~~~~~~~~~~t~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~ 273 (387) ...|....+.|+ .+..+..+|....+..-.+.+.+.-+ .+........... ....+....-.+ .. T Consensus 294 ~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r-----~~~~i~~~~~~~~~~d~~~f~~~~r~d------g~ 362 (377) T protein:vir:98 294 NQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMA-----TASTIEEYDQTFAMEDLQLYLTKNYFY------GK 362 (377) T ss_pred CCCCccccccCCCceEEecCCCCcccEEEEEecceeEEee-----cceEEEeechhhhhcCceEEEEEEEEc------CE Confidence 112222334443 35556666654332221111211111 1111111100000 001111111111 00 Q ss_pred eeeeeeeeccccceeeeccceeccccccceeeeeeeeccccccccc Q lcl|NC_021299. 274 IVDTWIGVKAVLDPVTANLDDEPRFVRGTRIHLKATDAEIEGETVK 319 (387) Q Consensus 274 ~~~~~~g~~~~~~~~~~~~~~~~~~v~~~~v~~~~~~~~~~~~~~~ 319 (387) ..+.. ...+..++. + T Consensus 363 ~~~~~-------------------a~~vl~i~~------------~ 377 (377) T protein:vir:98 363 AKDNH-------------------TAALLTLAG------------G 377 (377) T ss_pred EeccC-------------------cEEEEEEec------------C Confidence 00000 000000000 0 Done!