Query lcl|Aclame:protein:vir:8187|NCBI_annot:gp7|genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Match_columns 311 No_of_seqs 110 out of 1032 Neff 9.6 Searched_HMMs 1612 Date Sat Nov 30 14:23:26 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_7 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_7_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8187 Length: 311 # 100.0 5.7E-76 3.5E-79 433.1 32.5 311 1-311 1-311 (311) 2 protein:vir:9574 Length: 300 # 100.0 3.3E-68 2E-71 390.5 31.4 297 1-310 1-300 (300) 3 protein:vir:9759 Length: 303 # 100.0 1.1E-67 6.5E-71 387.8 32.1 300 1-310 1-303 (303) 4 protein:vir:99920 Length: 311 100.0 1.6E-67 9.7E-71 386.8 31.5 307 1-310 1-311 (311) 5 protein:vir:1638 Length: 298 # 100.0 8.5E-67 5.3E-70 382.8 31.6 295 1-309 1-298 (298) 6 protein:vir:94771 Length: 298 100.0 1.3E-66 7.9E-70 381.9 31.6 295 1-309 1-298 (298) 7 protein:vir:80684 Length: 315 100.0 1.7E-64 1E-67 370.3 30.6 297 1-311 1-307 (315) 8 protein:vir:94142 Length: 304 100.0 4E-60 2.5E-63 346.2 30.8 285 1-309 11-304 (304) 9 protein:vir:105905 Length: 304 100.0 4E-60 2.5E-63 346.2 30.8 285 1-309 11-304 (304) 10 protein:vir:41 Length: 299 # N 100.0 1.2E-59 7.2E-63 343.7 30.7 285 1-311 6-299 (299) 11 protein:vir:7771 Length: 330 # 100.0 3.3E-59 2E-62 341.2 31.3 294 1-311 10-324 (330) 12 protein:vir:78523 Length: 338 100.0 5.1E-59 3.2E-62 340.2 30.4 299 1-311 10-336 (338) 13 protein:vir:100247 Length: 425 100.0 3.8E-59 2.3E-62 340.9 27.8 283 1-311 132-425 (425) 14 protein:vir:485 Length: 407 # 100.0 8.4E-59 5.2E-62 339.0 28.2 283 1-311 106-401 (407) 15 protein:vir:2504 Length: 305 # 100.0 1.2E-58 7.2E-62 338.2 28.6 287 1-311 1-299 (305) 16 protein:vir:78223 Length: 333 100.0 2.4E-58 1.5E-61 336.4 30.3 298 1-311 10-333 (333) 17 protein:vir:4226 Length: 326 # 100.0 4.2E-58 2.6E-61 335.1 30.5 289 1-311 20-324 (326) 18 protein:vir:5739 Length: 366 # 100.0 2.3E-58 1.4E-61 336.5 28.9 291 1-310 64-366 (366) 19 protein:vir:4456 Length: 401 # 100.0 1.5E-58 9.3E-62 337.6 27.4 282 1-310 107-401 (401) 20 protein:vir:104085 Length: 320 100.0 8.8E-58 5.5E-61 333.4 30.5 288 1-311 14-318 (320) 21 protein:vir:78830 Length: 324 100.0 1.3E-57 7.9E-61 332.5 30.1 281 1-311 29-316 (324) 22 protein:vir:96392 Length: 324 100.0 1.3E-57 7.9E-61 332.5 30.1 281 1-311 29-316 (324) 23 protein:vir:2430 Length: 318 # 100.0 1.8E-57 1.1E-60 331.6 30.8 285 1-311 14-314 (318) 24 protein:vir:97148 Length: 324 100.0 1.6E-57 1E-60 331.9 30.4 281 1-311 29-316 (324) 25 protein:vir:80376 Length: 435 100.0 9.1E-58 5.6E-61 333.3 29.0 294 1-311 134-434 (435) 26 protein:vir:105038 Length: 428 100.0 1.1E-57 7.1E-61 332.8 28.5 290 1-310 127-428 (428) 27 protein:vir:6242 Length: 390 # 100.0 1E-57 6.3E-61 333.0 27.7 277 1-311 112-390 (390) 28 protein:vir:1433 Length: 435 # 100.0 1.5E-57 9.6E-61 332.0 28.6 294 1-311 134-434 (435) 29 protein:vir:9309 Length: 324 # 100.0 3.3E-57 2E-60 330.3 30.4 281 1-311 29-316 (324) 30 protein:vir:2344 Length: 397 # 100.0 2E-57 1.2E-60 331.4 29.0 284 1-311 10-307 (397) 31 protein:vir:103955 Length: 324 100.0 6.3E-57 3.9E-60 328.7 30.4 281 1-311 29-316 (324) 32 protein:vir:99749 Length: 324 100.0 6.4E-57 4E-60 328.6 30.4 281 1-311 29-316 (324) 33 protein:vir:96223 Length: 324 100.0 7E-57 4.4E-60 328.4 30.4 281 1-311 29-316 (324) 34 protein:vir:8102 Length: 543 # 100.0 7.3E-57 4.5E-60 328.4 28.3 291 1-311 252-543 (543) 35 protein:vir:1328 Length: 392 # 100.0 3.1E-56 1.9E-59 324.9 28.5 279 1-311 111-392 (392) 36 protein:vir:101650 Length: 497 100.0 3E-56 1.9E-59 324.9 27.9 281 1-311 151-494 (497) 37 protein:vir:7855 Length: 497 # 100.0 3E-56 1.9E-59 324.9 27.9 281 1-311 151-494 (497) 38 protein:vir:95763 Length: 297 100.0 1.9E-55 1.2E-58 320.6 30.5 279 1-311 9-297 (297) 39 protein:vir:100135 Length: 418 100.0 1.5E-55 9.2E-59 321.2 29.9 277 1-311 136-416 (418) 40 protein:vir:93616 Length: 645 100.0 1.3E-55 8E-59 321.5 29.0 288 1-311 338-640 (645) 41 protein:vir:4339 Length: 395 # 100.0 2.8E-55 1.7E-58 319.7 29.9 279 1-310 115-395 (395) 42 protein:vir:4511 Length: 409 # 100.0 1.3E-55 8E-59 321.5 26.9 284 1-311 119-407 (409) 43 protein:vir:191 Length: 385 # 100.0 4.8E-55 3E-58 318.4 29.5 277 1-311 105-385 (385) 44 protein:vir:1886 Length: 385 # 100.0 4.8E-55 3E-58 318.4 29.5 277 1-311 105-385 (385) 45 protein:vir:81160 Length: 371 100.0 7.3E-55 4.6E-58 317.4 29.0 274 1-310 93-371 (371) 46 protein:vir:97053 Length: 390 100.0 1E-54 6.4E-58 316.6 28.6 273 1-308 113-390 (390) 47 protein:vir:104256 Length: 458 100.0 2.2E-54 1.4E-57 314.7 28.6 281 1-310 164-458 (458) 48 protein:vir:4700 Length: 415 # 100.0 2.8E-54 1.7E-57 314.2 29.0 279 1-311 123-405 (415) 49 protein:vir:4600 Length: 415 # 100.0 2.8E-54 1.7E-57 314.2 29.0 279 1-311 123-405 (415) 50 protein:vir:102119 Length: 404 100.0 3.4E-54 2.1E-57 313.7 29.3 283 1-311 110-401 (404) 51 protein:vir:4856 Length: 293 # 100.0 3.3E-54 2.1E-57 313.8 29.1 271 1-311 5-282 (293) 52 protein:vir:4997 Length: 397 # 100.0 3E-54 1.8E-57 314.0 28.5 271 1-311 111-386 (397) 53 protein:vir:81070 Length: 390 100.0 4.4E-54 2.8E-57 313.1 29.4 273 1-308 113-390 (390) 54 protein:vir:81227 Length: 413 100.0 3.9E-54 2.4E-57 313.4 29.0 278 1-311 120-411 (413) 55 protein:vir:10364 Length: 390 100.0 8.1E-54 5E-57 311.7 29.7 273 1-308 113-390 (390) 56 protein:vir:4953 Length: 397 # 100.0 4.2E-54 2.6E-57 313.2 28.1 271 1-311 109-386 (397) 57 protein:vir:4830 Length: 397 # 100.0 4.7E-54 2.9E-57 313.0 27.8 271 1-311 111-386 (397) 58 protein:vir:79987 Length: 415 100.0 8.8E-54 5.4E-57 311.5 28.9 279 1-311 123-405 (415) 59 protein:vir:98339 Length: 415 100.0 8.8E-54 5.4E-57 311.5 28.9 279 1-311 123-405 (415) 60 protein:vir:81100 Length: 415 100.0 8.8E-54 5.4E-57 311.5 28.9 279 1-311 123-405 (415) 61 protein:vir:95376 Length: 425 100.0 3.6E-54 2.2E-57 313.6 26.8 276 1-311 140-422 (425) 62 protein:vir:96762 Length: 632 100.0 2.2E-54 1.3E-57 314.8 25.4 268 1-309 357-632 (632) 63 protein:vir:105004 Length: 392 100.0 1E-53 6.5E-57 311.0 27.9 271 1-311 108-385 (392) 64 protein:vir:102082 Length: 392 100.0 1E-53 6.5E-57 311.0 27.9 271 1-311 108-385 (392) 65 protein:vir:102873 Length: 392 100.0 1E-53 6.5E-57 311.0 27.9 271 1-311 108-385 (392) 66 protein:vir:107593 Length: 392 100.0 1E-53 6.5E-57 311.0 27.9 271 1-311 108-385 (392) 67 protein:vir:9410 Length: 415 # 100.0 2.4E-53 1.5E-56 309.1 28.5 279 1-311 123-405 (415) 68 protein:vir:1268 Length: 397 # 100.0 2E-53 1.2E-56 309.5 27.9 267 1-310 123-397 (397) 69 protein:vir:6212 Length: 434 # 100.0 1.7E-53 1E-56 309.9 26.6 280 1-311 144-434 (434) 70 protein:vir:101607 Length: 379 100.0 3E-53 1.9E-56 308.5 27.9 267 1-310 109-379 (379) 71 protein:vir:1025 Length: 408 # 100.0 1E-52 6.3E-56 305.6 28.8 269 1-311 116-394 (408) 72 protein:vir:3845 Length: 395 # 100.0 1E-52 6.3E-56 305.6 28.0 270 1-311 109-384 (395) 73 protein:vir:3991 Length: 404 # 100.0 2.2E-52 1.4E-55 303.8 28.8 271 1-311 118-394 (404) 74 protein:vir:7409 Length: 408 # 100.0 2E-52 1.3E-55 304.0 28.3 269 1-311 118-394 (408) 75 protein:vir:1383 Length: 421 # 100.0 1.5E-52 9.4E-56 304.7 26.4 266 1-311 116-384 (421) 76 protein:vir:4092 Length: 390 # 100.0 6E-52 3.7E-55 301.4 26.7 272 1-311 86-369 (390) 77 protein:vir:3870 Length: 400 # 100.0 3.1E-51 1.9E-54 297.5 25.9 262 1-311 136-400 (400) 78 protein:vir:9704 Length: 394 # 100.0 1.1E-50 6.7E-54 294.5 26.2 259 1-311 130-391 (394) 79 protein:vir:94673 Length: 419 100.0 8.8E-50 5.4E-53 289.5 28.9 281 1-311 123-418 (419) 80 protein:vir:98635 Length: 377 100.0 6.9E-51 4.3E-54 295.6 22.4 278 1-310 79-377 (377) 81 protein:vir:100172 Length: 394 100.0 7.9E-50 4.9E-53 289.8 27.5 266 1-311 113-385 (394) 82 protein:vir:95963 Length: 395 100.0 3E-49 1.9E-52 286.6 26.3 273 1-311 88-377 (395) 83 protein:vir:1084 Length: 437 # 100.0 3.2E-49 2E-52 286.4 24.5 267 1-311 158-428 (437) 84 protein:vir:100884 Length: 389 100.0 1E-48 6.2E-52 283.7 27.1 265 1-311 109-383 (389) 85 protein:vir:78640 Length: 352 100.0 3.3E-49 2E-52 286.4 22.7 262 1-311 83-347 (352) 86 protein:vir:8420 Length: 477 # 100.0 9.3E-49 5.7E-52 283.9 24.9 284 1-311 157-472 (477) 87 protein:vir:100632 Length: 381 100.0 1.4E-48 9E-52 282.9 24.7 270 1-311 78-369 (381) 88 protein:vir:101291 Length: 381 100.0 2.3E-48 1.4E-51 281.8 24.9 270 1-311 76-369 (381) 89 protein:vir:9509 Length: 381 # 100.0 2.3E-48 1.4E-51 281.8 24.9 270 1-311 76-369 (381) 90 protein:vir:2685 Length: 387 # 100.0 1.4E-48 8.7E-52 282.9 21.0 262 1-311 120-382 (387) 91 protein:vir:96978 Length: 387 100.0 1.4E-48 8.7E-52 282.9 21.0 262 1-311 120-382 (387) 92 protein:vir:94424 Length: 387 100.0 1.4E-48 8.7E-52 282.9 21.0 262 1-311 120-382 (387) 93 protein:vir:93881 Length: 387 100.0 3.7E-48 2.3E-51 280.6 22.5 261 1-311 120-382 (387) 94 protein:vir:9361 Length: 402 # 100.0 2.5E-48 1.5E-51 281.6 20.5 261 1-311 135-397 (402) 95 protein:vir:962 Length: 397 # 100.0 1.3E-47 8.3E-51 277.6 23.1 261 1-310 134-397 (397) 96 protein:vir:80128 Length: 466 100.0 2E-47 1.2E-50 276.6 22.8 275 1-311 151-449 (466) 97 protein:vir:9643 Length: 377 # 100.0 3.2E-46 2E-49 270.0 25.6 270 1-310 81-377 (377) 98 protein:vir:78350 Length: 383 100.0 5.8E-46 3.6E-49 268.6 21.8 268 1-311 83-376 (383) 99 protein:vir:4197 Length: 314 # 100.0 5E-42 3.1E-45 247.0 25.4 287 1-311 14-313 (314) 100 protein:vir:4159 Length: 315 # 100.0 6.7E-40 4.2E-43 235.3 23.6 282 1-307 19-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 8.8E-36 5.5E-39 212.8 24.8 287 1-311 18-312 (321) 102 protein:vir:97397 Length: 517 100.0 3.8E-36 2.3E-39 214.8 21.2 273 1-311 241-515 (517) 103 protein:vir:3033 Length: 272 # 100.0 1.2E-34 7.5E-38 206.5 25.0 261 1-311 1-270 (272) 104 protein:vir:9820 Length: 272 # 100.0 1.2E-34 7.5E-38 206.5 25.0 261 1-311 1-270 (272) 105 protein:vir:4074 Length: 480 # 100.0 2.4E-32 1.5E-35 193.9 13.1 259 1-311 212-478 (480) 106 protein:vir:93742 Length: 274 99.9 2.6E-26 1.6E-29 160.8 22.9 260 1-311 1-271 (274) 107 protein:vir:79928 Length: 393 99.9 2.6E-26 1.6E-29 160.8 16.0 299 1-311 73-379 (393) 108 protein:vir:94933 Length: 330 99.9 6.5E-25 4E-28 153.2 22.0 291 1-311 25-330 (330) 109 protein:vir:3613 Length: 272 # 99.9 1.4E-24 8.4E-28 151.4 20.7 264 1-310 1-272 (272) 110 protein:vir:96123 Length: 274 99.9 6.5E-24 4E-27 147.7 22.7 260 1-311 1-271 (274) 111 protein:vir:80930 Length: 278 99.9 6.2E-24 3.8E-27 147.8 22.1 268 1-311 1-278 (278) 112 protein:vir:105334 Length: 276 99.9 8.6E-24 5.3E-27 147.0 21.5 262 1-311 1-271 (276) 113 protein:vir:94494 Length: 274 99.9 2.6E-23 1.6E-26 144.4 23.2 258 1-311 1-271 (274) 114 protein:vir:97433 Length: 274 99.9 2.6E-23 1.6E-26 144.4 23.2 258 1-311 1-271 (274) 115 protein:vir:96833 Length: 275 99.9 1.2E-23 7.3E-27 146.3 21.2 262 1-311 3-272 (275) 116 protein:vir:1239 Length: 274 # 99.8 4.5E-22 2.8E-25 137.6 22.1 259 1-311 1-271 (274) 117 protein:vir:96262 Length: 274 99.8 1.5E-21 9.3E-25 134.8 22.8 259 1-311 1-271 (274) 118 protein:vir:95898 Length: 274 99.8 1.5E-21 9.3E-25 134.8 22.8 259 1-311 1-271 (274) 119 protein:vir:95107 Length: 270 99.8 2.4E-21 1.5E-24 133.6 20.6 262 1-311 1-266 (270) 120 protein:vir:97255 Length: 310 99.8 1.3E-19 7.8E-23 124.2 23.1 291 1-310 1-310 (310) 121 protein:vir:739 Length: 231 # 99.6 4.2E-17 2.6E-20 110.3 17.4 229 33-310 1-231 (231) 122 protein:vir:7990 Length: 273 # 99.6 1.9E-16 1.2E-19 106.8 19.8 263 1-310 1-273 (273) 123 protein:vir:105822 Length: 273 99.6 2.8E-16 1.7E-19 105.8 20.2 263 1-310 1-273 (273) 124 protein:vir:102605 Length: 273 99.6 2.8E-16 1.7E-19 105.8 20.2 263 1-310 1-273 (273) 125 protein:vir:108211 Length: 318 99.6 3.6E-16 2.2E-19 105.3 16.9 282 1-311 7-318 (318) 126 protein:vir:94622 Length: 341 99.5 3E-15 1.9E-18 100.2 17.7 300 1-311 1-340 (341) 127 protein:vir:94576 Length: 347 99.5 4.4E-15 2.7E-18 99.3 17.0 296 1-310 1-347 (347) 128 protein:vir:99424 Length: 360 99.4 3.1E-14 1.9E-17 94.7 20.0 287 1-311 23-358 (360) 129 protein:vir:8885 Length: 347 # 99.4 1.5E-14 9.6E-18 96.3 17.0 298 1-311 1-347 (347) 130 protein:vir:2201 Length: 345 # 99.4 5.2E-14 3.2E-17 93.4 18.4 296 1-310 1-345 (345) 131 protein:vir:10450 Length: 344 99.4 4.2E-14 2.6E-17 93.9 15.5 296 1-310 1-344 (344) 132 protein:vir:5974 Length: 324 # 99.4 3.5E-13 2.2E-16 88.9 20.4 276 1-311 1-292 (324) 133 protein:vir:80180 Length: 381 99.4 4.7E-13 2.9E-16 88.2 21.0 279 1-311 1-306 (381) 134 protein:vir:94711 Length: 347 99.3 4.8E-14 3E-17 93.6 14.8 293 1-311 1-347 (347) 135 protein:vir:3364 Length: 347 # 99.3 1.4E-13 8.7E-17 91.1 17.0 293 1-311 1-346 (347) 136 protein:vir:1541 Length: 347 # 99.3 3.2E-13 2E-16 89.1 18.6 297 1-311 1-346 (347) 137 protein:vir:78739 Length: 332 99.3 1.8E-13 1.1E-16 90.5 16.8 294 1-308 1-332 (332) 138 protein:vir:103323 Length: 364 99.3 2.5E-12 1.5E-15 84.2 21.8 297 1-311 1-340 (364) 139 protein:vir:80213 Length: 334 99.3 3.9E-13 2.4E-16 88.6 17.4 297 1-311 1-333 (334) 140 protein:vir:95318 Length: 328 99.3 1.4E-12 8.7E-16 85.6 20.1 230 1-231 1-328 (328) 141 protein:vir:78935 Length: 335 99.3 2.5E-12 1.5E-15 84.2 19.6 295 1-311 1-330 (335) 142 protein:vir:6324 Length: 335 # 99.2 4E-12 2.5E-15 83.1 20.2 296 1-311 1-330 (335) 143 protein:vir:100057 Length: 375 99.2 2.3E-11 1.4E-14 78.9 20.4 300 1-311 1-371 (375) 144 protein:vir:102655 Length: 322 99.2 1.4E-11 8.5E-15 80.1 18.8 292 1-311 13-322 (322) 145 protein:vir:97031 Length: 402 99.1 1E-11 6.3E-15 80.8 17.2 297 1-311 1-337 (402) 146 protein:vir:107826 Length: 331 99.1 2.2E-11 1.4E-14 79.0 19.0 230 1-231 1-331 (331) 147 protein:vir:107388 Length: 331 99.1 2.2E-11 1.4E-14 79.0 19.0 230 1-231 1-331 (331) 148 protein:vir:98525 Length: 331 99.1 2.2E-11 1.4E-14 79.0 19.0 230 1-231 1-331 (331) 149 protein:vir:1583 Length: 351 # 99.1 3.2E-11 2E-14 78.1 18.6 281 1-311 1-296 (351) 150 protein:vir:8324 Length: 410 # 99.1 4.3E-12 2.7E-15 82.9 13.8 259 1-308 131-410 (410) 151 protein:vir:9927 Length: 295 # 99.1 3.5E-12 2.2E-15 83.4 12.9 267 1-311 1-289 (295) 152 protein:vir:103759 Length: 330 99.1 2.7E-11 1.7E-14 78.5 17.1 230 1-231 1-330 (330) 153 protein:vir:93858 Length: 400 99.1 6.7E-12 4.1E-15 81.8 13.2 274 1-308 121-400 (400) 154 protein:vir:3136 Length: 322 # 99.0 2E-11 1.2E-14 79.2 15.0 293 1-311 1-319 (322) 155 protein:vir:102944 Length: 330 99.0 2.5E-10 1.5E-13 73.3 20.3 278 1-311 1-298 (330) 156 protein:vir:80068 Length: 301 99.0 3.2E-10 2E-13 72.6 19.8 273 1-308 1-301 (301) 157 protein:vir:103285 Length: 296 99.0 2.3E-10 1.4E-13 73.4 18.5 277 1-311 3-296 (296) 158 protein:vir:99675 Length: 324 99.0 5.3E-11 3.3E-14 76.9 14.9 265 32-311 1-297 (324) 159 protein:vir:105645 Length: 400 99.0 1.2E-10 7.6E-14 74.9 16.9 298 1-311 1-334 (400) 160 protein:vir:7324 Length: 335 # 99.0 1.8E-10 1.1E-13 74.0 17.5 231 1-232 1-335 (335) 161 protein:vir:7019 Length: 401 # 98.9 1.8E-10 1.1E-13 74.0 15.0 297 1-311 1-334 (401) 162 protein:vir:104342 Length: 314 98.9 5.1E-10 3.1E-13 71.5 16.3 277 1-311 21-314 (314) 163 protein:vir:8843 Length: 317 # 98.8 3.6E-09 2.2E-12 66.9 20.4 288 1-311 1-316 (317) 164 protein:vir:9875 Length: 296 # 98.8 2.7E-10 1.7E-13 73.0 12.8 271 1-311 1-296 (296) 165 protein:vir:107687 Length: 319 98.8 3.2E-09 2E-12 67.1 18.5 274 1-308 21-319 (319) 166 protein:vir:106647 Length: 303 98.7 1.1E-09 7.1E-13 69.6 12.9 267 1-311 1-298 (303) 167 protein:vir:79642 Length: 329 98.6 2.9E-08 1.8E-11 61.9 18.2 277 1-311 28-329 (329) 168 protein:vir:99075 Length: 392 98.4 2.7E-07 1.7E-10 56.6 19.8 289 1-311 1-317 (392) 169 protein:vir:108303 Length: 418 98.4 5.4E-07 3.4E-10 54.9 20.2 265 1-311 1-283 (418) 170 protein:vir:94070 Length: 339 98.2 1.2E-07 7.5E-11 58.5 13.3 275 1-308 46-339 (339) 171 protein:vir:95512 Length: 693 98.1 2.1E-06 1.3E-09 51.7 17.9 276 1-308 394-693 (693) 172 protein:vir:3643 Length: 336 # 98.0 4.8E-07 3E-10 55.2 12.4 276 1-308 42-336 (336) 173 protein:vir:101557 Length: 336 97.9 8.9E-07 5.5E-10 53.7 12.7 276 1-308 42-336 (336) 174 protein:vir:79548 Length: 652 97.9 1.2E-05 7.7E-09 47.5 18.8 278 1-307 359-652 (652) 175 protein:vir:78558 Length: 336 97.7 2.4E-06 1.5E-09 51.3 12.6 277 1-308 42-336 (336) 176 protein:vir:107732 Length: 379 97.7 5.7E-06 3.6E-09 49.3 14.4 279 1-308 56-379 (379) 177 protein:vir:3525 Length: 423 # 97.7 3E-05 1.8E-08 45.4 18.4 273 1-311 1-324 (423) 178 protein:vir:94800 Length: 319 97.7 3E-05 1.9E-08 45.4 20.4 266 1-311 19-295 (319) 179 protein:vir:97331 Length: 319 97.7 3E-05 1.9E-08 45.4 20.4 266 1-311 19-295 (319) 180 protein:vir:107120 Length: 329 97.7 3E-05 1.9E-08 45.4 20.6 269 1-311 30-306 (329) 181 protein:vir:5255 Length: 304 # 97.5 1.3E-05 8.1E-09 47.4 13.6 278 4-307 1-304 (304) 182 protein:vir:79008 Length: 299 97.5 6.1E-05 3.8E-08 43.7 21.2 281 1-311 1-299 (299) 183 protein:vir:95451 Length: 313 97.4 4.8E-05 3E-08 44.3 16.0 292 1-311 1-312 (313) 184 protein:vir:174 Length: 423 # 97.3 9.5E-05 5.9E-08 42.6 18.7 281 1-311 1-337 (423) 185 protein:vir:105374 Length: 423 97.3 9.9E-05 6.1E-08 42.5 19.5 281 1-311 1-337 (423) 186 protein:vir:103886 Length: 302 97.3 0.0001 6.3E-08 42.5 16.2 273 1-311 1-302 (302) 187 protein:vir:95131 Length: 325 97.3 0.00011 6.9E-08 42.3 18.8 280 2-311 1-295 (325) 188 protein:vir:106734 Length: 336 97.1 2.5E-05 1.6E-08 45.8 11.0 277 1-308 42-336 (336) 189 protein:vir:94989 Length: 349 97.0 0.00019 1.2E-07 40.9 20.1 280 1-311 1-317 (349) 190 protein:vir:96079 Length: 382 97.0 0.00011 7E-08 42.2 13.8 281 1-308 70-382 (382) 191 protein:vir:99576 Length: 388 96.9 6.6E-05 4.1E-08 43.5 12.2 282 1-308 65-388 (388) 192 protein:vir:105522 Length: 423 96.9 0.00025 1.6E-07 40.3 20.3 275 1-311 1-337 (423) 193 protein:vir:78387 Length: 349 96.9 0.00026 1.6E-07 40.2 19.5 280 1-311 1-317 (349) 194 protein:vir:80446 Length: 367 96.8 0.00032 2E-07 39.8 18.1 278 1-311 1-337 (367) 195 protein:vir:1781 Length: 221 # 96.7 0.00012 7.5E-08 42.1 12.3 191 80-311 1-204 (221) 196 protein:vir:95875 Length: 401 96.6 0.00048 3E-07 38.8 16.3 301 1-311 1-401 (401) 197 protein:vir:96792 Length: 315 96.3 0.00078 4.8E-07 37.6 18.0 264 1-311 1-282 (315) 198 protein:vir:99888 Length: 309 96.3 0.00054 3.3E-07 38.5 13.1 286 4-311 1-309 (309) 199 protein:vir:270 Length: 341 # 95.6 0.0018 1.1E-06 35.6 14.5 283 1-311 20-333 (341) 200 protein:vir:100331 Length: 342 95.5 0.0018 1.1E-06 35.6 12.9 283 1-311 16-339 (342) 201 protein:vir:1153 Length: 338 # 94.2 0.005 3.1E-06 33.2 15.3 282 1-311 16-337 (338) 202 protein:vir:6061 Length: 357 # 92.6 0.011 6.7E-06 31.4 14.3 290 1-311 16-351 (357) 203 protein:vir:98566 Length: 355 92.5 0.011 7E-06 31.3 16.4 290 1-311 16-349 (355) 204 protein:vir:1829 Length: 355 # 92.5 0.011 7E-06 31.3 16.3 288 1-311 16-349 (355) 205 protein:vir:5694 Length: 357 # 92.2 0.013 7.8E-06 31.0 14.0 290 1-311 16-351 (357) 206 protein:vir:104011 Length: 337 91.2 0.017 1.1E-05 30.3 17.2 281 1-310 16-337 (337) 207 protein:vir:96666 Length: 462 90.5 0.02 1.3E-05 29.9 16.8 292 1-311 26-340 (462) 208 protein:vir:79171 Length: 337 90.3 0.021 1.3E-05 29.7 17.2 281 1-310 16-337 (337) 209 protein:vir:93966 Length: 400 90.3 0.0032 2E-06 34.3 5.8 269 1-308 121-400 (400) 210 protein:vir:861 Length: 318 # 90.1 0.0054 3.3E-06 33.0 6.9 271 1-308 39-318 (318) 211 protein:vir:5942 Length: 523 # 90.0 0.023 1.4E-05 29.6 12.4 285 1-311 188-522 (523) 212 protein:vir:78186 Length: 337 89.8 0.024 1.5E-05 29.4 15.8 281 1-310 16-337 (337) 213 protein:vir:79157 Length: 339 89.8 0.024 1.5E-05 29.4 15.9 284 1-311 16-339 (339) 214 protein:vir:2016 Length: 357 # 89.3 0.027 1.7E-05 29.2 14.3 288 1-311 16-349 (357) 215 protein:vir:102823 Length: 470 89.2 0.028 1.7E-05 29.1 14.4 289 1-311 18-367 (470) 216 protein:vir:1663 Length: 393 # 88.4 0.0059 3.7E-06 32.8 5.8 269 1-308 114-393 (393) 217 protein:vir:348 Length: 321 # 87.7 0.037 2.3E-05 28.4 19.0 281 9-308 1-321 (321) 218 protein:vir:98856 Length: 343 87.1 0.041 2.6E-05 28.2 15.2 284 1-311 16-341 (343) 219 protein:vir:96442 Length: 418 86.0 0.049 3E-05 27.8 18.2 285 1-311 69-407 (418) 220 protein:vir:99311 Length: 463 85.9 0.05 3.1E-05 27.7 16.2 295 1-311 26-381 (463) 221 protein:vir:95603 Length: 463 85.9 0.05 3.1E-05 27.7 16.2 295 1-311 26-381 (463) 222 protein:vir:78777 Length: 358 85.0 0.056 3.5E-05 27.4 14.7 289 1-311 20-355 (358) 223 protein:vir:78920 Length: 290 84.5 0.06 3.7E-05 27.3 20.8 280 1-310 1-290 (290) 224 protein:vir:103463 Length: 521 82.2 0.079 4.9E-05 26.6 16.9 280 1-311 79-493 (521) 225 protein:vir:106286 Length: 534 76.1 0.14 8.7E-05 25.3 16.6 280 1-311 87-515 (534) 226 protein:vir:3746 Length: 336 # 75.6 0.15 9E-05 25.2 15.7 281 1-311 21-336 (336) 227 protein:vir:80835 Length: 464 74.9 0.15 9.5E-05 25.1 14.3 286 1-311 19-337 (464) 228 protein:vir:6901 Length: 522 # 74.5 0.16 9.8E-05 25.0 14.0 279 1-311 164-508 (522) 229 protein:vir:79078 Length: 307 73.6 0.17 0.0001 24.8 13.6 276 1-310 1-307 (307) 230 protein:vir:79712 Length: 285 72.2 0.19 0.00012 24.6 17.7 266 1-311 1-284 (285) 231 protein:vir:94870 Length: 318 70.4 0.18 0.00011 24.7 7.4 266 1-308 39-318 (318) 232 protein:vir:98143 Length: 524 70.3 0.21 0.00013 24.3 15.5 280 1-311 79-497 (524) 233 protein:vir:3783 Length: 336 # 70.1 0.21 0.00013 24.3 16.0 281 1-311 13-331 (336) 234 protein:vir:100851 Length: 514 60.2 0.38 0.00023 22.9 14.1 289 1-311 45-383 (514) 235 protein:vir:80986 Length: 528 56.0 0.47 0.00029 22.4 16.9 284 1-311 79-502 (528) 236 protein:vir:93696 Length: 364 55.4 0.48 0.0003 22.3 18.3 289 1-311 1-362 (364) 237 protein:vir:107882 Length: 307 55.1 0.49 0.0003 22.3 14.5 278 1-310 1-307 (307) 238 protein:vir:1991 Length: 305 # 52.9 0.54 0.00034 22.0 9.9 231 1-311 1-235 (305) 239 protein:vir:5670 Length: 514 # 52.1 0.56 0.00035 21.9 12.6 279 1-311 148-487 (514) 240 protein:vir:78148 Length: 123 51.2 0.5 0.00031 22.2 6.2 116 179-310 1-123 (123) 241 protein:vir:103370 Length: 418 49.2 0.65 0.0004 21.6 16.0 287 1-311 71-407 (418) 242 protein:vir:63741 Length: 468 46.9 0.72 0.00045 21.4 14.3 289 1-311 23-340 (468) 243 protein:vir:80491 Length: 467 46.7 0.72 0.00045 21.4 14.5 289 1-311 25-339 (467) 244 protein:vir:100603 Length: 529 44.7 0.8 0.00049 21.1 15.8 280 1-311 63-503 (529) 245 protein:vir:2736 Length: 348 # 44.4 0.81 0.0005 21.1 20.7 293 1-311 1-348 (348) 246 protein:vir:7214 Length: 521 # 43.2 0.85 0.00053 21.0 14.1 282 1-311 166-502 (521) 247 protein:vir:105464 Length: 346 43.2 0.86 0.00053 21.0 20.1 281 1-311 1-301 (346) 248 protein:vir:98480 Length: 348 42.1 0.9 0.00056 20.8 19.3 290 1-309 1-348 (348) 249 protein:vir:96490 Length: 348 31.5 1.5 0.00093 19.6 21.0 301 1-311 1-348 (348) 250 protein:vir:101811 Length: 529 30.5 1.6 0.00098 19.5 12.5 279 1-311 171-503 (529) 251 protein:vir:107947 Length: 519 29.7 1.6 0.001 19.4 13.4 280 1-311 161-492 (519) 252 protein:vir:99523 Length: 311 29.1 1.7 0.001 19.3 19.4 292 1-308 1-311 (311) 253 protein:vir:101039 Length: 529 27.6 1.8 0.0011 19.2 12.8 279 1-311 176-503 (529) 254 protein:vir:104915 Length: 470 24.9 2.1 0.0013 18.8 17.0 281 1-311 69-459 (470) 255 protein:vir:106998 Length: 468 23.3 2.3 0.0014 18.6 18.8 283 1-311 63-450 (468) No 1 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=5.7e-76 Score=433.09 Aligned_cols=311 Identities=100% Similarity=1.398 Sum_probs=298.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) |||+++||++||+++.++|++.++++|+++++|++++++++++++|+.++++.++|++||+++|+++++|+++++++||+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl 80 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) ++++++|+||++++.++..+++++|+++++++|++++|+++|+|+++++++.+.++++.+.++++....+..+....+.+ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:81 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHHH Confidence 99999999999999999999999999999999999999999999998899999999998888888888777777778888 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++..+++++++|+|||.++.+|++|||++|+|+|++...++.+++|+|+||++++++|.++....+........ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~ 240 (311) T protein:vir:81 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) T ss_pred HHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999888888877777 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+..+++|||+++.++.+++++++++++.+.++++++|++|++++|++.|+|++++||+||++|+.++.| T Consensus 241 ~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred CCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 88889999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=3.3e-68 Score=390.54 Aligned_cols=297 Identities=33% Similarity=0.511 Sum_probs=262.1 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) || +++++|++||++++.+||+.+++.|+++++|++++++++.+++|+.++++.++|++|++++|+++++|+++++++|| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k 80 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPLK 80 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeEE Confidence 99 56678899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+||+++++++.++++++|.+++++++++++|+++|+|++++.|++..........+. .......+....++ T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 159 (300) T protein:vir:95 81 VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKK-VTQTVPFKDTNPDE 159 (300) T ss_pred EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccc-cceeecccccchHH Confidence 99999999999999989999999999999999999999999999987766666544333222222 22233344557788 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++..+...++++++|+|||+++.+|+++||++|+|+|++...++.+++|+|+||++++.+|... T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~------------ 227 (300) T protein:vir:95 160 SMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQ------------ 227 (300) T ss_pred HHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCC------------ Confidence 99999999999999999999999999999999999999999999988899999999999999997542 Q ss_pred ccccceEEEeecceE-EEEeecCceEEEeccCCcccc-hhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAF-RWGVQVSIPLELIEFGDPDGL-GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 240 ~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~-~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ......+++|||+.+ .++.|++++++++++.+.++. +++|++|++.+|+++|+|++++||+||++||+++. T Consensus 228 ~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 228 TDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 233446889999874 489999999999998876644 57899999999999999999999999999999999 No 3 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=1.1e-67 Score=387.78 Aligned_cols=300 Identities=32% Similarity=0.489 Sum_probs=264.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) |+|.++||++||++++++||+.+++.|+++++|++++++++.+++|+.++++.++|++|++++|+++++|+++++++||+ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~kl 80 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIKV 80 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) ++++++|+||+++++++.++++++|++++++++++++|+++|+|+++.++.+....+...............+....+++ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (303) T protein:vir:97 81 EYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADAN 160 (303) T ss_pred EEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHHH Confidence 99999999999999899999999999999999999999999999877666665555544443333333333344567899 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccc-cccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPEL-GFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~-~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) +.+++.++...++.+++|+|||+++..|+++||++|+|+|.+. ..+..+++|+|+||++++.+|.... . T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~----------~ 230 (303) T protein:vir:97 161 IEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGAD----------E 230 (303) T ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccc----------c Confidence 9999999998889999999999999999999999999999765 4456678999999999999986532 2 Q ss_pred ccccceEEEeecc-eEEEEeecCceEEEeccCCcccc-hhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 240 TNPNVKAIAGDFS-AFRWGVQVSIPLELIEFGDPDGL-GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 240 ~~~~~~~~~gd~~-~~~~~~~~~~~i~~~~~~~~~~~-~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ..+...+++|||+ .+.++.+++++++++++.+.|+. +++|++|++++|+++|+|++++||+||++||++.= T Consensus 231 ~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 231 AESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 2344578999995 67899999999999998876654 68999999999999999999999999999998887 No 4 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.6e-67 Score=386.83 Aligned_cols=307 Identities=46% Similarity=0.745 Sum_probs=273.1 Q ss_pred Cccc-CCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVAL-ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~-~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) |||. ++||++||++++++|++.++++++++++|++++++++..++|+.++++.++|++|++++|+++++|+++++++|| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 80 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPKK 80 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEE Confidence 9965 456899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+||+++++|+.++++++|+++++++|++++|+++|+|++++.++++.+....+...++.++.+..+....+. T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:99 81 AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANPDL 160 (311) T ss_pred EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchhHH Confidence 99999999999999989999999999999999999999999999998877877777666555666666666666666778 Q ss_pred HHHHHHHHHhhcC--CCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDN--LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 160 ~i~~~~~~~~~~~--~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++.+++.++...+ +.+++|+|||.++..|+++||++|||+|++...+..+++|+|+||++++.+|+.+....+.... T Consensus 161 ~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~- 239 (311) T protein:vir:99 161 AIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDL- 239 (311) T ss_pred HHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccchh- Confidence 8888888877654 4566799999999999999999999999999999999999999999999999888777665432 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ...+...+++|||++ +.++++++++++++++.+.++++++|++|++++|+++|+|++++|| +|++++++++ T Consensus 240 -~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 240 -DAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred -hccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 345666889999987 5699999999999999989999999999999999999999999996 5667777776 No 5 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=8.5e-67 Score=382.81 Aligned_cols=295 Identities=34% Similarity=0.514 Sum_probs=258.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) || .+||++||++++++|++.++++++++++|++++++++.+++|+.++++.++|++|++++|+++++|+++++++||+ T Consensus 1 ma--~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:16 1 MV--LNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) T ss_pred Cc--ccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeE Confidence 66 3568899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccc-ceeeccccccchHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTN-IVELTTGTSATPDL 159 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 159 (311) ++++++|+||+++++++.++++++|++++++++++++|+++|+|+++++|++...+......... ............++ T Consensus 79 a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) T protein:vir:16 79 EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHH Confidence 99999999999999999999999999999999999999999999887777665544332222211 11122233445678 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++.++..+++++++|+|||+++..|+++||++|||+|++.+..+.+++|+|+||++++.+|... T Consensus 159 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~------------ 226 (298) T protein:vir:16 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMS------------ 226 (298) T ss_pred HHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEeccccccc------------ Confidence 89999999999999999999999999999999999999999999999999999999999999998431 Q ss_pred ccccceEEEeecceE-EEEeecCceEEEeccCCccc-chhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAF-RWGVQVSIPLELIEFGDPDG-LGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 240 ~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~-~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) ..++..+++|||+++ .++.+++++++++++.+.++ .+++|++|++++|++.|+|++++||+||++||.++ T Consensus 227 ~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 234558999999874 58999999999999877665 46889999999999999999999999999999999 No 6 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.3e-66 Score=381.86 Aligned_cols=295 Identities=34% Similarity=0.516 Sum_probs=259.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) |++ +||++||+++.++|++.++++++++++|++++++++++++|+.+++++++|++|++++|+++++|++++++++|+ T Consensus 1 ma~--~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:94 1 MVL--NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) T ss_pred Cee--ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEE Confidence 655 568899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccc-cccccceeeccccccchHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKI-LDTTNIVELTTGTSATPDL 159 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++|+||++++.++..+++++|+++++++|++++|.++|+|+++++|+...++.... ...+.............++ T Consensus 79 ~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) T protein:vir:94 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) T ss_pred EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHH Confidence 999999999999999999999999999999999999999999998776666655443222 2222212222333445688 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++.++..++.++++|+|||+++.+|+++||++|+|+|++...++.+++|+|+||++++.+|.+. T Consensus 159 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~------------ 226 (298) T protein:vir:94 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMS------------ 226 (298) T ss_pred HHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEeccccccc------------ Confidence 99999999999999999999999999999999999999999999999999999999999999998542 Q ss_pred ccccceEEEeecceE-EEEeecCceEEEeccCCcccc-hhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAF-RWGVQVSIPLELIEFGDPDGL-GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 240 ~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~-~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) ..+...+++|||+.. .++.+++++++++++.+.++. +++|++|++++|++.|+|++++||+||++||.++ T Consensus 227 ~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234557899999874 589999999999999877654 5789999999999999999999999999999999 No 7 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.7e-64 Score=370.26 Aligned_cols=297 Identities=40% Similarity=0.647 Sum_probs=253.3 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |+ ++++||++||++++++||+.+++.|+++++|++++++++.+++|+.++++.++|++|++++|+++++|+++++++| T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 80 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeee Confidence 88 4567999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhH-HHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQL-GVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~-~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) |++++++||+||++++.++.. .++++|.++++++|++++|.++|+|+++.++.++.++...+..+++.+.. .... T Consensus 81 kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 156 (315) T protein:vir:80 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDA----TDSA 156 (315) T ss_pred eEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeec----cccc Confidence 999999999999998876544 47899999999999999999999999887777777766655444443332 2234 Q ss_pred HHHHHHHHHHHhhcCCC-ccEEEEcHHHHHHHHHhhccCCc-----eeeccccccCCCceecceeEEeeccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLS-PDGVALDNTFSFMLATQRDSQGR-----KLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~-~~~~v~n~~~~~~l~~lkd~~g~-----~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 231 (311) ++++.+++.++...++. .++|+|||.++..|+++||.+|+ |+|++ ...+.+++|+|+||+++++||.+.... T Consensus 157 ~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~-~~~g~~~tl~G~PV~~~~~~~~~~~~~- 234 (315) T protein:vir:80 157 TADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPA-AGFAGLDNWRGLNVGASSTVSGAPEMS- 234 (315) T ss_pred hHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccc-cccCCCceecceeeEecCcCCcccccc- Confidence 67888888888666554 46799999999999999877665 55643 334456899999999999998764332 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccc-hhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGL-GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~-~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ......+++|||+++.++.+++++++++++.+.++. +++|++|+++||+++|+|++++||+||++|+.+++ T Consensus 235 --------~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 235 --------PASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred --------cccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 234557899999999999999999999998876644 57899999999999999999999999999999888 Q ss_pred C Q lcl|Aclame:pro 311 S 311 (311) Q Consensus 311 ~ 311 (311) . T Consensus 307 ~ 307 (315) T protein:vir:80 307 P 307 (315) T ss_pred C Confidence 7 No 8 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=4e-60 Score=346.23 Aligned_cols=285 Identities=16% Similarity=0.192 Sum_probs=246.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) ..+++.||++||+++.++|++.+++.++++++|++++++++.+++|+.++++.+.|++|++++|+++++|++++++++|+ T Consensus 11 ~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~ 90 (304) T protein:vir:94 11 VILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAKKI 90 (304) T ss_pred ccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEEEE Confidence 23456788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+|+++++ .++++++|++++++++++++|+++|+|++... +....+.++.................+++ T Consensus 91 ~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (304) T protein:vir:94 91 GVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY--NTSTSGKPLVEGAEEKGNVVTDTNNLYVD 165 (304) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCc--ccccccccccccccccccccccccchHHH Confidence 99999999999654 57899999999999999999999999976433 23333444444444444444455567899 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++...+..+++|+|||+++..|+++||++|+|+|.+ .+++|+|+||++++.+|.. T Consensus 166 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~-----~~~~l~G~PV~~~~~~~~~-------------- 226 (304) T protein:vir:94 166 LSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDA-----NGNEIMGLPLSYTGADVYD-------------- 226 (304) T ss_pred HHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecC-----CCccccceeeEEecccccC-------------- Confidence 999999999988999999999999999999999999999965 3478999999999998743 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC--------cc-cchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD--------PD-GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~--------~~-~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) .++..+++|||+++.++++++++++++++.. .+ +.+++|++|++++|+++|+|++++||+||++||.+. T Consensus 227 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 227 KKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 2344789999999999999999999877642 22 356789999999999999999999999999999999 No 9 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=4e-60 Score=346.23 Aligned_cols=285 Identities=16% Similarity=0.192 Sum_probs=246.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) ..+++.||++||+++.++|++.+++.++++++|++++++++.+++|+.++++.+.|++|++++|+++++|++++++++|+ T Consensus 11 ~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~ 90 (304) T protein:vir:10 11 VILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAKKI 90 (304) T ss_pred ccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEEEE Confidence 23456788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+|+++++ .++++++|++++++++++++|+++|+|++... +....+.++.................+++ T Consensus 91 ~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (304) T protein:vir:10 91 GVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY--NTSTSGKPLVEGAEEKGNVVTDTNNLYVD 165 (304) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCc--ccccccccccccccccccccccccchHHH Confidence 99999999999654 57899999999999999999999999976433 23333444444444444444455567899 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++...+..+++|+|||+++..|+++||++|+|+|.+ .+++|+|+||++++.+|.. T Consensus 166 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~-----~~~~l~G~PV~~~~~~~~~-------------- 226 (304) T protein:vir:10 166 LSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDA-----NGNEIMGLPLSYTGADVYD-------------- 226 (304) T ss_pred HHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecC-----CCccccceeeEEecccccC-------------- Confidence 999999999988999999999999999999999999999965 3478999999999998743 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC--------cc-cchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD--------PD-GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~--------~~-~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) .++..+++|||+++.++++++++++++++.. .+ +.+++|++|++++|+++|+|++++||+||++||.+. T Consensus 227 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 227 KKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 2344789999999999999999999877642 22 356789999999999999999999999999999999 No 10 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.2e-59 Score=343.68 Aligned_cols=285 Identities=16% Similarity=0.201 Sum_probs=244.1 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |. ++++|+.+||++++++|++.+++.++++++|++++++++..++|+.+ ++.+.|++|++++|+++++|+++++.++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v~l~~~ 84 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFTKAKMRSK 84 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCccccccccceeEEEEeeE Confidence 33 55678899999999999999999999999999999999989999876 5789999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+|+++++ ..+++++|.+++++++++++|+++|+|++.+. +.++ .................+ T Consensus 85 k~~~~~~is~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~---~~gi----l~~~~~~~~~~~~~~~~~ 154 (299) T protein:vir:41 85 KMGVIIPTTKENLNYS---VTNFFSLMQAEIVEAFYKKFDQAVFTGVESPY---NWNI----LKSATDASNLVEETANKY 154 (299) T ss_pred EEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc---cccc----cccccccceeeccccccH Confidence 9999999999999644 57899999999999999999999999975332 2233 322222222223344668 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++.+++.++...++.+++|+|||+++.+|+++||++|+|+|.+....+ .++|+|+||++++.+|.+ T Consensus 155 ~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~-~~~l~G~PV~~~~~~~~~------------ 221 (299) T protein:vir:41 155 DDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNG-VDDVLGLPIAYTPKYTFG------------ 221 (299) T ss_pred HHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCC-CceecceeeEEecccCCC------------ Confidence 8999999999999999999999999999999999999999998877654 468999999999999843 Q ss_pred cccccceEEEeecceEEEEeecCceEEEeccCC------c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+.+.++.+++++++++++.. . ...+++|++|++++|+++|+|++++||+||++|+.+++. T Consensus 222 --~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 222 --DKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred --CCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 2445799999999999999999999987653 2 244678999999999999999999999999999999999 No 11 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=3.3e-59 Score=341.23 Aligned_cols=294 Identities=23% Similarity=0.294 Sum_probs=245.5 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) ++ ++.++|.++|+++.++|++.+++.++++++++++++.++.+++|+.++++.+.|++|++++|+++++|++++++++| T Consensus 10 ~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k 89 (330) T protein:vir:77 10 QVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKGSFGKQELEPVK 89 (330) T ss_pred hccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccccccceeeEEEEeEEE Confidence 33 23456667888899999999999999999999999998899999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccce----eecccccc Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIV----ELTTGTSA 155 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~----~~~~~~~~ 155 (311) ++++++||+||++++ .++++++|.+++++++++++|+++|+|++ .+.++.++.+......... ...+.... T Consensus 90 ~~~~~~is~ell~ds---~~~~~~~i~~~l~~ai~~~~~~~~l~G~g--~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 164 (330) T protein:vir:77 90 ITTIFAESAEVVRLN---PLNYLNTMRTKIAEAIALKFDAAAIHGID--KPSAFKGYLAETTKVVSLADTNLTTASGPQG 164 (330) T ss_pred EEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccC--CCCccccccccccccceeecccccccccccc Confidence 999999999999654 57899999999999999999999999975 3444445443322221111 11223344 Q ss_pred chHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccC-----CCceecceeEEeecccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGT-----DVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~-----~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ..++++..++..+...+.++++|+|||+++..|+++||++|||+|++....+ ..++|+|+||++++.+|.+ T Consensus 165 ~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~---- 240 (330) T protein:vir:77 165 NAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNG---- 240 (330) T ss_pred hhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCC---- Confidence 5678888899999888888899999999999999999999999998765544 4568999999999999843 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCC-----------cccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-----------PDGLGDLKRQNQIAIRAEVVYGIGIMST 299 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-----------~~~~~~~f~~~~v~~ra~~r~~~~v~~~ 299 (311) ...++..+++|||+.+.++.+++++++++++.. ....+++|++|++++|++.|+|++++|| T Consensus 241 --------~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 312 (330) T protein:vir:77 241 --------TVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDK 312 (330) T ss_pred --------CCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecc Confidence 224556899999999999999999999887643 1234678999999999999999999999 Q ss_pred cceEEEEecccC Q lcl|Aclame:pro 300 DAFAVVRDADES 311 (311) Q Consensus 300 ~a~~~l~~aa~~ 311 (311) +||++|+.++.. T Consensus 313 ~a~~~i~~~~~~ 324 (330) T protein:vir:77 313 DAFVKLTDQVAG 324 (330) T ss_pred cceEEEEeccCC Confidence 999999988866 No 12 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=5.1e-59 Score=340.15 Aligned_cols=299 Identities=25% Similarity=0.339 Sum_probs=244.6 Q ss_pred Ccc--------cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCce--------eEEeecCcccc Q lcl|Aclame:pro 1 MVA--------LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPR--------GEVVGEGAQKS 64 (311) Q Consensus 1 mat--------~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~ 64 (311) |.+ +++++.+||++++++|++.+++.++|+++|++++++++.+++|+.+..+. +.|++|+++++ T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~ 89 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTKP 89 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccccccccccccccc Confidence 222 12334489999999999999999999999999999999999999876544 55667999999 Q ss_pred ccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccc Q lcl|Aclame:pro 65 ESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT 144 (311) Q Consensus 65 ~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 144 (311) +++++|++++++++|++++++||+||++++ .++++++|++++++++++++|+++|+|+++..++++.++.+...... T Consensus 90 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~ 166 (338) T protein:vir:78 90 LSGTAWDTRSVAPIKLATIVTVSEEFARMN---PSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVN 166 (338) T ss_pred ccccceeEEEEEEEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccc Confidence 999999999999999999999999999754 47799999999999999999999999998877777776654322211 Q ss_pred c-ceeeccccccchHHHHHHHHHHHh-hcCCCccEEEEcHHHHHHHH---HhhccCCceeeccccccCCCceecceeEEe Q lcl|Aclame:pro 145 N-IVELTTGTSATPDLAVEAAVGLVL-GDNLSPDGVALDNTFSFMLA---TQRDSQGRKLYPELGFGTDVASFAGLNAAV 219 (311) Q Consensus 145 ~-~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~v~n~~~~~~l~---~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~ 219 (311) . ............++.+.+++..+. +..+.+++|+|||.++..|+ ++||++|+|+|++...++.+++|+|+||++ T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~ 246 (338) T protein:vir:78 167 TTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQF 246 (338) T ss_pred ccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEE Confidence 1 111112223345677777777664 35566778999999987774 578999999999999999999999999999 Q ss_pred ecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCC------c-ccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 220 SDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVY 292 (311) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~-~~~~~~f~~~~v~~ra~~r~ 292 (311) +++||...... ...+..+++|||+.+.++++++++++++++.. + ...+++|++|++++|+++|+ T Consensus 247 ~~~ip~~~~~~---------~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~ 317 (338) T protein:vir:78 247 GKAVGGDLGAA---------TDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTF 317 (338) T ss_pred ccccCcccccc---------CCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Confidence 99999764332 23445799999999999999999999998753 1 23468899999999999999 Q ss_pred ccEEecccceEEEEecccC Q lcl|Aclame:pro 293 GIGIMSTDAFAVVRDADES 311 (311) Q Consensus 293 ~~~v~~~~a~~~l~~aa~~ 311 (311) |++++||+||++|++++++ T Consensus 318 d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 318 GWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred ccEeecccceEEEecccCC Confidence 9999999999999999988 No 13 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=3.8e-59 Score=340.90 Aligned_cols=283 Identities=12% Similarity=0.106 Sum_probs=238.7 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccc-cceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSEST-ATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k 79 (311) ..+.+.||++||++++++|++.+++.++++++|++++++++..++|+..+++.+.|++|++.+|+++ ++|++++++++| T Consensus 132 ~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k 211 (425) T protein:vir:10 132 KGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGE 211 (425) T ss_pred cCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeeccccccccccccccceeeeehee Confidence 3466778999999999999999999999999999999999999999999999999999999999876 799999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee---------ec Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE---------LT 150 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~---------~~ 150 (311) ++++++||+|++++ +.++++++|.+++++++++++|.+||+|+|. ..|.|+.+.....++... .+ T Consensus 212 ~~~~i~iS~ell~d---s~~~l~~~i~~~la~ai~~~~d~~~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 285 (425) T protein:vir:10 212 IYANPAATQQILDD---AEIDLESWLATEVQTEFAKQEGKAFLAGDGT---NKPNGLLTYIAGGANAAKHPFGAIEVVNS 285 (425) T ss_pred eEeehHhHHHHHhc---chhHHHHHHHHHHHHHHHHHHHhhhhcccCC---CCcceeeeccccccccccccccccccccc Confidence 99999999999964 4578999999999999999999999999753 245555443332222111 11 Q ss_pred cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) .......++++.+++..+........+|+|||+++.+|+++||++|+|+|.+....+.+++|+|+||++++.||.. T Consensus 286 ~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~---- 361 (425) T protein:vir:10 286 GAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDV---- 361 (425) T ss_pred cccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCc---- Confidence 1233456788888888888777777789999999999999999999999998888888899999999999998842 Q ss_pred cccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) ..+...++||||+. |.+..+.++++..+++ |.+|++.||++.|+|+++++|+||++|+.+| T Consensus 362 ---------~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~---------~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~a 423 (425) T protein:vir:10 362 ---------AANSTPILFGDFQQTYLIIDRIGVRVLRDPY---------TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAA 423 (425) T ss_pred ---------cCCccEEEEEehhccEEEEEecceEEEeccc---------ccCCcEEEEEEEEeccEeecccceEEEEeec Confidence 23445789999987 6678888877665443 6789999999999999999999999999988 Q ss_pred cC Q lcl|Aclame:pro 310 ES 311 (311) Q Consensus 310 ~~ 311 (311) |- T Consensus 424 s~ 425 (425) T protein:vir:10 424 SE 425 (425) T ss_pred cC Confidence 88 No 14 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=8.4e-59 Score=338.97 Aligned_cols=283 Identities=10% Similarity=0.069 Sum_probs=237.3 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) |. +.+.||++||+++.++|++.+++.++++++|+++++.++.+.+|+..+++.+.|++|++.+|++ .++|+++++.+ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~ 185 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFM 185 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeecccccccccccccceeEEeee Confidence 33 5567899999999999999999999999999999999999999999999999999999999976 48999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccc--------ee- Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNI--------VE- 148 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~--------~~- 148 (311) +|++++++||+|++++ +.++++++|.+++++++++++|.++++|+|.+ .|.|+.+........ .. T Consensus 186 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~~~ 259 (407) T protein:vir:48 186 GEIYGNPQATQKMLDD---AFFNVEDWINSELALEFAEQEEIAFTSGDGSK---KPKGFLAYESTDEDDKTRAFGKLQHI 259 (407) T ss_pred eeeEeehhhHHHHHhc---chHHHHHHHHHHHHHHHHHHHHhhhhccCCCC---ccceeeeccccccccccccccccccc Confidence 9999999999999964 45789999999999999999999999997642 344443322111110 00 Q ss_pred eccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 149 LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) .........++++.+++..+...+...++|+||+.++..|+++||++|||+|.+....+.+++|+|+||++++.||.. T Consensus 260 ~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~-- 337 (407) T protein:vir:48 260 ASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDI-- 337 (407) T ss_pred ccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCc-- Confidence 111223345788888888888877777789999999999999999999999998888888999999999999999842 Q ss_pred cccccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~ 307 (311) ..+...++||||+. |.+..+.++++..+++ +++|++.||++.|+|+++++|+||++|+. T Consensus 338 -----------~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~---------~~~~~~~~~~~~r~d~~v~~~~a~~~l~~ 397 (407) T protein:vir:48 338 -----------AADAKAIAFGNFKRGYTIVDRIGTRILRDPY---------TNKPFVGFYTTKRTGGMLVDSQAIKLMKI 397 (407) T ss_pred -----------cCCccEEEEEeccccEEEEEeeceEEEeecc---------ccCCcEEEEEEEEeccEEecccceEEEEe Confidence 23445788999985 7788888888876543 67899999999999999999999999999 Q ss_pred cccC Q lcl|Aclame:pro 308 ADES 311 (311) Q Consensus 308 aa~~ 311 (311) ++++ T Consensus 398 ~aa~ 401 (407) T protein:vir:48 398 GAAT 401 (407) T ss_pred eccC Confidence 9998 No 15 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.2e-58 Score=338.22 Aligned_cols=287 Identities=18% Similarity=0.219 Sum_probs=235.5 Q ss_pred Cccc--CCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcc-----ccccccceeEE Q lcl|Aclame:pro 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQ-----KSESTATFAPV 73 (311) Q Consensus 1 mat~--~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 73 (311) |+++ +.||++||++++++|++.+++.++|+++++++++.++++++|+.+.++.+.|++|++. +|.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 9855 4478999999999999999999999999999999999999999999999999999985 55678999999 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGT 153 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (311) ++++||++++++||+||++++ .++++++|++++++++++++|+++|+|++.+.+....++................. T Consensus 81 ~~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDA---TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) T ss_pred EeeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccc Confidence 999999999999999999654 57899999999999999999999999987655544443332222222222211111 Q ss_pred --ccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccc Q lcl|Aclame:pro 154 --SATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 154 --~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 231 (311) ..+..+.+..+...+...++..+.|+|||.++..|+++||++|+|+|++ ++|+|+||++++.+|.. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~-------~~l~G~Pv~~~~~~~~~----- 225 (305) T protein:vir:25 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD----- 225 (305) T ss_pred hhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC-------CcccccceEEcCccCCC----- Confidence 1123344555566666677778889999999999999999999999965 47999999999887632 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccC---CcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFG---DPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~---~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .++..+++|||+++.++.+++++++++++. ..+..+++|++|++++|++.|+|+.++||+||++++.. T Consensus 226 ---------~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~ 296 (305) T protein:vir:25 226 ---------ADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) T ss_pred ---------CCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccc Confidence 234578999999999999999999998875 33455678999999999999999999999999999987 Q ss_pred ccC Q lcl|Aclame:pro 309 DES 311 (311) Q Consensus 309 a~~ 311 (311) ..+ T Consensus 297 ~~~ 299 (305) T protein:vir:25 297 PVA 299 (305) T ss_pred ccc Confidence 665 No 16 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.4e-58 Score=336.43 Aligned_cols=298 Identities=26% Similarity=0.341 Sum_probs=245.9 Q ss_pred Ccc--cC------CCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecC--------cccc Q lcl|Aclame:pro 1 MVA--LA------TGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEG--------AQKS 64 (311) Q Consensus 1 mat--~~------~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg--------~~~~ 64 (311) |.+ .. .++.++|+++.++|++.+++.++++++|++++++++..++|+.++.+.+.|++|+ +.++ T Consensus 10 ~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~ 89 (333) T protein:vir:78 10 NSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKP 89 (333) T ss_pred hcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCccccccccccccc Confidence 221 11 2233899999999999999999999999999999999999999999999998776 5678 Q ss_pred ccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccc--cc Q lcl|Aclame:pro 65 ESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKI--LD 142 (311) Q Consensus 65 ~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~--~~ 142 (311) +++++|+++++++||+++++++|+|+++++ ..+++++|+++++++|++++|.++|+|++...+..+.++.+.. .. T Consensus 90 ~~~~~f~~i~l~~~kl~~~~~is~ell~~s---~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~ 166 (333) T protein:vir:78 90 LSGTAWDTRSVSPIKLATIVTVSEEFARMN---PSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIAN 166 (333) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccccccccc Confidence 899999999999999999999999999654 4689999999999999999999999999877777766654422 22 Q ss_pred cccceeeccccccchHHHHHHHHHHHhh-cCCCccEEEEcHHHHHHHHH---hhccCCceeeccccccCCCceecceeEE Q lcl|Aclame:pro 143 TTNIVELTTGTSATPDLAVEAAVGLVLG-DNLSPDGVALDNTFSFMLAT---QRDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) Q Consensus 143 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~n~~~~~~l~~---lkd~~g~~~~~~~~~~~~~~~l~G~pv~ 218 (311) .+..... .......++++.+++..+.. ..+.+++|+|||.++..|++ ++|++|+|+|++....+.+++|+|+||+ T Consensus 167 ~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~ 245 (333) T protein:vir:78 167 TTNVDYL-QETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQ 245 (333) T ss_pred ccccccc-ccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeE Confidence 2222222 22233457778888877654 45566789999999987765 7899999999999999999999999999 Q ss_pred eecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCc----ccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDP----DGLGDLKRQNQIAIRAEVVYGI 294 (311) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~----~~~~~~f~~~~v~~ra~~r~~~ 294 (311) +++++|.+.... ...+..+++|||+.+.++++++++++++++... ...+++|++|++.+|+++|+|+ T Consensus 246 ~~~~i~~~~~~~---------~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~ 316 (333) T protein:vir:78 246 FGRAVGGDLGAA---------VDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGW 316 (333) T ss_pred EccccCCCcccc---------CCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEcc Confidence 999999664332 344558999999999999999999999988642 2345789999999999999999 Q ss_pred EEecccceEEEEecccC Q lcl|Aclame:pro 295 GIMSTDAFAVVRDADES 311 (311) Q Consensus 295 ~v~~~~a~~~l~~aa~~ 311 (311) +++||+||++|+.+++- T Consensus 317 ~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 317 LLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEecccceEEEeccCCC Confidence 99999999999988888 No 17 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=4.2e-58 Score=335.14 Aligned_cols=289 Identities=20% Similarity=0.238 Sum_probs=235.1 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) |. +++++|.++|++++++|++.+++.++++++|++++++++++++|+.++++.+.|++|++++|+++++|++++++++| T Consensus 20 ~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k 99 (326) T protein:vir:42 20 AQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHK 99 (326) T ss_pred eeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEE Confidence 32 22334557899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccc--cccch Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTG--TSATP 157 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 157 (311) +++++++|+||+++| .++++++|.+++++++++++|+++|+|++.+. +.++..............+. ..... T Consensus 100 ~~~~v~iS~ell~~s---~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~---p~gi~~~~~~~~~~~~~~~~~~~~~~~ 173 (326) T protein:vir:42 100 IATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDNAAINGTDSPF---PTFLAQTTKEVSLVDPDGTGSNADLTV 173 (326) T ss_pred EEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc---cccccccccccceeecccccccccchh Confidence 999999999999755 46899999999999999999999999976432 23333222222222111111 11122 Q ss_pred -HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCC-----ceecceeEEeeccccccccccc Q lcl|Aclame:pro 158 -DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDV-----ASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 158 -~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~-----~~l~G~pv~~~~~~~~~~~~~~ 231 (311) +..+..++..+.......++|+|||.++..|+++||++|+|+|.+....+.+ ++++|+||++++++|.+ T Consensus 174 ~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~----- 248 (326) T protein:vir:42 174 YDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASG----- 248 (326) T ss_pred HHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCC----- Confidence 2234556666777777888999999999999999999999999876655443 47999999999998743 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCC-------cccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV 304 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-------~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~ 304 (311) ...+++|||+.+.++.+++++++++++.. .+..+++|++|++.+|+++|+|+++.||+||++ T Consensus 249 -----------~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~ 317 (326) T protein:vir:42 249 -----------TVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVK 317 (326) T ss_pred -----------ceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEE Confidence 33578999999999999999999877643 234568899999999999999999999999999 Q ss_pred EEecccC Q lcl|Aclame:pro 305 VRDADES 311 (311) Q Consensus 305 l~~aa~~ 311 (311) |+.++++ T Consensus 318 l~~~~~~ 324 (326) T protein:vir:42 318 LTNVDAT 324 (326) T ss_pred Eeecccc Confidence 9988888 No 18 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.3e-58 Score=336.55 Aligned_cols=291 Identities=19% Similarity=0.188 Sum_probs=232.3 Q ss_pred Cc---ccCCCceEcchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEe Q lcl|Aclame:pro 1 MV---ALATGTFQLPKHLVPGVWQKAQGQSVLARL-SMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) Q Consensus 1 ma---t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 76 (311) |+ +.++||++||+++.++|++.+++.++++++ ++.+++.++++++|+.++++.++|++|++++|+++++|++++++ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~ 143 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLS 143 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEEe Confidence 22 345789999999999999999999999999 78899999999999999999999999999999999999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec-cc-cc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT-TG-TS 154 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-~~-~~ 154 (311) ++|++++++||+||++++ .++++++|++++++++++++|++||+|++ .+..+.|+.+............ +. +. T Consensus 144 ~~k~~~~~~iS~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~t~~~~ 218 (366) T protein:vir:57 144 AKTMIALVPVSNQLIGRA---GFNVEQLLLGDILSAIATREDKAFLRDDG--TGDTPKGMKAVATAANRLVAWTGTAINL 218 (366) T ss_pred eEEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHhhccCC--CCccccceeeccccccceeeccccccch Confidence 999999999999999644 56899999999999999999999999964 3445555543322222222111 11 11 Q ss_pred cchHHHHHHHHH--HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVG--LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~--~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) ...+..+..+.. ...........|+|||.++..|+++||++|+|+|++. ..++|+|+||++++++|.+... T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~----~~g~l~G~Pvv~s~~ip~~~~~--- 291 (366) T protein:vir:57 219 TTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEM----SQGILKGYPIQRTSAIPANLGD--- 291 (366) T ss_pred hhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCC----CCCeecceeeEEcccccccccc--- Confidence 112222222222 2223344566899999999999999999999999643 3468999999999999976432 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCC---cc-cchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD---PD-GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~---~~-~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) ..+...++||||+.+.++.+.+++++++++.. .+ ..+++|++|++++|+++|+||+++||+||++|+.. T Consensus 292 -------~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~ 364 (366) T protein:vir:57 292 -------DGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGV 364 (366) T ss_pred -------CCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecc Confidence 23345789999999999999999999988742 22 33578999999999999999999999999999977 Q ss_pred cc Q lcl|Aclame:pro 309 DE 310 (311) Q Consensus 309 a~ 310 (311) .= T Consensus 365 ~~ 366 (366) T protein:vir:57 365 IW 366 (366) T ss_pred cC Confidence 77 No 19 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.5e-58 Score=337.58 Aligned_cols=282 Identities=11% Similarity=0.081 Sum_probs=235.8 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) |. +.+.||++||+++.++|++.+++.++|+++|+++++.++...+|+..+++.+.|++|++.+|.+ .++|+++++.+ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~ 186 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFM 186 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccCccccccceeeeeeh Confidence 44 3367899999999999999999999999999999999999999999999999999999999965 58999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccc--------ceee Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTN--------IVEL 149 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~--------~~~~ 149 (311) +|+++++++|+|++++ +.++++++|.+++++++++++|.++|+|+|.+ .|.|+.+....... .... T Consensus 187 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~~~ 260 (401) T protein:vir:44 187 GEIYGNPQATQKMLDD---AFFNVEAWINSELATEFAEQEEIAFTTGDGTK---KPKGFLAYESTEESDKARAFGKLQHI 260 (401) T ss_pred hheeeehhhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHhhhhccCCCC---ccceeecccccccccccccccccccc Confidence 9999999999999964 45789999999999999999999999997542 34444322111110 0111 Q ss_pred -ccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 150 -TTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 150 -~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) +.......++++.+++..+...+...++|+||++++..|+++||++|||+|.+....+.+++|+|+||++++.+|.. T Consensus 261 ~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~-- 338 (401) T protein:vir:44 261 VSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDI-- 338 (401) T ss_pred ccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCc-- Confidence 11223345888889998888777777789999999999999999999999998888888899999999999998842 Q ss_pred cccccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~ 307 (311) ..+...++||||+. |.+..+.++++..+++ +++|++.||+++|+|+++++|+||++|+. T Consensus 339 -----------~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~---------~~~~~v~~~a~~r~d~~~~~~~a~~~l~~ 398 (401) T protein:vir:44 339 -----------AADAKAIAFGNFKRGYTIVDRIGTRILRDPY---------TNKPFVGFYTTKRTGGMLVDSQAIKLLKI 398 (401) T ss_pred -----------cCCccEEEEeehhccEEEEEecceEEeeecc---------ccCCcEEEEEEEEeccEEecccceEEEEe Confidence 23345688999975 7788899888876543 67899999999999999999999999999 Q ss_pred ccc Q lcl|Aclame:pro 308 ADE 310 (311) Q Consensus 308 aa~ 310 (311) +|+ T Consensus 399 ~aa 401 (401) T protein:vir:44 399 AAA 401 (401) T ss_pred ecC Confidence 999 No 20 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=8.8e-58 Score=333.37 Aligned_cols=288 Identities=17% Similarity=0.235 Sum_probs=237.8 Q ss_pred Cccc--CCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~--~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |+.+ ++++.+||+++.++|++.+++.++++++|++++++++++++|+.++++++.|++|++++|+++++|++++++++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~ 93 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQNIAPH 93 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeE Confidence 5533 33556899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccc---ccc Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTG---TSA 155 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~---~~~ 155 (311) |++++++||+|+++++ .++++++|.+++++++++++|+++|+|++.+....+.++ ....+....... ... T Consensus 94 k~~~~~~is~ell~ds---~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 166 (320) T protein:vir:10 94 KIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQT----TKSVSLADPGGATASDLT 166 (320) T ss_pred EEEEeehhhHHHHhcC---hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccc----cccccceecccccccccc Confidence 9999999999999754 478999999999999999999999999775444333322 222222222211 112 Q ss_pred chHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCC-----CceecceeEEeecccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTD-----VASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~-----~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ..++.+.+++..+.+.+..+++|+|||+++.+|+++||++|+|+|.+...... .++++|+||++++.+|.+ T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~---- 242 (320) T protein:vir:10 167 AYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADG---- 242 (320) T ss_pred cHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCCC---- Confidence 23345677778888888888899999999999999999999999987655444 357899999999888743 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCC-------cccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFA 303 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-------~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~ 303 (311) ...+++|||+.+.++.+++++++++++.. ....+++|++|++++|+++|+|++++||+||+ T Consensus 243 ------------~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~ 310 (320) T protein:vir:10 243 ------------TTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFV 310 (320) T ss_pred ------------ceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceE Confidence 23578999999999999999999987753 22345789999999999999999999999999 Q ss_pred EEEecccC Q lcl|Aclame:pro 304 VVRDADES 311 (311) Q Consensus 304 ~l~~aa~~ 311 (311) +|+.+++- T Consensus 311 ~l~~~~ap 318 (320) T protein:vir:10 311 KLTNVVTP 318 (320) T ss_pred EEEeccCC Confidence 99977755 No 21 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.3e-57 Score=332.49 Aligned_cols=281 Identities=19% Similarity=0.185 Sum_probs=239.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) .++.++|+++||+++.++|++.+++.+++++++++++++++++++|+.++++.++|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~ 108 (324) T protein:vir:78 29 VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred ccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEE Confidence 44567788999999999999999999999999999999988899999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+|+++++ .++++++|.+++++++++++|.++|+|++. +..+ .++........... .....+++ T Consensus 109 ~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~--~~~~----~gi~~~~~~~~~~~-~~~~t~~~ 178 (324) T protein:vir:78 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFG----KSIAQSIEKTNKVI-KGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhccCCC--CCcC----ccccccccccceec-cccccHHH Confidence 99999999999655 468999999999999999999999999652 2222 33333333322222 23456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++...++.+++|+|||+++..|+++||++|+|+|.. ..+++|+|+||++++..+ T Consensus 179 i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PV~~~~~~~---------------- 238 (324) T protein:vir:78 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC----CCCCcccceeeEeeCCCC---------------- Confidence 999999999999999999999999999999999999999853 346789999998765543 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC------c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+.+.++++++++++++++.. . ...+++|++|++++|+++|+|++++||+||++|+.+... T Consensus 239 ~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:78 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 3455789999999999999999999988753 1 234678999999999999999999999999999986665 No 22 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.3e-57 Score=332.49 Aligned_cols=281 Identities=19% Similarity=0.185 Sum_probs=239.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) .++.++|+++||+++.++|++.+++.+++++++++++++++++++|+.++++.++|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~ 108 (324) T protein:vir:96 29 VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred ccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEE Confidence 44567788999999999999999999999999999999988899999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+|+++++ .++++++|.+++++++++++|.++|+|++. +..+ .++........... .....+++ T Consensus 109 ~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~--~~~~----~gi~~~~~~~~~~~-~~~~t~~~ 178 (324) T protein:vir:96 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFG----KSIAQSIEKTNKVI-KGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhccCCC--CCcC----ccccccccccceec-cccccHHH Confidence 99999999999655 468999999999999999999999999652 2222 33333333322222 23456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++...++.+++|+|||+++..|+++||++|+|+|.. ..+++|+|+||++++..+ T Consensus 179 i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PV~~~~~~~---------------- 238 (324) T protein:vir:96 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC----CCCCcccceeeEeeCCCC---------------- Confidence 999999999999999999999999999999999999999853 346789999998765543 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC------c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+.+.++++++++++++++.. . ...+++|++|++++|+++|+|++++||+||++|+.+... T Consensus 239 ~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:96 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 3455789999999999999999999988753 1 234678999999999999999999999999999986665 No 23 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.8e-57 Score=331.64 Aligned_cols=285 Identities=20% Similarity=0.269 Sum_probs=238.7 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |++ .+.++.+||+++.++|++.+++.++++++|++++++++++++|+.++++.++|++|++++++++++|+++++++| T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~ 93 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPH 93 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeE Confidence 442 344677899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee--ccccccc Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL--TTGTSAT 156 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~--~~~~~~~ 156 (311) |+++++++|+|+++++ .++++++|.+++++++++++|+++|+|++.+.. .++ ......... ....... T Consensus 94 k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~---~~~----~~~~~~~~~~~~~~~~~~ 163 (318) T protein:vir:24 94 KIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDGAAMHGTDSPFP---TYI----GQTTKAISIADTTGATTV 163 (318) T ss_pred EEEEeehhhHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCC---ccc----ccccccccccccccccch Confidence 9999999999999755 467999999999999999999999999764322 222 222222111 1222233 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCC-----ceecceeEEeeccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDV-----ASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~-----~~l~G~pv~~~~~~~~~~~~~~ 231 (311) .++.+.+++..+.+.+..+.+|+|||+++..|+++||++|+|+|.+...+..+ +.++|+|+++++.++.+ T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~----- 238 (318) T protein:vir:24 164 YDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEG----- 238 (318) T ss_pred HHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCC----- Confidence 44566778888888888888999999999999999999999999887766554 46889999988887633 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCC-------cccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV 304 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-------~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~ 304 (311) ...+++|||+.+.++++++++++++++.. ....+++|++|++++|+++|+|+++.+|+||++ T Consensus 239 -----------~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~ 307 (318) T protein:vir:24 239 -----------TTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVA 307 (318) T ss_pred -----------ccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEE Confidence 34678999999999999999999887643 234568899999999999999999999999999 Q ss_pred EEecccC Q lcl|Aclame:pro 305 VRDADES 311 (311) Q Consensus 305 l~~aa~~ 311 (311) |+.++++ T Consensus 308 i~~~~a~ 314 (318) T protein:vir:24 308 LTNVVSG 314 (318) T ss_pred EEeeccC Confidence 9998888 No 24 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.6e-57 Score=331.94 Aligned_cols=281 Identities=18% Similarity=0.180 Sum_probs=239.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) ..++++|+++||++++++|++.+++.+++++++++++++++++++|+.++.+.+.|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~ 108 (324) T protein:vir:97 29 VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred ccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEE Confidence 23445688999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+|+++++ .++++++|.+++++++++++|+++|+|++. +..+ .++.......+.. ......+++ T Consensus 109 ~~~~~is~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~g~--~~~~----~gi~~~~~~~~~~-~~~~~~~~~ 178 (324) T protein:vir:97 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFG----KSIAQSIEKTNKV-IKGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhccCCC--CccC----cccccccccccee-ccccCCHHH Confidence 99999999999654 478999999999999999999999999652 2222 3333333332222 223456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++...++.+++|+|||.++..|+++||++|+|+|.+ ...++|+|+||++++..+ T Consensus 179 i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~----~~~~tl~G~PV~~~~~~~---------------- 238 (324) T protein:vir:97 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDTLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CCCccccceeeEeecCCC---------------- Confidence 999999999999999999999999999999999999999864 345789999998876543 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC-------cccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-------~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .....+++|||+.+.++++++++++++++.. ....+++|++|++++|+++|+|+++.+|+||++|+.+... T Consensus 239 ~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred CCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 2344789999999999999999999988753 1234688999999999999999999999999999988776 No 25 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=9.1e-58 Score=333.30 Aligned_cols=294 Identities=16% Similarity=0.151 Sum_probs=238.4 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARL-SMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) -.+.+.||++||+++.++|++.+++.++++++ ++.+++.++.+++|+.++++.+.|++|++.+|+++++|+++++.++| T Consensus 134 ~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k 213 (435) T protein:vir:80 134 TLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKK 213 (435) T ss_pred ccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEE Confidence 12445689999999999999999999999998 67899999999999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|+++++.. .++++++|.+++++++++++|++||+|++ .+..|.|+................+....+. T Consensus 214 ~~~~~~is~ell~ds~~-~~~l~~~i~~~l~~a~~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 290 (435) T protein:vir:80 214 MAALVPIANDLIKYAGV-NPNVDQIVVGDLTAAIGAREDKAFIRDDG--TANTPKGLRFWALPGNVITASDGSTLQKIET 290 (435) T ss_pred EEEeehhhHHHHHhhcc-cHHHHHHHHHHHHHHHHHHHHHHhhccCC--CCCcccceeecccccceeecccccchhhHHH Confidence 99999999999976532 24689999999999999999999999854 3344555543322222222222223334455 Q ss_pred HHHHHHHHHhhc--CCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGD--NLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 160 ~i~~~~~~~~~~--~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++.+++..+... ++.+++|+|||.++..|+++||++|+|+|++. ..++|+|+||++++.+|..... T Consensus 291 d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~----~~~~l~G~pv~~~~~~p~~~~~-------- 358 (435) T protein:vir:80 291 DLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL----ANGMLKGYPVGKTTQVPINLGE-------- 358 (435) T ss_pred HHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCC----CCCeEeeeeeEEeccccccccC-------- Confidence 677777666543 34567899999999999999999999999643 3468999999999999975432 Q ss_pred ccccccceEEEeecceEEEEeecCceEEEeccCC---c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD---P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~---~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...+++|||+.+.++++.+++++++++.. . ...+++|++|+++||++.|+|++++||+||++|+..+=. T Consensus 359 --~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:80 359 --AGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWG 434 (435) T ss_pred --CCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCC Confidence 23455799999999999999999999998752 1 133578999999999999999999999999999976655 No 26 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.1e-57 Score=332.77 Aligned_cols=290 Identities=16% Similarity=0.177 Sum_probs=231.5 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARL-SMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) +. +.++||++||+++.++||+.+++.++++++ ++++++.++.+++|+.++++.+.|++||+.+|+++++|++++++++ T Consensus 127 ~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~ 206 (428) T protein:vir:10 127 ISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAK 206 (428) T ss_pred hcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeE Confidence 22 234688999999999999999999999999 6788998899999999999999999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+|+++++ .++++++|.+.+++++++++|++||+|++ ++..|.|+.+.............. ..... T Consensus 207 k~~~~v~is~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~-~~~~~ 280 (428) T protein:vir:10 207 TMIAMVPISNALIGRA---GFNVEQLVLQDILTAISVREDKAFMRDDG--TGDTPIGMKARATQWNRLLPWAAD-AAVNL 280 (428) T ss_pred EEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhccCC--CCcccccccccccccccccccccc-ccccH Confidence 9999999999999644 56899999999999999999999999864 444555554432222222222111 11112 Q ss_pred HHHHHH------HHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 159 LAVEAA------VGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 159 ~~i~~~------~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) +.+... ...........++|+||+.++..|+++||++|+|+|++. ..++|+|+||++++++|.+... T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~----~~g~l~G~pv~~~~~~p~~~~~--- 353 (428) T protein:vir:10 281 DTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEM----AQGMLKGYPIQRTSAIPANLGE--- 353 (428) T ss_pred HHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCC----CCCeeeceeeEEeccccccccC--- Confidence 222211 222233344456899999999999999999999999654 3458999999999999976432 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCC---c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD---P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~---~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) ..+...++||||+.+.++.+.+++++++++.. . ...+++|++|++++|++.|||+++.||+||++++.. T Consensus 354 -------~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 354 -------GGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGV 426 (428) T ss_pred -------CCccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEecc Confidence 23455799999999999999999999988753 1 123578999999999999999999999999999988 Q ss_pred cc Q lcl|Aclame:pro 309 DE 310 (311) Q Consensus 309 a~ 310 (311) .- T Consensus 427 ~~ 428 (428) T protein:vir:10 427 LF 428 (428) T ss_pred CC Confidence 88 No 27 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1e-57 Score=333.03 Aligned_cols=277 Identities=15% Similarity=0.053 Sum_probs=229.8 Q ss_pred CcccC-CCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVALA-TGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~~-~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) -.+.+ .|++++|+.+...|++.++..++++++|+++++.+++ +.+|+.++.+.+.|++|++.+|+++++|++++++++ T Consensus 112 ~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~ 191 (390) T protein:vir:62 112 DGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGF 191 (390) T ss_pred cccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeee Confidence 11333 3445555555555666677888899999999987654 899999999999999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+||++ |+.++++++|+.+++++|++++|.+||+|+| .|.|+.+...........+. .....+ T Consensus 192 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G-----~p~Gi~~~~~~~~~~~~~~~-~~~~~~ 262 (390) T protein:vir:62 192 KYGFASVVSYEFAT---DQVLDLVGFLVSDAGPAIGDAMGRHFITGTG-----QPRGILTDASPATATFLATD-TDSKVS 262 (390) T ss_pred eEEeehHHHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhhhhccCC-----ccccccccccccccceeccc-ccccch Confidence 99999999999996 4567899999999999999999999999964 23455444333333333322 234567 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++.+++..+........+|+||++++..|++|||++|+|+|++....+.+++|+|+||++++.+|.+ T Consensus 263 ~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~------------ 330 (390) T protein:vir:62 263 DALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPAD------------ 330 (390) T ss_pred HHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEecCCCCc------------ Confidence 78888888887766566679999999999999999999999998888888899999999999988743 Q ss_pred cccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.|.++.+.+++++.+.+. +|.+|++.||++.|+|+++++|+||++|+.+++| T Consensus 331 ------~i~~gd~s~~~i~~~~~~~v~~~~~~-------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 331 ------KILFADLSKYRVRFAGSLRVDRSVDA-------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred ------cEEEeeccceeEEeecceEEEeeccc-------cccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 57899999999999999999887653 5899999999999999999999999999999999 No 28 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.5e-57 Score=332.04 Aligned_cols=294 Identities=16% Similarity=0.148 Sum_probs=235.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARL-SMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) -.+...||++||+++.++|++.+++.++++++ ++.+++.++.+++|+.++++.+.|++|++.+|+++++|+++++.++| T Consensus 134 ~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k 213 (435) T protein:vir:14 134 TLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKK 213 (435) T ss_pred cCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEE Confidence 12445688999999999999999999999998 67889988999999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+||++++.. ..+++++|..+++++|++++|++|++|+| .+..+.|+................+....+. T Consensus 214 ~~~~~~iS~ell~ds~~-~~~l~~~i~~~l~~ai~~~~d~a~l~G~G--~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 290 (435) T protein:vir:14 214 MAALVPIANDLIKYAGV-NPNVDQIVVGDLTAAIGAREDKAFIRDDG--TANTPKGLRFWALPSNVITASDASTLQKIET 290 (435) T ss_pred EEEeehhhHHHHHhhcc-CHHHHHHHHHHHHHHHHHHHHHHhhccCC--CCccccceeecccccceeccccccchhhHHH Confidence 99999999999976532 23589999999999999999999999854 3334555432211111111111222233445 Q ss_pred HHHHHHHHHhhc--CCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGD--NLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 160 ~i~~~~~~~~~~--~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++.+++..+... ++.+++|+|||.++..|+++||++|+|+|++. ..++|+|+||++++.+|.+.... T Consensus 291 ~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~----~~g~l~G~Pv~~~~~~p~~~~~~------- 359 (435) T protein:vir:14 291 DLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL----ANGMLKGYPVGKTTQVPINLGET------- 359 (435) T ss_pred HHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCC----CCCeeecceeEeeccccccccCC------- Confidence 667777666544 44567899999999999999999999999643 34689999999999998764332 Q ss_pred ccccccceEEEeecceEEEEeecCceEEEeccCCc----ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDP----DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~----~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .....+++|||+.+.++++.+++++++++... ...+.+|++|++++|+++|+|+++++|+||++|+.++-- T Consensus 360 ---~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:14 360 ---GKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWG 434 (435) T ss_pred ---CccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCC Confidence 33447999999999999999999999987531 133578999999999999999999999999999977655 No 29 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=3.3e-57 Score=330.26 Aligned_cols=281 Identities=19% Similarity=0.185 Sum_probs=237.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) ..+.++++.+||++++++|++.+++.++++++|++++++++.++||+.++.+.+.|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~ 108 (324) T protein:vir:93 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred ccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEE Confidence 23445667789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+||++++ .++++++|++++++++++++|+++|+|++. +..+ .++........... .....+++ T Consensus 109 ~~~~~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~g~--~~~~----~~~~~~~~~~~~~~-~~~~~~~~ 178 (324) T protein:vir:93 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFG----KSIAQSIEKTNKVI-KGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCcC----ccccccccccceec-cccccHHH Confidence 99999999999755 478999999999999999999999998642 2222 23333322222222 23456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++..+++.+++|+|||+++..|+++||++|+|++.+ ..+++|+|+||++++..+ T Consensus 179 i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PVv~~~~~~---------------- 238 (324) T protein:vir:93 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCcccceeeEeecCCC---------------- Confidence 999999999999999999999999999999999999999864 346789999998765433 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC------c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .....+++|||+.+.++.+++++++++++.. . ...+++|++|++++|+++|+|+++.||+||++|+.|+.- T Consensus 239 ~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:93 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 3455799999999999999999999988753 1 234578999999999999999999999999999977666 No 30 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2e-57 Score=331.42 Aligned_cols=284 Identities=22% Similarity=0.255 Sum_probs=235.9 Q ss_pred Cccc--CCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~--~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |+.. +.++.++|+++.++|++.+++.+++++++++++++++++++|+.+.++.+.|++|++++++++++|+++++++| T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 89 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRDVHPA 89 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccccccccceeEEEEeeE Confidence 5522 33445677778999999999999999999999999989999999999999999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+||++++ .++++++|++++++++++++|+++|+|++.+ .+..+ +......... ......+ T Consensus 90 k~~~~v~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~--~~~~~----~~~~~~~~~~--~~~~~~~ 158 (397) T protein:vir:23 90 KIATIFVASAETVRAN---PANYLGTMRTKVATAIAMAFDNAALHGTNAP--SAFQG----YLDQSNKTQS--ISPNAYQ 158 (397) T ss_pred EEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccCC--ccccc----ccccccceee--ecccchh Confidence 9999999999999755 4789999999999999999999999996542 22222 2222222111 1223345 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCC-----ceecceeEEeeccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDV-----ASFAGLNAAVSDTVRGGPEAVTAS 233 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~-----~~l~G~pv~~~~~~~~~~~~~~~~ 233 (311) +++.++...+...++..++|+||++++..|+++||++|||+|.+....+.+ ++++|+||++++.+|.+ T Consensus 159 ~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g------- 231 (397) T protein:vir:23 159 GLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEG------- 231 (397) T ss_pred HHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCC------- Confidence 666777778888888888999999999999999999999999877665543 58999999999998743 Q ss_pred ccccccccccceEEEeecceEEEEeecCceEEEeccCC-------cccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 234 TGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR 306 (311) Q Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-------~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~ 306 (311) ...+++|||+.+.++++++++++++++.. ....+++|++|++++|+++|+|++++||+||++++ T Consensus 232 ---------~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~ 302 (397) T protein:vir:23 232 ---------DVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLT 302 (397) T ss_pred ---------ceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEe Confidence 33578999999999999999999887653 23456889999999999999999999999999999 Q ss_pred ecccC Q lcl|Aclame:pro 307 DADES 311 (311) Q Consensus 307 ~aa~~ 311 (311) .++.+ T Consensus 303 ~~~~~ 307 (397) T protein:vir:23 303 FDPVL 307 (397) T ss_pred ecccc Confidence 87766 No 31 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=6.3e-57 Score=328.69 Aligned_cols=281 Identities=18% Similarity=0.181 Sum_probs=238.7 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) .++.+.++.+||++++++|++.+++.++++++|++++++++++++|+.++++.+.|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~ 108 (324) T protein:vir:10 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred eeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEE Confidence 33445566799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+|+++++ .++++++|.+++++++++++|+++|+|++.+ ..+. ++........... .....+++ T Consensus 109 ~~~~~iS~ell~ds---~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~--~~~~----~i~~~~~~~~~~~-~~~~t~~~ 178 (324) T protein:vir:10 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFGK----SIAQSIEKTNKVI-KGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC--ccCc----cccccccccceec-cccCCHHH Confidence 99999999999655 4689999999999999999999999996532 2222 3333333222222 23456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++..+...++.+++|+|||.++..|+++||++|+|+|.+ ..+++|+|+||++++..+ T Consensus 179 i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~PV~~~~~~~---------------- 238 (324) T protein:vir:10 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDTLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecC----CCCccccceeEEeecCCC---------------- Confidence 999999999999999999999999999999999999999854 346789999998775543 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC------c-ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~-~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+.+.++.+++++++++++.. . ...+++|++|++++|+++|+|+++.+|+||++|+.++.. T Consensus 239 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:10 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred CCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCC Confidence 3455799999999999999999999988753 1 233578999999999999999999999999999988777 No 32 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=6.4e-57 Score=328.65 Aligned_cols=281 Identities=19% Similarity=0.179 Sum_probs=238.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) .++.+.++.+||++++++|++.+++.++|+++|++++++++++++|+.++++.+.|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~ 108 (324) T protein:vir:99 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred eeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEE Confidence 33445566789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) ++++++|+|+++++ .++++++|.+++++++++++|+++|+|++.+ ..+ .++.......... ......+++ T Consensus 109 ~~~~~iS~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~--~~~----~~~~~~~~~~~~~-~~~~~~~~~ 178 (324) T protein:vir:99 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFG----KSIAQSIEKTNKV-IKGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC--ccC----cccccccccccee-ccccCCHHH Confidence 99999999999755 4679999999999999999999999986532 222 2333333222222 223456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++..+...++.+++|+|||.+|..|+++||++|+|+|.+ ..+++|+|+||++++.++ T Consensus 179 i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~PVv~~~~~~---------------- 238 (324) T protein:vir:99 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDTLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CCCccccceeEEeecCCC---------------- Confidence 999999999999999999999999999999999999999853 346789999999876654 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC-------cccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-------~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+.+.++.+++++++++++.. ....+++|++|++++|+++|+|+++.||+||++|+.++.. T Consensus 239 ~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~ 316 (324) T protein:vir:99 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred CCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCC Confidence 2345789999999999999999999988753 1233578999999999999999999999999999988877 No 33 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=7e-57 Score=328.43 Aligned_cols=281 Identities=19% Similarity=0.190 Sum_probs=237.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) .++.++++.+||++++++|++.++++++++++++++++++++++||+.++.+.+.|++|++++|+++++|++++++++|+ T Consensus 29 ~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~ 108 (324) T protein:vir:96 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) T ss_pred ccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEE Confidence 22334566789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) +++++||+||++++ ..+++++|.+++++++++++|+++|+|++. +..+ .++........... .....+++ T Consensus 109 ~~~~~is~ell~ds---~~~l~~~i~~~l~~aia~~~d~~~l~G~g~--~~~~----~~~~~~~~~~~~~~-~~~~~~~~ 178 (324) T protein:vir:96 109 GVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFG----KSIAQSIKKTNKVI-KGDFTQDN 178 (324) T ss_pred EEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCC--CCcC----ccccccccccceec-ccccchHH Confidence 99999999999755 468999999999999999999999999642 2222 23333333222222 23456889 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) Q Consensus 161 i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~ 240 (311) +.+++.++...++.+++|+|||+++..|+++||++|+|++.+ ..+++|+|+||++++..+ T Consensus 179 i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~----~~~~~l~G~PV~~~~~~~---------------- 238 (324) T protein:vir:96 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN---------------- 238 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCcccceeeEeecCCC---------------- Confidence 999999999999999999999999999999999999999853 346789999998765443 Q ss_pred cccceEEEeecceEEEEeecCceEEEeccCC------cc-cchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------PD-GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~~-~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+.+.++.+++++++++++.. .+ ..+++|++|++++|+++|+|+++++|+||++|+.|... T Consensus 239 ~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:96 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 3345799999999999999999999988753 22 34678999999999999999999999999999987766 No 34 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=7.3e-57 Score=328.35 Aligned_cols=291 Identities=14% Similarity=0.109 Sum_probs=241.9 Q ss_pred CcccCCCceEcchhHHHHHH-HHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVW-QKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii-~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) =.+.++||++||+++..+|| +.+++.++++++++++++ ++.+.+|+.++++.+.|++||+.+|+++++|++++++++| T Consensus 252 ~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k 330 (543) T protein:vir:81 252 GLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKK 330 (543) T ss_pred ccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEEEEecCCcceeecccCccccccccccceeeeeeee Confidence 12567889999999998876 557788999999997765 5789999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|+++++ +++.++|.+.|++++++++|.+||+|+| ++..+.|+.+........+ .+.......++ T Consensus 331 ~~~~~~is~ell~d~----~~~~~~i~~~l~~~~~~~~d~ail~G~G--t~~~p~Gi~~~~~~~~~~~-~~~~~~~~~~~ 403 (543) T protein:vir:81 331 AQGFVPISIEALQDE----ANVTETVALLFAEGKDELEAVTLTTGTG--QGNQPTGIVTALAGTAAEI-APVTAETFALA 403 (543) T ss_pred eEeeehhhHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHhccCC--CCcccccchhhcccccccc-cccccccccHH Confidence 999999999999643 4799999999999999999999999964 3445556544322222222 22333446788 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++..+...+...++|+|||.++..|+++||++|+|+|.+... +.+++|+|+||++++.||.+.... . T Consensus 404 ~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~-g~~~~l~G~pv~~~~~~~~~~~~~--------~ 474 (543) T protein:vir:81 404 DVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGN-GEPSQLLGRPVGEAEAMDANWNTS--------A 474 (543) T ss_pred HHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCC-CCCccccceeeEEecccccccccc--------c Confidence 8999999888777777789999999999999999999999987554 457899999999999998764322 2 Q ss_pred ccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+.|.++.+.+++++++++...+ ..|.+|++.||++.|+|+++++|+||++|+.+++| T Consensus 475 ~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 475 SADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGT---NRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred cCCcceEEEeeccceeEEeecccEEEEecccccc---chhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 3345579999999999999999999998876533 24789999999999999999999999999999999 No 35 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=3.1e-56 Score=324.90 Aligned_cols=279 Identities=14% Similarity=0.080 Sum_probs=229.3 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHH-hhchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQ-GQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~-~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) .. +.+.+|.++|+++..++|..++ ..++++++++++++.++ .+.+|+.++.+.+.|++|++++|+++++|+++++++ T Consensus 111 ~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 190 (392) T protein:vir:13 111 RDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGG 190 (392) T ss_pred hcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeee Confidence 22 3344455666666677666554 55678888898888655 489999999999999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|||+++ .++++++|..++++++++++|.+||+|+|. + .|.|+.+.......... ........ T Consensus 191 ~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt--~-~p~Gil~~~~~~~~~~~-~~~~~~~~ 263 (392) T protein:vir:13 191 FKYGFASVVSYEFATDQ---VLDLVGFLVSDAGPAIGDAMGRHFLTGTGT--G-QPRGILTDATGANAAFG-EADADSKV 263 (392) T ss_pred eeEEeeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhcccCC--c-ccccccccccccccccc-cccccccc Confidence 99999999999999644 578999999999999999999999999653 2 34454433222221111 22234455 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+.......++|+||+.++..|+++||++|+|+|.+....+.+++|+|+||++++.+|.+ T Consensus 264 ~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~----------- 332 (392) T protein:vir:13 264 SDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPAD----------- 332 (392) T ss_pred HHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCCCCC----------- Confidence 788888888887766667789999999999999999999999999888888999999999999998743 Q ss_pred ccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||++|.++.+.+++++.+.+. +|.+|++.||++.|+|++++||+||++++.+++| T Consensus 333 -------~i~~Gdf~~~~i~~~~~~~i~~~~~~-------~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 333 -------KVLFADLSKYRVRFAGSLRVDRSVDA-------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred -------cEEEeeccceeEEeecceEEEeeccc-------cccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 58899999999999999999877553 4899999999999999999999999999999999 No 36 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=3e-56 Score=324.95 Aligned_cols=281 Identities=15% Similarity=0.108 Sum_probs=220.8 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |. ++++||++||+++..+||+.+++.+++++++++++++++.++||+.++ .+.+.||+|++.+|+++++|+++++.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeee Confidence 33 556788999999999999999999999999999999999999999876 468999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc------ Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT------ 151 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~------ 151 (311) ||++++++||+|||++ + .+++++|.++++++|++++|.+||+|+|.+ .+.|+.+.....+....... T Consensus 231 ~k~a~~~~iS~ell~d---~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~~~ 303 (497) T protein:vir:10 231 GKVANALTITDEGLRD---A-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQRSTGFTASSASSLFGATSA 303 (497) T ss_pred eeeEeecHhHHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcc---cccccccccccccccccccchhhhhh Confidence 9999999999999964 3 358999999999999999999999996532 23443321111000000000 Q ss_pred ---------------------------------------------ccccchHHHHHHHHHHHh-hcCCCccEEEEcHHHH Q lcl|Aclame:pro 152 ---------------------------------------------GTSATPDLAVEAAVGLVL-GDNLSPDGVALDNTFS 185 (311) Q Consensus 152 ---------------------------------------------~~~~~~~~~i~~~~~~~~-~~~~~~~~~v~n~~~~ 185 (311) .+.......+..++..+. ...+.+++|+|||.+| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~ 383 (497) T protein:vir:10 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW 383 (497) T ss_pred hhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHH Confidence 000111122333333333 3445567899999999 Q ss_pred HHHHHhhccCCceeecccccc------CCCceecceeEEeecccccccccccccccccccccccceEEEeecce--EEEE Q lcl|Aclame:pro 186 FMLATQRDSQGRKLYPELGFG------TDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSA--FRWG 257 (311) Q Consensus 186 ~~l~~lkd~~g~~~~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~--~~~~ 257 (311) ..|+++||++|+|+|++.... ..+++|+|+||++++.||.+ .++||||+. +.+. T Consensus 384 ~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~------------------~~~~Gd~~~~~~~i~ 445 (497) T protein:vir:10 384 ELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------------TILVGHFAPSVIQTA 445 (497) T ss_pred HHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC------------------ceEEeecccceEEEE Confidence 999999999999999765432 23458999999999999743 468999986 4567 Q ss_pred eecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 258 VQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 258 ~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ++.+++++++++.. ++|++|+++||++.|+|+.|++|+||++|+.++.+ T Consensus 446 ~r~~~~v~~~~~~~-----~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 446 RREGVTMQMTNSNG-----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred EecccEEEeecccc-----hhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 89999999987642 35999999999999999999999999999988777 No 37 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=3e-56 Score=324.95 Aligned_cols=281 Identities=15% Similarity=0.108 Sum_probs=220.8 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |. ++++||++||+++..+||+.+++.+++++++++++++++.++||+.++ .+.+.||+|++.+|+++++|+++++.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeee Confidence 33 556788999999999999999999999999999999999999999876 468999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc------ Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT------ 151 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~------ 151 (311) ||++++++||+|||++ + .+++++|.++++++|++++|.+||+|+|.+ .+.|+.+.....+....... T Consensus 231 ~k~a~~~~iS~ell~d---~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~~~ 303 (497) T protein:vir:78 231 GKVANALTITDEGLRD---A-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQRSTGFTASSASSLFGATSA 303 (497) T ss_pred eeeEeecHhHHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcc---cccccccccccccccccccchhhhhh Confidence 9999999999999964 3 358999999999999999999999996532 23443321111000000000 Q ss_pred ---------------------------------------------ccccchHHHHHHHHHHHh-hcCCCccEEEEcHHHH Q lcl|Aclame:pro 152 ---------------------------------------------GTSATPDLAVEAAVGLVL-GDNLSPDGVALDNTFS 185 (311) Q Consensus 152 ---------------------------------------------~~~~~~~~~i~~~~~~~~-~~~~~~~~~v~n~~~~ 185 (311) .+.......+..++..+. ...+.+++|+|||.+| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~ 383 (497) T protein:vir:78 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW 383 (497) T ss_pred hhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHH Confidence 000111122333333333 3445567899999999 Q ss_pred HHHHHhhccCCceeecccccc------CCCceecceeEEeecccccccccccccccccccccccceEEEeecce--EEEE Q lcl|Aclame:pro 186 FMLATQRDSQGRKLYPELGFG------TDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSA--FRWG 257 (311) Q Consensus 186 ~~l~~lkd~~g~~~~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~--~~~~ 257 (311) ..|+++||++|+|+|++.... ..+++|+|+||++++.||.+ .++||||+. +.+. T Consensus 384 ~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~------------------~~~~Gd~~~~~~~i~ 445 (497) T protein:vir:78 384 ELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------------TILVGHFAPSVIQTA 445 (497) T ss_pred HHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC------------------ceEEeecccceEEEE Confidence 999999999999999765432 23458999999999999743 468999986 4567 Q ss_pred eecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 258 VQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 258 ~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ++.+++++++++.. ++|++|+++||++.|+|+.|++|+||++|+.++.+ T Consensus 446 ~r~~~~v~~~~~~~-----~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 446 RREGVTMQMTNSNG-----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred EecccEEEeecccc-----hhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 89999999987642 35999999999999999999999999999988777 No 38 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.9e-55 Score=320.61 Aligned_cols=279 Identities=18% Similarity=0.155 Sum_probs=235.2 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |. ++++++.+||++++++|++.+++.++++++|++++++++. ..+|+..+++.+.|++|++++|+++++|+++++++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~ 88 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVTLKA 88 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccceeEEEEee Confidence 22 3456788999999999999999999999999999997654 67888888999999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|+++++ .++++++|++++++++++++|+++|+|++... +.++ .......... ...... T Consensus 89 ~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~---~~gi----~~~~~~~~~~-~~~~~t 157 (297) T protein:vir:95 89 HKLGIILVTSREALNYT---WKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPF---ANSV----AKAAKDANKV-IGGPIN 157 (297) T ss_pred EEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc---cccc----ccccccccee-cccccC Confidence 99999999999999655 47899999999999999999999999976432 2333 2222222222 223456 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++.++...++.+++|+|||.++.+|++++|++|+|+|.+ .+++++|+||+.+...+ T Consensus 158 ~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~-----~~~~l~G~Pv~~~~~~~------------- 219 (297) T protein:vir:95 158 YDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDK-----AANTIDGITTVDLKSAR------------- 219 (297) T ss_pred HHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecC-----CCCcccceeeEeecCCC------------- Confidence 889999999999999999999999999999999999999999964 34789999998654433 Q ss_pred ccccccceEEEeecceEEEEeecCceEEEeccCC------cc-cchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------PD-GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~------~~-~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .++..+++|||+.+.++.+++++++++++.. .+ ..+++|++|++++|+++|+|+++++|+||++||.|+- T Consensus 220 ---~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 220 ---FEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred ---CCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCC Confidence 3455799999999999999999999988753 12 3357899999999999999999999999999998887 Q ss_pred C Q lcl|Aclame:pro 311 S 311 (311) Q Consensus 311 ~ 311 (311) - T Consensus 297 ~ 297 (297) T protein:vir:95 297 V 297 (297) T ss_pred C Confidence 7 No 39 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.5e-55 Score=321.16 Aligned_cols=277 Identities=17% Similarity=0.150 Sum_probs=235.9 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) +- +++.||++||++++++|++.+++.++|++++++++++++++.+|+..+ ++.+.|++|++++|+++++|+++++.++ T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~ 215 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVR 215 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeee Confidence 22 456678999999999999999999999999999999988899999876 6789999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee-eccccccch Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE-LTTGTSATP 157 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 157 (311) |++++++||+||++++ .+++++|.+++++++++++|.++|+|++. +..+.|+ ...+.... ..+.+.... T Consensus 216 k~~~~~~is~ell~ds----~~l~~~i~~~l~~a~~~~~d~a~l~G~g~--~~~p~Gi----~~~~~~~~~~~~~~~~~~ 285 (418) T protein:vir:10 216 TIAHLFKASRQILDDA----PALQSYIDGRARYGLQLTEEGQILKGDGT--GANILGI----LPQASAFMPSITLANATP 285 (418) T ss_pred eEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhccCCC--Ccccccc----cccccccccccccccccc Confidence 9999999999999643 36999999999999999999999999653 3334443 33333222 223334456 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++..++..+...++.+++|+|||.++..|+++||++|+|+|++ +..+.+++|+|+||++++.||.+ T Consensus 286 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~-~~~~~~~~l~G~pV~~~~~~p~~----------- 353 (418) T protein:vir:10 286 IDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGN-PVNGTTPRLWNLPVVETQAMTAN----------- 353 (418) T ss_pred HHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccc-cccCCCceecceeeEEcCCCCCC----------- Confidence 778888888888888888899999999999999999999999965 45667889999999999998854 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+++|||++ +.+..+.+++++++++.. .+|++|++.||++.|+|+++++|+||++++.++.+ T Consensus 354 -------~~~~gd~s~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~ 416 (418) T protein:vir:10 354 -------EFLVGAFSMAAQIFDRMEIEVLLSTENV-----DDFEKNMVSIRAEERLALAVYRPESFVTGALVEQA 416 (418) T ss_pred -------cEEEeeccceEEEEEecceEEEEecccc-----hhhhcCceEEEEEEeeccEEecccceEEEEeccCC Confidence 578999996 667889999999887643 45999999999999999999999999999988888 No 40 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=1.3e-55 Score=321.51 Aligned_cols=288 Identities=19% Similarity=0.159 Sum_probs=227.7 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecC----CCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQE----FGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~----~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |. +.+.|++++|+++.++||+.+++.+++++++....+. .+++++|+.++++.++||+|++.+|+++++|++++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~ 417 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESIT 417 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEE Confidence 22 2234889999999999999999999999998654322 24689999999999999999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +++||+++++++|+|||+++ .++++++|+++++++|++++|.+||+|++.+. ....|.++..+.... ... T Consensus 418 l~~~kla~~~~iS~ell~ds---~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~---~~~~p~gi~~~~~~~----~~~ 487 (645) T protein:vir:93 418 FSHAKVSAIAVLTEELIRFS---SPAADALVRNALAEAVVARLDTDFVDPKKAAV---ADVSPASITHDVKGT----ASS 487 (645) T ss_pred EeeEEEEEeehhHHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc---CCccccceecccccc----ccc Confidence 99999999999999999655 46799999999999999999999999865321 122344444332222 122 Q ss_pred cchHHHHHHHHHHHhhcCCC--ccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLS--PDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) ...+.++..++..+..++.. .++|+|||.++..|+++||++|+|+|++. ....++|+|+||++++++|.+. T Consensus 488 ~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~--~~~~~tL~G~PV~~s~~vp~~~----- 560 (645) T protein:vir:93 488 GNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDM--TLLGGSFQGLPVIVSQYVGDQL----- 560 (645) T ss_pred cchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCC--CCCCceeeceeeEEeccCCcce----- Confidence 34556777888777666554 35799999999999999999999999653 3345799999999999998642 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcc-------cchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPD-------GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVV 305 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~-------~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l 305 (311) ..++...+++|++..+.+..+++.++++..+.+.+ ..+++|++|++++|+++|+||+++||+||++| T Consensus 561 ------~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~l 634 (645) T protein:vir:93 561 ------VLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVI 634 (645) T ss_pred ------eEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEE Confidence 12334467788888888888888888876543322 45789999999999999999999999999999 Q ss_pred EecccC Q lcl|Aclame:pro 306 RDADES 311 (311) Q Consensus 306 ~~aa~~ 311 (311) +.+.=- T Consensus 635 t~~~~g 640 (645) T protein:vir:93 635 TGVNYG 640 (645) T ss_pred ecccCC Confidence 965432 No 41 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=2.8e-55 Score=319.68 Aligned_cols=279 Identities=16% Similarity=0.117 Sum_probs=235.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) -.+.+++|.++|++++++|++.+++.++|+++|++++++++.+++|+.++ .+.+.|++|++.+|+++++|++++++++| T Consensus 115 ~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k 194 (395) T protein:vir:43 115 TSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRT 194 (395) T ss_pred cccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeee Confidence 12456678899999999999999999999999999999988899999876 46899999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+||++++ .+++++|.+++++++++++|.++|+|++ .+..+.|+.+........ ..........++ T Consensus 195 ~~~~~~is~ell~d~----~~l~~~v~~~la~a~~~~~d~~~l~G~g--~~~~~~Gi~~~~~~~~~~-~~~~~~~~~~~~ 267 (395) T protein:vir:43 195 IAHLFKASRQILDDA----SALQSYIDARARYGLMLVEECQLLYGNG--TGANLHGIIPQAQAYAPP-SGVVVTAEQRID 267 (395) T ss_pred EEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCccccccccccccccc-cccccccchhHH Confidence 999999999999643 3689999999999999999999999964 444555554332222111 112223345678 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++..+...+..+++|+|||.++..|+++||++|+|+|++ +..+.+++|+|+||++++.+|.+ T Consensus 268 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~-~~~~~~~~l~G~pVv~~~~~~~~------------- 333 (395) T protein:vir:43 268 RIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGS-PQNGTTPTLWRLPVVETQAITQD------------- 333 (395) T ss_pred HHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccc-cccCCCceecceeeEEcCCCCCC------------- Confidence 8889999998888888899999999999999999999999976 45666789999999999998744 Q ss_pred ccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 240 TNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 240 ~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .+++|||+. +.+.++.+++++++++.. .+|++|++.||++.|+|+++++|+||++++.+++ T Consensus 334 -----~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 334 -----EFLTGAFSLGAQIFDRMDIEVLVSTEND-----KDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred -----cEEEEeccceEEEEEecceEEEEecccc-----chhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 578999987 667888999999887543 3599999999999999999999999999999988 No 42 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=1.3e-55 Score=321.50 Aligned_cols=284 Identities=16% Similarity=0.128 Sum_probs=235.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeC-CceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) -.+.+.||++||+++.++|++.+++.++|+++|+++++.++. ..+|+..+ ...+.|++|++++|+++++|+++++.++ T Consensus 119 ~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~ 198 (409) T protein:vir:45 119 VAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGAL 198 (409) T ss_pred CccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeee Confidence 235567899999999999999999999999999999998775 44555544 3457899999999999999999999999 Q ss_pred eEE-EEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 79 KVQ-VTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 79 kl~-~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) |++ ++++||+||+++ +.++++++|..++++++++++|++||+|+|.+....+.|+...... ........... T Consensus 199 k~~~~~i~is~ell~d---s~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~----~~~~~~~~~~~ 271 (409) T protein:vir:45 199 KMTSKIIRVSNELLQD---SAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTG----TTQTAAANAVK 271 (409) T ss_pred eeeeeehhhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccc----ccccccccccc Confidence 985 578999999964 4578999999999999999999999999876555555555433222 22222334456 Q ss_pred HHHHHHHHHHHhhcCCCccE--EEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDG--VALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTG 235 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~--~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 235 (311) ++++.+++..+......... |+||+.++..|++|||++|||+|.+....+.+.+|+|+||++++.+|.. T Consensus 272 ~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~--------- 342 (409) T protein:vir:45 272 WQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDI--------- 342 (409) T ss_pred hHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCc--------- Confidence 77888888888776655554 5779999999999999999999999888888999999999999999842 Q ss_pred ccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 236 VYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 236 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+.|.+..+.+++++.+.+. +|++|++.||++.|+|+++++|+||++|+.++++ T Consensus 343 ----~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~-------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~ 407 (409) T protein:vir:45 343 ----GAGKKFMFCGDFDRFIIRRVRYMILKRLVER-------YAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSV 407 (409) T ss_pred ----cCCccEEEEeehhhhheeeccceEEEEeecc-------cccCCcEEEEEEEEeccEeechhheEEEEeccCC Confidence 1234468899999999999999999877543 3788999999999999999999999999999999 No 43 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=4.8e-55 Score=318.38 Aligned_cols=277 Identities=17% Similarity=0.122 Sum_probs=235.1 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |. +.+++|.++|+++...|++.+++.++|+++|+++++.++.+++|+..+ .+.+.|++|++++|+++++|+++++.++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~ 184 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVK 184 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeee Confidence 44 334457788999999999999999999999999999988899999875 5789999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee-ccccccch Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL-TTGTSATP 157 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 157 (311) |++++++||+|+++++ .+++++|..++++++++++|.++|+|++ .+..+.|+.. ....... ...+.... T Consensus 185 k~~~~~~is~ell~d~----~~l~~~i~~~la~a~~~~~d~~~l~G~g--~~~~~~Gi~~----~~~~~~~~~~~~~~~~ 254 (385) T protein:vir:19 185 TIAHWVQASRQVMDDA----PMLQSYINNRLMYGLALKEEGQLLNGDG--TGDNLEGLNK----VATAYDTSLNATGDTR 254 (385) T ss_pred eEEEeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCccccccc----ccccccccccccccch Confidence 9999999999999633 3689999999999999999999999954 3444444433 2222221 22234456 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+...++.+++|+|||.++..|+++||++|+|+|++. ..+.+++|+|+||++++.+|.+ T Consensus 255 ~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~-~~~~~~~l~G~pV~~~~~~p~~----------- 322 (385) T protein:vir:19 255 ADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGP-QAFTSNIMWGLPVVPTKAQAAG----------- 322 (385) T ss_pred HHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCc-ccCCCceecceeeEEcCcCCCC----------- Confidence 7889999999988888999999999999999999999999999764 5667899999999999998843 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+++|||+. |.+..+.++++++.++.. ++|++|++.||+++|+|+++.+|+||++++.+++| T Consensus 323 -------~~~~gd~~~~~~~~~~~~~~v~~~~~~~-----~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 323 -------TFTVGGFDMASQVWDRMDATVEVSREDR-----DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred -------cEEEeecccEEEEEEecceEEEEecccc-----chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 588999986 778889999998876532 35999999999999999999999999999999999 No 44 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=4.8e-55 Score=318.38 Aligned_cols=277 Identities=17% Similarity=0.122 Sum_probs=235.1 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |. +.+++|.++|+++...|++.+++.++|+++|+++++.++.+++|+..+ .+.+.|++|++++|+++++|+++++.++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~ 184 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVK 184 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeee Confidence 44 334457788999999999999999999999999999988899999875 5789999999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee-ccccccch Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL-TTGTSATP 157 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 157 (311) |++++++||+|+++++ .+++++|..++++++++++|.++|+|++ .+..+.|+.. ....... ...+.... T Consensus 185 k~~~~~~is~ell~d~----~~l~~~i~~~la~a~~~~~d~~~l~G~g--~~~~~~Gi~~----~~~~~~~~~~~~~~~~ 254 (385) T protein:vir:18 185 TIAHWVQASRQVMDDA----PMLQSYINNRLMYGLALKEEGQLLNGDG--TGDNLEGLNK----VATAYDTSLNATGDTR 254 (385) T ss_pred eEEEeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCccccccc----ccccccccccccccch Confidence 9999999999999633 3689999999999999999999999954 3444444433 2222221 22234456 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+...++.+++|+|||.++..|+++||++|+|+|++. ..+.+++|+|+||++++.+|.+ T Consensus 255 ~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~-~~~~~~~l~G~pV~~~~~~p~~----------- 322 (385) T protein:vir:18 255 ADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGP-QAFTSNIMWGLPVVPTKAQAAG----------- 322 (385) T ss_pred HHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCc-ccCCCceecceeeEEcCcCCCC----------- Confidence 7889999999988888999999999999999999999999999764 5667899999999999998843 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+++|||+. |.+..+.++++++.++.. ++|++|++.||+++|+|+++.+|+||++++.+++| T Consensus 323 -------~~~~gd~~~~~~~~~~~~~~v~~~~~~~-----~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 323 -------TFTVGGFDMASQVWDRMDATVEVSREDR-----DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred -------cEEEeecccEEEEEEecceEEEEecccc-----chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 588999986 778889999998876532 35999999999999999999999999999999999 No 45 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=7.3e-55 Score=317.36 Aligned_cols=274 Identities=14% Similarity=0.055 Sum_probs=229.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCce--EEEEEeCCceeEEeecCccccc-cccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ--QYMTLTAPPRGEVVGEGAQKSE-STATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~--~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~ 77 (311) -.+.+.||++||+++.++|++.+++.++|++++++++++++.. .+++..+.+.+.|++||+++|+ ++++|+++++++ T Consensus 93 ~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~ 172 (371) T protein:vir:81 93 EGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQV 172 (371) T ss_pred cCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeee Confidence 3356678999999999999999999999999999999987654 4566677789999999999996 679999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|++++ +.++++++|.+++++++++++|.++++|++.+.. . .... T Consensus 173 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~-------~---------------~~~~ 227 (371) T protein:vir:81 173 KKYAGFFRVTNELLND---STEAIVNTLVRWIGDESRVTRNGLIINVLNTKAK-------T---------------AIAD 227 (371) T ss_pred eEEEEeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------c---------------cccc Confidence 9999999999999964 4578999999999999999999999999653211 1 1122 Q ss_pred HHHHHHHH-HHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAV-GLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 158 ~~~i~~~~-~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) ++++..++ ..+........+|+|||.+|..|+++||++|+|+|.+....+.+++|+|+||++++.+|.+..... T Consensus 228 ~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~----- 302 (371) T protein:vir:81 228 LDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDG----- 302 (371) T ss_pred HHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccc----- Confidence 44555444 345455555668999999999999999999999999888888999999999999999985543322 Q ss_pred cccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ....+...+++|||+. +.+..+.+++++++++.+ ++|++|++.||++.|+|+++++|+||++++.+++ T Consensus 303 -~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 303 -GTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM-----DAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred -cccCCcceEEEEehhceEEEEeecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 2345667899999996 677889999999887643 4699999999999999999999999999999999 No 46 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1e-54 Score=316.55 Aligned_cols=273 Identities=17% Similarity=0.127 Sum_probs=232.4 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC-ceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) +. +++++|+++|+++.+.|++.+++.++|++++++++++++.+++|+.++. +.+.|++||+++|+++++|+++++.+ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEee Confidence 33 4456788999999999999999999999999999999999999998764 68999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee-ccccccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL-TTGTSAT 156 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~ 156 (311) +|+++++++|+|+++++ .+++++|.+++++++++++|.++|+|++ ++..+.|+ ......... ...+... T Consensus 193 ~k~~~~~~is~ell~ds----~~l~~~i~~~la~a~~~~~d~a~l~G~g--~~~~p~Gi----~~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:97 193 HVIAHTMKATRQILSDA----PQLASYMNNRLIRGLKVKEDAEILRGTG--ANDGLLGL----IPQATTYAAPTTIAGAT 262 (390) T ss_pred eeEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCccccce----eeccccccccccccccc Confidence 99999999999999754 3689999999999999999999999854 33344444 333222222 2223445 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) .++++.+++..+...++.+++|+|||++|..|+++||++|+|+|++.. ...+++|+|+||++++.+|.+ T Consensus 263 ~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~-~~~~~~l~G~pV~~~~~~~~~---------- 331 (390) T protein:vir:97 263 RVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNAR-GTLTPTLWGLPVVATQAMAPG---------- 331 (390) T ss_pred hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc-CCCCceecceeeEEcCCCCCC---------- Confidence 677888899999999999999999999999999999999999998754 456789999999999998743 Q ss_pred cccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+++|||+. |.+..+.+++++++++. .+|++|++++|++.|+|+++++|+||++++.| T Consensus 332 --------~~~~gd~~~~~~~~~~~~~~i~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 332 --------EFLVGAFDLAAQIFDQWDARVEIGYVN------DDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred --------cEEEEeccceEEEEEecceEEEEeecc------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 578999986 77888999999887543 24999999999999999999999999999999 No 47 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=2.2e-54 Score=314.72 Aligned_cols=281 Identities=10% Similarity=0.027 Sum_probs=230.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccc------ccceeEEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSES------TATFAPVT 74 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~------~~~~~~v~ 74 (311) -.+.+.|+.+||++++++|++.+++.++++++|++++++++...+|+.++++.+.|++|++.++++ +++|++++ T Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~ 243 (458) T protein:vir:10 164 SSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIH 243 (458) T ss_pred cccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeE Confidence 113456889999999999999999999999999999999998999999999999999999888854 57899999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccc-cee--ecc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTN-IVE--LTT 151 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~-~~~--~~~ 151 (311) +.++|++++++||+|+++++ .++++++|.++++++|++++|.+||+|+|. + .+.|+.+....... ... .+. T Consensus 244 ~~~~k~~~~v~is~ell~ds---~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~--~-~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) T protein:vir:10 244 FSTYKLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMTGDGS--G-KPKGLLTLASEDSAKVVTEAKAD 317 (458) T ss_pred eeeeeEEeeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCC--C-ccceeeecccccccceeeccccc Confidence 99999999999999999644 478999999999999999999999999653 2 33444332211111 111 111 Q ss_pred ccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccc----cccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPEL----GFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~----~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) ......++++.+++..+...+..+++|+|||.+|..|+++||++|+|+|.+. ...+.+++|+|+||+++++||.+ T Consensus 318 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~- 396 (458) T protein:vir:10 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAK- 396 (458) T ss_pred ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccc- Confidence 2233467889999999988888888999999999999999999999998643 33455678999999999999853 Q ss_pred ccccccccccccccccceEEEeecc-eEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFS-AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR 306 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~-~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~ 306 (311) .+...+++|||. .|.+.++.+++++++++ +.+|++.||++.|+|+.+.+|+||++.+ T Consensus 397 -------------~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~---------~~~~~~~~~~~~r~~~~v~~~~a~v~~~ 454 (458) T protein:vir:10 397 -------------ANSAEFAVIVYKDNFVMPRQRAVTVERERQ---------AGKQRDAYYVTQRVNLQRYFANGVVSGT 454 (458) T ss_pred -------------cCCcceEEEEecccEEEEEeeceEEEeecc---------cCCCceEEEEEEEecceEecccceEEEe Confidence 233467889995 57889999999987655 5689999999999999999999999999 Q ss_pred eccc Q lcl|Aclame:pro 307 DADE 310 (311) Q Consensus 307 ~aa~ 310 (311) .||| T Consensus 455 ~aa~ 458 (458) T protein:vir:10 455 YAAS 458 (458) T ss_pred eccC Confidence 9999 No 48 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.8e-54 Score=314.21 Aligned_cols=279 Identities=14% Similarity=0.061 Sum_probs=231.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--eCCceeEEeecCccccc-cccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTL--TAPPRGEVVGEGAQKSE-STATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~--~~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~ 77 (311) ..+++.|+++||+++.++|++.+++.++|++++++++++++..++|+. .+...+.|++|++++|+ +.++|+++++.+ T Consensus 123 ~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:47 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 224566889999999999999999999999999999999888777765 56678999999999997 568999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|+++++ .++++++|++++++++++++|.++|+|++.+.. ..+.. ...........+.... T Consensus 203 ~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~--~~~~~----~~~~~~~~~~~~~~~~ 273 (415) T protein:vir:47 203 NTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSS----GFEKEGKKLEVKKAKS 273 (415) T ss_pred eeeEeeehhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccCCc--ccccc----ccccccceeccccccc Confidence 99999999999999644 578999999999999999999999999653222 22111 1111222233344566 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+...++.+++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++.+|.. T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~----------- 342 (415) T protein:vir:47 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG----------- 342 (415) T ss_pred hHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEecccccc----------- Confidence 888999999988888888999999999999999999999999998888888999999999999888732 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+. +.+..+.+++++.++ |.++++.+|+++|+|+++.+|+||++++..+.+ T Consensus 343 --~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:47 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTD----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred --CCCccEEEEEehhccEEEEeecceEEEeec----------cccCceEEEEEEEeccEEeccccEEEEEeeccC Confidence 12344689999997 667888999888764 456778899999999999999999999988887 No 49 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.8e-54 Score=314.21 Aligned_cols=279 Identities=14% Similarity=0.061 Sum_probs=231.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--eCCceeEEeecCccccc-cccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTL--TAPPRGEVVGEGAQKSE-STATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~--~~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~ 77 (311) ..+++.|+++||+++.++|++.+++.++|++++++++++++..++|+. .+...+.|++|++++|+ +.++|+++++.+ T Consensus 123 ~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:46 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 224566889999999999999999999999999999999888777765 56678999999999997 568999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|+++++ .++++++|++++++++++++|.++|+|++.+.. ..+.. ...........+.... T Consensus 203 ~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~--~~~~~----~~~~~~~~~~~~~~~~ 273 (415) T protein:vir:46 203 NTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSS----GFEKEGKKLEVKKAKS 273 (415) T ss_pred eeeEeeehhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccCCc--ccccc----ccccccceeccccccc Confidence 99999999999999644 578999999999999999999999999653222 22111 1111222233344566 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+...++.+++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++.+|.. T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~----------- 342 (415) T protein:vir:46 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG----------- 342 (415) T ss_pred hHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEecccccc----------- Confidence 888999999988888888999999999999999999999999998888888999999999999888732 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+. +.+..+.+++++.++ |.++++.+|+++|+|+++.+|+||++++..+.+ T Consensus 343 --~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:46 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTD----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred --CCCccEEEEEehhccEEEEeecceEEEeec----------cccCceEEEEEEEeccEEeccccEEEEEeeccC Confidence 12344689999997 667888999888764 456778899999999999999999999988887 No 50 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=3.4e-54 Score=313.74 Aligned_cols=283 Identities=11% Similarity=0.084 Sum_probs=231.0 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCC--CceEEEEEeCCceeEEeecCcccccc--ccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEF--GEQQYMTLTAPPRGEVVGEGAQKSES--TATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~--~~~~~p~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~ 74 (311) |. +.++||++||+++.++|++.+++.++|++++++++++. +.+.+|+..+.+.+.|++|++.++.+ +++|++++ T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~ 189 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFN 189 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeeccccccccccccccceeeeE Confidence 33 55778999999999999999999999999999988864 56789999999999999999999875 58999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) ++++|++++++||+|++++ +.++++++|.+++++++++++|.++|+|++ ++..+.|+ ......... ..+. T Consensus 190 ~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~il~G~g--~~~~~~gi----~~~~~~~~~-~~~~ 259 (404) T protein:vir:10 190 FKLKDLADFMSIPNDLLKF---ADKSLEDWIINWFVDKVRITRNAEILYGAG--GDEHATGI----MTANKFKKI-TLPK 259 (404) T ss_pred eeheeeEeeehhhHHHHhh---cHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCCcccce----eecccccee-eccc Confidence 9999999999999999964 456899999999999999999999999964 34444443 333322222 2233 Q ss_pred cchHHHHHHHHHH-HhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEe-ecccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGL-VLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAV-SDTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~-~~~~~~~~~~~~~ 232 (311) ...++++..++.. +.+......+|+|||.+|..|+++||++|+|+|.+...++.+++|+|+||++ ++.++.. T Consensus 260 ~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~------ 333 (404) T protein:vir:10 260 SPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLS------ 333 (404) T ss_pred cccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCC------ Confidence 4556777777664 4444444457999999999999999999999999888888899999999974 3444321 Q ss_pred cccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...+++|||+. +.+..+.+++++++++.. ..|++|++.||+++|+|+++++|+||++++.+++| T Consensus 334 -------~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa 401 (404) T protein:vir:10 334 -------TESAIPVLLGDTKEAYKYVSDGAYELATTNIGA-----GAFETNTTKARIIMRIDGNVKDSEALLIAEIPVES 401 (404) T ss_pred -------CCCccEEEEEeccccEEEEEecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 23455789999996 678889999999876542 34899999999999999999999999999998888 No 51 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=3.3e-54 Score=313.78 Aligned_cols=271 Identities=9% Similarity=-0.004 Sum_probs=227.7 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEe-CCceeEEeecCccccc-cccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLT-APPRGEVVGEGAQKSE-STATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~ 74 (311) |. +++.||++||+++.++|++.++++++++++|++++++++. +.+|+.. ..+.+.|++|++++++ ++++|++++ T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~i~ 84 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSLIK 84 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccceeEEE Confidence 55 5566889999999999999999999999999999987654 5566665 4678999999999997 579999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) ++++|+++++++|+|+++ |+.++++++|++++++++++++|+++++|.+.... ... T Consensus 85 l~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~---------------------~~~ 140 (293) T protein:vir:48 85 YTIKRYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT---------------------KPT 140 (293) T ss_pred EeeeEEEEeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc---------------------ccc Confidence 999999999999999996 44578999999999999999999999988542110 112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAST 234 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 234 (311) ...++++.+++.++..++...++|+||++++..|+++||++|||+|.+....+.+++|+|+||++.+..+-. T Consensus 141 ~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~-------- 212 (293) T protein:vir:48 141 LTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLP-------- 212 (293) T ss_pred ccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccC-------- Confidence 345788999999998887777889999999999999999999999999888888999999999865432211 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. +.+..+++++++++++.. ++|++|++.+|+++|+|+++++|+||++++.++.+ T Consensus 213 ---~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 282 (293) T protein:vir:48 213 ---NASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGG-----GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 282 (293) T ss_pred ---CccCCceEEEEEeccceEEEEEecceEEEEecccc-----hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccc Confidence 1123455789999997 668889999999887532 46999999999999999999999999999976665 No 52 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=3e-54 Score=314.03 Aligned_cols=271 Identities=9% Similarity=-0.008 Sum_probs=228.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeC-CceeEEeecCccccccc-cceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLTA-PPRGEVVGEGAQKSEST-ATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~~-~~~a~~v~Eg~~~~~~~-~~~~~v~l~ 76 (311) ..+.+.||++||+++..+|++.+++.++|++++++++++++. +.+|+... .+.+.|++|++++|+++ ++|++++++ T Consensus 111 ~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~ 190 (397) T protein:vir:49 111 DGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYA 190 (397) T ss_pred ccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEee Confidence 346677899999999999999999999999999999888765 45565544 46789999999999875 899999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|++++++||+|+++ ++.++++++|.+++++++++++|.++|+|+|.+. + ..... T Consensus 191 ~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~--~-------------------~~~~~ 246 (397) T protein:vir:49 191 IKRYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP--N-------------------KPTLA 246 (397) T ss_pred eeeeEeehhhHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--c-------------------ccccc Confidence 9999999999999996 4457899999999999999999999999965321 1 01123 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) .++++.+++..+...+..+++|+|||.+|..|+++||++|+|+|.+....+.+++|+|+||++++..+.. T Consensus 247 ~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~---------- 316 (397) T protein:vir:49 247 KWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLP---------- 316 (397) T ss_pred CHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccc---------- Confidence 4678888999999888888899999999999999999999999998888888899999999875432211 Q ss_pred cccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...++||||+. |.+.++.+++++++++.+ .+|++|++.+|++.|+|+++++|+||++++.++.+ T Consensus 317 -~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 317 -NGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG-----GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIA 386 (397) T ss_pred -cccCCceeEEEeeccceEEEEeecccEEEEecccc-----chhhcCeeeEEEEEeeccEEecccceEEEEecccc Confidence 1234556799999996 778999999999987643 46999999999999999999999999999976666 No 53 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=4.4e-54 Score=313.08 Aligned_cols=273 Identities=18% Similarity=0.135 Sum_probs=231.6 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC-ceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) +. +++.+|.++|+++...|++.+++.++|++++++++++++.+++|+.++. +.+.|++||+++|+++++|+++++.+ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEee Confidence 22 4456778899999999999999999999999999999999999998765 58999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee-ccccccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL-TTGTSAT 156 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~ 156 (311) +|++++++||+|+++++ .+++++|.+++++++++++|.+||+|++ .+..+.|+. ........ ....... T Consensus 193 ~k~~~~~~is~ell~d~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g--~~~~~~Gi~----~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:81 193 HVIAHTMKATRQILSDA----PQLASYMNNRLIRGLKVKEDAEILRGTG--ANDGLLGLI----PQATTYAAPTTIAGAT 262 (390) T ss_pred eEEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCccccee----ecccccccccccccch Confidence 99999999999999643 3699999999999999999999999964 334444443 33222221 2233445 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) .++++.+++..+...++.+++|+|||++|..|+++||++|+|+|.+.. ...+++|+|+||++++.+|.+ T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~-~~~~~~l~G~pv~~~~~~p~~---------- 331 (390) T protein:vir:81 263 RVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNAR-GTLTPTLWGLPVVATQAMAPG---------- 331 (390) T ss_pred hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc-cccCceecceeeEEcCCCCCC---------- Confidence 678888999999999999999999999999999999999999998754 455679999999999998743 Q ss_pred cccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+++|||+. |.+..+.+++++.+++. .+|++|++.+|++.|+|+++++|+||++++.| T Consensus 332 --------~~~~gd~~~~~~~~~~~~~~v~~~~~~------~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 332 --------EFLVGAFDLAAQIFDQWDARVEIGYVG------EDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred --------cEEEEehhceEEEEEecceEEEEeccc------chhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 578999986 67788899999877543 25999999999999999999999999999999 No 54 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=3.9e-54 Score=313.41 Aligned_cols=278 Identities=17% Similarity=0.087 Sum_probs=231.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC----ceeEEeecCccccccc-cceeEEEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAP----PRGEVVGEGAQKSEST-ATFAPVTA 75 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~----~~a~~v~Eg~~~~~~~-~~~~~v~l 75 (311) .-+.+.++++||+++.++|++.+++.++|++++++++++++.+++|+.... ..+.|++||+.+|+++ ++|+++++ T Consensus 120 ~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~ 199 (413) T protein:vir:81 120 ATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTE 199 (413) T ss_pred cccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEe Confidence 224456889999999999999999999999999999999888999997653 4679999999999987 68999999 Q ss_pred eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 76 ~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) .++|++++++||+|||+++ . .++++|+..+++++++++|+++|+|++ .+.++ .|+....+..+....+.. T Consensus 200 ~~~k~~~~~~iS~ell~ds---~-~l~~~i~~~la~~~~~~~d~~~l~G~G--~~~~~----~Gi~~~~~~~~~~~~~~~ 269 (413) T protein:vir:81 200 SLSKIAGLTKITDEMIEDY---D-FLVSYINARLLEELAIEEERQLLLGDG--TGNNL----TGLLKRDGIQTLAVSNKD 269 (413) T ss_pred eeeeEEEeehhhHHHHHHH---H-HHHHHHHHHHHHHHHHHHHHHHhccCC--CCCcc----cccccccccccccccccc Confidence 9999999999999999644 2 489999999999999999999999954 33333 445555555555444555 Q ss_pred chHHHHHHHHHHHh-hcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccC-------CCceecceeEEeeccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVL-GDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGT-------DVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 156 ~~~~~i~~~~~~~~-~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~-------~~~~l~G~pv~~~~~~~~~~ 227 (311) ..++.+..++..+. ..++.+++|+|||.+|..|+++||++|+|+|.+...+. ..++|+|+||++++.+|.+ T Consensus 270 ~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~- 348 (413) T protein:vir:81 270 ELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVG- 348 (413) T ss_pred hhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcc- Confidence 56777777776654 44667788999999999999999999999997654432 3458999999999998743 Q ss_pred ccccccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR 306 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~ 306 (311) .++||||+. |.+..+.+++++++++.. .+|++|++.||+++|+|+.+++|+||++++ T Consensus 349 -----------------~~~~gd~~~~~~~~~~~~~~v~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~ 406 (413) T protein:vir:81 349 -----------------KPVVGAFRSAASVLRKGGVRIDSTNTNV-----DDFENNLITVRAEERVGLMVTFPEAIVQLD 406 (413) T ss_pred -----------------cEEEEecccEEEEEEecceEEEEecccc-----chhhcCcEEEEEEEeeccEEecccceEEEE Confidence 588999986 677888999999987653 359999999999999999999999999999 Q ss_pred ecccC Q lcl|Aclame:pro 307 DADES 311 (311) Q Consensus 307 ~aa~~ 311 (311) .++.+ T Consensus 407 ~~~~~ 411 (413) T protein:vir:81 407 VAEVV 411 (413) T ss_pred ecCCC Confidence 88888 No 55 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=8.1e-54 Score=311.66 Aligned_cols=273 Identities=17% Similarity=0.128 Sum_probs=229.4 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC-ceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) +. +++++|.++|+++...||+.+++.++|+++|++++++++.+++|+.++. +.+.|++|++++|+++++|+++++++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEee Confidence 22 3344566778888899999999999999999999999888999998865 68999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee-ccccccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL-TTGTSAT 156 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~ 156 (311) +|++++++||+||++++ .+++++|.+++++++++++|+++|+|++ .+..+.|+ ......... ....... T Consensus 193 ~k~~~~~~is~ell~d~----~~l~~~i~~~l~~~~~~~~~~~il~G~G--~~~~p~Gi----~~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:10 193 HVIAHTMKATRQILSDA----PQLASYMNNRLIRGLKVKEDAEILRGTG--ANDGLLGL----IPQATTYAAPTTIAGAT 262 (390) T ss_pred EEEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCcccccc----ccccccccccccccccc Confidence 99999999999999643 3689999999999999999999999964 33344444 333332222 2223345 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) .++.+..++..+...++.+++|+|||++|..|+++||++|+|+|++.. ...+++|+|+||++++.+|.+ T Consensus 263 ~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~-~~~~~~l~G~pv~~~~~~p~~---------- 331 (390) T protein:vir:10 263 RVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNAR-GTLTPTLWGLPVVATQAMAPG---------- 331 (390) T ss_pred hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCc-CcCCceecceeeEEcCCCCCC---------- Confidence 677888999999999999999999999999999999999999998765 445679999999999998743 Q ss_pred cccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+++|||+. |.+..+.+++++++++. .+|++|++.||++.|+|+++++|+||++++.| T Consensus 332 --------~~~~gdf~~~~~~~~~~~~~i~~~~~~------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 332 --------EFLVGAFDLAAQIFDQWDARVEIGYVN------DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred --------cEEEEeccceEEEEEecceEEEEeecc------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 578999986 66788999999886542 24999999999999999999999999999999 No 56 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=4.2e-54 Score=313.21 Aligned_cols=271 Identities=9% Similarity=0.005 Sum_probs=227.7 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeC-CceeEEeecCccccc-cccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLTA-PPRGEVVGEGAQKSE-STATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~ 74 (311) |. +.+.||++||+++.+.|++.+++.++|+++|++++++++. +.+|+... .+.+.|++|++++|+ ++++|++++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~ 188 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIK 188 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEE Confidence 43 5566899999999999999999999999999999887544 55666554 467999999999996 689999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) ++++|++++++||+||+++ +.++++++|.+++++++++++|.++++|++... .. .. T Consensus 189 ~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~--~~-------------------~~ 244 (397) T protein:vir:49 189 YTIKRYAGISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILEAIAALP--TK-------------------PT 244 (397) T ss_pred eeeeeEEeeehhHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cc-------------------cc Confidence 9999999999999999964 457899999999999999999999999964321 11 01 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAST 234 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 234 (311) ...++++.+++..+.......++|+|||.++..|+++||++|+|+|.+...++.+++|+|+||++.+..+-. T Consensus 245 ~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~-------- 316 (397) T protein:vir:49 245 LTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLA-------- 316 (397) T ss_pred cccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccc-------- Confidence 234678888899998888888899999999999999999999999998888888999999999875432110 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. |.+..+.+++++++++.+ ++|++|++.+|++.|+|+++++|+||++++.++.+ T Consensus 317 ---~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 317 ---NGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG-----GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 386 (397) T ss_pred ---cccCCceeEEEeeccceEEEEeecceEEEEecccc-----chhhcCceeEEEEeeeCcEEecccceEEEEeeccc Confidence 1233455799999996 678889999999887642 45999999999999999999999999999977766 No 57 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=4.7e-54 Score=312.97 Aligned_cols=271 Identities=8% Similarity=-0.009 Sum_probs=229.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE---eCCceeEEeecCcccccc-ccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTL---TAPPRGEVVGEGAQKSES-TATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~---~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~ 76 (311) +.+++.||++||++++++|++.+++.++|+++|++++++++...+|+. +..+.++|++|+++++++ +++|++++++ T Consensus 111 ~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~ 190 (397) T protein:vir:48 111 DASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYA 190 (397) T ss_pred ccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEee Confidence 456677999999999999999999999999999999998776665543 345678999999999986 5899999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|++++++||+|+++++ .+++++++++++++++++++|.++|+|++... . ..... T Consensus 191 ~~k~~~~~~iS~ell~ds---~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~--~-------------------~~~~~ 246 (397) T protein:vir:48 191 IKRYAGISTVTNSLLADS---AENILAWLSGWIAKKVVVTRNKAILEAIATLP--T-------------------KPTLT 246 (397) T ss_pred heeeeeehhhHHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c-------------------ccccc Confidence 999999999999999654 57899999999999999999999999964321 1 01223 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) .++++.+++..+...+...++|+|||.++..|+++||++|+|+|.+....+.+++|+|+||++.+..+-. T Consensus 247 ~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~---------- 316 (397) T protein:vir:48 247 KWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLA---------- 316 (397) T ss_pred cHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccC---------- Confidence 4678888888998888888899999999999999999999999998888888999999999875432211 Q ss_pred cccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. +.+..+.+++++++++.+ .+|.+|++.+|+++|+|+++++|+||++++.++.+ T Consensus 317 -~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 317 -NASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG-----GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred -CcCCCceEEEEEeccceEEEEeecceEEEEeccch-----hhhhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 1234556899999996 568889999999887643 46999999999999999999999999999987776 No 58 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=8.8e-54 Score=311.46 Aligned_cols=279 Identities=14% Similarity=0.048 Sum_probs=231.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.+.|++.+++.++|++++++++|+++... +|+.++...+.|++|++++|+. .++|+++++++ T Consensus 123 ~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:79 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeee Confidence 23556688999999999999999999999999999999877655 4556677889999999999975 68999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+||++++ .++++++|.+++++++++++|.++++|++.+.. ..+... ........+...... T Consensus 203 ~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~--~~~~~~----~~~~~~~~~~~~~~~ 273 (415) T protein:vir:79 203 NTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSSG----FEKEGKKLEVKKAKS 273 (415) T ss_pred eeeEeeehhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcc--cccccc----ccccccccccccccc Confidence 99999999999999644 568999999999999999999999999654332 221111 111122233445567 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) |+++.+++..+...++.+++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++.+|.. T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----------- 342 (415) T protein:vir:79 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG----------- 342 (415) T ss_pred hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccC----------- Confidence 888999999998888888999999999999999999999999998888888999999999998888743 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+. |.+..+.+++++.++ |.++...+|+.+|+|+++.||+||++++..+++ T Consensus 343 --~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:79 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTD----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred --CCCccEEEEEehhccEEEEeecceEEEEec----------cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 12344689999997 557888999988764 345667899999999999999999999988888 No 59 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=8.8e-54 Score=311.46 Aligned_cols=279 Identities=14% Similarity=0.048 Sum_probs=231.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.+.|++.+++.++|++++++++|+++... +|+.++...+.|++|++++|+. .++|+++++++ T Consensus 123 ~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:98 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeee Confidence 23556688999999999999999999999999999999877655 4556677889999999999975 68999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+||++++ .++++++|.+++++++++++|.++++|++.+.. ..+... ........+...... T Consensus 203 ~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~--~~~~~~----~~~~~~~~~~~~~~~ 273 (415) T protein:vir:98 203 NTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSSG----FEKEGKKLEVKKAKS 273 (415) T ss_pred eeeEeeehhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcc--cccccc----ccccccccccccccc Confidence 99999999999999644 568999999999999999999999999654332 221111 111122233445567 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) |+++.+++..+...++.+++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++.+|.. T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----------- 342 (415) T protein:vir:98 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG----------- 342 (415) T ss_pred hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccC----------- Confidence 888999999998888888999999999999999999999999998888888999999999998888743 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+. |.+..+.+++++.++ |.++...+|+.+|+|+++.||+||++++..+++ T Consensus 343 --~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:98 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTD----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred --CCCccEEEEEehhccEEEEeecceEEEEec----------cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 12344689999997 557888999988764 345667899999999999999999999988888 No 60 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=8.8e-54 Score=311.46 Aligned_cols=279 Identities=14% Similarity=0.048 Sum_probs=231.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.+.|++.+++.++|++++++++|+++... +|+.++...+.|++|++++|+. .++|+++++++ T Consensus 123 ~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:81 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeee Confidence 23556688999999999999999999999999999999877655 4556677889999999999975 68999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+||++++ .++++++|.+++++++++++|.++++|++.+.. ..+... ........+...... T Consensus 203 ~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~--~~~~~~----~~~~~~~~~~~~~~~ 273 (415) T protein:vir:81 203 NTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGST--GSTSSG----FEKEGKKLEVKKAKS 273 (415) T ss_pred eeeEeeehhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcc--cccccc----ccccccccccccccc Confidence 99999999999999644 568999999999999999999999999654332 221111 111122233445567 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) |+++.+++..+...++.+++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++.+|.. T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----------- 342 (415) T protein:vir:81 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLG----------- 342 (415) T ss_pred hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccC----------- Confidence 888999999998888888999999999999999999999999998888888999999999998888743 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+. |.+..+.+++++.++ |.++...+|+.+|+|+++.||+||++++..+++ T Consensus 343 --~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:81 343 --QKGNNTLIIGNLKDAIVLFDRSQYQASWTD----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred --CCCccEEEEEehhccEEEEeecceEEEEec----------cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 12344689999997 557888999988764 345667899999999999999999999988888 No 61 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=3.6e-54 Score=313.60 Aligned_cols=276 Identities=14% Similarity=0.134 Sum_probs=228.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccc-cceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSEST-ATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k 79 (311) +.+++.||++||+++.++|++.+++.++++++|+++++. ++.++|+..+.+.+.|++|++++|+++ ++|++|+++++| T Consensus 140 ~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k 218 (425) T protein:vir:95 140 LRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFK 218 (425) T ss_pred hcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ceeEEEEecCCccccccccccccccccccccceeeeehee Confidence 445677899999999999999999999999999999986 678999999999999999999999877 789999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|+++++ .++++++|..+++++|++++|.++|+|+|.+.. .|.|+.+.+.... ..........++ T Consensus 219 ~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~-~p~Gil~~~~~~~---~~~~~~~~~~~~ 291 (425) T protein:vir:95 219 VGKVTFVDNYLLQDS---IINLDDYVTKKIARAIAKALDLAIVKGTGAANK-QPLGIIPSLPPEN---QVTVEADNNLLK 291 (425) T ss_pred eeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc-ccceeeccccccc---ccccccccchHH Confidence 999999999999644 567999999999999999999999999754322 3344443322222 222334456778 Q ss_pred HHHHHHHHHhhcCC--CccEEEEcHHHH----HHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNL--SPDGVALDNTFS----FMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAS 233 (311) Q Consensus 160 ~i~~~~~~~~~~~~--~~~~~v~n~~~~----~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 233 (311) ++.+++..+..... ...+|+||+.++ ..|+++||++|||+|... ....++|+|+||++++.+|.+ T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~pvv~~~~~~~~------- 362 (425) T protein:vir:95 292 NLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGLRVVFNNFLDDD------- 362 (425) T ss_pred HHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccC--CCCCccccceeeEEcCcCCCc------- Confidence 88888877655433 344699999875 346788999999999743 334578999999999998743 Q ss_pred ccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 234 TGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.|.++.+.+++++++++. .|.+|++.||++.|+|+++++|+||++++.+++. T Consensus 363 -----------~i~~Gd~~~~~~~~~~~~~i~~~~~~-------~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~ 422 (425) T protein:vir:95 363 -----------TVLFGEFEQYTLVERENITIDSSTHV-------KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPV 422 (425) T ss_pred -----------cEEEEecccEEEEeecceEEEeeccc-------ccccCceEEEEEEeeCcEeecccceEEEEecCcC Confidence 57899999999999999999988753 4899999999999999999999999999999976 No 62 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.2e-54 Score=314.79 Aligned_cols=268 Identities=15% Similarity=0.173 Sum_probs=228.4 Q ss_pred Cc--ccCCCceEcchhH-HHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEe Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHL-VPGVWQKAQGQSVLARL-SMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) Q Consensus 1 ma--t~~~g~~~vP~~~-~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 76 (311) |. +.++||++||+++ .++||+.+++.++++++ ++.+++.++++++|+.++++.++|++|++++++++++|++++++ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~ 436 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEee Confidence 22 5567899999886 68999999999999999 57789888999999999999999999999999999999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|++++++||+|||.++ .++++++|++.|++++++++|.++|+|++. +.. |.|+++.+.+.......... T Consensus 437 ~~k~~~~v~iS~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~----p~Gi~~~~~~~~~~~~~~~~ 507 (632) T protein:vir:96 437 PKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTGL--AND----PVGLLNMTGVPALTYPAGGV 507 (632) T ss_pred eeEEEEehhhHHHHHhcc---chHHHHHHHHHHHHHHHHHHHHHhhcccCC--CCc----cceeeecccccceecccccC Confidence 999999999999999654 467999999999999999999999998542 222 44555555444444444556 Q ss_pred hHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHH--hhccCCceeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNL--SPDGVALDNTFSFMLAT--QRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~--~~~~~v~n~~~~~~l~~--lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) .|+++.+++.++...+. ...+|+||+..+..+++ ++|++|+|+|.+ ++++|+||++++++|.+ T Consensus 508 ~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------~~l~G~pv~~s~~ip~~------ 574 (632) T protein:vir:96 508 DWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD------ 574 (632) T ss_pred CHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------CeecccceEeccccccC------ Confidence 78888888888876654 34579999988777765 789999999963 58999999999999854 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) .+++|||+.+.++++.+++|+++++. .|.+|++.||+++|+|++++||++|+++|.+| T Consensus 575 ------------~~~~gd~s~~~i~~~~~~~i~~~~~~-------~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 575 ------------TWIFGDWSQIVIAMWGVLDLKVDPYT-------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ------------cEEEeecceEEEEEecceEEEEcccc-------ccccCceEEEEEeecCceeechhhhhheeecC Confidence 47899999999999999999998874 37899999999999999999999999999999 No 63 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1e-53 Score=311.04 Aligned_cols=271 Identities=13% Similarity=0.079 Sum_probs=224.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.++|++.+++.++|+++|++++++++. ..+|+..+++.+.|++|+++++++ .++|+++++.+ T Consensus 108 ~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~ 187 (392) T protein:vir:10 108 GLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAV 187 (392) T ss_pred ccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeee Confidence 335567899999999999999999999999999999988655 567777888899999999999976 59999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|++++ +.++++++|.+.+++++++++|.++++|++.... ..... T Consensus 188 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------------~~~~~ 242 (392) T protein:vir:10 188 KDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------------QAIKS 242 (392) T ss_pred eeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cCccC Confidence 9999999999999964 4578999999999999999999999988542210 11233 Q ss_pred HHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEee--cccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVS--DTVRGGPEAVTAST 234 (311) Q Consensus 158 ~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~--~~~~~~~~~~~~~~ 234 (311) ++++.+++. .+.+.....+.|+|||.+|..|+++||++|+|+|.+....+.+++|+|+|+++. +..+... T Consensus 243 ~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~------- 315 (392) T protein:vir:10 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK------- 315 (392) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCC------- Confidence 566766654 555655566679999999999999999999999998888888999999876542 2223221 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. |.+..+.+++++++++.+ .+|++|++.+|++.|+|+++++|+||++++.++++ T Consensus 316 ---~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 316 ---GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred ---cccCCceEEEEEehhceEEEEeecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 2344566899999997 678999999999987643 46999999999999999999999999999988877 No 64 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1e-53 Score=311.04 Aligned_cols=271 Identities=13% Similarity=0.079 Sum_probs=224.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.++|++.+++.++|+++|++++++++. ..+|+..+++.+.|++|+++++++ .++|+++++.+ T Consensus 108 ~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~ 187 (392) T protein:vir:10 108 GLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAV 187 (392) T ss_pred ccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeee Confidence 335567899999999999999999999999999999988655 567777888899999999999976 59999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|++++ +.++++++|.+.+++++++++|.++++|++.... ..... T Consensus 188 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------------~~~~~ 242 (392) T protein:vir:10 188 KDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------------QAIKS 242 (392) T ss_pred eeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cCccC Confidence 9999999999999964 4578999999999999999999999988542210 11233 Q ss_pred HHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEee--cccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVS--DTVRGGPEAVTAST 234 (311) Q Consensus 158 ~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~--~~~~~~~~~~~~~~ 234 (311) ++++.+++. .+.+.....+.|+|||.+|..|+++||++|+|+|.+....+.+++|+|+|+++. +..+... T Consensus 243 ~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~------- 315 (392) T protein:vir:10 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK------- 315 (392) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCC------- Confidence 566766654 555655566679999999999999999999999998888888999999876542 2223221 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. |.+..+.+++++++++.+ .+|++|++.+|++.|+|+++++|+||++++.++++ T Consensus 316 ---~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 316 ---GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred ---cccCCceEEEEEehhceEEEEeecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 2344566899999997 678999999999987643 46999999999999999999999999999988877 No 65 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1e-53 Score=311.04 Aligned_cols=271 Identities=13% Similarity=0.079 Sum_probs=224.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.++|++.+++.++|+++|++++++++. ..+|+..+++.+.|++|+++++++ .++|+++++.+ T Consensus 108 ~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~ 187 (392) T protein:vir:10 108 GLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAV 187 (392) T ss_pred ccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeee Confidence 335567899999999999999999999999999999988655 567777888899999999999976 59999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|++++ +.++++++|.+.+++++++++|.++++|++.... ..... T Consensus 188 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------------~~~~~ 242 (392) T protein:vir:10 188 KDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------------QAIKS 242 (392) T ss_pred eeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cCccC Confidence 9999999999999964 4578999999999999999999999988542210 11233 Q ss_pred HHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEee--cccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVS--DTVRGGPEAVTAST 234 (311) Q Consensus 158 ~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~--~~~~~~~~~~~~~~ 234 (311) ++++.+++. .+.+.....+.|+|||.+|..|+++||++|+|+|.+....+.+++|+|+|+++. +..+... T Consensus 243 ~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~------- 315 (392) T protein:vir:10 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK------- 315 (392) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCC------- Confidence 566766654 555655566679999999999999999999999998888888999999876542 2223221 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. |.+..+.+++++++++.+ .+|++|++.+|++.|+|+++++|+||++++.++++ T Consensus 316 ---~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 316 ---GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred ---cccCCceEEEEEehhceEEEEeecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 2344566899999997 678999999999987643 46999999999999999999999999999988877 No 66 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1e-53 Score=311.04 Aligned_cols=271 Identities=13% Similarity=0.079 Sum_probs=224.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCceeEEeecCcccccc-ccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~ 77 (311) ..+++.||++||+++.++|++.+++.++|+++|++++++++. ..+|+..+++.+.|++|+++++++ .++|+++++.+ T Consensus 108 ~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~ 187 (392) T protein:vir:10 108 GLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAV 187 (392) T ss_pred ccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeee Confidence 335567899999999999999999999999999999988655 567777888899999999999976 59999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|++++ +.++++++|.+.+++++++++|.++++|++.... ..... T Consensus 188 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------------~~~~~ 242 (392) T protein:vir:10 188 KDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------------QAIKS 242 (392) T ss_pred eeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cCccC Confidence 9999999999999964 4578999999999999999999999988542210 11233 Q ss_pred HHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEee--cccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVS--DTVRGGPEAVTAST 234 (311) Q Consensus 158 ~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~--~~~~~~~~~~~~~~ 234 (311) ++++.+++. .+.+.....+.|+|||.+|..|+++||++|+|+|.+....+.+++|+|+|+++. +..+... T Consensus 243 ~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~------- 315 (392) T protein:vir:10 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK------- 315 (392) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCC------- Confidence 566766654 555655566679999999999999999999999998888888999999876542 2223221 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. |.+..+.+++++++++.+ .+|++|++.+|++.|+|+++++|+||++++.++++ T Consensus 316 ---~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 316 ---GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred ---cccCCceEEEEEehhceEEEEeecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 2344566899999997 678999999999987643 46999999999999999999999999999988877 No 67 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=2.4e-53 Score=309.10 Aligned_cols=279 Identities=13% Similarity=0.055 Sum_probs=231.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeCCceeEEeecCccccc-cccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTAPPRGEVVGEGAQKSE-STATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~ 77 (311) ..++++|+++||+++.++|++.+++.++|++++++++++++... +|+.++.+.+.|++|++++|+ +.++|+++++++ T Consensus 123 ~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~ 202 (415) T protein:vir:94 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred ccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeh Confidence 23456789999999999999999999999999999999877655 455567788999999999996 468999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +|++++++||+|+++++ .++++++|.+++++++++++|.++++|++.+.... +. ..... .....+.+.... T Consensus 203 ~k~~~~~~is~ell~ds---~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--~~-~~~~~---~~~~~~~~~~~~ 273 (415) T protein:vir:94 203 NTHRGYFRISREAIEDA---KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--TS-SGFEK---EGKKLEVKKAKS 273 (415) T ss_pred eeeeeechhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc--cc-ccccc---cccccccccccc Confidence 99999999999999644 57899999999999999999999999965433221 11 11111 112223334466 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+...++.+++|+|||++|.+|+++||++|+|+|.+.+..+.+++|+|+||++++.+|.+. T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~---------- 343 (415) T protein:vir:94 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQ---------- 343 (415) T ss_pred hHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCC---------- Confidence 8889999999988888889999999999999999999999999988888889999999999999887431 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...+++|||+. |.+..+.+++++.++ |.++++.+|++.|+|+++.+|+||++++..+.+ T Consensus 344 ---~~~~~i~~gd~~~~~~~~~~~~~~v~~~~----------~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 344 ---KGNNTLIIGNLKDAIVLFDRSQYQASWTD----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ---CCccEEEEEehhccEEEEeecceEEEEec----------cccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 2334689999997 567788889888654 456778899999999999999999999987777 No 68 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=2e-53 Score=309.51 Aligned_cols=267 Identities=13% Similarity=0.017 Sum_probs=224.2 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCC--CceEEEEEeCCceeEEeecCccccc-cccceeEEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEF--GEQQYMTLTAPPRGEVVGEGAQKSE-STATFAPVTA 75 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~--~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~l 75 (311) |. +.+.||++||+++++.|++.+++.++|+++|+++++++ +.+.+|+.++.+.++|++|++++|+ +.++|+++++ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~ 202 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSY 202 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEe Confidence 44 44668999999999999999999999999999999875 4567778888899999999999997 5699999999 Q ss_pred eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 76 ~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) +++|++++++||+|+++++ .++++++|...+++++++++|.++++|++.+. +.+ . T Consensus 203 ~~~k~~~~~~is~e~l~ds---~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~-------~~g---------------~ 257 (397) T protein:vir:12 203 SIIDYGGIMTLSNSMLNDS---DQAIMTYVAKWFAKKSVVTRNNLILAAIASLK-------KVD---------------I 257 (397) T ss_pred eheeeEeeehhhHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------ccc---------------c Confidence 9999999999999999644 57899999999999999999999999965321 111 1 Q ss_pred chHHHHHHHH-HHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecc-ccccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAV-GLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDT-VRGGPEAVTAS 233 (311) Q Consensus 156 ~~~~~i~~~~-~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~-~~~~~~~~~~~ 233 (311) ..++++..++ ..+.+.....++|+|||.+|.+|+++||++|+|+|.+....+.+++|+|+||++++. +|. T Consensus 258 ~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~-------- 329 (397) T protein:vir:12 258 DGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLK-------- 329 (397) T ss_pred ccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccc-------- Confidence 2355666555 466666666678999999999999999999999999888888899999999986554 331 Q ss_pred ccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 234 TGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 234 ~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ...+...+++|||+. +.+..+++++++++++.+ ..|++|++.+|+++|+|+++++|+||++++.+++ T Consensus 330 -----~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 330 -----TQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGA-----GAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred -----cCCCccEEEEEehhceEEEEeecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 224455799999997 568889999999876543 3599999999999999999999999999999999 No 69 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=1.7e-53 Score=309.93 Aligned_cols=280 Identities=14% Similarity=0.128 Sum_probs=222.7 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEe---ecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVV---GEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v---~Eg~~~~~~~~~~~~v~l~~ 77 (311) ..+++.||++||+++.++|++.++++++++++|+++++. +++++|+...++.+.|. +|++.+|+++++|+++++++ T Consensus 144 ~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~ 222 (434) T protein:vir:62 144 GLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-ENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSP 222 (434) T ss_pred cccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-CceEEEEEecCCcccceecccccccccccccceeeEEeeh Confidence 124456899999999999999999999999999998876 56899999888777775 56788999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) ||++++++||+||+++ +.++++++|.+++++++++++|.+||+|+|. +..+.++ ...... + ...+.... T Consensus 223 ~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~d~~~l~G~G~--~~~~~g~----~~~~~~-~-~~~~~~~~ 291 (434) T protein:vir:62 223 TEFDALATVTKKLLAR---TGLPIEQIVMDELKKAYVRKETQYMVNGDEA--NNINDGA----LAKKAV-E-FKTDEKNL 291 (434) T ss_pred eeeEeehhhHHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCC--Cccccce----eecccc-c-ccccccch Confidence 9999999999999964 4578999999999999999999999999653 2333332 222211 1 22334466 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeecccc--ccCCCceecceeEEeeccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELG--FGTDVASFAGLNAAVSDTVRGGPEAVTASTG 235 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~--~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 235 (311) ++++.+++..+.......+.|+|||.++..|+++||++|||+|++.. .++.+++|+|+||++++.+|... T Consensus 292 ~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~-------- 363 (434) T protein:vir:62 292 YDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPD-------- 363 (434) T ss_pred hhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCcc-------- Confidence 88898999988877666778999999999999999999999998644 45667899999999999987431 Q ss_pred ccccccccceEEEeecceEEEEeecC-ceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEec-ccceEEEE----ecc Q lcl|Aclame:pro 236 VYRTTNPNVKAIAGDFSAFRWGVQVS-IPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMS-TDAFAVVR----DAD 309 (311) Q Consensus 236 ~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~-~~a~~~l~----~aa 309 (311) ..+...++||||+.|.+..+.+ ++++++.+. +|.+|+|.||++.|+|+++++ |.++++++ .++ T Consensus 364 ----~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~-------~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~ 432 (434) T protein:vir:62 364 ----SPDTPVFYFGDFSKFYIQDVIGSLEVQKLVEL-------FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPT 432 (434) T ss_pred ----CCCceEEEEeeccceEEEEeeceeEEEeehhh-------hcccCceEEEEEeeecceeecCcccceEEEEEeccCC Confidence 1233458899999998877754 667765442 478999999999999999886 88876663 333 Q ss_pred cC Q lcl|Aclame:pro 310 ES 311 (311) Q Consensus 310 ~~ 311 (311) ++ T Consensus 433 ~~ 434 (434) T protein:vir:62 433 GA 434 (434) T ss_pred CC Confidence 33 No 70 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=3e-53 Score=308.51 Aligned_cols=267 Identities=12% Similarity=0.003 Sum_probs=227.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC--CceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA--PPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~--~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |.+.+.++.+||+++...|++.++..++++++|+++++.++.+.||+.++ ...+.|++||+.+|+++++|++++++++ T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~ 188 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTD 188 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeee Confidence 66667777789999999999999999999999999999999999999874 3467899999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+|||+++ .+++++|.+++++++++++|.+++.|.+..+. .+ ....+..... T Consensus 189 k~~~~~~iS~ell~D~----~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~---~~-------------~~~~~~~~~~ 248 (379) T protein:vir:10 189 FIAGFTRYSKKMANNL----PFLTSFIPNALRRDYAKAENAAFNAVLAANAT---AS-------------TEIITNKNKV 248 (379) T ss_pred eEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHHHHhcccccccc---cc-------------cccccCcccH Confidence 9999999999999643 35999999999999999999999988542211 00 0111222345 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccc--cCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGF--GTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) +.+.+++..+...++.+++|+|||.+|..|+++||++|+|+|++... .+.+.+|+|+||++++.+|.+ T Consensus 249 d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag---------- 318 (379) T protein:vir:10 249 EMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAAN---------- 318 (379) T ss_pred HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCCCC---------- Confidence 67888888888899999999999999999999999999999986554 455579999999999888743 Q ss_pred cccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .+++|||+.+.+..+++++++++++.. ++|++|++.||+++|+|+.++||+||++++.++= T Consensus 319 --------~~~~gdf~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 319 --------KYYVGDWTRVTKVTTEGLSLEFSEVEG-----TNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred --------ceEEeecccEEEEEEeceEEEEeeccc-----ccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 578999999999999999999876532 3599999999999999999999999999998877 No 71 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1e-52 Score=305.61 Aligned_cols=269 Identities=9% Similarity=0.024 Sum_probs=220.7 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEe-CCceeEEeecCcccccc-ccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYM--TLT-APPRGEVVGEGAQKSES-TATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p--~~~-~~~~a~~v~Eg~~~~~~-~~~~~~v~ 74 (311) |. +.+.||++||++++++||+.+++.++|+++|++++++++...+| +.. ..+.+.|++|++++|++ .++|++|+ T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~ 195 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIK 195 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEE Confidence 32 45668999999999999999999999999999999987665555 443 34678999999999975 58999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) ++++|++++++||+||+++ +.+++.++|++.+++++++++|.+|++|++.+. .. .. T Consensus 196 ~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~--~~-------------------~~ 251 (408) T protein:vir:10 196 YLIKRYAGIITATNTSLKD---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP--KK-------------------PT 251 (408) T ss_pred eeeeeEEeeehhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc-------------------cc Confidence 9999999999999999964 457899999999999999999999999864321 10 01 Q ss_pred cchHHHHHHHH-HHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecc--ccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAV-GLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDT--VRGGPEAVT 231 (311) Q Consensus 155 ~~~~~~i~~~~-~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~ 231 (311) ...++++..++ ..+.........|+|||.+|..|+++||++|+|+|.+....+.+++|+|+||++.+. +|. T Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~------ 325 (408) T protein:vir:10 252 IAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPN------ 325 (408) T ss_pred cccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCc------ Confidence 12355665554 456555555567999999999999999999999999888888899999999988553 331 Q ss_pred ccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ...+...+++|||+. +.+..+.+++++++++.. ..|++|++.+|++.|+|+++++|+||++++.++. T Consensus 326 -------~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 326 -------TGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA-----GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred -------cCCCceEEEEEehhccEEEEEecceEEEEccccc-----chhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 223455799999996 678899999999887643 3599999999999999999999999999998776 Q ss_pred C Q lcl|Aclame:pro 311 S 311 (311) Q Consensus 311 ~ 311 (311) + T Consensus 394 ~ 394 (408) T protein:vir:10 394 A 394 (408) T ss_pred c Confidence 6 No 72 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1e-52 Score=305.61 Aligned_cols=270 Identities=10% Similarity=0.048 Sum_probs=221.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeC-CceeEEeecCcccccc-ccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTA-PPRGEVVGEGAQKSES-TATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~-~~~a~~v~Eg~~~~~~-~~~~~~v~l~ 76 (311) -.+.++||++||+++.++|++.+++.++|+++|++++++++... +++... .+.+.|++|++.+|++ +++|++++++ T Consensus 109 ~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~ 188 (395) T protein:vir:38 109 TTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYL 188 (395) T ss_pred cCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEee Confidence 22445689999999999999999999999999999998766544 444443 5678899999999976 5999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|++++++||+||+++ +.++++++|.++|++++++++|.+||+|++.+.. .. ... T Consensus 189 ~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~--~~-------------------~~~ 244 (395) T protein:vir:38 189 IHRYAGITTVTNTLLKD---TVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK--KP-------------------TIS 244 (395) T ss_pred eeeeEeehhhHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--cc-------------------ccc Confidence 99999999999999964 4578999999999999999999999999653221 10 112 Q ss_pred hHHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTG 235 (311) Q Consensus 157 ~~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 235 (311) .++++.+++. .+........+|+|||.+|..|+++||++|+|+|.+....+.+++|+|+||++++.++.. T Consensus 245 ~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~--------- 315 (395) T protein:vir:38 245 QFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLP--------- 315 (395) T ss_pred cHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccC--------- Confidence 2455555554 454544555679999999999999999999999998888888999999999988765422 Q ss_pred ccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 236 VYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 236 ~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ...+...++||||+. +.+..+.+++++++++.+ .+|++|++.+|++.|+|+++.+|+||++++.++.+ T Consensus 316 ---~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (395) T protein:vir:38 316 ---DVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGA-----GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVA 384 (395) T ss_pred ---cCCCcceEEEEeccccEEEEEecceEEEEecccc-----chhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 123455789999996 778999999999887643 35999999999999999999999999999988777 No 73 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=2.2e-52 Score=303.76 Aligned_cols=271 Identities=9% Similarity=0.020 Sum_probs=221.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEe-CCceeEEeecCccccc-cccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYM--TLT-APPRGEVVGEGAQKSE-STATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p--~~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~ 76 (311) -.+.++||++||+++.++|++.+++.++|+++|++++++++...+| +.. ..+.+.|++|++++|+ ++++|++++++ T Consensus 118 ~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~ 197 (404) T protein:vir:39 118 SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYL 197 (404) T ss_pred cccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEee Confidence 2355778999999999999999999999999999999987665554 443 3467899999999997 67999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|++++++||+|+++++ .++++++|.+++++++++++|+++|+|++.+. +. .... T Consensus 198 ~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~--~~-------------------~~~~ 253 (404) T protein:vir:39 198 IKRYAGIITATNTLLKDT---AENILAWLSSWIAKKVVVTRNQAIIAAMGTVP--KK-------------------PTIA 253 (404) T ss_pred eeeEEeeehhHHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--cc-------------------cccc Confidence 999999999999999654 57899999999999999999999999965321 10 0112 Q ss_pred hHHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTG 235 (311) Q Consensus 157 ~~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 235 (311) .++++..++. .+.......++|+|||.+|..|+++||++|+|+|.+....+.+++|+|+||++++..+-. T Consensus 254 ~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~--------- 324 (404) T protein:vir:39 254 KFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLP--------- 324 (404) T ss_pred cHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccC--------- Confidence 2455555544 454554555679999999999999999999999998888888999999999986543211 Q ss_pred ccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 236 VYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 236 ~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. +.+..+++++++++++.. .+|++|++.+|+++|+|+.+.+|+||++++.++.+ T Consensus 325 --~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a 394 (404) T protein:vir:39 325 --NSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA-----GAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIA 394 (404) T ss_pred --ccCCCccEEEEEeccccEEEEeecceEEEEeccch-----hhhhhceeeEEEEeeeccEEecccceEEEEeeccc Confidence 1233455799999996 667889999999887643 45999999999999999999999999999977777 No 74 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=2e-52 Score=304.00 Aligned_cols=269 Identities=9% Similarity=0.038 Sum_probs=221.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeC-CceeEEeecCccccc-cccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQ--YMTLTA-PPRGEVVGEGAQKSE-STATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~--~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~ 76 (311) ..+.+.||++||+++.++|++.+++.++|+++|++++++++... +++..+ +..+.|++|++++++ ++++|++++++ T Consensus 118 ~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~ 197 (408) T protein:vir:74 118 SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYL 197 (408) T ss_pred ccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEee Confidence 23556689999999999999999999999999999999876544 555544 456789999999997 67999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|++++++||+|+++ |+.++++++|.+++++++++++|+++|+|++.+. .. .... T Consensus 198 ~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~--~~-------------------~~~~ 253 (408) T protein:vir:74 198 IKRYAGIITATNTLLK---DTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP--KK-------------------PTIA 253 (408) T ss_pred eeeEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc-------------------cccc Confidence 9999999999999996 4457899999999999999999999999964321 10 0112 Q ss_pred hHHHHHHHH-HHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecc--ccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAV-GLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDT--VRGGPEAVTAS 233 (311) Q Consensus 157 ~~~~i~~~~-~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~~~ 233 (311) .++++..++ ..+........+|+|||.++..|+++||++|+|+|.+....+.+++|+|+||++++. +| T Consensus 254 ~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~--------- 324 (408) T protein:vir:74 254 NFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLP--------- 324 (408) T ss_pred cHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccc--------- Confidence 245555544 566666666678999999999999999999999999888888899999999987653 33 Q ss_pred ccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 234 TGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 234 ~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. |.+..+.+++++++++.. ..|++|++.+|+++|+|+++++|+||++++.++.+ T Consensus 325 ----~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 394 (408) T protein:vir:74 325 ----NSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA-----GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIA 394 (408) T ss_pred ----cccCCcceEEEEehhccEEEEEecceEEEEecccc-----chhhcceeeEEEEEeeCcEEecccceEEEEeeccc Confidence 1234556799999996 678889999999887643 34899999999999999999999999999987666 No 75 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.5e-52 Score=304.67 Aligned_cols=266 Identities=9% Similarity=0.019 Sum_probs=226.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCce--eEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPR--GEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~--a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) +.+.+.||++||+++..+|++.+++.++|+++|+++++.+++.++|+...... +.|++|++++++++++|++++++++ T Consensus 116 ~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~ 195 (421) T protein:vir:13 116 IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDID 195 (421) T ss_pred ccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeee Confidence 67888899999999999999999999999999999999999999998776544 6679999999999999999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+|++++ +.++++++|++++++++++++|.++++. +.++...+ ....+ T Consensus 196 k~~~~v~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~i~~~------------~~g~~~~~---------~~~~~ 251 (421) T protein:vir:13 196 DYGLLAPIDNSLLED---SEINFLEFVNEEFAEFAVNTENAEIVKQ------------AKAVLAEE---------TINDY 251 (421) T ss_pred eeEeehhhhHHHHhh---hHHHHHHHHHHHHHHHHHHHhhhhHhhh------------hhhccccc---------cccch Confidence 999999999999964 4568999999999999999999888743 22222111 12347 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++.+++..+...++..++|+|||.+|..|+++||++|+|+|++ +..+.+++|+|+||++++.+|.. T Consensus 252 d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~-~~~~~~~tl~G~pV~~~~~~~~~------------ 318 (421) T protein:vir:13 252 AGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE-LSDGGDLVFKGRPVIELEESIFD------------ 318 (421) T ss_pred HHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC-cCCCCCceecceeeEEecccccc------------ Confidence 78888999998888888899999999999999999999999976 45666889999999999888743 Q ss_pred cccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...+++|||+. |.++++.+++++++++. +|++|++.+|++.|+|+++++++||+.++...-+ T Consensus 319 -~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (421) T protein:vir:13 319 -VGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA-------GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFG 384 (421) T ss_pred -CCCceEEEEEeccccEEEEEecceEEEeeccc-------ccccCeeEEEEEeeecceeecchhhheeeecccc Confidence 12345789999997 77899999999988763 4999999999999999999999998777655433 No 76 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=6e-52 Score=301.40 Aligned_cols=272 Identities=13% Similarity=0.001 Sum_probs=215.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccc-cccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSE-STATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k 79 (311) ..+.+.||++||+++.++|++.+++.++++++|+++++.++...+|+.++.+.+.|++|++++++ ++++|+++++++|| T Consensus 86 ~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k 165 (390) T protein:vir:40 86 GNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYK 165 (390) T ss_pred ccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeeeee Confidence 44667899999999999999999999999999999999999899999999999999999998875 68999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+||++++ .++++++|.++++++|++++|++||+|+|. + .|.|+.+.....+.............+. T Consensus 166 ~~~~i~iS~ell~ds---~~~l~~~i~~~la~~i~~~~~~a~l~G~G~--~-~P~Gil~~~~~~~~~~~~~~~~~~~t~~ 239 (390) T protein:vir:40 166 LSAYIPVCNAMLDLG---PSWLDQYVRTILGEAMALGLEAGIVNGSGK--D-QPIGMMRDLNNVTAGEHPVKTATPLTDL 239 (390) T ss_pred EEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhhhcccCC--C-ccceeeeccccccccccccccccccchh Confidence 999999999999644 568999999999999999999999999753 2 3445543322222211111111222233 Q ss_pred HHHHHHHHHhh-------cCCCccEEEEcHHHH----HHHHHhhccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLG-------DNLSPDGVALDNTFS----FMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 160 ~i~~~~~~~~~-------~~~~~~~~v~n~~~~----~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) +...++..+.. ......+|+|||.++ ..+++++|.+|+|+|... ++|+||+++++||.+ T Consensus 240 ~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~--------~~g~pvv~~~~~p~~-- 309 (390) T protein:vir:40 240 TPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGIL--------PVPLEIVQSVAVPVG-- 309 (390) T ss_pred hHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccccccC--------CCceeEEEcCCCCCC-- Confidence 33332222221 123345799999874 345579999999998543 479999999998743 Q ss_pred cccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .++||||+.|.++.+.+++++++++. +|.+|++.||++.|+|+++++++||++|+.+ T Consensus 310 ----------------~i~~Gd~s~~~i~~~~~~~v~~~~~~-------~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~ 366 (390) T protein:vir:40 310 ----------------KAVAGRAKDYFMGIGSEQVIRTSTEY-------RLLDDETLYYAKQYANGRPKDNSSFLVFDIT 366 (390) T ss_pred ----------------cEEEEeeceEEEEeecceEEEecchh-------hhhcCcEEEEEEEEeCCEEecccceEEEEee Confidence 48899999999999999999987653 4899999999999999999999999999977 Q ss_pred ccC Q lcl|Aclame:pro 309 DES 311 (311) Q Consensus 309 a~~ 311 (311) +.+ T Consensus 367 ~~~ 369 (390) T protein:vir:40 367 GLE 369 (390) T ss_pred ccC Confidence 775 No 77 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=3.1e-51 Score=297.47 Aligned_cols=262 Identities=14% Similarity=0.109 Sum_probs=219.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccc-cccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSE-STATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~ 78 (311) -.+.+.||++||+++.++|++.+++.++++++++++++++++.++|+.. ..+.+.|++|++.+++ ++++|+++++.++ T Consensus 136 ~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~ 215 (400) T protein:vir:38 136 GVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVE 215 (400) T ss_pred cccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehh Confidence 2356678999999999999999999999999999999999999999976 4567899999999986 6899999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+||++ |+.++++++|.+.++++++.++|.++++|++.... .....+ T Consensus 216 k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~----------------------~~~~~~ 270 (400) T protein:vir:38 216 TYRQALPVSQESID---DSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTA----------------------KTISSV 270 (400) T ss_pred heeeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc----------------------cccccH Confidence 99999999999996 44678999999999999999999999988542211 011234 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++.+++.......+ .++|+|||.+|..|+++||++|+|+|.+...++.+++|+|+||++++.+|.. T Consensus 271 ~~~~~~~~~~~~~~~-~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~------------ 337 (400) T protein:vir:38 271 DDLKHINNVDLDPAY-SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLG------------ 337 (400) T ss_pred HHHHHHHHhhhhhhh-CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccC------------ Confidence 556666554444333 4579999999999999999999999998888888999999999999888732 Q ss_pred cccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...++||||+. |.+..+.+++++++++. .+...+|+.+|+|+++++|+||++|+.++.| T Consensus 338 -~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 338 -AAGEAHAFLGDIKRAILFANRADFMVRWVDDQ----------IYGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred -CCCceEEEEEeccccEEEEeecceEEEEeccc----------ccceeEEEEEEeccEEecccceEEEEeecCC Confidence 12345789999997 66777999999887642 2235799999999999999999999999999 No 78 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=1.1e-50 Score=294.53 Aligned_cols=259 Identities=13% Similarity=0.051 Sum_probs=215.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccc-cccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSE-STATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~ 78 (311) -.+.+.||++||+++.++|++.+++.++++++|+++++++++..+|+.. ++..+.|++|++++|+ ++++|+++++.++ T Consensus 130 ~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~ 209 (394) T protein:vir:97 130 GIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNID 209 (394) T ss_pred ccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehh Confidence 2256678999999999999999999999999999999999999999876 4567899999999997 5799999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+||++++ .++++++|.+++++++++++|.++++|.+.... .....+ T Consensus 210 k~~~~i~is~ell~ds---~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~----------------------~~~~~~ 264 (394) T protein:vir:97 210 TYRGAIPLSQESIDDA---DVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT----------------------KTVKNL 264 (394) T ss_pred heeeehhhHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cccccH Confidence 9999999999999644 578999999999999999999999988432110 112335 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++..++.......+ .+.|+|||.+|..|+++||++|+|+|.+.+.++.+++|+|+||++++..+ T Consensus 265 ~~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~-------------- 329 (394) T protein:vir:97 265 DEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV-------------- 329 (394) T ss_pred HHHHHHHHhhhhhhh-CCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEecccc-------------- Confidence 666666655444433 35699999999999999999999999988888888999999998854432 Q ss_pred cccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+...++||||++ |.+..+.+++++.+++ ..+...+|++.|+|+++.+|+||++|+..+.+ T Consensus 330 --~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 391 (394) T protein:vir:97 330 --LGANKAFIGDFKRGVLFADRKDLGLRWADN----------EIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) T ss_pred --cCCccEEEeeccccEEEEEecceEEEEecc----------cccceeEEEEEEEccEEecccceEEEEecccc Confidence 2344689999987 6788899999987654 22345789999999999999999999987777 No 79 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=8.8e-50 Score=289.54 Aligned_cols=281 Identities=16% Similarity=0.123 Sum_probs=227.9 Q ss_pred Cc---ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC--------CceeEEeecCccccccccc Q lcl|Aclame:pro 1 MV---ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA--------PPRGEVVGEGAQKSESTAT 69 (311) Q Consensus 1 ma---t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~ 69 (311) +. +...++.++|+.+.+.|+...+..+.++++++++++.++.+.+|+.++ .+.++|++||+.+|+++++ T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 202 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLS 202 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccc Confidence 21 223344677787777778788888999999999999998899988643 4568899999999999999 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccc--cccce Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILD--TTNIV 147 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~--~~~~~ 147 (311) |++++++++|++++++||+|+++++ .+++++|..++++++++++|.++|+|++.+ .+.|+.+.... ..... T Consensus 203 ~~~i~~~~~k~~~~~~is~ell~d~----~~l~~~i~~~la~a~~~~~d~aii~G~G~~---~p~Gi~~~~~~~~~~~~~ 275 (419) T protein:vir:94 203 FDTITTTLKTVAHWLPITRQAADDN----SQLMGYIQGRLTYGLRFLRDRQLLNGNGST---EMQGILTTPGIGTYQQPK 275 (419) T ss_pred eeeEEeeeeeEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhccCcc---cccceecccccccccccc Confidence 9999999999999999999999643 368999999999999999999999996532 33343221111 11111 Q ss_pred eeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCc-eeeccccccCCCceecceeEEeecccccc Q lcl|Aclame:pro 148 ELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGR-KLYPELGFGTDVASFAGLNAAVSDTVRGG 226 (311) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~-~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 226 (311) .....+....++++.+++..+...++.+++|+|||.+|..|+++||++|+ +++.+...++.+++|+|+||++++.+|.+ T Consensus 276 ~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~ 355 (419) T protein:vir:94 276 PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) T ss_pred cccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCc Confidence 12223344568889999999998888999999999999999999998776 45677778888999999999999988743 Q ss_pred cccccccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 227 PEAVTASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVV 305 (311) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l 305 (311) .+++|||+. +.+..+.+++++++++.. .+|++|++.||++.|+|+++++|+||+++ T Consensus 356 ------------------~~~~gd~~~~~~~~~~~~~~v~~~~~~~-----~~~~~~~~~~r~~~r~d~~v~~~~a~~~~ 412 (419) T protein:vir:94 356 ------------------TALVGGFRQGATLWSRQGITVLMTDSHA-----DFFTANTLVILAEFRANLAVYQPKAFVRV 412 (419) T ss_pred ------------------cEEEeeccceEEEEEecceEEEEecccc-----chhhcCcEEEEEEEeeccEEeccccEEEE Confidence 588999986 567888999999877643 35999999999999999999999999999 Q ss_pred EecccC Q lcl|Aclame:pro 306 RDADES 311 (311) Q Consensus 306 ~~aa~~ 311 (311) +.+++- T Consensus 413 ~~~aa~ 418 (419) T protein:vir:94 413 TFAAAT 418 (419) T ss_pred EeccCC Confidence 988888 No 80 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=6.9e-51 Score=295.58 Aligned_cols=278 Identities=10% Similarity=-0.082 Sum_probs=219.8 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-ccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~ 77 (311) +. +.+.||++||+++.++|++.+++.++++++|++++++ +..++|+.++.+.+.|++|+++++ +++++|+++++.+ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 157 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-cceEEEEecCCcceeEeecccccCcccCccceeEeecc Confidence 22 4567799999999999999999999999999999986 568999999999999999988776 5789999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc--cccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT--GTSA 155 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~--~~~~ 155 (311) ||++++++||+|||. |+.++++++|+++++++|++++|.+|++|+|. ..|.|+.+.+........... .+.. T Consensus 158 ~kl~a~~~is~elL~---ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~---~qP~Gil~~~~~~~~~~~~~~~~~~~~ 231 (377) T protein:vir:98 158 FKLTAFVVIPKDALK---FGPKWIKQFITEQLKEAIAVALELAIVKGDGL---LQPVGLLKDLSQPTVDQSTGRDITTYK 231 (377) T ss_pred eeEEeeecccHHhhh---ccHhHHHHHHHHHHHHHHHHHHhhceEeccCC---Ccceeeeeccccccccccccccccccc Confidence 999999999999995 55688999999999999999999999999753 245555443222222111111 1111 Q ss_pred chHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeecccc--------------ccCCCceecceeE--Ee Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELG--------------FGTDVASFAGLNA--AV 219 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~--------------~~~~~~~l~G~pv--~~ 219 (311) ...+.+.++...+........+|+||+.++..++++||.+|+|+|...+ ..+.+.+++|+|+ +. T Consensus 232 ~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~ 311 (377) T protein:vir:98 232 TDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILE 311 (377) T ss_pred chhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEe Confidence 2233455555555444444557999999999999999999999993211 2344557888884 45 Q ss_pred ecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 220 SDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST 299 (311) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~ 299 (311) ++++|. ..++||||+.|.+.++.+++++++++. +|.+|++.||++.|+|++++++ T Consensus 312 s~~~p~------------------~~i~fgdf~~Y~i~~r~~~~i~~~~~~-------~~~~d~~~f~~~~r~dg~~~~~ 366 (377) T protein:vir:98 312 SLAVET------------------GKAIAFVANRYDAFMATASTIEEYDQT-------FAMEDLQLYLTKNYFYGKAKDN 366 (377) T ss_pred cCCCCc------------------ccEEEEEecceeEEeecceEEEeechh-------hhhcCceEEEEEEEEcCEEecc Confidence 555552 358899999999999999999987653 4899999999999999999999 Q ss_pred cceEEEEeccc Q lcl|Aclame:pro 300 DAFAVVRDADE 310 (311) Q Consensus 300 ~a~~~l~~aa~ 310 (311) +||++|+.+-. T Consensus 367 ~a~~vl~i~~~ 377 (377) T protein:vir:98 367 HTAALLTLAGG 377 (377) T ss_pred CcEEEEEEecC Confidence 99999998888 No 81 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=7.9e-50 Score=289.79 Aligned_cols=266 Identities=15% Similarity=0.125 Sum_probs=213.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccc-cccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSE-STATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~ 78 (311) ..+.+.||++||+++.++|++.+++.++|+++|+++++++++.++|+... ...+.|++|++++++ ++++|++|+++++ T Consensus 113 ~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~ 192 (394) T protein:vir:10 113 HVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVS 192 (394) T ss_pred ccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeee Confidence 34667789999999999999999999999999999999999999998764 467899999999996 6799999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+|||+++ .++++++|.++|++++++++|.++++|++.+. +.+ ......+ T Consensus 193 k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~-------~~~------------~~~~~~~ 250 (394) T protein:vir:10 193 TYRGAIPLSEEAIADS---AVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT-------AKA------------TTTDTLV 250 (394) T ss_pred eeEeeehhHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------ccc------------ccccccH Confidence 9999999999999644 57899999999999999999999998854211 110 0112334 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeecccccc----CCCceecceeEEeecccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFG----TDVASFAGLNAAVSDTVRGGPEAVTAST 234 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~----~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 234 (311) +++.+++.......++ ++|+|||++|..|+++||++|||+|.+.... ..+++|+|+||++++.... T Consensus 251 d~l~~~~~~~~~~~~~-a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~--------- 320 (394) T protein:vir:10 251 DSLKHILNVDLDPAYS-RALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALL--------- 320 (394) T ss_pred HHHHHHHHhhhhhhcc-CEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEeccccc--------- Confidence 5666666544444444 5799999999999999999999999766543 4457899999987553211 Q ss_pred cccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...+++|||+. +.+..+.+++++++++.. |.+ .+|+.+|+|+++++|+||++++.++.+ T Consensus 321 ---~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~-------~~~---~~~~~~r~d~~~~~~~ai~~~~~~~~~ 385 (394) T protein:vir:10 321 ---GSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI-------YGR---YLGAAFRFGVKQADSNAGYFVTNTDAA 385 (394) T ss_pred ---CCCCCceEEEEeeccccEEEEeecceEEEEecccc-------cce---eEEEEEEeccEEeccccEEEEEeeccc Confidence 1123455789999997 667778999998776532 433 589999999999999999999987777 No 82 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=3e-49 Score=286.58 Aligned_cols=273 Identities=12% Similarity=0.007 Sum_probs=210.4 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccc-cccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQK-SESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~l~~~k 79 (311) -.+.+.||++||+++.++|++.+++.++++++|++++++ +...+|+.++.+.+.|++|++++ ++++++|+++++.+|| T Consensus 88 ~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k 166 (395) T protein:vir:95 88 YDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAG-IKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYK 166 (395) T ss_pred hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccCccccccceeeeeceee Confidence 226778999999999999999999999999999999987 56899999999999999987666 4689999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|||+ |+.++++++|.+.++++|++++|++||+|+|.+. +.|.|+.+.+.................++ T Consensus 167 l~~~~~iS~ell~---ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~-~qP~Gil~~~~~~~~~~~~~~~~~~~t~~ 242 (395) T protein:vir:95 167 LTCFVVLPDDLST---FGPAWIERFVRTQIQEAISVALESAIINGGGAAK-TQPVGLMKDVNTNSGAVTDKASSGTLTFA 242 (395) T ss_pred EEEeecccHHHHh---cchhHHHHHHHHHHHHHHHHHHhhheeeccCCCC-cCceeeeecccccccccccccccchhhhh Confidence 9999999999995 5567899999999999999999999999965321 24555544332222221111122222333 Q ss_pred HHHHHHHHHhh-------------cCC-CccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceec--ceeEEeeccc Q lcl|Aclame:pro 160 AVEAAVGLVLG-------------DNL-SPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFA--GLNAAVSDTV 223 (311) Q Consensus 160 ~i~~~~~~~~~-------------~~~-~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~--G~pv~~~~~~ 223 (311) ++...+..+.. ..+ ....|+|||.++. |.+|+|+|.+. .+.+.+++ |+||+.+++| T Consensus 243 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~~~~~~--~G~~~~~lg~g~~v~~~~~~ 314 (395) T protein:vir:95 243 DADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQARYTYLTA--NGGFVTVLPYNVTIITSEFV 314 (395) T ss_pred hhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCCcceeccC--CCcceeccCCcceEEEcCCC Confidence 33333222211 111 2235999998765 66899999863 45567775 5557888888 Q ss_pred ccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFA 303 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~ 303 (311) |.+ .++||||+.|.++++.+++++++++. +|.+|++.||++.|+|+++++++||+ T Consensus 315 p~~------------------~i~fgdfs~y~i~~r~~~~i~~~~~~-------~~~~d~~~f~~~~r~dg~~~~~~A~~ 369 (395) T protein:vir:95 315 PEG------------------KLVAFVTDRYNAVRGGGLTVKKFDQT-------LALEDAVLFTAKTFAYGQPDDNKASA 369 (395) T ss_pred CCC------------------cEEEEecccEEEEEecceEEEeccch-------hhhCCcEEEEEEEEECCEEeccccEE Confidence 743 48899999999999999999988753 48999999999999999999999999 Q ss_pred EEEecccC Q lcl|Aclame:pro 304 VVRDADES 311 (311) Q Consensus 304 ~l~~aa~~ 311 (311) +|+...+. T Consensus 370 ~l~i~~~~ 377 (395) T protein:vir:95 370 VYDLKVAS 377 (395) T ss_pred EEEeeccC Confidence 99976555 No 83 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=3.2e-49 Score=286.45 Aligned_cols=267 Identities=11% Similarity=0.010 Sum_probs=212.7 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccc-cccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSE-STATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~ 78 (311) ..+.+.||++||+++...|. .+++.+.++++++++++.++...+|+.. ..+.+.|++|++.+++ ++++|+++++.++ T Consensus 158 ~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~ 236 (437) T protein:vir:10 158 GIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLK 236 (437) T ss_pred hcccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehh Confidence 34677899999999977665 5688899999999999999999999885 4567899999999996 5699999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |++++++||+|+++ |+.+++.++|.+.+++++++++|.++++|++.+. +. ......+ T Consensus 237 k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~-------~~-------------~~~~~~~ 293 (437) T protein:vir:10 237 TYTGGYVFSQELIS---DSSYDWQAELQSRLIELRDNTDDSLIITALTDGI-------KK-------------TTSTYLL 293 (437) T ss_pred heeeehhhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc-------cc-------------cccccch Confidence 99999999999996 4467899999999999999999999999964211 11 0011223 Q ss_pred HHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVG-LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 159 ~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) +++.+++. .+.......++|+|||.++..|+++||++|+|+|.+....+.+++|+|+||++++.+... T Consensus 294 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~----------- 362 (437) T protein:vir:10 294 GDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFP----------- 362 (437) T ss_pred hhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccC----------- Confidence 34444443 444444445579999999999999999999999998888888899999999987654211 Q ss_pred ccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....+...++||||+. |.+..+.+++++..+. |..+...+|+.+|+|++++||+||++|+.+..+ T Consensus 363 ~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~---------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~ 428 (437) T protein:vir:10 363 SASAGDVNIVVAPLKKAVINFKLTEITGQFQDT---------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKA 428 (437) T ss_pred CcCCCceEEEEeeccccEEEEeeeceEEEEecc---------cccccceeeEEEEEccEEecccceEEEEeeccc Confidence 1234456789999996 5678899999987643 344556889999999999999999999966555 No 84 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1e-48 Score=283.74 Aligned_cols=265 Identities=14% Similarity=0.120 Sum_probs=210.4 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccc-cccceeEEEEe Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSE-STATFAPVTAI 76 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~ 76 (311) |. +.+.||++||+++..+|++.+++.++++++|+++++.+++.++|+... ...+.|++|++++++ ++++|+++++. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~ 188 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWS 188 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeee Confidence 33 557789999999999999999999999999999999999999998864 456689999999985 78999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) ++|+++++++|+|++++ +.++++++|.+.+++++++.+|.+|++|.+... +. ...... T Consensus 189 ~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-------~~------------~~~~~~ 246 (389) T protein:vir:10 189 VATYRGAIPLSEEAIAD---SAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT-------AK------------KTTTDT 246 (389) T ss_pred heeeEeeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc-------cc------------cccccc Confidence 99999999999999964 457899999999999999999999998853211 00 111223 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeecccccc----CCCceecceeEEeecc-ccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFG----TDVASFAGLNAAVSDT-VRGGPEAVT 231 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~----~~~~~l~G~pv~~~~~-~~~~~~~~~ 231 (311) .++++.+++.......+ .++|+|||.+|..|+++||++|+|+|.+.... +.+++|+|+||++.+. ++. T Consensus 247 ~~d~l~~~~~~~~~~~~-~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~------ 319 (389) T protein:vir:10 247 LVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLG------ 319 (389) T ss_pred cHHHHHHHHHhhhhhhh-CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccC------ Confidence 45666666654333334 35799999999999999999999999766533 4457899999976543 321 Q ss_pred ccccccccccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ...+...++||||+. |.+..+++++++++++.. |. ..+|+..|+|+++.+|+||++++.++. T Consensus 320 -------~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-------~~---~~~~~~~r~d~~~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 320 -------SLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI-------YG---KYLGAAFRFGVQKADSKAGYFVTNTDV 382 (389) T ss_pred -------CCCCceEEEEeeccccEEEEeecceEEEeecccc-------cc---ceEEEEEEeccEEecccceEEEEeecc Confidence 123345789999997 778899999999876532 33 357999999999999999999985544 Q ss_pred C Q lcl|Aclame:pro 311 S 311 (311) Q Consensus 311 ~ 311 (311) + T Consensus 383 ~ 383 (389) T protein:vir:10 383 P 383 (389) T ss_pred C Confidence 4 No 85 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=3.3e-49 Score=286.39 Aligned_cols=262 Identities=12% Similarity=0.097 Sum_probs=208.2 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |- +.+.||++||+++.++|++.++.+++|+++++++++. +..+|+... .+.+.|++|++.+++++++|+++++++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~--~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 160 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTT 160 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecC--CceEEEEecCCCcccccccccccccccccceeeeecc Confidence 33 5567899999999999999999999999999998875 356777654 468999999999999999999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) ||++++++||+|||+ |+.++++++|.+.++++++++++..+| |++.+.+. +.++......... +.... T Consensus 161 ~k~~~~i~is~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~-~~g~g~~~-----~~g~l~~~~~~~~---t~~~~ 228 (352) T protein:vir:78 161 NKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDAL-AVSPKSGL-----EHMSFYNGSVKEV---EGANM 228 (352) T ss_pred eeEEeechhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHhhh-hcCCCCcc-----cccceeccccccc---cccch Confidence 999999999999995 456789999999999999988555433 33333322 2233322222222 22345 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 237 (311) ++++.+++..+.....+.++|+||+.++..|++++|.+|+|+|.. .+.+|+|+||++++.++ T Consensus 229 ~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~-----~~~~llG~PV~~~~~~~------------- 290 (352) T protein:vir:78 229 YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT-----PAEKVFGKPVVFTDAAV------------- 290 (352) T ss_pred HHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCccccc-----CCccccccceEEecCCC------------- Confidence 788888888888777777789999999999999999999999853 35689999999876543 Q ss_pred ccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 238 RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.+++. ++++.++..++ ..++++.|++..|+|+++++|+||++++.+|+| T Consensus 291 -------~~~~Gdf~~~~~~-~~~~~~~~~~~---------~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~ 347 (352) T protein:vir:78 291 -------KPIVGDFNYFGIN-YDGTTYDTDKD---------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEST 347 (352) T ss_pred -------ceeEeehhhhhhh-hhhheeeeecc---------ccCCeeEEEEEeeeCceeechhheEEEEeeccc Confidence 4689999988664 55565554432 346899999999999999999999999999988 No 86 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=9.3e-49 Score=283.92 Aligned_cols=284 Identities=12% Similarity=0.038 Sum_probs=212.4 Q ss_pred Cc-ccCCCceEcchhH-HHHHHHHHHhhchhhhhcceeecCC--CceEEEEEeCCc-eeEEeecCc-----cccccccce Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHL-VPGVWQKAQGQSVLARLSMAEPQEF--GEQQYMTLTAPP-RGEVVGEGA-----QKSESTATF 70 (311) Q Consensus 1 ma-t~~~g~~~vP~~~-~~~ii~~~~~~s~l~~l~~~~~~~~--~~~~~p~~~~~~-~a~~v~Eg~-----~~~~~~~~~ 70 (311) +. +.++||++||+++ .++|++.+++.++++++++.+++++ +++.||+..+++ .+.|++||+ .+|+++++| T Consensus 157 ~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f 236 (477) T protein:vir:84 157 LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTD 236 (477) T ss_pred ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccce Confidence 22 3456788888875 6789999999999999999888765 458999976655 567999985 457888999 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) ++++++++|++++++||+|||+++ .++++++|.+++++++++++|.+||+|+|. +..|.|+ .+.+...... T Consensus 237 ~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt--~~~p~Gi----~~~~~~~~~~ 307 (477) T protein:vir:84 237 GFVQANVKTIAGQQGIAIQLLDQA---AVSVDEFVFRDLAADYANKLNVQVISGTGS--NNQVVGV----RATAGITQVT 307 (477) T ss_pred eeEEEeeeeEEeeeHHHHHHHhcc---chhHHHHHHHHHHHHHHHHHHHHHhccCCC--CCcccee----eecccccccc Confidence 999999999999999999999644 578999999999999999999999999642 2234444 3332221111 Q ss_pred cc----ccc---chHHHHHHHHHHHhhcCC-CccEEEEcHHHHHHHHHhhccCCceeeccc-------------cccCCC Q lcl|Aclame:pro 151 TG----TSA---TPDLAVEAAVGLVLGDNL-SPDGVALDNTFSFMLATQRDSQGRKLYPEL-------------GFGTDV 209 (311) Q Consensus 151 ~~----~~~---~~~~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~-------------~~~~~~ 209 (311) .. +.. ..++.+.+++..+..... .+++|+|||.+|..|+++||++|||+|.+. ...+.+ T Consensus 308 ~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~ 387 (477) T protein:vir:84 308 ATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVV 387 (477) T ss_pred ccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCccccccccccccccccccc Confidence 11 111 123334445544443333 455799999999999999999999999765 233445 Q ss_pred ceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEE Q lcl|Aclame:pro 210 ASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAE 289 (311) Q Consensus 210 ~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~ 289 (311) ++|+|+||++++.+|.+.... .+...++||||+.+.++. .++.++++++.. +..+.+.||.. T Consensus 388 ~~l~G~pVv~s~~~p~~~~~~----------~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~-------~~~~~~~~~v~ 449 (477) T protein:vir:84 388 GQMHGLPVVTDPTLPTTLGTG----------TDQDVIHVLRASDLALFE-SSVRMRALQETR-------AENLSVLLQVY 449 (477) T ss_pred chhcccceEecCccccccccc----------CCcceEEEEEeceEEEEe-eceeEEeccccc-------cccceeeeeeh Confidence 789999999999999764332 334478999999998876 577888877643 44567778877 Q ss_pred EEeccEE-ecccceEEEEecccC Q lcl|Aclame:pro 290 VVYGIGI-MSTDAFAVVRDADES 311 (311) Q Consensus 290 ~r~~~~v-~~~~a~~~l~~aa~~ 311 (311) .++++.. +||+||++++.++-+ T Consensus 450 ~~~~~~~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 450 GYLAFTAARFPQSVVEIGGTALT 472 (477) T ss_pred hhhhhhhhccccceEEeeccccc Confidence 7777654 569999999988777 No 87 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.4e-48 Score=282.85 Aligned_cols=270 Identities=10% Similarity=-0.015 Sum_probs=208.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-ccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k 79 (311) -.+.+.||++||+++.++|++.+++.|++|++|++++++ +..++|+.+..+.+.|++|+++++ +++++|+++++.+|| T Consensus 78 ~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~-~~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~k 156 (381) T protein:vir:10 78 KSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) T ss_pred hcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecC-cceEEEeecCCcceEEeecccccccccCccceeEeeccee Confidence 225677899999999999999999999999999999986 568999999999999999988765 678999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec------c-- Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT------T-- 151 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~------~-- 151 (311) +++++++|+|||+ |+.++++++|+.+++++|++++|++|++|+|. ..|.|+.+.+.........+ . T Consensus 157 l~a~i~is~elL~---Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~---~qP~Gil~~~~~~~~~~~g~~~~~~~~~~ 230 (381) T protein:vir:10 157 LTAFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTGK---DQPIGLNRQVQKGVSVTDGAYPEKEEQGT 230 (381) T ss_pred EEeeccccHHHHh---ccHHHHHHHHHHHHHHHHHHHhhceeEecccC---CCceeeeecCCcccccccccccccccccc Confidence 9999999999994 55788999999999999999999999999753 23455543222111111100 0 Q ss_pred ---ccccchHHHHHHHHHHHhh------cCCC-ccEEEEcHHHHHHHHHhh---ccCCceeeccccccCCCceecceeEE Q lcl|Aclame:pro 152 ---GTSATPDLAVEAAVGLVLG------DNLS-PDGVALDNTFSFMLATQR---DSQGRKLYPELGFGTDVASFAGLNAA 218 (311) Q Consensus 152 ---~~~~~~~~~i~~~~~~~~~------~~~~-~~~~v~n~~~~~~l~~lk---d~~g~~~~~~~~~~~~~~~l~G~pv~ 218 (311) .+....+..+...+..+.. ..+. ...|+|||.++..|++++ +++|+|+|... .|+||+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp---------~g~~vv 301 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---------FNLNVI 301 (381) T ss_pred ccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCC---------CCceeE Confidence 0001112222222211111 1122 235999999999988655 78999998521 477898 Q ss_pred eecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMS 298 (311) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~ 298 (311) .++.||.+ .++||||++|.+.+|.+++++++++. +|.+|++.||+..|+|++++| T Consensus 302 ~~~~~p~~------------------~i~fGDfs~Y~i~~r~~~~i~~~~~~-------~~~~d~~~f~a~~r~dG~~~~ 356 (381) T protein:vir:10 302 ESTVQEAG------------------KVLTYVKGLYDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKAKD 356 (381) T ss_pred EcCCCCcC------------------cEEEEEcccEEEEEecccEEEeechh-------hhhcCceEEEEEEEEcCEEec Confidence 88888743 58999999999999999999988753 599999999999999999999 Q ss_pred ccceEEEEecccC Q lcl|Aclame:pro 299 TDAFAVVRDADES 311 (311) Q Consensus 299 ~~a~~~l~~aa~~ 311 (311) ++||++++.+.+- T Consensus 357 ~~A~~v~~l~~~~ 369 (381) T protein:vir:10 357 NKVAAVWKLDLKG 369 (381) T ss_pred CCcEEEEEEeecC Confidence 9999998877554 No 88 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2.3e-48 Score=281.76 Aligned_cols=270 Identities=11% Similarity=0.005 Sum_probs=209.3 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-ccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~ 77 (311) |. +.+.||++||+++.++|++.+++.|+++++|++++++ +..++|+.++.+.+.|++|+++++ +++++|+++++.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-cceEEEEecCCcceeeecccccccccccccceeeeecc Confidence 33 5567899999999999999999999999999999987 568999999999999999998876 5689999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccce--------e- Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIV--------E- 148 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~--------~- 148 (311) ||++++++||+|||+ |+.++++++|+.+++++|++++|++|++|+|. ..|.|+.+.+....... . T Consensus 155 ~kl~~~~~is~elL~---Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~---~qP~Gil~~~~~~~~~~~g~~~~~~~~ 228 (381) T protein:vir:10 155 NKLTAFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTGK---DQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) T ss_pred eeEEeechhhHHHhh---cCHHHHHHHHHHHHHHHHHHHhhheeEeccCC---CCceeeeeccCcccccccccccccccc Confidence 999999999999995 55678999999999999999999999999753 23445543222111110 0 Q ss_pred --eccccccchHHHHHHHHHHHhhc------CCCc-cEEEEcHHHHHHHHHhh---ccCCceeeccccccCCCceeccee Q lcl|Aclame:pro 149 --LTTGTSATPDLAVEAAVGLVLGD------NLSP-DGVALDNTFSFMLATQR---DSQGRKLYPELGFGTDVASFAGLN 216 (311) Q Consensus 149 --~~~~~~~~~~~~i~~~~~~~~~~------~~~~-~~~v~n~~~~~~l~~lk---d~~g~~~~~~~~~~~~~~~l~G~p 216 (311) .+..+....++.+..++..+... .+.. ..|+|||.++..|++++ +++|+|+|.. ..|.+ T Consensus 229 ~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---------~~g~~ 299 (381) T protein:vir:10 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---------PFNLN 299 (381) T ss_pred cccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC---------CCCce Confidence 01111122233444444333221 1232 35999999999998766 6779988742 14667 Q ss_pred EEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 217 AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGI 296 (311) Q Consensus 217 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v 296 (311) |+.++.||.+ .++||||+.|.+.+|.+++++++++. +|.+|++.||+..|+|+++ T Consensus 300 vv~s~~~p~~------------------~iifgDfs~Y~i~~r~~~~i~~~~~~-------~~~~d~~~f~a~~r~dg~~ 354 (381) T protein:vir:10 300 VIESTVQEAG------------------KVLTYVKGLYDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKA 354 (381) T ss_pred EEecCCCCcC------------------cEEEEecccEEEEEecccEEEeechh-------HhhcCCeEEEEEEEEcCEE Confidence 8888887733 58999999999999999999988763 5999999999999999999 Q ss_pred ecccceEEEEecccC Q lcl|Aclame:pro 297 MSTDAFAVVRDADES 311 (311) Q Consensus 297 ~~~~a~~~l~~aa~~ 311 (311) ++++||++++.+.+. T Consensus 355 ~~~~A~~v~~l~~~~ 369 (381) T protein:vir:10 355 KDNKVAAVWKLDLKG 369 (381) T ss_pred ecCceEEEEEEEecC Confidence 999999998765544 No 89 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2.3e-48 Score=281.76 Aligned_cols=270 Identities=11% Similarity=0.005 Sum_probs=209.3 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-ccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~ 77 (311) |. +.+.||++||+++.++|++.+++.|+++++|++++++ +..++|+.++.+.+.|++|+++++ +++++|+++++.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-cceEEEEecCCcceeeecccccccccccccceeeeecc Confidence 33 5567899999999999999999999999999999987 568999999999999999998876 5689999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccce--------e- Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIV--------E- 148 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~--------~- 148 (311) ||++++++||+|||+ |+.++++++|+.+++++|++++|++|++|+|. ..|.|+.+.+....... . T Consensus 155 ~kl~~~~~is~elL~---Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~---~qP~Gil~~~~~~~~~~~g~~~~~~~~ 228 (381) T protein:vir:95 155 NKLTAFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTGK---DQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) T ss_pred eeEEeechhhHHHhh---cCHHHHHHHHHHHHHHHHHHHhhheeEeccCC---CCceeeeeccCcccccccccccccccc Confidence 999999999999995 55678999999999999999999999999753 23445543222111110 0 Q ss_pred --eccccccchHHHHHHHHHHHhhc------CCCc-cEEEEcHHHHHHHHHhh---ccCCceeeccccccCCCceeccee Q lcl|Aclame:pro 149 --LTTGTSATPDLAVEAAVGLVLGD------NLSP-DGVALDNTFSFMLATQR---DSQGRKLYPELGFGTDVASFAGLN 216 (311) Q Consensus 149 --~~~~~~~~~~~~i~~~~~~~~~~------~~~~-~~~v~n~~~~~~l~~lk---d~~g~~~~~~~~~~~~~~~l~G~p 216 (311) .+..+....++.+..++..+... .+.. ..|+|||.++..|++++ +++|+|+|.. ..|.+ T Consensus 229 ~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---------~~g~~ 299 (381) T protein:vir:95 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---------PFNLN 299 (381) T ss_pred cccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC---------CCCce Confidence 01111122233444444333221 1232 35999999999998766 6779988742 14667 Q ss_pred EEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 217 AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGI 296 (311) Q Consensus 217 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v 296 (311) |+.++.||.+ .++||||+.|.+.+|.+++++++++. +|.+|++.||+..|+|+++ T Consensus 300 vv~s~~~p~~------------------~iifgDfs~Y~i~~r~~~~i~~~~~~-------~~~~d~~~f~a~~r~dg~~ 354 (381) T protein:vir:95 300 VIESTVQEAG------------------KVLTYVKGLYDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKA 354 (381) T ss_pred EEecCCCCcC------------------cEEEEecccEEEEEecccEEEeechh-------HhhcCCeEEEEEEEEcCEE Confidence 8888887733 58999999999999999999988763 5999999999999999999 Q ss_pred ecccceEEEEecccC Q lcl|Aclame:pro 297 MSTDAFAVVRDADES 311 (311) Q Consensus 297 ~~~~a~~~l~~aa~~ 311 (311) ++++||++++.+.+. T Consensus 355 ~~~~A~~v~~l~~~~ 369 (381) T protein:vir:95 355 KDNKVAAVWKLDLKG 369 (381) T ss_pred ecCceEEEEEEEecC Confidence 999999998765544 No 90 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.4e-48 Score=282.94 Aligned_cols=262 Identities=13% Similarity=0.100 Sum_probs=207.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) ..+.+.||++||+++.++|++.++++++|++++++++++ +..+|+.. ....+.|++|++.+++++++|+++++.++| T Consensus 120 ~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k 197 (387) T protein:vir:26 120 TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNK 197 (387) T ss_pred cCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecC--CceeeeeeccCCccccccccccccccccccceeeechhe Confidence 335677899999999999999999999999999998886 35678765 457899999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|||+ |+.++++++|.++++++++++++..+|. ++.+. +.+.++........+ +....++ T Consensus 198 ~~~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~-----g~~~g~~~~~~~~~~---~~~~~~d 265 (387) T protein:vir:26 198 FKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKS-----GLEHMSFYNGSVKEV---EGADMYD 265 (387) T ss_pred eeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCc-----cccceeeeccccccc---cccchHH Confidence 9999999999995 5567899999999999999987766542 22222 223333333222221 2334578 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++..+.........|+||+.++..+.++++.+|+|+|. +.+.+|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------------- 325 (387) T protein:vir:26 266 AIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------------- 325 (387) T ss_pred HHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------------- Confidence 888888888877667778999999998888888888888884 345789999999876543 Q ss_pred ccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.+++. ++++.+...++ ...|++.||++.|+|+++++|+||++|+.++++ T Consensus 326 -----~~~~GDf~~~~~~-~~~~~~~~~~~---------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:26 326 -----KPIVGDFNYFGIN-YDGTTYDTDKD---------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred -----ceeeechhhhhhh-hhhhhheeccc---------ccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 4789999987664 45555554433 236899999999999999999999999998888 No 91 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.4e-48 Score=282.94 Aligned_cols=262 Identities=13% Similarity=0.100 Sum_probs=207.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) ..+.+.||++||+++.++|++.++++++|++++++++++ +..+|+.. ....+.|++|++.+++++++|+++++.++| T Consensus 120 ~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k 197 (387) T protein:vir:96 120 TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNK 197 (387) T ss_pred cCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecC--CceeeeeeccCCccccccccccccccccccceeeechhe Confidence 335677899999999999999999999999999998886 35678765 457899999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|||+ |+.++++++|.++++++++++++..+|. ++.+. +.+.++........+ +....++ T Consensus 198 ~~~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~-----g~~~g~~~~~~~~~~---~~~~~~d 265 (387) T protein:vir:96 198 FKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKS-----GLEHMSFYNGSVKEV---EGADMYD 265 (387) T ss_pred eeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCc-----cccceeeeccccccc---cccchHH Confidence 9999999999995 5567899999999999999987766542 22222 223333333222221 2334578 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++..+.........|+||+.++..+.++++.+|+|+|. +.+.+|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------------- 325 (387) T protein:vir:96 266 AIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------------- 325 (387) T ss_pred HHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------------- Confidence 888888888877667778999999998888888888888884 345789999999876543 Q ss_pred ccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.+++. ++++.+...++ ...|++.||++.|+|+++++|+||++|+.++++ T Consensus 326 -----~~~~GDf~~~~~~-~~~~~~~~~~~---------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:96 326 -----KPIVGDFNYFGIN-YDGTTYDTDKD---------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred -----ceeeechhhhhhh-hhhhhheeccc---------ccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 4789999987664 45555554433 236899999999999999999999999998888 No 92 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.4e-48 Score=282.94 Aligned_cols=262 Identities=13% Similarity=0.100 Sum_probs=207.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) ..+.+.||++||+++.++|++.++++++|++++++++++ +..+|+.. ....+.|++|++.+++++++|+++++.++| T Consensus 120 ~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k 197 (387) T protein:vir:94 120 TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNK 197 (387) T ss_pred cCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecC--CceeeeeeccCCccccccccccccccccccceeeechhe Confidence 335677899999999999999999999999999998886 35678765 457899999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) ++++++||+|||+ |+.++++++|.++++++++++++..+|. ++.+. +.+.++........+ +....++ T Consensus 198 ~~~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~-----g~~~g~~~~~~~~~~---~~~~~~d 265 (387) T protein:vir:94 198 FKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKS-----GLEHMSFYNGSVKEV---EGADMYD 265 (387) T ss_pred eeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCc-----cccceeeeccccccc---cccchHH Confidence 9999999999995 5567899999999999999987766542 22222 223333333222221 2334578 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 239 (311) ++.+++..+.........|+||+.++..+.++++.+|+|+|. +.+.+|+|+||++++.++ T Consensus 266 ~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------------- 325 (387) T protein:vir:94 266 AIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------------- 325 (387) T ss_pred HHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------------- Confidence 888888888877667778999999998888888888888884 345789999999876543 Q ss_pred ccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.+++. ++++.+...++ ...|++.||++.|+|+++++|+||++|+.++++ T Consensus 326 -----~~~~GDf~~~~~~-~~~~~~~~~~~---------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:94 326 -----KPIVGDFNYFGIN-YDGTTYDTDKD---------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred -----ceeeechhhhhhh-hhhhhheeccc---------ccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 4789999987664 45555554433 236899999999999999999999999998888 No 93 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=3.7e-48 Score=280.60 Aligned_cols=261 Identities=12% Similarity=0.070 Sum_probs=203.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) ..+.+.||++||+++.++|++.++++++|+++|+++++. +..+|+.. +...+.|++|++.+++++++|++++++++| T Consensus 120 ~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~--~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k 197 (387) T protein:vir:93 120 TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNK 197 (387) T ss_pred cCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecC--CceEEEEeecCCccccccCcccccccccccceeeeehee Confidence 335677899999999999999999999999999998886 35678765 457799999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhh-hcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGI-HGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l-~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) ++++++||+|||+ |+.++++++|.++++++++++++..+| +|+ +.+ .+.+++....... .+....+ T Consensus 198 ~~~~~~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~--g~g-----~p~g~l~~~~~~~---v~~~~~~ 264 (387) T protein:vir:93 198 FKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSP--KSG-----LDHMSFYNGSVKE---VEGADMY 264 (387) T ss_pred eeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCC--Ccc-----ccceeeecccccc---ccccchH Confidence 9999999999995 556789999999999999999877655 332 222 2333333322222 1233457 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +.+.+++..+.......+.|+||+.++..+.++++.+|+++|. +.+.+|+|+||++++.++ T Consensus 265 d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~-----~~~~~llG~PV~~~~~~~-------------- 325 (387) T protein:vir:93 265 DAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV-------------- 325 (387) T ss_pred HHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC-------------- Confidence 8888888888877777778999999987765555445555553 345689999999876542 Q ss_pred cccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.+++. +.++.+...++ +.++++.|+++.|+|+++++|+||++++.++++ T Consensus 326 ------~~~~GDf~~~~~~-~~~~~~~~~~~---------~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~ 382 (387) T protein:vir:93 326 ------KPIVGDFNYFGIN-YDGTTYDTDKD---------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred ------ceeeeehhhhhee-hhhheeeeccc---------ccCCceeEEEEeeeCceeechhheEEEEeecCC Confidence 5789999998765 55565554433 457899999999999999999999999988877 No 94 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=2.5e-48 Score=281.60 Aligned_cols=261 Identities=12% Similarity=0.081 Sum_probs=205.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLT-APPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) ..+.+.||++||+++..+|++.++++++++++|++++++ +..+|+.. ....+.|++|++.+++++++|+++++.+|| T Consensus 135 ~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k 212 (402) T protein:vir:93 135 TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNK 212 (402) T ss_pred cCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecC--CceeeeeeccCCccccccccccccccccccceeeeccee Confidence 335567899999999999999999999999999998886 35678765 457789999999999999999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhh-hcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGI-HGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l-~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) ++++++||+|||+ |+.++++++|.++++++++++++..+| .|+ +. +.+.++........+ +....+ T Consensus 213 ~~~~i~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~--g~-----g~p~g~~~~~~~~~~---~~~~~~ 279 (402) T protein:vir:93 213 FKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSP--KS-----GLEHMSFYNGSVKEV---EGADMY 279 (402) T ss_pred eeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCC--Cc-----cccceeeeccccccc---cccchH Confidence 9999999999995 556789999999999999998776554 332 22 233444333322221 223457 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++.+++..+.........|+||+.++..++++++.+|+++|. +.+.+|+|+||++++.++ T Consensus 280 d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~-----~~~~~llG~PV~~t~~~~-------------- 340 (402) T protein:vir:93 280 DAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV-------------- 340 (402) T ss_pred HHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC-------------- Confidence 7888888888777666678999999998887777778888874 345789999999876543 Q ss_pred cccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++||||+.+++. ++++.++..++ ...+++.||+..|+|+++++|+||++|+.++.+ T Consensus 341 ------~i~~GDf~~~~~~-~~~~~~~~~~~---------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~ 397 (402) T protein:vir:93 341 ------KPIVGDFNYFGIN-YDGTTYDTDKD---------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 397 (402) T ss_pred ------ceeeechhhhhhh-hhhhhhhhhhc---------ccCCceEEEEEEEeCcEEechhheEEEEeecCC Confidence 5789999987654 34454443332 225899999999999999999999999988877 No 95 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=1.3e-47 Score=277.57 Aligned_cols=261 Identities=11% Similarity=0.032 Sum_probs=213.7 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CceeEEeecCccccc-cccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSE-STATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~ 78 (311) ..+...|++++|+++.+.|++ +++...++++|+.+++++++..+|+... ...+.|++|++..++ ++++|++++++++ T Consensus 134 ~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~ 212 (397) T protein:vir:96 134 GFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVA 212 (397) T ss_pred cccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHh Confidence 336677899999999999997 5778889999999999988888888654 567889999999996 6899999999999 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchH Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (311) |+++++++|+|++++ +.++++++|.+.+++++++++|.++++|++.... .....+ T Consensus 213 ~~~~~~~~s~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~----------------------~~~~~~ 267 (397) T protein:vir:96 213 TRRGYIPISQEMIDD---ASYDVTGLIADEIQDQSLNTKNADIAAVLKTATA----------------------KSVVGV 267 (397) T ss_pred HhhcchhhHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------ccccch Confidence 999999999999964 4568999999999999999999999998643211 112335 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +++.+++.......+ .++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++.... . T Consensus 268 d~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~------------~ 334 (397) T protein:vir:96 268 DGLKDLINKEIKKVY-DVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVI------------G 334 (397) T ss_pred HHHHHHHHHhhhhhc-CcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEeccccc------------C Confidence 667666665444433 467999999999999999999999999888888899999999986554321 1 Q ss_pred cccccceEEEeecce-EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 239 ~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ...+...++||||+. |.+..+.+++++.+++. .....+|+++|+|++++||+||++++.+++ T Consensus 335 ~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 335 KSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN----------IYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred CCCCceEEEEeehhcceEeEeecceEEEEeccc----------ccceeEEEEEEEccEEecccceEEEEeecC Confidence 234456799999997 66788999999876542 234578999999999999999999997777 No 96 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2e-47 Score=276.63 Aligned_cols=275 Identities=16% Similarity=0.108 Sum_probs=213.4 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) ..+.+.|+++||+++.+.|++.+++.+++++++++.+++ +..++|+....+.+.|++|++++++++++|+++++.+||+ T Consensus 151 ~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~-g~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~ 229 (466) T protein:vir:80 151 KRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLK-GTARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKV 229 (466) T ss_pred hhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecC-ceeEeeeecCCcceeecccccccccccccccceeecceee Confidence 223445568999999999999999999999999999987 5678999888899999999999999999999999999999 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec----cccccc Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT----TGTSAT 156 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----~~~~~~ 156 (311) +++++||+|||. |+.++++++|...+++++++++|.+||+|+|. +. |.|+.+.+...+...... ...... T Consensus 230 ~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~--~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~ 303 (466) T protein:vir:80 230 GGFIPIPNSTLE---DSDLNLADEILDAIGQAIGFALDKAILYGTGT--KM-PVGIVTRLAQTTQPPNWGTKAPAWTNLS 303 (466) T ss_pred eeehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhheeeccCC--CC-cceeeecccccccccccccccccccccc Confidence 999999999995 45578999999999999999999999999653 32 445543322211111100 000000 Q ss_pred ----------------hHHHHHHHHHHHhhcCCCcc-EEEEcHHHHHHHHHhh---ccCCceeeccccccCCCceeccee Q lcl|Aclame:pro 157 ----------------PDLAVEAAVGLVLGDNLSPD-GVALDNTFSFMLATQR---DSQGRKLYPELGFGTDVASFAGLN 216 (311) Q Consensus 157 ----------------~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lk---d~~g~~~~~~~~~~~~~~~l~G~p 216 (311) .+.++...+........++. .|+||+.++..|..++ +.+|.+++.+. +...++|+| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~----~~~~i~G~p 379 (466) T protein:vir:80 304 TTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN----NTMPIVGGD 379 (466) T ss_pred hhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCC----Ccccccccc Confidence 11111112222233334444 5999999999999887 66777776542 223589999 Q ss_pred EEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 217 AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGI 296 (311) Q Consensus 217 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v 296 (311) |++++++|.+ .+++|||+.|.+.+|.+++++++++. .|.+|++.||+++|+|+++ T Consensus 380 vv~s~~~~~~------------------~~~~g~~~~y~i~~r~~~~i~~~~~~-------~f~~d~~~~r~~~r~dg~~ 434 (466) T protein:vir:80 380 IVILDFIPDN------------------DIIGGYGSLYLLAERADIKLAQSEHV-------RFIEDQTVFKGTARYDGKP 434 (466) T ss_pred eeecCccCcc------------------ceeeeccccEEEEeecceEEEechhh-------hhhcCcEEEEEEEEEccEE Confidence 9999998743 58999999999999999999987653 4899999999999999999 Q ss_pred ecccceEEEEecccC Q lcl|Aclame:pro 297 MSTDAFAVVRDADES 311 (311) Q Consensus 297 ~~~~a~~~l~~aa~~ 311 (311) ++|+||++++.+..+ T Consensus 435 ~~~~afv~~~~~~~~ 449 (466) T protein:vir:80 435 VFGEGFVAVNIANAN 449 (466) T ss_pred eccCceEEEEecCCC Confidence 999999999988877 No 97 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=3.2e-46 Score=270.03 Aligned_cols=270 Identities=11% Similarity=-0.035 Sum_probs=202.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-ccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k 79 (311) -.+.+.||++||+++.++|++.+++.|+++++|++++++ +..++|+.++.+.+.|++|+++++ +++++|+++++.+|| T Consensus 81 ~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~k 159 (377) T protein:vir:96 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFK 159 (377) T ss_pred cCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceeEeecccccccccCccceeEeeeeee Confidence 225567899999999999999999999999999999986 568999999999999999998876 578999999999999 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec--------- Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT--------- 150 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~--------- 150 (311) ++++++||+|||+ |+.++++++|+++++++|++++|++|++|+|. + .|.|+.+.....+...... T Consensus 160 l~~~~~is~~ll~---ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~--~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:96 160 LTAFVVIPKDALK---FGPKWLKQFITEQLKEAIAVALELAIVKGNGL--L-QPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred EEeechhhHHHhh---cchhhHHHHHHHHHHHHHHHHHhhceEeccCC--C-cceeeeeccccccccccccccccceeec Confidence 9999999999995 55688999999999999999999999999753 2 3455543221111111100 Q ss_pred -----cccccchHHHH---HHHHHHHhhcC-------CCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecce Q lcl|Aclame:pro 151 -----TGTSATPDLAV---EAAVGLVLGDN-------LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGL 215 (311) Q Consensus 151 -----~~~~~~~~~~i---~~~~~~~~~~~-------~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~ 215 (311) ..+..+.+..+ ..+...+...+ .....|+|||.++..+ .|++.|.+ ..+.+.+++|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~------~~~~~~~~--~~G~~~~~l~~ 305 (377) T protein:vir:96 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL------EAKFTSRN--QFGEYVTVLPH 305 (377) T ss_pred cccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc------cccccccC--CCCCceeccCC Confidence 00111111112 12222221111 1123599999987754 45666654 23345677787 Q ss_pred eE--EeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec Q lcl|Aclame:pro 216 NA--AVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG 293 (311) Q Consensus 216 pv--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~ 293 (311) |+ +.++.+|. ..++||||+.|.+.++.+++++.+++. +|.+|++.||+..|+| T Consensus 306 p~~v~~s~~~p~------------------~~i~fgdf~~Y~i~~r~~~~i~~~~~~-------~~~~d~~~f~~~~r~d 360 (377) T protein:vir:96 306 GITILESLAVET------------------GKAIAFVANRYDAFMATASTIEEYDQT-------FAMEDLQLYLTKNYFY 360 (377) T ss_pred CceEEecCCCCc------------------ccEEEEEcCcEEEEEecccEEEeehhh-------hhhcCCeEEEEEEEEc Confidence 75 45555552 258999999999999999999988653 5999999999999999 Q ss_pred cEEecccceEEEEeccc Q lcl|Aclame:pro 294 IGIMSTDAFAVVRDADE 310 (311) Q Consensus 294 ~~v~~~~a~~~l~~aa~ 310 (311) +++++++||++|+.+-. T Consensus 361 G~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 361 GKAKDNHTAALLTLAGG 377 (377) T ss_pred CEEecCCcEEEEEEecC Confidence 99999999999998887 No 98 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=5.8e-46 Score=268.59 Aligned_cols=268 Identities=12% Similarity=-0.007 Sum_probs=196.8 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-ccccceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~ 77 (311) |. +.+.||++||+++.++|++.+++.|+++++|+++++++ ..++|+.++.+.+.|++|+++++ +++++|+++++.+ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 161 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL-RTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQ 161 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC-ceEEEEEcCCcceEEeecccccccccCcceeeEeecc Confidence 33 56778999999999999999999999999999999874 57999999999999999988775 6789999999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec----ccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT----TGT 153 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----~~~ 153 (311) ||++++++||+|||+ |+.++++++|++.++++|++++|++|++|+|. ..|.|+.+.+.......... ... T Consensus 162 ~kl~~~i~is~ell~---Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~---~qP~Gil~~~~~~~~~~~~~~~~~~~~ 235 (383) T protein:vir:78 162 NKLTAFVVVPKDLEK---FGPAWVKRFVVTQIEEAFAVALESAYIVGDGN---DKPIGLNRKVGKGSTVVDGVYAEKAAT 235 (383) T ss_pred eeeEeeccchHHHhh---ccHHHHHHHHHHHHHHHHHHHHhhheEeccCC---CCceeeeeccCCccccccccccccccc Confidence 999999999999995 55688999999999999999999999999752 23455543222222111110 111 Q ss_pred ccchHHHHHHHHHHHhhcCCC--------------ccEEEEcHHHHHHHH---HhhccCCceeeccccccCCCceeccee Q lcl|Aclame:pro 154 SATPDLAVEAAVGLVLGDNLS--------------PDGVALDNTFSFMLA---TQRDSQGRKLYPELGFGTDVASFAGLN 216 (311) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~--------------~~~~v~n~~~~~~l~---~lkd~~g~~~~~~~~~~~~~~~l~G~p 216 (311) ....++++..+...+...... ...|+|||.++..+. ..++.+|+| .+++|+| T Consensus 236 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~-----------~t~l~~~ 304 (383) T protein:vir:78 236 GTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVY-----------VTALPFN 304 (383) T ss_pred chhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCce-----------eeecCCC Confidence 122334444444443322111 112555654432221 122333333 3556666 Q ss_pred --EEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 217 --AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGI 294 (311) Q Consensus 217 --v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~ 294 (311) ++.++.+|.+ .++||||+.|.+.++.+++++++++. +|.+|++.||+..|+|+ T Consensus 305 ~~iv~s~~~p~~------------------~iifgdfs~Y~i~~r~~~~i~~~~~~-------~f~~d~~~f~~~~r~dG 359 (383) T protein:vir:78 305 LNIIESLFVPEK------------------KAISYVAERYDALIGGPLDIGTYDQT-------LAIEDLNLYAAKQFAYG 359 (383) T ss_pred ceEEecCCCCcc------------------cEEEeeccceEEEecccceEEecchh-------hhhcCceEEEEEEEEcC Confidence 4556666532 58899999999999999999887653 59999999999999999 Q ss_pred EEecccceEEEEecccC Q lcl|Aclame:pro 295 GIMSTDAFAVVRDADES 311 (311) Q Consensus 295 ~v~~~~a~~~l~~aa~~ 311 (311) ++++++||++|+.+.+. T Consensus 360 ~~~~~~A~~vl~~~~~~ 376 (383) T protein:vir:78 360 KAKDDKAAAVWTLNINP 376 (383) T ss_pred EEecCCeEEEEEEEecC Confidence 99999999998877655 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=5e-42 Score=247.01 Aligned_cols=287 Identities=11% Similarity=0.030 Sum_probs=226.1 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceee-cCCCceEEEEEeCC----ceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP-QEFGEQQYMTLTAP----PRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~-~~~~~~~~p~~~~~----~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |- +...||+++|+++ +++++.+++.+++++++++++ +.+....+|+...+ +...|.+|....++++++|++++ T Consensus 14 it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~ 92 (314) T protein:vir:41 14 IDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNT 92 (314) T ss_pred cccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCccCCccccccccee Confidence 32 3445899999987 689999999999999999885 57778899987643 23456777788899999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCC-Ccccccccccccccccc--ceeecc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-TGAALSGSPAKILDTTN--IVELTT 151 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~-~g~~~~~~~~~~~~~~~--~~~~~~ 151 (311) +.+||+...++||+|+|+++.. ..+++++|..++++++++.++.++++|+++. .+.+....+.|+...+. +..... T Consensus 93 l~~~kl~~~v~is~e~L~D~a~-~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~~ 171 (314) T protein:vir:41 93 LEMKELVTKVVLEDEALEDNIE-QSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDAEP 171 (314) T ss_pred eeeEEEEEeecccHHHHHhhhc-hhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeecCc Confidence 9999999999999999975531 3589999999999999999999999997643 22222335556554322 222233 Q ss_pred ccccchHHHHHHHHHHHhhcCCC---ccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLS---PDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~---~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) .+.....+.+.+++..+.+.+.+ ..+|+||+.+...++++++.+|+++|.+...++.+.+++|+||+..+.||. T Consensus 172 ~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~--- 248 (314) T protein:vir:41 172 EDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDA--- 248 (314) T ss_pred cccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEecccccc--- Confidence 34445666778888888665443 347999999999999999999999999999999999999999999888873 Q ss_pred cccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEE-EEe Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV-VRD 307 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~-l~~ 307 (311) ...++..++||||+.+.+..+.+++++..++ ..++++.+.+..|+|+.+..++|.++ +-. T Consensus 249 ----------~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~---------a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~ 309 (314) T protein:vir:41 249 ----------LGDDKARALLTVPTNLVYGFWRNIRIEPKRD---------AAMRRTEYIASLRADCNYEDENAAVAAVID 309 (314) T ss_pred ----------cCCCCceEEEechhheEEEeeceeEEeeccc---------CcCCeEEEEEEEEeceEEEEcCcEEEEEee Confidence 3456779999999999998888887776554 35788999999999999998866444 445 Q ss_pred cccC Q lcl|Aclame:pro 308 ADES 311 (311) Q Consensus 308 aa~~ 311 (311) .++| T Consensus 310 ~~~~ 313 (314) T protein:vir:41 310 MSSG 313 (314) T ss_pred ccCC Confidence 5666 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=6.7e-40 Score=235.34 Aligned_cols=282 Identities=10% Similarity=-0.007 Sum_probs=211.0 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceee-cCCCceEEEEEeCC----ceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP-QEFGEQQYMTLTAP----PRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~-~~~~~~~~p~~~~~----~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |- +...||+++|++. +++|+.+++.|+++++|++++ +.+....+++...+ ....|.+|.++.++++++|++++ T Consensus 19 ~t~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~ 97 (315) T protein:vir:41 19 IDVPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVKTNT 97 (315) T ss_pred cCCcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCcCCCCCCccccceee Confidence 22 2335788888876 679999999999999999864 54444556554322 23568889899999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee----ec Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE----LT 150 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~----~~ 150 (311) +.++|+.+.+.||+|+|.++.. .++++++|..++++++++.++.++++|++.. +.+.-..+.|++....... .. T Consensus 98 l~~~~l~~~~~it~elL~D~~~-~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s-~~p~~~~~~G~l~~a~~~~~~~~~~ 175 (315) T protein:vir:41 98 LYMREMVTKVVIHEDAIEDNIE-GKAFEQKIVTLLGEGISYVLEKYYLHGDTSS-SDPLLRMSDGWLKLASEKLTESDVD 175 (315) T ss_pred eceeeeeeeccccHHHHHhhhc-cccHHHHHHHHHHHHHHHHHHHHhhccCCcC-cCccccccccceecccccccccccc Confidence 9999999999999999965431 2589999999999999999999999996532 2222223444443322111 11 Q ss_pred cccccchHHHHHHHHHHHhhcCCC---ccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLS---PDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~---~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) ........+.+.+++..+.....+ ..+|+||+.++..++++||++|+|+|.+....+.+.+|+|+||+..+.||.. T Consensus 176 ~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~- 254 (315) T protein:vir:41 176 PEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSILYDGRPVQYVPALEAL- 254 (315) T ss_pred cccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCceecccceEeccccccc- Confidence 111222345566677666554432 3479999999999999999999999999999999999999999988888732 Q ss_pred ccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccc--eEEE Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA--FAVV 305 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a--~~~l 305 (311) ..++..++||||+.+.++++++++++..++. ..+.+.|.+..|+|+.+..+++ ++.+ T Consensus 255 ------------~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a---------~~~~~~~~~~~r~d~~~~~~~~~a~~~~ 313 (315) T protein:vir:41 255 ------------NDGKSRALFVVPTQLVYGFWRNIKVVPDYDA---------EMRLTKYVASLRTDNHYEDEEGAVSATI 313 (315) T ss_pred ------------CCCCccEEEecccceEEEeccccEEEeeecC---------CCCceEEEEEEEeceeEEeccceeEeee Confidence 3456689999999999999999988876652 3466788889999998776665 6666 Q ss_pred Ee Q lcl|Aclame:pro 306 RD 307 (311) Q Consensus 306 ~~ 307 (311) |. T Consensus 314 ~v 315 (315) T protein:vir:41 314 TV 315 (315) T ss_pred eC Confidence 66 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=8.8e-36 Score=212.76 Aligned_cols=287 Identities=11% Similarity=0.073 Sum_probs=217.1 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeec-C-ccccccccceeEEEEe Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGE-G-AQKSESTATFAPVTAI 76 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E-g-~~~~~~~~~~~~v~l~ 76 (311) ++ ...++|++||+++.+++++.+++.++++++++++++.+....+|....++...|+++ + .+.+.++++|+++++. T Consensus 18 ~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~ 97 (321) T protein:vir:31 18 ALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTIDIS 97 (321) T ss_pred cccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccccccccccccccceeeeeeee Confidence 33 234578899999999999999999999999999999988899999887777788763 3 4566788999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccc--cceeeccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT--NIVELTTGTS 154 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~--~~~~~~~~~~ 154 (311) .+|+.+.+.||+|+|.++. ...+++++|...+++++++.++.++++|++..... ....+.|+.... +......... T Consensus 98 ~~k~~~~~~it~e~L~d~a-~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~-~~~~n~G~l~~a~~~~~~~~~~~~ 175 (321) T protein:vir:31 98 TEKATVAWDLPREVVQENP-EGEALADRILNLMTDAWSADVEDLAANGDEDAEDS-FENQNDGFITVAEGDVETIDAADD 175 (321) T ss_pred eEEEEeehhccHHHHHhhh-cchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc-ccccchhhhhhhcccccccccccc Confidence 9999999999999996543 24689999999999999999999999997532211 111223443321 1222222333 Q ss_pred cchHHHHHHHHHHHhhcCCC-c-cEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLS-P-DGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~-~-~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) ....+.+.+++..+.....+ + .+|+||+.+...+++.....+.++|.+...++.+.+|.|+||+.+++||.+ T Consensus 176 ~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~------ 249 (321) T protein:vir:31 176 ILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDD------ 249 (321) T ss_pred ccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCccccchhhccccccccceeEEEcCCCCCC------ Confidence 34456777888887665443 2 369999999988776444455688988888888889999999999998843 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++++||+.+.+..++++++++.++.... ...++.+......++|+.|.+++|++.++.-.+. T Consensus 250 ------------~il~t~~~nl~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 250 ------------KAMFTDPQNLIYALYRDLEIDVLTESDKV----SERDLHARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred ------------cEEEeccccEEEEEeeccEEEEeecCccc----cccceeeEeeeeeecceeEeccccEEEEecCCcc Confidence 68999999999999999888876553211 1233445444566899999999999999976555 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=3.8e-36 Score=214.79 Aligned_cols=273 Identities=12% Similarity=0.019 Sum_probs=189.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) ......+++++|..+...+...+...++++++++..+.+ ...+|..+....+.|+.||+.+|+++++|+++++.++++ T Consensus 241 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~i 318 (517) T protein:vir:97 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYV 318 (517) T ss_pred cccccccccccchHHHHHHHHhhhhhccceeeeeecccc--ceeeecccccceeeeeecCCcccccccceeeEEeeHhhh Confidence 223445788999999999999999999998887765543 467777777788899999999999999999999999999 Q ss_pred EEEEeecHHHhhcCc-hhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccc-eeeccccccchH Q lcl|Aclame:pro 81 QVTQRFSQEVKWADE-SRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNI-VELTTGTSATPD 158 (311) Q Consensus 81 ~~~i~iS~ell~~s~-~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 158 (311) ++++++|+|||+++. ++...+++||..+|+++++++++.+||+|+|. +....++ ....+. ............ T Consensus 319 a~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGt--g~~~~gi----~~~a~~~~~~~~~~~~~~~ 392 (517) T protein:vir:97 319 YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVT--GVSETQI----YPVVGDAWATNVTGTTNIQ 392 (517) T ss_pred hhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCC--Ccccccc----cccccccccccccccchHH Confidence 999999999997543 22344999999999999999999999999653 2222222 111111 111111111122 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccccc Q lcl|Aclame:pro 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) Q Consensus 159 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~ 238 (311) +.+..+...+.. +..+.|+|||.+|..|+++||++|||+|++...++.+.+++|..-... .++.+ T Consensus 393 d~i~~l~~a~~~--a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~-~~~~~------------ 457 (517) T protein:vir:97 393 ELLEKLSVATPK--AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQ-SVAVD------------ 457 (517) T ss_pred HHHHHHHHHhhh--ccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCcccccc-ccccC------------ Confidence 222222222222 234579999999999999999999999999888888899988422211 11111 Q ss_pred cccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 239 TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 239 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ...++.++.|.+..+.++.+..+. + +.+|+..|+.++|+++.|+.|++|++.....-. T Consensus 458 ------~~~~~~~~~y~i~~~~g~~~~~~f--d-------~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~ 515 (517) T protein:vir:97 458 ------EKTAVSLSGYVTNGSRGMEFEQGT--I-------LVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) T ss_pred ------ceeEeeccccEEEeecceeeeeee--e-------cccCceeEeeeeeeccccccccceEEEEEcCCC Confidence 112233445555445444332111 1 346888899999999999999998887744444 No 103 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=1.2e-34 Score=206.54 Aligned_cols=261 Identities=13% Similarity=0.043 Sum_probs=206.9 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) || +++.+..++|+.|++.|++.+++.+++.+++... ..++..+++|+....+.+.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99 4667789999999999999999999998887652 23445699999988899999999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +.+++++..+++|+|++.++ ..++.+++.+++++++++++|+.++... .+ .+.. .+. T Consensus 81 ~~~~~~~~~~~itd~~~~~s---~~d~~~~~~~~~~~~~a~~~d~~i~~~~---~~------------a~~~-----~~~ 137 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSG---YGDPVGQAAKQIVEAIDHKVDADVLDAL---SK------------STQT-----VEA 137 (272) T ss_pred EEeeeeeeeeeecHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHHHHh---cc------------cccc-----ccc Confidence 99999999999999998544 4678999999999999999999998531 00 0000 112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccC--C-ceeeccccccCCCceecceeEEeeccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQ--G-RKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~--g-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 231 (311) ...++.+.++...+...+.....|+|||.++..|++.+..+ + .....+....+..++++|+||++++.+|.+ T Consensus 138 ~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~----- 212 (272) T protein:vir:30 138 TATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKG----- 212 (272) T ss_pred ccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcc----- Confidence 23477888898999888888889999999999998764221 1 112223334455689999999999999844 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+++.+.+.+.+..+++++++..|+. .++...++...||++++.+|+++++++.++++ T Consensus 213 -------------t~~~~~~~a~~~~~~~~~~ve~~r~~---------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 213 -------------TAYMVRKGALRIMLKRNTMVETDRDI---------TKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred -------------eEEEEcCCeEEEEecCCceeeecccc---------ccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 34555667777778888888877653 23557788889999999999999999999999 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=1.2e-34 Score=206.54 Aligned_cols=261 Identities=13% Similarity=0.043 Sum_probs=206.9 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) || +++.+..++|+.|++.|++.+++.+++.+++... ..++..+++|+....+.+.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99 4667789999999999999999999998887652 23445699999988899999999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +.+++++..+++|+|++.++ ..++.+++.+++++++++++|+.++... .+ .+.. .+. T Consensus 81 ~~~~~~~~~~~itd~~~~~s---~~d~~~~~~~~~~~~~a~~~d~~i~~~~---~~------------a~~~-----~~~ 137 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSG---YGDPVGQAAKQIVEAIDHKVDADVLDAL---SK------------STQT-----VEA 137 (272) T ss_pred EEeeeeeeeeeecHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHHHHh---cc------------cccc-----ccc Confidence 99999999999999998544 4678999999999999999999998531 00 0000 112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccC--C-ceeeccccccCCCceecceeEEeeccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQ--G-RKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~--g-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 231 (311) ...++.+.++...+...+.....|+|||.++..|++.+..+ + .....+....+..++++|+||++++.+|.+ T Consensus 138 ~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~----- 212 (272) T protein:vir:98 138 TATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKG----- 212 (272) T ss_pred ccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcc----- Confidence 23477888898999888888889999999999998764221 1 112223334455689999999999999844 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+++.+.+.+.+..+++++++..|+. .++...++...||++++.+|+++++++.++++ T Consensus 213 -------------t~~~~~~~a~~~~~~~~~~ve~~r~~---------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 213 -------------TAYMVRKGALRIMLKRNTMVETDRDI---------TKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred -------------eEEEEcCCeEEEEecCCceeeecccc---------ccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 34555667777778888888877653 23557788889999999999999999999999 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.96 E-value=2.4e-32 Score=193.90 Aligned_cols=259 Identities=10% Similarity=-0.037 Sum_probs=165.2 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccc--ccceeEEEEe- Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSES--TATFAPVTAI- 76 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~l~- 76 (311) +. ...++++ +|+.+...+.......+++...++.. ..++....|++|+...++. ..++.+.++. T Consensus 212 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 279 (480) T protein:vir:40 212 DLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTDKNKSQTATKRSLRP 279 (480) T ss_pred cccccccccc-cccchhhheeechhhhhhhhhcceee-----------eccccceeeeeeeecccccccccccccchhhH Confidence 11 2222333 44444444444444444443333221 1234456677776544432 2233444444 Q ss_pred --eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 77 --PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 77 --~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) .++++.....|.+++ +|+ .+|++||..++++.++++++.+|++|++++. +.+.+++. ..+.. ... T Consensus 280 ~~v~~l~~~~k~t~~lL---DDa-~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~-~~~~g~~~----~~~~~----~~~ 346 (480) T protein:vir:40 280 QMAEAYLQMDKATVRGV---NDS-GALSEYVMSEMVNRVIQKVEYNMILGSVDGS-NGFYGLKT----ATDGW----TKQ 346 (480) T ss_pred HHHHHHHHhHHHHHHHh---hhh-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCc-ccccccee----ecccc----ccc Confidence 467888888888887 344 4799999999999999999999999954332 23333322 11111 111 Q ss_pred cchHHHHHHHHHHHhhcCCCcc-EEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEee-cccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPD-GVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVS-DTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~-~~~~~~~~~~~~ 232 (311) ....+.+..++.++......++ .|+|||.+|..|++|||++|||||++....+.+.+|+|+||+++ .++|.+.+. T Consensus 347 ~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~--- 423 (480) T protein:vir:40 347 IEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVA--- 423 (480) T ss_pred chhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeeccccCCcce--- Confidence 2223455556656655544444 69999999999999999999999999999999999999998754 455543221 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) +.+....+++||++. +..+.- .++.++..|+++.|+++.+.+|+|+.++|.+.+= T Consensus 424 ------~~~~~~~~~~~d~~~-----------~~~~~~-------~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~ 478 (480) T protein:vir:40 424 ------VYNHDEYVLIGDLNV-----------ENYNDF-------DLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSL 478 (480) T ss_pred ------eeeCCccEEEEeccc-----------ceeccc-------ccccchhhhhhhhhhceeeEccccEEEEEeccCc Confidence 223345677787641 111110 1457788899999999999999999999999988 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.92 E-value=2.6e-26 Score=160.85 Aligned_cols=260 Identities=13% Similarity=0.088 Sum_probs=197.0 Q ss_pred Cccc--CCCceEcchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQ----EFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat~--~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) ||.. .-+..++|+.|.+.+.+.+.+...+.+++..... ++..+++|++...+++.++.||+.++.++.++++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 9844 4578899999999999999999888888865321 233689999987788999999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++...++ ..++.+.+.+++++++++++|+.++..-. + .+. ..... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~~d~~~~~~~~---~------------a~~----~~~~~ 138 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---G------------AKL----TVNAD 138 (274) T ss_pred EEeeeecccccccHHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHh---c------------ccc----ccccc Confidence 99999998999999977544 35678899999999999999999985421 0 000 01122 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCcee----e-ccccccCCCceecceeEEeeccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKL----Y-PELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~----~-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 229 (311) ...++.+.++..++...+.....++|||..+..|++.. .-+++ . .+....+..++++|++|++++.+|.+ T Consensus 139 ~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~--- 213 (274) T protein:vir:93 139 ITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG--- 213 (274) T ss_pred ccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhh--hhcccccccccccceeecccceecCeeEEEcCCCCcc--- Confidence 34577888899998887778888999999999997631 11111 0 11123345678999999999998843 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) ..++.....+.+...+++.++..|+.. +..-.+++..+|++++++|+++++++.++ T Consensus 214 ---------------t~~l~~~gai~~~~~~~~~vE~~Rd~~---------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~ 269 (274) T protein:vir:93 214 ---------------TAILAKKGAVKLILKRDFFLEVARDAS---------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---------------eEEEEeCCeEEEEecCCcccccccchh---------hcccEEEEEEEEEEEEEcCCceEEEeeCc Confidence 234445566666667777777666532 23357888899999999999999999999 Q ss_pred cC Q lcl|Aclame:pro 310 ES 311 (311) Q Consensus 310 ~~ 311 (311) .| T Consensus 270 ~s 271 (274) T protein:vir:93 270 GS 271 (274) T ss_pred cc Confidence 99 No 107 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.90 E-value=2.6e-26 Score=160.82 Aligned_cols=299 Identities=14% Similarity=0.137 Sum_probs=207.7 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccc---cceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSEST---ATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~---~~~~~v~l~~ 77 (311) .||+.++..+||..+++.|+|.+++.....+|...+...+|...+..--+.-.++-++||+++|+.. .+++.++++. T Consensus 73 ~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~ 152 (393) T protein:vir:79 73 FMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIPEDSIDWQTHESPEIRV 152 (393) T ss_pred hhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeeccccccccccccchhhhcCCceeEEe Confidence 5677899999999999999999999999999998888876654444334466788899999999865 5578999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccc-ccceeeccccccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDT-TNIVELTTGTSAT 156 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~~~ 156 (311) +|.+..+.+|+|++ +|+++|+..++...+.++|+|+.|+.++++.-..+.+...+++++-..- ++-.--+.-.... T Consensus 153 gK~G~~Ia~SqEmI---sDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTl 229 (393) T protein:vir:79 153 GKSGIRLRFTDEMI---SDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTF 229 (393) T ss_pred chhhhhhhhHHHHh---hcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccc Confidence 99999999999999 6789999999999999999999999999996555554444443321111 1100001122334 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh---hccCCceeeccccccCCCceecceeEEeeccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ---RDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAS 233 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l---kd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 233 (311) ..+++.++..++++++++++.++|||-.|..+.|- .....++.-+-++.+....+..| |-.+-+.+|.+..+...+ T Consensus 230 SleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~alg-p~~i~~~~~~nlnv~~sP 308 (393) T protein:vir:79 230 SAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALG-PDSIQGRLPFNFNVNLSP 308 (393) T ss_pred cHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhc-hhhhccccccceeEEEec Confidence 56788888889999999999999999999999762 22222222221111111122222 223333344444444444 Q ss_pred ccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecc-cceEEEEecccC Q lcl|Aclame:pro 234 TGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST-DAFAVVRDADES 311 (311) Q Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~-~a~~~l~~aa~~ 311 (311) ...+.......+++..|.....+...++ .|+++++.+ ..+|...++..+|||++|++. +|+++.++-.-+ T Consensus 309 fvp~d~k~~rFd~~~Vd~NnvgvlLV~D-~i~tdq~dd-------k~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~ 379 (393) T protein:vir:79 309 FIPLDKKSRRFDVYAVDRNNVGVLLVRD-DLKTDQWDE-------KARGLQNIKMIERYGIGILNEGKAIAVAKNISMD 379 (393) T ss_pred ccccccccceeeEEEeecCCceEEEEec-Ccceecccc-------ccccceeeeeeeeeceeeeeCCceEEEEecceee Confidence 4444444555577777777666554444 555555533 457888999999999999997 678877765555 No 108 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.89 E-value=6.5e-25 Score=153.20 Aligned_cols=291 Identities=15% Similarity=0.132 Sum_probs=217.1 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccc-cceeEEEEee Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSEST-ATFAPVTAIP 77 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~l~~ 77 (311) |. |...++.+.|..+...|||.+.+.+.++++.+...+.++.+++++...-+.+.|+..++.++++. .+|.+++... T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l 104 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSEL 104 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcceeeeccccccccCcceeeeeeech Confidence 66 45557889999999999999999999999999888888999999999999999999999988765 5799999999 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccch Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (311) +.+++.+.|.+++..-. .+..+...+..+...+++.++.+.++|||+.. ...+.|+... .+..+.......+...+ T Consensus 105 ~~l~~~~~Vd~~iadl~-g~~~d~~~~q~~~~ieal~~~~e~~linGDs~--~~~F~GL~~~-~~~~q~i~tg~~gg~~T 180 (330) T protein:vir:94 105 TTLIGDAEVNGLIQATR-SDFMDQTSVQVASKAKSIGRQYQASMITGDGT--GNSFQGMMGL-VAASQTISAGANGGTLT 180 (330) T ss_pred hhhhhhHHHHHHHHHhc-CCHHHHHHHHHHHHHHHHHHHHHHHhhccCCC--Cccccchhhc-CCcccEEecCCCCCCCC Confidence 99999999999986322 34567788888899999999999999999543 3455555443 34445555444445566 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeecc---ccccCCCceecceeEEeecccccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPE---LGFGTDVASFAGLNAAVSDTVRGGPEAVTAST 234 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~---~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 234 (311) .++++.++.++...+..+..|+||+....+|+.+....|++-..+ ...+....++.|+|+...+.+|.+.... T Consensus 181 ~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~---- 256 (330) T protein:vir:94 181 FELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQG---- 256 (330) T ss_pred HHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcc---- Confidence 788888888887777788899999999999999988777654322 2335555678999999999998764321 Q ss_pred cccccccccceEEEeecc-----eEEEEee----cCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFS-----AFRWGVQ----VSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVV 305 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~-----~~~~~~~----~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l 305 (311) ...+...|++..|. +-..+.. .++.+... +..-..+...+|.+.||+.++.+|+|+.+| T Consensus 257 ----~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~--------G~~~~k~v~~~~v~~y~~~av~~~~a~~~L 324 (330) T protein:vir:94 257 ----TATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNV--------GAKENADETITRVKMYCGFANFSQLGLAAI 324 (330) T ss_pred ----cCCCceeEEEEeecccccccceEeecCCCCCcceeeeC--------CCccccceeeEEEEEeeeeEEechhheeee Confidence 12333455555543 1222221 23333211 112245677899999999999999999999 Q ss_pred EecccC Q lcl|Aclame:pro 306 RDADES 311 (311) Q Consensus 306 ~~aa~~ 311 (311) ++-.=- T Consensus 325 ~~V~~g 330 (330) T protein:vir:94 325 KGLIPG 330 (330) T ss_pred ccccCC Confidence 855444 No 109 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.89 E-value=1.4e-24 Score=151.45 Aligned_cols=264 Identities=12% Similarity=0.071 Sum_probs=194.9 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQ----EFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) || .+.-...++|+.|.+.|.+.+.+...+.+++..-.. ++..+++|.+.....+.++.||++++.++.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 99 555678899999999999999999988898865432 334589999988888899999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++...++ ..|+.+.+.++++.++++++|+.++..- .+. .. ..+. T Consensus 81 ~~i~~~~k~~~vtD~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~i~~~l---~~~------------~~-----~~~~ 137 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSG---YGDPIGESNKQLGLSLANKVDDDLLSAA---KTT------------SQ-----TVST 137 (272) T ss_pred EeeehhhccccccHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cc-----cccc Confidence 99999999999999876543 3567889999999999999999988431 110 00 0112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCc--eeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGR--KLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) ...++.+.++...+...+..+..++|||..+..|++....... ....+....+..++++|++|++++.+|.+..... T Consensus 138 ~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~- 216 (272) T protein:vir:36 138 KANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF- 216 (272) T ss_pred cccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEE- Confidence 3456788899999988888888999999999999875432211 1111122234457899999999999996543211 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .++++. ..+.+...+++++|..|+.. +..-.+++..+|+.++.+|+++++++.+-= T Consensus 217 ------------~~~~~~-gA~~~~~~~~~~vE~~R~~~---------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 217 ------------KIVSNS-PALKLVLKRGVQVETDRDIV---------TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred ------------EEEecc-cceeeeecCCcccccccchh---------hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 233332 33444556677777666532 122367888999999999999999997777 No 110 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.88 E-value=6.5e-24 Score=147.71 Aligned_cols=260 Identities=13% Similarity=0.097 Sum_probs=192.9 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) ||+ +..+..++|+.|+..+.+.+.+...+.+++.... .++..+++|++...+.+....||+.++..+.++++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 994 4557899999999999999998888888875532 1244589999987778888899999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++...++ ..|+.+.+.+++++++++++|+.++.--. + ++. ..... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~~d~~i~~~l~---~------------a~~----~~~~~ 138 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALK---G------------ATL----TVEAD 138 (274) T ss_pred EEEEeeeceeeecHHHHHhh---cchHHHHHHHHHHHHHHHHHHHHHHHHHh---c------------CCC----CcCcc Confidence 99999888899999976443 45678899999999999999999885411 0 000 01112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceee-----ccccccCCCceecceeEEeeccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLY-----PELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~-----~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 229 (311) ...++.+.++..++...+.....++|||..+..|++... .+++- ......+..++++|++|++++.+|.+. T Consensus 139 ~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~--~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t-- 214 (274) T protein:vir:96 139 ITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSAS--DNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-- 214 (274) T ss_pred cccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhccc--ccccccccccccceeecccceecCeeEEEcCCCCcce-- Confidence 345778889988888777778889999999999987531 11110 111223456889999999999998542 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) .++.....+.+....+++++..|... +..-.+++..+||.++++|++++++++++ T Consensus 215 ----------------~~l~~~gA~~~~~~~~~~vE~~Rd~~---------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:96 215 ----------------ALLAKKGAVKLITKRDFFLEKDRDAS---------RKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) T ss_pred ----------------EEEEeCcceeeeecCCcccccccchh---------hcccEEEEeeEEEEEEEcCccEEEEEcCc Confidence 23333455666667777777655432 23346777899999999999999998877 Q ss_pred cC Q lcl|Aclame:pro 310 ES 311 (311) Q Consensus 310 ~~ 311 (311) +- T Consensus 270 ~~ 271 (274) T protein:vir:96 270 GD 271 (274) T ss_pred cc Confidence 76 No 111 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.88 E-value=6.2e-24 Score=147.82 Aligned_cols=268 Identities=11% Similarity=0.045 Sum_probs=189.1 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQ----EFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) || ++..+..++|+.|++.+.+.+++..++.+++..... ++..+++|++...+.+.++.|++.++..+.++++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 88 477789999999999999999999888888754322 234589999987778889999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++...++ ..|+.+.+.+++++++++++|+.++..-. +.. .... + ..+.... T Consensus 81 ~~i~~~~~a~~v~D~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~l~~~l~---~a~-----~~~~-~----~~t~~~~ 144 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSG---YGDPVEEAQKQIRMAIASKVDNDILEEAL---TTT-----LEVK-G----AINIGLI 144 (278) T ss_pred EeeehhhccccccHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc-----cccc-c----ccccchh Confidence 99999888899999876443 45688999999999999999999885421 100 0000 0 0111112 Q ss_pred cchHHHHHHHHHHHhhcCCC-ccEEEEcHHHHHHHHHhhccCC--c-eeeccccccCCCceecceeEEeecccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLS-PDGVALDNTFSFMLATQRDSQG--R-KLYPELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~-~~~~v~n~~~~~~l~~lkd~~g--~-~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ...++.+.++..++...+.. ...++|||..+..|++....+. . .+-.+....+..+++.|++|++++.+|.+. T Consensus 145 ~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t--- 221 (278) T protein:vir:80 145 DKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGN--- 221 (278) T ss_pred hhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcce--- Confidence 23355666666666554444 3358899999999987532111 0 111122234556899999999999998432 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .+++ .-..+.+...+.++++..|+.. +..-.+++..+|+.++++|++++++++.|. T Consensus 222 --------------~~l~-~~gAi~~~~~~~~~vE~~Rd~~---------~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 222 --------------ALAV-KAGALKTFLKRNLLAESGRDMD---------HKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred --------------EEEE-eccceeeeecCCcccccccchh---------hccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 1222 2344555556677777665432 223467778999999999999999998888 Q ss_pred C Q lcl|Aclame:pro 311 S 311 (311) Q Consensus 311 ~ 311 (311) - T Consensus 278 ~ 278 (278) T protein:vir:80 278 N 278 (278) T ss_pred C Confidence 8 No 112 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.87 E-value=8.6e-24 Score=147.04 Aligned_cols=262 Identities=15% Similarity=0.087 Sum_probs=197.6 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) || ++.-...++|+.|.+-|.+.+.+...+.+++..-. .++..+++|.++....+.++.||++++..+.+.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 99 66668899999999999999999999999986532 3455689999988888999999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) ...++.+..+.++++....+ ..|..+.+.++++.++++++|+.++.-- .+ ++... ... T Consensus 81 a~i~~~~k~~~~tD~a~~~~---~~dp~~~~~~~~~~~~a~~~d~~~~~~l---~~------------~~~~~----~~~ 138 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSG---YGDPQGEAVRQHGLAIANKVDNDVLEAL---RG------------TKLTV----SAD 138 (276) T ss_pred EEeehccccccccHHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHH---hc------------ccccc----ccc Confidence 99999999999999977544 3457788999999999999999987421 00 01001 112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCce---eeccccccCCCceecceeEEeeccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRK---LYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~---~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 231 (311) ...++.+.++..++...+....+++|||..+..|+++.+.+-.. .-.+....+..++++|++|++++.+|.+. T Consensus 139 ~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t---- 214 (276) T protein:vir:10 139 IGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGE---- 214 (276) T ss_pred ccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcce---- Confidence 24578888999998877778888999999999998754222100 00111233446789999999999987432 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++..-..+.+...++++++..|... +..-.+++..+|+.++.+|+.++++++++-| T Consensus 215 --------------~~l~~~gAi~~~~~~~~~vE~dRd~~---------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 215 --------------AILAKRGAVKLITKRDFFLETDRDPS---------TKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred --------------EEEEeccceeeeecCCceeecccchh---------hcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 22333445556667788888776542 2234667788999999999999999998888 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.87 E-value=2.6e-23 Score=144.45 Aligned_cols=258 Identities=14% Similarity=0.107 Sum_probs=193.3 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |+. +.-+..++|+.|...+.+.+++.....+++..-. .++..+++|++...+.+..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 884 4457899999999999999988888888876532 2345689999887778888999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+....++++...++ ..|..+.+.+++++++++++|+.++.--. + .+.. .... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~dp~~~~~~~~a~a~a~~vd~~~~~~l~---~------------a~~~----~~~~ 138 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---G------------AKLT----VNAD 138 (274) T ss_pred EEeeeecceecccHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHh---c------------cCcc----cccc Confidence 99999988899999976443 34577889999999999999999885311 0 0000 1112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccC-CceeeccccccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQ-GRKLYPELGFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~-g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) ...++.+.++..++...+.....++|||..+..|++.. ++. |.. ....+..++++|++|++++.+|.+ T Consensus 139 ~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~----~~~~G~ig~~~G~~Vi~s~~~p~~- 213 (274) T protein:vir:94 139 ITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDD----IIVKGAFGEALGAIIVRTNKLEAG- 213 (274) T ss_pred ccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCccccc----ceeccccceecCeeEEEcCCCCcc- Confidence 34578888999998887777888999999999997631 111 111 123445688999999999999843 Q ss_pred ccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~ 307 (311) ..++.....+.+...+++.++..|... . ..-.+++..+|++++.+|++++++++ T Consensus 214 -----------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~-------~--~~d~i~~~~~y~~~~~~~~~vv~~t~ 267 (274) T protein:vir:94 214 -----------------TAILAKKGAVKLILKRDFFLEVARDAS-------T--KTTALYSDKHYVAYLYDESKAVKITK 267 (274) T ss_pred -----------------eEEEEeCcceEeeecCCceeccccchh-------h--cccEEEEEEEEEEEEEcCCceEEEec Confidence 223334455666667777777666532 2 22366777899999999999999998 Q ss_pred cccC Q lcl|Aclame:pro 308 ADES 311 (311) Q Consensus 308 aa~~ 311 (311) +..| T Consensus 268 ~~~~ 271 (274) T protein:vir:94 268 GSGS 271 (274) T ss_pred Cccc Confidence 8888 No 114 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.87 E-value=2.6e-23 Score=144.45 Aligned_cols=258 Identities=14% Similarity=0.107 Sum_probs=193.3 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |+. +.-+..++|+.|...+.+.+++.....+++..-. .++..+++|++...+.+..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 884 4457899999999999999988888888876532 2345689999887778888999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+....++++...++ ..|..+.+.+++++++++++|+.++.--. + .+.. .... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~dp~~~~~~~~a~a~a~~vd~~~~~~l~---~------------a~~~----~~~~ 138 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---G------------AKLT----VNAD 138 (274) T ss_pred EEeeeecceecccHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHh---c------------cCcc----cccc Confidence 99999988899999976443 34577889999999999999999885311 0 0000 1112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccC-CceeeccccccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQ-GRKLYPELGFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~-g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) ...++.+.++..++...+.....++|||..+..|++.. ++. |.. ....+..++++|++|++++.+|.+ T Consensus 139 ~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~----~~~~G~ig~~~G~~Vi~s~~~p~~- 213 (274) T protein:vir:97 139 ITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDD----IIVKGAFGEALGAIIVRTNKLEAG- 213 (274) T ss_pred ccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCccccc----ceeccccceecCeeEEEcCCCCcc- Confidence 34578888999998887777888999999999997631 111 111 123445688999999999999843 Q ss_pred ccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~ 307 (311) ..++.....+.+...+++.++..|... . ..-.+++..+|++++.+|++++++++ T Consensus 214 -----------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~-------~--~~d~i~~~~~y~~~~~~~~~vv~~t~ 267 (274) T protein:vir:97 214 -----------------TAILAKKGAVKLILKRDFFLEVARDAS-------T--KTTALYSDKHYVAYLYDESKAVKITK 267 (274) T ss_pred -----------------eEEEEeCcceEeeecCCceeccccchh-------h--cccEEEEEEEEEEEEEcCCceEEEec Confidence 223334455666667777777666532 2 22366777899999999999999998 Q ss_pred cccC Q lcl|Aclame:pro 308 ADES 311 (311) Q Consensus 308 aa~~ 311 (311) +..| T Consensus 268 ~~~~ 271 (274) T protein:vir:97 268 GSGS 271 (274) T ss_pred Cccc Confidence 8888 No 115 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.87 E-value=1.2e-23 Score=146.31 Aligned_cols=262 Identities=12% Similarity=0.068 Sum_probs=192.8 Q ss_pred Ccc-cCCCceEcchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCceeEEeecCccccccccceeEEEE Q lcl|Aclame:pro 1 MVA-LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQ----EFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTA 75 (311) Q Consensus 1 mat-~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l 75 (311) |++ +.-...++|+.|...+.+.+++...+.+++..-+. ++..+++|.++..+++.++.||+.++..+.+.++.++ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~ 82 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKRQA 82 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccceeeE Confidence 443 44567899999999999999999999998865432 3446899999887888999999999999999999999 Q ss_pred eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 76 ~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) ..++.+..+.++++...++. .|..+.+.++++.++++++|+.++.--+. ++.. ..... T Consensus 83 ~i~~~~~~~~i~D~~~~~~~---~d~~~~~~~~~a~~~a~~~d~~ll~~l~~---------------a~~~----~~~~~ 140 (275) T protein:vir:96 83 TIRKIGKGTVLTDEALLSGY---GDPKGEAVRQHGLAIANKVDNDVLEALQG---------------ATLK----VEADI 140 (275) T ss_pred EeehhcccccccHHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHhc---------------cccc----ccccc Confidence 99999999999999764442 35778899999999999999998843110 0000 11123 Q ss_pred chHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccC---CceeeccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQ---GRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~---g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) ..++.+.++..++...+.....++|||..+..|+++...+ ....-.+....+..++++|++|++++.+|.+. T Consensus 141 ~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t----- 215 (275) T protein:vir:96 141 TKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGE----- 215 (275) T ss_pred cCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcce----- Confidence 4578888999999777777888999999999998753111 00000112234456889999999999988542 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .++++ ...+.+...+++.+|..|+.. +..-.+++..+|+.++++|+++++++..++- T Consensus 216 ------------~~i~~-~gA~~~~~~~~~~vE~~Rd~~---------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 216 ------------AILAK-RGAVKLITKRDFFLETERHAS---------HKSTALFSDKHYVAYLYDESKVVKITKSASG 272 (275) T ss_pred ------------EEEEe-ccceeeeecCCcccccccchh---------hcCcEEEEeEEEEEEEEcCccEEEEEecccc Confidence 13333 345556666777777766532 2335677789999999999999999887777 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.84 E-value=4.5e-22 Score=137.60 Aligned_cols=259 Identities=13% Similarity=0.082 Sum_probs=191.7 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) ||. +.-...++|+.|...+.+.+.+...+.+++..- ..++..+++|.....+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 884 344788999999999999998888877877652 22445689999987778888999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++....+ ..|..+.+.++++.++++++|+.++.--.. ++.. .... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~---------------a~~~----~~~~ 138 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMG---------------AKLT----VNAD 138 (274) T ss_pred EEeeeecceeeecHHHHHhc---ccchHHHHHHHHHHHHHHHHHHHHHHHHhc---------------cccc----cccc Confidence 99999888999999865433 345778899999999999999998854210 0000 1122 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) ...++.+.++..++...+......+|||..+..|++.. ++++. .+....+..++++|++|++++.+|... T Consensus 139 a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~p~~t- 214 (274) T protein:vir:12 139 ITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELG---DDIIVKGAFGEALGAIIVRSNKLEAGT- 214 (274) T ss_pred ccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhcccccccc---ccceecccceeecCeeEEEeCCCCcce- Confidence 34578888999988777777778999999999998731 12211 112234456789999999999998532 Q ss_pred cccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+++| ...+.+...+++++|..|... . ..-.++...+|++++++|+.+++++++ T Consensus 215 ----------------~~l~~-~gA~~~~~~~~~~vE~~Rd~~-------~--~~d~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 215 ----------------AILAK-KGAVKLILKRDFFLEVARDAS-------T--KTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred ----------------EEEEe-ccceeeeecCCceeccccchh-------h--cccEEEeeeEEEEEEEcCCceEEEEcC Confidence 23333 344555566777777766532 1 233677889999999999999999988 Q ss_pred ccC Q lcl|Aclame:pro 309 DES 311 (311) Q Consensus 309 a~~ 311 (311) ..| T Consensus 269 ~~~ 271 (274) T protein:vir:12 269 SGS 271 (274) T ss_pred Ccc Confidence 888 No 117 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.83 E-value=1.5e-21 Score=134.76 Aligned_cols=259 Identities=13% Similarity=0.068 Sum_probs=191.1 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |+. +.-...++|+.|...+.+.+.+...+.+++..-. .++..+++|..+..+.+..+.||+.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 884 3446889999999999999998888888875432 2345689999987778888999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++...++ ..|..+.+.++++.++++++|+.++.--. + +.... ... T Consensus 81 ~~i~~~~~a~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~vd~~i~~~l~--~-------------a~~~~----~~~ 138 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSG---YGDPQGEQVRQHGLAHANKVDDDVLEALK--S-------------AKLTV----EAD 138 (274) T ss_pred EEeeeeecceeehHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHh--c-------------ccccc----ccc Confidence 99999888899999865433 34677889999999999999999884311 0 00111 112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) ...++.+.++..++...+......+|||..+..|++.. ++++. .+....+..+++.|++|++++.+|... T Consensus 139 ~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~~~~t- 214 (274) T protein:vir:96 139 ITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELG---DDVIVKGAFGEALGAVIVRSNKLEAGT- 214 (274) T ss_pred ccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccccccccccc---ccceeccccceecCeEEEEeCCCCCce- Confidence 34577888899888777667778999999999998631 11110 112234456889999999999887431 Q ss_pred cccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+++| ...+.+...+++.+|..|... +..-.+++..+|++++++|++++++++. T Consensus 215 ----------------~~l~~-~gA~~~~~~~~~~vE~~Rd~~---------~~~d~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 215 ----------------AILAK-KGAVKLITKRDFFLETDRDPS---------TKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred ----------------EEEEe-ccceeeeecCCcccccccccc---------cccCEEEEeEEEEEEEEcCCcEEEEEcC Confidence 23344 344555567777777766532 2345677789999999999999999998 Q ss_pred ccC Q lcl|Aclame:pro 309 DES 311 (311) Q Consensus 309 a~~ 311 (311) .-| T Consensus 269 ~~~ 271 (274) T protein:vir:96 269 SGS 271 (274) T ss_pred Ccc Confidence 888 No 118 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.83 E-value=1.5e-21 Score=134.76 Aligned_cols=259 Identities=13% Similarity=0.068 Sum_probs=191.1 Q ss_pred Ccc--cCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVA--LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat--~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) |+. +.-...++|+.|...+.+.+.+...+.+++..-. .++..+++|..+..+.+..+.||+.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 884 3446889999999999999998888888875432 2345689999987778888999999999999999999 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +..++.+..+.++++...++ ..|..+.+.++++.++++++|+.++.--. + +.... ... T Consensus 81 ~~i~~~~~a~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~vd~~i~~~l~--~-------------a~~~~----~~~ 138 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSG---YGDPQGEQVRQHGLAHANKVDDDVLEALK--S-------------AKLTV----EAD 138 (274) T ss_pred EEeeeeecceeehHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHh--c-------------ccccc----ccc Confidence 99999888899999865433 34677889999999999999999884311 0 00111 112 Q ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccCCceeeccccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) ...++.+.++..++...+......+|||..+..|++.. ++++. .+....+..+++.|++|++++.+|... T Consensus 139 ~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~~~~t- 214 (274) T protein:vir:95 139 ITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELG---DDVIVKGAFGEALGAVIVRSNKLEAGT- 214 (274) T ss_pred ccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccccccccccc---ccceeccccceecCeEEEEeCCCCCce- Confidence 34577888899888777667778999999999998631 11110 112234456889999999999887431 Q ss_pred cccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 229 AVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+++| ...+.+...+++.+|..|... +..-.+++..+|++++++|++++++++. T Consensus 215 ----------------~~l~~-~gA~~~~~~~~~~vE~~Rd~~---------~~~d~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 215 ----------------AILAK-KGAVKLITKRDFFLETDRDPS---------TKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred ----------------EEEEe-ccceeeeecCCcccccccccc---------cccCEEEEeEEEEEEEEcCCcEEEEEcC Confidence 23344 344555567777777766532 2345677789999999999999999998 Q ss_pred ccC Q lcl|Aclame:pro 309 DES 311 (311) Q Consensus 309 a~~ 311 (311) .-| T Consensus 269 ~~~ 271 (274) T protein:vir:95 269 SGS 271 (274) T ss_pred Ccc Confidence 888 No 119 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.81 E-value=2.4e-21 Score=133.58 Aligned_cols=262 Identities=11% Similarity=0.021 Sum_probs=192.4 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCceeEEeecCccccccccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQ----EFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 76 (311) ||.+.-...++|+.|.+-|.+.+.+...+.+++..-+. ++..+++|.++..+++.-+.||++++..+.+.++.+.. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 99888899999999999999999998888888865322 44568999998888888899999999999999999999 Q ss_pred eeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 77 ~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) .++.+.-+.++++....+ .-|....+.++++..+++++|+.++.-- .+ .... .+... T Consensus 81 i~~~gk~~~itD~a~~~~---~~dp~~~~~~q~a~~~a~~~d~~li~~l---~~---------a~~~--------~~~~~ 137 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITN---VNGTLQEASRQLAMSLADKVEIDYIAEL---NK---------SKQT--------ATVSA 137 (270) T ss_pred eehhhCcceecHHHHhhh---ccchHHHHHHHHHHHHHHHHHHHHHHHh---cc---------cccc--------ccccc Confidence 999999999999966433 2356788999999999999999887320 11 0000 11224 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) .++.+.++..++.+....+.+++|||.++..|++...-.+...-......+..+.+.|++|++++..+.. T Consensus 138 t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~---------- 207 (270) T protein:vir:95 138 DATGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRVSE---------- 207 (270) T ss_pred CHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCCCc---------- Confidence 4677888889998888888899999999999987432111111112223445788999999887765522 Q ss_pred cccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ...++.....+.+...+++.+|..|+.. . ..-.+.+..+|++++.+|..+++++.+.+. T Consensus 208 -------~~~~l~~~gAi~~~~~~~~~vEtdRd~~-------~--~~d~i~~~~~y~v~~~~~skvv~~t~~~a~ 266 (270) T protein:vir:95 208 -------NTAFLQRYGAMEIVNKKKPEAYTDFDIL-------K--RTHLLSTNYHYSVNLKDETGVVKVTFKPSG 266 (270) T ss_pred -------eeEEEEeccceeeeecCCceeeeccchh-------h--cccEEEeeeEEEEEEEccceEEEEEecCCC Confidence 1222333455667777788888777542 2 233667779999999999999999955444 No 120 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.77 E-value=1.3e-19 Score=124.21 Aligned_cols=291 Identities=15% Similarity=0.122 Sum_probs=203.6 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeec-----CccccccccceeEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGE-----GAQKSESTATFAPV 73 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E-----g~~~~~~~~~~~~v 73 (311) |. |....+.+.+..+...|||.+.+.|.++++.+..++.++.+.+.+...-+.+.+.+. ++..+++..+|++. T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 76 444556788999999999999999999999999888888899998876655554433 35556788999999 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGT 153 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (311) +...+-+++.+.|.+.+..-......+...+-.++..+++.++.+..+|||+.. .++..|+... .+.++.+...+.. T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a--~n~F~GL~~~-~~~~q~i~~~~~g 157 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA--GNEFAGLIQL-CASGQKATTGATG 157 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC--CCcccchhhc-CCccceeecCCCC Confidence 999999999999987654211112334555556777899999999999999653 3445565443 3444555544444 Q ss_pred ccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCCceeec--cccccCCCceecceeEEeecccccccccc Q lcl|Aclame:pro 154 SATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQGRKLYP--ELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g~~~~~--~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ...+.++++.++..+...+..+..++|||++..+|+.+ +..+++.+++ ....+...-++.|+|+...+.+|.+.... T Consensus 158 g~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~ 237 (310) T protein:vir:97 158 SAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKG 237 (310) T ss_pred CCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCcccc Confidence 55667889999988877777889999999988777653 4455555554 33445555789999999999999764322 Q ss_pred cccccccccccccceEEEeecce-----EEEEe----ecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSA-----FRWGV----QVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA 301 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~-----~~~~~----~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a 301 (311) ...+...|++..|.. -.++. ..++.+.... ..-..+...+|.+.+|+.++.+|+| T Consensus 238 --------~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G--------~~~~~~v~~~~V~~Y~~~av~~~~A 301 (310) T protein:vir:97 238 --------GTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVG--------ESEDSDEHIWRVKWYCGLALFSEKG 301 (310) T ss_pred --------ccCCceeEEEEeeCccccccceeccccCCccceeEEeCC--------cccCCcceeEEEEEeeeEEEecccc Confidence 122333455444431 11111 1223333221 1123566788999999999999999 Q ss_pred eEEEEeccc Q lcl|Aclame:pro 302 FAVVRDADE 310 (311) Q Consensus 302 ~~~l~~aa~ 310 (311) +++|.+-.- T Consensus 302 ~a~L~~V~~ 310 (310) T protein:vir:97 302 LACADGITN 310 (310) T ss_pred eeeeccccC Confidence 999997777 No 121 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.61 E-value=4.2e-17 Score=110.35 Aligned_cols=229 Identities=11% Similarity=0.045 Sum_probs=167.3 Q ss_pred cceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 33 SMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVA 112 (311) Q Consensus 33 ~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ 112 (311) -+-++++ ..+++|.+ .+.+.-++||+.++..+.++++.+...++.+..++|++|..... .-|...+..++++.+ T Consensus 1 ~~~~~~G-dtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~---~gDp~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINLA-NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSG---YGDPIGESNKQLGLS 74 (231) T ss_pred CccccCC-ceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhc---cCchHHHHHHHHHHH Confidence 2233333 45888875 56788999999999999999999999999999999999976543 335678899999999 Q ss_pred HHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh Q lcl|Aclame:pro 113 LGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR 192 (311) Q Consensus 113 ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk 192 (311) |++++|..++.-- .+ ++. + .+....++.+.++...+...+..+.+.+|||..+..||+.. T Consensus 75 iA~kvD~di~~~~---~~------------a~l--~---~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~ 134 (231) T protein:vir:73 75 LANKVDDDLLKAA---KT------------TSQ--T---VSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) T ss_pred HHHhhhHHHHHhh---cc------------ccc--c---ccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhcc Confidence 9999999988421 00 000 0 11235678899999999888777888999999999999855 Q ss_pred ccCCc--eeeccccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccC Q lcl|Aclame:pro 193 DSQGR--KLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFG 270 (311) Q Consensus 193 d~~g~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~ 270 (311) +.+.. ..-.+....+..+.+.|++|++|+.+|.+..... . ++.-...+.+...+++.++..|+. T Consensus 135 ~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~-------------~-~i~~~gAl~~~~k~~~~vEtdRd~ 200 (231) T protein:vir:73 135 NAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF-------------K-IVSNSPALKLVLKRGVQVETDRDI 200 (231) T ss_pred chhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeee-------------e-EEeeccceeeeecccceeeccccc Confidence 43221 1112334456678999999999999986543321 1 122234566677888888877764 Q ss_pred CcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 271 DPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 271 ~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) . +....+.+.+.|+.++.+|..+++++.+-- T Consensus 201 ~---------~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 201 V---------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred c---------ccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 3 234567788999999999999999987666 No 122 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.60 E-value=1.9e-16 Score=106.82 Aligned_cols=263 Identities=13% Similarity=0.054 Sum_probs=164.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcce----eecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMA----EPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 76 (311) ||.. .++|+.|+.++++.+++.+++.++++. +...+.++++|+......+.++.++..++..+.+...+++. T Consensus 1 MA~~----~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:79 1 MAFN----NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccch----hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEE Confidence 7763 468999999999999999998888743 22234569999987666677888998888878787777777 Q ss_pred eeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 77 PRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 77 ~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) ..+. ..-+.|++.-..++ ..++.+ +.+++++++++++|+.++.=- .+.. . ... .....+.. T Consensus 77 id~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~vD~~i~~~~---~~a~-----~-----~~~-~~~~~~~~ 138 (273) T protein:vir:79 77 IDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADML---VDNG-----T-----ALT-GSAPSDAD 138 (273) T ss_pred EeeecccceeeccHHHHhh---cccHHH-HHHHHHHHHHHHHHHHHHHHH---hhcc-----c-----ccc-cccccchh Confidence 7553 44566766322222 235665 566788999999998765210 0000 0 000 00111222 Q ss_pred chHHHHHHHHHHHhhcCCC--ccEEEEcHHHHHHHHHhhcc-CCceeec--cccccCCCceecceeEEeecccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLS--PDGVALDNTFSFMLATQRDS-QGRKLYP--ELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~-~g~~~~~--~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ..++.+..+...+...+.. .-.++++|..+..|.+..+. ....... .....+..+++.|++|+.++.+|..... T Consensus 139 ~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~- 217 (273) T protein:vir:79 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE- 217 (273) T ss_pred hHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCce- Confidence 4567788888888777663 23588999999988764321 1111111 1223455789999999999999854221 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ..+.+--+.+... .+...++..+.. ..| ...+++.+.+|++++||+++++|+...+ T Consensus 218 --------------~~~a~~~~A~~~a-~~~~~~e~~r~~------~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 218 --------------QFVAFHPSAAAYV-SQIDTVEALRDQ------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred --------------EEEEEeccceeee-eehhhhhcccCc------ccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 1222222222222 122233333321 113 3457788999999999999999987777 No 123 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.60 E-value=2.8e-16 Score=105.83 Aligned_cols=263 Identities=14% Similarity=0.054 Sum_probs=162.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 76 (311) ||. ..++|+.|+.++++.+++.+++..+++.- ...+.++++|+......+.+..++..++..+.+.+.+++. T Consensus 1 MA~----~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccc----hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEE Confidence 776 35689999999999999999988887431 2233468999987666677788887777767666666666 Q ss_pred eeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 77 PRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 77 ~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) ..+. ..-+.|++.-..+. ..++++ +.+++.+++++++|..++.=- .+... .. ......+.. T Consensus 77 id~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~alA~~vD~~i~~~~---~~a~~----------~~-~~~~~~~~~ 138 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADML---VDNGT----------AL-TGSAPTDAD 138 (273) T ss_pred EeeeeecceEeecHHHhhh---hccHHH-HHHHHHHHHHHHHHHHHHHHH---hcccc----------cc-ccccccchh Confidence 5443 34456665322122 235655 567788999999998876321 00000 00 001111223 Q ss_pred chHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhccCC-ceeec--cccccCCCceecceeEEeecccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRDSQG-RKLYP--ELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd~~g-~~~~~--~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ..++.+..+...+...+... -.++++|..+..|.+..+--. ..... .....+..+++.|++|+.++.+|.+.. T Consensus 139 ~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-- 216 (273) T protein:vir:10 139 DAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) T ss_pred HHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc-- Confidence 45778888888887777642 358899999998876432111 11111 112245568999999999999985421 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ..++.+--+.+.... +...++..+.. ..| ...+++.+.+|++++||+++++|+...+ T Consensus 217 -------------~~~~~~~~~A~~~a~-q~~~~e~~r~~------~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 -------------EQFVAFHPSAAAYVS-QIDTVEALRDQ------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -------------cEEEEEeccceeeee-eeehhhcccCC------Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 122333333332221 11233333221 123 2357788999999999999999987777 No 124 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.60 E-value=2.8e-16 Score=105.83 Aligned_cols=263 Identities=14% Similarity=0.054 Sum_probs=162.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEe Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 76 (311) ||. ..++|+.|+.++++.+++.+++..+++.- ...+.++++|+......+.+..++..++..+.+.+.+++. T Consensus 1 MA~----~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccc----hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEE Confidence 776 35689999999999999999988887431 2233468999987666677788887777767666666666 Q ss_pred eeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 77 PRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 77 ~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) ..+. ..-+.|++.-..+. ..++++ +.+++.+++++++|..++.=- .+... .. ......+.. T Consensus 77 id~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~alA~~vD~~i~~~~---~~a~~----------~~-~~~~~~~~~ 138 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADML---VDNGT----------AL-TGSAPTDAD 138 (273) T ss_pred EeeeeecceEeecHHHhhh---hccHHH-HHHHHHHHHHHHHHHHHHHHH---hcccc----------cc-ccccccchh Confidence 5443 34456665322122 235655 567788999999998876321 00000 00 001111223 Q ss_pred chHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhccCC-ceeec--cccccCCCceecceeEEeecccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRDSQG-RKLYP--ELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd~~g-~~~~~--~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 230 (311) ..++.+..+...+...+... -.++++|..+..|.+..+--. ..... .....+..+++.|++|+.++.+|.+.. T Consensus 139 ~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-- 216 (273) T protein:vir:10 139 DAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) T ss_pred HHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc-- Confidence 45778888888887777642 358899999998876432111 11111 112245568999999999999985421 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) ..++.+--+.+.... +...++..+.. ..| ...+++.+.+|++++||+++++|+...+ T Consensus 217 -------------~~~~~~~~~A~~~a~-q~~~~e~~r~~------~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 -------------EQFVAFHPSAAAYVS-QIDTVEALRDQ------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -------------cEEEEEeccceeeee-eeehhhcccCC------Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 122333333332221 11233333221 123 2357788999999999999999987777 No 125 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.55 E-value=3.6e-16 Score=105.27 Aligned_cols=282 Identities=12% Similarity=0.036 Sum_probs=167.2 Q ss_pred CcccCCCc-eEc------chhHHHHHHHHHHhhchhhhhcceeec-CCCceEEEEEeC---CceeEEeecCccccccccc Q lcl|Aclame:pro 1 MVALATGT-FQL------PKHLVPGVWQKAQGQSVLARLSMAEPQ-EFGEQQYMTLTA---PPRGEVVGEGAQKSESTAT 69 (311) Q Consensus 1 mat~~~g~-~~v------P~~~~~~ii~~~~~~s~l~~l~~~~~~-~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~ 69 (311) ...+..|+ +++ |+-+-..|.+.+++.-+.-.+.+.... .++.+.+-+... ..++.-|+|++++|.+.+. T Consensus 7 i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~ 86 (318) T protein:vir:10 7 IVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGA 86 (318) T ss_pred ceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccccccCCC Confidence 22222233 222 666667777777777766666665544 344455544332 3567789999999999999 Q ss_pred eeEEEE-eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee Q lcl|Aclame:pro 70 FAPVTA-IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) Q Consensus 70 ~~~v~l-~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 148 (311) ++...+ ..+|.+.-++||+|++. ....+..+....+++++|+++.|+.++.--.+... +.........+..+... T Consensus 87 ~G~~~ia~~~K~G~~~~vS~Em~~---~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t-~~~~~s~~w~~~~~~~~ 162 (318) T protein:vir:10 87 RGLPRTAFAVKKALGVRVSKEMID---ENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIV-PTLAVPTAWDNGGKVRT 162 (318) T ss_pred CCchhhhhhehhccceeccHHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-ccccCCcCCCCcccccc Confidence 877666 55899999999999884 45678888999999999999999998754211110 00000001111110000 Q ss_pred eccccccchHHHHHH---------HHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccCCceeecccc-ccCCCcee Q lcl|Aclame:pro 149 LTTGTSATPDLAVEA---------AVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQGRKLYPELG-FGTDVASF 212 (311) Q Consensus 149 ~~~~~~~~~~~~i~~---------~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~g~~~~~~~~-~~~~~~~l 212 (311) +.....+.+.. ....-.+.+|.++.++|||.+|..|++-+ ..++.+++.... ++.-++++ T Consensus 163 ----d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~ 238 (318) T protein:vir:10 163 ----DIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSV 238 (318) T ss_pred ----cchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccccccccee Confidence 00000111110 11111367899999999999999995433 334555543222 34447889 Q ss_pred cceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEecc--CCcccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 213 AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEF--GDPDGLGDLKRQNQIAIRAEV 290 (311) Q Consensus 213 ~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~--~~~~~~~~~f~~~~v~~ra~~ 290 (311) +|+.|+.+..+|.+..+.+... .+|++ .+.+.++....+. .++++. .+....+|+.. T Consensus 239 lGl~vi~s~~~p~~~alvlq~g------------~vG~~-----~d~~pl~~t~~~~egg~~~g~----~~~s~~~~~~~ 297 (318) T protein:vir:10 239 MGLNVIRSRTFPIDRVLIMERG------------TVGFY-----SDTRPLQFTALYPEGNGPNGG----PTESYRADASH 297 (318) T ss_pred eceEEeecCccCCCeeEEEecC------------Cccee-----eccccceeeecccCCCCCCCC----cchhhheehhe Confidence 9999999999998765544321 12221 1333333333221 112111 11223567777 Q ss_pred EeccEEecccceEEEEecccC Q lcl|Aclame:pro 291 VYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 291 r~~~~v~~~~a~~~l~~aa~~ 311 (311) +-...|.+|+|+++||.=-+- T Consensus 298 ~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 298 KRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeeeeeeCcceeEEEeeccCC Confidence 788999999999999966655 No 126 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.49 E-value=3e-15 Score=100.18 Aligned_cols=300 Identities=8% Similarity=-0.007 Sum_probs=164.9 Q ss_pred CcccC--CC--------ceEcchhHHHHHHHHHHhhchhhhhcceee---cCCCceEEEEEeCCceeEEeecCccccccc Q lcl|Aclame:pro 1 MVALA--TG--------TFQLPKHLVPGVWQKAQGQSVLARLSMAEP---QEFGEQQYMTLTAPPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~~--~g--------~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~ 67 (311) |+-.- +| ..+||+.|+.+|++.+++..++.++++..+ ..+.++++|+.. .+.+.-..++..++..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 55221 22 237899999999999999998888876432 224468999865 55677777888888777 Q ss_pred cceeEEEEeeee-EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRK-VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNI 146 (311) Q Consensus 68 ~~~~~v~l~~~k-l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 146 (311) .+-.++++...+ ...-+.|+++-.. .+..|+...+.++.++++++++|+.++.--....+.. ......... T Consensus 80 ~~~~~~~itiD~~~~~~~~i~d~d~~---~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~-----~~~~~~~~~ 151 (341) T protein:vir:94 80 VNDTDFVITVDTDRTTAVALDDLLEI---QASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTA-----SQNVFSSSN 151 (341) T ss_pred ccCceEEEEEeeeeecceeechHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccc-----cCccccCcc Confidence 776666666633 3455777775332 2345788899999999999999998774311111111 000001111 Q ss_pred eeeccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCCc-eeeccccccCCCceecceeEEeeccc Q lcl|Aclame:pro 147 VELTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQGR-KLYPELGFGTDVASFAGLNAAVSDTV 223 (311) Q Consensus 147 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g~-~~~~~~~~~~~~~~l~G~pv~~~~~~ 223 (311) ...........++.+..+...+...+.... .++++|..+..|.+...-..+ +.-......+..+++.|++|+.++.+ T Consensus 152 ~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~l 231 (341) T protein:vir:94 152 GAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLI 231 (341) T ss_pred ccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEeccc Confidence 111222233456778888888877665432 477899999999763211111 11112233455689999999999999 Q ss_pred ccccccccccccccc-------------------cccccceEEEeecceEEEEeecCceEEEeccC--Ccccchhhh--h Q lcl|Aclame:pro 224 RGGPEAVTASTGVYR-------------------TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFG--DPDGLGDLK--R 280 (311) Q Consensus 224 ~~~~~~~~~~~~~~~-------------------~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~--~~~~~~~~f--~ 280 (311) |.............. ........+++..+.... .+-+..+..... ..-.+...| . T Consensus 232 p~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~--~k~~~~~~~~~~~~~~~~~~~~~~~~ 309 (341) T protein:vir:94 232 GNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHT--AVMCHMDWAAAVVSKAPRVTQSFENR 309 (341) T ss_pred cccccccccccccceecccccccccccccccccccccccEEEEEEecccccc--eeeecchhhhccccccccccccchhh Confidence 976433211100000 000011111111111100 000000000000 000000001 1 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 281 QNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 281 ~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) +-.-.+++..=||.+++||++.+.|+..+.. T Consensus 310 ~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~ 340 (341) T protein:vir:94 310 EQVWLMVGRQAYGARLYRPLHAVNIHTTGDT 340 (341) T ss_pred hhhhhhhhhhhhcccccCcceeEEEecCcCC Confidence 1122345566789999999999888888777 No 127 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.47 E-value=4.4e-15 Score=99.30 Aligned_cols=296 Identities=13% Similarity=0.052 Sum_probs=168.8 Q ss_pred CcccCCCc-----------------eEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcc Q lcl|Aclame:pro 1 MVALATGT-----------------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQ 62 (311) Q Consensus 1 mat~~~g~-----------------~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~ 62 (311) ||-..+|+ ..| +.|+.+|.+.++..+.++.+.++..+.+++ +.+|+. +..++..+..|.+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecCcC Confidence 66443333 133 778999999999999999999887766554 788874 4566777788877 Q ss_pred ccc--cccceeEEEEeeeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhh----ccc--CCCcccc Q lcl|Aclame:pro 63 KSE--STATFAPVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIH----GIN--PLTGAAL 133 (311) Q Consensus 63 ~~~--~~~~~~~v~l~~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~----G~~--~~~g~~~ 133 (311) +.. .++...+.++..-++ .....|-+- +..-+..|+.+.+.++.+++++++.|+.++. +.+ +.....+ T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~~Vddi---D~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADVLIYDI---EDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENI 155 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhhhhhhH---HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 754 356666655544332 111222111 1122334788899999999999999998863 211 1111111 Q ss_pred cccccc----ccccccceeeccccccchHHHHHHHHHHHhhcCCCcc-E-EEEcHHHHHHHHHhhc-cCCceeecccccc Q lcl|Aclame:pro 134 SGSPAK----ILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD-G-VALDNTFSFMLATQRD-SQGRKLYPELGFG 206 (311) Q Consensus 134 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~-~v~n~~~~~~l~~lkd-~~g~~~~~~~~~~ 206 (311) .+.+.+ +.................++.+.++...+...+.... . ++..|+.+..|.+..+ ..+.+-....... T Consensus 156 ~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~ 235 (347) T protein:vir:94 156 AGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPST 235 (347) T ss_pred ccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccccccc Confidence 111110 1111111111111223446778888888877766432 3 5557999988876433 3333332233345 Q ss_pred CCCceecceeEEeecccccccccccccc-----------------cccccccccceEEEeecceEEEEeecCceEEEecc Q lcl|Aclame:pro 207 TDVASFAGLNAAVSDTVRGGPEAVTAST-----------------GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEF 269 (311) Q Consensus 207 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~ 269 (311) +..+++.|++|+.++.+|.......... ..+.....+...++...+.+......++++++.++ T Consensus 236 G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~ 315 (347) T protein:vir:94 236 GSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARR 315 (347) T ss_pred ceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeec Confidence 5678999999999999996432111111 01111222223344444444445556666666543 Q ss_pred CCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 270 GDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 270 ~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .. ++.+ .+.+..-+|.+++||++.+.++.+.+ T Consensus 316 ~~-------~~~~--~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 316 AN-------FQAD--QIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hh-------hhhh--hhhhhhhhcCcccccceeEEEEecCC Confidence 21 2222 45666789999999999876664444 No 128 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.45 E-value=3.1e-14 Score=94.68 Aligned_cols=287 Identities=11% Similarity=0.036 Sum_probs=168.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEe-ecCcccc-ccccceeEEEEe-e Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVV-GEGAQKS-ESTATFAPVTAI-P 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v-~Eg~~~~-~~~~~~~~v~l~-~ 77 (311) |-+..-|+.+++++...++++.+++.++++++++.+++.+.+..++++.-+..-.-. .|+...+ ..+.+...+.+. . T Consensus 23 it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~ 102 (360) T protein:vir:99 23 IGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNAT 102 (360) T ss_pred ccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccccccCCCCCcCCcCccccCccccc Confidence 332333456677778899999999999999999999999988888887654433221 2332222 244444455552 3 Q ss_pred eeEEEEEeecHHHhhcCc-hhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCC-------cccccccccccccccc-cee Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADE-SRQLGVLQTMADLSGVALGRALDLIGIHGINPLT-------GAALSGSPAKILDTTN-IVE 148 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~-~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~-------g~~~~~~~~~~~~~~~-~~~ 148 (311) ++......+..+-+++.. -....+++.|...+++++++.++.-.++|+.+.. +.....+..|+..... -++ T Consensus 103 ~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~ 182 (360) T protein:vir:99 103 DKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQ 182 (360) T ss_pred cceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHHHhhcccc Confidence 455555566666554321 1123466899999999999999999999964421 1112222222221110 000 Q ss_pred e------------------------------cccc-ccchHHHHHHHHHHHhhcCCCc---c-EEEEcHHHHHHHHHhhc Q lcl|Aclame:pro 149 L------------------------------TTGT-SATPDLAVEAAVGLVLGDNLSP---D-GVALDNTFSFMLATQRD 193 (311) Q Consensus 149 ~------------------------------~~~~-~~~~~~~i~~~~~~~~~~~~~~---~-~~v~n~~~~~~l~~lkd 193 (311) . +.+. .......+.+++..+.....+. + +|+|++.+....+.... T Consensus 183 ~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~ 262 (360) T protein:vir:99 183 SVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLT 262 (360) T ss_pred hhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHh Confidence 0 0000 0012223456666665553322 2 79999988766655433 Q ss_pred cCCceeeccccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcc Q lcl|Aclame:pro 194 SQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPD 273 (311) Q Consensus 194 ~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~ 273 (311) .-.-++--....+...-.+.|+|++..+.+|.+ .+++=+...+.++..+++++....+.+ T Consensus 263 ~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~------------------~~mlT~p~NLi~g~~~~iri~~~~e~~-- 322 (360) T protein:vir:99 263 EREDPLGSAVIFGDSDITPFSYDLVGVNGFPDE------------------YMMFTDPNNLAFGLYEEMELDQSTDTD-- 322 (360) T ss_pred ccCcccchhheecccccccceeeeEEcCCCCCC------------------ceEEeccCceeEEeeeeeEEeecccch-- Confidence 222233222233444456789999988887754 456667777777788887775433211 Q ss_pred cchhhhhcCc--EEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 274 GLGDLKRQNQ--IAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 274 ~~~~~f~~~~--v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) . ..... +..-...++|+.+..++|+++++.--.. T Consensus 323 ~----~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~ 358 (360) T protein:vir:99 323 K----VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETP 358 (360) T ss_pred h----hhhhceeeeEEEEEEeeEEEEecccEEEEecCCCC Confidence 1 11222 2233456799999999999999965544 No 129 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.43 E-value=1.5e-14 Score=96.30 Aligned_cols=298 Identities=10% Similarity=-0.014 Sum_probs=162.1 Q ss_pred CcccCCCce--Ec--------------chhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccc Q lcl|Aclame:pro 1 MVALATGTF--QL--------------PKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQK 63 (311) Q Consensus 1 mat~~~g~~--~v--------------P~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~ 63 (311) ||-...|+- +. =+.|..+|....+..|.++.+.++.++.+|+ +.+|+.. ...+.....|.++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG-~~~~~~~~~g~~l 79 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG-RTKGYYLAPGENL 79 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeec-ceeeeeeccccCC Confidence 663322221 11 1778999999999999999999887766554 7888754 4556666666665 Q ss_pred cc--cccceeEEEEeeeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCC--ccccccccc Q lcl|Aclame:pro 64 SE--STATFAPVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLT--GAALSGSPA 138 (311) Q Consensus 64 ~~--~~~~~~~v~l~~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~--g~~~~~~~~ 138 (311) .. .++...++++..-++ .....|.+- +......|+...+.++.++++++..|+.++.--.... .....+.+. T Consensus 80 ~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~---D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDI---EDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred CCCCCCCccceEEEEEechhhhhhhhhhH---HHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 43 345666665554332 112222221 0112234677889999999999999998873210000 000111112 Q ss_pred cccccccce-------eeccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhc-cCCceeeccccccCC Q lcl|Aclame:pro 139 KILDTTNIV-------ELTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRD-SQGRKLYPELGFGTD 208 (311) Q Consensus 139 ~~~~~~~~~-------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd-~~g~~~~~~~~~~~~ 208 (311) ++..+.... ..........++.+.++...+...+.... .++++|..+..|.+.+. ....+.-......+. T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcce Confidence 211111111 00111112236667777777777666432 58889999988865332 222333222333456 Q ss_pred CceecceeEEeecccccccccccccccccccc-------cccceEEEeecce----------EEEEeecCceEEEeccCC Q lcl|Aclame:pro 209 VASFAGLNAAVSDTVRGGPEAVTASTGVYRTT-------NPNVKAIAGDFSA----------FRWGVQVSIPLELIEFGD 271 (311) Q Consensus 209 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~gd~~~----------~~~~~~~~~~i~~~~~~~ 271 (311) .+++.|++|+.++.+|.+..........+... .....-+-+|++. +......+++++..+.. T Consensus 237 vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~- 315 (347) T protein:vir:88 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP- 315 (347) T ss_pred eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech- Confidence 68899999999999985322211111100000 0001112233322 22222344445544332 Q ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 272 PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 272 ~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+|. + .+++...+|.+++||++.+.|+...+| T Consensus 316 -----~~~~-d--~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 316 -----EFQA-D--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred -----hhHH-H--HhhhhhhhcCceeccceEEEEEeCCCC Confidence 1222 2 467778899999999999888877777 No 130 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.40 E-value=5.2e-14 Score=93.40 Aligned_cols=296 Identities=13% Similarity=0.031 Sum_probs=170.5 Q ss_pred CcccCCC---------c--------eEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcc Q lcl|Aclame:pro 1 MVALATG---------T--------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQ 62 (311) Q Consensus 1 mat~~~g---------~--------~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~ 62 (311) |+....| + .+-=+.+..+|.+.....+.++++.++.++.+++ +++|+. +..++.....|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCCC Confidence 5533221 1 1112678999999999999999999988888655 788876 6677888888877 Q ss_pred cccc--ccceeEEEE--eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhccc--CCCccccccc Q lcl|Aclame:pro 63 KSES--TATFAPVTA--IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTGAALSGS 136 (311) Q Consensus 63 ~~~~--~~~~~~v~l--~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~--~~~g~~~~~~ 136 (311) ...+ +++..+.++ .-.++.. ..|-+ + +..-+..|+...+.+++++++++..|+.++.-.. .....+..+. T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~-~~Vdd-i--D~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTAD-VLIYD-I--EDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred CCCCCCCcccceEEEEecchhhhh-hhHhh-H--HHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 6543 355555333 3332222 11211 1 1122345788999999999999999998873110 0000111111 Q ss_pred ccccccc--------ccceeeccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccC-Cceeeccccc Q lcl|Aclame:pro 137 PAKILDT--------TNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQ-GRKLYPELGF 205 (311) Q Consensus 137 ~~~~~~~--------~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~-g~~~~~~~~~ 205 (311) +.+...+ ..............++.+.++...+...+.... ..+++|..+..|.+-+.-+ ..+.-..... T Consensus 156 ~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~ 235 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPE 235 (345) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccc Confidence 1111111 111111111223457778888888877666644 4788999999886544322 2232222233 Q ss_pred cCCCceecceeEEeeccccccccccccc--------------ccccccccccceEEEeecceEEEEeecCceEEEeccCC Q lcl|Aclame:pro 206 GTDVASFAGLNAAVSDTVRGGPEAVTAS--------------TGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD 271 (311) Q Consensus 206 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~ 271 (311) .+..+++.|++|+.++.+|......... .............++...+.+......+++++..+... T Consensus 236 ~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 315 (345) T protein:vir:22 236 KGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 (345) T ss_pred cceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh Confidence 4456889999999999998532111000 00011112223444445555555555556666655421 Q ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 272 PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 272 ~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) +|. + .+++..-+|.+++||++.+.|+-+-+ T Consensus 316 ------~~~-d--~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 316 ------FQA-D--QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred ------HHH-H--HHHHHHhcCCcccccceeEEEEEeeC Confidence 222 2 35666779999999999999998888 No 131 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.36 E-value=4.2e-14 Score=93.93 Aligned_cols=296 Identities=12% Similarity=0.018 Sum_probs=161.1 Q ss_pred CcccCCC---c-eEcc-------------hhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcc Q lcl|Aclame:pro 1 MVALATG---T-FQLP-------------KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQ 62 (311) Q Consensus 1 mat~~~g---~-~~vP-------------~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~ 62 (311) ||...+| + ...| +.|+.+|.+..+..+.++++.++.++.+++ +++|+. +..++..+..|++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~~ 79 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCCC Confidence 7744333 1 1122 678999999999999999999988888655 788886 5667777777877 Q ss_pred cccc--ccceeEEEEeeeeEEEEEeecHHHhhc--CchhhHHHHHHHHHHHHHHHHHHHHHHhhhcc----c--CCCccc Q lcl|Aclame:pro 63 KSES--TATFAPVTAIPRKVQVTQRFSQEVKWA--DESRQLGVLQTMADLSGVALGRALDLIGIHGI----N--PLTGAA 132 (311) Q Consensus 63 ~~~~--~~~~~~v~l~~~kl~~~i~iS~ell~~--s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~----~--~~~g~~ 132 (311) .+.+ ++.-.+.++..-++ .+++.++.+ ..-+..|+.+.+.+++++++++..|+.++.-. . +..... T Consensus 80 l~~t~~~~~~~e~~l~ID~~----~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~ 155 (344) T protein:vir:10 80 LDDIRKDIKHTEKVITIDGL----LTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNEN 155 (344) T ss_pred CCCCCCCcccceEEEEEcch----hhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 7643 34445544443221 112222211 12233578899999999999999999886321 0 001111 Q ss_pred ccccccccccccccee----eccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccC-Cceeeccccc Q lcl|Aclame:pro 133 LSGSPAKILDTTNIVE----LTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQ-GRKLYPELGF 205 (311) Q Consensus 133 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~-g~~~~~~~~~ 205 (311) +.+.+.+.....+... .........++.+.++...+...+.... ..+++|..+..|.+-+.-+ ..+.-..... T Consensus 156 ~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~ 235 (344) T protein:vir:10 156 ITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPE 235 (344) T ss_pred cccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccccccccccccccee Confidence 1111111111111111 1111112346667778888877766533 3667999999886543222 2222122233 Q ss_pred cCCCceecceeEEeecccccccccccccc---cccccccccceEEEeecce----------EEEEeecCceEEEeccCCc Q lcl|Aclame:pro 206 GTDVASFAGLNAAVSDTVRGGPEAVTAST---GVYRTTNPNVKAIAGDFSA----------FRWGVQVSIPLELIEFGDP 272 (311) Q Consensus 206 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~gd~~~----------~~~~~~~~~~i~~~~~~~~ 272 (311) .+..+++.|++|+.++.+|.+........ ..............++|+. +......+++++..+.. T Consensus 236 ~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~-- 313 (344) T protein:vir:10 236 KGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA-- 313 (344) T ss_pred eeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch-- Confidence 44567899999999999986421110000 0000001111111223322 22333344455544332 Q ss_pred ccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 273 DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 273 ~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .+|.. .+++..-+|.+++||++.+.++.++- T Consensus 314 ----~~~~d---~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 314 ----NFQAD---QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred ----hHHHH---HHHHHhhcccceecccceEEEEeecC Confidence 22322 45566789999999998855555554 No 132 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.36 E-value=3.5e-13 Score=88.86 Aligned_cols=276 Identities=9% Similarity=0.017 Sum_probs=162.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhh---------ccee--ecCCCceEEEEEeCC-ceeEEeecCcccccccc Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARL---------SMAE--PQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTA 68 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l---------~~~~--~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~ 68 (311) ||++.-...++|+.+..-+.+...+.+.+.+- .... ..++..+++|....- .++.-+.|+.+++..+. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l 80 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKI 80 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchhhc Confidence 99887789999999977777766666555332 1222 234456899998764 67778899999999888 Q ss_pred ceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee Q lcl|Aclame:pro 69 TFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) Q Consensus 69 ~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 148 (311) +.++-....++.+.-..++++....+ .-|..+.+.+++++.++++.++.+|.-- .|+........+... T Consensus 81 ~t~~~~a~i~~~~k~~~~tD~a~~~s---g~dp~~~i~~q~a~~~~~~~~~~lia~l--------~g~~~~~~~~~~~~d 149 (324) T protein:vir:59 81 NAGQDKAVLILRGNAWSSHDLAATLS---GSDPMQAIGSRVAAYWAREMQKIVFAEL--------AGVFSNDDMKDNKLD 149 (324) T ss_pred ccceeeEEEEeecCceeehhhhhhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHH--------HHhhhccccccceee Confidence 87776666667777778887744322 3356788999999999999998876321 011100000011111 Q ss_pred e-ccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 149 L-TTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 149 ~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) . +..+.....+.+.++..++.+....-.+|+||+..+..|++..-.+ ++ .....+...+.++|++|++++.+|... T Consensus 150 vsa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~~~~G~~VivdD~~p~~~ 226 (324) T protein:vir:59 150 ISGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIE--FV-KDSQSGIRFPTYMNKRVIVDDSMPVET 226 (324) T ss_pred eeccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhh--hc-cccccCceeeeecccEEEEeCCCCccc Confidence 1 1122223467788899888777666778999999999998753111 11 111122345789999999999998643 Q ss_pred ccccccccccccccccceEEEeecceEEEEe-ecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGV-QVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR 306 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~-~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~ 306 (311) ...... .+ ..++++. ..+.+.. +..+.+++.|... .+.-.+.++.++ ++||..+..-+ T Consensus 227 ~~~~~~--~y------~s~l~~~-GAi~~~~~~~~v~vE~dRd~~---------~g~~~l~~r~~~---~~~p~G~s~~~ 285 (324) T protein:vir:59 227 LEDGTK--VF------TSYLFGA-GALGYAEGQPEVPTETARNAL---------GSQDILINRKHF---VLHPRGVKFTE 285 (324) T ss_pred cCCCCc--eE------EEEEEec-CeEEEeecCCCcceecccCcc---------ccceEEEEeeEE---EeEeeeEEecc Confidence 321111 00 1223332 2222322 3445566655532 233334444443 35555544432 Q ss_pred ecc--cC Q lcl|Aclame:pro 307 DAD--ES 311 (311) Q Consensus 307 ~aa--~~ 311 (311) .+. .+ T Consensus 286 ~~~~~~s 292 (324) T protein:vir:59 286 NAMAGTT 292 (324) T ss_pred cccCCCC Confidence 211 11 No 133 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.36 E-value=4.7e-13 Score=88.18 Aligned_cols=279 Identities=10% Similarity=-0.021 Sum_probs=156.6 Q ss_pred CcccCCCc-------------eEcchhHHHHHHHHHHhhchhhhhcceeec---CCCceEEEEEeCCceeEEeecCcccc Q lcl|Aclame:pro 1 MVALATGT-------------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQ---EFGEQQYMTLTAPPRGEVVGEGAQKS 64 (311) Q Consensus 1 mat~~~g~-------------~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~---~~~~~~~p~~~~~~~a~~v~Eg~~~~ 64 (311) ||+...+| .++|+.|..++++.+++.+++..++..... .+.++++|+.. .+.+..+.++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i~ 79 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPVN 79 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCccc Confidence 66654433 478999999999999999998888765332 23458899865 56788888998888 Q ss_pred ccccceeEEEEeeeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc---cccc Q lcl|Aclame:pro 65 ESTATFAPVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS---PAKI 140 (311) Q Consensus 65 ~~~~~~~~v~l~~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~---~~~~ 140 (311) ..+.+..++++...+. ..-+.|+++-. ..+..|+.+.+.+++.+++++++|+.++.-............ ...+ T Consensus 80 ~~~~~~~~~~itID~~~~~~~~Idd~D~---~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i 156 (381) T protein:vir:80 80 LQARTDSEFTFTVTKYKESSFMIEDIVN---TQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTL 156 (381) T ss_pred ccccCCceEEEEEeeeeecceeechHHH---HhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccc Confidence 7777776666655332 33356665422 223457888999999999999999998743110000000000 1111 Q ss_pred cccccceeeccccccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhc-cCCceeeccccccCCCceecceeE Q lcl|Aclame:pro 141 LDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRD-SQGRKLYPELGFGTDVASFAGLNA 217 (311) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd-~~g~~~~~~~~~~~~~~~l~G~pv 217 (311) .........+.......++.+.++...+...+... -.++++|..+..|.+... .+-.+.-......+..+++.|++| T Consensus 157 ~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~~V 236 (381) T protein:vir:80 157 GDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEV 236 (381) T ss_pred cccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcceEE Confidence 11111111122233446778888888887776642 258899999999876432 111222233344556789999999 Q ss_pred Eeeccccccccccccccc---ccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 218 AVSDTVRGGPEAVTASTG---VYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGI 294 (311) Q Consensus 218 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~ 294 (311) +.++.+|........... ...........+-|+ |..+..+++....+|. T Consensus 237 v~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~----------------------------~s~~a~av~~~k~yd~ 288 (381) T protein:vir:80 237 IVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPD----------------------------QAGTANVVNTGSASDL 288 (381) T ss_pred Eeecccccccccceeeeccccccccccccccccccc----------------------------cccceeeeeeeeeece Confidence 999999864221111000 000000000111122 2333445555556666 Q ss_pred EEeccc-ceEEEEecccC Q lcl|Aclame:pro 295 GIMSTD-AFAVVRDADES 311 (311) Q Consensus 295 ~v~~~~-a~~~l~~aa~~ 311 (311) ++...- .+.....+.+- T Consensus 289 ~~~~~~~~~~~~~g~~~~ 306 (381) T protein:vir:80 289 AVSLSYFGLPVFSGAGAT 306 (381) T ss_pred eeeeeeccceeeecceee Confidence 664332 22222211111 No 134 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.34 E-value=4.8e-14 Score=93.60 Aligned_cols=293 Identities=13% Similarity=0.034 Sum_probs=156.5 Q ss_pred CcccCCCce-Ecc--------------hhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccc Q lcl|Aclame:pro 1 MVALATGTF-QLP--------------KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKS 64 (311) Q Consensus 1 mat~~~g~~-~vP--------------~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~ 64 (311) |+-+.-... +.| +.+..+|+...+..+.++++.+..++.+|+ +.+|+. +..++.....|++++ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERLS 79 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCcC Confidence 553211111 111 578899999999999999999888776554 788886 566677777777664 Q ss_pred cc--ccceeE--EEEeeeeEEEEEeecHHHhhc--CchhhHHHHHHHHHHHHHHHHHHHHHHhhhcc---cCCCcccccc Q lcl|Aclame:pro 65 ES--TATFAP--VTAIPRKVQVTQRFSQEVKWA--DESRQLGVLQTMADLSGVALGRALDLIGIHGI---NPLTGAALSG 135 (311) Q Consensus 65 ~~--~~~~~~--v~l~~~kl~~~i~iS~ell~~--s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~---~~~~g~~~~~ 135 (311) .+ +.+-.+ +++...++ ++.++-+ ..-+..|+.+.+.++.++++++..|+.++.-- ....+ .... T Consensus 80 ~~~~~~~~~e~~itID~~~~------~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~-~~~~ 152 (347) T protein:vir:94 80 DKRKGIKHTEKVITIDGLLT------ADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPA-ASNE 152 (347) T ss_pred CCCCCCCcceEEEEecchhh------hhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccc Confidence 33 223333 33333322 2222211 12234578888999999999999999886310 00001 0111 Q ss_pred ccccccccccceeeccc--------cccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCC-ceeecccc Q lcl|Aclame:pro 136 SPAKILDTTNIVELTTG--------TSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQG-RKLYPELG 204 (311) Q Consensus 136 ~~~~~~~~~~~~~~~~~--------~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g-~~~~~~~~ 204 (311) .+.+...++ ....... .....++.|.++...+...+.... ..+++|..+..|.+-++-+. .+.-.... T Consensus 153 ~~~g~~~~s-~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 231 (347) T protein:vir:94 153 NIAGLGTAS-VLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDP 231 (347) T ss_pred ccCCCcccc-eeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccc Confidence 111111111 1111100 112235556677777776665432 57889999987755443222 22222233 Q ss_pred ccCCCceecceeEEeecccccccccccccccccccccccceEEE--------eec----------ceEEEEeecCceEEE Q lcl|Aclame:pro 205 FGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIA--------GDF----------SAFRWGVQVSIPLEL 266 (311) Q Consensus 205 ~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------gd~----------~~~~~~~~~~~~i~~ 266 (311) ..+..++++|++|+.++.+|.................+....+. +|| +.+......+++++. T Consensus 232 ~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~ 311 (347) T protein:vir:94 232 ETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALER 311 (347) T ss_pred cccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccc Confidence 34566899999999999999643222111111111122222222 222 222222233334443 Q ss_pred eccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 267 IEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 267 ~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+.. .+|. + .+++..-+|.+++||++.+.|+.+++- T Consensus 312 ~r~~------~~~~-d--~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 312 DRDV------DAQG-D--LIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhch------hhHH-H--HhhhhhhhcCcccccceeEEEEecCCC Confidence 3321 1232 2 567778899999999999888755322 No 135 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.34 E-value=1.4e-13 Score=91.05 Aligned_cols=293 Identities=12% Similarity=0.016 Sum_probs=158.1 Q ss_pred CcccCCCc-----------------eEcchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCcc Q lcl|Aclame:pro 1 MVALATGT-----------------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQ 62 (311) Q Consensus 1 mat~~~g~-----------------~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~ 62 (311) ||-+.+|+ ..| +.|+.+|.+.++..|.++.+.+..+..+| ++.+|+.. ..++.....|++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG-~~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecc-ceeeeeecCCCC Confidence 77554444 234 77899999999999999999987766654 47888854 455566666666 Q ss_pred ccc--cccceeEEEEe--eeeEEE-EEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcc-----cC----C Q lcl|Aclame:pro 63 KSE--STATFAPVTAI--PRKVQV-TQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGI-----NP----L 128 (311) Q Consensus 63 ~~~--~~~~~~~v~l~--~~kl~~-~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~-----~~----~ 128 (311) ++. .+....+.++. -.+... .+.==+| ..+..|+...+.++.+++++++.|+.++.-. .. . T Consensus 79 l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~-----~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~ 153 (347) T protein:vir:33 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIED-----AMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNE 153 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHHhhHHH-----HhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 543 23444443333 222221 1111122 1233467888999999999999999987210 00 0 Q ss_pred Cccccccc--cccccccccceeeccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhcc-CCceeeccc Q lcl|Aclame:pro 129 TGAALSGS--PAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDS-QGRKLYPEL 203 (311) Q Consensus 129 ~g~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~-~g~~~~~~~ 203 (311) ...++.+. ......++.............++.+.++...|...+.... ..+++|..+..|.+-..- +..+.-... T Consensus 154 ~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~ 233 (347) T protein:vir:33 154 NIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLD 233 (347) T ss_pred ccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccc Confidence 00000000 0000111111110111122456777888888877777432 478899999988764332 223322233 Q ss_pred cccCCCceecceeEEeeccccccccccccc------ccccccccccceEEEeec----------ceEEEEeecCceEEEe Q lcl|Aclame:pro 204 GFGTDVASFAGLNAAVSDTVRGGPEAVTAS------TGVYRTTNPNVKAIAGDF----------SAFRWGVQVSIPLELI 267 (311) Q Consensus 204 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~gd~----------~~~~~~~~~~~~i~~~ 267 (311) ...+..++++|++|+.++.+|......... ...+..... ...-++| +.+......+++++.. T Consensus 234 ~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~--~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~ 311 (347) T protein:vir:33 234 PERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSS--TTVKVALDNVVGLFQHRSAVGTVKLKDLALERA 311 (347) T ss_pred cccceeEEEeceeEEEecccccCccccccccccccccccccCCcc--cceeccccceeeeeecchhheeeeeeceeeeec Confidence 344556899999999999999753321110 000000000 1111222 2221222333344443 Q ss_pred ccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 268 EFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 268 ~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) +.. ++|- -.+++...+|.+++||++.+.|+.+-=+ T Consensus 312 r~~------~~~~---d~i~~~~~~G~~vlrP~~av~i~~~~~~ 346 (347) T protein:vir:33 312 RRA------NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred cch------hhhh---HhhhhhhhcCCceecccceEEEecCCCC Confidence 321 1222 2356667889999999998888755444 No 136 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.33 E-value=3.2e-13 Score=89.11 Aligned_cols=297 Identities=12% Similarity=0.014 Sum_probs=157.9 Q ss_pred CcccCCCceE---------cc-------hhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccc Q lcl|Aclame:pro 1 MVALATGTFQ---------LP-------KHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQK 63 (311) Q Consensus 1 mat~~~g~~~---------vP-------~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~ 63 (311) ||.+.+|+.+ .+ +.++.+|+...+..|.++.+.++.+..++ ++.+|+.. ..++.....|.++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~~g~~l 79 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeeccCCCC Confidence 8877666632 12 45788899989999999999987776654 47888865 4566666667666 Q ss_pred cc--cccceeEEEE--eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhccc----C--CCcccc Q lcl|Aclame:pro 64 SE--STATFAPVTA--IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGIN----P--LTGAAL 133 (311) Q Consensus 64 ~~--~~~~~~~v~l--~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~----~--~~g~~~ 133 (311) +. .+.+..+.++ .-.|... ..| +.+ +...+..|+...+.++.+++++++.|+.++.-.. . ...... T Consensus 80 ~~~~~~~~~~e~~ltID~~~~~~-~~V-ddl--D~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:15 80 DDKRKDIKHTEKVIHIDGLLTAD-VLI-YDI--EDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENI 155 (347) T ss_pred CCCCCCCccceEEEEechhhhhh-HHh-hhH--HHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 43 3344455433 3333322 112 121 1123345788889999999999999999873210 0 000000 Q ss_pred -----ccccccccccccceeeccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCC-ceeeccccc Q lcl|Aclame:pro 134 -----SGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQG-RKLYPELGF 205 (311) Q Consensus 134 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g-~~~~~~~~~ 205 (311) .++.......+.............++.+.++...+...+.... ..+++|..+..|.+-.+... .+.-..... T Consensus 156 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~ 235 (347) T protein:vir:15 156 EGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHE 235 (347) T ss_pred cccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccccccc Confidence 0000000000000000000112235556666667766666422 36668999998876443222 222122233 Q ss_pred cCCCceecceeEEeecccccccccccccc------ccccccc--------ccceEEEeecceEEEEeecCceEEEeccCC Q lcl|Aclame:pro 206 GTDVASFAGLNAAVSDTVRGGPEAVTAST------GVYRTTN--------PNVKAIAGDFSAFRWGVQVSIPLELIEFGD 271 (311) Q Consensus 206 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~~------~~~~~~~--------~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~ 271 (311) .+..++++|++|+.++.+|.......... ....... .....++...+.+......+++++..+.. T Consensus 236 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~- 314 (347) T protein:vir:15 236 RGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA- 314 (347) T ss_pred ceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccc- Confidence 45568899999999999996543211100 0000000 00111222222222333344445444321 Q ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 272 PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 272 ~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+|- -.+++...+|.+++||++.+.|+.+-=+ T Consensus 315 -----~~~~---d~i~~~~~~G~~vlrP~~av~~~~~~~~ 346 (347) T protein:vir:15 315 -----NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred -----hhhh---hhhehhhhcCCceeccccEEEEecCCCC Confidence 1222 2456667889999999998888754444 No 137 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.33 E-value=1.8e-13 Score=90.49 Aligned_cols=294 Identities=15% Similarity=0.087 Sum_probs=158.0 Q ss_pred Ccc---------------cCCCc----eEcchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCceeEEeecC Q lcl|Aclame:pro 1 MVA---------------LATGT----FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEG 60 (311) Q Consensus 1 mat---------------~~~g~----~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg 60 (311) |-+ .+++. .+| +.|..+|++.++..|.++.+.+..+..+| ++++|+. +..+++....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g 78 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPG 78 (332) T ss_pred CcccccccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCC Confidence 221 12232 334 78999999999999999999987776654 4888887 45556665556 Q ss_pred cccccc-ccceeEEEEee--eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcc--cCCCcccccc Q lcl|Aclame:pro 61 AQKSES-TATFAPVTAIP--RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGI--NPLTGAALSG 135 (311) Q Consensus 61 ~~~~~~-~~~~~~v~l~~--~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~--~~~~g~~~~~ 135 (311) .++... ++.-.++++.. .|+.. ..| +.+ +...+..|+.+.+.++.++++++..|+.++.-- ......+.++ T Consensus 79 ~~l~~~~~~~~~~~~l~ID~~ky~~-~~V-ddi--D~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~ 154 (332) T protein:vir:78 79 TPIVGDAGIKANEKTLVMDDLLVSS-QFV-YSL--DEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) T ss_pred CCCCCCCCCCCceEEEEEehhhhhH-HHH-HhH--HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccc Confidence 554322 34434443333 22222 112 121 112234578899999999999999999887321 0011111111 Q ss_pred ccccccccccceeeccccccchHHHHHHHHHHHhhcCCCcc-E-EEEcHHHHHHHHHhhccC--Cceee--cc-ccccCC Q lcl|Aclame:pro 136 SPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD-G-VALDNTFSFMLATQRDSQ--GRKLY--PE-LGFGTD 208 (311) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~-~v~n~~~~~~l~~lkd~~--g~~~~--~~-~~~~~~ 208 (311) .+.+... ........+....++.|.++...+...+.... . ++++|..+..|.+.+|.. .+... .. ...+.. T Consensus 155 ~~g~~~~--~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~ 232 (332) T protein:vir:78 155 EPGGFHV--NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) T ss_pred ccccccc--ccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceeccee Confidence 1111100 00111112233467778888888888777543 3 556999999887644321 01000 00 111223 Q ss_pred CceecceeEEeeccccccccccccccc------ccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcC Q lcl|Aclame:pro 209 VASFAGLNAAVSDTVRGGPEAVTASTG------VYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQN 282 (311) Q Consensus 209 ~~~l~G~pv~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~ 282 (311) .++++|++|+.++.+|........... .+.........++.-.+.+......++++++.+. ....++|- + T Consensus 233 i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~---~~~~~~~~-d 308 (332) T protein:vir:78 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSG---DFNVQYQG-D 308 (332) T ss_pred eeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhc---ccchhhhH-h Confidence 578999999999999965432221110 1111111112233333333333333444443221 01112222 2 Q ss_pred cEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 283 QIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 283 ~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .++....+|.+++||++++.|+.| T Consensus 309 --~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 309 --LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred --hhhhhhhhcCceecccceEEEeeC Confidence 456667899999999999999888 No 138 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.30 E-value=2.5e-12 Score=84.21 Aligned_cols=297 Identities=12% Similarity=-0.009 Sum_probs=160.1 Q ss_pred Cccc----------CCCceEcc-hhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccc Q lcl|Aclame:pro 1 MVAL----------ATGTFQLP-KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTA 68 (311) Q Consensus 1 mat~----------~~g~~~vP-~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~ 68 (311) |... +.....+. +.+..+|.+.....+..+++..+.++.+++ +++|+. +..+++...-|++...+.+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~~~~ 79 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDASPT 79 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCCCCc Confidence 4411 11112333 778999999999999999999988888765 788887 4555566555555544445 Q ss_pred ceeEEEEeeeeEEEEEeecHHHhhcCc--hhhHH-HHHHHHHHHHHHHHHHHHHHhhhcccCC--Ccccc-ccccccccc Q lcl|Aclame:pro 69 TFAPVTAIPRKVQVTQRFSQEVKWADE--SRQLG-VLQTMADLSGVALGRALDLIGIHGINPL--TGAAL-SGSPAKILD 142 (311) Q Consensus 69 ~~~~v~l~~~kl~~~i~iS~ell~~s~--~~~~~-~~~~i~~~la~~ia~~~d~~~l~G~~~~--~g~~~-~~~~~~~~~ 142 (311) .-++.++..-.+- +++.++-+-+ -+.+| +.+++.+++.+++++..|+.++.-.-.. ....+ ...+.+... T Consensus 80 ~~~k~~itID~ll----~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~ 155 (364) T protein:vir:10 80 EFDKNRLVVDTTV----IARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGH 155 (364) T ss_pred ccCcEEEEeccee----eechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCC Confidence 5555444443221 2222221111 22345 5678999999999999999986310000 00000 000011111 Q ss_pred cccceeeccc-----cccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCC-ceee--ccccccCCCcee Q lcl|Aclame:pro 143 TTNIVELTTG-----TSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQG-RKLY--PELGFGTDVASF 212 (311) Q Consensus 143 ~~~~~~~~~~-----~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g-~~~~--~~~~~~~~~~~l 212 (311) +....-.... ......+.+..+...+...+.... ..+++|..+..|.+-.+--. .+.. ......+...++ T Consensus 156 g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v 235 (364) T protein:vir:10 156 GFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKS 235 (364) T ss_pred cceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEE Confidence 1111000111 111223345566666766666544 58889999988876321000 1111 122334556789 Q ss_pred cceeEEeeccccccccccccc-------------ccccc--cccccceEEEeecceEEEEeecCceEEEeccCCcccchh Q lcl|Aclame:pro 213 AGLNAAVSDTVRGGPEAVTAS-------------TGVYR--TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGD 277 (311) Q Consensus 213 ~G~pv~~~~~~~~~~~~~~~~-------------~~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 277 (311) .|+||+.|+.+|......... ...+. .......+++.-.+.+......+++.++.++... T Consensus 236 ~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~----- 310 (364) T protein:vir:10 236 WNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKE----- 310 (364) T ss_pred eceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccce----- Confidence 999999999999543321110 00111 1111233444444455555556666665543211 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 278 LKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 278 ~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) |. -.+.+.+-+|.+++||++.+.|+.++.. T Consensus 311 -~~---~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 311 -KT---WYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred -ee---eeeeeehcccCcccCccceEEEEecCCC Confidence 11 1233456699999999999999888877 No 139 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.30 E-value=3.9e-13 Score=88.61 Aligned_cols=297 Identities=14% Similarity=-0.011 Sum_probs=162.3 Q ss_pred CcccCCCc------------eEcc-hhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccc Q lcl|Aclame:pro 1 MVALATGT------------FQLP-KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSES 66 (311) Q Consensus 1 mat~~~g~------------~~vP-~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~ 66 (311) |++...+. ..++ +.|+.+|....+..+.++++.++.++.+|+ +.+|+. +..+++...-|+++... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~~~ 79 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELVVQ 79 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCCCC Confidence 76653322 2344 789999999999999999999988888655 788876 56777777778877777 Q ss_pred ccceeEEEEeeeeE-EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhc----cc--CCCcccccccccc Q lcl|Aclame:pro 67 TATFAPVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHG----IN--PLTGAALSGSPAK 139 (311) Q Consensus 67 ~~~~~~v~l~~~kl-~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G----~~--~~~g~~~~~~~~~ 139 (311) ..+-++.++..-.+ .....|-+- +..-+..|+.+.+.++++++++++.|++++.. .. +.....+...+ | T Consensus 80 ~~~~~~~~l~ID~~l~~~~~Vddi---D~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~-G 155 (334) T protein:vir:80 80 KNVSDKLNLTVDTVLYARHFFDKF---DEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHD-G 155 (334) T ss_pred CcccCceEEEEeeeeehhhhHhhH---HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccC-C Confidence 66666665555432 222222211 11223457889999999999999999987632 11 11110000000 1 Q ss_pred ccccccc---eeeccccccchHHHHHHHHHHHhhcCCC-----ccEEEEcHHHHHHHHHhhccCCc-eee---ccccccC Q lcl|Aclame:pro 140 ILDTTNI---VELTTGTSATPDLAVEAAVGLVLGDNLS-----PDGVALDNTFSFMLATQRDSQGR-KLY---PELGFGT 207 (311) Q Consensus 140 ~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~v~n~~~~~~l~~lkd~~g~-~~~---~~~~~~~ 207 (311) ....... ......+.......+..+...+...+.. .-..+++|..+..|.+-+.--.+ +.- .....++ T Consensus 156 ~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g 235 (334) T protein:vir:80 156 ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGG 235 (334) T ss_pred cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccce Confidence 0000000 0000111111233455666666555554 23588899999988764321111 100 0112344 Q ss_pred CCceecceeEEeeccccccccccccccc---ccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcE Q lcl|Aclame:pro 208 DVASFAGLNAAVSDTVRGGPEAVTASTG---VYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQI 284 (311) Q Consensus 208 ~~~~l~G~pv~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v 284 (311) ..++++|++|+.++.+|........... .+........++|.-.+.+......+++.++.++.. .|.. T Consensus 236 ~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~------~~~d--- 306 (334) T protein:vir:80 236 RIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK------DFGH--- 306 (334) T ss_pred eEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh------hHHH--- Confidence 4688999999999999966433221111 111111222333444444444444444444433321 1111 Q ss_pred EEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 285 AIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 285 ~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+.+.+-+|.+++||+++++++..--- T Consensus 307 ~i~~~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 307 YLDTFQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred HHHHHHHcCCceeccceEEEEEEeeec Confidence 122234589999999998887744333 No 140 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=99.29 E-value=1.4e-12 Score=85.56 Aligned_cols=230 Identities=11% Similarity=0.018 Sum_probs=158.5 Q ss_pred CcccCCCceEc--------chhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccccee Q lcl|Aclame:pro 1 MVALATGTFQL--------PKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFA 71 (311) Q Consensus 1 mat~~~g~~~v--------P~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 71 (311) |++.+.+..++ |......|||.+.+.++|+...+.+...+++ ..+.+.++-|.+.|..=|+.+++++.++. T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~tt~ 80 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKSTTV 80 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCcccceeE Confidence 87776666543 4456778999999999999999998886443 77889999999999999999999999999 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccc------------- Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPA------------- 138 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~------------- 138 (311) +++...+-+++.+.+.+.+.+... ...++...-.....+++.+++..+||+|+.+.....+.|+.. T Consensus 81 q~t~~l~ilgg~~eVDr~la~~~G-n~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qi 159 (328) T protein:vir:95 81 QVTDSVGMLETYAEVDKSLADLNG-NTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNI 159 (328) T ss_pred EEEEEEEEEecceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccce Confidence 999999999999999998875543 334555556667889999999999999954322222211100 Q ss_pred ---------------------------------c---------------------------------ccc------cccc Q lcl|Aclame:pro 139 ---------------------------------K---------------------------------ILD------TTNI 146 (311) Q Consensus 139 ---------------------------------~---------------------------------~~~------~~~~ 146 (311) | +.+ -.|+ T Consensus 160 idaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NI 239 (328) T protein:vir:95 160 IDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANI 239 (328) T ss_pred eecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecC Confidence 0 000 0000 Q ss_pred eee---ccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCCceeeccccccCCCceecceeEEeecc Q lcl|Aclame:pro 147 VEL---TTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQGRKLYPELGFGTDVASFAGLNAAVSDT 222 (311) Q Consensus 147 ~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~ 222 (311) ... ......+..+.+..++.++.+......+|.||......|++. .+.....+-.....+.-+-.++|+||...++ T Consensus 240 d~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~da 319 (328) T protein:vir:95 240 DVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETDA 319 (328) T ss_pred cccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEee Confidence 000 111222344455566666655555566799999999999875 4455544544445556677899999998888 Q ss_pred ccccccccc Q lcl|Aclame:pro 223 VRGGPEAVT 231 (311) Q Consensus 223 ~~~~~~~~~ 231 (311) +......+. T Consensus 320 i~~tE~~vv 328 (328) T protein:vir:95 320 LLETEARVV 328 (328) T ss_pred eecCccccC Confidence 864432221 No 141 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.26 E-value=2.5e-12 Score=84.20 Aligned_cols=295 Identities=12% Similarity=0.033 Sum_probs=162.8 Q ss_pred CcccC---------CC---ceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccc Q lcl|Aclame:pro 1 MVALA---------TG---TFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~~---------~g---~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~ 67 (311) |-+.. ++ ...| +.|+.+|.+.+...+.++++.++.++.+++ +.+|+. +..+++...-|++...+- T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCCCC Confidence 43322 11 1233 779999999999999999999988888765 788876 566677777676666555 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhc--CchhhHHHHHHHHHHHHHHHHHHHHHHhhh----ccc--CCCcccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWA--DESRQLGVLQTMADLSGVALGRALDLIGIH----GIN--PLTGAALSGSPAK 139 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~--s~~~~~~~~~~i~~~la~~ia~~~d~~~l~----G~~--~~~g~~~~~~~~~ 139 (311) +.-++.++..-.+- +++.++-+ ..-+..|+.+.+.+++++++++..|++++. +.. +.....+ +...| T Consensus 79 ~~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~-~~~~G 153 (335) T protein:vir:78 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLED-AFSPG 153 (335) T ss_pred cccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCC-CcCCC Confidence 55555444443221 33333322 123445788999999999999999998762 211 1111000 00011 Q ss_pred ccccccceeecc-ccccchHHHHHHHHHHHhhcCCC-----ccEEEEcHHHHHHHHHhhccCCc-eee---ccccccCCC Q lcl|Aclame:pro 140 ILDTTNIVELTT-GTSATPDLAVEAAVGLVLGDNLS-----PDGVALDNTFSFMLATQRDSQGR-KLY---PELGFGTDV 209 (311) Q Consensus 140 ~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~-----~~~~v~n~~~~~~l~~lkd~~g~-~~~---~~~~~~~~~ 209 (311) ............ .......+.+..+...+...+.. .-..+++|..+..|.+-+.--.+ +.- ..+...+.. T Consensus 154 ~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v 233 (335) T protein:vir:78 154 VLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRV 233 (335) T ss_pred cceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccccccee Confidence 111111110000 11112233445555555544442 23589999999998764322222 110 122344567 Q ss_pred ceecceeEEeecccccccccccccc---cccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEE Q lcl|Aclame:pro 210 ASFAGLNAAVSDTVRGGPEAVTAST---GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAI 286 (311) Q Consensus 210 ~~l~G~pv~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ 286 (311) +.++|+||+.++.+|.+........ ..+.-......++|+.-+.+......+++.++.++.. .|.. .+ T Consensus 234 ~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~------~~~~---~i 304 (335) T protein:vir:78 234 AILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD------QFSW---VL 304 (335) T ss_pred EEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc------hhhH---hh Confidence 8999999999999997643322111 1111112223445555455555555555555543321 1221 23 Q ss_pred EEEEEeccEEecccceEEEE-ecccC Q lcl|Aclame:pro 287 RAEVVYGIGIMSTDAFAVVR-DADES 311 (311) Q Consensus 287 ra~~r~~~~v~~~~a~~~l~-~aa~~ 311 (311) .+.+-+|.+++||++.+.++ ...+| T Consensus 305 ~~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:78 305 DTFQMYNIGARRPDTAGAIELKGIEA 330 (335) T ss_pred hHHHHcCCcccCcceEEEEEecCCCc Confidence 33445999999999988887 34444 No 142 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.25 E-value=4e-12 Score=83.06 Aligned_cols=296 Identities=12% Similarity=0.008 Sum_probs=162.5 Q ss_pred CcccC---------CCc---eEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccc Q lcl|Aclame:pro 1 MVALA---------TGT---FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~~---------~g~---~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~ 67 (311) |-+.+ +++ ..| +.+..+|.+.+...+.++++.++.++.+++ +.+|+. +..+++...-|+++..+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCCCC Confidence 43332 111 233 789999999999999999999988888765 788886 566777777777666665 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhc--CchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccC-CCccccccccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWA--DESRQLGVLQTMADLSGVALGRALDLIGIHGINP-LTGAALSGSPAKILDTT 144 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~--s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~~ 144 (311) +..++.++..-.+- +++.++-+ ..-+..|+.+++.+++.+++++..|++++.-.-. ..-..+...+.+..++. T Consensus 79 ~~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred ccccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 55555555443321 23333321 2234467889999999999999999987622000 01111111111111111 Q ss_pred -cceeecccc----ccchHHHHHHHHHHHhhcCCC-----ccEEEEcHHHHHHHHHhhccCCc-eee---ccccccCCCc Q lcl|Aclame:pro 145 -NIVELTTGT----SATPDLAVEAAVGLVLGDNLS-----PDGVALDNTFSFMLATQRDSQGR-KLY---PELGFGTDVA 210 (311) Q Consensus 145 -~~~~~~~~~----~~~~~~~i~~~~~~~~~~~~~-----~~~~v~n~~~~~~l~~lkd~~g~-~~~---~~~~~~~~~~ 210 (311) .....+..+ .....+.+..+...+...+.. .-..+++|..+..|.+-+.--.+ +.- ..+...+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeE Confidence 111111111 111233455666677665554 23588999999988764322222 111 1223445678 Q ss_pred eecceeEEeecccccccccccccc---cccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEE Q lcl|Aclame:pro 211 SFAGLNAAVSDTVRGGPEAVTAST---GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIR 287 (311) Q Consensus 211 ~l~G~pv~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~r 287 (311) +++|+||+.++.+|.......... ..+........++|.--+.+......+++.++.++.. .|.. .+. T Consensus 235 ~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~------~~~~---~i~ 305 (335) T protein:vir:63 235 ILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE------KFSW---VLD 305 (335) T ss_pred EeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc------hhhH---HhH Confidence 899999999999997654332111 1111111122333333333334444444444433321 1221 223 Q ss_pred EEEEeccEEecccceEEEEec-ccC Q lcl|Aclame:pro 288 AEVVYGIGIMSTDAFAVVRDA-DES 311 (311) Q Consensus 288 a~~r~~~~v~~~~a~~~l~~a-a~~ 311 (311) +..-+|.+++||++.+.++.+ ..| T Consensus 306 ~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:63 306 TFQMYNIGARRPDTAGAIELKGIGA 330 (335) T ss_pred HHHHcCCcccccceEEEEEEcCCCc Confidence 334599999999999888853 222 No 143 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.16 E-value=2.3e-11 Score=78.92 Aligned_cols=300 Identities=13% Similarity=0.049 Sum_probs=162.2 Q ss_pred Cccc----------CC----Cce-----EcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecC Q lcl|Aclame:pro 1 MVAL----------AT----GTF-----QLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEG 60 (311) Q Consensus 1 mat~----------~~----g~~-----~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg 60 (311) |+.. ++ |+. +-=+.+..+|...++..+.++.+.++.++.+++ +++|+. +..++....-| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecCC Confidence 2211 00 111 112678999999999999999999988887654 788887 45556555545 Q ss_pred ccccc---cccceeE--EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhccc--CCCcccc Q lcl|Aclame:pro 61 AQKSE---STATFAP--VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTGAAL 133 (311) Q Consensus 61 ~~~~~---~~~~~~~--v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~--~~~g~~~ 133 (311) +++.. .+....+ +++...|+.. ..|.+ + +..-+..|+...+.++.++++++..|+.++.--- .....+. T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~-~~VdD-i--D~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~ 155 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISS-AFVYD-L--DETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPV 155 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhh-hhHhh-H--HHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 44422 2222232 3333333222 11211 1 1122345788999999999999999998873210 0000000 Q ss_pred cccc------ccccccccceeeccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCC----ceeec Q lcl|Aclame:pro 134 SGSP------AKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQG----RKLYP 201 (311) Q Consensus 134 ~~~~------~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g----~~~~~ 201 (311) .+.+ ..+...+........+....++.+.++...+...+.... ..+++|..+..|.+-+|.+. .+.-. T Consensus 156 ~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~ 235 (375) T protein:vir:10 156 SATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGS 235 (375) T ss_pred ccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccc Confidence 0000 001111111111112233457778888888877776532 47889999988876555331 11111 Q ss_pred cccccCCCceecceeEEeeccccccccccccc-----------------------------cccccccc---ccceEEEe Q lcl|Aclame:pro 202 ELGFGTDVASFAGLNAAVSDTVRGGPEAVTAS-----------------------------TGVYRTTN---PNVKAIAG 249 (311) Q Consensus 202 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~-----------------------------~~~~~~~~---~~~~~~~g 249 (311) .....+..+++.|++|+.++.+|......... +..+.... .+...++. T Consensus 236 ~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~ 315 (375) T protein:vir:10 236 ALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIF 315 (375) T ss_pred ceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEE Confidence 11223345789999999999999654311000 00000000 22333444 Q ss_pred ecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 250 DFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 250 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+........++++++++.. .+ -.+-.-.+.+.+=+|..++||++.+.|+..+.+ T Consensus 316 ~~~A~g~v~~~~~~~~~~~~~-~~-----~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 316 QKEAAGVVEAIGPQVQVTNGD-VS-----VIYQGDVILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred chhheeeeeeeccccccccch-hh-----heeeeeeeeeeeeeccCccCceeEEEEecCcCc Confidence 444444445555566554310 01 112223466778899999999999999876555 No 144 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.15 E-value=1.4e-11 Score=80.14 Aligned_cols=292 Identities=12% Similarity=0.008 Sum_probs=159.4 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhh-chhhhhcceeecCCCceEEEEEeCCceeEEe---------ecCc-cccccc-- Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQ-SVLARLSMAEPQEFGEQQYMTLTAPPRGEVV---------GEGA-QKSEST-- 67 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v---------~Eg~-~~~~~~-- 67 (311) |++ +-....| +++..++....++. +.|++-++... ..+....+..-+...+.-+ +.+. +.|... T Consensus 13 Ms~-~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~~ 89 (322) T protein:vir:10 13 IAG-DIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKN-ESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNKP 89 (322) T ss_pred eec-hhhhHHH-HHHHHHHHHHHHHhhhhhhccccccc-ccccccceeecccccccccccccccccccCcccCCCccccc Confidence 333 3333444 56666776666544 44555444222 2222111111111111111 1221 233322 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccce Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIV 147 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 147 (311) -....+.+..+..+ ..|.+.-. .....|..+...+..+.+++|+.|+.++.+-- +....+.+.......... T Consensus 90 ~~~r~~~~~d~~~~--~~VDd~D~---~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~---g~a~~~~~gt~v~~~ss~ 161 (322) T protein:vir:10 90 FAKRRTNVDTYDTG--HVVEQEDI---SQMLLDPNSALITSQAYAMARKTDDLIIAGAW---KPASIKGTGQPVEFLATQ 161 (322) T ss_pred cceEEEeecccccc--eecchHHH---HHhhcCchHHHHHHHHHHhhhHHHHHHHhhhh---ccccccccccccccCCCc Confidence 33445555555443 45544422 12334567778889999999999998886521 111111111111111122 Q ss_pred eeccccccchHHHHHHHHHHHhhcCCCcc--E-EEEcHHHHHHHHHhhcc-CCceeecccc-ccCCCceecceeEEeecc Q lcl|Aclame:pro 148 ELTTGTSATPDLAVEAAVGLVLGDNLSPD--G-VALDNTFSFMLATQRDS-QGRKLYPELG-FGTDVASFAGLNAAVSDT 222 (311) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~-~v~n~~~~~~l~~lkd~-~g~~~~~~~~-~~~~~~~l~G~pv~~~~~ 222 (311) ....+.....++.+..+...+..++..+. . ++.+|..+..|.+...- +..+.-.... ..+..++++|+.+..++. T Consensus 162 ~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~ 241 (322) T protein:vir:10 162 EIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIVSTR 241 (322) T ss_pred ccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEEecc Confidence 23334445667788888888888777753 3 77789999888664432 2233322222 345678999999999999 Q ss_pred cccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 223 VRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAF 302 (311) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~ 302 (311) +|.....-................+++-.+.+.+....+++.++....+. .+...+++.+-+|.++++|+.+ T Consensus 242 lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~--------~~a~~I~~~~~~Ga~ri~~~gV 313 (322) T protein:vir:10 242 LDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA--------SFAWRIYSAFTADCVRVEDEHI 313 (322) T ss_pred CCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCc--------chhhhhhhhhhhCceEeccCcE Confidence 99543322222112222233335667777888888888877776543321 1123355667899999999999 Q ss_pred EEEEecccC Q lcl|Aclame:pro 303 AVVRDADES 311 (311) Q Consensus 303 ~~l~~aa~~ 311 (311) +.|.-..|= T Consensus 314 v~i~~~e~~ 322 (322) T protein:vir:10 314 FKLRLKNSL 322 (322) T ss_pred EEEEEeccC Confidence 999874433 No 145 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.13 E-value=1e-11 Score=80.84 Aligned_cols=297 Identities=12% Similarity=-0.021 Sum_probs=152.5 Q ss_pred Cccc----------CCCceEcc-hhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccc Q lcl|Aclame:pro 1 MVAL----------ATGTFQLP-KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTA 68 (311) Q Consensus 1 mat~----------~~g~~~vP-~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~ 68 (311) |... +....-+. +.+..+|.+.....+..+++.++.++.+++ +++|+. +..+++...-|++...+.+ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ldg~~~ 79 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccCCCCc Confidence 4411 11112233 778999999999999999999988888665 788887 4556666555555444445 Q ss_pred ceeEEEEeeeeEEEEEeecHHHhhcCc--hhhHH-HHHHHHHHHHHHHHHHHHHHhhhcc---cCCCccccccccccccc Q lcl|Aclame:pro 69 TFAPVTAIPRKVQVTQRFSQEVKWADE--SRQLG-VLQTMADLSGVALGRALDLIGIHGI---NPLTGAALSGSPAKILD 142 (311) Q Consensus 69 ~~~~v~l~~~kl~~~i~iS~ell~~s~--~~~~~-~~~~i~~~la~~ia~~~d~~~l~G~---~~~~g~~~~~~~~~~~~ 142 (311) .-++.++..-.+- +++.++-+-+ -+.+| +.+.+.+++.+++++..|+.+|.-. +......+...+.+... T Consensus 80 ~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~ 155 (402) T protein:vir:97 80 QADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) T ss_pred ccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccc Confidence 5455444432221 3333332212 22344 5678999999999999999886311 00000111111111111 Q ss_pred cccceeecc-----ccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCC-ceee--ccccccCCCcee Q lcl|Aclame:pro 143 TTNIVELTT-----GTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQG-RKLY--PELGFGTDVASF 212 (311) Q Consensus 143 ~~~~~~~~~-----~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g-~~~~--~~~~~~~~~~~l 212 (311) ++......+ .+.....+.+.++...+...+.... ..+++|..+..|.+-.+--. .+.. ......+..+++ T Consensus 156 g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v 235 (402) T protein:vir:97 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) T ss_pred ccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEE Confidence 111111111 1112223445566666665555543 58889999998876322111 1111 112335556889 Q ss_pred cceeEEeecccccccccccccc-------cccc--cccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCc Q lcl|Aclame:pro 213 AGLNAAVSDTVRGGPEAVTAST-------GVYR--TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQ 283 (311) Q Consensus 213 ~G~pv~~~~~~~~~~~~~~~~~-------~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~ 283 (311) .|+||+.++.+|.......... ..+. ........++.-.+.+......+++.++.++... |..= T Consensus 236 ~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~------~~~~- 308 (402) T protein:vir:97 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE------KTYY- 308 (402) T ss_pred eceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhH------HHHH- Confidence 9999999999996432221111 1111 1112223333333333334444444443322211 1111 Q ss_pred EEEEEEEEeccEEecccceEEEEecc---cC Q lcl|Aclame:pro 284 IAIRAEVVYGIGIMSTDAFAVVRDAD---ES 311 (311) Q Consensus 284 v~~ra~~r~~~~v~~~~a~~~l~~aa---~~ 311 (311) +-+.+-+|..++||++..++.-+- .+ T Consensus 309 --id~~~a~G~g~~RPeaa~vv~~~~~~t~~ 337 (402) T protein:vir:97 309 --IDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) T ss_pred --HHHHHHhCCcccCccceEEEEEecccccc Confidence 122345899999999988774322 22 No 146 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=99.13 E-value=2.2e-11 Score=79.03 Aligned_cols=230 Identities=11% Similarity=0.030 Sum_probs=153.7 Q ss_pred CcccCCCceEcchh---------HHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccce Q lcl|Aclame:pro 1 MVALATGTFQLPKH---------LVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATF 70 (311) Q Consensus 1 mat~~~g~~~vP~~---------~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 70 (311) |.+.+.+-.++.+. +...|+|.+.+.++|+...+.+...++. ..+.+.++-|.+.|..=|+.+++++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 88887666544433 3457999999999999999987655443 3456778899999999999999999999 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccc------------ Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPA------------ 138 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~------------ 138 (311) .+++...+-+++.+.|.+.+.+... ...++.........+++.+++..++|+|+.+.....+.|+.. T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 9999999999999999998876543 344555666677889999999999999964321111111100 Q ss_pred ----------------------------------c---------------------------------ccc------ccc Q lcl|Aclame:pro 139 ----------------------------------K---------------------------------ILD------TTN 145 (311) Q Consensus 139 ----------------------------------~---------------------------------~~~------~~~ 145 (311) | +.+ -.| T Consensus 160 ~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~N 239 (331) T protein:vir:10 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) T ss_pred eeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 000 000 Q ss_pred cee----eccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCC-ceeeccccccCCCceecceeEEe Q lcl|Aclame:pro 146 IVE----LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQG-RKLYPELGFGTDVASFAGLNAAV 219 (311) Q Consensus 146 ~~~----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g-~~~~~~~~~~~~~~~l~G~pv~~ 219 (311) +.. ..+.+..+..+.+..+..++.+.+....+|.||......|++. .+... +.+-.+...+...-.+.|+||.. T Consensus 240 Idvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~ 319 (331) T protein:vir:10 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) T ss_pred cchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEE Confidence 000 0011112233445555555555555556799999999999875 44433 33545555666678899999998 Q ss_pred eccccccccccc Q lcl|Aclame:pro 220 SDTVRGGPEAVT 231 (311) Q Consensus 220 ~~~~~~~~~~~~ 231 (311) .+++......+. T Consensus 320 ~dai~~tE~~Vv 331 (331) T protein:vir:10 320 TDALLLTEARVV 331 (331) T ss_pred eeeeecCccccC Confidence 888765322211 No 147 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=99.13 E-value=2.2e-11 Score=79.03 Aligned_cols=230 Identities=11% Similarity=0.030 Sum_probs=153.7 Q ss_pred CcccCCCceEcchh---------HHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccce Q lcl|Aclame:pro 1 MVALATGTFQLPKH---------LVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATF 70 (311) Q Consensus 1 mat~~~g~~~vP~~---------~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 70 (311) |.+.+.+-.++.+. +...|+|.+.+.++|+...+.+...++. ..+.+.++-|.+.|..=|+.+++++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 88887666544433 3457999999999999999987655443 3456778899999999999999999999 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccc------------ Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPA------------ 138 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~------------ 138 (311) .+++...+-+++.+.|.+.+.+... ...++.........+++.+++..++|+|+.+.....+.|+.. T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 9999999999999999998876543 344555666677889999999999999964321111111100 Q ss_pred ----------------------------------c---------------------------------ccc------ccc Q lcl|Aclame:pro 139 ----------------------------------K---------------------------------ILD------TTN 145 (311) Q Consensus 139 ----------------------------------~---------------------------------~~~------~~~ 145 (311) | +.+ -.| T Consensus 160 ~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~N 239 (331) T protein:vir:10 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) T ss_pred eeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 000 000 Q ss_pred cee----eccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCC-ceeeccccccCCCceecceeEEe Q lcl|Aclame:pro 146 IVE----LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQG-RKLYPELGFGTDVASFAGLNAAV 219 (311) Q Consensus 146 ~~~----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g-~~~~~~~~~~~~~~~l~G~pv~~ 219 (311) +.. ..+.+..+..+.+..+..++.+.+....+|.||......|++. .+... +.+-.+...+...-.+.|+||.. T Consensus 240 Idvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~ 319 (331) T protein:vir:10 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) T ss_pred cchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEE Confidence 000 0011112233445555555555555556799999999999875 44433 33545555666678899999998 Q ss_pred eccccccccccc Q lcl|Aclame:pro 220 SDTVRGGPEAVT 231 (311) Q Consensus 220 ~~~~~~~~~~~~ 231 (311) .+++......+. T Consensus 320 ~dai~~tE~~Vv 331 (331) T protein:vir:10 320 TDALLLTEARVV 331 (331) T ss_pred eeeeecCccccC Confidence 888765322211 No 148 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=99.13 E-value=2.2e-11 Score=79.03 Aligned_cols=230 Identities=11% Similarity=0.030 Sum_probs=153.7 Q ss_pred CcccCCCceEcchh---------HHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccce Q lcl|Aclame:pro 1 MVALATGTFQLPKH---------LVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATF 70 (311) Q Consensus 1 mat~~~g~~~vP~~---------~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 70 (311) |.+.+.+-.++.+. +...|+|.+.+.++|+...+.+...++. ..+.+.++-|.+.|..=|+.+++++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 88887666544433 3457999999999999999987655443 3456778899999999999999999999 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccc------------ Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPA------------ 138 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~------------ 138 (311) .+++...+-+++.+.|.+.+.+... ...++.........+++.+++..++|+|+.+.....+.|+.. T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:98 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 9999999999999999998876543 344555666677889999999999999964321111111100 Q ss_pred ----------------------------------c---------------------------------ccc------ccc Q lcl|Aclame:pro 139 ----------------------------------K---------------------------------ILD------TTN 145 (311) Q Consensus 139 ----------------------------------~---------------------------------~~~------~~~ 145 (311) | +.+ -.| T Consensus 160 ~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~N 239 (331) T protein:vir:98 160 IIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIAN 239 (331) T ss_pred eeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 000 000 Q ss_pred cee----eccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCC-ceeeccccccCCCceecceeEEe Q lcl|Aclame:pro 146 IVE----LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQG-RKLYPELGFGTDVASFAGLNAAV 219 (311) Q Consensus 146 ~~~----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g-~~~~~~~~~~~~~~~l~G~pv~~ 219 (311) +.. ..+.+..+..+.+..+..++.+.+....+|.||......|++. .+... +.+-.+...+...-.+.|+||.. T Consensus 240 Idvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~ 319 (331) T protein:vir:98 240 VDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRR 319 (331) T ss_pred cchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEE Confidence 000 0011112233445555555555555556799999999999875 44433 33545555666678899999998 Q ss_pred eccccccccccc Q lcl|Aclame:pro 220 SDTVRGGPEAVT 231 (311) Q Consensus 220 ~~~~~~~~~~~~ 231 (311) .+++......+. T Consensus 320 ~dai~~tE~~Vv 331 (331) T protein:vir:98 320 TDALLLTEARVV 331 (331) T ss_pred eeeeecCccccC Confidence 888765322211 No 149 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.10 E-value=3.2e-11 Score=78.15 Aligned_cols=281 Identities=12% Similarity=0.035 Sum_probs=154.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhh---------hcceeecCCCceEEEEEeCC-ceeEEeecCccccccccce Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLAR---------LSMAEPQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTATF 70 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~---------l~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~ 70 (311) ||++.-...++|+.+..-+.+...+.+.+.+ +.....-++..+++|..... .++.-+.|+..++..+.+- T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 9988778999999997777676655555433 11222234556899998753 5778889999999888877 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) ++-....++.+.-..++++....+ .-|..+.+.++++...+++.++.+|.--..-.+.... ......+.+ ..+ T Consensus 81 ~~~~a~i~~~~kg~~~tD~a~~~s---g~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~-~~~~~~d~t---~~~ 153 (351) T protein:vir:15 81 GKQQGIKFYQTKAYGYTDLGTMIS---GAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKI-ANSKVYDQT---KVS 153 (351) T ss_pred cceeEEEEeeccceehhhhhHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhh-cccceeccc---ccc Confidence 666666666666677877643222 3367788999999999999988877421000000000 000011111 111 Q ss_pred cccccchHHHHHHHHHHHhhcCCC-ccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLS-PDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~-~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 229 (311) .......++.+.++..++-+.... -.+|+||+..+..|++..--+ ++ .....+...++++|++|++++.+|..... T Consensus 154 ~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~t~~G~~VivdD~~p~~~~~ 230 (351) T protein:vir:15 154 PSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIE--TI-QPQNGATPFEAYNGLRIVLDDDIEIDLTD 230 (351) T ss_pred ccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhh--hc-cccccCcccceecceEEEEcCCCccccCC Confidence 122234467788888888665443 478999999999998643100 00 01111233588999999999999854322 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe-- Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD-- 307 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~-- 307 (311) .... .+ ..+++|. ..+.+... ...+++.|+.... .++-.+..+.+ .++||..+..-+. T Consensus 231 ~~~~--~y------tsyl~~~-GAi~~~~~-~~~ve~~rd~~~~-------~g~d~l~~r~~---~~~hp~G~s~~~~~~ 290 (351) T protein:vir:15 231 KTKP--VS------TSYIFAP-GAVRYSTN-MRSTETKYDPLIN-------GGQDVIVQKRV---GTIHVAGTSIKASFS 290 (351) T ss_pred CCCc--ee------EEEEEec-ceeeeecC-CcCcceeecccCC-------CCceEEEEeee---eeeeeeeeeeccccc Confidence 1110 00 0122222 11222222 2234444443211 11111111222 3466666554321 Q ss_pred --cccC Q lcl|Aclame:pro 308 --ADES 311 (311) Q Consensus 308 --aa~~ 311 (311) +..+ T Consensus 291 ~~~~~s 296 (351) T protein:vir:15 291 PSKASF 296 (351) T ss_pred ccCcCC Confidence 1111 No 150 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.10 E-value=4.3e-12 Score=82.89 Aligned_cols=259 Identities=13% Similarity=0.036 Sum_probs=156.2 Q ss_pred CcccCCCce--EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcee-------EEeecCcccccccccee Q lcl|Aclame:pro 1 MVALATGTF--QLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRG-------EVVGEGAQKSESTATFA 71 (311) Q Consensus 1 mat~~~g~~--~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a-------~~v~Eg~~~~~~~~~~~ 71 (311) ....++|.. .||++|....|+.+.+..++..+....|.++.++.||+.+..+.. +.-.||...+..+.+|+ T Consensus 131 ~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~ 210 (410) T protein:vir:83 131 ADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVID 210 (410) T ss_pred hccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeee Confidence 223344443 578889999999999999999999999999888999887665543 23458999999999999 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHH---HhhhcccCCCcccccccccccccccccee Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDL---IGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~---~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 148 (311) ..+...++++++..+|++.+..|....++ ...+.|..+.+++-+. ++|.++- +...+..+ T Consensus 211 t~tA~ikTyGGyt~LSRQ~IERs~v~~L~---~~lraL~~AYA~atea~vra~L~~t~-----------t~~~a~~~--- 273 (410) T protein:vir:83 211 RLTVNAKTLGGYVNVSRQAIDFSSPSALD---LVVNGLGQQYAIETEALVGAALASTS-----------TGAVGYGN--- 273 (410) T ss_pred eccceeehhcCcccccceeeecCChhhHH---HHHHHHHHHHHHHHHHHHHHHHHHhh-----------hhhhhhhh--- Confidence 99999999999999999999777655444 3444444444443333 3443321 00111110 Q ss_pred eccccccchHHHHHHHHHHHhhc--CCCccEEEEcHHHHHHHHHhhccCCceeeccc------c-ccCCCceecceeEEe Q lcl|Aclame:pro 149 LTTGTSATPDLAVEAAVGLVLGD--NLSPDGVALDNTFSFMLATQRDSQGRKLYPEL------G-FGTDVASFAGLNAAV 219 (311) Q Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~------~-~~~~~~~l~G~pv~~ 219 (311) . +.+ .....+.++..++.++ +..-..+.++|..+..+.++- ..+++.|.+. + ..+-.+.+.++||++ T Consensus 274 ~-Tad--~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm 349 (410) T protein:vir:83 274 A-TAD--NVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGSISGIPVVM 349 (410) T ss_pred c-cHH--HHHHHHHHHHHHHhhhhccceeeeEEechhhhhhcccee-eccCCCCcccccccccccccchhhhhcccceEE Confidence 0 111 1111233455555554 444446888998876665432 2222222111 1 022457899999998 Q ss_pred ecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 220 SDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST 299 (311) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~ 299 (311) ....+.+ .+.|-|-..+..-.+..-.++... .+..++.+-|. -|+.+.+..+ T Consensus 350 ~~~a~Ag------------------TA~f~~~~Ai~~~eS~~gp~qL~d-~~i~nLt~~yS---------gY~a~a~~~~ 401 (410) T protein:vir:83 350 SAALGSG------------------DAYLFSTAAIECFEQRVGTLQVVE-PSVFGLQVAYA---------GYFSTLVVNE 401 (410) T ss_pred ecCCCcC------------------eeeEeccceeeeeecCCceeEeeC-Cchhhhhhhhe---------eeeeeccccc Confidence 7776655 344445444443333332333332 22333333232 4667888888 Q ss_pred cceEEEEec Q lcl|Aclame:pro 300 DAFAVVRDA 308 (311) Q Consensus 300 ~a~~~l~~a 308 (311) +++.=|... T Consensus 402 ~gliPv~g~ 410 (410) T protein:vir:83 402 DAIVPLVGS 410 (410) T ss_pred cceeeeccC Confidence 888877766 No 151 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.09 E-value=3.5e-12 Score=83.37 Aligned_cols=267 Identities=11% Similarity=-0.044 Sum_probs=150.2 Q ss_pred Cccc--CCCceEcchhH---HHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccccee--- Q lcl|Aclame:pro 1 MVAL--ATGTFQLPKHL---VPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFA--- 71 (311) Q Consensus 1 mat~--~~g~~~vP~~~---~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~--- 71 (311) ||.. +...-++|.+. .+.+-.-+.+...++...+.+|+..|+ +++|+......+.-|+||++||-++.+.+ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~~ 80 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKDK 80 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeeee Confidence 8843 22334444332 223322222333344444778888765 89999998899999999999999998875 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) ..+++.+|.+.- +|.|.++.+- .-+-..+-.++|..+|++++|..|+.-- ..++. +... T Consensus 81 t~t~kikK~rK~--tTdEAIqlsG--ygdpvgead~qL~~~ia~kId~D~~~~l---------------ktat~--t~tg 139 (295) T protein:vir:99 81 DYTVKWFKKRRA--TTAEAIARHG--AARAITEADKRIMRELQNGIKDAFFTFL---------------KTKPT--KVKG 139 (295) T ss_pred eeEEEeeeeccc--ccHHHHHhcC--CCchhHHHHHHHHHHHHHhhhHHHHHHh---------------ccCce--eeeh Confidence 477777887774 4999986552 2234577889999999999999998531 00111 1111 Q ss_pred ccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccC--CceeeccccccCC-Cceeccee-EEeeccccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQ--GRKLYPELGFGTD-VASFAGLN-AAVSDTVRGGP 227 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~--g~~~~~~~~~~~~-~~~l~G~p-v~~~~~~~~~~ 227 (311) ......++.+......+...+..+.+.++||.+...+++-..-+ ....+ +.+ --.++|.. ++.+..+|.+. T Consensus 140 ~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~f-----G~~~L~nfLG~q~II~S~kv~~G~ 214 (295) T protein:vir:99 140 VGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVF-----GMTLLKNFLGMQNVIVMPSVPEGK 214 (295) T ss_pred hhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhh-----hhhhhhhhhccceEEEcccCCCce Confidence 11122344444445554444445668999999999887632211 11111 111 12489996 99999999999 Q ss_pred ccccccccccccccccceEEEeecceEEE---------EeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRW---------GVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMS 298 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~---------~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~ 298 (311) .+.+....+.....+ +-.||+++... ++......+...+. .++-.+.+.| .-+ T Consensus 215 ~~aT~~~Ni~~ay~~---~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~e------t~~~~~~~lf---------pE~ 276 (295) T protein:vir:99 215 IYSTAVENLVFASLN---VKGGDLGGLFADFTDETGLIAAARNRQLSNLTYE------SVFFGANVLF---------AEI 276 (295) T ss_pred EEEeeccceEEEEec---CCchhhhhhhhhccCcccceEEEeccccceeeeh------hhhHhHHHhc---------ccc Confidence 888776554432221 11233332211 11111111110000 0111111111 334 Q ss_pred ccceEEEEecccC Q lcl|Aclame:pro 299 TDAFAVVRDADES 311 (311) Q Consensus 299 ~~a~~~l~~aa~~ 311 (311) ++++++.+..++. T Consensus 277 ~dgiv~~tI~~~~ 289 (295) T protein:vir:99 277 PEGVVEATIEAAA 289 (295) T ss_pred cceEEEEEEecCc Confidence 6788888876655 No 152 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=99.07 E-value=2.7e-11 Score=78.52 Aligned_cols=230 Identities=13% Similarity=0.031 Sum_probs=150.7 Q ss_pred CcccCCCce--------EcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccccee Q lcl|Aclame:pro 1 MVALATGTF--------QLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFA 71 (311) Q Consensus 1 mat~~~g~~--------~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 71 (311) |++.+.+.. +-|......|+|.+.+.++|++..+.....+.. -...+.++-|.+.|..=|+.+++++.++. T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccceEE Confidence 777766554 334556678999999999999988876443322 12345577889999999999999999999 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccc------------- Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPA------------- 138 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~------------- 138 (311) +++...+-+++.+.|-+.+.+.. ....++.....+...+++.+++...+|+|+.+.....+.|+.. T Consensus 81 qvt~~l~ilgg~~eVDr~la~~~-Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qv 159 (330) T protein:vir:10 81 QVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) T ss_pred EEEEEeEEecchhhhhhHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhhe Confidence 99999999999999988776543 3345566666777889999999999999954432222222210 Q ss_pred -----------------------------c----c----------c--ccc----------------------------- Q lcl|Aclame:pro 139 -----------------------------K----I----------L--DTT----------------------------- 144 (311) Q Consensus 139 -----------------------------~----~----------~--~~~----------------------------- 144 (311) | + . ++. T Consensus 160 IdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~ 239 (330) T protein:vir:10 160 IDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVC 239 (330) T ss_pred eeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEe Confidence 0 0 0 000 Q ss_pred cceeeccccccchH---HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCCceeeccccccCCCceecceeEEee Q lcl|Aclame:pro 145 NIVELTTGTSATPD---LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQ-RDSQGRKLYPELGFGTDVASFAGLNAAVS 220 (311) Q Consensus 145 ~~~~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g~~~~~~~~~~~~~~~l~G~pv~~~ 220 (311) |+....-.+..... +.+..+..++.+.+....+|.||......|++. .+.+...+-.+...+...-.++|+||... T Consensus 240 NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~~ 319 (330) T protein:vir:10 240 NIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRT 319 (330) T ss_pred ecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEEE Confidence 00000001111222 333444445544555556799999999999874 55555444444455555678999999998 Q ss_pred ccccccccccc Q lcl|Aclame:pro 221 DTVRGGPEAVT 231 (311) Q Consensus 221 ~~~~~~~~~~~ 231 (311) +++......+. T Consensus 320 Dail~tE~~vv 330 (330) T protein:vir:10 320 DALLNTESRVV 330 (330) T ss_pred eeeecCccccC Confidence 88875433221 No 153 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.06 E-value=6.7e-12 Score=81.85 Aligned_cols=274 Identities=17% Similarity=0.075 Sum_probs=168.8 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEE-eecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEV-VGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) =.|.......+|..+...|-+.++...++.++.++...| .+-+-+......-+| +--|+++.++..+|..-++.|.- T Consensus 121 gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p--~l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~ 198 (400) T protein:vir:93 121 GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG--ALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVM 198 (400) T ss_pred ccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCC--ceeeecchhhhcccceeccCCcccceeeeeeeeccCHHH Confidence 112222234779999999999999999999988887774 222222222223445 67889999999999999999977 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHH-HHHHhhhcccCCCcccccccc---ccccccccceeecccccc Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRA-LDLIGIHGINPLTGAALSGSP---AKILDTTNIVELTTGTSA 155 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~-~d~~~l~G~~~~~g~~~~~~~---~~~~~~~~~~~~~~~~~~ 155 (311) +..+..+.+-.+ ++..+.-.+..|+..+|...+.++ .+++++-|+|..+-....... ....++. .... .... T Consensus 199 VYk~~~la~~~~-~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~-kt~~--a~~~ 274 (400) T protein:vir:93 199 VYKLQSLAERVK-RLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITT-KAKS--AGKT 274 (400) T ss_pred HHHHhhhhhhhh-hccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhh-hhhh--cCCc Confidence 777777744333 333344568899999999999975 699999885432211111100 0001110 0000 1111 Q ss_pred chHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceeccee-EEeecccccccccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLN-AAVSDTVRGGPEAVTAST 234 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~p-v~~~~~~~~~~~~~~~~~ 234 (311) ..-+.+..+.....+...+.-.++++|..|..|+.|||++|++.|+.........+=+|+- .++...+|...+. T Consensus 275 ~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~----- 349 (400) T protein:vir:93 275 PFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPT----- 349 (400) T ss_pred cHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCCCCCce----- Confidence 2233455555555555555556999999999999999999999997766666666666763 3334455433322 Q ss_pred cccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 235 GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 235 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) +++ |-. +.+.+. .++....- -+.+|.-.+..+...++-+.-+++-++++.+ T Consensus 350 -----------V~V-Dek-~~i~~~---~~~t~~sf-------~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 350 -----------VLV-DQK-YHIDMQ---DLTKVDAF-------EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred -----------eee-ehh-hhcccc---Cceeccce-------eeeeccceEEeeeeeccceecccceeeEeeC Confidence 222 211 111111 11111100 0345556677788999999999999999888 No 154 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.04 E-value=2e-11 Score=79.23 Aligned_cols=293 Identities=10% Similarity=0.009 Sum_probs=153.8 Q ss_pred Cccc---CCCceEc-chhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCceeEEeecCccccccccceeEEE- Q lcl|Aclame:pro 1 MVAL---ATGTFQL-PKHLVPGVWQKAQGQSVLARLSMAEPQEF-GEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT- 74 (311) Q Consensus 1 mat~---~~g~~~v-P~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~- 74 (311) |+++ +.+..+| |+.|+.+|+.-+++..+...+.+....+. .+++||.+. .+...--.++..+.-.+.+-.+++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg-~~tV~dY~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVG-TPVVRSRPEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccc-ccccccccCCCCcccccCCCceEEE Confidence 8854 4444555 99999999988888877666666544343 458888865 333333344555444444444333 Q ss_pred -EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhc--ccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 75 -AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHG--INPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 75 -l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G--~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) +...|.-++ .++++..| +..++.....++.+++++...|+.+..= ++....+. .+-+..+........... T Consensus 80 ~IDq~KYfaf-~VdDD~~Q----a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~-~~~p~vin~~~~~iv~~g 153 (322) T protein:vir:31 80 ILRDEVYAGN-AISKKLRQ----DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAG-QNDPNVINGVPHRFVGTG 153 (322) T ss_pred EEehhhhhcc-ccchhHHH----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc-cCCcceecCCccceeccC Confidence 344444443 47776653 3467888999999999999998876320 11100000 011111111112222233 Q ss_pred ccccchHHHHHHHHHHHhhcCCCc-cEE-EEcHHHHHHHHH-------hhccCCceee-ccccccC---CCceecceeEE Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSP-DGV-ALDNTFSFMLAT-------QRDSQGRKLY-PELGFGT---DVASFAGLNAA 218 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~-~~~-v~n~~~~~~l~~-------lkd~~g~~~~-~~~~~~~---~~~~l~G~pv~ 218 (311) ......|+.+.++..++...+... ..| |.+|.....|.. ++| +|..- ....... ..+++.|+.|+ T Consensus 154 t~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D--~rf~~i~~sG~a~g~~~Vg~~~GF~V~ 231 (322) T protein:vir:31 154 TDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNN--PRWEGIVESGIAPDMQFVRSVYGIDLF 231 (322) T ss_pred CCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcc--ccccccccccchhhHHHHHHHhceeee Confidence 344567889999988887776664 344 556888776644 333 23211 1111111 15889999999 Q ss_pred eeccccccccccccccccccccccc--ceEEEeecce-EEEEeecCc-eEEEeccCCcccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 219 VSDTVRGGPEAVTASTGVYRTTNPN--VKAIAGDFSA-FRWGVQVSI-PLELIEFGDPDGLGDLKRQNQIAIRAEVVYGI 294 (311) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~gd~~~-~~~~~~~~~-~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~ 294 (311) .|+.++..............+.++. ..+.+-|+.. -.+..+++| +-|-++. -.+..-.+|...|+|. T Consensus 232 ~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~---------~~~~~d~~~~~~~~g~ 302 (322) T protein:vir:31 232 VSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFID---------DYNDDLNTATTARWGN 302 (322) T ss_pred eeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccC---------ccccccceeeeeeecc Confidence 9999874321111000000000000 0111112211 001112222 1111111 0122335688899999 Q ss_pred EEecccceEEEEecccC Q lcl|Aclame:pro 295 GIMSTDAFAVVRDADES 311 (311) Q Consensus 295 ~v~~~~a~~~l~~aa~~ 311 (311) +++||+.++.|..-+.- T Consensus 303 g~~r~e~l~~~~a~~~~ 319 (322) T protein:vir:31 303 GLVRDENLVCVLANADK 319 (322) T ss_pred eeecccceEEEEecccc Confidence 99999998888533333 No 155 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.02 E-value=2.5e-10 Score=73.26 Aligned_cols=278 Identities=11% Similarity=0.005 Sum_probs=155.0 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhhh---------cceeecCCCceEEEEEeCC-ceeEEeecCc-cccccc Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLARL---------SMAEPQEFGEQQYMTLTAP-PRGEVVGEGA-QKSEST 67 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l---------~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~-~~~~~~ 67 (311) || ++.-...++|+.+..-+.+...+.+.+.+= .....-++..+++|..... ..+.-+.|++ .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 99 466678999999877666666555544332 1222235566899998754 5677778885 688888 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccce Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIV 147 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 147 (311) .+-++-....++.+.-..++++.... +.-|..+.+.+++++..+++.+..+|.--.......... ..+........ T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~---~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~-~~~~~~~~~~~ 156 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVV---AGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAG-EKGALEETHVS 156 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhh---cchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcc-cchhhhhhhee Confidence 77776666667777777887775322 344677889999999888888777663210000000000 00000011111 Q ss_pred eeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 148 ELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) ...........+.+.++..++.+....-.+|+||+..+..|++..--+ ++ .+...+...+.++|++|++++.+|... T Consensus 157 ~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~~~~G~~VivdD~~p~~~ 233 (330) T protein:vir:10 157 DQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQ--YI-QPTTATINIPTYLGYRVIIDDGIAPTG 233 (330) T ss_pred cccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhh--hh-cccccCcccccccceEEEEeCCCCCCC Confidence 111122223456788888888776666778999999999998743111 01 111123345789999999999998442 Q ss_pred ccccccccccccccccceEEEeecceEEEEe---ecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGV---QVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV 304 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~---~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~ 304 (311) .... .++++. ..+.+.. .....+++.|+.. .+...+-...+ .++||..+.. T Consensus 234 ~~yt-------------~yl~~~-GAi~~~~~~~~~~v~~EtdRd~~---------~g~~~l~~r~~---~~~hp~G~s~ 287 (330) T protein:vir:10 234 DIYT-------------SYLFRT-GSIGLNTGNPSGLTTFETSREAA---------KGNDMIYTRRA---LVMHPYGVKW 287 (330) T ss_pred Ccee-------------EEEEec-CceeeecccCCccccccccCCcc---------ccceEEEEeeE---EEeeeeeeee Confidence 2111 122321 1222221 1223445555432 12223333333 4566777665 Q ss_pred EEec----ccC Q lcl|Aclame:pro 305 VRDA----DES 311 (311) Q Consensus 305 l~~a----a~~ 311 (311) -+.. ..+ T Consensus 288 ~~~~~~~~~~s 298 (330) T protein:vir:10 288 TGAEVDAGNIT 298 (330) T ss_pred cccccccCcCC Confidence 5332 122 No 156 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.99 E-value=3.2e-10 Score=72.65 Aligned_cols=273 Identities=11% Similarity=0.023 Sum_probs=160.7 Q ss_pred CcccCCCceEcc--hhHHHHHHHHHHhhchhhhhccee-ecCCC--ceEEEEEeCCceeEEeecCc-cccccccceeEEE Q lcl|Aclame:pro 1 MVALATGTFQLP--KHLVPGVWQKAQGQSVLARLSMAE-PQEFG--EQQYMTLTAPPRGEVVGEGA-QKSESTATFAPVT 74 (311) Q Consensus 1 mat~~~g~~~vP--~~~~~~ii~~~~~~s~l~~l~~~~-~~~~~--~~~~p~~~~~~~a~~v~Eg~-~~~~~~~~~~~v~ 74 (311) |.+.++|.+++- +.+.+.|++.+.+.-..+++..+. +.+-+ .+.++.......+.|.+.++ ++|..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 888877765433 446778888888888888887653 33322 24566666677888988764 5788888888888 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee------ Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE------ 148 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~------ 148 (311) ...+.++.-+.++.+=+........++...-+...++++++++|+.+|+|....+-. | +++..+... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~---G----LlN~p~~~~~~~~~~ 153 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIK---G----AFEATGIQIDVSPTT 153 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccce---e----eecCCCcccccccCc Confidence 888888888888776554444445678888899999999999999999995433222 2 222222111 Q ss_pred -------eccccccchHHHHHHHHHHHhh---cCCCccEEEEcHHHHHHHHHhh--ccCCceeeccccccCCCceeccee Q lcl|Aclame:pro 149 -------LTTGTSATPDLAVEAAVGLVLG---DNLSPDGVALDNTFSFMLATQR--DSQGRKLYPELGFGTDVASFAGLN 216 (311) Q Consensus 149 -------~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~n~~~~~~l~~lk--d~~g~~~~~~~~~~~~~~~l~G~p 216 (311) ..+.+....++++..++.++.. ....+..++++|+.+..|.+.+ +..|..++.-.........|...| T Consensus 154 ~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 154 GVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred ccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcc Confidence 1111222346778888887743 2335678999999999997544 444544432221111122333333 Q ss_pred EEeecccccccccccccccccccccccceEEEee--cceEEEEeecCceEEEeccCCcccchhhhhcCc-EEEEEEEEe- Q lcl|Aclame:pro 217 AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGD--FSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQ-IAIRAEVVY- 292 (311) Q Consensus 217 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd--~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~-v~~ra~~r~- 292 (311) -... . ...+...+++-. -..+.+....+++ +.+ . -.++. .....+.|+ T Consensus 234 ~L~~-----~------------g~~g~~~~v~~~~~~d~~~~~v~~~~~--~~~-~--------e~~~~~~~~~~~~r~~ 285 (301) T protein:vir:80 234 DLAG-----M------------GTAGSDSFAVIHDSNETAELIIPMDIT--RHP-E--------EYSFPRTKVPFEERTA 285 (301) T ss_pred eecc-----C------------CCCcccEEEEEecCCcEEEEEecCcee--eec-c--------eecCceeEeeeeeeeE Confidence 2211 0 001122222221 1222222222222 111 1 11221 223456777 Q ss_pred ccEEecccceEEEEec Q lcl|Aclame:pro 293 GIGIMSTDAFAVVRDA 308 (311) Q Consensus 293 ~~~v~~~~a~~~l~~a 308 (311) |..+.+|.||++++.= T Consensus 286 Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 286 GVVVRFPAAIVRVDGI 301 (301) T ss_pred EEEEEccceEEEEecC Confidence 5789999999999977 No 157 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.97 E-value=2.3e-10 Score=73.43 Aligned_cols=277 Identities=9% Similarity=-0.014 Sum_probs=165.5 Q ss_pred CcccCCCceEcc---hhHHHHHHHHHHhhchhhhhcceee-cCCC--ceEEEEEeCCceeEEeecC-ccccccccceeEE Q lcl|Aclame:pro 1 MVALATGTFQLP---KHLVPGVWQKAQGQSVLARLSMAEP-QEFG--EQQYMTLTAPPRGEVVGEG-AQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~~~vP---~~~~~~ii~~~~~~s~l~~l~~~~~-~~~~--~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v 73 (311) |-...++|.+.- +.+.+.|+|...+.-..+++.++.. .+-+ .+.+++....+.+.|.+.+ .++|..+..++.. T Consensus 3 ~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~ 82 (296) T protein:vir:10 3 VDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALATER 82 (296) T ss_pred ccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccceeE Confidence 333344454444 3456778887777777777766543 2222 3556666677788898765 5588888888888 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec-c- Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT-T- 151 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-~- 151 (311) ....+.++.-+.++.+=+..+.....++...-+...++++++++|+.+|+|....+- .|+++..+..... . T Consensus 83 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~-------~GLlN~p~v~~~~~~~ 155 (296) T protein:vir:10 83 QGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGI-------PSVFDYPNINNVVSGG 155 (296) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccc-------eeEeecCCCccccccC Confidence 888888889888887655545455567888888999999999999999999543221 2233322221111 1 Q ss_pred --ccccchHHHHHHHHHHHhh---cCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccc Q lcl|Aclame:pro 152 --GTSATPDLAVEAAVGLVLG---DNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGG 226 (311) Q Consensus 152 --~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 226 (311) .+....++++..++..+.. ....+..++++|+.+..|.+.....|.-++.-......+..+...|...... T Consensus 156 ~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a~---- 231 (296) T protein:vir:10 156 SWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDYN---- 231 (296) T ss_pred CccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeeccCC---- Confidence 1222457788888876653 3456778999999999997665555544433222222223343333321100 Q ss_pred cccccccccccccccccceEEEeec--ceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cEEecccceE Q lcl|Aclame:pro 227 PEAVTASTGVYRTTNPNVKAIAGDF--SAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IGIMSTDAFA 303 (311) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~v~~~~a~~ 303 (311) ..++..+++-+. ..+.+....+++. .+ .+. ..-...+++..|++ ..+.+|.||+ T Consensus 232 -------------~~g~~~~v~~~~~~~~~~~~v~~~~~~--~~-~e~-------~~l~~~~~~~~~~~Gv~i~~P~ai~ 288 (296) T protein:vir:10 232 -------------GTGTSAAIAYEKDPNNMAIEIPEATNA--LP-AQP-------KDLHFKIPVTSKATGLIVYRPLTMA 288 (296) T ss_pred -------------CCcceEEEEEEcCCceEEEEcCcceee--ec-ccc-------cCceEEEeeEeeEEEEEEECCceeE Confidence 111222333222 2233333333222 21 111 11123567788885 8999999999 Q ss_pred EEEecccC Q lcl|Aclame:pro 304 VVRDADES 311 (311) Q Consensus 304 ~l~~aa~~ 311 (311) +++.=+.| T Consensus 289 ~~dGI~~~ 296 (296) T protein:vir:10 289 VMKGITFA 296 (296) T ss_pred EEeeeecC Confidence 99988888 No 158 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.97 E-value=5.3e-11 Score=76.90 Aligned_cols=265 Identities=12% Similarity=0.041 Sum_probs=138.5 Q ss_pred hcceeecCCCceEEEEEeCCceeEEeecCccccc--cccceeE--EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHH Q lcl|Aclame:pro 32 LSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSE--STATFAP--VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMAD 107 (311) Q Consensus 32 l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~--~~~~~~~--v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~ 107 (311) |.+.+.- +.++++|+. +..++....-|+++.. .++.-.+ +++.-.++..+ .|-+- +..-+..|+.....+ T Consensus 1 ~vr~i~~-g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~-~VdDi---D~~qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MTRTITS-GKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDV-LIYDI---EDAMNHYDVRSEYST 74 (324) T ss_pred Ceeeeec-CceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhh-hhhhH---HHHhcCccchhHHHH Confidence 4444432 355889987 5666766666665532 2233333 33333333221 11110 112234578899999 Q ss_pred HHHHHHHHHHHHHhhhc----cc--CCC-cccccccccc-ccccccceeeccccccchHHHHHHHHHHHhhcCCCcc--E Q lcl|Aclame:pro 108 LSGVALGRALDLIGIHG----IN--PLT-GAALSGSPAK-ILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPD--G 177 (311) Q Consensus 108 ~la~~ia~~~d~~~l~G----~~--~~~-g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~ 177 (311) ++++++++..|+.++.- .. ... .....+...+ +...+.............++.+.++...+...+.... . T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 99999999999887622 00 000 0000000000 0001111111111122346677778788877666433 4 Q ss_pred EEEcHHHHHHHHHhhc-cCCceeeccccccCCCceecceeEEeecccccccccccccc-----------------ccccc Q lcl|Aclame:pro 178 VALDNTFSFMLATQRD-SQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAST-----------------GVYRT 239 (311) Q Consensus 178 ~v~n~~~~~~l~~lkd-~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~-----------------~~~~~ 239 (311) .+++|..+..|..-+. .++.+.-......+..++++|++|+.|+.+|.......... ..+.. T Consensus 155 ~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~ 234 (324) T protein:vir:99 155 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTV 234 (324) T ss_pred EEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccc Confidence 7889999987754332 23333333444556678999999999999996533211000 00111 Q ss_pred ccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 240 TNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .......++.-.+.+......+++++..++. .+|.. .+++..-+|.+++||++.+.++..+.+ T Consensus 235 d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~------~~~~d---~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 235 GADNVVGLFVHRSAVATLKLKDMALERARRP------EYQAD---QIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred ccCceeEEEEehhheEEEeeecceecceech------hhHHH---hhhhhhhhcCcccccceEEEEEEccCc Confidence 1222233343444444444444455554432 12322 355667789999999999888866665 No 159 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.97 E-value=1.2e-10 Score=74.92 Aligned_cols=298 Identities=12% Similarity=-0.025 Sum_probs=155.8 Q ss_pred CcccCC---------Cc--eEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccc Q lcl|Aclame:pro 1 MVALAT---------GT--FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTA 68 (311) Q Consensus 1 mat~~~---------g~--~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~ 68 (311) |.+... |. .+-=+.+..+|.......+..+++..+.++.+++ +.+|+. +..+++...-|+++-.+.+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg~~~ 79 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCCCCc Confidence 443211 11 1222668899999999999999999999888766 788886 6777888877777655555 Q ss_pred ceeEEEEeeeeE-EEEEeecHHHhhcCchhhHH-HHHHHHHHHHHHHHHHHHHHhhhcccCCC---cccccccccccccc Q lcl|Aclame:pro 69 TFAPVTAIPRKV-QVTQRFSQEVKWADESRQLG-VLQTMADLSGVALGRALDLIGIHGINPLT---GAALSGSPAKILDT 143 (311) Q Consensus 69 ~~~~v~l~~~kl-~~~i~iS~ell~~s~~~~~~-~~~~i~~~la~~ia~~~d~~~l~G~~~~~---g~~~~~~~~~~~~~ 143 (311) .-++..+..-.+ .....|-+- +..-+.+| +...+.+++.+++++..|+.+|.-.-... ...+.+.+.+...+ T Consensus 80 ~~dk~~ItIDtLL~a~~~V~dl---Dd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 80 QADKNQLVIDATVIARNTVAHL---HDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred ccCcEEEEeCceeeecchhhhH---HHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 555554444222 222222110 11223455 67889999999999999998873210000 00111112222221 Q ss_pred ccc-e-eeccccccc---hHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhh-ccCCceeec--cccccCCCceec Q lcl|Aclame:pro 144 TNI-V-ELTTGTSAT---PDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQR-DSQGRKLYP--ELGFGTDVASFA 213 (311) Q Consensus 144 ~~~-~-~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lk-d~~g~~~~~--~~~~~~~~~~l~ 213 (311) ... + ........+ .-..+..+...+...+.... ++++.|..+..|..-. -=|-.+... .+...+...++. T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~ 236 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSY 236 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEe Confidence 111 1 111111111 12234455555555444433 3555666666664311 001111111 222344457899 Q ss_pred ceeEEeeccccccccccccc-------cccc--ccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcE Q lcl|Aclame:pro 214 GLNAAVSDTVRGGPEAVTAS-------TGVY--RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQI 284 (311) Q Consensus 214 G~pv~~~~~~~~~~~~~~~~-------~~~~--~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v 284 (311) |+||+.++.+|......... ...+ .........++.-.+.+......+++.++.++.. .|.. T Consensus 237 Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r------~~~~--- 307 (400) T protein:vir:10 237 NCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKK------EKTY--- 307 (400) T ss_pred ceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchh------hHHH--- Confidence 99999999999542211100 1111 1222333444444444444444454444433321 1111 Q ss_pred EEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 285 AIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 285 ~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+-+.+-+|..++||++..+++.+.-+ T Consensus 308 ~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 308 YIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred HHHHHHHhCCcccchhheEEEEecCCc Confidence 122335689999999999999988777 No 160 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.96 E-value=1.8e-10 Score=74.04 Aligned_cols=231 Identities=15% Similarity=0.027 Sum_probs=147.9 Q ss_pred CcccCCCceEc--------chhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccccee Q lcl|Aclame:pro 1 MVALATGTFQL--------PKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFA 71 (311) Q Consensus 1 mat~~~g~~~v--------P~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~ 71 (311) |++...+..++ |......|||.+.+.++|++..+.....+.. -...+.++-|.+.|..=|+.+++++.++. T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQTV 80 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccceEE Confidence 88777665433 3445667999999999999988876443322 12345577889999999999999999999 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccc------------- Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPA------------- 138 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~------------- 138 (311) +++...+-+++.+.|-+.|.+... +..++.........+++.+++...+|+|+.+.....+.|+.. T Consensus 81 qvt~~l~ilgg~~eVDr~La~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a 159 (335) T protein:vir:73 81 PVTDTTGMLYDLGFVDKALADRSN-NAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASA 159 (335) T ss_pred EEEEEEEEecchhhhhHHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCcc Confidence 999999999999999887765443 345566666777889999999999999954433333332210 Q ss_pred ------c---------------------------------------------------------------------cccc Q lcl|Aclame:pro 139 ------K---------------------------------------------------------------------ILDT 143 (311) Q Consensus 139 ------~---------------------------------------------------------------------~~~~ 143 (311) | +.-- T Consensus 160 ~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI 239 (335) T protein:vir:73 160 ENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRI 239 (335) T ss_pred cceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEE Confidence 0 0000 Q ss_pred ccceeec----cccccchHHHHHHHHH--HHhhcCCCccEEEEcHHHHHHHHHh-hccCCceeeccccccCCCceeccee Q lcl|Aclame:pro 144 TNIVELT----TGTSATPDLAVEAAVG--LVLGDNLSPDGVALDNTFSFMLATQ-RDSQGRKLYPELGFGTDVASFAGLN 216 (311) Q Consensus 144 ~~~~~~~----~~~~~~~~~~i~~~~~--~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~g~~~~~~~~~~~~~~~l~G~p 216 (311) .|+.... .....+..+.+..++. .+.+.+....+|.||......|++. ++.....+-.+...+...-.++|+| T Consensus 240 ~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gip 319 (335) T protein:vir:73 240 CNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGIP 319 (335) T ss_pred eecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCeE Confidence 0000000 0011122223333332 1222222235799999999999874 4444444434445555567789999 Q ss_pred EEeecccccccccccc Q lcl|Aclame:pro 217 AAVSDTVRGGPEAVTA 232 (311) Q Consensus 217 v~~~~~~~~~~~~~~~ 232 (311) |...+++......+.. T Consensus 320 ir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 320 IRRVDAILNTESAVTA 335 (335) T ss_pred EEEEeeeecCcccccC Confidence 9999888755433322 No 161 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.89 E-value=1.8e-10 Score=74.05 Aligned_cols=297 Identities=11% Similarity=-0.027 Sum_probs=151.7 Q ss_pred CcccCC---------Cc--eEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCcccccccc Q lcl|Aclame:pro 1 MVALAT---------GT--FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTA 68 (311) Q Consensus 1 mat~~~---------g~--~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~ 68 (311) |.+... |. .+-=+.+..+|.......+..+++..+.++.+++ +.+|+. +..+++...-|++...+.+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~~~~ 79 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCCCCc Confidence 543211 11 1222668899999999999999999999888766 788886 5667777766666555555 Q ss_pred ceeEEEEeeeeEEEEEeecHHHhhcCc--hhhHH-HHHHHHHHHHHHHHHHHHHHhhhcccCCC---ccccccccccccc Q lcl|Aclame:pro 69 TFAPVTAIPRKVQVTQRFSQEVKWADE--SRQLG-VLQTMADLSGVALGRALDLIGIHGINPLT---GAALSGSPAKILD 142 (311) Q Consensus 69 ~~~~v~l~~~kl~~~i~iS~ell~~s~--~~~~~-~~~~i~~~la~~ia~~~d~~~l~G~~~~~---g~~~~~~~~~~~~ 142 (311) .-++..+..-.+ .+++-++.+-+ -+.+| +...+.+++.+++++.+|+.++.-.-... ..+....+.+... T Consensus 80 ~~dK~~ItID~l----L~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~ 155 (401) T protein:vir:70 80 QADKNQLVIDAT----VIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGH 155 (401) T ss_pred ccccEEEEeCce----eehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCC Confidence 555544444222 12222221111 22345 56789999999999999998753210000 0011111111111 Q ss_pred cccc--eeeccc---cccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhh-ccCCceeec--cccccCCCcee Q lcl|Aclame:pro 143 TTNI--VELTTG---TSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQR-DSQGRKLYP--ELGFGTDVASF 212 (311) Q Consensus 143 ~~~~--~~~~~~---~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lk-d~~g~~~~~--~~~~~~~~~~l 212 (311) +... ...... +.....+.+.++...+...+.... ++++.|..+..|..-. --|-.+-.. .....+...++ T Consensus 156 G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~v 235 (401) T protein:vir:70 156 GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSS 235 (401) T ss_pred ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEE Confidence 1110 000011 111133445666666666555543 3444556655554321 001111111 22234455789 Q ss_pred cceeEEeeccccccccccccccc-------ccc--cccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCc Q lcl|Aclame:pro 213 AGLNAAVSDTVRGGPEAVTASTG-------VYR--TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQ 283 (311) Q Consensus 213 ~G~pv~~~~~~~~~~~~~~~~~~-------~~~--~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~ 283 (311) .|+||+.++.+|........... .+. .......+++.-.+.+......+++.++.++... |.. T Consensus 236 aGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~------~~~-- 307 (401) T protein:vir:70 236 YNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKE------KTY-- 307 (401) T ss_pred eceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhh------hHH-- Confidence 99999999999965332221111 111 1222223333344444444444444443332211 111 Q ss_pred EEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 284 IAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 284 v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+-+.+-+|..++||+|..+++.+--. T Consensus 308 -~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 308 -YIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred -HHHHHHHhCCcccchhheEEEeecCcc Confidence 112335689999999999988654432 No 162 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.85 E-value=5.1e-10 Score=71.54 Aligned_cols=277 Identities=9% Similarity=-0.007 Sum_probs=157.4 Q ss_pred CcccCCCc-eEcc--hhHHHHHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCceeEEeecC-ccccccccceeEE Q lcl|Aclame:pro 1 MVALATGT-FQLP--KHLVPGVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEG-AQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~-~~vP--~~~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v 73 (311) .++..++| +++. +.+.+.|++...+.-..+++.++..-.+ ..+.++.....+.+.|.+.+ ..+|..+..++.. T Consensus 21 ~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~ 100 (314) T protein:vir:10 21 VEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEK 100 (314) T ss_pred ccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccceeeeCCcccccceeeccccee Confidence 33334444 4443 3456677777777766666665432211 13566666777788898876 4588888888888 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec--- Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT--- 150 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~--- 150 (311) ....+.++..+.++.+=+........++...-+...++++++.+|+.+++|....+- . |+++..+..... T Consensus 101 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~---~----GLlN~p~v~~~~~~~ 173 (314) T protein:vir:10 101 QGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGI---V----SVFDQPNINNVVATP 173 (314) T ss_pred EEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccc---e----eEeecCCCccccCCC Confidence 888888889888876544444444567888888999999999999999999543322 2 222222211111 Q ss_pred -cccccchHHHHHHHHHHHhh---cCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccc Q lcl|Aclame:pro 151 -TGTSATPDLAVEAAVGLVLG---DNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGG 226 (311) Q Consensus 151 -~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 226 (311) ..+....++++..++.++.. ....|+.++++|+.+..|...-+..|..++.-......+-.|.+.|-... T Consensus 174 ~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~------ 247 (314) T protein:vir:10 174 NWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTRNNPGLTIRFLQFLDN------ 247 (314) T ss_pred CcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHHhCCCcEEEEcccccc------ Confidence 11222346778888877764 33457789999999988865444445444332222111223333332211 Q ss_pred cccccccccccccccccceEEEeec--ceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEe-ccEEecccceE Q lcl|Aclame:pro 227 PEAVTASTGVYRTTNPNVKAIAGDF--SAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVY-GIGIMSTDAFA 303 (311) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~-~~~v~~~~a~~ 303 (311) . ...+...+++-+- ..+.+.....++ ..+. +. . .=...+.+..|+ |..+.+|.||+ T Consensus 248 --a---------g~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~-e~---~----~~~~~~~~~~r~~Gv~i~~P~ai~ 306 (314) T protein:vir:10 248 --Y---------DGAGGKAALAFEKSPLNMSIEIPEVTN--VLPA-QP---K----DLHFRYPVTSKATGLIVYRPLTMA 306 (314) T ss_pred --c---------CCCcceEEEEEecCCcEEEEecCccce--eecc-ee---c----CceEEEcceeeeEEEEEECcceeE Confidence 0 0111222222221 222222222222 2211 10 0 011233446677 58899999999 Q ss_pred EEEecccC Q lcl|Aclame:pro 304 VVRDADES 311 (311) Q Consensus 304 ~l~~aa~~ 311 (311) +++.=+.| T Consensus 307 ~~dGI~~~ 314 (314) T protein:vir:10 307 VIKGITFA 314 (314) T ss_pred eeeeeecC Confidence 99999999 No 163 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.83 E-value=3.6e-09 Score=66.87 Aligned_cols=288 Identities=8% Similarity=-0.020 Sum_probs=159.2 Q ss_pred CcccCC-----CceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCc-eeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVALAT-----GTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPP-RGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat~~~-----g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~-~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) ||+.+. -.+..-+.+.++|...-....|+.++....+..+...+|....-.. ...-..||++.+.....-.... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 665432 2234456678888877777888888877666655556666544332 2234458887776543322221 Q ss_pred Eee-eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCC-Cccc-cccccccccccc------- Q lcl|Aclame:pro 75 AIP-RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-TGAA-LSGSPAKILDTT------- 144 (311) Q Consensus 75 l~~-~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~-~g~~-~~~~~~~~~~~~------- 144 (311) -.. .-+...+.||.-+..-...-.-+..++-..+-..+|.+.+|.++|+|.... ++.. ......|+.... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~ 160 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLG 160 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceec Confidence 111 223334555554332111111233333334445678999999999996431 1111 112223322111 Q ss_pred --cce--------eeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecc Q lcl|Aclame:pro 145 --NIV--------ELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAG 214 (311) Q Consensus 145 --~~~--------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G 214 (311) +.. ...........+++.+++.++...+..++.+++++.....|.++...++.++..+.. ....| T Consensus 161 ~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~-----~~~~g 235 (317) T protein:vir:88 161 ANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDAS-----DNRIA 235 (317) T ss_pred cCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEccc-----CeEEE Confidence 000 000111123566788999999999999999999999999998875445544432111 11223 Q ss_pred eeE--EeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 215 LNA--AVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVY 292 (311) Q Consensus 215 ~pv--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~ 292 (311) .-| +++++-. -....+ -.-+...+++.|.+.+.+..-+++..+... .+ -+.........+ T Consensus 236 ~~v~~~~tdfG~--v~ii~~------r~lp~~~~~~~D~~~~~l~~Lr~~~~e~la-----Kt-----Gd~~k~~i~~E~ 297 (317) T protein:vir:88 236 QTVDVYESDFGK--YTIRAN------RWFHENTLFVFDPKMHSLCYLRPFFQHELA-----KT-----GDSEKRQLLVEY 297 (317) T ss_pred EEEEEEEeCCeE--EEEEeC------CCCCCCeEEEEcccccceeecccceeeccC-----CC-----cccceeEEEEEE Confidence 322 2322211 011111 112345788889888877766665444322 11 133345667889 Q ss_pred ccEEecccceEEEEecccC Q lcl|Aclame:pro 293 GIGIMSTDAFAVVRDADES 311 (311) Q Consensus 293 ~~~v~~~~a~~~l~~aa~~ 311 (311) ++++.+|+|.+++..-+++ T Consensus 298 tLe~~N~~a~a~i~~l~~~ 316 (317) T protein:vir:88 298 TFRVNNEKSGALIRDVVAQ 316 (317) T ss_pred EEEEcCccceeEEEEeccc Confidence 9999999999999998888 No 164 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.79 E-value=2.7e-10 Score=73.03 Aligned_cols=271 Identities=11% Similarity=-0.022 Sum_probs=145.0 Q ss_pred CcccCC---Cce-----Ecc---hhHHHHHHHHHHhhchhhhhcceeecCCCc-e-EEEEEeCCceeEEeecCccccccc Q lcl|Aclame:pro 1 MVALAT---GTF-----QLP---KHLVPGVWQKAQGQSVLARLSMAEPQEFGE-Q-QYMTLTAPPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~~~---g~~-----~vP---~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~-~~p~~~~~~~a~~v~Eg~~~~~~~ 67 (311) |++.-+ -+. +-| .++.+.+-.-+.+..-++...+.+||..|+ + .+|.+.....+.-|+||++||-++ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Iplsk 80 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccchhh Confidence 554321 111 111 223334433333334444445778998776 5 456678888999999999999999 Q ss_pred ccee---EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccc Q lcl|Aclame:pro 68 ATFA---PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT 144 (311) Q Consensus 68 ~~~~---~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 144 (311) .+.+ ..+++.+|.+.-+ |.|.++.+- .-+-..+-.++|..+|++++|..|+.-- ..++ T Consensus 81 vt~~~~~t~t~~ikK~rK~t--TdEAIqlsG--yg~aVgetd~qL~~~iq~kId~d~~t~L---------------ktaT 141 (296) T protein:vir:98 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYG--SNEAVTNTDNALVRQLQKKIRTDFVTAL---------------KTGT 141 (296) T ss_pred heeeecceEEEEeecccccc--CHHHHHhhc--CCchhHHHHHHHHHHHHHhhhHHHHHHH---------------hccc Confidence 8875 3777778877764 999986443 2234577888999999999999998431 1111 Q ss_pred cceee-ccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCc-eecceeEEeecc Q lcl|Aclame:pro 145 NIVEL-TTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVA-SFAGLNAAVSDT 222 (311) Q Consensus 145 ~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~-~l~G~pv~~~~~ 222 (311) ..... +.+-.......+.++...++..+..+.+.++||.+...+++-..-.-+.. .+.+-. .++|..++.|.. T Consensus 142 ~t~~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~-----fG~tyl~nfLG~~II~S~k 216 (296) T protein:vir:98 142 GTQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTA-----FGLTYLVDFTGTVIISTND 216 (296) T ss_pred ceeeechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccchhhe-----echhhhhhccccEEEEcCc Confidence 11111 10111111223444445666655556689999999988764221111111 122222 388999999999 Q ss_pred cccccccccccccccccccccc-------eEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccE Q lcl|Aclame:pro 223 VRGGPEAVTASTGVYRTTNPNV-------KAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIG 295 (311) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~-------~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~ 295 (311) +|.+..+.+....+.....+.. ..+..|-.++ |++..+.......+. .++-.+.+.| T Consensus 217 V~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tgl-IGv~h~~~~~~~t~e------T~~~~~~~lf--------- 280 (296) T protein:vir:98 217 VTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGY-IGMNHFQENTTLTIQ------TLLVSGMLMY--------- 280 (296) T ss_pred CCCceEEEeeecceEEEeecccccchhhhhccccccccc-eEEEeccccceeeeh------hHhHhHHHhc--------- Confidence 9999988877655443322211 1111121111 111111111110000 0111111111 Q ss_pred EecccceEEEEecccC Q lcl|Aclame:pro 296 IMSTDAFAVVRDADES 311 (311) Q Consensus 296 v~~~~a~~~l~~aa~~ 311 (311) .-+++++++.+..++- T Consensus 281 pE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 281 PERIDGIVKVTLTPGV 296 (296) T ss_pred ccccceEEEEEecCCC Confidence 2345677777775544 No 165 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.78 E-value=3.2e-09 Score=67.15 Aligned_cols=274 Identities=9% Similarity=0.051 Sum_probs=156.9 Q ss_pred Cc----ccCCCceEcc---hhHHHHHHHHHHhhchhhhhccee-ecCCC--ceEEEEEeCCceeEEeecC-ccccccccc Q lcl|Aclame:pro 1 MV----ALATGTFQLP---KHLVPGVWQKAQGQSVLARLSMAE-PQEFG--EQQYMTLTAPPRGEVVGEG-AQKSESTAT 69 (311) Q Consensus 1 ma----t~~~g~~~vP---~~~~~~ii~~~~~~s~l~~l~~~~-~~~~~--~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~ 69 (311) |. +..+.|++.- +.+.+.|++...+.-..+++..+. +.+-+ .+.+......+.+.|.+.+ ..+|..+.. T Consensus 21 ~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~ 100 (319) T protein:vir:10 21 AGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDAL 100 (319) T ss_pred ccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccceeeecCccccccceecc Confidence 21 2223344444 335567888888887888877654 23322 2456666667788898765 457888888 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL 149 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~ 149 (311) ++......+.++..+.++.+=+........++...-+...++++++++|+.+|+|....+- .|+++..+.... T Consensus 101 ~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~-------~GLlN~p~~~~~ 173 (319) T protein:vir:10 101 GTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKI-------VSVFNHPNITKI 173 (319) T ss_pred ceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccc-------eeEEeCCCceee Confidence 8888888888888888876644444445567888888999999999999999999543222 223333332222 Q ss_pred ccc--------cccchHHHHHHHHHHHhh---cCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEE Q lcl|Aclame:pro 150 TTG--------TSATPDLAVEAAVGLVLG---DNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) Q Consensus 150 ~~~--------~~~~~~~~i~~~~~~~~~---~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~ 218 (311) ..+ +....++++..++.++.. ....|..++++|+.+..|.......|..++.-......+..|.+.|.. T Consensus 174 ~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel 253 (319) T protein:vir:10 174 TSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAEL 253 (319) T ss_pred ecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeee Confidence 111 112345677777777653 344577899999999999765555555444332222122234333332 Q ss_pred eecccccccccccccccccccccccceEEEeec--ceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cE Q lcl|Aclame:pro 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDF--SAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IG 295 (311) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~ 295 (311) .. . ...+...+++-.. ..+.+.....++ +.+- .. .. =...+.+..|++ .. T Consensus 254 ~~-----a------------g~~g~~~~v~y~~~~~~~~~~v~~~~~--~~~~-e~---~~----l~~~~~~~~r~~Gv~ 306 (319) T protein:vir:10 254 ED-----I------------DGAGTKGVLVYEKNPMNMSIEIPEAFN--MLPA-QP---KD----LHFKVPCTSKCTGLT 306 (319) T ss_pred cc-----c------------CCCcceEEEEEecCCceEEEecCccee--eeee-ee---cC----ceEEEeeeeeeEEEE Confidence 11 0 0011122222222 122222222222 1111 10 00 112344566664 77 Q ss_pred EecccceEEEEec Q lcl|Aclame:pro 296 IMSTDAFAVVRDA 308 (311) Q Consensus 296 v~~~~a~~~l~~a 308 (311) +.+|.||++++.= T Consensus 307 i~~P~ai~~~dGI 319 (319) T protein:vir:10 307 IYRPMTIVLITGV 319 (319) T ss_pred EEccceeEeeecC Confidence 8999999999977 No 166 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.67 E-value=1.1e-09 Score=69.60 Aligned_cols=267 Identities=11% Similarity=-0.014 Sum_probs=140.5 Q ss_pred CcccCC---CceE---cchhHHHHHHHHHHhhchhhhhcceeecCCCc-e---EEEEEeCCceeEEeecCccccccccce Q lcl|Aclame:pro 1 MVALAT---GTFQ---LPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-Q---QYMTLTAPPRGEVVGEGAQKSESTATF 70 (311) Q Consensus 1 mat~~~---g~~~---vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~---~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 70 (311) |+.... ..-+ .--++.+.+-.-+.+..-++...+.+||..|+ + ++|..+....+.-|+||+.||-++.+. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 653321 1112 22334444444444444445445677888765 4 345556678889999999999999886 Q ss_pred e---EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccce Q lcl|Aclame:pro 71 A---PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIV 147 (311) Q Consensus 71 ~---~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 147 (311) . ..+++.+|.+.-+ |.|.++.+- .-+-..+-.++|.++|++++|..|+.--. + ++... T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsG--yg~aVgetd~qL~~~Iq~kIdnd~~~~lk--t-------------aT~t~ 141 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHG--YDLAINQTDNEMIKYVQKKFRAKFFETLK--S-------------AIENG 141 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhc--CCchhHHHHHHHHHHHHhhhhHHHHHHHh--h-------------ccccc Confidence 4 5788888888744 999986442 12345677888999999999999884310 0 11000 Q ss_pred eeccccccchHHHHHHHHHHHh------hcCCCccEEEEcHHHHHHHHHhhccCCc-eeeccccccCC-CceecceeEEe Q lcl|Aclame:pro 148 ELTTGTSATPDLAVEAAVGLVL------GDNLSPDGVALDNTFSFMLATQRDSQGR-KLYPELGFGTD-VASFAGLNAAV 219 (311) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~------~~~~~~~~~v~n~~~~~~l~~lkd~~g~-~~~~~~~~~~~-~~~l~G~pv~~ 219 (311) .. +.......+.+..++.... ..+....+.++||.+...+++-..-+.+ ..| +.+ --.++|..++. T Consensus 142 ~~-t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~f-----G~n~L~nfLG~~II~ 215 (303) T protein:vir:10 142 KR-TNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQF-----GVNLLTPYVGVKIVE 215 (303) T ss_pred cc-ccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhh-----hhhhhhhhhcceEEE Confidence 00 0111122344555544332 1112233799999999988752211111 111 000 12488999999 Q ss_pred ecccccccccccccccccccccccceEEEeecceEE---------EEeecCceEEEeccCCcccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 220 SDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFR---------WGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEV 290 (311) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~---------~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~ 290 (311) +..+|.+..+.+....+.....+. -||.++.. |++..+...+...+. .++-.+.+. T Consensus 216 S~kv~~G~~~~T~~~Ni~~ay~~~----~g~l~~~f~~t~D~tglIGv~h~~~~~~~t~e------T~~~~~~~l----- 280 (303) T protein:vir:10 216 FADVPQGEVWMTVAENLNVAYANP----RGELSRAFAFATDATGFVGVLHDIQPQRLTSD------TIYASAISM----- 280 (303) T ss_pred eccCCCceEEEeeccceEEEEecC----chhhhhhhhhccccccceEEEeccccceeeeh------hHhHhHHHh----- Confidence 999999998887665544322211 12222110 111111111100000 011111111 Q ss_pred EeccEEecccceEEEEe-cccC Q lcl|Aclame:pro 291 VYGIGIMSTDAFAVVRD-ADES 311 (311) Q Consensus 291 r~~~~v~~~~a~~~l~~-aa~~ 311 (311) -.-+++++++.+. +.++ T Consensus 281 ----fpE~~dgiv~~ti~~~e~ 298 (303) T protein:vir:10 281 ----FPENIDAVIKVTIKKDEA 298 (303) T ss_pred ----cccccceEEEEEEecccc Confidence 1234577888886 4554 No 167 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.58 E-value=2.9e-08 Score=61.89 Aligned_cols=277 Identities=7% Similarity=0.048 Sum_probs=153.5 Q ss_pred Cccc--CCCceEcc---hhHHHHHHHHHHhhchhhhhcceee-cCCC--ceEEEEEeCCceeEEeecC-cccccccccee Q lcl|Aclame:pro 1 MVAL--ATGTFQLP---KHLVPGVWQKAQGQSVLARLSMAEP-QEFG--EQQYMTLTAPPRGEVVGEG-AQKSESTATFA 71 (311) Q Consensus 1 mat~--~~g~~~vP---~~~~~~ii~~~~~~s~l~~l~~~~~-~~~~--~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~~~ 71 (311) +++. +..+.+.- +.+.+.|+|...+.-..+++..+.. .+-+ .+.+......+.+.|.+.+ ..+|..+..++ T Consensus 28 ~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~ 107 (329) T protein:vir:79 28 GAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMT 107 (329) T ss_pred cceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeeeeecCcccccceeecccc Confidence 1111 11222333 3356778888888888888776542 2222 3566677777788898765 57887777777 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) ......+.++..+.++.+=+........++...-+...++++++++|+-+|+|....+..+ +++..+...... T Consensus 108 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~G-------LlN~p~v~~~~~ 180 (329) T protein:vir:79 108 SEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIIS-------VFEHPNLTTINS 180 (329) T ss_pred eeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeeccccccee-------eecCCCcccccc Confidence 7777778888888887654444444456788888999999999999999999954332222 222222221111 Q ss_pred ----------ccccchHHHHHHHHHHHhh---cCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEE Q lcl|Aclame:pro 152 ----------GTSATPDLAVEAAVGLVLG---DNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) Q Consensus 152 ----------~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~ 218 (311) .+....++++..++.++.. ....|..++++|+.+..|.......|.-++.-......+-+|.+.|-. T Consensus 181 ~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk~~~~~l~I~~~~el 260 (329) T protein:vir:79 181 AGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLDYFKQQNGGITIESISEL 260 (329) T ss_pred CCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHHHHHHhCCCcEEEEcccc Confidence 1222346778888877754 223467899999999988655555555444322211111122222211 Q ss_pred eecccccccccccccccccccccccceEEEeecc--eEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cE Q lcl|Aclame:pro 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFS--AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IG 295 (311) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~--~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~ 295 (311) ... ...+...+++-+.+ .+.+.....+ .+.+- +.. .=...+....|++ .. T Consensus 261 --------~~a---------g~~g~~~~v~y~~~~~~~~~~vp~~~--~~l~~-q~~-------~~~~~v~~~~r~~Gv~ 313 (329) T protein:vir:79 261 --------EDI---------DGAGTKAALVYEKDPMNMSIEIPEAF--NMLTA-QPK-------DLHFKVPCTSKCTGLT 313 (329) T ss_pred --------ccc---------CCCCceEEEEEecCCceEEEecCcce--eeeec-eec-------CceEEEceeeeEEEEE Confidence 100 01122333333332 2222222222 22211 110 0112334456664 78 Q ss_pred EecccceEEEEecccC Q lcl|Aclame:pro 296 IMSTDAFAVVRDADES 311 (311) Q Consensus 296 v~~~~a~~~l~~aa~~ 311 (311) +.+|.||+++..=--- T Consensus 314 i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 314 IYRPLTLVLIKGLVVG 329 (329) T ss_pred EECcceeeeeeeeeeC Confidence 8899999999843333 No 168 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.43 E-value=2.7e-07 Score=56.62 Aligned_cols=289 Identities=12% Similarity=0.003 Sum_probs=130.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhccee---ec---CCCceEEEEEeCCceeEE-----eecCccccccccc Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE---PQ---EFGEQQYMTLTAPPRGEV-----VGEGAQKSESTAT 69 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~---~~---~~~~~~~p~~~~~~~a~~-----v~Eg~~~~~~~~~ 69 (311) |+. ..++|+.|..++++.+++..++..++..- .. .+..++||+... ..+.+ .+++.++...+.+ T Consensus 1 Ma~----~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) T protein:vir:99 1 MAN----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFT 75 (392) T ss_pred Ccc----ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccc Confidence 773 45899999999999999999988887431 21 234588887543 22222 2344555555555 Q ss_pred eeEEEEee-eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee Q lcl|Aclame:pro 70 FAPVTAIP-RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) Q Consensus 70 ~~~v~l~~-~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 148 (311) -..+++.. +..+.-+.|+++-..+ +..++...+.++..+++++++|..++.-- .+. ..... . . T Consensus 76 ~~~~~~~id~~k~~~~~i~d~e~~~---~~~~~~~~~~~~a~~ala~~vd~~i~~~~---~~a-----~~~~~--~---~ 139 (392) T protein:vir:99 76 EDSFPVTLTDVAYHLGVLTDEELTF---DLESFATQILPRQVRGVADILEEGVRDMI---VGA-----PYEAA--G---A 139 (392) T ss_pred cceEEEEEeeeeecceeechHHHhh---hhhhhHHHHHHHHHHHHHHHHHHHHHHHH---hcc-----ccccc--c---c Confidence 55555554 2233445666663322 23456667778888999999998876321 000 00000 0 0 Q ss_pred eccccccchHHHHHHHHHHHhhcCCCcc-EEEEcHHHHHHHHHhhc-cCCceee---ccccccCCCceecceeEEeeccc Q lcl|Aclame:pro 149 LTTGTSATPDLAVEAAVGLVLGDNLSPD-GVALDNTFSFMLATQRD-SQGRKLY---PELGFGTDVASFAGLNAAVSDTV 223 (311) Q Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd-~~g~~~~---~~~~~~~~~~~l~G~pv~~~~~~ 223 (311) ....+....|+.+.++...|...+.... .+++.|..+..|.+... .+-.+.- ......+..+++.|++|+.++.+ T Consensus 140 ~~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~ 219 (392) T protein:vir:99 140 VHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI 219 (392) T ss_pred ccccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccc Confidence 1111223457778888888776665433 47889998888764310 0000110 01122455689999999999998 Q ss_pred ccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEE--EEEEEEeccEEecccc Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIA--IRAEVVYGIGIMSTDA 301 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~--~ra~~r~~~~v~~~~a 301 (311) |................. ......+.-..........+........+.....+-+.-+.+. .......+........ T Consensus 220 ~~~t~~a~~~~a~~~at~-a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~ 298 (392) T protein:vir:99 220 PHGDAYLYHPTAFIMATR-APAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARK 298 (392) T ss_pred ccccceeeeccccccccc-cccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeee Confidence 866543322111100000 0000000000000000000111000000000000000000000 0000001111111101 Q ss_pred eE------EEEe---cccC Q lcl|Aclame:pro 302 FA------VVRD---ADES 311 (311) Q Consensus 302 ~~------~l~~---aa~~ 311 (311) +. .+.. ...+ T Consensus 299 ~~~~~~~v~v~~v~~~~~~ 317 (392) T protein:vir:99 299 IHLIPGSIEVAPEAGANAT 317 (392) T ss_pred eeeecceeeeeeeecccce Confidence 10 0110 0000 No 169 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.37 E-value=5.4e-07 Score=54.93 Aligned_cols=265 Identities=11% Similarity=-0.005 Sum_probs=129.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceee----cC-CCceEEEEEeCCceeEEeecCccccccccceeEEEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QE-FGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTA 75 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~----~~-~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l 75 (311) |++ ..+..+-|+-|..++++.+++..++.+++..-. .. +.++++|+... .-+.++..+...+.+-..+++ T Consensus 1 m~~-~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~----~~v~dg~~~~~~~~te~~v~l 75 (418) T protein:vir:10 1 MAV-QDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR----VKSASGRTLVKQPMVDQTIPF 75 (418) T ss_pred CCc-cccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc----eeecccCCccccccccceEEE Confidence 555 467777899999999999999999988876421 12 24688887332 223344445544444444444 Q ss_pred ee-eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 76 IP-RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 76 ~~-~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) .. +....-+.++++=..++ ..++.+.+.+...++++.++|..++.-- . + ..+.... .+.. T Consensus 76 ~id~~k~~~~~itD~e~a~~---~~d~~~~~l~~A~~aLA~~vD~~ia~l~---~-----~-------a~~~~gt-~gt~ 136 (418) T protein:vir:10 76 KIAYQEHVGLEYTVKDKTLD---IMQFSERYLKSGMVQIANQIDRSLALTL---K-----K-------AFHSSGT-PGVR 136 (418) T ss_pred EEecccccceeechHHHhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHHH---h-----h-------ccccccc-CCcC Confidence 43 22233455655532222 2356667777788999999998876320 0 0 0111111 1122 Q ss_pred cchHHHHHHHHHHHhhcCCCc--cE-EEEcHHHHHHHHHhhccCCceeecc-----ccccCCCceecceeEEeecccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSP--DG-VALDNTFSFMLATQRDSQGRKLYPE-----LGFGTDVASFAGLNAAVSDTVRGG 226 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~--~~-~v~n~~~~~~l~~lkd~~g~~~~~~-----~~~~~~~~~l~G~pv~~~~~~~~~ 226 (311) ...|+++.++...|...+... .+ .+++|..+..|.+ +.. ..+.. ....+..+++.|+.|+.++.+|.. T Consensus 137 ~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~--~~~--~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ 212 (418) T protein:vir:10 137 PGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSD--EVT--KLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKH 212 (418) T ss_pred cchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhh--hcc--ccccccccchhhheeeeeeeeceEEEEecCCCcc Confidence 345788888888887777652 24 5789988877643 221 12221 122445688999999999999954 Q ss_pred ccccccccc-ccccc-cccceEEEeecceE-EEEeecC-ceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 227 PEAVTASTG-VYRTT-NPNVKAIAGDFSAF-RWGVQVS-IPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAF 302 (311) Q Consensus 227 ~~~~~~~~~-~~~~~-~~~~~~~~gd~~~~-~~~~~~~-~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~ 302 (311) ......... ..+.. ......+.++.... ......+ +++. .. +.-+.+... ...++.-| T Consensus 213 tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~---gv--------~~v~~~t~~-------~~~~~~~f 274 (418) T protein:vir:10 213 TVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFG---GV--------FGVNPQNYE-------TTGLLQEF 274 (418) T ss_pred cccccccceeeecccccceeEEEeecceeeccceeeccEEEEC---ce--------eeccccccc-------ccccceEE Confidence 332111100 10110 00111112222110 0000000 0000 00 000000000 01122333 Q ss_pred EEEEecccC Q lcl|Aclame:pro 303 AVVRDADES 311 (311) Q Consensus 303 ~~l~~aa~~ 311 (311) ++...++++ T Consensus 275 ~V~~~~~~~ 283 (418) T protein:vir:10 275 VVLEDVDTD 283 (418) T ss_pred EEEeecccc Confidence 333322110 No 170 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=98.21 E-value=1.2e-07 Score=58.49 Aligned_cols=275 Identities=9% Similarity=0.019 Sum_probs=147.1 Q ss_pred CcccCCCceEcc----hhHHHHHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCceeEEeecCccccccc--ccee Q lcl|Aclame:pro 1 MVALATGTFQLP----KHLVPGVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEGAQKSEST--ATFA 71 (311) Q Consensus 1 mat~~~g~~~vP----~~~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~--~~~~ 71 (311) |-+....| || +.+.+.|++...+....+.+.++.+.+. ..+.++..+..+.+.+.+.+++.|..+ ..+. T Consensus 46 ~~~~~~~~--i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~ 123 (339) T protein:vir:94 46 LQTTANAG--IPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFE 123 (339) T ss_pred cccccccc--hhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceee Confidence 22222222 44 3344677777888888888888776653 246788888888999999998888665 5566 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) +.++..+.++-... ..|+-+ ......++.+.-+....+++.+++|+..++|....+-.+.-+-|+.....+......+ T Consensus 124 ~~~v~~~~~g~~y~-~~E~~~-A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~ 201 (339) T protein:vir:94 124 SRQNYRYQTWTEYG-DLEMAT-YGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWAT 201 (339) T ss_pred EEeEEEEEEEEeec-HHHHHH-HHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCccc Confidence 66666655555444 244432 3334567888888999999999999999999654332222222221111111111222 Q ss_pred ccccchHHHHHHHHHHHhhcCC------CccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNL------SPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRG 225 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~------~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 225 (311) .+....++++..++.++...-. .+..++|.|+.+..|.+. +..|..++.-.... +.++.++. +|+ T Consensus 202 kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~~lk~n-----~pnl~i~~---~~e 272 (339) T protein:vir:94 202 AAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGAKIAQT-----YPNIQFVA---VPE 272 (339) T ss_pred CCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHHHHHHh-----cCCcEEEE---ccc Confidence 3333456778888887754322 244699999999988643 33344333211111 11222322 222 Q ss_pred ccccccccccccccccccceEEEeec---ceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEE-eccEEecccc Q lcl|Aclame:pro 226 GPEAVTASTGVYRTTNPNVKAIAGDF---SAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVV-YGIGIMSTDA 301 (311) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r-~~~~v~~~~a 301 (311) ... ...+...+++-.. ....+.....++ ..+- . ...-....-+..| .|..+++|.| T Consensus 273 l~~----------a~g~~~~~~~~~~~~~~~~~~~~p~~~~--~lpv-q-------~~~~~~~v~~~~rt~Gv~i~~P~a 332 (339) T protein:vir:94 273 FDT----------ASGRLVQLWVPEVNGQPTGEVAFAEKLR--SHSI-E-------RYSTTTRQKHSGATFGAVIYQPWA 332 (339) T ss_pred ccc----------CCCceEEEEEEeccCCcceEEEcchhhh--cccc-E-------EcCceEEecceeeeeeEEEEccce Confidence 111 0111111221111 111111111111 1110 0 0111234455667 5677888999 Q ss_pred eEEEEec Q lcl|Aclame:pro 302 FAVVRDA 308 (311) Q Consensus 302 ~~~l~~a 308 (311) |++++.= T Consensus 333 i~~~~GI 339 (339) T protein:vir:94 333 VTQELGV 339 (339) T ss_pred eeeeecC Confidence 9999977 No 171 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=98.09 E-value=2.1e-06 Score=51.73 Aligned_cols=276 Identities=11% Similarity=-0.029 Sum_probs=137.4 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHh-----hchhhhhcceeecCC-CceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQG-----QSVLARLSMAEPQEF-GEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~-----~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) || +-+++. -|.-+.+-+-..+++ ....++.|....++. ...+..+...-++..-|.|++++.-....=+.- T Consensus 394 ~a~~htTSD--Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e 471 (693) T protein:vir:95 394 LAFTHTSSD--FGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGE 471 (693) T ss_pred HHHhcCcch--hHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCccc Confidence 22 122222 243332222222222 233555555443332 223344455667778899999987655543445 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccc--ccceeecc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDT--TNIVELTT 151 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~--~~~~~~~~ 151 (311) ++...+++.++.||+|.+- .+.++....+-..++++.++.+++.++.=-. .......|. .+.+. .|..+.+ T Consensus 472 ~~~l~tyG~~~~iTRqaiI---NDDLga~~~ip~~~g~aA~~~~~~~vy~~L~-~Np~m~DGk--~LFhadH~Nl~tga- 544 (693) T protein:vir:95 472 QIILATYGELFSITRQAII---NDDLQMLSDIPFKLGQAAKATIGDLVYAVLT-GNPAMSDGK--TLFHADHSNLLTGA- 544 (693) T ss_pred eeehhhcCCeeeecHHhhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCccccCCc--ceeecccccccccc- Confidence 6678888999999999873 3346777888888998888888776552100 011111221 12222 2222111 Q ss_pred ccccchHHHHHHHHHHHh------------hcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecce-eEE Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVL------------GDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGL-NAA 218 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~------------~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~-pv~ 218 (311) ......+.+..+..++. .-+..|..|+..+......+++..+...|-- ....+...-+.|+ .++ T Consensus 545 -~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a--~~~~~~~NP~~~~~~vi 621 (693) T protein:vir:95 545 -ASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGA--DVNSGIVNPIRAFAQVI 621 (693) T ss_pred -ccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhcccccccc--ccccccccchhcccccc Confidence 11122233333322221 1234566688877777777776544322210 0011111113343 344 Q ss_pred eecccccc--cccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 219 VSDTVRGG--PEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGI 296 (311) Q Consensus 219 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v 296 (311) ....+... ..|..... -....+.++-+ .-.+...++... -|..+-+.+|++..+|.++ T Consensus 622 ~~prL~~~s~~~Wyl~a~------~~~dtie~~yL-----~G~~~P~ie~~~---------gf~~dG~~~kvr~D~G~~~ 681 (693) T protein:vir:95 622 GEPRLDDASATAWYMAAK------KGSDTIEVAYL-----DGVDTPYLEQQE---------GFTVDGVASKVRIDAGVAP 681 (693) T ss_pred ccceecCCCCCceEEecC------CCCCeEEEEEe-----cCCCCCeEeecC---------CCCcceEEEEEEEeccCce Confidence 44444211 11111000 00012222221 122333333221 2888999999999999999 Q ss_pred ecccceEEEEec Q lcl|Aclame:pro 297 MSTDAFAVVRDA 308 (311) Q Consensus 297 ~~~~a~~~l~~a 308 (311) +|=..+++-..| T Consensus 682 iD~Rg~~kn~GA 693 (693) T protein:vir:95 682 LDFRGLQKSNGA 693 (693) T ss_pred eeccccccCCCC Confidence 998888887777 No 172 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.98 E-value=4.8e-07 Score=55.23 Aligned_cols=276 Identities=13% Similarity=0.047 Sum_probs=143.4 Q ss_pred CcccCCCceEcchhHHH----HHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVP----GVWQKAQGQSVLARLSMAEPQEFG---EQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~~~vP~~~~~----~ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) |++++.+| ||..+.+ .+++.+.+......+..+.+.+.- ...+++....+.+.+.+.+.+.|..+...+.. T Consensus 42 ~~~~~~~~--~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~ 119 (336) T protein:vir:36 42 LSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYP 119 (336) T ss_pred cccCCCcc--hHHHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeeccccee Confidence 33333333 5654432 455556666666666666554432 24556666677888889888999888666666 Q ss_pred EEeeeeEEEEEeec-HHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccc-cceeecc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFS-QEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT-NIVELTT 151 (311) Q Consensus 74 ~l~~~kl~~~i~iS-~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-~~~~~~~ 151 (311) +-..+.++..+.++ .|+.+ ......++.+.-+...++++.+++++-.++|.....-.+.-+-|+.....+ ....... T Consensus 120 ~~~v~~~~~g~~yg~~E~~~-Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~ 198 (336) T protein:vir:36 120 QRQSYFFQTWTRWGERELEM-AGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGS 198 (336) T ss_pred eeeEEEEEeeeeeCHHHHHH-HHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccc Confidence 66678888888998 55543 334567788888888999999999998888854332222222121110010 0000111 Q ss_pred ccccchHHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDN------LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRG 225 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 225 (311) .+....++++..++..+.... ..+..++|.++.+..|.+ ++..|..++.-... .+-++.++ ..|. T Consensus 199 ~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~-----n~Pnl~i~---t~pE 269 (336) T protein:vir:36 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKD-----IFPKLEFV---TIPE 269 (336) T ss_pred cCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHHHHH-----hcCccEEE---Eccc Confidence 112335778888888776522 246679999998888854 23333333211111 01112222 1222 Q ss_pred ccccccccccccccccccceEEEeecce---EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cEEecccc Q lcl|Aclame:pro 226 GPEAVTASTGVYRTTNPNVKAIAGDFSA---FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IGIMSTDA 301 (311) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~gd~~~---~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~v~~~~a 301 (311) ... ..+....+++-+... ..+...+.++. .+- . ...-.....+..|++ ..+.+|.| T Consensus 270 l~~----------a~g~~~~l~~~~~~~~~t~~~~~p~~~~~--l~v----q----~~~~~~~v~~~~rt~Gv~i~~P~a 329 (336) T protein:vir:36 270 YDT----------ASGRLVQLWAPRVEGKDTATCGFTEKMRA--HSI----E----RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) T ss_pred ccc----------CCCceEEEEEEecCCCcceeeecchhhhc--cce----e----ecCceeEeccccceeeeeeeccch Confidence 111 111111222211111 11111111110 000 0 011123344556654 45677999 Q ss_pred eEEEEec Q lcl|Aclame:pro 302 FAVVRDA 308 (311) Q Consensus 302 ~~~l~~a 308 (311) |++++.= T Consensus 330 i~~~~GI 336 (336) T protein:vir:36 330 VAQMIGV 336 (336) T ss_pred heeeecC Confidence 9999977 No 173 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.90 E-value=8.9e-07 Score=53.75 Aligned_cols=276 Identities=13% Similarity=0.057 Sum_probs=143.3 Q ss_pred CcccCCCceEcchhH---H-HHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHL---V-PGVWQKAQGQSVLARLSMAEPQEFG---EQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~~~vP~~~---~-~~ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) |.+++.+| ||..+ . +.+++.+.+......+..+.+.+.- ...+++....+.+.+.+.+.+.|..+...+.. T Consensus 42 ~~~~~~~~--i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~ 119 (336) T protein:vir:10 42 LSSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYP 119 (336) T ss_pred cccCCCch--hHHHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeeccccee Confidence 33333333 55432 2 4455656666666667666554432 24556666677888889888999888666666 Q ss_pred EEeeeeEEEEEeec-HHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccc-ceeecc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFS-QEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTN-IVELTT 151 (311) Q Consensus 74 ~l~~~kl~~~i~iS-~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~-~~~~~~ 151 (311) +-..+.++..+.++ .|+- .......++.+.-+...++++.+++++-.++|.....-.+.-+-|+.....+. ...... T Consensus 120 ~~~v~~~~~g~~yg~~El~-~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~ 198 (336) T protein:vir:10 120 QRQSYFFQTWTRWGERELE-MAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGS 198 (336) T ss_pred eeeEEEEEeeeeeCHHHHH-HHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccc Confidence 66678888888998 4554 33445677888889999999999999988888543322222222221100110 000111 Q ss_pred ccccchHHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDN------LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRG 225 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 225 (311) .+....++++..++..+...- ..+..++|.++.+..|.+ ++..|..++.-... .+-++.++. .|. T Consensus 199 ~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~-----n~Pnl~i~t---~pE 269 (336) T protein:vir:10 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKD-----IFPKLEFVT---IPE 269 (336) T ss_pred cCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHHHHH-----hcCccEEEE---ccc Confidence 112335777888888776522 246789999998888854 23333333211111 111122221 222 Q ss_pred ccccccccccccccccccceEEEeecce---EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cEEecccc Q lcl|Aclame:pro 226 GPEAVTASTGVYRTTNPNVKAIAGDFSA---FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IGIMSTDA 301 (311) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~gd~~~---~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~v~~~~a 301 (311) ... ..+....+++-+... ..+...+.++. .+- . ...-.....++.|++ ..+.+|.| T Consensus 270 l~~----------a~G~~~~l~~~~~~~~~t~~~~~p~~~~~--l~v----q----~~~~~~~v~~~~rt~Gv~i~~P~a 329 (336) T protein:vir:10 270 YDT----------ASGRLVQLWAPRVEGKDTATCGFTEKMRA--HSI----E----RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) T ss_pred ccc----------CCCceEEEEEEecCCCcceeeecchhhhc--cce----e----ecCceeEeccccceeeeeeeccch Confidence 110 111111222221111 11111111110 000 0 011123344556654 45677999 Q ss_pred eEEEEec Q lcl|Aclame:pro 302 FAVVRDA 308 (311) Q Consensus 302 ~~~l~~a 308 (311) |++++.= T Consensus 330 i~~~~GI 336 (336) T protein:vir:10 330 VAQMIGV 336 (336) T ss_pred heeeecC Confidence 9999977 No 174 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.89 E-value=1.2e-05 Score=47.47 Aligned_cols=278 Identities=9% Similarity=-0.006 Sum_probs=140.7 Q ss_pred Cc-ccCCCceEcchhHHHHHHHHHHh-----hchhhhhcceeecCC-CceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQKAQG-----QSVLARLSMAEPQEF-GEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~~~~-----~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) +| +-+|+. .|.-+.+-+-..+++ ....++.|...+++- ...+..+..+-++..-|.|++++......=+.. T Consensus 359 ~A~~hsTsD--Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e 436 (652) T protein:vir:79 359 AAFTHSTSD--FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQA 436 (652) T ss_pred HHhhcCcch--HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCccc Confidence 22 223333 354333333222221 234666666555442 234555666778888999999998876655667 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhh---hcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGI---HGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l---~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) ++...+++.++.||+|.+- -+.++....|-..++++-++.+++.++ .+ |+.-....+.++.. .+..|....+ T Consensus 437 ~~~l~tyG~~~~iTRqaiI---NDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~-Np~~~~DGk~LF~h-A~H~Nl~~~a 511 (652) T protein:vir:79 437 TIALATYGELFSITRQAII---NDDLNMLTDVPMKLGRAAKSTIADLVYAILTS-NPKISTDNVSLFDK-AKHANVLESA 511 (652) T ss_pred eeeeecccCeeeeehheee---ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhc-CcccccCCceeecc-cccccccccc Confidence 8889999999999999773 334677788888888888888876654 22 11110011111100 1122322221 Q ss_pred cccccchHHHHHHHHHHHhh----cCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecce-eEEeeccccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLG----DNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGL-NAAVSDTVRG 225 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~----~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~-pv~~~~~~~~ 225 (311) .-+.........++..-.. -+..|..|+..+.....-+++..+...+- .....+...-+.|+ .++++..+.. T Consensus 512 -a~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~--a~~~~~~~Np~~~~~~~i~eprL~~ 588 (652) T protein:vir:79 512 -AMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKG--ADINAGIINPVKDFATVIAEPRLDD 588 (652) T ss_pred -cCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCcc--cccccccccccccccccccccccCC Confidence 1111122222222222221 22345667777777766666653321111 00011111123333 4444444432 Q ss_pred ccccccccccccccccc-cceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|Aclame:pro 226 GPEAVTASTGVYRTTNP-NVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV 304 (311) Q Consensus 226 ~~~~~~~~~~~~~~~~~-~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~ 304 (311) ..... -+..... ...+-++-+ .-.+...++... -|..+-+.+|++..+|.+++|=-.+++ T Consensus 589 ~s~~~-----wylaa~~~~dtiev~yL-----~G~~~P~ie~~~---------gf~~dG~~~kvrlD~G~~~iD~RG~~k 649 (652) T protein:vir:79 589 NSQTT-----FYLAASKGSDTIEVAYL-----NGVDTPYIDQME---------GFSVDGVTTKVRIDAGVAPVDHRGLVK 649 (652) T ss_pred CCccc-----EEEecCCCCCeEEEEEe-----cCCCCCeeeecC---------CCCcceEEEEEEEeccCceeeccceee Confidence 21100 0000000 111222211 112333333211 288899999999999999999999888 Q ss_pred EEe Q lcl|Aclame:pro 305 VRD 307 (311) Q Consensus 305 l~~ 307 (311) .+- T Consensus 650 ~t~ 652 (652) T protein:vir:79 650 CTA 652 (652) T ss_pred ecC Confidence 866 No 175 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=97.73 E-value=2.4e-06 Score=51.35 Aligned_cols=277 Identities=13% Similarity=0.036 Sum_probs=146.0 Q ss_pred CcccCCCceEcchhH---H-HHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHL---V-PGVWQKAQGQSVLARLSMAEPQEFG---EQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~~~vP~~~---~-~~ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) |.|.+.+| ||..+ . ..+++.+.+......+..+.+.+.- .+.++.....+.+.+.+.+.+.|..+...+.. T Consensus 42 ~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~ 119 (336) T protein:vir:78 42 LSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYP 119 (336) T ss_pred cccCCCcc--hHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeE Confidence 33333333 45433 2 3555566666666666666555432 34666767778888889999999988888888 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccc-cceeeccc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT-NIVELTTG 152 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-~~~~~~~~ 152 (311) .-..+.++..+.++.+=+........++.+.-+...++++.+++++-.++|.....-.+.-+-|......+ ........ T Consensus 120 ~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:78 120 QRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSP 199 (336) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCccccc Confidence 88888889999998554444445567788888888999999999998888854332222222121110011 00001111 Q ss_pred cccchHHHHHHHHHHHhhcCC------CccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccc Q lcl|Aclame:pro 153 TSATPDLAVEAAVGLVLGDNL------SPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGG 226 (311) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~------~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 226 (311) +....++++..++..+...-. .+..++|.+..+..|.+. +..|..++.-.... +-++.++ .+|+. T Consensus 200 T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~-n~~g~tv~~~lk~n-----~Pnl~i~---t~pel 270 (336) T protein:vir:78 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAAKLKEI-----FPKLEFV---TIPEY 270 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-CccCccHHHHHHHh-----cCccEEE---Ecccc Confidence 223356778877777644321 244699999999988642 33333232111100 1112222 12222 Q ss_pred cccccccccccccccccceEEEeecc---eEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cEEecccce Q lcl|Aclame:pro 227 PEAVTASTGVYRTTNPNVKAIAGDFS---AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IGIMSTDAF 302 (311) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~v~~~~a~ 302 (311) .. .......+++-+.. ...+...+.++. ++- . ...-.....++.|++ ..+.+|.|| T Consensus 271 ~~----------Agg~~~~~~~~~~~~~~t~~~~~p~~f~~--lpv----q----~~~~~~~v~~~~rt~Gv~i~~P~ai 330 (336) T protein:vir:78 271 DT----------ASGRLVQLWAPRVEGKDTATCGFTEKMRA--HSI----E----RYSSYFRQKKSAGTWGAVIFRPFAV 330 (336) T ss_pred cc----------cCcceEEEEEeeccCCcceeeecchhhhc--cce----e----ecCceeEeccccceeeeeeeccchh Confidence 11 11112222222221 111222211111 000 0 011123344555654 456679999 Q ss_pred EEEEec Q lcl|Aclame:pro 303 AVVRDA 308 (311) Q Consensus 303 ~~l~~a 308 (311) ++++.= T Consensus 331 ~~~~GI 336 (336) T protein:vir:78 331 AQMIGV 336 (336) T ss_pred eeeccC Confidence 999977 No 176 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=97.72 E-value=5.7e-06 Score=49.31 Aligned_cols=279 Identities=11% Similarity=0.021 Sum_probs=139.4 Q ss_pred CcccCCCc------e-------Ecch---hHHHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCceeEEeecCc Q lcl|Aclame:pro 1 MVALATGT------F-------QLPK---HLVPGVWQKAQGQSVLARLSMAEPQEFG---EQQYMTLTAPPRGEVVGEGA 61 (311) Q Consensus 1 mat~~~g~------~-------~vP~---~~~~~ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a~~v~Eg~ 61 (311) |-....++ . -+|. .|...+++.+-....+.++..+.+.+.- ...+++....+.+.+.+.+. T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~ 135 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGG 135 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEecccc Confidence 22221111 1 1232 3456777777777777777776665432 24566666677888888888 Q ss_pred cccccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc---cc Q lcl|Aclame:pro 62 QKSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS---PA 138 (311) Q Consensus 62 ~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~---~~ 138 (311) +.|..+...+...-..+.++..+.++.+=+........++.+.-+....+++.+++|+-.|+|.++.+ ....|+ |+ T Consensus 136 d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~-~~~yGllNdP~ 214 (379) T protein:vir:10 136 NMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGS-GRTFGFLNDPN 214 (379) T ss_pred CCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCC-cceEEEEeCCC Confidence 88877766555555567777777777653344445567888999999999999999999999954321 121121 11 Q ss_pred c---cccccc---ceeeccccccchHHHHHHHHHHHhhc---CC----CccEEEEcHHHHHHHHHhhccCCceeeccccc Q lcl|Aclame:pro 139 K---ILDTTN---IVELTTGTSATPDLAVEAAVGLVLGD---NL----SPDGVALDNTFSFMLATQRDSQGRKLYPELGF 205 (311) Q Consensus 139 ~---~~~~~~---~~~~~~~~~~~~~~~i~~~~~~~~~~---~~----~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~ 205 (311) . ....+. .......+....++++..++..+... .. .+..+++.|..+..|.+. +..|..++.-... T Consensus 215 l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~lk~ 293 (379) T protein:vir:10 215 LPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQYMRE 293 (379) T ss_pred CcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHHHHHH Confidence 0 110000 01111222333566777777765432 11 233688999999888643 2223333211111 Q ss_pred cCCCceecceeEEeecccccccccccccccccccccccceEEEeec-ce--------EEEEeecCceEEEeccCCcccch Q lcl|Aclame:pro 206 GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDF-SA--------FRWGVQVSIPLELIEFGDPDGLG 276 (311) Q Consensus 206 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~-~~--------~~~~~~~~~~i~~~~~~~~~~~~ 276 (311) .+-++.++....+. .. ........++.|- .. +.....+.++. .+- . T Consensus 294 -----n~Pnl~i~t~pEL~---~a---------ggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~--l~v-e----- 348 (379) T protein:vir:10 294 -----SYPNVTFVSAPELN---DA---------NGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFT--LGV-E----- 348 (379) T ss_pred -----hcCCcEEEEccccc---cc---------CCCccEEEEEeeccCCCccCCcceEEEecchhhhh--ccc-e----- Confidence 11122232211111 10 0011112222221 10 00001111100 000 0 Q ss_pred hhhhcCcEEEEEEEE-eccEEecccceEEEEec Q lcl|Aclame:pro 277 DLKRQNQIAIRAEVV-YGIGIMSTDAFAVVRDA 308 (311) Q Consensus 277 ~~f~~~~v~~ra~~r-~~~~v~~~~a~~~l~~a 308 (311) ...-.....+..| .|..+.+|.||+++..+ T Consensus 349 --~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 349 --KKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred --ecCceeEeccccceeeeeeecchhhheecCC Confidence 0001112233444 45667789999999999 No 177 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.68 E-value=3e-05 Score=45.41 Aligned_cols=273 Identities=11% Similarity=0.008 Sum_probs=120.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceee-----cC--CCceEEEEEeCCceeEEee-cCccccccccceeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP-----QE--FGEQQYMTLTAPPRGEVVG-EGAQKSESTATFAP 72 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~-----~~--~~~~~~p~~~~~~~a~~v~-Eg~~~~~~~~~~~~ 72 (311) ||..=. ..||+.|..+.++.+++..++.++++.-. .. +.+++||+........+.. .+..+..++..-.+ T Consensus 1 MAN~ll--T~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~ 78 (423) T protein:vir:35 1 MANNLE--SNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAK 78 (423) T ss_pred Cccchh--hhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccce Confidence 872211 23799999999999999999999876521 11 3457888754322222211 12223333333233 Q ss_pred --EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 73 --VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 73 --v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) +++..+|... +.++++=..++. .+++++++.+ .+++++++|..++..-- . +.+ +.+. . T Consensus 79 v~l~id~~k~~a-~~v~d~e~~l~i---~~~~~~l~~a-~~ala~~vd~~l~~~l~--~-----~a~-------~~vg-t 138 (423) T protein:vir:35 79 ATGKVGKYITVA-VEWTQIEEALKL---NQLDQILSPI-HERMVTDLETELAHFMM--N-----NGA-------LSLG-S 138 (423) T ss_pred eeEEeccceecc-ceeCHHHHHhhH---HHHHHHHHHH-HHHHHHHHHHHHHHHHh--h-----ccc-------cccc-c Confidence 4444444433 445544222222 2455666655 47889999988874210 0 001 1000 0 Q ss_pred cccccchHHHHHHHHHHHhhcCCCc-cE-EEEcHHHHHHHHHh----hccCCceeeccccccCCCceecceeEEeecccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLSP-DG-VALDNTFSFMLATQ----RDSQGRKLYPELGFGTDVASFAGLNAAVSDTVR 224 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~-~~-~v~n~~~~~~l~~l----kd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 224 (311) .++....|+.+.++-..|...+... .+ .+++|.....|.+- ...++. .-.....++-.+++.|+.++.|+.+| T Consensus 139 ~~t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~g~i~G~i~GFdv~~Snnvp 217 (423) T protein:vir:35 139 PNTAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQL-VRTAWENAQISGNFGGIRALMSNGLA 217 (423) T ss_pred ccCCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccc-hhHHHhhccceeeecceEEEEcCCCc Confidence 1112245778888888776665553 23 58899988777531 111110 00112223334899999999999999 Q ss_pred cccccccccccccccccccc-eEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEe------ Q lcl|Aclame:pro 225 GGPEAVTASTGVYRTTNPNV-KAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIM------ 297 (311) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~-~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~------ 297 (311) ............... .... ..-..+.++..+.... .++ ..+ +. +...|.+. ..|...+ T Consensus 218 ~~T~gt~~~~~~v~~-a~~v~~~a~~~~~~~~~~~~~-~~~--~~~-g~-----l~~GD~~t-----~aGv~~v~~~t~~ 282 (423) T protein:vir:35 218 SRKQGDFDGAITVKT-APNVDYLSVKDSYQFTVALTG-ATP--SKT-GF-----LKAGDQLK-----FTSTHWLNQQSKQ 282 (423) T ss_pred cccccccccceeecc-ccccccccccccccceeeeee-eee--ccC-Cc-----EEecceEE-----eeeeeeccccccc Confidence 643322111100000 0000 0001111111111000 000 000 00 00011111 1111111 Q ss_pred --------cccceEEEEec--------------------ccC Q lcl|Aclame:pro 298 --------STDAFAVVRDA--------------------DES 311 (311) Q Consensus 298 --------~~~a~~~l~~a--------------------a~~ 311 (311) ++.-|+++... ..+ T Consensus 283 ~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~ 324 (423) T protein:vir:35 283 TLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNS 324 (423) T ss_pred eeecccCCceeEEEEeccccccccCceeEEccccccccCCCc Confidence 11112222111 000 No 178 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.67 E-value=3e-05 Score=45.38 Aligned_cols=266 Identities=8% Similarity=-0.092 Sum_probs=127.5 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhh-h-cc--eeecCCCceEEEEEeCCceeEE-eecCccccccccceeEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLAR-L-SM--AEPQEFGEQQYMTLTAPPRGEV-VGEGAQKSESTATFAPV 73 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~-l-~~--~~~~~~~~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~~~~v 73 (311) .| ....+....-+-+ ..+++.+.....+.+ + ++ .....+++++||+.....-..+ ...+-....-+.++... T Consensus 19 ~~~~~~~~nt~~l~~k~-~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~ 97 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTY 97 (319) T ss_pred hhccCCCcchHHHHHHH-HHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEE Confidence 11 2223333333334 344444544444332 1 22 3445566799999876332222 22222222333444555 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhh--HHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQ--LGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~--~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) ++...|.-.+. | +.+=. .++. +.+...+.+...+.++-.+|...+.-.-...+ .....+ T Consensus 98 tidqdR~~~F~-V-D~~D~--~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~---------------~~~~~~ 158 (319) T protein:vir:94 98 FLDQEKYWGRF-V-DALDR--KDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA---------------KHLTVG 158 (319) T ss_pred Eeecccccccc-c-chhhH--hhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc---------------cccccc Confidence 55444332211 1 11100 1111 12233455566666677777665532110000 001112 Q ss_pred ccccchHHHHHHHHHHHhhcCCCccE-EEEcHHHHHHHHHhhccCCce-eeccccccCCCceecceeEEeeccccccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSPDG-VALDNTFSFMLATQRDSQGRK-LYPELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~-~v~n~~~~~~l~~lkd~~g~~-~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 229 (311) .+....|+.+.++...+...+...+. ++++|..+..|.+-..-.... +.......+..++|.|++|+.+ |.... T Consensus 159 ~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~v---ps~~~- 234 (319) T protein:vir:94 159 TGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKV---PTKLL- 234 (319) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEe---ccccc- Confidence 23446788999999999887765444 677899998886543222111 1122334556789999999753 22111 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) ....+++|.-+... ...+--.+++++.. + + .| .-.++...++|..|++|++..++..+. T Consensus 235 ------------k~in~i~~h~~A~~-~~~k~~~~~~~~p~-~-~---~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:94 235 ------------QGLQAIAVVGEVLA-SPIQADLAKTNSNI-P-G---MF---GTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred ------------ccceEEEEcCCeee-eeeeeeeeeccCCC-c-c---cc---ceeeeeeeeeeeEEeccccceEEEeec Confidence 11234455443332 22222233433211 0 1 11 235677889999999998655555433 Q ss_pred cC Q lcl|Aclame:pro 310 ES 311 (311) Q Consensus 310 ~~ 311 (311) .. T Consensus 294 ~~ 295 (319) T protein:vir:94 294 TE 295 (319) T ss_pred CC Confidence 33 No 179 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.67 E-value=3e-05 Score=45.38 Aligned_cols=266 Identities=8% Similarity=-0.092 Sum_probs=127.5 Q ss_pred Cc--ccCCCceEcchhHHHHHHHHHHhhchhhh-h-cc--eeecCCCceEEEEEeCCceeEE-eecCccccccccceeEE Q lcl|Aclame:pro 1 MV--ALATGTFQLPKHLVPGVWQKAQGQSVLAR-L-SM--AEPQEFGEQQYMTLTAPPRGEV-VGEGAQKSESTATFAPV 73 (311) Q Consensus 1 ma--t~~~g~~~vP~~~~~~ii~~~~~~s~l~~-l-~~--~~~~~~~~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~~~~v 73 (311) .| ....+....-+-+ ..+++.+.....+.+ + ++ .....+++++||+.....-..+ ...+-....-+.++... T Consensus 19 ~~~~~~~~nt~~l~~k~-~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~ 97 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTY 97 (319) T ss_pred hhccCCCcchHHHHHHH-HHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEE Confidence 11 2223333333334 344444544444332 1 22 3445566799999876332222 22222222333444555 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhh--HHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQ--LGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~--~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) ++...|.-.+. | +.+=. .++. +.+...+.+...+.++-.+|...+.-.-...+ .....+ T Consensus 98 tidqdR~~~F~-V-D~~D~--~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~---------------~~~~~~ 158 (319) T protein:vir:97 98 FLDQEKYWGRF-V-DALDR--KDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA---------------KHLTVG 158 (319) T ss_pred Eeecccccccc-c-chhhH--hhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc---------------cccccc Confidence 55444332211 1 11100 1111 12233455566666677777665532110000 001112 Q ss_pred ccccchHHHHHHHHHHHhhcCCCccE-EEEcHHHHHHHHHhhccCCce-eeccccccCCCceecceeEEeeccccccccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSPDG-VALDNTFSFMLATQRDSQGRK-LYPELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~-~v~n~~~~~~l~~lkd~~g~~-~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 229 (311) .+....|+.+.++...+...+...+. ++++|..+..|.+-..-.... +.......+..++|.|++|+.+ |.... T Consensus 159 ~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~v---ps~~~- 234 (319) T protein:vir:97 159 TGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKV---PTKLL- 234 (319) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEe---ccccc- Confidence 23446788999999999887765444 677899998886543222111 1122334556789999999753 22111 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) ....+++|.-+... ...+--.+++++.. + + .| .-.++...++|..|++|++..++..+. T Consensus 235 ------------k~in~i~~h~~A~~-~~~k~~~~~~~~p~-~-~---~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:97 235 ------------QGLQAIAVVGEVLA-SPIQADLAKTNSNI-P-G---MF---GTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred ------------ccceEEEEcCCeee-eeeeeeeeeccCCC-c-c---cc---ceeeeeeeeeeeEEeccccceEEEeec Confidence 11234455443332 22222233433211 0 1 11 235677889999999998655555433 Q ss_pred cC Q lcl|Aclame:pro 310 ES 311 (311) Q Consensus 310 ~~ 311 (311) .. T Consensus 294 ~~ 295 (319) T protein:vir:97 294 TE 295 (319) T ss_pred CC Confidence 33 No 180 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.67 E-value=3e-05 Score=45.36 Aligned_cols=269 Identities=8% Similarity=-0.130 Sum_probs=123.5 Q ss_pred Cccc--CCCceEcchhHHHHHHHHHHhhchhhh-hcc--eeecCCCceEEEEEeCCceeEEe-ecCccccccccceeEEE Q lcl|Aclame:pro 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLAR-LSM--AEPQEFGEQQYMTLTAPPRGEVV-GEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat~--~~g~~~vP~~~~~~ii~~~~~~s~l~~-l~~--~~~~~~~~~~~p~~~~~~~a~~v-~Eg~~~~~~~~~~~~v~ 74 (311) .+-. .=+....-+-+...+-+.+...+.-.. +++ .....+++++||+.....-..+. ..+-....-+.++...+ T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~t 109 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRNATNEFDHPQIQETTYF 109 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCCCCccccccccceeEEE Confidence 1111 011111222233333333332221111 122 34556677999998754322232 22222222334445555 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (311) +...|.-.+. | +.+=.+...-.+.+...+.+...+.++..+|...+.-.-...+ .......+. T Consensus 110 idqdR~~~F~-V-D~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~---------------~~~~~~~t~ 172 (329) T protein:vir:10 110 LDQEKYWGRF-V-DALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKA---------------KHLTVGSGA 172 (329) T ss_pred eecccceeee-c-chhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcc---------------cccccccCH Confidence 5554333221 1 1110000000112334455666777777888766532100000 001111234 Q ss_pred cchHHHHHHHHHHHhhcCCCcc-EEEEcHHHHHHHHHhhccCCcee-eccccccCCCceecceeEEeecccccccccccc Q lcl|Aclame:pro 155 ATPDLAVEAAVGLVLGDNLSPD-GVALDNTFSFMLATQRDSQGRKL-YPELGFGTDVASFAGLNAAVSDTVRGGPEAVTA 232 (311) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd~~g~~~-~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 232 (311) ...|+.+.++...+...+.... .++++|..+..|.+...-....- .......+..++|.|++|+.++. ... T Consensus 173 ~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps---~~~---- 245 (329) T protein:vir:10 173 DAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPS---KML---- 245 (329) T ss_pred HHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeeeeecCeEEEEecC---Ccc---- Confidence 4678889999999988766544 46778999888865221111111 11222345568899999985422 111 Q ss_pred cccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 233 STGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 233 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ....++++.-+.... ..+--.++.++... + ++.-.++...++|..|++|++..++.....+ T Consensus 246 ---------k~in~ii~~~~A~~~-~~K~~~~~~~~p~~--~------~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 246 ---------QGVEAMAVIGEVMAS-PIQANEAKLNSNVP--G------MFGTLAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred ---------cceeEEEEcCCceee-eeeeeeeeeeCCCC--c------cchheeeeeeeeeeEEEccccCEEEEecccC Confidence 111344444433322 22222344432211 0 1123567788999999999875555544333 No 181 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.49 E-value=1.3e-05 Score=47.37 Aligned_cols=278 Identities=11% Similarity=0.061 Sum_probs=142.6 Q ss_pred cCCCceEcc--hhHHHHHHHHHHhhchhhhhcceeec---CCCceEEEEEeCCceeE--EeecC-ccccccccceeEEEE Q lcl|Aclame:pro 4 LATGTFQLP--KHLVPGVWQKAQGQSVLARLSMAEPQ---EFGEQQYMTLTAPPRGE--VVGEG-AQKSESTATFAPVTA 75 (311) Q Consensus 4 ~~~g~~~vP--~~~~~~ii~~~~~~s~l~~l~~~~~~---~~~~~~~p~~~~~~~a~--~v~Eg-~~~~~~~~~~~~v~l 75 (311) .+...+++. +.+.+.|.|.-.+.-..+++.++.+. .-..+.+...+..+.+. |.+.+ .++|..+..+++... T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 233345444 12344555444444445555544221 11234555555556666 87654 778888888888888 Q ss_pred eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee------- Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE------- 148 (311) Q Consensus 76 ~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~------- 148 (311) ..+.++.-+.+|.+=++.......++...-+....+++.+.+++..+.|.....+ . .|+++..++.. T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g--~----~GllN~p~v~~~~~~~~~ 154 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSR--L----TGLLNNKSVEVYAIKGAA 154 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccc--e----EEEEeCCCcceeeecCCc Confidence 7888888777776544444344456777777778889999999999999542111 1 12222222211 Q ss_pred ----eccccccchHHHHHHHHHHHhhc---CCCccEEEEcHHHHHHHHHhhcc-CCceeeccccccCCCceecceeEEee Q lcl|Aclame:pro 149 ----LTTGTSATPDLAVEAAVGLVLGD---NLSPDGVALDNTFSFMLATQRDS-QGRKLYPELGFGTDVASFAGLNAAVS 220 (311) Q Consensus 149 ----~~~~~~~~~~~~i~~~~~~~~~~---~~~~~~~v~n~~~~~~l~~lkd~-~g~~~~~~~~~~~~~~~l~G~pv~~~ 220 (311) ..+.+.....+++..++.++... ...++.++|.|+.+..|....-+ .+..++.-.. ...+ ...|.|+.+ T Consensus 155 a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~-~n~~-~~~g~~l~I- 231 (304) T protein:vir:52 155 QNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLT-KHLS-AAAGRQVAI- 231 (304) T ss_pred cCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHH-Hhcc-cccCCcceE- Confidence 11112223455667777776432 23467899999999988654322 2222321111 1011 123444432 Q ss_pred cccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEE--EEEEEEec-cEEe Q lcl|Aclame:pro 221 DTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIA--IRAEVVYG-IGIM 297 (311) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~--~ra~~r~~-~~v~ 297 (311) ..++.... ....+++.++++-+.+.-.+...-.+.+.+.+ ...+|... +=++.|+| ..++ T Consensus 232 ~~v~~~~~--------~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~---------~q~~~~~~~~vp~~~r~gGv~v~ 294 (304) T protein:vir:52 232 KALPSNYG--------TRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLD---------AQPKGLLAFESGLRMAFGGVTFM 294 (304) T ss_pred EEeccccc--------ccCCCCceEEEEEecChhheEEecCccccccc---------hhhcCCceEEecceeeeeeEEEE Confidence 11111100 01123344455544433222222112222221 13344332 33566665 5677 Q ss_pred cccceEEEEe Q lcl|Aclame:pro 298 STDAFAVVRD 307 (311) Q Consensus 298 ~~~a~~~l~~ 307 (311) +|.+++++.- T Consensus 295 ~P~a~~y~D~ 304 (304) T protein:vir:52 295 EPDSALYVDY 304 (304) T ss_pred ccceeeeecC Confidence 7999999988 No 182 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.47 E-value=6.1e-05 Score=43.71 Aligned_cols=281 Identities=10% Similarity=-0.049 Sum_probs=136.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcc-----ee-ecCCCceEEEEEeCCceeEEe-ecCcccc-ccccceeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSM-----AE-PQEFGEQQYMTLTAPPRGEVV-GEGAQKS-ESTATFAP 72 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~-----~~-~~~~~~~~~p~~~~~~~a~~v-~Eg~~~~-~~~~~~~~ 72 (311) ||+.. .++.|+..+.+.++..+....|+. .+ ..++.+++||+.+...-..+. +-....+ ..+.++.. T Consensus 1 MA~~n-----~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t 75 (299) T protein:vir:79 1 MAALN-----YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEP 75 (299) T ss_pred Cccch-----hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeE Confidence 88553 247798999988888877665542 12 233456999998754433332 2212222 34556666 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhh--HHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQ--LGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~--~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) .++...|--.+ .|- .+ +-+++. ..+...+.+...+.++-.+|.-.+...-.. ...+. +..... T Consensus 76 ~~ldqdr~~~f-~vD-~~--Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~----a~~~g-------~~~~~~ 140 (299) T protein:vir:79 76 KVLTNQRKWST-LVH-PA--DINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYAD----WTALG-------NTADTT 140 (299) T ss_pred EEeecccccee-ccc-hh--hHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHh----hhhcC-------Cccccc Confidence 66666543332 111 10 001111 112233444444555566676555321000 00000 011111 Q ss_pred cccccchHHHHHHHHHHHhhcCCCc-c-EEEEcHHHHHHHHHhhcc--CCceeeccccccCCCceecceeEEe--ecccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLSP-D-GVALDNTFSFMLATQRDS--QGRKLYPELGFGTDVASFAGLNAAV--SDTVR 224 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~-~-~~v~n~~~~~~l~~lkd~--~g~~~~~~~~~~~~~~~l~G~pv~~--~~~~~ 224 (311) +.+....++.+.++...+...++.. . .++++|.....|.+...- +...........+..++|.|+||+. ++.|+ T Consensus 141 ~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~ 220 (299) T protein:vir:79 141 VLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMK 220 (299) T ss_pred ccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcC Confidence 2234467889999999998887754 3 477789999888753321 1111112223455568999999964 55555 Q ss_pred cccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecc--cce Q lcl|Aclame:pro 225 GGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST--DAF 302 (311) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~--~a~ 302 (311) ..-...... .....+-+-.+++... ...+...+.-.+++++- +.....++ .+.-+.++|.=|.+. +++ T Consensus 221 t~~~~~~G~--~~~~~ak~in~ii~~~-~a~~~~~K~~~~~~~~P-~~~~~~~~------~~~~r~y~d~~v~~nk~~~i 290 (299) T protein:vir:79 221 TAYDFTTGW--KVGAGAKQIFMSLVHP-SAIITPVSYQFSKLDEP-TAVTEGKY------FYFEESFEDVFILNKKADAI 290 (299) T ss_pred ccceeccCc--cccCcccccceEEEcC-CeeeeeEeeeeEEeecC-CCCCccce------eeeeeeeeeeeeeccccCeE Confidence 321111110 0011111123344433 33344455555555532 11111111 122356777777775 445 Q ss_pred EEEEecccC Q lcl|Aclame:pro 303 AVVRDADES 311 (311) Q Consensus 303 ~~l~~aa~~ 311 (311) .+-.++|-+ T Consensus 291 ~~~~~~a~~ 299 (299) T protein:vir:79 291 QFVVEGAGA 299 (299) T ss_pred EEEeeecCC Confidence 454455555 No 183 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=97.43 E-value=4.8e-05 Score=44.26 Aligned_cols=292 Identities=11% Similarity=0.010 Sum_probs=150.0 Q ss_pred CcccCCCce-EcchhHHHHHHHHHHhhchhhhhcc-eeecCCCc-eEEEEEeCCceeEEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTF-QLPKHLVPGVWQKAQGQSVLARLSM-AEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~-~vP~~~~~~ii~~~~~~s~l~~l~~-~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |.-++.... ++-+.|+.+|..-+.+.-.--.+.+ +...++|. +.||.. +.+...-..|..+..-...+.++|++.. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~i 79 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQI 79 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEEE Confidence 776666555 5556677777766665543333333 56666654 677653 3344444456566666667778888888 Q ss_pred eeEEEE-EeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhh-cccCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 78 RKVQVT-QRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIH-GINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 78 ~kl~~~-i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~-G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) ..+++- -.||++|.+++.+ --++.+++..+-+|+|....+.-+|. |.....+.+ . |..+-.-......+..... T Consensus 80 ~~Y~G~A~~vt~~LR~D~~~-I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~-~--P~~vNG~PH~~V~~~T~~~ 155 (313) T protein:vir:95 80 TEYKGDAWYVTDDLREDGTD-IDRLMAERAAESTRAIQETFETDFLKTGAEYFAANP-G--PHNVNGFPHVIVSAETNGV 155 (313) T ss_pred EeecCChhhhhhhhhhcchh-HHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCC-C--CcccccccceEEeccCCce Confidence 887764 4689999887653 34567777778888888888777773 211111111 1 1111111222222222222 Q ss_pred chHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhc------cCCceeecccccc--CCCceecceeEEeeccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRD------SQGRKLYPELGFG--TDVASFAGLNAAVSDTVRG 225 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd------~~g~~~~~~~~~~--~~~~~l~G~pv~~~~~~~~ 225 (311) -...++..+-..+..+.... ..++..|.....|..+.. .+|+.+......- +-.-++.|..+.+|+.+.. T Consensus 156 ~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~ 235 (313) T protein:vir:95 156 FALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHV 235 (313) T ss_pred ehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhh Confidence 22344544444444444433 359999999988887652 3455543211111 1124578888888877653 Q ss_pred ccccccccccccccccccc---eEEEeecceE-EEEeecCce-EEEeccCCcccchhhhhcCcEEEEEEEEeccEEeccc Q lcl|Aclame:pro 226 GPEAVTASTGVYRTTNPNV---KAIAGDFSAF-RWGVQVSIP-LELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTD 300 (311) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~---~~~~gd~~~~-~~~~~~~~~-i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~ 300 (311) .-.. ....+..+.. .+-+-|.... ...-|+.|. -|-.+. ++-.++..+.| .|+|.+++|.+ T Consensus 236 AN~~-----D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~-------~~~~~~~~~~~--~R~G~Gi~R~~ 301 (313) T protein:vir:95 236 ANYN-----DGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERN-------KDRARDEHVVR--CRYGFGIQRLD 301 (313) T ss_pred cccc-----ccccccCceeeeeeeeeecccccceeeeeccccccccccc-------cccccccceee--eeecccceeec Confidence 2111 1111111111 1112222110 112222221 111111 11123444555 49999999988 Q ss_pred ceEEEEecccC Q lcl|Aclame:pro 301 AFAVVRDADES 311 (311) Q Consensus 301 a~~~l~~aa~~ 311 (311) -...+-..|-+ T Consensus 302 ~L~~~~~~A~~ 312 (313) T protein:vir:95 302 TLGLLATSATA 312 (313) T ss_pred ceeEEEecccc Confidence 87666544444 No 184 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.32 E-value=9.5e-05 Score=42.64 Aligned_cols=281 Identities=10% Similarity=-0.052 Sum_probs=118.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceee-----c--CCCceEEEEEeCCceeEEe-ecCcccccccccee- Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP-----Q--EFGEQQYMTLTAPPRGEVV-GEGAQKSESTATFA- 71 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~-----~--~~~~~~~p~~~~~~~a~~v-~Eg~~~~~~~~~~~- 71 (311) |+..= -..+|+.|..+.++.+++..++.+++..-. . .+.+++||+...-....+. ..+..+..++..-. T Consensus 1 MaN~l--lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~ 78 (423) T protein:vir:17 1 MPNNL--DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred Cccch--hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccce Confidence 77221 123799999999999999999988876421 1 2345788863322211221 12222333333322 Q ss_pred -EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 72 -PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 72 -~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) ++++..+|...+--=..|.. ... -+++++++.+ .++++..+|..++.-- ....+. ... . T Consensus 79 v~l~id~~k~va~~v~d~E~~-~~i---~~~~~~l~~A-~~aLA~~vd~~ia~~~---~~~a~~-----~~g-------t 138 (423) T protein:vir:17 79 ATGRVGNYITVAVEYQQLEEA-IKL---NQLEEILAPV-RQRIVTDLETELAHFM---MNNGAL-----SLG-------S 138 (423) T ss_pred eEEEeeceeeeeeeecHHHHh-cCh---hHHHHHHHHH-HHHHHHHHHHHHHHHH---hhcccc-----ccc-------c Confidence 45555555544433344443 222 2356666555 5889999998876321 000000 000 0 Q ss_pred cccccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhc--cCCceee-ccccccCCCceecceeEEeeccccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRD--SQGRKLY-PELGFGTDVASFAGLNAAVSDTVRG 225 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd--~~g~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~ 225 (311) ..+....|+++.++-..|...+... -..+++|.....|.+-.. ......- .....++-.+.+.|+.++.++.+|. T Consensus 139 ~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~ 218 (423) T protein:vir:17 139 PNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLAS 218 (423) T ss_pred CCcccccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCcc Confidence 1111234777878877776665553 247889998877754210 0000010 1112222347899999999999996 Q ss_pred cccccccccccc-------cccc----------------ccceEEEeecceEE-EEeecCceEEEeccCCcccchhhhhc Q lcl|Aclame:pro 226 GPEAVTASTGVY-------RTTN----------------PNVKAIAGDFSAFR-WGVQVSIPLELIEFGDPDGLGDLKRQ 281 (311) Q Consensus 226 ~~~~~~~~~~~~-------~~~~----------------~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~f~~ 281 (311) ............ .... ....+..||.-.+. +.-...++-++... +.+..++ T Consensus 219 ~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~---~~t~~~~-- 293 (423) T protein:vir:17 219 RTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYN---GATPISF-- 293 (423) T ss_pred ccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccc---cccccce-- Confidence 533221110000 0000 00011111100000 00000000000000 0000000 Q ss_pred CcEEEEEEE------------EeccEEecccce---EEEE--ecccC Q lcl|Aclame:pro 282 NQIAIRAEV------------VYGIGIMSTDAF---AVVR--DADES 311 (311) Q Consensus 282 ~~v~~ra~~------------r~~~~v~~~~a~---~~l~--~aa~~ 311 (311) .|++.. ....++..+.+. .-++ .++++ T Consensus 294 ---~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~ 337 (423) T protein:vir:17 294 ---TATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGD 337 (423) T ss_pred ---EEEEEecccccccCceEEEecCccccccCCcccccceecccCCc Confidence 111110 000011111111 1111 11111 No 185 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.31 E-value=9.9e-05 Score=42.55 Aligned_cols=281 Identities=9% Similarity=-0.051 Sum_probs=120.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhccee----e-c--CCCceEEEEEeCCceeEEee-cCcccccccccee- Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----P-Q--EFGEQQYMTLTAPPRGEVVG-EGAQKSESTATFA- 71 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~----~-~--~~~~~~~p~~~~~~~a~~v~-Eg~~~~~~~~~~~- 71 (311) |+..= -..+|+.|..+.++.+++..++.+++..- . . .+.+++|++........+.. ++..+...+..-. T Consensus 1 MaN~l--lT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~ 78 (423) T protein:vir:10 1 MPNNL--DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred Cccch--hhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccce Confidence 77221 11279999999999999999998887652 1 1 23457787654322222221 2222333333333 Q ss_pred -EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 72 -PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 72 -~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) ++++..+|...+--=..|+. ... -+++++++.+ .++++.++|..++.-. ...... .. + . T Consensus 79 v~l~id~~k~va~~v~d~E~~-~~i---~~~~~~l~~A-~~aLA~~vd~~ia~~~---~~~~~~-----~~-g------t 138 (423) T protein:vir:10 79 ATGRVGNYITVAVEYQQLEEA-IKL---NQLEEILAPV-RQRIVTDLETELAHFM---MNNGAL-----SL-G------S 138 (423) T ss_pred eEEEeeceeeeeeeechHHHh-cCh---hhHHHHHHHH-HHHHHHHHHHHHHHHH---hhcccc-----cc-c------c Confidence 45555555544433344443 222 2356666655 5889999999887421 000000 00 0 1 Q ss_pred cccccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhc--cCCceee-ccccccCCCceecceeEEeeccccc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRD--SQGRKLY-PELGFGTDVASFAGLNAAVSDTVRG 225 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd--~~g~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~ 225 (311) .++....|+++.++-..|...+... -..+++|.....|.+... ......- .....++-.+++.|+.++.++.+|. T Consensus 139 ~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~ 218 (423) T protein:vir:10 139 PNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLAS 218 (423) T ss_pred CCcccchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCcc Confidence 1112234777777777776655543 247889998877754211 1111111 1112222347899999999999996 Q ss_pred cccccccccc-----cc---cc--c-------------cccceEEEeecceEE-EEeecCceEEEeccCCcccchhhhhc Q lcl|Aclame:pro 226 GPEAVTASTG-----VY---RT--T-------------NPNVKAIAGDFSAFR-WGVQVSIPLELIEFGDPDGLGDLKRQ 281 (311) Q Consensus 226 ~~~~~~~~~~-----~~---~~--~-------------~~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~f~~ 281 (311) .......... .. .. . .....+..||.-.+. +.....++-++.. .+.+..+ T Consensus 219 ~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~---~~~t~~~--- 292 (423) T protein:vir:10 219 RTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALY---NGATPIS--- 292 (423) T ss_pred ccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeeccccccccc---ccccCcc--- Confidence 5332211100 00 00 0 000111112210000 0000000000000 0000011 Q ss_pred CcEEEEEEEEe------cc------EEecccce---EEEE--ecccC Q lcl|Aclame:pro 282 NQIAIRAEVVY------GI------GIMSTDAF---AVVR--DADES 311 (311) Q Consensus 282 ~~v~~ra~~r~------~~------~v~~~~a~---~~l~--~aa~~ 311 (311) ..|++..-. +. ++..+.++ .-++ .++++ T Consensus 293 --~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~ 337 (423) T protein:vir:10 293 --FTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGD 337 (423) T ss_pred --eEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCc Confidence 112222111 00 11111111 1111 11111 No 186 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=97.27 E-value=0.0001 Score=42.46 Aligned_cols=273 Identities=12% Similarity=0.107 Sum_probs=131.5 Q ss_pred CcccCCCceEcchhHHHHHHHHHH-hhchhhhhcceeecCCCceEEEEEeCCcee-EEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQ-GQSVLARLSMAEPQEFGEQQYMTLTAPPRG-EVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~-~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) |..++..=-.+-+.+...+.+-.. ......++|+..+......++..+..-+.. .|.+| ++...+.=..-+++.+ T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~~~~~l~~~~~~i~~~ 77 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---KVVKNLKAYKYVVENE 77 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---eeeccccccceeEEee Confidence 776653311222222222222222 123466777766655555566666665654 46544 4444444455667889 Q ss_pred eEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccC-CCcccccccccccccc---------ccc-- Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINP-LTGAALSGSPAKILDT---------TNI-- 146 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~---------~~~-- 146 (311) +++..+.||++.| .++.+++..-+...+.++-++..|+.++.=-.+ .+.....|.+ +.++ .+. T Consensus 78 ~~g~~v~i~R~~i---~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~--fF~~dH~~g~~~~~N~g~ 152 (302) T protein:vir:10 78 DFEATVEVDRNDI---EDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQY--FIDTDHPVGDASVSNKGT 152 (302) T ss_pred cccceecccHHhh---cccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcc--eecccccccccccccccc Confidence 9999999999988 455678888999999999999998887632110 0111111111 1111 000 Q ss_pred --ee-eccccccchHHHHHHHHHHHhhc-----CCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecc-eeE Q lcl|Aclame:pro 147 --VE-LTTGTSATPDLAVEAAVGLVLGD-----NLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAG-LNA 217 (311) Q Consensus 147 --~~-~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G-~pv 217 (311) .+ .........+.....++...... +..|..++..|.....-+++-.+ ++.. .. ..++ +.| +.+ T Consensus 153 ~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~~--~g--~~Np--~~g~~~~ 225 (302) T protein:vir:10 153 APLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKLA--DN--TPNP--YVGTAEL 225 (302) T ss_pred hhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cccC--CC--Ccce--eccceEE Confidence 00 00111122233334444444333 33456677777766665554211 1110 00 0011 112 355 Q ss_pred EeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEecc--- Q lcl|Aclame:pro 218 AVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGI--- 294 (311) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~--- 294 (311) +++..+.+...+..-.. .. ....+++.-++...++.... |..+.+.+|.+..+|+ T Consensus 226 vv~p~L~s~~aWyL~a~-------~~------~i~~~~l~g~~~P~~~~~~~---------~~~dgv~~k~~~d~Gvd~R 283 (302) T protein:vir:10 226 VVDGRIESDTAWFLLDT-------TK------PVKPFIFQPRKQPEFVSQVN---------LDSDDVFNLRKLKFGAEAR 283 (302) T ss_pred EEeeccCCCCceEEEec-------CC------ccceEEEcCccccEEEeccC---------CCCCceEEEEEEEEeeeee Confidence 66566544333322111 00 01223445566666654432 4556666776666664 Q ss_pred ---EEecccceEEEEecccC Q lcl|Aclame:pro 295 ---GIMSTDAFAVVRDADES 311 (311) Q Consensus 295 ---~v~~~~a~~~l~~aa~~ 311 (311) +...+..-..- +.++| T Consensus 284 ~~~G~~~wq~a~~s-~g~~~ 302 (302) T protein:vir:10 284 AAAGYGFWQLAYGS-TGTGA 302 (302) T ss_pred eecchhhhhhhhcc-CccCC Confidence 33333332333 33333 No 187 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.27 E-value=0.00011 Score=42.26 Aligned_cols=280 Identities=10% Similarity=-0.052 Sum_probs=123.8 Q ss_pred cccCCCceEcchhHHHHHHHHHHhhchhhhhcc---e----eecCCCceEEEEEeCCc----eeEEeecCccccccccc- Q lcl|Aclame:pro 2 VALATGTFQLPKHLVPGVWQKAQGQSVLARLSM---A----EPQEFGEQQYMTLTAPP----RGEVVGEGAQKSESTAT- 69 (311) Q Consensus 2 at~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~---~----~~~~~~~~~~p~~~~~~----~a~~v~Eg~~~~~~~~~- 69 (311) |+.+.--+.-| ......+|.+.+.......+. . .+..+.-+.+|-...-. +..-+.+....+..+.+ T Consensus 1 m~lsD~~vfN~-~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~kitt 79 (325) T protein:vir:95 1 MALSDLAVYSE-YAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVLKH 79 (325) T ss_pred Cchhhhhhhhh-hhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccceecc Confidence 55544333333 344666666555433333221 1 12222224566554211 22223344444444433 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceee Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL 149 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~ 149 (311) ..++..+.+.-.++.....+.+....+....+...|.+++++...+.+-+.+|.+... ...+. .+-....+. T Consensus 80 ~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~----a~~~~----~~~v~dis~ 151 (325) T protein:vir:95 80 LVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYS----ALSQV----SDVVYDATA 151 (325) T ss_pred ccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhccc----ccceeeeec Confidence 4445444443333333333332222233334555666666666555554444433110 00110 011111111 Q ss_pred ccc--cccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccccccc Q lcl|Aclame:pro 150 TTG--TSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGP 227 (311) Q Consensus 150 ~~~--~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 227 (311) ..+ +.......+.++..++-+....-..|+||...+..|.+..-.+...++...... ..+.++|++|++++.+|... T Consensus 152 ~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~-~i~t~~G~~VIVdD~~p~~~ 230 (325) T protein:vir:95 152 NTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVN-VVRDPFGKLLVMTDSPNLFA 230 (325) T ss_pred ccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcc-cccccCCcEEEEeCCCCCCC Confidence 111 122345678888888876666667899999999999886655544443222211 23578999999999887442 Q ss_pred ccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~ 307 (311) ..... .+. ..++|. ..+.+....+......+.. .-.+-...+|+ |+ --++||..+..-+. T Consensus 231 ~g~~~---~yt------ty~lg~-GAi~~~~~~~~~~~~~~~~-------~~~~~~~~~~~--~~-tf~lhp~G~sw~~s 290 (325) T protein:vir:95 231 AGTPN---VYH------ILGLVP-GGVLIGQNNDFDANEETKN-------GDENIIRTYQA--EW-SYNIGVKGFAWDKA 290 (325) T ss_pred ccCce---eEE------EEEEec-CeEEecCCCCccccccccC-------cccceeeeeee--ee-eEEeecceeeeecc Confidence 21110 000 112221 1222333333222211110 01122223332 22 14678888777332 Q ss_pred cc-cC Q lcl|Aclame:pro 308 AD-ES 311 (311) Q Consensus 308 aa-~~ 311 (311) .. .+ T Consensus 291 ~~g~s 295 (325) T protein:vir:95 291 NGGKS 295 (325) T ss_pred cccCC Confidence 11 12 No 188 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=97.07 E-value=2.5e-05 Score=45.78 Aligned_cols=277 Identities=12% Similarity=0.037 Sum_probs=139.0 Q ss_pred CcccCCCceEcchhHH----HHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLV----PGVWQKAQGQSVLARLSMAEPQEFG---EQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~~~vP~~~~----~~ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) |.|.+.+| ||..+. ..+++.+.+...+..+..+.+.+.- ...++.....+.+.+.+...+.|..+...+.. T Consensus 42 ~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~ 119 (336) T protein:vir:10 42 LSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYP 119 (336) T ss_pred cccCCCcc--hHHHHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeee Confidence 23333333 454332 2334444444445555554443321 23455656666777778888999888776666 Q ss_pred EEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccc-cceeeccc Q lcl|Aclame:pro 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT-NIVELTTG 152 (311) Q Consensus 74 ~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-~~~~~~~~ 152 (311) .-+.+.++..+.++.+=+........++.+.-+....+++.+++++-.++|....+-.+.-+-|......+ ........ T Consensus 120 ~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:10 120 QRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSP 199 (336) T ss_pred eeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCccccc Confidence 66788888889998554444445566788888888899999999998888854332222222221110010 00001111 Q ss_pred cccchHHHHHHHHHHHhhcC---C---CccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccc Q lcl|Aclame:pro 153 TSATPDLAVEAAVGLVLGDN---L---SPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGG 226 (311) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~---~---~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 226 (311) +....++++..++..+...- . .+..+++.++.+..|.+ ++..|..++.-.... +-++.++. +|+. T Consensus 200 T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n-----~Pnl~i~t---~pel 270 (336) T protein:vir:10 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-----FPKLEFVT---IPEY 270 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHh-----CCccEEEE---cccc Confidence 22346777888877775432 1 24469999999998864 233333332111110 11122221 2222 Q ss_pred cccccccccccccccccceEEEeecc---eEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEec-cEEecccce Q lcl|Aclame:pro 227 PEAVTASTGVYRTTNPNVKAIAGDFS---AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG-IGIMSTDAF 302 (311) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~-~~v~~~~a~ 302 (311) .. ..+....++.-+.. ...+...+.++. ++- . ...-.....++.|++ ..+.+|-|| T Consensus 271 ~~----------Agg~~~~~~~~~~~~~~t~~~~~P~~f~~--lpv----q----~~~~~~~v~~~~rt~Gv~i~rP~ai 330 (336) T protein:vir:10 271 DT----------ASGRLVQLWAPRVEGKDTATCGFTEKMRA--HSI----E----RYSSYFRQKKSAGTWGAVIFRPFAV 330 (336) T ss_pred cc----------cCCceEEEEEecccCCcceeeecChhhhc--cce----e----ecCceeEeccccceeeeeeeccchh Confidence 11 11112222222221 111221211111 000 0 011122334555654 456679999 Q ss_pred EEEEec Q lcl|Aclame:pro 303 AVVRDA 308 (311) Q Consensus 303 ~~l~~a 308 (311) ++++.= T Consensus 331 ~~~~GI 336 (336) T protein:vir:10 331 AQMLGV 336 (336) T ss_pred eeeccC Confidence 999977 No 189 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.05 E-value=0.00019 Score=40.95 Aligned_cols=280 Identities=11% Similarity=0.045 Sum_probs=125.1 Q ss_pred CcccCCCceEcch--hHHHHHHHHHHhhchhhhhcce---------eecCCCceEEEEEeCC-ce--eEEeecC--cccc Q lcl|Aclame:pro 1 MVALATGTFQLPK--HLVPGVWQKAQGQSVLARLSMA---------EPQEFGEQQYMTLTAP-PR--GEVVGEG--AQKS 64 (311) Q Consensus 1 mat~~~g~~~vP~--~~~~~ii~~~~~~s~l~~l~~~---------~~~~~~~~~~p~~~~~-~~--a~~v~Eg--~~~~ 64 (311) ||++.=...++|+ .+..-+.+.-.+.+.+.+=+-+ ...++..+++|....- .. ..+-+.. +..+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 8888778999998 4655555554455555442211 1233444788876542 22 1121211 1233 Q ss_pred ccccc-eeEEEEeeeeEEE--EEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhh---hcccCCCccccccccc Q lcl|Aclame:pro 65 ESTAT-FAPVTAIPRKVQV--TQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGI---HGINPLTGAALSGSPA 138 (311) Q Consensus 65 ~~~~~-~~~v~l~~~kl~~--~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l---~G~~~~~g~~~~~~~~ 138 (311) ..+.+ ..++-...+.-.+ .-.++.++- ..|..+.|++++++...+...+.+| .|. .+.....--. T Consensus 81 ~~kit~~~~~a~~~~r~kaw~~~Dla~~ls------G~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gv---f~~~~~~~~~ 151 (349) T protein:vir:94 81 PRAIQTGEMMARVAYLNEGFGQADLTVELT------SQNPLQSVASRLDNFWQRQAQRRLIATALGL---YNDNVSATDA 151 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhh------CchHHHHHHHHHHHHHhhHHHHHHHHHHHhh---hccccccccc Confidence 33322 3333333332222 223344331 1256677888888777666555544 331 0000000000 Q ss_pred cccccccceeeccccccchHHHHHHHHHHHhhc-----CCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceec Q lcl|Aclame:pro 139 KILDTTNIVELTTGTSATPDLAVEAAVGLVLGD-----NLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFA 213 (311) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~ 213 (311) ..........+. .........+..+..++-.. .-.-+.++||...+..|++++--. + +++.......++++ T Consensus 152 ~~~~~~~~~d~~-~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty~ 227 (349) T protein:vir:94 152 YHEQNDMVVDVS-ATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATYQ 227 (349) T ss_pred ccccCceeEEec-ccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh--h-ccCcccCcccceec Confidence 000011111111 11122333455555554432 223357999999999998753200 0 01111122347899 Q ss_pred ceeEEeecccccccccccccccccccccccceEEEeecceEEEEee-cCceEEEeccCCcccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 214 GLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQ-VSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVY 292 (311) Q Consensus 214 G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~-~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~ 292 (311) |++|++++.||....- ....+ ..++||. ..+.++.- ....+++.|+....+ ..++-.+-.+.| T Consensus 228 G~~VivDD~~Pv~~~g---~~~~y------ttylfg~-GAi~~~~~~~~~~~E~~rd~~~g~-----~~G~d~L~~R~~- 291 (349) T protein:vir:94 228 GYRVIVDDSMTVVGQD---TSRKF------ISIIFGQ-GAIGYGEGNPEMPLEYEREASRAN-----GGGVETLWTRKT- 291 (349) T ss_pred CcEEEEeCCCccccCC---CCceE------EEEEeec-ceEEeecCCCCcceeeecccccCC-----cceeEEEEEeeE- Confidence 9999999999953211 01011 1344443 22222222 123355555432110 012222222233 Q ss_pred ccEEecccceEEEEeccc---------C Q lcl|Aclame:pro 293 GIGIMSTDAFAVVRDADE---------S 311 (311) Q Consensus 293 ~~~v~~~~a~~~l~~aa~---------~ 311 (311) .++||..+...+.... + T Consensus 292 --~~~hp~G~s~~~a~v~~~~~~~~~~s 317 (349) T protein:vir:94 292 --WLLHPFGYSFTSAVITGNGTETIARS 317 (349) T ss_pred --EEeeeeeeeecccccCCCccccccCC Confidence 3678888776653211 1 No 190 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=96.97 E-value=0.00011 Score=42.23 Aligned_cols=281 Identities=10% Similarity=-0.033 Sum_probs=132.1 Q ss_pred CcccCCCceEcchh----HHHHHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCceeEEeecCccccccccceeEE Q lcl|Aclame:pro 1 MVALATGTFQLPKH----LVPGVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) Q Consensus 1 mat~~~g~~~vP~~----~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v 73 (311) ..|+.+.| ||.. +...+++-+.+......+..+.+.+. ..+.++.....+.+.+.+.+.+.|..+...+.. T Consensus 70 ~~t~~~~g--~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~ 147 (382) T protein:vir:96 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFE 147 (382) T ss_pred ccccCCcc--HHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCcccccccee Confidence 22222222 4644 45566677777777777777665443 234677777778888889888888766443333 Q ss_pred EEeeeeEEEEEeec-HHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccc--cc-ccc-ccee Q lcl|Aclame:pro 74 TAIPRKVQVTQRFS-QEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAK--IL-DTT-NIVE 148 (311) Q Consensus 74 ~l~~~kl~~~i~iS-~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~--~~-~~~-~~~~ 148 (311) +-..+.++....++ .|+.+.. ....++.+.-+....+++.+++|+-.|+|.+++...+..|+.+. +. ..+ .... T Consensus 148 ~r~v~~~~~g~~yg~lE~~rAa-~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~ 226 (382) T protein:vir:96 148 RRTIVRGELGLLVGTLEEGRAS-AIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQG 226 (382) T ss_pred EEEEEEEEEeeeecHHHHHHHH-hhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCC Confidence 33344455556664 5555433 23466777788888999999999999999654333322232221 10 000 0011 Q ss_pred eccccccchHHHHHHHHHHHhhcCC---C----ccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeec Q lcl|Aclame:pro 149 LTTGTSATPDLAVEAAVGLVLGDNL---S----PDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSD 221 (311) Q Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~---~----~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~ 221 (311) ....+....++++..++..+...-. . +..+++.|+.+..|.+. +..|..++.-.... +-++.++. T Consensus 227 Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n-----~Pnl~i~t-- 298 (382) T protein:vir:96 227 WATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVSDWIEQT-----YPKMRIVS-- 298 (382) T ss_pred cccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHHHHHHHh-----cCCcEEEE-- Confidence 1222333456777777777754322 1 22477888888777432 22232222111100 11122221 Q ss_pred ccccccccccccccccccccccc-eEEEeecc----------eEEEEeecCceEEEeccCCcccchhhhhcCc-EEEEEE Q lcl|Aclame:pro 222 TVRGGPEAVTASTGVYRTTNPNV-KAIAGDFS----------AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQ-IAIRAE 289 (311) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~gd~~----------~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~-v~~ra~ 289 (311) +|.......+.. +... .+++.+-- ...+..+-.+.+.+.+ . ..+.. ....+. T Consensus 299 -~peL~~a~~~g~------g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~---v------e~~~~~~~~~~s 362 (382) T protein:vir:96 299 -APELSGVQMQGK------TPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLG---V------EKRAKSYVEDFS 362 (382) T ss_pred -ccccccccCCCc------cceeEEEEecchhhhhcccccccCcceeccccceeeecc---c------eeecceeEeccc Confidence 111111100000 0000 00111100 0000000000000000 0 00000 001111 Q ss_pred -EEeccEEecccceEEEEec Q lcl|Aclame:pro 290 -VVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 290 -~r~~~~v~~~~a~~~l~~a 308 (311) ...|..+.+|.||++++.= T Consensus 363 ~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 363 NGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred cceeeeEEEcchhhhhccCC Confidence 2367788899999999977 No 191 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=96.93 E-value=6.6e-05 Score=43.50 Aligned_cols=282 Identities=10% Similarity=-0.017 Sum_probs=134.3 Q ss_pred Cc-------ccCCCceEcchhHH----HHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCceeEEeecCcccccc Q lcl|Aclame:pro 1 MV-------ALATGTFQLPKHLV----PGVWQKAQGQSVLARLSMAEPQEFG---EQQYMTLTAPPRGEVVGEGAQKSES 66 (311) Q Consensus 1 ma-------t~~~g~~~vP~~~~----~~ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a~~v~Eg~~~~~~ 66 (311) +| -.++++.=||-.+. +.|++.+.......++..+.+.+.- ...+++....+.+.+.+.+.+.|.. T Consensus 65 ~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~ 144 (388) T protein:vir:99 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) T ss_pred cccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCce Confidence 11 11223333665543 4555555555556666666554322 3456666667788888888888877 Q ss_pred ccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccc--ccccc Q lcl|Aclame:pro 67 TATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAK--ILDTT 144 (311) Q Consensus 67 ~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~--~~~~~ 144 (311) +...+...-..+.++..+.++.+=++.......++.+.-+....+++.+++++-.|+|.+........|+.+. +.... T Consensus 145 d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v 224 (388) T protein:vir:99 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) T ss_pred eccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCccccc Confidence 6555444444555666677775534444445667888889999999999999999999643322222222110 00000 Q ss_pred cc------eeeccccccchHHHHHHHHHHHhhcCC---Cc----cEEEEcHHHHHHHHHhhccCCceeeccccccCCCce Q lcl|Aclame:pro 145 NI------VELTTGTSATPDLAVEAAVGLVLGDNL---SP----DGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVAS 211 (311) Q Consensus 145 ~~------~~~~~~~~~~~~~~i~~~~~~~~~~~~---~~----~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~ 211 (311) .. ......+....++++..++..+...-. .+ ..+++-++.+..|.+. +..|..++.-... . T Consensus 225 ~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~-----n 298 (388) T protein:vir:99 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQ-----T 298 (388) T ss_pred ccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHHHHHHH-----h Confidence 00 011111233346677777777644322 12 2578888888888532 2233322211110 0 Q ss_pred ecceeEEeecccccccccccccccccccccccceEEEeec-c-----------eEEEEeecCceEEEeccCCcccchhhh Q lcl|Aclame:pro 212 FAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDF-S-----------AFRWGVQVSIPLELIEFGDPDGLGDLK 279 (311) Q Consensus 212 l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~-~-----------~~~~~~~~~~~i~~~~~~~~~~~~~~f 279 (311) +.++-++. +|...... ........+++.+. . .+.....+.++ ..+- +. T Consensus 299 ~Pnl~i~t---~pEl~~a~-------~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~--~l~v-q~------- 358 (388) T protein:vir:99 299 YPRVRVMS---APELQGGN-------PDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFV--TLGV-EK------- 358 (388) T ss_pred cCCcEEEE---eccccccc-------ccCCceeEEEEecccccccccCccCcceeEEecccccc--cccc-ee------- Confidence 11222221 11111000 00011111111110 0 00000111110 0000 00 Q ss_pred hcCcEEEEEEEE-eccEEecccceEEEEec Q lcl|Aclame:pro 280 RQNQIAIRAEVV-YGIGIMSTDAFAVVRDA 308 (311) Q Consensus 280 ~~~~v~~ra~~r-~~~~v~~~~a~~~l~~a 308 (311) ..-.....+..| .|..+.+|.||++++.= T Consensus 359 ~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 359 RVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred cCceeEeccccceeeeEEeccchhheeccC Confidence 000111222333 46677889999999977 No 192 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=96.93 E-value=0.00025 Score=40.29 Aligned_cols=275 Identities=9% Similarity=-0.055 Sum_probs=116.6 Q ss_pred CcccCCCce--EcchhHHHHHHHHHHhhchhhhhcceee-----c--CCCceEEEEEeCCce---eEEeecCcccccccc Q lcl|Aclame:pro 1 MVALATGTF--QLPKHLVPGVWQKAQGQSVLARLSMAEP-----Q--EFGEQQYMTLTAPPR---GEVVGEGAQKSESTA 68 (311) Q Consensus 1 mat~~~g~~--~vP~~~~~~ii~~~~~~s~l~~l~~~~~-----~--~~~~~~~p~~~~~~~---a~~v~Eg~~~~~~~~ 68 (311) || ..+ ++|+-|..++++.+++..++.+++..-. . .+.++++|+...... ..+-..+.. ..+. T Consensus 1 MA----Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~--~~~l 74 (423) T protein:vir:10 1 MA----NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKS--KNSL 74 (423) T ss_pred Cc----cccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCccc--cccc Confidence 77 334 7899999999999999999999886521 1 134577776432211 111111111 1112 Q ss_pred ce--eEEEEeeeeEEEEEeec-HHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccc Q lcl|Aclame:pro 69 TF--APVTAIPRKVQVTQRFS-QEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTN 145 (311) Q Consensus 69 ~~--~~v~l~~~kl~~~i~iS-~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~ 145 (311) .- -++++..+|... +.++ .|+. .+. .+++++++.+ .++++..+|..+..... ...+ + T Consensus 75 ~e~~v~l~id~~k~~a-~~v~d~E~~-l~i---~~~~~~l~~A-~~aLA~~vd~~ia~~~~---~~~~-----------~ 134 (423) T protein:vir:10 75 ISAKATGEVGNYITVA-VEYRQIEEA-LKL---NQLDQILVPI-NERMVTDLETELALFMM---KHGA-----------L 134 (423) T ss_pred ccceEEEEecceeeee-eeeChHHHh-cCh---hHHHHHHHHH-HHHHHHHHHHHHHHHhh---hccc-----------c Confidence 11 244555555444 4454 4443 232 3456655555 68999999998863211 0010 0 Q ss_pred ceeeccccccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHH----hhccCCceeeccccccCCCceecceeEEe Q lcl|Aclame:pro 146 IVELTTGTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLAT----QRDSQGRKLYPELGFGTDVASFAGLNAAV 219 (311) Q Consensus 146 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~----lkd~~g~~~~~~~~~~~~~~~l~G~pv~~ 219 (311) .... .+.....|+++.++-..|...+... -..+++|.....|.+ +...++. .-.....++-.+.+.|+.++. T Consensus 135 ~vgt-~~t~~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~~~i~G~~~GFdi~~ 212 (423) T protein:vir:10 135 SLGS-PNTPIKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQL-VRTAWENAQISGNFGGIRALM 212 (423) T ss_pred cccc-cccccccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc-chHHHHhcccceeecceEEEE Confidence 0000 1111234677777776676555543 247889998887753 2221111 111122233347999999999 Q ss_pred ecccccccccccc-----cccccc------------------cccccceEEEeecceEEE-EeecCceEEEeccCCcccc Q lcl|Aclame:pro 220 SDTVRGGPEAVTA-----STGVYR------------------TTNPNVKAIAGDFSAFRW-GVQVSIPLELIEFGDPDGL 275 (311) Q Consensus 220 ~~~~~~~~~~~~~-----~~~~~~------------------~~~~~~~~~~gd~~~~~~-~~~~~~~i~~~~~~~~~~~ 275 (311) |+.+|........ ...... +......+-.||.-.+.- .....++-++.- .+.+ T Consensus 213 Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~---~~~~ 289 (423) T protein:vir:10 213 SNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLY---NGAS 289 (423) T ss_pred ecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceee---cccC Confidence 9999853221100 000000 000011122232111100 000000000000 0000 Q ss_pred hhhhhcCcEEEEEEEE------eccEE------ecccce---EEE--EecccC Q lcl|Aclame:pro 276 GDLKRQNQIAIRAEVV------YGIGI------MSTDAF---AVV--RDADES 311 (311) Q Consensus 276 ~~~f~~~~v~~ra~~r------~~~~v------~~~~a~---~~l--~~aa~~ 311 (311) ..++ .+++..- -+..| ..+.++ .-+ ..|+++ T Consensus 290 ~~~~-----~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~ 337 (423) T protein:vir:10 290 ALSF-----TATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGD 337 (423) T ss_pred Ccce-----EEEEEecccccccCceEEEeccccccccCcccccceeccccCCc Confidence 0000 1111110 01111 000000 000 011111 No 193 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.91 E-value=0.00026 Score=40.20 Aligned_cols=280 Identities=11% Similarity=0.016 Sum_probs=124.4 Q ss_pred CcccCCCceEcch--hHHHHHHHHHHhhchhhhhcce---------eecCCCceEEEEEeCC-c--eeEEeecC--cccc Q lcl|Aclame:pro 1 MVALATGTFQLPK--HLVPGVWQKAQGQSVLARLSMA---------EPQEFGEQQYMTLTAP-P--RGEVVGEG--AQKS 64 (311) Q Consensus 1 mat~~~g~~~vP~--~~~~~ii~~~~~~s~l~~l~~~---------~~~~~~~~~~p~~~~~-~--~a~~v~Eg--~~~~ 64 (311) ||++.=....+|+ .+..-+.+.-.+.+.+.+=+-+ ...++..+++|....- . +..+-..+ +..+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 8888778999998 4655555544444544442211 1233445788887542 2 22221222 2333 Q ss_pred ccccc-eeEEEEeeeeEEEE--EeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhh---hcccCCCccccccccc Q lcl|Aclame:pro 65 ESTAT-FAPVTAIPRKVQVT--QRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGI---HGINPLTGAALSGSPA 138 (311) Q Consensus 65 ~~~~~-~~~v~l~~~kl~~~--i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l---~G~~~~~g~~~~~~~~ 138 (311) ..+.+ ..++-...+.-.++ -.++.++- ..|..+.|.+++++...+...+.+| .|.=...-.+.. + T Consensus 81 ~~kitt~~~~a~~~~r~kaw~~~Dla~~ls------G~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~---~ 151 (349) T protein:vir:78 81 PRAIQTGEMMARVAYLNEGFGQADLTVELT------SQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATD---A 151 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhh------CchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccc---h Confidence 33322 33443333332222 22333331 1256777888888766655444443 331000000000 0 Q ss_pred cccccccceeeccccccchHHHHHHHHHHHhhc-----CCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceec Q lcl|Aclame:pro 139 KILDTTNIVELTTGTSATPDLAVEAAVGLVLGD-----NLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFA 213 (311) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~ 213 (311) ....+..+..+.+. .......+.++..++-.. .-.-++++||+..+..|++.+--. + +++........+++ T Consensus 152 ~~~~~~~t~d~s~~-a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty~ 227 (349) T protein:vir:78 152 YHEQNDMVVDVSAT-LGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATYQ 227 (349) T ss_pred hhhcccceeeeccc-cCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh--h-ccCcccCcccceec Confidence 00000111111111 112334455555444332 223357999999999998753200 0 01111222347899 Q ss_pred ceeEEeecccccccccccccccccccccccceEEEeecceEEEEee-cCceEEEeccCCcccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 214 GLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQ-VSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVY 292 (311) Q Consensus 214 G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~-~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~ 292 (311) |++|++++.+|....- ....+ ..++||. ..+.++.- ....+++.|+....+ ..++-.+-.+.| T Consensus 228 G~~VivDD~~Pv~~~g---~~~~y------ttylfg~-GAi~~~~~~~~~~~et~rd~~~g~-----~~G~d~l~~R~~- 291 (349) T protein:vir:78 228 GYRVIVDDSMTVVGQG---AQRKF------ISIIFGQ-GAIGYGEGNPVMPLEYEREASRAN-----GGGVETLWTRKT- 291 (349) T ss_pred CeEEEEeCCCccccCC---CCceE------EEEEeec-ceEEEccCCCccceeeecccccCC-----cceeEEEEEeeE- Confidence 9999999999843210 00001 1344553 22223221 112355555432110 112223333333 Q ss_pred ccEEecccceEEEEeccc---------C Q lcl|Aclame:pro 293 GIGIMSTDAFAVVRDADE---------S 311 (311) Q Consensus 293 ~~~v~~~~a~~~l~~aa~---------~ 311 (311) .++||..+...+.... + T Consensus 292 --~~~hp~G~s~~~a~v~~~~~~~~~~s 317 (349) T protein:vir:78 292 --WLLHPFGYRFTSAVITGNGTETIARS 317 (349) T ss_pred --EEeeeeeeeeccccccCCccccccCC Confidence 3677777766643211 1 No 194 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.82 E-value=0.00032 Score=39.76 Aligned_cols=278 Identities=11% Similarity=0.007 Sum_probs=128.0 Q ss_pred CcccCC----CceEcchhHHHHHHHHHHhhchhhhhcce---------eecCCCceEEEEEeCC-ceeEEeecCc---cc Q lcl|Aclame:pro 1 MVALAT----GTFQLPKHLVPGVWQKAQGQSVLARLSMA---------EPQEFGEQQYMTLTAP-PRGEVVGEGA---QK 63 (311) Q Consensus 1 mat~~~----g~~~vP~~~~~~ii~~~~~~s~l~~l~~~---------~~~~~~~~~~p~~~~~-~~a~~v~Eg~---~~ 63 (311) |+.... ..+++|+.+..-+.+.-.+.+.+.+=+-+ ...++..+++|....- ....-+.+.. ++ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 874432 45799998866666655555554432221 2234445788887543 2222233322 23 Q ss_pred ccccccee-EEE--EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHH---hhhcccCCCccccccc- Q lcl|Aclame:pro 64 SESTATFA-PVT--AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLI---GIHGINPLTGAALSGS- 136 (311) Q Consensus 64 ~~~~~~~~-~v~--l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~---~l~G~~~~~g~~~~~~- 136 (311) +..+.+-+ ++- +...|--..-.++.++- ..|..+.|..++++--.+...+. +|.|.=.......... T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~ls------G~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~ 154 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELA------GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhh------CchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhh Confidence 33333322 222 22222222233444432 23566777777775554444333 3333110000000000 Q ss_pred ------ccc--ccccccceeec--c--ccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh------ccCCce Q lcl|Aclame:pro 137 ------PAK--ILDTTNIVELT--T--GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR------DSQGRK 198 (311) Q Consensus 137 ------~~~--~~~~~~~~~~~--~--~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk------d~~g~~ 198 (311) +.. -....++..++ + .......+.+.++..++-+....-++++||+..+..|++++ +++| T Consensus 155 ~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~-- 232 (367) T protein:vir:80 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG-- 232 (367) T ss_pred hhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCC-- Confidence 000 00011111111 1 11223455677787777666556678999999999998754 3333 Q ss_pred eeccccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEee-cCceEEEeccCCcccchh Q lcl|Aclame:pro 199 LYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQ-VSIPLELIEFGDPDGLGD 277 (311) Q Consensus 199 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~-~~~~i~~~~~~~~~~~~~ 277 (311) ....++++|++|++++.||....-.. ..+. .++||.=. +.++.. ....+++.|+....+ T Consensus 233 -------~~~i~ty~G~~VIvDD~~Pv~~~~a~---~~yt------tYlfg~GA-i~~~~~~~~~~~E~~Rd~~~~~--- 292 (367) T protein:vir:80 233 -------QLTIPTYMGKVVIVDDGMPVFGTGAD---KTYL------SILFGGAA-FGYADGAPQVPVAVGRRELRGN--- 292 (367) T ss_pred -------ccccceecceeEEEeCCCcccccCCC---ceEE------EEEEecce-eeecccCCccceecccchhhhc--- Confidence 22357899999999999995432111 1111 34444321 222211 112234444432100 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecc----------------cC Q lcl|Aclame:pro 278 LKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD----------------ES 311 (311) Q Consensus 278 ~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa----------------~~ 311 (311) ..++-.+.-+.| .++||..|...+..- .+ T Consensus 293 --~gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~s 337 (367) T protein:vir:80 293 --GSGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) T ss_pred --CCceEEEEeeee---EEeecceeeecccccccccccccccccccccCC Confidence 012212222222 588998877764321 11 No 195 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.75 E-value=0.00012 Score=42.06 Aligned_cols=191 Identities=13% Similarity=0.048 Sum_probs=87.1 Q ss_pred EEEEEeecHHHhhcCc--hhhHHHHHHHHHHHHHHHHHHHHHHhhhcc--cCCCccccccccccccccccceeecccccc Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADE--SRQLGVLQTMADLSGVALGRALDLIGIHGI--NPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) Q Consensus 80 l~~~i~iS~ell~~s~--~~~~~~~~~i~~~la~~ia~~~d~~~l~G~--~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (311) +- -.-+|+-++.+-+ .+..|+.....+++.+++++..|+.++.-- ......+..+.+.+ ... ......+.+.. T Consensus 1 iD-~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g-~~~-~~~a~~t~~~~ 77 (221) T protein:vir:17 1 MD-DLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGG-FSV-NIGAGNTNNAQ 77 (221) T ss_pred CC-cchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccC-cce-eccccccCCHH Confidence 11 1123343332222 234577888999999999999999987421 01111111111110 000 01111112223 Q ss_pred chHHHHHHHHHHHhhcCCCcc-E-EEEcHHHHHHHHHhhcc-CCceeecc----ccccCCCceecceeEEeecccccccc Q lcl|Aclame:pro 156 TPDLAVEAAVGLVLGDNLSPD-G-VALDNTFSFMLATQRDS-QGRKLYPE----LGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~~-~-~v~n~~~~~~l~~lkd~-~g~~~~~~----~~~~~~~~~l~G~pv~~~~~~~~~~~ 228 (311) ..++.+.++...+...+.... . ++++|+.+..|.+-.|. --+.-+.. ...+...+++.|++|+.|+.+|.... T Consensus 78 ~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~g 157 (221) T protein:vir:17 78 AIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYG 157 (221) T ss_pred HHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCcccc Confidence 346778888888877776644 3 55589877777542221 11111110 11222467899999999999996432 Q ss_pred cccccc-cccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 229 AVTAST-GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) Q Consensus 229 ~~~~~~-~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~ 307 (311) ...... ............+=|||+... +.+.||+|+..+|. T Consensus 158 t~~~~~ag~~~~~~~~~~~yr~~fs~~~--------------------------------------glv~~~~Avgtvkl 199 (221) T protein:vir:17 158 TNLVTDPGDATTSGENNGSYRPAITDRA--------------------------------------GLVFHKEAADTVEV 199 (221) T ss_pred cccccCCccccccccccccccccccceE--------------------------------------EEEEcchheeeeee Confidence 211100 000001111111222222211 23455555544442 Q ss_pred ccc-C Q lcl|Aclame:pro 308 ADE-S 311 (311) Q Consensus 308 aa~-~ 311 (311) -.- | T Consensus 200 ~~~~~ 204 (221) T protein:vir:17 200 LLPPS 204 (221) T ss_pred ecCCC Confidence 211 1 No 196 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=301 Identities=14% Similarity=0.054 Sum_probs=148.5 Q ss_pred Cc---ccCCC------ceEcc---h-hHHHHHHHHHHhhchhhhhcceeecCCCc---eEEEEEeCCceeE-EeecC--- Q lcl|Aclame:pro 1 MV---ALATG------TFQLP---K-HLVPGVWQKAQGQSVLARLSMAEPQEFGE---QQYMTLTAPPRGE-VVGEG--- 60 (311) Q Consensus 1 ma---t~~~g------~~~vP---~-~~~~~ii~~~~~~s~l~~l~~~~~~~~~~---~~~p~~~~~~~a~-~v~Eg--- 60 (311) |+ +...| |..-| + .|....+..+++.-++.+++...++|.+. ++..+...-+.+. --.|| T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 54 11111 11123 2 34566677777789999999999998653 3333322222211 11222 Q ss_pred --cc-----------------------------ccccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHH-HHH Q lcl|Aclame:pro 61 --AQ-----------------------------KSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTM-ADL 108 (311) Q Consensus 61 --~~-----------------------------~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i-~~~ 108 (311) ++ ......+-..++.+.++++.++.+|+++.....|. .+.+.+ ++. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~--~l~~h~s~el 158 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDD--GLMEHLSREL 158 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcch--HHHHHHHHHH Confidence 21 11223334556778899999999999987544443 344433 222 Q ss_pred HHHH---HHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHHHHHHHHHHHhhc-------------- Q lcl|Aclame:pro 109 SGVA---LGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGD-------------- 171 (311) Q Consensus 109 la~~---ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~-------------- 171 (311) +.-+ ....+-..+|++-+..-..+... ..++.... ........++++..+...|..+ T Consensus 159 l~g~~~~t~d~i~~dll~ag~~viyAg~at-----s~At~~~~-~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~ 232 (401) T protein:vir:95 159 MNGATQITEAVLQKDLLAAAGTVLYAGAAT-----SDATITGE-GSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRM 232 (401) T ss_pred hhhhhhhHHHHHHHHHHhhcCeeecCCccc-----eeeecccc-ccccceechhHHHHHHHHHHhcccccchhhhhhhhc Confidence 2222 23333344564411000000000 00111111 1112223455666665555431 Q ss_pred -CC---Ccc-EEEEcHHHHHHHHHhhccCCceeec--------cccccCCCceecceeEEeeccccccccccccccc--- Q lcl|Aclame:pro 172 -NL---SPD-GVALDNTFSFMLATQRDSQGRKLYP--------ELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTG--- 235 (311) Q Consensus 172 -~~---~~~-~~v~n~~~~~~l~~lkd~~g~~~~~--------~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~--- 235 (311) +- .++ .-++|+.....|+.++|-.|.|-|. .....+..|++.++-+++++.+.-=....++... T Consensus 233 ~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~ 312 (401) T protein:vir:95 233 IDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANP 312 (401) T ss_pred cCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccc Confidence 11 112 2567999999999999988876663 3344556788888888877664410001100000 Q ss_pred --------ccccccccceEEEeecceEEEEeecCc-----eEEE--eccCCcccchhhhhcCcEEEEEEEEeccEEeccc Q lcl|Aclame:pro 236 --------VYRTTNPNVKAIAGDFSAFRWGVQVSI-----PLEL--IEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTD 300 (311) Q Consensus 236 --------~~~~~~~~~~~~~gd~~~~~~~~~~~~-----~i~~--~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~ 300 (311) ..+...--..+++|....-.+....+- .+-+ ..+..++..-.|-|++-+.+++ .+++.+++++ T Consensus 313 ~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~--~~a~~vL~~e 390 (401) T protein:vir:95 313 GYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKW--YYGILVKRPE 390 (401) T ss_pred ccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhh--hhhhheeccc Confidence 000001111344565544333222211 2222 2222122222455677777764 7889999999 Q ss_pred ceEEEEecccC Q lcl|Aclame:pro 301 AFAVVRDADES 311 (311) Q Consensus 301 a~~~l~~aa~~ 311 (311) -+++|+.++-- T Consensus 391 ~m~~ies~a~~ 401 (401) T protein:vir:95 391 RLALIKTVAPL 401 (401) T ss_pred eeEEEEeecCC Confidence 99999866655 No 197 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=96.29 E-value=0.00078 Score=37.62 Aligned_cols=264 Identities=9% Similarity=-0.012 Sum_probs=105.8 Q ss_pred CcccCCCceEc-chhHHHHHHHHHHhhchhhhhcce--eecCC----Cce-EEEEEe-CCcee-EEeecCccccccccc- Q lcl|Aclame:pro 1 MVALATGTFQL-PKHLVPGVWQKAQGQSVLARLSMA--EPQEF----GEQ-QYMTLT-APPRG-EVVGEGAQKSESTAT- 69 (311) Q Consensus 1 mat~~~g~~~v-P~~~~~~ii~~~~~~s~l~~l~~~--~~~~~----~~~-~~p~~~-~~~~a-~~v~Eg~~~~~~~~~- 69 (311) |+|+=-....| -+.+....+|.+++.......+.- +...+ |.. +.+-.. ++... .-+.........+.+ T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit~ 80 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIAA 80 (315) T ss_pred CceeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceeccc Confidence 88775555333 334456677776665444333211 11110 111 111111 11100 011111222222211 Q ss_pred eeEEEEeeeeEE-EEEee--cHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccc Q lcl|Aclame:pro 70 FAPVTAIPRKVQ-VTQRF--SQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNI 146 (311) Q Consensus 70 ~~~v~l~~~kl~-~~i~i--S~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 146 (311) ..++. .|++ +.-++ +.+.+.-...+.......|..++..++.+.+-...+.|.-. ...+. +.. T Consensus 81 ~~dva---Vk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~a----ai~~~-------t~~ 146 (315) T protein:vir:96 81 DEMVS---VKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQG----AIGSN-------AGM 146 (315) T ss_pred cccee---EEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hhccc-------ccc Confidence 12222 2222 22222 33333211222233334455555555555554444433110 00010 000 Q ss_pred eeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccc---cCCCceecceeEEeeccc Q lcl|Aclame:pro 147 VELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGF---GTDVASFAGLNAAVSDTV 223 (311) Q Consensus 147 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~---~~~~~~l~G~pv~~~~~~ 223 (311) .. +..........+.++..++-+....-..|+||...+..|.+ +. --..++.+... +..++ .+|++|++++.| T Consensus 147 ~~-~~~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~-L~~~~~~~~~~~~~~~~~~-~lGkrViVdD~~ 222 (315) T protein:vir:96 147 NV-SGELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EA-IDNKLYEEAGVVVYGGTPG-TLGKPVLVTDQC 222 (315) T ss_pred cc-cccccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hh-hhhhcccccceeEecCcCc-ccccEEEEECCC Confidence 11 11223344566788888886666666789999999999976 21 11222221111 11233 449999999999 Q ss_pred ccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEecc-EEecccce Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGI-GIMSTDAF 302 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~-~v~~~~a~ 302 (311) |...... ++. ..+.+.....+.. .++.. .++=.+-...|..+ -.+||..| T Consensus 223 P~~~~~g-----------------l~~-GAi~~~~~~~~~~--~~~~~---------~g~e~l~~~~r~e~tf~l~p~G~ 273 (315) T protein:vir:96 223 PATKIFG-----------------LVA-GAVMITESQAPGM--RSYQI---------DDQENLAIGFRAEGTANVEVLGY 273 (315) T ss_pred Ccceeee-----------------eec-ceeeecCCCcccc--ccccC---------CCcceeEEEEeeeeEeeeeeeeE Confidence 8531111 001 1111221222111 11100 01111112233333 36778777 Q ss_pred EEEEecccC Q lcl|Aclame:pro 303 AVVRDADES 311 (311) Q Consensus 303 ~~l~~aa~~ 311 (311) ..-+.+-.+ T Consensus 274 sw~~~~~~s 282 (315) T protein:vir:96 274 KWKTKTNVN 282 (315) T ss_pred EeecCCCcC Confidence 664332222 No 198 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=96.26 E-value=0.00054 Score=38.52 Aligned_cols=286 Identities=12% Similarity=0.058 Sum_probs=131.3 Q ss_pred cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcee----EEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 4 LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRG----EVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 4 ~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a----~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) .+.+-..+-+.+.+--+..-.+..+-..+++.++++.-..+||+....-.. .-++-++....-+++....+..... T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~~ 80 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTED 80 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccCceeeecc Confidence 333333333344333333333444555677888888777788886532111 1123333333334444444444444 Q ss_pred EEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccchHH Q lcl|Aclame:pro 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) Q Consensus 80 l~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) -+-..+|..+-+++. ...+|.++...+.+.+.|....|..+-.-.......+ .+ ....+.++ ... +...++... T Consensus 81 ~~L~~~i~~~~~~~a-~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~-~~-~k~~Lsgt--~~w-sd~~SDPi~ 154 (309) T protein:vir:99 81 HGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYA-AG-NKTTLSGA--DQW-SDPTSNPLP 154 (309) T ss_pred cceeecCCchhhhhc-cCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcC-CC-ceEEecCc--ccc-CCCCCCcHH Confidence 444556666655433 2345666666667777666655544332211000000 00 00011111 111 123456666 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHH-------hhccCCceeeccccccCCCceecce-eEEeeccccccccccc Q lcl|Aclame:pro 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLAT-------QRDSQGRKLYPELGFGTDVASFAGL-NAAVSDTVRGGPEAVT 231 (311) Q Consensus 160 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~-------lkd~~g~~~~~~~~~~~~~~~l~G~-pv~~~~~~~~~~~~~~ 231 (311) +|......+ +..|+..+|....|..|++ +|-..+..- ..+...-..++|+ .|++....-....... T Consensus 155 ~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g---~it~~~la~l~~ve~V~vg~a~~n~a~~g~ 228 (309) T protein:vir:99 155 VITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG---MVPMAFLQELLELDAIYIGEARLNIARPGQ 228 (309) T ss_pred HHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCcccc---ccCHHHHHHHhCcceEEeecceeecccccc Confidence 777776554 7899999999999988765 222222111 0111112345565 3444333221000000 Q ss_pred ccccccccccccce-EEEeecce----------EEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEeccc Q lcl|Aclame:pro 232 ASTGVYRTTNPNVK-AIAGDFSA----------FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTD 300 (311) Q Consensus 232 ~~~~~~~~~~~~~~-~~~gd~~~----------~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~ 300 (311) + ......+++... ++++.... +.++.+..-++. .++ +=..+.-.+|+..++.-.++-++ T Consensus 229 ~-~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~-d~~--------~~~~g~~~vr~~~~~k~~i~~~d 298 (309) T protein:vir:99 229 N-PNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA-DPN--------IGLRGGQRVRVGESVKELVTAPD 298 (309) T ss_pred c-cccccccCCcEEEEEcCCCCCCcccccccceeecccccCCcee-eee--------eccCCceEEEEeccccchhcchh Confidence 0 000011111111 11111110 111112221111 111 11233456888888888899999 Q ss_pred ceEEEEecccC Q lcl|Aclame:pro 301 AFAVVRDADES 311 (311) Q Consensus 301 a~~~l~~aa~~ 311 (311) +=..|+.+.++ T Consensus 299 ~G~li~~~va~ 309 (309) T protein:vir:99 299 LGFFFENAVAA 309 (309) T ss_pred cchhhhhcccC Confidence 99999999888 No 199 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=95.57 E-value=0.0018 Score=35.61 Aligned_cols=283 Identities=8% Similarity=0.032 Sum_probs=135.1 Q ss_pred Cc-----ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccceeEEE Q lcl|Aclame:pro 1 MV-----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 ma-----t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 74 (311) +| ...+-.+.|-+.+.+.+.+.+++.|-++++.+.+++..-. ..+-.-.+++-+.-+.-+ ..|. ++.++.-. T Consensus 20 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt~-R~~r-~~~l~~~~ 97 (341) T protein:vir:27 20 LAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAGG-RFTK-QVGVGGHK 97 (341) T ss_pred HHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCCC-ceec-ccccCCcc Confidence 22 1123346677778899999999999999999998887533 233332333433333221 1111 12344444 Q ss_pred EeeeeEEEEEeecHHHhhcCchh---hHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc------cccccccc- Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESR---QLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS------PAKILDTT- 144 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~---~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~------~~~~~~~~- 144 (311) +..++.---..|+-+.| +.+.. +.++...+++.+.++++...-.--+||+.....+.+.-. ..|+++.. T Consensus 98 Y~c~qtn~dt~i~y~~l-DaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~R 176 (341) T protein:vir:27 98 YKLAETDSCAAITWAML-CQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVK 176 (341) T ss_pred eEEEEeeeeeeecHHHH-HHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHHHHHH Confidence 44444434455677766 44443 678999999999999888777777888652222221111 12211110 Q ss_pred -----ccee--eccccccchHHHHH----HHHHH-HhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCceeeccccccCCC Q lcl|Aclame:pro 145 -----NIVE--LTTGTSATPDLAVE----AAVGL-VLGDNLS-PD-GVALDNTFSF-MLATQRDSQGRKLYPELGFGTDV 209 (311) Q Consensus 145 -----~~~~--~~~~~~~~~~~~i~----~~~~~-~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~~~~~~~~~~~~~ 209 (311) .+.+ .........|..++ ++... +.+...+ +. +.++.+.... .-..|-.....|- .-.....-. T Consensus 177 e~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pt-E~~Aa~~i~ 255 (341) T protein:vir:27 177 NRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPS-EQIAAQKLD 255 (341) T ss_pred hhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCH-HHHHHHHHH Confidence 0000 01111122244444 33332 2222222 22 5666655543 2222222111111 000001112 Q ss_pred ceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEE Q lcl|Aclame:pro 210 ASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAE 289 (311) Q Consensus 210 ~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~ 289 (311) .++-|+|++..+++|.+...++... ...+++-.=+ .|+ .++-.+. .+ .++-|++ +|.++ T Consensus 256 k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR--~~~d~p~--r~-rie~yes---~YvVE 314 (341) T protein:vir:27 256 KTIAGRPAYVPPFLPDNAMVVTIPE--------NLQVLTQHGT-----AQR--KAKHESD--RK-RSKTHTG---AWKVT 314 (341) T ss_pred HhhCCCeEEEccccCCCceEEeecc--------ceEEEEecCc-----EEE--EEEeccc--cc-cccchhh---hheee Confidence 5789999999999998866555432 1123322111 111 1111111 11 1111222 34443 Q ss_pred EEeccEEecccceEEEEecccC Q lcl|Aclame:pro 290 VVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 290 ~r~~~~v~~~~a~~~l~~aa~~ 311 (311) . +|+ ...-.|..+|..+++ T Consensus 315 d-yg~--~~~~~~~~vkl~~~~ 333 (341) T protein:vir:27 315 Q-WVC--WKRSPLTTQKKSTSA 333 (341) T ss_pred h-hhh--hhhccccccccCccc Confidence 3 332 334457788888888 No 200 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=95.50 E-value=0.0018 Score=35.60 Aligned_cols=283 Identities=15% Similarity=0.102 Sum_probs=134.1 Q ss_pred Cc--------ccCCC-ceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeec---Cccccccc Q lcl|Aclame:pro 1 MV--------ALATG-TFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE---GAQKSEST 67 (311) Q Consensus 1 ma--------t~~~g-~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E---g~~~~~~~ 67 (311) +| ..+.+ .+.|-+...+.+.+.+++.|-++++.+.+++..-. ..+-...+++-+.-+.- ++..|..- T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~ 95 (342) T protein:vir:10 16 QAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVASTTDTSGDGERKTTSI 95 (342) T ss_pred HHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCcccccccccCCCCCcccccc Confidence 22 11222 47788889999999999999999999998887533 23433334444443321 11122222 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------cccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AKIL 141 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~~~ 141 (311) ..++.-.+..++.-.-..|+-+.| +.+..+.++.+.+++.+.++++...-.--+||+.....+.+.-.| .|++ T Consensus 96 ~~l~~~~Y~c~qTn~dt~i~Y~~l-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWl 174 (342) T protein:vir:10 96 AKLVKQTYHCQQINFDTHINYKQL-DMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDRNSNPLLQDVAKGWL 174 (342) T ss_pred cccCCCccEEEEeeecccccHHHH-HHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHH Confidence 233333344444434445777776 566777889999999999998877777777886532222221111 1111 Q ss_pred -------------c--cccceeeccc-cccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHH--HHHHhhccCCceee Q lcl|Aclame:pro 142 -------------D--TTNIVELTTG-TSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSF--MLATQRDSQGRKLY 200 (311) Q Consensus 142 -------------~--~~~~~~~~~~-~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~--~l~~lkd~~g~~~~ 200 (311) . ..+.+.++.+ +-...+....++... +.....+ +. +.++.+.... ++..+.. .+.|-= T Consensus 175 Q~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~-~~~ptE 253 (342) T protein:vir:10 175 QKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQ-QNAPTE 253 (342) T ss_pred HHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhc-CCChHH Confidence 1 1122222222 233334444444432 3333222 22 5666665553 2222221 112110 Q ss_pred c-cccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhh Q lcl|Aclame:pro 201 P-ELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLK 279 (311) Q Consensus 201 ~-~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f 279 (311) . -...-....++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+.--++. T Consensus 254 ~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~rie~y~ 316 (342) T protein:vir:10 254 ELAADIVISQKRIGGLKAVRVPFFPANAILITKLE-------N-LAIYVQEG-----TTRKH--IENVP--KKDRIETYE 316 (342) T ss_pred HHHHHHHHhhhhhcCceeEEccccCCCceEEeecc-------c-cEEEEecC-----cEEEE--EEecc--ccccccchh Confidence 0 000011235789999999999998866554422 1 12221110 11111 11111 111111222 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 280 RQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 280 ~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+| -|+.|-+..+++.+..-.-+ T Consensus 317 s~N---------e~YvVEd~~~~a~iE~i~i~ 339 (342) T protein:vir:10 317 SEN---------IDYVVEDYGCAALIENITLK 339 (342) T ss_pred hhc---------cceeeeccccEEEeecceec Confidence 222 22233334444433322222 No 201 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=94.23 E-value=0.005 Score=33.20 Aligned_cols=282 Identities=11% Similarity=-0.014 Sum_probs=137.3 Q ss_pred Cc-----ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCcccccccc-cee Q lcl|Aclame:pro 1 MV-----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQKSESTA-TFA 71 (311) Q Consensus 1 ma-----t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~--Eg~~~~~~~~-~~~ 71 (311) +| ...+-.+.|.+...+.+.+.+++.|-++++.+.+++..-. ..+-.-.+++-+.-+. ...+....++ .++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT~~~~~R~~~~~~~l~ 95 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDTTGDGVRKPRDVSALD 95 (338) T ss_pred HHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccCCCCCccccccccccC Confidence 22 2234457788889999999999999999999999887533 2333333444444332 1111111111 233 Q ss_pred EEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------ccc----- Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AKI----- 140 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~~----- 140 (311) .-.+..++.---..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-.| .|+ T Consensus 96 ~~~Y~c~qtn~dt~i~y~~L-D~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~R 174 (338) T protein:vir:11 96 NQRYECKHTDFDTAITYAML-DAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQYR 174 (338) T ss_pred CCccEEEEeeeeeeecHHHH-HHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHH Confidence 33344444444456777777 566677899999999999998877777777886522222211111 111 Q ss_pred --------c--ccccceeecc---ccccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHHH-HHHhhccCCceeeccc Q lcl|Aclame:pro 141 --------L--DTTNIVELTT---GTSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSFM-LATQRDSQGRKLYPEL 203 (311) Q Consensus 141 --------~--~~~~~~~~~~---~~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~~-l~~lkd~~g~~~~~~~ 203 (311) . ..+..+..+. ++-...++...++... +.+...+ +. +.++.+..... -..+-.....|- +- T Consensus 175 e~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~pt--E~ 252 (338) T protein:vir:11 175 NNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVNKDQPAT--EK 252 (338) T ss_pred hhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHhcCCChH--HH Confidence 1 1122222221 2233334444444432 2333222 22 57777664431 112222221211 00 Q ss_pred cc---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhh Q lcl|Aclame:pro 204 GF---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKR 280 (311) Q Consensus 204 ~~---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~ 280 (311) .. -....++-|+|++..+++|.+...++... ..++++-.=+ .|+. ++-.+ +.+.--++.. T Consensus 253 ~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR~--~~d~p--~r~rie~y~s 315 (338) T protein:vir:11 253 IATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLK--------NLSLYWQIGG-----RRRY--LKEVP--EKNRIENYES 315 (338) T ss_pred HHHHHHHHhhhhCCceeEEccccCCCceEEeecc--------ccEEEEecCc-----EEEE--EEecc--ccccccchhh Confidence 00 11235799999999999998866554432 1122221111 1111 11111 1111112222 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 281 QNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 281 ~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) +| -|+.|-+..+++.+..-+-+ T Consensus 316 ~N---------e~YvVEd~~~~a~ieni~~~ 337 (338) T protein:vir:11 316 SN---------DAYVVEDYGLGCLVENIEVA 337 (338) T ss_pred hc---------cceeeeccccEEEeecceec Confidence 22 23334444555544432222 No 202 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=92.58 E-value=0.011 Score=31.36 Aligned_cols=290 Identities=11% Similarity=0.008 Sum_probs=133.9 Q ss_pred Ccc-------cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCcc-ccccccc Q lcl|Aclame:pro 1 MVA-------LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQ-KSESTAT 69 (311) Q Consensus 1 mat-------~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~--Eg~~-~~~~~~~ 69 (311) +|- ..+-.+.|-+.+.+.+.+.+++.|-++++.+++++..-. ..+-...+++-+.-+. -+.+ .|..-.. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:60 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcccccccccCCCCCcccccccc Confidence 221 113357778888999999999999999999998887533 2333333444443321 1111 1111122 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc------cccccc- Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS------PAKILD- 142 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~------~~~~~~- 142 (311) ++.-....++.-.-..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-. ..|+++ T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~l-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 174 (357) T protein:vir:60 96 LASNKYECDQINFDFYIRYKTL-DLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQK 174 (357) T ss_pred cCCCccEEEEeeeeccccHHHH-HHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHHH Confidence 3333333343333445777776 55667788999999999998887777777788653222222111 112110 Q ss_pred -------------------c-ccceeecc-ccccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 143 -------------------T-TNIVELTT-GTSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSF-MLATQRDSQGR 197 (311) Q Consensus 143 -------------------~-~~~~~~~~-~~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~ 197 (311) . +..+..+. ++-...+....++... +.....+ +. +.++.+.... .-..|-...+. T Consensus 175 ~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (357) T protein:vir:60 175 YRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNREQD 254 (357) T ss_pred HHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCC Confidence 0 00122222 2233333344444432 3332222 23 5666665543 11222222222 Q ss_pred eeeccccc---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCccc Q lcl|Aclame:pro 198 KLYPELGF---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDG 274 (311) Q Consensus 198 ~~~~~~~~---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~ 274 (311) |- +-.. -....++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+. T Consensus 255 pT--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~r 315 (357) T protein:vir:60 255 NS--EMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLE-------N-LSIYYMDD-----SHRRV--IEENP--KLDR 315 (357) T ss_pred hH--HHHHHHHHHHhhhhcCcceEEccccCCCceEEeecc-------c-cEEEEecC-----cEEEE--EEecc--cccc Confidence 21 0000 11135789999999999998866554321 1 12221110 11111 11111 1111 Q ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 275 LGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 275 ~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) --++..+| -+|..+.+--+..+..-.|...+..+++ T Consensus 316 iE~y~s~N-e~YvVEd~~~~a~iE~i~~~~~~~pa~~ 351 (357) T protein:vir:60 316 VENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ccchhhhc-ceeeeeccccEEEeeeeeeccCcccccC Confidence 11222233 2333333333333332112212212222 No 203 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=92.48 E-value=0.011 Score=31.27 Aligned_cols=290 Identities=10% Similarity=0.026 Sum_probs=135.5 Q ss_pred Cc------c-cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeec--C-ccccccccc Q lcl|Aclame:pro 1 MV------A-LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE--G-AQKSESTAT 69 (311) Q Consensus 1 ma------t-~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E--g-~~~~~~~~~ 69 (311) +| + ..+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-.-.+++-+.-+.- + +..|..-.. T Consensus 16 ~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~ 95 (355) T protein:vir:98 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) T ss_pred HHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccccCCCCCCcccccccc Confidence 22 1 122356677778899999999999999999999887533 23333334444433321 1 112222223 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc------cccccc- Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS------PAKILD- 142 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~------~~~~~~- 142 (311) ++.-.+..++.---..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-. ..|+++ T Consensus 96 l~~~~Y~c~qtn~dt~i~y~~L-D~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~ 174 (355) T protein:vir:98 96 LESSKYECNQINFDFHLKYKTL-DLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) T ss_pred cCCCccEEEEeeeeeeecHHHH-HHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHHH Confidence 3334444444444456777777 55667788999999999999887777777788652222221111 112111 Q ss_pred --------------------cccceeecc-ccccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 143 --------------------TTNIVELTT-GTSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSF-MLATQRDSQGR 197 (311) Q Consensus 143 --------------------~~~~~~~~~-~~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~ 197 (311) .+..+..+. ++-...+....++... +.....+ +. +.++.+.... +-.++-..... T Consensus 175 ~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (355) T protein:vir:98 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE 254 (355) T ss_pred HHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhccCC Confidence 011112222 2223333333444432 2332222 22 5777766443 22223222222 Q ss_pred eeec-cccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccch Q lcl|Aclame:pro 198 KLYP-ELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLG 276 (311) Q Consensus 198 ~~~~-~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~ 276 (311) |-=. -...-....++-|+|++..+++|.+...++... ..++++-.=+ .|+. ++-.+ +.+.-- T Consensus 255 ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR~--~~d~p--~r~rie 317 (355) T protein:vir:98 255 NSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLE--------NLSIYFMDES-----HRRS--IDENP--KKDRVE 317 (355) T ss_pred cHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeecc--------ccEEEEecCc-----EEEE--EEecc--cccccc Confidence 2100 000011235789999999999998866555432 1122221111 1111 11111 111111 Q ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 277 DLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 277 ~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ++..+| -+|..+.+--+..+. .+.+.+.++.+ T Consensus 318 ~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) T protein:vir:98 318 NYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) T ss_pred chhhhc-ceeeeeccccEEEee--ceeeeCCCCCc Confidence 222233 233333333333332 33332211111 No 204 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=92.45 E-value=0.011 Score=31.25 Aligned_cols=288 Identities=9% Similarity=0.011 Sum_probs=137.3 Q ss_pred Ccc-------cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeec--C-ccccccccc Q lcl|Aclame:pro 1 MVA-------LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE--G-AQKSESTAT 69 (311) Q Consensus 1 mat-------~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E--g-~~~~~~~~~ 69 (311) +|- ..+-.+.|-+.+.+.+.+.+++.|-++++.+.+++..-. ..+-.-.+++-+.-+.- + +..|..... T Consensus 16 ~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~ 95 (355) T protein:vir:18 16 LAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) T ss_pred HHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeeccccCCCCCcccccccc Confidence 220 113356777788899999999999999999999887533 23333334444443321 1 122222233 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc------cccccc- Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS------PAKILD- 142 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~------~~~~~~- 142 (311) ++.-.+..++.-.-..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-. ..|+++ T Consensus 96 l~~~~Y~c~qtn~dt~i~y~~L-D~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~ 174 (355) T protein:vir:18 96 LESNKYECNQINFDFHLTYKRL-DLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQK 174 (355) T ss_pred cCCCccEEEEeeeeeeecHHHH-HHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHHH Confidence 3444444444444456777777 55667788999999999999887777777788652222221111 122211 Q ss_pred --------------------cccceeeccc-cccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 143 --------------------TTNIVELTTG-TSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSF-MLATQRDSQGR 197 (311) Q Consensus 143 --------------------~~~~~~~~~~-~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~ 197 (311) .+..+..+.+ +-...+....++... +.....+ +. +.++.+.... +-.++-...+. T Consensus 175 ~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (355) T protein:vir:18 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQE 254 (355) T ss_pred HHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhccCC Confidence 1111222222 223333334444432 2332222 22 5777766443 22223222222 Q ss_pred eeecccccc---CCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCccc Q lcl|Aclame:pro 198 KLYPELGFG---TDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDG 274 (311) Q Consensus 198 ~~~~~~~~~---~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~ 274 (311) |- +.... ....++-|+|++..+++|.+...++... ..++++-.=+ .|+. ++-.+ +.+. T Consensus 255 pt--E~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR~--~~d~p--~r~r 315 (355) T protein:vir:18 255 NT--ESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLE--------NLSIYFMDES-----HRRS--IDENP--KKDR 315 (355) T ss_pred hH--HHHHHHHHHHHHhhCCceeEEccccCCCceEEeecc--------ccEEEEecCc-----EEEE--EEecc--cccc Confidence 22 11111 1135789999999999998866554432 1122221111 1111 11111 1111 Q ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 275 LGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 275 ~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) --++..+| -+|..+.+--+..+. .+.+.+.++.+ T Consensus 316 ie~y~s~N-e~YvVEd~~~~a~ie--ni~~~~~~~~~ 349 (355) T protein:vir:18 316 VENYESMN-IDYVVEAYAAGCLLE--NITLGDFTAPA 349 (355) T ss_pred ccchhhhc-ceeeeeccccEEEEe--eeeecCCCCcc Confidence 11222333 233333333333332 33333222111 No 205 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=92.15 E-value=0.013 Score=31.00 Aligned_cols=290 Identities=10% Similarity=0.008 Sum_probs=134.1 Q ss_pred Ccc-------cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCccccccc-cc Q lcl|Aclame:pro 1 MVA-------LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQKSEST-AT 69 (311) Q Consensus 1 mat-------~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~--Eg~~~~~~~-~~ 69 (311) +|- ..+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-.-.+++-+.-+. -+.+....+ .. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:56 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCCCCcccccccc Confidence 221 113357778888999999999999999999998887533 2333333444443321 111111111 22 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc------cccccc- Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS------PAKILD- 142 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~------~~~~~~- 142 (311) ++.-....++.-.-..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-. ..|+++ T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~l-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 174 (357) T protein:vir:56 96 LASNKYECDQINFDFYIRYKTL-DLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQK 174 (357) T ss_pred cCCCccEEEEeeecccccHHHH-HHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHHH Confidence 3333333333333445777776 55667788999999999998887777777788653222222111 112110 Q ss_pred -------------------c-ccceeeccc-cccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 143 -------------------T-TNIVELTTG-TSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSF-MLATQRDSQGR 197 (311) Q Consensus 143 -------------------~-~~~~~~~~~-~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~ 197 (311) . +..+..+.+ +-...+....++... +.....+ +. +.++.+.... +-..|-...+. T Consensus 175 ~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (357) T protein:vir:56 175 YRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQD 254 (357) T ss_pred HHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCC Confidence 0 001212222 233333344444432 3332222 22 4666665543 22223222222 Q ss_pred eeeccccc---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCccc Q lcl|Aclame:pro 198 KLYPELGF---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDG 274 (311) Q Consensus 198 ~~~~~~~~---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~ 274 (311) |- +-.. -....++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+. T Consensus 255 pT--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~r 315 (357) T protein:vir:56 255 NS--EMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLE-------N-LSIYYMDD-----SHRRV--IEENP--KLDR 315 (357) T ss_pred hH--HHHHHHHHHHhhhhCCceeEEccccCCCceEEeecc-------c-cEEEEecC-----cEEEE--EEecc--cccc Confidence 21 1101 11135789999999999998866554321 1 12221110 11111 11111 1111 Q ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 275 LGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 275 ~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) --++..+| -+|..+.+--+..+..-.++....++++ T Consensus 316 iE~y~s~N-e~YvVEd~~~~a~iE~i~i~~~~~~~~~ 351 (357) T protein:vir:56 316 VENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ccchhhhc-ceeeeeccccEEEeeeeeeccCCCCccc Confidence 11222233 2333333333333332222222222222 No 206 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=91.17 E-value=0.017 Score=30.27 Aligned_cols=281 Identities=11% Similarity=-0.021 Sum_probs=135.4 Q ss_pred Cc-----ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeec--CccccccccceeE Q lcl|Aclame:pro 1 MV-----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE--GAQKSESTATFAP 72 (311) Q Consensus 1 ma-----t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E--g~~~~~~~~~~~~ 72 (311) +| ...+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-.-.+++-+.-+.- +...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l~~ 95 (337) T protein:vir:10 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCccccccccccCC Confidence 22 1123345677778899999999999999999999887533 23333334444433322 2222222233444 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------ccc------ Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AKI------ 140 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~~------ 140 (311) -.+..++.---..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-.| .|+ T Consensus 96 ~~Y~c~qtn~dt~i~y~~L-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re 174 (337) T protein:vir:10 96 NRYRCEKTDYDTAIPYRKL-DMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) T ss_pred CccEEEEeeeeeeccHHHH-HHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHh Confidence 4444444444456777777 566677899999999999998877777777886522222221111 111 Q ss_pred -------c---ccccceeeccc-cccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHHH-HHHhhccCCceeeccccc Q lcl|Aclame:pro 141 -------L---DTTNIVELTTG-TSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSFM-LATQRDSQGRKLYPELGF 205 (311) Q Consensus 141 -------~---~~~~~~~~~~~-~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~~-l~~lkd~~g~~~~~~~~~ 205 (311) . .++..+.++.+ +-...+....++... +.....+ +. +.++.+..... -..+-...+.|- +-.. T Consensus 175 ~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~pt--E~~A 252 (337) T protein:vir:10 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPT--ERLA 252 (337) T ss_pred cchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcH--HHHH Confidence 1 11112222222 223333333444432 2332222 22 46666655531 112222222221 0000 Q ss_pred ---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcC Q lcl|Aclame:pro 206 ---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQN 282 (311) Q Consensus 206 ---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~ 282 (311) -....++-|+|++..+++|.+...++... ...+++-.=+ .|+ .++-.+ +.+.--++..+| T Consensus 253 a~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR--~~~d~p--~r~rie~y~s~N 315 (337) T protein:vir:10 253 ADLIVSQKRIGNLPAVRVPFFPKRALMVTKLS--------NLSIYYQEGA-----RRR--TLKEVP--ERDRIENYESSN 315 (337) T ss_pred HHHHHHhhhhCCceeEEccccCCCceEEeech--------hcEEEEecCc-----EEE--EEEEcc--ccccccchhhcc Confidence 11125789999999999998866554432 1122221111 111 111111 111111222222 Q ss_pred cEEEEEEEEeccEEecccceEEEE---eccc Q lcl|Aclame:pro 283 QIAIRAEVVYGIGIMSTDAFAVVR---DADE 310 (311) Q Consensus 283 ~v~~ra~~r~~~~v~~~~a~~~l~---~aa~ 310 (311) -|+.|-+..+++.+. .+.+ T Consensus 316 ---------e~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 316 ---------DAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ---------ceeeeeccccEEEEeceeecCC Confidence 223344444444433 2222 No 207 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=90.54 E-value=0.02 Score=29.86 Aligned_cols=292 Identities=11% Similarity=0.004 Sum_probs=133.3 Q ss_pred Cccc--------CCCceEcchhHHHHHHHHHHhhchh--hhhcceeecCCCceEEEEEeC---CceeEEeecCccccccc Q lcl|Aclame:pro 1 MVAL--------ATGTFQLPKHLVPGVWQKAQGQSVL--ARLSMAEPQEFGEQQYMTLTA---PPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~--------~~g~~~vP~~~~~~ii~~~~~~s~l--~~l~~~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~ 67 (311) |.|. ..+|.+--+.+.++|..+......+ .+-....+..+-.-+|-.... -..+.+++|++.++.++ T Consensus 26 ~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (462) T protein:vir:96 26 YQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSD 105 (462) T ss_pred HhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 3321 1233443444544444333322222 111222344433233333332 24578999999999999 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCc------ccccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTG------AALSGSPAKIL 141 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g------~~~~~~~~~~~ 141 (311) +.+.+.....+=++....+|...=.... ..|.++.+.+.....+++.++.++|+|+...+. ....|+.+.+ T Consensus 106 ~~~~R~~~~~k~l~~t~~vsi~~tl~n~--~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI- 182 (462) T protein:vir:96 106 PNIRQKTVEMKYVSDTKNLSIASTLVNN--IQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLI- 182 (462) T ss_pred CceEEEEEEEEEEeeeeeechhhhhccc--hhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhc- Confidence 9999999998888887777665322222 345667888888889999999999999664433 3334443322 Q ss_pred ccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeec Q lcl|Aclame:pro 142 DTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSD 221 (311) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~ 221 (311) +.-++...- +.....+.+..+-..+..+..+++-+.|+....+.|.+-.-...|-+.++.+. ....|++|-- T Consensus 183 ~~~NViDar--G~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g----~~~~G~~v~~-- 254 (462) T protein:vir:96 183 DKDNVIDAK--GESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSG----NVNAGYNVQG-- 254 (462) T ss_pred CCCceeecC--CCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCC----ceeeeeeccc-- Confidence 333333221 23334456666666677777788889999999998875443333444322211 1244555520 Q ss_pred ccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhh--hc--CcEEEEEEEEeccEEe Q lcl|Aclame:pro 222 TVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLK--RQ--NQIAIRAEVVYGIGIM 297 (311) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f--~~--~~v~~ra~~r~~~~v~ 297 (311) .+-....+-... ....+. ..+++.-.+..-......++... ...+. ...| ++ ....|++...-...=- T Consensus 255 f~s~~G~I~L~~----s~~m~~-~~i~~~~~~~~p~ap~~~~vsaT--v~t~~-~g~f~~~~d~~~y~Y~V~avs~dgeS 326 (462) T protein:vir:96 255 FYSSRGFIKLHG----STVMEN-ELILDESLQPLPNAPQPATVKAT--VETGK-KGLFTDEHDRAELTYKVVVNSDDAQS 326 (462) T ss_pred eeeeeeeeeeCC----ceecCc-ccccccccccCCCCCCCCceeEE--EEeCC-CCCCCCccCceeEEEEEEEECCCCcc Confidence 000000000000 000000 11111000000000001111110 00000 0011 11 1222222222222211 Q ss_pred cccceEEEEecccC Q lcl|Aclame:pro 298 STDAFAVVRDADES 311 (311) Q Consensus 298 ~~~a~~~l~~aa~~ 311 (311) -|+.++-.+.++.. T Consensus 327 ~PS~~VtaTva~~~ 340 (462) T protein:vir:96 327 APSEAVTATVNNAT 340 (462) T ss_pred ccceeeEeeeeccc Confidence 23334333333222 No 208 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=90.35 E-value=0.021 Score=29.75 Aligned_cols=281 Identities=11% Similarity=-0.020 Sum_probs=134.9 Q ss_pred Cc-----ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeec--CccccccccceeE Q lcl|Aclame:pro 1 MV-----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE--GAQKSESTATFAP 72 (311) Q Consensus 1 ma-----t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E--g~~~~~~~~~~~~ 72 (311) +| ...+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-...+++-+.-+.- +...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l~~ 95 (337) T protein:vir:79 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCccccccccccCC Confidence 22 1112345677778899999999999999999999887533 23333334444433322 2222222233444 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------ccc------ Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AKI------ 140 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~~------ 140 (311) -.+..++.---..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-.| .|+ T Consensus 96 ~~Y~c~qtn~dt~i~y~~L-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re 174 (337) T protein:vir:79 96 NRYRCEKTDYDTAIPYRKL-DAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) T ss_pred CccEEEEeeeeeeccHHHH-HHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHh Confidence 4444444444456777777 566777899999999999998877777777886522222221111 111 Q ss_pred -------c---ccccceeeccc-cccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHHH-HHHhhccCCceeeccccc Q lcl|Aclame:pro 141 -------L---DTTNIVELTTG-TSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSFM-LATQRDSQGRKLYPELGF 205 (311) Q Consensus 141 -------~---~~~~~~~~~~~-~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~~-l~~lkd~~g~~~~~~~~~ 205 (311) . .++....++.+ +-...+....++... +.....+ +. +.++.+..... -..+-...+.|- +-.. T Consensus 175 ~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~pt--E~~A 252 (337) T protein:vir:79 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVNATQAPT--ERLA 252 (337) T ss_pred cchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcH--HHHH Confidence 1 11111222222 222333333444432 2332222 22 46666655531 112222222221 0000 Q ss_pred ---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcC Q lcl|Aclame:pro 206 ---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQN 282 (311) Q Consensus 206 ---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~ 282 (311) -....++-|+|++..+++|.+...++... ...+++-.=+ .|+. ++-.+ +.+.--++..+| T Consensus 253 a~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR~--~~d~p--~r~rie~y~s~N 315 (337) T protein:vir:79 253 ADLIVSQKRIGNLPAVRVPFFPKRALMVTKLS--------NLSIYYQEGA-----RRRT--LKEVP--ERDRIENYESSN 315 (337) T ss_pred HHHHHHhhhhCCceeEEccccCCCceEEeech--------hcEEEEecCc-----EEEE--EEEcc--ccccccchhhcc Confidence 11125789999999999998866554432 1122221111 1111 11111 111111222222 Q ss_pred cEEEEEEEEeccEEecccceEEEE---eccc Q lcl|Aclame:pro 283 QIAIRAEVVYGIGIMSTDAFAVVR---DADE 310 (311) Q Consensus 283 ~v~~ra~~r~~~~v~~~~a~~~l~---~aa~ 310 (311) -|+.|-+..+++.+. .+.+ T Consensus 316 ---------e~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 316 ---------DAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ---------ceeeeeccccEEEEeceeecCC Confidence 223344444444433 2222 No 209 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=90.34 E-value=0.0032 Score=34.28 Aligned_cols=269 Identities=16% Similarity=0.112 Sum_probs=120.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) =+|.+.-...+|..+...|-..+..+.++...+.+...+.--++.. ..+...|...-.|+.+.+...+|..-++.+.-+ T Consensus 121 GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~V 199 (400) T protein:vir:93 121 GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSANEAQVHKDGQTKTEQAATLTIDTLEPVMV 199 (400) T ss_pred CcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhh-hhhhhhhhhhccCCccccceeeeeeechhHHHH Confidence 2244444567899998888888888888877665544431111111 123336666778888888888887777766433 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHH-HHHHHHhhhcccCCCccccccccccccccccceee---ccccccc Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALG-RALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL---TTGTSAT 156 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia-~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~---~~~~~~~ 156 (311) ...-.+ -|+.++...+...+..++..+++.+|. +..|.++.-|+|..+-....- ..+-..+... +...... T Consensus 200 Y~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK----~advK~I~~~Ttkaksagkt 274 (400) T protein:vir:93 200 YKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK----EADVKKIKKITTKAKSAGKT 274 (400) T ss_pred HHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhh----HHHHHHHHHHhhhhhhcCCC Confidence 333233 445555556666778899999999999 889999998865433211111 1111111111 1111222 Q ss_pred -hHHHHHHHHHHHhhcCCCccEEEEcHHHH-HHHHHhhccCCce---eeccccccCCCceecceeE--Eeeccccccccc Q lcl|Aclame:pro 157 -PDLAVEAAVGLVLGDNLSPDGVALDNTFS-FMLATQRDSQGRK---LYPELGFGTDVASFAGLNA--AVSDTVRGGPEA 229 (311) Q Consensus 157 -~~~~i~~~~~~~~~~~~~~~~~v~n~~~~-~~l~~lkd~~g~~---~~~~~~~~~~~~~l~G~pv--~~~~~~~~~~~~ 229 (311) ..+.+..+..-+.+...+.- .+....+. ..|..++.+..+. +-+++..- .+--|+.- +.+.. T Consensus 275 pfadaieeavdfvrptagrry-livktedrkalldelrqatanahvriknddaei---asevgvdeiivytgs------- 343 (400) T protein:vir:93 275 PFADAIEEAVDFVRPTAGRRY-LIVKTEDRKALLDELRQATANAHVRIKNDDAEI---ASEVGVDEIIVYTGS------- 343 (400) T ss_pred chhHHHHHHHhhhccCCCceE-EEEeccchHHHHHHHHhhccccceEeecchhhh---hhhcCcceeeeeecc------- Confidence 23345555554443322211 33333333 3445555444332 22222211 11122211 11100 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+-+..+++ | ..|.+.+..--.+..+.|-. ..||+.+....-.-....+ |-++++.. T Consensus 344 ----------kalkptvlv-d-qkyhidmqdltkvdafewkt--------nsnmilvetltsghvetyn--agavitvs 400 (400) T protein:vir:93 344 ----------KALKPTVLV-D-QKYHIDMQDLTKVDAFEWKT--------NSNMILVETLTSGHVETYN--AGAVITVS 400 (400) T ss_pred ----------ccccceeee-c-cccccchhhhhhhhhheecc--------CCceEEEeecccCcceeec--cceeEeeC Confidence 011111221 1 12222221111111111110 1233333221111112222 22333322 No 210 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=90.15 E-value=0.0054 Score=33.03 Aligned_cols=271 Identities=16% Similarity=0.100 Sum_probs=120.1 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) =+|.+.-...+|..+...|-..+..+.++.+.+.+...+.--++..- .+...+.-.-.|+.+.+...+|..-++.+.-+ T Consensus 39 GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~-~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~V 117 (318) T protein:vir:86 39 GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSSAEAQVHKDGQTKTEQAATLTIDTLEPVMV 117 (318) T ss_pred CceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhhhh-hhhhhhhhhccCCccccceeeeeeechhHHHH Confidence 22344445678999988888888888888876655544321111111 22355666778888888888887777766433 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHH-HHHHHHhhhcccCCCcccccccc--ccccccccceeeccccccch Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALG-RALDLIGIHGINPLTGAALSGSP--AKILDTTNIVELTTGTSATP 157 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia-~~~d~~~l~G~~~~~g~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 157 (311) ...-.+ -|+.++...+...+..++..+|+.+|. +..|.++.-|+|..+-....-+. ..+..-+.....+ +.... T Consensus 118 Y~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksa--gttpf 194 (318) T protein:vir:86 118 YKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSA--GTTPF 194 (318) T ss_pred HHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhcc--CCCch Confidence 333233 455555556666778899999999999 88999999886644322211110 1111111111111 11112 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHH-HHHHHhhccCCce---eeccccccCCCceecceeE--Eeeccccccccccc Q lcl|Aclame:pro 158 DLAVEAAVGLVLGDNLSPDGVALDNTFS-FMLATQRDSQGRK---LYPELGFGTDVASFAGLNA--AVSDTVRGGPEAVT 231 (311) Q Consensus 158 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~-~~l~~lkd~~g~~---~~~~~~~~~~~~~l~G~pv--~~~~~~~~~~~~~~ 231 (311) ...+..+..-+.+...+. -.+....+. ..|..++.+..+. +-+++..- .+--|+.- +.+. T Consensus 195 anaieeavdfvrptagrr-ylivkaedrkalldelrqatanahvriknddtei---asevgvdeiivytg---------- 260 (318) T protein:vir:86 195 ANAIEEAVDFVRPTAGRR-YLIVKAEDRKALLDELRQATANAHVRIKNDDTEI---ASEVGVDEIIVYTG---------- 260 (318) T ss_pred hhHHHHHHhhhccCCCce-EEEEeecchHHHHHHHHhhcccceeEEeccchhh---hhhcCcceeeeeec---------- Confidence 223444444443322221 134444433 3344555443322 22222111 11112211 1110 Q ss_pred ccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 232 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) +..-+..+++ | ..|.+.+..--.+..++|-. ..||+.+....-.-.... +|-++++.. T Consensus 261 -------skalkptvlv-d-qkyhidmqdltkvdafewkt--------nsnmilvetltsghvety--nagavitvs 318 (318) T protein:vir:86 261 -------SKALKPTVLV-D-QKYHIDMQDLTKVDAFEWKT--------NSNMILVETLTSGHVETY--NAGAVITVS 318 (318) T ss_pred -------cccccceeee-c-cceecchhhhhhhhcceecc--------CCceEEEeecccCcceee--cCceeEEeC Confidence 0011111221 1 12222222111111111110 123333322111111222 222333322 No 211 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=90.01 E-value=0.023 Score=29.55 Aligned_cols=285 Identities=7% Similarity=-0.065 Sum_probs=113.8 Q ss_pred CcccCCCc---eEcchh--HHHHH----HHHHHhhchhhhhccee-ec-------CCCceEEEEEeCC----ce-eE--- Q lcl|Aclame:pro 1 MVALATGT---FQLPKH--LVPGV----WQKAQGQSVLARLSMAE-PQ-------EFGEQQYMTLTAP----PR-GE--- 55 (311) Q Consensus 1 mat~~~g~---~~vP~~--~~~~i----i~~~~~~s~l~~l~~~~-~~-------~~~~~~~p~~~~~----~~-a~--- 55 (311) |......+ ...+.. ..... -+.+... ......+.. .. ......+....+. .+ .. T Consensus 188 ~~~q~itg~tga~fa~s~~~an~astAss~Al~gE-A~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~ 266 (523) T protein:vir:59 188 WQYDDASGDPENTVAYPLPRYNRIVGAVGSALYAR-LFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYP 266 (523) T ss_pred cccccccccccccccchhhcccccccccccccccc-ccccccccccccCCCcccccccccccccccccchhhcccccccc Confidence 22111111 111110 00000 0000000 000000000 00 0000001000000 00 00 Q ss_pred -EeecCccccccccceeEEEEeeeeEEEEEeecHHHhhcCch--hhHHHHHHHHHHHHHHHHHHHHHHhhhccc--CCCc Q lcl|Aclame:pro 56 -VVGEGAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES--RQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTG 130 (311) Q Consensus 56 -~v~Eg~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~--~~~~~~~~i~~~la~~ia~~~d~~~l~G~~--~~~g 130 (311) .-.++...++-..+++.+++.++.-+-...+|-||.||--. ..+|.+++|..-|+..|...|++.+|+-.. +..+ T Consensus 267 ~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~ 346 (523) T protein:vir:59 267 DPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRT 346 (523) T ss_pred ccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheee Confidence 11234456666677777777666666667899999987654 358899999999999999999999986522 1111 Q ss_pred cccccccccccccccceeecccc---ccchHHHHHHH-------HHHHh--hcCCCccEEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 131 AALSGSPAKILDTTNIVELTTGT---SATPDLAVEAA-------VGLVL--GDNLSPDGVALDNTFSFMLATQRDSQGRK 198 (311) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~-------~~~~~--~~~~~~~~~v~n~~~~~~l~~lkd~~g~~ 198 (311) ........++.+........... .....+-+..+ ...+. ......+.++++++....|...-.-+++. T Consensus 347 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~ 426 (523) T protein:vir:59 347 DNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGN 426 (523) T ss_pred eeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCC Confidence 11111122222221110000000 00001112222 22221 12234667999999988886422111111 Q ss_pred eeccccccC-CCcee-cceeEEeecccccccccccccccccccccccceEEEeecceE-----EEEeecCceEEEecc-C Q lcl|Aclame:pro 199 LYPELGFGT-DVASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF-----RWGVQVSIPLELIEF-G 270 (311) Q Consensus 199 ~~~~~~~~~-~~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~-----~~~~~~~~~i~~~~~-~ 270 (311) -.....++. ..|.| .|++|++..+.+.+ .+++|-.... .+.+..-..+...+- . T Consensus 427 ~~~~~~~~~~~~g~l~~~~~vy~d~~~~~d------------------y~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~ 488 (523) T protein:vir:59 427 DNRDGGTGIFYVGMVQGRYRLYKNIYQNQP------------------VIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIV 488 (523) T ss_pred ccccccccceeEEEecCceEEEecCCCCcc------------------eEEEEecccCCcccccceecccchhhcccccc Confidence 111111111 12444 45588888776543 2222221100 011111111211111 0 Q ss_pred CcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 271 DPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 271 ~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) |++ .|| -.+.+ ..|++..|.+|-+...|-.+--- T Consensus 489 dp~----s~q-p~~~~--~tRY~l~v~nP~~~~~~~~~~~~ 522 (523) T protein:vir:59 489 DPV----NFS-YRRGL--MTRYALEVVRPEFYGLLYVKLLQ 522 (523) T ss_pred cCC----ccc-ceeee--eeehhheecchhHhhhhhhhhcC Confidence 222 143 33444 46999999888664333211111 No 212 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=89.83 E-value=0.024 Score=29.45 Aligned_cols=281 Identities=11% Similarity=-0.020 Sum_probs=134.5 Q ss_pred Cc-----ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecC--ccccccccceeE Q lcl|Aclame:pro 1 MV-----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEG--AQKSESTATFAP 72 (311) Q Consensus 1 ma-----t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg--~~~~~~~~~~~~ 72 (311) +| ...+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-...+++-+.-+.-+ ...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l~~ 95 (337) T protein:vir:78 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecCCCcccccccccccCC Confidence 22 1223456777888899999999999999999998887533 233333344444333222 222222223333 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------cc------- Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AK------- 139 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~------- 139 (311) -....++.---..|+-+.| +.+..+.++.+.+++.+.++++...-.--+||+.....+.+.-.| .| T Consensus 96 ~~Y~c~qTn~dt~i~Y~~l-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re 174 (337) T protein:vir:78 96 NRYRCEKTDYDTAIPYRKL-DMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) T ss_pred CccEEEEeceecccCHHHH-HHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHh Confidence 3333333333445777776 566777889999999999988877777777886532222222111 11 Q ss_pred ------ccc---cccceeeccc-cccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHHHH-HHhhccCCceeeccccc Q lcl|Aclame:pro 140 ------ILD---TTNIVELTTG-TSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSFML-ATQRDSQGRKLYPELGF 205 (311) Q Consensus 140 ------~~~---~~~~~~~~~~-~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~~l-~~lkd~~g~~~~~~~~~ 205 (311) +.. +...+.++.+ +-...+....++... +.....+ +. +.++.+.....- ..+-...+.|- +-.. T Consensus 175 ~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~pt--E~~A 252 (337) T protein:vir:78 175 RAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPT--ERLA 252 (337) T ss_pred cchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCcH--HHHH Confidence 111 1112222222 233334444454542 3333232 22 566666555321 12222222221 0000 Q ss_pred ---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcC Q lcl|Aclame:pro 206 ---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQN 282 (311) Q Consensus 206 ---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~ 282 (311) -....++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+.--++..+| T Consensus 253 a~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~rie~y~s~N 315 (337) T protein:vir:78 253 ADLIVSQKRIGNLPAVRVPFFPKRALMVTKLS-------N-LSIYYQEG-----ARRRT--LKEVP--ERDRIENYESSN 315 (337) T ss_pred HHHHHHhhhhcCcceEEccccCCCceEEeech-------h-cEEEEecC-----cEEEE--EEecc--ccccccchhhcc Confidence 11235789999999999998866554321 1 12221110 11111 11111 111111222222 Q ss_pred cEEEEEEEEeccEEecccceEEEE---eccc Q lcl|Aclame:pro 283 QIAIRAEVVYGIGIMSTDAFAVVR---DADE 310 (311) Q Consensus 283 ~v~~ra~~r~~~~v~~~~a~~~l~---~aa~ 310 (311) -|+.|-+..+++.+. .+.+ T Consensus 316 ---------e~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 316 ---------DAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ---------ceeeeeccccEEEEeceeecCC Confidence 223344444444433 2222 No 213 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=89.82 E-value=0.024 Score=29.44 Aligned_cols=284 Identities=11% Similarity=0.000 Sum_probs=133.8 Q ss_pred Cc-----ccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCccccccccceeE Q lcl|Aclame:pro 1 MV-----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQKSESTATFAP 72 (311) Q Consensus 1 ma-----t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~--Eg~~~~~~~~~~~~ 72 (311) +| ...+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-...+++-+.-+. -++..|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l~~ 95 (339) T protein:vir:79 16 IAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDTTQQDRETSDISTMDG 95 (339) T ss_pred HHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccCCCCCcccccccccCC Confidence 22 1223456777888999999999999999999998887533 2333333444443321 11122211123333 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------ccc------ Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AKI------ 140 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~~------ 140 (311) -....++.-.-..|+-+.| +.+..+.++.+.+++.+.++++...-.--+||+.....+.+.-.| .|+ T Consensus 96 ~~Y~c~qTn~dt~i~Y~~l-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re 174 (339) T protein:vir:79 96 RRYRCEQTNSDTHITYQKL-DAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQNLRE 174 (339) T ss_pred CccEEEEeeeeceecHHHH-HHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcCccccchhHHHHHHh Confidence 3333444333445777776 566777889999999999988877767777886532222221111 111 Q ss_pred -------c---ccccceee-c-cccccchHHHHHHHHH-HHhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCceeec-cc Q lcl|Aclame:pro 141 -------L---DTTNIVEL-T-TGTSATPDLAVEAAVG-LVLGDNLS-PD-GVALDNTFSF-MLATQRDSQGRKLYP-EL 203 (311) Q Consensus 141 -------~---~~~~~~~~-~-~~~~~~~~~~i~~~~~-~~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~~~~~-~~ 203 (311) . .+++..-+ + .++-...+....++.. ++.+...+ +. +.++.+.... +-..|-.....|-=. -. T Consensus 175 ~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa 254 (339) T protein:vir:79 175 QAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFPLVNRDRDPVQQIAA 254 (339) T ss_pred hhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhhHhhcCCChHHHHHH Confidence 1 11111111 1 1223334444445553 33333332 22 4666665553 212222222222100 00 Q ss_pred cccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCc Q lcl|Aclame:pro 204 GFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQ 283 (311) Q Consensus 204 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~ 283 (311) ..-....++-|+|++..+++|.+...++... | .++++=.= ..|+ .++-.+ +.+.--++..+|. T Consensus 255 ~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~-------N-LsIY~Q~g-----s~RR--~~~d~p--~r~rie~y~s~Ne 317 (339) T protein:vir:79 255 DLIISQKRIGNLPAIRVPYFPANGLLVTRLD-------N-LSIYYQEG-----GRRR--TILDNA--KRDRIENYESSND 317 (339) T ss_pred HHHHHhhhhCCceeEEccccCCCceEEeech-------h-cEEEEecC-----cEEE--EEEecc--ccccccchhhccc Confidence 0011125789999999999998866554321 1 12221100 1111 111111 1111112222222 Q ss_pred EEEEEEEEeccEEecccceEEE---EecccC Q lcl|Aclame:pro 284 IAIRAEVVYGIGIMSTDAFAVV---RDADES 311 (311) Q Consensus 284 v~~ra~~r~~~~v~~~~a~~~l---~~aa~~ 311 (311) +| .|-+..+++.+ +.+.+| T Consensus 318 -~Y--------vVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 318 -AY--------VIEDLACAAMAENIALAAAA 339 (339) T ss_pred -ee--------eeeccccEEEeeeeecccCC Confidence 22 23333333333 233333 No 214 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=89.31 E-value=0.027 Score=29.18 Aligned_cols=288 Identities=10% Similarity=0.007 Sum_probs=132.6 Q ss_pred Ccc-------cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCccccccc-cc Q lcl|Aclame:pro 1 MVA-------LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQKSEST-AT 69 (311) Q Consensus 1 mat-------~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~--Eg~~~~~~~-~~ 69 (311) +|- ..+-.+.|-+...+.+.+.+++.|-++++.+++++..-. ..+-...+++-+.-+. -+.+....+ .. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:20 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCCCCcccccccc Confidence 221 113357778888999999999999999999998887533 2333333444443322 111111111 22 Q ss_pred eeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc------cccccc- Q lcl|Aclame:pro 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS------PAKILD- 142 (311) Q Consensus 70 ~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~------~~~~~~- 142 (311) ++.-....++.-.-..|+-+.| +.+..+.++...+++.+.++++...-.--+||+.....+.+.-. ..|+++ T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~l-D~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 174 (357) T protein:vir:20 96 LASNKYECDQINFDFYIRYKTL-DLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQK 174 (357) T ss_pred cCCCccEEEEeeecccccHHHH-HHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHHH Confidence 3333333343333445777776 55667788999999999998887777777788653222222111 112110 Q ss_pred -----------------c---ccceeecc-ccccchHHHHHHHHHH-HhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 143 -----------------T---TNIVELTT-GTSATPDLAVEAAVGL-VLGDNLS-PD-GVALDNTFSF-MLATQRDSQGR 197 (311) Q Consensus 143 -----------------~---~~~~~~~~-~~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~ 197 (311) + ...+..+. ++-...+....++... +.....+ +. +.++.+.... +-..|-...+. T Consensus 175 ~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (357) T protein:vir:20 175 YRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQD 254 (357) T ss_pred HHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCC Confidence 0 00121222 2233333334444432 3332222 23 4666665543 22223222222 Q ss_pred eeeccccc---cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCccc Q lcl|Aclame:pro 198 KLYPELGF---GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDG 274 (311) Q Consensus 198 ~~~~~~~~---~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~ 274 (311) |- +-.. -....++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+. T Consensus 255 pt--E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~r 315 (357) T protein:vir:20 255 NS--EMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLE-------N-LSIYYMDD-----SHRRV--IEENP--KLDR 315 (357) T ss_pred hH--HHHHHHHHHHhhhhCCceeEEccccCCCceEEeecc-------c-cEEEEecC-----cEEEE--EEecc--cccc Confidence 21 1101 11135789999999999998866554321 1 12221110 11111 11111 1111 Q ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 275 LGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 275 ~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) --++..+| -+|..+.+--+..+.. +.+...++.+ T Consensus 316 iE~y~s~N-e~YvVEd~~~~a~iE~--i~~~~~~~p~ 349 (357) T protein:vir:20 316 VENYESMN-IDYVVEDYAAGCLVEK--IKVGDFSTPA 349 (357) T ss_pred ccchhhhc-ceeeeeccccEEEeee--eeeccccCCc Confidence 11222233 2233333322333321 1111111111 No 215 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=89.23 E-value=0.028 Score=29.13 Aligned_cols=289 Identities=17% Similarity=0.121 Sum_probs=122.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhh--cceeecCCCceEEEEEeC-C--ceeEEeecCccccccccceeEEEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARL--SMAEPQEFGEQQYMTLTA-P--PRGEVVGEGAQKSESTATFAPVTA 75 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l--~~~~~~~~~~~~~p~~~~-~--~~a~~v~Eg~~~~~~~~~~~~v~l 75 (311) =+++.+|+.+--+.+.+++..+......+.-+ ....+..+-.-+|-.... + .....+.|++-.+.+++.+.+... T Consensus 18 ~~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v 97 (470) T protein:vir:10 18 NAAGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGGLPRTVEVNVVRRRI 97 (470) T ss_pred HHhhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhccccccccceeecccccCccCCCceEEEEE Confidence 23444445443333333332222222221111 122333322223322222 2 223356899999999999999999 Q ss_pred eeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCc--------cccccccccccc--ccc Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTG--------AALSGSPAKILD--TTN 145 (311) Q Consensus 76 ~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g--------~~~~~~~~~~~~--~~~ 145 (311) ..+=++....+|.-.++--.-...++++.+.+..--.+++.++.++|+|+...+. ....|+.+.+.. ..+ T Consensus 98 ~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~N 177 (470) T protein:vir:10 98 RPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQN 177 (470) T ss_pred EEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhhccCCCCcc Confidence 8888888888886532111112346777777777888999999999999664331 123343332221 112 Q ss_pred ceeeccccccchHHHHHHHHHHH--hhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeeccc Q lcl|Aclame:pro 146 IVELTTGTSATPDLAVEAAVGLV--LGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTV 223 (311) Q Consensus 146 ~~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~ 223 (311) +.. ..+.....+.+..+...+ ..+..+++-+.|+..+.+.|..-.....|-+.++.+.. ...|+||- ..+ T Consensus 178 ViD--arG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~----~~~G~~v~--~f~ 249 (470) T protein:vir:10 178 VLD--AGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRA----GLLGADAQ--SYI 249 (470) T ss_pred ccc--cCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCc----eeeeeecc--cee Confidence 221 112223345566666655 34666777899999999999876655556554422211 12333331 000 Q ss_pred ccccccccccccccccccccceEEEeecceEE---EEe------ecCceEEEeccC----C--cccchhhhh-cC--cEE Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFR---WGV------QVSIPLELIEFG----D--PDGLGDLKR-QN--QIA 285 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~---~~~------~~~~~i~~~~~~----~--~~~~~~~f~-~~--~v~ 285 (311) -....+.... ..+..++.... +.. ...++.-++... . .++... |. .+ ... T Consensus 250 sa~G~I~L~~-----------s~~m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~-~~~~~v~sy~ 317 (470) T protein:vir:10 250 GVRGEHSLYP-----------SQFLGDFHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGD-PANTTVYSYA 317 (470) T ss_pred eeeeeeeecc-----------cccccchhhcCcccCCcccCCcccCceeEEeecCCCceeecccCCCCc-ccCcceeEEE Confidence 0000000000 00000000000 000 000000000000 0 000000 00 00 112 Q ss_pred EEEEEEeccEEecccce--------------------------EEEEecccC Q lcl|Aclame:pro 286 IRAEVVYGIGIMSTDAF--------------------------AVVRDADES 311 (311) Q Consensus 286 ~ra~~r~~~~v~~~~a~--------------------------~~l~~aa~~ 311 (311) +.+..+.|-. ++.++ .+..+-+++ T Consensus 318 y~v~~~~gds--~s~~v~vt~t~~~v~kgv~ltI~~~~~v~yv~IYRk~~~s 367 (470) T protein:vir:10 318 FKAANFYGES--AAKYIDVYIDSTEAGKGVRFQFHGLVNVKWLDVYRKDPGS 367 (470) T ss_pred EEEEEecCCC--CcceEEEEEeeehhcceeEEEEecCCCCcEEEEEeecCCC Confidence 2222222222 12222 111111111 No 216 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=88.40 E-value=0.0059 Score=32.80 Aligned_cols=269 Identities=16% Similarity=0.109 Sum_probs=118.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccccccceeEEEEeeeeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~kl 80 (311) =+|.+.-...+|..+...|-..+..+.++...+.+...+.--++.. ..+...|...-.|+.+.+...+|..-++.+.-+ T Consensus 114 GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~V 192 (393) T protein:vir:16 114 GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSANEAQVHKDGQTKTEQAATLTIDTLEPVMV 192 (393) T ss_pred CcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhh-hhhhhhhhhhccCCccccceeeeeeechhHHHH Confidence 2234444567899998888888888888877665544431111111 122335666778888888888887777766433 Q ss_pred EEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHH-HHHHHHhhhcccCCCccccccccccccccccceee---ccccccc Q lcl|Aclame:pro 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALG-RALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL---TTGTSAT 156 (311) Q Consensus 81 ~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia-~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~---~~~~~~~ 156 (311) ...-.+ -|+.++...+...+..++..+|+.+|. +..|.++.-|+|..+-....- ..+-..+... +...... T Consensus 193 Y~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK----~advK~I~k~Ttkaksagkt 267 (393) T protein:vir:16 193 YKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK----EADVKKIKKITTKAKSAGKT 267 (393) T ss_pred HHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhh----HHHHHHHHHHhhhhhhcCCC Confidence 333233 455555556666778899999999999 889999998865433211111 1111111111 1111222 Q ss_pred -hHHHHHHHHHHHhhcCCCccEEEEcHHHH-HHHHHhhccCC---ceeeccccccCCCceecceeE--Eeeccccccccc Q lcl|Aclame:pro 157 -PDLAVEAAVGLVLGDNLSPDGVALDNTFS-FMLATQRDSQG---RKLYPELGFGTDVASFAGLNA--AVSDTVRGGPEA 229 (311) Q Consensus 157 -~~~~i~~~~~~~~~~~~~~~~~v~n~~~~-~~l~~lkd~~g---~~~~~~~~~~~~~~~l~G~pv--~~~~~~~~~~~~ 229 (311) ..+.+..+..-+.+...+.- .+....+. ..|..++.+.. ..+-+++..-. +--|+.- +.+.. T Consensus 268 pfadaieeavdfvrptagrry-livktedrkalldelrqatananvriknddteia---sevgvdeiivytgs------- 336 (393) T protein:vir:16 268 PFADAIEEAVDFVRPTAGRRY-LIVKTEDRKALLDELRQATANANVRIKNDDTEIA---SEVGVDEIIVYTGS------- 336 (393) T ss_pred chhHHHHHHHhhhccCCCceE-EEEeccchHHHHHHHHhhhccCceeeeccchhhh---hhcCcceeeeeecc------- Confidence 23345555554443322211 33333333 33344443322 22222222111 1112211 11100 Q ss_pred ccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~a 308 (311) .+-+..+++ | ..|.+.+..--.+..+.|-. ..||+.+....-.-....+ |-++++.. T Consensus 337 ----------kalkptvlv-d-qkyhidmqdltkvdafewkt--------nsnmilvetltsghvetyn--agavitvs 393 (393) T protein:vir:16 337 ----------KALKPTVLV-D-QKYHIDMQDLTKVDAFEWKT--------NSNMILVETLTSGHVETYN--AGAVITVS 393 (393) T ss_pred ----------ccccceeee-c-cccccchhhhhhhhhheecc--------CCceEEEeecccCcceeec--cceeEeeC Confidence 011111221 1 12222221111111111110 1233333221111112222 22333322 No 217 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=87.70 E-value=0.037 Score=28.43 Aligned_cols=281 Identities=9% Similarity=-0.008 Sum_probs=137.9 Q ss_pred eEcc----------hhHHHHHHHHHHhhchhhhh----cceeecCC-CceEEEEEeC-CceeEEe-ecCcccccccccee Q lcl|Aclame:pro 9 FQLP----------KHLVPGVWQKAQGQSVLARL----SMAEPQEF-GEQQYMTLTA-PPRGEVV-GEGAQKSESTATFA 71 (311) Q Consensus 9 ~~vP----------~~~~~~ii~~~~~~s~l~~l----~~~~~~~~-~~~~~p~~~~-~~~a~~v-~Eg~~~~~~~~~~~ 71 (311) .-+| .+.+.++.+.+-..++|+.. +...+..+ .++..|..-. ..++.|- +|..-...-.-.|. T Consensus 1 mp~~~lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~ 80 (321) T protein:vir:34 1 MPFPNISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDVIS 80 (321) T ss_pred CCCchHHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhhcc Confidence 1111 11223344444444444433 33344433 3466676655 7788885 44444444456789 Q ss_pred EEEEeeeeEEEEEeecHH-Hhhc-CchhhHHHHHHHHHHHHHHHHHHHHHHhhh-cccCCCcccccccccccc--ccccc Q lcl|Aclame:pro 72 PVTAIPRKVQVTQRFSQE-VKWA-DESRQLGVLQTMADLSGVALGRALDLIGIH-GINPLTGAALSGSPAKIL--DTTNI 146 (311) Q Consensus 72 ~v~l~~~kl~~~i~iS~e-ll~~-s~~~~~~~~~~i~~~la~~ia~~~d~~~l~-G~~~~~g~~~~~~~~~~~--~~~~~ 146 (311) +.++.++..++-+.||-. +++. .....+|+...=.+...+.++.++|..+.. |++ .++....|+...+. .++++ T Consensus 81 ~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa-~g~~~i~GL~~lv~~~p~tGt 159 (321) T protein:vir:34 81 SAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTA-FGGRAINGLDGAVPVDPTVGT 159 (321) T ss_pred ccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccc-cccchhhhhhhhcccCCCCce Confidence 999999999998888754 4433 345667777666677778888888888774 542 12333333221111 01111 Q ss_pred ee-------------eccccccchHHHHHHHHH----HHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccc-cCC Q lcl|Aclame:pro 147 VE-------------LTTGTSATPDLAVEAAVG----LVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGF-GTD 208 (311) Q Consensus 147 ~~-------------~~~~~~~~~~~~i~~~~~----~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~-~~~ 208 (311) +. ........+-..+..++. ++.-.+..|+.|++....+...++-.-...|+.-.+... +.. T Consensus 160 vGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~~a~~Gf~ 239 (321) T protein:vir:34 160 YGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAEEANLGFR 239 (321) T ss_pred eccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeecccccccccce Confidence 10 000000111122333333 333344468889999999988877555555554332211 111 Q ss_pred CceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEE Q lcl|Aclame:pro 209 VASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRA 288 (311) Q Consensus 209 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra 288 (311) .-.+.|..|+..+.+.+. .+..+.+|=|-+.+.+..-++-.+........ .-..++.+.-.. T Consensus 240 ~Lky~~~div~D~~~g~~--------------~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~----~~~NqdA~~q~I 301 (321) T protein:vir:34 240 SLKFLSTDVVLDGGIGGF--------------AGANTMYFLNTKYLHFRPHKDRNMVPLSPSRR----AAFNQDAEAQIL 301 (321) T ss_pred eeeeeeEEEEEeCCCCCC--------------ccccceeeeecceEEEEEcCCCceeecCcccc----cccchhHHhhhh Confidence 233455555554433221 22335666666655554333333322211110 001222222223 Q ss_pred EEEeccEEecccceEEEEec Q lcl|Aclame:pro 289 EVVYGIGIMSTDAFAVVRDA 308 (311) Q Consensus 289 ~~r~~~~v~~~~a~~~l~~a 308 (311) ..+....+-++.+=.+|+.- T Consensus 302 ~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 302 AWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred hhhheeeeecccceeEEeeC Confidence 34555566677776666555 No 218 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=87.07 E-value=0.041 Score=28.18 Aligned_cols=284 Identities=10% Similarity=-0.028 Sum_probs=127.1 Q ss_pred Cc--------c-cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeec-C---ccccc- Q lcl|Aclame:pro 1 MV--------A-LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE-G---AQKSE- 65 (311) Q Consensus 1 ma--------t-~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E-g---~~~~~- 65 (311) +| . ..+..+.|.+...+.+.+.+++.|-++++.+.+++..-. .......++..+.-... + +..+. T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~r~~t~~~~~~~~~~~ 95 (343) T protein:vir:98 16 AAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYGAHDRRTPIQQRWTRQ 95 (343) T ss_pred HHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccCccccCCCccccccCC Confidence 22 1 122357888889999999999999999999998886422 22222222322222111 1 11111 Q ss_pred -cccceeEEEEeeeeEEEEEeecHHHhhcCchhhHH-HHHHHHHHHHHHHHHHHHHHhhhcccCCCc-cccccc--ccc- Q lcl|Aclame:pro 66 -STATFAPVTAIPRKVQVTQRFSQEVKWADESRQLG-VLQTMADLSGVALGRALDLIGIHGINPLTG-AALSGS--PAK- 139 (311) Q Consensus 66 -~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~-~~~~i~~~la~~ia~~~d~~~l~G~~~~~g-~~~~~~--~~~- 139 (311) .+....+..+. ..|+-+.| +....+.| +.+.+++.+.++++...-.--+||+..... +.|.+. ..| T Consensus 96 ~~~Y~c~qTn~d-------t~i~Y~~l-D~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nPllqDVN~GW 167 (343) T protein:vir:98 96 VMSMNVSRQIQA-------CLIPWAKL-DQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDPNLADVNKGW 167 (343) T ss_pred CCccEEEEeeee-------eeccHHHH-HHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCcchhhcchHH Confidence 12222333332 34677766 44555676 888899999998887776667788652211 111111 111 Q ss_pred ------------ccccc---cceeeccc-cccchHHHHHHHHHHHhhcCCC-cc-EEEEcHHHHHHH-HHhhccCCceee Q lcl|Aclame:pro 140 ------------ILDTT---NIVELTTG-TSATPDLAVEAAVGLVLGDNLS-PD-GVALDNTFSFML-ATQRDSQGRKLY 200 (311) Q Consensus 140 ------------~~~~~---~~~~~~~~-~~~~~~~~i~~~~~~~~~~~~~-~~-~~v~n~~~~~~l-~~lkd~~g~~~~ 200 (311) +..++ ...-.+.+ +-...+....++...+.....+ +. +.++.+.....- ..+-..++++-- T Consensus 168 LQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~pt 247 (343) T protein:vir:98 168 IQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLIAT 247 (343) T ss_pred HHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCChH Confidence 11111 11111222 2333444444444444333222 22 466666554321 223333333221 Q ss_pred ccccc--cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhh Q lcl|Aclame:pro 201 PELGF--GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDL 278 (311) Q Consensus 201 ~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 278 (311) ..... -....++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+.--++ T Consensus 248 Ek~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~rie~y 310 (343) T protein:vir:98 248 EKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLS-------N-LSIYTQEG-----SMRRG--MKDDD--DKKAVRDS 310 (343) T ss_pred HHHHHHHHHHHHhhCCCeeEEccccCCCceEEeecc-------c-cEEEEecC-----cEEEE--EEecc--ccccccch Confidence 11000 11235789999999999998866554321 1 12221110 11111 11111 11111122 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 279 KRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 279 f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+| -+|..+.+--+..+..-.|+.-+.+. + T Consensus 311 ~s~N-e~YvVEd~~~~a~iE~i~v~~~~~~g-~ 341 (343) T protein:vir:98 311 YYRN-EAYAVEDCGKFMAVDFTKVKLSSGKG-T 341 (343) T ss_pred hhhc-ceeeeeccccEEEeeeeeeeecCCCC-C Confidence 2232 23333333333333333232222111 2 No 219 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=85.97 E-value=0.049 Score=27.77 Aligned_cols=285 Identities=11% Similarity=0.008 Sum_probs=123.9 Q ss_pred Cc-ccCC-CceEcchhHHHHHHHHHHhhchh-----hhhcceeecCCCceEEEEEeCCceeEEeecCc-------ccccc Q lcl|Aclame:pro 1 MV-ALAT-GTFQLPKHLVPGVWQKAQGQSVL-----ARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGA-------QKSES 66 (311) Q Consensus 1 ma-t~~~-g~~~vP~~~~~~ii~~~~~~s~l-----~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~-------~~~~~ 66 (311) -+ .+.. +...|++.-. +++...+ .....+..+.+..+++-|..++..++-++.|. .++|. T Consensus 69 ta~~~a~~T~i~V~~~~~------f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEG 142 (418) T protein:vir:96 69 TAEALADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEG 142 (418) T ss_pred EEEEecCceEEEecCCcc------cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCcccc Confidence 11 1111 2245554332 2223322 11223445555667777766555444443332 23332 Q ss_pred ccceeEEEEeeeeEEEEEeecHHHhhcCchhhH--------HHHHHHHHHHHHHHHHHHHHHhhhcccCC---Cccc--- Q lcl|Aclame:pro 67 TATFAPVTAIPRKVQVTQRFSQEVKWADESRQL--------GVLQTMADLSGVALGRALDLIGIHGINPL---TGAA--- 132 (311) Q Consensus 67 ~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~--------~~~~~i~~~la~~ia~~~d~~~l~G~~~~---~g~~--- 132 (311) .-..+.-..++..+..+..|-+|-++-|.-++. ++....++.|.+. ..+++.++++|..-- ++.. T Consensus 143 sd~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng~p~~~ 221 (418) T protein:vir:96 143 SQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPLHT 221 (418) T ss_pred cccCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCCccccc Confidence 222222223333333444444444433322111 1222224445544 457788888885311 1111 Q ss_pred ccccccccccc--ccceeeccccccchHHHHHHHHHHHhh----cCCCcc----EEEEcHHHHHHHHHhhccCCceeecc Q lcl|Aclame:pro 133 LSGSPAKILDT--TNIVELTTGTSATPDLAVEAAVGLVLG----DNLSPD----GVALDNTFSFMLATQRDSQGRKLYPE 202 (311) Q Consensus 133 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~----~~~~~~----~~v~n~~~~~~l~~lkd~~g~~~~~~ 202 (311) ..++..++..- .++.... ......++.+.++...... .+.+.. .+..+++...+|.++-. +-++.-.+ T Consensus 222 t~R~m~gI~~f~~~Nvi~ag-~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~I~~~~~e 299 (418) T protein:vir:96 222 TQGIVDAIRQYAPDNVNAMP-NPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVTQRE 299 (418) T ss_pred ccchhHHHHhhccccccccC-CCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-eeEecccc Confidence 12333344333 3333322 2223445555555544322 233332 25678899999987642 22222111 Q ss_pred ccccCCC---ceecce-eEEeecccccccccccccccccccccccceEEEeecceEEEEee--cCceEEEeccCC----- Q lcl|Aclame:pro 203 LGFGTDV---ASFAGL-NAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQ--VSIPLELIEFGD----- 271 (311) Q Consensus 203 ~~~~~~~---~~l~G~-pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~~~~----- 271 (311) -..+... -+-+|. +++++..+|... .....+++-|...+.+.+- +++.-+...... T Consensus 300 n~~G~vv~~~~Td~G~v~ii~n~~~pad~-------------I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~ 366 (418) T protein:vir:96 300 TSYGMVFTEWKFFKGRLIIKEHPLFSAIG-------------ISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKS 366 (418) T ss_pred ceeceEEEEEEeeccEEEEEecCCCCccc-------------cCcceEEEEecCceEEEEecCCCccchhcccCCCcccc Confidence 1111111 111232 455555555331 2344577888877766555 444433321111 Q ss_pred c-----ccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 272 P-----DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 272 ~-----~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) . +....-.+.+++ .-.+.+++++|++.++++.---| T Consensus 367 ~~~~~~~~~~~D~~~G~l----~~Eltle~~N~~a~a~itgl~~~ 407 (418) T protein:vir:96 367 GATDYSYGHGVDAQGGSL----TSEWALELLNPQGCAVITGLQKA 407 (418) T ss_pred cccccccccccccccCEE----EEEEEEEeecccccEEeeccccc Confidence 0 000000223333 34677788999999999854444 No 220 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=85.86 E-value=0.05 Score=27.73 Aligned_cols=295 Identities=10% Similarity=-0.008 Sum_probs=125.8 Q ss_pred Cccc-------CC-CceEcchhHHHHHHHHHHhhchh--hhhcceeecCCCceEEEEEeC---CceeEEeecCccccccc Q lcl|Aclame:pro 1 MVAL-------AT-GTFQLPKHLVPGVWQKAQGQSVL--ARLSMAEPQEFGEQQYMTLTA---PPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~-------~~-g~~~vP~~~~~~ii~~~~~~s~l--~~l~~~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~ 67 (311) |.|. .+ |+.+--+.+.++|..+......+ ..-....+..+-.-+|-.... -..+.+++|++.++.++ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (463) T protein:vir:99 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 3321 12 33444444544444433322222 111122344433233333332 24678999999999999 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcc------cccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGA------ALSGSPAKIL 141 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~------~~~~~~~~~~ 141 (311) +.+.......+=++....+|.-+=.. ....|.++.+.+.....+++.++.++|+|+...+.. ...|+.+.+ T Consensus 106 ~~~~Rr~~~~K~l~~~~~VS~~~~l~--n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lI- 182 (463) T protein:vir:99 106 PNIRQKTVSMKYVSDTKNMSIASGLV--NNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLI- 182 (463) T ss_pred CceEEEEEEeeeeehhhhhhhHHHhh--cccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhc- Confidence 99999988888887776666543222 223467788888888999999999999996643332 233333222 Q ss_pred ccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccc-------------cC- Q lcl|Aclame:pro 142 DTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGF-------------GT- 207 (311) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~-------------~~- 207 (311) +.-++... .+....-+++..+-..+..+..+++-+.|+....+.|.+-.-...|-+.++.+. .. T Consensus 183 d~enviDa--rG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G 260 (463) T protein:vir:99 183 DKNNVINA--KGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRG 260 (463) T ss_pred CCCCeeec--CCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeee Confidence 22222221 122334455667777777777788889999999988875332222222211111 00 Q ss_pred ----CCceecceeEEeecccccccccccccccccccccccc-----eEEEeecceEEEEeec------------------ Q lcl|Aclame:pro 208 ----DVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNV-----KAIAGDFSAFRWGVQV------------------ 260 (311) Q Consensus 208 ----~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~gd~~~~~~~~~~------------------ 260 (311) .+..+++-|-.........+....++........... ..-.+..+......+. T Consensus 261 ~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~ 340 (463) T protein:vir:99 261 FIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVD 340 (463) T ss_pred eeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeeecc Confidence 0111222222211111100110000000000000000 0001111111111000 Q ss_pred -CceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 261 -SIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 261 -~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ++++.+..... ......+-.+.|-+ .++..|..++.-.-+ T Consensus 341 ~gv~l~It~~a~--------~~~~~~~v~IYR~~---~~~g~~~~i~rv~v~ 381 (463) T protein:vir:99 341 DGVKLSINVNAM--------YQQQPQFVSIYRQG---KETGMYFLIKRVPVK 381 (463) T ss_pred ceEEEEEEecCC--------cccceeEEEEEeec---CCCCcceeEEEEEec Confidence 11111110000 00001111111111 011222222211111 No 221 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=85.86 E-value=0.05 Score=27.73 Aligned_cols=295 Identities=10% Similarity=-0.008 Sum_probs=125.8 Q ss_pred Cccc-------CC-CceEcchhHHHHHHHHHHhhchh--hhhcceeecCCCceEEEEEeC---CceeEEeecCccccccc Q lcl|Aclame:pro 1 MVAL-------AT-GTFQLPKHLVPGVWQKAQGQSVL--ARLSMAEPQEFGEQQYMTLTA---PPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 mat~-------~~-g~~~vP~~~~~~ii~~~~~~s~l--~~l~~~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~ 67 (311) |.|. .+ |+.+--+.+.++|..+......+ ..-....+..+-.-+|-.... -..+.+++|++.++.++ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (463) T protein:vir:95 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 3321 12 33444444544444433322222 111122344433233333332 24678999999999999 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcc------cccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGA------ALSGSPAKIL 141 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~------~~~~~~~~~~ 141 (311) +.+.......+=++....+|.-+=.. ....|.++.+.+.....+++.++.++|+|+...+.. ...|+.+.+ T Consensus 106 ~~~~Rr~~~~K~l~~~~~VS~~~~l~--n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lI- 182 (463) T protein:vir:95 106 PNIRQKTVSMKYVSDTKNMSIASGLV--NNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLI- 182 (463) T ss_pred CceEEEEEEeeeeehhhhhhhHHHhh--cccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhc- Confidence 99999988888887776666543222 223467788888888999999999999996643332 233333222 Q ss_pred ccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccc-------------cC- Q lcl|Aclame:pro 142 DTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGF-------------GT- 207 (311) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~-------------~~- 207 (311) +.-++... .+....-+++..+-..+..+..+++-+.|+....+.|.+-.-...|-+.++.+. .. T Consensus 183 d~enviDa--rG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G 260 (463) T protein:vir:95 183 DKNNVINA--KGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRG 260 (463) T ss_pred CCCCeeec--CCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeee Confidence 22222221 122334455667777777777788889999999988875332222222211111 00 Q ss_pred ----CCceecceeEEeecccccccccccccccccccccccc-----eEEEeecceEEEEeec------------------ Q lcl|Aclame:pro 208 ----DVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNV-----KAIAGDFSAFRWGVQV------------------ 260 (311) Q Consensus 208 ----~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~gd~~~~~~~~~~------------------ 260 (311) .+..+++-|-.........+....++........... ..-.+..+......+. T Consensus 261 ~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~ 340 (463) T protein:vir:95 261 FIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVD 340 (463) T ss_pred eeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeeecc Confidence 0111222222211111100110000000000000000 0001111111111000 Q ss_pred -CceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 261 -SIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 261 -~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ++++.+..... ......+-.+.|-+ .++..|..++.-.-+ T Consensus 341 ~gv~l~It~~a~--------~~~~~~~v~IYR~~---~~~g~~~~i~rv~v~ 381 (463) T protein:vir:95 341 DGVKLSINVNAM--------YQQQPQFVSIYRQG---KETGMYFLIKRVPVK 381 (463) T ss_pred ceEEEEEEecCC--------cccceeEEEEEeec---CCCCcceeEEEEEec Confidence 11111110000 00001111111111 011222222211111 No 222 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=84.96 E-value=0.056 Score=27.43 Aligned_cols=289 Identities=9% Similarity=0.004 Sum_probs=130.3 Q ss_pred Ccc-------cCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccceeE Q lcl|Aclame:pro 1 MVA-------LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFAP 72 (311) Q Consensus 1 mat-------~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~ 72 (311) +|- ..+-.+.|.+.+.+.+.+.+++.|-++++.+++++..-. ..+-.-.+++-+.-+..+... ....++. T Consensus 20 ~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt~tr~~~--~~~~l~~ 97 (358) T protein:vir:78 20 LAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRKKGGRFK--GKVGVDG 97 (358) T ss_pred HHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceecCCCccc--cccccCC Confidence 221 123467888889999999999999999999998887533 233333344444443332222 2222333 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhh---HHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccc------ccc--- Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQ---LGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSP------AKI--- 140 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~---~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~------~~~--- 140 (311) -.+..++.---..|+-+.| +.+..+ .++.+.+++.+.++++...-.--+||+.....+.+.-.| .|+ T Consensus 98 ~~Y~c~qTn~dt~i~Y~~l-D~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 176 (358) T protein:vir:78 98 NTYELTETDSCASLDWATL-CTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKGWHQL 176 (358) T ss_pred CccEEEEeceeeeccHHHH-HHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHH Confidence 3333333333345666666 333322 268889999999988877766777886532222221111 111 Q ss_pred ----------cc--cccceeecc---ccccchHHHHHHHHH-HHhhcCCC-cc-EEEEcHHHHH-HHHHhhccCCceeec Q lcl|Aclame:pro 141 ----------LD--TTNIVELTT---GTSATPDLAVEAAVG-LVLGDNLS-PD-GVALDNTFSF-MLATQRDSQGRKLYP 201 (311) Q Consensus 141 ----------~~--~~~~~~~~~---~~~~~~~~~i~~~~~-~~~~~~~~-~~-~~v~n~~~~~-~l~~lkd~~g~~~~~ 201 (311) .. .+..+..+. ++....+....+++. .+.....+ +. +.++.+.... .-.+|-...+.|- . T Consensus 177 ~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pT-E 255 (358) T protein:vir:78 177 AREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYSEATKPS-E 255 (358) T ss_pred HHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCCcH-H Confidence 11 111121221 223333444444432 33332222 22 5666665553 2222322222221 0 Q ss_pred cccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhc Q lcl|Aclame:pro 202 ELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQ 281 (311) Q Consensus 202 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~ 281 (311) -.....-..++-|+|++..+++|.+...++... | .++++=.= ..|+. ++-.+ +.+.--++..+ T Consensus 256 ~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~-------N-LsIY~Q~g-----s~RR~--~~d~p--~r~riE~y~s~ 318 (358) T protein:vir:78 256 QIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLD-------N-LHCYTQRG-----TRKRK--ADDNQ--DSKSFDNQYWR 318 (358) T ss_pred HHHHHHHHHHhCCCeEEEccccCCCceEEeecc-------c-cEEEEecC-----cEEEE--EEecc--ccccccchhhh Confidence 001111125789999999999998866554321 1 12221110 11111 11111 11111122222 Q ss_pred CcEEEEEEEEeccEEecccceEEEE--------ecccC Q lcl|Aclame:pro 282 NQIAIRAEVVYGIGIMSTDAFAVVR--------DADES 311 (311) Q Consensus 282 ~~v~~ra~~r~~~~v~~~~a~~~l~--------~aa~~ 311 (311) | -+|..+.+--+..+..-.|..-. ..+++ T Consensus 319 N-e~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~~~~ 355 (358) T protein:vir:78 319 M-EGYALGEHKAYGGFEEADIEIGADPAVLAVEAAAQA 355 (358) T ss_pred c-ceeeeeccccEEEEeeeeeeeCCCCCccccCCcccc Confidence 2 22333333333333322222111 01111 No 223 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=84.47 E-value=0.06 Score=27.27 Aligned_cols=280 Identities=10% Similarity=-0.036 Sum_probs=131.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcc--eeecCCCceEEEEEeCCcee-EEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSM--AEPQEFGEQQYMTLTAPPRG-EVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~--~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |+--- -+.++..+.+.++..+.-..|.. ..-.++.+++||+.+...-. +-.+.|-..+.-+.++.+.++.. T Consensus 1 Main~------a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl~q 74 (290) T protein:vir:78 1 MAINY------VDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTIDF 74 (290) T ss_pred CchhH------HHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEeec Confidence 55322 15688888888877766555543 33444456999998753322 22333333333345556666655 Q ss_pred eeEEEE-EeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccccccc Q lcl|Aclame:pro 78 RKVQVT-QRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 78 ~kl~~~-i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) .+--.+ +.-.+. +.......+...+.+...+.++-.+|...+.---...+ ..+.....+.+... T Consensus 75 dR~~~F~vD~~Dv---DEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~------------~~~~~~~~t~t~~n 139 (290) T protein:vir:78 75 DRDVEFFVDVMDV---DETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAK------------TNSNSVAEEITKDN 139 (290) T ss_pred cccceeeccccch---hHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhh------------ccCcccccccCHHH Confidence 443322 111110 00011234455666677777777888765521000000 00011111223446 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCcee---eccccccCCCceecceeEEeec---ccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKL---YPELGFGTDVASFAGLNAAVSD---TVRGGPEAV 230 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~---~~~~~~~~~~~~l~G~pv~~~~---~~~~~~~~~ 230 (311) .++.++.++.++...+..+-.++++|.....|.+.+.-+...- +......+..+++.|++|+... .+... -.. T Consensus 140 ~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~-~~f 218 (290) T protein:vir:78 140 VFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDT-FDF 218 (290) T ss_pred HHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhh-hhh Confidence 7888888888887765444457889999988865432221110 0112224456899999986421 21100 000 Q ss_pred cccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 231 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) .+.. .....+-+...++...+ ..+...+.-.+++++- +..... +.-.+.-+.++|.=|.+.+.=.+....+= T Consensus 219 ~~G~-~~~~~ak~in~ii~~~~-a~i~~~K~~~~~~~~P-~~~~~~-----d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 219 TDGY-KPAAGAKKLNFLLVNKG-SVVGGAKHASIYLHAP-GSVGQG-----DGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred cccc-cccCCccceeEEEEcCC-ceeeeeeeeEEEeeCC-CCCcCc-----ceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 0000 00111111223333332 2333444445555432 111111 22233445678877777655333333333 No 224 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=82.25 E-value=0.079 Score=26.64 Aligned_cols=280 Identities=13% Similarity=0.109 Sum_probs=119.7 Q ss_pred Ccc-cCCCceEcchhHHHHHHHH---HHhhchhhhhcceeecCCCceEEE-------EEeCC------------ceeEE- Q lcl|Aclame:pro 1 MVA-LATGTFQLPKHLVPGVWQK---AQGQSVLARLSMAEPQEFGEQQYM-------TLTAP------------PRGEV- 56 (311) Q Consensus 1 mat-~~~g~~~vP~~~~~~ii~~---~~~~s~l~~l~~~~~~~~~~~~~p-------~~~~~------------~~a~~- 56 (311) ++. .++|.+. .+.+.++.. +-+..+..+++.+.||.+++.-|. ..... +++.| T Consensus 79 i~es~~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fS 155 (521) T protein:vir:10 79 IAAGQTSGAVT---QIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFS 155 (521) T ss_pred ccccccccccc---cCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccc Confidence 321 1222221 233333434 345666777788878776542211 10000 00000 Q ss_pred --------------------------------------------------------------------eecC-------- Q lcl|Aclame:pro 57 --------------------------------------------------------------------VGEG-------- 60 (311) Q Consensus 57 --------------------------------------------------------------------v~Eg-------- 60 (311) ++++ T Consensus 156 G~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEa 235 (521) T protein:vir:10 156 GQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAEL 235 (521) T ss_pred ccccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhh Confidence 0010 Q ss_pred ---------ccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCc Q lcl|Aclame:pro 61 ---------AQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTG 130 (311) Q Consensus 61 ---------~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g 130 (311) .+.++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|.=.+-..- T Consensus 236 l~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~ 315 (521) T protein:vir:10 236 QESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQ 315 (521) T ss_pred hccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheee Confidence 112223333444444444444455789999987543 2578899999999999999999999843211100 Q ss_pred cccccc------cccccccccceeeccccccc-----hHHHHHHHHHHHh--hcCCCccEEEEcHHHHHHHHHhh----- Q lcl|Aclame:pro 131 AALSGS------PAKILDTTNIVELTTGTSAT-----PDLAVEAAVGLVL--GDNLSPDGVALDNTFSFMLATQR----- 192 (311) Q Consensus 131 ~~~~~~------~~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~~~~--~~~~~~~~~v~n~~~~~~l~~lk----- 192 (311) -+..+. ..|+.+..........-... .+--+......+. ...+..+.++++++....|...- T Consensus 316 ~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~ 395 (521) T protein:vir:10 316 VGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISY 395 (521) T ss_pred eeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccc Confidence 111111 12322222111111111000 1111222222222 23356677999999988887521 Q ss_pred ccCC-ceeeccccccCC-Ccee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecCceEE Q lcl|Aclame:pro 193 DSQG-RKLYPELGFGTD-VASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLE 265 (311) Q Consensus 193 d~~g-~~~~~~~~~~~~-~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~ 265 (311) .++| ..-|..+.++.. .|.| .|++|++..+.+.+ .+++|-.... .+.+..-+.+. T Consensus 396 ~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfyaPYv~l~ 457 (521) T protein:vir:10 396 AAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQD------------------YFTVGYKGPNEMDAGIYYAPYVALT 457 (521) T ss_pred ccccccccccccCCCceEEEEecCceEEEecCCCCcc------------------eEEEEEeCCcccccceeeccccccc Confidence 1111 122433333322 2444 45688888776543 2222221100 01111112222 Q ss_pred EeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 266 LIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 266 ~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+-.+++. || -.+.++ .|++..+ +| |+.-..-+++ T Consensus 458 ~~~~~dp~s----fq-P~~g~~--tRY~l~~-NP--~~~~~~~~~~ 493 (521) T protein:vir:10 458 PLRGSDPKN----FQ-PVMGFK--TRYGIGI-NP--FAESAAQAPA 493 (521) T ss_pred cccccCCcc----cc-ceeeee--eeeceee-cC--cccccCCccc Confidence 222233332 43 234443 4666543 44 2222211111 No 225 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=76.07 E-value=0.14 Score=25.27 Aligned_cols=280 Identities=12% Similarity=0.097 Sum_probs=115.1 Q ss_pred Ccc-cCCCceEcchhHHHHHHHH---HHhhchhhhhcceeecCCCceEEE--E--E---eC---C---------ceeEEe Q lcl|Aclame:pro 1 MVA-LATGTFQLPKHLVPGVWQK---AQGQSVLARLSMAEPQEFGEQQYM--T--L---TA---P---------PRGEVV 57 (311) Q Consensus 1 mat-~~~g~~~vP~~~~~~ii~~---~~~~s~l~~l~~~~~~~~~~~~~p--~--~---~~---~---------~~a~~v 57 (311) .+. .+++.+ +.+.+.++.. +-+..+..+++.+.||.+++--|. | . +. + +++.|- T Consensus 87 ia~s~~s~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fS 163 (534) T protein:vir:10 87 IASGETSGSI---TNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFS 163 (534) T ss_pred cccccccccc---ccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCcccccccccccccccccc Confidence 221 111111 1122333333 345666777777777776542211 0 0 00 0 000010 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 58 -------------------------------------------------------------------------------- 57 (311) Q Consensus 58 -------------------------------------------------------------------------------- 57 (311) T Consensus 164 G~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~T 243 (534) T protein:vir:10 164 GRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMAT 243 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccccch Confidence Q ss_pred --ec-----C----ccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 58 --GE-----G----AQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGI 125 (311) Q Consensus 58 --~E-----g----~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~ 125 (311) +| + .++++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|..+|++.+|.-. T Consensus 244 a~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l 323 (534) T protein:vir:10 244 AFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWI 323 (534) T ss_pred hhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHH Confidence 01 0 112233344455555444444455789999987543 24788899999999999999999988642 Q ss_pred cC--CCcccc----ccccccccccccceeeccccccchHHHHHHHHH-------HHh--hcCCCccEEEEcHHHHHHHHH Q lcl|Aclame:pro 126 NP--LTGAAL----SGSPAKILDTTNIVELTTGTSATPDLAVEAAVG-------LVL--GDNLSPDGVALDNTFSFMLAT 190 (311) Q Consensus 126 ~~--~~g~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-------~~~--~~~~~~~~~v~n~~~~~~l~~ 190 (311) .. ..+... .+...|+.+.........+-. ..+-+..++. .+. ......+.++++++....|.. T Consensus 324 ~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~--~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~ 401 (534) T protein:vir:10 324 NATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARW--AGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGH 401 (534) T ss_pred hhhhheeecccccccccccceeeeeccccccchhH--HHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhh Confidence 21 111110 001112222211111000000 1111222222 221 122356679999999988854 Q ss_pred h--hc---cCCcee-eccccccC-CCcee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEe Q lcl|Aclame:pro 191 Q--RD---SQGRKL-YPELGFGT-DVASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGV 258 (311) Q Consensus 191 l--kd---~~g~~~-~~~~~~~~-~~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~ 258 (311) . .+ ..|... ...+.++. ..|.| .|++|++..+.+.. .+++|-.... .+.+ T Consensus 402 ~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfy 463 (534) T protein:vir:10 402 TDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVED------------------YFTVGYKGASEMDAGLYY 463 (534) T ss_pred ccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcc------------------eEEEEEeCCcccccceee Confidence 1 11 011100 11111111 12444 45688888776643 2222221100 0111 Q ss_pred ecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecc-------cceEEEEecccC Q lcl|Aclame:pro 259 QVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST-------DAFAVVRDADES 311 (311) Q Consensus 259 ~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~-------~a~~~l~~aa~~ 311 (311) ..-+.+...+..+++. || -.+.++ .|++..+ +| +-+.++...+.- T Consensus 464 aPYv~l~~~~~~dp~s----fq-P~~g~~--tRY~l~~-NP~~~~~~~~~~~~i~~g~~~ 515 (534) T protein:vir:10 464 CPYVALTPLRGTDPKN----FQ-PVLGFK--TRYGVKL-HPMADATQNKGFAKISNGMPQ 515 (534) T ss_pred ccccccccccccCCcc----cc-ceeeee--eeeceee-cCcccccCCccccccccCCcc Confidence 1122222222233332 43 234443 4666543 33 112233322111 No 226 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=75.57 E-value=0.15 Score=25.18 Aligned_cols=281 Identities=13% Similarity=0.018 Sum_probs=123.5 Q ss_pred CcccCC-CceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccceeEEEEeee Q lcl|Aclame:pro 1 MVALAT-GTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) Q Consensus 1 mat~~~-g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 78 (311) -+..+. -.+.|.+...+.+.+.+++.|-++++.+.+++..-. ..+-.-.+++-+.-+.-+ ..| .+..++.-.+..+ T Consensus 21 ~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~-R~~-~~~~l~~~~Y~c~ 98 (336) T protein:vir:37 21 LDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQTG-RNL-ANLDHTQNGFELA 98 (336) T ss_pred hhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccCCC-ccc-cccCcCCcccEEE Confidence 122222 258888899999999999999999999999887533 233333333333322211 111 1123333444444 Q ss_pred eEEEEEeecHHHhhcCchhhHHHH-HHHHHHHHHHHHHHHHHHhhhcccCCCc-cccccc--cccc-------------- Q lcl|Aclame:pro 79 KVQVTQRFSQEVKWADESRQLGVL-QTMADLSGVALGRALDLIGIHGINPLTG-AALSGS--PAKI-------------- 140 (311) Q Consensus 79 kl~~~i~iS~ell~~s~~~~~~~~-~~i~~~la~~ia~~~d~~~l~G~~~~~g-~~~~~~--~~~~-------------- 140 (311) +.---..|+-+.| +.+..+.|.. ..+...+.++|+...-.--+||+..... ..|.+. ..|+ T Consensus 99 qTn~dt~i~y~~L-D~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~ 177 (336) T protein:vir:37 99 ETDSGIIVPWALF-DSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADLSDVNKGWLKLLQEQRAANFMT 177 (336) T ss_pred EeeeeeeecHHHH-HHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCcccccchhHHHHHHhccchhhcc Confidence 4444456777776 3444445543 3334445555555555556688542211 111110 1111 Q ss_pred ---cccccceeecc-ccccchHHHHHHHHHHHhhcCCC-cc-EEEEcHHHHHH-HHHhhccCC-ceeeccccc--cCCCc Q lcl|Aclame:pro 141 ---LDTTNIVELTT-GTSATPDLAVEAAVGLVLGDNLS-PD-GVALDNTFSFM-LATQRDSQG-RKLYPELGF--GTDVA 210 (311) Q Consensus 141 ---~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~-~~-~~v~n~~~~~~-l~~lkd~~g-~~~~~~~~~--~~~~~ 210 (311) .....+...+. ++-...++...++...+.....+ +. +.++.+..... ...+-..++ +|- ..... -.... T Consensus 178 ~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt-E~~Aa~~~~~~k 256 (336) T protein:vir:37 178 ESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT-EKAALGSHNLMG 256 (336) T ss_pred cccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH-HHHHHHHHHHHH Confidence 11111112122 22333344444455444332222 22 46666644421 122333322 221 00000 11236 Q ss_pred eecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 211 SFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEV 290 (311) Q Consensus 211 ~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~ 290 (311) ++-|+|++..+++|.+...++... ..++++-.=+ .|+. ++-.+ +.+.--++..+| T Consensus 257 ~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR~--~~d~p--~r~rie~y~s~N-------- 311 (336) T protein:vir:37 257 SFGGMNAITPPNFPARAAAVTTLK--------NLSVYTEAES-----VRRS--LRNDE--DKKGLVTSYYRQ-------- 311 (336) T ss_pred hhCCceeEEccccCCCceEEeech--------hcEEEEecCc-----EEEE--EEEcc--ccccccchhhhc-------- Confidence 789999999999998866554432 1122221111 1111 11111 111111222222 Q ss_pred EeccEEecccceEEEEecc-----cC Q lcl|Aclame:pro 291 VYGIGIMSTDAFAVVRDAD-----ES 311 (311) Q Consensus 291 r~~~~v~~~~a~~~l~~aa-----~~ 311 (311) -|+.|=+..+++.+...+ +- T Consensus 312 -e~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 312 -EGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred -ceeeeeccccEEEeeeeeeeecCcC Confidence 222333444444333221 11 No 227 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=74.88 E-value=0.15 Score=25.05 Aligned_cols=286 Identities=12% Similarity=0.040 Sum_probs=121.3 Q ss_pred CcccCC-----------CceEcchhHHHHHHHHHHhhchh--hhhcceeecCCCceEEEEEeC---CceeEEeecCcccc Q lcl|Aclame:pro 1 MVALAT-----------GTFQLPKHLVPGVWQKAQGQSVL--ARLSMAEPQEFGEQQYMTLTA---PPRGEVVGEGAQKS 64 (311) Q Consensus 1 mat~~~-----------g~~~vP~~~~~~ii~~~~~~s~l--~~l~~~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~~ 64 (311) |=+..+ |+.+--+.+.++|-.+......+ .+-....+..+-.-+|-.... -..+.+++|++.++ T Consensus 19 ~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~ 98 (464) T protein:vir:80 19 IKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAP 98 (464) T ss_pred HHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccccc Confidence 112222 22333333433443332222211 111222333333233333332 24577999999999 Q ss_pred ccccceeEEEEeeeeEEE--EEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcc-------cccc Q lcl|Aclame:pro 65 ESTATFAPVTAIPRKVQV--TQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGA-------ALSG 135 (311) Q Consensus 65 ~~~~~~~~v~l~~~kl~~--~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~-------~~~~ 135 (311) .+++.+.+.....+=+.. .+.+-.+|.+. ..|-++.+.+.....+++.++.++|+|+...+.. ...| T Consensus 99 ~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~----~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDG 174 (464) T protein:vir:80 99 ISDPNLRQKTVNMKYVSDTKNMSIATGLVNN----IEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDG 174 (464) T ss_pred cCCCceEEEEEEeeeeecceeeeeehhhhcc----hhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhh Confidence 999999987776554433 35555555532 2345566677777889999999999997654432 2333 Q ss_pred ccccccccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHH-HHhhccCCceeeccccccCCCceecc Q lcl|Aclame:pro 136 SPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFML-ATQRDSQGRKLYPELGFGTDVASFAG 214 (311) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l-~~lkd~~g~~~~~~~~~~~~~~~l~G 214 (311) +.+.+ +.-++... .+.....+.+..+-..+..+..+++-+.|+......+ ....+.+-+-+. ........| T Consensus 175 l~~lI-~~~NViDa--rG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~~-----~n~~~~~~G 246 (464) T protein:vir:80 175 LAKLI-DKHNVLDA--KGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVIS-----DNGQNATMG 246 (464) T ss_pred hHhhc-CCCceeec--CCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEEc-----CCCCcceee Confidence 33222 23333222 1222345667777777777777888889998888775 444444433331 112223456 Q ss_pred eeEEeecccccccccccccc---cccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhh-cC---cEEEE Q lcl|Aclame:pro 215 LNAAVSDTVRGGPEAVTAST---GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKR-QN---QIAIR 287 (311) Q Consensus 215 ~pv~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~-~~---~v~~r 287 (311) ++|- ..+-....+....+ ..............+.+. ...++..+.+. +. ..|. ++ ...|+ T Consensus 247 ~~v~--~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apa------apsvt~tv~~~----~~-g~f~~~~~~~~~~Yk 313 (464) T protein:vir:80 247 FNVK--GFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQ------KATVKATLEAG----TK-GKFRDEDLTIDTEYK 313 (464) T ss_pred eecc--cccccccceeccCccccCcccccccccccCCCCcC------CceeEEEecCC----cc-cCCccccccceeEEE Confidence 6652 01110000000000 000000000000111110 01111112111 11 1121 11 12233 Q ss_pred EEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 288 AEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 288 a~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) +...-+.+=--|..++-.+.++-. T Consensus 314 v~~vn~~GeS~ps~~~~~ti~~~~ 337 (464) T protein:vir:80 314 VVVVSDDAESAPSDVASVVIDDKK 337 (464) T ss_pred EEEECCCCccccceeeeeeecCcc Confidence 222211111112111111111100 No 228 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=74.50 E-value=0.16 Score=24.98 Aligned_cols=279 Identities=13% Similarity=0.070 Sum_probs=110.4 Q ss_pred CcccCCCceEcchhHHHHHHHHHHh-hchhhhhcceee------------------cCCCceEEEEEeCCceeEEeec-- Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQG-QSVLARLSMAEP------------------QEFGEQQYMTLTAPPRGEVVGE-- 59 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~-~s~l~~l~~~~~------------------~~~~~~~~p~~~~~~~a~~v~E-- 59 (311) ..+...+....+-.... ..+.. .....+...... +..+. . +..+..-..-.+| T Consensus 164 ~~~~~~~~~t~~G~~~~---~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~--~-y~~g~GmsTa~aEal 237 (522) T protein:vir:69 164 FPALAASTQTKVGDIYT---HFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGA--L-VEIAEGMATSIAELQ 237 (522) T ss_pred ccccccccccccccccc---cccccccceeeecccCCcCCCCCcccccccchhcccccccc--c-eeeccccchhhhhhc Confidence 00000000000000000 00000 000000000000 00000 0 0011111111223 Q ss_pred -------CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcc Q lcl|Aclame:pro 60 -------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTGA 131 (311) Q Consensus 60 -------g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~ 131 (311) +.++++-..+++.++..++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|.=.+-..-- T Consensus 238 ~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 317 (522) T protein:vir:69 238 EGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQV 317 (522) T ss_pred ccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhee Confidence 2345666677777776666666666899999987543 25788999999999999999999998432111101 Q ss_pred ccccc------cccccccccceeecccccc-----chHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhh-----c Q lcl|Aclame:pro 132 ALSGS------PAKILDTTNIVELTTGTSA-----TPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQR-----D 193 (311) Q Consensus 132 ~~~~~------~~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~lk-----d 193 (311) +..++ ..|+.+.......-++-.. ..+--+......+.. .....+.++++++....|...- . T Consensus 318 ~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~ 397 (522) T protein:vir:69 318 GKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYA 397 (522) T ss_pred eccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccc Confidence 11111 1222222211111000000 011112222222222 2235667999999998886521 1 Q ss_pred cCC-ceeeccccccCC-Ccee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecCceEEE Q lcl|Aclame:pro 194 SQG-RKLYPELGFGTD-VASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLEL 266 (311) Q Consensus 194 ~~g-~~~~~~~~~~~~-~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~ 266 (311) ++| ..-|..+.++.. .|.| .|++|++..+.+.+ .+++|-.... .+.+..-..+.. T Consensus 398 ~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfyaPYv~l~~ 459 (522) T protein:vir:69 398 AQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQD------------------YFTVGYKGANEMDAGIYYAPYVALTP 459 (522) T ss_pred cccccccccccCCCceEEEEecCceEEEecCCCCcc------------------eEEEEEeCCcccccceeecccccccc Confidence 111 222433333322 2444 45688887776543 2222221110 011122222222 Q ss_pred eccCCcccchhhhhcCcEEEEEEEEeccEEecccc-------eEEEEecc-----cC Q lcl|Aclame:pro 267 IEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA-------FAVVRDAD-----ES 311 (311) Q Consensus 267 ~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a-------~~~l~~aa-----~~ 311 (311) .+-.+++. || -.+.++ .|++..+ +|=+ -++|.+.+ ++ T Consensus 460 ~~~~dp~s----fq-P~~g~~--tRY~l~v-NP~~~~~~~~~~~ri~~g~p~~~~~~ 508 (522) T protein:vir:69 460 LRGSDPKN----FQ-PVMGFK--TRYGIGV-NPFAESSLQAPGARIQSGMPSILNSL 508 (522) T ss_pred ccccCCcc----cc-ceeeee--eeeceee-cCcccccCCcccceeecccchhhccc Confidence 23233332 43 334443 4666543 3311 12333333 11 No 229 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=73.57 E-value=0.17 Score=24.82 Aligned_cols=276 Identities=10% Similarity=0.013 Sum_probs=117.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccc-cccce---eE--EE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSE-STATF---AP--VT 74 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~---~~--v~ 74 (311) ||+... -..+-+.+.+--+..-.+..+--.+++.++++....+|++.....-..|-.+-+.... ...+| +. .. T Consensus 1 m~~~~~-~~~~dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k~~~f~~e~f~~~~t~ra~~~~~~~v~~~~~~~~~~~ 79 (307) T protein:vir:79 1 MGRLSK-LRIVDPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGKIPKFGKESFRLYQTERALRAKSNRMNPEDIDSVDVN 79 (307) T ss_pred CCCCCC-CcccCHHHHHHHhhccchhhhhhhcCCcccccccccceeeeccccccccccccccCCCcceeeeecccccccc Confidence 887753 3334444544444443344444556788888877777777632110011111111111 11122 22 22 Q ss_pred EeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHh----hhcccCCCccccccccccccccccceeec Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIG----IHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~----l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 150 (311) +..+-+.. ++-.+ ....+..+.++...+.+.+.|.+..|..+ ++..+- +.. ....+.+ .... T Consensus 80 ~~~~~l~~--~id~r---~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y----~~~--~k~tLsg--t~~W- 145 (307) T protein:vir:79 80 LDEHDLEY--PIDYR---EDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSY----AAG--NKKQLSA--TEKF- 145 (307) T ss_pred ccccchhh--cccch---hcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccccc----CCC--ceEEEcc--Cccc- Confidence 23333332 33222 12233445555555555555544444433 322110 000 0111111 1112 Q ss_pred cccccchHHHHHHHHHHHh-hcCCCccEEEEcHHHHHHHHH----hhc--cCCceeeccccccCCCceeccee-EEeecc Q lcl|Aclame:pro 151 TGTSATPDLAVEAAVGLVL-GDNLSPDGVALDNTFSFMLAT----QRD--SQGRKLYPELGFGTDVASFAGLN-AAVSDT 222 (311) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~-~~~~~~~~~v~n~~~~~~l~~----lkd--~~g~~~~~~~~~~~~~~~l~G~p-v~~~~~ 222 (311) +....+...+|.....++. ..+.+|+.++|.+..|..|++ ++. ..+..+.. ...-..++|+. |.+-+. T Consensus 146 sd~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it----~~~la~l~~v~~V~vg~a 221 (307) T protein:vir:79 146 TAANSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT----VDLLKEIFEVENIAVGEA 221 (307) T ss_pred CCCCCCcHHHHHHHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCccccccC----HHHHHHHhCceeEEEeee Confidence 2345677788888877765 567889999999999988865 121 22222221 11123455654 333333 Q ss_pred cccccccccccccccccccccceEEE-------e-----ecc-eEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEE Q lcl|Aclame:pro 223 VRGGPEAVTASTGVYRTTNPNVKAIA-------G-----DFS-AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAE 289 (311) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~-------g-----d~~-~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~ 289 (311) .-.. ........+.+...... + ..+ +|.. .+++..+ +..+. ...+--.+|+. T Consensus 222 ~y~~-----~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~-~~~g~~~-~d~~~--------~~~~~~~vrv~ 286 (307) T protein:vir:79 222 IYAD-----DKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTL-RKKGNPV-VDTRI--------EDGKLELVRAT 286 (307) T ss_pred eeec-----ccccchhcCCCceEEEecccccCCCCCcccccccceeE-EecCceE-Eeccc--------CCCceeEEeec Confidence 2110 00001111111111110 0 001 1111 1112111 11111 11222346677 Q ss_pred EEeccEEecccceEEEEeccc Q lcl|Aclame:pro 290 VVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 290 ~r~~~~v~~~~a~~~l~~aa~ 310 (311) ....-.+.-+++=..|+++-. T Consensus 287 ~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 287 DIFRPYLLGADAGYLISGING 307 (307) T ss_pred ccccceeeccccchhhccCCC Confidence 777777777777777776666 No 230 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=72.17 E-value=0.19 Score=24.59 Aligned_cols=266 Identities=11% Similarity=0.068 Sum_probs=116.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcc------eeecCCCceEEEEEeC--CceeEEeecCccccccccceeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSM------AEPQEFGEQQYMTLTA--PPRGEVVGEGAQKSESTATFAP 72 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~------~~~~~~~~~~~p~~~~--~~~a~~v~Eg~~~~~~~~~~~~ 72 (311) |+-- .-+.+...+.+..+..+....+.+ +...++.+++||+.++ +...+-..-|-....-+.+++. T Consensus 1 Main------~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et 74 (285) T protein:vir:79 1 MTVV------LDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKET 74 (285) T ss_pred Ccch------hhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeE Confidence 5432 134566777777776665555532 2344456799999853 3333333333333333445555 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHH-HHHHHHHHHHHHhhhcccCCCccccccccccccccccceeecc Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADL-SGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~-la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) .++..-+--.+ .| +.+ +.++++.-..+.+..+ ..+..+=.+|.-.|.-- ..........+ T Consensus 75 ~tl~~DR~~~f-~i-D~m--DvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskl---------------a~~a~~~~~~~ 135 (285) T protein:vir:79 75 VKLTHEDWFGY-DL-DQF--DMDENGAYTVENVVREHNKMITIPHRDKVAVQKL---------------FDSAAKKATDS 135 (285) T ss_pred EEeecccccee-cc-ccc--chhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHH---------------Hhhcccccccc Confidence 55544322111 01 000 0011111112222222 23333345554444210 00000001112 Q ss_pred ccccchHHHHHHHHHHHhhcCCCcc-EEEEcHHHHHHHHHhhccCCceeec-c---ccccCCCceecc-eeEEe--eccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSPD-GVALDNTFSFMLATQRDSQGRKLYP-E---LGFGTDVASFAG-LNAAV--SDTV 223 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd~~g~~~~~-~---~~~~~~~~~l~G-~pv~~--~~~~ 223 (311) .+....++.++.++.++...+...+ .++++|..+..|.+-+.-+...-.. . .......++|.| +|++. ++++ T Consensus 136 ~T~~nv~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~ 215 (285) T protein:vir:79 136 ITKDNALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRL 215 (285) T ss_pred cCHHHHHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhc Confidence 2344678889999999988877544 4778899998887654322111110 0 112234678898 88863 3444 Q ss_pred ccccccccccccccccccccceEEEeecceEEEEeecCceEEEe-ccCCcccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELI-EFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAF 302 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~-~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~ 302 (311) +.. .. ...-..++...+ ..+...+.-.+.++ |... ... |.-.+.-+.++|.=|.+.+.= T Consensus 216 kt~----~~--------~k~Infiiv~~~-a~i~~~K~~~~~~f~P~~~--~~~-----d~~~~~~R~Y~d~fv~~nk~~ 275 (285) T protein:vir:79 216 KGL----GI--------TNHVNFILTPLS-AIAPIVKYDSVSVIDPSTD--RSG-----NRWTIKGLSYYDAIVLDNAKK 275 (285) T ss_pred cCc----Cc--------chhccEEEecCc-eeccceeeeeeEeECCCCC--CCc-----ceeeeeeeeeeeeeehhhccc Confidence 321 00 011122222222 12333333333333 2211 111 112233346777777776443 Q ss_pred EEEEecccC Q lcl|Aclame:pro 303 AVVRDADES 311 (311) Q Consensus 303 ~~l~~aa~~ 311 (311) .+.-.+.++ T Consensus 276 ~Iy~~~~a~ 284 (285) T protein:vir:79 276 GIYVAATAG 284 (285) T ss_pred eeeeeeccc Confidence 333333333 No 231 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=70.44 E-value=0.18 Score=24.73 Aligned_cols=266 Identities=17% Similarity=0.147 Sum_probs=110.3 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE-eCCceeEEeecCccccccccceeEEEEeeee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTL-TAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~-~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 79 (311) =++.+...+.+|..++..|-..+....|+.+.+.+..+ |.+-..+. .+..++.....|+.+++...++.--++.|.- T Consensus 39 gvtitdttfqlprklvesintallntnpvfkvfhvtnv--gallvsrsfdssneaqvhkdgqtkteqaatltidtlepvm 116 (318) T protein:vir:94 39 GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV--GALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVM 116 (318) T ss_pred CceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhh--hheeeeccccccchhhhhcccccccccceeeeecccchhH Confidence 22344445567777778887777777887776655544 34444443 3455677778899999888777766666644 Q ss_pred EEEEEeecHHH--hhcCchhhHHHHHHHHHHHHHHHHHHH-HHHhhhcccCCCccccccccccccccccceee---cccc Q lcl|Aclame:pro 80 VQVTQRFSQEV--KWADESRQLGVLQTMADLSGVALGRAL-DLIGIHGINPLTGAALSGSPAKILDTTNIVEL---TTGT 153 (311) Q Consensus 80 l~~~i~iS~el--l~~s~~~~~~~~~~i~~~la~~ia~~~-d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~---~~~~ 153 (311) +...-.+-+.+ +++| ...+-..|..++..+|..++ |.++..|+|.. +.+.+..- .+......+ +... T Consensus 117 vyklqslaervkrlqms---yselynlivaeltqaivnkivdlalvegdgtn---gfksidke-advkkikkittkaksa 189 (318) T protein:vir:94 117 VYKLQSLAERVKRLQMS---YSELYNLIVAELTQAIVNKIVDLALVEGDGTN---GFKSIDKE-ADVKKIKKITTKAKSA 189 (318) T ss_pred HHHHHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHhhhhheeeeecCCcc---hhhhhchh-hhHHHHHHhhhhhhhc Confidence 43333333322 2233 33455666777777777765 55566664332 22222211 011111111 1111 Q ss_pred ccc-hHHHHHHHHHHHhhcCCCccEEEEcHHHH-HHHHHhhccCC---ceeeccccccCCCceecceeE--Eeecccccc Q lcl|Aclame:pro 154 SAT-PDLAVEAAVGLVLGDNLSPDGVALDNTFS-FMLATQRDSQG---RKLYPELGFGTDVASFAGLNA--AVSDTVRGG 226 (311) Q Consensus 154 ~~~-~~~~i~~~~~~~~~~~~~~~~~v~n~~~~-~~l~~lkd~~g---~~~~~~~~~~~~~~~l~G~pv--~~~~~~~~~ 226 (311) ... ..+.+..+..-+.+...+.- .+....+. ..|..++.+.. ..+-+++..-. +--|+.- +.+.. T Consensus 190 gktpfadaieeavdfvrptagrry-livktedrkalldelrqatananvriknddteia---sevgvdeiivytgs---- 261 (318) T protein:vir:94 190 GKTPFADAIEEAVDFVRPTAGRRY-LIVKTEDRKALLDELRQATANANVRIKNDDTEIA---SEVGVDEIIVYTGS---- 261 (318) T ss_pred CCCchhHHHHHHHhhhccCCCceE-EEEeccchHHHHHHHHhhhcccceEEeccchhhh---hhcCcceeEEeecc---- Confidence 222 23345555544443322211 33443333 33344543322 22222222111 1112211 11111 Q ss_pred cccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 227 PEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR 306 (311) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~ 306 (311) .+.+..+++ | ..|.+.+..--.+..++|-. ..||+.+....-.-.... +|-++++ T Consensus 262 -------------kavkptvlv-d-qkyhidmqdltkvdafewkt--------nsnmilvetltsghvety--nagavit 316 (318) T protein:vir:94 262 -------------KAVKPTVLV-D-QKYHIDMQDLTKVDAFEWKT--------NSNMILVETLTSGHVETY--NAGAVIT 316 (318) T ss_pred -------------ccccceeEe-c-cceecchhhhhhhhceeecc--------CCceEEEEecccCcceee--cCceeEE Confidence 111112221 1 12333222111111111110 123333322111111222 2223333 Q ss_pred ec Q lcl|Aclame:pro 307 DA 308 (311) Q Consensus 307 ~a 308 (311) .. T Consensus 317 vs 318 (318) T protein:vir:94 317 VS 318 (318) T ss_pred eC Confidence 22 No 232 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=70.31 E-value=0.21 Score=24.29 Aligned_cols=280 Identities=14% Similarity=0.096 Sum_probs=113.3 Q ss_pred Cc-ccCCCceEcchhHHHHHHHH---HHhhchhhhhcceeecCCCceE-------EEEEe--CC--------------ce Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPGVWQK---AQGQSVLARLSMAEPQEFGEQQ-------YMTLT--AP--------------PR 53 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~ii~~---~~~~s~l~~l~~~~~~~~~~~~-------~p~~~--~~--------------~~ 53 (311) ++ .+++|.+. .+.+.++.. +-+..+..+++.+.||.+++.- ++... .+ ++ T Consensus 79 i~~s~~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~d 155 (524) T protein:vir:98 79 IASGKSSGAIT---NIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPD 155 (524) T ss_pred ccccccccccc---cccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccc Confidence 23 12222221 122333333 3355666677777776654321 11110 00 00 Q ss_pred eEE---------------------------------------------------------------------ee------ Q lcl|Aclame:pro 54 GEV---------------------------------------------------------------------VG------ 58 (311) Q Consensus 54 a~~---------------------------------------------------------------------v~------ 58 (311) +.| ++ T Consensus 156 t~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA 235 (524) T protein:vir:98 156 TMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATS 235 (524) T ss_pred cccCCccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchh Confidence 000 00 Q ss_pred --c---------CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 59 --E---------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGIN 126 (311) Q Consensus 59 --E---------g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~ 126 (311) | +.++++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|.=.+ T Consensus 236 ~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~ 315 (524) T protein:vir:98 236 VAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLIN 315 (524) T ss_pred hhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHh Confidence 1 0112223333444444444444445789999987543 257889999999999999999999884322 Q ss_pred CCCcccccccc------ccccccccceeeccccc-----cchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHh-- Q lcl|Aclame:pro 127 PLTGAALSGSP------AKILDTTNIVELTTGTS-----ATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQ-- 191 (311) Q Consensus 127 ~~~g~~~~~~~------~~~~~~~~~~~~~~~~~-----~~~~~~i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~l-- 191 (311) ...-.+..++. .|+.+.........+-. ...+--+.+....+.. .....+.++++++....|..+ T Consensus 316 ~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~ 395 (524) T protein:vir:98 316 YTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS 395 (524) T ss_pred hhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhc Confidence 11111111111 12222211110000000 0111122222222222 223466789999988888752 Q ss_pred ---hccCCc-eeeccccccC-CCcee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecC Q lcl|Aclame:pro 192 ---RDSQGR-KLYPELGFGT-DVASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVS 261 (311) Q Consensus 192 ---kd~~g~-~~~~~~~~~~-~~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~ 261 (311) ..+.+- -....+.++. ..|.| .|++|++..+.+.. .+++|-.... .+.+..- T Consensus 396 g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfyaPY 457 (524) T protein:vir:98 396 GITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQD------------------YFTVGFKGDNEMDAGIYYAPY 457 (524) T ss_pred ccccccchhhcccccCCccceEEEEecCceEEEecCCCCcc------------------eEEEEeeCCcccccceeeccc Confidence 111110 0011111111 11333 45688887776543 2222221100 0111111 Q ss_pred ceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 262 IPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 262 ~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..+...+-.+++. || -.+.++ .|++.. ++| |+.-...+.+ T Consensus 458 v~l~~~~~~dp~s----fq-P~~g~~--tRY~l~-~NP--~~~~~~~~~~ 497 (524) T protein:vir:98 458 VALTPLRGSDPKN----FQ-PVMGFK--TRYGIG-INP--FANSRSQAPA 497 (524) T ss_pred cccccccccCCcc----cc-ceeeee--eeecee-ecC--cccccCCccc Confidence 2222222223332 33 234443 466654 344 3322222221 No 233 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=70.07 E-value=0.21 Score=24.25 Aligned_cols=281 Identities=12% Similarity=0.016 Sum_probs=122.3 Q ss_pred Cc--------ccCC-CceEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCceeEEeecCccccccccce Q lcl|Aclame:pro 1 MV--------ALAT-GTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGEGAQKSESTATF 70 (311) Q Consensus 1 ma--------t~~~-g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 70 (311) +| +.+. ..+.|.+...+.+.+.+++.|-++++.+.+++..-. ..+-.-.+++-+.-+.-+..... ... T Consensus 13 ~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~r~r~~--~~l 90 (336) T protein:vir:37 13 LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQTGRNLAT--LDH 90 (336) T ss_pred HHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccCCCCCccc--cCC Confidence 22 1222 358888899999999999999999999999887533 23333333433333222211111 122 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchhhHHHH-HHHHHHHHHHHHHHHHHHhhhcccCCCcc-ccccc--cccc------ Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESRQLGVL-QTMADLSGVALGRALDLIGIHGINPLTGA-ALSGS--PAKI------ 140 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~-~~i~~~la~~ia~~~d~~~l~G~~~~~g~-~~~~~--~~~~------ 140 (311) +.-.+..++.---..|+-+.| +.+..+.|.. ..+...+.++|+...-.--+||+.....+ .|.+. ..|+ T Consensus 91 ~~~~Y~c~qTn~dt~i~y~~L-D~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ~~Re 169 (336) T protein:vir:37 91 SQNGYELSETDSGILVNWSLF-DSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDLSDVNKGWLKLLQE 169 (336) T ss_pred CCCccEEEEeeeeeeccHHHH-HHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccccccchhHHHHHHh Confidence 233333333333445777766 3444445533 33344455555555555566885421111 11110 1111 Q ss_pred ----------cccc-cceeecc-ccccchHHHHHHHHHHHhhcCCC-cc-EEEEcHHHHHH-HHHhhccCC-ceeecccc Q lcl|Aclame:pro 141 ----------LDTT-NIVELTT-GTSATPDLAVEAAVGLVLGDNLS-PD-GVALDNTFSFM-LATQRDSQG-RKLYPELG 204 (311) Q Consensus 141 ----------~~~~-~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~-~~-~~v~n~~~~~~-l~~lkd~~g-~~~~~~~~ 204 (311) ..++ .+...+. ++-...++...++...+.....+ +. +.++.+..... ...+-..++ +|- .... T Consensus 170 ~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt-E~~A 248 (336) T protein:vir:37 170 QRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT-EKAA 248 (336) T ss_pred ccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH-HHHH Confidence 1111 1112122 22333344444455444332222 22 46666644421 112322222 221 0000 Q ss_pred c--cCCCceecceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcC Q lcl|Aclame:pro 205 F--GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQN 282 (311) Q Consensus 205 ~--~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~ 282 (311) . -....++-|+|++..+++|.+...++... ..++++-.=+ .|+. ++-.+ +.+.--++..+| T Consensus 249 a~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~--------NLsIY~Q~gs-----~RR~--~~d~p--~r~rie~y~s~N 311 (336) T protein:vir:37 249 LGSHNLMGSFGGMNAITPPNFPARAAAVTTLK--------NLSVYTEAES-----VRRS--LRNDE--DKKGLVTSYYRQ 311 (336) T ss_pred HHHHHHHHhhCCceEEEccccCCCceEEeecc--------ccEEEEecCc-----EEEE--EEEcc--ccccccchhhhc Confidence 0 11236789999999999998866554432 1122221111 1111 11111 111111222222 Q ss_pred cEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 283 QIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 283 ~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) -|+.|=+..+++.+...+-. T Consensus 312 ---------e~YvVEd~~~~a~iE~i~v~ 331 (336) T protein:vir:37 312 ---------EGYVVEDLGLMTAIDHTKVK 331 (336) T ss_pred ---------ceeeeeccccEEEeeeeeee Confidence 22334444444444322222 No 234 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=60.16 E-value=0.38 Score=22.90 Aligned_cols=289 Identities=11% Similarity=0.018 Sum_probs=117.0 Q ss_pred Cc--------ccCCCceEcchhHHHHHHHHHHhhchhh--hhcceeecCCCceEEEEEe---CCceeEEeecCccccccc Q lcl|Aclame:pro 1 MV--------ALATGTFQLPKHLVPGVWQKAQGQSVLA--RLSMAEPQEFGEQQYMTLT---APPRGEVVGEGAQKSEST 67 (311) Q Consensus 1 ma--------t~~~g~~~vP~~~~~~ii~~~~~~s~l~--~l~~~~~~~~~~~~~p~~~---~~~~a~~v~Eg~~~~~~~ 67 (311) |- +...|+.+--+.+.+++..+......+. .-....+..+-.-+|-... ....+.+++|++-.+.++ T Consensus 45 ~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d 124 (514) T protein:vir:10 45 FTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNN 124 (514) T ss_pred hccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCC Confidence 11 1111222222222222222221111111 1112223332222222222 233577899999999999 Q ss_pred cceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCc------ccccccccccc Q lcl|Aclame:pro 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTG------AALSGSPAKIL 141 (311) Q Consensus 68 ~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g------~~~~~~~~~~~ 141 (311) +.+....+..+=++....+|.-+=.+. ...+.++...+.....+++.++.++|+|+..... ....|+.+-+ T Consensus 125 ~~~~rk~~~~k~l~~~~~vS~~~~l~n--~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI- 201 (514) T protein:vir:10 125 PNERQRTINIKYIVDTHVTSIALQRAN--TIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLI- 201 (514) T ss_pred cceEEEEEeeeeeeeeeeeeehhhhcc--chhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhh- Confidence 999988888877766544444322211 3346777778888889999999999999664432 3344544433 Q ss_pred ccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeec Q lcl|Aclame:pro 142 DTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSD 221 (311) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~ 221 (311) +..++.. ..+....-+.+..+-..+.....+++-+.|+....+.|..-.....|-+.+..+. +...|+|+- . T Consensus 202 ~~~NvID--arG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~~n~~----~~~~G~~v~--~ 273 (514) T protein:vir:10 202 APENHID--LRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLPGQTG----GMTTGLDID--K 273 (514) T ss_pred cCCCeEe--cCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEeecCcc----ceeeeeecc--c Confidence 2333332 1222333455655555565666778889999999888876544444443322111 112233321 0 Q ss_pred ccccccccccccccccccccccceEEEeecceEEEE--------eecCceEEEeccCC-------cc-cchhhhhcC--- Q lcl|Aclame:pro 222 TVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWG--------VQVSIPLELIEFGD-------PD-GLGDLKRQN--- 282 (311) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~--------~~~~~~i~~~~~~~-------~~-~~~~~f~~~--- 282 (311) ++-....+.... ..+.+....+... ....+++.+.++.. .. ..++.|-.. T Consensus 274 f~s~~G~I~L~g-----------s~im~~~n~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g 342 (514) T protein:vir:10 274 FLSAHGSIRIQG-----------STIMDSDNKLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVG 342 (514) T ss_pred eeEeccceeecC-----------CeeecccccCccCCccCCcCCCCCcceEEEecCcccccCcccccccccccccccccc Confidence 000000000000 0000000000000 00001111111100 00 000000000 Q ss_pred -cEEEEEEEEeccEEecccceEEE-----------EecccC Q lcl|Aclame:pro 283 -QIAIRAEVVYGIGIMSTDAFAVV-----------RDADES 311 (311) Q Consensus 283 -~v~~ra~~r~~~~v~~~~a~~~l-----------~~aa~~ 311 (311) ...|++...-+.+=-.|+.++-. ++..++ T Consensus 343 ~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~ 383 (514) T protein:vir:10 343 VEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNA 383 (514) T ss_pred eeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEecc Confidence 00111111111111122222111 111111 No 235 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=56.05 E-value=0.47 Score=22.41 Aligned_cols=284 Identities=14% Similarity=0.115 Sum_probs=118.9 Q ss_pred CcccCCCceEcchhHHHHH---HHHHHhhchhhhhcceeecCCCceEEE-------EEeC------------Cce----- Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGV---WQKAQGQSVLARLSMAEPQEFGEQQYM-------TLTA------------PPR----- 53 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~i---i~~~~~~s~l~~l~~~~~~~~~~~~~p-------~~~~------------~~~----- 53 (311) +-.++++.+. .+.+.+ ++++-+..+..+++.+.||.+++--|. .... .++ T Consensus 79 ~es~~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~ 155 (528) T protein:vir:80 79 AAGQTTGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSS 155 (528) T ss_pred cccccccccc---cCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCcccccccccccccccccccccc Confidence 2234444432 233333 333445677778888888865531111 0000 000 Q ss_pred -------------------------------------------------------------------------eEEe--- Q lcl|Aclame:pro 54 -------------------------------------------------------------------------GEVV--- 57 (311) Q Consensus 54 -------------------------------------------------------------------------a~~v--- 57 (311) .+-+ T Consensus 156 ~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~G 235 (528) T protein:vir:80 156 LAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFG 235 (528) T ss_pred ccccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccc Confidence 0001 Q ss_pred -----ec---------CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 58 -----GE---------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGI 122 (311) Q Consensus 58 -----~E---------g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l 122 (311) +| +.+.++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+| T Consensus 236 m~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii 315 (528) T protein:vir:80 236 MATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIV 315 (528) T ss_pred cchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 11 0112233334455555444444455789999886533 25788999999999999999999996 Q ss_pred hcccCCCccccccc------cccccccccceeecccccc-c----hHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHH Q lcl|Aclame:pro 123 HGINPLTGAALSGS------PAKILDTTNIVELTTGTSA-T----PDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLA 189 (311) Q Consensus 123 ~G~~~~~g~~~~~~------~~~~~~~~~~~~~~~~~~~-~----~~~~i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~ 189 (311) .=.+...--+-.+. ..|+.+..........-.. . .+--+......+.. .....+.++++++....|. T Consensus 316 ~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~ 395 (528) T protein:vir:80 316 DVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILA 395 (528) T ss_pred hhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHh Confidence 42211100001111 1222222211110000000 0 11112222222222 2223467889999988886 Q ss_pred Hh-----hccC-CceeeccccccCC-Ccee-cceeEEeecccccccccccccccccccccccceEEEeecceEEEEeecC Q lcl|Aclame:pro 190 TQ-----RDSQ-GRKLYPELGFGTD-VASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVS 261 (311) Q Consensus 190 ~l-----kd~~-g~~~~~~~~~~~~-~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 261 (311) .. .... ....+..+.++.. .|.| .|++|++..+.+..-.++.-- +.. .... .+. +..- T Consensus 396 ~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K-------G~~-~~~~----glf--y~PY 461 (528) T protein:vir:80 396 SADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYK-------GDN-EMDA----GIY--YAPY 461 (528) T ss_pred hccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEe-------CCc-cccc----cee--eccc Confidence 52 1111 2233333333332 3455 456888887765442221111 000 0000 011 1111 Q ss_pred ceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEE-ecccC Q lcl|Aclame:pro 262 IPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR-DADES 311 (311) Q Consensus 262 ~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~-~aa~~ 311 (311) ..+...+-.+++. || -.+.++ .|++..+ +| |+.-+ .+.++ T Consensus 462 v~l~~~~~~dp~s----fq-P~~g~~--tRY~l~~-NP--~~~~~~~~~~~ 502 (528) T protein:vir:80 462 VALTPLRATDPQS----FH-PVLGFK--TRYGIGI-NP--FADSKSQAPSA 502 (528) T ss_pred ccceeeEeeCCcc----cc-ceeeee--eeeceee-cC--cccccCCcccc Confidence 1111112223322 33 234443 4666543 44 33222 22222 No 236 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=55.45 E-value=0.48 Score=22.33 Aligned_cols=289 Identities=12% Similarity=0.045 Sum_probs=118.4 Q ss_pred CcccCCCc--eEcchhHHHHHHHHHHhhchhhh-hcce-----e----ecC---CCceEEEEEeCCceeEEeecC--ccc Q lcl|Aclame:pro 1 MVALATGT--FQLPKHLVPGVWQKAQGQSVLAR-LSMA-----E----PQE---FGEQQYMTLTAPPRGEVVGEG--AQK 63 (311) Q Consensus 1 mat~~~g~--~~vP~~~~~~ii~~~~~~s~l~~-l~~~-----~----~~~---~~~~~~p~~~~~~~a~~v~Eg--~~~ 63 (311) |+.+..+- -.....|+..+.......+.... +... | ... +..+++.... .-...+|-++ -+- T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSV-HLRGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeee-ecccCCcccCceeec Confidence 88654433 23345577777777766665554 3210 0 000 0112222211 1122333222 233 Q ss_pred cccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhc-ccCCCcc-ccccccc--- Q lcl|Aclame:pro 64 SESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHG-INPLTGA-ALSGSPA--- 138 (311) Q Consensus 64 ~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G-~~~~~g~-~~~~~~~--- 138 (311) .+...+|.+-++..-.+..-+.....+-+ --+..|+...-++.|+.-+.+..|+.+|.= .+..+-. ....-+. T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~--qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~ 157 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSR--KRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTG 157 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccc Confidence 45556666555555444444432222211 124578989999999999999999987622 1110000 0000000 Q ss_pred -------------cccc--cccceeeccccccchHHHHHHHHHHHhhcCCC--------c------c--EEEEcHHHHHH Q lcl|Aclame:pro 139 -------------KILD--TTNIVELTTGTSATPDLAVEAAVGLVLGDNLS--------P------D--GVALDNTFSFM 187 (311) Q Consensus 139 -------------~~~~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--------~------~--~~v~n~~~~~~ 187 (311) .+.. .+...+. +.++....+.++.+...+...+.. | . +++|||..+.. T Consensus 158 ~~~N~v~aPt~~r~~~~~~at~~~~l-~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) T protein:vir:93 158 YAGNPLDAPDVDHLLYGGVATSKASL-AATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATD 236 (364) T ss_pred ccccccCCCCCCcEEeccccCchhhc-cccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhh Confidence 0000 0111111 223445566677776655433221 1 1 58889998888 Q ss_pred HHHhhc--------------cCCceeeccccccCCCceecceeEEeecccccccccccccccccccccccceEEEeecc- Q lcl|Aclame:pro 188 LATQRD--------------SQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFS- 252 (311) Q Consensus 188 l~~lkd--------------~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~- 252 (311) |+...| ...+|||. +..++++|+.++--..++..-........ ...--.++|--. T Consensus 237 Lr~~t~~~w~d~qk~A~~~~g~~nPlF~-----G~~gm~ngvii~~~~~vi~~~~~~~~~~v-----~~~ralllGaQA~ 306 (364) T protein:vir:93 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANV-----EAARALFMGRQAG 306 (364) T ss_pred hhhcCCHHHHHHHHHhhhcccccCCcee-----cCeeeEcCeEEeccCCcccccccccCccc-----cchhhheecceee Confidence 874321 12356653 45688999988755444322111111000 000012222211 Q ss_pred eEEEEeecCceEEEecc-CCcccchhhhhcCcEEEEEEEEeccEEec----ccceEEEEecccC Q lcl|Aclame:pro 253 AFRWGVQVSIPLELIEF-GDPDGLGDLKRQNQIAIRAEVVYGIGIMS----TDAFAVVRDADES 311 (311) Q Consensus 253 ~~~~~~~~~~~i~~~~~-~~~~~~~~~f~~~~v~~ra~~r~~~~v~~----~~a~~~l~~aa~~ 311 (311) .+.++--.+....-.++ .|.+ |.+.+-+...+|++-.+ .-.+..|..++.+ T Consensus 307 ~~a~g~~~g~~~~w~Ee~~D~g--------n~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~ 362 (364) T protein:vir:93 307 VIAYGTANGLRFDWEETVKDYG--------NEPAIAAGFIAGMKKARFNNKDFGVISIDTAAKK 362 (364) T ss_pred EEEeecCCCCCceeeecccCCC--------CchhhhhhhHhhhhhcccCCccceEEEecccccc Confidence 11112112222211111 0111 11222221222222111 1112222222222 No 237 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=55.11 E-value=0.49 Score=22.30 Aligned_cols=278 Identities=9% Similarity=0.002 Sum_probs=122.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCceeEEeecC--ccccc-ccccee-----E Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEG--AQKSE-STATFA-----P 72 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg--~~~~~-~~~~~~-----~ 72 (311) ||+... -.++-+.+.+--+..-.+..+--.+++.+|++.-..+|++... ++--+.+- +.... ...+|. . T Consensus 1 m~~~~~-~~~~dp~LT~~A~gy~n~~~ia~~l~P~vpv~~~~~k~~~f~~--eaF~~~~t~r~~~~~~~~v~~~~~~~~~ 77 (307) T protein:vir:10 1 MGRLSK-LRIVDPVLTNLAIGYTNAEFIGQSLMPVVEVEKEGGKIPKFGK--ESFRLYKTERALRARSNRMNPEDLGSID 77 (307) T ss_pred CCCCCC-CcccChhHHHHHHhhcchhhhhhhcCCcccccccccceeeECc--ccccchhhhcccCCCcceeecccccccc Confidence 887753 3334444555455454455566677888888877788888742 22112111 11111 111221 2 Q ss_pred EEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccccccccccccceeeccc Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTG 152 (311) Q Consensus 73 v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 152 (311) ..+..|-+.. ++..+ ....+..+.++...+.+.+.|.+..|..+-.-.-....-+.. ....+.++ ... +. T Consensus 78 ~~~~~~~L~~--~id~r---~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~--~k~tLsGt--~~W-sd 147 (307) T protein:vir:10 78 IVLDEHDLEY--PIDYR---EDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGG--NKKQLSAT--EKF-TA 147 (307) T ss_pred cccccccccc--cCChh---hcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCC--ceEEeccc--ccc-CC Confidence 2233333332 33322 223344556666666666655555544332110000000000 11111111 122 22 Q ss_pred cccchHHHHHHHHHHHh-hcCCCccEEEEcHHHHHHHHH---h-h--ccCCceeeccccccCCCceeccee-EEeecccc Q lcl|Aclame:pro 153 TSATPDLAVEAAVGLVL-GDNLSPDGVALDNTFSFMLAT---Q-R--DSQGRKLYPELGFGTDVASFAGLN-AAVSDTVR 224 (311) Q Consensus 153 ~~~~~~~~i~~~~~~~~-~~~~~~~~~v~n~~~~~~l~~---l-k--d~~g~~~~~~~~~~~~~~~l~G~p-v~~~~~~~ 224 (311) ...+...+|.....++. ..+.+|+..+|.+..|.+|++ + + +..+..+.. ...-..++|+. |.+....- T Consensus 148 ~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it----~~~la~ll~v~~i~vg~a~~ 223 (307) T protein:vir:10 148 AGSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT----VDLLKEIFEVENIAVGEAIY 223 (307) T ss_pred CCCCcHHHHHHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCccccccC----HHHHHHHhCceeEEEeeeee Confidence 45677788888877764 568889999999999988865 1 1 122221111 11123344532 22322221 Q ss_pred cccccccccccccccccccceEEE-------eec-----c-eEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEE Q lcl|Aclame:pro 225 GGPEAVTASTGVYRTTNPNVKAIA-------GDF-----S-AFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVV 291 (311) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~-------gd~-----~-~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r 291 (311) . .........+.+...... +.. + +|.. .+++..+. ..+. ...+--.+|+..+ T Consensus 224 ~-----~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~-~~~g~~~~-d~~~--------~~~~~~~~r~~~~ 288 (307) T protein:vir:10 224 A-----DDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTL-RKKGNPVV-DTRI--------EDGKLELVRSTDI 288 (307) T ss_pred e-----ccCCccceeCCCceEEEecccccCCCCCcccccccceeE-EEcCCeEe-ecee--------cCCceeEEecccc Confidence 0 000001111111110000 000 0 1111 12222211 1110 1123334677777 Q ss_pred eccEEecccceEEEEeccc Q lcl|Aclame:pro 292 YGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 292 ~~~~v~~~~a~~~l~~aa~ 310 (311) +.-.+.-+++-..|+++-. T Consensus 289 ~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 289 FRPYLLGADAGYLISGING 307 (307) T ss_pred ccceeecccccceeccCCC Confidence 7777777888778877777 No 238 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=52.89 E-value=0.54 Score=22.04 Aligned_cols=231 Identities=14% Similarity=0.044 Sum_probs=107.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhh--chhhhhcceeecCCCceEEEEEeCCcee-EEeecCccccccccceeEEEEee Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQ--SVLARLSMAEPQEFGEQQYMTLTAPPRG-EVVGEGAQKSESTATFAPVTAIP 77 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~--s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~l~~ 77 (311) |..+...=-.+=+.+ +.+.+..... +...+++..++..+..-+|..+..-|.. .|+||- ....++-..-+++- T Consensus 1 M~i~~~~l~~l~~~~-~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer---~i~~l~~~~y~i~N 76 (305) T protein:vir:19 1 MIVTPASIKALMTSW-RKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKR---TIQQMEAHGYSIAN 76 (305) T ss_pred CccCHHHHHHHHHHH-HHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhcce---eeeeccccceeEee Confidence 554432211111223 2222222222 3356666667755555677777777765 588654 33333334445566 Q ss_pred eeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCC-Cccccccccccccccccceeeccccccc Q lcl|Aclame:pro 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-TGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) Q Consensus 78 ~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (311) +++...+.|.++-+ +|+.+++-.-+.+++.++.+..-|..++.=-..+ +..-..|. ...++-..+.. T Consensus 77 k~fe~tV~V~R~dI---eDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq--~FFdtDHpv~~------- 144 (305) T protein:vir:19 77 KTFEGTVGISRDDF---EDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQ--NFFDKEHPVYP------- 144 (305) T ss_pred ccccceeccchhhc---cccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCC--cccCCCCCccc------- Confidence 77888999999977 6778899999999999999888887766211000 00111111 11122111100 Q ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceeeccccccCCCceecceeEEeecccccccccccccccc Q lcl|Aclame:pro 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGV 236 (311) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 236 (311) +.+....... ...++..++..|.+.|--+. .+ . T Consensus 145 -------------~~~~tg~~~~-----vsn~~~~~~~~g~~w~Lld~----------~~-----~-------------- 177 (305) T protein:vir:19 145 -------------NVDGTGSAVN-----TSNIVEQDSFSGLPFYLLDC----------SR-----A-------------- 177 (305) T ss_pred -------------CCcccccccc-----hhhhhcCCCCCCceeeeeec----------CC-----c-------------- Confidence 0000000000 01122233444443321000 00 0 Q ss_pred cccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 237 YRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 237 ~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+.+++..|+..++......+ .-+.|.+++..+-+..|+..+.--..--..-+.+-++ T Consensus 178 --------------ikP~I~Q~Rk~~~~~~~~~~~---d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~ 235 (305) T protein:vir:19 178 --------------VKPLIFQERRKPELVARTRID---DDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTL 235 (305) T ss_pred --------------ceeEEEecccccceeeccCCC---chhhhhhceeeeeeeeeeeccccchhheecCCCCCCH Confidence 022345566666554333222 1234778888887777776555433211111111111 No 239 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=52.09 E-value=0.56 Score=21.95 Aligned_cols=279 Identities=10% Similarity=0.020 Sum_probs=102.7 Q ss_pred CcccCCCceEcch------hHHHHHHH---------HHHhhchhhhhcceeecCCCc-eEE--EE------EeCCceeEE Q lcl|Aclame:pro 1 MVALATGTFQLPK------HLVPGVWQ---------KAQGQSVLARLSMAEPQEFGE-QQY--MT------LTAPPRGEV 56 (311) Q Consensus 1 mat~~~g~~~vP~------~~~~~ii~---------~~~~~s~l~~l~~~~~~~~~~-~~~--p~------~~~~~~a~~ 56 (311) +-....++..++. ........ ..............-+..... ..+ .. ..+..-..- T Consensus 148 ~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta 227 (514) T protein:vir:56 148 SFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATS 227 (514) T ss_pred Cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhh Confidence 1011001100000 00000000 000000000000000000000 000 00 000011111 Q ss_pred eec---------CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 57 VGE---------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGIN 126 (311) Q Consensus 57 v~E---------g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~ 126 (311) .+| +.++++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|..+|++.+|+=.+ T Consensus 228 ~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~ 307 (514) T protein:vir:56 228 QAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVN 307 (514) T ss_pred hhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHH Confidence 222 2335555566666666665555556889999987543 257889999999999999999999952211 Q ss_pred C-----CCccccccccccccccccceeeccccccchHHHHHHHHHHHh---------hcCCCccEEEEcHHHHHHHHHh- Q lcl|Aclame:pro 127 P-----LTGAALSGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVL---------GDNLSPDGVALDNTFSFMLATQ- 191 (311) Q Consensus 127 ~-----~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~---------~~~~~~~~~v~n~~~~~~l~~l- 191 (311) . ..+.....-..|+.+-.....+..+-. ..+-+..++.++. ...+..+.++++++....|... T Consensus 308 ~~atv~~~~~~~~~~~~G~~d~~~~~d~~~~~~--~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg 385 (514) T protein:vir:56 308 SQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARW--AGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTD 385 (514) T ss_pred hheeehhcccccccccccccccccccccccchH--HHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhh Confidence 0 111111111122222222111111100 1222222222221 1234567799999999888641 Q ss_pred -hc---cCC--ceeecccccc-CCCcee-cceeEEeecccccccccccccccccccccccceEEEeecceEE----EEee Q lcl|Aclame:pro 192 -RD---SQG--RKLYPELGFG-TDVASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFR----WGVQ 259 (311) Q Consensus 192 -kd---~~g--~~~~~~~~~~-~~~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~----~~~~ 259 (311) .+ ..| +--+..+..+ -..|.| .|++|++..+.+.. .+++|-..... +.+. T Consensus 386 ~l~~~~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfya 447 (514) T protein:vir:56 386 TLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND------------------YFTVGFKGSTEMDAGVFYS 447 (514) T ss_pred hhccccccCccccccccccCcceEEEEecCceEEEecCCCCcc------------------eEEEEEecCcceecceeec Confidence 11 111 1011111111 112344 56688888776643 22222211000 1111 Q ss_pred cCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 260 VSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 260 ~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .-..+...+..+++. || -.+.++ .|++..+ +| |. ...+.. T Consensus 448 PYv~l~~~~~~dp~s----fq-P~~g~~--tRY~l~~-NP--y~--~~~~~~ 487 (514) T protein:vir:56 448 PYVPLTPLRGSDSKN----FQ-PVIGFK--TRYGVQV-NP--FA--DPTASA 487 (514) T ss_pred cccccccccccCCcc----cc-ceeeee--eeeceee-CC--CC--Cccccc Confidence 111122122122222 33 234443 4666543 33 21 100000 No 240 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=51.19 E-value=0.5 Score=22.23 Aligned_cols=116 Identities=10% Similarity=-0.024 Sum_probs=65.1 Q ss_pred EEcHHHHHHHHH-------hhccCCceeeccccccCCCceecceeEEeecccccccccccccccccccccccceEEEeec Q lcl|Aclame:pro 179 ALDNTFSFMLAT-------QRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDF 251 (311) Q Consensus 179 v~n~~~~~~l~~-------lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~ 251 (311) +++...|..+.. |--.|.+|++. +.-+-+++|..-+.+..+|+++.++.+......+.+.+.. + T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~lt----G~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~---~-- 71 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLT----GSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLL---S-- 71 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEe----cCcceeeeceeeeecCCCCCCccceeehhhhccccccccC---C-- Confidence 222222222211 11233456553 4456678899989999999998888776654443322110 0 Q ss_pred ceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccc Q lcl|Aclame:pro 252 SAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) Q Consensus 252 ~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~ 310 (311) -.|.-....+++.++.|... -.+|+..+|+..-----+.-|.|.++|+.--- T Consensus 72 Pgya~~~~~Gvevkt~Red~-------~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 72 PEFAPAGNTGVEASTERAHQ-------GVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CcccCCCCcceeEEeecccc-------CCCCceEEeeeecceeEEecCccceEEeeecC Confidence 11222334455556555421 13677788886555566788999999985544 No 241 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=49.16 E-value=0.65 Score=21.62 Aligned_cols=287 Identities=8% Similarity=-0.058 Sum_probs=119.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhh-----hhcceeecCCCceEEEEEeCCceeEEeecCc-------ccccccc Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLA-----RLSMAEPQEFGEQQYMTLTAPPRGEVVGEGA-------QKSESTA 68 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~-----~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~-------~~~~~~~ 68 (311) -++.+.+...++..-. +.+...+. ...++..+...++++-|...+..++-+++|. .++|..- T Consensus 71 ~a~a~~T~l~ve~~~~------f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd 144 (418) T protein:vir:10 71 EAAADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQ 144 (418) T ss_pred EEecCceEEEEcCcce------eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccccc Confidence 1122223344444322 22222210 1223445555667777776665554433322 2222221 Q ss_pred ceeEEEEeeeeE-------EEEEeecHHHhhcCch-hhHH-HHHHHHHHHHHHHHHHHHHHhhhccc-C-CCcccccccc Q lcl|Aclame:pro 69 TFAPVTAIPRKV-------QVTQRFSQEVKWADES-RQLG-VLQTMADLSGVALGRALDLIGIHGIN-P-LTGAALSGSP 137 (311) Q Consensus 69 ~~~~v~l~~~kl-------~~~i~iS~ell~~s~~-~~~~-~~~~i~~~la~~ia~~~d~~~l~G~~-~-~~g~~~~~~~ 137 (311) ..+.-..+...+ .-.+.||.-....... -..| ++.+..+.+-+ +..+|+++|+|.. . ....++.-.. T Consensus 145 ~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~--av~iEkalI~G~~~~~~~~~g~~R~m 222 (418) T protein:vir:10 145 RPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH--ATEQETAIFFGQAFMGTYNGQPLHTT 222 (418) T ss_pred cCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHHH--HHHHHHHHhcccccCCCcCCcchhhH Confidence 111111111211 1222333332110000 0001 23333333333 3488999999942 1 1122222223 Q ss_pred cccccc------ccceeeccccccchHHHHHHHHHHHhh----cCCCcc----EEEEcHHHHHHHHHhhccCCceeeccc Q lcl|Aclame:pro 138 AKILDT------TNIVELTTGTSATPDLAVEAAVGLVLG----DNLSPD----GVALDNTFSFMLATQRDSQGRKLYPEL 203 (311) Q Consensus 138 ~~~~~~------~~~~~~~~~~~~~~~~~i~~~~~~~~~----~~~~~~----~~v~n~~~~~~l~~lkd~~g~~~~~~~ 203 (311) .|+.+. .+++... ......++.+.+++..... .+.+.. ...++++....|.++- +..-+. T Consensus 223 ~GIl~~vr~~~~gnVv~a~-~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~---~~I~~~-- 296 (418) T protein:vir:10 223 QGIVDAVRQYAPDNVNAMP-NPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF---GEVTVT-- 296 (418) T ss_pred HHHHHHHhhhcccceeccC-CCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh---hheeec-- Confidence 444322 2333332 2234456666666655422 222221 2566888888887763 221111 Q ss_pred cccCCCceecceeEEeecccccccccccccc-cccccccccceEEEeecceEEEEee--cCceEEEeccCC-----cccc Q lcl|Aclame:pro 204 GFGTDVASFAGLNAAVSDTVRGGPEAVTAST-GVYRTTNPNVKAIAGDFSAFRWGVQ--VSIPLELIEFGD-----PDGL 275 (311) Q Consensus 204 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~~~~-----~~~~ 275 (311) ..-+-.|+-|.-.++ +...+....+ .......+.+.+++-|...+.+.+- +++..+...... .... T Consensus 297 ----~~e~~~G~vv~~~~~--~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~ 370 (418) T protein:vir:10 297 ----QRETSYGMVFTEWKF--FKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATD 370 (418) T ss_pred ----ccceeeeEEEEEEEc--ceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCcccccccc Confidence 111222333321111 1111111111 1112234556888889888777666 665555442211 0000 Q ss_pred -----hhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 276 -----GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 276 -----~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..-.+.+++ .-.+..++++|++.++++.---+ T Consensus 371 ~~~~~~~D~~kG~i----v~E~tLe~~N~~a~avitgl~~~ 407 (418) T protein:vir:10 371 YSYGHGVDAQGGSL----TSEWALELLNPQGCAVITGLQKA 407 (418) T ss_pred cccccccccccceE----EEEeeeeeecccceEEeecccee Confidence 000122333 45677889999999999843222 No 242 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=46.89 E-value=0.72 Score=21.37 Aligned_cols=289 Identities=10% Similarity=0.026 Sum_probs=116.6 Q ss_pred CcccCCC-----------ceEcchhHHHHHHHHHHhhchhhhhcc---eeecCCCceEEEEEeC---CceeEEeecCccc Q lcl|Aclame:pro 1 MVALATG-----------TFQLPKHLVPGVWQKAQGQSVLARLSM---AEPQEFGEQQYMTLTA---PPRGEVVGEGAQK 63 (311) Q Consensus 1 mat~~~g-----------~~~vP~~~~~~ii~~~~~~s~l~~l~~---~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~ 63 (311) |-+..+| +.+--+.+..+|..+......+. +.+ ..+..+-.-+|-.... -..+.+++|++.+ T Consensus 23 ~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~-~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~ 101 (468) T protein:vir:63 23 LKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLT-FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVA 101 (468) T ss_pred HHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchh-hhhhcccchhhhhhhhheeeeccCcccccccccccccc Confidence 2222222 33333444444443333222221 111 2233322223333332 2457799999999 Q ss_pred cccccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccc--c-ccccc Q lcl|Aclame:pro 64 SESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALS--G-SPAKI 140 (311) Q Consensus 64 ~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~--~-~~~~~ 140 (311) +.+++.+.......+=++....+|.-.=+... ..+.++...+.....+++.++.++|+|+......+.+ + ...|+ T Consensus 102 ~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~--i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi 179 (468) T protein:vir:63 102 PVSDPNIRQKTVNMKFASDTKNISIAAGLVNN--IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGL 179 (468) T ss_pred ccCCCceEEEEEEeeeeeeeeeehhhhhhhcc--hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccce Confidence 99999999999998888887666655332222 3456677888888899999999999997654222111 1 12222 Q ss_pred cccccceeec-cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHH-HhhccCCceeeccccccCCCceecceeEE Q lcl|Aclame:pro 141 LDTTNIVELT-TGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLA-TQRDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) Q Consensus 141 ~~~~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~-~lkd~~g~~~~~~~~~~~~~~~l~G~pv~ 218 (311) ..-.+...+- ..+.....+++..+..........++-+.|+......|. .....+=+ +. .+.......|+||- T Consensus 180 ~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q~~-v~----~~n~~~~~~G~~v~ 254 (468) T protein:vir:63 180 AKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQ-LV----RDNGNNVSVGFNIQ 254 (468) T ss_pred eEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEE-EE----cCCCCceeeeeccc Confidence 2222211111 112223344555555555455556666888888887773 21111111 10 01111222333331 Q ss_pred eecccccccccccccccccccccccceEEEeecceEEEEeecCceEEEec-cCCc---ccchhhhhc---CcEEEEEEEE Q lcl|Aclame:pro 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIE-FGDP---DGLGDLKRQ---NQIAIRAEVV 291 (311) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~-~~~~---~~~~~~f~~---~~v~~ra~~r 291 (311) ..+.....+.. ....+.++...+.-.. ........+ .... ..+...|.. ....||+... T Consensus 255 --g~~sa~G~I~l-----------~gs~il~~~~~l~~~~-~~~~~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~v 320 (468) T protein:vir:63 255 --GFHSARGFIKL-----------HGSTVMENEQILDERI-LALPTAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVS 320 (468) T ss_pred --ceecceeeeee-----------cCceeeccccCCCccc-ccccccccCCccceeeecccCCcccCCCcceEEEEEEEE Confidence 11110000000 0011122211110000 000000000 0000 000000000 0011222111 Q ss_pred eccEEecccceEEEEecccC Q lcl|Aclame:pro 292 YGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 292 ~~~~v~~~~a~~~l~~aa~~ 311 (311) -+.+=.-|...+-++.++.. T Consensus 321 s~~GES~pS~~vtvTVaa~~ 340 (468) T protein:vir:63 321 SDDAESIASEVATATVTAKD 340 (468) T ss_pred CCCCccccccceEEEecCcc Confidence 11111112222222222211 No 243 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=46.74 E-value=0.72 Score=21.35 Aligned_cols=289 Identities=10% Similarity=0.013 Sum_probs=117.5 Q ss_pred Cc--------ccCCCceEcchhHHHHHHHHHHhhchhhhhcc---eeecCCCceEEEEEeC---CceeEEeecCcccccc Q lcl|Aclame:pro 1 MV--------ALATGTFQLPKHLVPGVWQKAQGQSVLARLSM---AEPQEFGEQQYMTLTA---PPRGEVVGEGAQKSES 66 (311) Q Consensus 1 ma--------t~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~~---~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~ 66 (311) |- +-..|+.+--+.+..+|..+......+. +.+ ..+..+-.-+|-.... -..+.+++|++.++.+ T Consensus 25 ~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~-~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~ 103 (467) T protein:vir:80 25 FTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLT-FYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVS 103 (467) T ss_pred HHcccccCCccccCcchhhhhhhhhhhheeeccccchh-hhhhcccchhhhhhhhheeeeccCccccccccccccccccC Confidence 21 1112333444445555544443333321 222 2233322223333332 2457899999999999 Q ss_pred ccceeEEEEeeeeEEEEEeecHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccc--cc-cccccccc Q lcl|Aclame:pro 67 TATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAAL--SG-SPAKILDT 143 (311) Q Consensus 67 ~~~~~~v~l~~~kl~~~i~iS~ell~~s~~~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~--~~-~~~~~~~~ 143 (311) ++.+.......+=++....+|.-.=+... ..|.++...+.....+++.++.++|+|+......+. .+ ...|+..- T Consensus 104 ~~~~~r~~~~~k~l~~~~~vs~~~~l~n~--i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~l 181 (467) T protein:vir:80 104 DPNIRQKTVNMKFASDTKNISIAAGLVNN--IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKL 181 (467) T ss_pred CCceEEEEEEeeeeeeeeeehhhhhhhcc--hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEE Confidence 99999999998888887666655332222 345667788888889999999999999765422211 11 12222222 Q ss_pred ccceeec-cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHH-HhhccCCceeeccccccCCCceecceeEEeec Q lcl|Aclame:pro 144 TNIVELT-TGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLA-TQRDSQGRKLYPELGFGTDVASFAGLNAAVSD 221 (311) Q Consensus 144 ~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~-~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~ 221 (311) .+...+- ..+.....+++..+..........++-+.|+......|. .....+=+ +. .+.......|+||- . T Consensus 182 i~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~-v~----~~n~~~~~~G~~v~--g 254 (467) T protein:vir:80 182 INQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQ-LV----RDNGNNVSVGFNIQ--G 254 (467) T ss_pred ecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEE-EE----cCCCCceeeeeccc--c Confidence 2211111 112223344555555555455556666888888887773 21111111 10 01111222333331 1 Q ss_pred ccccccccccccccccccccccceEEEeecceEEEEeecCceEEEec-cCCc---ccchhhhhc---CcEEEEEEEEecc Q lcl|Aclame:pro 222 TVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIE-FGDP---DGLGDLKRQ---NQIAIRAEVVYGI 294 (311) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~-~~~~---~~~~~~f~~---~~v~~ra~~r~~~ 294 (311) .+.....+.. ....+.++...+.-.. ........+ .... ..+...|.. ....||+...-+. T Consensus 255 ~~sa~G~I~l-----------~gs~il~~~~~l~~~~-~~~~~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~ 322 (467) T protein:vir:80 255 FHSARGFIKL-----------HGSTVMENEQILDERI-LALPTAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDD 322 (467) T ss_pred eecceeeeee-----------cCceeeccccCCCccc-ccccccccCCccceeeecccCCcccCCCcceEEEEEEEECCC Confidence 1110000000 0011122221110000 000000000 0000 000000000 0011222111111 Q ss_pred EEecccceEEEEecccC Q lcl|Aclame:pro 295 GIMSTDAFAVVRDADES 311 (311) Q Consensus 295 ~v~~~~a~~~l~~aa~~ 311 (311) +=.-|...+-++.++.. T Consensus 323 GES~pS~~vtvTVaa~~ 339 (467) T protein:vir:80 323 AESIASEVATATVTAKD 339 (467) T ss_pred CccccccceEEEecCcc Confidence 11112222222222211 No 244 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=44.71 E-value=0.8 Score=21.13 Aligned_cols=280 Identities=13% Similarity=0.064 Sum_probs=115.8 Q ss_pred Cc-cc----------------CCCceEcchhHHHHHHHHH---HhhchhhhhcceeecCCCceEE-------EEEeC--- Q lcl|Aclame:pro 1 MV-AL----------------ATGTFQLPKHLVPGVWQKA---QGQSVLARLSMAEPQEFGEQQY-------MTLTA--- 50 (311) Q Consensus 1 ma-t~----------------~~g~~~vP~~~~~~ii~~~---~~~s~l~~l~~~~~~~~~~~~~-------p~~~~--- 50 (311) |+ +. +++.+ ..+.+.++.++ -+..+..+++.+.||.+++--| +.... T Consensus 63 l~e~~~~~~~~~~~~~ia~s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~ 139 (529) T protein:vir:10 63 LMEAEVAGDHGYDPTNIAAGQSSGAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAG 139 (529) T ss_pred cchhhccccccccccccccccccccc---ccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCc Confidence 22 11 11111 11223333333 3555666666666666543211 00000 Q ss_pred C--------------------------------------------ceeEE------------------------------ Q lcl|Aclame:pro 51 P--------------------------------------------PRGEV------------------------------ 56 (311) Q Consensus 51 ~--------------------------------------------~~a~~------------------------------ 56 (311) + ....| T Consensus 140 g~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~ 219 (529) T protein:vir:10 140 AKEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDA 219 (529) T ss_pred ccccccccccccccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCcccc Confidence 0 00000 Q ss_pred ---------------------eec---------CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHH Q lcl|Aclame:pro 57 ---------------------VGE---------GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTM 105 (311) Q Consensus 57 ---------------------v~E---------g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i 105 (311) .+| +.++++-..+++.+++.++.-+-...+|-||.||=-. -.+|.+++| T Consensus 220 ~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtEL 299 (529) T protein:vir:10 220 LVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSEL 299 (529) T ss_pred ccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHH Confidence 011 0112333344455555444444455789999987533 247889999 Q ss_pred HHHHHHHHHHHHHHHhhhcccCCCc------cccccccccccccccceeecccccc-----chHHHHHHHHHHHhh--cC Q lcl|Aclame:pro 106 ADLSGVALGRALDLIGIHGINPLTG------AALSGSPAKILDTTNIVELTTGTSA-----TPDLAVEAAVGLVLG--DN 172 (311) Q Consensus 106 ~~~la~~ia~~~d~~~l~G~~~~~g------~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~~~~--~~ 172 (311) ..-|+..|..+|++.+|.=.+...- +...+...++.+..........-.. ..+--+......+.. .. T Consensus 300 sNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~r 379 (529) T protein:vir:10 300 NGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGR 379 (529) T ss_pred HHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 9999999999999999862111000 0111112233333222111000000 111122222222222 22 Q ss_pred CCccEEEEcHHHHHHHHHh--hccCC----ceeeccccccCC-Ccee-cceeEEeecccccccccccccccccccccccc Q lcl|Aclame:pro 173 LSPDGVALDNTFSFMLATQ--RDSQG----RKLYPELGFGTD-VASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNV 244 (311) Q Consensus 173 ~~~~~~v~n~~~~~~l~~l--kd~~g----~~~~~~~~~~~~-~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (311) ...+.++++++....|... ++.-+ .--|..+.+... .|.| .|++|++..+.+.. T Consensus 380 g~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------ 441 (529) T protein:vir:10 380 GAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQD------------------ 441 (529) T ss_pred ccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcc------------------ Confidence 3456789999998888642 21111 111222222222 3444 45688887776543 Q ss_pred eEEEeecceE----EEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEE-ecccC Q lcl|Aclame:pro 245 KAIAGDFSAF----RWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVR-DADES 311 (311) Q Consensus 245 ~~~~gd~~~~----~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~-~aa~~ 311 (311) .+++|-.... .+.+..-+.+...+-.+++. || -.+.++ .|++.. ++| |+.-+ .+.++ T Consensus 442 y~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s----fq-P~~g~~--tRY~l~-~NP--~~~~~~~~~~~ 503 (529) T protein:vir:10 442 YFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKN----FQ-PVMGFK--TRYAIG-VNP--FAESRTQAPTS 503 (529) T ss_pred eEEEEEeCCcccccceeeccccccccccccCCCc----cc-ceeeee--eeecee-ecC--ccccccccccc Confidence 2222221100 01111222222222223332 43 234443 466654 344 33322 22122 No 245 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=44.37 E-value=0.81 Score=21.09 Aligned_cols=293 Identities=10% Similarity=0.016 Sum_probs=121.9 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhch-hh-hhcceeecCCCceEEEEEeC-Cc-eeEEeecCcccccc-ccceeEEEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSV-LA-RLSMAEPQEFGEQQYMTLTA-PP-RGEVVGEGAQKSES-TATFAPVTA 75 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~-l~-~l~~~~~~~~~~~~~p~~~~-~~-~a~~v~Eg~~~~~~-~~~~~~v~l 75 (311) |+... .++-|.++..-|.+.-.+... +. .+++..++..-...+..... .. .+.++..+.+.+.. .-.++..++ T Consensus 1 M~~i~--d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~ 78 (348) T protein:vir:27 1 MGLIY--DKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDE 78 (348) T ss_pred Ccchh--hhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeee Confidence 88764 556666664444443333333 32 34555544433333333222 22 35678777665543 344666666 Q ss_pred eeeeEEEEEeecHHHhh------cCc-hhh-HHHHHHH---HHHHHHHHHHHHH----HHhhhcccCCCccccc-----c Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVKW------ADE-SRQ-LGVLQTM---ADLSGVALGRALD----LIGIHGINPLTGAALS-----G 135 (311) Q Consensus 76 ~~~kl~~~i~iS~ell~------~s~-~~~-~~~~~~i---~~~la~~ia~~~d----~~~l~G~~~~~g~~~~-----~ 135 (311) .+-.++-...++.+=++ ... .+. -.+...+ ...+.+++.+.+| +++.+|.-...+.+.. + T Consensus 79 ~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg 158 (348) T protein:vir:27 79 QMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYG 158 (348) T ss_pred ecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeec Confidence 66666655566543221 110 000 0111111 1222333333333 3444441111111111 1 Q ss_pred ccccccccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH---hhccC----Cc-eeeccccccC Q lcl|Aclame:pro 136 SPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLAT---QRDSQ----GR-KLYPELGFGT 207 (311) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~---lkd~~----g~-~~~~~~~~~~ 207 (311) .+....-+ .....+....+...+|.+....+...+..++.++|++..|..|++ .++.- +. ....+..... T Consensus 159 ~~~~~~~t--~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~ 236 (348) T protein:vir:27 159 VKPDHKKQ--VSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELEN 236 (348) T ss_pred CCccccee--eeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHH Confidence 11111000 001122345567788888887787788899999999999999865 33211 11 0111111111 Q ss_pred CCceecceeEEeec-ccccccccccccccccccccccceEEEeecc---eEEEEe-----------ecCce-------EE Q lcl|Aclame:pro 208 DVASFAGLNAAVSD-TVRGGPEAVTASTGVYRTTNPNVKAIAGDFS---AFRWGV-----------QVSIP-------LE 265 (311) Q Consensus 208 ~~~~l~G~pv~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~---~~~~~~-----------~~~~~-------i~ 265 (311) --+++.|.++.+-+ .+... .+......+...+++.--. ...++- ..... +. T Consensus 237 ~~~~~~g~~i~~yd~~y~d~-------~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~ 309 (348) T protein:vir:27 237 YIADNFGVSIVLENGTYRND-------KGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIA 309 (348) T ss_pred HHHhhcCceEEEEeeEEEcC-------CCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeE Confidence 11344566664322 22111 0000011111222211111 111110 00000 00 Q ss_pred EeccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 266 LIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 266 ~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) +..+.+.|. -...+++..+.=-.+.+|+++.++|.-++- T Consensus 310 ~~~~~~~dP-------~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 310 VTTTKTTDP-------VNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EEeeecCCC-------ceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 111111000 023444555555667778888887755555 No 246 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=43.24 E-value=0.85 Score=20.96 Aligned_cols=282 Identities=12% Similarity=0.061 Sum_probs=105.8 Q ss_pred Ccc---cCCCceEcchhH-HH-HHHHHHHhhchhhhhcceee--------cCCCceEEEEEeCCceeEEeec-------- Q lcl|Aclame:pro 1 MVA---LATGTFQLPKHL-VP-GVWQKAQGQSVLARLSMAEP--------QEFGEQQYMTLTAPPRGEVVGE-------- 59 (311) Q Consensus 1 mat---~~~g~~~vP~~~-~~-~ii~~~~~~s~l~~l~~~~~--------~~~~~~~~p~~~~~~~a~~v~E-------- 59 (311) ++. ...|...-.... .. ..........+...-..... +..+. .+..+..-..-.+| T Consensus 166 ~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~---~y~~g~gm~Ta~aEal~~~g~s 242 (521) T protein:vir:72 166 LAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGA---LVEIAEGMATSIAELQEGFNGS 242 (521) T ss_pred cccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCc---eeeeecccchhhhhhhcccCCc Confidence 110 001111000000 00 00000000000000000000 00000 00011111111222 Q ss_pred -CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccccc- Q lcl|Aclame:pro 60 -GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGS- 136 (311) Q Consensus 60 -g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~- 136 (311) +...++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|.=.+-..--+..+. T Consensus 243 s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t 322 (521) T protein:vir:72 243 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 322 (521) T ss_pred ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeee Confidence 1234444455566666555555556889999987543 2578899999999999999999999843211100011111 Q ss_pred -----cccccccccceeeccccccc-----hHHHHHHHHHHHh--hcCCCccEEEEcHHHHHHHHHhh--c---cCC-ce Q lcl|Aclame:pro 137 -----PAKILDTTNIVELTTGTSAT-----PDLAVEAAVGLVL--GDNLSPDGVALDNTFSFMLATQR--D---SQG-RK 198 (311) Q Consensus 137 -----~~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~~~~--~~~~~~~~~v~n~~~~~~l~~lk--d---~~g-~~ 198 (311) ..|+.+..........-... .+--+......+. ...+..+.++++++....|...- | ++| .- T Consensus 323 ~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 402 (521) T protein:vir:72 323 LTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAT 402 (521) T ss_pred eccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccc Confidence 12322222111111111000 1111222222222 23356677999999988887521 1 111 11 Q ss_pred eeccccccCC-Ccee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecCceEEEeccCCc Q lcl|Aclame:pro 199 LYPELGFGTD-VASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLELIEFGDP 272 (311) Q Consensus 199 ~~~~~~~~~~-~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~~~~~ 272 (311) -|..+.++.. .|.| .|++|++..+.+.+ .+++|-.... .+.+..-+.+...+-.++ T Consensus 403 g~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp 464 (521) T protein:vir:72 403 GFSTDTTKSVFAGVLGGKYRVYIDQYAKQD------------------YFTVGYKGPNEMDAGIYYAPYVALTPLRGSDP 464 (521) T ss_pred cccccCCCceEEEEccCceEEEecCCCCcc------------------eEEEEEeCCcccccceeeccccccccccccCC Confidence 1322222221 2343 45688887776543 2222221100 011111122222222333 Q ss_pred ccchhhhhcCcEEEEEEEEeccEEecccc-------eEEEEecccC Q lcl|Aclame:pro 273 DGLGDLKRQNQIAIRAEVVYGIGIMSTDA-------FAVVRDADES 311 (311) Q Consensus 273 ~~~~~~f~~~~v~~ra~~r~~~~v~~~~a-------~~~l~~aa~~ 311 (311) +. || -.+.++ .|++..+ +|=+ .++|+...=. T Consensus 465 ~s----fq-P~~g~~--tRY~l~~-NP~~~~~~~~~a~~i~~~~~~ 502 (521) T protein:vir:72 465 KN----FQ-PVMGFK--TRYGIGI-NPFAESAAQAPASRIQSGMPS 502 (521) T ss_pred cc----cc-ceeeee--eeeceee-cCcccccCcccceeecCcChh Confidence 32 43 234443 4666543 3311 1223211111 No 247 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=43.15 E-value=0.86 Score=20.95 Aligned_cols=281 Identities=9% Similarity=-0.043 Sum_probs=121.0 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhh-------hcceeecCCCceEEEEEeC--CceeEEeecCccc-cccccce Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLAR-------LSMAEPQEFGEQQYMTLTA--PPRGEVVGEGAQK-SESTATF 70 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~-------l~~~~~~~~~~~~~p~~~~--~~~a~~v~Eg~~~-~~~~~~~ 70 (311) |+ ..- -+.+...+.+.+...+.-.. ...+.-.++.+++||+.+. +..-+-..-|-.. ..-+.++ T Consensus 1 Ma-iny-----a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~ 74 (346) T protein:vir:10 1 MT-INY-----AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDW 74 (346) T ss_pred Cc-chh-----HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccce Confidence 44 221 24566677666655432111 1122334556799999862 3222222222211 2234455 Q ss_pred eEEEEeeeeEEEEEeecHHHhhcCchh--hHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCcccccccccccccccccee Q lcl|Aclame:pro 71 APVTAIPRKVQVTQRFSQEVKWADESR--QLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) Q Consensus 71 ~~v~l~~~kl~~~i~iS~ell~~s~~~--~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 148 (311) ...+|..-+--.+ .| +.+ +.+++ ...+...+.+...+..+=.+|.-.|.-.-...+.. ...... T Consensus 75 et~tl~qDR~~~F-~v-D~m--DvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~---------~~~~~~- 140 (346) T protein:vir:10 75 DSYELKNERYWST-LV-DPS--DIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAA---------HDGGIT- 140 (346) T ss_pred eEEEeecccccee-cc-ccc--chHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhh---------cccccc- Confidence 5555554332221 11 000 00111 11222223333333444456655442100000000 000001 Q ss_pred eccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCCce-eeccccccCCCceecceeEEe--eccc Q lcl|Aclame:pro 149 LTTGTSATPDLAVEAAVGLVLGDNLSPD--GVALDNTFSFMLATQRDSQGRK-LYPELGFGTDVASFAGLNAAV--SDTV 223 (311) Q Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~g~~-~~~~~~~~~~~~~l~G~pv~~--~~~~ 223 (311) ..+.+....++.++.++..+...++... .++++|.....|.+.+.-+... +.......+..++|.|+||+. ++.+ T Consensus 141 ~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~ 220 (346) T protein:vir:10 141 TNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLM 220 (346) T ss_pred ccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhc Confidence 1122345678889999999987776433 4778899998776543222111 111222355568999999963 4444 Q ss_pred ccccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccc-- Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA-- 301 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a-- 301 (311) +..-..... -.....+-+-..++... ...+...+.-.++++.-. .. ..|.-.+.-+.++|.=|.+.+. T Consensus 221 ~t~~~f~~G--~~~~t~ak~INfiiv~~-~A~ia~~K~~~~~if~P~-~~------~~g~~l~~~R~Y~D~fv~~nk~~~ 290 (346) T protein:vir:10 221 QTAYDFSDG--SKIIDTAKQIEMFLIYN-GVQIAPEKYSFVGFDQPS-AA------TSGNYLYYEQSYDDVLLLNTKTKG 290 (346) T ss_pred ccchhhccC--ccccCCccceeEEEECC-ceeeeeeeeeeeEeeCCC-CC------cccceeeeeeeeeeeeeeccccce Confidence 421100000 00011111122333332 334445555555555332 11 1222233345678888887654 Q ss_pred -eEEEEecccC Q lcl|Aclame:pro 302 -FAVVRDADES 311 (311) Q Consensus 302 -~~~l~~aa~~ 311 (311) ++-++.|.+. T Consensus 291 Iyv~~~~a~~~ 301 (346) T protein:vir:10 291 IQFVVSDKPKK 301 (346) T ss_pred EEEeeeccccc Confidence 2333333333 No 248 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=42.13 E-value=0.9 Score=20.84 Aligned_cols=290 Identities=12% Similarity=0.021 Sum_probs=120.6 Q ss_pred CcccCCCceEcchhHHHHHHHHHH-hh----chhhhhcceeecCCCceEEEEEeC-C-ceeEEeecCccccccc-cceeE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQ-GQ----SVLARLSMAEPQEFGEQQYMTLTA-P-PRGEVVGEGAQKSEST-ATFAP 72 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~-~~----s~l~~l~~~~~~~~~~~~~p~~~~-~-~~a~~v~Eg~~~~~~~-~~~~~ 72 (311) |..+-.-.++-|.++ ..++..+. .. -.+-.+++.+++..-...+-.... . ..+.+++.+.+.+..+ ..++. T Consensus 1 M~~~~~~d~~~~~~l-~~~i~~~~~~~~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~~~ 79 (348) T protein:vir:98 1 MSWTLDTEFIEPTQL-TGLIREALRDLQVNRFRLARWLPNVDVDDITFEFLRGGGGLAETASYRSWDTESKIGRREGLAK 79 (348) T ss_pred CcchhhhhccCHHHH-HHHHHHHhhccCcchhhHHhcCCCccccceEEEEEeccCCceeeeeeecCCCccceeeccccee Confidence 775444455566666 44444332 11 123344555554433332222222 1 2356788777666544 34777 Q ss_pred EEEeeeeEEEEEeecHH-HhhcCchhhHHHHHHHH---HHHHHHHHHHHH----HHhhhcccCCCccccc---ccccccc Q lcl|Aclame:pro 73 VTAIPRKVQVTQRFSQE-VKWADESRQLGVLQTMA---DLSGVALGRALD----LIGIHGINPLTGAALS---GSPAKIL 141 (311) Q Consensus 73 v~l~~~kl~~~i~iS~e-ll~~s~~~~~~~~~~i~---~~la~~ia~~~d----~~~l~G~~~~~g~~~~---~~~~~~~ 141 (311) .++.+-.++-...++.+ +++......-.+...+. ..+.+++.+.+| +++.+|.-...+.+-. +.+.... T Consensus 80 ~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~~~ 159 (348) T protein:vir:98 80 VMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGSHS 159 (348) T ss_pred eeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcccc Confidence 77777777666666654 22111111111222222 223334444443 4555552111111110 1111110 Q ss_pred ccccceeeccccccchHHHHHHHHHHHhh-cCCCccEEEEcHHHHHHHHH---hhcc-------CCceeeccccccCCCc Q lcl|Aclame:pro 142 DTTNIVELTTGTSATPDLAVEAAVGLVLG-DNLSPDGVALDNTFSFMLAT---QRDS-------QGRKLYPELGFGTDVA 210 (311) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~n~~~~~~l~~---lkd~-------~g~~~~~~~~~~~~~~ 210 (311) - +.....+.....+...+|.+....+.. .+..++.++|++..|..|++ +++. +..++..+.....--. T Consensus 160 ~-t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (348) T protein:vir:98 160 V-VAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTVLS 238 (348) T ss_pred c-ccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHHHH Confidence 0 011111122345677888888877765 57788999999999998863 3321 1112221111110011 Q ss_pred eeccee-EEeecccccccccccccccccccccccceEEEe-e-----------cceEEEEe-------------ecCceE Q lcl|Aclame:pro 211 SFAGLN-AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAG-D-----------FSAFRWGV-------------QVSIPL 264 (311) Q Consensus 211 ~l~G~p-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-d-----------~~~~~~~~-------------~~~~~i 264 (311) . +|.| +.+-+..-...... ....+...+++. + +....++- ..+..+ T Consensus 239 ~-~g~~~i~~~d~~~~~~g~~-------~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i 310 (348) T protein:vir:98 239 S-MGLPPIEVYDAKVAVDGVS-------TRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGI 310 (348) T ss_pred h-hCCeEEEEeeeEEEcCCce-------eceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCce Confidence 1 3443 33322111110000 000111112111 0 00000000 000000 Q ss_pred EEeccCCcccchhhhhcC--cEEEEEEEEeccEEecccceEEEEecc Q lcl|Aclame:pro 265 ELIEFGDPDGLGDLKRQN--QIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) Q Consensus 265 ~~~~~~~~~~~~~~f~~~--~v~~ra~~r~~~~v~~~~a~~~l~~aa 309 (311) -+..+. +.| ...+++..+.=-.+.+|+++.+++.-+ T Consensus 311 ~~~~~~---------~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 311 VAATWK---------TKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeeee---------ecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 000010 111 344555566556667889988888766 No 249 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=31.45 E-value=1.5 Score=19.62 Aligned_cols=301 Identities=9% Similarity=-0.034 Sum_probs=122.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhch-h-hhhcceeecCCCceEEEEEe-CCc-eeEEeecCcccccc-ccceeEEEE Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSV-L-ARLSMAEPQEFGEQQYMTLT-APP-RGEVVGEGAQKSES-TATFAPVTA 75 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~-l-~~l~~~~~~~~~~~~~p~~~-~~~-~a~~v~Eg~~~~~~-~~~~~~v~l 75 (311) |+... .++-+.++..-|-+.-.+... + ..+++..++..-...+.... ... .+.++..+.+.+.. ...++...+ T Consensus 1 M~~i~--d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~ 78 (348) T protein:vir:96 1 MGLIY--DKVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIHDE 78 (348) T ss_pred Ccchh--hccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeeeee Confidence 87664 355556554433332222322 3 24556555554333333322 223 36688887666643 345677777 Q ss_pred eeeeEEEEEeecHHHh------hcC-chhh-HHHHHHHHH---HHHHHHHHHHH----HHhhhcccCCCccccc-----c Q lcl|Aclame:pro 76 IPRKVQVTQRFSQEVK------WAD-ESRQ-LGVLQTMAD---LSGVALGRALD----LIGIHGINPLTGAALS-----G 135 (311) Q Consensus 76 ~~~kl~~~i~iS~ell------~~s-~~~~-~~~~~~i~~---~la~~ia~~~d----~~~l~G~~~~~g~~~~-----~ 135 (311) .+-.++-...++.+=+ ..+ .++. -.+...+.+ .+.+.+.+.+| +++.+|.-...+.+.. + T Consensus 79 ~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~vdfg 158 (348) T protein:vir:96 79 QMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDIDYG 158 (348) T ss_pred ecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEEecc Confidence 6666666555543211 111 1110 112222221 22233433343 3344442111111111 1 Q ss_pred ccccccccccceeeccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH---hhc----cCCce-eeccccccC Q lcl|Aclame:pro 136 SPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLAT---QRD----SQGRK-LYPELGFGT 207 (311) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~---lkd----~~g~~-~~~~~~~~~ 207 (311) .+....-+ ... ..+....+...+|......+...+..++.++|++..|..|++ +++ .++.. ...+..... T Consensus 159 ~~~~~~~t-~~~-~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~ 236 (348) T protein:vir:96 159 VKADHKKQ-VSK-SWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQN 236 (348) T ss_pred CCccccee-ecc-ccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHH Confidence 11110000 001 122335567788888887787788889999999999998864 332 11111 111111111 Q ss_pred CCceecceeEEeecccccccccccccccccccccccceEEEeecc---eEEEEee-cCceEEEeccCCcc-------cch Q lcl|Aclame:pro 208 DVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFS---AFRWGVQ-VSIPLELIEFGDPD-------GLG 276 (311) Q Consensus 208 ~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~---~~~~~~~-~~~~i~~~~~~~~~-------~~~ 276 (311) .-+...|+++.+-+..- .+..+......+...+++.--. ...++-. ++...........+ -.. T Consensus 237 ~~~~~~g~~i~~y~~~y------~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~ 310 (348) T protein:vir:96 237 YVADNYGVEIVLENGTY------RNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAV 310 (348) T ss_pred HHhhhcCceEEEEccEE------EecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEE Confidence 12344566665322111 0011111111122222221111 1111100 00000000000000 000 Q ss_pred hhh-hcC--cEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 277 DLK-RQN--QIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 277 ~~f-~~~--~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ..| +.| ...+++..+.=-.+.+|+++.++|.-+.- T Consensus 311 ~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 311 TTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred EeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 001 111 23445555555566778888888755555 No 250 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=30.47 E-value=1.6 Score=19.50 Aligned_cols=279 Identities=11% Similarity=0.045 Sum_probs=104.2 Q ss_pred CcccCCCceEcchhHHHHHHHHHHhhchhhhhc--ceeecCC------CceEEEE--------EeCCceeEEeec----- Q lcl|Aclame:pro 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLS--MAEPQEF------GEQQYMT--------LTAPPRGEVVGE----- 59 (311) Q Consensus 1 mat~~~g~~~vP~~~~~~ii~~~~~~s~l~~l~--~~~~~~~------~~~~~p~--------~~~~~~a~~v~E----- 59 (311) +...+.+..........-+... ........+ ....... ....+.. ..+..-..-.+| T Consensus 171 ~~~~ta~~~~a~g~g~ea~f~e--a~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ 248 (529) T protein:vir:10 171 FAKLTAGQAIAEGDIVGHFFYE--SGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGF 248 (529) T ss_pred cccccccccccccccceeeecc--cCceeeccccccccccCccccCcccccccccccccccccccccchhhhhhhccccC Confidence 2111111110000000000000 000000000 0000000 0000000 001111111223 Q ss_pred ----CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhccc--CC---- Q lcl|Aclame:pro 60 ----GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGIN--PL---- 128 (311) Q Consensus 60 ----g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~--~~---- 128 (311) +.++++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|+-.. +. T Consensus 249 ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~~~ 328 (529) T protein:vir:10 249 NGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKS 328 (529) T ss_pred CCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhcc Confidence 1234555566666666665555566889999987533 247888999999999999999998885422 11 Q ss_pred Cccccccccccccccccceeeccccc-----cchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHh--h------- Q lcl|Aclame:pro 129 TGAALSGSPAKILDTTNIVELTTGTS-----ATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQ--R------- 192 (311) Q Consensus 129 ~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~l--k------- 192 (311) .+....+...|+.+..........-. ...+--+......+.. .....+.++++++....|... + T Consensus 329 ~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~ 408 (529) T protein:vir:10 329 GWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQG 408 (529) T ss_pred ccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcccccccccc Confidence 11111112223333322111110100 0111122222222222 223456788999888888642 1 Q ss_pred ccCCceeeccccc-cCCCcee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecCceEEE Q lcl|Aclame:pro 193 DSQGRKLYPELGF-GTDVASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLEL 266 (311) Q Consensus 193 d~~g~~~~~~~~~-~~~~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~ 266 (311) ...| |..+.+ ....|.| .|++|++..+.+.. .+++|-.... .+.+..-..+.. T Consensus 409 ~~sg---~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfy~PYv~l~~ 467 (529) T protein:vir:10 409 MASG---LNADTTKGVFAGILGGRYKVYIDQYARQD------------------YFTMGYRGANNLDAGIYYCPYVALTP 467 (529) T ss_pred cccc---cccccCCceEEEEecCceEEEecCCCCcc------------------eEEEEEeCCcccccceeecccccccc Confidence 1111 211111 1123444 45688887776543 2222221100 011111111222 Q ss_pred eccCCcccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 267 IEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 267 ~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) .+-.+++. || -.+.++ .|++..+ +|=+. -.+.+.++ T Consensus 468 ~~~~dp~s----fq-P~~g~~--tRY~l~~-NP~~~-~~~~~~~~ 503 (529) T protein:vir:10 468 LRGFDPKN----FQ-PVMGFK--TRYAIGV-NPFAE-SRTQAPQG 503 (529) T ss_pred ccccCCCc----cc-ceeeee--eeeceee-cCccc-cccccccc Confidence 22223322 33 234443 4666543 33111 00111111 No 251 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=29.66 E-value=1.6 Score=19.41 Aligned_cols=280 Identities=13% Similarity=0.052 Sum_probs=107.9 Q ss_pred CcccCCCceEcchhH-HHHHHHHHHhhchhhhhcceeecCCCc-----e--EEE--------EEeCCceeEEeec----- Q lcl|Aclame:pro 1 MVALATGTFQLPKHL-VPGVWQKAQGQSVLARLSMAEPQEFGE-----Q--QYM--------TLTAPPRGEVVGE----- 59 (311) Q Consensus 1 mat~~~g~~~vP~~~-~~~ii~~~~~~s~l~~l~~~~~~~~~~-----~--~~p--------~~~~~~~a~~v~E----- 59 (311) -.+...+.......+ ...... ................+. . .+. +..+..-..-.+| T Consensus 161 ~~~~~~~~~~~~g~~~~~~~~~---s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~l 237 (519) T protein:vir:10 161 FEALAASKVLEVGKIYSHFFEA---TGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGF 237 (519) T ss_pred cccccccccccccccccccccc---cccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccC Confidence 001111111111000 000000 000000000000000000 0 000 0001111111223 Q ss_pred ----CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccc Q lcl|Aclame:pro 60 ----GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALS 134 (311) Q Consensus 60 ----g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~~~g~~~~ 134 (311) +.++++-..+++.++..++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|.=.+-..--+.. T Consensus 238 ggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~ 317 (519) T protein:vir:10 238 NGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKS 317 (519) T ss_pred CCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhccee Confidence 2345566666777666666555566889999987543 25788999999999999999999998521110000111 Q ss_pred ccc------ccccccccceeeccccc-----cchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhh-----ccC- Q lcl|Aclame:pro 135 GSP------AKILDTTNIVELTTGTS-----ATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQR-----DSQ- 195 (311) Q Consensus 135 ~~~------~~~~~~~~~~~~~~~~~-----~~~~~~i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~lk-----d~~- 195 (311) ++. .|+.+......+..+-. ...+--+......+.. .....+.++++++....|...- .+. T Consensus 318 g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~ 397 (519) T protein:vir:10 318 GMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQG 397 (519) T ss_pred ecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhcccccc Confidence 111 12222211111110000 0112222222223322 2233467899999888886532 011 Q ss_pred CceeeccccccCC-Ccee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecCceEEEecc Q lcl|Aclame:pro 196 GRKLYPELGFGTD-VASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLELIEF 269 (311) Q Consensus 196 g~~~~~~~~~~~~-~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~~ 269 (311) .+..+..+..... .|.| .|++|++..+.+.+ .+++|-.... .+.+..-..+...+- T Consensus 398 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfyaPYv~l~~~~~ 459 (519) T protein:vir:10 398 LGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD------------------YFTIGYKGSNEMDAGIYYAPYVALTPLRG 459 (519) T ss_pred ccccccccCCCceEEEEecCceEEEecCCCCcc------------------eEEEEEecCcccccceeeccccccccccc Confidence 1222332322221 2444 45688888776643 2222221100 011111122222222 Q ss_pred CCcccchhhhhcCcEEEEEEEEeccEEecccceE-EEEecccC Q lcl|Aclame:pro 270 GDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFA-VVRDADES 311 (311) Q Consensus 270 ~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~-~l~~aa~~ 311 (311) .+++. || -.+.++ .|++..+ +| |+ .++.+-.+ T Consensus 460 ~dp~s----fq-P~~g~~--tRY~l~~-NP--~~~~~~~~~~~ 492 (519) T protein:vir:10 460 SDPKN----FQ-PVMGFK--TRYGIGI-NP--FADPAAQAPTK 492 (519) T ss_pred cCCcc----cc-ceeeee--eeeceee-cC--cccccccCccc Confidence 23332 43 234443 4666543 34 22 11111111 No 252 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=29.06 E-value=1.7 Score=19.33 Aligned_cols=292 Identities=10% Similarity=-0.018 Sum_probs=111.3 Q ss_pred CcccCCCceEc--chhHHHHHHHHHHhhchhhhhcc---eeecCCCceEEEEEeCCceeE-EeecCccccccccceeEEE Q lcl|Aclame:pro 1 MVALATGTFQL--PKHLVPGVWQKAQGQSVLARLSM---AEPQEFGEQQYMTLTAPPRGE-VVGEGAQKSESTATFAPVT 74 (311) Q Consensus 1 mat~~~g~~~v--P~~~~~~ii~~~~~~s~l~~l~~---~~~~~~~~~~~p~~~~~~~a~-~v~Eg~~~~~~~~~~~~v~ 74 (311) |-+. +....+ -+.+.+.+-+.+...+.-..|.. .+-.++.+++||+.+..+-.. -...|-....-+.+++..+ T Consensus 1 ~~~~-an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~t 79 (311) T protein:vir:99 1 MPTD-AETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYT 79 (311) T ss_pred CCCc-chhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEE Confidence 3332 222212 23354554444444332111211 122345678999987533222 2232222222234445555 Q ss_pred EeeeeEEEEEeecHHHhhcCchh--hHHHHHHHHHHHHHHHHHHHHHHhhhcccC-CCccccccccccccccccceeecc Q lcl|Aclame:pro 75 AIPRKVQVTQRFSQEVKWADESR--QLGVLQTMADLSGVALGRALDLIGIHGINP-LTGAALSGSPAKILDTTNIVELTT 151 (311) Q Consensus 75 l~~~kl~~~i~iS~ell~~s~~~--~~~~~~~i~~~la~~ia~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~~~~~~~~~ 151 (311) |..-+--.+ .| +.+ +.+++ ...+...+.+...+..+=.+|.-.+.=--. ..+.... .........+.....+ T Consensus 80 l~~DR~~~f-~v-D~m--DvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~-~~~~~~~~~~~~~~~~ 154 (311) T protein:vir:99 80 MGQDRDVEF-YL-DRQ--DVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGT-DTEGTLLAKTHKTEET 154 (311) T ss_pred eeeccceee-ec-chh--chhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc-ccchhhhccccccccc Confidence 544322111 11 110 00111 111222233333333444455443311000 0000000 0000111111111122 Q ss_pred ccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCceee----ccccccCCCceecceeEE---eecccc Q lcl|Aclame:pro 152 GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLY----PELGFGTDVASFAGLNAA---VSDTVR 224 (311) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~g~~~~----~~~~~~~~~~~l~G~pv~---~~~~~~ 224 (311) -+....++.+..++..+...+..+-.+.++|.....|...+.-+ |.+- .........++|.|+|++ .+++|. T Consensus 155 lt~~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~-r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~ 233 (311) T protein:vir:99 155 LDETNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFT-RNITNQNVGTTALESRITSIDGVQLIEVYESNRFM 233 (311) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhh-eeeecccccccccccccceecCeEEEEecCchhhc Confidence 23445567777888777665554446888998888776432211 1110 111224457899999975 334443 Q ss_pred cccccccccccccccccccceEEEeecceEEEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecccc--e Q lcl|Aclame:pro 225 GGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA--F 302 (311) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a--~ 302 (311) ..-.. .+. ......+-+-..++-..+ ..+...+.-.++++.-. ..... +.-.+.-+.++|.=|.+.+. + T Consensus 234 t~~~f-t~G-~~~~~~ak~INfiiv~~~-a~i~~~K~~~v~~f~P~-~~~~g-----d~~l~~~R~Y~D~fv~~nk~~~I 304 (311) T protein:vir:99 234 TKYDF-TDG-AKPTEDAKAINFLVVAKP-AVISIVKENAVFLFAPG-QHTDG-----DGYLYQNRLYHDLFIKKHKRDGI 304 (311) T ss_pred chhhh-cCC-ccccCcccccceEEeCCC-eeeeeeeeeeeeeeCCC-CCCCc-----ceeeeeeeeeeeeeeeccccCeE Confidence 11000 000 000011111122222222 33344444445544211 11111 11233345677877777643 4 Q ss_pred -EEEEec Q lcl|Aclame:pro 303 -AVVRDA 308 (311) Q Consensus 303 -~~l~~a 308 (311) +-++.| T Consensus 305 yv~~k~A 311 (311) T protein:vir:99 305 FVSVKKA 311 (311) T ss_pred EEeeecC Confidence 223333 No 253 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=27.63 E-value=1.8 Score=19.15 Aligned_cols=279 Identities=12% Similarity=0.035 Sum_probs=105.9 Q ss_pred Cc-ccCCCc--eEcchhHHHHHHHHHHhhchhhhhcce-----------eecCCCceEEEEEeCCceeEEeec------- Q lcl|Aclame:pro 1 MV-ALATGT--FQLPKHLVPGVWQKAQGQSVLARLSMA-----------EPQEFGEQQYMTLTAPPRGEVVGE------- 59 (311) Q Consensus 1 ma-t~~~g~--~~vP~~~~~~ii~~~~~~s~l~~l~~~-----------~~~~~~~~~~p~~~~~~~a~~v~E------- 59 (311) +. +.+.+. -..-.+-...+....- .... ..+.. ..+..+.. . ..+..-..-.+| T Consensus 176 ~~~~~a~~~g~ea~f~ea~t~fs~~~~-g~~~-~~g~~~~~~~~~~~~~~~~a~~~~--~-~~~~Gm~Ta~aEaL~~~g~ 250 (529) T protein:vir:10 176 AGQAIAEGDIVGHFFYESGTAFLQNVS-GASV-TVGTNETGEALDKLINAAIGEGKL--A-EIAEGMATSIAELRQGFNG 250 (529) T ss_pred ccccccccCcceeeeecccceeccccc-cccc-ccCccccCcccccccccccccccc--c-ccccccchhhhhccccCCC Confidence 11 000000 0000000000000000 0000 00000 00000000 0 001111111223 Q ss_pred --CccccccccceeEEEEeeeeEEEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcccC--C----Cc Q lcl|Aclame:pro 60 --GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINP--L----TG 130 (311) Q Consensus 60 --g~~~~~~~~~~~~v~l~~~kl~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~--~----~g 130 (311) +.++++-..+++.+++.++.-+-...+|-||.||--. -.+|.+++|..-|+..|...|++.+|+-... . .+ T Consensus 251 ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g 330 (529) T protein:vir:10 251 SNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGW 330 (529) T ss_pred cccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhccc Confidence 2345555666677666666555566889999987543 2478889999999999999999998854221 1 11 Q ss_pred cccccccccccccccceeeccccc-----cchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHh--hccCC----c Q lcl|Aclame:pro 131 AALSGSPAKILDTTNIVELTTGTS-----ATPDLAVEAAVGLVLG--DNLSPDGVALDNTFSFMLATQ--RDSQG----R 197 (311) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~i~~~~~~~~~--~~~~~~~~v~n~~~~~~l~~l--kd~~g----~ 197 (311) ....+..+|+.+..........-. ...+--+......+.. .....+.++++++....|... ++.-+ . T Consensus 331 ~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~ 410 (529) T protein:vir:10 331 TKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMA 410 (529) T ss_pred ccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhccccccccc Confidence 111122233333322111110100 0111122222222222 223456788999888888641 11100 0 Q ss_pred eeeccccc-cCCCcee-cceeEEeecccccccccccccccccccccccceEEEeecceE----EEEeecCceEEEeccCC Q lcl|Aclame:pro 198 KLYPELGF-GTDVASF-AGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLELIEFGD 271 (311) Q Consensus 198 ~~~~~~~~-~~~~~~l-~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~~~~ 271 (311) .-|..+.+ ....|.| .|++|++..+.+.. .+++|-.... .+.+..-+.+...+-.+ T Consensus 411 sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------------y~~vG~KG~~~~~~glfy~PYv~l~~~~~~d 472 (529) T protein:vir:10 411 SGLNADTTKGVFAGILGGRYKVYIDQYARQD------------------YFTMGYRGANNLDAGIYYCPYVALTPLRGSD 472 (529) T ss_pred cccccccCCceEEEEecCceEEEecCCCCcc------------------eEEEEEeCCcccccceeeccccccccccccC Confidence 11211111 1123444 45688887776543 2222221100 01111222222222233 Q ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEEecccC Q lcl|Aclame:pro 272 PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) Q Consensus 272 ~~~~~~~f~~~~v~~ra~~r~~~~v~~~~a~~~l~~aa~~ 311 (311) ++. || -.+.++ .|++..+ +|=+. -.+.+.++ T Consensus 473 p~s----fq-P~~g~~--tRY~l~~-NP~~~-~~~~~~~~ 503 (529) T protein:vir:10 473 PKN----FQ-PVMGFK--TRYAIGV-NPFAE-SRTQAPQG 503 (529) T ss_pred CCc----cc-ceeeee--eeeceee-cCccc-cccccccc Confidence 332 43 334443 4666543 33111 01111111 No 254 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=24.93 E-value=2.1 Score=18.80 Aligned_cols=281 Identities=14% Similarity=0.108 Sum_probs=117.0 Q ss_pred Cc-ccCCCceEcchhHHHH---HHHHHHhhchhhhhcceeecCCCceEEEEE-----e-CCcee-------EEee----- Q lcl|Aclame:pro 1 MV-ALATGTFQLPKHLVPG---VWQKAQGQSVLARLSMAEPQEFGEQQYMTL-----T-APPRG-------EVVG----- 58 (311) Q Consensus 1 ma-t~~~g~~~vP~~~~~~---ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~-----~-~~~~a-------~~v~----- 58 (311) .+ .++++.+. .+... +++.+.+..+..+++.+.||.+++.-|.-. + .+.++ .|-+ T Consensus 69 i~~st~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~ 145 (470) T protein:vir:10 69 SADATAAGPVA---GFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGL 145 (470) T ss_pred ccccccccccc---ccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCcccccc Confidence 22 12222221 12222 334444667778888888888766433211 0 00000 0100 Q ss_pred ----------------------------------------------------c------CccccccccceeEEEEeeeeE Q lcl|Aclame:pro 59 ----------------------------------------------------E------GAQKSESTATFAPVTAIPRKV 80 (311) Q Consensus 59 ----------------------------------------------------E------g~~~~~~~~~~~~v~l~~~kl 80 (311) | +.+.++-..+++.+++.++.- T Consensus 146 ~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSR 225 (470) T protein:vir:10 146 DDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSR 225 (470) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeecc Confidence 0 011222223334444444433 Q ss_pred EEEEeecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhcccC--CCccccccccccccccccceeeccccccc- Q lcl|Aclame:pro 81 QVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINP--LTGAALSGSPAKILDTTNIVELTTGTSAT- 156 (311) Q Consensus 81 ~~~i~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~- 156 (311) +-...+|-||.||-.. -.+|.+++|..-|+..|...|++.+|.-... ..+........++.+. ....+.. T Consensus 226 aLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~Dl------~~~~~gr~ 299 (470) T protein:vir:10 226 ALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFDL------DTDSNGRW 299 (470) T ss_pred ceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEEe------ecccchhH Confidence 4445789999887533 3578889999999999999999998864221 1111111111222211 1111100 Q ss_pred hHHHHHHHHHH---------HhhcCCCccEEEEcHHHHHHHHHh--hccC-C--ceeeccccccCC-Ccee-cceeEEee Q lcl|Aclame:pro 157 PDLAVEAAVGL---------VLGDNLSPDGVALDNTFSFMLATQ--RDSQ-G--RKLYPELGFGTD-VASF-AGLNAAVS 220 (311) Q Consensus 157 ~~~~i~~~~~~---------~~~~~~~~~~~v~n~~~~~~l~~l--kd~~-g--~~~~~~~~~~~~-~~~l-~G~pv~~~ 220 (311) ..+.+..++.+ .....+..+.++++++....|... .+.. | ..+ ..+.++.. .|.| .|++|++. T Consensus 300 ~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~-~~D~t~~~~~G~l~~~~~vy~d 378 (470) T protein:vir:10 300 SVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFAGILQGKYRVYID 378 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhcccccccccccccc-ccCCCCceEEEEecCceEEEee Confidence 01111111111 223455666789999888877431 1100 0 011 11111111 2444 45688887 Q ss_pred cccccccccccccccccccccccceEEEeecceE----EEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 221 DTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGI 296 (311) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v 296 (311) .++..+.... ...+++|-.... .+.+..-..+...+..+++. || -.+.++ .|++..+ T Consensus 379 ~y~~~~~~a~------------~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s----fq-P~~g~~--tRY~l~~ 439 (470) T protein:vir:10 379 PFSASGGAAA------------TQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDT----FQ-PKIGFK--TRYGLVE 439 (470) T ss_pred ccccccCccc------------ccEEEEEEecCcceecceeeccccccccCCCCCCcc----cc-ceeeee--eeeceee Confidence 6654321111 112333322110 01122222233333334332 43 234443 4666543 Q ss_pred ecccc------eEEEEecccC Q lcl|Aclame:pro 297 MSTDA------FAVVRDADES 311 (311) Q Consensus 297 ~~~~a------~~~l~~aa~~ 311 (311) +|=. ...+...+-. T Consensus 440 -NP~~~~~~~~~~~i~~~~n~ 459 (470) T protein:vir:10 440 -NPFSQGTTQGLGTLTRNSNR 459 (470) T ss_pred -cCcccCCCcccccccCCCCc Confidence 3322 1111111111 No 255 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=23.32 E-value=2.3 Score=18.58 Aligned_cols=283 Identities=12% Similarity=0.051 Sum_probs=117.6 Q ss_pred Cc-------ccCCCceEcchhHHHHHHHHH---HhhchhhhhcceeecCCCceEEEEE-----e-CCc-------eeEEe Q lcl|Aclame:pro 1 MV-------ALATGTFQLPKHLVPGVWQKA---QGQSVLARLSMAEPQEFGEQQYMTL-----T-APP-------RGEVV 57 (311) Q Consensus 1 ma-------t~~~g~~~vP~~~~~~ii~~~---~~~s~l~~l~~~~~~~~~~~~~p~~-----~-~~~-------~a~~v 57 (311) |. ..+++.+ ..+.+.++.+. .+..+..+++.+.||.+++.-|.-. + .+. +..|- T Consensus 63 ~~~~n~~~~~~~t~~v---~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fS 139 (468) T protein:vir:10 63 IAPAGSALGSANTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFT 139 (468) T ss_pred cchhhhhhhhcccccc---cccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceecccccccc Confidence 22 1112221 11233334444 4566777888888888765332211 0 000 00000 Q ss_pred ------------------------------------------------ec-----CccccccccceeEEEEeeeeEEEEE Q lcl|Aclame:pro 58 ------------------------------------------------GE-----GAQKSESTATFAPVTAIPRKVQVTQ 84 (311) Q Consensus 58 ------------------------------------------------~E-----g~~~~~~~~~~~~v~l~~~kl~~~i 84 (311) +| +.++++-..+++.++..++.-+-.. T Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKA 219 (468) T protein:vir:10 140 GGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKA 219 (468) T ss_pred ccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceec Confidence 01 0112223333444444444444455 Q ss_pred eecHHHhhcCch-hhHHHHHHHHHHHHHHHHHHHHHHhhhccc--CCCccccccccccccccccceeeccccc-cchHHH Q lcl|Aclame:pro 85 RFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTGAALSGSPAKILDTTNIVELTTGTS-ATPDLA 160 (311) Q Consensus 85 ~iS~ell~~s~~-~~~~~~~~i~~~la~~ia~~~d~~~l~G~~--~~~g~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 160 (311) .+|-||.||-.. -.+|.+++|..-|+..|..+|++.+|+-.. +..+........|+.+...... +.+ ...+.. T Consensus 220 eYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~~~~---~rw~~e~~k~ 296 (468) T protein:vir:10 220 EYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSN---GRWSVEKFKG 296 (468) T ss_pred cccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccccccccccccccc---chhHHHHHHH Confidence 789999987543 357888999999999999999999885422 1121111111222222211111 000 001111 Q ss_pred ----HHHHHHH--HhhcCCCccEEEEcHHHHHHHHH---hhcc---CCceee---ccccccC-CCcee-cceeEEeeccc Q lcl|Aclame:pro 161 ----VEAAVGL--VLGDNLSPDGVALDNTFSFMLAT---QRDS---QGRKLY---PELGFGT-DVASF-AGLNAAVSDTV 223 (311) Q Consensus 161 ----i~~~~~~--~~~~~~~~~~~v~n~~~~~~l~~---lkd~---~g~~~~---~~~~~~~-~~~~l-~G~pv~~~~~~ 223 (311) +..-... .....+..+.++++++....|.. ++.. +++.-+ .-+.++. ..|.| .|++|++..+. T Consensus 297 L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya 376 (468) T protein:vir:10 297 LLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYA 376 (468) T ss_pred HHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCceEEEEcccc Confidence 1111122 22345566779999999998875 3311 111111 1111111 12444 35678776554 Q ss_pred ccccccccccccccccccccceEEEeecceE----EEEeecCceEEEeccCCcccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAF----RWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST 299 (311) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~~~~~~~~~~~f~~~~v~~ra~~r~~~~v~~~ 299 (311) ..+ .+...+++|-.... .+.+..-..+...+..+++. || -.+.++ .|++..+ +| T Consensus 377 ~~~--------------s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s----fq-P~~g~~--tRY~l~~-NP 434 (468) T protein:vir:10 377 ANL--------------SDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT----FQ-PKIGFK--TRYGMVS-NP 434 (468) T ss_pred ccC--------------CccceEEEEEecCcceeceeeeccccccccccccCCCc----cc-ceeeee--eeeceee-cc Confidence 321 11122333322110 01112222222333333332 33 234443 4666543 33 Q ss_pred cce-EEEEecc---cC Q lcl|Aclame:pro 300 DAF-AVVRDAD---ES 311 (311) Q Consensus 300 ~a~-~~l~~aa---~~ 311 (311) =+. ..++... ++ T Consensus 435 ~~~~~~~~~g~~~~~~ 450 (468) T protein:vir:10 435 FVTTNGLYNGTPDGEA 450 (468) T ss_pred cceeccccCCCccccc Confidence 110 1111111 01 Done!