Query lcl|Aclame:protein:vir:78739|NCBI_annot:major capsid protein|genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Match_columns 332 No_of_seqs 151 out of 179 Neff 7.7 Searched_HMMs 1612 Date Mon Dec 2 15:45:27 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_58 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_58_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78739 Length: 332 100.0 6E-107 4E-110 603.0 26.9 332 1-332 1-332 (332) 2 protein:vir:100057 Length: 375 100.0 2E-97 1E-100 550.7 24.0 329 1-332 1-368 (375) 3 protein:vir:6324 Length: 335 # 100.0 5.9E-93 3.6E-96 526.2 25.1 315 1-332 1-326 (335) 4 protein:vir:10450 Length: 344 100.0 3.2E-92 2E-95 522.2 28.6 323 1-332 1-342 (344) 5 protein:vir:94576 Length: 347 100.0 2.9E-91 1.8E-94 517.0 28.5 322 1-332 1-347 (347) 6 protein:vir:80213 Length: 334 100.0 2.8E-91 1.7E-94 517.0 26.7 315 1-332 1-330 (334) 7 protein:vir:78935 Length: 335 100.0 1.9E-91 1.2E-94 517.9 25.4 315 1-332 1-326 (335) 8 protein:vir:2201 Length: 345 # 100.0 4.6E-91 2.8E-94 515.9 27.1 323 1-332 1-343 (345) 9 protein:vir:103323 Length: 364 100.0 1.3E-91 8E-95 518.9 23.6 318 1-332 1-337 (364) 10 protein:vir:94711 Length: 347 100.0 9.4E-90 5.8E-93 508.7 27.7 321 1-332 1-344 (347) 11 protein:vir:8885 Length: 347 # 100.0 2.7E-89 1.7E-92 506.1 28.1 322 1-332 1-344 (347) 12 protein:vir:3364 Length: 347 # 100.0 4E-89 2.5E-92 505.2 27.3 322 1-332 1-343 (347) 13 protein:vir:97031 Length: 402 100.0 1.9E-88 1.2E-91 501.5 25.9 318 1-332 1-337 (402) 14 protein:vir:1541 Length: 347 # 100.0 2.7E-87 1.7E-90 495.2 26.0 319 1-332 1-343 (347) 15 protein:vir:7019 Length: 401 # 100.0 9E-87 5.6E-90 492.3 25.8 317 1-332 1-331 (401) 16 protein:vir:105645 Length: 400 100.0 3.3E-86 2E-89 489.2 26.4 318 1-332 1-331 (400) 17 protein:vir:99675 Length: 324 100.0 9.8E-77 6.1E-80 437.3 23.0 274 52-332 1-294 (324) 18 protein:vir:94622 Length: 341 100.0 7.6E-70 4.7E-73 399.5 24.3 312 1-332 1-337 (341) 19 protein:vir:80180 Length: 381 100.0 5.7E-63 3.5E-66 361.8 23.8 321 1-332 1-379 (381) 20 protein:vir:102605 Length: 273 100.0 1E-58 6.2E-62 338.6 18.8 267 1-332 1-271 (273) 21 protein:vir:105822 Length: 273 100.0 1E-58 6.2E-62 338.6 18.8 267 1-332 1-271 (273) 22 protein:vir:7990 Length: 273 # 100.0 4.2E-57 2.6E-60 329.7 18.0 266 1-332 1-271 (273) 23 protein:vir:1781 Length: 221 # 100.0 2.3E-56 1.4E-59 325.7 16.4 216 96-326 1-221 (221) 24 protein:vir:3136 Length: 322 # 100.0 2E-56 1.2E-59 325.9 15.4 298 1-332 1-316 (322) 25 protein:vir:102655 Length: 322 100.0 1.7E-52 1.1E-55 304.4 21.5 306 1-332 1-319 (322) 26 protein:vir:107120 Length: 329 100.0 9.1E-42 5.7E-45 245.6 19.3 284 1-332 16-303 (329) 27 protein:vir:97331 Length: 319 100.0 9.1E-42 5.7E-45 245.6 18.9 284 1-332 5-292 (319) 28 protein:vir:94800 Length: 319 100.0 9.1E-42 5.7E-45 245.6 18.9 284 1-332 5-292 (319) 29 protein:vir:80930 Length: 278 100.0 1.2E-41 7.2E-45 245.0 17.0 270 1-332 1-275 (278) 30 protein:vir:96123 Length: 274 100.0 2E-40 1.2E-43 238.2 16.5 263 1-332 1-268 (274) 31 protein:vir:99075 Length: 392 100.0 5.1E-40 3.2E-43 236.0 17.3 281 1-332 1-296 (392) 32 protein:vir:108303 Length: 418 100.0 8.1E-40 5E-43 234.9 16.9 287 1-332 1-415 (418) 33 protein:vir:3525 Length: 423 # 100.0 2.1E-38 1.3E-41 227.2 17.9 291 1-332 1-422 (423) 34 protein:vir:3613 Length: 272 # 100.0 7.2E-39 4.4E-42 229.7 15.3 265 1-332 1-270 (272) 35 protein:vir:93742 Length: 274 100.0 1.1E-38 7E-42 228.6 15.4 263 1-332 1-268 (274) 36 protein:vir:95898 Length: 274 100.0 1.5E-38 9.4E-42 227.9 15.6 263 1-332 1-268 (274) 37 protein:vir:96262 Length: 274 100.0 1.5E-38 9.4E-42 227.9 15.6 263 1-332 1-268 (274) 38 protein:vir:1239 Length: 274 # 100.0 4.5E-38 2.8E-41 225.3 16.4 263 1-332 1-268 (274) 39 protein:vir:94494 Length: 274 100.0 8.1E-38 5E-41 223.9 16.0 263 1-332 1-268 (274) 40 protein:vir:97433 Length: 274 100.0 8.1E-38 5E-41 223.9 16.0 263 1-332 1-268 (274) 41 protein:vir:174 Length: 423 # 100.0 2.8E-37 1.7E-40 221.0 17.3 291 1-332 1-422 (423) 42 protein:vir:96833 Length: 275 100.0 1.7E-37 1.1E-40 222.1 16.0 265 1-332 1-269 (275) 43 protein:vir:105374 Length: 423 100.0 5.6E-37 3.5E-40 219.3 17.8 291 1-332 1-422 (423) 44 protein:vir:79008 Length: 299 100.0 3.4E-36 2.1E-39 215.0 19.5 283 1-332 1-297 (299) 45 protein:vir:105522 Length: 423 100.0 4.5E-35 2.8E-38 208.9 17.2 292 1-332 1-422 (423) 46 protein:vir:105334 Length: 276 100.0 1E-34 6.3E-38 207.0 16.2 264 1-332 1-268 (276) 47 protein:vir:78920 Length: 290 100.0 6.8E-33 4.2E-36 197.0 17.7 281 1-332 1-288 (290) 48 protein:vir:3033 Length: 272 # 100.0 3.2E-33 2E-36 198.7 14.8 262 1-332 1-267 (272) 49 protein:vir:9820 Length: 272 # 100.0 3.2E-33 2E-36 198.7 14.8 262 1-332 1-267 (272) 50 protein:vir:105464 Length: 346 99.9 3.4E-30 2.1E-33 182.2 18.5 283 1-332 1-298 (346) 51 protein:vir:102335 Length: 312 99.9 4.4E-29 2.7E-32 176.0 18.2 294 1-332 1-308 (312) 52 protein:vir:739 Length: 231 # 99.9 3.4E-28 2.1E-31 171.2 11.3 229 53-332 1-229 (231) 53 protein:vir:79712 Length: 285 99.9 3.9E-26 2.4E-29 159.9 15.8 269 1-332 1-281 (285) 54 protein:vir:95107 Length: 270 99.9 1.1E-25 6.7E-29 157.5 14.8 259 1-332 1-263 (270) 55 protein:vir:99523 Length: 311 99.9 6.6E-25 4.1E-28 153.1 17.2 298 1-332 1-311 (311) 56 protein:vir:95451 Length: 313 99.8 2.9E-22 1.8E-25 138.7 11.4 297 1-332 1-309 (313) 57 protein:vir:78090 Length: 302 99.8 3.6E-21 2.2E-24 132.7 16.1 280 1-332 1-300 (302) 58 protein:vir:100939 Length: 430 99.7 4.1E-19 2.6E-22 121.4 16.1 291 1-332 1-427 (430) 59 protein:vir:9265 Length: 430 # 99.7 4.1E-19 2.6E-22 121.4 16.1 291 1-332 1-427 (430) 60 protein:vir:2106 Length: 430 # 99.7 2.4E-19 1.5E-22 122.6 13.7 291 1-332 1-427 (430) 61 protein:vir:78523 Length: 338 99.5 6.8E-16 4.2E-19 103.7 15.4 306 1-332 1-333 (338) 62 protein:vir:7771 Length: 330 # 99.4 6.3E-15 3.9E-18 98.4 16.2 295 1-332 1-321 (330) 63 protein:vir:105905 Length: 304 99.4 8.1E-15 5E-18 97.8 15.8 285 1-332 1-303 (304) 64 protein:vir:94142 Length: 304 99.4 8.1E-15 5E-18 97.8 15.8 285 1-332 1-303 (304) 65 protein:vir:9759 Length: 303 # 99.4 1.5E-14 9.3E-18 96.4 15.8 284 1-332 1-301 (303) 66 protein:vir:1638 Length: 298 # 99.4 2.4E-14 1.5E-17 95.2 16.6 285 1-332 1-297 (298) 67 protein:vir:41 Length: 299 # N 99.4 2.8E-14 1.7E-17 94.9 16.7 283 1-332 1-296 (299) 68 protein:vir:94771 Length: 298 99.4 3.3E-14 2.1E-17 94.5 17.0 284 1-332 1-297 (298) 69 protein:vir:78223 Length: 333 99.4 1.9E-14 1.2E-17 95.8 15.4 306 1-332 1-330 (333) 70 protein:vir:95763 Length: 297 99.3 1.1E-13 6.9E-17 91.6 16.9 280 1-332 1-294 (297) 71 protein:vir:8187 Length: 311 # 99.3 1.1E-13 6.7E-17 91.7 15.9 291 1-332 1-308 (311) 72 protein:vir:9574 Length: 300 # 99.3 2.3E-13 1.4E-16 89.9 17.6 285 1-332 1-298 (300) 73 protein:vir:9309 Length: 324 # 99.3 5.5E-14 3.4E-17 93.3 14.0 286 1-332 14-313 (324) 74 protein:vir:80684 Length: 315 99.3 2.2E-13 1.3E-16 90.0 17.0 290 1-332 1-304 (315) 75 protein:vir:10364 Length: 390 99.3 7.3E-14 4.6E-17 92.6 14.2 283 1-332 104-390 (390) 76 protein:vir:1886 Length: 385 # 99.3 1.3E-13 8.1E-17 91.2 15.3 286 1-332 93-382 (385) 77 protein:vir:191 Length: 385 # 99.3 1.3E-13 8.1E-17 91.2 15.3 286 1-332 93-382 (385) 78 protein:vir:78830 Length: 324 99.3 1.2E-13 7.3E-17 91.5 14.9 284 1-332 1-313 (324) 79 protein:vir:96392 Length: 324 99.3 1.2E-13 7.3E-17 91.5 14.9 284 1-332 1-313 (324) 80 protein:vir:97053 Length: 390 99.3 9.8E-14 6.1E-17 91.9 14.4 284 1-332 102-390 (390) 81 protein:vir:4339 Length: 395 # 99.3 2.4E-13 1.5E-16 89.7 15.8 289 1-332 98-393 (395) 82 protein:vir:2344 Length: 397 # 99.3 2.2E-13 1.4E-16 89.9 15.4 287 1-332 1-304 (397) 83 protein:vir:96223 Length: 324 99.3 1.7E-13 1.1E-16 90.6 14.7 286 1-332 1-313 (324) 84 protein:vir:104085 Length: 320 99.3 5E-13 3.1E-16 88.0 16.8 295 1-332 1-315 (320) 85 protein:vir:99749 Length: 324 99.3 3.3E-13 2.1E-16 89.0 15.3 281 1-332 18-313 (324) 86 protein:vir:98339 Length: 415 99.2 6.1E-13 3.8E-16 87.6 15.8 291 1-332 109-402 (415) 87 protein:vir:79987 Length: 415 99.2 6.1E-13 3.8E-16 87.6 15.8 291 1-332 109-402 (415) 88 protein:vir:81100 Length: 415 99.2 6.1E-13 3.8E-16 87.6 15.8 291 1-332 109-402 (415) 89 protein:vir:81070 Length: 390 99.2 3.5E-13 2.2E-16 88.9 14.5 284 1-332 101-390 (390) 90 protein:vir:4511 Length: 409 # 99.2 2.4E-13 1.5E-16 89.7 13.6 296 1-332 96-404 (409) 91 protein:vir:103955 Length: 324 99.2 5.6E-13 3.5E-16 87.7 15.2 281 1-332 18-313 (324) 92 protein:vir:4700 Length: 415 # 99.2 1E-12 6.3E-16 86.3 16.1 291 1-332 109-402 (415) 93 protein:vir:4600 Length: 415 # 99.2 1E-12 6.3E-16 86.3 16.1 291 1-332 109-402 (415) 94 protein:vir:4226 Length: 326 # 99.2 9.5E-13 5.9E-16 86.5 15.9 293 1-332 1-321 (326) 95 protein:vir:9410 Length: 415 # 99.2 5.4E-13 3.3E-16 87.9 14.4 293 1-332 107-402 (415) 96 protein:vir:8102 Length: 543 # 99.2 7.5E-13 4.7E-16 87.1 14.3 294 1-332 237-540 (543) 97 protein:vir:99920 Length: 311 99.2 2.9E-12 1.8E-15 83.8 17.3 291 1-332 1-310 (311) 98 protein:vir:80376 Length: 435 99.2 5.3E-12 3.3E-15 82.4 18.3 295 1-332 105-431 (435) 99 protein:vir:2430 Length: 318 # 99.2 2.6E-12 1.6E-15 84.1 16.6 289 1-332 6-311 (318) 100 protein:vir:100247 Length: 425 99.2 2.3E-12 1.4E-15 84.4 16.1 295 1-332 108-422 (425) 101 protein:vir:97148 Length: 324 99.2 1.5E-12 9.2E-16 85.4 15.0 285 1-332 1-313 (324) 102 protein:vir:485 Length: 407 # 99.2 2.1E-12 1.3E-15 84.6 15.7 294 1-332 90-398 (407) 103 protein:vir:1328 Length: 392 # 99.2 1E-12 6.3E-16 86.3 14.0 290 1-332 97-391 (392) 104 protein:vir:4856 Length: 293 # 99.2 1.9E-12 1.2E-15 84.8 15.3 274 7-332 1-279 (293) 105 protein:vir:100135 Length: 418 99.2 1.4E-12 8.7E-16 85.6 14.0 284 1-332 121-413 (418) 106 protein:vir:5739 Length: 366 # 99.2 4.2E-12 2.6E-15 83.0 16.3 292 1-332 52-364 (366) 107 protein:vir:1433 Length: 435 # 99.2 1.1E-11 6.6E-15 80.8 18.1 295 1-332 101-431 (435) 108 protein:vir:6242 Length: 390 # 99.1 1.8E-12 1.1E-15 85.0 13.7 288 1-332 93-387 (390) 109 protein:vir:104256 Length: 458 99.1 1.9E-12 1.2E-15 84.8 12.7 295 1-332 143-456 (458) 110 protein:vir:4456 Length: 401 # 99.1 5.4E-12 3.4E-15 82.4 14.3 293 1-332 91-399 (401) 111 protein:vir:105038 Length: 428 99.1 4.5E-11 2.8E-14 77.3 18.7 292 1-332 113-426 (428) 112 protein:vir:101607 Length: 379 99.1 2.1E-11 1.3E-14 79.1 16.8 281 1-332 89-377 (379) 113 protein:vir:4830 Length: 397 # 99.1 1.6E-11 1E-14 79.7 16.0 282 1-332 98-383 (397) 114 protein:vir:4997 Length: 397 # 99.1 2.9E-11 1.8E-14 78.4 17.2 282 1-332 91-383 (397) 115 protein:vir:94673 Length: 419 99.1 6.1E-12 3.8E-15 82.1 13.4 292 1-332 110-415 (419) 116 protein:vir:81227 Length: 413 99.0 1.5E-11 9.6E-15 79.9 14.1 292 1-332 107-408 (413) 117 protein:vir:93616 Length: 645 99.0 4.7E-11 2.9E-14 77.2 16.4 293 1-332 321-637 (645) 118 protein:vir:4953 Length: 397 # 99.0 5.2E-11 3.2E-14 77.0 16.6 282 1-332 91-383 (397) 119 protein:vir:6212 Length: 434 # 99.0 3.3E-11 2E-14 78.1 15.3 289 1-332 131-431 (434) 120 protein:vir:1383 Length: 421 # 99.0 1.8E-11 1.1E-14 79.5 13.6 273 1-332 104-381 (421) 121 protein:vir:3870 Length: 400 # 99.0 1.3E-11 8.2E-15 80.2 12.5 274 1-332 120-397 (400) 122 protein:vir:81160 Length: 371 99.0 6.9E-11 4.3E-14 76.3 16.5 287 1-332 76-369 (371) 123 protein:vir:3991 Length: 404 # 99.0 8.8E-11 5.4E-14 75.7 16.4 282 1-332 105-391 (404) 124 protein:vir:1268 Length: 397 # 99.0 6.9E-11 4.3E-14 76.3 15.7 280 1-332 102-395 (397) 125 protein:vir:102944 Length: 330 99.0 1.5E-11 9.5E-15 79.9 12.0 280 1-332 1-307 (330) 126 protein:vir:100172 Length: 394 99.0 8.5E-11 5.3E-14 75.8 16.1 278 1-332 101-382 (394) 127 protein:vir:1583 Length: 351 # 98.9 6.7E-11 4.1E-14 76.4 14.1 282 1-332 1-305 (351) 128 protein:vir:7409 Length: 408 # 98.9 7.5E-11 4.6E-14 76.1 14.3 280 1-332 105-391 (408) 129 protein:vir:96762 Length: 632 98.9 4.5E-11 2.8E-14 77.3 12.8 279 1-332 347-631 (632) 130 protein:vir:100884 Length: 389 98.9 2E-10 1.2E-13 73.8 15.9 276 1-332 99-380 (389) 131 protein:vir:102119 Length: 404 98.9 9.2E-11 5.7E-14 75.6 13.9 296 1-332 92-398 (404) 132 protein:vir:1025 Length: 408 # 98.9 2.1E-10 1.3E-13 73.7 15.2 282 1-332 101-391 (408) 133 protein:vir:8420 Length: 477 # 98.9 8.2E-10 5.1E-13 70.4 17.4 298 1-332 137-469 (477) 134 protein:vir:9704 Length: 394 # 98.8 1.7E-10 1.1E-13 74.1 13.6 272 1-332 115-388 (394) 135 protein:vir:101650 Length: 497 98.8 2.3E-10 1.4E-13 73.5 13.9 293 1-332 138-491 (497) 136 protein:vir:7855 Length: 497 # 98.8 2.3E-10 1.4E-13 73.5 13.9 293 1-332 138-491 (497) 137 protein:vir:2504 Length: 305 # 98.8 4E-10 2.5E-13 72.1 15.3 278 1-332 1-296 (305) 138 protein:vir:5974 Length: 324 # 98.8 1.7E-09 1E-12 68.7 17.7 279 1-332 1-301 (324) 139 protein:vir:93696 Length: 364 98.8 1.1E-09 6.6E-13 69.8 15.8 300 1-332 1-359 (364) 140 protein:vir:105004 Length: 392 98.8 1.3E-09 8.2E-13 69.3 16.3 281 1-332 84-382 (392) 141 protein:vir:107593 Length: 392 98.8 1.3E-09 8.2E-13 69.3 16.3 281 1-332 84-382 (392) 142 protein:vir:102873 Length: 392 98.8 1.3E-09 8.2E-13 69.3 16.3 281 1-332 84-382 (392) 143 protein:vir:102082 Length: 392 98.8 1.3E-09 8.2E-13 69.3 16.3 281 1-332 84-382 (392) 144 protein:vir:95875 Length: 401 98.8 4.7E-09 2.9E-12 66.2 19.2 318 1-332 1-398 (401) 145 protein:vir:95376 Length: 425 98.8 2.3E-10 1.4E-13 73.4 11.8 292 1-332 119-419 (425) 146 protein:vir:3845 Length: 395 # 98.8 4.9E-10 3E-13 71.6 13.5 279 1-332 98-381 (395) 147 protein:vir:4092 Length: 390 # 98.7 5.9E-10 3.7E-13 71.2 12.7 289 1-332 69-366 (390) 148 protein:vir:105610 Length: 430 98.7 1.2E-08 7.3E-12 64.0 19.4 312 1-332 1-422 (430) 149 protein:vir:2770 Length: 318 # 98.7 2.2E-09 1.4E-12 68.0 15.2 251 1-272 1-318 (318) 150 protein:vir:1084 Length: 437 # 98.7 2.3E-09 1.4E-12 67.9 15.3 278 1-332 141-428 (437) 151 protein:vir:108211 Length: 318 98.7 3.9E-09 2.4E-12 66.7 15.4 291 1-332 1-315 (318) 152 protein:vir:9361 Length: 402 # 98.7 1.9E-09 1.2E-12 68.4 13.6 273 1-332 114-396 (402) 153 protein:vir:78640 Length: 352 98.6 6.3E-09 3.9E-12 65.6 15.7 273 1-332 64-344 (352) 154 protein:vir:93881 Length: 387 98.6 3.4E-09 2.1E-12 67.0 14.0 272 1-332 100-381 (387) 155 protein:vir:96978 Length: 387 98.6 4.1E-09 2.6E-12 66.5 13.6 273 1-332 99-379 (387) 156 protein:vir:2685 Length: 387 # 98.6 4.1E-09 2.6E-12 66.5 13.6 273 1-332 99-379 (387) 157 protein:vir:94424 Length: 387 98.6 4.1E-09 2.6E-12 66.5 13.6 273 1-332 99-379 (387) 158 protein:vir:962 Length: 397 # 98.6 2.4E-09 1.5E-12 67.8 12.0 273 1-332 121-397 (397) 159 protein:vir:9927 Length: 295 # 98.5 3.2E-08 2E-11 61.6 16.1 261 1-332 1-288 (295) 160 protein:vir:819 Length: 404 # 98.4 6.3E-08 3.9E-11 60.1 16.5 321 1-332 1-401 (404) 161 protein:vir:3298 Length: 404 # 98.4 6.3E-08 3.9E-11 60.1 16.5 321 1-332 1-401 (404) 162 protein:vir:10123 Length: 404 98.4 6.3E-08 3.9E-11 60.1 16.5 321 1-332 1-401 (404) 163 protein:vir:104439 Length: 404 98.4 6.3E-08 3.9E-11 60.1 16.5 321 1-332 1-401 (404) 164 protein:vir:106647 Length: 303 98.4 6.2E-08 3.8E-11 60.1 14.9 264 1-332 1-296 (303) 165 protein:vir:101291 Length: 381 98.3 2.2E-08 1.4E-11 62.6 11.8 281 1-332 57-368 (381) 166 protein:vir:9509 Length: 381 # 98.3 2.2E-08 1.4E-11 62.6 11.8 281 1-332 57-368 (381) 167 protein:vir:3158 Length: 321 # 98.3 7.8E-08 4.8E-11 59.5 13.5 292 1-332 1-309 (321) 168 protein:vir:9643 Length: 377 # 98.3 4.8E-08 3E-11 60.7 12.2 284 1-332 59-375 (377) 169 protein:vir:100632 Length: 381 98.2 5.5E-08 3.4E-11 60.4 12.1 284 1-332 57-366 (381) 170 protein:vir:9875 Length: 296 # 98.1 4.2E-07 2.6E-10 55.5 13.7 262 1-332 1-295 (296) 171 protein:vir:4197 Length: 314 # 98.1 4.8E-07 3E-10 55.2 13.9 294 1-332 1-311 (314) 172 protein:vir:78350 Length: 383 98.0 1.1E-07 6.6E-11 58.8 9.0 283 1-332 64-373 (383) 173 protein:vir:98635 Length: 377 97.8 6E-07 3.7E-10 54.7 9.9 279 1-332 59-375 (377) 174 protein:vir:80128 Length: 466 97.8 4E-07 2.5E-10 55.7 8.6 281 1-332 141-446 (466) 175 protein:vir:95963 Length: 395 97.7 1.2E-06 7.5E-10 53.0 10.5 285 1-332 62-374 (395) 176 protein:vir:4159 Length: 315 # 97.6 3.7E-06 2.3E-09 50.3 12.1 296 1-332 1-315 (315) 177 protein:vir:80446 Length: 367 97.6 2.7E-05 1.7E-08 45.6 16.8 289 1-332 1-346 (367) 178 protein:vir:79928 Length: 393 97.4 7.4E-06 4.6E-09 48.7 11.5 300 1-332 28-375 (393) 179 protein:vir:79548 Length: 652 96.1 0.00021 1.3E-07 40.8 9.9 296 1-331 331-652 (652) 180 protein:vir:94933 Length: 330 96.1 0.0011 6.6E-07 36.9 14.9 281 1-332 25-327 (330) 181 protein:vir:97255 Length: 310 95.4 0.0021 1.3E-06 35.2 16.3 287 1-332 1-308 (310) 182 protein:vir:97397 Length: 517 95.1 0.002 1.2E-06 35.4 11.9 275 1-327 229-517 (517) 183 protein:vir:80068 Length: 301 94.7 0.0036 2.3E-06 34.0 17.2 280 14-332 1-301 (301) 184 protein:vir:95512 Length: 693 94.2 0.0015 9.1E-07 36.1 9.1 297 1-332 366-691 (693) 185 protein:vir:103285 Length: 296 90.8 0.019 1.2E-05 30.0 16.3 274 1-332 1-296 (296) 186 protein:vir:3969 Length: 287 # 90.6 0.02 1.2E-05 29.9 14.4 263 24-332 1-283 (287) 187 protein:vir:78387 Length: 349 90.3 0.022 1.3E-05 29.7 19.8 286 1-332 1-326 (349) 188 protein:vir:94989 Length: 349 86.3 0.046 2.9E-05 27.9 20.9 285 1-332 1-326 (349) 189 protein:vir:107687 Length: 319 85.4 0.053 3.3E-05 27.6 14.9 288 1-332 1-319 (319) 190 protein:vir:8324 Length: 410 # 84.5 0.06 3.7E-05 27.3 12.6 279 1-332 85-410 (410) 191 protein:vir:4074 Length: 480 # 83.4 0.069 4.3E-05 27.0 14.7 268 1-332 171-475 (480) 192 protein:vir:104342 Length: 314 80.9 0.091 5.6E-05 26.3 13.7 289 1-332 3-314 (314) 193 protein:vir:4786 Length: 295 # 80.2 0.097 6E-05 26.1 11.0 270 1-322 1-295 (295) 194 protein:vir:79078 Length: 307 79.3 0.11 6.6E-05 25.9 9.8 287 1-332 1-305 (307) 195 protein:vir:94528 Length: 286 77.3 0.13 7.8E-05 25.5 15.1 260 1-332 1-282 (286) 196 protein:vir:5942 Length: 523 # 70.6 0.21 0.00013 24.3 12.9 307 1-332 162-519 (523) 197 protein:vir:99424 Length: 360 64.9 0.29 0.00018 23.5 14.3 292 1-332 1-354 (360) 198 protein:vir:95131 Length: 325 61.6 0.35 0.00022 23.1 17.0 276 1-332 1-304 (325) 199 protein:vir:79642 Length: 329 57.9 0.42 0.00026 22.6 15.9 284 1-332 17-326 (329) 200 protein:vir:98871 Length: 314 57.1 0.44 0.00027 22.5 14.2 282 1-332 1-310 (314) 201 protein:vir:107732 Length: 379 49.0 0.65 0.00041 21.6 14.1 292 1-332 56-379 (379) 202 protein:vir:99888 Length: 309 38.7 1.1 0.00065 20.5 11.8 280 1-332 1-296 (309) 203 protein:vir:107882 Length: 307 35.9 1.2 0.00075 20.1 10.4 285 1-332 1-305 (307) 204 protein:vir:78558 Length: 336 30.6 1.6 0.00097 19.5 10.7 285 1-332 31-336 (336) 205 protein:vir:78148 Length: 123 28.6 0.67 0.00041 21.5 3.1 110 198-332 1-121 (123) 206 protein:vir:103181 Length: 457 21.8 2.5 0.0016 18.4 13.5 301 1-332 97-436 (457) 207 protein:vir:106734 Length: 336 21.6 2.6 0.0016 18.3 9.4 284 1-332 31-336 (336) 208 protein:vir:103886 Length: 302 20.1 2.8 0.0018 18.1 15.3 267 1-326 1-302 (302) No 1 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=5.8e-107 Score=603.02 Aligned_cols=332 Identities=100% Similarity=1.391 Sum_probs=323.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) ||.++|+|||||.+.|..++++|.++|||||+|+|||+++|++.|+++++++.|++++|+|||||++|++++++|++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~~~~g~~ 80 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEeeecCCCC Confidence 99999999999998888888888889999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccce Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFH 160 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~ 160 (332) +.+++++++++++|+||+.+|++|.|||+|++|+++|+|+++++++|++||+++|++|++++++++++.+++.+.+++.. T Consensus 81 l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~ 160 (332) T protein:vir:78 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFH 160 (332) T ss_pred CCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccc Confidence 98877799999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeeceE Q lcl|Aclame:pro 161 VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIR 240 (332) Q Consensus 161 i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~ 240 (332) +.++++.++++.++|++|++|+++|+|++||.+|||+||+|++|+.||+++|++|+|++++++++.+++|+.|++++||+ T Consensus 161 ~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~ 240 (332) T protein:vir:78 161 VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIR 240 (332) T ss_pred cccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeE Confidence 99999999999999999999999999999999999999999999999998899999999999999999998799999999 Q ss_pred EEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCce Q lcl|Aclame:pro 241 ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGS 320 (332) Q Consensus 241 V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~v 320 (332) ||+|||||..+++.+..++.+|.++.|+++|++++|++||++|+++++++++++|+++.++++++|+|+|+|+++||+++ T Consensus 241 V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v 320 (332) T protein:vir:78 241 ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGS 320 (332) T ss_pred EEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred echhheeeeecC Q lcl|Aclame:pro 321 LRTSVAGSFQAA 332 (332) Q Consensus 321 lrpe~~v~i~~A 332 (332) +||||+++|++| T Consensus 321 ~rPe~~v~l~~a 332 (332) T protein:vir:78 321 LRTSVAGSFQAA 332 (332) T ss_pred ecccceEEEeeC Confidence 999999999999 No 2 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=2e-97 Score=550.67 Aligned_cols=329 Identities=42% Similarity=0.722 Sum_probs=289.4 Q ss_pred CCCcccccccccccccccc--cccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARN--ADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~--~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g 78 (332) ||+ +|++|+++.|.+... +..++++|||||+|+|||+++|++.|+++++++.|++++|||++||++|++++++|+|| T Consensus 1 ~~~-~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~G 79 (375) T protein:vir:10 1 MAN-ANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPG 79 (375) T ss_pred Ccc-ccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecCC Confidence 888 677777754422211 12234789999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccC--CCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc- Q lcl|Aclame:pro 79 TPIVGDA--GIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE- 155 (332) Q Consensus 79 ~~~~~~~--~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~- 155 (332) ++|.++. ++++++++|+||+.+|++|.|||+|++|+++|+|+++++|+|++||+++|++|++++++++++..++.+. T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9998764 6778999999999999999999999999999999999999999999999999999999999998876653 Q ss_pred ---cccceeccccc----cccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcC-chhhccccccccccc Q lcl|Aclame:pro 156 ---PGGFHVNIGAG----NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVD-TNILNREIGNSQGDM 227 (332) Q Consensus 156 ---~~~~~i~~~~~----~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d-~~~~~~d~~~~~~~~ 227 (332) +++..+..+++ .+++++++|++|++++++|+|++||.+|||+||+|++|++||+++| ++|+|+++.+ ++.. T Consensus 160 ~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~-~~~~ 238 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQG-SALQ 238 (375) T ss_pred ccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccc-ccee Confidence 45556554333 3467999999999999999999999999999999999999998766 4799999854 4555 Q ss_pred cccceeeeeeceEEEeeCccccccccccccc-----------------------ccccccccccccc---cceEEEeech Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSA-----------------------AVTGENNDYQVDA---SALAGLIFHR 281 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~-----------------------~~~g~~~~y~~~~---~~~~~l~~h~ 281 (332) .+| .+++++||+||+|||+|..+++++..+ .++|++|+|.+++ ++++|++||| T Consensus 239 ~~g-~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~ 317 (375) T protein:vir:10 239 SGN-GVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQK 317 (375) T ss_pred ccc-eEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEch Confidence 555 489999999999999999888766543 3456789999999 9999999999 Q ss_pred hhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 282 EAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 282 ~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +|++++|++++++|+++.+++++||+|+|+++|+|||+++||||||+|++. T Consensus 318 ~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~ 368 (375) T protein:vir:10 318 EAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIG 368 (375) T ss_pred hheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecC Confidence 999999999999999999999999999999999999999999999999988 No 3 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=5.9e-93 Score=526.23 Aligned_cols=315 Identities=20% Similarity=0.229 Sum_probs=281.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) |||++++|||+ |||..++ ++||||+|+|||+++|++.++|++++++|++++|||+|||++|+.++.+|+||++ T Consensus 1 ms~~~~~tr~~---~~~s~~d----~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~ 73 (335) T protein:vir:63 1 MSFLNDLTRPN---YAGKNAD----VDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEE 73 (335) T ss_pred CCCcccchhhh---cccccch----hheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcC Confidence 99999999996 6666554 3699999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc--- Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG--- 157 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~--- 157 (332) |.++ ++.+++++|+||+++|++++|||+|++|++||+|+|+++|+|++||+++||++++++++++++.+++...++ T Consensus 74 l~~~-~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:63 74 LERS-RVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred cCCC-CccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCC Confidence 9986 578899999999999999999999999999999999999999999999999999999999999887765443 Q ss_pred c-c-eecc-ccccccCHHHHHHHHHHHHHHHHhcCCCcCC---CEEEEChHHHHHHHhhcCchhhccccccccc--cccc Q lcl|Aclame:pro 158 G-F-HVNI-GAGNTNDAQAIVDGFFEAAAVLDERSAPQEG---RVAVLSPRQYYSLISSVDTNILNREIGNSQG--DMNS 229 (332) Q Consensus 158 ~-~-~i~~-~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~g---R~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~--~~~~ 229 (332) + . .+.+ +.+.+++++.++++|++|.++|+|++||+++ ||++|+|++|++||+ +++|+|++|+++++ .+.+ T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~--~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:63 153 GVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLE--HDKLMNVEYQATGATNDYVK 230 (335) T ss_pred CcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhc--cccccccccccccccccccC Confidence 2 1 1222 3445567999999999999999999999765 999999999999997 48999999987665 3566 Q ss_pred cceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL 309 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~ 309 (332) |+ |.+++||+|++|||||..+++.+.. ...+|.|+++++++++++||++|++++|++++++|.|++++ +|+|+ T Consensus 231 g~-v~~v~Gv~V~~sn~lP~~~~t~~~l---g~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~---~~~~~ 303 (335) T protein:vir:63 231 SR-VAILNGVKVLETPRFATKAIAAHPL---GRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE---KFSWV 303 (335) T ss_pred ce-eEEeeceEEEeeccCCCCCcccccc---cccCCccccccceeEEEEEecceEEEEEEeecccceeeccc---hhhHH Confidence 64 9999999999999999887766643 34567899999999999999999999999999999998754 58999 Q ss_pred HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |+++++|||+++|||||++|++= T Consensus 304 i~~~~a~G~g~lRPe~a~~i~~t 326 (335) T protein:vir:63 304 LDTFQMYNIGARRPDTAGAIELK 326 (335) T ss_pred hHHHHHcCCcccccceEEEEEEc Confidence 99999999999999999999886 No 4 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=3.2e-92 Score=522.21 Aligned_cols=323 Identities=24% Similarity=0.371 Sum_probs=276.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) |+|..+.+.||....++.++.++ ++|||||+|+|||+++|++.|+|+++|+.|++++|||++||++|++++.+|+||++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~-~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~~~~~G~~ 79 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGD-KLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (344) T ss_pred CccccccccCCcccCCccCCccc-hhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEEeeecCCC Confidence 99987777777654444444444 78999999999999999999999999999999999999999999999999999999 Q ss_pred CCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc- Q lcl|Aclame:pro 81 IVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG- 158 (332) Q Consensus 81 ~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~- 158 (332) |.+. +++++++++|+||+.+|++|.|||+|++|+++|+|+++++|+|++||+++|++|++++++++....+....+++ T Consensus 80 l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~ 159 (344) T protein:vir:10 80 LDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGL 159 (344) T ss_pred CCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Confidence 9875 67899999999999999999999999999999999999999999999999999999999998887777665443 Q ss_pred ---ceeccc------cccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 159 ---FHVNIG------AGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 159 ---~~i~~~------~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) ..+... +..++.++.+|++|++|+++|+|++||.+|||+||+|++|++||+ +++|++.++++ ++.+++ T Consensus 160 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~--~~~~~~~~~~~-~~~~~~ 236 (344) T protein:vir:10 160 GTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILA--ALMPNAANYAA-LIDPEK 236 (344) T ss_pred cccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhh--ccccccccccc-ccceee Confidence 222211 122344678999999999999999999999999999999999986 47888888764 566888 Q ss_pred cceeeeeeceEEEeeCccccccccccccccccccc--------ccccccccceEEEeechhhhhhhhhccceeeeeeccc Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGEN--------NDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF 301 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~--------~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~ 301 (332) |. |++++||+||+|||||......+ ..+.+|.. ..|.++|++++||+|||+|+++++++++++|.+|+ T Consensus 237 G~-V~~v~G~~V~~Sn~lp~~~~~~~-~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~-- 312 (344) T protein:vir:10 237 GS-IRNVMGFEVVEVPHLTAGGAGTS-REGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARR-- 312 (344) T ss_pred eE-EEEEeceEEEeccccccccCCcc-cccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccc-- Confidence 85 99999999999999997543332 22233333 34556899999999999999999999999999875 Q ss_pred chhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 NVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++|+|+|+|+|+||||++||||+++|.-+ T Consensus 313 -~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~ 342 (344) T protein:vir:10 313 -ANFQADQIIAKYAMGHGGLRPEAAGAVVFK 342 (344) T ss_pred -hhHHHHHHHHHhhcccceecccceEEEEee Confidence 678999999999999999999999777666 No 5 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=2.9e-91 Score=516.96 Aligned_cols=322 Identities=26% Similarity=0.418 Sum_probs=272.4 Q ss_pred CCCcccccccc-cccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPN-QANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~-~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |+|.....++| +.+|||++++ .+|||||+|+|||+++|++.|+|+++|+.|++++|||++||++|++++.+|+||+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d---~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~ 77 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGD---KLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGE 77 (347) T ss_pred CCccccccccccccccCCcccc---hHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCc Confidence 99877544443 1235555554 4589999999999999999999999999999999999999999999999999999 Q ss_pred CCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc----ccc Q lcl|Aclame:pro 80 PIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP----VTG 154 (332) Q Consensus 80 ~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~----~~~ 154 (332) ++.++ +++++++++|+||+++|++|+|||+|++|+++|+|+++++++|++||+++|++|++++.+++++..+ ..+ T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:94 78 NLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAG 157 (347) T ss_pred CCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 99875 5789999999999999999999999999999999999999999999999999999999998876543 445 Q ss_pred ccccceecccc------ccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccc Q lcl|Aclame:pro 155 EPGGFHVNIGA------GNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMN 228 (332) Q Consensus 155 ~~~~~~i~~~~------~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~ 228 (332) .+++..+.++. +.+.++.++|++|++|.++|+|++||++|||+||+|++|+.||+..+..+.+ + ++...+. T Consensus 158 ~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~--~-~~~~~~~ 234 (347) T protein:vir:94 158 LGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAAN--Y-QALIDPS 234 (347) T ss_pred CCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccc--c-ccccccc Confidence 55555554432 2345688999999999999999999999999999999999999764444444 3 2234577 Q ss_pred ccceeeeeeceEEEeeCcccccccccccccc-----------cccccccccccccceEEEeechhhhhhhhhccceeeee Q lcl|Aclame:pro 229 SGKGLYSIAGIRILKSNNLAGLYGQDLSSAA-----------VTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTT 297 (332) Q Consensus 229 ~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~-----------~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~ 297 (332) +|+ |++++||+||+|||+|...++.+...+ ..+.+++|.++|+++++|+||++|++++|++++++|.+ T Consensus 235 ~G~-V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~ 313 (347) T protein:vir:94 235 TGS-IRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERA 313 (347) T ss_pred cce-eEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeee Confidence 785 999999999999999976544444332 13456789999999999999999999999999999998 Q ss_pred ecccchhHHHHHHHHHHHhCCceechhheee--eecC Q lcl|Aclame:pro 298 SGDFNVQYQGDLIVGKLAMGCGSLRTSVAGS--FQAA 332 (332) Q Consensus 298 ~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~--i~~A 332 (332) |+ +++|+|+|+++++|||+++||||+|+ +.+| T Consensus 314 ~~---~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 314 RR---ANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ec---hhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 64 67899999999999999999999984 4555 No 6 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=2.8e-91 Score=517.04 Aligned_cols=315 Identities=19% Similarity=0.219 Sum_probs=277.2 Q ss_pred CCCc--ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCC Q lcl|Aclame:pro 1 MTTL--SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPG 78 (332) Q Consensus 1 m~~~--~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g 78 (332) |+|| +.++||+ |+|.+++ ++||||+|+|||+++|++.++|+++++.|++++|||+|||++|++++++|+|| T Consensus 1 m~~~~~~~~t~~~---~~~~~~~----~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~~~g 73 (334) T protein:vir:80 1 MTYPAANTHTRPG---WGGANSD----VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAG 73 (334) T ss_pred CCCCcCCCccccc---cccccch----heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeeeecCC Confidence 9999 5678885 6666554 46999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc-- Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP-- 156 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~-- 156 (332) ++|++. ++++++++|+||+.+|++++|||+|++|+++|+|+++++|+|++||+++||+|++++++++++..+....+ T Consensus 74 ~~l~~~-~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~ 152 (334) T protein:vir:80 74 EELVVQ-KNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAF 152 (334) T ss_pred CCCCCC-CcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 999874 78999999999999999999999999999999999999999999999999999999999999887664332 Q ss_pred --c-cceecc---ccccccCHHHHHHHHHHHHHHHHhcCCCc---CCCEEEEChHHHHHHHhhcCchhhccccccccc-- Q lcl|Aclame:pro 157 --G-GFHVNI---GAGNTNDAQAIVDGFFEAAAVLDERSAPQ---EGRVAVLSPRQYYSLISSVDTNILNREIGNSQG-- 225 (332) Q Consensus 157 --~-~~~i~~---~~~~~~~~~~~~d~i~~a~~~Lde~~VP~---~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~-- 225 (332) | ...+.+ ++..+++++.+++++++|++.|+|++||+ .|||+||+|++|++||+ +++|+|+||+++++ T Consensus 153 ~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~--~~r~~n~d~~~s~~~~ 230 (334) T protein:vir:80 153 HDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLE--HDRLMNVEFGAKEGGN 230 (334) T ss_pred cCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhc--ccccccceeccccccc Confidence 2 222221 23455789999999999999999999994 67999999999999997 58999999977543 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhH Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQY 305 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~ 305 (332) .+.+| .|++++||+||+|||+|..+++.+.. .+..+.|+++|+++++++||++|++++|++++++|.||+ +++ T Consensus 231 ~~~~g-~i~~v~G~~V~~Sn~~P~~~~t~~~~---g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~---~~~ 303 (334) T protein:vir:80 231 SFVGG-RIAMLNGVRVVETPRFPQSAITANAL---GADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEE---KKD 303 (334) T ss_pred cccce-eEEEEeceEEEeecCCCCcccccccc---ccccccccccccceEEEEEeCceEEEEEEeecceeeeec---hhh Confidence 44555 49999999999999999877665543 467789999999999999999999999999999999876 457 Q ss_pred HHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |+|+|+++++|||+++||||++++.== T Consensus 304 ~~d~i~~~~a~G~g~lRPeaa~vv~~~ 330 (334) T protein:vir:80 304 FGHYLDTFQSYNIGQRRPDAVAVHDIT 330 (334) T ss_pred HHHHHHHHHHcCCceeccceEEEEEEe Confidence 999999999999999999999987755 No 7 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=1.9e-91 Score=517.92 Aligned_cols=315 Identities=20% Similarity=0.219 Sum_probs=280.4 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) ||||+++|||+ |||+.++ ++||||+|+|||+++|++.++|++++++|++++|||+|||++|+.++.||+||++ T Consensus 1 ms~~~~~t~~~---~~~s~~d----~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~ 73 (335) T protein:vir:78 1 MSFLNDLTRPN---YAGKNAD----VDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEE 73 (335) T ss_pred CCccccccccc---cccccch----hhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCcc Confidence 99999999996 6666554 4699999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc--- Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG--- 157 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~--- 157 (332) |++. .+.+++++|+||+++|++++|||+|++|+|||+|+++++|+|++||+++||++++++++++++..++...++ T Consensus 74 l~~~-~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 74 LERS-RVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred cCCC-CcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 9975 688899999999999999999999999999999999999999999999999999999999998877764433 Q ss_pred -cce-ecc-ccccccCHHHHHHHHHHHHHHHHhcCCCcC---CCEEEEChHHHHHHHhhcCchhhccccccccc--cccc Q lcl|Aclame:pro 158 -GFH-VNI-GAGNTNDAQAIVDGFFEAAAVLDERSAPQE---GRVAVLSPRQYYSLISSVDTNILNREIGNSQG--DMNS 229 (332) Q Consensus 158 -~~~-i~~-~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~---gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~--~~~~ 229 (332) +.. +.+ +.+.+++++.++++++++.+.|+|++||+. ||+++|+|++|++||+ +++|+|++|+++++ .+.+ T Consensus 153 G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~--~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:78 153 GVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLE--HDKLMSVEYQATGATNDYVK 230 (335) T ss_pred CcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhc--cccccccccccccccccccc Confidence 222 122 344557899999999999999999999965 6999999999999997 48999999987665 3566 Q ss_pred cceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL 309 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~ 309 (332) |+ |++++||+|++|||||..+++.+.. ...+|.|+.++++++|++||++|+++++++++++|.+++++ +|+|+ T Consensus 231 g~-v~~v~Gv~V~~Sn~lP~~~~t~~~l---g~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~---~~~~~ 303 (335) T protein:vir:78 231 SR-VAILNGVKVLETPRFATKAISAHPL---GRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD---QFSWV 303 (335) T ss_pred ce-eEEeeceEEEeeccCCCCCCccccc---cccCCcccccccceEEEEEecceEEEEEEEecccceeeccc---hhhHh Confidence 65 9999999999999999887766644 23467888899999999999999999999999999988754 58999 Q ss_pred HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |+++++|||+++|||||++|++= T Consensus 304 i~~~~a~G~g~lRPe~a~~i~~t 326 (335) T protein:vir:78 304 LDTFQMYNIGARRPDTAGAIELK 326 (335) T ss_pred hhHHHHcCCcccCcceEEEEEec Confidence 99999999999999999999876 No 8 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=4.6e-91 Score=515.86 Aligned_cols=323 Identities=25% Similarity=0.378 Sum_probs=277.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) |+++.+.+|||..++.|-++ .++++|||||+|+|||+++|++.|+++++|+.|++++|||++||++|++++.+|+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVA-AGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (345) T ss_pred Ccccccchhccccccccccc-CCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEEeeecCCC Confidence 99999999999755555443 44578999999999999999999999999999999999999999999999999999999 Q ss_pred CCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 81 IVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 81 ~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) |++. .++++++++|+||+.+|++|+|||+|++|+++|+|+++++|+|++||+++|++|+++++++++..+++.+.+++. T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~ 159 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGL 159 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Confidence 9874 468899999999999999999999999999999999999999999999999999999999999888887666543 Q ss_pred e----ecccc------ccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 160 H----VNIGA------GNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 160 ~----i~~~~------~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) . +..++ ...+.+.++|++|++|+++|+|++||.+|||+||+|++|++||+ +++|++.++++. +..++ T Consensus 160 ~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~--~~~~~~~~~~~~-~~~~~ 236 (345) T protein:vir:22 160 GTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILA--ALMPNAANYAAL-IDPEK 236 (345) T ss_pred ccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhc--cccccccccccc-ccccc Confidence 2 11111 12245678999999999999999999999999999999999985 478988888654 55788 Q ss_pred cceeeeeeceEEEeeCccccccccccccc---------ccccccccccccccceEEEeechhhhhhhhhccceeeeeecc Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSA---------AVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD 300 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~---------~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~ 300 (332) |. |++++||+||+|||||...++....+ ...|..+ |...+++++|++|||+|+++++++++++|.+|+ T Consensus 237 G~-V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~- 313 (345) T protein:vir:22 237 GS-IRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGN-VKVAKDNVIGLFMHRSAVGTVKLRDLALERARR- 313 (345) T ss_pred ce-EEEEeceEEEecccccccccCccccCccccccccccccccee-eeeccCceEEEEEehhheeeeeeecceeeeeec- Confidence 85 99999999999999997544433322 1122222 445568899999999999999999999999885 Q ss_pred cchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 301 FNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 301 ~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++|+|+|+|+++|||+++||||+++|..= T Consensus 314 --~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~ 343 (345) T protein:vir:22 314 --ANFQADQIIAKYAMGHGGLRPEAAGAVVFK 343 (345) T ss_pred --hhHHHHHHHHHHhcCCcccccceeEEEEEe Confidence 568999999999999999999999998777 No 9 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=1.3e-91 Score=518.87 Aligned_cols=318 Identities=14% Similarity=0.157 Sum_probs=274.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) ||+++.++||+ |+++. + ++|||||+|+|||+++|++.|++++++++|++++|||++||++|++++++|+||+. T Consensus 1 ms~~n~~t~~~---~~~~~---~-~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~~~G~~ 73 (364) T protein:vir:10 1 MSNPNVLTQPA---VSASG---E-VDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQVLSPGKS 73 (364) T ss_pred CCCcccccccc---ccccc---c-hhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEeeeccCcc Confidence 99999999996 44432 2 46899999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-----ccc Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYS-TRAEVSKQIGEALATHYDERIARVLAKASAEASP-----VTG 154 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~-----~~~ 154 (332) |++ +++.+++++|+||+++|++++|+|+|++|+||| +|+++++|+|++||+++|++|++++.+++..... ..+ T Consensus 74 ld~-~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~ 152 (364) T protein:vir:10 74 PDA-SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRV 152 (364) T ss_pred cCC-CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcc Confidence 986 578899999999999999999999999999999 8999999999999999999999888766533211 123 Q ss_pred ccccceecc---ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccc-ccccccc Q lcl|Aclame:pro 155 EPGGFHVNI---GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNS-QGDMNSG 230 (332) Q Consensus 155 ~~~~~~i~~---~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~-~~~~~~g 230 (332) .+++..+.+ ....++++..++++|++|.++|+|++||.+|||+||+|++||.||+ +++|+|++|+.+ .+.+.+| T Consensus 153 ~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~--~~~lvn~d~~~~~~~~~~~G 230 (364) T protein:vir:10 153 AGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRD--ADRIVDKSYTIAASDNTVDG 230 (364) T ss_pred cCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhc--CCccccccccccCCCccccc Confidence 334444433 2345678899999999999999999999999999999999999996 589999999754 3556777 Q ss_pred ceeeeeeceEEEeeCccccccccccccc-------cccccccccc--ccccceEEEeechhhhhhhhhccceeeeeeccc Q lcl|Aclame:pro 231 KGLYSIAGIRILKSNNLAGLYGQDLSSA-------AVTGENNDYQ--VDASALAGLIFHREAAGCIQSVAPTIQTTSGDF 301 (332) Q Consensus 231 ~~v~~i~G~~V~~sn~lp~~~g~~~~~~-------~~~g~~~~y~--~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~ 301 (332) + |++++||+|++|||||..++....++ ..+|.++.|. ++++++++++|||+|++++|++++++|.|++++ T Consensus 231 ~-v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~ 309 (364) T protein:vir:10 231 F-VLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK 309 (364) T ss_pred e-eEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc Confidence 5 89999999999999998655433322 3456678887 677799999999999999999999999998755 Q ss_pred chhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 NVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) + |+|+|+++++|||+++|||||++|.++ T Consensus 310 ~---~~~~ida~~a~G~g~lRPeaa~~i~~~ 337 (364) T protein:vir:10 310 E---KTWYIDTFLAEGAIPDRWEAVAVVTAA 337 (364) T ss_pred e---eeeeeeeehcccCcccCccceEEEEec Confidence 4 778899999999999999999999999 No 10 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=9.4e-90 Score=508.66 Aligned_cols=321 Identities=26% Similarity=0.392 Sum_probs=268.0 Q ss_pred CCCccccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |+|. +.+|+|- .+||++++ | +.|||||+|.||||++|+++|+|+++|+.|++++|||+|||++|++++++|+||+ T Consensus 1 m~~~-~~~~~~t~~g~~~~~~--d-~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G~ 76 (347) T protein:vir:94 1 MANV-PGQKIGTDQGKGKSSS--D-ALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGE 76 (347) T ss_pred CCCC-CccccccccccCCccc--c-HHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCCC Confidence 8874 4455542 13555544 4 5789999999999999999999999999999999999999999999999999999 Q ss_pred CCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc----cc Q lcl|Aclame:pro 80 PIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV----TG 154 (332) Q Consensus 80 ~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~----~~ 154 (332) +|++. +++++++++|+||+++|++|.|||+|++|+++|+|+++++|+|++||+++|++|++++.+.+....+. .+ T Consensus 77 ~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g 156 (347) T protein:vir:94 77 RLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAG 156 (347) T ss_pred CcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCC Confidence 99763 56899999999999999999999999999999999999999999999999999999998766543322 22 Q ss_pred ccccceecccc-----ccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 155 EPGGFHVNIGA-----GNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 155 ~~~~~~i~~~~-----~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) ..+++.+..+. ..+..+++++++|++|+++|+|++||.+|||+||+|++|+.||+ +.+|.+.++.+ ++.+.+ T Consensus 157 ~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~--~~~~~~~~~~~-~~~~~~ 233 (347) T protein:vir:94 157 LGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILA--ALMPNAANYAA-LIDPET 233 (347) T ss_pred CcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhc--cchhhhhhccc-cccccc Confidence 22333333221 12244688999999999999999999999999999999999985 46788877654 567888 Q ss_pred cceeeeeeceEEEeeCccccccccccccc----cccccc--------ccccccccceEEEeechhhhhhhhhccceeeee Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSA----AVTGEN--------NDYQVDASALAGLIFHREAAGCIQSVAPTIQTT 297 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~----~~~g~~--------~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~ 297 (332) |+ |++++||+||+|||||..+.+.+..+ ..+|+. ..|.++|+++++|+|||+|++++|++++++|.+ T Consensus 234 G~-Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~ 312 (347) T protein:vir:94 234 GN-IRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERD 312 (347) T ss_pred cc-eEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccch Confidence 95 99999999999999997555444332 223332 368899999999999999999999999999976 Q ss_pred ecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 298 SGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 298 ~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) | ++++|+|+|+|+++||||++||||+|+|... T Consensus 313 r---~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~ 344 (347) T protein:vir:94 313 R---DVDAQGDLIVGKYAMGHGGLRPEAAGALVFS 344 (347) T ss_pred h---chhhHHHHhhhhhhhcCcccccceeEEEEec Confidence 6 4678999999999999999999999998877 No 11 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=2.7e-89 Score=506.14 Aligned_cols=322 Identities=27% Similarity=0.386 Sum_probs=274.1 Q ss_pred CCCccccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |+|.....|++- .+||++ .+| ++|||||+|+|||+++|++.|+|+++|+.|++++|||+|||++|++++.+|++|+ T Consensus 1 ~a~~~~~~~~~~~~g~~~~--~~d-~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g~ 77 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQS--AAD-KLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGE 77 (347) T ss_pred CCCcccchhhhccCCCCcc--ccc-hHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeecccc Confidence 998776555541 134544 444 5789999999999999999999999999999999999999999999999999999 Q ss_pred CCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc----ccc Q lcl|Aclame:pro 80 PIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP----VTG 154 (332) Q Consensus 80 ~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~----~~~ 154 (332) +++++ +++++++++|+||+++|++|+|||+|++|+++|+|+++++++|++||+++|++|++++.++++.... +.+ T Consensus 78 ~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:88 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) T ss_pred CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC Confidence 98875 5789999999999999999999999999999999999999999999999999999999998775433 333 Q ss_pred ccccceeccccc-----cccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 155 EPGGFHVNIGAG-----NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 155 ~~~~~~i~~~~~-----~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) ..++..+.++++ ...+++.+|++|++|+++|+|++||.+|||+||+|++|++||+ ++++.+.++. +++.+++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~--~~~~~~~~~~-~~~~~~~ 234 (347) T protein:vir:88 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILS--ALMPNAANYA-ALIDPET 234 (347) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhc--chhhhhhhhc-cccchhc Confidence 334444444332 2345677899999999999999999999999999999999996 4678777774 4567888 Q ss_pred cceeeeeeceEEEeeCccccccccccc-----------ccccccccccccccccceEEEeechhhhhhhhhccceeeeee Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLS-----------SAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTS 298 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~-----------~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~ 298 (332) |+ |++++||+|++|||+|......++ .+...+....|..+++++++|+||++|+++++++++++|.+| T Consensus 235 G~-vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r 313 (347) T protein:vir:88 235 GN-IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) T ss_pred ce-eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeee Confidence 85 999999999999999964433322 222345677899999999999999999999999999999886 Q ss_pred cccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 299 GDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 299 ~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) + +++|+|+|+|+++||++++||||+|+|.+- T Consensus 314 ~---~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~ 344 (347) T protein:vir:88 314 R---PEFQADQIIGKYAMGHGGLRPEAAGALVFT 344 (347) T ss_pred c---hhhHHHHhhhhhhhcCceeccceEEEEEeC Confidence 5 668999999999999999999999998876 No 12 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=4e-89 Score=505.23 Aligned_cols=322 Identities=26% Similarity=0.381 Sum_probs=273.7 Q ss_pred CCCcccccccc-cccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPN-QANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~-~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |+|+....++| +.+|||++++. +|||||+|+|||+++|+++|+++++++.|++++|||+|||++|++++.+|++|+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~---~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~~~g~ 77 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADK---LALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGE 77 (347) T ss_pred CCCCccCcccccccccCCcccch---HHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeeecCCC Confidence 99876544332 12466665554 579999999999999999999999999999999999999999999999999999 Q ss_pred CCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-cccc---- Q lcl|Aclame:pro 80 PIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA-SPVT---- 153 (332) Q Consensus 80 ~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~-~~~~---- 153 (332) +++++ +++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|++++.++.... .++. T Consensus 78 ~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:33 78 NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEG 157 (347) T ss_pred CCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Confidence 98764 56899999999999999999999999999999999999999999999999999999987765432 1221 Q ss_pred -ccccccee-ccccccc----cCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccc Q lcl|Aclame:pro 154 -GEPGGFHV-NIGAGNT----NDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDM 227 (332) Q Consensus 154 -~~~~~~~i-~~~~~~~----~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~ 227 (332) +.+.+..+ ..+++.+ .++.++|++|++|+++|+|++||.+|||+||+|++|+.||+ +++|+++++.+ .+.+ T Consensus 158 ~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~--~~~~~~~d~~~-~~~~ 234 (347) T protein:vir:33 158 LGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILA--ALMPNAANYQA-LLDP 234 (347) T ss_pred ccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhc--ccccccccccc-cccc Confidence 12222222 2233333 34678999999999999999999999999999999999996 47999999964 5678 Q ss_pred cccceeeeeeceEEEeeCccccccccccccccccccccccc--------ccccceEEEeechhhhhhhhhccceeeeeec Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQ--------VDASALAGLIFHREAAGCIQSVAPTIQTTSG 299 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~--------~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~ 299 (332) .+|. |++++||+||+|||||..+++.+..++.+|..+.|+ +.|+..+||+||++|+|+++++++++|..|+ T Consensus 235 ~~G~-V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~ 313 (347) T protein:vir:33 235 ERGT-IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred ccce-eEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccc Confidence 8885 999999999999999999888888888888877754 5566678999999999999999999999875 Q ss_pred ccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 300 DFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 300 ~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++||+|+|+|+++||+|++||||+|+|.-= T Consensus 314 ---~~~~~d~i~~~~~~G~~vlrP~~av~i~~~ 343 (347) T protein:vir:33 314 ---ANYQADQIIAKYAMGHGGLRPEAAGAIVLP 343 (347) T ss_pred ---hhhhhHhhhhhhhcCCceecccceEEEecC Confidence 678999999999999999999999999544 No 13 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=1.9e-88 Score=501.55 Aligned_cols=318 Identities=13% Similarity=0.135 Sum_probs=268.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) ||+++.++||+ |+++. + +++||||+|+|||+++|++.|++++++++|++++|||++||++|++++++|+||+. T Consensus 1 Ms~~n~~t~~~---~~~s~---~-~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~~~G~~ 73 (402) T protein:vir:97 1 MSTPNTLTNVA---VSASG---E-VDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (402) T ss_pred CCCcccccccc---ccccc---c-hhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEeeeccccc Confidence 99999999996 44432 2 46899999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-cccccccc- Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYS-TRAEVSKQIGEALATHYDERIARVLAKASAEA-SPVTGEPG- 157 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~-~~~~~~~~- 157 (332) |++ +++.+++++|+||+++|++++|+|+|++|+||| +|+++++|+|++||+.+||+|++++..+++.. .++...++ T Consensus 74 ldg-~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~ 152 (402) T protein:vir:97 74 PNA-TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) T ss_pred cCC-CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcc Confidence 986 578899999999999999999999999999999 89999999999999999999998887666532 12221111 Q ss_pred ---cceecc---ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccc-ccccccc Q lcl|Aclame:pro 158 ---GFHVNI---GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNS-QGDMNSG 230 (332) Q Consensus 158 ---~~~i~~---~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~-~~~~~~g 230 (332) +....+ ....++++..++++|+++.++|+|++||.+||+++|+|++|+.||+ +++|+|++|+.+ .+.+.+| T Consensus 153 ~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~--~~rl~n~d~~~~~~g~~~~G 230 (402) T protein:vir:97 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKTYTISQSGATING 230 (402) T ss_pred cccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhh--cccccchhhccccCCccccc Confidence 112222 2334688999999999999999999999999999999999999996 589999999643 3456666 Q ss_pred ceeeeeeceEEEeeCcccccc--cccccc-cccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH Q lcl|Aclame:pro 231 KGLYSIAGIRILKSNNLAGLY--GQDLSS-AAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG 307 (332) Q Consensus 231 ~~v~~i~G~~V~~sn~lp~~~--g~~~~~-~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~ 307 (332) + |++++||+||+|||||+.+ .+.+.. .+.+|...+|.+++++++|++|||+|++++|+++++.+.|++ +++|+ T Consensus 231 ~-v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d---~r~~~ 306 (402) T protein:vir:97 231 F-VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE---KKEKT 306 (402) T ss_pred e-eEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhc---hhHHH Confidence 5 9999999999999999753 222222 223344445559999999999999999999999999999886 45688 Q ss_pred HHHHHHHHhCCceechhheeeee------cC Q lcl|Aclame:pro 308 DLIVGKLAMGCGSLRTSVAGSFQ------AA 332 (332) Q Consensus 308 d~i~~~~~~G~~vlrpe~~v~i~------~A 332 (332) |+|+++++|||+++||||++++. +| T Consensus 307 ~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~ 337 (402) T protein:vir:97 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) T ss_pred HHHHHHHHhCCcccCccceEEEEEecccccc Confidence 89999999999999999999983 22 No 14 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.7e-87 Score=495.16 Aligned_cols=319 Identities=26% Similarity=0.386 Sum_probs=269.7 Q ss_pred CCCcccc----cccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeec Q lcl|Aclame:pro 1 MTTLSNF----SLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~----~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~ 76 (332) |+|+... |||+ ||+.+++ .+|||||+|+|+|+++|++.|+++++++.|++++|||+|||++|++++++|+ T Consensus 1 ma~~~~~~~~~t~~~---~~~~~~~---~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQIGTNQG---KGQSAAD---KLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCccccCCccccccc---cCCCcch---HHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeec Confidence 7776643 5554 5555444 4689999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc---cc Q lcl|Aclame:pro 77 PGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS---PV 152 (332) Q Consensus 77 ~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~---~~ 152 (332) +|++++++ +++++++++|+||+++|++|+|||+|++|+++|+|+++++++|++||+++|++|++++.+++.+.. .. T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNEN 154 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 99998764 468999999999999999999999999999999999999999999999999999999987754432 11 Q ss_pred ccccccce----eccccccccC----HHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 153 TGEPGGFH----VNIGAGNTND----AQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 153 ~~~~~~~~----i~~~~~~~~~----~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) ...+++.. ...+++..++ +.+++++|++|+++|+|++||.+|||+||+|++|+.||+ +++|++.++.+. T Consensus 155 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~--~~~~~~~d~~~~- 231 (347) T protein:vir:15 155 IEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILA--ALMPNAANYQAL- 231 (347) T ss_pred ccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhc--cccccccccccc- Confidence 11222222 2233344444 567899999999999999999999999999999999996 478999998654 Q ss_pred ccccccceeeeeeceEEEeeCccccccccccccccccccccccc--------ccccceEEEeechhhhhhhhhccceeee Q lcl|Aclame:pro 225 GDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQ--------VDASALAGLIFHREAAGCIQSVAPTIQT 296 (332) Q Consensus 225 ~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~--------~~~~~~~~l~~h~~a~~~~~~~~~~~e~ 296 (332) +.+++|. |++++||+||+|||||..+++.+...+.+|..+.|. +.|+..++|+||++|++++|++++++|. T Consensus 232 ~~~~~G~-Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~ 310 (347) T protein:vir:15 232 IDHERGT-IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALER 310 (347) T ss_pred ccccceE-EEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeee Confidence 5688885 999999999999999988877776666777766654 4566778999999999999999999998 Q ss_pred eecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 297 TSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 297 ~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .|+ ++||+|+|+++|+||||++||||+|+|.-= T Consensus 311 ~~~---~~~~~d~i~~~~~~G~~vlrP~~av~~~~~ 343 (347) T protein:vir:15 311 ARR---ANYQADQIIAKYAMGHGGLRPEAAGAIVLP 343 (347) T ss_pred ccc---chhhhhhhehhhhcCCceeccccEEEEecC Confidence 875 678999999999999999999999999443 No 15 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=9e-87 Score=492.32 Aligned_cols=317 Identities=15% Similarity=0.123 Sum_probs=266.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) ||++++++||+ |+|.. +++|||||+|+|||+++|++.+++++++++|++++|||++||++|+.++++|+||+. T Consensus 1 Ms~~n~~t~~~---~~~sg----~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~pG~~ 73 (401) T protein:vir:70 1 MSTPNNLTNVA---VSASG----EVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (401) T ss_pred CCCCccccccc---ccccc----chhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEeeeecCCCC Confidence 99999999997 44432 257899999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----cccccc Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYS-TRAEVSKQIGEALATHYDERIARVLAKASAE-----ASPVTG 154 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~-----~~~~~~ 154 (332) |++ +++.+++++|+||+.+|++++|+|+|++|+||| +|+|+++|+|++||+++||+|++++..++.. .....+ T Consensus 74 ld~-~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~ 152 (401) T protein:vir:70 74 PAA-TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRV 152 (401) T ss_pred cCC-CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 987 478899999999999999999999999999999 8999999999999999999999988766542 123344 Q ss_pred ccccceeccccc---cccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEC-hHHHHHHHhhcCchhhccccccc-cccccc Q lcl|Aclame:pro 155 EPGGFHVNIGAG---NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLS-PRQYYSLISSVDTNILNREIGNS-QGDMNS 229 (332) Q Consensus 155 ~~~~~~i~~~~~---~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~-P~~~~~Ll~~~d~~~~~~d~~~~-~~~~~~ 229 (332) .+++..+.++.. ..++++.++++|++|+..|+|++||.. |++++. |.+|+.||+ .++++|++|+.+ .+.+.+ T Consensus 153 ~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~--~d~L~nrd~~~s~~g~~~~ 229 (401) T protein:vir:70 153 KGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRD--ADRIVDKTYTISQSGATIQ 229 (401) T ss_pred CCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHh--cCcccchhhccccCCcccc Confidence 555666665433 346889999999999999999999955 677764 555556654 379999999744 355677 Q ss_pred cceeeeeeceEEEeeCcccccccc-ccccccccccccc--ccccccceEEEeechhhhhhhhhccceeeeeecccchhHH Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQ-DLSSAAVTGENND--YQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ 306 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~-~~~~~~~~g~~~~--y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~ 306 (332) |+ |.+++||+||+|||||+.++. .+....++|.++. |.++++++++++|||+|++++|+++++.|.|++ +++| T Consensus 230 G~-v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d---~r~~ 305 (401) T protein:vir:70 230 GF-TLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYE---KKEK 305 (401) T ss_pred ce-EEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhh---hhhh Confidence 75 889999999999999975422 1112223444444 459999999999999999999999999999876 4568 Q ss_pred HHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 307 GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 307 ~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +|+|+++++|||+++||||++++.++ T Consensus 306 ~~~id~~~a~g~g~~RPeaa~vv~~k 331 (401) T protein:vir:70 306 TYYIDTFMAEGAIPDRWEAVSVVTTK 331 (401) T ss_pred HHHHHHHHHhCCcccchhheEEEeec Confidence 88999999999999999999999888 No 16 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=3.3e-86 Score=489.23 Aligned_cols=318 Identities=13% Similarity=0.099 Sum_probs=262.6 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) ||++++++||+ |+|+. +++|||||+|+|||+++|++.+++++++++|++++|||++||++|+.++++|+||++ T Consensus 1 Ms~~n~~t~p~---~~gsg----~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~y~~pG~~ 73 (400) T protein:vir:10 1 MSTPNNLTNVA---VSASG----EVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (400) T ss_pred CCCCccccccc---ccccc----chhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEeeecCCCC Confidence 99999999997 44432 256899999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--hccccccc-- Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYS-TRAEVSKQIGEALATHYDERIARVLAKASA--EASPVTGE-- 155 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~--~~~~~~~~-- 155 (332) |++. ++.+++++|+||+++|++++|+|+|++|+||| +|+|+++|+|++||+++||++++++..+.. +..+.... T Consensus 74 ldg~-~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g 152 (400) T protein:vir:10 74 PAAT-STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRV 152 (400) T ss_pred cCCC-CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCc Confidence 9875 68899999999999999999999999999999 999999999999999999999988876643 22222211 Q ss_pred ---cccceec-cccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc-cccccc Q lcl|Aclame:pro 156 ---PGGFHVN-IGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ-GDMNSG 230 (332) Q Consensus 156 ---~~~~~i~-~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~-~~~~~g 230 (332) +++..+. ......++++.+.++|++|..+|+|++||.++++++++|.+|++||. .++++|++|+.++ +.+..| T Consensus 153 ~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~--~dkLvnrdf~~s~~g~~~~g 230 (400) T protein:vir:10 153 KGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRD--ADRIVDKSYTISQSGATIQG 230 (400) T ss_pred cccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHh--CCcccchhccccCCCccccc Confidence 1111221 23344478999999999999999999999664444455555556654 3699999997554 556667 Q ss_pred ceeeeeeceEEEeeCccccccccc-cccccccccccc--ccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH Q lcl|Aclame:pro 231 KGLYSIAGIRILKSNNLAGLYGQD-LSSAAVTGENND--YQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG 307 (332) Q Consensus 231 ~~v~~i~G~~V~~sn~lp~~~g~~-~~~~~~~g~~~~--y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~ 307 (332) + |.+++|++||+|||||+..+.. +.....+|.++. |.++++++++++||++|++++|+++++.|.|++ +++|+ T Consensus 231 ~-v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d---~r~~~ 306 (400) T protein:vir:10 231 F-VLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYE---KKEKT 306 (400) T ss_pred e-EEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccc---hhhHH Confidence 5 8899999999999999754221 112223344444 459999999999999999999999999999876 55688 Q ss_pred HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |+|+++++||++++||||++++.++ T Consensus 307 ~~id~~~a~G~g~~RPeaa~vv~~~ 331 (400) T protein:vir:10 307 YYIDTFMSEGAIPDRWEAVSVVTTK 331 (400) T ss_pred HHHHHHHHhCCcccchhheEEEEec Confidence 9999999999999999999999999 No 17 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=9.8e-77 Score=437.30 Aligned_cols=274 Identities=23% Similarity=0.376 Sum_probs=233.3 Q ss_pred ccccccccceEEEecccceeeeeecCCCCCCc-cCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHH Q lcl|Aclame:pro 52 RSYDLRGGKSKQFMFTGKLSAGYHTPGTPIVG-DAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEAL 130 (332) Q Consensus 52 ~~r~~~~G~tv~i~~iG~~t~~~~~~g~~~~~-~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aL 130 (332) ++|+|++|||++||++|++++.+|+||++|++ ++++++++++|+||+++|++|.|||+|++|++||+|+++++|+|++| T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aL 80 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEAL 80 (324) T ss_pred CeeeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHH Confidence 78999999999999999999999999999976 46799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhhcc-----ccccccccceeccccc---cccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChH Q lcl|Aclame:pro 131 ATHYDERIARVLAKASAEAS-----PVTGEPGGFHVNIGAG---NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPR 202 (332) Q Consensus 131 a~~~D~~i~~~~~~aa~~~~-----~~~~~~~~~~i~~~~~---~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~ 202 (332) |+.+|++|++++++.+.+.. +..+.++...+.++.+ .+.+++++|++|++|+++|||++||.+|||+||+|+ T Consensus 81 A~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P~ 160 (324) T protein:vir:99 81 AMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPD 160 (324) T ss_pred HHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChH Confidence 99999999999987765443 3344444444444333 345678999999999999999999999999999999 Q ss_pred HHHHHHhhcCchhhccccccccccccccceeeeeeceEEEeeCcccccccccccccc--------ccccc---ccccccc Q lcl|Aclame:pro 203 QYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAA--------VTGEN---NDYQVDA 271 (332) Q Consensus 203 ~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~--------~~g~~---~~y~~~~ 271 (332) +|++||+ +.++.+.++. +.+.+++|+ |++++||+||+|||||...++....+. .+|.. ..|+++| T Consensus 161 ~y~~Ll~--~~~~~~~~~~-~~~~~~~G~-V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~ 236 (324) T protein:vir:99 161 TYSAILA--ALMPNAANYA-ALIDPETGN-IRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGA 236 (324) T ss_pred HHHHHhh--cccccccccc-cccceecce-EEEEeceEEEecCCcccccccccccccccccccccccccccccccccccc Confidence 9998875 3566766664 557788885 999999999999999987665443321 12222 2699999 Q ss_pred cceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 272 SALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 272 ~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++++||+||++|++++|++++++|.+|+ +++|+|+|+|+|+|||+++||||++++.-- T Consensus 237 ~~~~gl~~~~~a~~tv~~~~~~~e~~~~---~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~ 294 (324) T protein:vir:99 237 DNVVGLFVHRSAVATLKLKDMALERARR---PEYQADQIIAKYAMGHGGLRPEAVGAIIFE 294 (324) T ss_pred CceeEEEEehhheEEEeeecceecceec---hhhHHHhhhhhhhhcCcccccceEEEEEEc Confidence 9999999999999999999999999985 567999999999999999999999877522 No 18 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=7.6e-70 Score=399.52 Aligned_cols=312 Identities=17% Similarity=0.166 Sum_probs=257.2 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcccccccc--ccccceEEEecccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYD--LRGGKSKQFMFTGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r~--~~~G~tv~i~~iG~~t~~~~~~ 77 (332) |+.-+.+|+|.- +.+.+. -|| |+|+++|++.|++.+++.++++.++ +..|+|||||++|+++++||++ T Consensus 1 ~~~~~~~~~~~~-----~t~~v~----~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~ 71 (341) T protein:vir:94 1 MALGNTITGPSI-----NTQRGQ----QFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELGVEDKAT 71 (341) T ss_pred Ccchhhhccccc-----cchhHH----HHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcceeeeecC Confidence 888889999874 222232 466 9999999999999999999998664 4679999999999999999999 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |.++.. +++++++++|+||+++|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..+..++....+...... T Consensus 72 ~~~i~~-~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~ 150 (341) T protein:vir:94 72 DVPVGV-QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSS 150 (341) T ss_pred CCcccc-ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCc Confidence 998875 5789999999999999999999999999999999999999999999999999999987665433322221111 Q ss_pred cceeccccccccC-HHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTND-AQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 158 ~~~i~~~~~~~~~-~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ......+ ....++.|++++++|||++||.+|||+||+|++|+.||+ +++|++.++.+. +.+++|. |+++ T Consensus 151 ------~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~--~~~~~~~~~~g~-~~l~~G~-ig~i 220 (341) T protein:vir:94 151 ------NGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFT--IPQFISKDFINN-APIAQGQ-IGSL 220 (341) T ss_pred ------cccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhh--chhhhhhhcccc-chhheee-eeeE Confidence 1111122 234589999999999999999999999999999999985 589999999765 5688886 9999 Q ss_pred eceEEEeeCccccccccccccccc-------------ccccccccccccceEEEeechhhhhhhhhccc--------eee Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAV-------------TGENNDYQVDASALAGLIFHREAAGCIQSVAP--------TIQ 295 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~-------------~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~--------~~e 295 (332) +||+|++||+||..+++.+..... ......|..+++..++|+||++|++.+|.+++ +.. T Consensus 221 ~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~ 300 (341) T protein:vir:94 221 MGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAP 300 (341) T ss_pred eceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccc Confidence 999999999999876655433211 12234678899999999999999999997774 334 Q ss_pred eeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 296 TTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 296 ~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..+..++++||+|+|+|+++||||+|||||+|+|+++ T Consensus 301 ~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 301 RVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred cccccchhhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 4455678899999999999999999999999999999 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=5.7e-63 Score=361.82 Aligned_cols=321 Identities=18% Similarity=0.168 Sum_probs=245.1 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccccc--cccceEEEecccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDL--RGGKSKQFMFTGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r~~--~~G~tv~i~~iG~~t~~~~~~ 77 (332) |+++... -++...|-+.+. ...|+ |+|+++|++.|++.+++.++++.+++ +.|+|||||++|++++.+|++ T Consensus 1 ~~~~~~~--~~~~~~~~~~t~----~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~~ 74 (381) T protein:vir:80 1 MATIQGT--GGYKGSAVDLSN----VQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQP 74 (381) T ss_pred Cceeccc--ccccCcccchhh----HHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeecC Confidence 8887622 111122222222 23455 99999999999999999999887654 679999999999999999999 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |.++.. +++++++++++||+++|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..+.+......+...... T Consensus 75 g~~i~~-~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~ 153 (381) T protein:vir:80 75 QTPVNL-QARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYD 153 (381) T ss_pred CCcccc-cccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Confidence 998876 5788999999999999999999999999999999999999999999999999999988665544333221111 Q ss_pred c--ceecccc-ccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 158 G--FHVNIGA-GNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 158 ~--~~i~~~~-~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) . ....... ........+++.|++|+++|||++||.+|||+||+|++|+.||+ +++|++.++++. ..+++|. |+ T Consensus 154 ~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~--~~~~~~ad~~~~-~~l~~G~-Ig 229 (381) T protein:vir:80 154 TTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLS--INQFISVDFSQV-KPVTSGV-VG 229 (381) T ss_pred ccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhh--chhhhhhhhccc-hhhhcee-ee Confidence 0 1111111 12234567899999999999999999999999999999999995 589999998654 5689886 99 Q ss_pred eeeceEEEeeCcccccccccccccccc--c-----ccccccccccc---------------------------------- Q lcl|Aclame:pro 235 SIAGIRILKSNNLAGLYGQDLSSAAVT--G-----ENNDYQVDASA---------------------------------- 273 (332) Q Consensus 235 ~i~G~~V~~sn~lp~~~g~~~~~~~~~--g-----~~~~y~~~~~~---------------------------------- 273 (332) +++||+|++||+||...++.+...+.. + .+..|.++++. T Consensus 230 ~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~ 309 (381) T protein:vir:80 230 TILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAAD 309 (381) T ss_pred EEcceEEEeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecC Confidence 999999999999998666555433211 1 12344444422 Q ss_pred ---eEEEe--echhhhhhh-----hhccceeeeee-cccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 274 ---LAGLI--FHREAAGCI-----QSVAPTIQTTS-GDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 274 ---~~~l~--~h~~a~~~~-----~~~~~~~e~~~-~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++|.+ +.+.+.+++ ++..+|++.+. ..++..|+||.|+|+++||++.+||++||+|++- T Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 379 (381) T protein:vir:80 310 GGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTS 379 (381) T ss_pred CCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhc Confidence 22333 333444444 45566666666 6778899999999999999999999999999999 No 20 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1e-58 Score=338.56 Aligned_cols=267 Identities=20% Similarity=0.230 Sum_probs=222.8 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc---cccccceEEEecccceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTGKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r---~~~~G~tv~i~~iG~~t~~~~~ 76 (332) |++ .+|+ |+|+++|++.|++.+++.++++.+ +++.|+|+|||++|++++.+|+ T Consensus 1 MA~-----------------------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~ 57 (273) T protein:vir:10 1 MAF-----------------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYK 57 (273) T ss_pred Ccc-----------------------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccc Confidence 444 2465 999999999999999999998753 5778999999999999999998 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) +...+...+++++++++++||+.+++++.|+|+|+.|+++|+++ ++++++++||+++|++++.++..++... T Consensus 58 ~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~------- 129 (273) T protein:vir:10 58 AAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTAL------- 129 (273) T ss_pred cCCCccCccccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 64444445678999999999999999999999999999999865 9999999999999999998876543211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) +.+...++.++++.|++|+++||+++||.+|||+||+|++|+.|++. +.++.+.+..+.++.+++|. |+++ T Consensus 130 -------~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~-~~~~~~~~~~~~~~~l~~G~-ig~i 200 (273) T protein:vir:10 130 -------TGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSS-GSKLTSADTSGDAAGLRAGT-IGNL 200 (273) T ss_pred -------ccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcc-hhhhhhhhccccccceeeee-eeEE Confidence 11223456778999999999999999999999999999999999974 34566777777777788886 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +||+|++||+||...+. .+++||+.|++.++.+. ++|. ++++.+|+|.|+|+++| T Consensus 201 ~G~~v~~s~~lp~~~~~---------------------~~~~~~~~A~~~a~q~~-~~e~---~r~~~~~~~~v~~~~~y 255 (273) T protein:vir:10 201 LGARIVESNNLRDTDDE---------------------QFVAFHPSAAAYVSQID-TVEA---LRDQDSFSDRIRALHVY 255 (273) T ss_pred eceEEEEecccccCCcc---------------------EEEEEeccceeeeeeee-hhhc---ccCCCcceeeeeeeeee Confidence 99999999999953211 14789999999888654 5554 45567799999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |++++|||++++|++. T Consensus 256 g~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 256 GGKVVRPTGVVVFNKT 271 (273) T ss_pred eeeEeccceEEEEecc Confidence 9999999999999988 No 21 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1e-58 Score=338.56 Aligned_cols=267 Identities=20% Similarity=0.230 Sum_probs=222.8 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc---cccccceEEEecccceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTGKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r---~~~~G~tv~i~~iG~~t~~~~~ 76 (332) |++ .+|+ |+|+++|++.|++.+++.++++.+ +++.|+|+|||++|++++.+|+ T Consensus 1 MA~-----------------------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~ 57 (273) T protein:vir:10 1 MAF-----------------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYK 57 (273) T ss_pred Ccc-----------------------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccc Confidence 444 2465 999999999999999999998753 5778999999999999999998 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) +...+...+++++++++++||+.+++++.|+|+|+.|+++|+++ ++++++++||+++|++++.++..++... T Consensus 58 ~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~------- 129 (273) T protein:vir:10 58 AAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTAL------- 129 (273) T ss_pred cCCCccCccccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 64444445678999999999999999999999999999999865 9999999999999999998876543211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) +.+...++.++++.|++|+++||+++||.+|||+||+|++|+.|++. +.++.+.+..+.++.+++|. |+++ T Consensus 130 -------~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~-~~~~~~~~~~~~~~~l~~G~-ig~i 200 (273) T protein:vir:10 130 -------TGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSS-GSKLTSADTSGDAAGLRAGT-IGNL 200 (273) T ss_pred -------ccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcc-hhhhhhhhccccccceeeee-eeEE Confidence 11223456778999999999999999999999999999999999974 34566777777777788886 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +||+|++||+||...+. .+++||+.|++.++.+. ++|. ++++.+|+|.|+|+++| T Consensus 201 ~G~~v~~s~~lp~~~~~---------------------~~~~~~~~A~~~a~q~~-~~e~---~r~~~~~~~~v~~~~~y 255 (273) T protein:vir:10 201 LGARIVESNNLRDTDDE---------------------QFVAFHPSAAAYVSQID-TVEA---LRDQDSFSDRIRALHVY 255 (273) T ss_pred eceEEEEecccccCCcc---------------------EEEEEeccceeeeeeee-hhhc---ccCCCcceeeeeeeeee Confidence 99999999999953211 14789999999888654 5554 45567799999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |++++|||++++|++. T Consensus 256 g~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 256 GGKVVRPTGVVVFNKT 271 (273) T ss_pred eeeEeccceEEEEecc Confidence 9999999999999988 No 22 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=4.2e-57 Score=329.68 Aligned_cols=266 Identities=20% Similarity=0.229 Sum_probs=221.6 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc---cccccceEEEecccceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTGKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r---~~~~G~tv~i~~iG~~t~~~~~ 76 (332) |++ .+|+ |+|+++|++.|++.+++.++++.. ....|+||+||++|.+++.+|+ T Consensus 1 MA~-----------------------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~ 57 (273) T protein:vir:79 1 MAF-----------------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYK 57 (273) T ss_pred Ccc-----------------------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccc Confidence 544 2465 999999999999999999998654 3346999999999999999987 Q ss_pred C-CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 77 P-GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 77 ~-g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) + |.++. .+++++++++++||+++++++.|+|+|+.|+++|++ +++++++++||+++|++++..+..+.... T Consensus 58 ~~~~~~~-~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~------ 129 (273) T protein:vir:79 58 AAGRQTS-ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTAL------ 129 (273) T ss_pred cCCCccC-ccccccceEEEEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc------ Confidence 5 55554 457899999999999999999999999999999987 58999999999999999998875543211 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) +.+...++..+++.|.+++++||+++||.+|||+||+|++|+.||+. +.+|.+.++.+.++.+++|. ||+ T Consensus 130 --------~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~-~~~~~~~~~~~~~~~l~~G~-ig~ 199 (273) T protein:vir:79 130 --------TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSS-GSKLTSADTSGDAAGLRAGT-IGN 199 (273) T ss_pred --------ccccccchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhc-hhhhhhhhhcccccceeeeE-eeE Confidence 11223446678999999999999999999999999999999999974 34688888877777888886 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||+||...+ ..++++|++|++.++.+. ++|.. +++++|+|.|.|+++ T Consensus 200 ~~G~~i~~s~~lp~~~~---------------------~~~~a~~~~A~~~a~~~~-~~e~~---r~~~~~~~~v~~~~~ 254 (273) T protein:vir:79 200 LLGARIVESNNLRDTDD---------------------EQFVAFHPSAAAYVSQID-TVEAL---RDQDSFSDRIRALHV 254 (273) T ss_pred EeceEEEecccccccCc---------------------eEEEEEeccceeeeeehh-hhhcc---cCcccceeeeeeeee Confidence 99999999999996422 113678999998888654 56654 456779999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++|||++++|++. T Consensus 255 yg~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 255 YGGKVVRPTGVVVFNKT 271 (273) T ss_pred eeeEEecCceEEEEecc Confidence 99999999999999988 No 23 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=100.00 E-value=2.3e-56 Score=325.65 Aligned_cols=216 Identities=67% Similarity=0.983 Sum_probs=190.2 Q ss_pred EeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccceeccccccccCHHHHH Q lcl|Aclame:pro 96 MDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIV 175 (332) Q Consensus 96 ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 175 (332) ||++++++|+|||+|++|++||+|+++++|+|++||+++|++|++++++++++..+..+.+++....++++.+++++++| T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 99999999999999999999999999999999999999999999999999999999988888888888889999999999 Q ss_pred HHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeeceEEEeeCcccccccccc Q lcl|Aclame:pro 176 DGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDL 255 (332) Q Consensus 176 d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~ 255 (332) ++|++|+++|||++||.+|||+||+|++|+.||+..|++++|++++++++.+++|+.|++++||+||+|||+|..+|+.+ T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 99999999999999999999999999999999987789999999999999999997799999999999999999888765 Q ss_pred ccc-----ccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhhe Q lcl|Aclame:pro 256 SSA-----AVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVA 326 (332) Q Consensus 256 ~~~-----~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~ 326 (332) ... ...+..++|.++|++++||+|||+|+||+|++.+..+- + ++.+ =..+.||+.- T Consensus 161 ~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~------~-----~~~~----~~~~~~~~~~ 221 (221) T protein:vir:17 161 VTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRP------P-----LVIS----MFSIRRPDRR 221 (221) T ss_pred ccCCccccccccccccccccccceEEEEEcchheeeeeeecCCCCC------c-----eeee----eeeccCCCCC Confidence 422 12244569999999999999999999999999875321 1 1111 1245666665 No 24 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=2e-56 Score=325.94 Aligned_cols=298 Identities=13% Similarity=0.054 Sum_probs=221.7 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |+.=+|-++ + .|+| -|+|+.+++..++++.+...+.+......|+|||||+||++++++|++++ T Consensus 1 ~~~~n~ts~------------~---qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~ 65 (322) T protein:vir:31 1 MSTGNNTSN------------T---QALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQG 65 (322) T ss_pred CCCCCCccc------------c---eEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCC Confidence 665221111 1 1456 49999999999999999999888667788999999999999999999999 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) +++. +++++++++|+|||.|||+|.||| |++|...|+++.++++++++|++.+|++++..+..++...+.+.. .. T Consensus 66 ~i~~-d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~---p~ 140 (322) T protein:vir:31 66 DFTF-DNLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQND---PN 140 (322) T ss_pred Cccc-ccCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCC---cc Confidence 9976 479999999999999999999999 999999999999999999999999999999988776643322211 11 Q ss_pred eecc----ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHH-------hhcCchhhcccccccccccc Q lcl|Aclame:pro 160 HVNI----GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLI-------SSVDTNILNREIGNSQGDMN 228 (332) Q Consensus 160 ~i~~----~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll-------~~~d~~~~~~d~~~~~~~~~ 228 (332) .+.. -....+++...|+.|++++.+|||++||.+|||+||+|+++..|. -.+|+||+..+-.+. . T Consensus 141 vin~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~----a 216 (322) T protein:vir:31 141 VINGVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGI----A 216 (322) T ss_pred eecCCccceeccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccc----h Confidence 1110 011234566779999999999999999999999999999987661 136889886544333 3 Q ss_pred cc-ceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEee-----chhhhhhhhhccceeeeeecccc Q lcl|Aclame:pro 229 SG-KGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIF-----HREAAGCIQSVAPTIQTTSGDFN 302 (332) Q Consensus 229 ~g-~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~-----h~~a~~~~~~~~~~~e~~~~~~~ 302 (332) +| +.|++++||+||+||+||. +...+.++..|... .++ ++.+++. +...++..+.++ +.|.||+ T Consensus 217 ~g~~~Vg~~~GF~V~~SN~l~~--~~~~i~aG~d~~~t-~ag---~~n~f~~~~~~~~~~~~~~~~~l~-~~e~~r~--- 286 (322) T protein:vir:31 217 PDMQFVRSVYGIDLFVSNLLAD--ANETINAGGDARST-TAG---KCNMFMNVSDMGLLPFVVAWKEMP-TTKSFID--- 286 (322) T ss_pred hhHHHHHHHhceeeeeeccccc--cccccccCcccccc-cce---eecccccccchhhhhhhhHhhhhh-hhhcccC--- Confidence 34 2489999999999999984 23333333222111 111 1122222 444555555433 4455554 Q ss_pred hhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 VQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 ~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +.+|+|.++++++||+|++|||.++.|.+- T Consensus 287 ~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~ 316 (322) T protein:vir:31 287 DYNDDLNTATTARWGNGLVRDENLVCVLAN 316 (322) T ss_pred ccccccceeeeeeecceeecccceEEEEec Confidence 566999999999999999999999888765 No 25 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=1.7e-52 Score=304.40 Aligned_cols=306 Identities=12% Similarity=0.023 Sum_probs=226.0 Q ss_pred CCCcccccc-cccccccccccccCchhhHHHHHHhHHHHHHHHHh-hhhcccccccccccc-ce------EEEeccccee Q lcl|Aclame:pro 1 MTTLSNFSL-PNQANGGARNADYDVRYATALKLFSGEVFTAFNNA-SIFKGLVRSYDLRGG-KS------KQFMFTGKLS 71 (332) Q Consensus 1 m~~~~~~~r-~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~-s~~~~~v~~r~~~~G-~t------v~i~~iG~~t 71 (332) |.=-+-+|- |-.. .+.+ +.|+|+|..+|+..||.. ++|++.|+.++-.+| .+ +.++.+++.. T Consensus 1 ~~~~~~~~~~~~Ms------~~i~---~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (322) T protein:vir:10 1 MKLNAIMSMLPLIA------GDID---QAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKR 71 (322) T ss_pred Ccccceeeeeeeee------chhh---hHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeeccccccccccccc Confidence 554333443 2111 1222 479999999999999855 999999998865444 33 4444455555 Q ss_pred eeeecCCCC-CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 72 AGYHTPGTP-IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) Q Consensus 72 ~~~~~~g~~-~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~ 150 (332) +..+.+.+. .+..++.+++.+.+.++++ |+.+.|||+|+.|+++|++++|++++++||+|++|+.|+..+...+.. T Consensus 72 ~~~~~~d~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~-- 148 (322) T protein:vir:10 72 SRQQSADGTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASI-- 148 (322) T ss_pred ccccccCcccCCCccccccceEEEeeccc-ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccc-- Confidence 555544332 2333566788888777776 788999999999999999999999999999999999998766544422 Q ss_pred ccccccccceecccccccc--CHHHHHHHHHHHHHHHHhcCCCcCC-CEEEEChHHHHHHHhhcCchhhccccccccccc Q lcl|Aclame:pro 151 PVTGEPGGFHVNIGAGNTN--DAQAIVDGFFEAAAVLDERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGNSQGDM 227 (332) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~--~~~~~~d~i~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~ 227 (332) +.++............ .....+++|++|++.|+|++||.++ ||+||+|++|+.||+ +++|++.||.+.+... T Consensus 149 ---~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~--d~~~ts~D~~~~~~l~ 223 (322) T protein:vir:10 149 ---KGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQ--ITEATSADYTSAMDLQ 223 (322) T ss_pred ---cccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhc--chhhhhhhcccchhhh Confidence 1111111110000000 1123378899999999999999875 999999999999995 6999999999877666 Q ss_pred cccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG 307 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~ 307 (332) ++|. +++|+||+|++||+||..+++....+... ..+. ..+.|++||++|+++++.+++++++.+ +..++|+ T Consensus 224 ~~G~-ig~~lGf~~i~s~~lp~~~~t~~~~~~~~-----~~~~-~~~~~~a~~k~Av~~a~~~dv~~~i~~--~~~~~~a 294 (322) T protein:vir:10 224 SKGI-ITNWMGYTWIVSTRLDKFDPTQWGMAAED-----GPQG-DEIWCIAMTDMALGYHSCKDIWTKVAE--DPSASFA 294 (322) T ss_pred hcCe-eeeeeeEEEEEeccCCccccccccccccC-----CCCc-cceeEEEEecCceeEEEeeeeeEEeec--cCCcchh Confidence 7786 99999999999999998776665544322 2222 234478999999999999999988854 4566899 Q ss_pred HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +.|.++++||+++++|+++++|.-- T Consensus 295 ~~I~~~~~~Ga~ri~~~gVv~i~~~ 319 (322) T protein:vir:10 295 WRIYSAFTADCVRVEDEHIFKLRLK 319 (322) T ss_pred hhhhhhhhhCceEeccCcEEEEEEe Confidence 9999999999999999999999877 No 26 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=9.1e-42 Score=245.57 Aligned_cols=284 Identities=16% Similarity=0.054 Sum_probs=217.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc--cccccceEEEecccceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY--DLRGGKSKQFMFTGKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r--~~~~G~tv~i~~iG~~t~~~~~~g 78 (332) .-|-.+--..|.+|....+-.-.+ ..+.|+|.+.+++.|...++...+...+ .+.+|++|+||+++.+.++||+++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~nt--~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~ 93 (329) T protein:vir:10 16 IKNATGKLKLNLQHFANKSVEPGD--TLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRN 93 (329) T ss_pred hhcccceeEEehhhhcCCccCCch--hHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCC Confidence 222222335566676665543221 3455999999999999987766554334 567899999999999999999998 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYST--RAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) +.... .+++.+..+++||+.+|+.|.||++|..|++..+ ...+.+.+...++.++|.+.+..++..+... T Consensus 94 ~g~~~-g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~------- 165 (329) T protein:vir:10 94 ATNEF-DHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKH------- 165 (329) T ss_pred CCccc-cccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccc------- Confidence 87654 5799999999999999999999999999998776 4556778999999999999998886543211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ......+.++|++|.++.++|+|++|| +|||++|+|++|.+|++ +++|+..... .+...++|. |+++ T Consensus 166 --------~~~~~t~~nay~~i~~a~~~Lde~~vp-~~Rvl~VtP~~~~~Lk~--~~~f~~~~~~-~~~~~~~g~-Vg~i 232 (329) T protein:vir:10 166 --------LTVGSGADAQYDAVLDVSVELDEIGAG-ASRILFVTPKFYKGIKK--FVIELPQGDN-RQQVLGKGV-QGEL 232 (329) T ss_pred --------cccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHh--hhhhhccccc-cccceeeee-eeee Confidence 111234678899999999999999999 59999999999998875 4778765433 345667885 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) .||+|+++|+.... +.-.+++|++|+..+...+ .++.++. .+..+||.|.+++.| T Consensus 233 dG~~Ii~vps~~~k----------------------~in~ii~~~~A~~~~~K~~-~~~~~~p--~~~~~a~~v~gr~yy 287 (329) T protein:vir:10 233 DGFTIVKVPSKMLQ----------------------GVEAMAVIGEVMASPIQAN-EAKLNSN--VPGMFGTLAEQMLYT 287 (329) T ss_pred cCeEEEEecCCccc----------------------ceeEEEEcCCceeeeeeee-eeeeeCC--CCccchheeeeeeee Confidence 99999998654321 1124789999998777665 5666653 255689999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |++|+||++.+.+..+ T Consensus 288 d~~V~~~k~~~I~~~~ 303 (329) T protein:vir:10 288 GAFVPEHLQKYIFTIG 303 (329) T ss_pred eeEEEccccCEEEEec Confidence 9999999987766544 No 27 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=9.1e-42 Score=245.58 Aligned_cols=284 Identities=18% Similarity=0.075 Sum_probs=216.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc--cccccceEEEecccceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY--DLRGGKSKQFMFTGKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r--~~~~G~tv~i~~iG~~t~~~~~~g 78 (332) .-|-.+.-..|.+|...-+ +++-...+.|+|++.+++.+...++...+...+ .+.+|++|+||+++.+.++||+++ T Consensus 5 ~~~~~~~~~~~~~~~~~~~--~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~ 82 (319) T protein:vir:97 5 IKNATGMLKLNLQHFANKS--VEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRN 82 (319) T ss_pred cccccceeEeehhhhhccC--CCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCC Confidence 2333344566666766554 333223556999999999888888776554333 567899999999999999999998 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYST--RAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) +.... .+++.+..+++||+.+|+.|.||++|.+|++.++ ...+.+++...++.++|.+.+..++..+... T Consensus 83 ~g~~~-g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------- 154 (319) T protein:vir:97 83 ATNEF-DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) T ss_pred CCccc-CCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------- Confidence 77664 5799999999999999999999999999998776 4556788899999999999988886543211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ...+..+.++|++|.++.++|+|++|| +|||++|+|++|.+|+++ ++|+.....+ +..+++|. |+++ T Consensus 155 --------~~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~--~~f~~~~~~~-~~~~~~g~-Vg~i 221 (319) T protein:vir:97 155 --------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKF--VIALPQGDTR-QQVLGKGV-QGEL 221 (319) T ss_pred --------cccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhh--hhhhcccccc-ccceeeee-ceee Confidence 111234678899999999999999999 699999999999988764 6787655443 45567885 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) .||+|+++++-.. . +.-.++.|+.|+..+...+ .++.++. .+..|||.|.+++.| T Consensus 222 dG~~Vi~vps~~~---k-------------------~in~i~~h~~A~~~~~k~~-~~~~~~p--~~~~~a~~v~gr~y~ 276 (319) T protein:vir:97 222 DGFVIVKVPTKLL---Q-------------------GLQAIAVVGEVLASPIQAD-LAKTNSN--IPGMFGTLAEQLLYT 276 (319) T ss_pred cCeEEEEeccccc---c-------------------cceEEEEcCCeeeeeeeee-eeeccCC--Cccccceeeeeeeee Confidence 9999998754321 0 1124788999987766544 4555543 355689999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |++|+||+..+.+..+ T Consensus 277 d~~V~~~k~~~Iy~~~ 292 (319) T protein:vir:97 277 GAFVPEHLQKYIFTIG 292 (319) T ss_pred eeEEeccccceEEEee Confidence 9999999987777644 No 28 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=9.1e-42 Score=245.58 Aligned_cols=284 Identities=18% Similarity=0.075 Sum_probs=216.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc--cccccceEEEecccceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY--DLRGGKSKQFMFTGKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r--~~~~G~tv~i~~iG~~t~~~~~~g 78 (332) .-|-.+.-..|.+|...-+ +++-...+.|+|++.+++.+...++...+...+ .+.+|++|+||+++.+.++||+++ T Consensus 5 ~~~~~~~~~~~~~~~~~~~--~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~ 82 (319) T protein:vir:94 5 IKNATGMLKLNLQHFANKS--VEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRN 82 (319) T ss_pred cccccceeEeehhhhhccC--CCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCC Confidence 2333344566666766554 333223556999999999888888776554333 567899999999999999999998 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYST--RAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) +.... .+++.+..+++||+.+|+.|.||++|.+|++.++ ...+.+++...++.++|.+.+..++..+... T Consensus 83 ~g~~~-g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------- 154 (319) T protein:vir:94 83 ATNEF-DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) T ss_pred CCccc-CCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------- Confidence 77664 5799999999999999999999999999998776 4556788899999999999988886543211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ...+..+.++|++|.++.++|+|++|| +|||++|+|++|.+|+++ ++|+.....+ +..+++|. |+++ T Consensus 155 --------~~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~--~~f~~~~~~~-~~~~~~g~-Vg~i 221 (319) T protein:vir:94 155 --------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKF--VIALPQGDTR-QQVLGKGV-QGEL 221 (319) T ss_pred --------cccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhh--hhhhcccccc-ccceeeee-ceee Confidence 111234678899999999999999999 699999999999988764 6787655443 45567885 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) .||+|+++++-.. . +.-.++.|+.|+..+...+ .++.++. .+..|||.|.+++.| T Consensus 222 dG~~Vi~vps~~~---k-------------------~in~i~~h~~A~~~~~k~~-~~~~~~p--~~~~~a~~v~gr~y~ 276 (319) T protein:vir:94 222 DGFVIVKVPTKLL---Q-------------------GLQAIAVVGEVLASPIQAD-LAKTNSN--IPGMFGTLAEQLLYT 276 (319) T ss_pred cCeEEEEeccccc---c-------------------cceEEEEcCCeeeeeeeee-eeeccCC--Cccccceeeeeeeee Confidence 9999998754321 0 1124788999987766544 4555543 355689999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |++|+||+..+.+..+ T Consensus 277 d~~V~~~k~~~Iy~~~ 292 (319) T protein:vir:94 277 GAFVPEHLQKYIFTIG 292 (319) T ss_pred eeEEeccccceEEEee Confidence 9999999987777644 No 29 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=1.2e-41 Score=245.00 Aligned_cols=270 Identities=14% Similarity=0.071 Sum_probs=216.0 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcccccc-ccc--cccceEEEecccce-eeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRS-YDL--RGGKSKQFMFTGKL-SAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~-r~~--~~G~tv~i~~iG~~-t~~~~ 75 (332) |+|+ .|+.. .+|+ |+|+.+|.++|.+..++.++... +++ ..|++|+||+++.. .+.+| T Consensus 1 Ma~~--~T~~~---------------~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~ 63 (278) T protein:vir:80 1 MADL--TTKLA---------------NLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDV 63 (278) T ss_pred CCCc--ceehh---------------heecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceee Confidence 7772 34442 1566 89999999999999999988754 344 45999999998754 46789 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++.. ..|.|+|++..++..|++++++++++++|++++|+.++..+..+... +. T Consensus 64 ~~g~~i~~-~~lt~~~~~~~i~~~~-~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~---~~-- 136 (278) T protein:vir:80 64 AEGAAIDY-SALETESVKHGIKKAG-KGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE---VK-- 136 (278) T ss_pred cCCCcCcc-cccccceeeEeeehhh-ccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---cc-- Confidence 99999875 5799999999999964 58999999999999999999999999999999999999887543211 11 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) ++.........++.|.++..+|+++++|. .++++|.|++|+.|++....+|+..... .++.+++|. +++ T Consensus 137 --------~~~t~~~~~~~~~~~~da~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~-g~~~~~~G~-ig~ 205 (278) T protein:vir:80 137 --------GAINIGLIDKIENTFTDAPDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQL-GDDLLVKGA-FGE 205 (278) T ss_pred --------cccccchhhhHHHHHHHHHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhccccccc-cccceeecc-cee Confidence 01111224456889999999999999996 5678999999999986433467654332 345678885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++||+.|+++...+++++|..| ++.++.|.|++++. T Consensus 206 ~~G~~Vi~s~~~p~~------------------------t~~l~~~gAi~~~~~~~~~vE~~R---d~~~~~d~i~~~~~ 258 (278) T protein:vir:80 206 LLGWEIVRTKKLADG------------------------NALAVKAGALKTFLKRNLLAESGR---DMDHKLTKFNADQH 258 (278) T ss_pred ecceeEEEcCCCCcc------------------------eEEEEeccceeeeecCCccccccc---chhhccceeeeeeE Confidence 999999999999831 246889999999888888887655 56679999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||++++.|.++ T Consensus 259 yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 259 YAVALVDETKAVKVVPV 275 (278) T ss_pred EEEEEEcCcceEEEeec Confidence 99999999999999998 No 30 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=2e-40 Score=238.23 Aligned_cols=263 Identities=14% Similarity=0.100 Sum_probs=214.4 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEecccc-eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTGK-LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG~-~t~~~~ 75 (332) |+| ..|+.. .+++ |+|+..|.++|.+..++.+++... ++ +.|++++||+.+. ..+.+| T Consensus 1 ma~--~~T~~~---------------d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~ 63 (274) T protein:vir:96 1 MAQ--GTTKVS---------------NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI 63 (274) T ss_pred CCc--cccchh---------------hhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccc Confidence 887 335532 1555 999999999999999999988654 33 3599999999874 478899 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ .++.|.|+|++..++..|++++++++++++|++++|+.++..+..+... T Consensus 64 ~~g~~i~~-~~it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~------- 134 (274) T protein:vir:96 64 AEGEKIPV-DQIGTSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT------- 134 (274) T ss_pred CCCCcCch-hhcccceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC------- Confidence 99999875 57999999999998 5889999999999999999999999999999999999999877432111 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) ..+.. ..++.|.+|..+|+++++ ++||++|.|++|+.|++....+|+.... ..++.+++|. +++ T Consensus 135 --------~~~~~----~~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~-~g~~~~~~g~-ig~ 198 (274) T protein:vir:96 135 --------VEADI----TKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQ-LGDNIIVKGA-FGE 198 (274) T ss_pred --------cCccc----ccHHHHHHHHHHhcccCC--CceEEEeCHHHHHHHHhccccccccccc-ccccceeecc-cce Confidence 00111 127889999999999886 6899999999999998643346665433 3356778885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++|++.|++++..+++++|..| ++.++.|.|.+++. T Consensus 199 ~~G~~Vi~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~R---d~~~~~d~i~~~~~ 251 (274) T protein:vir:96 199 ALGAVIVRSNKLNKG------------------------EALLAKKGAVKLITKRDFFLEKDR---DASRKSTALYSDKH 251 (274) T ss_pred ecCeeEEEcCCCCcc------------------------eEEEEeCcceeeeecCCccccccc---chhhcccEEEEeeE Confidence 999999999999942 147889999999999998887655 56679999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||++++.|.+| T Consensus 252 yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 252 YVAYLYDESKVVKITKG 268 (274) T ss_pred EEEEEEcCccEEEEEcC Confidence 99999999999999999 No 31 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=5.1e-40 Score=236.00 Aligned_cols=281 Identities=14% Similarity=0.134 Sum_probs=185.7 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc---cc--cccceEEEecccceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DL--RGGKSKQFMFTGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r---~~--~~G~tv~i~~iG~~t~~~ 74 (332) |+| .+|+ |+|+.++++.|++..+|..+++.. ++ +.|+||+||..+.+++.+ T Consensus 1 Ma~-----------------------~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~ 57 (392) T protein:vir:99 1 MAN-----------------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHT 57 (392) T ss_pred Ccc-----------------------ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccccccee Confidence 554 2444 889999999999999999998643 44 359999999999999999 Q ss_pred ecC-----CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 75 HTP-----GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 75 ~~~-----g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) |++ +.++. .+++.+++++++||+.+|++|.|+|.|+.|...|+++++.++++++||+.+|++|+..+..+.... T Consensus 58 ~~~~~~~~~~~~~-~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~ 136 (392) T protein:vir:99 58 RKLRGAGAERNLT-VSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA 136 (392) T ss_pred eeccccccCCccc-ccccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 864 33444 357889999999999999999999999999999999999999999999999999998775432211 Q ss_pred cccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc--ccc Q lcl|Aclame:pro 150 SPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ--GDM 227 (332) Q Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~--~~~ 227 (332) . +.....++...|+.|++++++|+|++||. |||+||+|++|+.|++ +++|++.++.+.. ..+ T Consensus 137 ~-------------~~~~~~~~~~~~~~i~~a~~~L~~~~vP~-~R~~vv~p~~~~~l~~--~~~~~~~~~~g~~~~~~l 200 (392) T protein:vir:99 137 A-------------GAVHEVAPDEFFKGVNGARRALNELYIPQ-GRVLVVGTAVTEQILN--DDRFIKYESQGQSAVSAL 200 (392) T ss_pred c-------------ccccccChhhhHHHHHHHHHHHhhcCCCC-CCEEEEcHHHHHHHhc--ccceeecccccchhhhhh Confidence 0 12233457778999999999999999996 8999999999999985 5889988877654 347 Q ss_pred cccceeeeeeceEEEeeCccccccccccccc-ccccc-cccccccccceEEEeechhhhhhhhhccceeeeeecccchhH Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSA-AVTGE-NNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQY 305 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~-~~~g~-~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~ 305 (332) ++|. |++++||+||+|+++|...+...... ..... .+.......... ... + .........++ ++..+ T Consensus 201 ~~G~-vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~--s~s----~---~~~v~~~~~~~-~~~t~ 269 (392) T protein:vir:99 201 QEAR-LGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRST--AIS----G---DQRIAMRWLVD-YDSTI 269 (392) T ss_pred hcce-eeeeeeeEEEeecccccccceeeecccccccccccccccccccee--EEe----c---ccceecceeec-cccee Confidence 7885 99999999999999997544222111 00000 000000000000 000 0 00001111111 11112 Q ss_pred HHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..|...-....|.+.+.-.+...+..+ T Consensus 270 ~s~~~~v~~~~g~~~v~~~~~~~~~~~ 296 (392) T protein:vir:99 270 TSNRSLIDTYFGLKVVEDPNGVGFVRA 296 (392) T ss_pred eccccccceeEEEEEEeeccccceeee Confidence 222222222233333322221111111 No 32 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=8.1e-40 Score=234.90 Aligned_cols=287 Identities=17% Similarity=0.163 Sum_probs=197.7 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhccccccc---cc-cccceEEEecccceeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFKGLVRSY---DL-RGGKSKQFMFTGKLSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~~~v~~r---~~-~~G~tv~i~~iG~~t~~~~ 75 (332) |+-. .| .+. -|+|+.++++.|++++++.++|+.. ++ ..|+|||||+.+..+++++ T Consensus 1 m~~~-------------~N-------~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg 60 (418) T protein:vir:10 1 MAVQ-------------DN-------NLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASG 60 (418) T ss_pred CCcc-------------cc-------ccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeeccc Confidence 4431 11 122 3699999999999999999999742 22 3489999999999999986 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) . .+. .+++..++++|+||+.+|++|.|+|.|++|...|++++++++++++||+.+|++|+.++..+....+ . T Consensus 61 ~---~~~-~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~g----t 132 (418) T protein:vir:10 61 R---TLV-KQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSG----T 132 (418) T ss_pred C---Ccc-ccccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----c Confidence 5 344 3578889999999999999999999999999999999999999999999999999987755432211 0 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCC-CEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) ++ + ....|+.|.+++++|++++||.+| ||+||+|++|+.|++. .++.-. ..+....+++|. |+ T Consensus 133 ---------~g--t-~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~--~~~~~~-~~~~~~~lr~G~-IG 196 (418) T protein:vir:10 133 ---------PG--V-RPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDE--VTKLFK-ESMVEQAYKMGY-RG 196 (418) T ss_pred ---------CC--c-CcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhh--cccccc-ccccchhhheee-ee Confidence 01 0 112388999999999999999986 9999999999988753 444432 234456789996 89 Q ss_pred eeeceEEEeeCcccc-cccccccccccccc------------------------c--------------------ccccc Q lcl|Aclame:pro 235 SIAGIRILKSNNLAG-LYGQDLSSAAVTGE------------------------N--------------------NDYQV 269 (332) Q Consensus 235 ~i~G~~V~~sn~lp~-~~g~~~~~~~~~g~------------------------~--------------------~~y~~ 269 (332) +++||+||+||+||. +.++......++|. . ..|.+ T Consensus 197 ~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V 276 (418) T protein:vir:10 197 NVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVV 276 (418) T ss_pred eeeceEEEEecCCCcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEE Confidence 999999999999995 33332221111111 0 01100 Q ss_pred cc-----------------------------------------------------------cceEEEeechhhhhhhhh- Q lcl|Aclame:pro 270 DA-----------------------------------------------------------SALAGLIFHREAAGCIQS- 289 (332) Q Consensus 270 ~~-----------------------------------------------------------~~~~~l~~h~~a~~~~~~- 289 (332) .. +.+.-|+||++|+..+-. T Consensus 277 ~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~ 356 (418) T protein:vir:10 277 LEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMID 356 (418) T ss_pred EeeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEee Confidence 00 001128899988644321 Q ss_pred -----------------ccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 290 -----------------VAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 290 -----------------~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+++++..+. ++.+..-+.++--..||.+.+|||.++.|.=. T Consensus 357 l~~p~g~~~~~~~~~~~~G~s~r~~~~-~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~ 415 (418) T protein:vir:10 357 LELPQSAVIKSRAADPETGLSLTLTGA-YDINEQSEIHRIDAVWGADMIYGELALRLWGA 415 (418) T ss_pred ccCCCCCCcceEEEeccCCeEEEEEEc-ccccccceEEEEEeecCceeecccceEEEEee Confidence 1122222221 11222222333444899999999998766544 No 33 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=2.1e-38 Score=227.18 Aligned_cols=291 Identities=14% Similarity=0.153 Sum_probs=200.6 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc---cc---cccceEEEecccceeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DL---RGGKSKQFMFTGKLSAG 73 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r---~~---~~G~tv~i~~iG~~t~~ 73 (332) |+| + +..|| |+|+.+.++.|+++.++.++|+.. .+ +.|+||+|++.+.++++ T Consensus 1 MAN--~-------------------llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~ 59 (423) T protein:vir:35 1 MAN--N-------------------LESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSE 59 (423) T ss_pred Ccc--c-------------------hhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceee Confidence 665 1 12354 899999999999999999999743 23 34999999999999999 Q ss_pred eecCC--CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 74 YHTPG--TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP 151 (332) Q Consensus 74 ~~~~g--~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~ 151 (332) +|.++ +.+.. +++...++.|+||+.+|++|.++|.|++|...|+ +.+.++++++|++.+|+.++..+...+. + T Consensus 60 d~~~~~~~~~~~-~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~---~ 134 (423) T protein:vir:35 60 RTETGDITGKDK-NGLFSAKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGA---L 134 (423) T ss_pred cccCcCCCCccc-cccccceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccc---c Confidence 99764 44553 5677788999999999999999999999988888 4677888999999999999987755431 1 Q ss_pred cccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 152 VTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) ..+.+ + .+...|+.|.+++++|++++||+.|||+||+|++|..||++ +.+|.+.+..+ ...+++|+ T Consensus 135 ~vgt~---------~---t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~-~~~~~~~~~~~-~~alr~g~ 200 (423) T protein:vir:35 135 SLGSP---------N---TAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADA-QSGLHAADQLV-RTAWENAQ 200 (423) T ss_pred ccccc---------c---CCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhcc-ccceeccccch-hHHHhhcc Confidence 11111 0 11123788999999999999999999999999999999864 56677665444 45589987 Q ss_pred eeeeeeceEEEeeCcccc-ccccccccccc----------------------------cc-----ccccccc-------- Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAG-LYGQDLSSAAV----------------------------TG-----ENNDYQV-------- 269 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~-~~g~~~~~~~~----------------------------~g-----~~~~y~~-------- 269 (332) .+|+++||+||+||++|. +.++.+....+ .| -...|++ T Consensus 201 i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:35 201 ISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQS 280 (423) T ss_pred ceeeecceEEEEcCCCccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeeccccc Confidence 669999999999999995 33322211000 00 0000111 Q ss_pred -----------------cc------------------------------------------------cceEEEeechhhh Q lcl|Aclame:pro 270 -----------------DA------------------------------------------------SALAGLIFHREAA 284 (332) Q Consensus 270 -----------------~~------------------------------------------------~~~~~l~~h~~a~ 284 (332) .. .-+.-|+||++|+ T Consensus 281 ~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~ 360 (423) T protein:vir:35 281 KQTLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFC 360 (423) T ss_pred cceeecccCCceeEEEEeccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEeecCcee Confidence 00 0113479999987 Q ss_pred hhhhh---------------ccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 285 GCIQS---------------VAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 285 ~~~~~---------------~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..+.. .+++++.... ++.+-.-+.++--..||.+.+|||.++-|.-= T Consensus 361 ~l~~~~l~~~~~~~~~~~~~~g~s~r~~~~-~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~ 422 (423) T protein:vir:35 361 GLGTIPLPKLHSLDSAVATYEGFSIRVHKY-ADGDANKQMMRFDLLPAYVCFNPHMGGQFFGN 422 (423) T ss_pred EEEEEccccCCccceeeccccCceEEEEEe-eccccCceEEEEEeecceeeecccceEEEEec Confidence 65432 1222222211 11111112233335699999999998766655 No 34 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=7.2e-39 Score=229.71 Aligned_cols=265 Identities=16% Similarity=0.161 Sum_probs=214.5 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhccccccc-ccc--ccceEEEecccce-eeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFKGLVRSY-DLR--GGKSKQFMFTGKL-SAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~~~v~~r-~~~--~G~tv~i~~iG~~-t~~~~ 75 (332) |+| ..|+.. .++ =|+|+.+|.++|.+..++.++.... ++. .|++|+||..+.. ...++ T Consensus 1 ma~--~~T~~~---------------d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~ 63 (272) T protein:vir:36 1 MSK--QKTTLA---------------DLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADV 63 (272) T ss_pred CCC--cceehh---------------hhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCcccccc Confidence 887 456642 144 4999999999999999999988654 343 4999999997665 35678 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|..++. ++++.++.+++|.+. ...|.|+|++..++..|++++++++++++||+++|+.++..+..+... T Consensus 64 ~eg~~i~~-~~lt~~~~~~~i~~~-~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~------- 134 (272) T protein:vir:36 64 AEGGEISL-DKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQT------- 134 (272) T ss_pred CCCCccCh-hhcCCcceeEeeehh-hccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 88999875 579999999999885 678999999999999999999999999999999999998766432111 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) .++...+|.|.+|..+|.++++| .|+++|.|+.|+.|++ +.+|.......+++.+++|. +++ T Consensus 135 -------------~~~~~~~d~i~~A~~~lgd~~~~--~~~ivv~p~~~~~L~k--~~~~~~~~~~~~~~~~~~G~-ig~ 196 (272) T protein:vir:36 135 -------------VSTKANVDGVQAALDIFNDEDAQ--AYVLIVNPKDAAKIRK--DANAKNIGSEVGANALINGT-YAD 196 (272) T ss_pred -------------ccccccHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHhc--ccccccccccccccceeeec-cce Confidence 11222378899999999999986 5899999999999974 56777665455566788886 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++|++|++|+++|..++. ...++|++.|+++...+++++|..| ++..|.|.|++++. T Consensus 197 ~~G~~Vv~s~~~p~~~~~--------------------~~~~~~~~gA~~~~~~~~~~vE~~R---~~~~~~d~i~~~~~ 253 (272) T protein:vir:36 197 VLGAQIVRSKKLAEGSAL--------------------MFKIVSNSPALKLVLKRGVQVETDR---DIVTKTTVITADEH 253 (272) T ss_pred ecCeeEEEeCCCCCCcee--------------------EEEEEecccceeeeecCCccccccc---chhhcCcEEEEEEE Confidence 999999999999953221 1246788899998888898888655 56679999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||++++.|..+ T Consensus 254 y~~~v~~~~~vv~~t~~ 270 (272) T protein:vir:36 254 YAAYLYDLTKVVNITFT 270 (272) T ss_pred EEEEEEcCccEEEEeec Confidence 99999999999999999 No 35 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=1.1e-38 Score=228.61 Aligned_cols=263 Identities=13% Similarity=0.108 Sum_probs=213.4 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-ccc--ccceEEEecccc-eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DLR--GGKSKQFMFTGK-LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~~--~G~tv~i~~iG~-~t~~~~ 75 (332) |+| +.|+... +++ |+|+.+|.+++.+..++.+++... ++. .|++|+||+... ..+++| T Consensus 1 ma~--~~T~~~~---------------~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~ 63 (274) T protein:vir:93 1 MPQ--GITKTSN---------------QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV 63 (274) T ss_pred CCc--cceehhh---------------eechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccc Confidence 888 6666531 444 999999999999999999998754 343 499999999765 477899 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ .++.|.|+|++..++..|++++.+++++++|++.+|+.++..+.++.... T Consensus 64 ~eg~~i~~-~~it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~------ 135 (274) T protein:vir:93 64 AEGEKIPT-DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------ 135 (274) T ss_pred cCCCcccc-cccccceeEEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ Confidence 99999875 57999999999988 57899999999999999999999999999999999999998774432110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) .+..++ ++.|.+|..+|+++++ ++||++|.|++|+.|++....+|+.... ..++.+++|. +++ T Consensus 136 ---------~~~~~~----~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~-~g~~~~~~G~-ig~ 198 (274) T protein:vir:93 136 ---------NADITK----LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATE-LGDDIIVKGA-FGE 198 (274) T ss_pred ---------cccccC----HHHHHHHHHHhhhccC--CccEEEeCHHHHHHHHhhhhhccccccc-ccccceeecc-cce Confidence 111122 6888999999999876 6899999999999998643235665432 3345678885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++||+.|++++..+++.+|..|+ +..+.|.|++++. T Consensus 199 ~~G~~Vi~s~~~p~~------------------------t~~l~~~gai~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~ 251 (274) T protein:vir:93 199 ALGAIIVRTNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKH 251 (274) T ss_pred ecCeeEEEcCCCCcc------------------------eEEEEeCCeEEEEecCCcccccccc---hhhcccEEEEEEE Confidence 999999999999831 1468899999999888888886654 5668999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||++++.+..| T Consensus 252 y~~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCceEEEeeC Confidence 99999999999999999 No 36 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.5e-38 Score=227.93 Aligned_cols=263 Identities=14% Similarity=0.106 Sum_probs=212.1 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcccccc-cccc--ccceEEEecccc-eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRS-YDLR--GGKSKQFMFTGK-LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~-r~~~--~G~tv~i~~iG~-~t~~~~ 75 (332) |+| ..|+.. .+++ |+|+.+|.+++.+..++.++... +++. .|++|+||.... ..+.+| T Consensus 1 m~~--~~T~l~---------------d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~ 63 (274) T protein:vir:95 1 MAQ--GMTKLT---------------NQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVV 63 (274) T ss_pred CCc--ceeehh---------------heechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccc Confidence 888 456653 1454 99999999999999999998754 4454 499999998664 356789 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ .++.|.|+|++..++..|++++++++++++||+++|+.++..+.++... T Consensus 64 ~~g~~i~~-~~lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~------- 134 (274) T protein:vir:95 64 AEGEKIPT-DILETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT------- 134 (274) T ss_pred cCCCccch-hhcccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------- Confidence 99999875 57999999999998 5899999999999999999999999999999999999998777543211 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) +. .... .++.|.+|..+|++++. .+||++|.|++|+.|++..-.+|+...-. .++.+++|. |++ T Consensus 135 -----~~---~~~~----~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~-ig~ 198 (274) T protein:vir:95 135 -----VE---ADIT----KLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATEL-GDDVIVKGA-FGE 198 (274) T ss_pred -----cc---cccc----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhccccccccccc-cccceeccc-cce Confidence 10 1111 27889999999998775 78999999999999986422356654332 346778885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++|++-|+++...+++.+|..|+ +..+.|.|.+++. T Consensus 199 ~~G~~Vi~s~~~~~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~ 251 (274) T protein:vir:95 199 ALGAVIVRSNKLEAG------------------------TAILAKKGAVKLITKRDFFLETDRD---PSTKTTALYSDKH 251 (274) T ss_pred ecCeEEEEeCCCCCc------------------------eEEEEeccceeeeecCCcccccccc---cccccCEEEEeEE Confidence 999999999999831 1468889999998888888887665 5568999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||++++.+.++ T Consensus 252 y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCcEEEEEcC Confidence 99999999999999999 No 37 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.5e-38 Score=227.93 Aligned_cols=263 Identities=14% Similarity=0.106 Sum_probs=212.1 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcccccc-cccc--ccceEEEecccc-eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRS-YDLR--GGKSKQFMFTGK-LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~-r~~~--~G~tv~i~~iG~-~t~~~~ 75 (332) |+| ..|+.. .+++ |+|+.+|.+++.+..++.++... +++. .|++|+||.... ..+.+| T Consensus 1 m~~--~~T~l~---------------d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~ 63 (274) T protein:vir:96 1 MAQ--GMTKLT---------------NQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVV 63 (274) T ss_pred CCc--ceeehh---------------heechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccc Confidence 888 456653 1454 99999999999999999998754 4454 499999998664 356789 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ .++.|.|+|++..++..|++++++++++++||+++|+.++..+.++... T Consensus 64 ~~g~~i~~-~~lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~------- 134 (274) T protein:vir:96 64 AEGEKIPT-DILETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT------- 134 (274) T ss_pred cCCCccch-hhcccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------- Confidence 99999875 57999999999998 5899999999999999999999999999999999999998777543211 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) +. .... .++.|.+|..+|++++. .+||++|.|++|+.|++..-.+|+...-. .++.+++|. |++ T Consensus 135 -----~~---~~~~----~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~-ig~ 198 (274) T protein:vir:96 135 -----VE---ADIT----KLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATEL-GDDVIVKGA-FGE 198 (274) T ss_pred -----cc---cccc----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhccccccccccc-cccceeccc-cce Confidence 10 1111 27889999999998775 78999999999999986422356654332 346778885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++|++-|+++...+++.+|..|+ +..+.|.|.+++. T Consensus 199 ~~G~~Vi~s~~~~~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~ 251 (274) T protein:vir:96 199 ALGAVIVRSNKLEAG------------------------TAILAKKGAVKLITKRDFFLETDRD---PSTKTTALYSDKH 251 (274) T ss_pred ecCeEEEEeCCCCCc------------------------eEEEEeccceeeeecCCcccccccc---cccccCEEEEeEE Confidence 999999999999831 1468889999998888888887665 5568999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||++++.+.++ T Consensus 252 y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCcEEEEEcC Confidence 99999999999999999 No 38 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=4.5e-38 Score=225.34 Aligned_cols=263 Identities=13% Similarity=0.108 Sum_probs=213.2 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEecccce-eeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTGKL-SAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG~~-t~~~~ 75 (332) |+| ..|+.. .+++ |+|+.+|.++|.+..++.+++... ++ ..|++|+||..+.. .+.+| T Consensus 1 ma~--~~T~l~---------------d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~ 63 (274) T protein:vir:12 1 MAQ--GLTKTS---------------NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV 63 (274) T ss_pred CCc--ceeehh---------------hhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccc Confidence 888 456552 1444 999999999999999999998764 33 45999999986542 57789 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ .++.|.|+|++..++..|++++++++++++|++++|+.++..+.++..+. T Consensus 64 ~~g~~i~~-~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~------ 135 (274) T protein:vir:12 64 AEGEKIPT-DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------ 135 (274) T ss_pred cCCCccch-hhcccceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ Confidence 99999875 57999999999999 58999999999999999999999999999999999999988775432110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) ..... .++.|.+|..+|++++. .+||++|.|++|+.|++....+|++..-. .++.+++|. +++ T Consensus 136 ---------~~~a~----~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~-g~~~~~~G~-ig~ 198 (274) T protein:vir:12 136 ---------NADIT----KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGA-FGE 198 (274) T ss_pred ---------ccccc----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhhhhhccccccc-cccceeccc-cee Confidence 01111 27889999999998764 78999999999999986422367764332 346678885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++|+++|.. .+++|++-|+++...+++++|..|+ +..+.|.|.+++. T Consensus 199 ~~G~~Vi~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~ 251 (274) T protein:vir:12 199 ALGAIIVRSNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKH 251 (274) T ss_pred ecCeeEEEeCCCCcc------------------------eEEEEeccceeeeecCCceeccccc---hhhcccEEEeeeE Confidence 999999999999941 1468889999998888888887665 5568999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||++++||+.++.+..| T Consensus 252 y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCceEEEEcC Confidence 99999999999999999 No 39 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=8.1e-38 Score=223.93 Aligned_cols=263 Identities=13% Similarity=0.105 Sum_probs=212.8 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEecccc-eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTGK-LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG~-~t~~~~ 75 (332) |+| ..|+.. .+++ |+|+.+|.+++.+..++.+++... ++ +.|++|+||..+. ..+.+| T Consensus 1 ma~--~~T~~~---------------d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~ 63 (274) T protein:vir:94 1 MPQ--GLTKTS---------------DQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV 63 (274) T ss_pred CCc--cceehh---------------heechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccc Confidence 888 466652 1444 999999999999999999998764 34 3499999999764 367789 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ ..+.|.|+|++..++..|++++++++++++|++.+|+.++..+.++.... T Consensus 64 ~~g~~i~~-~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~------ 135 (274) T protein:vir:94 64 AEGEKIPT-DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------ 135 (274) T ss_pred cCCCcccc-cccccceeEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------ Confidence 99999875 57999999999998 46899999999999999999999999999999999999998775432110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) .+..++ ++.|.+|..+|++++. .+||++|.|++|..|++....+|+...-. .++.+++|. +++ T Consensus 136 ---------~~~~~~----~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~-ig~ 198 (274) T protein:vir:94 136 ---------NADITK----LNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGA-FGE 198 (274) T ss_pred ---------cccccC----HHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcc-cccceeccc-cce Confidence 011112 7889999999998876 67999999999999986432467654332 345677885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++|++.|+++++.+++.+|..|+ +..+.|.|.+++. T Consensus 199 ~~G~~Vi~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~ 251 (274) T protein:vir:94 199 ALGAIIVRTNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKH 251 (274) T ss_pred ecCeeEEEcCCCCcc------------------------eEEEEeCcceEeeecCCceeccccc---hhhcccEEEEEEE Confidence 999999999999831 1468899999999888888886654 5668999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||+++++|+.++.+..+ T Consensus 252 y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCceEEEecC Confidence 99999999999999999 No 40 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=8.1e-38 Score=223.93 Aligned_cols=263 Identities=13% Similarity=0.105 Sum_probs=212.8 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEecccc-eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTGK-LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG~-~t~~~~ 75 (332) |+| ..|+.. .+++ |+|+.+|.+++.+..++.+++... ++ +.|++|+||..+. ..+.+| T Consensus 1 ma~--~~T~~~---------------d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~ 63 (274) T protein:vir:97 1 MPQ--GLTKTS---------------DQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV 63 (274) T ss_pred CCc--cceehh---------------heechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccc Confidence 888 466652 1444 999999999999999999998764 34 3499999999764 367789 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|+.++. ++++.++.+++|++ ..+.|.|+|++..++..|++++++++++++|++.+|+.++..+.++.... T Consensus 64 ~~g~~i~~-~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~------ 135 (274) T protein:vir:97 64 AEGEKIPT-DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------ 135 (274) T ss_pred cCCCcccc-cccccceeEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------ Confidence 99999875 57999999999998 46899999999999999999999999999999999999998775432110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) .+..++ ++.|.+|..+|++++. .+||++|.|++|..|++....+|+...-. .++.+++|. +++ T Consensus 136 ---------~~~~~~----~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~-ig~ 198 (274) T protein:vir:97 136 ---------NADITK----LNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGA-FGE 198 (274) T ss_pred ---------cccccC----HHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcc-cccceeccc-cce Confidence 011112 7889999999998876 67999999999999986432467654332 345677885 999 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++||+|++||++|.. .+++|++.|+++++.+++.+|..|+ +..+.|.|.+++. T Consensus 199 ~~G~~Vi~s~~~p~~------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~ 251 (274) T protein:vir:97 199 ALGAIIVRTNKLEAG------------------------TAILAKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKH 251 (274) T ss_pred ecCeeEEEcCCCCcc------------------------eEEEEeCcceEeeecCCceeccccc---hhhcccEEEEEEE Confidence 999999999999831 1468899999999888888886654 5668999999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||+++++|+.++.+..+ T Consensus 252 y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCceEEEecC Confidence 99999999999999999 No 41 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=2.8e-37 Score=220.97 Aligned_cols=291 Identities=15% Similarity=0.139 Sum_probs=198.7 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhccccccc---cc---cccceEEEecccceeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFKGLVRSY---DL---RGGKSKQFMFTGKLSAG 73 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~~~v~~r---~~---~~G~tv~i~~iG~~t~~ 73 (332) |+| + +..| .++|+.+.++.|+++.++.++|+.+ .+ +.|+||+|++.+.+++. T Consensus 1 MaN--~-------------------llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~ 59 (423) T protein:vir:17 1 MPN--N-------------------LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSL 59 (423) T ss_pred Ccc--c-------------------hhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceee Confidence 665 1 1124 4899999999999999999999753 22 35999999999999999 Q ss_pred eecCCC--CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 74 YHTPGT--PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP 151 (332) Q Consensus 74 ~~~~g~--~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~ 151 (332) +|+... .+. .+++...++.|+||+.||++|.++|.|+.+...|+ +++.++++++||+.+|+.|+..+.+.+.. T Consensus 60 ~~~~~~~~~~~-~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~--- 134 (423) T protein:vir:17 60 RTPTGDISGQN-KNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGAL--- 134 (423) T ss_pred cccCcccCCcc-cCccccceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 997532 333 35788888999999999999999999998776666 78999999999999999999887554221 Q ss_pred cccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 152 VTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) ..+.+ +... ..|+.+.+++.+|++++||..|||+||+|++|..||++ +.+|.+.+.+ ....+++|+ T Consensus 135 ~~gt~---------~t~~---~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~-~~~~~~~~~~-~~~alr~g~ 200 (423) T protein:vir:17 135 SLGSP---------NTPI---TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADA-QTGLHASDQL-VRTAWENAQ 200 (423) T ss_pred ccccC---------Cccc---ccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhcc-ccceeccccc-chHHHhhcc Confidence 11111 1111 13788999999999999999999999999999999864 4566655544 445689997 Q ss_pred eeeeeeceEEEeeCccccc-ccccccc-----------ccc-----------------------------cc-------- Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAGL-YGQDLSS-----------AAV-----------------------------TG-------- 262 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~~-~g~~~~~-----------~~~-----------------------------~g-------- 262 (332) .+|+++||+||+||+||.. .++.+.. ++. +| T Consensus 201 i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:17 201 IPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQT 280 (423) T ss_pred ceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccc Confidence 5599999999999999942 2222110 000 01 Q ss_pred ----------ccccccccc------------------------------------------------cceEEEeechhhh Q lcl|Aclame:pro 263 ----------ENNDYQVDA------------------------------------------------SALAGLIFHREAA 284 (332) Q Consensus 263 ----------~~~~y~~~~------------------------------------------------~~~~~l~~h~~a~ 284 (332) ....|.+.. .-+.-|+||++|+ T Consensus 281 k~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~ 360 (423) T protein:vir:17 281 KQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFC 360 (423) T ss_pred cccccccccccceEEEEEecccccccCceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEecCcce Confidence 000111110 0112379999987 Q ss_pred hhhhh---------------ccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 285 GCIQS---------------VAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 285 ~~~~~---------------~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..+.. .+++++.... ++.+---+.++--..||.+.+|||.++-|.-= T Consensus 361 ~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~-~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~ 422 (423) T protein:vir:17 361 GLGSIPLPKLHSIDSAVATYEGFSIRVHKY-ADGDANVQKMRFDLLPAYVCFNPHMGGQFFGN 422 (423) T ss_pred EEEEEcccCCCccceeecccCCcEEEEEEe-cccccceeEEEEEeecceeeeccceEEEEEec Confidence 65432 2222222211 00000011133334699999999998766655 No 42 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=1.7e-37 Score=222.13 Aligned_cols=265 Identities=14% Similarity=0.110 Sum_probs=212.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc-ccc--ccceEEEecccce-eeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY-DLR--GGKSKQFMFTGKL-SAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r-~~~--~G~tv~i~~iG~~-t~~~~~ 76 (332) |+. ++.|+.. +-+.=|+|+.+|.+.+.+..++.+++... ++. .|++|+||..... .+.++. T Consensus 1 ~~~-~~~T~l~--------------d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~ 65 (275) T protein:vir:96 1 MAL-ENMTKLA--------------NMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVP 65 (275) T ss_pred CCC-cccchhh--------------hhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCcccccc Confidence 777 3456652 11334999999999999999999998654 443 4999999987653 567899 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|+.++. ++++.++.+++|.+ .++.|.|+|++..++..|++.+++++++++||+++|+.++..+.++... T Consensus 66 ~g~~i~~-~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~-------- 135 (275) T protein:vir:96 66 EGEEIPI-DLIETKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK-------- 135 (275) T ss_pred CCCCcch-hhcccceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Confidence 9999885 57999999999977 5999999999999999999999999999999999999998776443211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) + ..... .+|.|.+|..+|.+.+. ++||++|.|++|..|++....+|+.....+ ++.+++|. ++++ T Consensus 136 ----~---~~~~~----~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g-~~~~~~G~-ig~~ 200 (275) T protein:vir:96 136 ----V---EADIT----KLAGLQTAIDKFNDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLG-DNVIVKGA-FGEA 200 (275) T ss_pred ----c---ccccc----CHHHHHHHHHHhccccC--CccEEEeCHHHHHHHHhccccccccccccc-ccceeccc-ccee Confidence 0 01111 27889999999987764 789999999999999875445787665433 45678885 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +|++|++||++|.. .+++|++.|++++...++++|..|+ +..+.|.|.+++.| T Consensus 201 ~G~~Vi~s~~~p~~------------------------t~~i~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~y 253 (275) T protein:vir:96 201 LGAIIVRSNKIKEG------------------------EAILAKRGAVKLITKRDFFLETERH---ASHKSTALFSDKHY 253 (275) T ss_pred cCeeEEEeCCCCcc------------------------eEEEEeccceeeeecCCcccccccc---hhhcCcEEEEeEEE Confidence 99999999999842 2467899999999888888886654 56689999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |++++||++++.+..- T Consensus 254 ~~~~~~~~~vv~~t~~ 269 (275) T protein:vir:96 254 VAYLYDESKVVKITKS 269 (275) T ss_pred EEEEEcCccEEEEEec Confidence 9999999998887655 No 43 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=5.6e-37 Score=219.33 Aligned_cols=291 Identities=15% Similarity=0.145 Sum_probs=199.8 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhccccccc---cc---cccceEEEecccceeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFKGLVRSY---DL---RGGKSKQFMFTGKLSAG 73 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~~~v~~r---~~---~~G~tv~i~~iG~~t~~ 73 (332) |+| + +..| .|+|+.++++.|+++.++.++|+.. .+ +.|+||+|++.+..++. T Consensus 1 MaN--~-------------------llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~ 59 (423) T protein:vir:10 1 MPN--N-------------------LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSL 59 (423) T ss_pred Ccc--c-------------------hhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeee Confidence 665 1 1124 3899999999999999999999753 23 35999999999999999 Q ss_pred eecCC--CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 74 YHTPG--TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP 151 (332) Q Consensus 74 ~~~~g--~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~ 151 (332) +|+.+ +.+. .+++...++.|+||+.||++|.++|.|+.+...++ +++.+++.++||+.+|+.++..+...+.. T Consensus 60 d~~~~~~~~~~-~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~--- 134 (423) T protein:vir:10 60 RTPTGDISGQN-KNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGAL--- 134 (423) T ss_pred ccCCccccccc-cCccccceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 99864 3344 35788899999999999999999999998766565 88999999999999999999876543221 Q ss_pred cccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 152 VTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) ..+. ++... ..|+.+.+++.+|++++||..|||+||+|++|..||+. +.+|.+.+.++ ...+++|+ T Consensus 135 ~~gt---------~~t~~---~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~-~~~~~~~~~~~-~~alr~g~ 200 (423) T protein:vir:10 135 SLGS---------PNTPI---TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADA-QTGLHASDQLV-RTAWENAQ 200 (423) T ss_pred cccc---------CCccc---chHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhcc-ccceecccccc-hhhhhhcc Confidence 1111 11111 23788999999999999999999999999999999864 55666655444 45589998 Q ss_pred eeeeeeceEEEeeCcccc-cccccccc-----------cccc----------------------cccccccc-------- Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAG-LYGQDLSS-----------AAVT----------------------GENNDYQV-------- 269 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~-~~g~~~~~-----------~~~~----------------------g~~~~y~~-------- 269 (332) .+|+++||+||+||+||. +.++.+.. ++.. |....|++ T Consensus 201 i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:10 201 IPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQT 280 (423) T ss_pred ceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccc Confidence 569999999999999995 22322110 0000 10111111 Q ss_pred -----------------cc------------------------------------------------cceEEEeechhhh Q lcl|Aclame:pro 270 -----------------DA------------------------------------------------SALAGLIFHREAA 284 (332) Q Consensus 270 -----------------~~------------------------------------------------~~~~~l~~h~~a~ 284 (332) .. .-+.-|+||++|+ T Consensus 281 k~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~ 360 (423) T protein:vir:10 281 KQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFC 360 (423) T ss_pred cccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEecCcce Confidence 00 0112379999987 Q ss_pred hhhhh---------------ccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 285 GCIQS---------------VAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 285 ~~~~~---------------~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..+.. .+++++.... ++.+-.-+.++--..||.+.+|||.++-|.-= T Consensus 361 ~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~-~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~ 422 (423) T protein:vir:10 361 GLGSIPLPKLHSIDSAVATYEGFSIRVHKY-ADGDANVQKMRFDLLPAYVCFNPHMGGQFFGN 422 (423) T ss_pred EEEEEcccCCCccceeeccccCceEEEEEe-eeccccceEEEEEeecceeeeccceEEEEEec Confidence 65432 2222222211 11111111233334699999999998766655 No 44 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=3.4e-36 Score=215.04 Aligned_cols=283 Identities=14% Similarity=0.073 Sum_probs=194.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc-----cccccceEEEecccceeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY-----DLRGGKSKQFMFTGKLSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r-----~~~~G~tv~i~~iG~~t~~~~ 75 (332) |+.+ -|.|+|+.++++.|...+++..|.+.. .+.+|++||||+++.+.++|| T Consensus 1 MA~~-----------------------n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY 57 (299) T protein:vir:79 1 MAAL-----------------------NYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDS 57 (299) T ss_pred Cccc-----------------------hhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccccccccc Confidence 4431 267999999999999999988776643 235789999999999999999 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYST--RAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) ++++....+.+++.+..+++||+.+++.|.||++|..|++..+ .....+.+.+.++.++|.+.+..++..+...+. T Consensus 58 ~R~~~g~~~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~-- 135 (299) T protein:vir:79 58 NRDTIAVAQRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGN-- 135 (299) T ss_pred ccCCCcccccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCC-- Confidence 9976544445788899999999999999999977777776554 334456677888999999999888655432110 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) .+...+.+++++|++|.++.++|+|++||.+|||++|+|++|.+|+++ ++|....-....+..++|. | T Consensus 136 ---------~~~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~--~~f~k~~~~~~~~~~~~g~-V 203 (299) T protein:vir:79 136 ---------TADTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNA--KEIQRTVNIKDAGTSLNRQ-T 203 (299) T ss_pred ---------cccccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhc--hhhhcccccccccceeeee-e Confidence 112233457889999999999999999999999999999999988754 6777654444445567775 9 Q ss_pred eeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHH Q lcl|Aclame:pro 234 YSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIV 311 (332) Q Consensus 234 ~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~ 311 (332) +++.||+|++ |++++..- ....+..++ .++-+.--++.|+.|+....-.+ .+.+|...- ...+|... T Consensus 204 g~idG~~Ii~Vps~r~~t~~--~~~~G~~~~------~~ak~in~ii~~~~a~~~~~K~~-~~~~~~P~~--~~~~~~~~ 272 (299) T protein:vir:79 204 TDIDTVKIIKVPSNLMKTAY--DFTTGWKVG------AGAKQIFMSLVHPSAIITPVSYQ-FSKLDEPTA--VTEGKYFY 272 (299) T ss_pred eeecceEEEEechhhcCccc--eeccCcccc------CcccccceEEEcCCeeeeeEeee-eEEeecCCC--CCccceee Confidence 9999999987 77887421 111111111 11222335788999876555444 455554322 22344332 Q ss_pred HHHHhCCceec-----hhheeeeecC Q lcl|Aclame:pro 312 GKLAMGCGSLR-----TSVAGSFQAA 332 (332) Q Consensus 312 ~~~~~G~~vlr-----pe~~v~i~~A 332 (332) . ++.-|.+.- +-+-+.+++| T Consensus 273 ~-~r~y~d~~v~~nk~~~i~~~~~~a 297 (299) T protein:vir:79 273 F-EESFEDVFILNKKADAIQFVVEGA 297 (299) T ss_pred e-eeeeeeeeeeccccCeEEEEeeec Confidence 2 232233322 2223566666 No 45 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=4.5e-35 Score=208.87 Aligned_cols=292 Identities=14% Similarity=0.117 Sum_probs=195.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc---cc---cccceEEEecccceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY---DL---RGGKSKQFMFTGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r---~~---~~G~tv~i~~iG~~t~~~ 74 (332) |+| +++- |--++|+.+.++.|+++.++.++|+.. .+ +.|+||+||+.+..++.+ T Consensus 1 MAN--sl~~------------------l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d 60 (423) T protein:vir:10 1 MAN--NLDA------------------NVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSER 60 (423) T ss_pred Ccc--cccc------------------ccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeec Confidence 775 2221 445899999999999999999999843 22 258999999999999988 Q ss_pred ecCCCCCC--ccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 75 HTPGTPIV--GDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 75 ~~~g~~~~--~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) ... ..+. ..+++...++.++||+.+|++|.++|.|+.+...++ +.+.+++.++||+.+|+.|+..+...+.. . T Consensus 61 ~~~-~~~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~---~ 135 (423) T protein:vir:10 61 TMD-GDITGKSKNSLISAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGAL---S 135 (423) T ss_pred ccC-cccCcccccccccceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccc---c Confidence 543 3232 234566678999999999999999999998666666 78999999999999999998666443211 1 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+ +...+ .|+.+.+++.+|++++||..|||+||+|++|..||.. +.++...+..+ ...+++|+. T Consensus 136 vgt~---------~t~~~---a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~-~~~~~~~~~~~-~~alr~~~i 201 (423) T protein:vir:10 136 LGSP---------NTPIK---KWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADA-QSGLHVSEQLV-RTAWENAQI 201 (423) T ss_pred cccc---------ccccc---cHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhh-hhhhccccccc-hHHHHhccc Confidence 1111 11111 2788999999999999999999999999999999863 45555554444 456899975 Q ss_pred eeeeeceEEEeeCccccc-cccccc----ccc------c------------------------------cc--------- Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGL-YGQDLS----SAA------V------------------------------TG--------- 262 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~-~g~~~~----~~~------~------------------------------~g--------- 262 (332) +|+++||+||+||++|.. .++... .++ . +| T Consensus 202 ~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk 281 (423) T protein:vir:10 202 SGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSK 281 (423) T ss_pred ceeecceEEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeeccccc Confidence 699999999999999942 222110 000 0 01 Q ss_pred ---------cccccccccc------------------------------------------------ceEEEeechhhhh Q lcl|Aclame:pro 263 ---------ENNDYQVDAS------------------------------------------------ALAGLIFHREAAG 285 (332) Q Consensus 263 ---------~~~~y~~~~~------------------------------------------------~~~~l~~h~~a~~ 285 (332) .-..|.+..+ -+.-|+||++|+. T Consensus 282 ~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~ 361 (423) T protein:vir:10 282 QTLYNGASALSFTATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCG 361 (423) T ss_pred ceeecccCCcceEEEEEecccccccCceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceE Confidence 0011111100 1123799999875 Q ss_pred hhhh---------------ccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 286 CIQS---------------VAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 286 ~~~~---------------~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+.. .+++++.... ++.+-.-+.++--..||.+.+|||.++-|.-= T Consensus 362 l~~~pl~~~~~~~~~~~~~~g~s~r~~~~-~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~ 422 (423) T protein:vir:10 362 LGTIPLPKLHSIDSAVATYEGFSIRVHKY-ADGDANKQMMRFDLLPAYVCYNPHMGGQFFGN 422 (423) T ss_pred EEEEcccCCCccceeecccccceEEEEEe-eeccccceEEEEEeecceeeeccceEEEEEec Confidence 5432 2222222211 11111112233334699999999998766655 No 46 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=1e-34 Score=206.96 Aligned_cols=264 Identities=15% Similarity=0.110 Sum_probs=213.3 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEecccce-eeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTGKL-SAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG~~-t~~~~~ 76 (332) |+| ..|+.. +-+.=|+|+..|.+++.+.+++.++.... ++ ..|++++||..... ...++. T Consensus 1 Ma~--~~T~l~--------------d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~ 64 (276) T protein:vir:10 1 MAQ--GTTTKS--------------TQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVP 64 (276) T ss_pred CCc--ceeehh--------------hhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCcccccc Confidence 877 345542 11344999999999999999999998764 34 45999999987653 556788 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|+.++. ++++.++.+.+|.+ .+..|.++|++..++..|++++++++++++||+++|+.++..+..+... T Consensus 65 eg~~i~~-~~lt~~~~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~-------- 134 (276) T protein:vir:10 65 EGQKIPV-DKIETNRREAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT-------- 134 (276) T ss_pred CCCccCc-cccccceeeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Confidence 8998874 57999999999976 6899999999999999999999999999999999999998876432211 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) + . +... .++.|.+|..+|+++++ +.++++|.|++|..|++....+|+.....+ ++.+++|. ++++ T Consensus 135 ----~--~-~~~~----t~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~-ig~~ 199 (276) T protein:vir:10 135 ----V--S-ADIG----TLAGLEAAIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELG-DNIIVKGA-FGEA 199 (276) T ss_pred ----c--c-cccc----CHHHHHHHHHHhccccC--cccEEEEcHHHHHHHHHhcccccccccccc-ccceeccc-ccee Confidence 0 0 1111 27889999999998876 679999999999999876567788654333 45678885 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +|++|++|+++|.. .+++|++-|++++...++++|..|+ +..+.|.|.+++.| T Consensus 200 ~G~~Vi~s~~~p~~------------------------t~~l~~~gAi~~~~~~~~~vE~dRd---~~~~~d~i~~~~~y 252 (276) T protein:vir:10 200 LGAVIVRSKKLDEG------------------------EAILAKRGAVKLITKRDFFLETDRD---PSTKTTALYSDKHY 252 (276) T ss_pred cceeEEEcCCCCcc------------------------eEEEEeccceeeeecCCceeecccc---hhhcccEEEEeeEE Confidence 99999999999831 2368899999999888988886554 56689999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |+++++|+.++.+..| T Consensus 253 ~~~~~~~~~vv~~t~~ 268 (276) T protein:vir:10 253 VAYLYDESKAVKVTKG 268 (276) T ss_pred EEEEEcCcceEEEecC Confidence 9999999999999999 No 47 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=6.8e-33 Score=196.95 Aligned_cols=281 Identities=14% Similarity=0.036 Sum_probs=196.3 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc-cccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY-DLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r-~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |+- . +.++|++.+++.|...+++..+.+.+ .+.+|++|+||+++.+.++||++++ T Consensus 1 Mai-----------------------n-~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~ 56 (290) T protein:vir:78 1 MAI-----------------------N-YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNK 56 (290) T ss_pred Cch-----------------------h-HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCC Confidence 322 2 34899999999999999988877543 5678999999999999999999988 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhh--hHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVY--SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) .... .+++.+..+++||+.+++.|.|| |+||.+....+.....+.+.+.++.++|.+.+..|+..+...+.. T Consensus 57 g~~~-g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~----- 130 (290) T protein:vir:78 57 GYNE-GSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNS----- 130 (290) T ss_pred Cccc-CccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcc----- Confidence 7764 36888999999999999999999 889988888889999999999999999999998887665332111 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccc-ccccccccccceeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREI-GNSQGDMNSGKGLYSI 236 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~-~~~~~~~~~g~~v~~i 236 (332) ...+.+++++|++|.++.++|+| ||.+|||++|+|++|.+|.+ +++|...-- ........+|. |+++ T Consensus 131 -------~~~t~t~~n~~~~i~~~~~~lde--vp~~~rvl~vtp~~~~lL~~--~~~f~r~~~~~~~~~~~i~~~-V~~i 198 (290) T protein:vir:78 131 -------VAEEITKDNVFTKLKAAIRKVKK--YGTQNLVMYVSPDVMAALEL--SDDFVRAINVQNIGPSSIETR-ITAI 198 (290) T ss_pred -------cccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEECHHHHHHHhh--Chhhhccccccccccccccce-eeee Confidence 11224678899999999999997 89999999999999987764 467764321 22122334674 9999 Q ss_pred eceEEEeeC---cccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHH Q lcl|Aclame:pro 237 AGIRILKSN---NLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGK 313 (332) Q Consensus 237 ~G~~V~~sn---~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~ 313 (332) .||+|++.+ ++-. + .....|-.. ...+.+.--++.|+.|+....-.+ .+.+|...-...--++++..+ T Consensus 199 dG~~ii~vps~~r~~t----~--~~f~~G~~~--~~~ak~in~ii~~~~a~i~~~K~~-~~~~~~P~~~~~~d~~~~~~r 269 (290) T protein:vir:78 199 DGTRIVEVEAEDRFYD----T--FDFTDGYKP--AAGAKKLNFLLVNKGSVVGGAKHA-SIYLHAPGSVGQGDGWLYQYR 269 (290) T ss_pred cCcEEEEecccchhhh----h--hhhcccccc--cCCccceeEEEEcCCceeeeeeee-EEEeeCCCCCcCcceeeeeee Confidence 999999855 2211 0 011111100 112233445788999875554444 566665333222112345444 Q ss_pred HHhCCceechhheeeeecC Q lcl|Aclame:pro 314 LAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 314 ~~~G~~vlrpe~~v~i~~A 332 (332) .-+.+=|+.-...+.+..+ T Consensus 270 ~y~d~~v~~nk~~~i~~~~ 288 (290) T protein:vir:78 270 VYHDIFVLDQQKDGVIAST 288 (290) T ss_pred eeeeeeeeccccCeeEEEe Confidence 4445555555555554444 No 48 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.96 E-value=3.2e-33 Score=198.73 Aligned_cols=262 Identities=15% Similarity=0.128 Sum_probs=206.5 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEeccc-ceeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTG-KLSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG-~~t~~~~ 75 (332) |++- -|+.+ ++++ |+|+.+|.+.+.+.+++.+++... ++ ..|++++||+.+ ...+.++ T Consensus 1 MA~~--~T~~~---------------~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v 63 (272) T protein:vir:30 1 MAVG--TTKMA---------------QMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDV 63 (272) T ss_pred CCCc--cccch---------------heechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccc Confidence 8872 23321 1444 999999999999999999888754 33 358999999986 4678889 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|..++. .+++.+++++++.+. ...+.|+|.+..++..|+++++.+++++++++++|+.++..+.++... T Consensus 64 ~eg~~i~~-~~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~------- 134 (272) T protein:vir:30 64 AEGEAIPM-TQLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT------- 134 (272) T ss_pred cCCCcccc-cccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 99998875 578999999999985 677999999999999999999999999999999999998766432211 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) .+ +...++.|.+|..+|++.+ ...|+++|+|++|..|++....++++... ...+.+++|. +++ T Consensus 135 -------~~------~~~t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~-~~~~~~~~g~-ig~ 197 (272) T protein:vir:30 135 -------VE------ATATVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATE-VGANRVVSGV-YGE 197 (272) T ss_pred -------cc------cccCHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhcccccccccc-cccccccccc-chh Confidence 00 1112788999999998775 45689999999999998654345554322 2234567775 899 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++|++|++|+++|.. .+++|++.+++++...++.+|..|+ +.++.|.|.++++ T Consensus 198 i~G~~Vi~s~~~p~~------------------------t~~~~~~~a~~~~~~~~~~ve~~r~---~~~~~~~i~~~~~ 250 (272) T protein:vir:30 198 VLGVQIVRSRKCPKG------------------------TAYMVRKGALRIMLKRNTMVETDRD---ITKAINQIVANKH 250 (272) T ss_pred hcCeeEEEcCCCCcc------------------------eEEEEcCCeEEEEecCCceeeeccc---cccceeEEEEEEE Confidence 999999999999831 1367888899999888888887654 4568899999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||.++++|++++.+..+ T Consensus 251 ~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 251 YGVYLYKAEKAVKITLK 267 (272) T ss_pred EEEEEEcCCceEEEEec Confidence 99999999999988887 No 49 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.96 E-value=3.2e-33 Score=198.73 Aligned_cols=262 Identities=15% Similarity=0.128 Sum_probs=206.5 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhccccccc-cc--cccceEEEeccc-ceeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY-DL--RGGKSKQFMFTG-KLSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v~~r-~~--~~G~tv~i~~iG-~~t~~~~ 75 (332) |++- -|+.+ ++++ |+|+.+|.+.+.+.+++.+++... ++ ..|++++||+.+ ...+.++ T Consensus 1 MA~~--~T~~~---------------~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v 63 (272) T protein:vir:98 1 MAVG--TTKMA---------------QMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDV 63 (272) T ss_pred CCCc--cccch---------------heechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccc Confidence 8872 23321 1444 999999999999999999888754 33 358999999986 4678889 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|..++. .+++.+++++++.+. ...+.|+|.+..++..|+++++.+++++++++++|+.++..+.++... T Consensus 64 ~eg~~i~~-~~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~------- 134 (272) T protein:vir:98 64 AEGEAIPM-TQLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT------- 134 (272) T ss_pred cCCCcccc-cccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 99998875 578999999999985 677999999999999999999999999999999999998766432211 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) .+ +...++.|.+|..+|++.+ ...|+++|+|++|..|++....++++... ...+.+++|. +++ T Consensus 135 -------~~------~~~t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~-~~~~~~~~g~-ig~ 197 (272) T protein:vir:98 135 -------VE------ATATVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATE-VGANRVVSGV-YGE 197 (272) T ss_pred -------cc------cccCHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhcccccccccc-cccccccccc-chh Confidence 00 1112788999999998775 45689999999999998654345554322 2234567775 899 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++|++|++|+++|.. .+++|++.+++++...++.+|..|+ +.++.|.|.++++ T Consensus 198 i~G~~Vi~s~~~p~~------------------------t~~~~~~~a~~~~~~~~~~ve~~r~---~~~~~~~i~~~~~ 250 (272) T protein:vir:98 198 VLGVQIVRSRKCPKG------------------------TAYMVRKGALRIMLKRNTMVETDRD---ITKAINQIVANKH 250 (272) T ss_pred hcCeeEEEcCCCCcc------------------------eEEEEcCCeEEEEecCCceeeeccc---cccceeEEEEEEE Confidence 999999999999831 1367888899999888888887654 4568899999999 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||.++++|++++.+..+ T Consensus 251 ~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 251 YGVYLYKAEKAVKITLK 267 (272) T ss_pred EEEEEEcCCceEEEEec Confidence 99999999999988887 No 50 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.95 E-value=3.4e-30 Score=182.16 Aligned_cols=283 Identities=15% Similarity=0.091 Sum_probs=186.3 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccc-cc-----ccccccceEEEeccc-ceeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLV-RS-----YDLRGGKSKQFMFTG-KLSAG 73 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v-~~-----r~~~~G~tv~i~~iG-~~t~~ 73 (332) |+- -+.++|+.+++++|...++..... +. -.+.+|++|+||++. ++.++ T Consensus 1 Mai------------------------nya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~ 56 (346) T protein:vir:10 1 MTI------------------------NYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRK 56 (346) T ss_pred Ccc------------------------hhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccc Confidence 322 145899999999998887664332 11 135689999999996 67899 Q ss_pred eecCCCCCCccCCCCCceEEEEEeeeeecchhhh--hHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVY--SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP 151 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~ 151 (332) ||++........+++.+..+++|++.+++.|.|| |+||.+........+.+.+....+.++|.+.+..++..+..... T Consensus 57 DY~R~~g~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~ 136 (346) T protein:vir:10 57 DRQRRTITTPVANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHD 136 (346) T ss_pred cccccCCcccccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhcc Confidence 9998665543457889999999999999999999 66666555555555566677778889999988888755433211 Q ss_pred cccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 152 VTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) +. ..+.+.+++++|++|.++.++|+|+.||.+|||++|+|++|.+|.++ ++|...--....+ ..+|. T Consensus 137 ------~~----~~~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s--~~f~k~~~v~~~~-~i~~~ 203 (346) T protein:vir:10 137 ------GG----ITTNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRA--EAMNRALTLKDPN-NIQRT 203 (346) T ss_pred ------cc----ccccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhc--hhheecccccccc-cccee Confidence 00 01123467889999999999999999999999999999999877543 5665433333333 45775 Q ss_pred eeeeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH- Q lcl|Aclame:pro 232 GLYSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD- 308 (332) Q Consensus 232 ~v~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d- 308 (332) |+++.||+|++ |++|+..- ....|... ...+-+.--++.|+.|.....-.+ .+.+|... ....++ T Consensus 204 -V~siDGv~Ii~VPs~r~~t~~------~f~~G~~~--~t~ak~INfiiv~~~A~ia~~K~~-~~~if~P~--~~~~g~~ 271 (346) T protein:vir:10 204 -VYSLDDVTIRVVPSDLMQTAY------DFSDGSKI--IDTAKQIEMFLIYNGVQIAPEKYS-FVGFDQPS--AATSGNY 271 (346) T ss_pred -eeeecCeEEEEcchhhcccch------hhccCccc--cCCccceeEEEECCceeeeeeeee-eeEeeCCC--CCcccce Confidence 99999999987 67787311 11111110 112223445788999875444333 45555442 233443 Q ss_pred HHHHHHHhCCceechhheee---eecC Q lcl|Aclame:pro 309 LIVGKLAMGCGSLRTSVAGS---FQAA 332 (332) Q Consensus 309 ~i~~~~~~G~~vlrpe~~v~---i~~A 332 (332) ++..+.-+.+=|+.....+. +..| T Consensus 272 l~~~R~Y~D~fv~~nk~~~Iyv~~~~a 298 (346) T protein:vir:10 272 LYYEQSYDDVLLLNTKTKGIQFVVSDK 298 (346) T ss_pred eeeeeeeeeeeeeccccceEEEeeecc Confidence 44444444455554443333 4444 No 51 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.94 E-value=4.4e-29 Score=176.04 Aligned_cols=294 Identities=12% Similarity=0.021 Sum_probs=193.4 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc---cccccceEEEecccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r---~~~~G~tv~i~~iG~~t~~~~~~ 77 (332) |+| + --+.++|.+++++.|...+++..+.... .+.+|++|+||++....++||+| T Consensus 1 Man--t--------------------l~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R 58 (312) T protein:vir:10 1 MAN--T--------------------LAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSR 58 (312) T ss_pred CCc--c--------------------hhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeeccccccccc Confidence 554 1 1367999999999999999887775332 36789999999999999999999 Q ss_pred CCCCCc-cCCCCCceEEEEEeeeeecchhhh--hHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 78 GTPIVG-DAGIKANEKTLVMDDLLVSSQFVY--SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 78 g~~~~~-~~~~~~~~~~l~ID~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) ++.... ..+++.+..+++|++.+++.|.|| |+||.+....+...+.+.+......++|.+.+..|+..+....... T Consensus 59 ~~g~~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~- 137 (312) T protein:vir:10 59 GSANAYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDT- 137 (312) T ss_pred ccCCccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccc- Confidence 765332 246889999999999999999999 8888877777777778889999999999999988876654332111 Q ss_pred ccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 155 EPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) ......+.++.++|++|.++.++|+|+.|| ++|+++|+|+.+. ||+. +..+.-. .........+++ |+ T Consensus 138 -------~~~~~~~~T~~ni~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~-lLk~-~~~~~~~-~~~~~~~~i~~~-V~ 205 (312) T protein:vir:10 138 -------NVEYSYSVNSSTIINKIKTGIKIIRENGYN-GPLVCHLTYDSMF-AIEE-KVLEKLT-AVTFAQGGIQTQ-VP 205 (312) T ss_pred -------ccccccccCHHHHHHHHHHHHHHHHHccCC-CceEEEeChHHHH-HHhh-hhhceec-ccccccceeeee-ee Confidence 112223346888999999999999999999 6999999999874 4442 2221111 112223345665 99 Q ss_pred eeeceEEEee--Ccccccccccccccccccc-cccc--cccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH Q lcl|Aclame:pro 235 SIAGIRILKS--NNLAGLYGQDLSSAAVTGE-NNDY--QVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL 309 (332) Q Consensus 235 ~i~G~~V~~s--n~lp~~~g~~~~~~~~~g~-~~~y--~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~ 309 (332) ++.|++|++. ++|.. .-....+.++|. .+.| +..+-+.--++.|+.|.....-.+ .+.+|...-+..-.+++ T Consensus 206 ~iDgv~Ii~VPs~r~~t--~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~-~~~if~P~~~~~~d~~~ 282 (312) T protein:vir:10 206 SIDGCALIKTPQNRMYS--SILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQD-KMRIFDPETNQTANAWS 282 (312) T ss_pred eecccEEEEchhhhccc--eeeeccCcccccccCceeecCcccccceEEeCCceeeceeeee-eeeeeCCCCCCCcceee Confidence 9999999974 44532 111111111111 0112 222233446788999875444333 55665433333223345 Q ss_pred HHHHHHhCCceechhh---eeeeecC Q lcl|Aclame:pro 310 IVGKLAMGCGSLRTSV---AGSFQAA 332 (332) Q Consensus 310 i~~~~~~G~~vlrpe~---~v~i~~A 332 (332) +..+.-+.+=|+.... -+.+++| T Consensus 283 ~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 283 MDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred eeeeeeeeeeeeccccCeEEEEeecc Confidence 5444444444443322 2566777 No 52 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.91 E-value=3.4e-28 Score=171.21 Aligned_cols=229 Identities=14% Similarity=0.133 Sum_probs=182.2 Q ss_pred cccccccceEEEecccceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 53 SYDLRGGKSKQFMFTGKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALAT 132 (332) Q Consensus 53 ~r~~~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~ 132 (332) ..-+..|+|++||.- -..+.++..|..++. +.++.++.+.+|.+. ...|.|+|++..+...|++.+.++|++.+||+ T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i~~-~~l~~t~~~atIk~~-gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEISL-DKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcCCh-hhccccceeeeEeee-ccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 334667999999843 335588999999984 579999999999884 88999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcC Q lcl|Aclame:pro 133 HYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVD 212 (332) Q Consensus 133 ~~D~~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d 212 (332) ++|..++..+.++..+. ... ..++.|.+|..+|.+.+ ...++++|.|..|+.|.+ + T Consensus 78 kvD~di~~~~~~a~l~~----------------~~~----~t~d~i~~A~~~fgde~--~~~~vivv~p~~~~~Lrk--~ 133 (231) T protein:vir:73 78 KVDDDLLKAAKTTSQTV----------------STK----ANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRK--D 133 (231) T ss_pred hhhHHHHHhhccccccc----------------ccc----ccHHHHHHHHHHhcccc--ccceEEEEcchHHHhhhh--c Confidence 99999987665432111 111 12788999999998876 356899999999999975 4 Q ss_pred chhhccccccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccc Q lcl|Aclame:pro 213 TNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAP 292 (332) Q Consensus 213 ~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~ 292 (332) .++........++.+++|. +|.++|++|+.|+++|..++.. +-+++.+.|++....+++ T Consensus 134 ~~~~~~~~~~g~~i~~~G~-iG~i~G~~Vi~S~~~~~~~~~~--------------------~~~i~~~gAl~~~~k~~~ 192 (231) T protein:vir:73 134 ANAKNIGSEVGANALINGT-YADVLGAQIVRSKKLAEGSALM--------------------FKIVSNSPALKLVLKRGV 192 (231) T ss_pred cchhhhhhhhccceeeecc-cceEcceEEEEcCCCCCCceee--------------------eeEEeeccceeeeecccc Confidence 5555543334567788885 9999999999999999532211 113456788999999999 Q ss_pred eeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 293 TIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 293 ~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++|..|+ +..+.|.|.+.+.||+++.+|+.++.|.-+ T Consensus 193 ~vEtdRd---~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~ 229 (231) T protein:vir:73 193 QVETDRD---IVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) T ss_pred eeecccc---ccccccEEEEeEEEEEEEEcCccEEEEEee Confidence 9887654 566899999999999999999999999888 No 53 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.90 E-value=3.9e-26 Score=159.88 Aligned_cols=269 Identities=15% Similarity=0.076 Sum_probs=172.4 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc-----cccccceEEEecccc-eeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY-----DLRGGKSKQFMFTGK-LSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r-----~~~~G~tv~i~~iG~-~t~~~ 74 (332) |+. -+.++|...+++.|...+++..+.... .+.+|++|+||++.+ ..+.+ T Consensus 1 Mai------------------------n~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~d 56 (285) T protein:vir:79 1 MTV------------------------VLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATA 56 (285) T ss_pred Ccc------------------------hhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccc Confidence 332 245899999999999998887776542 467899999999975 68999 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQ-IGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~-~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) |+|+.... ..+++.+..+++|++.+++.|.||.+|..++..=..+.++.+ .......++|.+-+..++..+.. T Consensus 57 Y~R~~g~~-~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~----- 130 (285) T protein:vir:79 57 YKRGQDNA-RKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAK----- 130 (285) T ss_pred cccccCcc-ccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhccc----- Confidence 99977654 357889999999999999999999666655322223333333 45555778898888777643311 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc--ccccccc Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ--GDMNSGK 231 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~--~~~~~g~ 231 (332) . . ..+.+++++|++|.++.++|+|+.|| ++||++|+|++|.+|.++ +.|....-.+.+ ..-.++ T Consensus 131 ------~---~-~~~~T~~nv~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~~Lk~s--~~~~r~~~~~~~~~~~~i~~- 196 (285) T protein:vir:79 131 ------K---A-TDSITKDNALDAYDTAEAYMFDNEVP-GGFVMFVSSAYYTALKQS--AAVTRTFSTDGTMVINGIDR- 196 (285) T ss_pred ------c---c-ccccCHHHHHHHHHHHHHHHHHcCCC-CceEEEEChHHHHHHHhh--hhhheecccccceeccceee- Confidence 0 1 11235778999999999999999999 699999999999877654 556543211111 111333 Q ss_pred eeeeeec-eEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH Q lcl|Aclame:pro 232 GLYSIAG-IRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD 308 (332) Q Consensus 232 ~v~~i~G-~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d 308 (332) .|+++.| ++|++ |+++...+++ .+.-.++.|+.|+....-.+ .+..|..+-...--++ T Consensus 197 ~V~~lDg~v~ii~Vps~r~kt~~~~------------------k~Infiiv~~~a~i~~~K~~-~~~~f~P~~~~~~d~~ 257 (285) T protein:vir:79 197 RVAQLDGGVPIVRVSSDRLKGLGIT------------------NHVNFILTPLSAIAPIVKYD-SVSVIDPSTDRSGNRW 257 (285) T ss_pred eeccccceeEEEEcchhhccCcCcc------------------hhccEEEecCceeccceeee-eeEeECCCCCCCccee Confidence 3899998 89987 5666532111 12334788998865444333 4555543322111123 Q ss_pred HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 309 LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 309 ~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++..+.-+.+=|+.-...+....+ T Consensus 258 ~~~~R~Y~d~fv~~nk~~~Iy~~~ 281 (285) T protein:vir:79 258 TIKGLSYYDAIVLDNAKKGIYVAA 281 (285) T ss_pred eeeeeeeeeeeehhhccceeeeee Confidence 444443334444433333332222 No 54 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.89 E-value=1.1e-25 Score=157.47 Aligned_cols=259 Identities=15% Similarity=0.139 Sum_probs=196.3 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccc-c--cccceEEEecccc-eeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYD-L--RGGKSKQFMFTGK-LSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~-~--~~G~tv~i~~iG~-~t~~~~~ 76 (332) |+. |+.. +-+.=|+|+.+|.+++.+.++|.++....+ + ..|++|+||.... ....++. T Consensus 1 Ma~----T~~~--------------d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~ 62 (270) T protein:vir:95 1 MTQ----TKKA--------------NLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQ 62 (270) T ss_pred CCc----eehh--------------hhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCcccccc Confidence 553 2210 113449999999999999999999887653 2 5699999998753 2567788 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|+.++. ++++.++.+.+|-+ ....|.|+|++...+..|++.+.+++++.++++++|+.++..+..+..+. T Consensus 63 eg~~i~~-~~lt~~~~~a~i~~-~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~------- 133 (270) T protein:vir:95 63 EGVAMDT-TQMSMTTTKVTVKE-TGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA------- 133 (270) T ss_pred CCCccch-hhcccchheeeeeh-hhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc------- Confidence 8999874 58999999999976 47899999999888888999999999999999999999987765432111 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) +...+ ++.|.+|..+|.+.. ....+++|.|..|+.|.+. ..+.. ....++.+++|. ++.+ T Consensus 134 ---------~~~~t----~~~~~dA~~~lgd~~--~~~~~i~vhs~~~~~Lrk~--~~~~~--~~~~~~~~~~G~-ig~~ 193 (270) T protein:vir:95 134 ---------TVSAD----ATGILDAIEVFNSEN--DEDYVLYVNPKDYNKLVKS--LFKVG--GNVQDRAISKGD-LVEI 193 (270) T ss_pred ---------ccccC----HHHHHHHHHHhcccc--CCCcEEEEcHHHHHHHHhh--hcccc--cccccchhcccc-ccee Confidence 11111 566778888885542 3346899999999998753 33332 233455677775 9999 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +|++|+.+++.|.. ..+.+|++-|++.+..+++++|..| ++..+.|.+.+.+.| T Consensus 194 ~G~~Viv~s~~~~~-----------------------~~~~l~~~gAi~~~~~~~~~vEtdR---d~~~~~d~i~~~~~y 247 (270) T protein:vir:95 194 VGVSDIVKSKRVSE-----------------------NTAFLQRYGAMEIVNKKKPEAYTDF---DILKRTHLLSTNYHY 247 (270) T ss_pred cceeEEEeCCCCCc-----------------------eeEEEEeccceeeeecCCceeeecc---chhhcccEEEeeeEE Confidence 99999887766531 1246889999999999998888655 456688999999999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |.++.+|+.++.+.-+ T Consensus 248 ~v~~~~~skvv~~t~~ 263 (270) T protein:vir:95 248 SVNLKDETGVVKVTFK 263 (270) T ss_pred EEEEEccceEEEEEec Confidence 9999999988876555 No 55 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.88 E-value=6.6e-25 Score=153.15 Aligned_cols=298 Identities=15% Similarity=0.064 Sum_probs=182.8 Q ss_pred CCCcccccccccccccccccccCchhhH-HHHHHhHHHHHHHHHhhhhccccccc-cc-cccceEEEecccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYAT-ALKLFSGEVFTAFNNASIFKGLVRSY-DL-RGGKSKQFMFTGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al-~~e~f~g~V~~~f~~~s~~~~~v~~r-~~-~~G~tv~i~~iG~~t~~~~~~ 77 (332) |.. .+| -.|| +.++|.+++++.|...++...+.+.. .+ .+|++|+||.+....+.||+| T Consensus 1 ~~~-------------~an-----~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R 62 (311) T protein:vir:99 1 MPT-------------DAE-----TRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTR 62 (311) T ss_pred CCC-------------cch-----hhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeecccccccc Confidence 222 121 1344 78999999999999998877766543 24 579999999999999999999 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQ--YSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~--~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ++.-. ..+++.+..+++|++.+++.|.||-+|..+++ ...-....+.......-++|.+-+..++..+......... T Consensus 63 ~~g~~-~g~v~~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~ 141 (311) T protein:vir:99 63 GKGFN-SGTISDEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTE 141 (311) T ss_pred ccCcc-ccceeeeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccc Confidence 77544 46789999999999999999999944444443 4334444555666678889999988887554332211110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccc-cc-cccccccccee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREI-GN-SQGDMNSGKGL 233 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~-~~-~~~~~~~g~~v 233 (332) .+...-......+.+++++++.|..+..+|++ ||.+||+++|+|+.|.+|.++ +.|...-- .. +++ ..+++ | T Consensus 142 ~~~~~~~~~~~~~lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~--~~~~r~~~~~~~~~~-~i~~~-V 215 (311) T protein:vir:99 142 GTLLAKTHKTEETLDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERS--KEFTRNITNQNVGTT-ALESR-I 215 (311) T ss_pred hhhhccccccccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhc--hhhheeeeccccccc-ccccc-c Confidence 00000111223346788999999999999987 799999999999998766432 34432110 11 122 24554 9 Q ss_pred eeeeceEEEe---eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHH Q lcl|Aclame:pro 234 YSIAGIRILK---SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLI 310 (332) Q Consensus 234 ~~i~G~~V~~---sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i 310 (332) +++.|++|++ |+++... -....+..++ .++-+.--++.|+.|.....-.+ .+.+|...-+..-.++++ T Consensus 216 ~~lDgv~Ii~V~ps~r~~t~--~~ft~G~~~~------~~ak~INfiiv~~~a~i~~~K~~-~v~~f~P~~~~~gd~~l~ 286 (311) T protein:vir:99 216 TSIDGVQLIEVYESNRFMTK--YDFTDGAKPT------EDAKAINFLVVAKPAVISIVKEN-AVFLFAPGQHTDGDGYLY 286 (311) T ss_pred ceecCeEEEEecCchhhcch--hhhcCCcccc------CcccccceEEeCCCeeeeeeeee-eeeeeCCCCCCCcceeee Confidence 9999999984 5666532 1111111111 11223445788998875443332 455554322221112333 Q ss_pred HHHHHhCCceechh---heeeeecC Q lcl|Aclame:pro 311 VGKLAMGCGSLRTS---VAGSFQAA 332 (332) Q Consensus 311 ~~~~~~G~~vlrpe---~~v~i~~A 332 (332) ..+.-+.+=|+... .-+.+++| T Consensus 287 ~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 287 QNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred eeeeeeeeeeeccccCeEEEeeecC Confidence 33333333344222 23566777 No 56 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.79 E-value=2.9e-22 Score=138.70 Aligned_cols=297 Identities=18% Similarity=0.207 Sum_probs=188.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccc-cccccccceEEEecccceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVR-SYDLRGGKSKQFMFTGKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~-~r~~~~G~tv~i~~iG~~t~~~~~~g~ 79 (332) |- ..+ | .+--+--|+|+.+++-.+++..+-..+-+ +..+-.|++.|||.+|.++++.....+ T Consensus 1 ~~------------~TS-N----T~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~ 63 (313) T protein:vir:95 1 MQ------------LTS-N----TRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSVTLQEAEEDT 63 (313) T ss_pred Cc------------ccc-c----chheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCceeeeccccCC Confidence 11 111 1 11124459999999999888755444444 446778999999999999999999999 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhh-hHHHHHhchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVY-SLDEIFSQYS-TRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Id-d~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) ++.++ ++++.+.++.|.+++-.+++|. |+.+.-..+| ++.+...|.++|+.+.+...++..-..--...+....-.| T Consensus 64 ~~~~~-~i~TGEIt~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG 142 (313) T protein:vir:95 64 PLIYN-PIETGEITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNG 142 (313) T ss_pred Ceeec-ccccceEEEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCccccc Confidence 88875 7999999999999998888884 5555555555 8999999999999999988887542211111111111122 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhc-----ccccccccccccc-c Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILN-----REIGNSQGDMNSG-K 231 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~-----~d~~~~~~~~~~g-~ 231 (332) ..++-+ +..++....+..+...+-.|++++||.+||+.||+|.....|-.. ..+.+ ..+....| +..| + T Consensus 143 ~PH~~V--~~~T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l--~~It~~vt~~~k~I~ESG-~A~~~~ 217 (313) T protein:vir:95 143 FPHVIV--SAETNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGL--VTITHDVTDFGKMILESG-MARGQR 217 (313) T ss_pred ccceEE--eccCCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhh--heeecccccccceeeecc-CCchhH Confidence 223332 234555555677889999999999999999999999987666321 11211 11233333 3444 4 Q ss_pred eeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeech---hhhhhhhhccceeeeeecccchhHHHH Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHR---EAAGCIQSVAPTIQTTSGDFNVQYQGD 308 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~---~a~~~~~~~~~~~e~~~~~~~~~~~~d 308 (332) .|.+++|++++.||.|....-+. +..+|+ .|-++. -+++... --++.=+.++ +.+.++++++.+ | T Consensus 218 Fi~~~YG~Di~~SN~L~~AN~~D---~~tT~~--G~~~Nl---FM~i~D~~~~P~~~AWr~MP-~s~~~~~~~~~~---~ 285 (313) T protein:vir:95 218 FIMNLYGWDILTSNRLHVANYND---GTTTGN--GYVGNL---FMCILDDQTKPIMGAWRRMP-KSEGERNKDRAR---D 285 (313) T ss_pred HHHHHhhhhhhhhhhhhhccccc---cccccC--ceeeee---eeeeecccccceeeeecccc-cccccccccccc---c Confidence 58899999999999997533221 111111 121111 0111100 0111112222 556666655433 2 Q ss_pred HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 309 LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 309 ~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .-.-..+||.++.|-|.+|.+.+- T Consensus 286 ~~~~~~R~G~Gi~R~~~L~~~~~~ 309 (313) T protein:vir:95 286 EHVVRCRYGFGIQRLDTLGLLATS 309 (313) T ss_pred cceeeeeecccceeecceeEEEec Confidence 333456899999999999988775 No 57 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.78 E-value=3.6e-21 Score=132.65 Aligned_cols=280 Identities=15% Similarity=0.110 Sum_probs=177.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc---cccccceEEEeccc-----ceee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTG-----KLSA 72 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r---~~~~G~tv~i~~iG-----~~t~ 72 (332) |+| + --+.++|.+++++.|...+++..|.... .+.+|++|+||.|. +..+ T Consensus 1 Man--t--------------------l~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl 58 (302) T protein:vir:78 1 MAN--S--------------------LALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDL 58 (302) T ss_pred CCc--h--------------------hHHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccc Confidence 554 0 1377999999999999999888775432 47889999999994 6788 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQ--YSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~--~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~ 150 (332) ++|+|++.-. ...++.+..++++++.+++.|.||-+|..+++ ..+-....+........++|.+-+..|+..+.... T Consensus 59 ~dy~R~~g~~-~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~ 137 (302) T protein:vir:78 59 KAYNRSTGFT-QGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG 137 (302) T ss_pred cccccccCcc-ccceeeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC Confidence 9999977544 34688899999999999999999955555554 33344445557777888999998888765443221 Q ss_pred ccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchh---hccccccccccc Q lcl|Aclame:pro 151 PVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNI---LNREIGNSQGDM 227 (332) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~---~~~d~~~~~~~~ 227 (332) .. ........++++++++|..+.++|+|+ ++|+++|+|+.+.+|.++ +.| ++.... ..+ . T Consensus 138 ~~---------~~~~~~~~t~~nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a--~~~~~~~~~~~~-~~~-~ 200 (302) T protein:vir:78 138 GV---------IDLSKPDASAQALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNT--ALIRESKNTQVL-RRG-E 200 (302) T ss_pred cc---------ccccccchhHHHHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcc--hhhccceecccc-ccc-c Confidence 11 001122346788999999999999996 599999999998877543 223 222111 112 2 Q ss_pred cccceeeeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhH Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQY 305 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~ 305 (332) .+++ |+++.|++|++ |++|...- . ...|-.. +..+-+.-.++.|+.|.....-.+ .+.+|.. .... T Consensus 201 i~~~-V~~lDgv~Ii~VPs~r~~t~~--~----f~~G~~~--~~~ak~INfiiv~~~a~ia~~K~~-~~~if~P--~~~~ 268 (302) T protein:vir:78 201 VDTK-ITFIQDVEVLQVPSEYLYDKV--A----PKVGVPD--YTGAKKIPYMIFKRDAPTGIVKTD-KVRVFEP--DTNQ 268 (302) T ss_pred ccce-eeeecccEEEEchhhhcccce--e----ccCCccc--cCCccceeEEEECCCeeeeeeeee-eeEeeCC--CCCC Confidence 3554 99999999987 44554311 0 1111110 122233446788999875444333 4566533 2344 Q ss_pred HHH--HHHHHHHhCCceechhhe---eeeecC Q lcl|Aclame:pro 306 QGD--LIVGKLAMGCGSLRTSVA---GSFQAA 332 (332) Q Consensus 306 ~~d--~i~~~~~~G~~vlrpe~~---v~i~~A 332 (332) .|| ++..+.-+.+=|+..... +.+.+| T Consensus 269 ~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~ 300 (302) T protein:vir:78 269 SADAYKVDLRLYHDLIVPKNQRPGIIKASFGT 300 (302) T ss_pred CcceeeeeeeeEeeeeeeccccCeEEEeeccc Confidence 565 444443334444433322 333444 No 58 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.70 E-value=4.1e-19 Score=121.38 Aligned_cols=291 Identities=13% Similarity=0.105 Sum_probs=177.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccc-cc-----ccccceEEEecccceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YD-----LRGGKSKQFMFTGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~-r~-----~~~G~tv~i~~iG~~t~~~ 74 (332) |+| +++ ..+++=..|+++.|....++..++.+ |. .+.|+|+.+|.--.....+ T Consensus 1 MAn--~l~-------------------~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~ 59 (430) T protein:vir:10 1 MAL--NEG-------------------QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE 59 (430) T ss_pred Ccc--chh-------------------hHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc Confidence 777 111 34566778899999999998876442 32 2569999888765555544 Q ss_pred ecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 75 HTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 75 ~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) |..+.+. .++....+.++||+.+-..|.+.+-+ +...+....+.+.+.++||.++|..++.++..-....... T Consensus 60 ---G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~- 133 (430) T protein:vir:10 60 ---GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS- 133 (430) T ss_pred ---CcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccc- Confidence 4433322 23445789999999999999998744 5677777888899999999999999998764332211100 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcC-CCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQE-GRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) ..+ ..+..+ ..+..+.++.+.|++..||.+ +|.++++|+.++.|.... .++.+.+-..+ ..+++|. T Consensus 134 --~~~-----t~~~~~---~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l-~~l~~~~~~~~-~A~r~g~- 200 (430) T protein:vir:10 134 --PDA-----IGTNTA---DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRIPE-EAYRDGT- 200 (430) T ss_pred --ccc-----CCCcCC---cchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhh-ccccccccchh-HHHhhcc- Confidence 000 011111 135668889999999999995 899999999999986432 34444333233 4578886 Q ss_pred eee-eeceE-EEeeCcccccccccccccccc---------------------------------------------ccc- Q lcl|Aclame:pro 233 LYS-IAGIR-ILKSNNLAGLYGQDLSSAAVT---------------------------------------------GEN- 264 (332) Q Consensus 233 v~~-i~G~~-V~~sn~lp~~~g~~~~~~~~~---------------------------------------------g~~- 264 (332) |++ +.||+ +|+++++|..++.......++ |.. T Consensus 201 i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~ 280 (430) T protein:vir:10 201 IQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKF 280 (430) T ss_pred ccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceee Confidence 775 89997 588999996322221111111 100 Q ss_pred ------------ccccccc-----------------------------------------------cceEEEeechhhhh Q lcl|Aclame:pro 265 ------------NDYQVDA-----------------------------------------------SALAGLIFHREAAG 285 (332) Q Consensus 265 ------------~~y~~~~-----------------------------------------------~~~~~l~~h~~a~~ 285 (332) ..|.+.. ..+.-|+||++|++ T Consensus 281 v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~a 360 (430) T protein:vir:10 281 LGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIR 360 (430) T ss_pred eccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceE Confidence 0011100 00224899999864 Q ss_pred hhhhc---------------------cceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 286 CIQSV---------------------APTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 286 ~~~~~---------------------~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+-.. ++.+...+..+ .+---..++--..||.+.+|||.++.++.- T Consensus 361 La~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd-~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g 427 (430) T protein:vir:10 361 IVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGD-ISTLSGLCRIALWYGVNATRPEAIGVGLPG 427 (430) T ss_pred EEEecccCCCCHHHhhhhheeccccceEEEEEEEecc-cccCceEEEEeeeccceecCcceEEEEcCC Confidence 43211 12222221111 000011222335789999999998777655 No 59 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.70 E-value=4.1e-19 Score=121.38 Aligned_cols=291 Identities=13% Similarity=0.105 Sum_probs=177.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccc-cc-----ccccceEEEecccceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YD-----LRGGKSKQFMFTGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~-r~-----~~~G~tv~i~~iG~~t~~~ 74 (332) |+| +++ ..+++=..|+++.|....++..++.+ |. .+.|+|+.+|.--.....+ T Consensus 1 MAn--~l~-------------------~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~ 59 (430) T protein:vir:92 1 MAL--NEG-------------------QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE 59 (430) T ss_pred Ccc--chh-------------------hHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc Confidence 777 111 34566778899999999998876442 32 2569999888765555544 Q ss_pred ecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 75 HTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 75 ~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) |..+.+. .++....+.++||+.+-..|.+.+-+ +...+....+.+.+.++||.++|..++.++..-....... T Consensus 60 ---G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~- 133 (430) T protein:vir:92 60 ---GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS- 133 (430) T ss_pred ---CcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccc- Confidence 4433322 23445789999999999999998744 5677777888899999999999999998764332211100 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcC-CCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQE-GRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) ..+ ..+..+ ..+..+.++.+.|++..||.+ +|.++++|+.++.|.... .++.+.+-..+ ..+++|. T Consensus 134 --~~~-----t~~~~~---~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l-~~l~~~~~~~~-~A~r~g~- 200 (430) T protein:vir:92 134 --PDA-----IGTNTA---DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRIPE-EAYRDGT- 200 (430) T ss_pred --ccc-----CCCcCC---cchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhh-ccccccccchh-HHHhhcc- Confidence 000 011111 135668889999999999995 899999999999986432 34444333233 4578886 Q ss_pred eee-eeceE-EEeeCcccccccccccccccc---------------------------------------------ccc- Q lcl|Aclame:pro 233 LYS-IAGIR-ILKSNNLAGLYGQDLSSAAVT---------------------------------------------GEN- 264 (332) Q Consensus 233 v~~-i~G~~-V~~sn~lp~~~g~~~~~~~~~---------------------------------------------g~~- 264 (332) |++ +.||+ +|+++++|..++.......++ |.. T Consensus 201 i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~ 280 (430) T protein:vir:92 201 IQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKF 280 (430) T ss_pred ccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceee Confidence 775 89997 588999996322221111111 100 Q ss_pred ------------ccccccc-----------------------------------------------cceEEEeechhhhh Q lcl|Aclame:pro 265 ------------NDYQVDA-----------------------------------------------SALAGLIFHREAAG 285 (332) Q Consensus 265 ------------~~y~~~~-----------------------------------------------~~~~~l~~h~~a~~ 285 (332) ..|.+.. ..+.-|+||++|++ T Consensus 281 v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~a 360 (430) T protein:vir:92 281 LGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIR 360 (430) T ss_pred eccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceE Confidence 0011100 00224899999864 Q ss_pred hhhhc---------------------cceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 286 CIQSV---------------------APTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 286 ~~~~~---------------------~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+-.. ++.+...+..+ .+---..++--..||.+.+|||.++.++.- T Consensus 361 La~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd-~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g 427 (430) T protein:vir:92 361 IVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGD-ISTLSGLCRIALWYGVNATRPEAIGVGLPG 427 (430) T ss_pred EEEecccCCCCHHHhhhhheeccccceEEEEEEEecc-cccCceEEEEeeeccceecCcceEEEEcCC Confidence 43211 12222221111 000011222335789999999998777655 No 60 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.69 E-value=2.4e-19 Score=122.62 Aligned_cols=291 Identities=13% Similarity=0.094 Sum_probs=175.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccc-ccc-----ccccceEEEecccceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVR-SYD-----LRGGKSKQFMFTGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~-~r~-----~~~G~tv~i~~iG~~t~~~ 74 (332) |++. +. -++++=-.|+++.|....++..++. .|. .+.|+|+.+|.--.....+ T Consensus 1 Ma~~--~~-------------------~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~ 59 (430) T protein:vir:21 1 MALN--EG-------------------QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE 59 (430) T ss_pred Cccc--cc-------------------hhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccc Confidence 7761 11 1333333899999999999988643 232 2669999988554433332 Q ss_pred ecCCCCCCc-cCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 75 HTPGTPIVG-DAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 75 ~~~g~~~~~-~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) |.++.+ ..++....+.++||+.+-..|.+.+ +| +...+....+.+.+.++||..+|..++.++........ T Consensus 60 ---G~~~t~~~~~~~e~~v~~~~~~~~~V~~~~~~-kE-l~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~--- 131 (430) T protein:vir:21 60 ---GWDLTDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVI--- 131 (430) T ss_pred ---cccccCCCccceeeeEeEEEeeeccceEEeeh-hH-hcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccc--- Confidence 222221 1245567889999999988888864 33 56777788999999999999999999988754322211 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcC-CCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQE-GRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) +...+ ..+..++ .+..+.++.+.|++..||.+ +|.++++|+.+..|... ..++.+.+-..+ ..+++|. T Consensus 132 ~~~~~-----t~~~~~~---~~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~-l~~~~~~~~~~~-~A~r~g~- 200 (430) T protein:vir:21 132 TSPDA-----IGTNTAD---AWNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYD-LTKRDIFGRIPE-EAYRDGT- 200 (430) T ss_pred cccCC-----CCCCCCc---chhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhh-hccccccccchh-HHHhhcc- Confidence 00000 0111112 25678889999999999995 79999999999888653 244544443333 4578886 Q ss_pred eee-eeceE-EEeeCcccccccccccccccccc--------------------------------------cccccc--- Q lcl|Aclame:pro 233 LYS-IAGIR-ILKSNNLAGLYGQDLSSAAVTGE--------------------------------------NNDYQV--- 269 (332) Q Consensus 233 v~~-i~G~~-V~~sn~lp~~~g~~~~~~~~~g~--------------------------------------~~~y~~--- 269 (332) |++ +.||+ +|+|+++|..++.......++|. ...+++ T Consensus 201 i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~ 280 (430) T protein:vir:21 201 IQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKF 280 (430) T ss_pred cccccchhhhhhhcCCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEecceee Confidence 665 99997 58899999632221111110000 000011 Q ss_pred -----------------cc-----------------------------------------------cceEEEeechhhhh Q lcl|Aclame:pro 270 -----------------DA-----------------------------------------------SALAGLIFHREAAG 285 (332) Q Consensus 270 -----------------~~-----------------------------------------------~~~~~l~~h~~a~~ 285 (332) .. ..+.-|+||++|++ T Consensus 281 v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~ 360 (430) T protein:vir:21 281 LGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIR 360 (430) T ss_pred eccccccccCCcceEEEEEecCCceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeE Confidence 00 00123899999864 Q ss_pred hhhhc---------------------cceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 286 CIQSV---------------------APTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 286 ~~~~~---------------------~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+-.. ++++..++..+ .+--.+.++--..||.+.+|||.++.++.- T Consensus 361 La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd-~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g 427 (430) T protein:vir:21 361 IVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGD-ISTLSGLCRIALWYGVNATRPEAIGVGLPG 427 (430) T ss_pred EEEecccCCCChhHhhheeeeeccccceEEEEEEccc-cccCceEEEEEeecCccccCcceEEEEcCC Confidence 43211 23334333222 111112233345799999999998777655 No 61 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.51 E-value=6.8e-16 Score=103.73 Aligned_cols=306 Identities=12% Similarity=0.051 Sum_probs=170.1 Q ss_pred CCCccccccc--ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc---------cc Q lcl|Aclame:pro 1 MTTLSNFSLP--NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT---------GK 69 (332) Q Consensus 1 m~~~~~~~r~--~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i---------G~ 69 (332) |++++-+.-. +..+.|+..+... +|.-+.|..++.+..++.+.++.+.+..+.. +.+++||+. |. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~---~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPS---DLLPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHHhhhhhcccccccceecccc---cccchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccceeecc Confidence 7775433211 1122333322222 4788999999999999999999999877665 557778765 22 Q ss_pred eeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 70 LSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 70 ~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) .++.....|..+.. .+++..++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|+.++.-- ..... T Consensus 77 ~~~~~~~Eg~~~~~-~~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~--g~~~~ 152 (338) T protein:vir:78 77 GTSNEQREGGTKPL-SGTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGK--SPLTG 152 (338) T ss_pred cccccccccccccc-cccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhccc--CCCcc Confidence 33444455655543 345666676666553 333456552223456899999999999999999999887321 11010 Q ss_pred ccccccccccee---ccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 150 SPVTGEPGGFHV---NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 150 ~~~~~~~~~~~i---~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) ....+....... .......++....++.|.++...+.. +.......++++|..|..|++...-+-.+..+.- ... T Consensus 153 ~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~-~~~ 230 (338) T protein:vir:78 153 SALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSA-NTDVDFNGWAADPRYRARLLRSQAYRDANGNVDP-TRI 230 (338) T ss_pred ccccccccccccccccccccccccchhhHHHHHHHHHHhhh-hccccceEEEEchHHHHHHHHHhhhccCCCceee-ccc Confidence 001110000000 00111223345568888888766643 3344445678899999988653111111111111 122 Q ss_pred ccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc----- Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF----- 301 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~----- 301 (332) ...|. -+.++|++|+.++++|...+.. .......|-++|+... +.. ..+++++..++-. T Consensus 231 ~~~~~-~~~l~G~PV~~~~~ip~~~~~~-----~~~~~~~~~gdfs~~~--~~~--------~~~~~i~~~~~~~~~~~~ 294 (338) T protein:vir:78 231 NLAAS-AGDLLGLPVQFGKAVGGDLGAA-----TDSKVRVVGGDFSQLK--YGF--------ADEIRVKMSDTATLTDNT 294 (338) T ss_pred ccCCC-CceeeeeeEEEccccCcccccc-----CCcccEEEEEecceEE--EEe--------ecccEEEEeecccccccc Confidence 23333 5789999999999999532211 1111234455665432 221 2233444433210 Q ss_pred ---c--h-hHHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 ---N--V-QYQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ---~--~-~~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) . . .|+-|+ ++..+++|.+++||++.+.|..| T Consensus 295 ~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 295 SPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDD 333 (338) T ss_pred cccccchhhhhcCcEEEEEEEEeccEeecccceEEEecc Confidence 0 0 011222 34566789999999999999999 No 62 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.45 E-value=6.3e-15 Score=98.43 Aligned_cols=295 Identities=12% Similarity=-0.025 Sum_probs=169.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+-. ..|+.+....++. + .+..+.+..++.+..++.++++++++..+..+ ..++||+. +.+.+..+..|. T Consensus 1 m~~~--~~~a~~~~~t~~~--g----~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~ 71 (330) T protein:vir:77 1 MAGS--TVPSTQVALTGDF--S----AFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSASWTGEAE 71 (330) T ss_pred Cccc--ccchhhccccCCC--c----ceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceeEecCCC Confidence 7662 3444332222221 1 14557788899999999999999988766554 45788876 667777778888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc--- Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP--- 156 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~--- 156 (332) .+... .++..++++..-+. +.-..|.+-=-.++.+++.+.+.++.++++++.+|+.++. +.....+..+.. T Consensus 72 ~~~~~-~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~----G~g~~~~~~g~~~~~ 145 (330) T protein:vir:77 72 RKPIT-KGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIH----GIDKPSAFKGYLAET 145 (330) T ss_pred ccccc-cceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCCccccccccc Confidence 77654 56777777776553 3334565522223568899999999999999999998872 111111111100 Q ss_pred -ccceec--cccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCch--hhccccccccccccccc Q lcl|Aclame:pro 157 -GGFHVN--IGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNREIGNSQGDMNSGK 231 (332) Q Consensus 157 -~~~~i~--~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~--~~~~d~~~~~~~~~~g~ 231 (332) ...... ............++.|.++...+..++.+.. .++++|..|..|...+|.. .+-.+ ....+..... T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~G~~l~~~-~~~~~~~~~~- 221 (330) T protein:vir:77 146 TKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWT--GTLLDNVTEPILNTAVDGNGRPLFVE-STYTEQVGAI- 221 (330) T ss_pred cccceeecccccccccccchhHHHHHHHHHhhhhcCCCcc--EEEEcHHHHHHHHHHhccCCceeecC-cccccccccc- Confidence 000000 0111122234457888888888888776443 4679999999887654432 11000 0001111111 Q ss_pred eeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch-------- Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV-------- 303 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~-------- 303 (332) .-++++|++|+.++++|..+.. +...-|-+++++.. .+...++++++.++-+-. T Consensus 222 ~~~~l~G~PV~~~~~~p~~~~~--------~~~~~~~gd~s~~~----------i~~~~~~~i~~~~e~~~~~~~~~~~~ 283 (330) T protein:vir:77 222 REGRILGRPTYVADNVVNGTVG--------NRVVGVMGDFSQVI----------WGQIGGLSFDVTDQATLDFGEEQGGV 283 (330) T ss_pred CCceecceeeEEeccccCCCCC--------CccEEEEEecceEE----------EEEecCcEEEEeecceeeeccccccc Confidence 1357899999999999952211 11112334444332 122234444443321100 Q ss_pred -------hHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 304 -------QYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 304 -------~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .|+-| .+++..++|.+++||++.+.|..+ T Consensus 284 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 284 WVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQ 321 (330) T ss_pred ccccccchhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 01111 245667889999999998888776 No 63 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.43 E-value=8.1e-15 Score=97.85 Aligned_cols=285 Identities=8% Similarity=0.014 Sum_probs=170.7 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+- ..+.+.+.. .. +++. ++.-+.+..++.+..++.+.++++++..+.. +.+++||+. +...+.-+..+. T Consensus 1 ma~-~~~~~~~~~---~t-~~gg---~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~ 71 (304) T protein:vir:10 1 MAT-PTYTPGNVI---LS-DFKN---GVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETE 71 (304) T ss_pred Ccc-ccccccccc---cc-CCCc---eecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCc Confidence 765 233443321 11 1111 3677899999999999999999998776654 456788877 566777777777 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... +++.+++++...+.- .-..|.+-=..++.+|+.+.+.++.++++++..|+.++.-- .+..+....+.+. T Consensus 72 ~~~~~-~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~----g~~~~~~~~~~~~ 145 (304) T protein:vir:10 72 RIQTS-KPEYAQAEMEAKKIG-VIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGT----KSPYNTSTSGKPL 145 (304) T ss_pred ccccc-cceeeEEEEEEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheecc----CCCcccccccccc Confidence 76643 577777777777643 33456552223356889999999999999999999886321 1111111111110 Q ss_pred e--eccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 160 H--VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 160 ~--i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) . .........+....|+.|.++..++..++.... .++++|..|..|.+.+|.. + ..+.... .++++ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~lkd~~-------G--~~l~~~~-~~~l~ 213 (304) T protein:vir:10 146 VEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNALDAN-------D--RPLFDAN-GNEIM 213 (304) T ss_pred cccccccccccccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhccC-------C--cEeecCC-Ccccc Confidence 0 011111122344568889999988888776544 5678999999987543321 1 1122222 56899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeec--------ccch----h- Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSG--------DFNV----Q- 304 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~--------~~~~----~- 304 (332) |.+|+.++++|...+.. .-+-++|++.. ++-+ .+++++..++ ++.. . T Consensus 214 G~PV~~~~~~~~~~~~~----------~~~~gd~~~~~--~~~~--------~~~~i~~~~e~~~~~~~~~~~~g~~~~~ 273 (304) T protein:vir:10 214 GLPLSYTGADVYDKKKS----------LALMGDWDYAR--YGIL--------QGIEYAISEDATLTTLQASDASGQPVSL 273 (304) T ss_pred ceeeEEecccccCCCCc----------EEEEEehhhEE--EEEe--------cceEEEEeecceeeeecccccCccchhh Confidence 99999999999543222 12334555432 2211 2222322221 1110 0 Q ss_pred HHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |.-| .+++.+++|..+++|++.+.|+.| T Consensus 274 f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a 303 (304) T protein:vir:10 274 FERDMFALRATMHIAYMNVKPEAFATLKPT 303 (304) T ss_pred hhcCcEEEEEEEEeccEeecccceEEEEec Confidence 1111 134556899999999999999999 No 64 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.43 E-value=8.1e-15 Score=97.85 Aligned_cols=285 Identities=8% Similarity=0.014 Sum_probs=170.7 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+- ..+.+.+.. .. +++. ++.-+.+..++.+..++.+.++++++..+.. +.+++||+. +...+.-+..+. T Consensus 1 ma~-~~~~~~~~~---~t-~~gg---~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~ 71 (304) T protein:vir:94 1 MAT-PTYTPGNVI---LS-DFKN---GVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETE 71 (304) T ss_pred Ccc-ccccccccc---cc-CCCc---eecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCc Confidence 765 233443321 11 1111 3677899999999999999999998776654 456788877 566777777777 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... +++.+++++...+.- .-..|.+-=..++.+|+.+.+.++.++++++..|+.++.-- .+..+....+.+. T Consensus 72 ~~~~~-~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~----g~~~~~~~~~~~~ 145 (304) T protein:vir:94 72 RIQTS-KPEYAQAEMEAKKIG-VIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGT----KSPYNTSTSGKPL 145 (304) T ss_pred ccccc-cceeeEEEEEEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheecc----CCCcccccccccc Confidence 76643 577777777777643 33456552223356889999999999999999999886321 1111111111110 Q ss_pred e--eccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 160 H--VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 160 ~--i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) . .........+....|+.|.++..++..++.... .++++|..|..|.+.+|.. + ..+.... .++++ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~lkd~~-------G--~~l~~~~-~~~l~ 213 (304) T protein:vir:94 146 VEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNALDAN-------D--RPLFDAN-GNEIM 213 (304) T ss_pred cccccccccccccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhccC-------C--cEeecCC-Ccccc Confidence 0 011111122344568889999988888776544 5678999999987543321 1 1122222 56899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeec--------ccch----h- Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSG--------DFNV----Q- 304 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~--------~~~~----~- 304 (332) |.+|+.++++|...+.. .-+-++|++.. ++-+ .+++++..++ ++.. . T Consensus 214 G~PV~~~~~~~~~~~~~----------~~~~gd~~~~~--~~~~--------~~~~i~~~~e~~~~~~~~~~~~g~~~~~ 273 (304) T protein:vir:94 214 GLPLSYTGADVYDKKKS----------LALMGDWDYAR--YGIL--------QGIEYAISEDATLTTLQASDASGQPVSL 273 (304) T ss_pred ceeeEEecccccCCCCc----------EEEEEehhhEE--EEEe--------cceEEEEeecceeeeecccccCccchhh Confidence 99999999999543222 12334555432 2211 2222322221 1110 0 Q ss_pred HHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |.-| .+++.+++|..+++|++.+.|+.| T Consensus 274 f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a 303 (304) T protein:vir:94 274 FERDMFALRATMHIAYMNVKPEAFATLKPT 303 (304) T ss_pred hhcCcEEEEEEEEeccEeecccceEEEEec Confidence 1111 134556899999999999999999 No 65 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.41 E-value=1.5e-14 Score=96.38 Aligned_cols=284 Identities=8% Similarity=-0.024 Sum_probs=165.7 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |.. ..+ ++ .+.=+.|+.++++..+..|.++.+.+..+..+ .+++||+. +.+.+..+..|. T Consensus 1 m~t-------------~t~--gg---~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E~~ 61 (303) T protein:vir:97 1 MGT-------------ETS--KA---SLFDKHLVSDLINKVKGHSSLAKLSSQKPIPF-NGSKEFTFTLDSDIDVVAENG 61 (303) T ss_pred Ccc-------------cCC--CC---eEcchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEecCcceEEeecCc Confidence 333 111 11 14558899999999999999999987766654 46788874 667777787887 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHH-----HhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEI-----FSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) .+... +++.+++++..-+. ..-..|.+ |. ....++.+.+.++.++++++..|+.++.-.-.+. ...+ T Consensus 62 ~~~~s-~~~f~~v~l~~~kl-~~~~~iS~--ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~----g~~~ 133 (303) T protein:vir:97 62 KKTHG-GLSLEPVTIVPIKV-EYGARLSD--EFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRT----KKAS 133 (303) T ss_pred ccccc-ccceeeEEeeeEEE-EEeehhhH--HHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCC----cccc Confidence 76643 56666777765443 33344543 21 2346788999999999999999998874321000 1111 Q ss_pred cccccee----ccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccc Q lcl|Aclame:pro 155 EPGGFHV----NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSG 230 (332) Q Consensus 155 ~~~~~~i----~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g 230 (332) .+.+... .......+.....++.|.++..++...+.... .++++|..+..|++.+|..- + +.- ...+..+ T Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~L~~lkd~~g--~-~~~-~~~~~~~ 207 (303) T protein:vir:97 134 DVIGTNHFDSKVTQVVKFTESEDADANIEAAVNLIQGAEGVVT--GLAMDTEFSTALAKVTNGEM--G-PKM-YPELAWG 207 (303) T ss_pred ccccccccccccccccccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCC--C-eEE-ecCccCC Confidence 1111100 00111112233457888888888877766443 36789999999976544320 1 110 1112223 Q ss_pred ceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc----h-hH Q lcl|Aclame:pro 231 KGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN----V-QY 305 (332) Q Consensus 231 ~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~----~-~~ 305 (332) ...++++|.+|+.|+++|...... .+....|.++|...+.+ +.+ .++++++....+. . .| T Consensus 208 ~~~~~l~G~Pv~~s~~v~~~~~~~------~~~~~~~~Gdf~~~~~~-~~~--------~~~~~~~~~~~~~d~~~~~~~ 272 (303) T protein:vir:97 208 ANPDSINGLKSSVNTTVGAGADEA------ESKDLVIIGDFESMFKW-GYA--------KQIPMEIIKYGDPDNSGKDLK 272 (303) T ss_pred CCCceecceeeEEecccCCccccC------CCccEEEEeeccccEEE-EEe--------cCcEEEEeeccCCCCcchhhh Confidence 235689999999999999532111 11222455666544322 222 2334444321110 0 02 Q ss_pred HHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..| .+++..+++.++++|++.+.|+.| T Consensus 273 ~~n~~~~r~~~r~~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 273 GYNQIYLRAEAYIGWGILDAKSFARVTKG 301 (303) T ss_pred hcCcEEEEEEEEeccEeecccceEEeeCC Confidence 222 345566889999999999999999 No 66 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.40 E-value=2.4e-14 Score=95.22 Aligned_cols=285 Identities=11% Similarity=-0.023 Sum_probs=167.6 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+. .||. |..+.+..++.+..+..++++++.+..+..+|+ +.||+. +.+.+..+..|. T Consensus 1 ma~-----------~gG~---------lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~ 59 (298) T protein:vir:16 1 MVL-----------NKGT---------LFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESG 59 (298) T ss_pred Ccc-----------cCcc---------eechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCCc Confidence 332 2332 666888999999999999999999877666544 678864 677888888888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHH---HHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDE---IFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~---~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .+... +++..++++..-+.- .-..|.+-=- .....++.+.+.++.++++++..|+.++.-......+.....+.. T Consensus 60 ~~~~~-~~~f~~v~l~~~k~a-~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~ 137 (298) T protein:vir:16 60 KKTHG-GVTLAPQTMVPIKVE-YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTN 137 (298) T ss_pred ccccc-ccceeEEEEeeeeEE-EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccc Confidence 77643 456666666655532 2244544111 123467888999999999999999988742111111111111100 Q ss_pred c-cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 157 G-GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 157 ~-~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) . ................+++.|.++..++..++.+.. .++++|..+..|.+.+|.+ ++ +.- ......|. .++ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~--G~-~i~-~~~~~~~~-~~~ 210 (298) T protein:vir:16 138 HFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQ--DN-ALF-PELKWGAT-PDT 210 (298) T ss_pred ccccccccccccccccccHHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhccC--CC-eee-cCcccCCC-Cce Confidence 0 000001111112233456778888888888887644 3678999999887655432 11 111 12223332 578 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc--ch--h-HHHHH- Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF--NV--Q-YQGDL- 309 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~--~~--~-~~~d~- 309 (332) ++|.+|+.++++|..... +...-+.++|++.+.+ +.+ .++++++.+... .. . |+.|+ T Consensus 211 l~G~PV~~~~~v~~~~~~--------~~~~~~~GDfs~~~~~-~~~--------~~~~~~~~~~~~~~~~~~~~f~~~~v 273 (298) T protein:vir:16 211 INGLPVDVNKTVSDMSLT--------QRDRAIIGDFANGFKW-GYA--------KEVPLEVIQYGDPDNSGLDLKGYNQV 273 (298) T ss_pred ecceeeEEecccccccCC--------CccEEEEeeccceEEE-EEe--------cCceEEEeeccCCcCcchhhhhcCcE Confidence 999999999999953221 1122445666554322 222 223444443211 10 1 22222 Q ss_pred -HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 -IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 -i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++.+++|.+++||++.+.|+-| T Consensus 274 ~~ra~~r~d~~v~~~~a~~~l~~a 297 (298) T protein:vir:16 274 YIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred EEEEEEEEccEeecccceEEEeec Confidence 45566789999999999999999 No 67 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.40 E-value=2.8e-14 Score=94.91 Aligned_cols=283 Identities=10% Similarity=0.014 Sum_probs=170.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeeecCCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~~~g~~ 80 (332) |-+-+.. +...+++. .+.-+.++.++.+..++.++++++++..++ .+.+.++|....+....+..|.. T Consensus 1 ~g~~a~~--------~~~~~~~~---~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~ 68 (299) T protein:vir:41 1 MGFNPDT--------TTMQSAKT---GSIPINISEQIITGVKNGSAAMKLAKAVPM-TKPEEEFTFMSGVGAFWVDEAER 68 (299) T ss_pred CCcCCCc--------ccccCCCc---eecchhHHHHHHHHHHhcchhhhhceeeec-CCCcEEEEEEcCCceeeeecCcc Confidence 4432111 11111222 256689999999999999999999987665 45678889888788888888888 Q ss_pred CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccce Q lcl|Aclame:pro 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFH 160 (332) Q Consensus 81 ~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~ 160 (332) +... +++.+++++...+. +....|.+-=-..+..++.+.+.++.++++++..|+.++.- ..+..+..... . T Consensus 69 ~~~~-~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~g~~~~~gil~---~ 139 (299) T protein:vir:41 69 IQTS-KPTFTKAKMRSKKM-GVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTG----VESPYNWNILK---S 139 (299) T ss_pred cccc-ccceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCcccccccc---c Confidence 7653 57778888888764 44456655222235688999999999999999999988731 11111110000 0 Q ss_pred eccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeeceE Q lcl|Aclame:pro 161 VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIR 240 (332) Q Consensus 161 i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~ 240 (332) ...+..........++.|.++..+|..++.+.. .++++|..|..|.+-+|.. .+ +.. +..+..| .++++|.+ T Consensus 140 ~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~~--G~-~l~-~~~~~~~--~~~l~G~P 211 (299) T protein:vir:41 140 ATDASNLVEETANKYDDLNEAIGLIEAEDLEPN--GIATIRKQRVKYRSTKDGN--GM-PIF-NTATSNG--VDDVLGLP 211 (299) T ss_pred ccccceeeccccccHHHHHHHHHhhhcccCCcC--EEEEcHHHHHHHHHhhccC--Cc-eee-cCCcCCC--Cceeccee Confidence 000001111122237888888888888877543 4689999999997644321 11 111 1112222 46899999 Q ss_pred EEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch-----------hHHHHH Q lcl|Aclame:pro 241 ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV-----------QYQGDL 309 (332) Q Consensus 241 V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~-----------~~~~d~ 309 (332) |+.++++|..+ +...-|-++|+... + +...+++++..++-... .|+.|. T Consensus 212 V~~~~~~~~~~----------~~~~~~~gdfs~~~--i--------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (299) T protein:vir:41 212 IAYTPKYTFGD----------KDISELVGDWNQAY--Y--------GILRGVEYEILTEATLTTVADETGKPLNLAERDM 271 (299) T ss_pred eEEecccCCCC----------CceEEEEEecccEE--E--------EEecCcEEEEeecccccccccccccchhhhhcCc Confidence 99999999421 11123445555432 1 12233455544432100 122233 Q ss_pred H--HHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 I--VGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 i--~~~~~~G~~vlrpe~~v~i~~A 332 (332) + +....+|.++++|++.+.|+.+ T Consensus 272 ~~~r~~~~~d~~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 272 AAIKATFEVGFMVVKDEAFSAVQPK 296 (299) T ss_pred EEEEEEEEeccEEecccceEEEEec Confidence 2 4456789999999999988877 No 68 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.40 E-value=3.3e-14 Score=94.47 Aligned_cols=284 Identities=10% Similarity=-0.007 Sum_probs=168.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+- .||. |.-+.|..++.+..++.|+++++.+..+..+| .++||++ +.+.+.-+..|. T Consensus 1 ma~-----------~gG~---------lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~ 59 (298) T protein:vir:94 1 MVL-----------NKGT---------LFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESG 59 (298) T ss_pred Cee-----------cccc---------ccChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCc Confidence 443 2222 55588999999999999999999877666554 5788886 677788888888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHH----hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIF----SQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) .+... +++.+++++..-+. .....|.+- -.+ ...++.+.+.++.++++++.+|+.++.-............+. T Consensus 60 ~~~~~-~~~f~~v~l~~~k~-~~~~~iS~e-ll~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~ 136 (298) T protein:vir:94 60 KKTHG-GVTLAPQTMVPIKV-EYGARISDE-FMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGT 136 (298) T ss_pred ccccc-ccceeEEEEeeeEE-EEeeehhHH-HhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccc Confidence 77643 56667777776654 233445441 111 235688899999999999999998874211111011111000 Q ss_pred cc-cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 156 PG-GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 156 ~~-~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) .+ ...........+....+++.|.++..+|..++.... .++++|..|..|.+.+|.+ ++ +.- ......| ..+ T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~--G~-~l~-~~~~~~~-~~~ 209 (298) T protein:vir:94 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQ--GN-ALF-PELKWGA-TPD 209 (298) T ss_pred cccccccccccccccccccHHHHHHHHHHhhhhcCCCcc--EEEEcHHHHHHHHHhhccC--CC-eee-cCcccCC-CCc Confidence 00 000000111122334457789999999988887544 4789999999987654432 11 111 1222233 357 Q ss_pred eeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc--ch---hHHHHH Q lcl|Aclame:pro 235 SIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF--NV---QYQGDL 309 (332) Q Consensus 235 ~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~--~~---~~~~d~ 309 (332) +++|++|+.++.+|...++. ....+.++|++...+ + ...++++++.+... .. .|+.|. T Consensus 210 tl~G~PV~~~~~v~~~~~~~--------~~~~~~Gdfs~~~~~-~--------~~~~~~~~~~~~~~~d~~~~~~f~~~~ 272 (298) T protein:vir:94 210 TINGLPVDVNKTVSDMSLTQ--------RDRAIIGDFANGFKW-G--------YAKEVPLEVIQYGDPDNSGLDLKGYNQ 272 (298) T ss_pred eecceeeEEecccccccCCC--------ccEEEEeeccceEEE-E--------EecCceEEEeecCCCcCcchhhhhcCc Confidence 89999999999999532211 112344555543211 1 12333444433211 10 122222 Q ss_pred --HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 --IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 --i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++.+++|.+++||++.+.|+.| T Consensus 273 v~~r~~~r~~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 273 VYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred EEEEEEEEeccEeecccceEEEEec Confidence 45567789999999999999999 No 69 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.39 E-value=1.9e-14 Score=95.76 Aligned_cols=306 Identities=12% Similarity=0.051 Sum_probs=164.0 Q ss_pred CCCccccccc--ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLP--NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~--~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~ 77 (332) |+-++-+..- +....|+..+.+. +|.-+.+..++.+..++.+.++.+.+..++.+ .+.+||+. +.+++..... T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~---~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPS---DLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHhhhhcccccccCceecCCc---cccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecC Confidence 5554433211 1112222222222 37789999999999999999999988777664 55577766 4555554444 Q ss_pred CCCCCc-------cCCCCCceEEEEEeeeeecc-hhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 78 GTPIVG-------DAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 78 g~~~~~-------~~~~~~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) |..... ...++..++++. ..++.. ..|.+-=-.++..++.+.+.+++++++++..|+.++.-- ..... T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~--~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~--g~~~~ 152 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVS--PIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGK--SPLTG 152 (333) T ss_pred cccccccccccccccccceeEEEEe--eEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhccc--CCCCC Confidence 432211 122344444444 444444 345442122467789999999999999999999887311 11010 Q ss_pred ccccccccc---ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 150 SPVTGEPGG---FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 150 ~~~~~~~~~---~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) ....+.... ............+...++.|+++...+..+. ......+++.|..|..|++....+-.+..+.- ... T Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~-~~~ 230 (333) T protein:vir:78 153 SALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRDANGNVDP-SRI 230 (333) T ss_pred cccccccccccccccccccccccccchhHHHHHHHHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcCCCCceee-cCc Confidence 001110000 0111011111223445788888877765543 23334567899999988753111111111111 122 Q ss_pred ccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc----c Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF----N 302 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~----~ 302 (332) ...+. -++++|++|+.|+++|...+.. ..+...-|-++|+... + +...+++++..+.-. . T Consensus 231 ~~~~~-~~~l~G~Pv~~~~~i~~~~~~~-----~~~~~~~~~gD~~~~~--~--------g~~~~~~i~~~~~~~~~~~~ 294 (333) T protein:vir:78 231 NLAAQ-TGDVLGLPAQFGRAVGGDLGAA-----VDSKTRIIGGDFSQLK--F--------GFADEIRIKMSDTATLTDSG 294 (333) T ss_pred cccCC-CceeeceeeEEccccCCCcccc-----CCCccEEEEEecccEE--E--------EEeeccEEEEeccccccccc Confidence 23333 5799999999999999542211 1222234455555432 1 122334444433210 0 Q ss_pred ----hhHHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 ----VQYQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 ----~~~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) -.|+-|. +++.+.+|.++++|++.+.|..| T Consensus 295 ~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 295 SATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDD 330 (333) T ss_pred cceeehhhcCcEEEEEEEEEccEEecccceEEEecc Confidence 0111122 45567889999999999999988 No 70 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.35 E-value=1.1e-13 Score=91.62 Aligned_cols=280 Identities=15% Similarity=0.066 Sum_probs=166.4 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+- .+.+..+-... ++++ +|.-+.|..++.+..+..+.++++.+.....++....+|.. +.+.+..+..|. T Consensus 1 m~~----~~~~~~~~~~t-~~~~---~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 72 (297) T protein:vir:95 1 MTV----QTFNPENVLVS-QKKD---GTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETE 72 (297) T ss_pred CCc----ccccccccccc-CCCc---ceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCc Confidence 544 22222111111 1111 26679999999999999999999988777666555667744 567778888888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... +++.+++++...+. .....|.+-=-.++..++.+.+.++.++++++..|+.++.- ..+..+........ T Consensus 73 ~~~~~-~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G----~g~~~~~gi~~~~~ 146 (297) T protein:vir:95 73 KIKTD-KPEVVPVTLKAHKL-GIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLG----HDTPFANSVAKAAK 146 (297) T ss_pred ccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcc----cCCccccccccccc Confidence 77654 56777777777663 34455655222235688999999999999999999988731 11111111110000 Q ss_pred eeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeece Q lcl|Aclame:pro 160 HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGI 239 (332) Q Consensus 160 ~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~ 239 (332) ......+ ....++.|+++..+|..++.+.. .++++|..|..|.+-+|.. + ..+.++. .+.++|. T Consensus 147 ~~~~~~~----~~~t~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d~~-------G--~~i~~~~-~~~l~G~ 210 (297) T protein:vir:95 147 DANKVIG----GPINYDNILKLQDALYDADVEPN--AFVSKIQNRSALREARDGN-------K--VSIYDKA-ANTIDGI 210 (297) T ss_pred ccceecc----cccCHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhccC-------C--ceeecCC-CCcccce Confidence 1111111 11236778888888888876543 4678999999987543321 1 1123332 4678999 Q ss_pred EEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc--------h---hHHHH Q lcl|Aclame:pro 240 RILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN--------V---QYQGD 308 (332) Q Consensus 240 ~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~--------~---~~~~d 308 (332) +|+.+++.+...+ ..+.++|+... + +...+++++..++-.. . .++-| T Consensus 211 Pv~~~~~~~~~~~------------~~~~gd~s~~~--~--------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (297) T protein:vir:95 211 TTVDLKSARFEKG------------DLLAGDFDNLI--Y--------GVPYNITYKISEEGQISTITNADGTPINLFEQE 268 (297) T ss_pred eeEeecCCCCCCc------------eEEEEecccEE--E--------EEecCeEEEEeeccccccccccCccchhhhhcC Confidence 9998876653222 12334544432 1 1223344444433210 0 01222 Q ss_pred H--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 309 L--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 309 ~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) . ++...++|.++++|++.+.|+.| T Consensus 269 ~~~~r~~~~~d~~v~~~~a~~~l~~a 294 (297) T protein:vir:95 269 MIAIRATMDIAVMITKTDAFAKLTPA 294 (297) T ss_pred cEEEEEEEEeccEeecccceEEEeec Confidence 2 34446889999999999999999 No 71 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.33 E-value=1.1e-13 Score=91.68 Aligned_cols=291 Identities=14% Similarity=0.054 Sum_probs=164.5 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+..+. |+ .+-=+.|..++.+..+..|+++.+.+..+..+| .++||+. +.+.+.-+..|+ T Consensus 1 mat~~~----------gg--------~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~ 61 (311) T protein:vir:81 1 MVALAT----------GT--------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGA 61 (311) T ss_pred CceecC----------Cc--------eEcchhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCc Confidence 544211 11 133488999999999999999999887666555 5888876 677888788888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHH----HhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEI----FSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) .+... +++.+++++...+. ..-..|.+ +-. ....++.+.+.++.+++|++.+|+.++.--... ...+..+. T Consensus 62 ~~~~~-~~~f~~v~l~~~kl-~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~--~~~~~~gi 136 (311) T protein:vir:81 62 QKSES-TATFAPVTAIPRKV-QVTQRFSQ-EVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL--TGAALSGS 136 (311) T ss_pred ccccc-cceeeEEEEeeEEE-EEeehhhH-HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCC--CCcccccc Confidence 77753 56777777777654 33344544 111 134568899999999999999999887421100 11111111 Q ss_pred cccc--eeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 156 PGGF--HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 156 ~~~~--~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) +... .........+.....+..|..+..++...+... ..+++.|..+..|.+-+|.+ ..+.- .... .++.. T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~---G~~l~-~~~~-~~~~~ 209 (311) T protein:vir:81 137 PAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRDSQ---GRKLY-PELG-FGTDV 209 (311) T ss_pred cccccccceeeeecccccchHHHHHHHHHHHhhhcCCCc--eEEEEcHHHHHHHHhhhccC---CCeee-cCcc-ccCCC Confidence 1110 000111111222233444555666665555533 34688999999887644321 11110 1111 22236 Q ss_pred eeeeceEEEeeCcccccccccccc----cccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch----hH Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSS----AAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV----QY 305 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~----~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~----~~ 305 (332) +.++|.+|+.++++|......... ....+....+.+||++.. .+...++++++.++-+.. .| T Consensus 210 ~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~----------i~~~~~~~~~~~~~~~~~~~~~~~ 279 (311) T protein:vir:81 210 ASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFR----------WGVQVSIPLELIEFGDPDGLGDLK 279 (311) T ss_pred ceecceeEEecccccccccccccccchhcccCCccEEEEEecccEE----------EEEeccceEEEeccCCCCcchhhh Confidence 789999999999998532211111 111112223445555432 112234455555432111 12 Q ss_pred HHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +-|+ +++..++|.++++|++.+.|+-| T Consensus 280 ~~~~v~~r~~~r~d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 280 RQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) T ss_pred hcCcEEEEEEEEeccEeecccceEEEEee Confidence 2232 34557899999999999999999 No 72 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.33 E-value=2.3e-13 Score=89.88 Aligned_cols=285 Identities=11% Similarity=0.007 Sum_probs=165.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+.- ..+.+. +.-+.+..++.+..++.|.++.+.+......| .+.||+. +.+.+.-...|. T Consensus 1 ma~~-------------t~~~G~----lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~ 62 (300) T protein:vir:95 1 MSEA-------------QLSKGN----LFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENG 62 (300) T ss_pred Cccc-------------ccCCcc----eechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCc Confidence 6661 111111 66788999999999999999888776665554 4677764 566777777787 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHH-----hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIF-----SQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) .+... .++.+++++..-+. +.-..|.+ |.. ...++.+.+.++.++++++..|+.++.-..........+.+ T Consensus 63 ~~~~s-~~~f~~v~l~~~k~-~~~~~iS~--ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~ 138 (300) T protein:vir:95 63 KKTHG-GVSLDPVTIVPLKV-EYGARVSD--EFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIG 138 (300) T ss_pred ccccc-cccceeeEeeeEEE-EEeehhhH--HHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCccccc Confidence 76643 56777777776653 33344544 222 24678899999999999999999887321100000000100 Q ss_pred ccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 155 EPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) ........ ...........++.|.++..++...+.... .++++|..+..|.+.+|.+ ..+.- ......| ..+ T Consensus 139 ~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~L~~lkd~~---G~~i~-~~~~~~~-~~~ 210 (300) T protein:vir:95 139 DNCFDKKV-TQTVPFKDTNPDESMEDAVGMIDGSERDIT--GAILDPIFTTALSKMKNAE---GGKLY-PELAWGG-VPD 210 (300) T ss_pred cccccccc-ceeecccccchHHHHHHHHHHhhhcCCCcc--EEEECHHHHHHHHHhhccC---CCeec-cCccccC-CCc Confidence 00000000 001111233447788888888887765433 4678999999887654321 11111 1112223 367 Q ss_pred eeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecc--cc--hh-HHHHH Q lcl|Aclame:pro 235 SIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD--FN--VQ-YQGDL 309 (332) Q Consensus 235 ~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~--~~--~~-~~~d~ 309 (332) +++|++|+.|+.+|...... ....+.+||++..-+.. + .++++++.... .. .. |+-|. T Consensus 211 ~l~G~Pv~~s~~v~~~~~~~--------~~~~~~GDf~~~~~~~~-~--------~~~~~~v~~~~~~d~~~~~~f~~~~ 273 (300) T protein:vir:95 211 AINGLAVDKNRTVSYSQTDP--------KNTAIVGDFETMFKWGY-A--------KEVPMEIIKYGDPDNSGRDLKGYNQ 273 (300) T ss_pred eecceeeEEecCCCCCCCCC--------ccEEEEeeccceEEEEE-e--------cccEEEEeeccCCCCcchhhhhcCc Confidence 89999999999998532211 11234456654331111 1 22233333211 10 01 22222 Q ss_pred --HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 --IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 --i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++.+++|.++++|++.+.|+.+ T Consensus 274 v~~r~~~r~d~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 274 IYIRCEAYIGWGIMDAASFARIVKT 298 (300) T ss_pred EEEEEEEeecceeecccceEEEecC Confidence 35667899999999999999998 No 73 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.32 E-value=5.5e-14 Score=93.28 Aligned_cols=286 Identities=13% Similarity=0.045 Sum_probs=165.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) +++ +.-++...+.....+..+ .-.+.-+.+..++.+..+..++++++.+..++.+ .+++||+. +.+.+.-...|. T Consensus 14 f~~--~~~~~~~~~a~~~~~~~~-~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~ 89 (324) T protein:vir:93 14 FAS--NNVKPQVFNPDNVMMHEK-KDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQ 89 (324) T ss_pred HHH--hhhhhhhcccccccccCC-CcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCc Confidence 111 111111000000000000 0126678999999999999999999987766554 55788876 667777778888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... .++.+++++..-+. ..-..|.+-=-.++..++.+.+.++.++++++.+|+.++.-- + ......+.. . T Consensus 90 ~~~~~-~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~--g--~~~~~~~~~--~ 161 (324) T protein:vir:93 90 KIETS-KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--G--NNPFGKSIA--Q 161 (324) T ss_pred ccccc-ccceeEEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCC--C--CCCcCcccc--c Confidence 87653 57777777777664 344566652222456899999999999999999999876321 1 111000000 0 Q ss_pred eeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeece Q lcl|Aclame:pro 160 HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGI 239 (332) Q Consensus 160 ~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~ 239 (332) .+..+ .........++.|.++...|..++.... .++++|..|..|.+.+|.. ....+..+ .-++++|. T Consensus 162 ~~~~~-~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~d~~--------G~~~~~~~-~~~~l~G~ 229 (324) T protein:vir:93 162 SIEKT-NKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE--------TKERIYDR-NSDSLDGL 229 (324) T ss_pred ccccc-ceeccccccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhCCC--------CCeeecCC-CCCcccce Confidence 00000 1111122347888888888888776433 5689999999886544321 11112223 35689999 Q ss_pred EEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc-------ch----hHHH- Q lcl|Aclame:pro 240 RILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF-------NV----QYQG- 307 (332) Q Consensus 240 ~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~-------~~----~~~~- 307 (332) +|+.++..+...+ ..+.++|+... ++ ...+++++..++-. +. .|+. T Consensus 230 PVv~~~~~~~~~~------------~i~~gdfs~~~--~~--------~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n 287 (324) T protein:vir:93 230 PVVNLKSSNLKRG------------ELITGDFDKLI--YG--------IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred eeEeecCCCCCcc------------eEEEEecceEE--EE--------EecCcEEEEeecccccccccccccchhhhhcC Confidence 9998876653211 23445555422 11 12333444443311 00 0111 Q ss_pred -HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 -DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 -d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) -.+++.+.||.++++|++.+.|..| T Consensus 288 ~~~~r~~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 288 MVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred cEEEEEEEEeccEEecccceEEEecc Confidence 2345667889999999999999988 No 74 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.32 E-value=2.2e-13 Score=90.01 Aligned_cols=290 Identities=10% Similarity=-0.022 Sum_probs=159.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |++- ..++++ .+.-+++++++.+..++.|+++.+.+..+.. +..++||+. |.+.+.-+..|. T Consensus 1 Ma~~-------------~~~~gg---~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~-~~~~~ip~~~~~~~a~wv~Eg~ 63 (315) T protein:vir:80 1 MADD-------------FLSAGK---LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGE 63 (315) T ss_pred CCCC-------------cCCcCc---eEcchHHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeCCc Confidence 7761 111111 2566899999999999999999988765554 456788875 667888888888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYS----TRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d----~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) .+... +++.+++++..-+. ..-..|.+-=-.++..| +++.+.++.+++|++.+|+.++.-- ......+..+. T Consensus 64 ~~~~s-~~~f~~v~l~~~kl-~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~--~~~~~~~~~~~ 139 (315) T protein:vir:80 64 VKPSA-SVDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI--DPATGKAASAV 139 (315) T ss_pred ccccc-ccceeeeEeeeeeE-EeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeecc--CCCCCcccccc Confidence 77653 56667666665543 22234544211122333 6788899999999999998776310 00011111110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCch--hhcccccccccccccccee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~--~~~~d~~~~~~~~~~g~~v 233 (332) .. .+..++.........++.|.++..++..++.-.... .++.|..+..|.+.++.. -.+..+.- ..+..|+ . T Consensus 140 ~~--~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~-~imn~~~~~~L~~l~~~~g~~~~g~~~~--~~~~~g~-~ 213 (315) T protein:vir:80 140 HT--SLNKTKNIVDATDSATADLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMY--PAAGFAG-L 213 (315) T ss_pred cc--ccccccceeeccccchHHHHHHHHHHhhccCccceE-EEEcHHHHHHHHHHhhccCCcccccccc--cccccCC-C Confidence 00 011111111112223566777777776555533333 568999999987644321 11111110 1122332 5 Q ss_pred eeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch-----hHHHH Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV-----QYQGD 308 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~-----~~~~d 308 (332) ++++|.+|+.++++|...... .......+.+||++.. +.. ...+++++.++.... .|+-| T Consensus 214 ~tl~G~PV~~~~~~~~~~~~~-----~~~~~~~~~GDfs~~~-~g~---------~~~~~i~i~~~~~~~~~~~~~~~~~ 278 (315) T protein:vir:80 214 DNWRGLNVGASSTVSGAPEMS-----PASGVKAIVGDFSRVH-WGF---------QRNFPIELIEYGDPDQTGRDLKGHN 278 (315) T ss_pred ceecceeeEecCcCCcccccc-----cccccEEEEeecccEE-EEE---------ecCeeEEEeccccccCcccchhhcC Confidence 789999999999999532211 1111123456666532 111 112233333221100 02222 Q ss_pred --HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 309 --LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 309 --~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+++..++|.++++|++.+.|+.+ T Consensus 279 ~v~~r~~~r~~~~v~~~~a~~~l~~~ 304 (315) T protein:vir:80 279 EVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) T ss_pred cEEEEEEEEecceeecccceEEEeec Confidence 234557899999999999998855 No 75 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.32 E-value=7.3e-14 Score=92.59 Aligned_cols=283 Identities=13% Similarity=0.010 Sum_probs=163.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc--ceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG--~~t~~~~~~g 78 (332) +.-. ...+.... ++..+++ .+....+...+.+..+..+.++++++..+..+ .++++|+.. ..++.....| T Consensus 104 ~~~~-~~~~~~~~--~~~~~~g----~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg 175 (390) T protein:vir:10 104 MNIK-AALNTAST--DAAGSAG----ALTTPNRLPGFITQPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEG 175 (390) T ss_pred hHHH-HHHHhhhc--ccccccc----cccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCCcceeeecCC Confidence 0000 00011111 1111111 26777888888888888888889888766544 467888653 3556666777 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) +.... .+++..++++.+.+.. .-+.|.+ +-.+...++.+.+.++.++++++..|+.|+.- ..++....|.-.. T Consensus 176 ~~~~~-~~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~l~~~~~~~~~~~il~G----~G~~~~p~Gi~~~ 248 (390) T protein:vir:10 176 ALKPE-SSLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TGANDGLLGLIPQ 248 (390) T ss_pred ccccc-cccceeEEEEeeEEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCCccccccccc Confidence 77654 3567788888887753 3445655 22334467888999999999999999987631 1111111111000 Q ss_pred ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) .... +..........++.+.++...|..+..+.. .+|++|..|..|.+-+|.. .+ +.... .. .+ ..+.++| T Consensus 249 ~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~~--g~-~l~~~-~~-~~-~~~~l~G 319 (390) T protein:vir:10 249 ATTY-AAPTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDAN--NQ-YLIGN-AR-GT-LTPTLWG 319 (390) T ss_pred cccc-cccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC--Cc-eeecC-Cc-Cc-CCceecc Confidence 0000 111111223346788888889988887654 4678999999987654432 11 11111 11 22 2457899 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHHHHHh Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVGKLAM 316 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~~~~~ 316 (332) .+|+.++.+|... .+-++|+...- ++. ..+++++..+... .+..| .+++.+++ T Consensus 320 ~pv~~~~~~p~~~--------------~~~gdf~~~~~-~~~--------~~~~~i~~~~~~~--~~~~~~~~~r~~~r~ 374 (390) T protein:vir:10 320 LPVVATQAMAPGE--------------FLVGAFDLAAQ-IFD--------QWDARVEIGYVND--DFQRNMVTVLAEERL 374 (390) T ss_pred eeeEEcCCCCCCc--------------EEEEeccceEE-EEE--------ecceEEEEeeccc--ccccCcEEEEEEEee Confidence 9999999999421 23355544221 222 2334555443211 12233 34456789 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) +.++++|++.+.+.-| T Consensus 375 d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 375 ALVVYRPEALISGSFA 390 (390) T ss_pred ccEEeccccEEEEEeC Confidence 9999999999999999 No 76 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.31 E-value=1.3e-13 Score=91.23 Aligned_cols=286 Identities=9% Similarity=0.017 Sum_probs=163.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-c-ceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-G-KLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G-~~t~~~~~~g 78 (332) +.......+ ......+..+.+ .+..+.+..++.+..+..+.++++++..+.. +.++++|+. + ..+..-...| T Consensus 93 ~~~~~~~~~-~~~~~~~~~~~g----~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~ 166 (385) T protein:vir:18 93 QGTFGAKTF-NKSLGSDADSAG----SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEK 166 (385) T ss_pred hccchhhHH-HhhhccccccCC----ceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccC Confidence 111000000 001111211111 2567888999999999999999998877654 457889976 3 3455566677 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) +.+... +++..++++.+.+.- ..+.|++ +-.+...++.+.+.++.++++++.+|+.++.- ..++.+..|.... T Consensus 167 ~~~~~~-~~~~~~~~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G----~g~~~~~~Gi~~~ 239 (385) T protein:vir:18 167 ALKPES-DITFSKQTANVKTIA-HWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLNG----DGTGDNLEGLNKV 239 (385) T ss_pred cccccc-ccceeEEEEeeeeEE-EeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCcccccccc Confidence 766543 567778888877753 3345654 23334456888889999999999999887731 1112121111000 Q ss_pred ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) .... ...........++.|.++..+|.....+.. .++++|..|..|...+|.. .+ +... ....|. .+.++| T Consensus 240 ~~~~-~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~--~~~~~~~~~~~l~~lkd~~--G~-~l~~--~~~~~~-~~~l~G 310 (385) T protein:vir:18 240 ATAY-DTSLNATGDTRADIIAHAIYQVTESEFSAS--GIVLNPRDWHNIALLKDNE--GR-YIFG--GPQAFT-SNIMWG 310 (385) T ss_pred cccc-cccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC--Cc-eecc--CcccCC-Cceecc Confidence 0001 011112233457888899888877765433 5688999999987654421 11 1111 122333 568999 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHHHHHh Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVGKLAM 316 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~~~~~ 316 (332) .+|+.|+.+|... .+-++|+... +++ ...+++++......+ .|.-+ .+++.+++ T Consensus 311 ~pV~~~~~~p~~~--------------~~~gd~~~~~-~~~--------~~~~~~v~~~~~~~~-~~~~~~~~~~~~~r~ 366 (385) T protein:vir:18 311 LPVVPTKAQAAGT--------------FTVGGFDMAS-QVW--------DRMDATVEVSREDRD-NFVKNMLTILCEERL 366 (385) T ss_pred eeeEEcCcCCCCc--------------EEEeecccEE-EEE--------EecceEEEEeccccc-hhhcCcEEEEEEEee Confidence 9999999999421 1223333321 122 223445554432211 12222 23556678 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |.++++|++.+.+.-+ T Consensus 367 ~~~v~~~~a~~~~~~~ 382 (385) T protein:vir:18 367 ALAHYRPTAIIKGTFS 382 (385) T ss_pred ccEEecccceEEEEec Confidence 9999999999877766 No 77 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.31 E-value=1.3e-13 Score=91.23 Aligned_cols=286 Identities=9% Similarity=0.017 Sum_probs=163.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-c-ceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-G-KLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G-~~t~~~~~~g 78 (332) +.......+ ......+..+.+ .+..+.+..++.+..+..+.++++++..+.. +.++++|+. + ..+..-...| T Consensus 93 ~~~~~~~~~-~~~~~~~~~~~g----~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~ 166 (385) T protein:vir:19 93 QGTFGAKTF-NKSLGSDADSAG----SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEK 166 (385) T ss_pred hccchhhHH-HhhhccccccCC----ceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccC Confidence 111000000 001111211111 2567888999999999999999998877654 457889976 3 3455566677 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) +.+... +++..++++.+.+.- ..+.|++ +-.+...++.+.+.++.++++++.+|+.++.- ..++.+..|.... T Consensus 167 ~~~~~~-~~~~~~~~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G----~g~~~~~~Gi~~~ 239 (385) T protein:vir:19 167 ALKPES-DITFSKQTANVKTIA-HWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLNG----DGTGDNLEGLNKV 239 (385) T ss_pred cccccc-ccceeEEEEeeeeEE-EeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCcccccccc Confidence 766543 567778888877753 3345654 23334456888889999999999999887731 1112121111000 Q ss_pred ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) .... ...........++.|.++..+|.....+.. .++++|..|..|...+|.. .+ +... ....|. .+.++| T Consensus 240 ~~~~-~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~--~~~~~~~~~~~l~~lkd~~--G~-~l~~--~~~~~~-~~~l~G 310 (385) T protein:vir:19 240 ATAY-DTSLNATGDTRADIIAHAIYQVTESEFSAS--GIVLNPRDWHNIALLKDNE--GR-YIFG--GPQAFT-SNIMWG 310 (385) T ss_pred cccc-cccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC--Cc-eecc--CcccCC-Cceecc Confidence 0001 011112233457888899888877765433 5688999999987654421 11 1111 122333 568999 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHHHHHh Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVGKLAM 316 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~~~~~ 316 (332) .+|+.|+.+|... .+-++|+... +++ ...+++++......+ .|.-+ .+++.+++ T Consensus 311 ~pV~~~~~~p~~~--------------~~~gd~~~~~-~~~--------~~~~~~v~~~~~~~~-~~~~~~~~~~~~~r~ 366 (385) T protein:vir:19 311 LPVVPTKAQAAGT--------------FTVGGFDMAS-QVW--------DRMDATVEVSREDRD-NFVKNMLTILCEERL 366 (385) T ss_pred eeeEEcCcCCCCc--------------EEEeecccEE-EEE--------EecceEEEEeccccc-hhhcCcEEEEEEEee Confidence 9999999999421 1223333321 122 223445554432211 12222 23556678 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |.++++|++.+.+.-+ T Consensus 367 ~~~v~~~~a~~~~~~~ 382 (385) T protein:vir:19 367 ALAHYRPTAIIKGTFS 382 (385) T ss_pred ccEEecccceEEEEec Confidence 9999999999877766 No 78 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.31 E-value=1.2e-13 Score=91.46 Aligned_cols=284 Identities=14% Similarity=0.056 Sum_probs=164.6 Q ss_pred CC--------------CcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEec Q lcl|Aclame:pro 1 MT--------------TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMF 66 (332) Q Consensus 1 m~--------------~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~ 66 (332) |= +......-+..+... .++++ .+.=+.|..++++..+..|.++++++.-++. |.+++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~---~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~ 75 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKD---GTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTF 75 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccc-cCcCc---cccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEE Confidence 11 111111101001111 11111 2555889999999999999999998776654 55688887 Q ss_pred c-cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 T-GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 (332) Q Consensus 67 i-G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~a 145 (332) . +.+.+.-+..|..+... +++.+++++..-+. ..-..|.+-=-.++..|+.+.+.++.++++++.+|+.++.-- T Consensus 76 ~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~--- 150 (324) T protein:vir:78 76 WADKPGAYWVGEGQKIETS-KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--- 150 (324) T ss_pred EecCcceeEecCCcccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC--- Confidence 6 56677777788887753 57777887777664 344456552222456789999999999999999999886321 Q ss_pred hhhccccccccccceecc-ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 146 SAEASPVTGEPGGFHVNI-GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 146 a~~~~~~~~~~~~~~i~~-~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) .... .+.+..... .......+...++.|.++...|..++.... .++++|..|..|.+.+|.. + . T Consensus 151 -g~~~----~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~vmn~~~~~~L~~l~d~~-------G-~ 215 (324) T protein:vir:78 151 -GNNP----FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-------T-K 215 (324) T ss_pred -CCCC----cCccccccccccceeccccccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhccC-------C-C Confidence 1111 111111000 011111222347888888888888776433 5689999999987654432 1 1 Q ss_pred ccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-- Q lcl|Aclame:pro 225 GDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-- 302 (332) Q Consensus 225 ~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-- 302 (332) ..+..+ ....++|.+|+.++..+...+ ..+.++|+... + +...+++++..++-.. T Consensus 216 ~~~~~~-~~~~l~G~PV~~~~~~~~~~~------------~~~~gd~~~~~-~---------g~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:78 216 ERIYDR-NSDSLDGLPVVNLKSSNLKRG------------ELITGDFDKLI-Y---------GIPQLIEYKIDETAQLST 272 (324) T ss_pred eeecCC-CCCcccceeeEeeCCCCCCcc------------eEEEEecceEE-E---------EEecCcEEEEeecccccc Confidence 122223 356899999998876553211 22344554422 1 1223344544432110 Q ss_pred -----h----hHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 -----V----QYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 -----~----~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) . .|+-| .+++.+.+|.+++||++.+.|..| T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecc Confidence 0 01112 234556789999999999999988 No 79 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.31 E-value=1.2e-13 Score=91.46 Aligned_cols=284 Identities=14% Similarity=0.056 Sum_probs=164.6 Q ss_pred CC--------------CcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEec Q lcl|Aclame:pro 1 MT--------------TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMF 66 (332) Q Consensus 1 m~--------------~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~ 66 (332) |= +......-+..+... .++++ .+.=+.|..++++..+..|.++++++.-++. |.+++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~---~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~ 75 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKD---GTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTF 75 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccc-cCcCc---cccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEE Confidence 11 111111101001111 11111 2555889999999999999999998776654 55688887 Q ss_pred c-cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 T-GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 (332) Q Consensus 67 i-G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~a 145 (332) . +.+.+.-+..|..+... +++.+++++..-+. ..-..|.+-=-.++..|+.+.+.++.++++++.+|+.++.-- T Consensus 76 ~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~--- 150 (324) T protein:vir:96 76 WADKPGAYWVGEGQKIETS-KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--- 150 (324) T ss_pred EecCcceeEecCCcccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC--- Confidence 6 56677777788887753 57777887777664 344456552222456789999999999999999999886321 Q ss_pred hhhccccccccccceecc-ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 146 SAEASPVTGEPGGFHVNI-GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 146 a~~~~~~~~~~~~~~i~~-~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) .... .+.+..... .......+...++.|.++...|..++.... .++++|..|..|.+.+|.. + . T Consensus 151 -g~~~----~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~vmn~~~~~~L~~l~d~~-------G-~ 215 (324) T protein:vir:96 151 -GNNP----FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-------T-K 215 (324) T ss_pred -CCCC----cCccccccccccceeccccccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhccC-------C-C Confidence 1111 111111000 011111222347888888888888776433 5689999999987654432 1 1 Q ss_pred ccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-- Q lcl|Aclame:pro 225 GDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-- 302 (332) Q Consensus 225 ~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-- 302 (332) ..+..+ ....++|.+|+.++..+...+ ..+.++|+... + +...+++++..++-.. T Consensus 216 ~~~~~~-~~~~l~G~PV~~~~~~~~~~~------------~~~~gd~~~~~-~---------g~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:96 216 ERIYDR-NSDSLDGLPVVNLKSSNLKRG------------ELITGDFDKLI-Y---------GIPQLIEYKIDETAQLST 272 (324) T ss_pred eeecCC-CCCcccceeeEeeCCCCCCcc------------eEEEEecceEE-E---------EEecCcEEEEeecccccc Confidence 122223 356899999998876553211 22344554422 1 1223344544432110 Q ss_pred -----h----hHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 -----V----QYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 -----~----~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) . .|+-| .+++.+.+|.+++||++.+.|..| T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecc Confidence 0 01112 234556789999999999999988 No 80 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.31 E-value=9.8e-14 Score=91.89 Aligned_cols=284 Identities=12% Similarity=0.014 Sum_probs=168.3 Q ss_pred CC-CcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc--ceeeeeecC Q lcl|Aclame:pro 1 MT-TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAGYHTP 77 (332) Q Consensus 1 m~-~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG--~~t~~~~~~ 77 (332) +. ......+.+. .+...+++ .+..+.+...+.+..+..+.++++++..++. +.++++|+.. ..++..... T Consensus 102 ~~~~~~~~~~~~~---~~~~~~~g---~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E 174 (390) T protein:vir:97 102 ATMNIKAALNTAS---TDAAGSAG---ALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAAIVAE 174 (390) T ss_pred hhhHHHHHHHhhh---cccccccc---cccchhhhHHHHHHHhhhhhhHhhcceeecc-CCceEEEEEecCCcceeeecC Confidence 00 0001111111 11111122 2677889999999999999999998877665 4467788763 356667777 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |+.+... +++..++++.+.+.- .-..|.+ +-.+...++.+.+.++.++++++.+|+.++.- ..++....|.-. T Consensus 175 g~~~~~~-~~~~~~i~~~~~k~~-~~~~is~-ell~ds~~l~~~i~~~la~a~~~~~d~a~l~G----~g~~~~p~Gi~~ 247 (390) T protein:vir:97 175 GALKPES-SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TGANDGLLGLIP 247 (390) T ss_pred Ccccccc-ccceeEEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCCccccceee Confidence 8877643 567788888888753 3445655 22333467889999999999999999987631 111111111100 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ..... +......+...++.|.++...+.....+.. .+|++|..|..|.+-+|.. ..+.-.. ...+. .++++ T Consensus 248 ~~~~~-~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~~---G~~l~~~--~~~~~-~~~l~ 318 (390) T protein:vir:97 248 QATTY-AAPTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDAN---NQYLIGN--ARGTL-TPTLW 318 (390) T ss_pred ccccc-cccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC---CceeecC--ccCCC-Cceec Confidence 00000 011111233447888888889988888655 4578999999987544321 1111111 12232 46899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH--HHHHHH Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL--IVGKLA 315 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~--i~~~~~ 315 (332) |.+|+.|+.+|... .+-++|+... +++. ..+++++..++.. .+..+. +++... T Consensus 319 G~pV~~~~~~~~~~--------------~~~gd~~~~~-~~~~--------~~~~~i~~~~~~~--~f~~~~~~~r~~~r 373 (390) T protein:vir:97 319 GLPVVATQAMAPGE--------------FLVGAFDLAA-QIFD--------QWDARVEIGYVND--DFQRNMVTVLAEER 373 (390) T ss_pred ceeeEEcCCCCCCc--------------EEEEeccceE-EEEE--------ecceEEEEeeccc--ccccCcEEEEEEEe Confidence 99999999998421 1234443321 2232 3444566544321 122333 556678 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||.++++|++.+.+.-| T Consensus 374 ~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 374 LALVVYRPEALITGSFA 390 (390) T ss_pred eccEEeccccEEEEEeC Confidence 99999999999999999 No 81 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.29 E-value=2.4e-13 Score=89.74 Aligned_cols=289 Identities=11% Similarity=0.015 Sum_probs=166.8 Q ss_pred CCCcccccccc--cccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-c-ceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPN--QANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-G-KLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~--~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G-~~t~~~~~ 76 (332) |.....-.+.. ..-..+..+.+. .+..+.|+.++.+..+..+.++++++..++.+ .++.+|+. + ..+...+. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~g---~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~-~~~~~~~~~~~~~~a~~v~ 173 (395) T protein:vir:43 98 TSSLRGSHRVSMPRSAITSIDGSGG---ALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES-NSVEYVRETGFVNNAAPVS 173 (395) T ss_pred HHHhhhhhhhhhhhhhhcccCCCCc---cccchhhHHHHHHHHHhhhhHHhhccceecCC-CceEEEEEecCCCceeeec Confidence 11100000000 000011111111 36788999999999999999999998777654 46888875 3 45666667 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|+..... .++.+++++.+.+... -..|++ +-.+...++.+.+.++.++++++..|..|+.- ..++.+..|.. T Consensus 174 E~~~~~~~-~~~~~~i~~~~~k~~~-~~~is~-ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G----~g~~~~~~Gi~ 246 (395) T protein:vir:43 174 EGTQKPYS-DLTFELENAPVRTIAH-LFKASR-QILDDASALQSYIDARARYGLMLVEECQLLYG----NGTGANLHGII 246 (395) T ss_pred CCcccccc-ccceeEEEEeeeeEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCcccccc Confidence 77766543 5677888888887543 345654 23344456888889999999999999987631 11222222211 Q ss_pred ccc-eeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 157 GGF-HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 157 ~~~-~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) ... ......+........++.|.++...+..++.+.. .+|++|..|..|...+|.. .+ +... ...++. .+. T Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~--G~-~i~~--~~~~~~-~~~ 318 (395) T protein:vir:43 247 PQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPAS--GIVLNPIDWALIELNKDAE--NR-YIIG--SPQNGT-TPT 318 (395) T ss_pred ccccccccccccccccchhHHHHHHHHHhhccccCCCc--EEEEcHHHHHHHHHhhccC--Cc-eecc--ccccCC-Cce Confidence 111 1111222223344568889899888888776533 5689999999886544321 11 2111 223443 568 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVGK 313 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~~ 313 (332) ++|.+|+.++.+|... .+-++|+... +++-+ .+++++..+... ..+.-| .+++. T Consensus 319 l~G~pVv~~~~~~~~~--------------~~~gd~~~~~-~~~~~--------~~~~i~~~~~~~-~~f~~~~~~~r~~ 374 (395) T protein:vir:43 319 LWRLPVVETQAITQDE--------------FLTGAFSLGA-QIFDR--------MDIEVLVSTEND-KDFENNMVTIRAE 374 (395) T ss_pred ecceeeEEcCCCCCCc--------------EEEEeccceE-EEEEe--------cceEEEEecccc-chhhcCcEEEEEE Confidence 9999999999998421 1234444322 12221 233455443221 112222 34555 Q ss_pred HHhCCceechhheeeeecC Q lcl|Aclame:pro 314 LAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 314 ~~~G~~vlrpe~~v~i~~A 332 (332) .++|.++++|++.+.+.-+ T Consensus 375 ~r~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 375 ERLAFAVYRPEAFVTGSLT 393 (395) T ss_pred EeeccEEecccceEEEEec Confidence 6889999999999877554 No 82 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.29 E-value=2.2e-13 Score=89.93 Aligned_cols=287 Identities=11% Similarity=-0.013 Sum_probs=163.6 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |..-+. .|..+.. +..+.+ .+..+++..++.+..++.+.++++++..+.. +.+.+||+. +.+.+.-...|. T Consensus 1 ~g~~~e-~~~~~~~--~t~~~~----g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~ 72 (397) T protein:vir:23 1 MGFSAD-HSQIAQT--KDTMFT----GYLDPVQAKDYFAEAEKTSIVQRVAQKIPMG-ATGIVIPHWTGDVSAQWIGEGD 72 (397) T ss_pred CCcCHH-HHHHhhc--cCCCCc----cccchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEcCCcceEEecCCc Confidence 777333 2222211 111111 1456777888889989999999988766654 456888876 456666667777 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... +++..++++.+-+. ..-..|.+-=-.++.+|+.+.+.++.++++++.+|+.++.-- .+..+..+... T Consensus 73 ~~~~s-~~~f~~v~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~----gt~~~~~~~~~-- 144 (397) T protein:vir:23 73 MKPIT-KGNMTKRDVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGT----NAPSAFQGYLD-- 144 (397) T ss_pred ccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCcccccccc-- Confidence 76643 56777777777653 333556552223456899999999999999999999887311 11111111000 Q ss_pred eeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccc-cc--cccccccccceeeee Q lcl|Aclame:pro 160 HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNRE-IG--NSQGDMNSGKGLYSI 236 (332) Q Consensus 160 ~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d-~~--~~~~~~~~g~~v~~i 236 (332) .............++.+.++...|.++..+. -.++++|..|..|.+.+|.+ .+. |. ...+....+ ..+++ T Consensus 145 --~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--a~~vmn~~~~~~L~~lkd~~--G~~i~~~~~~~~~~~~~-~~~tl 217 (397) T protein:vir:23 145 --QSNKTQSISPNAYQGLGVSGLTKLVTDGKKW--THTLLDDTVEPVLNGSVDAN--GRPLFVESTYESLTTPF-REGRI 217 (397) T ss_pred --cccceeeecccchhHHHHHHHHhhhhcccCC--CEEEEcHHHHHHHHHhhccC--Cceeecccccccccccc-cCcee Confidence 0001111122233566677777777776543 34678999999998755432 110 10 111111111 24589 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc--------c---hhH Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF--------N---VQY 305 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~--------~---~~~ 305 (332) +|++|+.++++|... ...+.++|++.. +..+. ++.++..++-. . -.| T Consensus 218 ~G~Pv~~s~~~~~g~------------~~~~~gDfs~~~--i~~~~--------~i~i~~~~e~~~~~~~~~~~~~~~lf 275 (397) T protein:vir:23 218 LGRPTILSDHVAEGD------------VVGYAGDFSQII--WGQVG--------GLSFDVTDQATLNLGSQESPNFVSLW 275 (397) T ss_pred eeeeEEEeCCCCCCc------------eEEEEeecceEE--EEEEe--------ceEEEEeeeeeeeeccccccceeeee Confidence 999999999998421 112345555432 22221 22233222110 0 002 Q ss_pred HHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +-| .++...+++.++++|++.+.+..+ T Consensus 276 ~~d~v~~ra~~r~d~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 276 QHNLVAVRVEAEYGLLINDVNAFVKLTFD 304 (397) T ss_pred eccceeEEEEeeeccceecccceEEEeec Confidence 222 345667899999999999888876 No 83 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.29 E-value=1.7e-13 Score=90.60 Aligned_cols=286 Identities=14% Similarity=0.068 Sum_probs=163.3 Q ss_pred CCCccc-----------cccccccccccc--ccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc Q lcl|Aclame:pro 1 MTTLSN-----------FSLPNQANGGAR--NADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT 67 (332) Q Consensus 1 m~~~~~-----------~~r~~~~~~~~~--~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i 67 (332) |=.+.. +-+..-.+.... ..+.+ .+.-+.+..++++..+..+.++++++..+.. +.+++||+. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~---~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~ 76 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKD---GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFW 76 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCc---ceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEE Confidence 111111 001100000000 01111 2566889999999999999999998877755 456889976 Q ss_pred -cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 68 -GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKAS 146 (332) Q Consensus 68 -G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa 146 (332) +.+.+.-+..|..+.. .+++..++++..-+. ..-..|.+-=-.++..++.+.+.++.++++++.+|+.++.-- + T Consensus 77 ~~~~~a~~v~Eg~~~~~-~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~--g- 151 (324) T protein:vir:96 77 ADKPGAYWVGEGQKIET-SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--G- 151 (324) T ss_pred ecCcceeeecCCccccc-cccceeEEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC--C- Confidence 5566666777877764 357777777777664 333556552222356889999999999999999999887321 1 Q ss_pred hhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 147 AEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 147 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) ......+.. ..+..+ .....+...++.|.++..++..++.... .++++|..|..|...+|.. .... T Consensus 152 -~~~~~~~~~--~~~~~~-~~~~~~~~~~~~i~~~~~~i~~~~~~~~--~~i~n~~~~~~L~~lkd~~--------G~~~ 217 (324) T protein:vir:96 152 -NNPFGKSIA--QSIKKT-NKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE--------TKER 217 (324) T ss_pred -CCCcCcccc--cccccc-ceecccccchHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhCCC--------CCee Confidence 111000000 000000 1111122347888888888887766433 4689999999887654432 1111 Q ss_pred ccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc----- Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF----- 301 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~----- 301 (332) +..+ ..+.++|++|+.++..+...+ ..+.+++++.. + +...+++++..++-. T Consensus 218 ~~~~-~~~~l~G~PV~~~~~~~~~~~------------~~~~gd~s~~~--~--------~~~~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:96 218 IYDR-NSDSLDGLPVVNLKSSNLKRG------------ELITGDFDKLI--Y--------GIPQLIEYKIDETAQLSTVK 274 (324) T ss_pred ecCC-CCCcccceeeEeecCCCCCcc------------eEEEEecceEE--E--------EEecCcEEEEeecccccccc Confidence 2222 356789999998876653221 12334444321 1 112233444433211 Q ss_pred ---ch---hHHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 ---NV---QYQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ---~~---~~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .. .|+.|. +++.+++|.+++||++.+.|..| T Consensus 275 ~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred cccccchhhhhcCcEEEEEEEEeccEEecccceEEEecc Confidence 00 012222 45667889999999999999988 No 84 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.28 E-value=5e-13 Score=88.02 Aligned_cols=295 Identities=11% Similarity=-0.002 Sum_probs=159.7 Q ss_pred CCCcccc--cccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNF--SLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~--~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~ 77 (332) |+--... .+-..... ++...+ .+.-+.+..+|.+..++.+.++++.+..+.. +.+.+||+. +.+.+.-... T Consensus 1 ~~~~~~~~~~~~~~~~t-~~~~~~----~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E 74 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQT-GDTMFK----GYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWIGDVSAQWIGE 74 (320) T ss_pred CCCCccCCHHHHHhhcc-cccccc----ccccHHHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEecC Confidence 5442221 11111111 111111 1566889999999999999999998766654 456788876 5667777778 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+... +++.+++++..-+. ..-+.|.+-=-.++..|+.+.+.+++++++++.+|+.++.-- +........+... T Consensus 75 ~~~~~~~-~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~--g~~~~~~~~~~~~ 150 (320) T protein:vir:10 75 GDMKPIT-KGNMTSQNIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGT--DSPFPTYLAQTTK 150 (320) T ss_pred Ccccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhccc--CCCCCcccccccc Confidence 8877653 56777777777663 333556552222457889999999999999999999886311 1000000000000 Q ss_pred c-ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCch--hhccc-ccccccccccccee Q lcl|Aclame:pro 158 G-FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNRE-IGNSQGDMNSGKGL 233 (332) Q Consensus 158 ~-~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~--~~~~d-~~~~~~~~~~g~~v 233 (332) + .....+.....+-...-+.+.++...+.....+ .-+++++|..|..|.+-+|.. .+-.+ .......... - T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~---~ 225 (320) T protein:vir:10 151 SVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKK--WTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFR---A 225 (320) T ss_pred cccceecccccccccccHHHHHHHHHhhhhcccCC--CcEEEEcHHHHHHHHHhhccCCceeeccccccCcccccc---C Confidence 0 000101111111111223455566666655543 346688999999997654431 11110 0111111111 2 Q ss_pred eeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch---------- Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV---------- 303 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~---------- 303 (332) ++++|++|+.++++|... . ..+-++|++.. + +...+++++..++-.-+ T Consensus 226 ~~i~g~pv~~~~~~~~~~--~----------~~~~gd~~~~~--~--------~~~~~~~i~~~~~~~~~~~~~~~~~~~ 283 (320) T protein:vir:10 226 GRIVSRPTILSDHVADGT--T----------VGYMGDFRNVI--W--------GQVGGLSFDVTDQATLNLGTPTEPNFV 283 (320) T ss_pred ceeeeeeeEecCCCCCCc--e----------EEEEeecceEE--E--------EEecCeEEEEeecceeeeccccccccc Confidence 478999999999998421 0 11234444322 1 12233444443321100 Q ss_pred -hHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 304 -QYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 304 -~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .|+-| .+++.+.+|.+++||++.+.|..+ T Consensus 284 ~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 284 SLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNV 315 (320) T ss_pred hhhhcCcEEEEEEEeeccEEecccceEEEEec Confidence 01112 245667889999999999888855 No 85 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.27 E-value=3.3e-13 Score=88.99 Aligned_cols=281 Identities=13% Similarity=0.052 Sum_probs=164.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |-....+.-.+ -...+ +. -.+.-+.|..++++..++.+.++++.++-+.. +.+++||+. +.+.+.-...|. T Consensus 18 ~~~~~~~~a~~---~~~~~-~~---~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~ 89 (324) T protein:vir:99 18 NVKPQVFNPDN---VMMHE-KK---DGTLLNDFTTPILQEVMENSKIMRLGKYEPME-GTEKKFTFWADKPGAYWVGEGQ 89 (324) T ss_pred hhhhhhccccc---eeccC-CC---cceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEeccCc Confidence 21111111110 00010 11 12566889999999999999999998877755 456889876 566777777888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+.. .+++..++++..-+. ..-..|.+-=-.++..++.+.+.++.++++++..|+.++.-- + .+. .+.+. T Consensus 90 ~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~--g--~~~----~~~~~ 159 (324) T protein:vir:99 90 KIET-SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--G--NNP----FGKSI 159 (324) T ss_pred cccc-cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC--C--CCc----cCccc Confidence 7764 356777777777664 333456552222346789999999999999999999887311 1 110 11111 Q ss_pred eeccc-cccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 160 HVNIG-AGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 160 ~i~~~-~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) ..... ......+...++.|.++...|..++.... .++++|..|..|.+.+|.. ....+..+ .-+.++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~d~~--------g~~~~~~~-~~~~l~G 228 (324) T protein:vir:99 160 AQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE--------TKERIYDR-NSDTLDG 228 (324) T ss_pred cccccccceeccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhcCC--------CceeecCC-CCccccc Confidence 00000 11111122347888889888988776433 4678999999887644421 11222222 2457899 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch-----------hHHH Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV-----------QYQG 307 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~-----------~~~~ 307 (332) .+|+.++.++...+ .-+.++|+... + +...+++++..++-.-. .|+. T Consensus 229 ~PVv~~~~~~~~~~------------~~i~gd~~~~~-~---------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) T protein:vir:99 229 LPVVNLKSSNLKRG------------ELITGDFDKLI-Y---------GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) T ss_pred eeEEeecCCCCCcc------------eEEEEecccEE-E---------EEecCcEEEEeecccccccccccccchhhhhc Confidence 99999887764221 12334444422 1 12233445544331100 0111 Q ss_pred H--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 D--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) | .+++.+.+|.+++||++.+.|.-| T Consensus 287 ~~~~~r~~~r~d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 287 DMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CcEEEEEEEEEccEEecccceEEEEec Confidence 2 234457789999999999999888 No 86 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.25 E-value=6.1e-13 Score=87.57 Aligned_cols=291 Identities=11% Similarity=-0.018 Sum_probs=164.1 Q ss_pred CCCccccccccccccc-ccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEec-ccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGG-ARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMF-TGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~-~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~-iG~~t~~~~~~ 77 (332) +.+ .+........+ ...+++. .+.=+.|..++++..+..+.++++++..++.++. ++.++. .+......... T Consensus 109 ~~~--~~~~~~~~~~~~~~~~~gg---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 183 (415) T protein:vir:98 109 FTE--YLETRNDIQGGSLKTDSGF---VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) T ss_pred HHH--HHhhhhhhhhccccccccc---cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeecc Confidence 000 00000000000 0001111 2455789999999999999999999887776543 444443 45566666677 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-...+... +...... T Consensus 184 ~~~~~~~~~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~ 260 (415) T protein:vir:98 184 LEENPELAVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFE 260 (415) T ss_pred ccccCcccccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccc Confidence 77665333345667777777653 33456543223467889999999999999999999887543221111 1100000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ..+...+.++...|+.|.++..++........ .+|++|..|..|...+|.. ++ +... ..+..| ..++++ T Consensus 261 ----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~--G~-~l~~-~~~~~~-~~~~l~ 329 (415) T protein:vir:98 261 ----KEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDKL--GN-YLIQ-PDVKEK-TQQRLL 329 (415) T ss_pred ----ccccccccccccchhHHHHHHHhhhhhccCCC--EEEEcHHHHHHHHHhhccC--Cc-eeec-cCcCCC-CCceec Confidence 00111112222347888888888877776432 4578999999987544432 11 2111 123334 256899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |++|+.++++|..+. |...-+-++|++..- + ....+++++..+.. .+...+++.++++ T Consensus 330 G~pV~~~~~~~~~~~---------~~~~~~~Gd~~~~~~-~--------~~~~~~~v~~~~~~----~~~~~~~~~~r~d 387 (415) T protein:vir:98 330 GAKIEILPDEVLGQK---------GNNTLIIGNLKDAIV-L--------FDRSQYQASWTDYM----HFGECLMIAVRQD 387 (415) T ss_pred ceeeEEecccccCCC---------CccEEEEEehhccEE-E--------EeecceEEEEeccc----cCceEEEEEEEec Confidence 999999999985322 111223444443221 1 22234455543321 2334567788999 Q ss_pred CceechhheeeeecC Q lcl|Aclame:pro 318 CGSLRTSVAGSFQAA 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~A 332 (332) .++++|++.+.+.-. T Consensus 388 ~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 388 CRILDYKSAIVIEYD 402 (415) T ss_pred cEEeccccEEEEEEe Confidence 999999998877544 No 87 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.25 E-value=6.1e-13 Score=87.57 Aligned_cols=291 Identities=11% Similarity=-0.018 Sum_probs=164.1 Q ss_pred CCCccccccccccccc-ccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEec-ccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGG-ARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMF-TGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~-~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~-iG~~t~~~~~~ 77 (332) +.+ .+........+ ...+++. .+.=+.|..++++..+..+.++++++..++.++. ++.++. .+......... T Consensus 109 ~~~--~~~~~~~~~~~~~~~~~gg---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 183 (415) T protein:vir:79 109 FTE--YLETRNDIQGGSLKTDSGF---VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) T ss_pred HHH--HHhhhhhhhhccccccccc---cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeecc Confidence 000 00000000000 0001111 2455789999999999999999999887776543 444443 45566666677 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-...+... +...... T Consensus 184 ~~~~~~~~~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~ 260 (415) T protein:vir:79 184 LEENPELAVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFE 260 (415) T ss_pred ccccCcccccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccc Confidence 77665333345667777777653 33456543223467889999999999999999999887543221111 1100000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ..+...+.++...|+.|.++..++........ .+|++|..|..|...+|.. ++ +... ..+..| ..++++ T Consensus 261 ----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~--G~-~l~~-~~~~~~-~~~~l~ 329 (415) T protein:vir:79 261 ----KEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDKL--GN-YLIQ-PDVKEK-TQQRLL 329 (415) T ss_pred ----ccccccccccccchhHHHHHHHhhhhhccCCC--EEEEcHHHHHHHHHhhccC--Cc-eeec-cCcCCC-CCceec Confidence 00111112222347888888888877776432 4578999999987544432 11 2111 123334 256899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |++|+.++++|..+. |...-+-++|++..- + ....+++++..+.. .+...+++.++++ T Consensus 330 G~pV~~~~~~~~~~~---------~~~~~~~Gd~~~~~~-~--------~~~~~~~v~~~~~~----~~~~~~~~~~r~d 387 (415) T protein:vir:79 330 GAKIEILPDEVLGQK---------GNNTLIIGNLKDAIV-L--------FDRSQYQASWTDYM----HFGECLMIAVRQD 387 (415) T ss_pred ceeeEEecccccCCC---------CccEEEEEehhccEE-E--------EeecceEEEEeccc----cCceEEEEEEEec Confidence 999999999985322 111223444443221 1 22234455543321 2334567788999 Q ss_pred CceechhheeeeecC Q lcl|Aclame:pro 318 CGSLRTSVAGSFQAA 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~A 332 (332) .++++|++.+.+.-. T Consensus 388 ~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 388 CRILDYKSAIVIEYD 402 (415) T ss_pred cEEeccccEEEEEEe Confidence 999999998877544 No 88 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.25 E-value=6.1e-13 Score=87.57 Aligned_cols=291 Identities=11% Similarity=-0.018 Sum_probs=164.1 Q ss_pred CCCccccccccccccc-ccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEec-ccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGG-ARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMF-TGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~-~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~-iG~~t~~~~~~ 77 (332) +.+ .+........+ ...+++. .+.=+.|..++++..+..+.++++++..++.++. ++.++. .+......... T Consensus 109 ~~~--~~~~~~~~~~~~~~~~~gg---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 183 (415) T protein:vir:81 109 FTE--YLETRNDIQGGSLKTDSGF---VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) T ss_pred HHH--HHhhhhhhhhccccccccc---cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeecc Confidence 000 00000000000 0001111 2455789999999999999999999887776543 444443 45566666677 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-...+... +...... T Consensus 184 ~~~~~~~~~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~ 260 (415) T protein:vir:81 184 LEENPELAVKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFE 260 (415) T ss_pred ccccCcccccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccc Confidence 77665333345667777777653 33456543223467889999999999999999999887543221111 1100000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ..+...+.++...|+.|.++..++........ .+|++|..|..|...+|.. ++ +... ..+..| ..++++ T Consensus 261 ----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~lkd~~--G~-~l~~-~~~~~~-~~~~l~ 329 (415) T protein:vir:81 261 ----KEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN--VAIVSQTMFAKLDKMKDKL--GN-YLIQ-PDVKEK-TQQRLL 329 (415) T ss_pred ----ccccccccccccchhHHHHHHHhhhhhccCCC--EEEEcHHHHHHHHHhhccC--Cc-eeec-cCcCCC-CCceec Confidence 00111112222347888888888877776432 4578999999987544432 11 2111 123334 256899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |++|+.++++|..+. |...-+-++|++..- + ....+++++..+.. .+...+++.++++ T Consensus 330 G~pV~~~~~~~~~~~---------~~~~~~~Gd~~~~~~-~--------~~~~~~~v~~~~~~----~~~~~~~~~~r~d 387 (415) T protein:vir:81 330 GAKIEILPDEVLGQK---------GNNTLIIGNLKDAIV-L--------FDRSQYQASWTDYM----HFGECLMIAVRQD 387 (415) T ss_pred ceeeEEecccccCCC---------CccEEEEEehhccEE-E--------EeecceEEEEeccc----cCceEEEEEEEec Confidence 999999999985322 111223444443221 1 22234455543321 2334567788999 Q ss_pred CceechhheeeeecC Q lcl|Aclame:pro 318 CGSLRTSVAGSFQAA 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~A 332 (332) .++++|++.+.+.-. T Consensus 388 ~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 388 CRILDYKSAIVIEYD 402 (415) T ss_pred cEEeccccEEEEEEe Confidence 999999998877544 No 89 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.25 E-value=3.5e-13 Score=88.86 Aligned_cols=284 Identities=12% Similarity=0.002 Sum_probs=164.7 Q ss_pred CCC--cccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc--ceeeeeec Q lcl|Aclame:pro 1 MTT--LSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAGYHT 76 (332) Q Consensus 1 m~~--~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG--~~t~~~~~ 76 (332) +.. .....+.. ..+..+.++ .+..++|...+.+.....+.++++++..+.. +.++++|+.. ..++.-.. T Consensus 101 ~~~~~~~~~~~~~---~~~~~~~~g---~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:81 101 RATMNIKAALNTA---STDAAGSAG---ALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAAIVA 173 (390) T ss_pred hhhhHHHHHHHhh---ccccccCCc---ceechhhhHHHHHHHhhhhhhhhhcceeecc-CCceEEEEEecCCcceeeec Confidence 000 00000000 111111111 2677889999999999999999998766654 4567888763 34566667 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|..+... +++.+++++.+.+.- .-..|.+ +-.+...++.+.+.++.++++++..|+.|+.- ..++....|.- T Consensus 174 Eg~~~~~~-~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~G----~g~~~~~~Gi~ 246 (390) T protein:vir:81 174 EGALKPES-SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TGANDGLLGLI 246 (390) T ss_pred CCcccccc-cceeeEEEEeeeEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc----CCCCCccccee Confidence 77776543 567778888877653 3345654 23334467888899999999999999987631 11111111110 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ...... +..........++.|.++..++...+.+.. .+|++|..|..|...+|.. .+ +.-.. ...+. ...+ T Consensus 247 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~~--G~-~l~~~--~~~~~-~~~l 317 (390) T protein:vir:81 247 PQATTY-AAPTTIAGATRVDQLRLAMLQASLAEYNPS--GIVINPIDWAAIELAKDAN--NQ-YLIGN--ARGTL-TPTL 317 (390) T ss_pred eccccc-ccccccccchhHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC--Cc-eeecC--ccccc-Ccee Confidence 000000 111111223347888888888888887544 4578999999887654432 11 11111 12222 4689 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHHHH Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVGKL 314 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~~~ 314 (332) +|.+|+.++.+|... .+-++|+... +++. ..+++++..+... .+..| .+++.. T Consensus 318 ~G~pv~~~~~~p~~~--------------~~~gd~~~~~-~~~~--------~~~~~v~~~~~~~--~~~~~~v~~r~~~ 372 (390) T protein:vir:81 318 WGLPVVATQAMAPGE--------------FLVGAFDLAA-QIFD--------QWDARVEIGYVGE--DFQRNMITVLAEE 372 (390) T ss_pred cceeeEEcCCCCCCc--------------EEEEehhceE-EEEE--------ecceEEEEecccc--hhhcCcEEEEEEE Confidence 999999999999421 2334444322 2222 2344555443211 12223 345677 Q ss_pred HhCCceechhheeeeecC Q lcl|Aclame:pro 315 AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 315 ~~G~~vlrpe~~v~i~~A 332 (332) .+|.++++|++.+.+.-| T Consensus 373 r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 373 RLALVVYRPEALISGSFA 390 (390) T ss_pred eeccEEecccceEEEEeC Confidence 899999999999999999 No 90 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.25 E-value=2.4e-13 Score=89.75 Aligned_cols=296 Identities=10% Similarity=0.019 Sum_probs=157.9 Q ss_pred CCCc-cccc-------ccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccce-e Q lcl|Aclame:pro 1 MTTL-SNFS-------LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKL-S 71 (332) Q Consensus 1 m~~~-~~~~-------r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~-t 71 (332) |... ..++ +-......+...++. .+.-+.|.+++.+..+..+.+++++++.+..++..+.++..+.. . T Consensus 96 l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg---~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (409) T protein:vir:45 96 MRHGASELTSEERKALRELRAQGVAQDEKGG---YTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSE 172 (409) T ss_pred HHhhhhhccHHHHHHHHHHhhccCccCcCCc---eeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcc Confidence 1000 0010 000011112222222 24558999999999999999999998888888888888877543 2 Q ss_pred -eeeecCCCCCCccCCCCCceEEEEEeeeeec-c-hhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 72 -AGYHTPGTPIVGDAGIKANEKTLVMDDLLVS-S-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAE 148 (332) Q Consensus 72 -~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~-~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~ 148 (332) ......|..... .+++...+++. ..+.. . +.|.+-=-..+.+|+.+.+.++.++++++..|+.|+.- .++.. T Consensus 173 ~~~~v~E~~~~~~-~~~~f~~~~l~--~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G--~G~~~ 247 (409) T protein:vir:45 173 VGVLLGENEEAGE-EDTDFGMGSLG--ALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQG--TGAGT 247 (409) T ss_pred ccccccccccccc-cccccceeeee--eeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhcc--CCCCC Confidence 234445555443 34555555554 43433 3 34655322335689999999999999999999987631 11111 Q ss_pred ccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCE-EEEChHHHHHHHhhcCchhhccccccccccc Q lcl|Aclame:pro 149 ASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRV-AVLSPRQYYSLISSVDTNILNREIGNSQGDM 227 (332) Q Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~-~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~ 227 (332) .....|.... ..............++.|.++...|..+.. ....| +++.|..|..|..-+|.. ++ +.- +..+ T Consensus 248 ~~~p~Gil~~--~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd~~--G~-~i~-~~~~ 320 (409) T protein:vir:45 248 PKQPKGLAAS--VTGTTQTAAANAVKWQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMEDGQ--GR-PLW-LPDI 320 (409) T ss_pred ccccceeeec--cccccccccccccchHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhcCC--Cc-eee-ccCc Confidence 1011110000 000000011111225677777777766553 33456 467999988875433332 11 111 1223 Q ss_pred cccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG 307 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~ 307 (332) ..|. -.+++|.+|+.++++|..+.... .-+-++|++.. ++. ..++.++...+.+-. +-- T Consensus 321 ~~~~-~~~l~G~PV~~~~~~p~~~~~~~---------~i~~Gd~~~~~--i~~--------~~~~~~~~~~d~~~~-~~~ 379 (409) T protein:vir:45 321 VGVA-PASVLNVPYVIDQEIDDIGAGKK---------FMFCGDFDRFI--IRR--------VRYMILKRLVERYAE-YDQ 379 (409) T ss_pred CCCC-CceecceeeEEecCcCCccCCcc---------EEEEeehhhhh--eee--------ccceEEEEeeccccc-CCc Confidence 3443 46899999999999995322111 11223444321 222 223334444332211 001 Q ss_pred HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) -.|++.++||.++++|++.+.|.-+ T Consensus 380 ~~~~~~~r~d~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 380 TGFLAFHRFDCILEDTSAIKALVGK 404 (409) T ss_pred EEEEEEEEeccEeechhheEEEEec Confidence 1267778999999999988876665 No 91 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.24 E-value=5.6e-13 Score=87.73 Aligned_cols=281 Identities=14% Similarity=0.075 Sum_probs=161.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |-....+ ++. +-...+ +.+ .+.-+.+..++++..++.+.++++.++-++. +.+++||+. +.+.+.-...|. T Consensus 18 ~~~~~~~-~a~--~~~~~~-~~~---~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~ 89 (324) T protein:vir:10 18 NVKPQVF-NPD--NVMMHE-KKD---GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQ 89 (324) T ss_pred hhcccee-ccc--ceeccC-CCc---ceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceeEeccCc Confidence 2221111 111 001110 111 2566889999999999999999998876655 456888876 566777778888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... +++.+++++..-+. ..-..|.+-=-.++..++.+.+.++.++++++..|+.++.-- + .+. .+.+. T Consensus 90 ~~~~~-~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~--g--~~~----~~~~i 159 (324) T protein:vir:10 90 KIETS-KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--G--NNP----FGKSI 159 (324) T ss_pred ccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC--C--CCc----cCccc Confidence 77643 56777777776653 333456542222346789999999999999999999886421 1 110 11110 Q ss_pred eeccc-cccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 160 HVNIG-AGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 160 ~i~~~-~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) ..... ......+...++.|.++...|..++.... .++++|..|..|.+.+|.. + ...+..+ .-+.++| T Consensus 160 ~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~d~~-------g-~~~~~~~-~~~~l~G 228 (324) T protein:vir:10 160 AQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-------T-KERIYDR-NSDTLDG 228 (324) T ss_pred cccccccceeccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhccC-------C-ceeecCC-CCccccc Confidence 00000 01111122347888888888888775433 4678999999887644321 1 1112222 2457899 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc--------ch---hHHH Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF--------NV---QYQG 307 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~--------~~---~~~~ 307 (332) .+|+.++..+...+ .-+.++|++.. + +...+++++..++-. .. .|+- T Consensus 229 ~PV~~~~~~~~~~~------------~~~~gd~~~~~-~---------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 286 (324) T protein:vir:10 229 LPVVNLKSSNLKRG------------ELITGDFDKLI-Y---------GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) T ss_pred eeEEeecCCCCCcc------------eEEEEecccEE-E---------EEecCcEEEEeecccccccccccccchhhhhc Confidence 99998876653221 12334444432 1 112334444443311 00 0111 Q ss_pred H--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 D--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) | .+++.+++|.++++|++.+.|.-| T Consensus 287 ~~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 287 DMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CcEEEEEEEEEccEEecccceEEEEec Confidence 2 234456789999999999988888 No 92 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.23 E-value=1e-12 Score=86.33 Aligned_cols=291 Identities=12% Similarity=-0.005 Sum_probs=160.7 Q ss_pred CCCcccccccccccccc-cccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEec-ccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGA-RNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMF-TGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~-~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~-iG~~t~~~~~~ 77 (332) +.. .+...+....++ ..+++. .+.=+.|.+++.+..+..+.++++++..+..++. ++.++. .+......... T Consensus 109 ~~~--~~~~~~~~~~~~~~t~~g~---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 183 (415) T protein:vir:47 109 FTE--YLETRNDIQGGSLKTDSGF---VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) T ss_pred HHH--HHhhhhhhhhccccccCCc---ccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccc Confidence 000 000000000111 111111 2555899999999999999999999887776653 333333 34455666667 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++.+++++..-+. +.-+.|.+-=-.++.+|+.+.+.+++++++++..|+.|+.-...+........... T Consensus 184 g~~~~~~~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~- 261 (415) T protein:vir:47 184 LEENPELAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEK- 261 (415) T ss_pred ccccccccccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccc- Confidence 7666532234556666666654 23345655222345688999999999999999999988754321111100000000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ............++.|.++...+....... =.+|++|..|..|...+|.. ..+.. ...+.+|. -++++ T Consensus 262 -----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~L~~lkd~~---G~~i~-~~~~~~~~-~~~l~ 329 (415) T protein:vir:47 262 -----EGKKLEVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKL---GNYLI-QPDVKEKT-QQRLL 329 (415) T ss_pred -----ccceeccccccchHHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccC---CCeee-ccCcCCCC-Ccccc Confidence 001111112223677888887777766542 24579999999886544321 11211 11233443 56899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |++|+.++++|..+. |...-+-++|++.+. ++ ...+++++..... .+...+++.+++| T Consensus 330 G~pV~~~~~~~~~~~---------~~~~~~~gd~~~~~~-~~--------~~~~~~v~~~~~~----~~~~~~~~~~r~d 387 (415) T protein:vir:47 330 GAKIEILPDEVLGQK---------GNNTLIIGNLKDAIV-LF--------DRSQYQASWTDYM----HFGECLMIAVRQD 387 (415) T ss_pred ceeeEEeccccccCC---------CccEEEEEehhccEE-EE--------eecceEEEeeccc----cCceEEEEEEEec Confidence 999999999985322 112234445554321 12 2233444443221 1223467788999 Q ss_pred CceechhheeeeecC Q lcl|Aclame:pro 318 CGSLRTSVAGSFQAA 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~A 332 (332) .++++|++.+.+.-. T Consensus 388 ~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 388 CRILDYKSAIVIEYD 402 (415) T ss_pred cEEeccccEEEEEee Confidence 999999998877533 No 93 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.23 E-value=1e-12 Score=86.33 Aligned_cols=291 Identities=12% Similarity=-0.005 Sum_probs=160.7 Q ss_pred CCCcccccccccccccc-cccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEec-ccceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGA-RNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMF-TGKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~-~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~-iG~~t~~~~~~ 77 (332) +.. .+...+....++ ..+++. .+.=+.|.+++.+..+..+.++++++..+..++. ++.++. .+......... T Consensus 109 ~~~--~~~~~~~~~~~~~~t~~g~---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 183 (415) T protein:vir:46 109 FTE--YLETRNDIQGGSLKTDSGF---VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) T ss_pred HHH--HHhhhhhhhhccccccCCc---ccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccc Confidence 000 000000000111 111111 2555899999999999999999999887776653 333333 34455666667 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++.+++++..-+. +.-+.|.+-=-.++.+|+.+.+.+++++++++..|+.|+.-...+........... T Consensus 184 g~~~~~~~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~- 261 (415) T protein:vir:46 184 LEENPELAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEK- 261 (415) T ss_pred ccccccccccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccc- Confidence 7666532234556666666654 23345655222345688999999999999999999988754321111100000000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ............++.|.++...+....... =.+|++|..|..|...+|.. ..+.. ...+.+|. -++++ T Consensus 262 -----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~L~~lkd~~---G~~i~-~~~~~~~~-~~~l~ 329 (415) T protein:vir:46 262 -----EGKKLEVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKL---GNYLI-QPDVKEKT-QQRLL 329 (415) T ss_pred -----ccceeccccccchHHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccC---CCeee-ccCcCCCC-Ccccc Confidence 001111112223677888887777766542 24579999999886544321 11211 11233443 56899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |++|+.++++|..+. |...-+-++|++.+. ++ ...+++++..... .+...+++.+++| T Consensus 330 G~pV~~~~~~~~~~~---------~~~~~~~gd~~~~~~-~~--------~~~~~~v~~~~~~----~~~~~~~~~~r~d 387 (415) T protein:vir:46 330 GAKIEILPDEVLGQK---------GNNTLIIGNLKDAIV-LF--------DRSQYQASWTDYM----HFGECLMIAVRQD 387 (415) T ss_pred ceeeEEeccccccCC---------CccEEEEEehhccEE-EE--------eecceEEEeeccc----cCceEEEEEEEec Confidence 999999999985322 112234445554321 12 2233444443221 1223467788999 Q ss_pred CceechhheeeeecC Q lcl|Aclame:pro 318 CGSLRTSVAGSFQAA 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~A 332 (332) .++++|++.+.+.-. T Consensus 388 ~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 388 CRILDYKSAIVIEYD 402 (415) T ss_pred cEEeccccEEEEEee Confidence 999999998877533 No 94 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.23 E-value=9.5e-13 Score=86.48 Aligned_cols=293 Identities=11% Similarity=0.025 Sum_probs=157.2 Q ss_pred CCCc--cccccccc----ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeee Q lcl|Aclame:pro 1 MTTL--SNFSLPNQ----ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAG 73 (332) Q Consensus 1 m~~~--~~~~r~~~----~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~ 73 (332) |.-- -..-|.+. .-..+..+.+ .+.-+.+..++.+..++.+.++++.+..+.. +++.+||+. +.+.+. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g----~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~ 75 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFE----GYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSAS 75 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCc----ceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceE Confidence 3220 01122211 0011111122 2567889999999999999988887765544 556888865 556777 Q ss_pred eecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) .+..|..+... +++..++++...+. ..-+.|.+-=-.++.+|+.+.+.++.++++++.+|+.++.- ..+..+.. T Consensus 76 ~v~Eg~~~~~~-~~~f~~i~~~~~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G----~gs~~p~g 149 (326) T protein:vir:42 76 WIGEGDMKPIT-KGNMTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAING----TDSPFPTF 149 (326) T ss_pred EecCCcccccc-ccceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCCcccc Confidence 77888887754 57777877777764 44556765333346789999999999999999999988631 11111110 Q ss_pred c--ccccceeccccccccCHHHHHHH--HHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCch----hhccccccccc Q lcl|Aclame:pro 154 G--EPGGFHVNIGAGNTNDAQAIVDG--FFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN----ILNREIGNSQG 225 (332) Q Consensus 154 ~--~~~~~~i~~~~~~~~~~~~~~d~--i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~----~~~~d~~~~~~ 225 (332) . ...........+....+...+.. +..+...+ ++....+-..+++|..|..|.+-+|.. |..... ++ T Consensus 150 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~---~~ 224 (326) T protein:vir:42 150 LAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLL--VNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTY---TE 224 (326) T ss_pred ccccccccceeecccccccccchhHHHHHHHHHhhh--hhhccCccEEEEeHHHHHHHHHhhccCCceeeccccc---cC Confidence 0 00000000011111111111211 12222222 222333445678999999997644422 111111 11 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc---- Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF---- 301 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~---- 301 (332) ..... ..+.++|++|+.++.+|... ...+.++|+... ++.+..+ +++..++-. T Consensus 225 ~~~~~-~~~~l~G~pv~~~~~~~~~~------------~~~~~Gd~s~~~--~~~~~~~--------~v~~~~e~~~~~~ 281 (326) T protein:vir:42 225 ENSPF-RLGRIVARPTILSDHVASGT------------VVGYQGDFRQLV--WGQVGGL--------SFDVTDQATLNLG 281 (326) T ss_pred ccccc-cCceeeeeeEEEcCCCCCCc------------eEEEEeecceEE--EEEecce--------EEEEeecceeeec Confidence 11111 13478999999999998421 112334555432 2232222 233222111 Q ss_pred ----ch---hHHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 ----NV---QYQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ----~~---~~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .. .|+-|. +++.+.++.+++||++.+.|..+ T Consensus 282 ~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 282 TPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNV 321 (326) T ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeec Confidence 00 122233 36778899999999998888776 No 95 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.23 E-value=5.4e-13 Score=87.86 Aligned_cols=293 Identities=10% Similarity=-0.049 Sum_probs=163.5 Q ss_pred CCCccccccccccc-ccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQAN-GGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~-~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~i-G~~t~~~~~~ 77 (332) ......+....... .+...+++. .+.=+.+.+++++..+..+.++++++..++.++. ++.++.. +......... T Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~g~---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 183 (415) T protein:vir:94 107 RDFTEYLETRNDIQGGSLKTDSGF---VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEE 183 (415) T ss_pred HHHHHHhhhhhhhhhhcccccccc---ccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccc Confidence 00000000000000 001111111 1344789999999999999999999888876543 4555543 5556666777 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++..++++.+-+. +.-+.|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-...+... +...... T Consensus 184 g~~~~~~~~~~~~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~ 260 (415) T protein:vir:94 184 LEENPELAVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFE 260 (415) T ss_pred cccccccccccceeeEeeheee-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccc Confidence 7766533334566676666664 233456542222356889999999999999999999887543221111 1100000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ..... ....+...|+.|+++...+...+... -.+|++|..|..|...+|.. .+ +.. ...+.+| ..++++ T Consensus 261 ~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~--G~-~l~-~~~~~~~-~~~~l~ 329 (415) T protein:vir:94 261 KEGKK----LEVKKAKSLDDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKL--GN-YLI-QPDVKEK-TQQRLL 329 (415) T ss_pred ccccc----cccccccchHHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccC--CC-eee-ccCcCCC-CCceec Confidence 00011 11112234788888888887777642 24578999999997644422 11 111 1123334 357899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |.+|+.++.+|..+. |...-+-++|++... .....+++++..+. ..+...+++.+.++ T Consensus 330 G~pV~~~~~~~~~~~---------~~~~i~~gd~~~~~~---------~~~~~~~~v~~~~~----~~~~~~~r~~~r~d 387 (415) T protein:vir:94 330 GAKIEILPDEVLGQK---------GNNTLIIGNLKDAIV---------LFDRSQYQASWTDY----MHFGECLMIAVRQD 387 (415) T ss_pred ceeeEEecccccCCC---------CccEEEEEehhccEE---------EEeecceEEEEecc----ccCceEEEEEEEec Confidence 999999999985322 111223344443221 12223345544332 12334567788999 Q ss_pred CceechhheeeeecC Q lcl|Aclame:pro 318 CGSLRTSVAGSFQAA 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~A 332 (332) .++++|++.+.+.-. T Consensus 388 ~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 388 CRILDYKSAIVIEYD 402 (415) T ss_pred cEEeccccEEEEEEe Confidence 999999998877543 No 96 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.21 E-value=7.5e-13 Score=87.05 Aligned_cols=294 Identities=9% Similarity=0.025 Sum_probs=157.3 Q ss_pred CCCcc--cccccccccccccccccCchhhHHHHHHhHHHH-HHHHHhhhhccccccccccccceEEEec-ccceeeeeec Q lcl|Aclame:pro 1 MTTLS--NFSLPNQANGGARNADYDVRYATALKLFSGEVF-TAFNNASIFKGLVRSYDLRGGKSKQFMF-TGKLSAGYHT 76 (332) Q Consensus 1 m~~~~--~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~-~~f~~~s~~~~~v~~r~~~~G~tv~i~~-iG~~t~~~~~ 76 (332) +.... .+. .....+...+++. .|.-+.|..++. ..+...+.+..+.++... .| .+.+|+ .+.+.+.... T Consensus 237 l~~~e~~~~~--~~~~~~~t~~~gg---~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g-~~~~~~~~~~~~a~~v~ 309 (543) T protein:vir:81 237 LTEEEKRAIN--EVRAMGLTKADGG---YLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TG-DVWHGVSSAAVQWSWDA 309 (543) T ss_pred hhhhhhhhhh--hhhhcccccccCc---ccCchhhhhHHHHHHHhhhchhhhhcccccC-Cc-ceEEEEecCCcceeecc Confidence 00000 000 0000111111222 155578887765 556666888877765433 34 455664 4666777777 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-- Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG-- 154 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~-- 154 (332) .|..+... .++..++++...+.- .-+.|++ +-.+...|+.+.+.+++++++++..|+.|+.- ..+.....| T Consensus 310 Eg~~~~~~-~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~ail~G----~Gt~~~p~Gi~ 382 (543) T protein:vir:81 310 EFEEVSDD-SPEFGQPEIPVKKAQ-GFVPISI-EALQDEANVTETVALLFAEGKDELEAVTLTTG----TGQGNQPTGIV 382 (543) T ss_pred cCcccccc-ccccceeeeeeeeeE-eeehhhH-HHHhccHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCcccccch Confidence 78777653 577778777777653 3345654 33445679999999999999999999988621 111100111 Q ss_pred -ccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 155 -EPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 155 -~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) ..++.... ....+.....++.+.++...|...+-+. -.+|++|..|..|...+|.. ++ +.- ..+..|. - T Consensus 383 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~--~~~v~n~~~~~~l~~lkd~~--G~-~l~--~~~~~g~-~ 452 (543) T protein:vir:81 383 TALAGTAAE--IAPVTAETFALADVYAVYEQLAARHRRQ--GAWLANNLIYNKIRQFDTQG--GA-GLW--TTIGNGE-P 452 (543) T ss_pred hhccccccc--ccccccccccHHHHHHHHHhhhccccCC--cEEEEcHHHHHHHHHhhcCC--Cc-eec--cCcCCCC-C Confidence 00000111 1111222334677888887776665432 35678999999997654432 11 111 1123342 4 Q ss_pred eeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc---chhHHHHHH Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF---NVQYQGDLI 310 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~---~~~~~~d~i 310 (332) ++++|.+|+.++++|...... ...|...-|.++|+... ++ ...+++++....-. +.....-.+ T Consensus 453 ~~l~G~pv~~~~~~~~~~~~~----~~~~~~~i~~gd~~~~~--i~--------~~~~~~i~~~~~~~~~~~~~~~~~~~ 518 (543) T protein:vir:81 453 SQLLGRPVGEAEAMDANWNTS----ASADNFVLLYGNFQNYV--IA--------DRIGMTVEFIPHLFGTNRRPNGSRGW 518 (543) T ss_pred ccccceeeEEecccccccccc----ccCCcceEEEeecccee--EE--------eecccEEEEeccccccchhhcCceEE Confidence 689999999999999643221 12233334455555322 11 12233443321110 000001124 Q ss_pred HHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 311 VGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 311 ~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++...+|.++++|++.+.+.-+ T Consensus 519 ~~~~r~d~~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 519 FAYYRMGADVVNPNAFRLLNVE 540 (543) T ss_pred EEEEeeccEeecccceEEEEec Confidence 4556679999999998877666 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.20 E-value=2.9e-12 Score=83.85 Aligned_cols=291 Identities=14% Similarity=0.041 Sum_probs=154.5 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |+.. +- .|| .+.=++|+.++++..+..++++.+.+..+... ..++||+. +.+++.-...|. T Consensus 1 Mat~---tt-----~~g---------~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~-~~~~~p~~~~~~~a~wv~Eg~ 62 (311) T protein:vir:99 1 MATF---GT-----GNL---------KNLPRNIADGMVKDVVQGSTVAVLSARKPQRF-GNEDIITFNGRPKAEFVGEGQ 62 (311) T ss_pred Ccee---cC-----CCc---------eeccHHHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEEeecCc Confidence 5542 21 111 14458899999999999999999887655554 44688876 677777777888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHH---HHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDE---IFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~---~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .+... +++..++++..-+. ..-+.|.+-=- ..+..++.+.+.+++++++++.+|+.++.-.. ........+.. T Consensus 63 ~~~~~-~~~f~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g--~~~g~~~~g~~ 138 (311) T protein:vir:99 63 QKSST-TGEFDFVTSTPKKA-QVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRIN--PLTGTVIPGWS 138 (311) T ss_pred ccccc-cceeeEEEEeeEEE-EEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccC--cccCccccccc Confidence 77643 46666676665443 23344544111 13457899999999999999999998874211 10111111110 Q ss_pred c-----cceeccccccccCHHHHHHHHHHHHHHHHhcCC--CcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 157 G-----GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSA--PQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 157 ~-----~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~V--P~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) . ...+..+.. ....+++.|..+...+..... +..+ +++.|..+..|.+.+|.+ .+ +.- .. ... T Consensus 139 ~~~~~~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~~~~~~~~~--~vmn~~~~~~L~~lkd~~--G~-~l~-~~-~~~ 208 (311) T protein:vir:99 139 NYLGAASKRVELTAD---TIANPDLAIEAAVGLLVANGHPTPVNG--LALHPSIAWGLSTARYTD--GR-KKF-PE-LGL 208 (311) T ss_pred cccccccceeecccc---ccchhHHHHHHHHHHHhhhccCCCccE--EEEcHHHHHHHHhhhccC--CC-eee-cC-ccc Confidence 0 011111111 122223334444444443332 2232 578999999987654432 11 111 11 112 Q ss_pred cceeeeeeceEEEeeCcccccccccccccc--cccccccccccccceEEEeechhhhhhhhhccceeeeeeccc-c---h Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAA--VTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF-N---V 303 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~--~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~-~---~ 303 (332) +...++++|++|+.|+++|...+....... .......|-++|++.+-+ ...++++++..+.-. + . T Consensus 209 ~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~---------~~~~~~~~~~~~~~~~~~~~~ 279 (311) T protein:vir:99 209 GIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHW---------GVQRDIPVELIKYGDPDGQGD 279 (311) T ss_pred CCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccccEEE---------EEecCceEEEeecCCCCcchh Confidence 223578999999999999854332211111 111111233444432211 112233444432111 0 0 Q ss_pred hHHHHHH--HHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 304 QYQGDLI--VGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 304 ~~~~d~i--~~~~~~G~~vlrpe~~v~i~~A 332 (332) .|+.|++ ++..++|..+++|++++...+| T Consensus 280 ~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~ 310 (311) T protein:vir:99 280 LKRHNQIALRLEIVYGWYVFTDRFVVIENAV 310 (311) T ss_pred hhhcCcEEEEEEEeecceecChhHeeeeccc Confidence 1233333 5667889999998765544444 No 98 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.19 E-value=5.3e-12 Score=82.42 Aligned_cols=295 Identities=15% Similarity=0.100 Sum_probs=158.8 Q ss_pred CCCccc---------------ccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccc-cccccccccceEEE Q lcl|Aclame:pro 1 MTTLSN---------------FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGL-VRSYDLRGGKSKQF 64 (332) Q Consensus 1 m~~~~~---------------~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~-v~~r~~~~G~tv~i 64 (332) |..-.. -.+.+..+ ++..+.++ .+.=+.+..+|++..+..+.++.+ .+.-+...| .+.+ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg---~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~ 179 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLN-TLSPGAGG---VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITI 179 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhc-ccCCCCCc---cccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceEE Confidence 100000 00000001 11112222 134477889999998888888776 333333334 5788 Q ss_pred ecc-cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhh--HHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 MFT-GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYS--LDEIFSQYSTRAEVSKQIGEALATHYDERIARV 141 (332) Q Consensus 65 ~~i-G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~ 141 (332) |+. +.+.+.-...|..... .+++.+++++...+. +.-+.|.+ ++.....+++.+.+.++.++++++..|+.++.- T Consensus 180 p~~~~~~~a~~v~E~~~~~~-~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G 257 (435) T protein:vir:80 180 PRLKGGAIVGYIGADTDIPT-TQQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRD 257 (435) T ss_pred EEEeCCcceeeeccCccccc-cccceeeEEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 866 5566666666776654 356677777777664 33445644 122222457889999999999999999988631 Q ss_pred HHHHhhhccccccc---cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcc Q lcl|Aclame:pro 142 LAKASAEASPVTGE---PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNR 218 (332) Q Consensus 142 ~~~aa~~~~~~~~~---~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~ 218 (332) .-+.....|. .....+. .+........++..+.++...|..++....+-..|++|..|..|...+|.+ . T Consensus 258 ----~G~~~~p~Gi~~~~~~~~~~-~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~---G 329 (435) T protein:vir:80 258 ----DGTANTPKGLRFWALPGNVI-TASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGN---G 329 (435) T ss_pred ----CCCCCcccceeeccccccee-ecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccC---C Confidence 1111101110 0000111 111122233444556777777777766544445578999998886543321 1 Q ss_pred ccccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeee Q lcl|Aclame:pro 219 EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTS 298 (332) Q Consensus 219 d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~ 298 (332) .+.-. .+. -+.++|.+|+.++.+|...+.. .....-|.++|+..+ ++- ..+++++..+ T Consensus 330 ~~l~~--~~~----~~~l~G~pv~~~~~~p~~~~~~------~~~~~i~~gd~s~~~--i~~--------~~~~~i~~~~ 387 (435) T protein:vir:80 330 NKVYP--ELA----NGMLKGYPVGKTTQVPINLGEA------GKESEIYFTDFGDVF--IGE--------EETLEIDYSK 387 (435) T ss_pred ceecc--CCC----CCeEeeeeeEEeccccccccCC------CCcceEEEEEcccEE--EEe--------ecceEEEEec Confidence 11110 011 2368999999999999643221 111223456666532 222 2233444433 Q ss_pred cccch--------hHH--HHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 299 GDFNV--------QYQ--GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 299 ~~~~~--------~~~--~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +-.-. .|+ .-.++....|+.++.||++.+.|..+ T Consensus 388 ~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:80 388 EATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGV 431 (435) T ss_pred cccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEecc Confidence 21100 011 13456788999999999998888888 No 99 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.19 E-value=2.6e-12 Score=84.05 Aligned_cols=289 Identities=9% Similarity=-0.013 Sum_probs=156.6 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) ..++.+ |.. .. .+. .++. .+.-+.+..+|.+..++.++++++.+..+.. +.+.+||+. +.+.+.-...|. T Consensus 6 ~~~~e~--~~~-~~-~~~-~~~~---~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~ 76 (318) T protein:vir:24 6 AFAVDH--AQI-AQ-TGD-TMFK---GYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDVSAQWIGEGD 76 (318) T ss_pred CCCHHH--HHh-hc-ccC-cccc---eeechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceEEecCCc Confidence 111111 000 00 011 1111 2556889999999999999999998766654 556888865 567777788888 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~ 159 (332) .+... +++.+++++..-+. ..-..|.+-=-.++..|+.+.+.+++++++++.+|+.++.-- .+..+........ T Consensus 77 ~~~~~-~~~f~~i~~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~----g~~~~~~~~~~~~ 150 (318) T protein:vir:24 77 MKPIT-KGNMTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGT----DSPFPTYIGQTTK 150 (318) T ss_pred ccccc-ccceeEEEEeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhccc----CCCCCcccccccc Confidence 87754 56667766666653 233445542222356789999999999999999999886311 1111110000000 Q ss_pred eeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchh---hccccccccccccccceeeee Q lcl|Aclame:pro 160 HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNI---LNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 160 ~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~---~~~d~~~~~~~~~~g~~v~~i 236 (332) .+..+.. ........+.+.++...+...+. ..-.++++|..|..|.+.+|.+- ...+..+....... -+.+ T Consensus 151 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~---~~~i 224 (318) T protein:vir:24 151 AISIADT-TGATTVYDQVAVNGLSLLVNDGK--KWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFR---SGRI 224 (318) T ss_pred ccccccc-ccccchHHHHHHHHHHhhccccC--CCCEEEEcHHHHHHHHHhhccCCceeecCccccCcccccc---CceE Confidence 1111111 11111223344555555554443 33456899999999976554320 00011111111111 2478 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc--------h---hH Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN--------V---QY 305 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~--------~---~~ 305 (332) +|++++.++++|... ...+.++|+... ++. ..++.++..++-.- . .| T Consensus 225 ~g~pv~~~~~~~~~~------------~~~~~gdfs~~~--~~~--------~~~l~i~~~~~~~~~~~~~~~~~~~~~f 282 (318) T protein:vir:24 225 VARPTILSDHVVEGT------------TVGFMGDFSQLI--WGQ--------IGGLSFDVTDQATLNLGTVESPNFVSLW 282 (318) T ss_pred EEEeeEEeCCCCCCc------------cEEEEeecceEE--EEE--------ecCeEEEEeeccceeccccccccchhhh Confidence 999999999988421 112334554322 222 22334443332110 0 01 Q ss_pred HHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +-| .+++.+.+|.+++||++.+.|+.+ T Consensus 283 ~~~~~~~r~~~r~d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 283 QHNLVAVRVEAEYAFHCNDAEAFVALTNV 311 (318) T ss_pred hcCcEEEEEEEEEccEEecccceEEEEee Confidence 112 246677899999999999888887 No 100 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.19 E-value=2.3e-12 Score=84.42 Aligned_cols=295 Identities=10% Similarity=0.057 Sum_probs=157.1 Q ss_pred CCC------ccccccccc---ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEec-ccce Q lcl|Aclame:pro 1 MTT------LSNFSLPNQ---ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMF-TGKL 70 (332) Q Consensus 1 m~~------~~~~~r~~~---~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~-iG~~ 70 (332) +.. +.+.-|-+. .-..+..+++. .+.=+.|..++.+..+..+.+++++++.+..+++ .++|. .+.+ T Consensus 108 ~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG---~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~~~~~~~~ 183 (425) T protein:vir:10 108 LRDPEYTEAFKAHVKRGDVQAALNKGEDSEGG---YLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAG-FSKLFNMGGT 183 (425) T ss_pred cccHHHHHHHHHHhhhhhhHHHhhcCcCCCCc---eeccHhHHHHHHHHHHhhhhhhhhceeeeccCCc-eEEEEEcCCc Confidence 000 000000000 00111112222 2555999999999999999999999877776554 55554 4666 Q ss_pred eeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 71 SAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) Q Consensus 71 t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~ 150 (332) ++.-...|........++..++++..-+. ..-..|.+-=-.++.+|+.+.+.++.++++++..|+.++.= .+ .+. T Consensus 184 ~a~wv~E~~~~~~~~~~~f~~v~~~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G--~G--~~~ 258 (425) T protein:vir:10 184 TSGWVGEASQRPQTNAATFQPLSFASGEI-YANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAG--DG--TNK 258 (425) T ss_pred ceeeeccccccccccccccceeeeeheee-EeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcc--cC--CCC Confidence 66666666655433223455566655543 22334554222345689999999999999999999987631 01 111 Q ss_pred ccc---ccccccee------c-cccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccc Q lcl|Aclame:pro 151 PVT---GEPGGFHV------N-IGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREI 220 (332) Q Consensus 151 ~~~---~~~~~~~i------~-~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~ 220 (332) |.+ ...++... . ............++.|+++...|...... +-..|++|..|..|...+|.+ ++ + T Consensus 259 p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~--~a~~vmn~~~~~~L~~lkD~~--G~-~ 333 (425) T protein:vir:10 259 PNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTG--NARFAMNRNTQRQVRKLKDGQ--GN-Y 333 (425) T ss_pred cceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhcc--CCEEEEchHHHHHHHHhhcCC--Cc-e Confidence 100 00000000 0 00001112223467788887777665442 334579999999887654432 11 1 Q ss_pred ccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecc Q lcl|Aclame:pro 221 GNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD 300 (332) Q Consensus 221 ~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~ 300 (332) . -...+..|. -++++|.+|+.++++|....... .-+-++|+... .++.+.. +++.++. T Consensus 334 l-~~~~~~~g~-~~~l~G~PV~~~~~~p~~~~~~~---------~i~~Gd~~~~~-~i~~~~~----------~~v~~d~ 391 (425) T protein:vir:10 334 L-WQPSYVAGQ-PATLAGYPVTEVPDMPDVAANST---------PILFGDFQQTY-LIIDRIG----------VRVLRDP 391 (425) T ss_pred e-eccCccCCC-CceecceeeEEecCcCCccCCcc---------EEEEEehhccE-EEEEecc----------eEEEecc Confidence 1 112234453 46899999999999995422111 11234554422 1222221 2222222 Q ss_pred cchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 301 FNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 301 ~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +... --..+++..++|.++++|++...|.-+ T Consensus 392 ~~~~-~~~~~~~~~r~d~~v~~~~A~~~l~~~ 422 (425) T protein:vir:10 392 YTAK-PYVLFYTTKRVGGGLLNPEPMRAMKVA 422 (425) T ss_pred cccC-CcEEEEEEEEeccEeecccceEEEEee Confidence 2111 112345667899999999998877766 No 101 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.19 E-value=1.5e-12 Score=85.45 Aligned_cols=285 Identities=14% Similarity=0.078 Sum_probs=163.9 Q ss_pred CCCcc-----------cccccccccccc--cccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc Q lcl|Aclame:pro 1 MTTLS-----------NFSLPNQANGGA--RNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT 67 (332) Q Consensus 1 m~~~~-----------~~~r~~~~~~~~--~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i 67 (332) |=.+. .+-++...+... ...+++ .+.=+.|..++++..++.+.++++++.-+.. +.+++||+. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~---~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~ 76 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKD---GTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFW 76 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCc---ceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEE Confidence 11100 011111000000 001111 2555889999999999999999998766644 556889976 Q ss_pred -cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 68 -GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKAS 146 (332) Q Consensus 68 -G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa 146 (332) +.+.+.-...|..+.. .+++.+++++..-+. ..-..|.+---.++..++.+.+.++.++++++..|+.++.-- + T Consensus 77 ~~~~~a~~v~Eg~~~~~-~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~--g- 151 (324) T protein:vir:97 77 ADKPGAYWVGEGQKIET-SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--G- 151 (324) T ss_pred ecCcceeEeccCccccc-cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccC--C- Confidence 5667777777887764 356777777777664 344456552222346789999999999999999999887421 1 Q ss_pred hhccccccccccceeccc-cccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccc Q lcl|Aclame:pro 147 AEASPVTGEPGGFHVNIG-AGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG 225 (332) Q Consensus 147 ~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~ 225 (332) .. ..+.+...... ......+...++.|.++...|..++.... .++++|..|..|.+.+|.. ... T Consensus 152 -~~----~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~~--------g~~ 216 (324) T protein:vir:97 152 -NN----PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE--------TKE 216 (324) T ss_pred -CC----ccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhcCC--------Cce Confidence 11 11111100000 11111222347888888888888776433 5688999999887654432 112 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc---- Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF---- 301 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~---- 301 (332) .+. +..-+.++|.+|+.++..+...+ .-+.++|++.. + +...+++++..++-. T Consensus 217 ~~~-~~~~~tl~G~PV~~~~~~~~~~~------------~~~~gd~~~~~--i--------~~~~~~~i~~~~~~~~~~~ 273 (324) T protein:vir:97 217 RIY-DRNSDTLDGLPVVNLKSSNLKRG------------ELITGDFDKLI--Y--------GIPQLIEYKIDETAQLSTV 273 (324) T ss_pred eec-CCCCccccceeeEeecCCCCCcc------------eEEEEecccEE--E--------EEecCcEEEEeeccccccc Confidence 222 22346789999999887664221 12334444322 1 122344555443311 Q ss_pred ----ch---hHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 ----NV---QYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ----~~---~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .. .++-| .+++.+.+|.++++|++.+.|+-| T Consensus 274 ~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 00 01112 234456789999999999988888 No 102 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.18 E-value=2.1e-12 Score=84.65 Aligned_cols=294 Identities=9% Similarity=0.024 Sum_probs=158.0 Q ss_pred CCCcc--ccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEec-ccceeeeeec Q lcl|Aclame:pro 1 MTTLS--NFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMF-TGKLSAGYHT 76 (332) Q Consensus 1 m~~~~--~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~-iG~~t~~~~~ 76 (332) |-... .++..-. ....+..+++. .+.=+.|..++++..+..+.++++++..+..++ +..+|. .+.+++.-.. T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG---~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~ 165 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGG---YAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTSGWVG 165 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCc---ccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcceeeec Confidence 11100 1110000 01112222222 144488999999999999999998887666655 555654 4556666666 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecc-hhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc- Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG- 154 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~- 154 (332) .|.........+..++++.+-+ +.. ..|.+-=-.++.+++.+.+.++.++++++..|+.++.- .+ +..|.+. T Consensus 166 E~~~~~~~~~~~f~~i~~~~~k--~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G--~G--~~~p~Gil 239 (407) T protein:vir:48 166 ETDARPETATSKLGLIEPFMGE--IYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSG--DG--SKKPKGFL 239 (407) T ss_pred ccccccccccccceeEEeeeee--eEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--CC--CCccceee Confidence 6766543333455666666654 333 34554322345678999999999999999999987631 11 1111000 Q ss_pred -ccccceecc--------ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccc Q lcl|Aclame:pro 155 -EPGGFHVNI--------GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG 225 (332) Q Consensus 155 -~~~~~~i~~--------~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~ 225 (332) ......... .......+...++.|.++...|..+..+.. .+|++|..|..|...+|.+ .+ +. -.. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a--~~v~n~~~~~~L~~lkD~~--Gr-~l-~~~ 313 (407) T protein:vir:48 240 AYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGA--KFMMNNSSLFAIRLLKDND--GN-YL-WRP 313 (407) T ss_pred ecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCC--EEEEcHHHHHHHHHhhccC--Cc-ee-ecc Confidence 000000000 000011112236778888888877765432 3578999998886544432 11 11 111 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhH Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQY 305 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~ 305 (332) .+..|. .++++|.+|+.++++|..+.... .-+-++|+... .++.+ .++++ .++++-.+ T Consensus 314 ~~~~g~-~~~l~G~PV~~~~~~p~~~~~~~---------~i~~Gd~~~~~-~i~~~--------~~~~i--~~d~~~~~- 371 (407) T protein:vir:48 314 GIELGQ-PSSLAGYGIVENEQMPDIAADAK---------AIAFGNFKRGY-TIVDR--------IGTRI--LRDPYTNK- 371 (407) T ss_pred CcCCCC-CceecceeeEEecCcCCccCCcc---------EEEEEeccccE-EEEEe--------eceEE--EeeccccC- Confidence 233443 56899999999999996422111 11224444321 12211 22222 22322111 Q ss_pred HHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) --..+++.+++|+++++|++.+.|.-+ T Consensus 372 ~~~~~~~~~r~d~~v~~~~a~~~l~~~ 398 (407) T protein:vir:48 372 PFVGFYTTKRTGGMLVDSQAIKLMKIG 398 (407) T ss_pred CcEEEEEEEEeccEEecccceEEEEee Confidence 112356677899999999999888766 No 103 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.18 E-value=1e-12 Score=86.34 Aligned_cols=290 Identities=10% Similarity=0.052 Sum_probs=157.5 Q ss_pred CCCc-cccc-ccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTL-SNFS-LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~-~~~~-r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~ 77 (332) |..- .... ++.. ..+-..+++. .+--+++...+.+...+.++++.+.+..+..++..+.||.. |.+++.-... T Consensus 97 ~~~~~~~~~~~~~~-~~~t~~~~g~---~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 172 (392) T protein:vir:13 97 NLGEARSFEFAPEK-RDGTKAGNPN---VLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGE 172 (392) T ss_pred chhhhHHHHhhhhh-hcccccCCCc---cccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecc Confidence 1000 0000 0111 1111111111 13335666777777778888888888777777888888866 5567777777 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+... +++.+++++..-+. ..-..|++-=-.++.+|+.+.+.++.++++++..|..++.= .+ ++.|.+.... T Consensus 173 ~~~~~~~-~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G--~G--t~~p~Gil~~ 246 (392) T protein:vir:13 173 TAEIPES-YPATTQRSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTG--TG--TGQPRGILTD 246 (392) T ss_pred ccccccc-ccceeeEEeeeeeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcc--cC--Cccccccccc Confidence 8777653 56777777777653 33345655322346778999999999999999999988731 11 1111110000 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) ........+........|+.|+++...|...... ...| |++|..+..|..-+|.. .+ +.- ...+..|. -..++ T Consensus 247 ~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~-~a~~-v~n~~~~~~l~~lkd~~--G~-~l~-~~~~~~g~-~~~l~ 319 (392) T protein:vir:13 247 ATGANAAFGEADADSKVSDALIDLFHEVPSAYRK-NAKF-VVNDLRAAQMRKLKDAN--GQ-YLW-QSALTVGA-PDTFN 319 (392) T ss_pred cccccccccccccccccHHHHHHHHHhhhhhhhc-CCEE-EEcHHHHHHHHHhhccC--Cc-eee-cCCcCCCC-Cceec Confidence 0000000111111223377788877777655332 2344 77999998886544321 11 111 11223343 46899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhC Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G 317 (332) |.+|+.++++|... -+-++|+.. +++.+. +++++...+.+-.+ =-..+++.+++| T Consensus 320 G~Pv~~~~~~~~~~--------------i~~Gdf~~~--~i~~~~--------~~~i~~~~~~~~~~-~~~~~r~~~r~d 374 (392) T protein:vir:13 320 GKVVETDDGMPADK--------------VLFADLSKY--RVRFAG--------SLRVDRSVDAKFST-DQIVYRFLQRAD 374 (392) T ss_pred ceeeEEcCCCCCCc--------------EEEeeccce--eEEeec--------ceEEEeeccccccC-CcEEEEEEEEec Confidence 99999999998421 123455432 222222 23333332211000 012356778899 Q ss_pred Cceechhheeeeec--C Q lcl|Aclame:pro 318 CGSLRTSVAGSFQA--A 332 (332) Q Consensus 318 ~~vlrpe~~v~i~~--A 332 (332) +++++|++.+.++- | T Consensus 375 ~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 375 GLLVDARGAKVLTVTPA 391 (392) T ss_pred cEEecccceEEEEeecc Confidence 99999999885543 3 No 104 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.18 E-value=1.9e-12 Score=84.80 Aligned_cols=274 Identities=12% Similarity=-0.024 Sum_probs=159.3 Q ss_pred ccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEeccc--ceeeeeecCCCCCCc Q lcl|Aclame:pro 7 FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTG--KLSAGYHTPGTPIVG 83 (332) Q Consensus 7 ~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~iG--~~t~~~~~~g~~~~~ 83 (332) +-|- ..++..+++. ++.=+.|..++++..+..+.++++++..++..+ .+..|+... .........|..+.. T Consensus 1 ~l~~---~~~~t~~~gg---~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~ 74 (293) T protein:vir:48 1 MLDS---KTDHSGSDAG---LTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIAD 74 (293) T ss_pred Ccee---ecccccCcCc---eEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCccccc Confidence 1111 1112222222 255689999999999999999999887666543 466676553 345566677777654 Q ss_pred cCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccceecc Q lcl|Aclame:pro 84 DAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNI 163 (332) Q Consensus 84 ~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~i~~ 163 (332) ...++..++++...+.. ....|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-+.+.+ T Consensus 75 ~~~~~~~~i~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~----------------- 136 (293) T protein:vir:48 75 IDDPKLSLIKYTIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLP----------------- 136 (293) T ss_pred ccccceeEEEEeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccc----------------- Confidence 33466777788777653 345665532334578899999999999999999998875432110 Q ss_pred ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeeceEEEe Q lcl|Aclame:pro 164 GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILK 243 (332) Q Consensus 164 ~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~ 243 (332) +..... -++.|.++..+|..+..+. -..+++|..|..|.+.+|.. ++ +.- ...+.+|. -++++|.+|+. T Consensus 137 ~~~~~~----~~d~i~~~~~~l~~~~~~~--a~~vmn~~~~~~L~~lkd~~--g~-~l~-~~~~~~~~-~~~l~G~Pv~~ 205 (293) T protein:vir:48 137 TKPTLT----KWDDIIDLEAKVDPAIKQT--SFFLTNTSGFTALKKVKNAL--GD-YLM-ERDVKSPT-GYSIAGFAVKE 205 (293) T ss_pred cccccc----CHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhccC--Cc-eEe-ecCcCCCC-CceecceeeEE Confidence 011111 2677778888887665532 34578999999886644431 11 111 11233443 56899999987 Q ss_pred eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHHHHHhCCcee Q lcl|Aclame:pro 244 SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVGKLAMGCGSL 321 (332) Q Consensus 244 sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~~~~~G~~vl 321 (332) +.+.+.... ..|...-+-++|++.. ..+...+++++..+... ..+..| .+++..++|.+++ T Consensus 206 ~~~~~~~~~-------~~~~~~~~~gd~~~~~---------~~~~~~~~~i~~~~~~~-~~~~~~~~~~r~~~r~d~~~~ 268 (293) T protein:vir:48 206 ISDRWLPNA-------SSGVMPLYFGDLKQAV---------TLFDRQQMSLLSTNIGG-GAFETDTTKVRVIDRFDVVAT 268 (293) T ss_pred ecccccCCc-------cCCceEEEEEeccceE---------EEEEecceEEEEecccc-hhhhcCeEEEEEEEeeCcEEe Confidence 655442111 1111122333443322 22222344555443211 111222 3566778999999 Q ss_pred chhheeeeecC Q lcl|Aclame:pro 322 RTSVAGSFQAA 332 (332) Q Consensus 322 rpe~~v~i~~A 332 (332) +|++.+.++-+ T Consensus 269 ~~~a~~~l~~~ 279 (293) T protein:vir:48 269 DTEAFVPASFK 279 (293) T ss_pred cccceEEEEee Confidence 99998877633 No 105 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.17 E-value=1.4e-12 Score=85.55 Aligned_cols=284 Identities=11% Similarity=0.037 Sum_probs=159.3 Q ss_pred CCC-----cccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc--ceeee Q lcl|Aclame:pro 1 MTT-----LSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAG 73 (332) Q Consensus 1 m~~-----~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG--~~t~~ 73 (332) |.. ..+..+ ..+...++++ .+.-+.|+.++.+..+..+.++++++..+.. +.++.+|+.. ..++. T Consensus 121 ~~~~~~~~~~~~~~----~~~~~~~~~g---~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~ 192 (418) T protein:vir:10 121 RVRVDRKSIMNVPA----TVGSGVSGSN---SLVVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNAA 192 (418) T ss_pred hhhhHHHHHHHhhh----hccCCCCCCc---cccchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCcee Confidence 110 001111 1111222222 3777999999999999999999999877655 5567788753 34555 Q ss_pred eecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) ....|+..... +++.+++++...+.- .-..|++ +-.+...++.+.+.++.++++++..|..++.- ..++.... T Consensus 193 ~v~E~~~~~~~-~~~f~~v~~~~~k~~-~~~~is~-ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G----~g~~~~p~ 265 (418) T protein:vir:10 193 AVAEGAQKPTS-DLKFNLKNQPVRTIA-HLFKASR-QILDDAPALQSYIDGRARYGLQLTEEGQILKG----DGTGANIL 265 (418) T ss_pred eeccCcccccc-ccceeeEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCcccc Confidence 66677766543 567777777777643 2344654 23344568889999999999999999988731 11111111 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) |.-....... ..........++.|+++...+...+.+.. .+|++|..|..|..-+|.. .+ +... ....+. - T Consensus 266 Gi~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~~--G~-~i~~--~~~~~~-~ 336 (418) T protein:vir:10 266 GILPQASAFM-PSITLANATPIDKIRLALLQAVLAEFPAT--GIVLNPIDWASIELTKDSQ--GR-YIVG--NPVNGT-T 336 (418) T ss_pred cccccccccc-ccccccccccHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC--Cc-eecc--ccccCC-C Confidence 1100000000 01111122236777788777776665443 3678999998886544432 11 2111 123332 5 Q ss_pred eeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHH Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIV 311 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~ 311 (332) +.++|++|+.|+++|... .+-++|+... +++.+ .++++++.+... ..+.-| .++ T Consensus 337 ~~l~G~pV~~~~~~p~~~--------------~~~gd~s~~~-~~~~~--------~~~~i~~~~~~~-~~f~~~~~~~r 392 (418) T protein:vir:10 337 PRLWNLPVVETQAMTANE--------------FLVGAFSMAA-QIFDR--------MEIEVLLSTENV-DDFEKNMVSIR 392 (418) T ss_pred ceecceeeEEcCCCCCCc--------------EEEeeccceE-EEEEe--------cceEEEEecccc-hhhhcCceEEE Confidence 689999999999999421 1234444322 22222 233444432211 111112 234 Q ss_pred HHHHhCCceechhheeeeecC Q lcl|Aclame:pro 312 GKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 312 ~~~~~G~~vlrpe~~v~i~~A 332 (332) +.+.++.++++|++.+.+.-. T Consensus 393 ~~~~~d~~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 393 AEERLALAVYRPESFVTGALV 413 (418) T ss_pred EEEeeccEEecccceEEEEec Confidence 556789999999998766554 No 106 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.16 E-value=4.2e-12 Score=82.96 Aligned_cols=292 Identities=16% Similarity=0.142 Sum_probs=151.0 Q ss_pred CCC--c--ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccc-cccccccccceEEEecc-cceeeee Q lcl|Aclame:pro 1 MTT--L--SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGL-VRSYDLRGGKSKQFMFT-GKLSAGY 74 (332) Q Consensus 1 m~~--~--~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~-v~~r~~~~G~tv~i~~i-G~~t~~~ 74 (332) |.. . ..+.| .. +...++++ .|.=+.+.+++.+..+..++++.+ .+.-+...| .+++|+. +.+.+.- T Consensus 52 ~a~~~~~~~~~~~--a~--~~~~~~Gg---~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g-~~~~p~~t~~~~a~w 123 (366) T protein:vir:57 52 FAATELGDTGLSM--AI--STAAGSGG---ALIPQNMQNEVIELLRDRTVVRILGARSIPLPNG-NLSMPRLSGGATAGY 123 (366) T ss_pred HHHHhhcchhhhh--hc--cccccCCc---cccchhHHHHHHHHHhhhcchhhhceeeeecCCC-ceEEEEEeCCcceee Confidence 100 0 00111 00 11111122 144578899999999888888766 333233444 4788866 5667777 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) ...|..+... +++.+++++..-+. +.-..|.+-=-.++.+++.+.+.+++++++++..|+.++.- ..++....| T Consensus 124 v~E~~~~~~s-~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~G~~~~p~G 197 (366) T protein:vir:57 124 VGEGKDVVAT-GATFDDVKLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRD----DGTGDTPKG 197 (366) T ss_pred eccCcccccc-ccceeEEEEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhcc----CCCCccccc Confidence 7778777654 56677777766654 33445654222356789999999999999999999987731 111111111 Q ss_pred c---cc-cceeccccccccCHHHHHHHHHH-HHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 155 E---PG-GFHVNIGAGNTNDAQAIVDGFFE-AAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 155 ~---~~-~~~i~~~~~~~~~~~~~~d~i~~-a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) . .+ .......++...+... ++.+.+ +.......+....+-..+++|..|..|.+-+|.. ..+ +.. T Consensus 198 i~~~~~~~~~~~~~~~t~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~---G~~------l~~ 267 (366) T protein:vir:57 198 MKAVATAANRLVAWTGTAINLTT-IDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGN---GNK------VYP 267 (366) T ss_pred eeeccccccceeeccccccchhh-HHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccC---Cce------ecc Confidence 0 00 0000001111112222 222222 2222222222222333468999999886543321 111 111 Q ss_pred cceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecc----cc--- Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD----FN--- 302 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~----~~--- 302 (332) ...-+.++|++|+.|+.+|...+.. .+...-|-++|+... +..+ .+++++..++- .+ T Consensus 268 ~~~~g~l~G~Pvv~s~~ip~~~~~~------~~~~~i~~gdfs~~~--i~~~--------~~i~i~~~~ea~~~~~~g~~ 331 (366) T protein:vir:57 268 EMSQGILKGYPIQRTSAIPANLGDD------GNESEIYFCDFNDVV--IGED--------GMMKVDFSTEATYKDADGQL 331 (366) T ss_pred CCCCCeecceeeEEccccccccccC------CCccEEEEEecceEE--EEEe--------cceEEEEeeccccccccccc Confidence 1112478999999999999643221 111223456666532 2222 22333333221 00 Q ss_pred -hhHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 -VQYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 -~~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) -.|+.| .|+..+.++.+++||++.+.|..+ T Consensus 332 ~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~ 364 (366) T protein:vir:57 332 VSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGV 364 (366) T ss_pred hhhhhcCceeEEeeeeeCcEeeccccEEEEecc Confidence 012222 466777899999999998888777 No 107 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.15 E-value=1.1e-11 Score=80.76 Aligned_cols=295 Identities=16% Similarity=0.110 Sum_probs=156.8 Q ss_pred CC-----------------Cc--ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccc-cccccccccc Q lcl|Aclame:pro 1 MT-----------------TL--SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGL-VRSYDLRGGK 60 (332) Q Consensus 1 m~-----------------~~--~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~-v~~r~~~~G~ 60 (332) |. .. ....+.+..+.+. .+++. .+.=+.+..++++..+..++++.+ ++..+..+| T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t-~~~gg---~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~- 175 (435) T protein:vir:14 101 MVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLS-PGAGG---VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG- 175 (435) T ss_pred HHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCC-cCCCc---cccchhHHHHHHHHHhhhchhhhhcceeeecCCC- Confidence 00 00 0000111111111 11121 133478888999988888887776 333343444 Q ss_pred eEEEecc-cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHh--chhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 SKQFMFT-GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFS--QYSTRAEVSKQIGEALATHYDER 137 (332) Q Consensus 61 tv~i~~i-G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~--~~d~~~~~~~~~~~aLa~~~D~~ 137 (332) .+++|+. +.+...-...|..+.. .+++..++++...+. +.-+.|.+-=-.++ ..++.+.+..+.++++++..|+. T Consensus 176 ~~~~p~~~~~~~a~~v~E~~~~~~-~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a 253 (435) T protein:vir:14 176 NITIPRLKGGAIVGYIGADTDIPT-TQQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKA 253 (435) T ss_pred ceEEEEEeCCcceeeeccCccccc-cccceeEEEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHH Confidence 5788876 5566666666766654 346667777777654 33345654111123 34588889999999999999998 Q ss_pred HHHHHHHHhhhcccccccc---ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCch Q lcl|Aclame:pro 138 IARVLAKASAEASPVTGEP---GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN 214 (332) Q Consensus 138 i~~~~~~aa~~~~~~~~~~---~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~ 214 (332) |+.- ..++....|.. ....+ ...........+++.+.++...+...+.-.....++++|..|..|...+|.. T Consensus 254 ~l~G----~G~~~~p~Gi~~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~ 328 (435) T protein:vir:14 254 FIRD----DGTANTPKGLRFWALPSNV-ITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGN 328 (435) T ss_pred hhcc----CCCCccccceeecccccce-eccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccC Confidence 8621 11111011100 00001 1112223344455666677666666654334456689999998886544321 Q ss_pred hhccccccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcccee Q lcl|Aclame:pro 215 ILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTI 294 (332) Q Consensus 215 ~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~ 294 (332) ..+.-. ... -+.++|++|+.++.+|...+... ....-+-++|+... ++.+.. +++ T Consensus 329 ---G~~l~~--~~~----~g~l~G~Pv~~~~~~p~~~~~~~------~~~~i~~gd~s~~~--i~~~~~--------~~~ 383 (435) T protein:vir:14 329 ---GNKVYP--ELA----NGMLKGYPVGKTTQVPINLGETG------KESEIYFTDFGDVF--IGEEET--------LEI 383 (435) T ss_pred ---Cceecc--CCC----CCeeecceeEeeccccccccCCC------ccceEEEeecccEE--EEEecc--------cEE Confidence 111110 011 24789999999999996432211 11123445665532 333322 233 Q ss_pred eeeecccc-------hh-HH--HHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 295 QTTSGDFN-------VQ-YQ--GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 295 e~~~~~~~-------~~-~~--~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +..++-.- .. |+ --.++..++++.++.||++.+.|.-| T Consensus 384 ~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:14 384 DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGV 431 (435) T ss_pred EEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecC Confidence 33221100 00 11 14567888999999999998888888 No 108 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.15 E-value=1.8e-12 Score=85.01 Aligned_cols=288 Identities=9% Similarity=0.018 Sum_probs=157.5 Q ss_pred CCCc-cccccc----ccccccccccccCchhhHHHH-HHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeee Q lcl|Aclame:pro 1 MTTL-SNFSLP----NQANGGARNADYDVRYATALK-LFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAG 73 (332) Q Consensus 1 m~~~-~~~~r~----~~~~~~~~~~~~d~~~al~~e-~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~ 73 (332) |-.- ....|. .-...+...++++ +..+ ++...+.+..+..++++.+.++.+..++..+.||+. |...+. T Consensus 93 ~r~~~~~~~r~~~~~~~~~~~t~~~~g~----~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~ 168 (390) T protein:vir:62 93 LRAGNLGEARSFEFAPEKRDGTKAGNPN----VLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSAS 168 (390) T ss_pred HhhhhhhhhHHHHhhhhhhcccccCCCc----cccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCccee Confidence 0000 000010 0001111111222 3444 444555556667788888887777777788999966 556777 Q ss_pred eecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) ....|..+... +++..++++.+-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +......+. T Consensus 169 wv~E~~~~~~~-~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G~p~Gi~ 242 (390) T protein:vir:62 169 IVGETAEIPES-YPATAQRSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFIT----GTGQPRGIL 242 (390) T ss_pred eeccccccccc-ccceeeeEeeeeeE-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhc----cCCcccccc Confidence 77778877654 56778888888765 3444565532334677899999999999999999998763 110100010 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) ............ .......++.|.++...|+.+... +-..|++|..|..|.+-+|.+ ..+.- +..+..|. - T Consensus 243 ~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~~---g~~l~-~~~~~~g~-~ 313 (390) T protein:vir:62 243 TDASPATATFLA--TDTDSKVSDALIDLFHEVPSAYRA--NAKYVVNDLRAAQMRKLKDAN---GQYLW-QSGLTVGA-P 313 (390) T ss_pred ccccccccceec--ccccccchHHHHHHHHhhhhhhhc--CCEEEEchHHHHHHHHhhccC---CCeee-cCCcCCCc-c Confidence 010000001111 111122367777887778766543 334578999999886533321 11211 12234443 4 Q ss_pred eeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHH Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGK 313 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~ 313 (332) ..++|++|+.++++|... -+-++|+.. +++.+. ++.++...+.+ ..+=...+++. T Consensus 314 ~~l~G~Pv~~~~~~p~~~--------------i~~gd~s~~--~i~~~~--------~~~v~~~~~~~-~~~~~~~~~~~ 368 (390) T protein:vir:62 314 SLFNGKVVETDDGMPADK--------------ILFADLSKY--RVRFAG--------SLRVDRSVDAK-FSTDQIVYRFL 368 (390) T ss_pred ceecccceEEecCCCCcc--------------EEEeeccce--eEEeec--------ceEEEeecccc-ccCCcEEEEEE Confidence 579999999999998421 122455432 222222 22333322211 00001234677 Q ss_pred HHhCCceechhheeeeecC Q lcl|Aclame:pro 314 LAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 314 ~~~G~~vlrpe~~v~i~~A 332 (332) +++|+++++|++...|.-+ T Consensus 369 ~r~d~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 369 QRADGLLVDARGAKVLTVT 387 (390) T ss_pred EEeCcEeechhheEEEEee Confidence 8899999999998766644 No 109 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.12 E-value=1.9e-12 Score=84.78 Aligned_cols=295 Identities=11% Similarity=-0.017 Sum_probs=152.3 Q ss_pred CCCcc------cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeee Q lcl|Aclame:pro 1 MTTLS------NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAG 73 (332) Q Consensus 1 m~~~~------~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~ 73 (332) |..-. .+......+.+...++++ .+.-+.+..++.+..+..+.++++.++.++.++ ..++|+. +.+.+. T Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~---~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~ 218 (458) T protein:vir:10 143 VMEKGVFETEHGQRHLKAVNQSSSVEVSS---ESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKAT 218 (458) T ss_pred HHhhccchhhhhhhhhhhhhhcccCcccc---ceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCccee Confidence 00000 000000011112222222 367789999999999999999988877666554 4555543 445554 Q ss_pred eecCCCCCCcc-----CCCCCceEEEEEeeeeecc-hhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 74 YHTPGTPIVGD-----AGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 74 ~~~~g~~~~~~-----~~~~~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) ....|...... ..++..++++.. .++.. ..|.+-=-..+.+++.+.+.++++++|++..|+.++.- .+ T Consensus 219 ~v~e~~~~~~~~~~~~~~~~~~~i~~~~--~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G--~G-- 292 (458) T protein:vir:10 219 WVAASTYGTDTTTGEEVKGALKEIHFST--YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTG--DG-- 292 (458) T ss_pred ecccccccccccccccccccceeeEeee--eeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcC--CC-- Confidence 44444433321 123344444444 44444 34544222234588999999999999999999988631 11 Q ss_pred hccccc--cccccceecc--ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCc--hhhccccc Q lcl|Aclame:pro 148 EASPVT--GEPGGFHVNI--GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDT--NILNREIG 221 (332) Q Consensus 148 ~~~~~~--~~~~~~~i~~--~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~--~~~~~d~~ 221 (332) +..|.+ ..++...... ..+........|+.|+++...|..+... +-.+|++|..|..|...+|. +.+-... T Consensus 293 ~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~lkd~~G~~i~~~~- 369 (458) T protein:vir:10 293 SGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDEEWQDVAQVG- 369 (458) T ss_pred CCccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHhhcccCCceeeccc- Confidence 111110 0000000000 0111111122378888888888777653 23357899999887654332 1111100 Q ss_pred cccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc Q lcl|Aclame:pro 222 NSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF 301 (332) Q Consensus 222 ~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~ 301 (332) .......|. ..+++|.+|+.++.+|..++... -+-++|.... ++ +...+++++ ++++ T Consensus 370 -~~~~~~~~~-~~~l~G~pv~~~~~~p~~~~~~~----------~~~~~f~~~~-~~--------~~~~~~~v~--~d~~ 426 (458) T protein:vir:10 370 -NDSVKLQGQ-VGRIYGLPVVVSEYFPAKANSAE----------FAVIVYKDNF-VM--------PRQRAVTVE--RERQ 426 (458) T ss_pred -cccccccCc-CceecceeeEEccccccccCCcc----------eEEEEecccE-EE--------EEeeceEEE--eecc Confidence 112223332 46899999999999996432211 1122332211 11 122333333 2222 Q ss_pred chhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 302 NVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 302 ~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ... --..++...++|..+.+|++.|....| T Consensus 427 ~~~-~~~~~~~~~r~~~~v~~~~a~v~~~~a 456 (458) T protein:vir:10 427 AGK-QRDAYYVTQRVNLQRYFANGVVSGTYA 456 (458) T ss_pred cCC-CceEEEEEEEecceEecccceEEEeec Confidence 111 012245567889999999999988877 No 110 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.10 E-value=5.4e-12 Score=82.35 Aligned_cols=293 Identities=9% Similarity=0.061 Sum_probs=153.7 Q ss_pred CC-----CcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEec-ccceeeee Q lcl|Aclame:pro 1 MT-----TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMF-TGKLSAGY 74 (332) Q Consensus 1 m~-----~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~-iG~~t~~~ 74 (332) |- ....+-+- ....+..++++ .+.=++|..++++..+..+.++++.+..+..++ +.++|. .+.+.+.- T Consensus 91 lr~~~~~~~~~~e~~--a~~~~~~~~GG---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~w 164 (401) T protein:vir:44 91 LRKGREDGLRDLERK--ALQVGTDEDGG---YAVPEELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTASGW 164 (401) T ss_pred HhhhhhhhhHHHHHH--HhhcCCCCCCc---eeccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcccee Confidence 10 00000000 01122222232 134489999999999999999998887766544 455554 45555555 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc- Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT- 153 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~- 153 (332) ...|.........+.+++++.+-+. ..-..|.+---..+..|+.+.+.++.++++++..|+.++.- .++ ..|.+ T Consensus 165 v~E~~~~~~~~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G--~G~--~~p~Gi 239 (401) T protein:vir:44 165 VGETDTRSQTATSRLGLIEPFMGEI-YGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTG--DGT--KKPKGF 239 (401) T ss_pred eccccccCccccccceeeeeehhhe-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--CCC--Ccccee Confidence 5556544433234556666666553 22234554222345678999999999999999999888731 111 11100 Q ss_pred -cccccceec-------ccc-ccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 154 -GEPGGFHVN-------IGA-GNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 154 -~~~~~~~i~-------~~~-~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) ......... ... .........|+.|+++...|...... +-+++++|..|..|...+|.. ++... + T Consensus 240 l~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd~~--G~~l~--~ 313 (401) T protein:vir:44 240 LAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRT--GAKFMMNNNSLFAIRLLKDTE--GNYLW--R 313 (401) T ss_pred eccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhc--CCEEEEcHHHHHHHHHhhccC--Cceee--c Confidence 000000000 000 00011112367788887777655432 334578999998886544432 11111 1 Q ss_pred ccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchh Q lcl|Aclame:pro 225 GDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ 304 (332) Q Consensus 225 ~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~ 304 (332) ..+..|. -++++|.+|+.++++|..+.... .-+-++|+... .++.+ +++++. ++++-.+ T Consensus 314 ~~~~~g~-~~~l~G~PVv~~~~~p~~~~~~~---------~i~~Gd~~~~~-~i~~~--------~~~~~~--~~~~~~~ 372 (401) T protein:vir:44 314 PGLELGQ-PSSLAGYGIAENEQMPDIAADAK---------AIAFGNFKRGY-TIVDR--------IGTRIL--RDPYTNK 372 (401) T ss_pred CCcCCCC-CceecceeeEEecCcCCccCCcc---------EEEEeehhccE-EEEEe--------cceEEe--eeccccC Confidence 1233443 46899999999999995432211 11224443321 12222 222322 2222111 Q ss_pred HHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) - --.+++..++|+++++|++.+.|+-+ T Consensus 373 ~-~v~~~a~~r~d~~~~~~~a~~~l~~~ 399 (401) T protein:vir:44 373 P-FVGFYTTKRTGGMLVDSQAIKLLKIA 399 (401) T ss_pred C-cEEEEEEEEeccEEecccceEEEEee Confidence 0 01245566899999999999988777 No 111 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.08 E-value=4.5e-11 Score=77.30 Aligned_cols=292 Identities=13% Similarity=0.102 Sum_probs=147.1 Q ss_pred CCCc----ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccc-cccccccccceEEEecc-cceeeee Q lcl|Aclame:pro 1 MTTL----SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGL-VRSYDLRGGKSKQFMFT-GKLSAGY 74 (332) Q Consensus 1 m~~~----~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~-v~~r~~~~G~tv~i~~i-G~~t~~~ 74 (332) |... ....|. .. ...+.++ .+-=+.+..++.+..+..++++.+ ++.-+...| .++||+. +.+++.. T Consensus 113 ~~~~~~~~~~~~~~---~~-~~~~~gg---~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g-~~~~p~~~~~~~a~~ 184 (428) T protein:vir:10 113 FASDELNDQSVSMA---IS-TAAGSGG---VLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNG-NMSLPRLAGGATASY 184 (428) T ss_pred HhhhhhhhhhHhhh---hc-ccccCCc---cccchhHHHHHHHHHhhhchhhhhcceeeecCCc-ceEEEEEeCCcceee Confidence 1110 011111 11 1111222 122377788898888888888877 322122233 4788876 4566666 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) ...|...... +++.+++++...+. +.-+.|.+-=-.++..++.+.+.++++++|++..|+.++. +..++....| T Consensus 185 v~Eg~~~~~~-~~~f~~i~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~G~~~~p~G 258 (428) T protein:vir:10 185 TGENQDAKVS-EARFDDVKLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMR----DDGTGDTPIG 258 (428) T ss_pred eccCcccccc-ccceeeEEeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCccccc Confidence 7777777643 56777777777654 3345565532234678899999999999999999998863 1111111111 Q ss_pred cc----ccce-eccccccccCHHHHHHHHHHHHHHHHh-cCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccc Q lcl|Aclame:pro 155 EP----GGFH-VNIGAGNTNDAQAIVDGFFEAAAVLDE-RSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMN 228 (332) Q Consensus 155 ~~----~~~~-i~~~~~~~~~~~~~~d~i~~a~~~Lde-~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~ 228 (332) .- .... +..++....+... .+...++...+.. .+.....-..+++|..|..|...+|. +..+.-. ... T Consensus 259 i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~---~G~~i~~--~~~ 332 (428) T protein:vir:10 259 MKARATQWNRLLPWAADAAVNLDT-IDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDG---NGNKVYP--EMA 332 (428) T ss_pred cccccccccccccccccccccHHH-HHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhcc---CCceecc--CCC Confidence 10 0001 1111111222211 1222222221111 11212222346799999888654332 1111110 111 Q ss_pred ccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc------ Q lcl|Aclame:pro 229 SGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN------ 302 (332) Q Consensus 229 ~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~------ 302 (332) -+.++|.+|+.++.+|...+.. .....-|-++|+... ++- ..+++++..++-.. T Consensus 333 ----~g~l~G~pv~~~~~~p~~~~~~------~~~~~i~~gd~s~~~--i~~--------~~~i~i~~~~~~~~~~~~~~ 392 (428) T protein:vir:10 333 ----QGMLKGYPIQRTSAIPANLGEG------GKESEIYFADFNDVV--IGE--------DGNMKVDFSKEASYIDTDGK 392 (428) T ss_pred ----CCeeeceeeEEeccccccccCC------CccceEEEEecceEE--EEE--------ecceEEEeeccccccccccc Confidence 2368999999999999643221 111223445555432 111 12223333221100 Q ss_pred --hhHHH--HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 --VQYQG--DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 --~~~~~--d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..|+- -.+++..+++.++.||++.+.+... T Consensus 393 ~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 393 LVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGV 426 (428) T ss_pred ccchhhcchhheeeeeeeCceeeccceEEEEecc Confidence 01121 2457788899999999987777776 No 112 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.08 E-value=2.1e-11 Score=79.08 Aligned_cols=281 Identities=12% Similarity=0.074 Sum_probs=153.4 Q ss_pred CCCc---ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-c--ceeeee Q lcl|Aclame:pro 1 MTTL---SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-G--KLSAGY 74 (332) Q Consensus 1 m~~~---~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G--~~t~~~ 74 (332) +-.. ....+....-.++.....+. -.+..+.|..++.+...+.+.++++++..+.. +.++.||+. | ...... T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 166 (379) T protein:vir:10 89 FNDIKEVRNGKSIQVKAVGDMTLPVNL-TGAQPKDYNFDVVLNPSQMLNVSDIVGAVSIS-GGTYTFVRENGAGEGAIGA 166 (379) T ss_pred HHhHHHHHhhhhhhhhhhcccccCCCC-ccccchhhhhHHHHhHHhhhhHHhhceeeecc-CCceEEEEeecCCCccccc Confidence 0000 00000000000110111110 12456889999999999999999998877665 446788864 2 233444 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) ...|+.... .+++.+++++.+.++- .-..|++ +-.+....+.+.+.++.++++++..|+.++.-+. ..++ T Consensus 167 v~Eg~~~~~-~~~~f~~i~~~~~k~~-~~~~iS~-ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~----~~~~--- 236 (379) T protein:vir:10 167 QVEGATKGQ-KDYDISMIDVNTDFIA-GFTRYSK-KMANNLPFLTSFIPNALRRDYAKAENAAFNAVLA----ANAT--- 236 (379) T ss_pred ccCCccccc-cccceeeeEeeeeeEE-eeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc----cccc--- Confidence 556666553 3567777777777653 2334544 2233334577888888999999999987764221 0000 Q ss_pred ccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 155 EPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 155 ~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) ... .+....+ .++.|.++...+.....+.. .+|++|..|..|...+|.. ++ +....+...+++... T Consensus 237 ---~~~--~~~~~~~----~~d~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~--G~-~l~~~~~~~~~~~~~ 302 (379) T protein:vir:10 237 ---AST--EIITNKN----KVEMLINEIAKQENLDFPVT--AIVLRPTDYYDILVTQKSV--GA-GYGLPGVVTQDNGVL 302 (379) T ss_pred ---ccc--ccccCcc----cHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhccC--Cc-eeccCCccCCCCCcc Confidence 000 1111111 25667777777777766543 3568999999987654432 12 211111111222245 Q ss_pred eeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HHHH Q lcl|Aclame:pro 235 SIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LIVG 312 (332) Q Consensus 235 ~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i~~ 312 (332) +++|++|+.|+.+|.. .-+-++|+... +++- ++++++..++..+ .|.-| .+++ T Consensus 303 ~l~G~pvv~s~~~~ag--------------~~~~gdf~~~~-~~~~---------~~~~i~~~~~~~~-~f~~~~~~~r~ 357 (379) T protein:vir:10 303 RINGIPLFRATWLAAN--------------KYYVGDWTRVT-KVTT---------EGLSLEFSEVEGT-NFVKNNITARI 357 (379) T ss_pred eecceeeEecCCCCCC--------------ceEEeecccEE-EEEE---------eceEEEEeecccc-cccCCcEEEEE Confidence 8999999999999831 12345555432 2221 2334555443221 11112 3445 Q ss_pred HHHhCCceechhheeeeecC Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~~A 332 (332) ..++|..+++|++.+-+.=+ T Consensus 358 ~~R~~~~v~~p~a~v~~~~~ 377 (379) T protein:vir:10 358 EAQVALAVEQPAALIFGDFT 377 (379) T ss_pred EEEeccEEecCccEEEEEec Confidence 56899999999998887766 No 113 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.07 E-value=1.6e-11 Score=79.73 Aligned_cols=282 Identities=11% Similarity=-0.031 Sum_probs=156.5 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc--ceEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG--KSKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G--~tv~i~~i-G~~t~~~~~~ 77 (332) +.. ....... ....+..++++ .+.=+.|..++++..+..+.++++++..++.++ +....+.. +...+..... T Consensus 98 ~~~-~~~~~~~-~~~~~t~~~gg---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 172 (397) T protein:vir:48 98 VRG-RYQNLLD-SKTDASGSDAG---LTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDE 172 (397) T ss_pred Hhh-hhhHHHH-HhhccCCcccc---ccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeecc Confidence 111 0000000 11111112222 255589999999999999999999887766543 32222322 2234555566 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++..++++.+.+. ..-..|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-. +. T Consensus 173 ~~~~~~~~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~-------------g~ 238 (397) T protein:vir:48 173 AGSIGTNDDPKLYPIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI-------------AT 238 (397) T ss_pred ccccccccccceeeEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------cc Confidence 7666543346677888888764 344556653223467899999999999999999999887421 00 Q ss_pred cceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~ 237 (332) +.. .+..+ -++.|.++...|.....+. =.++++|..|..|...+|.+ ++ +.- ...+..|. -+.++ T Consensus 239 ~~~----~~~~~----~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~L~~lkd~~--G~-~i~-~~~~~~~~-~~~l~ 303 (397) T protein:vir:48 239 LPT----KPTLT----KWDDIIDLQAKVDPAIKQT--SFFLTNTSGFTALKKVKNAF--GD-YLM-ERDVKSPT-GYSID 303 (397) T ss_pred ccc----ccccc----cHHHHHHHHHHhhhhhcCC--CEEEECHHHHHHHHHhhcCC--Cc-eee-ccCcCCCC-Cceec Confidence 000 01111 2677888888888776653 35578999999987654432 11 111 12233443 56899 Q ss_pred ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc-chhHHHHHHHHHHHh Q lcl|Aclame:pro 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF-NVQYQGDLIVGKLAM 316 (332) Q Consensus 238 G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~-~~~~~~d~i~~~~~~ 316 (332) |++|+.+.+.+...+ ..+...-+-++|+. ++..+...+++++..+... ...+-...+++.+++ T Consensus 304 G~PV~~~~~~~~~~~-------~~~~~~~~~gd~~~---------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~ 367 (397) T protein:vir:48 304 GFAVKEVADRWLANA-------SSGAMPLYFGDLKQ---------AVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRF 367 (397) T ss_pred cceeEEecccccCCc-------CCCceEEEEEeccc---------eEEEEeecceEEEEeccchhhhhcCceeEEEEeee Confidence 999987654332111 11111122233332 2212222334455433211 111112356677889 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) +.++++|++.+.+.-+ T Consensus 368 d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 368 DVVATDTESFVPASFK 383 (397) T ss_pred ccEEecccceEEEEec Confidence 9999999998777744 No 114 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.07 E-value=2.9e-11 Score=78.37 Aligned_cols=282 Identities=13% Similarity=0.016 Sum_probs=156.2 Q ss_pred CCCccccccccc-----ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecccc--eee Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-----ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFTGK--LSA 72 (332) Q Consensus 1 m~~~~~~~r~~~-----~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~iG~--~t~ 72 (332) ...+...-|.+. ....+..+++. .+.=+.|..++.+..+..+.+++++++..+..+. ++.++.... ... T Consensus 91 ~~~~~~~l~~~~~~~~~~~~~~t~~~gg---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a 167 (397) T protein:vir:49 91 VKDFKNLVRGRYQNLLDSKTDGSGSDAG---LTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLA 167 (397) T ss_pred HHHHHHHhhcchhhHHHhhhccCCccCc---ceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcce Confidence 000000001000 01111112222 2445899999999999999999998887776542 455554432 334 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .....|..+.....++.+++++...+. +.-..|.+-=-.++.+|+.+.+.+++++++++..|+.|+.-.. T Consensus 168 ~~v~E~~~~~~~~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g--------- 237 (397) T protein:vir:49 168 KLDDEGGQIGQNDDPKLSLIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIG--------- 237 (397) T ss_pred eeeccccccccccccceeeeEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc--------- Confidence 444556655433334567777777765 3334565522234678899999999999999999998863210 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+. +...+ +|.|.++...|+.+..+.. .++++|..|..|..-+|.. .+ +.- ...+..|. T Consensus 238 ~~~~~--------~~~~~----~d~i~~~~~~l~~~~~~~a--~~v~n~~~~~~l~~lkd~~--g~-~l~-~~~~~~g~- 298 (397) T protein:vir:49 238 TLPNK--------PTLAK----WDDIIDLQAKVDPAIKQTS--LFLTNTSGFTALKKVKNAM--GD-YLM-ERDVKSPT- 298 (397) T ss_pred ccccc--------ccccC----HHHHHHHHHhhhhhhcCCC--EEEEcHHHHHHHHHhhccC--Cc-eee-cccccCCC- Confidence 01110 11111 6778888888888776543 5678999999886544332 11 111 11233343 Q ss_pred eeeeeceEEEeeCc--ccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc-chhHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNN--LAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF-NVQYQGDL 309 (332) Q Consensus 233 v~~i~G~~V~~sn~--lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~-~~~~~~d~ 309 (332) -.+++|++|+.+.+ +|..++ +...-+-++|++ ++..+...+++++..+... ...+.... T Consensus 299 ~~~l~G~pV~~~~~~~~~~~~~---------~~~~~~~gd~~~---------~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 360 (397) T protein:vir:49 299 GYSIDGFVVKEISDRFLPNGTG---------GAMPLYFGDLKQ---------AVTLFDRQHLSLLSTNIGGGAFETDTTK 360 (397) T ss_pred CceecceeeEEecccccccccC---------CceeEEEeeccc---------eEEEEeecccEEEEeccccchhhcCeee Confidence 46899999987654 442211 111112233332 2222333445555433211 11111224 Q ss_pred HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++..++|.++++|++.+.+.-+ T Consensus 361 ~~~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 361 VRVIDRFDVVSTDTEAFVPASFK 383 (397) T ss_pred EEEEEeeccEEecccceEEEEec Confidence 66778899999999998888744 No 115 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.07 E-value=6.1e-12 Score=82.05 Aligned_cols=292 Identities=10% Similarity=0.046 Sum_probs=155.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccce---------e Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKL---------S 71 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~---------t 71 (332) +.....-.+-+-...++...... ..+-=+.+.+.+.......+.++++++..+.. +.+++||+.... . T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 186 (419) T protein:vir:94 110 MRDIDPNRLLSRDAPAGTITNPN--VPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNK 186 (419) T ss_pred HHHHHHHHhhccccccccccCCc--ccccchhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeeccccccccccCcc Confidence 00000000000001122111111 11223566677777766667777887765543 556777764322 2 Q ss_pred eeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 72 AGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP 151 (332) Q Consensus 72 ~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~ 151 (332) ......|+.... .+++..++++.+.+.- .-..|++ +-.+...++.+.+.++.++++++..|+.|+. +..+..| T Consensus 187 a~~v~Eg~~~~~-~~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~aii~----G~G~~~p 259 (419) T protein:vir:94 187 AAVVPEGTAKPQ-STLSFDTITTTLKTVA-HWLPITR-QAADDNSQLMGYIQGRLTYGLRFLRDRQLLN----GNGSTEM 259 (419) T ss_pred cceecCCccccc-cccceeeEEeeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccc Confidence 233344555443 3466677777777653 3345654 2233345688888999999999999998873 1111111 Q ss_pred ccc--ccccceecc-ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccc Q lcl|Aclame:pro 152 VTG--EPGGFHVNI-GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMN 228 (332) Q Consensus 152 ~~~--~~~~~~i~~-~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~ 228 (332) .+. .++...+.. ...........++.|.++...+.....+.. .++++|..|..|+..+++. +..+. ...... T Consensus 260 ~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~~k~~~--~~~~~-~~~~~~ 334 (419) T protein:vir:94 260 QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPG--SGVFR-VIANVQ 334 (419) T ss_pred cceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHHhhcC--CCcee-ecCCcc Confidence 110 011111111 111122344568889999989888877543 5689999999997655432 11111 112223 Q ss_pred ccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH- Q lcl|Aclame:pro 229 SGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG- 307 (332) Q Consensus 229 ~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~- 307 (332) .+ ..+.++|++|+.++.+|... .+-++|+... +++.+ .+++++......+ .+.- T Consensus 335 ~~-~~~~l~G~pV~~~~~~~~~~--------------~~~gd~~~~~-~~~~~--------~~~~v~~~~~~~~-~~~~~ 389 (419) T protein:vir:94 335 GE-ATPRIWGLNVVSTVAIAQGT--------------ALVGGFRQGA-TLWSR--------QGITVLMTDSHAD-FFTAN 389 (419) T ss_pred cC-CCccccceeeEEcCCCCCcc--------------EEEeeccceE-EEEEe--------cceEEEEeccccc-hhhcC Confidence 33 36789999999999998421 1234444322 22222 3345544332110 1111 Q ss_pred -HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 308 -DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 308 -d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..++....+|.++++|++.+.+.-+ T Consensus 390 ~~~~r~~~r~d~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 390 TLVILAEFRANLAVYQPKAFVRVTFA 415 (419) T ss_pred cEEEEEEEeeccEEeccccEEEEEec Confidence 2345677899999999998876655 No 116 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.03 E-value=1.5e-11 Score=79.85 Aligned_cols=292 Identities=10% Similarity=0.058 Sum_probs=154.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccc-----eeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK-----LSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~-----~t~~~~ 75 (332) ......+.-+. ..++...++. .+.-+.|+.++.+..+..+.++++++..+..+ .++.+|+... ...... T Consensus 107 ~~~~~~~~~~~--~~~~~~~~~~---~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~v 180 (413) T protein:vir:81 107 APRVKAASDPA--STATLTDEFQ---GGYGTTWNRNIIYRRREKLVVADLMDNLTMTN-TTIKYLMEKANRVVEGGFKTV 180 (413) T ss_pred hhHHHhhhhhh--hhcccccccc---cccchhhHHHHHHHHhhhhhHHhhcceeeccC-CceeEEEecccccccccccee Confidence 00000000000 1112112222 36678899999999999999999988777654 4566665432 233445 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|..+......+..++++.+.+.. .-+.|.+- -.+....+-+.+.++.++++++..|+.|+.- ..++.+..|. T Consensus 181 ~Eg~~~~~~~~~~f~~i~~~~~k~~-~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~~~l~G----~G~~~~~~Gi 254 (413) T protein:vir:81 181 AEGGKKPYMRFADFDIVTESLSKIA-GLTKITDE-MIEDYDFLVSYINARLLEELAIEEERQLLLG----DGTGNNLTGL 254 (413) T ss_pred cCcccccccCcccceeeEeeeeeEE-EeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCCccccc Confidence 5666654322234566777776642 23456542 2222334777888889999999999987631 1111111111 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCch--hhc-cccccccccccccce Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILN-REIGNSQGDMNSGKG 232 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~--~~~-~d~~~~~~~~~~g~~ 232 (332) .. .....+....+...+++.+.++...+..+..-..+. +|++|..|..|.+-+|.. .+- .......+..... . T Consensus 255 ~~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~-~ 330 (413) T protein:vir:81 255 LK--RDGIQTLAVSNKDELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIM-L 330 (413) T ss_pred cc--ccccccccccccchhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhccCCceeccccccccccccccc-c Confidence 00 000011111223445777777776665554433334 578999999886544432 111 1111111111111 2 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHH--HH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGD--LI 310 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d--~i 310 (332) ..+++|.+|+.|+.+|.. ..+-++|+... +++.+ .+++++..+...+ .+.-| .+ T Consensus 331 ~~~l~G~pv~~s~~~~~~--------------~~~~gd~~~~~-~~~~~--------~~~~v~~~~~~~~-~~~~~~~~~ 386 (413) T protein:vir:81 331 DPAPWGLRTVQSQVVPVG--------------KPVVGAFRSAA-SVLRK--------GGVRIDSTNTNVD-DFENNLITV 386 (413) T ss_pred CceecceeeEEcCCCCcc--------------cEEEEecccEE-EEEEe--------cceEEEEeccccc-hhhcCcEEE Confidence 357999999999999842 12334544322 23332 2345555433211 11122 35 Q ss_pred HHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 311 VGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 311 ~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++.++++..+++|++.+.+.-+ T Consensus 387 r~~~r~d~~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 387 RAEERVGLMVTFPEAIVQLDVA 408 (413) T ss_pred EEEEeeccEEecccceEEEEec Confidence 5666799999999998877766 No 117 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.02 E-value=4.7e-11 Score=77.21 Aligned_cols=293 Identities=12% Similarity=0.060 Sum_probs=151.0 Q ss_pred CCCc-------ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc--cccc-cceEEEecc-cc Q lcl|Aclame:pro 1 MTTL-------SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY--DLRG-GKSKQFMFT-GK 69 (332) Q Consensus 1 m~~~-------~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r--~~~~-G~tv~i~~i-G~ 69 (332) .... ..+.+.. .++..+.+. -+.-+.|.+++.+..+..++++.+-... ...+ -..++||+. +. T Consensus 321 ~~~~~~~~~~~~a~~~~~---~~~~~~~Gg---~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~ 394 (645) T protein:vir:93 321 PDDSRLHHVLKSAVGAGT---TTDPQWAGS---LSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSG 394 (645) T ss_pred ccchhhhhhhhhhhhccc---cccccccCC---ccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecC Confidence 0000 0000000 001111111 1344788999999998888887765321 2221 125678864 66 Q ss_pred eeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 70 LSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 70 ~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) +++.....|..+... +++.+++++..-+. +.-..|.+-=-.++.+++.+.+.++.++++++..|+.++.--. . T Consensus 395 ~~a~wv~Eg~~~~~s-~~~f~~v~l~~~kl-a~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g-~---- 467 (645) T protein:vir:93 395 GAAGWVGEGKTKPLT-KFDFESITFSHAKV-SAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKK-A---- 467 (645) T ss_pred cceEEeccCcccccc-ccceeEEEEeeEEE-EEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCC-c---- Confidence 777777778877654 56777776666542 3334454422225668899999999999999999998873110 0 Q ss_pred cccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 150 SPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) ......|.+.... ..+..+.+ ..+..+..+...|..+++...+-+.|++|..+..|...+|.. ..+.-.+ .... T Consensus 468 ~~~~~~p~gi~~~-~~~~~~~~-~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~---G~~~~~~-~~~~ 541 (645) T protein:vir:93 468 AVADVSPASITHD-VKGTASSG-NPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNAL---GQKEYPD-MTLL 541 (645) T ss_pred ccCCccccceecc-cccccccc-chHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccC---CceeecC-CCCC Confidence 0111112211100 01111111 123446677777888888666666788999999987654431 1111011 0111 Q ss_pred cceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc------- Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN------- 302 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~------- 302 (332) | +.++|.+|+.|+++|.. . . -++++.. ++.....+........++++...... T Consensus 542 ~---~tL~G~PV~~s~~vp~~----~-~----------~gd~s~~--~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~ 601 (645) T protein:vir:93 542 G---GSFQGLPVIVSQYVGDQ----L-V----------LVNAPDI--YLADDGGVAVDMSREASLEMQSEPTGDSTTPSP 601 (645) T ss_pred C---ceeeceeeEEeccCCcc----e-e----------EeccccE--EEEEecceEEEeecceeEEEeeccccccccccc Confidence 2 47899999999999841 0 0 1122211 11111111000000111111111000 Q ss_pred ----hhHHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 303 ----VQYQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 303 ----~~~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) -.|+-| .|+..+.++.+++||++.+.|.-+ T Consensus 602 ~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 602 VELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred ccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 012322 245556789999999998877777 No 118 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.02 E-value=5.2e-11 Score=76.98 Aligned_cols=282 Identities=12% Similarity=-0.005 Sum_probs=159.0 Q ss_pred CCCccccccccc-----ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEecc--cceee Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-----ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFT--GKLSA 72 (332) Q Consensus 1 m~~~~~~~r~~~-----~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~i--G~~t~ 72 (332) +..+...-|.+. ...++..++++ .+.=+.|..++.+..+..+.++++++..++.++ .+..++.. +...+ T Consensus 91 ~~~~~~~l~~~~~~~~~~~~~~t~~~gg---~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a 167 (397) T protein:vir:49 91 VKDFKNLVRGRYQNLLDSKTDASGSDAG---LTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLA 167 (397) T ss_pred HHHHHHHHhcchhHHHHHhhccccccCc---ccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcce Confidence 000000001110 01112222222 244588999999999999999999887776532 23445544 34456 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .....|..+.....++.+++++.+.+. +.-..|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-... T Consensus 168 ~~v~E~~~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~-------- 238 (397) T protein:vir:49 168 NIDDEAGKIADVDDPKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAA-------- 238 (397) T ss_pred eeecCccccccccccceeeEEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 667777776532346677888887764 34445655222345689999999999999999999988742110 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) +.+ .+..++ ++.|.++...|..+..+. -.+|++|..|..|...+|.. ++ +.- ...+..|. T Consensus 239 -~~~--------~~~~~~----~d~i~~~~~~l~~~~~~~--a~~vmn~~~~~~l~~lkd~~--G~-~l~-~~~~~~~~- 298 (397) T protein:vir:49 239 -LPT--------KPTLTK----WDDIIDLEAKVDPAIKQT--SFFLTNTSGFTALKKVKNAL--GD-YLM-ERDVKSPT- 298 (397) T ss_pred -ccc--------cccccc----HHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhcCC--Cc-eee-ccCcCCCC- Confidence 000 011111 577888888888777654 35678999999997654432 11 211 11233333 Q ss_pred eeeeeceEEEeeCc--ccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecc-cchhHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNN--LAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD-FNVQYQGDL 309 (332) Q Consensus 233 v~~i~G~~V~~sn~--lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~-~~~~~~~d~ 309 (332) -+.++|++|+.+.+ +|..+ .+...-+-++|+.. +..+...+++++..+.. ......... T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~---------~~~~~i~~gd~~~~---------~~~~~~~~~~i~~~~~~~~~~~~~~~~ 360 (397) T protein:vir:49 299 GYSIDGFAVKEVADRWLANGT---------GGAMPLYFGDLKQA---------VTLFDRQHMSLLSTNIGGGAFETDTTK 360 (397) T ss_pred CceecceeeEEeccccccccc---------CCceeEEEeeccce---------EEEEeecceEEEEeccccchhhcCcee Confidence 56899999987554 44211 11111222333332 22222234444443211 111111234 Q ss_pred HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++..++|.++++|++.+.+.-+ T Consensus 361 ~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 361 VRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred EEEEeeeCcEEecccceEEEEee Confidence 66778899999999998887744 No 119 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.01 E-value=3.3e-11 Score=78.06 Aligned_cols=289 Identities=11% Similarity=0.032 Sum_probs=148.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeee---c Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYH---T 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~---~ 76 (332) |.......+.... +...+++. .|.=+.|..+|++..+..+.++.+.+..... | .+.+|.. +..+..-. . T Consensus 131 l~~~~~~~e~~a~--~~~t~~GG---~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~-~~~~p~~~~~~~a~~~~~~~ 203 (434) T protein:vir:62 131 IVGNIDEKEARAL--GLVTGNGS---VTIPDFLSKEIITYAQEENFLRRLGTGVKTK-E-NIKYPVLVKKAEAQGHKNER 203 (434) T ss_pred hccccchhhhhhh--cccccccc---eecchhhHHHHHHhhhhhhhhhhhcceeccC-C-ceEEEEEecCCcccceeccc Confidence 1110000111111 11111222 1444889999999999999999888764433 3 4677765 23333222 2 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|..... .+++..++++.+-+. +.-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+. +..+..+..+. T Consensus 204 e~~~~~~-~~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~~~~~~g~- 276 (434) T protein:vir:62 204 TNNEMPE-TDIEFDEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVN----GDEANNINDGA- 276 (434) T ss_pred ccccccc-cccceeeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCccccce- Confidence 3433332 245556666666553 2234454422224567999999999999999999998873 11111111111 Q ss_pred ccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ....+....+.....++.|+++...|+....+ ...| |++|..|..|..-+|.+ .+ +.-.......++.-..+ T Consensus 277 ---~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~-~a~~-v~n~~~~~~L~~lkd~~--G~-~l~~~~~~~~~g~~~tl 348 (434) T protein:vir:62 277 ---LAKKAVEFKTDEKNLYDALVKMKNTPVKEVRK-KARW-VLNTAALTKIETMKTDD--GF-PLLRPFNQAEGGIGYTL 348 (434) T ss_pred ---eecccccccccccchhhHHHHHHhhcchhhhc-CCEE-EEcHHHHHHHHHhhccC--CC-EeeccCCCccCCCCcee Confidence 11111112223334588888888888766554 3345 68999999886544432 11 11100011122224579 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH-H--HHHHH Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG-D--LIVGK 313 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~-d--~i~~~ 313 (332) +|.+|+.++.+|...+... ..-|-+||+... ++-+.. .+.++... ..++. + .+++. T Consensus 349 ~G~pV~~~~~~~~~~~~~~--------~~i~~Gdfs~~~--i~~~~g-------~~~i~~~~----~~~~~~~~v~~~~~ 407 (434) T protein:vir:62 349 LGFPVEEEDAIDIPDSPDT--------PVFYFGDFSKFY--IQDVIG-------SLEVQKLV----ELFSRTNRVGFRIW 407 (434) T ss_pred cceeeEEecCccCccCCCc--------eEEEEeeccceE--EEEeec-------eeEEEeeh----hhhcccCceEEEEE Confidence 9999999999985322111 112335555432 221111 11222111 11221 1 25667 Q ss_pred HHhCCceec-hhheeee----ecC Q lcl|Aclame:pro 314 LAMGCGSLR-TSVAGSF----QAA 332 (332) Q Consensus 314 ~~~G~~vlr-pe~~v~i----~~A 332 (332) .++.+++++ |++..++ +.| T Consensus 408 ~r~Dgk~i~~~~~~~~~~~~~~~~ 431 (434) T protein:vir:62 408 NLLDAQLIHSPFEVPVYKYVLKAP 431 (434) T ss_pred eeecceeecCcccceEEEEEeccC Confidence 788888775 8876555 445 No 120 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.01 E-value=1.8e-11 Score=79.54 Aligned_cols=273 Identities=11% Similarity=-0.003 Sum_probs=158.1 Q ss_pred CCCc--ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccc---eeeeee Q lcl|Aclame:pro 1 MTTL--SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK---LSAGYH 75 (332) Q Consensus 1 m~~~--~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~---~t~~~~ 75 (332) |-.. ..-.|.+ ...++++ .|.=+.+..++++..+..+.++++++..++.++ +.++|.... ...... T Consensus 104 ~~~~~~~~~~ra~-----~t~~~gg---~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 174 (421) T protein:vir:13 104 IRGIQLSEEERDI-----MSSTNNG---AVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANL 174 (421) T ss_pred hhccchhHHHhhc-----cccCCcc---eecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeec Confidence 1000 0011221 1111122 144488899999999999999999887776544 456664422 234445 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|..+... .++..++++.+.+. +.-..|.+-=-..+.+++.+.+.++++++++...|..++....... T Consensus 175 ~E~~~~~~s-~~~f~~i~~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~--------- 243 (421) T protein:vir:13 175 AKDTELVKA-MLKTQPMAYDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVL--------- 243 (421) T ss_pred ccccccccc-ccceeEEEeeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcc--------- Confidence 566665543 56677777777764 3334565522234567899999999999999999988875432110 Q ss_pred cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) +.....+ ++.|.++...|..+..+.. .+|++|..|..|...+|. +..+.-. ....|. -.. T Consensus 244 --------~~~~~~~----~d~i~~~~~~l~~~~~~~a--~~v~n~~~~~~l~~lkd~---~G~~i~~--~~~~~~-~~t 303 (421) T protein:vir:13 244 --------AEETIND----YAGLVKTINSLVPNARKRA--IIVTNSDGRAYLDGLMDK---QGRPLLK--ELSDGG-DLV 303 (421) T ss_pred --------ccccccc----hHHHHHHHHHhhhhhcCCC--EEEEcHHHHHHHHHhhcC---CCceeec--CcCCCC-Cce Confidence 0011111 5677778778877666532 457899999998764443 1112211 123343 468 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA 315 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~ 315 (332) ++|.+|+.++++|..++ +...-+-++|++..- ++ ...+++++..++.+ ..+---.+++..+ T Consensus 304 l~G~pV~~~~~~~~~~~---------~~~~~~~gd~~~~~~-~~--------~~~~~~v~~~~~~~-f~~~~~~~r~~~r 364 (421) T protein:vir:13 304 FKGRPVIELEESIFDVG---------DETKFIVSDFKTLIK-FM--------DRKQYLIDQSKEAG-YTKNETIARIIER 364 (421) T ss_pred ecceeeEEeccccccCC---------CceEEEEEeccccEE-EE--------EecceEEEeecccc-cccCeeEEEEEee Confidence 99999999999985322 112233444444221 12 22344555544321 0111124667789 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ++.++++|+++..+..+ T Consensus 365 ~d~~~~~~~a~~~~~~~ 381 (421) T protein:vir:13 365 FDVNSPLDKSSDAEKIR 381 (421) T ss_pred ecceeecchhhheeeec Confidence 99999999998777666 No 121 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.00 E-value=1.3e-11 Score=80.22 Aligned_cols=274 Identities=12% Similarity=0.017 Sum_probs=151.4 Q ss_pred CCCccc-ccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeecC Q lcl|Aclame:pro 1 MTTLSN-FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~-~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~~ 77 (332) +..... .......+.+...+++. .+-=+.|..++++..+..+.++++++..+..++ +..+|.. +......+.. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~gg---~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E 195 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAADAA---STIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAE 195 (400) T ss_pred hhhhhhhHHHHHHHhhcccccCCc---ccccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCccccccc Confidence 000000 00000011111222222 144588999999999999999999887766544 3455543 4555566666 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |........+..+++++.+.+. +.-..|.+-=-..+.+|+.+.+.++.+++|+...|+.|+..... T Consensus 196 ~~~~~~~~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~------------- 261 (400) T protein:vir:38 196 LEKNPAMAKPEFKPVNWSVETY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKG------------- 261 (400) T ss_pred cccccccccccceeeEeehhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccc------------- Confidence 6555433345666666666543 23344554222245678999999999999999999887643210 Q ss_pred cceeccccccccCHHHHHHHHHHHHH-HHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) +. +..... ++.|.++.. .++. ...-.+|++|..|..|.+-+|.+ ..+.- ...+..|. -+++ T Consensus 262 ~~-----~~~~~~----~~~~~~~~~~~~~~----~~~a~~v~~~~~~~~l~~lkd~~---G~~i~-~~~~~~~~-~~~l 323 (400) T protein:vir:38 262 FT-----AKTISS----VDDLKHINNVDLDP----AYSRVIIASQSFYNFLDTVKDGN---GRYLL-QDSILTPS-GKSV 323 (400) T ss_pred cc-----cccccc----HHHHHHHHHhhhhh----hhCcEEEEcHHHHHHHHHhhccC---CCeee-ecCcCCCC-cccc Confidence 00 011111 344444332 2221 22345678999999986544321 11211 11233332 4689 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +|++|+.+++.|..+. |...-+-++|++..- ++ ...+++++..++ .+|...+++.+++ T Consensus 324 ~G~pv~~~~~~~~~~~---------g~~~~~~gd~s~~~~-~~--------~~~~~~~~~~~~----~~~~~~~~~~~r~ 381 (400) T protein:vir:38 324 LGMPIAVVSDDTLGAA---------GEAHAFLGDIKRAIL-FA--------NRADFMVRWVDD----QIYGQFLQAGMRF 381 (400) T ss_pred ccceeEEecccccCCC---------CceEEEEEeccccEE-EE--------eecceEEEEecc----cccceeEEEEEEe Confidence 9999999999985321 212223445444321 12 223344554433 2345678889999 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) |+++++|++.+.|.-+ T Consensus 382 d~~~~~~~a~~~l~~~ 397 (400) T protein:vir:38 382 GVSVADEKAGYFLTYT 397 (400) T ss_pred ccEEecccceEEEEee Confidence 9999999997766554 No 122 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.00 E-value=6.9e-11 Score=76.29 Aligned_cols=287 Identities=12% Similarity=-0.009 Sum_probs=155.9 Q ss_pred CCCccccccccc--ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEeccc-ceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQ--ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTG-KLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~--~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~iG-~~t~~~~~ 76 (332) +..+....|-+. ....+..++++ .+.=+.|.+++.+..+..+.++++++..++.++ -+..++..+ .+.+.... T Consensus 76 ~~~~~~~l~~~~~~a~~~~t~~~gg---~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 152 (371) T protein:vir:81 76 VEAFVNHIRTRFRNAMSEGSNQDGG---YTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVA 152 (371) T ss_pred HHHHHHHHHHHHHHhhccCCCccCc---eeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeec Confidence 111111111000 01111112222 255588999999999999999999987777543 344555544 45777778 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .|+.......++.+++++...+.- ....|.+-=-..+.+|+.+.+.++.++++++..|+.|+.-.. + +.+ T Consensus 153 Eg~~~~~~~~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g----~-----~~~ 222 (371) T protein:vir:81 153 EGAAIGEKATPQFTLLQYQVKKYA-GFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLN----T-----KAK 222 (371) T ss_pred cccccccccccceeeEEeeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc----c-----ccc Confidence 887764333456677777777642 334565522223467899999999999999999988765211 0 000 Q ss_pred ccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) ....+ ++.+..+. ..|....- .+-.+|++|..|..|...+|.. .+ +.- ...+..|. -+. T Consensus 223 ---------~~~~~----~~~i~~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd~~--g~-~l~-~~~~~~~~-~~~ 282 (371) T protein:vir:81 223 ---------TAIAD----LDGLKQIINVQLDPVFR--STSSVIVNQDAFNWLDTLKDQN--GQ-YLL-QPSISSPT-GRQ 282 (371) T ss_pred ---------ccccc----HHHHHHHHHhhcchhhh--cCCEEEEcHHHHHHHHHhhccC--CC-eee-ecccCCCC-Cce Confidence 01111 33444433 23433332 2235678999999987644321 11 111 11233342 578 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHH--HHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG--DLIVGK 313 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~--d~i~~~ 313 (332) ++|.+|+.++++|...... .+...+...-+-++|+..+- .+...+++++..+... ..|.- -.+++. T Consensus 283 l~G~pV~~~~~~~~~~~~~--~~~~~~~~~i~~Gd~~~~~~---------~~~~~~~~i~~~~~~~-~~f~~~~v~~~~~ 350 (371) T protein:vir:81 283 LLGLPVVIVSNKVLANRVD--GGTGAQFAPIIVGDLKEAVV---------MFDRQRTEIMSSNVAM-DAFETDATLWRAI 350 (371) T ss_pred ecceeEEEecccccCcccc--ccccCCcceEEEEehhceEE---------EEeecceEEEEecccc-chhhcCceEEEEE Confidence 9999999999998532211 11122222233444443221 1122333444432211 11111 245667 Q ss_pred HHhCCceechhheeeeecC Q lcl|Aclame:pro 314 LAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 314 ~~~G~~vlrpe~~v~i~~A 332 (332) +++|.++++|++.+.+.-+ T Consensus 351 ~r~d~~~~~~~a~~~~~~~ 369 (371) T protein:vir:81 351 ERMDVKMRDDEAFVFGEVQ 369 (371) T ss_pred EeeccEEecccceEEEEEe Confidence 7889999999998766555 No 123 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.98 E-value=8.8e-11 Score=75.72 Aligned_cols=282 Identities=9% Similarity=-0.046 Sum_probs=154.3 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEeccc--ceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTG--KLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~iG--~~t~~~~~~ 77 (332) +.......+- ....+..++++ .+.=+.|..++++..+..+.++++++..++.++ .+..++... ......... T Consensus 105 ~~~~~~~e~~--a~~~~t~~~gg---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 179 (404) T protein:vir:39 105 MAFLNTVSSK--TETSGSDSAAG---LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAE 179 (404) T ss_pred hhhhhhhhhh--hhhcccccCCc---eeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecC Confidence 1111011110 01122223333 255689999999999999999999988777654 344455443 344555666 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++..++++.+.+.- ..+.|.+-=-..+.+|+.+.+.++.++++++..|+.|+.-. | . T Consensus 180 g~~~~~~~~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~-----------g--~ 245 (404) T protein:vir:39 180 DGKIPDLDNPRLTIIKYLIKRYA-GIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM-----------G--T 245 (404) T ss_pred ccccccccccceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------c--c Confidence 76654323456778888888753 44456653223457889999999999999999999887421 0 0 Q ss_pred cceeccccccccCHHHHHHHHHHHHH-HHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) +.. .+...+ ++.+.++.. .++....+ +-.+|++|..|..|...+|.. ++ +.. ...+..+. ..++ T Consensus 246 ~~~----~~~~~~----~~~i~~~~~~~~~~~~~~--~a~~v~n~~~~~~L~~lkd~~--G~-~l~-~~~~~~~~-~~~l 310 (404) T protein:vir:39 246 VPK----KPTIAK----FDDVITMINTSVDPAIIA--TSSLLTNQSGLNKLALVKTAE--GK-YLL-EPDPTKPN-SYLI 310 (404) T ss_pred ccc----cccccc----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhccC--Cc-eee-ccCcCCCC-ccee Confidence 000 011112 344444432 33333222 235688999999997654431 11 111 11233343 5689 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-hhHHHHHHHHHHH Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-VQYQGDLIVGKLA 315 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-~~~~~d~i~~~~~ 315 (332) +|++|+.+.+.+..+. ..+...-|-++|.... ..+...+++++..+...+ .......+++.++ T Consensus 311 ~G~pV~~~~~~~~~~~-------~~~~~~~~~gd~~~~~---------~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 374 (404) T protein:vir:39 311 KGKKVIVVADRWLPNS-------GSTVYPLYYGDMSQAI---------TLFDRENMSLLPTNIGAGAFETDTTKIRVIDR 374 (404) T ss_pred cceeEEEecccccCcc-------CCCccEEEEEeccccE---------EEEeecceEEEEeccchhhhhhceeeEEEEee Confidence 9999998765332111 1111112334444322 112223445554332111 1111234667788 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||.++++|++.+.+.-. T Consensus 375 ~d~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 375 FDVKTTDSEALVAGSFT 391 (404) T ss_pred eccEEecccceEEEEee Confidence 99999999998887733 No 124 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.98 E-value=6.9e-11 Score=76.28 Aligned_cols=280 Identities=11% Similarity=-0.018 Sum_probs=158.2 Q ss_pred CCCcc--cc------cccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEec-ccce Q lcl|Aclame:pro 1 MTTLS--NF------SLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMF-TGKL 70 (332) Q Consensus 1 m~~~~--~~------~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~-iG~~ 70 (332) |..-. +- .+......++..+++. .+.=+.|.+++.+..+..+.+.++++..++.++ ..+.+++ .+.+ T Consensus 102 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg---~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (397) T protein:vir:12 102 LRGKRLTDEERDLLDSPEFRAMSGINDEDGG---ILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMV 178 (397) T ss_pred HhccCCcHHHHHHHhhhhhhhccccccccCc---ccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCc Confidence 11000 00 0010111222222332 244599999999999999999999887777643 3455554 4667 Q ss_pred eeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 71 SAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) Q Consensus 71 t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~ 150 (332) .+..+..|..+.....++.+++++...+.- .-..|++-=-..+.+|+.+.+.++.+++|++..|..|+.-.. T Consensus 179 ~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~-~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g------- 250 (397) T protein:vir:12 179 PFSPVEELGNLPEIDQPRFTKVSYSIIDYG-GIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIA------- 250 (397) T ss_pred ceeeecccccccccccccceeEEeeheeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccc------- Confidence 777888887765433456677777776643 334565532234567899999999999999999998874210 Q ss_pred ccccccccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 151 PVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) .+.+. + .+. ++.|.++. ..|+...- .+-.++++|..|..|...+|.+ ++ +.. ...+.+ T Consensus 251 --~~~~~------g---~~~----~~~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~L~~lkd~~--G~-~l~-~~~~~~ 309 (397) T protein:vir:12 251 --SLKKV------D---IDG----LDGIKKALNVTLDPMVA--PGSIVLTNQDGYDWLDTLKDGT--GR-YLL-QPDPTN 309 (397) T ss_pred --ccccc------c---ccc----HHHHHHHHhhccchhhh--CCCEEEEcHHHHHHHHHhhccC--Cc-eee-cccccC Confidence 01110 1 111 45555544 24443332 2344678999999987654432 11 211 112344 Q ss_pred cceeeeeeceEEEeeCcc-cccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHH-- Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNL-AGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ-- 306 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~l-p~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~-- 306 (332) |. -+.++|.+|+.+++. |..+ .|...-+-++|++.+ ..+...+++++..+..+. .|. T Consensus 310 g~-~~~l~G~pv~~~~~~~~~~~---------~~~~~~~~gd~~~~~---------~~~~~~~~~i~~~~~~~~-~f~~~ 369 (397) T protein:vir:12 310 PT-KKLLDGRPVVPFTNRVLKTQ---------KGKAPLIIGNLKEAI---------VLFDREQQSIASTDTGAG-AFETN 369 (397) T ss_pred CC-CccccceeeEEecccccccC---------CCccEEEEEehhceE---------EEEeecceEEEEeccccc-hhhcC Confidence 43 568999999988763 3221 111222334444322 112223445554433221 111 Q ss_pred HHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 307 GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 307 ~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ...+++.+.++.++++|++.+.+.-+ T Consensus 370 ~~~~r~~~r~d~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 370 STKVRGIEREDVRKWDEDAVVFGQIT 395 (397) T ss_pred ceEEEEEEeeccEEecccceEEEEEe Confidence 23566778899999999999888777 No 125 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.97 E-value=1.5e-11 Score=79.88 Aligned_cols=280 Identities=13% Similarity=0.081 Sum_probs=158.7 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcc--c-cccccc-----cccceEEEecccce- Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKG--L-VRSYDL-----RGGKSKQFMFTGKL- 70 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~--~-v~~r~~-----~~G~tv~i~~iG~~- 70 (332) |++ ..|+.. + +++ |+|..+|.+...+.+.|.. . ++...+ .+|+++.||..+.. T Consensus 1 Ma~--~~T~l~------d---------~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~ 63 (330) T protein:vir:10 1 MAN--ELTKIL------D---------TITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLT 63 (330) T ss_pred CCC--CceEee------e---------eechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCC Confidence 887 556652 1 454 8999999999887766532 1 222122 36999999988754 Q ss_pred -eeeeecCCC-CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 71 -SAGYHTPGT-PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAE 148 (332) Q Consensus 71 -t~~~~~~g~-~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~ 148 (332) ....+..|. .++ +..++.++..-+|=. .-..|.+.|+-...+-.|.+.++.+|.+...++..+..++..+.+.-.. T Consensus 64 G~~~~~~dg~~~i~-~~ki~t~~~~a~i~~-~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~ 141 (330) T protein:vir:10 64 GDSEVLGNGDKALE-TGKITAGADIACVLY-RGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFAT 141 (330) T ss_pred CcccccCCCccccc-hhhcccceeEEEEEe-ecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhh Confidence 455666664 565 456776766666554 4567889998888888899999999999999999999888877543322 Q ss_pred ccccc-cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccc Q lcl|Aclame:pro 149 ASPVT-GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDM 227 (332) Q Consensus 149 ~~~~~-~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~ 227 (332) ..... ...........++. .+..-++.|.+|..+|.++. ..-..+++.|..|..|.+. ++++.-. .+++ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~--~a~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~---~li~~~~-~s~~-- 211 (330) T protein:vir:10 142 GTAGEKGALEETHVSDQSKA--STGIDAGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKD---NLIQYIQ-PTTA-- 211 (330) T ss_pred hhcccchhhhhhheeccccc--ccccCHHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHh---hhhhhhc-cccc-- Confidence 11000 00000011000000 01111466888888886664 3446889999999999753 4444211 1111 Q ss_pred cccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcc---ceeeeeecccch- Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVA---PTIQTTSGDFNV- 303 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~---~~~e~~~~~~~~- 303 (332) .+ .++.++|.+|+.+..+|...+...+. +|-+-|++..+..+ +.+|.-|+.... T Consensus 212 -~~-~i~~~~G~~VivdD~~p~~~~~yt~y--------------------l~~~GAi~~~~~~~~~~v~~EtdRd~~~g~ 269 (330) T protein:vir:10 212 -TI-NIPTYLGYRVIIDDGIAPTGDIYTSY--------------------LFRTGSIGLNTGNPSGLTTFETSREAAKGN 269 (330) T ss_pred -Cc-ccccccceEEEEeCCCCCCCCceeEE--------------------EEecCceeeecccCCccccccccCCccccc Confidence 22 48899999999999999654333222 23333333322111 233333332211 Q ss_pred -hHHHHHHHHHHHhCCceech---------hheeeeecC Q lcl|Aclame:pro 304 -QYQGDLIVGKLAMGCGSLRT---------SVAGSFQAA 332 (332) Q Consensus 304 -~~~~d~i~~~~~~G~~vlrp---------e~~v~i~~A 332 (332) ..+.+.-..+|.+|.+-.-+ .- -+|.++ T Consensus 270 ~~l~~r~~~~~hp~G~s~~~~~~~~~~~sPt~-~~L~~~ 307 (330) T protein:vir:10 270 DMIYTRRALVMHPYGVKWTGAEVDAGNITPSN-ADLAKF 307 (330) T ss_pred eEEEEeeEEEeeeeeeeecccccccCcCCcCh-HHhcCC Confidence 01111122334455544322 11 122222 No 126 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.97 E-value=8.5e-11 Score=75.80 Aligned_cols=278 Identities=12% Similarity=0.058 Sum_probs=149.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~~g 78 (332) ...... +-+ ...++..+++. .+.=+.|..++++..+..+.++++++..+..++ +.++|.. +.........| T Consensus 101 ~~~~~~--~~~-~~~~~t~~~gg---~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~ 173 (394) T protein:vir:10 101 HSHGKV--IDN-AAGHVTSTEAG---VLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAEL 173 (394) T ss_pred hccchh--hhh-hhcccccccCc---eeccHHHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEecCCCcccccccc Confidence 000000 000 01112222222 244588999999999999999999887766544 4555544 44555555555 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) ........++..++++.+-+.- .-..|.+-=-.++.+++.+.+.+++++++++..|+.|+..... +.+ T Consensus 174 ~~~~~~~~~~~~~v~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~---------~~~-- 241 (394) T protein:vir:10 174 AENPALAEPEFEQVDWSVSTYR-GAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQS---------FTA-- 241 (394) T ss_pred ccccccccccceeEEeeeeeeE-eeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------ccc-- Confidence 5544323456677777776643 3345655333346789999999999999999999988653211 000 Q ss_pred ceeccccccccCHHHHHHHHHHHHH-HHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccc-cccccccccccceeeee Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNRE-IGNSQGDMNSGKGLYSI 236 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d-~~~~~~~~~~g~~v~~i 236 (332) .+..... .++.|.++.. .++.+. .-.+|++|..|..|..-+|.. .+. +...-.....++.-+++ T Consensus 242 ----~~~~~~~----~~d~l~~~~~~~~~~~~----~a~~vmn~~~~~~l~~lkd~~--G~~i~~~~~~~~~~~~~~~~L 307 (394) T protein:vir:10 242 ----KATTTDT----LVDSLKHILNVDLDPAY----SRALVVTQSLFNTLDTLKDKN--GRYLLHDASDSITDGTAKGTV 307 (394) T ss_pred ----ccccccc----cHHHHHHHHHhhhhhhc----cCEEEecHHHHHHHHHhhccC--CCeeeeccccccccCCccccc Confidence 0111111 2455555432 333322 236789999999987644331 010 10000111122234689 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHh Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAM 316 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~ 316 (332) +|.+|+.+++..... ..|...-+-++|++..- ++ ...+++++..++ .+|...+++.+++ T Consensus 308 ~G~PV~~~~~~~~~~--------~~~~~~i~~gd~s~~~~-~~--------~~~~~~v~~~~~----~~~~~~~~~~~r~ 366 (394) T protein:vir:10 308 LGVPVYVVGDALLGS--------AAGDQKAFVGDLKRGVL-FA--------DRQQVTLAWEDS----KIYGRYLGAAFRF 366 (394) T ss_pred ccceeEEecccccCC--------CCCceEEEEeeccccEE-EE--------eecceEEEEecc----cccceeEEEEEEe Confidence 999999876532111 11112223445554321 22 123334443322 2244557778889 Q ss_pred CCceechhheeeeecC Q lcl|Aclame:pro 317 GCGSLRTSVAGSFQAA 332 (332) Q Consensus 317 G~~vlrpe~~v~i~~A 332 (332) ++++++|++.+.|.-. T Consensus 367 d~~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 367 GVKQADSNAGYFVTNT 382 (394) T ss_pred ccEEeccccEEEEEee Confidence 9999999997665433 No 127 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.93 E-value=6.7e-11 Score=76.37 Aligned_cols=282 Identities=10% Similarity=0.083 Sum_probs=164.3 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcc---ccccccc-----cccceEEEecccce- Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKG---LVRSYDL-----RGGKSKQFMFTGKL- 70 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~---~v~~r~~-----~~G~tv~i~~iG~~- 70 (332) |+. |+.. .+++ |+|..+|.+.+.+.+.|.. +++...+ .+|+++.||..+.. T Consensus 1 MA~----T~ls---------------d~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~ 61 (351) T protein:vir:15 1 MAE----THLS---------------DLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLT 61 (351) T ss_pred CCc----eeee---------------eeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCC Confidence 663 4431 1455 8999999988887776532 2222122 35999999988754 Q ss_pred -eeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 71 -SAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 71 -t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) ...++..++.++. +.+...+..-+|=. .-..+.+.|+...-+-.|++.++.++.+...++..+..++..+....... T Consensus 62 Gd~~~~~~~~~i~~-~kitt~~~~a~i~~-~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~ 139 (351) T protein:vir:15 62 GDPDNWTDSDDIDV-NNLTSGKQQGIKFY-QTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVT 139 (351) T ss_pred CcccccCCCcccch-heecccceeEEEEe-eccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhch Confidence 5778888888874 57887877777744 45668899988888888999999999999999999999998775432221 Q ss_pred cccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccc Q lcl|Aclame:pro 150 SPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) Q Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~ 229 (332) .... .....++.....++.--++.|.+|..+|.+..= ..=..+++.|..|+.|.+. ++++.- ..+++ . T Consensus 140 ~~~~----~~~~d~t~~~~~~~~is~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~---~li~~~-~~s~~---~ 207 (351) T protein:vir:15 140 KIAN----SKVYDQTKVSPSEPMFGAKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQ---GLIETI-QPQNG---A 207 (351) T ss_pred hhcc----cceeccccccccccccCHHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhh---hhhhhc-ccccc---C Confidence 1111 011111111111111125778889999855311 1136778899999999753 344321 11221 2 Q ss_pred cceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchh----H Q lcl|Aclame:pro 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ----Y 305 (332) Q Consensus 230 g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~----~ 305 (332) . .++.++|.+|+.+..+|...... .. .....++|-+-|++..+.. +.+|..|+..... - T Consensus 208 ~-~i~t~~G~~VivdD~~p~~~~~~--------~~-------~~ytsyl~~~GAi~~~~~~-~~ve~~rd~~~~~g~d~l 270 (351) T protein:vir:15 208 T-PFEAYNGLRIVLDDDIEIDLTDK--------TK-------PVSTSYIFAPGAVRYSTNM-RSTETKYDPLINGGQDVI 270 (351) T ss_pred c-ccceecceEEEEcCCCccccCCC--------CC-------ceeEEEEEecceeeeecCC-cCcceeecccCCCCceEE Confidence 2 48999999999999999642111 00 1122345555555544432 3455555432100 0 Q ss_pred HHHHHHHHHHhCCceechhhe--------eeeecC Q lcl|Aclame:pro 306 QGDLIVGKLAMGCGSLRTSVA--------GSFQAA 332 (332) Q Consensus 306 ~~d~i~~~~~~G~~vlrpe~~--------v~i~~A 332 (332) +.+.-..+|.+|.+--.+... -+|.++ T Consensus 271 ~~r~~~~~hp~G~s~~~~~~~~~~~sPt~~~L~~~ 305 (351) T protein:vir:15 271 VQKRVGTIHVAGTSIKASFSPSKASFPTIDELAKS 305 (351) T ss_pred EEeeeeeeeeeeeeecccccccCcCCcChHHhcCC Confidence 111222355666655433211 112222 No 128 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.93 E-value=7.5e-11 Score=76.10 Aligned_cols=280 Identities=10% Similarity=-0.015 Sum_probs=152.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEecccce-eeeee-cC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTGKL-SAGYH-TP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~iG~~-t~~~~-~~ 77 (332) +.......+. ....+..++++ .+.=+.|..++++..+..+.++++++..++.++ .++.++..... ....+ .. T Consensus 105 ~~~~~~~~~~--a~~~~~~~~gg---~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E 179 (408) T protein:vir:74 105 MAFLNTVSSK--TETSGSDSAAG---LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEE 179 (408) T ss_pred hhhhhhhhhh--hhcccccCCCc---eeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCccccccccc Confidence 0000000000 11122222222 145689999999999999999999988777654 35566655432 23333 34 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |+.+.....++.++++++..+. +.-..|.+-=-..+.+|+.+.+.++.+++|++..|+.|+.-. | . T Consensus 180 ~~~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~-----------G--~ 245 (408) T protein:vir:74 180 DGKIPDLDNPRLTIIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM-----------G--T 245 (408) T ss_pred ccccccccccceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------c--c Confidence 5554432345667777777764 333456553333467789999999999999999999876311 0 0 Q ss_pred cceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) +.. .+...+ ++.|.++. ..|+.+..+. -.+|++|..|..|...+|.. ..+.- ...+..|. -+.+ T Consensus 246 ~~~----~~~~~~----~~~i~~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lkd~~---G~~l~-~~~~~~~~-~~~l 310 (408) T protein:vir:74 246 VPK----KPTIAN----FDDVITMINTSVDPAIIAT--SSLLTNQSGLNKLALVKTAE---GKYLL-EPDPTKPN-SYLI 310 (408) T ss_pred ccc----cccccc----HHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhcCC---CceEe-ccCcCCCC-Ccee Confidence 000 011112 44455543 4565555442 34578999999887644321 11211 11233343 4689 Q ss_pred eceEEEeeCc--ccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeeccc-chhHHHHHHHHH Q lcl|Aclame:pro 237 AGIRILKSNN--LAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF-NVQYQGDLIVGK 313 (332) Q Consensus 237 ~G~~V~~sn~--lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~-~~~~~~d~i~~~ 313 (332) +|.+|+.+++ +|..+. +...-+-++|+... +++ ...+++++..+... ...+....+++. T Consensus 311 ~G~pV~~~~~~~~~~~~~---------~~~~i~~gd~~~~~-~~~--------~~~~~~i~~~~~~~~~f~~~~~~~r~~ 372 (408) T protein:vir:74 311 KGKQVIVVADRWLPNSGS---------TVYPLYYGDMSQAI-TLF--------DRENMSLLPTNIGAGAFETDTTKIRVI 372 (408) T ss_pred cceeeEEecCcccccccC---------CcceEEEEehhccE-EEE--------EecceEEEEeccccchhhcceeeEEEE Confidence 9999998765 443221 11112233333322 122 22344554433211 111222346677 Q ss_pred HHhCCceechhheeeeecC Q lcl|Aclame:pro 314 LAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 314 ~~~G~~vlrpe~~v~i~~A 332 (332) +++|.++++|++.+.+.-+ T Consensus 373 ~r~d~~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 373 DRFDVKATDSEALVAGSFT 391 (408) T ss_pred EeeCcEEecccceEEEEee Confidence 8899999999998888754 No 129 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.92 E-value=4.5e-11 Score=77.31 Aligned_cols=279 Identities=10% Similarity=0.002 Sum_probs=152.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHH-HhHHHHHHHHHhhhhccc-cccccccccceEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKL-FSGEVFTAFNNASIFKGL-VRSYDLRGGKSKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~-f~g~V~~~f~~~s~~~~~-v~~r~~~~G~tv~i~~i-G~~t~~~~~~ 77 (332) |+...-..|-. ..+..+++. .|-... ++.++.+..+..++++.+ ++.-+...| .+.||+. +.+++..... T Consensus 347 ~~~~~l~~ra~---~~~t~~~gg---~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g-~~~ip~~~~~~~a~wv~E 419 (632) T protein:vir:96 347 MPHEVLVQRQL---EKKTAGKGG---ELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTSGANFYWIGE 419 (632) T ss_pred hhHHHHHHhhh---hcccccccc---cccccccchHHHHHHHhhcchhhhhcceEeecCCc-ceEEEEEeCCceeEeecC Confidence 11111111211 111112222 144444 467888888878887776 333333344 5788876 5667777777 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |+.+... +++.+++++..-++ +.-+.|.+-=-.++.+++.+.+..++++++++..|+.++.- ..++ +.|. T Consensus 420 ~~~~~~s-~~~f~~i~l~~~k~-~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~G~~----~~p~ 489 (632) T protein:vir:96 420 DEDVQDS-DFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTG----TGLA----NDPV 489 (632) T ss_pred Ccccccc-ccceeeEEeeeeEE-EEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcc----cCCC----Cccc Confidence 8777653 56666676666543 33344544222356788999999999999999999987631 1111 1121 Q ss_pred cceeccc--cccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 158 GFHVNIG--AGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 158 ~~~i~~~--~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) |.....+ +.....+...++.|.++..++...++....-..+++|..+..|... ++.+ . .+ .-+..+ +. T Consensus 490 Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~---~l~d--~-~G-~~i~~~---~~ 559 (632) T protein:vir:96 490 GLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA---QVFD--N-TG-ERIWQN---NE 559 (632) T ss_pred eeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHH---hccC--C-CC-ceeecC---Ce Confidence 1110000 0001111123677888888888888866655667899887666432 1111 1 11 112222 36 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-hhHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-VQYQGDLIVGKL 314 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-~~~~~d~i~~~~ 314 (332) +.|.+|+.||.+|... .+.++|+... +... .+++++. +++. ...-.-.++..+ T Consensus 560 l~G~pv~~s~~ip~~~--------------~~~gd~s~~~--i~~~--------~~~~i~~--~~~~~~~~~~v~~~~~~ 613 (632) T protein:vir:96 560 VNGYRAEASNQIPADT--------------WIFGDWSQIV--IAMW--------GVLDLKV--DPYTKAASDGLVLRVFQ 613 (632) T ss_pred ecccceEeccccccCc--------------EEEeecceEE--EEEe--------cceEEEE--ccccccccCceEEEEEe Confidence 7899999999999421 1223444321 1111 1222222 1111 001112455677 Q ss_pred HhCCceechhheeeeecC Q lcl|Aclame:pro 315 AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 315 ~~G~~vlrpe~~v~i~~A 332 (332) .++.++++|++.+.++.+ T Consensus 614 ~~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 614 DVDAGVRRKEAFCIAKKG 631 (632) T ss_pred ecCceeechhhhhheeec Confidence 889999999999999988 No 130 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=98.91 E-value=2e-10 Score=73.80 Aligned_cols=276 Identities=12% Similarity=0.056 Sum_probs=150.4 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~~g 78 (332) |-......+. ..++..+++. .+-=+.|..++++..+..+.++++++..++.++ +.++|.. +.........| T Consensus 99 lr~~~~~~~~---~~~~t~~~gg---~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~ 171 (389) T protein:vir:10 99 IHSHGKVIDA---TSKVTSTEAG---VLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAEL 171 (389) T ss_pred hhcchhhhhh---hcccccCCcc---eeehHHHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCcccccccc Confidence 1110000010 1112222222 134488899999999999999999887776544 3455544 34444555555 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) ........++..++++.+.+. +.-+.|.+-=-..+.+|+.+.+.++.+++|++..|..|+.-+..+ .+ T Consensus 172 ~~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~---------~~-- 239 (389) T protein:vir:10 172 AENPKLAEPEFNKVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSF---------TA-- 239 (389) T ss_pred ccccccccccceeeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccc---------cc-- Confidence 555433356667777777664 333455542223456789999999999999999999887543211 00 Q ss_pred ceeccccccccCHHHHHHHHHHHHH-HHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccc--cccccccceeee Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNS--QGDMNSGKGLYS 235 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~--~~~~~~g~~v~~ 235 (332) .+..... .++.|.++.. .++.. .+-.++++|..|..|...+|.+ .+ +.-. -.....++.-.+ T Consensus 240 ----~~~~~~~----~~d~l~~~~~~~~~~~----~~a~~~~n~~~~~~L~~lkd~~--G~-~i~~~~~~~~~~~~~~~~ 304 (389) T protein:vir:10 240 ----KKTTTDT----LVDSLKHILNVDLDPA----YSRALVVTQSLFNTLDTLKDKN--GR-YLLHDASDSITDGTAKGT 304 (389) T ss_pred ----ccccccc----cHHHHHHHHHhhhhhh----hCcEEEecHHHHHHHHHhhccC--CC-eeeecCcccccccccccc Confidence 0111111 2455555432 33332 2346789999999997655432 11 1110 011112223568 Q ss_pred eeceEEEeeCcc-cccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNL-AGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKL 314 (332) Q Consensus 236 i~G~~V~~sn~l-p~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~ 314 (332) ++|.+|+.+++. |... .|...-+-++|++...+ + ...+++++..++ .+|...+++.+ T Consensus 305 l~G~pV~~~~~~~~~~~---------~~~~~~~~gd~~~~~~~-~--------~~~~~~i~~~~~----~~~~~~~~~~~ 362 (389) T protein:vir:10 305 ILGVPVYVVGDTLLGSL---------AGDQKAFVGDLKRGVLF-T--------DRQQVTLAWEDS----KIYGKYLGAAF 362 (389) T ss_pred cccceeEEecccccCCC---------CCceEEEEeeccccEEE-E--------eecceEEEeecc----ccccceEEEEE Confidence 999999876543 3111 11112234455543211 2 223445554432 23455678888 Q ss_pred HhCCceechhheeeeecC Q lcl|Aclame:pro 315 AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 315 ~~G~~vlrpe~~v~i~~A 332 (332) ++|+++++|++.+.+.-+ T Consensus 363 r~d~~~~~~~a~~~~~~~ 380 (389) T protein:vir:10 363 RFGVQKADSKAGYFVTNT 380 (389) T ss_pred EeccEEecccceEEEEee Confidence 999999999997766533 No 131 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.90 E-value=9.2e-11 Score=75.60 Aligned_cols=296 Identities=10% Similarity=-0.027 Sum_probs=153.5 Q ss_pred CCCcc-----cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEec-ccceeee Q lcl|Aclame:pro 1 MTTLS-----NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMF-TGKLSAG 73 (332) Q Consensus 1 m~~~~-----~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~-iG~~t~~ 73 (332) +.... ...+-......+..+++. .+.=+.|.+++++..+..+.+.++++..++..+ .++.+|+ .+..... T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg---~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~ 168 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGG---YAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMK 168 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCc---eeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCccee Confidence 11100 000000001111112222 144488899999999999999999988877643 3555665 4667777 Q ss_pred eecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 74 YHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 74 ~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) ....|...... ..++..++++...+. ..-..|.+-=-.++..++.+.+.++.++++++..|+.|+.- ..+..+. T Consensus 169 ~v~e~~~~~~~~~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G----~g~~~~~ 243 (404) T protein:vir:10 169 PLSENQQIPTNGDNGKLERFNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYG----AGGDEHA 243 (404) T ss_pred eccccccccccccccceeeeEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCCCcc Confidence 77777665432 234556666666654 23345554222235678999999999999999999988632 1111111 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHH-HHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) .|......+. ..+......++.+..+.. .|....-+ .. .++++|..|..|...+|.. ++ +... ..+..| T Consensus 244 ~gi~~~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~-~~v~n~~~~~~L~~lkd~~--G~-~l~~-~~~~~~- 313 (404) T protein:vir:10 244 TGIMTANKFK---KITLPKSPALKDFKKCKNVELLNVFKA-TS-SWIVNQDGFNYLDSLEDKT--GR-PYLQ-PDPKDP- 313 (404) T ss_pred cceeeccccc---eeeccccccHHHHHHHHHhhhhccccC-CC-EEEEcHHHHHHHHHhhccC--Cc-eeec-cCcCCC- Confidence 1111000000 111111122555555443 34433322 23 4578999999887654422 11 2111 122333 Q ss_pred eeeeeeceEEEee-CcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-hhHHHHH Q lcl|Aclame:pro 232 GLYSIAGIRILKS-NNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-VQYQGDL 309 (332) Q Consensus 232 ~v~~i~G~~V~~s-n~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-~~~~~d~ 309 (332) ...+++|.+|+.+ +.+|..+.. ...-+-++|+. ++......+++++...+... ....... T Consensus 314 ~~~~l~G~PV~~~~~~~~~~~~~---------~~~~~~gd~s~---------~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 375 (404) T protein:vir:10 314 TQYRFLGLPVIELPNDLLLSTES---------AIPVLLGDTKE---------AYKYVSDGAYELATTNIGAGAFETNTTK 375 (404) T ss_pred CCccccceeeEEecccccCCCCC---------ccEEEEEeccc---------cEEEEEecceEEEEeccccchhhcCceE Confidence 2568999999854 444432211 11122233333 22222223445554432211 0111224 Q ss_pred HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 310 IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 310 i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++.+++|.++++|++.+.+.-+ T Consensus 376 ~~~~~r~d~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 376 ARIIMRIDGNVKDSEALLIAEIP 398 (404) T ss_pred EEEEEeeccEEecccceEEEEee Confidence 67788999999999999877655 No 132 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.88 E-value=2.1e-10 Score=73.65 Aligned_cols=282 Identities=9% Similarity=-0.030 Sum_probs=153.6 Q ss_pred CCCccc-c-cccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEecccc--eeeeee Q lcl|Aclame:pro 1 MTTLSN-F-SLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTGK--LSAGYH 75 (332) Q Consensus 1 m~~~~~-~-~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~iG~--~t~~~~ 75 (332) +-+... + .+-......+..++++ .+-=+.|+.++++..+..+.++++++..++..+ .++.++.... ...... T Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg---~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 177 (408) T protein:vir:10 101 VRNPMAFMNTVSSKTETSGSDSAAG---LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (408) T ss_pred hhcchhhhhhhhhhhhhcccccCCc---eeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeee Confidence 111000 0 0000112223333333 245589999999999999999999987766542 3455555433 444555 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) ..|..+.....++..++++...+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-... T Consensus 178 ~E~~~~~~~~~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----------- 245 (408) T protein:vir:10 178 AEDGKIPDLDNPQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA----------- 245 (408) T ss_pred cCccccccccCcceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------- Confidence 6676665333355677777776643 3345655322345789999999999999999999988743211 Q ss_pred cccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY 234 (332) Q Consensus 156 ~~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~ 234 (332) +... ...++ ++.|.++. ..|+...- .. -.++++|..|..|...+|.+ ..+... ..+..|. .. T Consensus 246 --~~~~----~~~~~----~~~l~~~~~~~~~~~~~-~~-a~~v~n~~~~~~l~~lkd~~---G~~i~~-~~~~~~~-~~ 308 (408) T protein:vir:10 246 --APKK----PTIAK----FDDVITMINTAVDPAII-AT-SSLLTNQSGLNKLALVKTAE---GKYLLE-PDPTKPN-SY 308 (408) T ss_pred --cccc----ccccc----HHHHHHHHHHhhhhhhc-cC-CEEEEcHHHHHHHHHhhccC---CceEec-cCcCCCC-Cc Confidence 0000 01111 45555544 34443332 22 34578999999997655432 112211 1233443 56 Q ss_pred eeeceEEEeeCc--ccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-hhHHHHHHH Q lcl|Aclame:pro 235 SIAGIRILKSNN--LAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-VQYQGDLIV 311 (332) Q Consensus 235 ~i~G~~V~~sn~--lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-~~~~~d~i~ 311 (332) +++|++|+.+++ +|..+. +...-|-++++... ..+...+++++..+.... ..+..-.++ T Consensus 309 ~l~G~PV~~~~~~~~~~~~~---------~~~~i~~gd~~~~~---------~~~~~~~~~v~~~~~~~~~f~~~~~~~r 370 (408) T protein:vir:10 309 LIKGKQVIVVADRWLPNTGS---------TVYPLYYGDMSQAI---------TLFDRENMSLLPTNIGAGAFETDTTKIR 370 (408) T ss_pred eecceeeEEecccccCccCC---------CceEEEEEehhccE---------EEEEecceEEEEcccccchhhcCceEEE Confidence 899999998664 453211 11112334444322 112223445544332211 011112455 Q ss_pred HHHHhCCceechhheeeeecC Q lcl|Aclame:pro 312 GKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 312 ~~~~~G~~vlrpe~~v~i~~A 332 (332) +.++++.++++|++.+.+.-+ T Consensus 371 ~~~r~d~~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 371 VIDRFDVKATDSEALVAGSFS 391 (408) T ss_pred EEEeeccEEeccccEEEEEee Confidence 677899999999999877744 No 133 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.85 E-value=8.2e-10 Score=70.40 Aligned_cols=298 Identities=11% Similarity=0.058 Sum_probs=143.6 Q ss_pred CCC--------ccc-ccccccccccccccccCchhhHHHHHH-hHHHHHHHHHhhhhccccccccccc-cceEEEecccc Q lcl|Aclame:pro 1 MTT--------LSN-FSLPNQANGGARNADYDVRYATALKLF-SGEVFTAFNNASIFKGLVRSYDLRG-GKSKQFMFTGK 69 (332) Q Consensus 1 m~~--------~~~-~~r~~~~~~~~~~~~~d~~~al~~e~f-~g~V~~~f~~~s~~~~~v~~r~~~~-G~tv~i~~iG~ 69 (332) ... ... ..+... -+.+. .+. .+....| .+++.+..+..+++.++++..++.+ +.++.||++.. T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~--~~~~~-~gg---~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~ 210 (477) T protein:vir:84 137 DVESDKEIRKIAKVGEEYRDL--DRNGG-TGG---YAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILT 210 (477) T ss_pred hhhhhhhHHHHHHhhhhhccc--cccCC-Ccc---eeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEec Confidence 000 000 000000 01111 111 1455554 6889999888888889988888765 45799998633 Q ss_pred --eeeeeecCCCCCCcc----CCCCCceEEEEEeeeeecc-hhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 70 --LSAGYHTPGTPIVGD----AGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVL 142 (332) Q Consensus 70 --~t~~~~~~g~~~~~~----~~~~~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~ 142 (332) ....-...|..+... .++....+++...+ +.. ..|.+-=-.++.+++.+.+.++.++++++..|+.++. T Consensus 211 ~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k--~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~-- 286 (477) T protein:vir:84 211 GTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKT--IAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVIS-- 286 (477) T ss_pred CcceeeeeccCcccccccccccccceeeEEEeeee--EEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhc-- Confidence 223334445443321 12334444444444 444 3455422234578999999999999999999987762 Q ss_pred HHHhhhcccccc---ccccceeccccc--cccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchh-- Q lcl|Aclame:pro 143 AKASAEASPVTG---EPGGFHVNIGAG--NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNI-- 215 (332) Q Consensus 143 ~~aa~~~~~~~~---~~~~~~i~~~~~--~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~-- 215 (332) +..++....| .++...+..+.. ...+...+++.|+++...++.... ......++.|..|..|.+.+|.+- T Consensus 287 --G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~ 363 (477) T protein:vir:84 287 --GTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRP 363 (477) T ss_pred --cCCCCCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCe Confidence 1111111111 111111111111 112233456667777666554433 223456789999988876554321 Q ss_pred -hcccccc------ccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhh Q lcl|Aclame:pro 216 -LNREIGN------SQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQ 288 (332) Q Consensus 216 -~~~d~~~------~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~ 288 (332) ...++.. ..+.+..+ ..++++|.+|+.|+.+|...+... ....-+-++|+.. +++. . T Consensus 364 l~~~~~~~~~~~~~~~~~~~~~-~~~~l~G~pVv~s~~~p~~~~~~~------d~~~i~~gd~~~~--~i~~-~------ 427 (477) T protein:vir:84 364 LIVPSGPGFNNLGVLTEVASQR-VVGQMHGLPVVTDPTLPTTLGTGT------DQDVIHVLRASDL--ALFE-S------ 427 (477) T ss_pred eeecCccccccccccccccccc-ccchhcccceEecCcccccccccC------CcceEEEEEeceE--EEEe-e------ Confidence 0001000 01122333 357899999999999995322110 0111233444432 1221 1 Q ss_pred hccceeeeeecccchhHHHH--HHHHHHHhCCceec-hhheeeeecC Q lcl|Aclame:pro 289 SVAPTIQTTSGDFNVQYQGD--LIVGKLAMGCGSLR-TSVAGSFQAA 332 (332) Q Consensus 289 ~~~~~~e~~~~~~~~~~~~d--~i~~~~~~G~~vlr-pe~~v~i~~A 332 (332) .+.++.....+.. +... .+.+.+. .+.+| |++.+.|.-+ T Consensus 428 --~~~~~~~~~~~~~-~~~~~~~v~~~~~--~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 428 --SVRMRALQETRAE-NLSVLLQVYGYLA--FTAARFPQSVVEIGGT 469 (477) T ss_pred --ceeEEeccccccc-cceeeeeehhhhh--hhhhccccceEEeecc Confidence 1223322211111 0001 1122222 35666 9988776554 No 134 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.85 E-value=1.7e-10 Score=74.08 Aligned_cols=272 Identities=10% Similarity=0.020 Sum_probs=149.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~~g 78 (332) ...............|...+++. .+.=+.|..++.+..+..+.++++++..++..|+ .++|.. +..+......| T Consensus 115 ~~~~~~~~~~~~~~~~~t~~~gg---~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~ 190 (394) T protein:vir:97 115 LMPINETTPVEPQKDGIKKENAK---PVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAEL 190 (394) T ss_pred HHHHHhhhhhhhhcccccccccc---ccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceeccc Confidence 00000000000001111111222 1445889999999999889999998877765554 566654 44566667667 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) ........++.+++++...+. +.-..|.+-=-.++.+|+.+.+.++.+++|++..|..|+.-...+ T Consensus 191 ~~~~~~~~~~~~~v~l~~~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~------------- 256 (394) T protein:vir:97 191 EKNPALAKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF------------- 256 (394) T ss_pred ccccccccccceeEEeehhhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------- Confidence 655433345667777777653 333455552223456789999999999999999998877532100 Q ss_pred ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) ++..... ++.|.++...+-. |..+-.+|++|..|..|..-+|.+ ++.... ..+..|. -+.++| T Consensus 257 -----~~~~~~~----~~~~~~~~~~~~~---~~~~a~~v~n~~~~~~l~~lkd~~--G~~i~~--~~~~~~~-~~~l~G 319 (394) T protein:vir:97 257 -----TTKTVKN----LDEIKALLNGGFD---PAYNVSLIVSQSFYQTLDTLKDGN--GRYLLQ--DDITAVS-GKVLLG 319 (394) T ss_pred -----ccccccc----HHHHHHHHHhhhh---hhhCCEEEEcHHHHHHHHHhhccC--CCeeee--cCcCCCC-Cceecc Confidence 0111112 3434443322111 222334679999999987654432 111111 1233332 468999 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCC Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGC 318 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~ 318 (332) ++|+.+++.+... ..-|-++|+.... ++ ...+++++...++ ++...+++.+++|+ T Consensus 320 ~pv~~~~~~~~~~------------~~~~~gd~~~~~~-~~--------~~~~~~~~~~~~~----~~~~~~~~~~r~d~ 374 (394) T protein:vir:97 320 KPVFVLSDEVLGA------------NKAFIGDFKRGVL-FA--------DRKDLGLRWADNE----IYGQYLQAVLRFGV 374 (394) T ss_pred ceeEEecccccCC------------ccEEEeeccccEE-EE--------EecceEEEEeccc----ccceeEEEEEEEcc Confidence 9999876543211 1123455544321 22 1223444433322 34456788899999 Q ss_pred ceechhheeeeecC Q lcl|Aclame:pro 319 GSLRTSVAGSFQAA 332 (332) Q Consensus 319 ~vlrpe~~v~i~~A 332 (332) ++++|++.+.|.-. T Consensus 375 ~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 375 SKVDDKAGYYVTFT 388 (394) T ss_pred EEecccceEEEEec Confidence 99999998866544 No 135 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.84 E-value=2.3e-10 Score=73.46 Aligned_cols=293 Identities=12% Similarity=0.064 Sum_probs=150.1 Q ss_pred CCCccccccccc--ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQ--ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~--~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~ 76 (332) +.+ ....+.. ....+..+++. .+..+.|..++.+..++.+.++++++..+..++ ++.||+. +..++.... T Consensus 138 ~~~--~~~~~~~~~~~~~~~~~~gg---~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~ 211 (497) T protein:vir:10 138 FAD--GETAPAAIGQNPFGSTGTFA---PGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVA 211 (497) T ss_pred Hhh--hhhhHHHHHhhhcccCcccc---cccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeec Confidence 111 0000000 01112222232 267799999999999999999999987777665 5888864 345677777 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-- Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG-- 154 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~-- 154 (332) .|+.+... +++.+++++...+.-. -..|++ +-.+...++.+.+.++.++++++..|..++.= .+ +..|.+. T Consensus 212 E~~~~~~s-~~~f~~i~~~~~k~a~-~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G--~G--~~~p~Gil~ 284 (497) T protein:vir:10 212 EAGTYPFS-SEEFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG--GG--YPGVNGLLQ 284 (497) T ss_pred cCcccccc-cccceeeEeeeeeeEe-ecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcC--CC--ccccccccc Confidence 77776653 5677777777766422 234543 22233456888889999999999999987631 00 0000000 Q ss_pred ccccceecccccc---------------------ccC-----------------------------HHHHHHHHHHHHHH Q lcl|Aclame:pro 155 EPGGFHVNIGAGN---------------------TND-----------------------------AQAIVDGFFEAAAV 184 (332) Q Consensus 155 ~~~~~~i~~~~~~---------------------~~~-----------------------------~~~~~d~i~~a~~~ 184 (332) .++...+..+... ... .....+.++.+... T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (497) T protein:vir:10 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD 364 (497) T ss_pred ccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhh Confidence 0000000000000 000 00011112222222 Q ss_pred HHhcCCCcCCCEEEEChHHHHHHHhhcCch--hhcc-ccccccccccccceeeeeeceEEEeeCcccccccccccccccc Q lcl|Aclame:pro 185 LDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNR-EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVT 261 (332) Q Consensus 185 Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~--~~~~-d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~ 261 (332) +.....= ..-.+++.|..|..|.+.+|.. .+-. .+.+..+....+ ...++|.+|+.++.+|... T Consensus 365 ~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~--~~~l~G~pV~~t~~~~~~~---------- 431 (497) T protein:vir:10 365 IQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG--GKNIWGVPVVTTPLIPLGT---------- 431 (497) T ss_pred hhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccC--CceeeceeeEecCCCCCCc---------- Confidence 2222110 1114678999999886655432 1111 111111111111 3479999999999998421 Q ss_pred cccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 262 GENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 262 g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+-++|+...-+++.+. +++++.... ....|.-|. |++..++|..+++|++.+.|.=. T Consensus 432 ----~~~Gd~~~~~~~i~~r~--------~~~v~~~~~-~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 432 ----ILVGHFAPSVIQTARRE--------GVTMQMTNS-NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred ----eEEeecccceEEEEEec--------ccEEEeecc-cchhhhcCcEEEEEEEeecceeeccccEEEEEec Confidence 12344443222233333 334444321 111122233 55667899999999998877654 No 136 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.84 E-value=2.3e-10 Score=73.46 Aligned_cols=293 Identities=12% Similarity=0.064 Sum_probs=150.1 Q ss_pred CCCccccccccc--ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQ--ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~--~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~ 76 (332) +.+ ....+.. ....+..+++. .+..+.|..++.+..++.+.++++++..+..++ ++.||+. +..++.... T Consensus 138 ~~~--~~~~~~~~~~~~~~~~~~gg---~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~ 211 (497) T protein:vir:78 138 FAD--GETAPAAIGQNPFGSTGTFA---PGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVA 211 (497) T ss_pred Hhh--hhhhHHHHHhhhcccCcccc---cccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeec Confidence 111 0000000 01112222232 267799999999999999999999987777665 5888864 345677777 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-- Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG-- 154 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~-- 154 (332) .|+.+... +++.+++++...+.-. -..|++ +-.+...++.+.+.++.++++++..|..++.= .+ +..|.+. T Consensus 212 E~~~~~~s-~~~f~~i~~~~~k~a~-~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G--~G--~~~p~Gil~ 284 (497) T protein:vir:78 212 EAGTYPFS-SEEFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG--GG--YPGVNGLLQ 284 (497) T ss_pred cCcccccc-cccceeeEeeeeeeEe-ecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcC--CC--ccccccccc Confidence 77776653 5677777777766422 234543 22233456888889999999999999987631 00 0000000 Q ss_pred ccccceecccccc---------------------ccC-----------------------------HHHHHHHHHHHHHH Q lcl|Aclame:pro 155 EPGGFHVNIGAGN---------------------TND-----------------------------AQAIVDGFFEAAAV 184 (332) Q Consensus 155 ~~~~~~i~~~~~~---------------------~~~-----------------------------~~~~~d~i~~a~~~ 184 (332) .++...+..+... ... .....+.++.+... T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (497) T protein:vir:78 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD 364 (497) T ss_pred ccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhh Confidence 0000000000000 000 00011112222222 Q ss_pred HHhcCCCcCCCEEEEChHHHHHHHhhcCch--hhcc-ccccccccccccceeeeeeceEEEeeCcccccccccccccccc Q lcl|Aclame:pro 185 LDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNR-EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVT 261 (332) Q Consensus 185 Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~--~~~~-d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~ 261 (332) +.....= ..-.+++.|..|..|.+.+|.. .+-. .+.+..+....+ ...++|.+|+.++.+|... T Consensus 365 ~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~--~~~l~G~pV~~t~~~~~~~---------- 431 (497) T protein:vir:78 365 IQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG--GKNIWGVPVVTTPLIPLGT---------- 431 (497) T ss_pred hhhhccc-CCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccC--CceeeceeeEecCCCCCCc---------- Confidence 2222110 1114678999999886655432 1111 111111111111 3479999999999998421 Q ss_pred cccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 262 GENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 262 g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+-++|+...-+++.+. +++++.... ....|.-|. |++..++|..+++|++.+.|.=. T Consensus 432 ----~~~Gd~~~~~~~i~~r~--------~~~v~~~~~-~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 432 ----ILVGHFAPSVIQTARRE--------GVTMQMTNS-NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred ----eEEeecccceEEEEEec--------ccEEEeecc-cchhhhcCcEEEEEEEeecceeeccccEEEEEec Confidence 12344443222233333 334444321 111122233 55667899999999998877654 No 137 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.84 E-value=4e-10 Score=72.10 Aligned_cols=278 Identities=15% Similarity=0.061 Sum_probs=148.6 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) |++. +-. .|| .|.-+.+..++++..++.+.++++++..+.. +.+.+||+. +.+.+.-+..|. T Consensus 1 ma~~---t~~----~gg---------~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~wv~E~~ 63 (305) T protein:vir:25 1 MADI---SRA----EVA---------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESA 63 (305) T ss_pred CCCc---cCC----ccc---------eecCHHHHHHHHHHHHhhchhhhhcceeecc-CCcEEEEEEeCCcceEEeeccc Confidence 5552 111 111 2566889999999999999999999877765 446888865 455666666665 Q ss_pred CCCcc----CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-ccccc Q lcl|Aclame:pro 80 PIVGD----AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA-SPVTG 154 (332) Q Consensus 80 ~~~~~----~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~-~~~~~ 154 (332) ..... .+++..++++..-+. +.-..|.+-=-.++..++.+.+.++.++++++..|+.++.-- ..... .+... T Consensus 64 ~~~~~~~~~s~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~--g~~~~~~~~~~ 140 (305) T protein:vir:25 64 TDPKGVKPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT--DKPASWVSPAL 140 (305) T ss_pred ccccccccccccceeeEEeeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheecc--CCCCCcccccc Confidence 54321 123444555554442 233445542222456889999999999999999999887311 00000 00000 Q ss_pred cccccee---ccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 155 EPGGFHV---NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 155 ~~~~~~i---~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) .+..... ..++........+++.+..+...+........ . ++++|..|..|.+.+|. +..- +... T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~v~~~~~~~~l~~lkd~--------~G~~-i~~~- 208 (305) T protein:vir:25 141 IPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPD-T-LLSSLALRYEVANIRDA--------NGNP-VFRD- 208 (305) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccc-e-eEecHHHHHHHHHhhcc--------CCce-eecC- Confidence 0000000 01111112223345555555554443332211 2 57799999888654332 1111 1111 Q ss_pred eeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecc----c--ch-h Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD----F--NV-Q 304 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~----~--~~-~ 304 (332) ..++|.+|+.++++|...+.. ..+-++|++.. ++-+ .++++++.++- . .. . T Consensus 209 --~~l~G~Pv~~~~~~~~~~~~~----------~~~~gd~s~~~--i~~~--------~~~~i~~~~~~~~~~~~~~~~~ 266 (305) T protein:vir:25 209 --DSFAGFRTFFNRNGAWDADAA----------IEVIADSSRVK--IGVR--------QDITVKFLDQATLGTGENQINL 266 (305) T ss_pred --CcccccceEEcCccCCCCCcc----------EEEEEecceEE--EEEe--------cCeEEEEeeeeeeecCCceeee Confidence 268999999999988533221 23445555432 1111 12233322210 0 00 1 Q ss_pred HHHH--HHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGD--LIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d--~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |+-| .+++..++|..++||++++.+.-. T Consensus 267 ~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~ 296 (305) T protein:vir:25 267 AERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) T ss_pred eecCcEEEEEEEeecceeeCcccEEEEccc Confidence 1222 234556789999999998877765 No 138 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.81 E-value=1.7e-09 Score=68.69 Aligned_cols=279 Identities=13% Similarity=0.112 Sum_probs=168.4 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcc--cc-c---ccc----ccccceEEEecccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKG--LV-R---SYD----LRGGKSKQFMFTGK 69 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~--~v-~---~r~----~~~G~tv~i~~iG~ 69 (332) |+. |+.. .+++ |+|..+|.++..+.+.|.. .+ + ..+ -.+|+++.+|..+. T Consensus 1 MA~----T~ls---------------d~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~ 61 (324) T protein:vir:59 1 MAY----TKIS---------------DVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWND 61 (324) T ss_pred CCc----eeee---------------ceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEeccccc Confidence 663 4431 1555 9999999998888876632 21 1 111 24699999998876 Q ss_pred e--eeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 70 L--SAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 70 ~--t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) . ...++..++.++. +.++.++..-+|= .+...+.+.|+-...+-.|.+.++.++.+..+++..+..++..+..... T Consensus 62 l~Gd~~~v~~~~~i~~-~~l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~ 139 (324) T protein:vir:59 62 LDGDSQVLNDTDDLVP-QKINAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFS 139 (324) T ss_pred CCCcccccCCCcccch-hhcccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 4 5678888888774 5787777776666 4788899999888888889999999999999999999999987754332 Q ss_pred hccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccc Q lcl|Aclame:pro 148 EASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDM 227 (332) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~ 227 (332) ... . ......+.++. +...-++.|.+|..+|.++. ..-..++|.|..|..|.+. ++++.- ..+++ T Consensus 140 ~~~-~----~~~~~dvsa~~--~~~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~---~li~~~-~~s~~-- 204 (324) T protein:vir:59 140 NDD-M----KDNKLDISGTA--DGIYSAETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQ---DLIEFV-KDSQS-- 204 (324) T ss_pred ccc-c----ccceeeeeccc--cceecHHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHh---hhhhhc-ccccc-- Confidence 211 1 11111111111 11112467888888887753 2336888999999999753 444321 11221 Q ss_pred cccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcc-ceeeeeecccchh-- Q lcl|Aclame:pro 228 NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVA-PTIQTTSGDFNVQ-- 304 (332) Q Consensus 228 ~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~-~~~e~~~~~~~~~-- 304 (332) .+ .++.++|.+|+.+..+|....+ |.. ....+++|-+-|++....++ +.+|..|+..... T Consensus 205 -~~-~i~~~~G~~VivdD~~p~~~~~--------~~~-------~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~ 267 (324) T protein:vir:59 205 -GI-RFPTYMNKRVIVDDSMPVETLE--------DGT-------KVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDI 267 (324) T ss_pred -Cc-eeeeecccEEEEeCCCCccccC--------CCC-------ceEEEEEEecCeEEEeecCCCcceecccCccccceE Confidence 22 4889999999999999953221 111 12234566666666655443 4566665543211 Q ss_pred HHHHHHHHHHHhCCceechhhe------eeeecC Q lcl|Aclame:pro 305 YQGDLIVGKLAMGCGSLRTSVA------GSFQAA 332 (332) Q Consensus 305 ~~~d~i~~~~~~G~~vlrpe~~------v~i~~A 332 (332) .+.+....+|.+|.+-..+... .+|.++ T Consensus 268 l~~r~~~~~~p~G~s~~~~~~~~~sPt~~~L~~~ 301 (324) T protein:vir:59 268 LINRKHFVLHPRGVKFTENAMAGTTPTDEELANG 301 (324) T ss_pred EEEeeEEEeEeeeEEecccccCCCCCChhhhcCC Confidence 1222233344444444322110 122222 No 139 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.78 E-value=1.1e-09 Score=69.76 Aligned_cols=300 Identities=15% Similarity=0.138 Sum_probs=160.5 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcc-ccc---------cccc--cccceEEEeccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKG-LVR---------SYDL--RGGKSKQFMFTG 68 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~-~v~---------~r~~--~~G~tv~i~~iG 68 (332) |+. ..+ +-+ | .+-.++|+..+...-.+.+-|.+ ++= ...+ ..|++|.|+-+. T Consensus 1 Ma~--T~~-------~~~----~---p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~ 64 (364) T protein:vir:93 1 MSQ--TVI-------PFG----D---PKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSV 64 (364) T ss_pred Cce--ecc-------CcC----C---HHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeee Confidence 664 111 111 1 25679999999988887776655 331 1122 238999998876 Q ss_pred ceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhh-hhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 69 KLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFV-YSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKAS 146 (332) Q Consensus 69 ~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa 146 (332) ..+-....-++.+.+. +.++....+|+||+..- ++.. ..+++-.+.+|+|.+--..++.=+++..|+.++..+..+. T Consensus 65 ~L~g~gv~Gd~~leGnee~L~~~~~~i~idq~r~-~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGar 143 (364) T protein:vir:93 65 HLRGKPTYGDARVEGKEESLRFYQDEVRIDQVRH-SVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) T ss_pred ecccCCcccCceeeccccceeEEeeEEEEeeccc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6655444444555554 35777888999998643 2221 3577788899999999999999999999999888876532 Q ss_pred hhccccccc-------------cccceeccc--ccc--c-cCHH-HHHHHHHHHHHHHHhcCCC--c------------C Q lcl|Aclame:pro 147 AEASPVTGE-------------PGGFHVNIG--AGN--T-NDAQ-AIVDGFFEAAAVLDERSAP--Q------------E 193 (332) Q Consensus 147 ~~~~~~~~~-------------~~~~~i~~~--~~~--~-~~~~-~~~d~i~~a~~~Lde~~VP--~------------~ 193 (332) ....+.... |....+-.+ ++. . +..+ .-++.|..+..+++....+ + + T Consensus 144 g~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~ 223 (364) T protein:vir:93 144 GINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDD 223 (364) T ss_pred ccccccccccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcc Confidence 211111110 111111111 000 0 1111 1256777788777665321 1 1 Q ss_pred CCEEEEChHHHHHHHhhcCchhhccc-----cccccccccccceeeeeeceEEEeeCccccccccccccccccccccccc Q lcl|Aclame:pro 194 GRVAVLSPRQYYSLISSVDTNILNRE-----IGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQ 268 (332) Q Consensus 194 gR~~vv~P~~~~~Ll~~~d~~~~~~d-----~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~ 268 (332) -=++++.|.+++.|-.+.++++.+-. ..+.+..+-.|. ++.|.|+-|++.++++...... . .. T Consensus 224 ~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~-~gm~ngvii~~~~~vi~~~~~~--~------~~--- 291 (364) T protein:vir:93 224 HYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGG-LGMINNVVLHKHRNVIRFNDYG--A------GA--- 291 (364) T ss_pred eeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecC-eeeEcCeEEeccCCcccccccc--c------Cc--- Confidence 12778999999999754455543321 122334567775 9999999999999998642111 0 00 Q ss_pred ccccceEEEeechhhhhhh--hhccceeeeeecccchhHHHH-HHHHHHHhCCceechh----heeeeecC Q lcl|Aclame:pro 269 VDASALAGLIFHREAAGCI--QSVAPTIQTTSGDFNVQYQGD-LIVGKLAMGCGSLRTS----VAGSFQAA 332 (332) Q Consensus 269 ~~~~~~~~l~~h~~a~~~~--~~~~~~~e~~~~~~~~~~~~d-~i~~~~~~G~~vlrpe----~~v~i~~A 332 (332) +.....+|++-.+|++.+ +..+ +..++.|....|.-. .|.....+|.+=.|=+ ++.+|-+| T Consensus 292 -~v~~~ralllGaQA~~~a~g~~~g--~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~~~DfGvi~idta 359 (364) T protein:vir:93 292 -NVEAARALFMGRQAGVIAYGTANG--LRFDWEETVKDYGNEPAIAAGFIAGMKKARFNNKDFGVISIDTA 359 (364) T ss_pred -cccchhhheecceeeEEEeecCCC--CCceeeecccCCCCchhhhhhhHhhhhhcccCCccceEEEeccc Confidence 111122344434443222 2211 122222221111111 1333333343333221 34455555 No 140 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.78 E-value=1.3e-09 Score=69.25 Aligned_cols=281 Identities=11% Similarity=-0.001 Sum_probs=152.5 Q ss_pred CCCcc-----------cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecc- Q lcl|Aclame:pro 1 MTTLS-----------NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFT- 67 (332) Q Consensus 1 m~~~~-----------~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~i- 67 (332) |.+-. ..-+... + .+..+++. .+.=+.|.+++.+..+..+.++++++..++.++. ...++.. T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~-~-~~t~~~gg---~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAM-S-GLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhc-c-ccccCCCc---eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 11000 0000000 1 11111222 1445889999999999999999999988876543 3445544 Q ss_pred cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 68 GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 68 G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) +.+.+.....|..+.....++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-... T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~--- 234 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK--- 234 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 45667777777776544345667777777664 34445655222335689999999999999999999988642210 Q ss_pred hccccccccccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 148 EASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) . ++...++ ++.|.++. ..|+....+ +-.+|++|..|..|.+.+|.+ ..+.- ... T Consensus 235 -~--------------~~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd~~---G~~l~-~~~ 289 (392) T protein:vir:10 235 -L--------------TKQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDKD---GKYIL-QSD 289 (392) T ss_pred -c--------------cccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhccC---CCeEe-ecC Confidence 0 0111122 44555544 345554443 234578999999997654432 11111 112 Q ss_pred ccccceeeeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchh Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ 304 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~ 304 (332) +..|. -+.++|.+++. ++.+|...+ ...+...-+-++|+..+ ..+...+++++..+. .... T Consensus 290 ~~~~~-~~tllG~~~v~~~~~~~~~~~~------~~~~~~~~~~gdfs~~~---------~i~~~~~~~~~~~~~-~~~~ 352 (392) T protein:vir:10 290 PTQKN-KKLFAGTNPVVVVSNRFLKSKG------TTAKKAPLIIGDLKEAI---------VLFKREDMELASTDV-GGKA 352 (392) T ss_pred ccCCc-cccccCcccEEEecccccCCCc------ccCCceEEEEEehhceE---------EEEeecceEEEEecc-ccch Confidence 23332 46789987654 344443211 11122222233444322 122233444444321 1112 Q ss_pred HHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |.-+. +++..++|.++++|++.+.+.-. T Consensus 353 f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 353 FTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEec Confidence 22222 66777889999999998886544 No 141 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.78 E-value=1.3e-09 Score=69.25 Aligned_cols=281 Identities=11% Similarity=-0.001 Sum_probs=152.5 Q ss_pred CCCcc-----------cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecc- Q lcl|Aclame:pro 1 MTTLS-----------NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFT- 67 (332) Q Consensus 1 m~~~~-----------~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~i- 67 (332) |.+-. ..-+... + .+..+++. .+.=+.|.+++.+..+..+.++++++..++.++. ...++.. T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~-~-~~t~~~gg---~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAM-S-GLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhc-c-ccccCCCc---eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 11000 0000000 1 11111222 1445889999999999999999999988876543 3445544 Q ss_pred cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 68 GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 68 G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) +.+.+.....|..+.....++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-... T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~--- 234 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK--- 234 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 45667777777776544345667777777664 34445655222335689999999999999999999988642210 Q ss_pred hccccccccccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 148 EASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) . ++...++ ++.|.++. ..|+....+ +-.+|++|..|..|.+.+|.+ ..+.- ... T Consensus 235 -~--------------~~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd~~---G~~l~-~~~ 289 (392) T protein:vir:10 235 -L--------------TKQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDKD---GKYIL-QSD 289 (392) T ss_pred -c--------------cccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhccC---CCeEe-ecC Confidence 0 0111122 44555544 345554443 234578999999997654432 11111 112 Q ss_pred ccccceeeeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchh Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ 304 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~ 304 (332) +..|. -+.++|.+++. ++.+|...+ ...+...-+-++|+..+ ..+...+++++..+. .... T Consensus 290 ~~~~~-~~tllG~~~v~~~~~~~~~~~~------~~~~~~~~~~gdfs~~~---------~i~~~~~~~~~~~~~-~~~~ 352 (392) T protein:vir:10 290 PTQKN-KKLFAGTNPVVVVSNRFLKSKG------TTAKKAPLIIGDLKEAI---------VLFKREDMELASTDV-GGKA 352 (392) T ss_pred ccCCc-cccccCcccEEEecccccCCCc------ccCCceEEEEEehhceE---------EEEeecceEEEEecc-ccch Confidence 23332 46789987654 344443211 11122222233444322 122233444444321 1112 Q ss_pred HHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |.-+. +++..++|.++++|++.+.+.-. T Consensus 353 f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 353 FTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEec Confidence 22222 66777889999999998886544 No 142 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.78 E-value=1.3e-09 Score=69.25 Aligned_cols=281 Identities=11% Similarity=-0.001 Sum_probs=152.5 Q ss_pred CCCcc-----------cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecc- Q lcl|Aclame:pro 1 MTTLS-----------NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFT- 67 (332) Q Consensus 1 m~~~~-----------~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~i- 67 (332) |.+-. ..-+... + .+..+++. .+.=+.|.+++.+..+..+.++++++..++.++. ...++.. T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~-~-~~t~~~gg---~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAM-S-GLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhc-c-ccccCCCc---eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 11000 0000000 1 11111222 1445889999999999999999999988876543 3445544 Q ss_pred cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 68 GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 68 G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) +.+.+.....|..+.....++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-... T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~--- 234 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK--- 234 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 45667777777776544345667777777664 34445655222335689999999999999999999988642210 Q ss_pred hccccccccccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 148 EASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) . ++...++ ++.|.++. ..|+....+ +-.+|++|..|..|.+.+|.+ ..+.- ... T Consensus 235 -~--------------~~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd~~---G~~l~-~~~ 289 (392) T protein:vir:10 235 -L--------------TKQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDKD---GKYIL-QSD 289 (392) T ss_pred -c--------------cccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhccC---CCeEe-ecC Confidence 0 0111122 44555544 345554443 234578999999997654432 11111 112 Q ss_pred ccccceeeeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchh Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ 304 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~ 304 (332) +..|. -+.++|.+++. ++.+|...+ ...+...-+-++|+..+ ..+...+++++..+. .... T Consensus 290 ~~~~~-~~tllG~~~v~~~~~~~~~~~~------~~~~~~~~~~gdfs~~~---------~i~~~~~~~~~~~~~-~~~~ 352 (392) T protein:vir:10 290 PTQKN-KKLFAGTNPVVVVSNRFLKSKG------TTAKKAPLIIGDLKEAI---------VLFKREDMELASTDV-GGKA 352 (392) T ss_pred ccCCc-cccccCcccEEEecccccCCCc------ccCCceEEEEEehhceE---------EEEeecceEEEEecc-ccch Confidence 23332 46789987654 344443211 11122222233444322 122233444444321 1112 Q ss_pred HHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |.-+. +++..++|.++++|++.+.+.-. T Consensus 353 f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 353 FTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEec Confidence 22222 66777889999999998886544 No 143 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.78 E-value=1.3e-09 Score=69.25 Aligned_cols=281 Identities=11% Similarity=-0.001 Sum_probs=152.5 Q ss_pred CCCcc-----------cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecc- Q lcl|Aclame:pro 1 MTTLS-----------NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFT- 67 (332) Q Consensus 1 m~~~~-----------~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~i- 67 (332) |.+-. ..-+... + .+..+++. .+.=+.|.+++.+..+..+.++++++..++.++. ...++.. T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~-~-~~t~~~gg---~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAM-S-GLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhc-c-ccccCCCc---eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 11000 0000000 1 11111222 1445889999999999999999999988876543 3445544 Q ss_pred cceeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 68 GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 68 G~~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) +.+.+.....|..+.....++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-... T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~--- 234 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK--- 234 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 45667777777776544345667777777664 34445655222335689999999999999999999988642210 Q ss_pred hccccccccccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 148 EASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) . ++...++ ++.|.++. ..|+....+ +-.+|++|..|..|.+.+|.+ ..+.- ... T Consensus 235 -~--------------~~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd~~---G~~l~-~~~ 289 (392) T protein:vir:10 235 -L--------------TKQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDKD---GKYIL-QSD 289 (392) T ss_pred -c--------------cccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhccC---CCeEe-ecC Confidence 0 0111122 44555544 345554443 234578999999997654432 11111 112 Q ss_pred ccccceeeeeeceEEEe--eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchh Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILK--SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ 304 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~--sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~ 304 (332) +..|. -+.++|.+++. ++.+|...+ ...+...-+-++|+..+ ..+...+++++..+. .... T Consensus 290 ~~~~~-~~tllG~~~v~~~~~~~~~~~~------~~~~~~~~~~gdfs~~~---------~i~~~~~~~~~~~~~-~~~~ 352 (392) T protein:vir:10 290 PTQKN-KKLFAGTNPVVVVSNRFLKSKG------TTAKKAPLIIGDLKEAI---------VLFKREDMELASTDV-GGKA 352 (392) T ss_pred ccCCc-cccccCcccEEEecccccCCCc------ccCCceEEEEEehhceE---------EEEeecceEEEEecc-ccch Confidence 23332 46789987654 344443211 11122222233444322 122233444444321 1112 Q ss_pred HHHHH--HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 305 YQGDL--IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 305 ~~~d~--i~~~~~~G~~vlrpe~~v~i~~A 332 (332) |.-+. +++..++|.++++|++.+.+.-. T Consensus 353 f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 353 FTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEec Confidence 22222 66777889999999998886544 No 144 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.78 E-value=4.7e-09 Score=66.25 Aligned_cols=318 Identities=13% Similarity=0.102 Sum_probs=161.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccc--cccceEEEecccce-ee-eeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDL--RGGKSKQFMFTGKL-SA-GYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~--~~G~tv~i~~iG~~-t~-~~~~ 76 (332) |-+-+...--.+++--|+++ + -+...=|-..++..-.+..++.++-..+++ .+|+|++|.+--.. .. .-.+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~--~---~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~ 75 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANS--D---QMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNIND 75 (401) T ss_pred CCccCCCccccccccccccc--c---eeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchh Confidence 55433322211112222222 1 122333444555544555677777666665 67999998754321 11 1122 Q ss_pred CCCCCCcc-----------CCC----------------------CCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHH Q lcl|Aclame:pro 77 PGTPIVGD-----------AGI----------------------KANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVS 123 (332) Q Consensus 77 ~g~~~~~~-----------~~~----------------------~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~ 123 (332) .|.++.+. .++ +-.++...|-|+=.|..+=|.++..-....+...++ T Consensus 76 eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s 155 (401) T protein:vir:95 76 QGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLS 155 (401) T ss_pred cCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHH Confidence 23333221 111 111233345554444433344443333344555555 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHhhhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCc---------- Q lcl|Aclame:pro 124 KQIGEALATH-YDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ---------- 192 (332) Q Consensus 124 ~~~~~aLa~~-~D~~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~---------- 192 (332) .++...=+.. .|.. .+.+..++... ...+.. .+..+++....++....++.|+.+...|+++..|+ T Consensus 156 ~ell~g~~~~t~d~i-~~dll~ag~~v-iyAg~a-ts~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~ 232 (401) T protein:vir:95 156 RELMNGATQITEAVL-QKDLLAAAGTV-LYAGAA-TSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRM 232 (401) T ss_pred HHHhhhhhhhHHHHH-HHHHHhhcCee-ecCCcc-ceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhc Confidence 5544433333 3433 33333222110 000100 01111122222333344788999999999987776 Q ss_pred C-------CCEEEEChHHH------HHHHhhcCchhhccccccccccccccceeeeeeceEEEeeCcccccccccccccc Q lcl|Aclame:pro 193 E-------GRVAVLSPRQY------YSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAA 259 (332) Q Consensus 193 ~-------gR~~vv~P~~~------~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~ 259 (332) - -|++++.|+.- ++|+ .++.|+.....+..+.+.+|+ ||++-+|++++++.+-.-.+.+.. T Consensus 233 ~dTk~i~~s~va~~h~~L~~di~a~~D~~--~~~~fi~v~kYa~~~~i~~gE-iG~i~~vR~i~~p~~~~w~~ag~~--- 306 (401) T protein:vir:95 233 IDTKVIGATRVMYVGSELVPELKAMKDLF--GNKAFIETQHYADAGTIMNGE-VGSIDKFRIIQVPEMLHWAGAGAQ--- 306 (401) T ss_pred cCccccccceEEEEecCchhHHHHHHHhc--CCCCceehhhcCCcccccccc-ccccCceeEEecccceeecCCccc--- Confidence 2 26888888433 4444 346788775556678899997 999999999998875422111110 Q ss_pred cccccccc-------cccccceEEEeechhhhhhhhhccceee-----ee-------ecccchhHHHHHHHHHHHhCCce Q lcl|Aclame:pro 260 VTGENNDY-------QVDASALAGLIFHREAAGCIQSVAPTIQ-----TT-------SGDFNVQYQGDLIVGKLAMGCGS 320 (332) Q Consensus 260 ~~g~~~~y-------~~~~~~~~~l~~h~~a~~~~~~~~~~~e-----~~-------~~~~~~~~~~d~i~~~~~~G~~v 320 (332) ..|.+..| .++.+-.-.|++-++|.+++.++....- +. -+..++--|-=.+.=++.||+.+ T Consensus 307 a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~v 386 (401) T protein:vir:95 307 ATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILV 386 (401) T ss_pred ccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhhe Confidence 11111122 2233445568888888887766654311 00 01122322333455578899999 Q ss_pred echhheeeeecC Q lcl|Aclame:pro 321 LRTSVAGSFQAA 332 (332) Q Consensus 321 lrpe~~v~i~~A 332 (332) ||||..+-|+++ T Consensus 387 L~~e~m~~ies~ 398 (401) T protein:vir:95 387 KRPERLALIKTV 398 (401) T ss_pred eccceeEEEEee Confidence 999999999999 No 145 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.77 E-value=2.3e-10 Score=73.42 Aligned_cols=292 Identities=12% Similarity=0.096 Sum_probs=150.0 Q ss_pred CCCcccccccc------cccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccc-eeee Q lcl|Aclame:pro 1 MTTLSNFSLPN------QANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK-LSAG 73 (332) Q Consensus 1 m~~~~~~~r~~------~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~-~t~~ 73 (332) |.......|-. ........++++ .+.=+.+..++.+..+..+.+.++++..+.. |+ ++||+.+. .... T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg---~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~ 193 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGE---LTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPAT 193 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCc---eeccHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCcccc Confidence 11100000100 001111112222 2445889999999999999999999876653 44 57887654 4444 Q ss_pred eecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) ....|..+........+++++..-+. +.-+.|.+-=-.++..++.+.+.++.++++++..|+.|+.- .+.....|.+ T Consensus 194 ~v~E~~~~~~~~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G--~G~~~~~p~G 270 (425) T protein:vir:95 194 WIEQSGALPTGDVGTIASIDFDGFKV-GKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKG--TGAANKQPLG 270 (425) T ss_pred ccccccccccccccccceeeeeheee-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhcc--CCCCccccce Confidence 55667665433223456666655543 23345655323345668999999999999999999987731 1110111111 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhh--ccccccccccccccc Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNIL--NREIGNSQGDMNSGK 231 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~--~~d~~~~~~~~~~g~ 231 (332) .... ........ .......++.|.++...+..+..+..+-++++.|..|+..|... .... +..|... .-.+ T Consensus 271 il~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l-~~~kd~~g~~i~~---~~~~- 343 (425) T protein:vir:95 271 IIPS-LPPENQVT-VEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEF-SIQVDSNGNVVGK---LPNL- 343 (425) T ss_pred eecc-cccccccc-cccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHH-HhhcCCCCceeec---cCCC- Confidence 0000 00000000 11122246677777777766665555544566766554322211 1111 1112111 1122 Q ss_pred eeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHH Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIV 311 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~ 311 (332) ....++|.+|+.++++|... -+-++|+.. +++-+ .+++++...+. ...+-...++ T Consensus 344 ~~~~l~G~pvv~~~~~~~~~--------------i~~Gd~~~~--~~~~~--------~~~~i~~~~~~-~f~~~~~~~~ 398 (425) T protein:vir:95 344 RTPDLLGLRVVFNNFLDDDT--------------VLFGEFEQY--TLVER--------ENITIDSSTHV-KFTEDQTAFR 398 (425) T ss_pred CCccccceeeEEcCcCCCcc--------------EEEEecccE--EEEee--------cceEEEeeccc-ccccCceEEE Confidence 25679999999999998421 123455442 12211 22233332211 1111123466 Q ss_pred HHHHhCCceechhheeeeecC Q lcl|Aclame:pro 312 GKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 312 ~~~~~G~~vlrpe~~v~i~~A 332 (332) +..++++++++|++.+.+.=. T Consensus 399 ~~~r~d~~~~~~~a~~~~~i~ 419 (425) T protein:vir:95 399 GKGRFDGKPVKPEAFVLVTIT 419 (425) T ss_pred EEEeeCcEeecccceEEEEec Confidence 667889999999998887665 No 146 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.77 E-value=4.9e-10 Score=71.63 Aligned_cols=279 Identities=10% Similarity=0.007 Sum_probs=147.7 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccccccc-ceEEEecccce--eeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTGKL--SAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G-~tv~i~~iG~~--t~~~~~~ 77 (332) +...-+..+-+ +...+++. .+.=+.|+.++++..+..+.++++++..++..+ .++.++..... ....... T Consensus 98 ~~~~~~~~~~~----~~~~~~gg---~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 170 (395) T protein:vir:38 98 VKDFKNLVTSG----TTGTGNAG---LTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDE 170 (395) T ss_pred HHHHHHHHhhc----cCccCCCc---eecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccc Confidence 11100000100 01111111 134478899999999999999999887766432 34555554432 2333445 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) |..+.....++..++++...+.- .-..|.+-=-..+.+|+.+.+.++.++++++..|+.|+.-.. +. T Consensus 171 ~~~~~~~~~~~f~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g----~~-------- 237 (395) T protein:vir:38 171 SALIGDNDDPELTVVKYLIHRYA-GITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMG----KA-------- 237 (395) T ss_pred ccccccccccceeeEEeeeeeeE-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc----cc-------- Confidence 65554333345566666666543 223455522233567899999999999999999998874211 00 Q ss_pred cceeccccccccCHHHHHHHHHHHHH-HHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeee Q lcl|Aclame:pro 158 GFHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSI 236 (332) Q Consensus 158 ~~~i~~~~~~~~~~~~~~d~i~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i 236 (332) ... ....+ ++.|.++.. .|....- .+-.++++|..|..|.+.+|.. ++ +.. ...+..|. -..+ T Consensus 238 -~~~----~~~~~----~~~i~~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~~--G~-~l~-~~~~~~~~-~~~l 301 (395) T protein:vir:38 238 -PKK----PTISQ----FDNIKDLENNTLDPAIE--STSSFITNQSGYNILSKVKDAD--GR-YLM-QPDVTSPD-KYLI 301 (395) T ss_pred -ccc----ccccc----HHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHHhhccC--Cc-eee-ccCcCCCC-ccee Confidence 000 01111 344444432 3333332 2345679999999987654431 11 111 12233343 5689 Q ss_pred eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc-hhHHHHHHHHHHH Q lcl|Aclame:pro 237 AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN-VQYQGDLIVGKLA 315 (332) Q Consensus 237 ~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~-~~~~~d~i~~~~~ 315 (332) +|++|+.+.+.|.... .+...-|-++|++. +..+...+++++..+.... ..+-...+++..+ T Consensus 302 ~G~pV~~~~~~~~~~~--------~~~~~i~~gd~~~~---------~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 364 (395) T protein:vir:38 302 DGKPVIRIADKWLPDV--------SGSHPLYFGDLKQG---------ITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDR 364 (395) T ss_pred ccceeEEecccccCcC--------CCcceEEEEecccc---------EEEEEecceEEEEeccccchhhcCceEEEEEEe Confidence 9999999987653211 11112233444432 1122233445555433211 1111234566677 Q ss_pred hCCceechhheeeeecC Q lcl|Aclame:pro 316 MGCGSLRTSVAGSFQAA 332 (332) Q Consensus 316 ~G~~vlrpe~~v~i~~A 332 (332) ||.++++|++.+.|.-. T Consensus 365 ~d~~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 365 FDVQLIDDGAFAAASFK 381 (395) T ss_pred eccEEecccceEEEEee Confidence 89999999998877655 No 147 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.72 E-value=5.9e-10 Score=71.18 Aligned_cols=289 Identities=13% Similarity=0.040 Sum_probs=150.4 Q ss_pred CCCccccccc--ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLP--NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~--~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~ 77 (332) +-.+..-.|- +.....+..++++ .|.=+.|..++++..++.+.++++++..+..+ ...+||.. +...+..... T Consensus 69 ~~~l~~~~r~~~~~~~~~~~~~~gg---~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~-~~~~i~~~~~~~~a~~~~E 144 (390) T protein:vir:40 69 ANALTSDESKYYNEVIAGNGFAGVT---ALLPPTVFERVFEDLTVEHPLLSKINFVNTTA-TTEWIISVGDVATAWWGPL 144 (390) T ss_pred chhccHHHHHHHHHHHhccCcccCc---ccccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceeEEEEEcCCcceeeecc Confidence 0000000110 0011122222333 24448999999999999999999998777654 44566764 5556666666 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-- Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE-- 155 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~-- 155 (332) +..+....+++.+++++..-++ +.-+.|.+-=-..+.+|+.+.+.++.++++++..|+.|+.- .+ .+.|.+-. T Consensus 145 ~~~~~~~~~~~f~~i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G--~G--~~~P~Gil~~ 219 (390) T protein:vir:40 145 CAEIKEVLDNGFDKIQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNG--SG--KDQPIGMMRD 219 (390) T ss_pred ccccCccccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--cC--CCccceeeec Confidence 6555433456677787877764 34455655333356778999999999999999999988741 11 11111100 Q ss_pred ccccee---ccccccccCHHHHHHHHHHHHHHHHhcCCCc-CCCEEEEChHHHHHHHhhcCchhhccccccccccccccc Q lcl|Aclame:pro 156 PGGFHV---NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ-EGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) Q Consensus 156 ~~~~~i---~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~-~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~ 231 (332) .+.... ............+.+.+..+...+....-+. ..-++++.|..+..+|... ..+ .+.++.+..+. T Consensus 220 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~-~~~-----~d~~G~~v~~~ 293 (390) T protein:vir:40 220 LNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAA-TSY-----MTPQGVWVTGI 293 (390) T ss_pred cccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHH-hhc-----cCCCCcccccc Confidence 000000 0001111111222333333333333322221 2345678887765555421 111 11222222221 Q ss_pred eeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHH Q lcl|Aclame:pro 232 GLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIV 311 (332) Q Consensus 232 ~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~ 311 (332) ...|.+|+.|+++|... -+.++|+.. +++.+ .+++++...+ ....+-...++ T Consensus 294 ---~~~g~pvv~~~~~p~~~--------------i~~Gd~s~~--~i~~~--------~~~~v~~~~~-~~f~~~~~~~r 345 (390) T protein:vir:40 294 ---LPVPLEIVQSVAVPVGK--------------AVAGRAKDY--FMGIG--------SEQVIRTSTE-YRLLDDETLYY 345 (390) T ss_pred ---CCCceeEEEcCCCCCCc--------------EEEEeeceE--EEEee--------cceEEEecch-hhhhcCcEEEE Confidence 24699999999998421 123455542 23322 2334443321 11111113467 Q ss_pred HHHHhCCceechhheeeeecC Q lcl|Aclame:pro 312 GKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 312 ~~~~~G~~vlrpe~~v~i~~A 332 (332) +.+++++++++|++.+.|.-+ T Consensus 346 ~~~r~dg~v~~~~A~~~l~~~ 366 (390) T protein:vir:40 346 AKQYANGRPKDNSSFLVFDIT 366 (390) T ss_pred EEEEeCCEEecccceEEEEee Confidence 788999999999999877633 No 148 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=98.71 E-value=1.2e-08 Score=64.05 Aligned_cols=312 Identities=16% Similarity=0.136 Sum_probs=163.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhh----c----------------------cccccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIF----K----------------------GLVRSY 54 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~----~----------------------~~v~~r 54 (332) |+- .-|..+ -+ | .+-+++|+.-+++.-.+++.| . +.++.. T Consensus 1 ~~~--a~T~~~-----~~----~---p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~ 66 (430) T protein:vir:10 1 MTA--SKTTMR-----YG----D---PNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQ 66 (430) T ss_pred Ccc--eeeecc-----cC----C---hhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEec Confidence 765 233221 11 1 256778887776665443222 1 244444 Q ss_pred cc--cccceEEEecccceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhhh-hHHHHHhchhHHHHHHHHHHHHH Q lcl|Aclame:pro 55 DL--RGGKSKQFMFTGKLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVY-SLDEIFSQYSTRAEVSKQIGEAL 130 (332) Q Consensus 55 ~~--~~G~tv~i~~iG~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Id-d~D~~q~~~d~~~~~~~~~~~aL 130 (332) ++ ..|++|.|+-+...+-....-+..+.+. +.++...-.|+||+.. .++.+. .+++-.+.+|+|++--..++.=+ T Consensus 67 dL~K~~GD~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R-~~V~~gg~msqQRt~~dlR~~ar~~L~~w~ 145 (430) T protein:vir:10 67 DLGRNKGDEVRFHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQAR-FPVDLGDVMSQIRNPYDLRRLGRPKAKWFM 145 (430) T ss_pred cCCCCCccEEEEeEeeccccCceecCceeeccccceEEEeeEEEEeeec-cccccCCchhhhhhhhHHHHHHHHHHHHHH Confidence 44 3589999987765544444444444444 3577788899999964 333332 45666788999999999999999 Q ss_pred HHHHHHHHHHHHHHHhh--------------------hccccccccccc-eeccccccc-------------cCHHH-HH Q lcl|Aclame:pro 131 ATHYDERIARVLAKASA--------------------EASPVTGEPGGF-HVNIGAGNT-------------NDAQA-IV 175 (332) Q Consensus 131 a~~~D~~i~~~~~~aa~--------------------~~~~~~~~~~~~-~i~~~~~~~-------------~~~~~-~~ 175 (332) ++..||.++..++.+.. ..+++. .|..+ ++. ..+.+ +..+. -+ T Consensus 146 ~~~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~-aPt~nrh~~-~~G~at~~~~~~~~~~sl~stD~~s~ 223 (430) T protein:vir:10 146 DAYLDQSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVK-APTKNRHFV-ASADAITGVAPNAGEYNITTADVLDV 223 (430) T ss_pred HHHHHHHHHHHHhhhhcccccccccccccCCcchhhhhccccC-CCCCceeEe-ecccccccccccccccchhhhcccCH Confidence 99999999988865421 011111 12221 111 01111 11111 15 Q ss_pred HHHHHHHHHHHhcCCC-------cCC-------CEEEEChHHHHHHHhhcCchhh-------ccccccccccccccceee Q lcl|Aclame:pro 176 DGFFEAAAVLDERSAP-------QEG-------RVAVLSPRQYYSLISSVDTNIL-------NREIGNSQGDMNSGKGLY 234 (332) Q Consensus 176 d~i~~a~~~Lde~~VP-------~~g-------R~~vv~P~~~~~Ll~~~d~~~~-------~~d~~~~~~~~~~g~~v~ 234 (332) +.|.+++..++..+.| .+. +++++.|.+|..|... +.+. ++..-+.+..+-.|. ++ T Consensus 224 ~~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~d--t~~~~wq~~~~a~a~~g~~nPlF~G~-~g 300 (430) T protein:vir:10 224 DVVDSIATYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQ--EKFRSWQAAALARASNAKQHPIFRVD-AG 300 (430) T ss_pred HHHHHHHHHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhC--cchHHHHHHHHHhhcccccCCceecc-ee Confidence 6677888888887643 222 7888999999999753 4432 111122345666775 99 Q ss_pred eeeceEEEeeCccc-ccccc--cccccccccc------cccccccccceEEEeechhhhhhhhhccc--eeeeeecccch Q lcl|Aclame:pro 235 SIAGIRILKSNNLA-GLYGQ--DLSSAAVTGE------NNDYQVDASALAGLIFHREAAGCIQSVAP--TIQTTSGDFNV 303 (332) Q Consensus 235 ~i~G~~V~~sn~lp-~~~g~--~~~~~~~~g~------~~~y~~~~~~~~~l~~h~~a~~~~~~~~~--~~e~~~~~~~~ 303 (332) .|.|+-|++..+.= ...|. .++....... ...+++.++-..+|+.-.+|++.+-.+.. -++.+|.|... T Consensus 301 m~ngvii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~ 380 (430) T protein:vir:10 301 LWSNTLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDM 380 (430) T ss_pred eecCeEEecCCceeeecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeecc Confidence 99999999986431 11111 1111111100 11222333333445554554432221111 11222222211 Q ss_pred hH-HHHHHHHHHHhCCceechh------------heeeeecC Q lcl|Aclame:pro 304 QY-QGDLIVGKLAMGCGSLRTS------------VAGSFQAA 332 (332) Q Consensus 304 ~~-~~d~i~~~~~~G~~vlrpe------------~~v~i~~A 332 (332) .| .--.|.....+|.+=.|=. ++++|-+| T Consensus 381 D~g~~~~i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idta 422 (430) T protein:vir:10 381 DHGDKLELLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTA 422 (430) T ss_pred ccCchhhhhhhHHhccceeeecCCCCCCceeeeeEEEEhhhh Confidence 11 0113444455555444432 34566666 No 149 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.70 E-value=2.2e-09 Score=68.01 Aligned_cols=251 Identities=15% Similarity=0.131 Sum_probs=142.9 Q ss_pred CCCccccccccc-------ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccc---------ccccc--cccceE Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-------ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLV---------RSYDL--RGGKSK 62 (332) Q Consensus 1 m~~~~~~~r~~~-------~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v---------~~r~~--~~G~tv 62 (332) ||++..-. |+. ++.+..+ -.++.|++.+...-.+.+-++.+. +..++ ..|++| T Consensus 1 mt~~~~~~-~~~~~~~~~ft~~~~~~--------~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~V 71 (318) T protein:vir:27 1 MTTVTSAQ-ANKLFQVALFTAANRNR--------SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEV 71 (318) T ss_pred CCccCCCC-hHHHHHHHHHHHHhcCC--------hHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEE Confidence 99986554 762 1222221 257889998877655554443332 22233 358999 Q ss_pred EEecccceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 63 QFMFTGKLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARV 141 (332) Q Consensus 63 ~i~~iG~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~ 141 (332) .|+-+...+-....-+..+.+. +.++...-+|+||+..-.-..=..+++-.+.+|+|++--..++.-+++..|+.++.. T Consensus 72 tf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~ 151 (318) T protein:vir:27 72 TFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVH 151 (318) T ss_pred EEeEeeccccCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9987765544444434444443 357778888999986432111145677778899999999999999999999999988 Q ss_pred HHHHhhh--------------------ccccccccccceeccccccccCH------HHH-HHHHHHHHHHHHhcCCC--- Q lcl|Aclame:pro 142 LAKASAE--------------------ASPVTGEPGGFHVNIGAGNTNDA------QAI-VDGFFEAAAVLDERSAP--- 191 (332) Q Consensus 142 ~~~aa~~--------------------~~~~~~~~~~~~i~~~~~~~~~~------~~~-~d~i~~a~~~Lde~~VP--- 191 (332) ++.+... .+++. .|..+.+-.+ ++.++. +.+ ++.|-.+.+++++..-| T Consensus 152 laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~-aPt~~r~~~~-g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~P 229 (318) T protein:vir:27 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVL-PPTHDRHFFG-GDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQP 229 (318) T ss_pred HhhcccccccccceEecccCccchhhhhcccC-CCCCCcEEec-cCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcc Confidence 8654421 01111 1111111111 111111 111 34455677777663332 Q ss_pred --cC--C-------CEEEEChHHHHHHHhhcC-chhh----ccccc--cccccccccceeeeeeceEEEeeCcccccccc Q lcl|Aclame:pro 192 --QE--G-------RVAVLSPRQYYSLISSVD-TNIL----NREIG--NSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQ 253 (332) Q Consensus 192 --~~--g-------R~~vv~P~~~~~Ll~~~d-~~~~----~~d~~--~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~ 253 (332) -+ . +++++.|.+|..|..... ..+. ++... +....+-.|. ++.|.|+-|.+..++|-- T Consensus 230 V~v~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~-~gm~ngvil~~~~~vpIr--- 305 (318) T protein:vir:27 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIR--- 305 (318) T ss_pred eeeccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecc-eeeecCEEEeecCCccEE--- Confidence 12 2 678899999999975311 1122 22221 2344577776 999999999999998721 Q ss_pred ccccccccccccccccccc Q lcl|Aclame:pro 254 DLSSAAVTGENNDYQVDAS 272 (332) Q Consensus 254 ~~~~~~~~g~~~~y~~~~~ 272 (332) +.+|.+-.|+- .+ T Consensus 306 -----f~~G~~v~~~~-~~ 318 (318) T protein:vir:27 306 -----FYQGQRFWYQR-IT 318 (318) T ss_pred -----EcCCCeeeeee-cC Confidence 11222211110 00 No 150 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.70 E-value=2.3e-09 Score=67.90 Aligned_cols=278 Identities=12% Similarity=0.013 Sum_probs=140.8 Q ss_pred CCCcccccccccc--cccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQA--NGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~--~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~~~~ 76 (332) +.......+.+.. .+.+...++. .+--+.+...+... ...+.++.++++.+...+ +..+|.. +........ T Consensus 141 ~~~~~~~~~~~e~~~~~~~~~~~~g---~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 215 (437) T protein:vir:10 141 VTAFADYLKTGEVRDVTGIALKDGK---VIIPETILTPEKEV-HQFPRLGSLVRTESVTTT-TGKLPIFNNSTDLLTAHT 215 (437) T ss_pred hhhhHHHHHhhhhhhhhhccccccc---ccchHHHHHHHHHh-hhhhhhhhcceeEeeccC-ceeeEEeecccccccccc Confidence 1111110000000 0111111111 13346677777654 445556677766655444 3455544 334455555 Q ss_pred CCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 77 PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP 156 (332) Q Consensus 77 ~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~ 156 (332) .+..+.....++.+++++.+.+. +.-+.|.+-=-..+.+|+.+.+.++.+++|++..|..|+.-... +.+ T Consensus 216 e~~~~~e~~~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~---------~~~ 285 (437) T protein:vir:10 216 EYGQTTKNATPVITPILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTD---------GIK 285 (437) T ss_pred ccccccccccccceeeeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---------ccc Confidence 55555433334556666666553 23344544222245678999999999999999999888753210 111 Q ss_pred ccceeccccccccCHHHHHHHHHHHH-HHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 157 GGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 157 ~~~~i~~~~~~~~~~~~~~d~i~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) + +++. .. ++.|.++. ..|+....+ +-.+|++|..|..|...+|.. ..+.- ...+..|. -+. T Consensus 286 ~------~~~~-~~----~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd~~---g~~~~-~~~~~~~~-~~~ 347 (437) T protein:vir:10 286 K------TTST-YL----LGDLKKVLNVTLKPQDSA--AASIVMSQSAYNLFDMATDAM---GRPLL-QPNVTAAT-GYT 347 (437) T ss_pred c------cccc-cc----hhhHHHHHHhhhhhhhhc--CCEEEEcHHHHHHHHHhhccC---CCeee-ccCccCCC-Ccc Confidence 0 0110 11 23344432 245444433 234588999999886644321 11111 11233342 468 Q ss_pred eeceEEEeeCcc--cccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNL--AGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGK 313 (332) Q Consensus 236 i~G~~V~~sn~l--p~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~ 313 (332) ++|.+|+.+++. |..+ +|...-+-++|++... ++-+ .+++++...+ ...+...+++. T Consensus 348 l~G~pv~~~~~~~~~~~~---------~~~~~~~~gd~~~~~~-~~~r--------~~~~~~~~~~---~~~~~~~~~~~ 406 (437) T protein:vir:10 348 LLGKTVVIVDDKLFPSAS---------AGDVNIVVAPLKKAVI-NFKL--------TEITGQFQDT---YDIWYKQLGIF 406 (437) T ss_pred cccceeEEecccccCCcC---------CCceEEEEeeccccEE-EEee--------eceEEEEecc---cccccceeeEE Confidence 999999987654 4321 2222234556654332 2222 2334443321 12234566777 Q ss_pred HHhCCceechhheeeee---cC Q lcl|Aclame:pro 314 LAMGCGSLRTSVAGSFQ---AA 332 (332) Q Consensus 314 ~~~G~~vlrpe~~v~i~---~A 332 (332) ++|++++++|++.+.|. +| T Consensus 407 ~r~d~~~~~~~a~~~l~~~~~~ 428 (437) T protein:vir:10 407 LRQNVVQASKDLIVNLTGKLKA 428 (437) T ss_pred EEEccEEecccceEEEEeeccc Confidence 88999999999977665 22 No 151 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.66 E-value=3.9e-09 Score=66.70 Aligned_cols=291 Identities=10% Similarity=-0.039 Sum_probs=156.5 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccc-cccccceEEE----ecccceeeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSY-DLRGGKSKQF----MFTGKLSAGYH 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r-~~~~G~tv~i----~~iG~~t~~~~ 75 (332) |++|.+++= .+-|+.- .++. .|-=..|-+.......+...+.++...+ ...++-+|+| |........+. T Consensus 1 ~~~~~~i~s---~~~~~~i-tv~~--ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAPTGIVS---VSDGPAI-TVRE--LVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCCCccee---eecCCce-ehHH--hhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhc Confidence 999866553 3444321 2221 1111344444444444444444444333 3455667888 44556677777 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc--ccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS--PVT 153 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~--~~~ 153 (332) .+|.++.-. ...+.+..+-.-+..--.+.|.|--......+......++++.+++++.|+..+..+..+....- ... T Consensus 75 aEggEiP~~-~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~ 153 (318) T protein:vir:10 75 AEFGEIPVS-AGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTA 153 (318) T ss_pred cCccccccc-CCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcC Confidence 888776533 34454444433223356678888777778999999999999999999999998887654331111 111 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHh-------cCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDE-------RSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde-------~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) +..+..... +..++ ++.+-.+.-.+.. .+-.=.--.+|+.|..|..|+. ++.+... |...+.. T Consensus 154 w~~~~~~~~----d~~~A---~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~--n~~~~~~-y~~~a~~ 223 (318) T protein:vir:10 154 WDNGGKVRT----DIAIA---IEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMD--NENFMKV-YERNANY 223 (318) T ss_pred CCCcccccc----cchhh---hhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhc--chhhhhh-hhccchh Confidence 111111111 11111 1111111111111 1111111378999999999975 3554432 2111110 Q ss_pred c-----cccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhh-hhccceeeeeecc Q lcl|Aclame:pro 227 M-----NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCI-QSVAPTIQTTSGD 300 (332) Q Consensus 227 ~-----~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~-~~~~~~~e~~~~~ 300 (332) . ..|..-++++|++|+.|+++|.. + ++++.+..+|+- -..+++.+.++.| T Consensus 224 ~~~~~~~tg~~~g~~lGl~vi~s~~~p~~--~----------------------alvlq~g~vG~~~d~~pl~~t~~~~e 279 (318) T protein:vir:10 224 VSTAPDWTGNFPGSVMGLNVIRSRTFPID--R----------------------VLIMERGTVGFYSDTRPLQFTALYPE 279 (318) T ss_pred hhhcccccccccceeeceEEeecCccCCC--e----------------------eEEEecCCcceeeccccceeeecccC Confidence 0 12322357899999999999952 1 244555555432 2334566666654 Q ss_pred cc----hhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 301 FN----VQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 301 ~~----~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .. ..-....++..+.....|.+|.++.-|.-= T Consensus 280 gg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 280 GNGPNGGPTESYRADASHKRALAVDQPKAALWLTGI 315 (318) T ss_pred CCCCCCCcchhhheehheeeeeeeeCcceeEEEeec Confidence 11 122345677788889999999975544433 No 152 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.65 E-value=1.9e-09 Score=68.41 Aligned_cols=273 Identities=12% Similarity=0.007 Sum_probs=145.4 Q ss_pred CCC--cccc----cccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceee Q lcl|Aclame:pro 1 MTT--LSNF----SLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSA 72 (332) Q Consensus 1 m~~--~~~~----~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~ 72 (332) |.. .... .++......+..++++ .+.=+.|..++.+..+..+.+++++++.+..+ .++|++ +..++ T Consensus 114 ~~~~~~~~~~~~~~~~~~a~~~~t~~~GG---~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a 187 (402) T protein:vir:93 114 ILPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDD 187 (402) T ss_pred HhhhhHHHHHHhHHHHHhhhccCCCcCCc---cccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCcc Confidence 100 0000 0000001112222232 14458899999999999999999988776643 345654 33445 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...|...... +++.+++++.+.+. +.-+.|.+-=-..+.+|+.+.+.++.++++++..++.++... .. T Consensus 188 ~~v~Eg~~~~~~-~~~f~~i~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g--------~g 257 (402) T protein:vir:93 188 DFITDVETAKEL-KAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS--------PK 257 (402) T ss_pred cccccccccccc-ccccceeeecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC--------CC Confidence 555666665543 46677777776654 333456542222357889999999999999987766554221 11 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+.+.....+ .....+...+|.|+++...|+..... .+.| ++.+..|..|+..+++. ++.+..|. T Consensus 258 ~g~p~g~~~~~~-~~~~~~~~~~d~l~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~d~---------~~~~~~~~- 324 (402) T protein:vir:93 258 SGLEHMSFYNGS-VKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNG---------TTNFFDTP- 324 (402) T ss_pred ccccceeeeccc-cccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcC---------CCcccccC- Confidence 222222211111 11223344588888888888776553 4466 56777666666433221 12233333 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVG 312 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~ 312 (332) -..++|.+|+.++..|.. +-++|+.... .+.+ +..+.+++-.. ---.+++ T Consensus 325 ~~~llG~PV~~t~~~~~i----------------~~GDf~~~~~-~~~~----------~~~~~~~~~~~---~~~~~~~ 374 (402) T protein:vir:93 325 AEKVFGKPVVFTDAAVKP----------------IVGDFNYFGI-NYDG----------TTYDTDKDVKK---GEYLFVL 374 (402) T ss_pred CccccccceEEecCCCce----------------eeechhhhhh-hhhh----------hhhhhhhcccC---CceEEEE Confidence 347899999998866521 1233433211 1111 11111111000 0012445 Q ss_pred HHHhCCceechhheeeeec--C Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQA--A 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~~--A 332 (332) ..++|+++++|++...++- | T Consensus 375 ~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 375 TAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred EEEeCcEEechhheEEEEeecC Confidence 6688999999999875544 3 No 153 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.62 E-value=6.3e-09 Score=65.56 Aligned_cols=273 Identities=12% Similarity=0.014 Sum_probs=144.9 Q ss_pred CC-C--ccc---ccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceee Q lcl|Aclame:pro 1 MT-T--LSN---FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSA 72 (332) Q Consensus 1 m~-~--~~~---~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~ 72 (332) |. + ... ..+.......+..++++ .|-=+.+..++.+..+..+.+++++++.+..+ .++|.+ +..+. T Consensus 64 ~~~~~~~~~~~~~~~~~~al~~~~~~~gG---~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~---~~~p~~~~~~~~a 137 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDD 137 (352) T ss_pred hhhhHHHHHHhhHHHHHHHhccCCCCCCc---eeccHhHHHHHHHHHHhhcchhhheeeEecCC---ceEEEEecCCCcc Confidence 10 0 000 00000011122223333 13348899999999999999999988766543 244543 22445 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...|..+... +++.+++++.+.++- .-+.|++-=-..+.+|+.+.+.++.++++++..++.++.. + .. T Consensus 138 ~~v~E~~~~~~~-~~~f~~v~~~~~k~~-~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~---g-----~g 207 (352) T protein:vir:78 138 DFITDVETAKEL-KLKGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV---S-----PK 207 (352) T ss_pred cccccccccccc-cccceeeeecceeEE-eechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhc---C-----CC Confidence 555566666543 567777777776642 3355655322335788999999999999987655544421 1 11 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+-+.....+ .....+...+|.|.++...|+..... ...| ++.|..|..|+..+++. ++.+..|. T Consensus 208 ~~~~~g~l~~~~-~~~~t~~~~~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~l~~~~~~~---------~~~~~~~~- 274 (352) T protein:vir:78 208 SGLEHMSFYNGS-VKEVEGANMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNG---------TTNFFDTP- 274 (352) T ss_pred Ccccccceeccc-cccccccchHHHHHHHHhccChhhhc-CCEE-EEehHHHHHHHHHHhcc---------CCcccccC- Confidence 111111111111 11122333478888888777666543 3344 67888888887543321 12233332 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVG 312 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~ 312 (332) -.+++|.+|+.++..|.. +-++|+.... .+ + ....+...+... --..+++ T Consensus 275 ~~~llG~PV~~~~~~~~~----------------~~Gdf~~~~~-~~--~--------~~~~~~~~~~~~---g~~~f~~ 324 (352) T protein:vir:78 275 AEKVFGKPVVFTDAAVKP----------------IVGDFNYFGI-NY--D--------GTTYDTDKDVKK---GEYLFVL 324 (352) T ss_pred CccccccceEEecCCCce----------------eEeehhhhhh-hh--h--------hheeeeeccccC---CeeEEEE Confidence 347899999998865421 1233332110 00 0 111111111110 0112445 Q ss_pred HHHhCCceechhheeeeecC Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~~A 332 (332) .++|++++++|++.+.+..+ T Consensus 325 ~~r~Dg~~~~~eA~~~l~~~ 344 (352) T protein:vir:78 325 TAWYDQQRTLDSAFRIAKAK 344 (352) T ss_pred EeeeCceeechhheEEEEee Confidence 67889999999998877655 No 154 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.61 E-value=3.4e-09 Score=67.01 Aligned_cols=272 Identities=13% Similarity=0.042 Sum_probs=144.4 Q ss_pred CCCc-cccccc----ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeee Q lcl|Aclame:pro 1 MTTL-SNFSLP----NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAG 73 (332) Q Consensus 1 m~~~-~~~~r~----~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~ 73 (332) +... ....+. .-....+..++++ .+.=+.|..++.+..+..+.+++++++.+..+ .++|++ +..++. T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG---~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~ 173 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDD 173 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCc---eeechhHHHHHHHHHHhhchhhhheeeeecCC---ceEEEEeecCCccc Confidence 0000 000000 0001112222232 14458889999999999998999888776643 345543 334555 Q ss_pred eecCCCCCCccCCCCCceEEEEEeeeeecc-hhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) -...|+.... .+++.+++++..-++ .. +.|++---..+.+|+.+.+.++.++++++..++.++.. ... T Consensus 174 ~v~E~~~~~~-~~~~f~~v~~~~~k~--~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~--------g~g 242 (387) T protein:vir:93 174 FITDVETAKE-LKLKGDTVKFTTNKF--KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV--------SPK 242 (387) T ss_pred cccCcccccc-cccccceeeeeheee--eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhc--------CCC Confidence 5666766554 346677776666554 44 45654222345788999999999999998877655421 111 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+.+.....+ .....+...+|.|+++...|+.+... ...| ++.+..|..|+..+++. ++.+..|. T Consensus 243 ~g~p~g~l~~~~-~~~v~~~~~~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~~~~~~~d~---------~~~~~~~~- 309 (387) T protein:vir:93 243 SGLDHMSFYNGS-VKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNG---------TTNFFDTP- 309 (387) T ss_pred ccccceeeeccc-cccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcC---------CCcccccC- Confidence 122222111111 11223344588888888888777654 3456 56777776666433221 12233332 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVG 312 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~ 312 (332) -.+++|.+|+.++..|. -+-++|+.... .+.. +..+.... ...-...+++ T Consensus 310 ~~~llG~PV~~~~~~~~----------------~~~GDf~~~~~-~~~~----------~~~~~~~~---~~~~~~~~~~ 359 (387) T protein:vir:93 310 AEKVFGKPVVFTDAAVK----------------PIVGDFNYFGI-NYDG----------TTYDTDKD---VKKGEYLFVL 359 (387) T ss_pred CccccccceEEecCCCc----------------eeeeehhhhhe-ehhh----------heeeeccc---ccCCceeEEE Confidence 34789999999876542 12234443211 1111 11111110 0000112345 Q ss_pred HHHhCCceechhheeeee--cC Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQ--AA 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~--~A 332 (332) ..+||+++++|++.+.+. +| T Consensus 360 ~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 360 TAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EeeeCceeechhheEEEEeecC Confidence 568899999999987543 33 No 155 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.58 E-value=4.1e-09 Score=66.54 Aligned_cols=273 Identities=12% Similarity=0.018 Sum_probs=144.8 Q ss_pred CCCcc---ccc---ccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceee Q lcl|Aclame:pro 1 MTTLS---NFS---LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSA 72 (332) Q Consensus 1 m~~~~---~~~---r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~ 72 (332) |.... ... +..-....+..++++ .+.=+.|..++++..+..+.+++++++.+..+ .++|++ +..++ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a 172 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDD 172 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcc Confidence 10000 000 000001112222233 14458899999999999999999888776643 345543 23445 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...|..... .+++.+++++...++ ..-+.|.+-=-..+.+++.+.+.++.++++++..++.++... .. T Consensus 173 ~~v~Eg~~~~~-~~~~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g--------~g 242 (387) T protein:vir:96 173 DFITDVETAKE-LKAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS--------PK 242 (387) T ss_pred ccccccccccc-cccccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC--------CC Confidence 55566666654 356777777777664 233456542222356889999999999999987676555321 11 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+.+.....+. ....+...+|.|+++...|+.+..+ ...| ++.+..|..|+..+++. ++.+..|. T Consensus 243 ~g~~~g~~~~~~~-~~~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~---------~~~~~~~~- 309 (387) T protein:vir:96 243 SGLEHMSFYNGSV-KEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNG---------TTNFFDTP- 309 (387) T ss_pred ccccceeeecccc-ccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcC---------CCcccccC- Confidence 1222221111111 1122344578888888888776554 3456 56777676666533321 12233343 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVG 312 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~ 312 (332) -.+++|.+|+.++..|.. +-++|+.... .+. ++..+.+++ ...---.+++ T Consensus 310 ~~~llG~PV~~~~~~~~~----------------~~GDf~~~~~-~~~----------~~~~~~~~~---~~~~~~~~~~ 359 (387) T protein:vir:96 310 AEKVFGKPVVFTDAAVKP----------------IVGDFNYFGI-NYD----------GTTYDTDKD---VKKGEYLFVL 359 (387) T ss_pred CccccccceEEecCCCce----------------eeechhhhhh-hhh----------hhhheeccc---ccCCceEEEE Confidence 357899999998865521 1233332211 111 011111111 0000012344 Q ss_pred HHHhCCceechhheeeeecC Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~~A 332 (332) ..+|++++++|++.+.+.-. T Consensus 360 ~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:96 360 TAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EEEeCcEeechhheEEEEee Confidence 56899999999998876653 No 156 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.58 E-value=4.1e-09 Score=66.54 Aligned_cols=273 Identities=12% Similarity=0.018 Sum_probs=144.8 Q ss_pred CCCcc---ccc---ccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceee Q lcl|Aclame:pro 1 MTTLS---NFS---LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSA 72 (332) Q Consensus 1 m~~~~---~~~---r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~ 72 (332) |.... ... +..-....+..++++ .+.=+.|..++++..+..+.+++++++.+..+ .++|++ +..++ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a 172 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDD 172 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcc Confidence 10000 000 000001112222233 14458899999999999999999888776643 345543 23445 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...|..... .+++.+++++...++ ..-+.|.+-=-..+.+++.+.+.++.++++++..++.++... .. T Consensus 173 ~~v~Eg~~~~~-~~~~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g--------~g 242 (387) T protein:vir:26 173 DFITDVETAKE-LKAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS--------PK 242 (387) T ss_pred ccccccccccc-cccccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC--------CC Confidence 55566666654 356777777777664 233456542222356889999999999999987676555321 11 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+.+.....+. ....+...+|.|+++...|+.+..+ ...| ++.+..|..|+..+++. ++.+..|. T Consensus 243 ~g~~~g~~~~~~~-~~~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~---------~~~~~~~~- 309 (387) T protein:vir:26 243 SGLEHMSFYNGSV-KEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNG---------TTNFFDTP- 309 (387) T ss_pred ccccceeeecccc-ccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcC---------CCcccccC- Confidence 1222221111111 1122344578888888888776554 3456 56777676666533321 12233343 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVG 312 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~ 312 (332) -.+++|.+|+.++..|.. +-++|+.... .+. ++..+.+++ ...---.+++ T Consensus 310 ~~~llG~PV~~~~~~~~~----------------~~GDf~~~~~-~~~----------~~~~~~~~~---~~~~~~~~~~ 359 (387) T protein:vir:26 310 AEKVFGKPVVFTDAAVKP----------------IVGDFNYFGI-NYD----------GTTYDTDKD---VKKGEYLFVL 359 (387) T ss_pred CccccccceEEecCCCce----------------eeechhhhhh-hhh----------hhhheeccc---ccCCceEEEE Confidence 357899999998865521 1233332211 111 011111111 0000012344 Q ss_pred HHHhCCceechhheeeeecC Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~~A 332 (332) ..+|++++++|++.+.+.-. T Consensus 360 ~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:26 360 TAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EEEeCcEeechhheEEEEee Confidence 56899999999998876653 No 157 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.58 E-value=4.1e-09 Score=66.54 Aligned_cols=273 Identities=12% Similarity=0.018 Sum_probs=144.8 Q ss_pred CCCcc---ccc---ccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceee Q lcl|Aclame:pro 1 MTTLS---NFS---LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSA 72 (332) Q Consensus 1 m~~~~---~~~---r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~ 72 (332) |.... ... +..-....+..++++ .+.=+.|..++++..+..+.+++++++.+..+ .++|++ +..++ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~a 172 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDD 172 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCcc Confidence 10000 000 000001112222233 14458899999999999999999888776643 345543 23445 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...|..... .+++.+++++...++ ..-+.|.+-=-..+.+++.+.+.++.++++++..++.++... .. T Consensus 173 ~~v~Eg~~~~~-~~~~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g--------~g 242 (387) T protein:vir:94 173 DFITDVETAKE-LKAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS--------PK 242 (387) T ss_pred ccccccccccc-cccccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC--------CC Confidence 55566666654 356777777777664 233456542222356889999999999999987676555321 11 Q ss_pred ccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccce Q lcl|Aclame:pro 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) Q Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~ 232 (332) .+.+.+.....+. ....+...+|.|+++...|+.+..+ ...| ++.+..|..|+..+++. ++.+..|. T Consensus 243 ~g~~~g~~~~~~~-~~~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~---------~~~~~~~~- 309 (387) T protein:vir:94 243 SGLEHMSFYNGSV-KEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNG---------TTNFFDTP- 309 (387) T ss_pred ccccceeeecccc-ccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcC---------CCcccccC- Confidence 1222221111111 1122344578888888888776554 3456 56777676666533321 12233343 Q ss_pred eeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHH Q lcl|Aclame:pro 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVG 312 (332) Q Consensus 233 v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~ 312 (332) -.+++|.+|+.++..|.. +-++|+.... .+. ++..+.+++ ...---.+++ T Consensus 310 ~~~llG~PV~~~~~~~~~----------------~~GDf~~~~~-~~~----------~~~~~~~~~---~~~~~~~~~~ 359 (387) T protein:vir:94 310 AEKVFGKPVVFTDAAVKP----------------IVGDFNYFGI-NYD----------GTTYDTDKD---VKKGEYLFVL 359 (387) T ss_pred CccccccceEEecCCCce----------------eeechhhhhh-hhh----------hhhheeccc---ccCCceEEEE Confidence 357899999998865521 1233332211 111 011111111 0000012344 Q ss_pred HHHhCCceechhheeeeecC Q lcl|Aclame:pro 313 KLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 313 ~~~~G~~vlrpe~~v~i~~A 332 (332) ..+|++++++|++.+.+.-. T Consensus 360 ~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:94 360 TAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EEEeCcEeechhheEEEEee Confidence 56899999999998876653 No 158 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=98.57 E-value=2.4e-09 Score=67.81 Aligned_cols=273 Identities=13% Similarity=0.037 Sum_probs=141.2 Q ss_pred CCCccc-ccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccc-eEEEecccceeeeeecCC Q lcl|Aclame:pro 1 MTTLSN-FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMFTGKLSAGYHTPG 78 (332) Q Consensus 1 m~~~~~-~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~-tv~i~~iG~~t~~~~~~g 78 (332) +..... ..+. ..+....++. .+-.+.+..++.+. .....+.++++..++..++ .+.++..+.........+ T Consensus 121 ~~~~~~~~~~~---~~~~~~~~~~---~~vp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~ 193 (397) T protein:vir:96 121 NAFVKSKGAEK---RDGFTSVEGG---ALIPQELLQPQLEP-KDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQL 193 (397) T ss_pred HHHHHhhhhhh---hhcccccccc---cchhHHHHHHHHHh-hhhhhHHHhhhhccccccceeEEEEeccCCcccccccc Confidence 000000 0000 0111111111 24557888888774 3344445666655554332 334444455555555555 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) .......++...++++.+.+. +.-..|.+---.++.+|+.+.+.++.++++++..|..|+.-... +. T Consensus 194 ~~~~~~~~~~~~~i~~~~~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~---------~~--- 260 (397) T protein:vir:96 194 EKNPQLANPKMVEIDYSVATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKT---------AT--- 260 (397) T ss_pred ccccccccccccceeecHhHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cc--- Confidence 544432345667777777654 33344544222335678899999999999999999877642110 00 Q ss_pred ceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeec Q lcl|Aclame:pro 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) Q Consensus 159 ~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G 238 (332) +..... +|.|.++....... .. +-..|++|..|..|..-+|.. ++ +.- ...+..|. -++++| T Consensus 261 ------~~~~~~----~d~~~~~~~~~~~~-~~--~a~~v~n~~~~~~l~~lkd~~--G~-~~~-~~~~~~~~-~~~l~G 322 (397) T protein:vir:96 261 ------AKSVVG----VDGLKDLINKEIKK-VY--DVKLFISASMYSELDKLKDKN--GR-YLL-QDSITAAS-GKQLLG 322 (397) T ss_pred ------cccccc----hHHHHHHHHHhhhh-hc--CcEEEEcHHHHHHHHHhhccC--CC-eEe-ccCccCCC-cccccc Confidence 011112 44455443332221 22 335689999999987644321 11 211 11233342 468999 Q ss_pred eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCC Q lcl|Aclame:pro 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGC 318 (332) Q Consensus 239 ~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~ 318 (332) .+|+.+++.+... ..|...-+-++|++.+. ++- ..+++++.... .+|...+++.+++|+ T Consensus 323 ~pv~~~~~~~~~~--------~~~~~~~~~gd~~~~~~-~~~--------~~~~~~~~~~~----~~~~~~~~~~~r~d~ 381 (397) T protein:vir:96 323 KEVVVLDDDVIGK--------SVGNVVGFIGDAKAFAS-FFD--------RKQVSVSWVDN----NIYGQLLAGIIRYDV 381 (397) T ss_pred cceEEecccccCC--------CCCceEEEEeehhcceE-eEe--------ecceEEEEecc----cccceeEEEEEEEcc Confidence 9999887653211 11222233455554322 222 22233333222 234456788899999 Q ss_pred ceechhheeeee--cC Q lcl|Aclame:pro 319 GSLRTSVAGSFQ--AA 332 (332) Q Consensus 319 ~vlrpe~~v~i~--~A 332 (332) ++++|++.+.|. +| T Consensus 382 ~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 382 KATDKKAGFYVTFTIG 397 (397) T ss_pred EEecccceEEEEeecC Confidence 999999998774 44 No 159 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.49 E-value=3.2e-08 Score=61.65 Aligned_cols=261 Identities=15% Similarity=0.113 Sum_probs=140.4 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc-ceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG-KLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG-~~t~~~~~~g~ 79 (332) |+. .|++-.- ..+...+ . -|+++|+.-+.+-+ .+++.+|.-+...|++++||.-. .....++..|. T Consensus 1 mAe-~nlt~~~--dL~~~~s-i-----dfv~~f~~~i~~L~----~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe 67 (295) T protein:vir:99 1 MAE-KNLNTMA--DLGDIKS-I-----DFVNKFSKNINDLL----KLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGE 67 (295) T ss_pred CCC-cccccHh--hccCcee-e-----hhhHHhhhhHHHHH----HHhccccccccccCCeEEeeeeeeecccccccCCc Confidence 888 5666431 2222221 2 39999986554432 24577777788999999999865 45668899999 Q ss_pred CCCccCCCCC---ceEEEEEeeeeecchhhhhHHHH-H-h-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 80 PIVGDAGIKA---NEKTLVMDDLLVSSQFVYSLDEI-F-S-QYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 80 ~~~~~~~~~~---~~~~l~ID~~~~~~~~Idd~D~~-q-~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) .|+.+ .++. +..++++.++.- .+. ||+ | . ..|...+..+++..+|++.+|..++..+..+..+. T Consensus 68 ~Ipls-kvt~~~~~t~t~kikK~rK---~tT--dEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~---- 137 (295) T protein:vir:99 68 TIPLS-KVTRTKDKDYTVKWFKKRR---ATT--AEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKV---- 137 (295) T ss_pred ccchh-hheeeeeeeeEEEeeeecc---ccc--HHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceee---- Confidence 99765 4553 357777876433 243 455 4 3 45689999999999999999999998773221110 Q ss_pred cccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) ....-+..++.+..+...+.|.+= ...+++|+|..++.||....-. +...+ ...-..+ T Consensus 138 -------------tg~~lq~a~a~~~~al~~f~Ee~~--~~~V~FVnP~D~a~yl~~A~~~-----~~~a~--~fG~~~L 195 (295) T protein:vir:99 138 -------------KGVGLQKALSASWAKLATFNEFEG--SPLVSFVSPLDVANYLGDTKVG-----ADASN--VFGMTLL 195 (295) T ss_pred -------------ehhhHHHHHHHhhhhhhhcccccC--CceEEEEehHHHHHHHhccccc-----cchhh--hhhhhhh Confidence 001112234444444444433321 1369999999999998642211 11110 0111235 Q ss_pred eeeeceE-EEeeCcccccccccccccccccccccc--------------cccccceEEEeechhhhhhhhhccceeeeee Q lcl|Aclame:pro 234 YSIAGIR-ILKSNNLAGLYGQDLSSAAVTGENNDY--------------QVDASALAGLIFHREAAGCIQSVAPTIQTTS 298 (332) Q Consensus 234 ~~i~G~~-V~~sn~lp~~~g~~~~~~~~~g~~~~y--------------~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~ 298 (332) -+++|++ |+.|+.+|.. +-+.+. ..+....| ..|.+..+|+. |.. T Consensus 196 ~nfLG~q~II~S~kv~~G--~~~aT~-~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~-h~~---------------- 255 (295) T protein:vir:99 196 KNFLGMQNVIVMPSVPEG--KIYSTA-VENLVFASLNVKGGDLGGLFADFTDETGLIAAA-RNR---------------- 255 (295) T ss_pred hhhhccceEEEcccCCCc--eEEEee-ccceEEEEecCCchhhhhhhhhccCcccceEEE-ecc---------------- Confidence 5789997 9999999952 111111 11111111 11222222211 111 Q ss_pred cccchhHHHHHHHHHHHhCCceechhhee-----eeecC Q lcl|Aclame:pro 299 GDFNVQYQGDLIVGKLAMGCGSLRTSVAG-----SFQAA 332 (332) Q Consensus 299 ~~~~~~~~~d~i~~~~~~G~~vlrpe~~v-----~i~~A 332 (332) ...+..+. -+++++-.+=||..- .|..+ T Consensus 256 ~~~~~t~e------t~~~~~~~lfpE~~dgiv~~tI~~~ 288 (295) T protein:vir:99 256 QLSNLTYE------SVFFGANVLFAEIPEGVVEATIEAA 288 (295) T ss_pred ccceeeeh------hhhHhHHHhcccccceEEEEEEecC Confidence 11111111 123334444444332 33333 No 160 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.44 E-value=6.3e-08 Score=60.06 Aligned_cols=321 Identities=14% Similarity=0.092 Sum_probs=163.6 Q ss_pred CCCccccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhc---------cccccccc--cccceEEEeccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFK---------GLVRSYDL--RGGKSKQFMFTG 68 (332) Q Consensus 1 m~~~~~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~---------~~v~~r~~--~~G~tv~i~~iG 68 (332) ||.+ +-|+. .+|..+--..-.++.-++++|.+.+...-+..+-+. +.++..++ ..|++|.|+-+. T Consensus 1 ~~~~---~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:81 1 MTTV---TSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCCc---CCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 7663 22221 122111000000112357778776544333222222 22222333 348999998876 Q ss_pred ceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 69 KLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 69 ~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) ..+-....-++.+.+. +.++....+|+||+..-.-..=..+++-.+.+|+|++.-..++.-+++..|+.++..++.+.. T Consensus 78 ~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:81 78 KLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred ecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 6654444444455554 457788889999986533111146777778999999999999999999999999988875432 Q ss_pred h--------------------ccccccccccceeccccccccCH------HHH-HHHHHHHHHHHHhcCCCc-------C Q lcl|Aclame:pro 148 E--------------------ASPVTGEPGGFHVNIGAGNTNDA------QAI-VDGFFEAAAVLDERSAPQ-------E 193 (332) Q Consensus 148 ~--------------------~~~~~~~~~~~~i~~~~~~~~~~------~~~-~d~i~~a~~~Lde~~VP~-------~ 193 (332) . .+++. .|....+-.+ +++++- +.+ ++.|-++.+.+++..-|- + T Consensus 158 ~~~n~~~~vp~~~~~~~~~~~~N~v~-APt~~r~~~~-g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:81 158 DFVADDTILPTAEHPEFKKIMINDVL-PPTHDRHFFG-GDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred ccccccceeeccccccccceeecccC-CCCCCcEEec-cCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 1 00110 1111111000 111111 111 455667777887744442 1 Q ss_pred C-------CEEEEChHHHHHHHhhcC-chhhc----c--ccccccccccccceeeeeeceEEEeeCcccc--cccccccc Q lcl|Aclame:pro 194 G-------RVAVLSPRQYYSLISSVD-TNILN----R--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAG--LYGQDLSS 257 (332) Q Consensus 194 g-------R~~vv~P~~~~~Ll~~~d-~~~~~----~--d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~--~~g~~~~~ 257 (332) . ++++++|.+|..|.+... ..+.+ + ...+.+..+-.|. ++.|.|+-|.+..+.|- ..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~-~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:81 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecC-eeEEcCEEEEecCCceeeecccceeee Confidence 2 788899999999975311 11222 1 1113445677775 99999999999888772 22221111 Q ss_pred ccc-cc-ccccccccccceEEEeechhhh--hhhhhcc----ceeeeeecccchhHHHHHHHHHHHhCCceec-hh---- Q lcl|Aclame:pro 258 AAV-TG-ENNDYQVDASALAGLIFHREAA--GCIQSVA----PTIQTTSGDFNVQYQGDLIVGKLAMGCGSLR-TS---- 324 (332) Q Consensus 258 ~~~-~g-~~~~y~~~~~~~~~l~~h~~a~--~~~~~~~----~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlr-pe---- 324 (332) +.. .+ .+......++-..+|++-.+|+ +.++.-+ +.-|.+.. .+. -.|.....+|.+=.| |. T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~-g~~----~~i~~~~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:81 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM-DNR----TEIAISWINGLKKIRFPEKSGK 389 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc-Cch----hhhhhHHHhhhhhccccCCCCc Confidence 111 00 0111112122223344444443 2222211 22222221 111 235556677877777 52 Q ss_pred ----heeeeecC Q lcl|Aclame:pro 325 ----VAGSFQAA 332 (332) Q Consensus 325 ----~~v~i~~A 332 (332) ++++|-+| T Consensus 390 ~~DfGvi~idta 401 (404) T protein:vir:81 390 MQDHGVIAVDTA 401 (404) T ss_pred eeeEEEEEeccc Confidence 46677777 No 161 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.44 E-value=6.3e-08 Score=60.06 Aligned_cols=321 Identities=14% Similarity=0.092 Sum_probs=163.6 Q ss_pred CCCccccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhc---------cccccccc--cccceEEEeccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFK---------GLVRSYDL--RGGKSKQFMFTG 68 (332) Q Consensus 1 m~~~~~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~---------~~v~~r~~--~~G~tv~i~~iG 68 (332) ||.+ +-|+. .+|..+--..-.++.-++++|.+.+...-+..+-+. +.++..++ ..|++|.|+-+. T Consensus 1 ~~~~---~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:32 1 MTTV---TSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCCc---CCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 7663 22221 122111000000112357778776544333222222 22222333 348999998876 Q ss_pred ceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 69 KLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 69 ~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) ..+-....-++.+.+. +.++....+|+||+..-.-..=..+++-.+.+|+|++.-..++.-+++..|+.++..++.+.. T Consensus 78 ~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:32 78 KLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred ecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 6654444444455554 457788889999986533111146777778999999999999999999999999988875432 Q ss_pred h--------------------ccccccccccceeccccccccCH------HHH-HHHHHHHHHHHHhcCCCc-------C Q lcl|Aclame:pro 148 E--------------------ASPVTGEPGGFHVNIGAGNTNDA------QAI-VDGFFEAAAVLDERSAPQ-------E 193 (332) Q Consensus 148 ~--------------------~~~~~~~~~~~~i~~~~~~~~~~------~~~-~d~i~~a~~~Lde~~VP~-------~ 193 (332) . .+++. .|....+-.+ +++++- +.+ ++.|-++.+.+++..-|- + T Consensus 158 ~~~n~~~~vp~~~~~~~~~~~~N~v~-APt~~r~~~~-g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:32 158 DFVADDTILPTAEHPEFKKIMINDVL-PPTHDRHFFG-GDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred ccccccceeeccccccccceeecccC-CCCCCcEEec-cCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 1 00110 1111111000 111111 111 455667777887744442 1 Q ss_pred C-------CEEEEChHHHHHHHhhcC-chhhc----c--ccccccccccccceeeeeeceEEEeeCcccc--cccccccc Q lcl|Aclame:pro 194 G-------RVAVLSPRQYYSLISSVD-TNILN----R--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAG--LYGQDLSS 257 (332) Q Consensus 194 g-------R~~vv~P~~~~~Ll~~~d-~~~~~----~--d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~--~~g~~~~~ 257 (332) . ++++++|.+|..|.+... ..+.+ + ...+.+..+-.|. ++.|.|+-|.+..+.|- ..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~-~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:32 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecC-eeEEcCEEEEecCCceeeecccceeee Confidence 2 788899999999975311 11222 1 1113445677775 99999999999888772 22221111 Q ss_pred ccc-cc-ccccccccccceEEEeechhhh--hhhhhcc----ceeeeeecccchhHHHHHHHHHHHhCCceec-hh---- Q lcl|Aclame:pro 258 AAV-TG-ENNDYQVDASALAGLIFHREAA--GCIQSVA----PTIQTTSGDFNVQYQGDLIVGKLAMGCGSLR-TS---- 324 (332) Q Consensus 258 ~~~-~g-~~~~y~~~~~~~~~l~~h~~a~--~~~~~~~----~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlr-pe---- 324 (332) +.. .+ .+......++-..+|++-.+|+ +.++.-+ +.-|.+.. .+. -.|.....+|.+=.| |. T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~-g~~----~~i~~~~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:32 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM-DNR----TEIAISWINGLKKIRFPEKSGK 389 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc-Cch----hhhhhHHHhhhhhccccCCCCc Confidence 111 00 0111112122223344444443 2222211 22222221 111 235556677877777 52 Q ss_pred ----heeeeecC Q lcl|Aclame:pro 325 ----VAGSFQAA 332 (332) Q Consensus 325 ----~~v~i~~A 332 (332) ++++|-+| T Consensus 390 ~~DfGvi~idta 401 (404) T protein:vir:32 390 MQDHGVIAVDTA 401 (404) T ss_pred eeeEEEEEeccc Confidence 46677777 No 162 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.44 E-value=6.3e-08 Score=60.06 Aligned_cols=321 Identities=14% Similarity=0.092 Sum_probs=163.6 Q ss_pred CCCccccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhc---------cccccccc--cccceEEEeccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFK---------GLVRSYDL--RGGKSKQFMFTG 68 (332) Q Consensus 1 m~~~~~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~---------~~v~~r~~--~~G~tv~i~~iG 68 (332) ||.+ +-|+. .+|..+--..-.++.-++++|.+.+...-+..+-+. +.++..++ ..|++|.|+-+. T Consensus 1 ~~~~---~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:10 1 MTTV---TSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCCc---CCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 7663 22221 122111000000112357778776544333222222 22222333 348999998876 Q ss_pred ceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 69 KLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 69 ~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) ..+-....-++.+.+. +.++....+|+||+..-.-..=..+++-.+.+|+|++.-..++.-+++..|+.++..++.+.. T Consensus 78 ~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:10 78 KLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred ecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 6654444444455554 457788889999986533111146777778999999999999999999999999988875432 Q ss_pred h--------------------ccccccccccceeccccccccCH------HHH-HHHHHHHHHHHHhcCCCc-------C Q lcl|Aclame:pro 148 E--------------------ASPVTGEPGGFHVNIGAGNTNDA------QAI-VDGFFEAAAVLDERSAPQ-------E 193 (332) Q Consensus 148 ~--------------------~~~~~~~~~~~~i~~~~~~~~~~------~~~-~d~i~~a~~~Lde~~VP~-------~ 193 (332) . .+++. .|....+-.+ +++++- +.+ ++.|-++.+.+++..-|- + T Consensus 158 ~~~n~~~~vp~~~~~~~~~~~~N~v~-APt~~r~~~~-g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:10 158 DFVADDTILPTAEHPEFKKIMINDVL-PPTHDRHFFG-GDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred ccccccceeeccccccccceeecccC-CCCCCcEEec-cCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 1 00110 1111111000 111111 111 455667777887744442 1 Q ss_pred C-------CEEEEChHHHHHHHhhcC-chhhc----c--ccccccccccccceeeeeeceEEEeeCcccc--cccccccc Q lcl|Aclame:pro 194 G-------RVAVLSPRQYYSLISSVD-TNILN----R--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAG--LYGQDLSS 257 (332) Q Consensus 194 g-------R~~vv~P~~~~~Ll~~~d-~~~~~----~--d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~--~~g~~~~~ 257 (332) . ++++++|.+|..|.+... ..+.+ + ...+.+..+-.|. ++.|.|+-|.+..+.|- ..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~-~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:10 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecC-eeEEcCEEEEecCCceeeecccceeee Confidence 2 788899999999975311 11222 1 1113445677775 99999999999888772 22221111 Q ss_pred ccc-cc-ccccccccccceEEEeechhhh--hhhhhcc----ceeeeeecccchhHHHHHHHHHHHhCCceec-hh---- Q lcl|Aclame:pro 258 AAV-TG-ENNDYQVDASALAGLIFHREAA--GCIQSVA----PTIQTTSGDFNVQYQGDLIVGKLAMGCGSLR-TS---- 324 (332) Q Consensus 258 ~~~-~g-~~~~y~~~~~~~~~l~~h~~a~--~~~~~~~----~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlr-pe---- 324 (332) +.. .+ .+......++-..+|++-.+|+ +.++.-+ +.-|.+.. .+. -.|.....+|.+=.| |. T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~-g~~----~~i~~~~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:10 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM-DNR----TEIAISWINGLKKIRFPEKSGK 389 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc-Cch----hhhhhHHHhhhhhccccCCCCc Confidence 111 00 0111112122223344444443 2222211 22222221 111 235556677877777 52 Q ss_pred ----heeeeecC Q lcl|Aclame:pro 325 ----VAGSFQAA 332 (332) Q Consensus 325 ----~~v~i~~A 332 (332) ++++|-+| T Consensus 390 ~~DfGvi~idta 401 (404) T protein:vir:10 390 MQDHGVIAVDTA 401 (404) T ss_pred eeeEEEEEeccc Confidence 46677777 No 163 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.44 E-value=6.3e-08 Score=60.06 Aligned_cols=321 Identities=14% Similarity=0.092 Sum_probs=163.6 Q ss_pred CCCccccccccc-ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhc---------cccccccc--cccceEEEeccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFK---------GLVRSYDL--RGGKSKQFMFTG 68 (332) Q Consensus 1 m~~~~~~~r~~~-~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~---------~~v~~r~~--~~G~tv~i~~iG 68 (332) ||.+ +-|+. .+|..+--..-.++.-++++|.+.+...-+..+-+. +.++..++ ..|++|.|+-+. T Consensus 1 ~~~~---~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:10 1 MTTV---TSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCCc---CCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 7663 22221 122111000000112357778776544333222222 22222333 348999998876 Q ss_pred ceeeeeecCCCCCCcc-CCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 69 KLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 69 ~~t~~~~~~g~~~~~~-~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) ..+-....-++.+.+. +.++....+|+||+..-.-..=..+++-.+.+|+|++.-..++.-+++..|+.++..++.+.. T Consensus 78 ~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:10 78 KLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred ecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 6654444444455554 457788889999986533111146777778999999999999999999999999988875432 Q ss_pred h--------------------ccccccccccceeccccccccCH------HHH-HHHHHHHHHHHHhcCCCc-------C Q lcl|Aclame:pro 148 E--------------------ASPVTGEPGGFHVNIGAGNTNDA------QAI-VDGFFEAAAVLDERSAPQ-------E 193 (332) Q Consensus 148 ~--------------------~~~~~~~~~~~~i~~~~~~~~~~------~~~-~d~i~~a~~~Lde~~VP~-------~ 193 (332) . .+++. .|....+-.+ +++++- +.+ ++.|-++.+.+++..-|- + T Consensus 158 ~~~n~~~~vp~~~~~~~~~~~~N~v~-APt~~r~~~~-g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:10 158 DFVADDTILPTAEHPEFKKIMINDVL-PPTHDRHFFG-GDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred ccccccceeeccccccccceeecccC-CCCCCcEEec-cCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 1 00110 1111111000 111111 111 455667777887744442 1 Q ss_pred C-------CEEEEChHHHHHHHhhcC-chhhc----c--ccccccccccccceeeeeeceEEEeeCcccc--cccccccc Q lcl|Aclame:pro 194 G-------RVAVLSPRQYYSLISSVD-TNILN----R--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAG--LYGQDLSS 257 (332) Q Consensus 194 g-------R~~vv~P~~~~~Ll~~~d-~~~~~----~--d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~--~~g~~~~~ 257 (332) . ++++++|.+|..|.+... ..+.+ + ...+.+..+-.|. ++.|.|+-|.+..+.|- ..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~-~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:10 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecC-eeEEcCEEEEecCCceeeecccceeee Confidence 2 788899999999975311 11222 1 1113445677775 99999999999888772 22221111 Q ss_pred ccc-cc-ccccccccccceEEEeechhhh--hhhhhcc----ceeeeeecccchhHHHHHHHHHHHhCCceec-hh---- Q lcl|Aclame:pro 258 AAV-TG-ENNDYQVDASALAGLIFHREAA--GCIQSVA----PTIQTTSGDFNVQYQGDLIVGKLAMGCGSLR-TS---- 324 (332) Q Consensus 258 ~~~-~g-~~~~y~~~~~~~~~l~~h~~a~--~~~~~~~----~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlr-pe---- 324 (332) +.. .+ .+......++-..+|++-.+|+ +.++.-+ +.-|.+.. .+. -.|.....+|.+=.| |. T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~-g~~----~~i~~~~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:10 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM-DNR----TEIAISWINGLKKIRFPEKSGK 389 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeecccc-Cch----hhhhhHHHhhhhhccccCCCCc Confidence 111 00 0111112122223344444443 2222211 22222221 111 235556677877777 52 Q ss_pred ----heeeeecC Q lcl|Aclame:pro 325 ----VAGSFQAA 332 (332) Q Consensus 325 ----~~v~i~~A 332 (332) ++++|-+| T Consensus 390 ~~DfGvi~idta 401 (404) T protein:vir:10 390 MQDHGVIAVDTA 401 (404) T ss_pred eeeEEEEEeccc Confidence 46677777 No 164 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.36 E-value=6.2e-08 Score=60.12 Aligned_cols=264 Identities=11% Similarity=0.088 Sum_probs=141.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc----ceeeeeec Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG----KLSAGYHT 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG----~~t~~~~~ 76 (332) |+-.+|++-+- ..+...| .| |.++|+.-+.+-++ .++.+|.-++..|.+++++..- .....+.. T Consensus 1 M~~e~nl~~~~--dL~~a~s-iD-----F~~~f~~~i~~L~~----~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVa 68 (303) T protein:vir:10 1 MSAENNLINVE--ALGKAKS-ID-----FANKLGVGLNKLFE----ALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVA 68 (303) T ss_pred CCCCcCCcchh--hccccee-eh-----hhhhhhhhHHHHHH----HhhhhccccccCCceeeeeeeeceeecccccccc Confidence 98888887542 2222222 33 99999987765533 3566666677788887766542 24456788 Q ss_pred CCCCCCccCCCC---CceEEEEEeeeeecchhhhhHHHH-Hh--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 77 PGTPIVGDAGIK---ANEKTLVMDDLLVSSQFVYSLDEI-FS--QYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) Q Consensus 77 ~g~~~~~~~~~~---~~~~~l~ID~~~~~~~~Idd~D~~-q~--~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~ 150 (332) .|..|+-+ .++ .+..++++.++.- .+ -||+ |. ..|...+..+++..++++.+|..++..+..+..+.. T Consensus 69 EGe~Ipls-kvt~~~~~t~~~~~kK~rK---~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~ 142 (303) T protein:vir:10 69 EGDVIPLT-KVTREQVDITELQFAKYRK---ST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGK 142 (303) T ss_pred CCcccchh-hheeeecceEEEEeecccc---cc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccc Confidence 89888754 454 3457888876433 23 3444 43 445899999999999999999999987754421110 Q ss_pred ccccccccceeccccccccCHHHHHHHHHHHHHH---HHhcCCCcCCCEEEEChHHHHHHHhhcCchhhc--cccccccc Q lcl|Aclame:pro 151 PVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAV---LDERSAPQEGRVAVLSPRQYYSLISSVDTNILN--REIGNSQG 225 (332) Q Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~---Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~--~d~~~~~~ 225 (332) .+.....+.+.+-++|-....+ ++|.++ .-+++|+|.-.+.||.. ..+.. ..++. T Consensus 143 ------------~t~~t~~s~~glq~Al~~~~~kl~~~~ed~~---~~V~FvNP~Daa~yl~~--A~i~~~~t~fG~--- 202 (303) T protein:vir:10 143 ------------RTNKTKLSAENLQGALSKGRANLSVLLDDEI---TPIAFVNPNDTAEYLAN--GFINSTGAQFGV--- 202 (303) T ss_pred ------------cccceeecHHHHHHHHHhhhhhccccccccc---cEEEEEchHHHHHHhhc--CCcchhhhhhhh--- Confidence 0001111233333333222222 344433 24889999999999853 33321 12211 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccc------------cccccccccccccceEEEeechhhhhhhhhccce Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAA------------VTGENNDYQVDASALAGLIFHREAAGCIQSVAPT 293 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~------------~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~ 293 (332) ..+-+++|+.|+.|+.+|.. +-+.+.. .-+...+|..|.+..+|+. |.. T Consensus 203 -----n~L~nfLG~~II~S~kv~~G--~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~-h~~----------- 263 (303) T protein:vir:10 203 -----NLLTPYVGVKIVEFADVPQG--EVWMTVAENLNVAYANPRGELSRAFAFATDATGFVGVL-HDI----------- 263 (303) T ss_pred -----hhhhhhhcceEEEeccCCCc--eEEEeeccceEEEEecCchhhhhhhhhccccccceEEE-ecc----------- Confidence 13557899999999999952 1111111 0111223344444333322 111 Q ss_pred eeeeecccchhHHHHHHHHHHHhCCceechh---hee--eeecC Q lcl|Aclame:pro 294 IQTTSGDFNVQYQGDLIVGKLAMGCGSLRTS---VAG--SFQAA 332 (332) Q Consensus 294 ~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe---~~v--~i~~A 332 (332) ...+..+. -+++++-.+=|| +++ .|..+ T Consensus 264 -----~~~~~t~e------T~~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 264 -----QPQRLTSD------TIYASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred -----ccceeeeh------hHhHhHHHhcccccceEEEEEEecc Confidence 11111111 123333444444 333 33333 No 165 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.34 E-value=2.2e-08 Score=62.59 Aligned_cols=281 Identities=13% Similarity=0.037 Sum_probs=147.9 Q ss_pred CCCcc--cccc-----cccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc-ceee Q lcl|Aclame:pro 1 MTTLS--NFSL-----PNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG-KLSA 72 (332) Q Consensus 1 m~~~~--~~~r-----~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG-~~t~ 72 (332) +.... .++. -+..+.+.+ ++++ .|.=+.+..++.+...+.+.+++++++.+.. |+ .+||+.. ...+ T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg---~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 57 SLPKSAQSLSANQRSFFMDINKNVN-YKEE---KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccC-CCCc---eecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 11100 0100 011112222 2333 2556999999999999999999999877754 44 4677653 4444 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...+..+....+++..++++..-++ +.-..|..-=-..+.+|+.+.+.++.++++++..|+.++.= .-...|. T Consensus 131 ~w~~e~~~~~~~~~~~f~~i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~~qP~ 205 (381) T protein:vir:10 131 VWGKIYGEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG----TGKDQPI 205 (381) T ss_pred eeecccccccccccccceeeeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEec----cCCCCce Confidence 444444444433345566666666664 44455654222235678999999999999999999877631 1111111 Q ss_pred cc--cccc-cee---------ccccccccCHHHHHHHHHHHHHHHHhc----C-CCcCCCEEEEChHHHHHHHhhcCchh Q lcl|Aclame:pro 153 TG--EPGG-FHV---------NIGAGNTNDAQAIVDGFFEAAAVLDER----S-APQEGRVAVLSPRQYYSLISSVDTNI 215 (332) Q Consensus 153 ~~--~~~~-~~i---------~~~~~~~~~~~~~~d~i~~a~~~Lde~----~-VP~~gR~~vv~P~~~~~Ll~~~d~~~ 215 (332) +- ..++ ... ..++....+...+++.|..+...|... . .+..+-++++.|..++.|+..++ T Consensus 206 Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~--- 282 (381) T protein:vir:10 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--- 282 (381) T ss_pred eeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc--- Confidence 00 0000 000 001111123334456666555555322 2 23445567899998887754221 Q ss_pred hccccccccccccccceeee-eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcccee Q lcl|Aclame:pro 216 LNREIGNSQGDMNSGKGLYS-IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTI 294 (332) Q Consensus 216 ~~~d~~~~~~~~~~g~~v~~-i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~ 294 (332) +..++ |..+.. -.|.+|+.|+.+|... -..++|+.. +++-+. ++++ T Consensus 283 ----~~~~~-----G~~v~~l~~g~~vv~s~~~p~~~--------------iifgDfs~Y--~i~~r~--------~~~i 329 (381) T protein:vir:10 283 ----HLNAN-----GVYVTALPFNLNVIESTVQEAGK--------------VLTYVKGLY--DGYLAG--------GINV 329 (381) T ss_pred ----cCCCC-----CceeecCCCCceEEecCCCCcCc--------------EEEEecccE--EEEEec--------ccEE Confidence 11222 222211 1377799999998421 122444442 223332 3344 Q ss_pred eeeecccchhHHH---HHHHHHHHhCCceechhheeeee--cC Q lcl|Aclame:pro 295 QTTSGDFNVQYQG---DLIVGKLAMGCGSLRTSVAGSFQ--AA 332 (332) Q Consensus 295 e~~~~~~~~~~~~---d~i~~~~~~G~~vlrpe~~v~i~--~A 332 (332) +...+ .+|- ..+++.+++++++++|++.+++. .+ T Consensus 330 ~~~~~----~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 330 QKFKE----TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred Eeech----hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 43321 2222 25778889999999999987743 33 No 166 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.34 E-value=2.2e-08 Score=62.59 Aligned_cols=281 Identities=13% Similarity=0.037 Sum_probs=147.9 Q ss_pred CCCcc--cccc-----cccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc-ceee Q lcl|Aclame:pro 1 MTTLS--NFSL-----PNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG-KLSA 72 (332) Q Consensus 1 m~~~~--~~~r-----~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG-~~t~ 72 (332) +.... .++. -+..+.+.+ ++++ .|.=+.+..++.+...+.+.+++++++.+.. |+ .+||+.. ...+ T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg---~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:95 57 SLPKSAQSLSANQRSFFMDINKNVN-YKEE---KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccC-CCCc---eecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 11100 0100 011112222 2333 2556999999999999999999999877754 44 4677653 4444 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...+..+....+++..++++..-++ +.-..|..-=-..+.+|+.+.+.++.++++++..|+.++.= .-...|. T Consensus 131 ~w~~e~~~~~~~~~~~f~~i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~~qP~ 205 (381) T protein:vir:95 131 VWGKIYGEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG----TGKDQPI 205 (381) T ss_pred eeecccccccccccccceeeeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEec----cCCCCce Confidence 444444444433345566666666664 44455654222235678999999999999999999877631 1111111 Q ss_pred cc--cccc-cee---------ccccccccCHHHHHHHHHHHHHHHHhc----C-CCcCCCEEEEChHHHHHHHhhcCchh Q lcl|Aclame:pro 153 TG--EPGG-FHV---------NIGAGNTNDAQAIVDGFFEAAAVLDER----S-APQEGRVAVLSPRQYYSLISSVDTNI 215 (332) Q Consensus 153 ~~--~~~~-~~i---------~~~~~~~~~~~~~~d~i~~a~~~Lde~----~-VP~~gR~~vv~P~~~~~Ll~~~d~~~ 215 (332) +- ..++ ... ..++....+...+++.|..+...|... . .+..+-++++.|..++.|+..++ T Consensus 206 Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~--- 282 (381) T protein:vir:95 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--- 282 (381) T ss_pred eeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc--- Confidence 00 0000 000 001111123334456666555555322 2 23445567899998887754221 Q ss_pred hccccccccccccccceeee-eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcccee Q lcl|Aclame:pro 216 LNREIGNSQGDMNSGKGLYS-IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTI 294 (332) Q Consensus 216 ~~~d~~~~~~~~~~g~~v~~-i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~ 294 (332) +..++ |..+.. -.|.+|+.|+.+|... -..++|+.. +++-+. ++++ T Consensus 283 ----~~~~~-----G~~v~~l~~g~~vv~s~~~p~~~--------------iifgDfs~Y--~i~~r~--------~~~i 329 (381) T protein:vir:95 283 ----HLNAN-----GVYVTALPFNLNVIESTVQEAGK--------------VLTYVKGLY--DGYLAG--------GINV 329 (381) T ss_pred ----cCCCC-----CceeecCCCCceEEecCCCCcCc--------------EEEEecccE--EEEEec--------ccEE Confidence 11222 222211 1377799999998421 122444442 223332 3344 Q ss_pred eeeecccchhHHH---HHHHHHHHhCCceechhheeeee--cC Q lcl|Aclame:pro 295 QTTSGDFNVQYQG---DLIVGKLAMGCGSLRTSVAGSFQ--AA 332 (332) Q Consensus 295 e~~~~~~~~~~~~---d~i~~~~~~G~~vlrpe~~v~i~--~A 332 (332) +...+ .+|- ..+++.+++++++++|++.+++. .+ T Consensus 330 ~~~~~----~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:95 330 QKFKE----TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred Eeech----hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 43321 2222 25778889999999999987743 33 No 167 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.27 E-value=7.8e-08 Score=59.55 Aligned_cols=292 Identities=10% Similarity=-0.013 Sum_probs=139.9 Q ss_pred CCCcc---cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeeee-- Q lcl|Aclame:pro 1 MTTLS---NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYH-- 75 (332) Q Consensus 1 m~~~~---~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~~-- 75 (332) |+.-. .+.+..+.+. -..++.+.- .+.-..+..++.+..++.|.++++.++.+..+. +.+|+.+|-...... T Consensus 1 ~~~k~~~~~l~~~~~~~~-~~~~~~~~g-~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~-~~~i~~~~~~~~~~~~~ 77 (321) T protein:vir:31 1 MASRTINNDLSRITEKNA-LTVDDLDAG-GTLPDPLWDEFWTDMIEETPLLDAIRTETVGAK-KTRIPTLNIGERHRRPQ 77 (321) T ss_pred CchHHHHHHHHHHHHhcc-ccccccCCc-ceeCHHHHHHHHHHHHHhhhhhhhceeeeccCc-ceeeeeeccCCcccccc Confidence 55421 3444433211 111122211 134477888888888888999999887766443 356676653222122 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhh--HHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-- Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYS--LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP-- 151 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~-- 151 (332) ..|+......+++.+++++..-+.. +...|.+ +|..-...|+.+.+....++++++..+..++.= .++ ...+ T Consensus 78 ~e~~~~~~~~~~~~~~~~~~~~k~~-~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nG--d~~-~~~~~~ 153 (321) T protein:vir:31 78 DEGEWNENESDVSTGTIDISTEKAT-VAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANG--DED-AEDSFE 153 (321) T ss_pred cccccccccccceeeeeeeeeEEEE-eehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeec--ccc-CCCccc Confidence 2233332223455666777776653 3334532 333222468999999999999999988866521 111 0110 Q ss_pred ---cccc--cccce-eccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccc Q lcl|Aclame:pro 152 ---VTGE--PGGFH-VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG 225 (332) Q Consensus 152 ---~~~~--~~~~~-i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~ 225 (332) .+.. ..... ....++...+ ++.|.++...|+++.--..+-+++++++.+..++..+.++ .....+. T Consensus 154 ~~n~G~l~~a~~~~~~~~~~~~~~~----~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~----~~~~~~~ 225 (321) T protein:vir:31 154 NQNDGFITVAEGDVETIDAADDILD----NDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR----DTPLGDN 225 (321) T ss_pred ccchhhhhhhccccccccccccccC----HHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC----CCccccc Confidence 0000 00000 0001111122 4666677777766553222345678999876665322121 1111222 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhH Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQY 305 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~ 305 (332) .+..|. ..+++|++|+.++.+|... -.-+++.+.+ +.+ ..+..++..++...... T Consensus 226 ~l~~~~-~~tl~G~pvv~~~~mP~~~--------------il~t~~~nl~-~~~---------~~~~~~~~~~~~~~~~~ 280 (321) T protein:vir:31 226 VIMGEA-DVNPFSFPIIGSGLWPDDK--------------AMFTDPQNLI-YAL---------YRDLEIDVLTESDKVSE 280 (321) T ss_pred hhhccc-cccccceeEEEcCCCCCCc--------------EEEeccccEE-EEE---------eeccEEEEeecCccccc Confidence 333343 5579999999999999521 1123334432 112 22333444333221110 Q ss_pred HHHHHHHHH--HhCCceechhheeeeecC Q lcl|Aclame:pro 306 QGDLIVGKL--AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 306 ~~d~i~~~~--~~G~~vlrpe~~v~i~~A 332 (332) ..+.+...+ -++..|-++++++.+.-= T Consensus 281 ~~~~~~~~~~~~~~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 281 RDLHARYFMRGDDDFAIENTEAVVLAEGL 309 (321) T ss_pred cceeeEeeeeeecceeEeccccEEEEecC Confidence 111122112 245555666654443321 No 168 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.26 E-value=4.8e-08 Score=60.73 Aligned_cols=284 Identities=15% Similarity=0.052 Sum_probs=148.4 Q ss_pred CCCcc--ccc---c--cccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceee Q lcl|Aclame:pro 1 MTTLS--NFS---L--PNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSA 72 (332) Q Consensus 1 m~~~~--~~~---r--~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~ 72 (332) ..+.. .++ | -+....++..++++ .|.=+.+..++.+...+.+.+++++++.+.. | .++||+. +..++ T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg---~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a 133 (377) T protein:vir:96 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKF---KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTA 133 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhcCCCCCCc---eecCHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCCcce Confidence 11100 000 0 00011233334443 1445889999999999999999999877764 3 3567754 34455 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...+..+....+++..+++|..-++ +.-..|..-=-..+.+|+-+.+.++.++++++..|+.++.= .+ ...|. T Consensus 134 ~wv~e~~~~~~~~~~~f~~i~l~~~kl-~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G--~G--~~~P~ 208 (377) T protein:vir:96 134 VWGDIFGEIKGQLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKG--NG--LLQPV 208 (377) T ss_pred eEeecccccccccCccceeEeeeeeeE-EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEec--cC--CCcce Confidence 444445454433345666666666554 33345654223346778999999999999999999988631 00 00000 Q ss_pred --------------cccccccee----ccccccccCHHHHHHHHHHHHHHHHhcCC--C---cCCCEEEEChHHHHHHHh Q lcl|Aclame:pro 153 --------------TGEPGGFHV----NIGAGNTNDAQAIVDGFFEAAAVLDERSA--P---QEGRVAVLSPRQYYSLIS 209 (332) Q Consensus 153 --------------~~~~~~~~i----~~~~~~~~~~~~~~d~i~~a~~~Lde~~V--P---~~gR~~vv~P~~~~~Ll~ 209 (332) .+....... ..+.....++..+++.+..+...+....- | ...-+.++.|..|+.++ T Consensus 209 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~- 287 (377) T protein:vir:96 209 GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE- 287 (377) T ss_pred eeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcc- Confidence 000000000 00111112345555555555555543321 1 11234678898877653 Q ss_pred hcCchhhccccccccccccccceeeeee--ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhh Q lcl|Aclame:pro 210 SVDTNILNREIGNSQGDMNSGKGLYSIA--GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCI 287 (332) Q Consensus 210 ~~d~~~~~~d~~~~~~~~~~g~~v~~i~--G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~ 287 (332) .+.+ +..++ |.. ..++ |..|+.|+.+|... ...++|+.. +++- T Consensus 288 ---~~~~---~~~~~-----G~~-~~~l~~p~~v~~s~~~p~~~--------------i~fgdf~~Y--~i~~------- 332 (377) T protein:vir:96 288 ---AKFT---SRNQF-----GEY-VTVLPHGITILESLAVETGK--------------AIAFVANRY--DAFM------- 332 (377) T ss_pred ---cccc---ccCCC-----CCc-eeccCCCceEEecCCCCccc--------------EEEEEcCcE--EEEE------- Confidence 2222 11222 321 2344 55688899988421 112334331 2222 Q ss_pred hhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 288 QSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 288 ~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..+++++...+ ....+--..+++.+++++++++|++.++|.=+ T Consensus 333 -r~~~~i~~~~~-~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~ 375 (377) T protein:vir:96 333 -ATASTIEEYDQ-TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred -ecccEEEeehh-hhhhcCCeEEEEEEEEcCEEecCCcEEEEEEe Confidence 23334443321 11112223477888999999999999999988 No 169 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.25 E-value=5.5e-08 Score=60.37 Aligned_cols=284 Identities=13% Similarity=0.055 Sum_probs=141.2 Q ss_pred CCCc--cccccc-----ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccc-eee Q lcl|Aclame:pro 1 MTTL--SNFSLP-----NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK-LSA 72 (332) Q Consensus 1 m~~~--~~~~r~-----~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~-~t~ 72 (332) +... ..++.- +..+.+++ ++++ .|.=+.|..++.+...+.|.++++.++.+. +|. .+||+... .+. T Consensus 57 ~~~~~~~~l~~~e~~~~~~~~~~t~-~~Gg---~lvP~~~~~~I~~~l~~~spir~~a~v~~~-~~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 57 SLPKSAQTLSANQRNFFMDINKSVG-YKEE---KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) T ss_pred HhcccccccCHHHHHHHHHHhhcCC-CCCc---eecCHHHHHHHHHHHHhhcceeeeeeeEec-Ccc-eEEEeecCCcce Confidence 0000 001100 01112222 2222 255699999999999999999999987776 343 45665543 333 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...+..+....+++.+++++..-++ +.-..|..-=-..+.+|+-+.+..+.++++++..|+.++.= .+ +.-|. T Consensus 131 ~W~~e~~~~~~~~~~~f~~i~l~~~kl-~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~G--dG--~~qP~ 205 (381) T protein:vir:10 131 VWGKIYGEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG--TG--KDQPI 205 (381) T ss_pred EEeecccccccccCccceeEeecceeE-EeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEec--cc--CCCce Confidence 222222233322234555666655554 34444543222235678899999999999999999877621 11 11111 Q ss_pred cc---ccccceeccc-------cc--cccCHHHHHHHHHHHHHHHH----hcC-CCcCCCEEEEChHHHHHHHhhcCchh Q lcl|Aclame:pro 153 TG---EPGGFHVNIG-------AG--NTNDAQAIVDGFFEAAAVLD----ERS-APQEGRVAVLSPRQYYSLISSVDTNI 215 (332) Q Consensus 153 ~~---~~~~~~i~~~-------~~--~~~~~~~~~d~i~~a~~~Ld----e~~-VP~~gR~~vv~P~~~~~Ll~~~d~~~ 215 (332) +- .+++..+..+ .+ ...+...+++.+......+. .+. .+..+.++++.|..|+.|+..+. T Consensus 206 Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~--- 282 (381) T protein:vir:10 206 GLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--- 282 (381) T ss_pred eeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccc--- Confidence 10 0000000000 00 01122233333333222221 112 23456678899998888753211 Q ss_pred hccccccccccccccceeeee-eceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcccee Q lcl|Aclame:pro 216 LNREIGNSQGDMNSGKGLYSI-AGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTI 294 (332) Q Consensus 216 ~~~d~~~~~~~~~~g~~v~~i-~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~ 294 (332) +..++| ..+..+ .|.+|+.|+.+|... -.-++|+.. +++-+ .++++ T Consensus 283 ----~~~~~G-----~~v~~lp~g~~vv~~~~~p~~~--------------i~fGDfs~Y--~i~~r--------~~~~i 329 (381) T protein:vir:10 283 ----HLNANG-----VYVTALPFNLNVIESTVQEAGK--------------VLTYVKGLY--DGYLA--------GGINV 329 (381) T ss_pred ----cCCCCC-----ceeecCCCCceeEEcCCCCcCc--------------EEEEEcccE--EEEEe--------cccEE Confidence 122222 222222 488899999998421 123455542 22322 23344 Q ss_pred eeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 295 QTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 295 e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +...+ ....+--..+++.+++++++++|++.+++.=. T Consensus 330 ~~~~~-~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 330 QKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred Eeech-hhhhcCceEEEEEEEEcCEEecCCcEEEEEEe Confidence 44322 11111113577888999999999998874433 No 170 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.07 E-value=4.2e-07 Score=55.52 Aligned_cols=262 Identities=13% Similarity=0.129 Sum_probs=141.3 Q ss_pred CCCcccccccc--c---ccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc--cceeee Q lcl|Aclame:pro 1 MTTLSNFSLPN--Q---ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT--GKLSAG 73 (332) Q Consensus 1 m~~~~~~~r~~--~---~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i--G~~t~~ 73 (332) |-. .-+-|- . ...+-.. ..| |+++|+.-+.+-++ .++.+|..++..|++++++.- -..... T Consensus 1 ~~~--~~~~~e~nlt~~~dl~~~~-siD-----f~~~f~~~i~~L~~----~LGv~r~~pla~GstIkt~k~~~y~gda~ 68 (296) T protein:vir:98 1 MVT--SRTYPEENLIKSTDLKYPI-TID-----VTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEG 68 (296) T ss_pred CCC--ccccCcCCCcchhhhhhhh-hhh-----hHHHHhhhHHHHHH----HhhhcccccccCCCEEeeccceeeeeccc Confidence 432 111121 0 1111111 122 89999987655433 457777778899999987643 234567 Q ss_pred eecCCCCCCccCCCCC---ceEEEEEeeeeecchhhhhHHHH-H-h-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKA---NEKTLVMDDLLVSSQFVYSLDEI-F-S-QYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~---~~~~l~ID~~~~~~~~Idd~D~~-q-~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) +...|..|+.+ .++. +..+++|.++.-. +. ||+ | + ..|...+..+++..++++.+|..++..+..+.. T Consensus 69 dVaEGe~Ipls-kvt~~~~~t~t~~ikK~rK~---tT--dEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~ 142 (296) T protein:vir:98 69 NVPEGEVIPLS-KVERKIHSEKKIELKKYRKA---TT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG 142 (296) T ss_pred cccCCcccchh-hheeeecceEEEEeeccccc---cC--HHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc Confidence 88889988765 4543 3577888765433 43 555 5 3 446899999999999999999999987743321 Q ss_pred hccccccccccceeccccccccC-HHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 148 EASPVTGEPGGFHVNIGAGNTND-AQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~-~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) + +. + .... -.+++..+.++..+|.+.+ ....+++|+|...+.+|. +.++. .+. T Consensus 143 t------------~~--~-t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg--~a~it-------~qt 196 (296) T protein:vir:98 143 T------------QD--A-LGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIA--KAGIT-------TQT 196 (296) T ss_pred e------------ee--e-chhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhc--CCccc-------hhh Confidence 1 00 0 0000 1223445566667776653 235899999999999985 34432 122 Q ss_pred ccccceeeeeeceEEEeeCcccccccccccccccccccccc--------------cccccceEEEeechhhhhhhhhccc Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDY--------------QVDASALAGLIFHREAAGCIQSVAP 292 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y--------------~~~~~~~~~l~~h~~a~~~~~~~~~ 292 (332) ...+..+-+++|..|+.|+.+|. |+-+.+. ..+....| ..+.+..+|+. |.. T Consensus 197 ~fG~tyl~nfLG~~II~S~kV~~--G~~~~T~-~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~-h~~---------- 262 (296) T protein:vir:98 197 AFGLTYLVDFTGTVIISTNDVTK--GEIWATV-PENIIFAYINPNNSELAKEFNLYGDPTGYIGMN-HFQ---------- 262 (296) T ss_pred eechhhhhhccccEEEEcCcCCC--ceEEEee-ecceEEEeecccccchhhhhccccccccceEEE-ecc---------- Confidence 22233344689999999999994 2222211 11111111 22223333221 111 Q ss_pred eeeeeecccchhHHHHHHHHHHHhCCceechh---hee--eeecC Q lcl|Aclame:pro 293 TIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTS---VAG--SFQAA 332 (332) Q Consensus 293 ~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe---~~v--~i~~A 332 (332) ...+..+. -+++++-.+=|| +++ .|..| T Consensus 263 ------~~~~~t~e------T~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 263 ------ENTTLTIQ------TLLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred ------ccceeeeh------hHhHhHHHhcccccceEEEEEecCC Confidence 11111111 123333344444 332 44444 No 171 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.06 E-value=4.8e-07 Score=55.23 Aligned_cols=294 Identities=11% Similarity=0.014 Sum_probs=142.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccccee--eeeecC- Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS--AGYHTP- 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t--~~~~~~- 77 (332) |-.+..+.+.-..--..+. ++. -|-=++|+ +..+..++.|.++++.++.+-.+..+..|+.+|... ..-... T Consensus 1 ~~~~~~~~~~~k~it~~d~-~gG---~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~ 75 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDL-GKG---ILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTS 75 (314) T ss_pred CchhhhHHHhhcccccccC-CCc---eeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCcccccccccc Confidence 5444333332211000111 111 14447775 677888999999999986433334567888887421 122221 Q ss_pred --CCCCCccCCCCCceEEEEEeeeeecchhhhh-HHHHHh-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc- Q lcl|Aclame:pro 78 --GTPIVGDAGIKANEKTLVMDDLLVSSQFVYS-LDEIFS-QYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV- 152 (332) Q Consensus 78 --g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd-~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~- 152 (332) ++.... .+++.++++|..-++.. .+.|.+ +=+... ..|+.+.++.+.++++++.....++.= ..+..+..+. T Consensus 76 ~~~~~~~~-~~~tf~~~~l~~~kl~~-~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nG-dg~~~s~~~~~ 152 (314) T protein:vir:41 76 GTKVAPTA-DEVTVSTNTLEMKELVT-KVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHA-DSSLTTGRELY 152 (314) T ss_pred cCCccCCc-ccccccceeeeeEEEEE-eecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhcc-ccCCcCcccch Confidence 222222 24667778888777644 455633 222222 248999999999999999888766531 1111111110 Q ss_pred ---ccc---cccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCC-CEEEEChHHHHHHHhhcCchhhccccccccc Q lcl|Aclame:pro 153 ---TGE---PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGNSQG 225 (332) Q Consensus 153 ---~~~---~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~ 225 (332) .|. .++..+..+... .....+.|.++...|....--..+ -+++++++.+..+.+.++.+ .....+. T Consensus 153 ~~p~G~l~~a~~~~~~~~~~~---~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~----~~~l~~~ 225 (314) T protein:vir:41 153 RINDGWMKLAGNQYTDAEPED---ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVR----ETGLGDS 225 (314) T ss_pred hcchhhhhhcccceeecCccc---cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhcc----CCcccch Confidence 110 111111111111 122345555666666544321111 23457999887776433332 1222333 Q ss_pred cccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhH Q lcl|Aclame:pro 226 DMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQY 305 (332) Q Consensus 226 ~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~ 305 (332) .+..|+ -..++|++|+.++.+|..+. +...-+-+++.+.+ .+-..++.++..+.-.. T Consensus 226 ~~~~~~-~~~l~G~PV~~~~~~~~~~~---------~~~~i~fgd~~nlv----------~~~~~~ir~~~~~~a~~--- 282 (314) T protein:vir:41 226 ALIGAT-GLQYDGIPIQYVPALDALGD---------DKARALLTVPTNLV----------YGFWRNIRIEPKRDAAM--- 282 (314) T ss_pred hhhCCC-CceecceeeEecccccccCC---------CCceEEEechhheE----------EEeeceeEEeecccCcC--- Confidence 444444 56789999999999985321 11222233344332 22333444443332111 Q ss_pred HHHHHHHHHHhCCceechhhee--eeecC Q lcl|Aclame:pro 306 QGDLIVGKLAMGCGSLRTSVAG--SFQAA 332 (332) Q Consensus 306 ~~d~i~~~~~~G~~vlrpe~~v--~i~~A 332 (332) -...+...+++++.+..+++++ .+..| T Consensus 283 ~~~~~~~~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 283 RRTEYIASLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred CeEEEEEEEEeceEEEEcCcEEEEEeecc Confidence 1112333445566666555443 22233 No 172 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.98 E-value=1.1e-07 Score=58.80 Aligned_cols=283 Identities=14% Similarity=0.076 Sum_probs=137.1 Q ss_pred CCC--ccccc----cc-ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccccee-e Q lcl|Aclame:pro 1 MTT--LSNFS----LP-NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS-A 72 (332) Q Consensus 1 m~~--~~~~~----r~-~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t-~ 72 (332) +.. ...++ |- +... .+..++++ .|.=+.|..++.+...+.|.+++++++.+. +|+ .+||+..... + T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~-~~~~~~gg---~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~-~~~-~~i~~~~~~~~a 137 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDIN-KEVGYKEE---TLLPQTVVDEIFEDLTTEHPFLASIGMRTT-GLR-TKFLKSETSGVA 137 (383) T ss_pred HhcCChhhhhHHHHHHHHHHh-ccCCCCCc---cccCHHHHHHHHHHHHhhccceeeeeeEec-CCc-eEEEEEcCCcce Confidence 000 00010 00 0001 12223333 255699999999999999999999987765 454 5788765443 3 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...+..+....+++..+++|..-++ +.-+.|..-=-..+.+|+.+.+.++.++++++..|+.++.= .+ ..-|. T Consensus 138 ~w~~e~~~~~~~~~~~f~~i~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G--~G--~~qP~ 212 (383) T protein:vir:78 138 VWGKIFGEIKGQLDATFSDEESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVG--DG--NDKPI 212 (383) T ss_pred EEeecccccccccCcceeeEeecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEec--cC--CCCce Confidence 333333344333345667777777654 45456654222235678999999999999999999987631 11 11111 Q ss_pred ccc---c-ccceeccc------ccc--ccCHHHHHHHHHHHH---HHHHhcCC-CcCC-CEEEEChHHHHHHHhhcCchh Q lcl|Aclame:pro 153 TGE---P-GGFHVNIG------AGN--TNDAQAIVDGFFEAA---AVLDERSA-PQEG-RVAVLSPRQYYSLISSVDTNI 215 (332) Q Consensus 153 ~~~---~-~~~~i~~~------~~~--~~~~~~~~d~i~~a~---~~Lde~~V-P~~g-R~~vv~P~~~~~Ll~~~d~~~ 215 (332) +-. + ........ ++. ..+...+++.+..+. ..+....- ...+ ...++.|..|+.++- .. T Consensus 213 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~----~~ 288 (383) T protein:vir:78 213 GLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKK----QY 288 (383) T ss_pred eeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhcc----ch Confidence 000 0 00000000 000 011112222221111 11111111 1112 234678866655432 11 Q ss_pred hccccccccccccccceeeeee--ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccce Q lcl|Aclame:pro 216 LNREIGNSQGDMNSGKGLYSIA--GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPT 293 (332) Q Consensus 216 ~~~d~~~~~~~~~~g~~v~~i~--G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~ 293 (332) . ..+. +|. -..++ |..|++|+.+|... ...++|+.. +++ ...+++ T Consensus 289 ~---~~~~-----~G~-~~t~l~~~~~iv~s~~~p~~~--------------iifgdfs~Y--~i~--------~r~~~~ 335 (383) T protein:vir:78 289 T---SLNA-----NGV-YVTALPFNLNIIESLFVPEKK--------------AISYVAERY--DAL--------IGGPLD 335 (383) T ss_pred h---ccCC-----CCc-eeeecCCCceEEecCCCCccc--------------EEEeeccce--EEE--------ecccce Confidence 1 1111 232 22344 55688899998421 112333332 122 233445 Q ss_pred eeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 294 IQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 294 ~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++...+ ....+--..+++.+++++++++|++.++|.=+ T Consensus 336 i~~~~~-~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~ 373 (383) T protein:vir:78 336 IGTYDQ-TLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLN 373 (383) T ss_pred EEecch-hhhhcCceEEEEEEEEcCEEecCCeEEEEEEE Confidence 544321 11111124578888999999999997775544 No 173 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.78 E-value=6e-07 Score=54.68 Aligned_cols=279 Identities=16% Similarity=0.075 Sum_probs=137.5 Q ss_pred CCCccc--cc---c--cccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceee Q lcl|Aclame:pro 1 MTTLSN--FS---L--PNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSA 72 (332) Q Consensus 1 m~~~~~--~~---r--~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~ 72 (332) +....+ ++ | -+.....+..++++ .+.=+.+..++.+...+.+.++.++++.+.. |+ ++||+- +..++ T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg---~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~~~~~~~~~~a 133 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKF---KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTA 133 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhccCCCCCc---cccCHHHHHHHHHHHHHhhhhhhheeeEecC-cc-eEEEEecCCcce Confidence 111000 00 0 00011123333333 1455889999999999999999999877764 44 467753 45555 Q ss_pred eeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 GYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 73 ~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) .-...+..+....+++..+++|..-++ +.-..|..-=-..+.+|+-+.+.++.++++++..|+.++.= .+ T Consensus 134 ~w~~e~~~~~~~~~~~f~~i~l~~~kl-~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G--~G------- 203 (377) T protein:vir:98 134 VWGDIFGEIKGQLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKG--DG------- 203 (377) T ss_pred eEeecccccCcccCccceeEeecceeE-EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEec--cC------- Confidence 555544444432334455555555553 33344543222235778999999999999999999887631 11 Q ss_pred ccccccce-------ec-----cccccccCHHHHHHH--------------HHHHHHHHHhcC-CCcCCCE-EEEChHHH Q lcl|Aclame:pro 153 TGEPGGFH-------VN-----IGAGNTNDAQAIVDG--------------FFEAAAVLDERS-APQEGRV-AVLSPRQY 204 (332) Q Consensus 153 ~~~~~~~~-------i~-----~~~~~~~~~~~~~d~--------------i~~a~~~Lde~~-VP~~gR~-~vv~P~~~ 204 (332) .+.|.|.. +. .+.+...+.+.+.+. ++.-.+...... --..||+ .++.|..| T Consensus 204 ~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~ 283 (377) T protein:vir:98 204 LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDR 283 (377) T ss_pred CCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccch Confidence 11111110 00 001111111122111 111011000111 1235664 44677766 Q ss_pred HHHHhhcCchhhccccccccccccccceeeeeece--EEEeeCcccccccccccccccccccccccccccceEEEeechh Q lcl|Aclame:pro 205 YSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGI--RILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHRE 282 (332) Q Consensus 205 ~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~--~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~ 282 (332) +.++ +... ..++ +|. -..++|+ .|+.|+.+|... ...++|+.. +++ T Consensus 284 ~~~~----p~~~---~~~~-----~G~-~~t~lg~p~~vv~s~~~p~~~--------------i~fgdf~~Y--~i~--- 331 (377) T protein:vir:98 284 WALE----AQFT---SRNQ-----FGE-YVTVLPHGITILESLAVETGK--------------AIAFVANRY--DAF--- 331 (377) T ss_pred hhcc----cccc---ccCC-----CCc-cccccCCCceEEecCCCCccc--------------EEEEEecce--eEE--- Confidence 6553 2111 1111 222 2245554 578899888421 112333331 222 Q ss_pred hhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 283 AAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 283 a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ...+++++...+ ....+--..+++.+++|++++.|++.+.|.=+ T Consensus 332 -----~r~~~~i~~~~~-~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~ 375 (377) T protein:vir:98 332 -----MATASTIEEYDQ-TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred -----eecceEEEeech-hhhhcCceEEEEEEEEcCEEeccCcEEEEEEe Confidence 223344444321 11111224477888999999999999998888 No 174 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.75 E-value=4e-07 Score=55.67 Aligned_cols=281 Identities=13% Similarity=0.050 Sum_probs=134.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccccee-eeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS-AGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t-~~~~~~g~ 79 (332) .....+..+ .+.+.++++ .+.=+.+..++.+..+..+.+++++++.+..+ .++++.-+..+ +.-...|. T Consensus 141 ~~~~~~~~~-----~~~~~~g~~---~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g--~~~~~~~~~~~~a~wv~E~~ 210 (466) T protein:vir:80 141 LAQVRTLAQ-----QKRAVSGAE---LTIPDVMLELLRDNMHRYSKLISKVRLRPLKG--TARQNIAGAIPEGVWTEAVA 210 (466) T ss_pred HHHHHHHhh-----hhhhhcccc---ccccHHHHHHHHHhhhhhhhhhhheeeeecCc--eeEeeeecCCcceeeccccc Confidence 000001111 011111111 23336788888888888888888888777653 34566555433 33334455 Q ss_pred CCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc--cc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE--PG 157 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~--~~ 157 (332) .+.. .+++.+++++.+.++ +.-+.|.+-=-..+..++-+.+.++.+++++...|+.|+.- .....|.+-. .+ T Consensus 211 ~~~~-~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G----~G~~~P~Gil~~~~ 284 (466) T protein:vir:80 211 NLNE-LSLSFSQIEVDGYKV-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYG----TGTKMPVGIVTRLA 284 (466) T ss_pred cccc-ccccccceeecceee-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeec----cCCCCcceeeeccc Confidence 5543 246667777776664 33345654222235578999999999999999999988731 1111111100 00 Q ss_pred cceeccc----cc--cccCHHHH----------HHHHHHHHHHHH--hcCCCcCCC-EEEEChHHHHHHHhhcCchhhcc Q lcl|Aclame:pro 158 GFHVNIG----AG--NTNDAQAI----------VDGFFEAAAVLD--ERSAPQEGR-VAVLSPRQYYSLISSVDTNILNR 218 (332) Q Consensus 158 ~~~i~~~----~~--~~~~~~~~----------~d~i~~a~~~Ld--e~~VP~~gR-~~vv~P~~~~~Ll~~~d~~~~~~ 218 (332) ...+... +. ...+...+ +..+.++...+. +... ..++ +.++++..+..|+.-.. . T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~w~~~~~~~~~l~~~~~-~---- 358 (466) T protein:vir:80 285 QTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANY-SNGMKFWAMSSNTHAVLMSKAI-T---- 358 (466) T ss_pred ccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccc-cCCceeEEecchhHHHhhcccc-c---- Confidence 0000000 00 00000000 011111111111 1111 2333 34678888877754221 1 Q ss_pred cccccccccccc-ceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeee Q lcl|Aclame:pro 219 EIGNSQGDMNSG-KGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTT 297 (332) Q Consensus 219 d~~~~~~~~~~g-~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~ 297 (332) .+.++.+..+ ..-..++|.+|+.|+++|... -+.++|... +++-+ .+++++.. T Consensus 359 --~~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~~--------------~~~g~~~~y--~i~~r--------~~~~i~~~ 412 (466) T protein:vir:80 359 --FNSAGALVASLNNTMPIVGGDIVILDFIPDND--------------IIGGYGSLY--LLAER--------ADIKLAQS 412 (466) T ss_pred --ccCCccccccCCCcccccccceeecCccCccc--------------eeeeccccE--EEEee--------cceEEEec Confidence 1111222111 001248899999999998521 122233321 12222 22333332 Q ss_pred ecccchhHHH--HHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 298 SGDFNVQYQG--DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 298 ~~~~~~~~~~--d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+ ..+.- ..+++.+++|+++++|++.+.+.-+ T Consensus 413 ~~---~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~ 446 (466) T protein:vir:80 413 EH---VRFIEDQTVFKGTARYDGKPVFGEGFVAVNIA 446 (466) T ss_pred hh---hhhhcCcEEEEEEEEEccEEeccCceEEEEec Confidence 11 11111 2356778889999999998887655 No 175 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.70 E-value=1.2e-06 Score=53.03 Aligned_cols=285 Identities=11% Similarity=0.034 Sum_probs=137.3 Q ss_pred CCC-------cccccc---c--ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc Q lcl|Aclame:pro 1 MTT-------LSNFSL---P--NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG 68 (332) Q Consensus 1 m~~-------~~~~~r---~--~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG 68 (332) +.. ...++. - +....++. ++++ .|.=+.+..++.+..++.|.+++++++.+.. | .++||... T Consensus 62 ~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~-~~gG---~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~ 135 (395) T protein:vir:95 62 VDNGILAKRSQDPLTSEERKFFNDINYDVG-YTDE---KILPETVVERVFDDLQKDHPLLSKINFQNAG-I-KTRVIKAD 135 (395) T ss_pred HHHHHHhhcCccccchHHHHHHHHHhhccC-CCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEec Confidence 000 001110 0 01111122 2222 1455889999999999999999999877663 4 35777654 Q ss_pred c-eeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 69 K-LSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASA 147 (332) Q Consensus 69 ~-~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~ 147 (332) . ....-...+..+....+++.+++++..-++ +.-..|.+-=-..+..|+-+.+.++.++++++..|+.++.- .++. T Consensus 136 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G--~G~~ 212 (395) T protein:vir:95 136 PAGQAVWGKVFGEIKGQLDAAFREENFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIING--GGAA 212 (395) T ss_pred CCcceEEeecccccCccccccceeeeeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeec--cCCC Confidence 4 333333333333322345666777666553 34455654223345678999999999999999999877631 1110 Q ss_pred hcccccccc--ccc---eecc-cccccc--CHHHHHHHHHHHHHHHHh----cC-CCcCCCEEEEChHHHHHHHhhcCch Q lcl|Aclame:pro 148 EASPVTGEP--GGF---HVNI-GAGNTN--DAQAIVDGFFEAAAVLDE----RS-APQEGRVAVLSPRQYYSLISSVDTN 214 (332) Q Consensus 148 ~~~~~~~~~--~~~---~i~~-~~~~~~--~~~~~~d~i~~a~~~Lde----~~-VP~~gR~~vv~P~~~~~Ll~~~d~~ 214 (332) ..-|.+-.. ... .... .++..+ +....++.+..+...|.- +. ........++.|..+..+. .+ T Consensus 213 ~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~----g~ 288 (395) T protein:vir:95 213 KTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQ----AR 288 (395) T ss_pred CcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcC----Cc Confidence 000110000 000 0000 011111 112223334333333211 11 1122234578887765442 11 Q ss_pred hhccccccccccccccceeeeee--ceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccc Q lcl|Aclame:pro 215 ILNREIGNSQGDMNSGKGLYSIA--GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAP 292 (332) Q Consensus 215 ~~~~d~~~~~~~~~~g~~v~~i~--G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~ 292 (332) .+ +.. ..|. ..+++ |.+|+.|+.+|... -..++|+.. +++-+ .++ T Consensus 289 ~~---~~~-----~~G~-~~~~lg~g~~v~~~~~~p~~~--------------i~fgdfs~y--~i~~r--------~~~ 335 (395) T protein:vir:95 289 YT---YLT-----ANGG-FVTVLPYNVTIITSEFVPEGK--------------LVAFVTDRY--NAVRG--------GGL 335 (395) T ss_pred ce---ecc-----CCCc-ceeccCCcceEEEcCCCCCCc--------------EEEEecccE--EEEEe--------cce Confidence 11 111 1233 23444 66789999999421 122455442 22222 233 Q ss_pred eeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 293 TIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 293 ~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +++...+ ....+-...+++..++|+++++|++.+.|.=. T Consensus 336 ~i~~~~~-~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~ 374 (395) T protein:vir:95 336 TVKKFDQ-TLALEDAVLFTAKTFAYGQPDDNKASAVYDLK 374 (395) T ss_pred EEEeccc-hhhhCCcEEEEEEEEECCEEeccccEEEEEee Confidence 3433221 10111112366778899999999998764333 No 176 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=97.62 E-value=3.7e-06 Score=50.35 Aligned_cols=296 Identities=9% Similarity=0.015 Sum_probs=136.3 Q ss_pred CCCccc--ccccccccc--cccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccce--eeee Q lcl|Aclame:pro 1 MTTLSN--FSLPNQANG--GARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKL--SAGY 74 (332) Q Consensus 1 m~~~~~--~~r~~~~~~--~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~--t~~~ 74 (332) |-.|.+ +.+|..... +-...++. .|-=++++ +..+..++.|.++++.++.+..++.+..|+.+|.. .... T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg---~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g 76 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRG---VLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG 76 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCc---eechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccc Confidence 444433 223322111 11111111 13335554 46677788899999988755555566667776532 2212 Q ss_pred ecC---CCCCCccCCCCCceEEEEEeeeeecchhhh-h-HHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 75 HTP---GTPIVGDAGIKANEKTLVMDDLLVSSQFVY-S-LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 75 ~~~---g~~~~~~~~~~~~~~~l~ID~~~~~~~~Id-d-~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) +.. +..+.. ..++..+.++..-++. +...|. + +|+..-..|+.+.+..+.++++++..+..++.= ..+ +. T Consensus 77 ~~~~~~~~~~~~-~~~~f~~~~l~~~~l~-~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nG--dg~-s~ 151 (315) T protein:vir:41 77 RDETGQKLAPPE-STAEVKTNTLYMREMV-TKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHG--DTS-SS 151 (315) T ss_pred cccccCcCCCCC-Cccccceeeeceeeee-eeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhcc--CCc-Cc Confidence 221 222221 2356666666666653 334552 2 232222358999999999999999988766531 011 00 Q ss_pred ccccccccc------ceeccccccccCHHHHHHHHHHHHHHHHhcCCCc-CCCEEEEChHHHHHHHhhcCchhhcccccc Q lcl|Aclame:pro 150 SPVTGEPGG------FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ-EGRVAVLSPRQYYSLISSVDTNILNREIGN 222 (332) Q Consensus 150 ~~~~~~~~~------~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~-~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~ 222 (332) .+....+.| ..+..+..+........+.|.++...|..+.--. .+-++++++..+..+.+-++.+ ..+ . T Consensus 152 ~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~---g~~-l 227 (315) T protein:vir:41 152 DPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGR---ETG-L 227 (315) T ss_pred CccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccC---CCc-c Confidence 010000111 0111000111111112455556665554433211 1224578999888776544322 112 2 Q ss_pred ccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccc Q lcl|Aclame:pro 223 SQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN 302 (332) Q Consensus 223 ~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~ 302 (332) .+..+..|+ -..++|.+|+.++.+|..+... ..-+-+++.+.+ .+-..++.++..++... T Consensus 228 w~~~~~~g~-~~tl~G~PV~~~~~m~~~~~~~---------~~ilf~d~~nl~----------~~~~~~i~i~~~~~a~~ 287 (315) T protein:vir:41 228 GDQALTGAN-SILYDGRPVQYVPALEALNDGK---------SRALFVVPTQLV----------YGFWRNIKVVPDYDAEM 287 (315) T ss_pred ccchhhcCC-CceecccceEecccccccCCCC---------ccEEEecccceE----------EEeccccEEEeeecCCC Confidence 223344453 5689999999999998543211 112223333321 11223344444333221 Q ss_pred hhHHHHHHHHHHHhCCceechhh-eeeeecC Q lcl|Aclame:pro 303 VQYQGDLIVGKLAMGCGSLRTSV-AGSFQAA 332 (332) Q Consensus 303 ~~~~~d~i~~~~~~G~~vlrpe~-~v~i~~A 332 (332) .. ..+....+.|++..-+++ ++.+.+= T Consensus 288 ~~---~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 288 RL---TKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred Cc---eEEEEEEEeceeEEeccceeEeeeeC Confidence 11 122333455665544443 3333333 No 177 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.61 E-value=2.7e-05 Score=45.59 Aligned_cols=289 Identities=12% Similarity=0.142 Sum_probs=152.0 Q ss_pred CCCcccccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhc--cccccc-cc-----cccceEEEeccccee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFK--GLVRSY-DL-----RGGKSKQFMFTGKLS 71 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~--~~v~~r-~~-----~~G~tv~i~~iG~~t 71 (332) |+.+++-||.. .+|+ |+|..+|.+.-.+.+-|. +.+... .+ .+|+.+.+|..+... T Consensus 1 M~~~~~~T~l~---------------Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~ 65 (367) T protein:vir:80 1 MPDFNNQVRLV---------------DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD 65 (367) T ss_pred Ccchhhhhhhh---------------hccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCC Confidence 99887778762 1455 889888888776654433 222211 22 679999999997764 Q ss_pred eee--ecCCCC---CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 72 AGY--HTPGTP---IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKAS 146 (332) Q Consensus 72 ~~~--~~~g~~---~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa 146 (332) -.. |..+++ ++ +..++..+.. -+=..+-.+|...|+-..-+--|.|..+..+.+.--.+...+.++..|...- T Consensus 66 g~~~n~~~d~~~~~~t-~~kittg~~~-a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf 143 (367) T protein:vir:80 66 SLEPNYGSDNPNVEAP-IDGLGSGEMK-TTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVY 143 (367) T ss_pred CCccccCCCCCccccc-ccccccchhe-eeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhh Confidence 322 221221 21 2334433322 2222345677788998888888999999999888888887777777665433 Q ss_pred hhcccc---------------ccccccceeccccccccCHHHH-HHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhh Q lcl|Aclame:pro 147 AEASPV---------------TGEPGGFHVNIGAGNTNDAQAI-VDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISS 210 (332) Q Consensus 147 ~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~~-~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~ 210 (332) ...... .+..+..+..+.+........+ .+.+.+|...|.++ ...=..++|-+.+|..|.+. T Consensus 144 ~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~--~~~l~~i~mHS~V~~~L~~~ 221 (367) T protein:vir:80 144 KSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMTNN 221 (367) T ss_pred ccccccchhhhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccc--cccccEEEEchHHHHHHHhc Confidence 221100 1111222222222211111112 56688888888664 23346888999999998653 Q ss_pred cCchhhccccccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhc Q lcl|Aclame:pro 211 VDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSV 290 (332) Q Consensus 211 ~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~ 290 (332) +++.. ...+++ +. .++.++|.+|+.+..+|..... ..+. ..+.+|-.=|++..+.. T Consensus 222 ---~li~~-i~~sd~---~~-~i~ty~G~~VIvDD~~Pv~~~~------a~~~----------yttYlfg~GAi~~~~~~ 277 (367) T protein:vir:80 222 ---DEIEF-IPDSKG---QL-TIPTYMGKVVIVDDGMPVFGTG------ADKT----------YLSILFGGAAFGYADGA 277 (367) T ss_pred ---ccccc-ccCCCC---cc-ccceecceeEEEeCCCcccccC------CCce----------EEEEEEecceeeecccC Confidence 45432 112222 22 3889999999999999964311 0111 12234444444444433 Q ss_pred cc-eeeeeecccchhHH-HHHHH-----HHHHhCCceechhhe--------------------eeeecC Q lcl|Aclame:pro 291 AP-TIQTTSGDFNVQYQ-GDLIV-----GKLAMGCGSLRTSVA--------------------GSFQAA 332 (332) Q Consensus 291 ~~-~~e~~~~~~~~~~~-~d~i~-----~~~~~G~~vlrpe~~--------------------v~i~~A 332 (332) +. .+|.-|++.....- -|.+. -+|.+|.+-.....+ -+|..+ T Consensus 278 ~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~ 346 (367) T protein:vir:80 278 PQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANP 346 (367) T ss_pred CccceecccchhhhcCCceEEEEeeeeEEeecceeeecccccccccccccccccccccCCCChHHhcCC Confidence 22 12333332110000 01111 235556554432211 012222 No 178 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=97.43 E-value=7.4e-06 Score=48.72 Aligned_cols=300 Identities=15% Similarity=0.081 Sum_probs=144.3 Q ss_pred CCCcc----------------------------cccccccccccccccccCchhhHHH-HHHhHHHHHHHHHhhhhcccc Q lcl|Aclame:pro 1 MTTLS----------------------------NFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLV 51 (332) Q Consensus 1 m~~~~----------------------------~~~r~~~~~~~~~~~~~d~~~al~~-e~f~g~V~~~f~~~s~~~~~v 51 (332) |-..- +=+--++++-.-.-..++. ++.| +.-++.|.++-.--.+-..++ T Consensus 28 me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~p~~eV~~~e~mtt~~a--~IliP~vis~v~~Eaaepl~~~~kl~ 105 (393) T protein:vir:79 28 MERGETLAEADANKLALNEEETQILESFAKMMEGETPTNEVNLREFMATPSA--QILIPRVIVGTMREAAEPLYIGTKML 105 (393) T ss_pred hhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCCchhheehhhhhcCCCc--ceechhhhhhhhhhcccchhHHHHHH Confidence 11100 1010111111000111111 1222 444555554321112222223 Q ss_pred ccccccccceEEEecccceeeeeecCCCCCCccCCCC-CceEEEEEeeeeecc-hhhhhHHHHHhchhHHHHHHHHHHHH Q lcl|Aclame:pro 52 RSYDLRGGKSKQFMFTGKLSAGYHTPGTPIVGDAGIK-ANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEA 129 (332) Q Consensus 52 ~~r~~~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~-~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~a 129 (332) ..-.+..|.+-.|+.+|..-..+...|..+... .++ .+.-.+++-+.|+.- ..+.|----.+.+|+++-..+.++++ T Consensus 106 qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~~~-sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~Ra 184 (393) T protein:vir:79 106 QKIRLKSGQSMIFPSIGIMRAYDVAEGQEIPED-SIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRA 184 (393) T ss_pred HHHhhhcCcceeccchheeeecccccccccccc-chhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHH Confidence 323567899999999998888888887776543 343 222245666655432 22333222236899999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhc-----cccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHH Q lcl|Aclame:pro 130 LATHYDERIARVLAKASAEA-----SPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQY 204 (332) Q Consensus 130 La~~~D~~i~~~~~~aa~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~ 204 (332) |+|..|+-++.+.-+-..+. ....+++.|.....-...+...+.+.|-++. .+..-- .+-++++.|-.| T Consensus 185 MaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~a---v~~~hy---t~svi~MHPLAW 258 (393) T protein:vir:79 185 MGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIA---VMANEY---TPSDLMMHPLAW 258 (393) T ss_pred HHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHH---HhcccC---CcceEEEcCchh Confidence 99999999998775433211 1111222221111111222334444443332 222222 346788888888 Q ss_pred HHHHhhcCchhhccccccccccccccce-eeeee-----------ceEEEeeCccccccccccccccccccccccccccc Q lcl|Aclame:pro 205 YSLISSVDTNILNREIGNSQGDMNSGKG-LYSIA-----------GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDAS 272 (332) Q Consensus 205 ~~Ll~~~d~~~~~~d~~~~~~~~~~g~~-v~~i~-----------G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~ 272 (332) +..-+ + ..+..-+.+.-+.+-.... --+.. .|.|+.|+-+|.-.... ...-|.++ . T Consensus 259 nv~AK--n-a~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~--------rFd~~~Vd-~ 326 (393) T protein:vir:79 259 TVFAK--N-ELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSR--------RFDVYAVD-R 326 (393) T ss_pred hhhhh--h-hhhcceeeccccccCccccchhhhhchhhhccccccceeEEEecccccccccc--------eeeEEEee-c Confidence 76532 1 1222212211111111000 11223 49999999998532111 11122233 3 Q ss_pred ceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 273 ALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 273 ~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++++++.-++ +++++.|+++- +--.-|+=.-+||.+||.--.+++.-.- T Consensus 327 NnvgvlLV~D--------~i~tdq~ddk~---rdiq~iKl~ERYG~gvLn~gkaiavakN 375 (393) T protein:vir:79 327 NNVGVLLVRD--------DLKTDQWDEKA---RGLQNIKMIERYGIGILNEGKAIAVAKN 375 (393) T ss_pred CCceEEEEec--------Ccceecccccc---ccceeeeeeeeeceeeeeCCceEEEEec Confidence 4455554333 44666665432 2122345556899999998887765443 No 179 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=96.06 E-value=0.00021 Score=40.78 Aligned_cols=296 Identities=11% Similarity=0.088 Sum_probs=138.2 Q ss_pred CCC--cc--cccccccccccc----------cccccCchhh-HHHHHHhHHHHHHHHHh-hhhccccccccccccceEEE Q lcl|Aclame:pro 1 MTT--LS--NFSLPNQANGGA----------RNADYDVRYA-TALKLFSGEVFTAFNNA-SIFKGLVRSYDLRGGKSKQF 64 (332) Q Consensus 1 m~~--~~--~~~r~~~~~~~~----------~~~~~d~~~a-l~~e~f~g~V~~~f~~~-s~~~~~v~~r~~~~G~tv~i 64 (332) |+- ++ .|.|-|...-|. ..+..| .. |+...-...++..|+.. .-++.+.+.++++.-+..+. T Consensus 331 ~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsD--Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~ 408 (652) T protein:vir:79 331 MTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSD--FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHR 408 (652) T ss_pred ccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcch--HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccce Confidence 110 00 111111100000 011222 12 23333334455666644 56778888888888888888 Q ss_pred ecccc-eeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 MFTGK-LSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLA 143 (332) Q Consensus 65 ~~iG~-~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~ 143 (332) .++|. +++.....|.++.+- .+.....++.+.++ ---|.|...--.=-..+.-..+.+..|++-++..++.+...+. T Consensus 409 ~~lg~~~~L~~V~E~gEyk~~-t~~e~~e~~~l~ty-G~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~ 486 (652) T protein:vir:79 409 VGMGGFSALRQVREGAEYKYV-TTGDKQATIALATY-GELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILT 486 (652) T ss_pred eecCCCCCccccCCCCcccee-eecCccceeeeecc-cCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 88875 566666667776653 45666777888764 2223343210000023345667778899999999988887775 Q ss_pred HHhhhcccccccccc---ceeccccccccCHHHHHHHHHHHHHHHHhcC-----CCcCCCEEEEChHHHHHHHhhcCchh Q lcl|Aclame:pro 144 KASAEASPVTGEPGG---FHVNIGAGNTNDAQAIVDGFFEAAAVLDERS-----APQEGRVAVLSPRQYYSLISSVDTNI 215 (332) Q Consensus 144 ~aa~~~~~~~~~~~~---~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~-----VP~~gR~~vv~P~~~~~Ll~~~d~~~ 215 (332) .-.... ..|.+-+ .+-++.++.+.+-. .|-.+++.|.++. +--..||++|+|+...... ++ T Consensus 487 ~Np~~~--~DGk~LF~hA~H~Nl~~~aa~~~~----~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~-----~l 555 (652) T protein:vir:79 487 SNPKIS--TDNVSLFDKAKHANVLESAAMDVA----SLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVAN-----QV 555 (652) T ss_pred cCcccc--cCCceeecccccccccccccCCHH----HHHHHHHHHHHhccCCccccccccEEEecchhHHHHH-----HH Confidence 322111 0222222 23344333333433 3334443333322 2234589999998554332 22 Q ss_pred hccccccccccccccceeeeeec-eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcccee Q lcl|Aclame:pro 216 LNREIGNSQGDMNSGKGLYSIAG-IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTI 294 (332) Q Consensus 216 ~~~d~~~~~~~~~~g~~v~~i~G-~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~ 294 (332) ++...... .....| .+--+.| ++|+..++|...+.+.+-.++..+. . ++=+.|= -+ ...|.+ T Consensus 556 l~s~~v~~-a~~~~~-~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~~-d--------tiev~yL-----~G-~~~P~i 618 (652) T protein:vir:79 556 IRSSSVKG-ADINAG-IINPVKDFATVIAEPRLDDNSQTTFYLAASKGS-D--------TIEVAYL-----NG-VDTPYI 618 (652) T ss_pred hccCCCcc-cccccc-cccccccccccccccccCCCCcccEEEecCCCC-C--------eEEEEEe-----cC-CCCCee Confidence 22111111 111111 1222334 3888899997654444433332221 1 1111110 00 122333 Q ss_pred eeeecccchhHHHHHHHHHHHhCCceechhheeeeec Q lcl|Aclame:pro 295 QTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQA 331 (332) Q Consensus 295 e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~ 331 (332) |.-... + -.|-.++-.+=||++++...+++=..+ T Consensus 619 e~~~gf-~--~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 619 DQMEGF-S--VDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred eecCCC-C--cceEEEEEEEeccCceeeccceeeecC Confidence 322111 1 112233445678999998887665544 No 180 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=96.06 E-value=0.0011 Score=36.90 Aligned_cols=281 Identities=9% Similarity=0.088 Sum_probs=130.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc---cceeeeeecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT---GKLSAGYHTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i---G~~t~~~~~~ 77 (332) |+. +|=. .++. |.-......|.+.|.++|-+........+. |++.+.++. +.++..+... T Consensus 25 m~a---lTLa---ea~~----------l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve-~~~~~~~r~~~lp~a~~r~~n~ 87 (330) T protein:vir:94 25 MPT---VTLA---ESAK----------LSQDHLVSGLIETIVEVNPLYEMMPFTEIE-GNALAYNRENVLGDVQFLAVGG 87 (330) T ss_pred hhh---hhhh---HHhh----------cCchhhHHHHHHhhhccchHHhhccccccc-CCcceeeeeecCCcceeeeccc Confidence 332 1211 1111 223445677888888775555555444433 445555544 3444444443 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHh-----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFS-----QYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~-----~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) +.++.. +.+..+++. + .....- +-++|+.-+ ..|.+.+..+...++|++++...++.- ..+ .+- T Consensus 88 ~~~~~~--~~Tf~q~t~--~-l~~l~~-~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linG--Ds~---~~~ 156 (330) T protein:vir:94 88 TITAKN--PATFTKVTS--E-LTTLIG-DAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITG--DGT---GNS 156 (330) T ss_pred cccccC--cceeeeeee--c-hhhhhh-hHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhcc--CCC---Ccc Confidence 322211 111123333 2 112221 224555442 346788888888889988887655431 000 010 Q ss_pred c-c----ccccceeccc-cccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccc Q lcl|Aclame:pro 153 T-G----EPGGFHVNIG-AGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) Q Consensus 153 ~-~----~~~~~~i~~~-~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~ 226 (332) . | ..+...+..+ .+...+++. .|.++++.- +-|-+.-+++++..+...+-+.. .+...+. ...... T Consensus 157 F~GL~~~~~~~q~i~tg~~gg~~T~d~-LDeLl~~v~-----~~~g~~~~~l~n~a~~r~I~a~~-R~~~~~~-v~~~~~ 228 (330) T protein:vir:94 157 FQGMMGLVAASQTISAGANGGTLTFEL-LDQLLDLVK-----DKDGQVDYLMSSFAMRRKYFSLL-RALGGAA-IGEVMT 228 (330) T ss_pred ccchhhcCCcccEEecCCCCCCCCHHH-HHHHHHHhc-----CCCCCCcEEEechhHHHHHHHHH-HhccCCC-CCCccc Confidence 0 1 1122233322 222233322 233333221 11223458887777665553321 1111111 111223 Q ss_pred ccccceeeeeeceEEEeeCccccccccccccccccccccccccccc------ceEEEeechhhhhhhhhccceeeeee-- Q lcl|Aclame:pro 227 MNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDAS------ALAGLIFHREAAGCIQSVAPTIQTTS-- 298 (332) Q Consensus 227 ~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~------~~~~l~~h~~a~~~~~~~~~~~e~~~-- 298 (332) ...|+.|-.|.|++|+.++-+|...+. ...+|..+-|...+. ..+||-.... -.++++... T Consensus 229 ~~~G~~v~~~~GvPi~~~d~ip~~~~~----~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~-------~glsVr~~G~~ 297 (330) T protein:vir:94 229 LPSGRQIPTYRGVPWFVNDFIPSNMTQ----GTATNATAIFAGTFDDGSNKYGIAGLTARGS-------AGLRVQNVGAK 297 (330) T ss_pred ccCCCEEeeeCCeEEEecccccCCCCc----ccCCCceeEEEEeecccccccceEeecCCCC-------CcceeeeCCCc Confidence 346777889999999999999965321 122344455655543 2233321111 123333221 Q ss_pred cccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 299 GDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 299 ~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++....++ .| .+-+|..++.|++++.|+-= T Consensus 298 ~~k~v~~~--~v--~~y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 298 ENADETIT--RV--KMYCGFANFSQLGLAAIKGL 327 (330) T ss_pred cccceeeE--EE--EEeeeeEEechhheeeeccc Confidence 11111100 11 12369999999999999888 No 181 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=95.40 E-value=0.0021 Score=35.22 Aligned_cols=287 Identities=11% Similarity=0.099 Sum_probs=128.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccc---ceeeeeec- Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG---KLSAGYHT- 76 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG---~~t~~~~~- 76 (332) |+ .||..- ++... -| .+ ...|.+.|.+.|-+.......+++ |++.+.++.- ........ T Consensus 1 mp---altLae---a~k~~--~d---~l-----~~~ViE~~~~~s~lL~~LpF~~ve-g~~~~ynR~~~~~~~~~~~v~~ 63 (310) T protein:vir:97 1 MA---SVTLAE---SAKLA--QD---EL-----VAGVIENIITVNRMFDVLPFDSIE-GNSLAYNRENVLGDVIMAGVGT 63 (310) T ss_pred Cc---ccchHH---HhhcC--cc---hH-----HHHHHHHHhccchHHHhCCccccc-CCcceeeEeeccCCcccccccc Confidence 54 455432 22111 11 12 346677887666666666555554 5566666552 22222111 Q ss_pred ----CCCCCCccCCCCCceEEEEEeeeeecchhhhh-HHHHH-h-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 77 ----PGTPIVGDAGIKANEKTLVMDDLLVSSQFVYS-LDEIF-S-QYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 77 ----~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd-~D~~q-~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) .|..+. ..+.++++..+--. .-.+.||. +.+.. . -.|.+.+..+...++|++++...++. +-.++ T Consensus 64 ~~~~~g~~~~---~~t~~~~~~~L~i~-~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lIN----GD~a~ 135 (310) T protein:vir:97 64 TFSGAGAGKA---AATFTKVNSNLTTI-MGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLIN----GNGAG 135 (310) T ss_pred cccCCCcccc---ccccceeeeeeeee-eehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhc----cccCC Confidence 122111 12223333333221 12233332 11211 2 34677777888889999988765543 10001 Q ss_pred ccccc----ccccceecccc-ccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 150 SPVTG----EPGGFHVNIGA-GNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 150 ~~~~~----~~~~~~i~~~~-~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) .+-.| ..+...+..++ +...+++ ..|.++++.-+ -..+..+++..|..+..+-.- -|-.+....-.. T Consensus 136 n~F~GL~~~~~~~q~i~~~~~gg~~t~d-~LDeLl~~v~~-----~~g~p~~~l~~~~~~r~i~A~--~R~~~~~g~~~~ 207 (310) T protein:vir:97 136 NEFAGLIQLCASGQKATTGATGSAISFA-ILDELMDLVVD-----KDGQVDYLTMHARTLRSYKAL--LRALGGASINEV 207 (310) T ss_pred CcccchhhcCCccceeecCCCCCCCCHH-HHHHHHHHHhc-----CCCCCCEEEecHHHHHHHHHH--HHHhcCCCCCCc Confidence 11101 01122333222 2222332 23333332210 112446999999764333211 121111111122 Q ss_pred ccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccc---eEEEeechhhhhhhhhccceeeeee--c Q lcl|Aclame:pro 225 GDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASA---LAGLIFHREAAGCIQSVAPTIQTTS--G 299 (332) Q Consensus 225 ~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~---~~~l~~h~~a~~~~~~~~~~~e~~~--~ 299 (332) .....|+.|-.|.|++|+.++.+|..... ...+|..+-|...+.. ..|++.-... ..-.++++... + T Consensus 208 ~~~~~G~~v~~~~GiPi~~~d~ip~~~~~----~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~----~~~glsVr~~G~~~ 279 (310) T protein:vir:97 208 VELPSGAEVPAYSGTPIFRNDYIPTNQTK----GGTTGCTTIFAGTLDDGSRTHGIAGLTAT----QAAGIQVVDVGESE 279 (310) T ss_pred cccCCCCEEeeeCCeEEEEeCccCCCccc----cccCCceeEEEEeeCccccccceeccccC----CccceeEEeCCccc Confidence 23456777899999999999999964211 1223445556555432 2344321110 01123333322 0 Q ss_pred ccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 300 DFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 300 ~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +.... ...| .+-+|..++.|++++.|.-= T Consensus 280 ~~~v~--~~~V--~~Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 280 DSDEH--IWRV--KWYCGLALFSEKGLACADGI 308 (310) T ss_pred CCcce--eEEE--EEeeeEEEecccceeeeccc Confidence 11111 0011 11269999999999988888 No 182 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=95.10 E-value=0.002 Score=35.42 Aligned_cols=275 Identities=14% Similarity=0.017 Sum_probs=98.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecc-cceeeeeecCCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~i-G~~t~~~~~~g~ 79 (332) .+.-.+...+. +.... +.++ + +--..+...+...+...+.+...++..++ ....++.. .......+..|. T Consensus 229 ~~~~~~~~~~~--~~~~~-~~~~--~-~~p~~~~~~i~~~~~~~~~i~~~~~~~~i---~~~~~~~~~~~~~a~~~~eG~ 299 (517) T protein:vir:97 229 LTKDPKAAWTA--ELKER-GISG--M-PAPAGILKRIQDAVNDEGSLLPFIRHENL---PTLVVGGDNALTQGTGHTTGT 299 (517) T ss_pred ccccccceeee--ecccc-cccc--c-ccchHHHHHHHHhhhhhccceeeeeeccc---cceeeecccccceeeeeecCC Confidence 11111111110 00000 0000 0 11223334444555555555555554333 23334422 223344555565 Q ss_pred CCCccCCCCCceEEEEEeeeeecc-hhhhhHHHHHhchh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 80 PIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYS----TRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG 154 (332) Q Consensus 80 ~~~~~~~~~~~~~~l~ID~~~~~~-~~Idd~D~~q~~~d----~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~ 154 (332) .... .+++..++++.+-+ +.+ +.++..--..+.+| +.+-+..++.++|+++.++.++.- .+ +..+..+ T Consensus 300 ~kp~-s~~tf~~~~~~~~~--ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~G--dG--tg~~~~g 372 (517) T protein:vir:97 300 DKTE-SNITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG--GV--TGVSETQ 372 (517) T ss_pred cccc-cccceeeEEeeHhh--hhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcc--cC--CCccccc Confidence 5443 24555555555433 333 22332111112334 677788999999999999877621 01 1111100 Q ss_pred ccccceecc-ccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 155 EPGGFHVNI-GAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 155 ~~~~~~i~~-~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) .. ..... .+........+.+.+ ..|..+..+..+-.+|++|..|..|.+-||.+ ..|.- +.....+ .. T Consensus 373 i~--~~a~~~~~~~~~~~~~~~d~i----~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~---G~Yl~-~~~~~~~-~~ 441 (517) T protein:vir:97 373 IY--PVVGDAWATNVTGTTNIQELL----EKLSVATPKAADSTLVIHRNDLAAIRFLKDKN---GNYVF-PVGVSNQ-TI 441 (517) T ss_pred cc--ccccccccccccccchHHHHH----HHHHHHhhhccCCEEEECHHHHHHHHHhhcCC---CCeec-cCcCCcc-cc Confidence 00 00000 001111112222322 22222222222344679999999997766532 22221 1112222 34 Q ss_pred eeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccce-eeeeec-ccchhHHHH--- Q lcl|Aclame:pro 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPT-IQTTSG-DFNVQYQGD--- 308 (332) Q Consensus 234 ~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~-~e~~~~-~~~~~~~~d--- 308 (332) ..++|+.-+.+ .++. ...... ..+.|.. +. .+.+. .+.|.. +....|... T Consensus 442 ~~l~G~~~~~~-~~~~---~~~~~~----~~~~y~i--------~~---------~~g~~~~~~fd~~~n~~~f~~~~~~ 496 (517) T protein:vir:97 442 ATHFGFNRLVQ-SVAV---DEKTAV----SLSGYVT--------NG---------SRGMEFEQGTILVENNKEYLFEMPI 496 (517) T ss_pred cccCCcccccc-cccc---CceeEe----eccccEE--------Ee---------ecceeeeeeeecccCceeEeeeeee Confidence 55666422221 1211 000000 0011110 00 00000 011110 001111000 Q ss_pred --HHHHHHHhCCceechhhee Q lcl|Aclame:pro 309 --LIVGKLAMGCGSLRTSVAG 327 (332) Q Consensus 309 --~i~~~~~~G~~vlrpe~~v 327 (332) .|+..-++--.+.||..++ T Consensus 497 ~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 497 SGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred ccccccccceEEEEEcCCCCC Confidence 1233333444555555555 No 183 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=94.73 E-value=0.0036 Score=33.96 Aligned_cols=280 Identities=14% Similarity=0.067 Sum_probs=130.3 Q ss_pred cccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccccc-ccc-cceEEEecccc-eeeeeecCCC-CCCccCCCCC Q lcl|Aclame:pro 14 NGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYD-LRG-GKSKQFMFTGK-LSAGYHTPGT-PIVGDAGIKA 89 (332) Q Consensus 14 ~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~-~~~-G~tv~i~~iG~-~t~~~~~~g~-~~~~~~~~~~ 89 (332) -++-+.+.. .+=+++....+|.+........+.++..++ +-- ..++.++.... ..++-+..+. ++.. .+++- T Consensus 1 ~~~~~~g~f---~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~-~~~~~ 76 (301) T protein:vir:80 1 MQGKITATI---EARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPL-VDVDM 76 (301) T ss_pred CCccccchh---hHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccc-ccccc Confidence 111211111 122344555666666666777777776653 332 34566554422 2333343322 2221 13334 Q ss_pred ceEEEEEeee-eecchhhhhHHHHH-hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc---ccccceec-- Q lcl|Aclame:pro 90 NEKTLVMDDL-LVSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG---EPGGFHVN-- 162 (332) Q Consensus 90 ~~~~l~ID~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~---~~~~~~i~-- 162 (332) ++....|-.. .-+.+.+.++..++ ...++-..-...++.++++..|+.++-=... ..+.| .++-.... T Consensus 77 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-----~g~~GLlN~p~~~~~~~~ 151 (301) T protein:vir:80 77 VRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK-----YAIKGAFEATGIQIDVSP 151 (301) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc-----ccceeeecCCCccccccc Confidence 4555555553 23455567777775 4777878888899999999999977732111 11111 11100000 Q ss_pred -cccc-----cccCHHHHHHHHHHHHHHHHhcCCCcC-CCEEEEChHHHHHHHhhcCchhhccccccccccccccceee- Q lcl|Aclame:pro 163 -IGAG-----NTNDAQAIVDGFFEAAAVLDERSAPQE-GRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY- 234 (332) Q Consensus 163 -~~~~-----~~~~~~~~~d~i~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~- 234 (332) .+++ ...+++.+++.|..+..+|.++.-=.. .-.++|+|+.|..|.. ++..+. . +-.+.+ .+. T Consensus 152 ~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~---~~~~~~-~---~~tvl~--~l~~ 222 (301) T protein:vir:80 152 TTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINK---KRYSNE-D---SRSVLK--VLQD 222 (301) T ss_pred CcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhh---ccccCC-C---CeeHHH--HHHH Confidence 0111 223588899999999999876521111 1368899999988842 111111 1 111111 121 Q ss_pred eeeceEEEeeCcccccccccccccccccccccccccccceEEEeechh-hhhhhhhccceeeeeecccchhHHHHHHHHH Q lcl|Aclame:pro 235 SIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHRE-AAGCIQSVAPTIQTTSGDFNVQYQGDLIVGK 313 (332) Q Consensus 235 ~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~-a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~ 313 (332) +.-+..|...+.|...++. ..+..-.|..+..+ ..+.++.. -...++.+++..+ +... T Consensus 223 ~~~~~~I~~~p~L~~~g~~------g~~~~v~~~~~~d~-~~~~v~~~~~~~~~e~~~~~~~--------------~~~~ 281 (301) T protein:vir:80 223 NAWFSAIVRVPDLAGMGTA------GSDSFAVIHDSNET-AELIIPMDITRHPEEYSFPRTK--------------VPFE 281 (301) T ss_pred HcCcceEEEcceeccCCCC------cccEEEEEecCCcE-EEEEecCceeeecceecCceeE--------------eeee Confidence 2335677777777532110 00001111111111 11221111 1111222222111 1111 Q ss_pred H-HhCCceechhheeeeecC Q lcl|Aclame:pro 314 L-AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 314 ~-~~G~~vlrpe~~v~i~~A 332 (332) . ..|.-+.||+++.-+.== T Consensus 282 ~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 282 ERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeeEEEEEEccceEEEEecC Confidence 2 237889999986543333 No 184 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=94.24 E-value=0.0015 Score=36.13 Aligned_cols=297 Identities=15% Similarity=0.097 Sum_probs=135.9 Q ss_pred CCC--c--cccccccccccccc----------ccccCchhhHHHHHHhHHHHHHHHHh-hhhccccccccccccceEEEe Q lcl|Aclame:pro 1 MTT--L--SNFSLPNQANGGAR----------NADYDVRYATALKLFSGEVFTAFNNA-SIFKGLVRSYDLRGGKSKQFM 65 (332) Q Consensus 1 m~~--~--~~~~r~~~~~~~~~----------~~~~d~~~al~~e~f~g~V~~~f~~~-s~~~~~v~~r~~~~G~tv~i~ 65 (332) |+- + ..|.+-|....|.+ .+..|=+ .|+...-...++..|+.. .-++.+...++++.-+..+.. T Consensus 366 ~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp-~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~ 444 (693) T protein:vir:95 366 MTLRELARASLVDRGIGVASLNAPQMVGLAFTHTSSDFG-LILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRV 444 (693) T ss_pred CcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCcchhH-HHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCccccccee Confidence 111 0 01111111111100 1111111 133334445666666654 667777777788888877777 Q ss_pred cccc-eeeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 FTGK-LSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAK 144 (332) Q Consensus 66 ~iG~-~t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~ 144 (332) ++|. +++.....|.++.+. .+....-++.|.++ ---|.|+...-.=-..+....+.+..|++-++..++.+...+.. T Consensus 445 ~lg~~~~L~~V~E~gEyk~~-t~~e~~e~~~l~ty-G~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~ 522 (693) T protein:vir:95 445 GLGEFSSLRQVREGAEYKYV-TLGERGEQIILATY-GELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTG 522 (693) T ss_pred ecCCCCChhhcCCCCceeee-ecCCccceeehhhc-CCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 7876 455566666666542 34555556777654 22234433111111234566677889999999999988887753 Q ss_pred Hhhhccccccccccceeccc--cccccCHHHHHHHHHHHHHHHHhcCC----------CcCCCEEEEChHHHHHHHhhcC Q lcl|Aclame:pro 145 ASAEASPVTGEPGGFHVNIG--AGNTNDAQAIVDGFFEAAAVLDERSA----------PQEGRVAVLSPRQYYSLISSVD 212 (332) Q Consensus 145 aa~~~~~~~~~~~~~~i~~~--~~~~~~~~~~~d~i~~a~~~Lde~~V----------P~~gR~~vv~P~~~~~Ll~~~d 212 (332) -.........+- ..+-++. ++...+ ++.|-.++..|..+.- --..++++|+|+...... T Consensus 523 Np~m~DGk~LFh-adH~Nl~tga~sals----~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~---- 593 (693) T protein:vir:95 523 NPAMSDGKTLFH-ADHSNLLTGAASALS----IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKAN---- 593 (693) T ss_pred CccccCCcceee-ccccccccccccccC----hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHH---- Confidence 211111111110 1233332 222223 3444455555544331 124578888887554443 Q ss_pred chhhccccccccccccccceeeeeec-eEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhcc Q lcl|Aclame:pro 213 TNILNREIGNSQGDMNSGKGLYSIAG-IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVA 291 (332) Q Consensus 213 ~~~~~~d~~~~~~~~~~g~~v~~i~G-~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~ 291 (332) ++++....-. .....| .+--+.| ++++..++|...+++.+-.++.++. ...- +.|= -+ ... T Consensus 594 -~l~~s~~~~~-a~~~~~-~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~-dtie--------~~yL-----~G-~~~ 655 (693) T protein:vir:95 594 -QIINSESVPG-ADVNSG-IVNPIRAFAQVIGEPRLDDASATAWYMAAKKGS-DTIE--------VAYL-----DG-VDT 655 (693) T ss_pred -HHhccccccc-cccccc-cccchhccccccccceecCCCCCceEEecCCCC-CeEE--------EEEe-----cC-CCC Confidence 2333322111 111122 1222335 3788889997666666555544332 1111 1110 00 112 Q ss_pred ceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 292 PTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 292 ~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +.+|.-... ..-|-.++-.+=||++++...+ ..+.. T Consensus 656 P~ie~~~gf---~~dG~~~kvr~D~G~~~iD~Rg--~~kn~ 691 (693) T protein:vir:95 656 PYLEQQEGF---TVDGVASKVRIDAGVAPLDFRG--LQKSN 691 (693) T ss_pred CeEeecCCC---CcceEEEEEEEeccCceeeccc--cccCC Confidence 233322111 0111123334567888876653 23333 No 185 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=90.78 E-value=0.019 Score=30.01 Aligned_cols=274 Identities=11% Similarity=0.010 Sum_probs=126.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHh---HHHHHHHHHhhhhcccccccc-ccc-cceEEEecc---cceee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFS---GEVFTAFNNASIFKGLVRSYD-LRG-GKSKQFMFT---GKLSA 72 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~---g~V~~~f~~~s~~~~~v~~r~-~~~-G~tv~i~~i---G~~t~ 72 (332) |+- +. .|+..++..+++. ..|.+.....-..+.++..++ +.. -.++.++.. |..+ T Consensus 1 ~~~--------------~~--a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~- 63 (296) T protein:vir:10 1 MGV--------------DK--ADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQ- 63 (296) T ss_pred Ccc--------------cc--hhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCcee- Confidence 332 11 1222345555555 344444444456666666554 222 245555443 4443 Q ss_pred eeecCC-CCCCccCCCCCceEEEEEeee-eecchhhhhHHHHHh-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 73 GYHTPG-TPIVGDAGIKANEKTLVMDDL-LVSSQFVYSLDEIFS-QYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 73 ~~~~~g-~~~~~~~~~~~~~~~l~ID~~-~~~~~~Idd~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) -|..+ .++.. .+++-++....|-.+ .-+.+.+.++..++. ..++-......++.++++..|+.++-=- .. T Consensus 64 -~~~~~~~dip~-v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~-----~~ 136 (296) T protein:vir:10 64 -IVADYTDDLPL-VDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGS-----TA 136 (296) T ss_pred -EeCCCccccce-eeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec-----cc Confidence 33322 22221 133444555555442 234555677776655 6777777788899999999998776211 11 Q ss_pred ccccc---ccccceeccccccccCHHHHHHHHHHHHHHHHhc--CCCcCCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 150 SPVTG---EPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDER--SAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 150 ~~~~~---~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~--~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) ..+.| .++-.... ..++=.++..+++.|..+...|.++ .+= ..-.++|+|+.|..|... .+ +++ T Consensus 137 ~g~~GLlN~p~v~~~~-~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~-~p~~l~L~p~~~~~L~~~-----~~----~~~ 205 (296) T protein:vir:10 137 HGIPSVFDYPNINNVV-SGGSWSQPTTAVSDITSLLDIIETSTNGQH-RATHLLLPTTARRIMQNL-----VP----GTS 205 (296) T ss_pred ccceeEeecCCCcccc-ccCCccCHHHHHHHHHHHHHHHHHhhCcee-cceeEEeCHHHHHHHhhc-----cC----CCC Confidence 11111 11111111 1112234567888898988877654 221 112577899999877421 11 111 Q ss_pred ccccccceee-eeeceEEEeeCcccccccccccccccccccccccccccceEEEeech-hhhhhhhhccceeeeeecccc Q lcl|Aclame:pro 225 GDMNSGKGLY-SIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHR-EAAGCIQSVAPTIQTTSGDFN 302 (332) Q Consensus 225 ~~~~~g~~v~-~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~-~a~~~~~~~~~~~e~~~~~~~ 302 (332) -.+. ..+. +..+.+|...+.|....+.+. ...-.|..+-. .+.+.++. -...-++.+++.. T Consensus 206 ~t~l--~~ik~~~~~l~i~~~~~l~~a~~~g~------~~~v~~~~~~~-~~~~~v~~~~~~~~~e~~~l~~-------- 268 (296) T protein:vir:10 206 VSYG--EFFRQNNSGVTVEFVQYLNDYNGTGT------SAAIAYEKDPN-NMAIEIPEATNALPAQPKDLHF-------- 268 (296) T ss_pred ccHH--HHHHHhcCCceEEEeeeeccCCCCcc------eEEEEEEcCCc-eEEEEcCcceeeecccccCceE-------- Confidence 1111 1222 234677777777764322110 00011111111 11122211 1111122222211 Q ss_pred hhHHHHHHHHHHHh-CCceechhheeee---ecC Q lcl|Aclame:pro 303 VQYQGDLIVGKLAM-GCGSLRTSVAGSF---QAA 332 (332) Q Consensus 303 ~~~~~d~i~~~~~~-G~~vlrpe~~v~i---~~A 332 (332) .+....+. |.-+.||++++.+ .=| T Consensus 269 ------~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 269 ------KIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred ------EEeeEeeEEEEEEECCceeEEEeeeecC Confidence 12222333 6899999987765 344 No 186 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=90.65 E-value=0.02 Score=29.93 Aligned_cols=263 Identities=14% Similarity=0.143 Sum_probs=126.9 Q ss_pred chhhHHHHHHhHHHHHHHHHhhhhcccccc-----ccccccceEEEeccc--ceeeeeecCCCCCC-----cc-CCCCCc Q lcl|Aclame:pro 24 VRYATALKLFSGEVFTAFNNASIFKGLVRS-----YDLRGGKSKQFMFTG--KLSAGYHTPGTPIV-----GD-AGIKAN 90 (332) Q Consensus 24 ~~~al~~e~f~g~V~~~f~~~s~~~~~v~~-----r~~~~G~tv~i~~iG--~~t~~~~~~g~~~~-----~~-~~~~~~ 90 (332) -+---|-|+|.|.+.+-|++++.|++..-- .-+.+.++.-=-... .+-++.|..+.... +. .....- T Consensus 1 ~avr~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~r 80 (287) T protein:vir:39 1 MAIKYFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQR 80 (287) T ss_pred CCcccccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccce Confidence 011158899999999999999999876531 123333322111121 12344555432211 00 001111 Q ss_pred eEEEEEeeee-e-cchhh-hhHHHHHhchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccceeccc Q lcl|Aclame:pro 91 EKTLVMDDLL-V-SSQFV-YSLDEIFSQYST---RAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIG 164 (332) Q Consensus 91 ~~~l~ID~~~-~-~~~~I-dd~D~~q~~~d~---~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~i~~~ 164 (332) +-.+.+|+.. | +.+.| .-+|+.-.+-|+ ..+....++.|-++.+|..+-..|...+.... .+ T Consensus 81 kEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~-----------~~- 148 (287) T protein:vir:39 81 KEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETL-----------TV- 148 (287) T ss_pred eEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchhe-----------ee- Confidence 1223333321 1 22222 235555555554 34456668888899999876655544332211 00 Q ss_pred cccccCHHHHHHHHHHHHHHHHhcCCCcCC-CEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeeceEEEe Q lcl|Aclame:pro 165 AGNTNDAQAIVDGFFEAAAVLDERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILK 243 (332) Q Consensus 165 ~~~~~~~~~~~d~i~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~ 243 (332) +.+.+.+..-+-.|.++.-.++|-... ..+.|.|+.|.+|+.. +..+.. -++...+.+.+ +.+.-||-+-+ T Consensus 149 ---~~t~d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~--~l~Tsa--K~SsaNiDen~-i~kFkGf~l~e 220 (287) T protein:vir:39 149 ---KLDEDSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDS--KLATTA--KNSSANVDEQT-LYKFKGFILSE 220 (287) T ss_pred ---eecccchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhcc--cccccc--ccceeeeccCC-cceecceEEEe Confidence 112223334455566666666665444 5678999999999864 333332 23444555554 78888999988 Q ss_pred eCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceech Q lcl|Aclame:pro 244 SNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRT 323 (332) Q Consensus 244 sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrp 323 (332) .+.--.-.|.. .+|.++-++.+=+ -+...|.-.++.+-|-.+-|---||--++.- T Consensus 221 ~P~~~~q~g~~----------------------a~fs~dnig~af~---GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~ 275 (287) T protein:vir:39 221 LPDEKFQLNEG----------------------AYFAADNVGVAGV---GIQVTRAMDSEDFAGTALQAAAKYGKYLPEK 275 (287) T ss_pred cchHhhccCcE----------------------EEEccccceeecc---cceeEEeeecccccceeeecccccccccccc Confidence 66322211211 1222222211100 0111111122223334444444555555544 Q ss_pred hheeeeecC Q lcl|Aclame:pro 324 SVAGSFQAA 332 (332) Q Consensus 324 e~~v~i~~A 332 (332) . ..+|.+| T Consensus 276 N-k~Ai~k~ 283 (287) T protein:vir:39 276 N-KKAILKA 283 (287) T ss_pred c-ceEEEEE Confidence 4 4555555 No 187 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=90.28 E-value=0.022 Score=29.71 Aligned_cols=286 Identities=11% Similarity=0.089 Sum_probs=144.0 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhc--cccccc-cc-----cccceEEEeccccee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFK--GLVRSY-DL-----RGGKSKQFMFTGKLS 71 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~--~~v~~r-~~-----~~G~tv~i~~iG~~t 71 (332) |+. ||.. + . =.| +|.|..+|.+.-.+.+-|. +.+... .+ .+|+.+.+|..+... T Consensus 1 Ma~----T~l~------D---~----iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~ 63 (349) T protein:vir:78 1 MAI----TTIG------D---I----VTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAID 63 (349) T ss_pred CCc----eEEe------e---e----eccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCC Confidence 663 4431 1 0 011 3578888887776654433 222211 22 569999999987654 Q ss_pred ee-e--ecC-C-CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 72 AG-Y--HTP-G-TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKAS 146 (332) Q Consensus 72 ~~-~--~~~-g-~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa 146 (332) -. + +-. + +....+..++..+. .-+=..+-.+|...|+-..-+--|.|..+.++.+.--.+...+.++..|...- T Consensus 64 g~~e~nv~~D~~~~~~t~~kitt~~~-~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf 142 (349) T protein:vir:78 64 TSIEPNYSNDVYQDIATPRAIQTGEM-MARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLY 142 (349) T ss_pred CCcccccCCCCcccccccccccccce-eeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhh Confidence 31 1 111 1 11222334544433 33333456678888888887777999999999888888888887777765433 Q ss_pred hhccc---cccccccceeccccccccCHHHHHHHHHHHHHHHHhcC---CCcCCCEEEEChHHHHHHHhhcCchhhcccc Q lcl|Aclame:pro 147 AEASP---VTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERS---APQEGRVAVLSPRQYYSLISSVDTNILNREI 220 (332) Q Consensus 147 ~~~~~---~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~---VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~ 220 (332) ..... .....+.....+++....++. .+.+|..+|...- ....=..+++-+..|..|.+. +++.. . T Consensus 143 ~~~~~a~~~~~~~~~~t~d~s~~a~~~~~----~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~---~li~~-i 214 (349) T protein:vir:78 143 NDNVSATDAYHEQNDMVVDVSATLGFDAG----AFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKA---QLIDF-I 214 (349) T ss_pred cccccccchhhhcccceeeeccccCCChh----hhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhh---hhhhh-c Confidence 21100 001111112222222223433 4555555555441 112225788999999998653 44432 1 Q ss_pred ccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccc-eeeeeec Q lcl|Aclame:pro 221 GNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAP-TIQTTSG 299 (332) Q Consensus 221 ~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~-~~e~~~~ 299 (332) ..++ ... .+..++|.+|+.+..+|..+.. .... ..+.+|-+=|++.....+. .+|.-|+ T Consensus 215 ~~s~---~~~-~i~ty~G~~VivDD~~Pv~~~g---------~~~~-------yttylfg~GAi~~~~~~~~~~~et~rd 274 (349) T protein:vir:78 215 RDAE---NNT-MFATYQGYRVIVDDSMTVVGQG---------AQRK-------FISIIFGQGAIGYGEGNPVMPLEYERE 274 (349) T ss_pred cCcc---cCc-ccceecCeEEEEeCCCccccCC---------CCce-------EEEEEeecceEEEccCCCccceeeecc Confidence 1222 122 3889999999999999964311 1111 1224444444444443322 2444444 Q ss_pred ccch------hHHHHHHHHHHHhCCceechhhe-------------eeeecC Q lcl|Aclame:pro 300 DFNV------QYQGDLIVGKLAMGCGSLRTSVA-------------GSFQAA 332 (332) Q Consensus 300 ~~~~------~~~~d~i~~~~~~G~~vlrpe~~-------------v~i~~A 332 (332) .... ..+...-..+|.+|.+-..+... -+|.++ T Consensus 275 ~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~ 326 (349) T protein:vir:78 275 ASRANGGGVETLWTRKTWLLHPFGYRFTSAVITGNGTETIARSASWQDLANA 326 (349) T ss_pred cccCCcceeEEEEEeeEEEeeeeeeeeccccccCCccccccCCCChHHhcCC Confidence 3211 11111222346666666543211 112222 No 188 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=86.34 E-value=0.046 Score=27.90 Aligned_cols=285 Identities=11% Similarity=0.097 Sum_probs=143.7 Q ss_pred CCCcccccccccccccccccccCchhhHH-HHHHhHHHHHHHHHhhhhc--cccccc-cc-----cccceEEEeccccee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATA-LKLFSGEVFTAFNNASIFK--GLVRSY-DL-----RGGKSKQFMFTGKLS 71 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~-~e~f~g~V~~~f~~~s~~~--~~v~~r-~~-----~~G~tv~i~~iG~~t 71 (332) |+. ||.. + . =.| +|.|..+|.+.-.+.+-|. +.+..+ .+ .+|+.+.+|..+... T Consensus 1 Ma~----T~l~------D---~----iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~ 63 (349) T protein:vir:94 1 MAI----TTIG------N---I----VTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAID 63 (349) T ss_pred CCc----eEEe------e---e----eccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCC Confidence 663 4431 1 0 011 3578888887776654443 222211 12 569999999887643 Q ss_pred ee-e--ecCCCC---CCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 72 AG-Y--HTPGTP---IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 (332) Q Consensus 72 ~~-~--~~~g~~---~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~a 145 (332) -. + |.-.++ ++ +..++..+. .-+=..+-.+|...|+-..-+--|+|..+.++.+.--.+...+.++..|... T Consensus 64 g~~e~n~~~dt~~~~~t-~~kit~~~~-~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gv 141 (349) T protein:vir:94 64 TSIEPNYSNDVYQDIAT-PRAIQTGEM-MARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGL 141 (349) T ss_pred CCcccccCCCCcccccc-cccccccce-eeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhh Confidence 22 2 221111 22 233443332 2333345567888888888777799999999999888888888887776543 Q ss_pred hhhc---cccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCC--CcCC-CEEEEChHHHHHHHhhcCchhhccc Q lcl|Aclame:pro 146 SAEA---SPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSA--PQEG-RVAVLSPRQYYSLISSVDTNILNRE 219 (332) Q Consensus 146 a~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~V--P~~g-R~~vv~P~~~~~Ll~~~d~~~~~~d 219 (332) .... .............+++....++.. +.+|..+|..+-- ..+. ..+++-+..|..|.+. +++.. T Consensus 142 f~~~~~~~~~~~~~~~~~~d~~~~a~~~~~~----~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~---~li~~- 213 (349) T protein:vir:94 142 YNDNVSATDAYHEQNDMVVDVSATSGFDAGA----FIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKA---QLIDF- 213 (349) T ss_pred hcccccccccccccCceeEEecccCCCChhh----HHHHHHHHHHHhccccccceeEEEEchHHHHHHHhc---chhhh- Confidence 3221 111111111222233333344444 4445545444311 1122 5778999999998653 44432 Q ss_pred cccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccc-eeeeee Q lcl|Aclame:pro 220 IGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAP-TIQTTS 298 (332) Q Consensus 220 ~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~-~~e~~~ 298 (332) ...+++ .. .+..++|.+|+.+..+|..... ..+ . ....+|-+=|++.....+. .+|.-| T Consensus 214 i~~s~~---~~-~i~ty~G~~VivDD~~Pv~~~g------~~~---~-------yttylfg~GAi~~~~~~~~~~~E~~r 273 (349) T protein:vir:94 214 IRDAEN---NT-MFATYQGYRVIVDDSMTVVGQD------TSR---K-------FISIIFGQGAIGYGEGNPEMPLEYER 273 (349) T ss_pred ccCccc---Cc-ccceecCcEEEEeCCCccccCC------CCc---e-------EEEEEeecceEEeecCCCCcceeeec Confidence 112221 22 3889999999999999964311 111 1 1223444444444444322 244444 Q ss_pred cccch------hHHHHHHHHHHHhCCceechhhe-------------eeeecC Q lcl|Aclame:pro 299 GDFNV------QYQGDLIVGKLAMGCGSLRTSVA-------------GSFQAA 332 (332) Q Consensus 299 ~~~~~------~~~~d~i~~~~~~G~~vlrpe~~-------------v~i~~A 332 (332) +.... ..+...-..+|.+|.+-..+... -+|.++ T Consensus 274 d~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~ 326 (349) T protein:vir:94 274 EASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNGTETIARSASWQDLANA 326 (349) T ss_pred ccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCccccccCCCChHHhcCC Confidence 33211 11111122346666666543211 122233 No 189 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=85.41 E-value=0.053 Score=27.58 Aligned_cols=288 Identities=10% Similarity=-0.028 Sum_probs=127.4 Q ss_pred CCCcc-c---c----cccccccccccccccCchhhHHHHHHhHHHH----HHHHHhhhhcccccccc-cccc-ceEEEec Q lcl|Aclame:pro 1 MTTLS-N---F----SLPNQANGGARNADYDVRYATALKLFSGEVF----TAFNNASIFKGLVRSYD-LRGG-KSKQFMF 66 (332) Q Consensus 1 m~~~~-~---~----~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~----~~f~~~s~~~~~v~~r~-~~~G-~tv~i~~ 66 (332) |+++. + + ++..+ .+=.+.+.+ ..+.|+.+...+++ +.....-..+.++..++ +--+ .++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~da~~-~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~ 77 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQ--AGVKQDAAA-TMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMT 77 (319) T ss_pred CCCcchhHHhhHHHHHHHhh--ccchhhhhh-hhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeee Confidence 55432 1 0 11111 111111111 12356543333444 44444455566666553 2223 3454443 Q ss_pred ---ccceee-eeecCCCCCCccCCCCCceEEEEEeee-eecchhhhhHHHHH-hchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 ---TGKLSA-GYHTPGTPIVGDAGIKANEKTLVMDDL-LVSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIAR 140 (332) Q Consensus 67 ---iG~~t~-~~~~~g~~~~~~~~~~~~~~~l~ID~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~ 140 (332) +|.++. .++. +++.. .+++-++....|-.+ .-+.+.+.++..++ ...++-..-...+..++++..|+.++- T Consensus 78 ~~~~G~a~~~~d~~--~dip~-v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 154 (319) T protein:vir:10 78 FDKVGTAQIIADYT--DDLPL-VDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFK 154 (319) T ss_pred eccccceeeecCcc--ccccc-eeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe Confidence 355443 2332 22221 123344555555442 33445567777775 467777777888999999999987763 Q ss_pred HHHHHhhhcccccc---ccccceecccc---ccccCHHHHHHHHHHHHHHHHhc--CCCcCCCEEEEChHHHHHHHhhcC Q lcl|Aclame:pro 141 VLAKASAEASPVTG---EPGGFHVNIGA---GNTNDAQAIVDGFFEAAAVLDER--SAPQEGRVAVLSPRQYYSLISSVD 212 (332) Q Consensus 141 ~~~~aa~~~~~~~~---~~~~~~i~~~~---~~~~~~~~~~d~i~~a~~~Lde~--~VP~~gR~~vv~P~~~~~Ll~~~d 212 (332) =.. ...+.| .++-.....+. ..+.+++.+++.|..+..+|..+ .+ ...-.++|+|+.|..|.. .. T Consensus 155 G~~-----~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~-~~ 227 (319) T protein:vir:10 155 GSA-----PHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQ-HRATNILIPPSMRKVLAI-RM 227 (319) T ss_pred ecc-----cccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCce-eeceEEEecHHHHHhhhc-cc Confidence 211 111111 11111111111 12235778899999998888754 22 112368899999987742 11 Q ss_pred chhhccccccccccccccceeee-eeceEEEeeCcccccccccccccccccccccccccccceEEEeec-hhhhhhhhhc Q lcl|Aclame:pro 213 TNILNREIGNSQGDMNSGKGLYS-IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFH-REAAGCIQSV 290 (332) Q Consensus 213 ~~~~~~d~~~~~~~~~~g~~v~~-i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h-~~a~~~~~~~ 290 (332) + +. + +...+.+.+ .-+.+|...+.|....+.+. ...-.|.-+-. .+.+.++ +--..-++.+ T Consensus 228 ~---~~-----~--~t~l~~lk~~~~~l~I~~~pel~~ag~~g~------~~~v~y~~~~~-~~~~~v~~~~~~~~~e~~ 290 (319) T protein:vir:10 228 P---ET-----T--MSYLDYFKSQNSGIEIDSIAELEDIDGAGT------KGVLVYEKNPM-NMSIEIPEAFNMLPAQPK 290 (319) T ss_pred C---CC-----C--eeHHHHHHHhcCCceEEEeeeecccCCCcc------eEEEEEecCCc-eEEEecCcceeeeeeeec Confidence 1 11 1 111112222 23677887777764221100 00011111111 1111111 1011111222 Q ss_pred cceeeeeecccchhHHHHHHHHHH-HhCCceechhheeeeecC Q lcl|Aclame:pro 291 APTIQTTSGDFNVQYQGDLIVGKL-AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 291 ~~~~e~~~~~~~~~~~~d~i~~~~-~~G~~vlrpe~~v~i~~A 332 (332) ++..+ +.... ..|.-+.||+++.-+.== T Consensus 291 ~l~~~--------------~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 291 DLHFK--------------VPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred CceEE--------------EeeeeeeEEEEEEccceeEeeecC Confidence 22111 11112 236888999975533333 No 190 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=84.49 E-value=0.06 Score=27.28 Aligned_cols=279 Identities=15% Similarity=0.068 Sum_probs=121.6 Q ss_pred CCCccccccccc-------------ccccccccccCchhhHHHHHHhHHHHHHHHHh---hhhcccc-------cc-ccc Q lcl|Aclame:pro 1 MTTLSNFSLPNQ-------------ANGGARNADYDVRYATALKLFSGEVFTAFNNA---SIFKGLV-------RS-YDL 56 (332) Q Consensus 1 m~~~~~~~r~~~-------------~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~---s~~~~~v-------~~-r~~ 56 (332) |++. .-.+|-| ..|+++.+ |+.-+-.+|.|..-+.....-. .+..+|+ .+ |.+ T Consensus 85 ~~~~-~r~~p~~~~veyRSaGE~lkal~~~~~G--d~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i 161 (410) T protein:vir:83 85 AISA-MRGSPVGTEVEYRSAGEYMLDMWNSAQG--NASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPL 161 (410) T ss_pred hhcc-CcCCCCCCCcccccHHHHHHHHhccCCc--hHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccch Confidence 5441 2233321 12222211 1110111222222111110000 0122222 00 111 Q ss_pred --------cccceEEEecc-cceeeeee-------cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHH Q lcl|Aclame:pro 57 --------RGGKSKQFMFT-GKLSAGYH-------TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRA 120 (332) Q Consensus 57 --------~~G~tv~i~~i-G~~t~~~~-------~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~ 120 (332) -.|.|..-+.. .++++.-+ +.|..++. ..+.....+-.|+.+--.. .+...---.++....+ T Consensus 162 ~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~-gKl~~~t~tA~ikTyGGyt-~LSRQ~IERs~v~~L~ 239 (410) T protein:vir:83 162 VSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDS-QKMVIDRLTVNAKTLGGYV-NVSRQAIDFSSPSALD 239 (410) T ss_pred hhhhhhCCCCCCeeEEeeecccccccccccccccccccccccc-cceeeeeccceeehhcCcc-cccceeeecCChhhHH Confidence 12556655433 23443222 24666654 3566677777787753222 2333222234555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhc--CCCcCCCEEE Q lcl|Aclame:pro 121 EVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDER--SAPQEGRVAV 198 (332) Q Consensus 121 ~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~--~VP~~gR~~v 198 (332) -.++-++.+-|+.....+=..+....... .+...++++.+...|.++....+.+ ++ .=+++. T Consensus 240 ~~lraL~~AYA~atea~vra~L~~t~t~~--------------~a~~~~Tad~~~~~i~da~~~v~da~~~~--~~~~i~ 303 (410) T protein:vir:83 240 LVVNGLGQQYAIETEALVGAALASTSTGA--------------VGYGNATADNVASAIWQAAGAVYTAVKGM--GRLVIA 303 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhh--------------hhhhhccHHHHHHHHHHHHHHHhhhhccc--eeeeEE Confidence 55666666656666554444442221110 1122345677888888988888876 43 226788 Q ss_pred EChHHHHHHHhhcCchhhc--cccccccc--cccc-cceeeeeeceEEEeeCcccccccccccccccccccccccccccc Q lcl|Aclame:pro 199 LSPRQYYSLISSVDTNILN--REIGNSQG--DMNS-GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASA 273 (332) Q Consensus 199 v~P~~~~~Ll~~~d~~~~~--~d~~~~~~--~~~~-g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~ 273 (332) |+|+.+-.+. ++|.+ .+.....| ..+- .+..|.++|++|.+++.+|..+ +.|-. T Consensus 304 vS~DVl~~~~----~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgT-----------------A~f~~ 362 (410) T protein:vir:83 304 IAPDVLGDFG----PLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGD-----------------AYLFS 362 (410) T ss_pred echhhhhhcc----ceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCe-----------------eeEec Confidence 9999976664 33432 22222212 1221 2246789999999999887421 11111 Q ss_pred eEEEeechhhhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 274 LAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 274 ~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ..+|-++.+.++...+.+. +....--.+.|.+ +..+.-|++++=+.-- T Consensus 363 ~~Ai~~~eS~~gp~qL~d~---------~i~nLt~~ySgY~--a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 363 TAAIECFEQRVGTLQVVEP---------SVFGLQVAYAGYF--STLVVNEDAIVPLVGS 410 (410) T ss_pred cceeeeeecCCceeEeeCC---------chhhhhhhheeee--eeccccccceeeeccC Confidence 2223333333333332221 1111111222333 3344555555544444 No 191 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=83.39 E-value=0.069 Score=26.95 Aligned_cols=268 Identities=11% Similarity=0.080 Sum_probs=99.7 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHh----HHHHHHHHH----h----------hhhcccccccccccc-ce Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFS----GEVFTAFNN----A----------SIFKGLVRSYDLRGG-KS 61 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~----g~V~~~f~~----~----------s~~~~~v~~r~~~~G-~t 61 (332) |.....-..|.. ..+ ..++.++ +.-...|.+ . ++.+.+.+...+... .+ T Consensus 171 ~~~~~~~~~~~~--~~~----------~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (480) T protein:vir:40 171 REASIPSEKPED--AER----------KFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMK 238 (480) T ss_pred hhhhccccchhh--hhh----------HHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhh Confidence 332111111110 000 0111111 100011110 0 011111110000000 00 Q ss_pred E-----EEecccce---eee-eecCCCCCCccCCCCCceEEEEEeee---eecc---hhhhhHHHHHhchhHHHHHHHHH Q lcl|Aclame:pro 62 K-----QFMFTGKL---SAG-YHTPGTPIVGDAGIKANEKTLVMDDL---LVSS---QFVYSLDEIFSQYSTRAEVSKQI 126 (332) Q Consensus 62 v-----~i~~iG~~---t~~-~~~~g~~~~~~~~~~~~~~~l~ID~~---~~~~---~~Idd~D~~q~~~d~~~~~~~~~ 126 (332) + .+...|.. .+. ....+..... . ..+ +..+... ++.. .....+|. ..++.+-+..+. T Consensus 239 ~~~~~~~~~~~g~~~~~~~~e~~~~~~~~~~-~--~~~--~~~~~~~~v~~l~~~~k~t~~lLDD---a~~l~~~i~~~l 310 (480) T protein:vir:40 239 ARFQGLTLAEDGVDDTFISGTFKAGTDKNKS-Q--TAT--KRSLRPQMAEAYLQMDKATVRGVND---SGALSEYVMSEM 310 (480) T ss_pred hhhhcceeeeccccceeeeeeeecccccccc-c--ccc--cchhhHHHHHHHHHhHHHHHHHhhh---hHHHHHHHHHHH Confidence 0 01111110 111 1111211110 0 001 1111110 0100 11111122 234777788899 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccccccccccce-eccccccccCHHHHHHHHHHHHHHHHhcCCCcCCC-EEEEChHHH Q lcl|Aclame:pro 127 GEALATHYDERIARVLAKASAEASPVTGEPGGFH-VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGR-VAVLSPRQY 204 (332) Q Consensus 127 ~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR-~~vv~P~~~ 204 (332) ++.|+++.++.++.-- .....+ +++.. ...+.+....+..+++.|+.+..+-..+ +. .+|++|..| T Consensus 311 ~~~~~~~ee~a~l~G~------g~g~~~-~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~-----~a~~~vmn~~t~ 378 (480) T protein:vir:40 311 VNRVIQKVEYNMILGS------VDGSNG-FYGLKTATDGWTKQIEYTDLFEGITDAVAECSIS-----DAITIVMSPQTF 378 (480) T ss_pred HHHHHHHHHHHhhccC------CCCccc-cccceeecccccccchhHHHHHHHHHhhhHHhhC-----CCCEEEECHHHH Confidence 9999999887765310 001111 11111 1111111222334444444433222222 33 578999999 Q ss_pred HHHHhhcCchhhccccccccccccccceeeeeeceEEEeeC-cccccccccccccccccccccccccccceEEEeechhh Q lcl|Aclame:pro 205 YSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSN-NLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREA 283 (332) Q Consensus 205 ~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn-~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a 283 (332) ..|.+-||.+ ..|.- +..+..|. ...++|++|+.++ .+|.. . ...+..+. ..+++-+ T Consensus 379 ~~I~klKD~~---G~Yi~-q~~~~~~~-~~~llG~pvv~~~~~~~~~--~-----~~~~~~~~--------~~~~~d~-- 436 (480) T protein:vir:40 379 AELRKAKGTD---GHSRF-NELATKEQ-IAQSFGAVNLETRVWMPKD--E-----VAVYNHDE--------YVLIGDL-- 436 (480) T ss_pred HHHHHhhcCC---CCeec-cCcccccC-cceecccceeeeeccccCC--c-----ceeeeCCc--------cEEEEec-- Confidence 9987766542 22322 22334443 6789999988754 33321 0 01111111 1223322 Q ss_pred hhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 284 AGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 284 ~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .++.+++ .+.++--..+....+.|..+.+|+++.-++.= T Consensus 437 ---------~~~~~~~-~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~ 475 (480) T protein:vir:40 437 ---------NVENYND-FDLRYNVEQWLSETLVGGSIRGKNRSAYLKKK 475 (480) T ss_pred ---------ccceecc-cccccchhhhhhhhhhceeeEccccEEEEEec Confidence 2233322 22233335667777889999999987776655 No 192 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=80.91 E-value=0.091 Score=26.30 Aligned_cols=289 Identities=9% Similarity=-0.049 Sum_probs=126.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHH----Hhhhhcccccccc-cccc-ceEEEec---cccee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFN----NASIFKGLVRSYD-LRGG-KSKQFMF---TGKLS 71 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~----~~s~~~~~v~~r~-~~~G-~tv~i~~---iG~~t 71 (332) |-.=+.+-+.......-+..+.|+..+.+.++.. .|+.... ..-..+.++..++ +..+ .++.+.. .|..+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~ 81 (314) T protein:vir:10 3 IKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQ 81 (314) T ss_pred cchHHHHHHHHHHHHhhcccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCCCCceeEEEeeeecccccee Confidence 3332333333221111112333433455555444 4444333 3455556666553 2222 3555443 34443 Q ss_pred e-eeecCCCCCCccCCCCCceEEEEEeee-eecchhhhhHHHHHh-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 72 A-GYHTPGTPIVGDAGIKANEKTLVMDDL-LVSSQFVYSLDEIFS-QYSTRAEVSKQIGEALATHYDERIARVLAKASAE 148 (332) Q Consensus 72 ~-~~~~~g~~~~~~~~~~~~~~~l~ID~~-~~~~~~Idd~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~ 148 (332) . .++. +++.. .+++-++....|-.+ .-+.+.+.++..++. ..++-..-...+..++++..|+.++-=- + T Consensus 82 ~~~d~~--~dip~-vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~---~-- 153 (314) T protein:vir:10 82 IIADYS--DDLPL-VDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGS---A-- 153 (314) T ss_pred eeCCcc--cccce-eecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec---c-- Confidence 2 2332 22322 133444555555443 223444566666643 6677777778888888888888665211 1 Q ss_pred cccccc---ccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCc-CCCEEEEChHHHHHHHhhcCchhhcccccccc Q lcl|Aclame:pro 149 ASPVTG---EPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ-EGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) Q Consensus 149 ~~~~~~---~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~-~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~ 224 (332) ...+.| .|+-..+. +.++=.+++.+++.|..+..+|.++.-=. ..-.++|+|+.|..| ....+ +.+. T Consensus 154 ~~g~~GLlN~p~v~~~~-~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L-~~~~~---~~~~---- 224 (314) T protein:vir:10 154 PHGIVSVFDQPNINNVV-ATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVM-QGLVP---QTNL---- 224 (314) T ss_pred cccceeEeecCCCcccc-CCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhh-ccccc---CCCc---- Confidence 111111 11100011 11122357888999999999998752100 112578999988655 21111 1111 Q ss_pred ccccccceee-eeeceEEEeeCcccccccccccccccccccccccccccceEEEeech--hhhhhhhhccceeeeeeccc Q lcl|Aclame:pro 225 GDMNSGKGLY-SIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHR--EAAGCIQSVAPTIQTTSGDF 301 (332) Q Consensus 225 ~~~~~g~~v~-~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~--~a~~~~~~~~~~~e~~~~~~ 301 (332) .+.+ .+. +--+++|...+.|-..++.+ ....-.|..+.. .+.+.++. ..+ -++.+++.. T Consensus 225 -tvl~--~l~~n~~~l~I~~~~el~~ag~~g------~~~~v~y~~~~~-~~~~~vp~~~~~l-~~e~~~~~~------- 286 (314) T protein:vir:10 225 -SYGE--LFTRNNPGLTIRFLQFLDNYDGAG------GKAALAFEKSPL-NMSIEIPEVTNVL-PAQPKDLHF------- 286 (314) T ss_pred -cHHH--HHHHhCCCcEEEEcccccccCCCc------ceEEEEEecCCc-EEEEecCccceee-cceecCceE------- Confidence 1111 111 12366777777665321110 000011211111 11122211 111 122222211 Q ss_pred chhHHHHHHHHHHH-hCCceechhhee---eeecC Q lcl|Aclame:pro 302 NVQYQGDLIVGKLA-MGCGSLRTSVAG---SFQAA 332 (332) Q Consensus 302 ~~~~~~d~i~~~~~-~G~~vlrpe~~v---~i~~A 332 (332) .+....+ .|.-+.||.++. -|.=| T Consensus 287 -------~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 287 -------RYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred -------EEcceeeeEEEEEECcceeEeeeeeecC Confidence 1112223 378899999877 44445 No 193 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=80.25 E-value=0.097 Score=26.15 Aligned_cols=270 Identities=13% Similarity=0.087 Sum_probs=116.1 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccc----ccccccceEEEeccc--ceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS----YDLRGGKSKQFMFTG--KLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~----r~~~~G~tv~i~~iG--~~t~~~ 74 (332) |...+| .+ ---|-|+|.|...+-|+.++.|++..-- .-+.+.++.---... .+-++. T Consensus 1 mp~N~n----------~a-------vr~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~ 63 (295) T protein:vir:47 1 MPSNQN----------NA-------VRRYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGE 63 (295) T ss_pred CCCCCC----------cc-------chhhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeec Confidence 433111 11 1158899999999999999999866531 122222221111111 123345 Q ss_pred ecCCCCCCc------c-CCCCCceEEEEEeee-ee-cchhh-hhHHHHHhchhHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 HTPGTPIVG------D-AGIKANEKTLVMDDL-LV-SSQFV-YSLDEIFSQYSTRAE---VSKQIGEALATHYDERIARV 141 (332) Q Consensus 75 ~~~g~~~~~------~-~~~~~~~~~l~ID~~-~~-~~~~I-dd~D~~q~~~d~~~~---~~~~~~~aLa~~~D~~i~~~ 141 (332) |+.+....+ . .....-+-.+-.|+. .| +.+.| .-+|..-.+-|+-.. ....++.|-.+.+|..+-.. T Consensus 64 Y~TdeNvagFGtGTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ 143 (295) T protein:vir:47 64 YKTGENDGGFGDNSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKY 143 (295) T ss_pred ccCCCcccccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554443321 0 001111222333332 22 23333 346666556665444 45567888888898766555 Q ss_pred HHHHhhhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccc Q lcl|Aclame:pro 142 LAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIG 221 (332) Q Consensus 142 ~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~ 221 (332) +...+...... ...+.+.+..-+-.|.++.-..+|-..-| +.|.|+.|..|+.. +..+.. - T Consensus 144 ls~~A~~te~~--------------td~t~d~V~~LF~~as~~yvn~ev~~~~~-AyV~~evYnaiiD~--~l~Tsa--K 204 (295) T protein:vir:47 144 LSDTATKTEAL--------------ADFTDDKVKALFNKLSAFYTNNEVTAPIT-VYLRSEFYNAIVDM--ASVTSA--K 204 (295) T ss_pred HHhhhhhhhhh--------------hcccchhHHHHHHHHHHHhhhhheeeeeE-EEEchhHHHHHhcc--cccccc--c Confidence 54333211100 11112333344445555565566633333 88999999999863 434433 2 Q ss_pred cccccccccceeeeeeceEEEeeCcccccccccccccc------cccccccccccccceEEEeechhhhhhhhhccceee Q lcl|Aclame:pro 222 NSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAA------VTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQ 295 (332) Q Consensus 222 ~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~------~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e 295 (332) ++...+.+.+ +.+.-||.|-+.+.--..+|.-...++ -.|.+..=.-+..+..|+.++-- +-.+-+.-++++ T Consensus 205 ~SsaNiDeng-i~~FkGf~i~e~P~~~~q~G~~aifs~dnig~aftGIn~aR~IesEdF~GValQ~~-~~~~~~~~~~~~ 282 (295) T protein:vir:47 205 GATISLDENG-LPKYKGFTLEETPAQYFETGVIAIFSPNGIIIPFVGISTARVIEAENFDGVNCKLL-LRVVLTLLMTIR 282 (295) T ss_pred cceeeeccCC-cceecceEEEeccHhhccCCcEEEEccccceeecccceeeeeeecccccchHHHHH-HHHHHHHHHHHH Confidence 3444555554 788889999886643222222221111 11111111111111111111100 000000000000 Q ss_pred eeecccchhHHHHHHHHHHHhCCceec Q lcl|Aclame:pro 296 TTSGDFNVQYQGDLIVGKLAMGCGSLR 322 (332) Q Consensus 296 ~~~~~~~~~~~~d~i~~~~~~G~~vlr 322 (332) + .|..+-- +.|- | T Consensus 283 -------~-~~~~~~~--~~~~----~ 295 (295) T protein:vir:47 283 -------K-QFTKLQE--LLYR----R 295 (295) T ss_pred -------H-HHHHHHH--Hhhc----C Confidence 0 0100000 0111 1 No 194 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=79.27 E-value=0.11 Score=25.93 Aligned_cols=287 Identities=11% Similarity=0.028 Sum_probs=106.3 Q ss_pred CCCcccccccccccccccccccCchh-hHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccccee--eee--e Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRY-ATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS--AGY--H 75 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~-al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t--~~~--~ 75 (332) |+.+. -.|| +|..+ ++.+.-+.+ .|--..+| +.+. ....+.+++..|+-. +.+ . T Consensus 1 m~~~~-~~~~-----------~dp~LT~~A~gy~n~----~~Iad~lf-P~vp----V~~~~~k~~~f~~e~f~~~~t~r 59 (307) T protein:vir:79 1 MGRLS-KLRI-----------VDPVLTNLAIGYTNA----EFIGQTLM-PVVE----VEKEGGKIPKFGKESFRLYQTER 59 (307) T ss_pred CCCCC-CCcc-----------cCHHHHHHHhhccch----hhhhhhcC-Cccc----ccccccceeeecccccccccccc Confidence 55532 1222 11111 111111111 12222222 2222 223333444444311 111 1 Q ss_pred cCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) Q Consensus 76 ~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~ 155 (332) .++......+....+..++.+++.-. ...||+.+...+.+|++....+.....+.+..+-.+...+..+... T Consensus 60 a~~~~~~~v~~~~~~~~~~~~~~~~l-~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y------- 131 (307) T protein:vir:79 60 ALRAKSNRMNPEDIDSVDVNLDEHDL-EYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSY------- 131 (307) T ss_pred ccCCCcceeeeeccccccccccccch-hhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccccc------- Confidence 23333322211122345566666433 3457777777888888776655555555555554444444332221 Q ss_pred cccceecccccccc--CHHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccccccccccccee Q lcl|Aclame:pro 156 PGGFHVNIGAGNTN--DAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGL 233 (332) Q Consensus 156 ~~~~~i~~~~~~~~--~~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v 233 (332) +.++.+.++.++.. +....+..|.++.+.+.+..- ...-.+++++..|..|+. |++++++-.....+ +..-+.+ T Consensus 132 ~~~~k~tLsgt~~Wsd~~sDPi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~--h~~i~~~lk~~~~g-~it~~~l 207 (307) T protein:vir:79 132 AAGNKKQLSATEKFTAANSDPVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKA--HPQLIEKIKYSMKG-IVTVDLL 207 (307) T ss_pred CCCceEEEccCcccCCCCCCcHHHHHHHHHHHHHhhC-CccceEEeCHHHHHHHhc--CHHHHHHhcCcccc-ccCHHHH Confidence 22233333322111 112235556677777666533 233578899999999975 58888765443333 3333346 Q ss_pred eeeeceEEEee-CcccccccccccccccccccccccccccceEEEeechhhhhhhh------hccceeeeeecccchhHH Q lcl|Aclame:pro 234 YSIAGIRILKS-NNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQ------SVAPTIQTTSGDFNVQYQ 306 (332) Q Consensus 234 ~~i~G~~V~~s-n~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~------~~~~~~e~~~~~~~~~~~ 306 (332) ..++|+.-+.. ...-..... ...--..+.+.|.+.+.+.+... +.+.+.+.-....-..++ T Consensus 208 a~l~~v~~V~vg~a~y~~~~~------------~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~d~~~ 275 (307) T protein:vir:79 208 KEIFEVENIAVGEAIYADDKD------------RFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRI 275 (307) T ss_pred HHHhCceeEEEeeeeeecccc------------cchhcCCCceEEEecccccCCCCCcccccccceeEEecCceEEeccc Confidence 66777763322 111000000 00000011122222222111110 011111100000000000 Q ss_pred ----HHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 307 ----GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 307 ----~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ++.|+.....=-.++=||+---|+-| T Consensus 276 ~~~~~~~vrv~~~~~~~i~~~~~G~li~~~ 305 (307) T protein:vir:79 276 EDGKLELVRATDIFRPYLLGADAGYLISGI 305 (307) T ss_pred CCCceeEEeecccccceeeccccchhhccC Confidence 00000000000011112211122222 No 195 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=77.34 E-value=0.13 Score=25.52 Aligned_cols=260 Identities=16% Similarity=0.149 Sum_probs=120.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccc----ccccccceEEEeccc--ceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS----YDLRGGKSKQFMFTG--KLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~----r~~~~G~tv~i~~iG--~~t~~~ 74 (332) |..-++ - -+ + | .|-|+|.|.+.+-|+.++.|++..-- .-+.+.++.---... .+-+.. T Consensus 1 m~t~N~-n--------~a---v--r--~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~ 64 (286) T protein:vir:94 1 MATTNN-D--------LP---V--R--VYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGE 64 (286) T ss_pred CCCCcc-c--------cc---e--e--ehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEec Confidence 544110 0 00 1 1 58899999999999999999876531 122222221111111 123344 Q ss_pred ecCCCCCCc------cCCCCCceEEEEEeee-ee-cchhh-hhHHHHHhchhHHH---HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 HTPGTPIVG------DAGIKANEKTLVMDDL-LV-SSQFV-YSLDEIFSQYSTRA---EVSKQIGEALATHYDERIARVL 142 (332) Q Consensus 75 ~~~g~~~~~------~~~~~~~~~~l~ID~~-~~-~~~~I-dd~D~~q~~~d~~~---~~~~~~~~aLa~~~D~~i~~~~ 142 (332) |..+..... +.....-+-.+-.|+. .| +.+.| .-+|..-.+-|+-. +....++.|-.+.+|..+-..| T Consensus 65 Y~TdeNv~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~l 144 (286) T protein:vir:94 65 YSTDANTAFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEAL 144 (286) T ss_pred ccCCCccccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554322211 0011111222333332 22 22333 34566555666544 4455678888888887665444 Q ss_pred HHHhhhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcC----CCcCCCEEEEChHHHHHHHhhcCchhhcc Q lcl|Aclame:pro 143 AKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERS----APQEGRVAVLSPRQYYSLISSVDTNILNR 218 (332) Q Consensus 143 ~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~----VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~ 218 (332) ..++.. ...+|.+..+..+|.+.. |-.. .-+.|.|+.|..|+.. +..+.. T Consensus 145 s~~A~~-----------------------t~~~D~V~~LF~~as~~yvn~ev~~~-~~ayV~~evYnaiiD~--~l~Tsa 198 (286) T protein:vir:94 145 ATAGTD-----------------------LGAVDDVNALFESAVEKYTDLEVIAP-VRAYVTASVYNAIIDL--ANVTTA 198 (286) T ss_pred Hhhhhh-----------------------hhhhhhHHHHHHHHHHHhhhhheeee-eEEEEchhHHHHHhcc--cccccc Confidence 322211 001244445555555444 4222 3388999999999863 434433 Q ss_pred ccccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeee Q lcl|Aclame:pro 219 EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTS 298 (332) Q Consensus 219 d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~ 298 (332) -++...+.+.+ +.+.-||.|-+.+. -.-.|.. .+|.++-++.+=+ -+...| T Consensus 199 --K~SsaNiDeng-i~~FkGf~i~e~P~-~~~~g~~----------------------aifs~dnig~aft---GIn~aR 249 (286) T protein:vir:94 199 --KNSAVNIDTNG-MLSFRGIAITKVPT-QYMGGKA----------------------VIFAPDNVARVFT---GINIAR 249 (286) T ss_pred --ccceeeeccCC-cceecceEEeecch-hhccCce----------------------EEEccccceeeec---cceeee Confidence 23444555554 78888999988763 1111111 1222222211100 001111 Q ss_pred cccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 299 GDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 299 ~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .-.++.+-|-.+.|---||--++.-.. .+|.++ T Consensus 250 ~IesEdF~GValQgAGK~G~~I~edNk-~Ai~~~ 282 (286) T protein:vir:94 250 TIQAIDFAGVELQGAGKYGTFILDDNK-KAIFTA 282 (286) T ss_pred eeeccccCceeeeccccccccccccCc-eeEEEe Confidence 111222223344444455655555444 445555 No 196 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=70.59 E-value=0.21 Score=24.33 Aligned_cols=307 Identities=13% Similarity=0.012 Sum_probs=125.7 Q ss_pred CCCc------ccccccccc------ccccccccc--CchhhHHHHHHhHHH---HHHHHHhhhhccccccccccccceEE Q lcl|Aclame:pro 1 MTTL------SNFSLPNQA------NGGARNADY--DVRYATALKLFSGEV---FTAFNNASIFKGLVRSYDLRGGKSKQ 63 (332) Q Consensus 1 m~~~------~~~~r~~~~------~~~~~~~~~--d~~~al~~e~f~g~V---~~~f~~~s~~~~~v~~r~~~~G~tv~ 63 (332) |++. ..-++.++. .|+-....+ +.-.+..++.|.+.+ -++|..+.....-........|..-. T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t 241 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPST 241 (523) T ss_pred cccceeeeeccccccccccccccccccccccccccccccccchhhccccccccccccccccccccccccccccCCCcccc Confidence 3331 011122110 000000000 001122233333221 11211111100000001111111000 Q ss_pred -----Eecccc--eeeeeecCCC-CCCccCCCCCceEEEEEeeeee--------cchhhhhHHHHHh-c--hhHHHHHHH Q lcl|Aclame:pro 64 -----FMFTGK--LSAGYHTPGT-PIVGDAGIKANEKTLVMDDLLV--------SSQFVYSLDEIFS-Q--YSTRAEVSK 124 (332) Q Consensus 64 -----i~~iG~--~t~~~~~~g~-~~~~~~~~~~~~~~l~ID~~~~--------~~~~Idd~D~~q~-~--~d~~~~~~~ 124 (332) ...++. .+..--..+. ...+.......+.-+.||++.. +...+.-..+.++ | .|.-.|++. T Consensus 242 ~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELan 321 (523) T protein:vir:59 242 QDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVT 321 (523) T ss_pred cccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHH Confidence 011110 0000000000 0001223345678888887643 2344554555566 3 889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccccc-cccccceeccccccccCHHH----HHHHHHHHHHHHHhcC--CC-----c Q lcl|Aclame:pro 125 QIGEALATHYDERIARVLAKASAEASPVT-GEPGGFHVNIGAGNTNDAQA----IVDGFFEAAAVLDERS--AP-----Q 192 (332) Q Consensus 125 ~~~~aLa~~~D~~i~~~~~~aa~~~~~~~-~~~~~~~i~~~~~~~~~~~~----~~d~i~~a~~~Lde~~--VP-----~ 192 (332) =++..+..++++.|++.+...+.-..... ...+-..+.........+-+ .++++..+..++++.. +- - T Consensus 322 ILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~ 401 (523) T protein:vir:59 322 LMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVA 401 (523) T ss_pred HHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccc Confidence 99999999999999998875543211111 11111111110000001111 1334444433433211 21 2 Q ss_pred CCCEEEEChHHHHHHHhhcCchhhccccccccccccccc-eeeeee-ceEEEeeCccccccccccccccccccccccccc Q lcl|Aclame:pro 193 EGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK-GLYSIA-GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVD 270 (332) Q Consensus 193 ~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~-~v~~i~-G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~ 270 (332) .+-|+|++|++...|-.+ +-+..+.. ......|. .+|.+. |++||.-++-|. .+..-+.-|....|. T Consensus 402 ~~~~~~~s~~v~~~l~~~--~~~~~~~~---~~~~~~~~~~~g~l~~~~~vy~d~~~~~----dy~~~g~k~~~~~~~-- 470 (523) T protein:vir:59 402 GANFLVTSPQVAALLESM--PGFTPGND---NRDGGTGIFYVGMVQGRYRLYKNIYQNQ----PVIIMGNQDLNTPWQ-- 470 (523) T ss_pred cccEEEEchhHHHHHHhc--cccccCCc---cccccccceeEEEecCceEEEecCCCCc----ceEEEEecccCCccc-- Confidence 457999999998776332 33321111 01111111 234443 789999888763 333333334333333 Q ss_pred ccceEEEeechh-hhhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 271 ASALAGLIFHRE-AAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 271 ~~~~~~l~~h~~-a~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+|+|.|= .++..+.+ .++.-|-=.|-.+.+||..|.+|...+-|--- T Consensus 471 ----~~~~y~Py~~l~~~~~~----------~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~ 519 (523) T protein:vir:59 471 ----TGAVYAPYVPLLFTPTI----------VDPVNFSYRRGLMTRYALEVVRPEFYGLLYVK 519 (523) T ss_pred ----ccceecccchhhccccc----------ccCCcccceeeeeeehhheecchhHhhhhhhh Confidence 25666552 23222221 11222333455677888888888876644333 No 197 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=64.92 E-value=0.29 Score=23.51 Aligned_cols=292 Identities=12% Similarity=0.046 Sum_probs=111.8 Q ss_pred CCCc--------ccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceee Q lcl|Aclame:pro 1 MTTL--------SNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSA 72 (332) Q Consensus 1 m~~~--------~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~ 72 (332) |++. ++++|....+-+ -++.+. --|=-+++...|...+....+ .++.+.- -...++.+|+++|-... T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it--~~~l~~-g~L~p~~a~~Fl~~v~~~t~i-L~~~r~~-~~~s~~~ei~kig~G~r 75 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIG--LAELDG-FQLPVDVTEEFLERMQKGVQI-LGMADTM-TLARLEMEVPQFGVPRL 75 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhcc--ccccCc-eeecHHHHHHHHHHHhhccch-hhhccee-eccccccccccccccee Confidence 5543 245565433211 111111 114446666666555444444 4555432 33456677777765433 Q ss_pred e--eecC-CCCCCccCCCCCceEEE-EEeeeeecchhhhhHHHHHhc-------hhHHHHHHHHHHHHHHHHH---HH-- Q lcl|Aclame:pro 73 G--YHTP-GTPIVGDAGIKANEKTL-VMDDLLVSSQFVYSLDEIFSQ-------YSTRAEVSKQIGEALATHY---DE-- 136 (332) Q Consensus 73 ~--~~~~-g~~~~~~~~~~~~~~~l-~ID~~~~~~~~Idd~D~~q~~-------~d~~~~~~~~~~~aLa~~~---D~-- 136 (332) . .+.. |+....+ +++...+.+ ..+...++...+.+-.+...+ -.+++.++++.++-|.... |. T Consensus 76 ~~r~~~e~~~~~~~~-~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds 154 (360) T protein:vir:99 76 SGHTRDEEGSRTENS-EAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASS 154 (360) T ss_pred eccccccCCCCCcCC-cCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchh Confidence 2 3322 3322222 233333333 455544444333222111111 2345555555555443221 11 Q ss_pred --------------HHHHHHHHHhhhccccccccccceeccccc-------cc----------cCHHHH-HHHHHHHHHH Q lcl|Aclame:pro 137 --------------RIARVLAKASAEASPVTGEPGGFHVNIGAG-------NT----------NDAQAI-VDGFFEAAAV 184 (332) Q Consensus 137 --------------~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~-------~~----------~~~~~~-~d~i~~a~~~ 184 (332) ..-+.+.++.....-+ ..++..+.++.. .+ -++..+ ...|.++.+. T Consensus 155 ~d~~~~~~~d~fl~~~dGwlKka~~~~~~i--d~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~ 232 (360) T protein:vir:99 155 GNLQSIGGAAELDNTFKGWIARAEGDAQSV--DDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQT 232 (360) T ss_pred cccccCcccchhhhhhHHHHHHhhcccchh--hccccccccccccccccccccchhhhccccccccccchHHHHHHHHHh Confidence 1111222221000000 000000000000 00 001111 1223455555 Q ss_pred HHhcCC--CcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeeeeeceEEEeeCccccccccccccccccc Q lcl|Aclame:pro 185 LDERSA--PQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTG 262 (332) Q Consensus 185 Lde~~V--P~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g 262 (332) |..+.- |...-+.+++|..+..... .+.+++...++..+..+. ..+..|++|+..+.+|.. . T Consensus 233 Lp~kyr~~~~~~~~~~~s~~~~~~yr~----~L~~R~t~LGd~~l~g~~-~~~~~Gipi~~v~~~pd~----~------- 296 (360) T protein:vir:99 233 LDSRYRESDAYSPVLMTSPNQVQSYTM----SLTEREDPLGSAVIFGDS-DITPFSYDLVGVNGFPDE----Y------- 296 (360) T ss_pred cchhhhcCcccceEEEccCchHHHHHH----HHhccCcccchhheeccc-ccccceeeeEEcCCCCCC----c------- Confidence 555532 1112245678876655543 344454444444444443 456789999999999841 1 Q ss_pred ccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH----HHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 263 ENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL----IVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 263 ~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~----i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .++-+++=+..+-..++.++...++ .|+.+. +-.+.+.=--+++-+-+|++.+= T Consensus 297 -------------~mlT~p~NLi~g~~~~iri~~~~e~---~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~ 354 (360) T protein:vir:99 297 -------------MMFTDPNNLAFGLYEEMELDQSTDT---DKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTD 354 (360) T ss_pred -------------eEEeccCceeEEeeeeeEEeecccc---hhhhhhceeeeEEEEEEeeEEEEecccEEEEec Confidence 1223443333333344443322221 111110 00000011112233223444333 No 198 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=61.58 E-value=0.35 Score=23.08 Aligned_cols=276 Identities=12% Similarity=0.057 Sum_probs=122.8 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHh-----hhhc----c-ccccccccccceEEEecccce Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNA-----SIFK----G-LVRSYDLRGGKSKQFMFTGKL 70 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~-----s~~~----~-~v~~r~~~~G~tv~i~~iG~~ 70 (332) |+= .| ++.|..++.+++-+. .+|. + .+.......|+-+..|..-.. T Consensus 1 m~l------------------sD------~~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l 56 (325) T protein:vir:95 1 MAL------------------SD------LAVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKV 56 (325) T ss_pred Cch------------------hh------hhhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccc Confidence 221 11 234666666655443 1111 1 111122334777777755322 Q ss_pred -----eeeeecCCCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 -----SAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 (332) Q Consensus 71 -----t~~~~~~g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~a 145 (332) ...++.....+. +..++..+..-++ -..-..+...|+-..-.-.+.+++++++.|..+++...+.++..+.++ T Consensus 57 ~g~~~~~~~~~~~~~vt-~~kitt~~~~av~-~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~ 134 (325) T protein:vir:95 57 TGGLVRRRNAYGSGTVA-EKVLKHLVDTSVK-VAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGS 134 (325) T ss_pred cccccccccCCCCceec-cceeccccceeeE-EecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333443333332 3344433332222 222233334454444445678888888999999888777766655443 Q ss_pred hhhccccccccccceeccccccc-cCHHHHHHHHHHHHHHHHhcCCCcCC-CEEEEChHHHHHHHhhcCchhhccccccc Q lcl|Aclame:pro 146 SAEASPVTGEPGGFHVNIGAGNT-NDAQAIVDGFFEAAAVLDERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGNS 223 (332) Q Consensus 146 a~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~d~i~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~d~~~~~~d~~~~ 223 (332) .... +.. .+.....+.+... .+.....+.|.+|..+|.++ .+. ..+++....|..|.+. ++++....-. T Consensus 135 l~~a--~~~-~~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~---~~~l~~~~MHS~v~~~L~~~---~L~~~~~~~~ 205 (325) T protein:vir:95 135 VYSA--LSQ-VSDVVYDATANTDAADKLPTWNNLNNGQAKFGDQ---SSQIAAWIMHSTPMHKLYGS---NLTNGERLFT 205 (325) T ss_pred HHHh--hcc-cccceeeeecccCcccccccHHHHHHHHHHhccc---ccceeEEEEchHHHHHHHHh---hccccccccc Confidence 3221 111 1111111111110 00111246788899998765 222 4677899999999753 4554322111 Q ss_pred cccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccch Q lcl|Aclame:pro 224 QGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV 303 (332) Q Consensus 224 ~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~ 303 (332) .+... .+..++|-+|+.+..+|..... .++ .| ..+.|-+-|++.....++.....+..++ T Consensus 206 ~~g~~---~i~t~~G~~VIVdD~~p~~~~g------~~~---~y-------tty~lg~GAi~~~~~~~~~~~~~~~~~~- 265 (325) T protein:vir:95 206 YGTVN---VVRDPFGKLLVMTDSPNLFAAG------TPN---VY-------HILGLVPGGVLIGQNNDFDANEETKNGD- 265 (325) T ss_pred cCCcc---cccccCCcEEEEeCCCCCCCcc------Cce---eE-------EEEEEecCeEEecCCCCccccccccCcc- Confidence 11111 2567889999999999964311 111 11 1233333344333333332222221111 Q ss_pred hHHHHHHH-----HHHHhCCcee------chhheeeeecC Q lcl|Aclame:pro 304 QYQGDLIV-----GKLAMGCGSL------RTSVAGSFQAA 332 (332) Q Consensus 304 ~~~~d~i~-----~~~~~G~~vl------rpe~~v~i~~A 332 (332) ...+..++ .+|.+|.+-- -|-- .+|.++ T Consensus 266 ~~~~~~~~~~~tf~lhp~G~sw~~s~~g~sPt~-aeL~~~ 304 (325) T protein:vir:95 266 ENIIRTYQAEWSYNIGVKGFAWDKANGGKSPTD-AALFTS 304 (325) T ss_pred cceeeeeeeeeeEEeecceeeeecccccCCcCh-HhhcCC Confidence 11111111 2345554441 1111 122222 No 199 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=57.89 E-value=0.42 Score=22.62 Aligned_cols=284 Identities=10% Similarity=0.004 Sum_probs=119.0 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHH----hHHHHHHHHHhhhhcccccccc-cccc-ceEEEecc---ccee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLF----SGEVFTAFNNASIFKGLVRSYD-LRGG-KSKQFMFT---GKLS 71 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f----~g~V~~~f~~~s~~~~~v~~r~-~~~G-~tv~i~~i---G~~t 71 (332) -.-+++..++. +.-.+.+.- +.|+... ...|.+.-...-..+.++..++ ..-+ .++.++.. |..+ T Consensus 17 ~~~~a~~~~~~-----~~~~~~~~~-~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~ 90 (329) T protein:vir:79 17 ANVIANHMQLR-----GAKNDASDM-GIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAK 90 (329) T ss_pred hhhHhhhcccc-----cceeccchh-hHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeee Confidence 00111112221 111111111 2455333 3445444444455566666553 3323 45555544 4433 Q ss_pred eeeecC-CCCCCccCCCCCceEEEEEeee-eecchhhhhHHHHH-hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 72 AGYHTP-GTPIVGDAGIKANEKTLVMDDL-LVSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKASAE 148 (332) Q Consensus 72 ~~~~~~-g~~~~~~~~~~~~~~~l~ID~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~ 148 (332) -|.- .+++... +++-.+....|-.+ ..+.+.+.++..++ ...++-..-...+..++++..|+.++-=- + T Consensus 91 --~~~d~~~dip~v-d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~---~-- 162 (329) T protein:vir:79 91 --IIADYTDDLSTV-DALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGS---K-- 162 (329) T ss_pred --eecCccccccee-ecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeec---c-- Confidence 3332 1222211 23333333444332 22345566777775 46777777788888999999998766211 0 Q ss_pred cccccc---ccccceecccc-----ccccCHHHHHHHHHHHHHHHHhcCCCc-CCCEEEEChHHHHHHHhhcCchhhccc Q lcl|Aclame:pro 149 ASPVTG---EPGGFHVNIGA-----GNTNDAQAIVDGFFEAAAVLDERSAPQ-EGRVAVLSPRQYYSLISSVDTNILNRE 219 (332) Q Consensus 149 ~~~~~~---~~~~~~i~~~~-----~~~~~~~~~~d~i~~a~~~Lde~~VP~-~gR~~vv~P~~~~~Ll~~~d~~~~~~d 219 (332) ...+.| .|+-.....++ -...+++.+++.|.++..++.++.-=. ..-.++|+|+.|..|.. ..+ +.+ T Consensus 163 ~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~-~~~---~~~ 238 (329) T protein:vir:79 163 PHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMV-RMP---ETT 238 (329) T ss_pred cccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhc-ccC---CCC Confidence 111111 11111111111 112357889999999999987752211 11368899998877742 111 111 Q ss_pred cccccccccccceeee-eeceEEEeeCcccccccccccccccccc--cccccccccceEEEee-chhhhhhhhhccceee Q lcl|Aclame:pro 220 IGNSQGDMNSGKGLYS-IAGIRILKSNNLAGLYGQDLSSAAVTGE--NNDYQVDASALAGLIF-HREAAGCIQSVAPTIQ 295 (332) Q Consensus 220 ~~~~~~~~~~g~~v~~-i~G~~V~~sn~lp~~~g~~~~~~~~~g~--~~~y~~~~~~~~~l~~-h~~a~~~~~~~~~~~e 295 (332) . ...+.+.+ --+++|...+.|-..+ ..|. .-.|..+-.. +.+.+ .+--..-++.+.+..+ T Consensus 239 ~-------tvl~~lk~~~~~l~I~~~~el~~ag--------~~g~~~~v~y~~~~~~-~~~~vp~~~~~l~~q~~~~~~~ 302 (329) T protein:vir:79 239 M-------SYLDYFKQQNGGITIESISELEDID--------GAGTKAALVYEKDPMN-MSIEIPEAFNMLTAQPKDLHFK 302 (329) T ss_pred c-------cHHHHHHHhCCCcEEEEcccccccC--------CCCceEEEEEecCCce-EEEecCcceeeeeceecCceEE Confidence 1 11111211 1245566655553211 1111 0111111111 11111 1100111122222111 Q ss_pred eeecccchhHHHHHHHHHH-HhCCceechhheeeeecC Q lcl|Aclame:pro 296 TTSGDFNVQYQGDLIVGKL-AMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 296 ~~~~~~~~~~~~d~i~~~~-~~G~~vlrpe~~v~i~~A 332 (332) +.... ..|.-+.||+++.-+.== T Consensus 303 --------------v~~~~r~~Gv~i~~P~ai~~~dGI 326 (329) T protein:vir:79 303 --------------VPCTSKCTGLTIYRPLTLVLIKGL 326 (329) T ss_pred --------------EceeeeEEEEEEECcceeeeeeee Confidence 11112 336888999976532211 No 200 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=57.15 E-value=0.44 Score=22.54 Aligned_cols=282 Identities=10% Similarity=0.089 Sum_probs=121.7 Q ss_pred CCCcc----cccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccc-----ccccccceEEEecccc-- Q lcl|Aclame:pro 1 MTTLS----NFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-----YDLRGGKSKQFMFTGK-- 69 (332) Q Consensus 1 m~~~~----~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~-----r~~~~G~tv~i~~iG~-- 69 (332) |--.- .|... +-...+-++-|.+--.|-|+|.|.+.+-|+.++.|++..-- .-+.+.++.---.... T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~p 78 (314) T protein:vir:98 1 MKKQFKPFLPLNNI--QFFASGTANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIP 78 (314) T ss_pred Ccccccccccccce--eeeeeccccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccc Confidence 32211 11111 11111111122222258899999999999999999876632 1222222211111111 Q ss_pred eee-eeecCCCCCC-----c-cCCCCCceEEEEEeee-ee-cchhh-hhHHHHHhchhHHH---HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 70 LSA-GYHTPGTPIV-----G-DAGIKANEKTLVMDDL-LV-SSQFV-YSLDEIFSQYSTRA---EVSKQIGEALATHYDE 136 (332) Q Consensus 70 ~t~-~~~~~g~~~~-----~-~~~~~~~~~~l~ID~~-~~-~~~~I-dd~D~~q~~~d~~~---~~~~~~~~aLa~~~D~ 136 (332) +-+ ..|..+.... + +.....-+-.+-.|+. .| +.+.| .-+|..-.+-|+-. +....++.|-.+.+|. T Consensus 79 VVig~~Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~ 158 (314) T protein:vir:98 79 VVVGNEYNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNA 158 (314) T ss_pred eeecCcccCCCCcccccCCccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHH Confidence 112 1344322211 0 1111111222333332 22 22333 34566555656544 4455678888888887 Q ss_pred HHHHHHHHHhhhccccccccccceeccccccccCHHHHHHHHHHHHHHHHhcCCCc---CCCEEEEChHHHHHHHhhcCc Q lcl|Aclame:pro 137 RIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ---EGRVAVLSPRQYYSLISSVDT 213 (332) Q Consensus 137 ~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i~~a~~~Lde~~VP~---~gR~~vv~P~~~~~Ll~~~d~ 213 (332) .+-..|...+...... + ..+ .|.+..+..+|.+..|-- ....+.|.|+.|..|+.. + T Consensus 159 ~~Gk~lS~~As~te~l-----------t---d~~----~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~--~ 218 (314) T protein:vir:98 159 QHSKFISSIAEKTETL-----------T---DYS----ADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDH--P 218 (314) T ss_pred HHHHHHHhhhhhhhhh-----------h---hcc----hhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhcc--c Confidence 6655443332211100 0 011 133444444554444421 236788999999999863 4 Q ss_pred hhhccccccccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccce Q lcl|Aclame:pro 214 NILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPT 293 (332) Q Consensus 214 ~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~ 293 (332) ..+.. -++...+.+.+ +.+.-||.|-+.+.--...+. ...... +-++..|.- +-++. .++ T Consensus 219 l~Tsa--K~SsaNIDeng-i~~FkGf~i~e~P~~~~q~g~-ia~~s~------------dnig~aftG--In~aR--~Ie 278 (314) T protein:vir:98 219 LTTSA--KSSSANIDQNG-IVNFKGFAIQEIPESMLQSGD-VAYTYI------------TNIGKAFTG--INTSR--IIE 278 (314) T ss_pred ccccc--ccceeeeccCC-cceecceEEEecchhhcCCCc-EEEEcc------------ccceeeccc--ceeee--eee Confidence 34433 23444555554 778889999876542211111 100000 111222210 11111 112 Q ss_pred eeeeecccchhHHHHHHHHHHHhCCceechhheeee-ecC Q lcl|Aclame:pro 294 IQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSF-QAA 332 (332) Q Consensus 294 ~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i-~~A 332 (332) .|- +-|-.+-|---||--++.-.....+ .++ T Consensus 279 sEd--------F~GValQgAGK~G~~I~edNk~Ai~k~t~ 310 (314) T protein:vir:98 279 SED--------FDGVALQGAGKAGEFILDDNKKAVAKVTS 310 (314) T ss_pred ccc--------ccceeeecccccccccccccceeeEEEec Confidence 222 2233333444455445544332222 344 No 201 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=48.97 E-value=0.65 Score=21.60 Aligned_cols=292 Identities=12% Similarity=0.039 Sum_probs=125.5 Q ss_pred CC--CcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccc--cceEEEec---ccceeee Q lcl|Aclame:pro 1 MT--TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRG--GKSKQFMF---TGKLSAG 73 (332) Q Consensus 1 m~--~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~--G~tv~i~~---iG~~t~~ 73 (332) |- +.....+|.......+++..= -|++-|.+.+.+....--+...++...+.-. -+++.|+. .|.+++ T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~----~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~- 130 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLI----QFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQP- 130 (379) T ss_pred hccccccccccccCccccccccchH----HHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEE- Confidence 43 222233332222222232221 3888888776666655555667776655211 14555554 455443 Q ss_pred eecCCCCC-CccCCCCCceEEEEEeeeeecchhhhh--HHHHH-hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 74 YHTPGTPI-VGDAGIKANEKTLVMDDLLVSSQFVYS--LDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) Q Consensus 74 ~~~~g~~~-~~~~~~~~~~~~l~ID~~~~~~~~Idd--~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~ 149 (332) |.-+++. ..+-+.+-....+..=+ ..+.+.+ +..++ +..++-.+-.+.+..+|.+..|+..+-=...+.... T Consensus 131 -ygd~~d~pl~d~~~~~~~r~v~~~~---~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~ 206 (379) T protein:vir:10 131 -YTDGGNMALMSWTPTFETRTVVRFE---AGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRT 206 (379) T ss_pred -eccccCCCeeeeeeeeeeeeeEEEE---EEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcce Confidence 3323222 11112222222222211 1222333 33333 367777777778888888888775442110010000 Q ss_pred c---ccccccccceeccccc-----cccCHHHHHHHHHHHHHHHHhcC----CCcCCC-EEEEChHHHHHHHhhcCchhh Q lcl|Aclame:pro 150 S---PVTGEPGGFHVNIGAG-----NTNDAQAIVDGFFEAAAVLDERS----APQEGR-VAVLSPRQYYSLISSVDTNIL 216 (332) Q Consensus 150 ~---~~~~~~~~~~i~~~~~-----~~~~~~~~~d~i~~a~~~Lde~~----VP~~gR-~~vv~P~~~~~Ll~~~d~~~~ 216 (332) - +-...++......+++ ...+++.+++.|..+...|-.+. .|.+-+ .++++|..+..|-. + T Consensus 207 yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~---~--- 280 (379) T protein:vir:10 207 FGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITT---P--- 280 (379) T ss_pred EEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcc---c--- Confidence 0 0000011111111111 12357888888888877765442 254434 68899998877732 2 Q ss_pred ccccccccccccccceeeeeeceEEEeeCccccccccccccccccccccccccc-------ccceEEEeechhhhh-hhh Q lcl|Aclame:pro 217 NREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVD-------ASALAGLIFHREAAG-CIQ 288 (332) Q Consensus 217 ~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~-------~~~~~~l~~h~~a~~-~~~ 288 (332) | .+..+-....+ .+.-+++|...+.|-..++.+.. .--|..+ -...+-++++..... -++ T Consensus 281 n-~~g~Tvl~~lk----~n~Pnl~i~t~pEL~~aggg~~~-------~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve 348 (379) T protein:vir:10 281 T-ELGYSVAQYMR----ESYPNVTFVSAPELNDANGGSSA-------IYYYADAVENNGTDDGRTWLQVVPTKMFTLGVE 348 (379) T ss_pred c-ccCccHHHHHH----HhcCCcEEEEcccccccCCCccE-------EEEEeeccCCCccCCcceEEEecchhhhhccce Confidence 1 12111111111 12336778887777432111000 0001000 001122223222110 011 Q ss_pred hccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 289 SVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 289 ~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) ...+ .+..+.. -...|+-+.||-+++-+.=| T Consensus 349 ~~~~---~~~~~~~----------~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 349 KKIK---GYAEGYT----------NATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ecCc---eeEeccc----------cceeeeeeecchhhheecCC Confidence 1111 1111111 12347888899998777777 No 202 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=38.72 E-value=1.1 Score=20.46 Aligned_cols=280 Identities=13% Similarity=0.105 Sum_probs=105.0 Q ss_pred CCCccccccc-ccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEeccccee---ee-e- Q lcl|Aclame:pro 1 MTTLSNFSLP-NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS---AG-Y- 74 (332) Q Consensus 1 m~~~~~~~r~-~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t---~~-~- 74 (332) |++= -|| ++.. --+.--|. ...|-..++| +.|. ....+.+++..|+.. +. + T Consensus 1 ~~~~---~~~~dp~L------------T~~A~gy~---n~~~Ia~~l~-P~vp----V~~~~~~~~~f~~~e~F~~~~t~ 57 (309) T protein:vir:99 1 MSNA---PFPIDPEL------------TAIAIAYR---NGRMISDEVL-PRVP----VGKQEFKFWKYDLAQGFTVPETL 57 (309) T ss_pred CCCC---CcCcCHhH------------HHHHhhcc---ChhhhhhhcC-Cccc----cCccccceeeechhhcccccchh Confidence 7762 232 2110 00111111 1112223333 3332 233334445555432 11 1 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeee-ecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLL-VSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~-~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~ 153 (332) ..++...... ++..++.++.+.+.- ...+...++.++...+|++....+.....|....+..+...+..++.. T Consensus 58 r~~~~~~~~v-~~~~~~~~~~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y----- 131 (309) T protein:vir:99 58 VGRKSKPNEV-EFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY----- 131 (309) T ss_pred hccCCCcceE-eecccCceeeecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhc----- Confidence 1233333221 223333444444332 223333355566678999888877777766666665555544333322 Q ss_pred cccccceeccccccc-cC-HHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhcccccc-cccccccc Q lcl|Aclame:pro 154 GEPGGFHVNIGAGNT-ND-AQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGN-SQGDMNSG 230 (332) Q Consensus 154 ~~~~~~~i~~~~~~~-~~-~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~-~~~~~~~g 230 (332) +.++.+.++.++. .+ ....+..|.++..++. ...-.++++...|..|.. +++++++.... .+.....- T Consensus 132 --~~~~k~~Lsgt~~wsd~~SDPi~~i~~~~~~~g-----~~PN~~vlg~~~~~~l~~--hp~i~~~ik~~~~~~g~it~ 202 (309) T protein:vir:99 132 --AAGNKTTLSGADQWSDPTSNPLPVITDALDSVI-----LRPNIGVLGRRTATILRR--HPKIVKAYNGSLGDEGMVPM 202 (309) T ss_pred --CCCceEEecCccccCCCCCCcHHHHHHHHHhhC-----CCcceEEechHHHHHHhh--CHHHHHHhcCCCccccccCH Confidence 1222333322211 11 1122333445544431 122478899999998874 58888874322 22222333 Q ss_pred ceeeeeeceE-EEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhhhccceeeeeecccchhHHHHH Q lcl|Aclame:pro 231 KGLYSIAGIR-ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDL 309 (332) Q Consensus 231 ~~v~~i~G~~-V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~~~~~~~d~ 309 (332) +.+..++|++ |+.....-+++ ..|....+.---.+.+.|++......+.+ +++--.+..+ ..+..|.. T Consensus 203 ~~la~l~~ve~V~vg~a~~n~a--------~~g~~~~~~~iwg~~~~L~y~~~~~~~~~--~ps~G~t~~~-~~r~~g~~ 271 (309) T protein:vir:99 203 AFLQELLELDAIYIGEARLNIA--------RPGQNPNLIRAWGPHASFIYRDRLADTRN--GTTFGLTAQW-GDRVSGSI 271 (309) T ss_pred HHHHHHhCcceEEeecceeecc--------ccccccccccccCCcEEEEEcCCCCCCcc--cccccceeec-ccccCCce Confidence 3467778884 55432221110 01111111111122233333222211111 1111111000 00001111 Q ss_pred HHHHHH-hCCceechhheeeeec----C Q lcl|Aclame:pro 310 IVGKLA-MGCGSLRTSVAGSFQA----A 332 (332) Q Consensus 310 i~~~~~-~G~~vlrpe~~v~i~~----A 332 (332) ++-.+- =|..++| +....+ | T Consensus 272 ~d~~~~~~g~~~vr---~~~~~k~~i~~ 296 (309) T protein:vir:99 272 ADPNIGLRGGQRVR---VGESVKELVTA 296 (309) T ss_pred eeeeeccCCceEEE---Eeccccchhcc Confidence 100000 0223333 121111 1 No 203 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=35.88 E-value=1.2 Score=20.14 Aligned_cols=285 Identities=13% Similarity=0.067 Sum_probs=102.0 Q ss_pred CCCcccccccccccccccccccCchh-hHHHHHHhHHHHHHHHHhhhhccccccccccccceEEEecccceeeee--ecC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRY-ATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGY--HTP 77 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~-al~~e~f~g~V~~~f~~~s~~~~~v~~r~~~~G~tv~i~~iG~~t~~~--~~~ 77 (332) |+.++ -.|+ +|..+ ++-+--+.+ .|-..++| +.+.+ ..++|+-.+|+.-+ .++.+ ..+ T Consensus 1 m~~~~-~~~~-----------~dp~LT~~A~gy~n~----~~ia~~l~-P~vpv-~~~~~k~~~f~~ea-F~~~~t~r~~ 61 (307) T protein:vir:10 1 MGRLS-KLRI-----------VDPVLTNLAIGYTNA----EFIGQSLM-PVVEV-EKEGGKIPKFGKES-FRLYKTERAL 61 (307) T ss_pred CCCCC-CCcc-----------cChhHHHHHHhhcch----hhhhhhcC-Ccccc-cccccceeeECccc-ccchhhhccc Confidence 55532 1222 11100 111111111 12222222 22221 12334444443211 11111 112 Q ss_pred CCCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) Q Consensus 78 g~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~ 157 (332) +..+.-.+.-..+.....+-+.- -...||+.+...+.||++....+.....+.+..+..+...+...... +. T Consensus 62 ~~~~~~v~~~~~~~~~~~~~~~~-L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y-------~~ 133 (307) T protein:vir:10 62 RARSNRMNPEDLGSIDIVLDEHD-LEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSY-------AG 133 (307) T ss_pred CCCcceeeccccccccccccccc-ccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCcccc-------CC Confidence 22221111001112222222211 12346666667778888876666666655555555554443322221 12 Q ss_pred cceecccccccc-C-HHHHHHHHHHHHHHHHhcCCCcCCCEEEEChHHHHHHHhhcCchhhccccccccccccccceeee Q lcl|Aclame:pro 158 GFHVNIGAGNTN-D-AQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) Q Consensus 158 ~~~i~~~~~~~~-~-~~~~~d~i~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~~~~~~~~g~~v~~ 235 (332) ++.+.++.++.. + ....+..|.++.+++.+..- .....++++++.|..|+. +++++++-.....+ ...-+.+.. T Consensus 134 ~~k~tLsGt~~Wsd~~sDPi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~--hp~i~e~lk~~~~g-~it~~~la~ 209 (307) T protein:vir:10 134 GNKKQLSATEKFTAAGSDPVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKA--HPQLIEKIKYSMKG-IVTVDLLKE 209 (307) T ss_pred CceEEeccccccCCCCCCcHHHHHHHHHHHHhhhC-CccceEEeCHHHHHHHhc--CHHHHHHhCCcccc-ccCHHHHHH Confidence 233333322111 1 12235556677777666533 233578899999999975 58888764433333 333334667 Q ss_pred eeceEEEeeCcccccccccccccccccccccccccccceEEEeechhhhhhhh------hccceeeeeecccchhHHHHH Q lcl|Aclame:pro 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQ------SVAPTIQTTSGDFNVQYQGDL 309 (332) Q Consensus 236 i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~a~~~~~------~~~~~~e~~~~~~~~~~~~d~ 309 (332) ++|++.+....--...... .+.--..+.+.|.+.+...+..+ +.+.+.+.-.. .+.|. T Consensus 210 ll~v~~i~vg~a~~~~~~~-----------~~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~-----~~~d~ 273 (307) T protein:vir:10 210 IFEVENIAVGEAIYADDKD-----------RFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGN-----PVVDT 273 (307) T ss_pred HhCceeEEEeeeeeeccCC-----------ccceeCCCceEEEecccccCCCCCcccccccceeEEEcCC-----eEeec Confidence 7887766543211100000 00000011112222222111010 01111110000 00011 Q ss_pred HHH---HHHhCCcee------chhheeeeecC Q lcl|Aclame:pro 310 IVG---KLAMGCGSL------RTSVAGSFQAA 332 (332) Q Consensus 310 i~~---~~~~G~~vl------rpe~~v~i~~A 332 (332) ..+ -..+.++-. =|++---|+-| T Consensus 274 ~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~ 305 (307) T protein:vir:10 274 RIEDGKLELVRSTDIFRPYLLGADAGYLISGI 305 (307) T ss_pred eecCCceeEEeccccccceeecccccceeccC Confidence 001 001111111 11111111111 No 204 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=30.59 E-value=1.6 Score=19.52 Aligned_cols=285 Identities=12% Similarity=0.024 Sum_probs=121.9 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHh-HHHHHHHHHhhhhccccccccccc--cceEEEec---ccceeeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFS-GEVFTAFNNASIFKGLVRSYDLRG--GKSKQFMF---TGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~-g~V~~~f~~~s~~~~~v~~r~~~~--G~tv~i~~---iG~~t~~~ 74 (332) |+--++-..|.. .+++++..- =|+.-|- ..+.+.....-+...++...+.=. -+++.|+. .|.+.+ T Consensus 31 ~a~da~d~~~~~--~t~~~~g~~----~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~-- 102 (336) T protein:vir:78 31 YAMDAADLSPHL--SSTGSSGIP----NYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVAT-- 102 (336) T ss_pred HHHhhhhhcccc--ccCCCcchH----HHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEE-- Confidence 322233334432 334444333 2777776 333333222233345555444211 14555543 455443 Q ss_pred ecCCCCCCccCCCCCceEEEEEeeee-ecchhhhhHHHHH-hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 75 HTPGTPIVGDAGIKANEKTLVMDDLL-VSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) Q Consensus 75 ~~~g~~~~~~~~~~~~~~~l~ID~~~-~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~ 152 (332) |.-++++. -.+.+-++.+-+|-..- -+.+-+..+..++ +..++-.+-.+.+.++|.+..++..+-=-. .... T Consensus 103 ygd~~D~P-~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~-----~~~~ 176 (336) T protein:vir:78 103 YGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-----GLEN 176 (336) T ss_pred eecccCCC-eeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEecc-----ccce Confidence 32233321 11222233333332211 1222233444443 367777777777888888887764432110 1111 Q ss_pred cc---cccc-ceecccc--ccccCHHHHHHHHHHHHHHHHhcC---C-CcCCCEEEEChHHHHHHHhhcCchhhcccccc Q lcl|Aclame:pro 153 TG---EPGG-FHVNIGA--GNTNDAQAIVDGFFEAAAVLDERS---A-PQEGRVAVLSPRQYYSLISSVDTNILNREIGN 222 (332) Q Consensus 153 ~~---~~~~-~~i~~~~--~~~~~~~~~~d~i~~a~~~Lde~~---V-P~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~~ 222 (332) .| .|.- ..+..++ -..++++.+++.|..+...|.... + |..--.++++|..+..|-. + + .++. T Consensus 177 ~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~---~---n-~~g~ 249 (336) T protein:vir:78 177 YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK---T---N-QYGL 249 (336) T ss_pred EEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC---C---C-ccCc Confidence 11 1100 0011111 123567889999999888886554 2 3334578899999877732 1 1 1211 Q ss_pred ccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccc--cceEEEeechhhh-hhhhhccceeeeeec Q lcl|Aclame:pro 223 SQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDA--SALAGLIFHREAA-GCIQSVAPTIQTTSG 299 (332) Q Consensus 223 ~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~--~~~~~l~~h~~a~-~~~~~~~~~~e~~~~ 299 (332) +-....+. +.=+++|...+.|-..+| +..--|..+. ..+..+.++..-. .-++...+. +.. T Consensus 250 tv~~~lk~----n~Pnl~i~t~pel~~Agg---------~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~~~---~~v 313 (336) T protein:vir:78 250 SAAAKLKE----IFPKLEFVTIPEYDTASG---------RLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSY---FRQ 313 (336) T ss_pred cHHHHHHH----hcCccEEEEcccccccCc---------ceEEEEEeeccCCcceeeecchhhhccceeecCce---eEe Confidence 11111111 122566777666632111 1111111111 1222333333221 112333322 222 Q ss_pred ccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 300 DFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 300 ~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) +... ...|+-+.||-++.-+.== T Consensus 314 ~~~~----------rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 314 KKSA----------GTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cccc----------ceeeeeeeccchheeeccC Confidence 2221 2448888888877644333 No 205 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=28.62 E-value=0.67 Score=21.54 Aligned_cols=110 Identities=17% Similarity=0.179 Sum_probs=52.9 Q ss_pred EEChHHHHHHHhhc-CchhhccccccccccccccceeeeeeceEEEeeCccccccccccc-ccccccc-------ccccc Q lcl|Aclame:pro 198 VLSPRQYYSLISSV-DTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLS-SAAVTGE-------NNDYQ 268 (332) Q Consensus 198 vv~P~~~~~Ll~~~-d~~~~~~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~-~~~~~g~-------~~~y~ 268 (332) +|+--+|..+|-.. ++..+.++- ...+..|+.--+++|.+|+.|+|||... ... ....-|. +..|+ T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~---aNp~ltG~lpV~~~GltWl~tpnlpg~~--a~vlDst~lGgmaDE~l~~Pgya 75 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQ---ANIVLTGSLPVSAYGLTWVTSRHITGTD--PWLFDVEQLGGMADEKLLSPEFA 75 (123) T ss_pred CcchhhHHHHhcchhccccccccc---CCceEecCcceeeeceeeeecCCCCCCc--cceeehhhhccccccccCCCccc Confidence 56666688887431 222333332 2344556545579999999999999432 111 1111110 11111 Q ss_pred ccccceEEEeechhhhhhhhhccceeeeeeccc--chhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 269 VDASALAGLIFHREAAGCIQSVAPTIQTTSGDF--NVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 269 ~~~~~~~~l~~h~~a~~~~~~~~~~~e~~~~~~--~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .. .-.++++...|++. +..| .++++.+-=.-++.|.+-+-|.-- T Consensus 76 ~~-----------------~~~Gvevkt~Red~~~nD~y---riRaRRvTvpiv~EP~Agv~ltg~ 121 (123) T protein:vir:78 76 PA-----------------GNTGVEASTERAHQGVKDGY---LVRGRRNTVAVVTEPMAGVRLTGT 121 (123) T ss_pred CC-----------------CCcceeEEeeccccCCCCce---EEeeeecceeEEecCccceEEeee Confidence 11 11223444444433 2222 466665555666666655444433 No 206 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=21.81 E-value=2.5 Score=18.36 Aligned_cols=301 Identities=11% Similarity=0.056 Sum_probs=123.3 Q ss_pred CCCccccc---ccccccccccccccCchhhHHHHHHhHHHHHHHHHhhhhcccccc---ccccccceEEEecccceeeee Q lcl|Aclame:pro 1 MTTLSNFS---LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS---YDLRGGKSKQFMFTGKLSAGY 74 (332) Q Consensus 1 m~~~~~~~---r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~s~~~~~v~~---r~~~~G~tv~i~~iG~~t~~~ 74 (332) |+-|..+- |+--.++.+..+... +-|+| .|.++.|.-..-....... .+..+....-.+.-+..+... T Consensus 97 mTgPTGLIFAmRsrY~~q~~~~~a~~-~EAl~-----nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~ 170 (457) T protein:vir:10 97 MTGPTGLIFAMRTNYGAERNPAAAGY-DEAFF-----NEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQ 170 (457) T ss_pred CCCcceeeeeeeeeecCccccccccc-cceee-----eccCcccCcccccccccccccccccccccccccCccccccccc Confidence 88887755 432112222111111 12333 3344443321100000000 001111111011000111111 Q ss_pred ecCCCCCC---------ccCCCCCceEEEEEeeeee--------cchhhhhHHHHHh-c-hhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 HTPGTPIV---------GDAGIKANEKTLVMDDLLV--------SSQFVYSLDEIFS-Q-YSTRAEVSKQIGEALATHYD 135 (332) Q Consensus 75 ~~~g~~~~---------~~~~~~~~~~~l~ID~~~~--------~~~~Idd~D~~q~-~-~d~~~~~~~~~~~aLa~~~D 135 (332) +..+..+. +..+....+..+.||++.. +...+.-..+.++ | .|.-.|++.=++..++.++. T Consensus 171 ~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEIN 250 (457) T protein:vir:10 171 ADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEIN 250 (457) T ss_pred cccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhh Confidence 11111110 0111234677788887643 3344555555566 4 78889999999999999999 Q ss_pred HHHHHHHHHHhhhccccccccccceeccccccccCHHHHHHHH--------HHHHHHHHhcCCCcCCCEEEEChHHHHHH Q lcl|Aclame:pro 136 ERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGF--------FEAAAVLDERSAPQEGRVAVLSPRQYYSL 207 (332) Q Consensus 136 ~~i~~~~~~aa~~~~~~~~~~~~~~i~~~~~~~~~~~~~~d~i--------~~a~~~Lde~~VP~~gR~~vv~P~~~~~L 207 (332) +.|++.+...+.-. .+.+........+. ..+++.-..+.. .++.....+- ---.+.++|++|++.+.| T Consensus 251 Reii~~l~~~a~~~-~~~~~~~~gv~dl~--~~~~g~~~~e~~k~L~~~i~~ean~i~~~T-~rg~gn~~i~S~~Va~~L 326 (457) T protein:vir:10 251 REVVRTIYTNAVAG-AQNNTATAGVFDLD--VDSNGRWSVEKFKGLLFQIERDANAIGHQT-RRGKGNILICSADVVSAL 326 (457) T ss_pred HHHHHhHhhhheee-eccccccceeeeee--ccccchhhHHHHHHHHHHHHHHHHHHHHhh-ccccceEEEEchhHHHHH Confidence 99999887555321 11111111111111 111222112222 2222222222 124568999999998887 Q ss_pred HhhcCchhhcc---cccccccccccc-ceeeeee-ceEEEeeCcccccccccccccccccccccccccccceEEEeechh Q lcl|Aclame:pro 208 ISSVDTNILNR---EIGNSQGDMNSG-KGLYSIA-GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHRE 282 (332) Q Consensus 208 l~~~d~~~~~~---d~~~~~~~~~~g-~~v~~i~-G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~~~~~~l~~h~~ 282 (332) -.+.-..+... +.+.++ .-..| ..+|.+. |++||.-+-....+...+..-+.-| +-.--.+|+|.|= T Consensus 327 ~~sg~l~~~p~~~~~~~~~~-~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG-------~~~~~~glfy~PY 398 (457) T protein:vir:10 327 GMAGVLDYTPALNGNNGLAG-VDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKG-------TSPYDAGLFYCPY 398 (457) T ss_pred hhcccccccchhhccccccc-cccccceeEEEecCCeEEEEecccccCCccceEEEEEeC-------Ccceecceeeccc Confidence 44321122211 111110 00112 2355543 7888887322211112233322222 2222245666552 Q ss_pred h-hhhhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 283 A-AGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 283 a-~~~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) - +--.... + +.-|-=.|-.+.+||. ..+|... .+..+ T Consensus 399 v~l~~~~~~--------d---p~sfqP~~g~~tRY~l-~~NP~~~-~~~~~ 436 (457) T protein:vir:10 399 VPLQQVRAI--------N---PDTFQPKIGFKTRYGM-VSNPFAG-GLTQG 436 (457) T ss_pred ccccccCcc--------C---Cccccceeeeeeeeee-eeccccc-ccccc Confidence 1 1111111 1 2223334555667777 6677753 33322 No 207 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=21.56 E-value=2.6 Score=18.32 Aligned_cols=284 Identities=12% Similarity=0.029 Sum_probs=118.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHh-HHHHHHHHHhhhhccccccccccc---cceEEEec---ccceeee Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFS-GEVFTAFNNASIFKGLVRSYDLRG---GKSKQFMF---TGKLSAG 73 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~-g~V~~~f~~~s~~~~~v~~r~~~~---G~tv~i~~---iG~~t~~ 73 (332) |+--++-..|.. .+++++... =|+.-|- +.+.+.....-....++...+. + -+++.|+. .|.+. T Consensus 31 ~a~da~d~~~~~--~t~~~~g~~----~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~-g~w~~~~~~~~~~e~~G~a~-- 101 (336) T protein:vir:10 31 YAMDAADLSPHL--SSTGSSGIP----NYLTTYVDPSVIDILVAPMKAAELVGESKK-GDWTTLVAAFITAEPTTKVA-- 101 (336) T ss_pred HHHhhhhhcccc--ccCCCcchH----HHHHhhcCcceeeeeechhchhhhcccccC-CCcceeeEEEEeeeeeeeEE-- Confidence 322233333431 334444333 2777776 3333332222233455555442 2 13444443 35553 Q ss_pred eecCCCCCCccCCCCCceEEEEEeee-eecchhhhhHHHHH-hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 74 YHTPGTPIVGDAGIKANEKTLVMDDL-LVSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASP 151 (332) Q Consensus 74 ~~~~g~~~~~~~~~~~~~~~l~ID~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~ 151 (332) -|.-++++.- .+.+-+..+-++--. .-+.+-+..+..++ +..++-.+-.+.+..+|.+..++..+-=- .... T Consensus 102 ~ygd~~d~P~-~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd-----~~~~ 175 (336) T protein:vir:10 102 TYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGV-----AGLE 175 (336) T ss_pred EccccCCCcc-eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEee-----cccc Confidence 2322222211 112222222222111 01222233333333 36677777777777777777775333110 0111 Q ss_pred ccc---cccc-ceecccc--ccccCHHHHHHHHHHHHHHHHhcC---C-CcCCCEEEEChHHHHHHHhhcCchhhccccc Q lcl|Aclame:pro 152 VTG---EPGG-FHVNIGA--GNTNDAQAIVDGFFEAAAVLDERS---A-PQEGRVAVLSPRQYYSLISSVDTNILNREIG 221 (332) Q Consensus 152 ~~~---~~~~-~~i~~~~--~~~~~~~~~~d~i~~a~~~Lde~~---V-P~~gR~~vv~P~~~~~Ll~~~d~~~~~~d~~ 221 (332) ..| .|.- ..+..++ -..++++.+++.|..+..+|.... + |..--.++++|..|..|-. + + .++ T Consensus 176 ~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~---~---n-~~g 248 (336) T protein:vir:10 176 NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK---T---N-QYG 248 (336) T ss_pred eEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC---C---C-ccC Confidence 111 1100 0011111 123567899999999888886554 2 3333578899999887732 1 1 121 Q ss_pred cccccccccceeeeeeceEEEeeCcccccccccccccccccccccccccc--cceEEEeechhhhh-hhhhccceeeeee Q lcl|Aclame:pro 222 NSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDA--SALAGLIFHREAAG-CIQSVAPTIQTTS 298 (332) Q Consensus 222 ~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~y~~~~--~~~~~l~~h~~a~~-~~~~~~~~~e~~~ 298 (332) .+-....+. +.=+++|...+.|-..+| +..--|..+. ..+..+.++..-.. -++...+. +. T Consensus 249 ~tv~~~lk~----n~Pnl~i~t~pel~~Agg---------~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq~~~~~---~~ 312 (336) T protein:vir:10 249 LSAAAKLKE----IFPKLEFVTIPEYDTASG---------RLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSY---FR 312 (336) T ss_pred ccHHHHHHH----hCCccEEEEcccccccCC---------ceEEEEEecccCCcceeeecChhhhccceeecCce---eE Confidence 111111111 122567777666632111 1111121111 12233334332211 12333222 22 Q ss_pred cccchhHHHHHHHHHHHhCCceechhheeeeecC Q lcl|Aclame:pro 299 GDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) Q Consensus 299 ~~~~~~~~~d~i~~~~~~G~~vlrpe~~v~i~~A 332 (332) .+... ...|+-+.||-++.-+.== T Consensus 313 v~~~~----------rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 313 QKKSA----------GTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ecccc----------ceeeeeeeccchheeeccC Confidence 22221 2448888888877644333 No 208 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=20.14 E-value=2.8 Score=18.11 Aligned_cols=267 Identities=15% Similarity=0.081 Sum_probs=102.2 Q ss_pred CCCcccccccccccccccccccCchhhHHHHHHhHHHHHHHHHh-hhhccccccccccccceEEEecccce-eeeeecCC Q lcl|Aclame:pro 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNA-SIFKGLVRSYDLRGGKSKQFMFTGKL-SAGYHTPG 78 (332) Q Consensus 1 m~~~~~~~r~~~~~~~~~~~~~d~~~al~~e~f~g~V~~~f~~~-s~~~~~v~~r~~~~G~tv~i~~iG~~-t~~~~~~g 78 (332) |-- |+-+ -.+|+. -|.....++|+.. +-+..+.+ +.-+.+++-+-..+|.. .+.... | T Consensus 1 m~i----t~~~-------------l~~l~~-~~~~~~~~~y~~a~~~~~~~a~-~~~sdf~~~~~~~lg~~p~l~e~~-G 60 (302) T protein:vir:10 1 MLI----NKQS-------------LNAAFV-AIKTIFNNAFAAAPTTWQKIAM-EVPSNTSSNDYKWLSTFPKMRRWI-G 60 (302) T ss_pred Ccc----cHHH-------------HHHHHH-HHHHHHHHHHHhhhhhhhceee-ecCCCcceeeceecCCCCCccccc-c Confidence 321 1111 012222 3334444444433 22223322 22233444444555542 222221 3 Q ss_pred CCCCccCCCCCceEEEEEeeeeecchhhhhHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 79 TPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) Q Consensus 79 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~aa~~~~~~~~~~~~ 158 (332) .... ..+....-++.+.++ --.+.|+.-+-.--.+..-..+.+++|++-++..|+.+...+..+... ....|..-+ T Consensus 61 e~~~--~~l~~~~~~i~~~~~-g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~-~~~DG~~fF 136 (302) T protein:vir:10 61 AKVV--KNLKAYKYVVENEDF-EATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTK-PCFDGQYFI 136 (302) T ss_pred ceee--ccccccceeEEeecc-cceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCC-cccCCccee Confidence 3222 234445566666654 233444332222223567778889999999999999999988653221 111222111 Q ss_pred ce---------eccccc------cccCHHHHHHHHHHHHHHH-HhcCCC--cCCCEEEEChHHHHH---HHhhcCchhhc Q lcl|Aclame:pro 159 FH---------VNIGAG------NTNDAQAIVDGFFEAAAVL-DERSAP--QEGRVAVLSPRQYYS---LISSVDTNILN 217 (332) Q Consensus 159 ~~---------i~~~~~------~~~~~~~~~d~i~~a~~~L-de~~VP--~~gR~~vv~P~~~~~---Ll~~~d~~~~~ 217 (332) .. .+++.+ ...+.. .+++.+.+..++ +...-| -..+++||+|..... ||.+ ++.. T Consensus 137 ~~dH~~g~~~~~N~g~~~~~~~~~~l~~~-~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~--~~~~- 212 (302) T protein:vir:10 137 DTDHPVGDASVSNKGTAPLSNASQAAAKA-GYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN--PKLA- 212 (302) T ss_pred cccccccccccccccchhhhhcccccchH-HHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc--cccC- Confidence 10 011111 111122 233333333332 222223 345789999875433 3321 2211 Q ss_pred cccccccccccccceeeeeeceEEEeeCccccccccccccccccccccc-c-----------cccccceEEEeechhhhh Q lcl|Aclame:pro 218 REIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENND-Y-----------QVDASALAGLIFHREAAG 285 (332) Q Consensus 218 ~d~~~~~~~~~~g~~v~~i~G~~V~~sn~lp~~~g~~~~~~~~~g~~~~-y-----------~~~~~~~~~l~~h~~a~~ 285 (332) .+. .....|. ++++.++.|.. ++.+-..+.+..-.. | ..+++ .-++.+... T Consensus 213 ---~g~-~Np~~g~-------~~~vv~p~L~s--~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~-~dgv~~k~~--- 275 (302) T protein:vir:10 213 ---DNT-PNPYVGT-------AELVVDGRIES--DTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLD-SDDVFNLRK--- 275 (302) T ss_pred ---CCC-cceeccc-------eEEEEeeccCC--CCceEEEecCCccceEEEcCccccEEEeccCCC-CCceEEEEE--- Confidence 111 1222232 57777777742 222222221111000 0 00000 000000000 Q ss_pred hhhhccceeeeeecccchhHHHHHHHHHHHhCCceechhhe Q lcl|Aclame:pro 286 CIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVA 326 (332) Q Consensus 286 ~~~~~~~~~e~~~~~~~~~~~~d~i~~~~~~G~~vlrpe~~ 326 (332) .+..+ +.+...-|++. .++||.+. +++ T Consensus 276 ----~d~Gv----d~R~~~G~~~w---q~a~~s~g---~~~ 302 (302) T protein:vir:10 276 ----LKFGA----EARAAAGYGFW---QLAYGSTG---TGA 302 (302) T ss_pred ----EEEee----eeeeecchhhh---hhhhccCc---cCC Confidence 00000 11111123333 46777665 333 Done!