Query lcl|Aclame:protein:vir:739|NCBI_annot:major structural protein 4|genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Match_columns 231 No_of_seqs 119 out of 163 Neff 8.1 Searched_HMMs 1612 Date Sat Nov 30 05:15:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:739 Length: 231 # 100.0 4.8E-75 3E-78 428.0 23.0 231 1-231 1-231 (231) 2 protein:vir:3613 Length: 272 # 100.0 1.2E-62 7.4E-66 360.1 23.7 231 1-231 40-272 (272) 3 protein:vir:95107 Length: 270 100.0 7.6E-59 4.7E-62 339.2 22.4 225 1-231 38-265 (270) 4 protein:vir:105334 Length: 276 100.0 4.9E-58 3.1E-61 334.8 23.3 226 1-231 40-270 (276) 5 protein:vir:96833 Length: 275 100.0 9.3E-58 5.8E-61 333.3 23.4 227 1-231 41-271 (275) 6 protein:vir:1239 Length: 274 # 100.0 1.3E-56 7.8E-60 327.1 23.6 226 1-231 40-270 (274) 7 protein:vir:97433 Length: 274 100.0 2.5E-56 1.5E-59 325.4 24.1 226 1-231 40-270 (274) 8 protein:vir:94494 Length: 274 100.0 2.5E-56 1.5E-59 325.4 24.1 226 1-231 40-270 (274) 9 protein:vir:96262 Length: 274 100.0 3.8E-56 2.3E-59 324.4 23.3 227 1-231 40-270 (274) 10 protein:vir:95898 Length: 274 100.0 3.8E-56 2.3E-59 324.4 23.3 227 1-231 40-270 (274) 11 protein:vir:93742 Length: 274 100.0 7.3E-55 4.5E-58 317.4 24.1 227 1-231 40-270 (274) 12 protein:vir:96123 Length: 274 100.0 6.7E-55 4.2E-58 317.6 23.9 226 1-231 40-270 (274) 13 protein:vir:80930 Length: 278 100.0 3.6E-53 2.2E-56 308.1 23.6 227 1-231 40-277 (278) 14 protein:vir:3033 Length: 272 # 100.0 8.8E-51 5.4E-54 295.0 23.6 227 1-231 40-269 (272) 15 protein:vir:9820 Length: 272 # 100.0 8.8E-51 5.4E-54 295.0 23.6 227 1-231 40-269 (272) 16 protein:vir:5974 Length: 324 # 100.0 2.4E-40 1.5E-43 237.7 21.5 222 1-231 45-291 (324) 17 protein:vir:102944 Length: 330 100.0 7E-40 4.3E-43 235.3 19.6 226 1-231 45-297 (330) 18 protein:vir:7990 Length: 273 # 100.0 4.4E-39 2.7E-42 230.9 21.9 228 1-231 34-273 (273) 19 protein:vir:105822 Length: 273 100.0 4.5E-38 2.8E-41 225.3 21.8 228 1-231 34-273 (273) 20 protein:vir:102605 Length: 273 100.0 4.5E-38 2.8E-41 225.3 21.8 228 1-231 34-273 (273) 21 protein:vir:1583 Length: 351 # 100.0 1.8E-37 1.1E-40 222.1 20.4 221 1-231 43-295 (351) 22 protein:vir:1541 Length: 347 # 100.0 2.1E-34 1.3E-37 205.2 16.9 229 1-231 51-345 (347) 23 protein:vir:94622 Length: 341 100.0 2.9E-34 1.8E-37 204.5 16.3 229 1-231 47-339 (341) 24 protein:vir:3364 Length: 347 # 100.0 4.5E-34 2.8E-37 203.4 16.0 229 1-231 51-345 (347) 25 protein:vir:9927 Length: 295 # 100.0 6.2E-34 3.8E-37 202.7 15.5 224 1-231 40-288 (295) 26 protein:vir:78739 Length: 332 100.0 3.5E-33 2.2E-36 198.5 15.0 227 1-229 54-332 (332) 27 protein:vir:2201 Length: 345 # 100.0 3.3E-33 2E-36 198.7 12.9 229 1-231 52-345 (345) 28 protein:vir:94711 Length: 347 100.0 5.9E-33 3.7E-36 197.3 14.0 228 1-231 51-346 (347) 29 protein:vir:9875 Length: 296 # 100.0 3.1E-32 1.9E-35 193.3 17.1 221 1-231 46-295 (296) 30 protein:vir:10450 Length: 344 100.0 8.8E-33 5.5E-36 196.3 13.4 229 1-231 52-344 (344) 31 protein:vir:99675 Length: 324 100.0 1.9E-32 1.2E-35 194.5 13.9 229 1-231 2-296 (324) 32 protein:vir:80213 Length: 334 100.0 1.9E-32 1.2E-35 194.4 13.3 230 1-231 48-332 (334) 33 protein:vir:80446 Length: 367 100.0 6.4E-31 4E-34 186.1 19.0 222 1-231 47-319 (367) 34 protein:vir:94576 Length: 347 99.9 2.5E-31 1.5E-34 188.4 13.4 229 1-231 51-347 (347) 35 protein:vir:8885 Length: 347 # 99.9 3.9E-31 2.4E-34 187.3 14.3 228 1-231 52-346 (347) 36 protein:vir:106647 Length: 303 99.9 6.7E-31 4.2E-34 186.0 14.4 224 1-231 41-296 (303) 37 protein:vir:100057 Length: 375 99.9 8E-30 4.9E-33 180.1 17.7 229 1-231 54-370 (375) 38 protein:vir:80180 Length: 381 99.9 1.1E-30 6.7E-34 184.9 12.7 230 1-231 50-381 (381) 39 protein:vir:103323 Length: 364 99.9 1.9E-29 1.2E-32 178.0 18.0 230 1-231 46-339 (364) 40 protein:vir:6324 Length: 335 # 99.9 8.3E-29 5.2E-32 174.5 15.2 230 1-231 46-328 (335) 41 protein:vir:191 Length: 385 # 99.9 1.9E-27 1.2E-30 167.1 20.1 224 1-231 131-384 (385) 42 protein:vir:1886 Length: 385 # 99.9 1.9E-27 1.2E-30 167.1 20.1 224 1-231 131-384 (385) 43 protein:vir:101607 Length: 379 99.9 1.8E-27 1.1E-30 167.2 19.8 226 1-231 142-379 (379) 44 protein:vir:78935 Length: 335 99.9 2.9E-28 1.8E-31 171.6 14.5 230 1-231 46-328 (335) 45 protein:vir:3136 Length: 322 # 99.9 1.3E-28 8.1E-32 173.4 12.5 224 1-231 39-318 (322) 46 protein:vir:94673 Length: 419 99.9 3.1E-27 1.9E-30 165.9 19.7 226 1-231 159-417 (419) 47 protein:vir:94989 Length: 349 99.9 3.4E-27 2.1E-30 165.7 19.4 222 1-231 45-306 (349) 48 protein:vir:78387 Length: 349 99.9 3.5E-27 2.2E-30 165.6 19.2 222 1-231 45-306 (349) 49 protein:vir:41 Length: 299 # N 99.9 4.1E-27 2.5E-30 165.2 19.5 227 1-231 33-298 (299) 50 protein:vir:108211 Length: 318 99.9 1.4E-27 8.7E-31 167.8 15.9 227 1-231 48-317 (318) 51 protein:vir:81070 Length: 390 99.9 6E-27 3.7E-30 164.3 19.2 222 1-229 148-390 (390) 52 protein:vir:97053 Length: 390 99.9 7.4E-27 4.6E-30 163.9 19.5 221 1-229 149-390 (390) 53 protein:vir:10364 Length: 390 99.9 2.1E-26 1.3E-29 161.4 19.9 222 1-229 147-390 (390) 54 protein:vir:2344 Length: 397 # 99.9 1.9E-26 1.2E-29 161.6 19.3 229 1-231 45-306 (397) 55 protein:vir:99749 Length: 324 99.9 2.6E-26 1.6E-29 160.9 20.0 223 1-231 62-315 (324) 56 protein:vir:9309 Length: 324 # 99.9 3.6E-26 2.2E-29 160.1 20.1 223 1-231 59-315 (324) 57 protein:vir:102655 Length: 322 99.9 1E-26 6.2E-30 163.1 16.8 230 1-231 47-321 (322) 58 protein:vir:100135 Length: 418 99.9 3.9E-26 2.4E-29 159.9 20.0 224 1-231 170-415 (418) 59 protein:vir:6242 Length: 390 # 99.9 2.1E-26 1.3E-29 161.3 17.9 225 1-231 147-389 (390) 60 protein:vir:1328 Length: 392 # 99.9 3.8E-26 2.4E-29 159.9 19.2 225 1-231 147-391 (392) 61 protein:vir:4339 Length: 395 # 99.9 6.5E-26 4E-29 158.7 20.2 224 1-231 148-395 (395) 62 protein:vir:97148 Length: 324 99.9 6E-26 3.7E-29 158.9 20.0 223 1-231 62-315 (324) 63 protein:vir:9759 Length: 303 # 99.9 5.6E-26 3.5E-29 159.0 19.0 230 1-231 31-303 (303) 64 protein:vir:4856 Length: 293 # 99.9 9.2E-26 5.7E-29 157.8 20.2 228 1-231 42-281 (293) 65 protein:vir:103955 Length: 324 99.9 9.5E-26 5.9E-29 157.8 19.9 223 1-231 62-315 (324) 66 protein:vir:9410 Length: 415 # 99.9 6.1E-26 3.8E-29 158.8 18.8 228 1-231 158-404 (415) 67 protein:vir:96223 Length: 324 99.9 1.4E-25 8.5E-29 156.9 19.9 223 1-231 62-315 (324) 68 protein:vir:7771 Length: 330 # 99.9 1.4E-25 8.6E-29 156.9 19.7 229 1-231 44-323 (330) 69 protein:vir:4953 Length: 397 # 99.9 9.6E-26 5.9E-29 157.7 18.8 227 1-231 146-385 (397) 70 protein:vir:81100 Length: 415 99.9 1.1E-25 6.8E-29 157.4 19.0 228 1-231 158-404 (415) 71 protein:vir:79987 Length: 415 99.9 1.1E-25 6.8E-29 157.4 19.0 228 1-231 158-404 (415) 72 protein:vir:98339 Length: 415 99.9 1.1E-25 6.8E-29 157.4 19.0 228 1-231 158-404 (415) 73 protein:vir:97031 Length: 402 99.9 8.1E-27 5E-30 163.6 12.3 230 1-231 46-338 (402) 74 protein:vir:9574 Length: 300 # 99.9 1.5E-25 9.2E-29 156.7 19.2 228 1-231 35-300 (300) 75 protein:vir:4600 Length: 415 # 99.9 1.6E-25 1E-28 156.5 18.9 228 1-231 158-404 (415) 76 protein:vir:4700 Length: 415 # 99.9 1.6E-25 1E-28 156.5 18.9 228 1-231 158-404 (415) 77 protein:vir:4997 Length: 397 # 99.9 3.4E-25 2.1E-28 154.8 18.8 228 1-231 146-385 (397) 78 protein:vir:96392 Length: 324 99.9 5.5E-25 3.4E-28 153.6 19.9 223 1-231 62-315 (324) 79 protein:vir:78830 Length: 324 99.9 5.5E-25 3.4E-28 153.6 19.9 223 1-231 62-315 (324) 80 protein:vir:95763 Length: 297 99.9 5.8E-25 3.6E-28 153.5 19.9 220 1-231 45-296 (297) 81 protein:vir:485 Length: 407 # 99.9 3.6E-25 2.2E-28 154.6 18.6 229 1-231 138-400 (407) 82 protein:vir:105905 Length: 304 99.9 4.2E-25 2.6E-28 154.2 18.6 219 1-230 46-304 (304) 83 protein:vir:94142 Length: 304 99.9 4.2E-25 2.6E-28 154.2 18.6 219 1-230 46-304 (304) 84 protein:vir:8102 Length: 543 # 99.9 8.6E-25 5.3E-28 152.5 19.2 227 1-231 285-542 (543) 85 protein:vir:4830 Length: 397 # 99.9 7.5E-25 4.7E-28 152.8 18.9 227 1-231 147-385 (397) 86 protein:vir:100247 Length: 425 99.9 6.1E-25 3.8E-28 153.3 18.3 229 1-231 162-424 (425) 87 protein:vir:104256 Length: 458 99.9 1.1E-24 6.9E-28 151.9 18.9 230 1-231 198-458 (458) 88 protein:vir:7409 Length: 408 # 99.9 1.2E-24 7.5E-28 151.7 18.8 228 1-231 153-393 (408) 89 protein:vir:2430 Length: 318 # 99.9 2.1E-24 1.3E-27 150.4 19.8 227 1-231 49-313 (318) 90 protein:vir:1025 Length: 408 # 99.9 2.2E-24 1.4E-27 150.3 19.3 228 1-231 153-393 (408) 91 protein:vir:4456 Length: 401 # 99.9 1.2E-24 7.6E-28 151.7 17.8 229 1-231 142-401 (401) 92 protein:vir:3991 Length: 404 # 99.9 2.4E-24 1.5E-27 150.1 19.2 228 1-231 153-393 (404) 93 protein:vir:104085 Length: 320 99.9 3E-24 1.9E-27 149.5 19.6 227 1-231 48-317 (320) 94 protein:vir:4226 Length: 326 # 99.9 3.7E-24 2.3E-27 149.1 19.5 227 1-231 46-323 (326) 95 protein:vir:94771 Length: 298 99.9 8.2E-24 5.1E-27 147.2 20.4 228 1-230 32-298 (298) 96 protein:vir:1638 Length: 298 # 99.9 7.9E-24 4.9E-27 147.2 19.6 228 1-230 29-298 (298) 97 protein:vir:80684 Length: 315 99.9 8.2E-24 5.1E-27 147.1 19.3 228 1-231 36-306 (315) 98 protein:vir:81227 Length: 413 99.9 1.5E-23 9.3E-27 145.7 19.9 225 1-231 153-410 (413) 99 protein:vir:78223 Length: 333 99.9 1.2E-23 7.4E-27 146.3 19.2 229 1-231 43-332 (333) 100 protein:vir:7019 Length: 401 # 99.9 3.9E-25 2.4E-28 154.4 10.8 230 1-231 46-333 (401) 101 protein:vir:78523 Length: 338 99.9 2E-23 1.2E-26 145.1 19.3 231 1-231 51-335 (338) 102 protein:vir:81160 Length: 371 99.9 3.1E-23 1.9E-26 144.0 19.4 227 1-231 128-371 (371) 103 protein:vir:3845 Length: 395 # 99.9 3.3E-23 2E-26 143.9 19.1 228 1-231 144-383 (395) 104 protein:vir:8187 Length: 311 # 99.9 4.4E-23 2.7E-26 143.1 19.6 227 1-231 34-310 (311) 105 protein:vir:7855 Length: 497 # 99.9 3.4E-23 2.1E-26 143.8 18.8 228 1-231 186-493 (497) 106 protein:vir:101650 Length: 497 99.9 3.4E-23 2.1E-26 143.8 18.8 228 1-231 186-493 (497) 107 protein:vir:1268 Length: 397 # 99.9 3.1E-23 1.9E-26 144.0 18.5 227 1-231 160-397 (397) 108 protein:vir:99075 Length: 392 99.8 3.2E-23 2E-26 143.9 17.8 227 1-231 34-302 (392) 109 protein:vir:107120 Length: 329 99.8 1.2E-22 7.4E-26 140.8 20.7 227 1-231 68-307 (329) 110 protein:vir:1084 Length: 437 # 99.8 3.9E-23 2.4E-26 143.4 17.9 228 1-231 187-429 (437) 111 protein:vir:102119 Length: 404 99.8 4.6E-23 2.8E-26 143.1 18.3 229 1-231 147-400 (404) 112 protein:vir:100172 Length: 394 99.8 1.2E-22 7.7E-26 140.7 19.8 229 1-231 143-384 (394) 113 protein:vir:1383 Length: 421 # 99.8 8.8E-23 5.4E-26 141.5 18.0 227 1-231 141-383 (421) 114 protein:vir:4511 Length: 409 # 99.8 6.1E-23 3.8E-26 142.4 17.0 229 1-231 153-406 (409) 115 protein:vir:3870 Length: 400 # 99.8 1E-22 6.5E-26 141.1 18.1 225 1-231 165-399 (400) 116 protein:vir:9704 Length: 394 # 99.8 2.2E-22 1.3E-25 139.4 18.8 222 1-231 160-390 (394) 117 protein:vir:97331 Length: 319 99.8 2.4E-22 1.5E-25 139.1 19.0 225 1-231 59-296 (319) 118 protein:vir:94800 Length: 319 99.8 2.4E-22 1.5E-25 139.1 19.0 225 1-231 59-296 (319) 119 protein:vir:93616 Length: 645 99.8 2E-22 1.3E-25 139.5 18.5 224 1-231 370-637 (645) 120 protein:vir:100884 Length: 389 99.8 4E-22 2.5E-25 137.9 19.8 229 1-231 143-382 (389) 121 protein:vir:105645 Length: 400 99.8 1.4E-23 8.7E-27 145.9 11.6 230 1-231 46-333 (400) 122 protein:vir:96762 Length: 632 99.8 1.3E-22 8.1E-26 140.6 16.6 219 1-230 390-632 (632) 123 protein:vir:2504 Length: 305 # 99.8 1.8E-22 1.1E-25 139.7 16.7 222 1-231 36-298 (305) 124 protein:vir:6212 Length: 434 # 99.8 6E-22 3.7E-25 136.9 17.6 231 1-231 176-429 (434) 125 protein:vir:80376 Length: 435 99.8 7.6E-22 4.7E-25 136.4 18.0 225 1-231 164-433 (435) 126 protein:vir:95131 Length: 325 99.8 1.5E-21 9.6E-25 134.7 19.7 227 1-231 39-294 (325) 127 protein:vir:102082 Length: 392 99.8 1.6E-21 1E-24 134.6 19.0 226 1-231 143-384 (392) 128 protein:vir:102873 Length: 392 99.8 1.6E-21 1E-24 134.6 19.0 226 1-231 143-384 (392) 129 protein:vir:107593 Length: 392 99.8 1.6E-21 1E-24 134.6 19.0 226 1-231 143-384 (392) 130 protein:vir:105004 Length: 392 99.8 1.6E-21 1E-24 134.6 19.0 226 1-231 143-384 (392) 131 protein:vir:79928 Length: 393 99.8 8.5E-22 5.3E-25 136.1 16.7 231 1-231 107-381 (393) 132 protein:vir:108303 Length: 418 99.8 1.6E-21 9.7E-25 134.6 17.9 223 1-231 38-279 (418) 133 protein:vir:962 Length: 397 # 99.8 1.5E-21 9.1E-25 134.8 17.2 225 1-231 166-397 (397) 134 protein:vir:5739 Length: 366 # 99.8 1.9E-21 1.2E-24 134.2 17.7 224 1-231 97-366 (366) 135 protein:vir:1433 Length: 435 # 99.8 3E-21 1.9E-24 133.1 18.7 224 1-231 164-433 (435) 136 protein:vir:78640 Length: 352 99.8 4.1E-22 2.5E-25 137.8 13.9 216 1-231 117-346 (352) 137 protein:vir:94424 Length: 387 99.8 3.6E-22 2.2E-25 138.1 13.2 216 1-231 152-381 (387) 138 protein:vir:96978 Length: 387 99.8 3.6E-22 2.2E-25 138.1 13.2 216 1-231 152-381 (387) 139 protein:vir:2685 Length: 387 # 99.8 3.6E-22 2.2E-25 138.1 13.2 216 1-231 152-381 (387) 140 protein:vir:9361 Length: 402 # 99.8 4.7E-22 2.9E-25 137.5 13.5 216 1-231 167-396 (402) 141 protein:vir:99920 Length: 311 99.8 3.3E-21 2.1E-24 132.9 18.0 228 1-230 32-311 (311) 142 protein:vir:105038 Length: 428 99.8 6.2E-21 3.9E-24 131.4 18.3 225 1-231 158-428 (428) 143 protein:vir:3525 Length: 423 # 99.8 2.5E-21 1.6E-24 133.5 15.7 224 1-231 36-309 (423) 144 protein:vir:95376 Length: 425 99.8 7.2E-21 4.4E-24 131.0 17.9 221 1-231 174-421 (425) 145 protein:vir:93881 Length: 387 99.8 2.8E-21 1.7E-24 133.3 14.1 216 1-231 152-381 (387) 146 protein:vir:174 Length: 423 # 99.8 1.4E-20 8.8E-24 129.4 16.8 226 1-231 36-302 (423) 147 protein:vir:4092 Length: 390 # 99.8 3.3E-20 2.1E-23 127.4 18.4 221 1-231 119-368 (390) 148 protein:vir:105374 Length: 423 99.7 6.9E-20 4.3E-23 125.6 16.3 229 1-231 36-300 (423) 149 protein:vir:8420 Length: 477 # 99.7 4E-20 2.5E-23 126.9 13.8 230 1-231 194-471 (477) 150 protein:vir:105522 Length: 423 99.7 1.1E-18 6.7E-22 119.1 16.8 225 1-231 36-302 (423) 151 protein:vir:80128 Length: 466 99.7 1.2E-18 7.4E-22 118.9 15.7 224 1-231 183-448 (466) 152 protein:vir:95963 Length: 395 99.6 2.4E-17 1.5E-20 111.7 15.7 217 1-231 120-376 (395) 153 protein:vir:9643 Length: 377 # 99.6 7.4E-17 4.6E-20 109.0 15.9 217 1-231 113-377 (377) 154 protein:vir:98635 Length: 377 99.6 1.4E-17 8.9E-21 112.9 11.7 224 1-231 116-377 (377) 155 protein:vir:101291 Length: 381 99.6 6.6E-17 4.1E-20 109.3 15.3 217 1-231 110-370 (381) 156 protein:vir:9509 Length: 381 # 99.6 6.6E-17 4.1E-20 109.3 15.3 217 1-231 110-370 (381) 157 protein:vir:4197 Length: 314 # 99.6 7.9E-16 4.9E-19 103.4 19.8 227 1-231 39-311 (314) 158 protein:vir:100632 Length: 381 99.6 1.1E-16 6.9E-20 108.1 14.6 219 1-231 110-368 (381) 159 protein:vir:79008 Length: 299 99.5 1.7E-15 1.1E-18 101.5 20.5 228 1-231 33-297 (299) 160 protein:vir:1781 Length: 221 # 99.5 8.2E-17 5.1E-20 108.8 11.8 176 42-231 1-202 (221) 161 protein:vir:78920 Length: 290 99.5 3.1E-15 1.9E-18 100.1 19.6 229 1-231 30-290 (290) 162 protein:vir:96792 Length: 315 99.5 3.7E-15 2.3E-18 99.7 19.5 221 1-231 42-281 (315) 163 protein:vir:4159 Length: 315 # 99.5 3.1E-15 1.9E-18 100.2 18.7 224 1-228 44-315 (315) 164 protein:vir:8324 Length: 410 # 99.5 4.1E-16 2.5E-19 104.9 12.9 222 1-229 167-410 (410) 165 protein:vir:78350 Length: 383 99.5 5.9E-16 3.6E-19 104.1 12.4 216 1-231 117-374 (383) 166 protein:vir:95875 Length: 401 99.4 4.1E-14 2.5E-17 94.0 15.1 231 1-231 50-400 (401) 167 protein:vir:3158 Length: 321 # 99.3 4.5E-13 2.8E-16 88.3 17.7 223 1-231 55-311 (321) 168 protein:vir:97397 Length: 517 99.3 2.6E-13 1.6E-16 89.6 15.4 224 1-231 266-514 (517) 169 protein:vir:105464 Length: 346 99.3 1.1E-12 7E-16 86.1 18.5 229 1-231 32-299 (346) 170 protein:vir:102335 Length: 312 99.1 8.3E-12 5.1E-15 81.3 17.2 229 1-231 29-308 (312) 171 protein:vir:79712 Length: 285 99.0 6.1E-11 3.8E-14 76.6 16.5 229 1-231 34-285 (285) 172 protein:vir:4074 Length: 480 # 99.0 6.3E-12 3.9E-15 82.0 10.4 221 1-231 213-477 (480) 173 protein:vir:2106 Length: 430 # 99.0 4.6E-11 2.9E-14 77.2 14.6 225 1-231 38-300 (430) 174 protein:vir:100939 Length: 430 98.9 5.9E-11 3.7E-14 76.6 14.0 224 1-231 38-300 (430) 175 protein:vir:9265 Length: 430 # 98.9 5.9E-11 3.7E-14 76.6 14.0 224 1-231 38-300 (430) 176 protein:vir:94933 Length: 330 98.9 8.2E-10 5.1E-13 70.4 19.1 229 1-231 52-329 (330) 177 protein:vir:99523 Length: 311 98.9 6.8E-10 4.2E-13 70.9 18.4 230 1-231 37-311 (311) 178 protein:vir:79548 Length: 652 98.8 3E-09 1.8E-12 67.3 17.7 223 1-228 392-652 (652) 179 protein:vir:97255 Length: 310 98.7 1.3E-08 8.2E-12 63.8 20.3 226 1-231 41-308 (310) 180 protein:vir:78090 Length: 302 98.7 6.9E-09 4.3E-12 65.3 18.1 227 1-231 34-301 (302) 181 protein:vir:95512 Length: 693 98.6 9.1E-09 5.7E-12 64.7 17.0 224 1-229 435-693 (693) 182 protein:vir:93696 Length: 364 98.6 2.6E-09 1.6E-12 67.7 13.8 228 1-231 49-361 (364) 183 protein:vir:95451 Length: 313 98.6 1.4E-09 8.4E-13 69.2 11.6 229 1-231 31-311 (313) 184 protein:vir:104439 Length: 404 98.3 8.1E-08 5E-11 59.5 14.3 222 1-226 62-404 (404) 185 protein:vir:3298 Length: 404 # 98.3 8.1E-08 5E-11 59.5 14.3 222 1-226 62-404 (404) 186 protein:vir:819 Length: 404 # 98.3 8.1E-08 5E-11 59.5 14.3 222 1-226 62-404 (404) 187 protein:vir:10123 Length: 404 98.3 8.1E-08 5E-11 59.5 14.3 222 1-226 62-404 (404) 188 protein:vir:103285 Length: 296 98.3 3.4E-07 2.1E-10 56.0 16.7 225 1-229 41-296 (296) 189 protein:vir:105610 Length: 430 98.2 1E-07 6.3E-11 58.9 13.0 229 1-231 67-424 (430) 190 protein:vir:2770 Length: 318 # 98.2 4E-07 2.5E-10 55.7 15.1 177 1-193 62-318 (318) 191 protein:vir:80068 Length: 301 98.0 2.2E-06 1.3E-09 51.6 16.6 225 1-231 38-301 (301) 192 protein:vir:104342 Length: 314 98.0 1.7E-06 1.1E-09 52.2 15.5 225 1-229 59-314 (314) 193 protein:vir:107687 Length: 319 97.9 2.8E-06 1.7E-09 51.0 15.4 224 1-231 63-319 (319) 194 protein:vir:103886 Length: 302 97.9 6.9E-06 4.3E-09 48.9 17.1 220 1-231 32-302 (302) 195 protein:vir:8843 Length: 317 # 97.8 1.8E-05 1.1E-08 46.6 20.0 221 1-231 41-315 (317) 196 protein:vir:99424 Length: 360 97.6 3.3E-05 2.1E-08 45.1 17.3 225 1-231 47-357 (360) 197 protein:vir:79642 Length: 329 97.5 2.5E-05 1.6E-08 45.8 15.7 226 1-231 68-328 (329) 198 protein:vir:5942 Length: 523 # 96.7 0.00021 1.3E-07 40.8 13.6 224 1-231 222-521 (523) 199 protein:vir:78148 Length: 123 96.5 3.3E-05 2.1E-08 45.1 8.0 111 121-231 1-123 (123) 200 protein:vir:107882 Length: 307 96.1 0.001 6.3E-07 37.0 15.0 228 1-231 36-306 (307) 201 protein:vir:79078 Length: 307 95.7 0.0015 9.5E-07 36.0 14.5 229 1-231 22-306 (307) 202 protein:vir:5255 Length: 304 # 95.1 0.0027 1.7E-06 34.7 13.4 223 1-228 35-304 (304) 203 protein:vir:103370 Length: 418 94.9 0.0032 2E-06 34.2 15.8 223 1-231 89-406 (418) 204 protein:vir:93858 Length: 400 94.8 0.0025 1.6E-06 34.8 11.6 221 1-229 119-400 (400) 205 protein:vir:103181 Length: 457 94.1 0.0053 3.3E-06 33.1 16.2 229 1-231 152-440 (457) 206 protein:vir:96490 Length: 348 93.8 0.0064 4E-06 32.6 12.9 227 1-231 34-348 (348) 207 protein:vir:78558 Length: 336 93.3 0.0074 4.6E-06 32.3 11.3 217 1-231 78-336 (336) 208 protein:vir:5670 Length: 514 # 93.0 0.0093 5.8E-06 31.7 15.6 224 1-231 176-514 (514) 209 protein:vir:2736 Length: 348 # 92.5 0.011 6.9E-06 31.3 13.3 225 1-231 23-348 (348) 210 protein:vir:348 Length: 321 # 91.8 0.014 8.7E-06 30.7 14.0 223 1-229 41-321 (321) 211 protein:vir:101039 Length: 529 91.7 0.015 9.1E-06 30.6 16.7 230 1-231 222-529 (529) 212 protein:vir:104549 Length: 462 91.7 0.015 9.2E-06 30.6 16.8 229 1-231 157-461 (462) 213 protein:vir:106998 Length: 468 91.6 0.015 9.5E-06 30.5 15.7 228 1-231 153-467 (468) 214 protein:vir:94070 Length: 339 91.0 0.018 1.1E-05 30.1 15.2 217 1-231 83-339 (339) 215 protein:vir:103463 Length: 521 90.3 0.022 1.3E-05 29.7 17.6 225 1-231 213-521 (521) 216 protein:vir:101557 Length: 336 90.1 0.023 1.4E-05 29.6 11.6 219 1-231 76-336 (336) 217 protein:vir:99888 Length: 309 89.9 0.024 1.5E-05 29.5 15.2 228 1-231 20-308 (309) 218 protein:vir:95318 Length: 328 89.2 0.028 1.7E-05 29.1 13.3 171 1-175 47-328 (328) 219 protein:vir:3643 Length: 336 # 89.2 0.028 1.7E-05 29.1 11.7 217 1-231 78-336 (336) 220 protein:vir:6901 Length: 522 # 88.7 0.031 1.9E-05 28.9 17.8 226 1-231 214-522 (522) 221 protein:vir:96442 Length: 418 88.2 0.034 2.1E-05 28.7 17.2 224 1-231 89-406 (418) 222 protein:vir:4902 Length: 348 # 87.6 0.038 2.3E-05 28.4 12.7 227 1-231 35-348 (348) 223 protein:vir:98871 Length: 314 87.3 0.04 2.5E-05 28.3 12.6 220 1-231 60-311 (314) 224 protein:vir:104915 Length: 470 86.7 0.044 2.7E-05 28.0 16.3 228 1-231 166-469 (470) 225 protein:vir:80986 Length: 528 86.4 0.046 2.9E-05 27.9 17.0 228 1-231 210-528 (528) 226 protein:vir:107826 Length: 331 86.0 0.049 3E-05 27.8 13.9 172 1-175 48-331 (331) 227 protein:vir:98525 Length: 331 86.0 0.049 3E-05 27.8 13.9 172 1-175 48-331 (331) 228 protein:vir:107388 Length: 331 86.0 0.049 3E-05 27.8 13.9 172 1-175 48-331 (331) 229 protein:vir:101811 Length: 529 85.8 0.05 3.1E-05 27.7 17.4 230 1-231 222-529 (529) 230 protein:vir:107947 Length: 519 85.5 0.052 3.2E-05 27.6 17.2 224 1-231 193-519 (519) 231 protein:vir:106734 Length: 336 85.4 0.053 3.3E-05 27.6 11.4 219 1-231 66-336 (336) 232 protein:vir:106286 Length: 534 84.6 0.059 3.7E-05 27.3 14.9 226 1-231 194-534 (534) 233 protein:vir:80835 Length: 464 81.8 0.083 5.1E-05 26.5 12.1 226 1-231 70-336 (464) 234 protein:vir:7214 Length: 521 # 80.6 0.093 5.8E-05 26.2 17.9 225 1-231 195-521 (521) 235 protein:vir:98143 Length: 524 80.5 0.094 5.8E-05 26.2 17.1 226 1-231 204-524 (524) 236 protein:vir:100603 Length: 529 78.3 0.12 7.2E-05 25.7 18.5 228 1-231 222-529 (529) 237 protein:vir:6601 Length: 528 # 75.4 0.15 9.1E-05 25.1 17.1 228 1-231 203-528 (528) 238 protein:vir:107732 Length: 379 75.3 0.15 9.2E-05 25.1 11.6 218 1-229 104-379 (379) 239 protein:vir:96079 Length: 382 74.9 0.15 9.5E-05 25.1 11.0 216 1-231 106-382 (382) 240 protein:vir:1991 Length: 305 # 74.8 0.15 9.5E-05 25.0 10.9 160 1-167 32-305 (305) 241 protein:vir:98480 Length: 348 74.3 0.16 9.9E-05 25.0 16.9 230 1-230 27-348 (348) 242 protein:vir:3969 Length: 287 # 71.2 0.2 0.00012 24.4 11.5 219 1-231 35-286 (287) 243 protein:vir:106590 Length: 349 69.9 0.22 0.00013 24.2 17.9 221 1-229 34-349 (349) 244 protein:vir:94528 Length: 286 65.3 0.29 0.00018 23.6 10.5 215 1-230 41-286 (286) 245 protein:vir:103759 Length: 330 62.2 0.34 0.00021 23.2 12.3 172 1-175 46-330 (330) 246 protein:vir:7324 Length: 335 # 57.7 0.43 0.00027 22.6 13.0 173 1-176 46-335 (335) 247 protein:vir:6378 Length: 346 # 55.7 0.47 0.00029 22.4 20.3 228 1-229 29-346 (346) 248 protein:vir:99576 Length: 388 51.2 0.59 0.00036 21.9 11.8 219 1-231 110-388 (388) 249 protein:vir:3424 Length: 341 # 50.6 0.6 0.00038 21.8 21.0 223 1-229 30-341 (341) 250 protein:vir:99311 Length: 463 42.0 0.9 0.00056 20.8 13.6 223 1-231 74-339 (463) 251 protein:vir:95603 Length: 463 42.0 0.9 0.00056 20.8 13.6 223 1-231 74-339 (463) 252 protein:vir:63741 Length: 468 36.2 1.2 0.00073 20.2 12.8 220 1-231 74-339 (468) 253 protein:vir:80491 Length: 467 35.0 1.3 0.00078 20.0 12.8 220 1-231 73-338 (467) 254 protein:vir:96666 Length: 462 34.6 1.3 0.0008 20.0 15.1 223 1-231 74-339 (462) 255 protein:vir:393 Length: 341 # 33.8 1.3 0.00083 19.9 19.5 227 1-229 30-341 (341) No 1 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=4.8e-75 Score=428.04 Aligned_cols=231 Identities=100% Similarity=1.304 Sum_probs=228.0 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVD 80 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~vd 80 (231) |||+|+||||+||+||||+++++||++|++++|++++.+++|+|+||+|+|+|++.+++++||++++.+|++++||+++| T Consensus 1 ~~~~~~Gdtit~P~~iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~kvD 80 (231) T protein:vir:73 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVD 80 (231) T ss_pred CccccCCceEEecccccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeeccee Q lcl|Aclame:pro 81 DDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQ 160 (231) Q Consensus 81 ~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~ 160 (231) ++++++++++++++++.+++|+|++|+++|+|+++.++++||||+++++|||++++....++.++++++||+||+++|+| T Consensus 81 ~di~~~~~~a~l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i~G~~ 160 (231) T protein:vir:73 81 DDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQ 160 (231) T ss_pred HHHHHhhccccccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhhhhccceeeecccceEcceE Confidence 99999999999999999999999999999999999999999999999999999999888888999999999999999999 Q ss_pred EEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 161 IVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 161 Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |++|+++|.++++..+++..++|++++.|+++++|++||+++++|.+++++||+++++||+++|++||+|| T Consensus 161 Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 161 IVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=1.2e-62 Score=360.07 Aligned_cols=231 Identities=94% Similarity=1.192 Sum_probs=223.5 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..=+.|+||+||+| +|+++++.||++|+++++++++.+++|++++++|+++|++.+++++||++++.+|++++|+++ T Consensus 40 ~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~ 119 (272) T protein:vir:36 40 TLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 119 (272) T ss_pred ccccCCCCEEEEeeeccCccccccCCCCccChhhcCCcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHH Confidence 444457999999998 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecc Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG 158 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G 158 (231) +|+++++.+.+++.+++...++|.|++|+++|+++++.+++++|||++++.|+|++++.....+.++++++||.||+++| T Consensus 120 ~d~~i~~~l~~~~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G 199 (272) T protein:vir:36 120 VDDDLLSAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG 199 (272) T ss_pred HHHHHHHHhccccccccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecC Confidence 99999999999999999999999999999999999999999999999999999999998888888899999999999999 Q ss_pred eeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 159 AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 159 ~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +||++||++|++++++..+++.++|++++.++++++|++|++++++|.|++++|||+++++|+++|++|++|| T Consensus 200 ~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 200 AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=7.6e-59 Score=339.22 Aligned_cols=225 Identities=30% Similarity=0.459 Sum_probs=209.4 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ++.-++|+||+||+| +|+++++.||++|++++|++++.+++|+++|++|+++|++.+.+++||++++.+|++.+|+++ T Consensus 38 ~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~ 117 (270) T protein:vir:95 38 TLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADK 117 (270) T ss_pred ccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHH Confidence 777789999999988 999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecc Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG 158 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G 158 (231) +|++++++|++++++++...+++++++|+++|+|+.+.+++++|||++++.|||+..+. ....+.++++||.||+++| T Consensus 118 ~d~~li~~l~~a~~~~~~~~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~--~~~~~~~~~~~G~ig~~~G 195 (270) T protein:vir:95 118 VEIDYIAELNKSKQTATVSADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKV--GGNVQDRAISKGDLVEIVG 195 (270) T ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhccc--ccccccchhcccccceecc Confidence 99999999999999999999999999999999999999999999999999999987543 3456778899999999999 Q ss_pred eeEEEcCCC-ccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 159 AQIVRSKKL-AEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 159 ~~Vv~s~~~-~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +||++++++ ++++.+ +.++||++++.++++++|++||+++++|.+++++||+++++||+++|++||+.- T Consensus 196 ~~Viv~s~~~~~~~~~----l~~~gAi~~~~~~~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a 265 (270) T protein:vir:95 196 VSDIVKSKRVSENTAF----LQRYGAMEIVNKKKPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFKPS 265 (270) T ss_pred eeEEEeCCCCCceeEE----EEeccceeeeecCCceeeeccchhhcccEEEeeeEEEEEEEccceEEEEEecCC Confidence 999886655 455544 446999999999999999999999999999999999999999999999998877 No 4 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=4.9e-58 Score=334.76 Aligned_cols=226 Identities=47% Similarity=0.705 Sum_probs=210.2 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ++.-++|+||+||+| +|+++++.||++|++++|++++.+++|+|++++|+++|++.+++++||++++++|++++||++ T Consensus 40 ~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~ 119 (276) T protein:vir:10 40 TLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANK 119 (276) T ss_pred cccCCCCCEEEeeeecCCCccccccCCCccCccccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHH Confidence 555578999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccccccccc-ccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh--hhhhccccccCceeeecccee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVST-KANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA--NAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~-~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~--~~~~~~~~~~~~~~~~G~ig~ 155 (231) +|+++++.++++++..+. +++++.|++|+++|+++++++++++|||++++.|+|+. +|.. .+..+++++++|+||+ T Consensus 120 ~d~~~~~~l~~~~~~~~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~-~s~~g~~~~~~G~ig~ 198 (276) T protein:vir:10 120 VDNDVLEALRGTKLTVSADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTR-ATELGDNIIVKGAFGE 198 (276) T ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccc-cccccccceeccccce Confidence 999999999998877665 67999999999999999999999999999999999975 4433 3456778999999999 Q ss_pred ecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 156 VLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 156 ~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++|++|++|+++|+++.+. .+++|++++.++++++|++||+++++|.|++++||+++++||+++|++++++. T Consensus 199 ~~G~~Vi~s~~~p~~t~~l----~~~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (276) T protein:vir:10 199 ALGAVIVRSKKLDEGEAIL----AKRGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAG 270 (276) T ss_pred ecceeEEEcCCCCcceEEE----EeccceeeeecCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCc Confidence 9999999999999988764 46999999999999999999999999999999999999999999999999888 No 5 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=9.3e-58 Score=333.25 Aligned_cols=227 Identities=44% Similarity=0.655 Sum_probs=209.8 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++|+||+||+| +|+++++.||++|+++++++++.+++|+|++++|+++|++.+++++||++++++|++++||++ T Consensus 41 ~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~ 120 (275) T protein:vir:96 41 TLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANK 120 (275) T ss_pred cccCCCCCEEEeeeeccCCccccccCCCCcchhhcccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHH Confidence 444467999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhc-cccccCceeeeccceee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNI-GSEVGANALINGTYADV 156 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~ 156 (231) +|+++++++++++++.. .++++|.|++|+++|+++++.+++++|||++++.|+|++..... ....+.+++++|.||++ T Consensus 121 ~d~~ll~~l~~a~~~~~~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~ 200 (275) T protein:vir:96 121 VDNDVLEALQGATLKVEADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEA 200 (275) T ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceecccccee Confidence 99999999999887764 56799999999999999999999999999999999998743322 34557788999999999 Q ss_pred cceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 157 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 157 ~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +|++|++||++|+++++.+ +++|++++.++++++|++||+++++|.|++++||++++++|+++|+++++.- T Consensus 201 ~G~~Vi~s~~~p~~t~~i~----~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 201 LGAIIVRSNKIKEGEAILA----KRGAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred cCeeEEEeCCCCcceEEEE----eccceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 9999999999999987644 5999999999999999999999999999999999999999999999999987 No 6 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=1.3e-56 Score=327.06 Aligned_cols=226 Identities=46% Similarity=0.669 Sum_probs=209.5 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++|+||+||+| +|+++++.||++|+++++++++.+++|+|++++|+++|++.+++++||++++++|++++|+++ T Consensus 40 ~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~ 119 (274) T protein:vir:12 40 TLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred cccCCCCCEEEEeeecCCCccccccCCCccchhhcccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHH Confidence 555578999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh--hhhhccccccCceeeecccee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA--NAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~--~~~~~~~~~~~~~~~~G~ig~ 155 (231) +|+++++.+.+++.+.. .++++|.|++|+++|+++++.+++++|||++++.|+|++ +|.. .+..+.+++++|.||+ T Consensus 120 vd~~~l~~~~~a~~~~~~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~-~s~~g~~~~~~G~ig~ 198 (274) T protein:vir:12 120 VDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTR-ATELGDDIIVKGAFGE 198 (274) T ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccc-cccccccceeccccee Confidence 99999999998887765 467999999999999999999999999999999999986 4443 3456778999999999 Q ss_pred ecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 156 VLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 156 ~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++|++|++||++|+++++ +.+++|++++.++++++|++||+++++|.+++++|||++++||+++|++++++= T Consensus 199 ~~G~~Vi~s~~~p~~t~~----l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:12 199 ALGAIIVRSNKLEAGTAI----LAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred ecCeeEEEeCCCCcceEE----EEeccceeeeecCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCc Confidence 999999999999998875 446999999999999999999999999999999999999999999999998777 No 7 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=2.5e-56 Score=325.43 Aligned_cols=226 Identities=46% Similarity=0.665 Sum_probs=209.4 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++|+||+||+| +|+++++.||++|+++++++++.+++|+|++++|+++|++..++++||++++++|++++|+++ T Consensus 40 ~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~ 119 (274) T protein:vir:97 40 TLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred cccCCCCCEEEEeeecCCCccccccCCCcccccccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHH Confidence 444568999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh--hhhhccccccCceeeecccee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA--NAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~--~~~~~~~~~~~~~~~~G~ig~ 155 (231) +|+++++.+.++++++. .+++++.+++|.++|+++++.+++++|||++++.|+|++ +|.. .+..+++++++|.||+ T Consensus 120 vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~-~s~~g~~~~~~G~ig~ 198 (274) T protein:vir:97 120 VDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTR-ATELGDDIIVKGAFGE 198 (274) T ss_pred HHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccc-cCcccccceeccccce Confidence 99999999999887764 467899999999999999999999999999999999986 4433 3456778999999999 Q ss_pred ecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 156 VLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 156 ~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++|++|++||++|+++++.+ +++|++++.++++++|++||+++++|.+++++|||++++||+++|++++++= T Consensus 199 ~~G~~Vi~s~~~p~~t~~l~----~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 199 ALGAIIVRTNKLEAGTAILA----KKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred ecCeeEEEcCCCCcceEEEE----eCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 99999999999998887544 5999999999999999999999999999999999999999999999999988 No 8 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=2.5e-56 Score=325.43 Aligned_cols=226 Identities=46% Similarity=0.665 Sum_probs=209.4 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++|+||+||+| +|+++++.||++|+++++++++.+++|+|++++|+++|++..++++||++++++|++++|+++ T Consensus 40 ~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~ 119 (274) T protein:vir:94 40 TLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred cccCCCCCEEEEeeecCCCccccccCCCcccccccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHH Confidence 444568999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh--hhhhccccccCceeeecccee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA--NAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~--~~~~~~~~~~~~~~~~G~ig~ 155 (231) +|+++++.+.++++++. .+++++.+++|.++|+++++.+++++|||++++.|+|++ +|.. .+..+++++++|.||+ T Consensus 120 vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~-~s~~g~~~~~~G~ig~ 198 (274) T protein:vir:94 120 VDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTR-ATELGDDIIVKGAFGE 198 (274) T ss_pred HHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccc-cCcccccceeccccce Confidence 99999999999887764 467899999999999999999999999999999999986 4433 3456778999999999 Q ss_pred ecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 156 VLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 156 ~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++|++|++||++|+++++.+ +++|++++.++++++|++||+++++|.+++++|||++++||+++|++++++= T Consensus 199 ~~G~~Vi~s~~~p~~t~~l~----~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 199 ALGAIIVRTNKLEAGTAILA----KKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred ecCeeEEEcCCCCcceEEEE----eCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 99999999999998887544 5999999999999999999999999999999999999999999999999988 No 9 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=3.8e-56 Score=324.44 Aligned_cols=227 Identities=45% Similarity=0.661 Sum_probs=207.2 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++||||+||+| +|+++++.||++|+++++++++.+++|+|++++|+++|++.+++.+||++++++|++++||++ T Consensus 40 ~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~ 119 (274) T protein:vir:96 40 TLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred cccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHH Confidence 333357999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh-ccccccCceeeeccceee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADV 156 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~G~ig~~ 156 (231) +|+++++.+++++.++. .+++++.|++|.++|+++++.+++++|||++++.|+|++.... ..+..+.+++++|.||++ T Consensus 120 vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:96 120 VDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEA 199 (274) T ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceecccccee Confidence 99999999999887764 4678999999999999999999999999999999999974332 245667789999999999 Q ss_pred cceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 157 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 157 ~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +|++|++||++|+++++ +.+++|++++.++++++|++||+++++|.+++++|||++++||+++|++++..= T Consensus 200 ~G~~Vi~s~~~~~~t~~----l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:96 200 LGAVIVRSNKLEAGTAI----LAKKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cCeEEEEeCCCCCceEE----EEeccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 99999999999988875 346999999999999999999999999999999999999999999999994443 No 10 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=3.8e-56 Score=324.44 Aligned_cols=227 Identities=45% Similarity=0.661 Sum_probs=207.2 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++||||+||+| +|+++++.||++|+++++++++.+++|+|++++|+++|++.+++.+||++++++|++++||++ T Consensus 40 ~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~ 119 (274) T protein:vir:95 40 TLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred cccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHH Confidence 333357999999988 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh-ccccccCceeeeccceee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADV 156 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~G~ig~~ 156 (231) +|+++++.+++++.++. .+++++.|++|.++|+++++.+++++|||++++.|+|++.... ..+..+.+++++|.||++ T Consensus 120 vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:95 120 VDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEA 199 (274) T ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceecccccee Confidence 99999999999887764 4678999999999999999999999999999999999974332 245667789999999999 Q ss_pred cceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 157 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 157 ~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +|++|++||++|+++++ +.+++|++++.++++++|++||+++++|.+++++|||++++||+++|++++..= T Consensus 200 ~G~~Vi~s~~~~~~t~~----l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:95 200 LGAVIVRSNKLEAGTAI----LAKKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cCeEEEEeCCCCCceEE----EEeccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 99999999999988875 346999999999999999999999999999999999999999999999994443 No 11 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=7.3e-55 Score=317.37 Aligned_cols=227 Identities=44% Similarity=0.655 Sum_probs=209.6 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++|+||+||+| +|+++++.||++|+++++++++.+++|++++++|+++|++..++.+||++++.+|++++|+++ T Consensus 40 ~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~ 119 (274) T protein:vir:93 40 TLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANK 119 (274) T ss_pred cccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHH Confidence 555568999999998 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccccccccc-ccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh-ccccccCceeeeccceee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVST-KANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADV 156 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~-~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~G~ig~~ 156 (231) +|+++++.+.+++.++.+ +++++.|++|..+|+++++.+++++|||++++.|+|++.... ..+..+++++++|.||++ T Consensus 120 ~d~~~~~~~~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:93 120 VDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeeccccee Confidence 999999999998876654 578999999999999999999999999999999999874332 234567788999999999 Q ss_pred cceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 157 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 157 ~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +|++|++||++|+++++.+ +++|++++.++++++|++||+++++|.+++++|||+++++|+++|++++++= T Consensus 200 ~G~~Vi~s~~~p~~t~~l~----~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 200 LGAIIVRTNKLEAGTAILA----KKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cCeeEEEcCCCCcceEEEE----eCCeEEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 9999999999999887644 5999999999999999999999999999999999999999999999999888 No 12 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=6.7e-55 Score=317.58 Aligned_cols=226 Identities=45% Similarity=0.686 Sum_probs=208.9 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-++|+||+||+| +|+++++.||++++++++++++.+++|++++++|+++|++..++.+||++++.+|++++|+++ T Consensus 40 ~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~ 119 (274) T protein:vir:96 40 TLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANK 119 (274) T ss_pred cccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHH Confidence 555568999999998 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccc-cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhh--hhhccccccCceeeecccee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVS-TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDAN--AKNIGSEVGANALINGTYAD 155 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~-~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~--~~~~~~~~~~~~~~~G~ig~ 155 (231) +|+++++.+++++.... .+++++.|++|.++|+++++.+++++|||++++.|+|+.. |.. .+..+++++++|.||+ T Consensus 120 ~d~~i~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~-~~~~g~~~~~~g~ig~ 198 (274) T protein:vir:96 120 VDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTR-PTQLGDNIIVKGAFGE 198 (274) T ss_pred HHHHHHHHHhcCCCCcCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccc-cccccccceeecccce Confidence 99999999998887654 5678999999999999999999999999999999999863 432 3456778999999999 Q ss_pred ecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 156 VLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 156 ~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++|++|++||++|+++++.+ +++|++++.++++++|++|++++++|.|++++|||++++||+++|+++..+- T Consensus 199 ~~G~~Vi~s~~~p~~t~~l~----~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 199 ALGAVIVRSNKLNKGEALLA----KKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred ecCeeEEEcCCCCcceEEEE----eCcceeeeecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 99999999999999987644 5999999999999999999999999999999999999999999999998887 No 13 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=3.6e-53 Score=308.09 Aligned_cols=227 Identities=40% Similarity=0.556 Sum_probs=203.8 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-.+|+||+||+| +|+++++.||+.|+++++++++.+++|+|++++|+++|++..++.+||++++++|++++|+++ T Consensus 40 ~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~ 119 (278) T protein:vir:80 40 SLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASK 119 (278) T ss_pred cccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHH Confidence 433467999999998 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccccccccccc-------CHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhhhhc-cccccCceee Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVSTKA-------NVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNI-GSEVGANALI 149 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~~~-------~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~~~~-~~~~~~~~~~ 149 (231) +|+++++.+.+++...+... .++.+.++..+|++++. ..++++|||++++.|+|++..... .+..++++++ T Consensus 120 ~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~ 199 (278) T protein:vir:80 120 VDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLV 199 (278) T ss_pred HHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhcccccccccccee Confidence 99999999988776654432 47788899999987764 466899999999999999754332 3456778999 Q ss_pred eccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 150 NGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 150 ~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) +|.||+++|++|++||++|+++++.+ .++|++++.++++++|++|++++++|.|++++|||++++||+++|++++. T Consensus 200 ~G~ig~~~G~~Vi~s~~~p~~t~~l~----~~gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 200 KGAFGELLGWEIVRTKKLADGNALAV----KAGALKTFLKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPV 275 (278) T ss_pred eccceeecceeEEEcCCCCcceEEEE----eccceeeeecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeec Confidence 99999999999999999999887654 58999999999999999999999999999999999999999999999999 Q ss_pred cC Q lcl|Aclame:pro 230 GV 231 (231) Q Consensus 230 ~~ 231 (231) |= T Consensus 276 a~ 277 (278) T protein:vir:80 276 AG 277 (278) T ss_pred cC Confidence 88 No 14 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=8.8e-51 Score=295.01 Aligned_cols=227 Identities=48% Similarity=0.723 Sum_probs=209.8 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-..|++|+||+| +|++++++||++++++++++++.++++++++++|++||++..++.+|++++..++++++|+++ T Consensus 40 ~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~ 119 (272) T protein:vir:30 40 TLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHK 119 (272) T ss_pred cccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHH Confidence 222357999999998 789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhc-cccccCceeeeccceeec Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNI-GSEVGANALINGTYADVL 157 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~ 157 (231) +|+++++.+.+++..++...++++|++|+.+|++++..+++++|||++++.|+++...... ....+.+.+++|.+|+++ T Consensus 120 ~d~~i~~~~~~a~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~ 199 (272) T protein:vir:30 120 VDADVLDALSKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVL 199 (272) T ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhc Confidence 9999999999999999999999999999999999999999999999999999998544322 345566788999999999 Q ss_pred ceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 158 GAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 158 G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |+||++|+++|+++++.+ +++|++++.++++.+|++|+++++++.+++++||++++++|+++|++|+++- T Consensus 200 G~~Vi~s~~~p~~t~~~~----~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 200 GVQIVRSRKCPKGTAYMV----RKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred CeeEEEcCCCCcceEEEE----cCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 999999999999987654 5899999999999999999999999999999999999999999999999999 No 15 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=8.8e-51 Score=295.01 Aligned_cols=227 Identities=48% Similarity=0.723 Sum_probs=209.8 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +..-..|++|+||+| +|++++++||++++++++++++.++++++++++|++||++..++.+|++++..++++++|+++ T Consensus 40 ~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~ 119 (272) T protein:vir:98 40 TLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHK 119 (272) T ss_pred cccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHH Confidence 222357999999998 789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhc-cccccCceeeeccceeec Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNI-GSEVGANALINGTYADVL 157 (231) Q Consensus 79 vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~ 157 (231) +|+++++.+.+++..++...++++|++|+.+|++++..+++++|||++++.|+++...... ....+.+.+++|.+|+++ T Consensus 120 ~d~~i~~~~~~a~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~ 199 (272) T protein:vir:98 120 VDADVLDALSKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVL 199 (272) T ss_pred HHHHHHHHhcccccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhc Confidence 9999999999999999999999999999999999999999999999999999998544322 345566788999999999 Q ss_pred ceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 158 GAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 158 G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |+||++|+++|+++++.+ +++|++++.++++.+|++|+++++++.+++++||++++++|+++|++|+++- T Consensus 200 G~~Vi~s~~~p~~t~~~~----~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 200 GVQIVRSRKCPKGTAYMV----RKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred CeeEEEcCCCCcceEEEE----cCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 999999999999987654 5899999999999999999999999999999999999999999999999999 No 16 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=100.00 E-value=2.4e-40 Score=237.75 Aligned_cols=222 Identities=11% Similarity=0.102 Sum_probs=183.9 Q ss_pred CCCcccCceEEeccc--c-CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--I-GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--i-gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =++..+|++|+||+| + |+++++.||+++++++|++++..+++++++++|+++|++.+.+++||+.++.+|++.+|++ T Consensus 45 l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~ 124 (324) T protein:vir:59 45 AKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAR 124 (324) T ss_pred hhccCCCCEEEecccccCCCcccccCCCcccchhhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHH Confidence 134568999999998 4 9999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccc-------------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS-------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) Q Consensus 78 ~vd~~~~~~l~t~~-------------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~ 144 (231) +.++++++.|++.. ....+.++++.+++|.++|||+.....+++|||.++..|+++..... -.+.. T Consensus 125 ~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~-~~~s~ 203 (324) T protein:vir:59 125 EMQKIVFAELAGVFSNDDMKDNKLDISGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEF-VKDSQ 203 (324) T ss_pred HHHHHHHHHHHHhhhccccccceeeeeccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhh-ccccc Confidence 99999999886421 11223468899999999999999999999999999999999753332 22222 Q ss_pred CceeeeccceeecceeEEEcCCCccC-----ceEEEEEecCCceEEEeecC-CccceeccchhhcccEEEEEEEEEEEEE Q lcl|Aclame:pro 145 ANALINGTYADVLGAQIVRSKKLAEG-----SALMFKIVSNSPALKLVLKR-GVQVETDRDIVTKTTVITADEHYAAYLY 218 (231) Q Consensus 145 ~~~~~~G~ig~~~G~~Vv~s~~~~~~-----~~~~~~~~~~~~A~~~~~k~-~v~vE~~Rd~~~~~~~i~~~~~y~~~~~ 218 (231) .++.|++++|++|++|++||.. ...+..++.++||+++..++ ++.+|++|++.++.+.++.++||.+|+. T Consensus 204 ----~~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p~ 279 (324) T protein:vir:59 204 ----SGIRFPTYMNKRVIVDDSMPVETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHPR 279 (324) T ss_pred ----cCceeeeecccEEEEeCCCCccccCCCCceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEee Confidence 2457899999999999999842 23456677789999998755 5889999999999999999999998884 Q ss_pred cCCcEEEEEecc---C Q lcl|Aclame:pro 219 DLTKVVNITFTG---V 231 (231) Q Consensus 219 ~~~~vv~l~~~~---~ 231 (231) = ++.+-++ . T Consensus 280 G----~s~~~~~~~~~ 291 (324) T protein:vir:59 280 G----VKFTENAMAGT 291 (324) T ss_pred e----EEecccccCCC Confidence 1 2222222 2 No 17 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=100.00 E-value=7e-40 Score=235.25 Aligned_cols=226 Identities=14% Similarity=0.138 Sum_probs=184.1 Q ss_pred CCCcccCceEEeccc--c-CCcccccCCC-ccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--I-GDAADVAEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--i-gda~~v~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .....+|++|+||+| + |+++++.||+ .|+++++++++..+++++++|+|.++|++.+.++.||+.++.+|++.+|+ T Consensus 45 ~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~ 124 (330) T protein:vir:10 45 KNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWL 124 (330) T ss_pred HHhhcCCCEEEecccccCCCcccccCCCccccchhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhh Confidence 344458999999998 4 9999999996 79999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhccccc-------------------ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhh Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQ-------------------TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAK 137 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~-------------------~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~ 137 (231) ++.++.+++.|++... ..+..++++.+++|.++|||+.....+++|||.++.+|+++.. . T Consensus 125 ~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~l-i 203 (330) T protein:vir:10 125 REDQKALIATLNGIFATGTAGEKGALEETHVSDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNL-I 203 (330) T ss_pred hhHHHHHHHHHHhhhhhhhcccchhhhhhheecccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhh-h Confidence 9999999987752210 1223468899999999999999999999999999999999643 3 Q ss_pred hccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEee---cCCccceeccchhhcccEEEEEEEEE Q lcl|Aclame:pro 138 NIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVL---KRGVQVETDRDIVTKTTVITADEHYA 214 (231) Q Consensus 138 ~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~---k~~v~vE~~Rd~~~~~~~i~~~~~y~ 214 (231) +...+. ..++.|++++|++|++|+++|.....+..++.++||+++.. ++.+.+|++|+++++.+.+..++||. T Consensus 204 ~~~~~s----~~~~~i~~~~G~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~ 279 (330) T protein:vir:10 204 QYIQPT----TATINIPTYLGYRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALV 279 (330) T ss_pred hhhccc----ccCcccccccceEEEEeCCCCCCCCceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEE Confidence 322222 12467999999999999999977666667788899999975 44589999999999999999999999 Q ss_pred EEEEcCCcEEE-EEeccC Q lcl|Aclame:pro 215 AYLYDLTKVVN-ITFTGV 231 (231) Q Consensus 215 ~~~~~~~~vv~-l~~~~~ 231 (231) +|++--+-... .+.++. T Consensus 280 ~hp~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 280 MHPYGVKWTGAEVDAGNI 297 (330) T ss_pred eeeeeeeecccccccCcC Confidence 88532211111 111222 No 18 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=4.4e-39 Score=230.88 Aligned_cols=228 Identities=21% Similarity=0.187 Sum_probs=191.2 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehc-cceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) |.--..||||+||+| .+......+|..+++++++.++.+++|+|. ...+.|+|++..++..|+ .+..+|+++++|+ T Consensus 34 ~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~ 112 (273) T protein:vir:79 34 EGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALAT 112 (273) T ss_pred cccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEeeecccceeeccHHHHhhcccH-HHHHHHHHHHHHH Confidence 555567999999987 444556789999999999999999999885 668999999999998886 5689999999999 Q ss_pred HHHHHHHHHhccccccc------ccccCHHHHHHHHHHhhccCC--CceEEEECHHHHHHHHhhhhh-hhccccccCcee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTV------STKANVDGVQAALDIFNDEDA--QAYVLIVNPKDAAKIRKDANA-KNIGSEVGANAL 148 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~------~~~~~~d~i~da~~~l~~~~~--~~~v~vv~p~~~~~L~k~~~~-~~~~~~~~~~~~ 148 (231) ++|+++++.+.++.... +....++.|.+|...|++.+. ++++++++|+.++.|++++.+ ...........+ T Consensus 113 ~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l 192 (273) T protein:vir:79 113 DTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) T ss_pred HHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccce Confidence 99999998886543322 223457899999999988764 689999999999999998754 333333345678 Q ss_pred eeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 149 INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 149 ~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) ++|.+|+++|++|+.|+++|.+++... +...++|+++. ++...+|.+|+++++.+.|++++|||+++++|+++++++. T Consensus 193 ~~G~ig~~~G~~i~~s~~lp~~~~~~~-~a~~~~A~~~a-~~~~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~ 270 (273) T protein:vir:79 193 RAGTIGNLLGARIVESNNLRDTDDEQF-VAFHPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) T ss_pred eeeEeeEEeceEEEecccccccCceEE-EEEeccceeee-eehhhhhcccCcccceeeeeeeeeeeeEEecCceEEEEec Confidence 999999999999999999998877543 34567888875 4677999999999999999999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +|. T Consensus 271 ~g~ 273 (273) T protein:vir:79 271 TGS 273 (273) T ss_pred cCC Confidence 999 No 19 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=4.5e-38 Score=225.33 Aligned_cols=228 Identities=21% Similarity=0.183 Sum_probs=189.9 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehc-cceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) |.-...||||+||++ .+......+|..++++.++.++.+++|+|. ...+.|+|++..++..|+ ....+|+++++|+ T Consensus 34 ~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~ 112 (273) T protein:vir:10 34 EGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALAT 112 (273) T ss_pred ccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHH Confidence 544577999999986 333444678888999999999999999885 567999999999988886 5689999999999 Q ss_pred HHHHHHHHHhccccccc------ccccCHHHHHHHHHHhhccCC--CceEEEECHHHHHHHHhhhhhhh-ccccccCcee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTV------STKANVDGVQAALDIFNDEDA--QAYVLIVNPKDAAKIRKDANAKN-IGSEVGANAL 148 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~------~~~~~~d~i~da~~~l~~~~~--~~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~ 148 (231) ++|+++++.+.++.... +....++.|++|...|++.+. ++++++++|+.++.|++++.+.. .........+ T Consensus 113 ~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) T protein:vir:10 113 DTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) T ss_pred HHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccce Confidence 99999998876543322 223458899999999988764 68999999999999999876443 3434445678 Q ss_pred eeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 149 INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 149 ~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) ++|.+|+++|++|+.|+++|.+++... +...++|+++.. +..++|..|+++++++.|+++++||+++++|+++++++. T Consensus 193 ~~G~ig~i~G~~v~~s~~lp~~~~~~~-~~~~~~A~~~a~-q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~ 270 (273) T protein:vir:10 193 RAGTIGNLLGARIVESNNLRDTDDEQF-VAFHPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) T ss_pred eeeeeeEEeceEEEEecccccCCccEE-EEEeccceeeee-eeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEec Confidence 899999999999999999998877543 345688998764 677999999999999999999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +|. T Consensus 271 ~g~ 273 (273) T protein:vir:10 271 TGS 273 (273) T ss_pred cCC Confidence 999 No 20 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=4.5e-38 Score=225.33 Aligned_cols=228 Identities=21% Similarity=0.183 Sum_probs=189.9 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehc-cceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) |.-...||||+||++ .+......+|..++++.++.++.+++|+|. ...+.|+|++..++..|+ ....+|+++++|+ T Consensus 34 ~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~ 112 (273) T protein:vir:10 34 EGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALAT 112 (273) T ss_pred ccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHH Confidence 544577999999986 333444678888999999999999999885 567999999999988886 5689999999999 Q ss_pred HHHHHHHHHhccccccc------ccccCHHHHHHHHHHhhccCC--CceEEEECHHHHHHHHhhhhhhh-ccccccCcee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTV------STKANVDGVQAALDIFNDEDA--QAYVLIVNPKDAAKIRKDANAKN-IGSEVGANAL 148 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~------~~~~~~d~i~da~~~l~~~~~--~~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~ 148 (231) ++|+++++.+.++.... +....++.|++|...|++.+. ++++++++|+.++.|++++.+.. .........+ T Consensus 113 ~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) T protein:vir:10 113 DTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) T ss_pred HHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccce Confidence 99999998876543322 223458899999999988764 68999999999999999876443 3434445678 Q ss_pred eeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 149 INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 149 ~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) ++|.+|+++|++|+.|+++|.+++... +...++|+++.. +..++|..|+++++++.|+++++||+++++|+++++++. T Consensus 193 ~~G~ig~i~G~~v~~s~~lp~~~~~~~-~~~~~~A~~~a~-q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~ 270 (273) T protein:vir:10 193 RAGTIGNLLGARIVESNNLRDTDDEQF-VAFHPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) T ss_pred eeeeeeEEeceEEEEecccccCCccEE-EEEeccceeeee-eeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEec Confidence 899999999999999999998877543 345688998764 677999999999999999999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +|. T Consensus 271 ~g~ 273 (273) T protein:vir:10 271 TGS 273 (273) T ss_pred cCC Confidence 999 No 21 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=100.00 E-value=1.8e-37 Score=222.07 Aligned_cols=221 Identities=11% Similarity=0.067 Sum_probs=176.1 Q ss_pred CCCcccCceEEeccc--c-CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--I-GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--i-gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) +....+|++|+||+| + ||++++.|+.+|++++|++++..+++++++++|+++|++.+.+++||++++.+|++.+|++ T Consensus 43 ~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~ 122 (351) T protein:vir:15 43 PHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQR 122 (351) T ss_pred HHhhcCCCEEEecccccCCCcccccCCCcccchheecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHH Confidence 444468999999998 5 9999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccc----------------cccccccCHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS----------------QTVSTKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIG 140 (231) Q Consensus 78 ~vd~~~~~~l~t~~----------------~~~~~~~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~~~~~ 140 (231) +.++.+++.|++.. ...+..++++.+++|.++|+|+.. ...+++|||.++..|+++.. .+.- T Consensus 123 ~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~l-i~~~ 201 (351) T protein:vir:15 123 ADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGL-IETI 201 (351) T ss_pred HHHHHHHHHHHHHhhchhhcccceeccccccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhh-hhhc Confidence 99999999876321 012335788999999999999765 47999999999999998753 2222 Q ss_pred ccccCceeeeccceeecceeEEEcCCCcc-----CceEEEEEecCCceEEEeecCCccceeccchhh--cccEEEEEEEE Q lcl|Aclame:pro 141 SEVGANALINGTYADVLGAQIVRSKKLAE-----GSALMFKIVSNSPALKLVLKRGVQVETDRDIVT--KTTVITADEHY 213 (231) Q Consensus 141 ~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~-----~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~--~~~~i~~~~~y 213 (231) .+.. .++.||+++|++|++|++||. ....+..++.++||+++.. +...+|++||+++ +.+.++.++|| T Consensus 202 ~~s~----~~~~i~t~~G~~VivdD~~p~~~~~~~~~~ytsyl~~~GAi~~~~-~~~~ve~~rd~~~~~g~d~l~~r~~~ 276 (351) T protein:vir:15 202 QPQN----GATPFEAYNGLRIVLDDDIEIDLTDKTKPVSTSYIFAPGAVRYST-NMRSTETKYDPLINGGQDVIVQKRVG 276 (351) T ss_pred cccc----cCcccceecceEEEEcCCCccccCCCCCceeEEEEEecceeeeec-CCcCcceeecccCCCCceEEEEeeee Confidence 2222 245789999999999999984 2234556777899999765 4557899998776 57899999998 Q ss_pred EEEEEcCCcEEEEEe-----ccC Q lcl|Aclame:pro 214 AAYLYDLTKVVNITF-----TGV 231 (231) Q Consensus 214 ~~~~~~~~~vv~l~~-----~~~ 231 (231) ..|+. ++ +.+. ++. T Consensus 277 ~~hp~---G~-s~~~~~~~~~~~ 295 (351) T protein:vir:15 277 TIHVA---GT-SIKASFSPSKAS 295 (351) T ss_pred eeeee---ee-eecccccccCcC Confidence 86653 22 1111 111 No 22 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.1e-34 Score=205.21 Aligned_cols=229 Identities=19% Similarity=0.160 Sum_probs=185.9 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccC--ccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEIS--LDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~--~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) ..-+++|+|++||+ +|. ++++.+|++++ ++.++.++.+++|++..+ .+.|.|.+..++..|++++..++++++| T Consensus 51 ~~~~~~G~sv~i~~-ig~~t~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aL 129 (347) T protein:vir:15 51 LRSIASGKSAQFPV-IGRTKAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESL 129 (347) T ss_pred cccccccceeEeee-ccceeeeeeccCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHH Confidence 22345799999976 565 45789999985 466899999999999888 4899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccc-------------c---------ccccc---------cCHHHHHHHHHHhhccC--CCceEEEE Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS-------------Q---------TVSTK---------ANVDGVQAALDIFNDED--AQAYVLIV 122 (231) Q Consensus 76 a~~vd~~~~~~l~t~~-------------~---------~~~~~---------~~~d~i~da~~~l~~~~--~~~~v~vv 122 (231) |++.|+.++..+.... + ..++. .-++.+.+|...|.+.+ ...+|+++ T Consensus 130 A~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv 209 (347) T protein:vir:15 130 AMAADGAVLAELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYT 209 (347) T ss_pred HHHHHHHHHHHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEe Confidence 9999999886543110 0 00000 11666777778898765 47899999 Q ss_pred CHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce------------EE---------------- Q lcl|Aclame:pro 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA------------LM---------------- 174 (231) Q Consensus 123 ~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~------------~~---------------- 174 (231) +|++|+.|++++++.... +.+...+.+|.+|+++|++|+.||++|.+.+ +. T Consensus 210 ~P~~y~~LL~~~~~~~~d-~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~ 288 (347) T protein:vir:15 210 TPDNYSAILAALMPNAAN-YQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNV 288 (347) T ss_pred CHHHHHHHhccccccccc-ccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccc Confidence 999999999999887554 4556678999999999999999999985321 00 Q ss_pred EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 175 FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 175 ~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .-.+..+.|++.+..+++++|.+|++.++.|.|.+.+.||+++++|++++.|.++.| T Consensus 289 ~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 289 VGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred eeeeeccceeeeeEeeceeeeecccchhhhhhhehhhhcCCceeccccEEEEecCCC Confidence 112335678889999999999999999999999999999999999999999999999 No 23 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=2.9e-34 Score=204.46 Aligned_cols=229 Identities=15% Similarity=0.238 Sum_probs=185.5 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehcc-ceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAA-KGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g-~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) |.-...||||+||.. |+ +.++.+|..+++++++.++.+++|+|.. ..+.|+|++..++..|++++..++++++||+ T Consensus 47 ~~~~~~Gdtv~ip~~-g~~~~~d~~~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~ 125 (341) T protein:vir:94 47 GAQVKKGDTFHVPRI-SELGVEDKATDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAK 125 (341) T ss_pred cccccCCceEEEecc-CcceeeeecCCCccccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHH Confidence 554566999999974 55 5689999999999999999999998864 5799999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccc------------c---ccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQT------------V---STKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIG 140 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~------------~---~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~~~ 140 (231) ++|+++++.+...+.. . ...++++.|.+|...|++.+ .++++++++|+.++.|+++++|.... T Consensus 126 ~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~ 205 (341) T protein:vir:94 126 DMTGSILGLRAAVQNTASQNVFSSSNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKD 205 (341) T ss_pred HHHHHHHHHhhhccccccCccccCccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhh Confidence 9999988765432211 0 12246899999999998875 47899999999999999999987654 Q ss_pred ccccCceeeeccceeecceeEEEcCCCccCceEEEE---------------------------------EecCCceEEEe Q lcl|Aclame:pro 141 SEVGANALINGTYADVLGAQIVRSKKLAEGSALMFK---------------------------------IVSNSPALKLV 187 (231) Q Consensus 141 ~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~---------------------------------~~~~~~A~~~~ 187 (231) ..+...+++|.+|+++|++|+.|+++|.+.+.... +...+.|++.+ T Consensus 206 -~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~ 284 (341) T protein:vir:94 206 -FINNAPIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTA 284 (341) T ss_pred -ccccchhheeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccce Confidence 45566789999999999999999999976543211 11111222222 Q ss_pred -----------ecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 188 -----------LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 188 -----------~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ..+...+|.+|++.++.|.|.+++.||+++++|+++|.|...|. T Consensus 285 k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 285 VMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred eeecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcC Confidence 23446788999999999999999999999999999999999998 No 24 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=4.5e-34 Score=203.39 Aligned_cols=229 Identities=20% Similarity=0.167 Sum_probs=187.6 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccC--ccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEIS--LDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~--~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) ..-+++|++++||. +|. +.++.+|++++ +..++.++.+++|++..++ +.|.|.+..++..|++++..++++++| T Consensus 51 ~r~~~~G~sv~i~~-iG~~t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aL 129 (347) T protein:vir:33 51 LRSIASGKSAQFPV-IGRTKAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESL 129 (347) T ss_pred cccccccceeEeee-ccceeeeeecCCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHH Confidence 33456899999976 565 45789999985 4668999999999999885 899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcc-----------------c---------ccccc-----cccCHHHHHHHHHHhhccC--CCceEEEE Q lcl|Aclame:pro 76 ANKVDDDLLKAAKT-----------------T---------SQTVS-----TKANVDGVQAALDIFNDED--AQAYVLIV 122 (231) Q Consensus 76 a~~vd~~~~~~l~t-----------------~---------~~~~~-----~~~~~d~i~da~~~l~~~~--~~~~v~vv 122 (231) |++.|+.++..+.. . ++... ....|+.|.+|...|.+.+ ...+|+|+ T Consensus 130 A~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv 209 (347) T protein:vir:33 130 AMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYT 209 (347) T ss_pred HHHHHHHHHHHHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEe Confidence 99999988754311 0 00000 0123788899999998776 46899999 Q ss_pred CHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEE---------------------------- Q lcl|Aclame:pro 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALM---------------------------- 174 (231) Q Consensus 123 ~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~---------------------------- 174 (231) +|++|+.|++++++.... +.+.+.+.+|.+++++|++|+.||++|.+.+.. T Consensus 210 ~P~~y~~Ll~~~~~~~~d-~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~ 288 (347) T protein:vir:33 210 TPDNYSAILAALMPNAAN-YQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNV 288 (347) T ss_pred CHHHHHHHhccccccccc-cccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccce Confidence 999999999999887654 345667899999999999999999998643210 Q ss_pred EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 175 FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 175 ~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .-.+..+.|++.+..+++++|..|++.++.|.|.+.+.||+++++|++++.|.++.| T Consensus 289 ~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 289 VGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred eeeeecchhheeeeeeceeeeeccchhhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 012335678888888999999999999999999999999999999999999999999 No 25 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=100.00 E-value=6.2e-34 Score=202.66 Aligned_cols=224 Identities=24% Similarity=0.219 Sum_probs=165.8 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccc---eeEEEeehccceeeecHHHH-HhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTT---TKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~---~~~~tikk~g~~~~itD~~~-~~~~~d~~~~~~~~~a~~ 74 (231) ..-+..|+||++|+| +||+++|+||++||.++++.+ ..+++++|++|++ |||++ +.+++||+.++.+||.++ T Consensus 40 ~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ 117 (295) T protein:vir:99 40 RETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRE 117 (295) T ss_pred ccccccCCeEEeeeeeeecccccccCCcccchhhheeeeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHH Confidence 455677999999998 999999999999999999976 5899999999976 99997 789999999999999999 Q ss_pred HHHHHHHHHHHHhcccccccccc---cCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh-ccccccCceeee Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTSQTVSTK---ANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN-IGSEVGANALIN 150 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~~~~~~~---~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~ 150 (231) |++++|+++++.|++++++++.. ..++.+.++++.|.++++.+.|+||||+++++||++....+ +.+..|.+.+.| T Consensus 118 ia~kId~D~~~~lktat~t~tg~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~n 197 (295) T protein:vir:99 118 LQNGIKDAFFTFLKTKPTKVKGVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKN 197 (295) T ss_pred HHHhhhHHHHHHhccCceeeehhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhh Confidence 99999999999999999888754 36777888899999888889999999999999999987654 234577777664 Q ss_pred ccceeeccee-EEEcCCCccCceEEEEEecCCceEEEeecCCc----cceecc--ch----hhcccEEEEEEE-E-EEEE Q lcl|Aclame:pro 151 GTYADVLGAQ-IVRSKKLAEGSALMFKIVSNSPALKLVLKRGV----QVETDR--DI----VTKTTVITADEH-Y-AAYL 217 (231) Q Consensus 151 G~ig~~~G~~-Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v----~vE~~R--d~----~~~~~~i~~~~~-y-~~~~ 217 (231) ++|++ |++|+++|+|+.+.....+-..|+.-+...++ ..-+|- .+ .+..+-.+.... + |..+ T Consensus 198 -----fLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~l 272 (295) T protein:vir:99 198 -----FLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVL 272 (295) T ss_pred -----hhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhhhhhccCcccceEEEeccccceeeehhhhHhHHHh Confidence 99997 99999999999986543222222222211111 111110 00 000111111111 1 1111 Q ss_pred E--cCCcEEEEEeccC Q lcl|Aclame:pro 218 Y--DLTKVVNITFTGV 231 (231) Q Consensus 218 ~--~~~~vv~l~~~~~ 231 (231) . .+++||+.++.+- T Consensus 273 fpE~~dgiv~~tI~~~ 288 (295) T protein:vir:99 273 FAEIPEGVVEATIEAA 288 (295) T ss_pred cccccceEEEEEEecC Confidence 1 5678999888665 No 26 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.96 E-value=3.5e-33 Score=198.51 Aligned_cols=227 Identities=15% Similarity=0.153 Sum_probs=182.3 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCcc-ccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISLD-KIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~~-~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) ++ +.+|+|++||+ +|+. .++.+|+++.++ .++.++++++|++..+ .+.|.|.+..++..|++++..++++++|| T Consensus 54 r~-i~~G~tv~i~~-ig~~~~~~~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA 131 (332) T protein:vir:78 54 YD-LRGGKSKQFMF-TGKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALA 131 (332) T ss_pred cc-ccccceEEEEe-ccceeEeeecCCCCCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHH Confidence 44 35899999976 5665 579999999776 5999999999999777 48999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccc--------------------cccccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHh-- Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS--------------------QTVSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRK-- 132 (231) Q Consensus 77 ~~vd~~~~~~l~t~~--------------------~~~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k-- 132 (231) +++|+.++..+..+. ...+....|+.|.+|...|.+.+ ...+|++++|+.|+.|++ T Consensus 132 ~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~ 211 (332) T protein:vir:78 132 THYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSV 211 (332) T ss_pred HHHHHHHHHHHHhhhcccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhc Confidence 999998887553221 01112235788999999998776 467999999999999998 Q ss_pred hhhhhhccccccCceeeecc-ceeecceeEEEcCCCccCceEE--------------------EEEecCCceEEEeecCC Q lcl|Aclame:pro 133 DANAKNIGSEVGANALINGT-YADVLGAQIVRSKKLAEGSALM--------------------FKIVSNSPALKLVLKRG 191 (231) Q Consensus 133 ~~~~~~~~~~~~~~~~~~G~-ig~~~G~~Vv~s~~~~~~~~~~--------------------~~~~~~~~A~~~~~k~~ 191 (231) ++++........++.+++|. +++++|++|+.||++|.+.+.. ..++..+.|+++....+ T Consensus 212 d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~ 291 (332) T protein:vir:78 212 DTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVA 291 (332) T ss_pred CceeeeeeccccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeec Confidence 66665554444455678874 8999999999999999643211 12344577888888777 Q ss_pred ccc---eeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 192 VQV---ETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 192 v~v---E~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) +++ |.+|+++++.|.|.+++.||+++++|++++.|+-. T Consensus 292 ~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 292 PTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred cchhhhhcccchhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 655 46889999999999999999999999999999777 No 27 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.96 E-value=3.3e-33 Score=198.69 Aligned_cols=229 Identities=19% Similarity=0.173 Sum_probs=187.4 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCcc--ccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISLD--KIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~~--~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) .--+++|++++||. +|.. ..+.+|++++.. .++.++.+++|++..++ +.|.|.+..++..|+.++..+++|++| T Consensus 52 ~r~i~~gks~~~~~-iG~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aL 130 (345) T protein:vir:22 52 VRSISSGKSAQFPV-LGRTQAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESL 130 (345) T ss_pred eeeccccceEEEee-ecceEEEeeecCCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHH Confidence 34566899999975 6765 579999999654 57889999999998885 899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccc------------------------cc------cccccCHHHHHHHHHHhhccC--CCceEEEEC Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS------------------------QT------VSTKANVDGVQAALDIFNDED--AQAYVLIVN 123 (231) Q Consensus 76 a~~vd~~~~~~l~t~~------------------------~~------~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~ 123 (231) |++.|+.++..+.... .. .+....|+.|.+|.+.|.+.+ ...+|++++ T Consensus 131 A~~~D~~i~~~l~k~a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~ 210 (345) T protein:vir:22 131 AMAADGAVLAEIAGLCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCD 210 (345) T ss_pred HHHHHHHHHHHHHHhhcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeC Confidence 9999998886542110 00 001124888999999998765 467999999 Q ss_pred HHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCc---------------------eEE-------E Q lcl|Aclame:pro 124 PKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGS---------------------ALM-------F 175 (231) Q Consensus 124 p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~---------------------~~~-------~ 175 (231) |++|+.|++++++.... +.+.+...+|.++++.|++|+.||++|.+. ..+ + T Consensus 211 P~~y~~Ll~~~~~~~~~-~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 289 (345) T protein:vir:22 211 PDSYSAILAALMPNAAN-YAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVI 289 (345) T ss_pred hHHHHHHhccccccccc-cccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceE Confidence 99999999998886543 556677789999999999999999987421 111 1 Q ss_pred EEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 176 KIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 176 ~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) -+.+.+.|++.+..+++++|.+|++.++.|.|.+.+.||+++++|+++++|+++=- T Consensus 290 ~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 290 GLFMHRSAVGTVKLRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred EEEEehhheeeeeeecceeeeeechhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 13446789999999999999999999999999999999999999999999888766 No 28 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.96 E-value=5.9e-33 Score=197.27 Aligned_cols=228 Identities=19% Similarity=0.208 Sum_probs=183.1 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCcc--CccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEI--SLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i--~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) ++ +++|++++||. +|.. .++.+|++| +++.++.++.+++|+++.++ +.|.|.+..++..|++++..++++++| T Consensus 51 r~-i~~G~sv~i~~-iG~~tv~~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aL 128 (347) T protein:vir:94 51 RT-IQNGKSAQFPV-MGRTSGVYLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEAL 128 (347) T ss_pred cc-ccccceEEEec-ccceeeeeecCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHH Confidence 44 45799999976 5664 579999999 45678999999999998875 799999999999999999999999999 Q ss_pred HHHHHHHHHHHhcc---cc------------c----------ccc----cccCHHHHHHHHHHhhccC--CCceEEEECH Q lcl|Aclame:pro 76 ANKVDDDLLKAAKT---TS------------Q----------TVS----TKANVDGVQAALDIFNDED--AQAYVLIVNP 124 (231) Q Consensus 76 a~~vd~~~~~~l~t---~~------------~----------~~~----~~~~~d~i~da~~~l~~~~--~~~~v~vv~p 124 (231) +++.|+.++..+.. .+ . ..+ ....++.|.+|...|.+.+ ..++|++++| T Consensus 129 a~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P 208 (347) T protein:vir:94 129 AIAADGAVLAEMAILCNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTP 208 (347) T ss_pred HHHHHHHHHHHHHHHhccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCH Confidence 99999988754421 00 0 000 0123677888889998665 4689999999 Q ss_pred HHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce--------------------------E----- Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA--------------------------L----- 173 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~--------------------------~----- 173 (231) +.|+.|++++.+... ...+...+.+|.+|+++|++|+.||++|.+.. + T Consensus 209 ~~~~~Ll~~~~~~~~-~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~ 287 (347) T protein:vir:94 209 DNYSAILAALMPNAA-NYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMD 287 (347) T ss_pred HHHHHHhccchhhhh-hccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhccccc Confidence 999999998877654 34455668899999999999999999984211 0 Q ss_pred -EEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 174 -MFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 174 -~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+-..+.+.|++.+...++++|.+|+++++.|.|.+.+.||+++++|+++++|+.+.- T Consensus 288 ~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 288 NVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred ceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 0112345678888888899999999999999999999999999999999999988866 No 29 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.96 E-value=3.1e-32 Score=193.34 Aligned_cols=221 Identities=21% Similarity=0.271 Sum_probs=158.9 Q ss_pred CCCcccCceE-Eeccc--cCCcccccCCCccCccccccc---eeEEEeehccceeeecHHHH-HhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLC-EYPND--IGDAADVAEGGEISLDKIGTT---TKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~ti-~~P~~--igda~~v~EG~~i~~~~lt~~---~~~~tikk~g~~~~itD~~~-~~~~~d~~~~~~~~~a~ 73 (231) .--+..|+|| ++|+| +|++++|+||++||.++++.+ ..+++++|++|++ |||++ +.+++||++++.+||.+ T Consensus 46 ~~pla~GstIkt~k~~~y~gda~dVaEGe~Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~ 123 (296) T protein:vir:98 46 KISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVR 123 (296) T ss_pred cccccCCCEEeeccceeeeeccccccCCcccchhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHH Confidence 4556679999 55666 999999999999999999986 4999999999996 99997 89999999999999999 Q ss_pred HHHHHHHHHHHHHhcccccccccccCHHHHHHHH--------HHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTSQTVSTKANVDGVQAAL--------DIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~--------~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|++++|+++++.|++++++++. +.+.+.+|+ .+|+|+++.+.|+||||.+++++|++... ..++..+. T Consensus 124 ~iq~kId~d~~t~LktaT~t~~~--t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~i-t~qt~fG~ 200 (296) T protein:vir:98 124 QLQKKIRTDFVTALKTGTGTQDA--LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI-TTQTAFGL 200 (296) T ss_pred HHHHhhhHHHHHHHhcccceeee--chhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCcc-chhheech Confidence 99999999999999998876442 445666555 78998888899999999999999999865 44444444 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCc----cceeccc--h----hhcccEEEEEEE-E- Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGV----QVETDRD--I----VTKTTVITADEH-Y- 213 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v----~vE~~Rd--~----~~~~~~i~~~~~-y- 213 (231) ..+.| ++|+.|++|+++|+|+.+.....+-..|+.-+...++ .+-+|-- + .+..+-.+.... + T Consensus 201 tyl~n-----fLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~ 275 (296) T protein:vir:98 201 TYLVD-----FTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVS 275 (296) T ss_pred hhhhh-----ccccEEEEcCcCCCceEEEeeecceEEEeecccccchhhhhccccccccceEEEeccccceeeehhHhHh Confidence 44332 9999999999999999987654332223322211111 1111100 0 000111111111 1 Q ss_pred EEEEE--cCCcEEEEEeccC Q lcl|Aclame:pro 214 AAYLY--DLTKVVNITFTGV 231 (231) Q Consensus 214 ~~~~~--~~~~vv~l~~~~~ 231 (231) |..+. .+++|++.++++- T Consensus 276 ~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 276 GMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred HHHhcccccceEEEEEecCC Confidence 11111 5678999999877 No 30 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.96 E-value=8.8e-33 Score=196.31 Aligned_cols=229 Identities=19% Similarity=0.188 Sum_probs=182.3 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCc--cccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISL--DKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~--~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) .--+++|++++||. +|.. ..+.+|++++. +.+..++.+++|++..++ +.|.|.+..++..|++++..++++++| T Consensus 52 ~r~i~~g~s~~~~~-iG~~~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aL 130 (344) T protein:vir:10 52 VRSISSGKSAQFPV-LGRTQAAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESL 130 (344) T ss_pred eeeecccceEEEEe-eceeEEEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHH Confidence 23466899999975 6765 47899999964 568999999999998885 899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccc-----------------------c--cccc-----ccCHHHHHHHHHHhhccC--CCceEEEEC Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS-----------------------Q--TVST-----KANVDGVQAALDIFNDED--AQAYVLIVN 123 (231) Q Consensus 76 a~~vd~~~~~~l~t~~-----------------------~--~~~~-----~~~~d~i~da~~~l~~~~--~~~~v~vv~ 123 (231) |+..|+.++..+.... . ..+. ...|+.|.+|.+.|.+.+ ...+|++++ T Consensus 131 A~~~D~~i~~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~ 210 (344) T protein:vir:10 131 AMAADGAVLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCD 210 (344) T ss_pred HHHHHHHHHHHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeC Confidence 9999998876542100 0 0001 123778888999998765 467999999 Q ss_pred HHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce-----------EE----------------EE Q lcl|Aclame:pro 124 PKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA-----------LM----------------FK 176 (231) Q Consensus 124 p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~-----------~~----------------~~ 176 (231) |++|+.|++++++... .+.+.+...+|.++++.|++|+.||++|.+.. +. +- T Consensus 211 P~~y~~Ll~~~~~~~~-~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~ 289 (344) T protein:vir:10 211 PDSYSAILAALMPNAA-NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIG 289 (344) T ss_pred hHHHHHHhhccccccc-ccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEE Confidence 9999999999887544 35567778999999999999999999985310 00 01 Q ss_pred EecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 177 IVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 177 ~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+..+.|++.+...++++|.+|+++++.|.|.+.+.||+++++|+++.++.++-- T Consensus 290 l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 290 LFMHRSAVGTVKLRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred EeechhhhhhhhhccceeecccchhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 2335667888888999999999999999999999999999999998855544444 No 31 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.96 E-value=1.9e-32 Score=194.51 Aligned_cols=229 Identities=15% Similarity=0.138 Sum_probs=181.4 Q ss_pred CCCcccCceEEeccccCCcc--cccCCCcc--CccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAA--DVAEGGEI--SLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~--~v~EG~~i--~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) ---+++|++++||. +|..+ .+..|++| +++.+..++.+++|++..++ +.|.|.+..++..|++++..++++++| T Consensus 2 vr~i~~g~s~~~~~-iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aL 80 (324) T protein:vir:99 2 TRTITSGKSAQFPV-MGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEAL 80 (324) T ss_pred eeeeecCceEEEee-eeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHH Confidence 33377899999976 67755 79999999 45789999999999999885 899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcc-----cc-------------------cccccc----cCHHHHHHHHHHhhccC--CCceEEEECHH Q lcl|Aclame:pro 76 ANKVDDDLLKAAKT-----TS-------------------QTVSTK----ANVDGVQAALDIFNDED--AQAYVLIVNPK 125 (231) Q Consensus 76 a~~vd~~~~~~l~t-----~~-------------------~~~~~~----~~~d~i~da~~~l~~~~--~~~~v~vv~p~ 125 (231) |+..|+.++..+.. ++ +..... ..++.|.+|.+.|.+.+ ...+|++|+|+ T Consensus 81 A~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P~ 160 (324) T protein:vir:99 81 AMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPD 160 (324) T ss_pred HHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChH Confidence 99999988754310 00 001111 23788888999998765 46899999999 Q ss_pred HHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceE-------------------------EE----- Q lcl|Aclame:pro 126 DAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL-------------------------MF----- 175 (231) Q Consensus 126 ~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~-------------------------~~----- 175 (231) +|+.|++++.+.. ..+.+.+.+.+|.+++++|++|+.||++|.+.+. .. T Consensus 161 ~y~~Ll~~~~~~~-~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~ 239 (324) T protein:vir:99 161 TYSAILAALMPNA-ANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNV 239 (324) T ss_pred HHHHHhhcccccc-cccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccCce Confidence 9998887755543 3445556789999999999999999999964211 00 Q ss_pred -EEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 176 -KIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 176 -~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) -+...+.|++.+...++++|..|+++++.|.|.+.+.||+++++|++++.+++++- T Consensus 240 ~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 240 VGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred eEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 02233556777777888999999999999999999999999999999987776554 No 32 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.96 E-value=1.9e-32 Score=194.43 Aligned_cols=230 Identities=13% Similarity=0.119 Sum_probs=184.5 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--+.+|+|++||. +|.+ ..+..|++|+.+.++.++.+++|++..++ +.|.|.+..++..|+.+++.+++|++||+ T Consensus 48 ~r~i~~G~s~~~~~-iG~~~~~~~~~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~ 126 (334) T protein:vir:80 48 VRSLRGTNQLRVDR-VGASTIAGRKAGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALAR 126 (334) T ss_pred eeeccccceEEEee-ecceeeeeecCCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHH Confidence 22357799999984 6765 57999999999999999999999998774 89999999999999999999999999999 Q ss_pred HHHHHHHHHhcccc--------------c----------ccccccCH----HHHHHHHHHhhccCC-----CceEEEECH Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS--------------Q----------TVSTKANV----DGVQAALDIFNDEDA-----QAYVLIVNP 124 (231) Q Consensus 78 ~vd~~~~~~l~t~~--------------~----------~~~~~~~~----d~i~da~~~l~~~~~-----~~~v~vv~p 124 (231) +.|+.++..+..+. + +.....+. +++.+|.+.|.+.+. .++|++|+| T Consensus 127 ~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P 206 (334) T protein:vir:80 127 QYDQACIIQLQKCGDFLAPAHLKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDP 206 (334) T ss_pred HHHHHHHHHHHHhhhhcccccccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeCh Confidence 99998775432111 0 01111233 445566777766542 359999999 Q ss_pred HHHHHHHhhhhhhhcc--ccccCceeeeccceeecceeEEEcCCCccCceE-----------------EEEEecCCceEE Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIG--SEVGANALINGTYADVLGAQIVRSKKLAEGSAL-----------------MFKIVSNSPALK 185 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~--~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~-----------------~~~~~~~~~A~~ 185 (231) ++|+.|++++++.++. ...+.....+|.+++++|++|+.||++|.+... .+-....+.|++ T Consensus 207 ~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~ 286 (334) T protein:vir:80 207 VIFSFLLEHDRLMNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALI 286 (334) T ss_pred HHHHHHhcccccccceeccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEE Confidence 9999999999987652 223345567899999999999999999965211 011233578999 Q ss_pred EeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 186 LVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 186 ~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ...-.+++.|.+|++.++++.|.+.+.||+++++|++++.+.++.+ T Consensus 287 t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 287 SAQVHPVSAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred EEEEeecceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEeee Confidence 9999999999999999999999999999999999999999999999 No 33 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=99.95 E-value=6.4e-31 Score=186.11 Aligned_cols=222 Identities=11% Similarity=0.059 Sum_probs=178.7 Q ss_pred CCCcccCceEEeccc---cCCcccccCCC---ccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGG---EISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~---~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) ..+..+|++|+||.| .|+.+.+.++. ++++.++++++..+.+..++++|..+|.+...++.|||+.+.+|++.. T Consensus 47 ~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~y 126 (367) T protein:vir:80 47 QFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVY 126 (367) T ss_pred HHhhcCCCEEEeeeeccCCCCccccCCCCCcccccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHH Confidence 567789999999999 58888887665 589999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcccc------------------------------------cccccccCHHHHHHHHHHhhccCCCce Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTS------------------------------------QTVSTKANVDGVQAALDIFNDEDAQAY 118 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~------------------------------------~~~~~~~~~d~i~da~~~l~~~~~~~~ 118 (231) |.+..++.+++.|++.. ......++.+.+++|..+|||...+.. T Consensus 127 W~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~ 206 (367) T protein:vir:80 127 WTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIA 206 (367) T ss_pred hhhhhHHHHHHHHHHhhccccccchhhhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhcccccccc Confidence 99999999988765211 011234678999999999999999999 Q ss_pred EEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc----CceEEEEEecCCceEEEeecCC-cc Q lcl|Aclame:pro 119 VLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE----GSALMFKIVSNSPALKLVLKRG-VQ 193 (231) Q Consensus 119 v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~----~~~~~~~~~~~~~A~~~~~k~~-v~ 193 (231) .++|||.++..|++.. ....-++... +..|+++.|.+|+++|.||- +...+..+++++||+++....+ +. T Consensus 207 ~i~mHS~V~~~L~~~~-li~~i~~sd~----~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~ 281 (367) T protein:vir:80 207 AIAVHSMVYKRMTNND-EIEFIPDSKG----QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVP 281 (367) T ss_pred EEEEchHHHHHHHhcc-ccccccCCCC----ccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccc Confidence 9999999999999974 3333333222 34799999999999999994 3557788899999999887665 55 Q ss_pred ceeccchhh----cccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 194 VETDRDIVT----KTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 194 vE~~Rd~~~----~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +|++||+++ +.|.+..++||..| |.++- .+.+.| T Consensus 282 ~E~~Rd~~~~~~gG~d~L~~Rr~~~~h---P~G~s-~~~~~v 319 (367) T protein:vir:80 282 VAVGRRELRGNGSGLEYILERKEWIVH---PGGFN-WLDADV 319 (367) T ss_pred eecccchhhhcCCceEEEEeeeeEEee---cceee-eccccc Confidence 799999987 46899999986544 44432 222222 No 34 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.95 E-value=2.5e-31 Score=188.39 Aligned_cols=229 Identities=16% Similarity=0.155 Sum_probs=183.5 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccC--ccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEIS--LDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~--~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) .--+++|++++||. +|.. ..+.+|++++ .+.+..++.+++|++..++ +.|.|.+..++..|+.++..++++++| T Consensus 51 ~rti~~G~sv~~~~-iG~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~AL 129 (347) T protein:vir:94 51 VRSIQSGKSAQFPV-LGRTKAAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESL 129 (347) T ss_pred heeccccceEEeee-ccceeEeeeecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHH Confidence 22356899999975 6764 5789999995 3579999999999999885 899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccc--------------------------ccc----ccccCHHHHHHHHHHhhccC--CCceEEEEC Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS--------------------------QTV----STKANVDGVQAALDIFNDED--AQAYVLIVN 123 (231) Q Consensus 76 a~~vd~~~~~~l~t~~--------------------------~~~----~~~~~~d~i~da~~~l~~~~--~~~~v~vv~ 123 (231) |++.|+.++..+.... ... .....|+.|.+|.+.|.+.+ ..++|+|++ T Consensus 130 A~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~ 209 (347) T protein:vir:94 130 AMAADGAVLAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTT 209 (347) T ss_pred HHHHHHHHHHHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeC Confidence 9999998875442100 000 01123788999999998776 468999999 Q ss_pred HHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCc-------------------------eEE---- Q lcl|Aclame:pro 124 PKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGS-------------------------ALM---- 174 (231) Q Consensus 124 p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~-------------------------~~~---- 174 (231) |++|+.|++..++... .....+.+.+|.+++++|++|+.||++|.+. .|- T Consensus 210 P~~y~~LLk~~~~~~~-~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~ 288 (347) T protein:vir:94 210 PDNYSAILAALMPNAA-NYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALD 288 (347) T ss_pred hHHHHHHHHhhccccc-ccccccccccceeEEeeceEEEEcCccccccCccccccccccccccccccccccccccccccc Confidence 9999999986554433 3334456789999999999999999998432 110 Q ss_pred --EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 175 --FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 175 --~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +-+++.+.|++.+...++++|.+|++.++.|.|.+.+.||+++++|++++.+.++.- T Consensus 289 ~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 289 NVVGLFNHRSAVGTVKLKDMALERARRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ceEEEEechhhhhhhhhcccceeeeechhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 124556778888889999999999999999999999999999999999998887777 No 35 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.95 E-value=3.9e-31 Score=187.29 Aligned_cols=228 Identities=18% Similarity=0.167 Sum_probs=182.2 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCc--cccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISL--DKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~--~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) +. +++|++++||+ +|.. ..+.+|++++. +.+..++.+++|++..+ .+.|+|.+..++..|++++..++++++| T Consensus 52 r~-i~~G~sv~~~~-iG~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aL 129 (347) T protein:vir:88 52 RT-IQNGKSASFPV-MGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEAL 129 (347) T ss_pred cc-ccCcceEEEee-ecceeeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHH Confidence 44 46899999985 6664 56899999863 57899999999999988 4899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccc------------------cc------cc-----cccCHHHHHHHHHHhhccC--CCceEEEECH Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS------------------QT------VS-----TKANVDGVQAALDIFNDED--AQAYVLIVNP 124 (231) Q Consensus 76 a~~vd~~~~~~l~t~~------------------~~------~~-----~~~~~d~i~da~~~l~~~~--~~~~v~vv~p 124 (231) |++.|+.++..+.... .. .+ ....++.|.+|...|.+.+ ..+++++++| T Consensus 130 A~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P 209 (347) T protein:vir:88 130 AIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAP 209 (347) T ss_pred HHHHHHHHHHHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCH Confidence 9999998876542110 00 00 0123788999999998765 4689999999 Q ss_pred HHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce--------EE---------------------- Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA--------LM---------------------- 174 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~--------~~---------------------- 174 (231) +.|+.|++++.+... .+.....+.+|.+|.++|++|+.|+++|.+.. +. T Consensus 210 ~~y~~Ll~~~~~~~~-~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~ 288 (347) T protein:vir:88 210 EDYSAILSALMPNAA-NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNN 288 (347) T ss_pred HHHHHHhcchhhhhh-hhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCc Confidence 999999998876543 34444567899999999999999999984211 00 Q ss_pred -EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 175 -FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 175 -~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +-+...+.|++.+...++++|.+|+++++.|.|.+.+.||+++++|++++.|.++.- T Consensus 289 ~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 289 VVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred EEEEEechhhhhheecccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 012234567777888889999999999999999999999999999999988877777 No 36 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.95 E-value=6.7e-31 Score=185.99 Aligned_cols=224 Identities=21% Similarity=0.261 Sum_probs=163.6 Q ss_pred CCCcccCceEEeccc-----cCCcccccCCCccCccccccc---eeEEEeehccceeeecHHHH-HhcCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-----IGDAADVAEGGEISLDKIGTT---TKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-----igda~~v~EG~~i~~~~lt~~---~~~~tikk~g~~~~itD~~~-~~~~~d~~~~~~~~~ 71 (231) .--+..|.+|+.++| +|++.+|+||+.||.++++.. ..+++++|++|++ |||++ +.+++||++++.+|+ T Consensus 41 ~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL 118 (303) T protein:vir:10 41 KIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEM 118 (303) T ss_pred cccccCCceeeeeeeeceeeccccccccCCcccchhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHH Confidence 334457999876653 799999999999999999965 6899999999977 99997 899999999999999 Q ss_pred HHHHHHHHHHHHHHHhcccccc----cccccCHHHHHHHHHHhh------ccCCCceEEEECHHHHHHHHhhhhhhhccc Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTTSQT----VSTKANVDGVQAALDIFN------DEDAQAYVLIVNPKDAAKIRKDANAKNIGS 141 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~~~~----~~~~~~~d~i~da~~~l~------~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~ 141 (231) .++|++++|+++++.|++++++ .+.+.+++.|.+|++.+. ++++.+.|+||||.++++||++......++ T Consensus 119 ~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t 198 (303) T protein:vir:10 119 IKYVQKKFRAKFFETLKSAIENGKRTNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGA 198 (303) T ss_pred HHHHHhhhhHHHHHHHhhcccccccccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhh Confidence 9999999999999999998865 445678999999998774 345567799999999999999987765557 Q ss_pred cccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeec---CCccceecc--ch----hhcccEEEEEEE Q lcl|Aclame:pro 142 EVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLK---RGVQVETDR--DI----VTKTTVITADEH 212 (231) Q Consensus 142 ~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k---~~v~vE~~R--d~----~~~~~~i~~~~~ 212 (231) +.|.+.+.| ++|+.||+|+++|+|+.+.....+-..|+.-+.. +....-+|- .+ ....+-.+.... T Consensus 199 ~fG~n~L~n-----fLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT~ 273 (303) T protein:vir:10 199 QFGVNLLTP-----YVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELSRAFAFATDATGFVGVLHDIQPQRLTSDTI 273 (303) T ss_pred hhhhhhhhh-----hhcceEEEeccCCCceEEEeeccceEEEEecCchhhhhhhhhccccccceEEEeccccceeeehhH Confidence 778888765 9999999999999999987553322222222210 111111110 00 000111111111 Q ss_pred -E-EEEEE--cCCcEEEEEeccC Q lcl|Aclame:pro 213 -Y-AAYLY--DLTKVVNITFTGV 231 (231) Q Consensus 213 -y-~~~~~--~~~~vv~l~~~~~ 231 (231) + |..+. .+++||+.++++- T Consensus 274 ~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 274 YASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred hHhHHHhcccccceEEEEEEecc Confidence 1 11111 5678999999776 No 37 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.94 E-value=8e-30 Score=180.12 Aligned_cols=229 Identities=15% Similarity=0.164 Sum_probs=183.3 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCcc---ccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISLD---KIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~~---~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) .--+.+|++++||. +|.. ..+..|++|... ++..++++++|++..++ +.|.|.+..++..|++++..++++++ T Consensus 54 ~rti~~Gksv~f~~-iG~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~a 132 (375) T protein:vir:10 54 KRTLKNGKSLQFIY-TGRMTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYA 132 (375) T ss_pred ccccccCceEEEEe-eeeeEEeeecCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHH Confidence 44566899999965 6765 479999998533 66788899999998774 99999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcccc-----------------------cc-----cccccCHHHHHHHHHHhhccC--CCceEEEECH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTS-----------------------QT-----VSTKANVDGVQAALDIFNDED--AQAYVLIVNP 124 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~-----------------------~~-----~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p 124 (231) ||++.|+.++..+..+. +. .+....|+.|.++...|.+.+ ...+|++++| T Consensus 133 LA~~~D~~i~~~l~kaa~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P 212 (375) T protein:vir:10 133 LAEKYDRLIFRSITRGARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNP 212 (375) T ss_pred HHHHHHHHHHHHHHHhhhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCh Confidence 99999998876553110 00 122345888999999998776 4689999999 Q ss_pred HHHHHHHhhh---hhhhccccccCceeeeccceeecceeEEEcCCCccCceE---------------------------- Q lcl|Aclame:pro 125 KDAAKIRKDA---NAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL---------------------------- 173 (231) Q Consensus 125 ~~~~~L~k~~---~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~---------------------------- 173 (231) ++|+.|+++. .+.+.. ..++....+|.+++++|++|+.||++|...+. T Consensus 213 ~~y~~Ll~~~d~~~~~n~d-~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~ 291 (375) T protein:vir:10 213 RQYYALIQDIGSNGLVNRD-VQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENAN 291 (375) T ss_pred HHHHHHHhcCCccceeeec-ccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcce Confidence 9999999863 343332 34666778889999999999999999954321 Q ss_pred ------------------EEEEecCCceEEEeecCCccceec---cchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 174 ------------------MFKIVSNSPALKLVLKRGVQVETD---RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 174 ------------------~~~~~~~~~A~~~~~k~~v~vE~~---Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+-.++.+.|++.+.-.++++|.. |++.++.|.|.+.+-||...+||++++.|+..|+ T Consensus 292 ~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 292 ATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred eeccccccccccccccCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 111344677888888888888854 7999999999999999999999999999999988 No 38 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.94 E-value=1.1e-30 Score=184.87 Aligned_cols=230 Identities=14% Similarity=0.160 Sum_probs=179.6 Q ss_pred CCCcccCceEEeccc-cCCcccccCCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) +.-...||||+||++ ...+.++.+|..+++++++.++.+++|++... .+.|+|++..+...|+.+++.++++.+||++ T Consensus 50 ~~~~~~GdTV~ip~~g~~~a~d~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~ 129 (381) T protein:vir:80 50 PFEGKKGDLIHIPNISRAAVYDKQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARD 129 (381) T ss_pred cceeecCceEEeeccCcceeeeecCCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHH Confidence 333456999999985 33456899999999999999999999988765 5999999999999999999999999999999 Q ss_pred HHHHHHHHhccccc-----------------------ccccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 79 VDDDLLKAAKTTSQ-----------------------TVSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 79 vd~~~~~~l~t~~~-----------------------~~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~ 133 (231) +|+.++..+..... ..+..++++.|.+|...|++.+ .++++++++|+.++.|+++ T Consensus 130 ~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~ 209 (381) T protein:vir:80 130 MDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSI 209 (381) T ss_pred HHHHHHHHHhhcccccccccccccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhc Confidence 99998876532110 0112347899999999998876 4679999999999999999 Q ss_pred hhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEe----------------------------------- Q lcl|Aclame:pro 134 ANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIV----------------------------------- 178 (231) Q Consensus 134 ~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~----------------------------------- 178 (231) ++|.... +.+.+.+++|.+|+++|++|+.|+++|.+.+...... T Consensus 210 ~~~~~ad-~~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~ 288 (381) T protein:vir:80 210 NQFISVD-FSQVKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDL 288 (381) T ss_pred hhhhhhh-hccchhhhceeeeEEcceEEEeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeece Confidence 9887654 4556679999999999999999999986433211110 Q ss_pred -------------------------------cCCceEEEe---------ecCCccceeccchhhcccEEEEEEEEEEEEE Q lcl|Aclame:pro 179 -------------------------------SNSPALKLV---------LKRGVQVETDRDIVTKTTVITADEHYAAYLY 218 (231) Q Consensus 179 -------------------------------~~~~A~~~~---------~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~ 218 (231) +.+++.+.+ .+..++-|.+|...-+.|+|.++..||++++ T Consensus 289 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (381) T protein:vir:80 289 AVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVF 368 (381) T ss_pred eeeeeeccceeeecceeeecCCCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccc Confidence 011222222 1122333567777788889999999999999 Q ss_pred cCCcEEEEEeccC Q lcl|Aclame:pro 219 DLTKVVNITFTGV 231 (231) Q Consensus 219 ~~~~vv~l~~~~~ 231 (231) ||..+|-|.-.|. T Consensus 369 ~~~~~~~~~~~~~ 381 (381) T protein:vir:80 369 RPDHCVLLHTSGI 381 (381) T ss_pred cchhhhhhhhcCC Confidence 9999999999999 No 39 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.94 E-value=1.9e-29 Score=177.99 Aligned_cols=230 Identities=10% Similarity=0.020 Sum_probs=187.2 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCC-HHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGD-PIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d-~~~~~~~~~a~~ia 76 (231) .--+.+|+|++||. +|. +..+..|+++.++.+..++.+++|++.-.+ +.|.|.+..++..| +-.+..+++|++|| T Consensus 46 ~rti~~gkS~q~~~-iG~~~~~~~~~G~~ld~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA 124 (364) T protein:vir:10 46 VQEVVGTNSVSNKY-IGETELQVLSPGKSPDASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLK 124 (364) T ss_pred eeeecccceEEeee-eeeeEEeeeccCcccCCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHH Confidence 44578999999976 565 457999999999999999999999998875 88999999999999 78899999999999 Q ss_pred HHHHHHHHHHhccc--------c---------------cccc-cc----cCHHHHHHHHHHhhccC--CCceEEEECHHH Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT--------S---------------QTVS-TK----ANVDGVQAALDIFNDED--AQAYVLIVNPKD 126 (231) Q Consensus 77 ~~vd~~~~~~l~t~--------~---------------~~~~-~~----~~~d~i~da~~~l~~~~--~~~~v~vv~p~~ 126 (231) +..|+.++..+..+ . ...+ .. ..+++|.+|.+.|.+.+ ...++++++|++ T Consensus 125 ~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~ 204 (364) T protein:vir:10 125 KMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTA 204 (364) T ss_pred HHHHHHHHHHHHhhhhhcccccccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHH Confidence 99999886543211 0 0000 11 12566778889998776 477999999999 Q ss_pred HHHHHhhhhhhhcc-ccccCceeeeccceeecceeEEEcCCCccC---------------------ce--------EEEE Q lcl|Aclame:pro 127 AAKIRKDANAKNIG-SEVGANALINGTYADVLGAQIVRSKKLAEG---------------------SA--------LMFK 176 (231) Q Consensus 127 ~~~L~k~~~~~~~~-~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~---------------------~~--------~~~~ 176 (231) |+.|++++++.++. ...+.+...+|.++++.|+||+.||++|.. .. ...- T Consensus 205 y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~ 284 (364) T protein:vir:10 205 FNCLRDADRIVDKSYTIAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQA 284 (364) T ss_pred HHHHhcCCccccccccccCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEE Confidence 99999998877643 233455678999999999999999999841 11 1122 Q ss_pred EecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 177 IVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 177 ~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +.+.+.|++.+...++++|.+|++.++.+.+.+.+.||+++++|++++.++..+- T Consensus 285 ~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 285 VLFTQDALLVGRTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred EEEecceEEEEEEecceeeeeeccceeeeeeeeehcccCcccCccceEEEEecCC Confidence 4556789999999999999999999999999999999999999999999988887 No 40 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.93 E-value=8.3e-29 Score=174.52 Aligned_cols=230 Identities=15% Similarity=0.141 Sum_probs=184.8 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--+.+|++++||. +|. +..+.+|+++..+....++.+++|+..-.. +.|.|.+..++..|+..+..+++++++|+ T Consensus 46 ~rti~~g~s~~~~~-iG~~~~~~~~pG~~l~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~ 124 (335) T protein:vir:63 46 IRDLRGSNVVRLDR-LGNVEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELAR 124 (335) T ss_pred eeeeccceeEEEee-eeeeeeecccCCcCcCCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHH Confidence 22357899999986 565 557999999998889999999999997764 78999999999999999999999999999 Q ss_pred HHHHHHHHHhccccc-------------------cccc---ccCHHH----HHHHHHHhhccCC-----CceEEEECHHH Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQ-------------------TVST---KANVDG----VQAALDIFNDEDA-----QAYVLIVNPKD 126 (231) Q Consensus 78 ~vd~~~~~~l~t~~~-------------------~~~~---~~~~d~----i~da~~~l~~~~~-----~~~v~vv~p~~ 126 (231) ..|+.++..+-.+.. ..++ ...++. +.+|.+.|.+.+. ..++++|+|++ T Consensus 125 ~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~ 204 (335) T protein:vir:63 125 KFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRV 204 (335) T ss_pred HHHHHHHHHHHhhccccCccccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHH Confidence 999988754321110 0011 123444 4577788876552 34999999999 Q ss_pred HHHHHhhhhhhhcc-c-cccCceeeeccceeecceeEEEcCCCccCceE-----------------EEEEecCCceEEEe Q lcl|Aclame:pro 127 AAKIRKDANAKNIG-S-EVGANALINGTYADVLGAQIVRSKKLAEGSAL-----------------MFKIVSNSPALKLV 187 (231) Q Consensus 127 ~~~L~k~~~~~~~~-~-~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~-----------------~~~~~~~~~A~~~~ 187 (231) |+.|++++++.++. . ..+.+...+|.++.+.|+||+.||++|.+... ..-+...+.|++.+ T Consensus 205 y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~ 284 (335) T protein:vir:63 205 FSLLLEHDKLMNVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITA 284 (335) T ss_pred HHHHhccccccccccccccccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEE Confidence 99999999887752 1 22334568899999999999999999953321 13345567899999 Q ss_pred ecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 188 LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 188 ~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ...+++.|.+|+..++++.|.+.+.||+++++|++++.++.+|+ T Consensus 285 ~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 285 QVAPVQAKLWEDNEKFSWVLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred EEeecccceeeccchhhHHhHHHHHcCCcccccceEEEEEEcCC Confidence 99999999999999999999999999999999999999999999 No 41 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.92 E-value=1.9e-27 Score=167.08 Aligned_cols=224 Identities=20% Similarity=0.138 Sum_probs=178.5 Q ss_pred CCC--------cccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHH Q lcl|Aclame:pro 1 ENG--------INLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNK 69 (231) Q Consensus 1 ~~~--------~~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~ 69 (231) ++. --.|.++++|.+.+ .+.+++||+++|..++++.+.++++++++..+.+|++...++ +++.+...+ T Consensus 131 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~ 209 (385) T protein:vir:19 131 RLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINN 209 (385) T ss_pred ccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHH Confidence 111 12356789998743 356899999999999999999999999999999999976655 679999999 Q ss_pred HHHHHHHHHHHHHHHHHhccc---------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh Q lcl|Aclame:pro 70 QLGLSLANKVDDDLLKAAKTT---------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) Q Consensus 70 ~~a~~ia~~vd~~~~~~l~t~---------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~ 134 (231) ++++++++++|..++.+..+. ....+..++++.|.+++..+......+..++|||.++..|++.+ T Consensus 210 ~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lk 289 (385) T protein:vir:19 210 RLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLK 289 (385) T ss_pred HHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh Confidence 999999999999999753221 11223456899999999999888888899999999999999877 Q ss_pred hhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEE Q lcl|Aclame:pro 135 NAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITAD 210 (231) Q Consensus 135 ~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~ 210 (231) +..++ +...+ ..+|..++++|+||++|+.+|+++.++.++ ..++.++.+.+++++..+.. .+....++.. T Consensus 290 d~~G~--~l~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~---~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 363 (385) T protein:vir:19 290 DNEGR--YIFGG-PQAFTSNIMWGLPVVPTKAQAAGTFTVGGF---DMASQVWDRMDATVEVSREDRDNFVKNMLTILCE 363 (385) T ss_pred cCCCc--eeccC-cccCCCceecceeeEEcCcCCCCcEEEeec---ccEEEEEEecceEEEEeccccchhhcCcEEEEEE Confidence 65443 22222 346677899999999999999998776553 44677888888888866543 3556788999 Q ss_pred EEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 211 EHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 211 ~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+|++++.+|++++++++++. T Consensus 364 ~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 364 ERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred EeeccEEecccceEEEEeccC Confidence 999999999999999999999 No 42 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.92 E-value=1.9e-27 Score=167.08 Aligned_cols=224 Identities=20% Similarity=0.138 Sum_probs=178.5 Q ss_pred CCC--------cccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHH Q lcl|Aclame:pro 1 ENG--------INLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNK 69 (231) Q Consensus 1 ~~~--------~~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~ 69 (231) ++. --.|.++++|.+.+ .+.+++||+++|..++++.+.++++++++..+.+|++...++ +++.+...+ T Consensus 131 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~ 209 (385) T protein:vir:18 131 RLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINN 209 (385) T ss_pred ccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHH Confidence 111 12356789998743 356899999999999999999999999999999999976655 679999999 Q ss_pred HHHHHHHHHHHHHHHHHhccc---------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh Q lcl|Aclame:pro 70 QLGLSLANKVDDDLLKAAKTT---------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) Q Consensus 70 ~~a~~ia~~vd~~~~~~l~t~---------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~ 134 (231) ++++++++++|..++.+..+. ....+..++++.|.+++..+......+..++|||.++..|++.+ T Consensus 210 ~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lk 289 (385) T protein:vir:18 210 RLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLK 289 (385) T ss_pred HHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh Confidence 999999999999999753221 11223456899999999999888888899999999999999877 Q ss_pred hhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEE Q lcl|Aclame:pro 135 NAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITAD 210 (231) Q Consensus 135 ~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~ 210 (231) +..++ +...+ ..+|..++++|+||++|+.+|+++.++.++ ..++.++.+.+++++..+.. .+....++.. T Consensus 290 d~~G~--~l~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~---~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 363 (385) T protein:vir:18 290 DNEGR--YIFGG-PQAFTSNIMWGLPVVPTKAQAAGTFTVGGF---DMASQVWDRMDATVEVSREDRDNFVKNMLTILCE 363 (385) T ss_pred cCCCc--eeccC-cccCCCceecceeeEEcCcCCCCcEEEeec---ccEEEEEEecceEEEEeccccchhhcCcEEEEEE Confidence 65443 22222 346677899999999999999998776553 44677888888888866543 3556788999 Q ss_pred EEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 211 EHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 211 ~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+|++++.+|++++++++++. T Consensus 364 ~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 364 ERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred EeeccEEecccceEEEEeccC Confidence 999999999999999999999 No 43 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.92 E-value=1.8e-27 Score=167.18 Aligned_cols=226 Identities=12% Similarity=0.064 Sum_probs=177.4 Q ss_pred CCCcccCceEEeccccC----CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG----DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig----da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) --....+.++++|..+| .+.+++||+.+|..++++++.++.+++++..+.||++...++ +++.+...++++++++ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~-~~l~~~i~~~la~~~~ 220 (379) T protein:vir:10 142 GAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNL-PFLTSFIPNALRRDYA 220 (379) T ss_pred eeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHH Confidence 12223466789997643 345799999999999999999999999999999999987765 5788999999999999 Q ss_pred HHHHHHHHHHhcccc----cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecc Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS----QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 152 (231) Q Consensus 77 ~~vd~~~~~~l~t~~----~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ 152 (231) .++|..++..+.+.. ...+...+++.|.+++..+......+..|+|||.++..|++.++..+.+-...+-...+|. T Consensus 221 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 300 (379) T protein:vir:10 221 KAENAAFNAVLAANATASTEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNG 300 (379) T ss_pred HHHHHHHhcccccccccccccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCC Confidence 999999887765432 2344566799999999999888888899999999999999887655443222122223455 Q ss_pred ceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 153 YADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 153 ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) ..+++|+||++|+.||+++.+..++ ..+ .+...+++.++..++. .+....+++..|+++++.+|+++|++++ T Consensus 301 ~~~l~G~pvv~s~~~~ag~~~~gdf--~~~--~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~ 376 (379) T protein:vir:10 301 VLRINGIPLFRATWLAANKYYVGDW--TRV--TKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDF 376 (379) T ss_pred cceecceeeEecCCCCCCceEEeec--ccE--EEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEe Confidence 5689999999999999998765553 233 3445667788877654 4566788899999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +|| T Consensus 377 ~~~ 379 (379) T protein:vir:10 377 TAV 379 (379) T ss_pred cCC Confidence 999 No 44 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.92 E-value=2.9e-28 Score=171.55 Aligned_cols=230 Identities=13% Similarity=0.125 Sum_probs=183.2 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--+++|++++||. +|.. ..+..|+++..+.+..++.+++|+..-.. +.|.|.+..++..|+..+..+++|+++|+ T Consensus 46 ~rti~~g~s~~~~~-iG~~~~~~~~pG~~l~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~ 124 (335) T protein:vir:78 46 IRDLRGSNVVRLDR-LGNVEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELAR 124 (335) T ss_pred eeeeccceeEEEee-eeeeeecccccCcccCCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHH Confidence 23468899999984 6764 57999999999999999999999997764 78999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccc----------------------cccccC----HHHHHHHHHHhhccCC-----CceEEEECHHH Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQT----------------------VSTKAN----VDGVQAALDIFNDEDA-----QAYVLIVNPKD 126 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~----------------------~~~~~~----~d~i~da~~~l~~~~~-----~~~v~vv~p~~ 126 (231) ..|+.++..+..+... .+.... ++++.+|.+.|.+.+. ..++++|+|++ T Consensus 125 ~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~ 204 (335) T protein:vir:78 125 KFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRV 204 (335) T ss_pred HHHHHHHHHHHhhcccccccccCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHH Confidence 9999887544221100 001112 3445556666765432 35899999999 Q ss_pred HHHHHhhhhhhhcc-c-cccCceeeeccceeecceeEEEcCCCccCceE-----------------EEEEecCCceEEEe Q lcl|Aclame:pro 127 AAKIRKDANAKNIG-S-EVGANALINGTYADVLGAQIVRSKKLAEGSAL-----------------MFKIVSNSPALKLV 187 (231) Q Consensus 127 ~~~L~k~~~~~~~~-~-~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~-----------------~~~~~~~~~A~~~~ 187 (231) |+.|++++++.++. . ..+.+...+|.++.+.|+||+.||++|.+... .+-+...+.|++.+ T Consensus 205 y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~ 284 (335) T protein:vir:78 205 FSLLLEHDKLMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITA 284 (335) T ss_pred HHHHhcccccccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEE Confidence 99999999887753 1 22334578899999999999999999954211 12234567899999 Q ss_pred ecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 188 LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 188 ~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ...++..|.+|+..++++.|.+.+.||+++++|++++.++.+|. T Consensus 285 ~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 285 QVAPVQAKLWEDHDQFSWVLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred EEEecccceeeccchhhHhhhHHHHcCCcccCcceEEEEEecCC Confidence 99999999999999999999999999999999999999999999 No 45 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.92 E-value=1.3e-28 Score=173.45 Aligned_cols=224 Identities=13% Similarity=0.102 Sum_probs=168.8 Q ss_pred CCCcccCceEEeccccCCcc--cccCCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAA--DVAEGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~--~v~EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+..--||||+||. ||+++ ++.+++.++++.++..+.+++|+|..+ +|.|+| +..|...|+++.+++++++++++ T Consensus 39 ~~d~g~GDtV~Ins-Ig~~tV~dY~~~~~i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~ 116 (322) T protein:vir:31 39 VVDFPDGDKLTIPS-VGTPVVRSRPEQGDFTFDNLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARA 116 (322) T ss_pred ccccCCCCeEEecc-ccccccccccCCCCcccccCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHH Confidence 23333699999966 89876 799999999999999999999999888 599999 88999999999999999999999 Q ss_pred HHHHHHHHHhcccc---------cc------------cccccCHHHHHHHHHHhhccCC--CceEEEECHHHHH------ Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS---------QT------------VSTKANVDGVQAALDIFNDEDA--QAYVLIVNPKDAA------ 128 (231) Q Consensus 78 ~vd~~~~~~l~t~~---------~~------------~~~~~~~d~i~da~~~l~~~~~--~~~v~vv~p~~~~------ 128 (231) .+|+.....|++.. .. ......|+.|+++..+|++.+. ..+|+||+|..+. T Consensus 117 ~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~ 196 (322) T protein:vir:31 117 IMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETIT 196 (322) T ss_pred HHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhh Confidence 99997755444321 00 1123579999999999988764 6899999999865 Q ss_pred ---HHHhhhhhhhccccccCceeeecc--ceeecceeEEEcCCCccCceE--------------E--EEEecCC---ceE Q lcl|Aclame:pro 129 ---KIRKDANAKNIGSEVGANALINGT--YADVLGAQIVRSKKLAEGSAL--------------M--FKIVSNS---PAL 184 (231) Q Consensus 129 ---~L~k~~~~~~~~~~~~~~~~~~G~--ig~~~G~~Vv~s~~~~~~~~~--------------~--~~~~~~~---~A~ 184 (231) .|++|++|....... ...|. +|+++|++|++||.+++++.- . +..+... ..+ T Consensus 197 ~~~~l~~D~rf~~i~~sG----~a~g~~~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~ 272 (322) T protein:vir:31 197 NISNISNNPRWEGIVESG----IAPDMQFVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFV 272 (322) T ss_pred hhhhhhcccccccccccc----chhhHHHHHHHhceeeeeeccccccccccccCcccccccceeecccccccchhhhhhh Confidence 457777775432211 12232 899999999999998754411 0 0000001 111 Q ss_pred EEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 185 KLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 185 ~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +.+ |+=.+.|..|+.+++.|.++++.+||.++.+|+.++++.-.+- T Consensus 273 ~~~-~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~ 318 (322) T protein:vir:31 273 VAW-KEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVCVLANAD 318 (322) T ss_pred hHh-hhhhhhhcccCccccccceeeeeeecceeecccceEEEEeccc Confidence 112 2224679999999999999999999999999999999877666 No 46 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.92 E-value=3.1e-27 Score=165.92 Aligned_cols=226 Identities=21% Similarity=0.206 Sum_probs=176.6 Q ss_pred CCCcccCceEEecccc----------CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI----------GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQ 70 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i----------gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~ 70 (231) ..-.-.+.++++|... +.+.+++||+.++..++++++.++++++++..+.||++.+.++ +++.+...++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~ 237 (419) T protein:vir:94 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGR 237 (419) T ss_pred eeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHH Confidence 1111244567777532 2366899999999999999999999999999999999988766 6789999999 Q ss_pred HHHHHHHHHHHHHHHHhccc-------------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHH Q lcl|Aclame:pro 71 LGLSLANKVDDDLLKAAKTT-------------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 71 ~a~~ia~~vd~~~~~~l~t~-------------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~ 131 (231) ++++++.++|..++.+-.+. ....+....+++|.+++..+......+.+++|||.++..|+ T Consensus 238 la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~ 317 (419) T protein:vir:94 238 LTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) T ss_pred HHHHHHHHHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHH Confidence 99999999999998642110 11123345689999999999888888889999999999999 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEE Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVI 207 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i 207 (231) +..+..+.. +.....+.+|..++++|+||++++++|+++.++.++ ..++.++.+.+++++.++.. .+....+ T Consensus 318 ~~k~~~~~~-~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~ 393 (419) T protein:vir:94 318 LDQAPGSGV-FRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGF---RQGATLWSRQGITVLMTDSHADFFTANTLVI 393 (419) T ss_pred HHhhcCCCc-eeecCCcccCCCccccceeeEEcCCCCCccEEEeec---cceEEEEEecceEEEEeccccchhhcCcEEE Confidence 876543322 222222345677899999999999999998776553 44566777788888876654 4677889 Q ss_pred EEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 208 ~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++..+|++++++|+++++++++++ T Consensus 394 r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 394 LAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred EEEEeeccEEeccccEEEEEeccC Confidence 999999999999999999999999 No 47 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=99.92 E-value=3.4e-27 Score=165.71 Aligned_cols=222 Identities=12% Similarity=0.072 Sum_probs=173.6 Q ss_pred CCCcccCceEEeccc---cCCcccccCC----CccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEG----GEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG----~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) ..+..+|+++++|.| .|+++.--.| +.+++.++++.+..+.+..++++|..+|.+...++.|||..+.+|++. T Consensus 45 ~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~ 124 (349) T protein:vir:94 45 EIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDN 124 (349) T ss_pred HHHhcCCCEEEeeeeecCCCCcccccCCCCcccccccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHH Confidence 566789999999999 4776643333 258899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhccccc-------------------ccccccCHHHHHHHHHHhhcc-----CCCceEEEECHHHHHH Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTSQ-------------------TVSTKANVDGVQAALDIFNDE-----DAQAYVLIVNPKDAAK 129 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~~-------------------~~~~~~~~d~i~da~~~l~~~-----~~~~~v~vv~p~~~~~ 129 (231) .|.+.-++.+++.|++... ..+..++...+.+|.++|++. ......++|||.++.. T Consensus 125 yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~ 204 (349) T protein:vir:94 125 FWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQ 204 (349) T ss_pred HHhhHHHHHHHHHHHhhhcccccccccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHH Confidence 9999999999987763211 122346788899999988875 4678899999999999 Q ss_pred HHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc----CceEEEEEecCCceEEEeecCC-ccceeccchhhc- Q lcl|Aclame:pro 130 IRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE----GSALMFKIVSNSPALKLVLKRG-VQVETDRDIVTK- 203 (231) Q Consensus 130 L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~----~~~~~~~~~~~~~A~~~~~k~~-v~vE~~Rd~~~~- 203 (231) |++....... +. .-++..|++++|.+|+++|.||- +...+..+++++||+++....+ +.+|++||++++ T Consensus 205 L~~~~li~~i-~~----s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~ 279 (349) T protein:vir:94 205 ARKAQLIDFI-RD----AENNTMFATYQGYRVIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRAN 279 (349) T ss_pred HHhcchhhhc-cC----cccCcccceecCcEEEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCC Confidence 9987543221 11 12334689999999999999984 3446777888999999998765 579999999875 Q ss_pred ---ccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 204 ---TTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 204 ---~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .|.+..++||..|.. + ++.+-+.| T Consensus 280 ~~G~d~L~~R~~~~~hp~---G-~s~~~a~v 306 (349) T protein:vir:94 280 GGGVETLWTRKTWLLHPF---G-YSFTSAVI 306 (349) T ss_pred cceeEEEEEeeEEEeeee---e-eeeccccc Confidence 599999999877652 2 22222222 No 48 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=99.92 E-value=3.5e-27 Score=165.61 Aligned_cols=222 Identities=11% Similarity=0.049 Sum_probs=172.0 Q ss_pred CCCcccCceEEeccc---cCCccc-c-cCC--CccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAAD-V-AEG--GEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~-v-~EG--~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) ..+..+|+++++|.| .|+++. + ..+ +.+++.++++.+..+.+..++++|..+|.+...++.|||.++.+|++. T Consensus 45 ~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~ 124 (349) T protein:vir:78 45 EIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDN 124 (349) T ss_pred HHhhcCCCEEEeeeeecCCCCcccccCCCCcccccccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHH Confidence 566789999999999 466653 3 333 367999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhccccc-------------------ccccccCHHHHHHHHHHhhcc-----CCCceEEEECHHHHHH Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTSQ-------------------TVSTKANVDGVQAALDIFNDE-----DAQAYVLIVNPKDAAK 129 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~~-------------------~~~~~~~~d~i~da~~~l~~~-----~~~~~v~vv~p~~~~~ 129 (231) .|.+.-++.+++.|++... ..++.++.+.+++|.++|++. ......++|||.++.. T Consensus 125 yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~ 204 (349) T protein:vir:78 125 FWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQ 204 (349) T ss_pred HHhhHHHHHHHHHHHHhhcccccccchhhhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHH Confidence 9999999999887763211 112236788899999998875 4678899999999999 Q ss_pred HHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccC----ceEEEEEecCCceEEEeecCC-ccceeccchhhc- Q lcl|Aclame:pro 130 IRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEG----SALMFKIVSNSPALKLVLKRG-VQVETDRDIVTK- 203 (231) Q Consensus 130 L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~----~~~~~~~~~~~~A~~~~~k~~-v~vE~~Rd~~~~- 203 (231) |++...... -+ +.-++..|++++|.+|+++|.||-. ...+..+++++||+++....+ +.+|++||++++ T Consensus 205 L~~~~li~~-i~----~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~ 279 (349) T protein:vir:78 205 ARKAQLIDF-IR----DAENNTMFATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRAN 279 (349) T ss_pred HHhhhhhhh-cc----CcccCcccceecCeEEEEeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCC Confidence 998644322 11 1123446899999999999999942 235667888999999987665 569999999875 Q ss_pred ---ccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 204 ---TTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 204 ---~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .|.+..++||..|.. + ++.+-+.| T Consensus 280 ~~G~d~l~~R~~~~~hp~---G-~s~~~a~v 306 (349) T protein:vir:78 280 GGGVETLWTRKTWLLHPF---G-YRFTSAVI 306 (349) T ss_pred cceeEEEEEeeEEEeeee---e-eeeccccc Confidence 599999999877752 2 22222222 No 49 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.92 E-value=4.1e-27 Score=165.25 Aligned_cols=227 Identities=16% Similarity=0.102 Sum_probs=176.2 Q ss_pred CCC-------c-ccCceEEeccccC-CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENG-------I-NLANLCEYPNDIG-DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~-------~-~~G~ti~~P~~ig-da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~ 71 (231) ++. + -.|.+.++|.+.+ .+.+++||++++..++++++.++.+++.+..+.+|+|...++..|+.+...+++ T Consensus 33 ~s~l~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l 112 (299) T protein:vir:41 33 GSAAMKLAKAVPMTKPEEEFTFMSGVGAFWVDEAERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEI 112 (299) T ss_pred cchhhhhceeeecCCCcEEEEEEcCCceeeeecCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHH Confidence 111 1 1356677887743 466899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN 138 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~ 138 (231) ++++++++|+.++.+-.+. .....+.+++++|.+++..+.+++..+..++|||..+..|++.++..+ T Consensus 113 ~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G 192 (299) T protein:vir:41 113 VEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNG 192 (299) T ss_pred HHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCC Confidence 9999999999998643211 111234578999999999999888888999999999999998776543 Q ss_pred ccccccCceeeeccceeecceeEEEcCCCccCceEEEEE-ecCCceEEEeecCCccceeccchh---------------- Q lcl|Aclame:pro 139 IGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKI-VSNSPALKLVLKRGVQVETDRDIV---------------- 201 (231) Q Consensus 139 ~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~-~~~~~A~~~~~k~~v~vE~~Rd~~---------------- 201 (231) . +.....+. +..++++|+||++++++|.++.-...+ ..... +.+...+++.+|..|+.. T Consensus 193 ~--~l~~~~~~-~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 268 (299) T protein:vir:41 193 M--PIFNTATS-NGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQ-AYYGILRGVEYEILTEATLTTVADETGKPLNLAE 268 (299) T ss_pred c--eeecCCcC-CCCceecceeeEEecccCCCCCceEEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhh Confidence 2 32323233 334789999999999999876321111 11123 346777889999888753 Q ss_pred hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 ~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....++..+++++++.+|+++++++.++- T Consensus 269 ~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 269 RDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred cCcEEEEEEEEeccEEecccceEEEEeccC Confidence 345678899999999999999999999999 No 50 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.92 E-value=1.4e-27 Score=167.80 Aligned_cols=227 Identities=15% Similarity=0.112 Sum_probs=171.6 Q ss_pred CCCcccCceEEe----ccc-cCCcccccCCCccCcccccccee-EEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEY----PND-IGDAADVAEGGEISLDKIGTTTK-SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~----P~~-igda~~v~EG~~i~~~~lt~~~~-~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) +-+.+.+-.+.| |.| .||+++++||.++|....+++.. .+.++|+|+.++||||++..+..++++.+.++++.. T Consensus 48 ~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nt 127 (318) T protein:vir:10 48 NGGANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNT 127 (318) T ss_pred cccccccceeEEEecccccccCcHhhccCcccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHH Confidence 445666667888 445 79999999999999999999664 457899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcccc---ccc--cc---ccCHHHHHHHHH-------Hhh---------ccCCCceEEEECHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTS---QTV--ST---KANVDGVQAALD-------IFN---------DEDAQAYVLIVNPKDAAKI 130 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~---~~~--~~---~~~~d~i~da~~-------~l~---------~~~~~~~v~vv~p~~~~~L 130 (231) |++++|+.++.+|..+. ... ++ .....++.+|.+ .+. ..++.++.+||||.++..| T Consensus 128 i~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l 207 (318) T protein:vir:10 128 FIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPIL 207 (318) T ss_pred HHHHHHHHHHHHHhccccccccCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHH Confidence 99999999999885432 111 10 011112333321 111 1247889999999999999 Q ss_pred HhhhhhhhccccccCc----eeeeccc-eeecceeEEEcCCCccCceEEEEEecCCceEEEe-ecCCccceeccch---- Q lcl|Aclame:pro 131 RKDANAKNIGSEVGAN----ALINGTY-ADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLV-LKRGVQVETDRDI---- 200 (231) Q Consensus 131 ~k~~~~~~~~~~~~~~----~~~~G~i-g~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~-~k~~v~vE~~Rd~---- 200 (231) ++++++......-.+. .-.+|.| |+++|++|+.|+++|.++++.+. ++++|++ -.+++++++.|.+ T Consensus 208 ~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq----~g~vG~~~d~~pl~~t~~~~egg~~ 283 (318) T protein:vir:10 208 MDNENFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPIDRVLIME----RGTVGFYSDTRPLQFTALYPEGNGP 283 (318) T ss_pred hcchhhhhhhhccchhhhhcccccccccceeeceEEeecCccCCCeeEEEe----cCCcceeeccccceeeecccCCCCC Confidence 9998876543211111 1124554 77899999999999999988776 6667665 4455788999976 Q ss_pred ---hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 ---VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ---~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ...+..+..++....+|.+|.++|+||-=+. T Consensus 284 ~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 284 NGGPTESYRADASHKRALAVDQPKAALWLTGIVT 317 (318) T ss_pred CCCcchhhheehheeeeeeeeCcceeEEEeeccC Confidence 6667888999999999999999999976555 No 51 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.92 E-value=6e-27 Score=164.35 Aligned_cols=222 Identities=21% Similarity=0.210 Sum_probs=176.6 Q ss_pred CCCcccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--...+.++++|.+.+ .+.+++||+++|..++++++.++++++++..+.+|++...++ +++.+...+++++++++ T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~ 226 (390) T protein:vir:81 148 GSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKV 226 (390) T ss_pred ceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHH Confidence 11112455788888743 456899999999999999999999999999999999988776 58999999999999999 Q ss_pred HHHHHHHHHhccc---------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 78 ~vd~~~~~~l~t~---------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) ++|..++.+-.+. ....+...+++.+.+++..+...+.....++|||..+..|++.++..+. + T Consensus 227 ~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~--~ 304 (390) T protein:vir:81 227 KEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQ--Y 304 (390) T ss_pred HHHHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCc--e Confidence 9999998753221 1122345679999999999988888888999999999999987654433 2 Q ss_pred ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchh---hcccEEEEEEEEEEEEEc Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV---TKTTVITADEHYAAYLYD 219 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~---~~~~~i~~~~~y~~~~~~ 219 (231) ...+. .+|..++++|+||++++.+|+++.++.++ ..++.++.+.++.++.++... +....+++..||++++.+ T Consensus 305 l~~~~-~~~~~~~l~G~pv~~~~~~p~~~~~~gd~---~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~ 380 (390) T protein:vir:81 305 LIGNA-RGTLTPTLWGLPVVATQAMAPGEFLVGAF---DLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYR 380 (390) T ss_pred eecCc-ccccCceecceeeEEcCCCCCCcEEEEeh---hceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEec Confidence 22222 24556799999999999999999776553 446777888899999887532 345578899999999999 Q ss_pred CCcEEEEEec Q lcl|Aclame:pro 220 LTKVVNITFT 229 (231) Q Consensus 220 ~~~vv~l~~~ 229 (231) |+++|++++. T Consensus 381 ~~a~v~~t~a 390 (390) T protein:vir:81 381 PEALISGSFA 390 (390) T ss_pred ccceEEEEeC Confidence 9999999999 No 52 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.92 E-value=7.4e-27 Score=163.85 Aligned_cols=221 Identities=21% Similarity=0.196 Sum_probs=176.2 Q ss_pred CCCcccCceEEecccc---CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI---GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i---gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--+ .+.++++|.+. +.+.+++||++++..++++++.++++++.+..+.+|++.+.++ .++.+...+++++++++ T Consensus 149 ~~~~-~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~ 226 (390) T protein:vir:97 149 SGRT-DSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKV 226 (390) T ss_pred eeec-cCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHH Confidence 1112 35568888763 4567899999999999999999999999999999999987766 68999999999999999 Q ss_pred HHHHHHHHHhccc---------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 78 ~vd~~~~~~l~t~---------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) ++|..++.+-.+. ....+..+.++.+.+++..+.........++|||.++..|++.++..+. + T Consensus 227 ~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~--~ 304 (390) T protein:vir:97 227 KEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQ--Y 304 (390) T ss_pred HHHHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCc--e Confidence 9999998642211 1123345678999999999988888899999999999999987654433 2 Q ss_pred ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch-h--hcccEEEEEEEEEEEEEc Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI-V--TKTTVITADEHYAAYLYD 219 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~-~--~~~~~i~~~~~y~~~~~~ 219 (231) ...+. .+|..++++|+||++|+.+|+++.++.++ ..++.++...++.++..++. . +....+++..+|++++.+ T Consensus 305 l~~~~-~~~~~~~l~G~pV~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~ 380 (390) T protein:vir:97 305 LIGNA-RGTLTPTLWGLPVVATQAMAPGEFLVGAF---DLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYR 380 (390) T ss_pred eecCc-cCCCCceecceeeEEcCCCCCCcEEEEec---cceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEec Confidence 22222 34556799999999999999998776553 45677888899999988753 3 445568888999999999 Q ss_pred CCcEEEEEec Q lcl|Aclame:pro 220 LTKVVNITFT 229 (231) Q Consensus 220 ~~~vv~l~~~ 229 (231) |+++|++++. T Consensus 381 ~~a~v~~~~a 390 (390) T protein:vir:97 381 PEALITGSFA 390 (390) T ss_pred cccEEEEEeC Confidence 9999999999 No 53 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.91 E-value=2.1e-26 Score=161.39 Aligned_cols=222 Identities=21% Similarity=0.198 Sum_probs=174.6 Q ss_pred CCCc-ccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI-NLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~-~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) =.-+ -.+.++++|.+.+ .+.+++||++++..++++++.++++++++..+.+|++.+.++ .++.+...++++++++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~l~~~~~ 225 (390) T protein:vir:10 147 IGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLK 225 (390) T ss_pred cceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHH Confidence 0011 1345789998754 456899999999999999999999999999999999987766 5899999999999999 Q ss_pred HHHHHHHHHHhccc---------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccc Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT---------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGS 141 (231) Q Consensus 77 ~~vd~~~~~~l~t~---------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~ 141 (231) +++|..++.+-.+. ....+....++.+.+++..+.........++|||.++..|++.++..+.+ T Consensus 226 ~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~- 304 (390) T protein:vir:10 226 VKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQY- 304 (390) T ss_pred HHHHHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCce- Confidence 99999998753211 11223456789999999999888888899999999999999877654432 Q ss_pred cccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch---hhcccEEEEEEEEEEEEE Q lcl|Aclame:pro 142 EVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI---VTKTTVITADEHYAAYLY 218 (231) Q Consensus 142 ~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~---~~~~~~i~~~~~y~~~~~ 218 (231) ..... .++.-++++|+||++++.+|+++.++.++ ..++.++.+.++.++..++. .+....+++..+|++++. T Consensus 305 -l~~~~-~~~~~~~l~G~pv~~~~~~p~~~~~~gdf---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:10 305 -LIGNA-RGTLTPTLWGLPVVATQAMAPGEFLVGAF---DLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred -eecCC-cCcCCceecceeeEEcCCCCCCcEEEEec---cceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEe Confidence 22222 23445789999999999999999876553 34667777888999877653 335568888899999999 Q ss_pred cCCcEEEEEec Q lcl|Aclame:pro 219 DLTKVVNITFT 229 (231) Q Consensus 219 ~~~~vv~l~~~ 229 (231) +|+++++++++ T Consensus 380 ~~~a~~~~~~a 390 (390) T protein:vir:10 380 RPEALISGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999999 No 54 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.91 E-value=1.9e-26 Score=161.57 Aligned_cols=229 Identities=16% Similarity=0.130 Sum_probs=176.8 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ..-.-.+.+++||++.+. +.+++||++++..++++++.+++++|++..+.||+|...++..|+.+...++++++|+++ T Consensus 45 ~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~ 124 (397) T protein:vir:23 45 QKIPMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMA 124 (397) T ss_pred ceeeccCCceEEEEEcCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 111123556889987544 568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc--- Q lcl|Aclame:pro 79 VDDDLLKAAKTT------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV--- 143 (231) Q Consensus 79 vd~~~~~~l~t~------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~--- 143 (231) +|+.++....+. ....+....++.+.+++..|.........++|||..+..|++.++...+.... T Consensus 125 ~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~ 204 (397) T protein:vir:23 125 FDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVEST 204 (397) T ss_pred HHHHHhhcccCCcccccccccccceeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccc Confidence 999999754332 11234456788899999999888888899999999999999877654432211 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEE Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVI 207 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i 207 (231) .......+..++++|+||++++++|+++...+.-.+. .+ .+...+++.+|..|+. .+....+ T Consensus 205 ~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs-~~-~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ 282 (397) T protein:vir:23 205 YESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFS-QI-IWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAV 282 (397) T ss_pred cccccccccCceeeeeeEEEeCCCCCCceEEEEeecc-eE-EEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeE Confidence 1111122234689999999999999988654432332 23 3566677888877764 2345678 Q ss_pred EEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 208 ~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +...++++++.+|+++++++++.+ T Consensus 283 ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 283 RVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred EEEeeeccceecccceEEEeeccc Confidence 889999999999999999999888 No 55 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.91 E-value=2.6e-26 Score=160.86 Aligned_cols=223 Identities=13% Similarity=0.081 Sum_probs=176.6 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) -.-.-.|.++++|.+.+ .+.+++||+++|..++++++.+++.+|.+..+.+|+|...++..++.+...++++++++++ T Consensus 62 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~ 141 (324) T protein:vir:99 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 11112355789998854 4678999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 79 vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|+.++.+..+. .....+.+++++|.++...|.+......+++|||..+..|++..+..+. T Consensus 142 ~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~------ 215 (324) T protein:vir:99 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK------ 215 (324) T ss_pred HHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCc------ Confidence 999998653221 1123356789999999999988888888999999999999987654332 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEEE Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVITA 209 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~~ 209 (231) ..+..+.-++++|+||+.++.++.+++..+..... . +.+...+++++|..|+. .+....+++ T Consensus 216 ~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~-~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~ 293 (324) T protein:vir:99 216 ERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFD-K-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCccccceeEEeecCCCCCcceEEEEecc-c-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 22334556789999999999988777654433332 2 34666778899887764 345678889 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++|++++.+|+++++|+.+.- T Consensus 294 ~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 294 TMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEccEEecccceEEEEeccC Confidence 9999999999999999987655 No 56 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.91 E-value=3.6e-26 Score=160.10 Aligned_cols=223 Identities=13% Similarity=0.083 Sum_probs=175.0 Q ss_pred CC---CcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 EN---GINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~---~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) .. -.-.|.++++|.+.+. +.+++||+++|..++++++.++++++++..+.+|+|.+.++..|+.+...++++++| T Consensus 59 ~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai 138 (324) T protein:vir:93 59 QLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred hhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 11 1123556889988654 568999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 76 a~~vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) ++++|+.++....+. .....+..++++|.++...+.........++|||..+..|++..+..+ T Consensus 139 a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G---- 214 (324) T protein:vir:93 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET---- 214 (324) T ss_pred HHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCC---- Confidence 999999998643221 112335678999999999998888888899999999999998755433 Q ss_pred ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccE Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTV 206 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~ 206 (231) ..++..+..++++|+||+.++..+.+++..+...+. -+.+...++++++.+|+. .+.... T Consensus 215 --~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~ 290 (324) T protein:vir:93 215 --KERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFD--KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred --CeeecCCCCCcccceeeEeecCCCCCcceEEEEecc--eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE Confidence 223445667899999999988766555443322222 234666788899888874 345689 Q ss_pred EEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 207 ITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 207 i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +++.++|++++.+|+++++|+.+.. T Consensus 291 ~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 291 LRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEEEeccEEecccceEEEecccc Confidence 9999999999999999999975444 No 57 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.91 E-value=1e-26 Score=163.12 Aligned_cols=230 Identities=13% Similarity=0.124 Sum_probs=178.5 Q ss_pred CCCcccCceEEeccccCCcccc---------cCCC-ccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADV---------AEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQ 70 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v---------~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~ 70 (231) ++.+.++++++.|. +.+...+ ..++ ..|+.....+...+.+.++..++.|.|.+..+...||.+..+++ T Consensus 47 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~ 125 (322) T protein:vir:10 47 KNESSESHNWETLA-SMDPDAVKRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITS 125 (322) T ss_pred ccccccccceeecc-cccccccccccccccccCcccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHH Confidence 88888888888764 2222222 2222 24555667888888888888899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccccc-----------------ccccccCHHHHHHHHHHhhccCC---CceEEEECHHHHHHH Q lcl|Aclame:pro 71 LGLSLANKVDDDLLKAAKTTSQ-----------------TVSTKANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKI 130 (231) Q Consensus 71 ~a~~ia~~vd~~~~~~l~t~~~-----------------~~~~~~~~d~i~da~~~l~~~~~---~~~v~vv~p~~~~~L 130 (231) ++++|+|+.|+.+++++..... .....++++.|.+|..+|++.+. .++|++++|++++.| T Consensus 126 ~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~L 205 (322) T protein:vir:10 126 QAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKL 205 (322) T ss_pred HHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHH Confidence 9999999999988876543210 01235679999999999987653 358999999999999 Q ss_pred HhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce--------------EEEEEecCCceEEEeecCCcccee Q lcl|Aclame:pro 131 RKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA--------------LMFKIVSNSPALKLVLKRGVQVET 196 (231) Q Consensus 131 ~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~--------------~~~~~~~~~~A~~~~~k~~v~vE~ 196 (231) +++++|.............+|.+|+++|++|+.|+++|.... ....+...+.|+++...+++++|- T Consensus 206 L~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i 285 (322) T protein:vir:10 206 LQITEATSADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKV 285 (322) T ss_pred hcchhhhhhhcccchhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEe Confidence 999999764333333444789999999999999999984322 112344567899999988999884 Q ss_pred c-cchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 197 D-RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 197 ~-Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) + |....+...|++.+.||+++++|++||.|..+=. T Consensus 286 ~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 286 AEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred eccCCcchhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 4 4555568889999999999999999999988777 No 58 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.91 E-value=3.9e-26 Score=159.89 Aligned_cols=224 Identities=18% Similarity=0.177 Sum_probs=177.7 Q ss_pred CCCcccCceEEecccc---CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI---GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i---gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) -.-.-.|.++++|.+. +.+.+++||++++..++++++++++.++++..+.||++...++ +++.+...+++++++++ T Consensus 170 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~ 248 (418) T protein:vir:10 170 MPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQL 248 (418) T ss_pred ceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHH Confidence 1111235668888753 3466899999999999999999999999999999999987666 68999999999999999 Q ss_pred HHHHHHHHHhcccc---------------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS---------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 78 ~vd~~~~~~l~t~~---------------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) ++|..++.+-.+.. ...++..++++|++++..+.........++|||.++..|++..+..+ .+ T Consensus 249 ~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G--~~ 326 (418) T protein:vir:10 249 TEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQG--RY 326 (418) T ss_pred HHHHHHhccCCCCccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC--ce Confidence 99999987533211 11234467999999999998888888899999999999998765443 33 Q ss_pred ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEE Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLY 218 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~ 218 (231) ...+ ..+|..++++|+||++|+.||.++.+..++ ..++.++.+.+++++.+++. .+....+++..++++++. T Consensus 327 i~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~---s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~ 402 (418) T protein:vir:10 327 IVGN-PVNGTTPRLWNLPVVETQAMTANEFLVGAF---SMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVY 402 (418) T ss_pred eccc-cccCCCceecceeeEEcCCCCCCcEEEeec---cceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEe Confidence 3333 346777899999999999999998765542 34566777788888877653 356678889999999999 Q ss_pred cCCcEEEEEeccC Q lcl|Aclame:pro 219 DLTKVVNITFTGV 231 (231) Q Consensus 219 ~~~~vv~l~~~~~ 231 (231) +|+++++++++.. T Consensus 403 ~~~a~~~~~~~~~ 415 (418) T protein:vir:10 403 RPESFVTGALVEQ 415 (418) T ss_pred cccceEEEEeccC Confidence 9999999999988 No 59 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.91 E-value=2.1e-26 Score=161.33 Aligned_cols=225 Identities=17% Similarity=0.111 Sum_probs=177.7 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ..-+++|..+.+|.+.| .+.+++||+++|.+++++++.++++++++..+.||++.+.++..|+.+...++++++|+.+ T Consensus 147 ~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 226 (390) T protein:vir:62 147 TFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDA 226 (390) T ss_pred eeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 22245667799998866 5668999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhc------cc--------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccccc Q lcl|Aclame:pro 79 VDDDLLKAAK------TT--------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) Q Consensus 79 vd~~~~~~l~------t~--------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~ 144 (231) +|..++.+-. +. ....++..++++|++++..|........+|+|||..+..|++.++..+. +.. T Consensus 227 ~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~--~l~ 304 (390) T protein:vir:62 227 MGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQ--YLW 304 (390) T ss_pred HHhhhhccCCccccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCC--eee Confidence 9999986421 10 0112345689999999988876555566899999999999887654433 222 Q ss_pred CceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch--hhcccEEEEEEEEEEEEEcCCc Q lcl|Aclame:pro 145 ANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI--VTKTTVITADEHYAAYLYDLTK 222 (231) Q Consensus 145 ~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~--~~~~~~i~~~~~y~~~~~~~~~ 222 (231) ..-+.+|..++++|.||++++.+|++..++.++ + .+.+...+++.++...+. .+....+++..++++++++|++ T Consensus 305 ~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~--s--~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A 380 (390) T protein:vir:62 305 QSGLTVGAPSLFNGKVVETDDGMPADKILFADL--S--KYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARG 380 (390) T ss_pred cCCcCCCccceecccceEEecCCCCccEEEeec--c--ceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeechhh Confidence 223455777899999999999999988655443 2 244666778888766655 4456778999999999999999 Q ss_pred EEEEEeccC Q lcl|Aclame:pro 223 VVNITFTGV 231 (231) Q Consensus 223 vv~l~~~~~ 231 (231) +++|++++- T Consensus 381 ~~~l~~~~~ 389 (390) T protein:vir:62 381 AKVLTVTPG 389 (390) T ss_pred eEEEEeecC Confidence 999999988 No 60 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.91 E-value=3.8e-26 Score=159.93 Aligned_cols=225 Identities=18% Similarity=0.137 Sum_probs=177.9 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) .--+++|..+.+|.+.|. +.+++||+++|..++++++.++.+++.+..+.||++.+.++..|+.+...++++++++++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 226 (392) T protein:vir:13 147 TFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDA 226 (392) T ss_pred eeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 122456778999987654 557999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcc---------cc-------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 79 VDDDLLKAAKT---------TS-------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 79 vd~~~~~~l~t---------~~-------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) +|..++.+-.+ .. ...++.++|++|++++..|.........++|||..+..|++..+..+. + T Consensus 227 ~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~--~ 304 (392) T protein:vir:13 227 MGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQ--Y 304 (392) T ss_pred HHHHHhcccCCccccccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCc--e Confidence 99999964211 00 112345679999999988876555567799999999999887654433 2 Q ss_pred ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchh--hcccEEEEEEEEEEEEEcC Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--TKTTVITADEHYAAYLYDL 220 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~--~~~~~i~~~~~y~~~~~~~ 220 (231) ....-+..|..++++|.||++++.+|+++.++.++ ..+.+...+++.++.+++.. +....+++..++++++.+| T Consensus 305 l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf----~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 380 (392) T protein:vir:13 305 LWQSALTVGAPDTFNGKVVETDDGMPADKVLFADL----SKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA 380 (392) T ss_pred eecCCcCCCCCceecceeeEEcCCCCCCcEEEeec----cceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecc Confidence 22233455667899999999999999998765543 23566677788887766654 4557889999999999999 Q ss_pred CcEEEEEeccC Q lcl|Aclame:pro 221 TKVVNITFTGV 231 (231) Q Consensus 221 ~~vv~l~~~~~ 231 (231) ++++.+++++- T Consensus 381 ~A~~~~~~~~a 391 (392) T protein:vir:13 381 RGAKVLTVTPA 391 (392) T ss_pred cceEEEEeecc Confidence 99999999888 No 61 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.90 E-value=6.5e-26 Score=158.68 Aligned_cols=224 Identities=18% Similarity=0.183 Sum_probs=175.5 Q ss_pred CCCcccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) --..-+|.++++|.+.+ .+.+++||+++|..++++++.++++++++..+.+|++...++ +++.+...+++++++++ T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~v~~~la~a~~~ 226 (395) T protein:vir:43 148 APGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA-SALQSYIDARARYGLML 226 (395) T ss_pred cceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHH Confidence 01111355788997633 456899999999999999999999999999999999987655 67888999999999999 Q ss_pred HHHHHHHHHhccc---------c--------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------S--------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIG 140 (231) Q Consensus 78 ~vd~~~~~~l~t~---------~--------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~ 140 (231) ++|..++.+-.+. . ...+....++.|.++...+........+|+|||.++..|++..+..++ T Consensus 227 ~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~- 305 (395) T protein:vir:43 227 VEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENR- 305 (395) T ss_pred HHHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCc- Confidence 9999998743211 1 112234568999999999988777888999999999999887654433 Q ss_pred ccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEE Q lcl|Aclame:pro 141 SEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAY 216 (231) Q Consensus 141 ~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~ 216 (231) +...+ ..+|..++++|+||++|+.||+++.++.++ ..++.++.+.++.++.++.. .+....+++..+|+++ T Consensus 306 -~i~~~-~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~---~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 380 (395) T protein:vir:43 306 -YIIGS-PQNGTTPTLWRLPVVETQAITQDEFLTGAF---SLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFA 380 (395) T ss_pred -eeccc-cccCCCceecceeeEEcCCCCCCcEEEEec---cceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccE Confidence 33333 346777899999999999999999765543 34556666778888877643 3556688999999999 Q ss_pred EEcCCcEEEEEeccC Q lcl|Aclame:pro 217 LYDLTKVVNITFTGV 231 (231) Q Consensus 217 ~~~~~~vv~l~~~~~ 231 (231) +.+|++++++++++- T Consensus 381 v~~~~a~~~~~~taa 395 (395) T protein:vir:43 381 VYRPEAFVTGSLTAS 395 (395) T ss_pred EecccceEEEEeccC Confidence 999999999999999 No 62 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.90 E-value=6e-26 Score=158.85 Aligned_cols=223 Identities=13% Similarity=0.081 Sum_probs=176.5 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ..-.-.|.++++|++.+ .+.+++||+++|..++++++.+++.++.+..+.+|+|...++..++.+...++++++++++ T Consensus 62 ~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~ 141 (324) T protein:vir:97 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 11112356789998854 4568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 79 vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|+.++..-.+. .....+.+++++|.++...+...+..+.+++|||..+..|++..+..++ T Consensus 142 ~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~------ 215 (324) T protein:vir:97 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK------ 215 (324) T ss_pred HHHHhhccCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCc------ Confidence 999998653321 1123356789999999999998888888999999999999987654332 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEEE Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVITA 209 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~~ 209 (231) ..+..+..++++|+||++++..+.+++..+...+. .+.+...+++++|.+|+. .+....+++ T Consensus 216 ~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~--~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:97 216 ERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFD--KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCccccceeeEeecCCCCCcceEEEEecc--cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 22334566889999999999877666544333232 234666788899888764 345678899 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++|++++.+|+++++|+.+-- T Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 294 TMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEeccEEecccceEEEEeccC Confidence 9999999999999999987655 No 63 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.90 E-value=5.6e-26 Score=159.02 Aligned_cols=230 Identities=13% Similarity=0.061 Sum_probs=172.3 Q ss_pred CCC---cccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~---~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~a 72 (231) ..+ --.+.++++|.+.+ .+.+++||+++|.+++++++.++..+|.+..+++|+|.+.+ +..++.++..++++ T Consensus 31 ~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la 110 (303) T protein:vir:97 31 KLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFA 110 (303) T ss_pred hhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHH Confidence 111 12346789998744 57799999999999999999999999999999999998854 45578889999999 Q ss_pred HHHHHHHHHHHHHHhcccc-----------------c---ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHh Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTTS-----------------Q---TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRK 132 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~~-----------------~---~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k 132 (231) ++|++++|+.++.+..+.. . ..+....+++|.++..++...+..+..++|||..+..|++ T Consensus 111 ~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~ 190 (303) T protein:vir:97 111 KKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAK 190 (303) T ss_pred HHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH Confidence 9999999999997632111 0 0123456899999999998878888999999999999998 Q ss_pred hhhhhhccccccCceeeeccceeecceeEEEcCCCccCce-----EEEEEecCCceEEEeecCCccceeccc-------- Q lcl|Aclame:pro 133 DANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA-----LMFKIVSNSPALKLVLKRGVQVETDRD-------- 199 (231) Q Consensus 133 ~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~-----~~~~~~~~~~A~~~~~k~~v~vE~~Rd-------- 199 (231) .++..+.... ..++-..+..++++|+||++|++||.+.. ..+.+-....++.+..++++++|..+. T Consensus 191 lkd~~g~~~~-~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~ 269 (303) T protein:vir:97 191 VTNGEMGPKM-YPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGK 269 (303) T ss_pred hhccCCCeEE-ecCccCCCCCceecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcch Confidence 7665543322 22233345567999999999999986431 111122224566677778888775432 Q ss_pred --hhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 --IVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 --~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) -.+....++...+|++++.+|+++++|+.+=| T Consensus 270 ~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 270 DLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred hhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 12345678889999999999999999999999 No 64 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.90 E-value=9.2e-26 Score=157.84 Aligned_cols=228 Identities=15% Similarity=0.101 Sum_probs=177.0 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .+-.+..-+..+|++ .+.+++++||++++. +++++++.+++.++.+..+.+|+|.+.++..|+.+...++++++|+ T Consensus 42 ~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~ 121 (293) T protein:vir:48 42 ENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVV 121 (293) T ss_pred eeccCCcceEEEEeecCCCcceeeecCCcccccccccceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHH Confidence 222222335567766 245779999999986 6799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) +++|+.++..+.+... .++..+|++|.+++..+.........++|||..+..|++.++..++ +....-+.+|..+++ T Consensus 122 ~~~~~~i~~g~~~~~~-~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~--~l~~~~~~~~~~~~l 198 (293) T protein:vir:48 122 VTRNKAILGVVDKLPT-KPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGD--YLMERDVKSPTGYSI 198 (293) T ss_pred HHHHhHHhhccccccc-cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCc--eEeecCcCCCCCcee Confidence 9999999988765443 3466789999999999987777778899999999999987765443 222223456777899 Q ss_pred cceeEEEcCC--CccCce--EEEEEecCCceEEEeecCCccceeccc----hhhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 157 LGAQIVRSKK--LAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRD----IVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 157 ~G~~Vv~s~~--~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd----~~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) +|.||+++++ +|.... ..+.+..-..++.++.+++++++.++. -.+....++...+|++++.+|++++++++ T Consensus 199 ~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~ 278 (293) T protein:vir:48 199 AGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASF 278 (293) T ss_pred cceeeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEe Confidence 9999987553 443221 111222224567777788888887764 35667789999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +++ T Consensus 279 ~~~ 281 (293) T protein:vir:48 279 KAI 281 (293) T ss_pred ecc Confidence 998 No 65 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.90 E-value=9.5e-26 Score=157.77 Aligned_cols=223 Identities=13% Similarity=0.081 Sum_probs=175.3 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) -.-.-.|.++++|.+.+ .+.+++||+++|..++++++.++..++++..+.+|+|...++..++.+...++++++++++ T Consensus 62 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~ 141 (324) T protein:vir:10 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 11112355789998854 5678999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 79 vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|+.++....+. .....+.+++++|.++...+........+++|||..+..|++..+..++ T Consensus 142 ~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~------ 215 (324) T protein:vir:10 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK------ 215 (324) T ss_pred HHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCc------ Confidence 999998653221 1123356789999999999988877888999999999999987654332 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEEE Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVITA 209 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~~ 209 (231) ..+..+..++++|+||+.++.++.+++..+...+. . +.+...+++.+|..++. .+....++. T Consensus 216 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~-~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 293 (324) T protein:vir:10 216 ERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFD-K-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCccccceeEEeecCCCCCcceEEEEecc-c-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 22334556789999999999877766554433332 2 34566678888877764 345678899 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+||++.+.+|+++++|+.+.- T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 294 TMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEccEEecccceEEEEeccC Confidence 9999999999999999987655 No 66 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.90 E-value=6.1e-26 Score=158.83 Aligned_cols=228 Identities=14% Similarity=0.047 Sum_probs=178.7 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) ..-.++.-++.+|++.+ .+.+++||++++. +..++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~ 237 (415) T protein:vir:94 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) T ss_pred eeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 22223334666777754 4568999999985 46799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccc--------------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS--------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 78 ~vd~~~~~~l~t~~--------------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) ++|..++....+.. ...++..+|++|.+++..+.........++|||..+..|++.++..+. +. T Consensus 238 ~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~--~l 315 (415) T protein:vir:94 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) T ss_pred HHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCC--ee Confidence 99999998764321 223456789999999999988888888999999999999987654443 33 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceEE--EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSALM--FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~--~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) ...-+.+|..++++|+||++++.+|.+..-. +.+..-..++.++.+.++.++..+.. ...+.+++..++++++.+|+ T Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~r~~~r~d~~~~~~~ 394 (415) T protein:vir:94 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc-cCceEEEEEEEeccEEeccc Confidence 3333456777899999999999998655311 11111244566777788888877653 45677899999999999999 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) ++++++++.+ T Consensus 395 a~~~~~~~~~ 404 (415) T protein:vir:94 395 SAIVIEYDDS 404 (415) T ss_pred cEEEEEEecc Confidence 9999999999 No 67 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.90 E-value=1.4e-25 Score=156.90 Aligned_cols=223 Identities=13% Similarity=0.082 Sum_probs=173.2 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) -.-.-.|.++++|.+.+ .+.+++||+++|..+++++++++++++++..+.||+|...++..++.+...++++++++++ T Consensus 62 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~ 141 (324) T protein:vir:96 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 11112355789998754 4568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 79 vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|+.++.+..+. .....+.+++++|.++..++.+.+.....++|||..+..|++..+..+. T Consensus 142 ~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~------ 215 (324) T protein:vir:96 142 FDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK------ 215 (324) T ss_pred HHHHhhhcCCCCCcCccccccccccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCC------ Confidence 999988643211 1123355789999999999988888888999999999999987554322 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEEE Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVITA 209 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~~ 209 (231) -.+..+..++++|+||++++..+.+++..+..... -+.+...+++.+|..|+. .+....+++ T Consensus 216 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~ 293 (324) T protein:vir:96 216 ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFD--KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCCcccceeeEeecCCCCCcceEEEEecc--eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 22345667899999999988776665433322222 234566678888888764 344678899 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ..||++++.+|+++++|+.+-. T Consensus 294 ~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 294 TMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEeccEEecccceEEEecccc Confidence 9999999999999999985444 No 68 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.90 E-value=1.4e-25 Score=156.86 Aligned_cols=229 Identities=10% Similarity=0.025 Sum_probs=170.2 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ---...+..+++|.+.+. +.+++||++++..++++++.+++++|.+..+.+|+|...++..|+.+...++++++++++ T Consensus 44 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~ 123 (330) T protein:vir:77 44 RKVPMGPTGISIPHWTGAVSASWTGEAERKPITKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALK 123 (330) T ss_pred ceeeccCCceEEEEEcCCcceeEecCCCccccccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 111123556889988554 557999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc----------------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhh Q lcl|Aclame:pro 79 VDDDLLKAAKTT----------------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANA 136 (231) Q Consensus 79 vd~~~~~~l~t~----------------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~ 136 (231) +|+.++.+-.+. +........++++.+++..+...+.....++|||.++..|++.++. T Consensus 124 ~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~ 203 (330) T protein:vir:77 124 FDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDG 203 (330) T ss_pred HHHHhhcccCCCCccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhcc Confidence 999998642210 0111223458899999999988888888999999999999987665 Q ss_pred hhcccccc---CceeeeccceeecceeEEEcCCCccCce----EEEEEecCCceEEEeecCCccceeccch--------- Q lcl|Aclame:pro 137 KNIGSEVG---ANALINGTYADVLGAQIVRSKKLAEGSA----LMFKIVSNSPALKLVLKRGVQVETDRDI--------- 200 (231) Q Consensus 137 ~~~~~~~~---~~~~~~G~ig~~~G~~Vv~s~~~~~~~~----~~~~~~~~~~A~~~~~k~~v~vE~~Rd~--------- 200 (231) .++.-... .+....+.-++++|+||++++.+|++.. ..+...+. .+.+...+++.++..++. T Consensus 204 ~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s--~~~i~~~~~~~i~~~~e~~~~~~~~~~ 281 (330) T protein:vir:77 204 NGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFS--QVIWGQIGGLSFDVTDQATLDFGEEQG 281 (330) T ss_pred CCceeecCccccccccccCCceecceeeEEeccccCCCCCCccEEEEEecc--eEEEEEecCcEEEEeecceeeeccccc Confidence 44321111 1111222346899999999999997542 11111122 234566667777655542 Q ss_pred -----------hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 -----------VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 -----------~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+....++...|+++.+.+|+++++|+.++- T Consensus 282 ~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~ 323 (330) T protein:vir:77 282 GVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVA 323 (330) T ss_pred ccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 3456788999999999999999999998888 No 69 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.90 E-value=9.6e-26 Score=157.74 Aligned_cols=227 Identities=16% Similarity=0.087 Sum_probs=176.6 Q ss_pred CCC-cccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG-INLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~-~~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) .+- ...|+ +.+|++ .+.+.+++||++++. +++++++.++++++.+..+.+|++...++..|+.+...+++++++ T Consensus 146 ~~~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~ 224 (397) T protein:vir:49 146 ENVTTLTGS-RVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKV 224 (397) T ss_pred eecccCccc-eEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHH Confidence 222 22343 446655 244678999999985 679999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecccee Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 76 a~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~ 155 (231) ++++|..++++..+... ..+.+++|+|.++...+.........|+|||.++..|++.++..++ +....-+.+|.-++ T Consensus 225 ~~~~d~ai~~G~g~~~~-~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~--~l~~~~~~~~~~~~ 301 (397) T protein:vir:49 225 VVTRNKAILEAIAALPT-KPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGD--YLMERDVKSPTGYS 301 (397) T ss_pred HHHHHHHHHhhcccccc-ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCc--eeeccCcCCCCCce Confidence 99999999988765443 3456789999999999988777888999999999999988665443 22222245677789 Q ss_pred ecceeEEEcC--CCccCceE--EEEEecCCceEEEeecCCccceeccc----hhhcccEEEEEEEEEEEEEcCCcEEEEE Q lcl|Aclame:pro 156 VLGAQIVRSK--KLAEGSAL--MFKIVSNSPALKLVLKRGVQVETDRD----IVTKTTVITADEHYAAYLYDLTKVVNIT 227 (231) Q Consensus 156 ~~G~~Vv~s~--~~~~~~~~--~~~~~~~~~A~~~~~k~~v~vE~~Rd----~~~~~~~i~~~~~y~~~~~~~~~vv~l~ 227 (231) ++|+||++++ .+|.+..- .+.+..-..++.++.++++.++.++. -.+....+++..++++++++|+++++++ T Consensus 302 l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 381 (397) T protein:vir:49 302 IDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPAS 381 (397) T ss_pred ecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEE Confidence 9999998754 35544321 11222224567778888999987764 4456678999999999999999999999 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) ++++ T Consensus 382 ~~~~ 385 (397) T protein:vir:49 382 FKAI 385 (397) T ss_pred eecc Confidence 9999 No 70 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.90 E-value=1.1e-25 Score=157.44 Aligned_cols=228 Identities=14% Similarity=0.038 Sum_probs=177.2 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-.++.-++.+|++.+ .+.+++||++++.. ..++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 237 (415) T protein:vir:81 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) T ss_pred eeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 22222223566677755 45689999999864 5799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 78 ~vd~~~~~~l~t~--------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) ++|..++.+..+. ..+.++..+|++|.+++..+.+.......++|||.++..|++.++..++ +. T Consensus 238 ~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~--~l 315 (415) T protein:vir:81 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) T ss_pred HHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCc--ee Confidence 9999999876432 1223456789999999999988888888999999999999987654433 33 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceEE--EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSALM--FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~--~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) ...-+.+|..++++|+||++++++|.+.+-. +.+..-..++.++.+.++.++..++. ...+.+++..++++++.+|+ T Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:81 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc-cCceEEEEEEEeccEEeccc Confidence 2223456777899999999999998654311 11111244566777888899877654 34567889999999999999 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) ++++++++.+ T Consensus 395 a~~~~~~~~~ 404 (415) T protein:vir:81 395 SAIVIEYDDS 404 (415) T ss_pred cEEEEEEecc Confidence 9999999999 No 71 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.90 E-value=1.1e-25 Score=157.44 Aligned_cols=228 Identities=14% Similarity=0.038 Sum_probs=177.2 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-.++.-++.+|++.+ .+.+++||++++.. ..++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 237 (415) T protein:vir:79 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) T ss_pred eeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 22222223566677755 45689999999864 5799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 78 ~vd~~~~~~l~t~--------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) ++|..++.+..+. ..+.++..+|++|.+++..+.+.......++|||.++..|++.++..++ +. T Consensus 238 ~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~--~l 315 (415) T protein:vir:79 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) T ss_pred HHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCc--ee Confidence 9999999876432 1223456789999999999988888888999999999999987654433 33 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceEE--EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSALM--FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~--~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) ...-+.+|..++++|+||++++++|.+.+-. +.+..-..++.++.+.++.++..++. ...+.+++..++++++.+|+ T Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:79 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc-cCceEEEEEEEeccEEeccc Confidence 2223456777899999999999998654311 11111244566777888899877654 34567889999999999999 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) ++++++++.+ T Consensus 395 a~~~~~~~~~ 404 (415) T protein:vir:79 395 SAIVIEYDDS 404 (415) T ss_pred cEEEEEEecc Confidence 9999999999 No 72 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.90 E-value=1.1e-25 Score=157.44 Aligned_cols=228 Identities=14% Similarity=0.038 Sum_probs=177.2 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-.++.-++.+|++.+ .+.+++||++++.. ..++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 237 (415) T protein:vir:98 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) T ss_pred eeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 22222223566677755 45689999999864 5799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 78 ~vd~~~~~~l~t~--------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) ++|..++.+..+. ..+.++..+|++|.+++..+.+.......++|||.++..|++.++..++ +. T Consensus 238 ~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~--~l 315 (415) T protein:vir:98 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) T ss_pred HHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCc--ee Confidence 9999999876432 1223456789999999999988888888999999999999987654433 33 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceEE--EEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSALM--FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~--~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) ...-+.+|..++++|+||++++++|.+.+-. +.+..-..++.++.+.++.++..++. ...+.+++..++++++.+|+ T Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:98 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc-cCceEEEEEEEeccEEeccc Confidence 2223456777899999999999998654311 11111244566777888899877654 34567889999999999999 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) ++++++++.+ T Consensus 395 a~~~~~~~~~ 404 (415) T protein:vir:98 395 SAIVIEYDDS 404 (415) T ss_pred cEEEEEEecc Confidence 9999999999 No 73 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.90 E-value=8.1e-27 Score=163.63 Aligned_cols=230 Identities=14% Similarity=0.050 Sum_probs=183.3 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCC-HHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGD-PIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d-~~~~~~~~~a~~ia 76 (231) .--+.+|++++||. +|. +..+..|+++..+.+..++.+++|+..-.+ +.|.|.+..++..| +-.+..+++|++|| T Consensus 46 vrti~~GkS~qf~~-iG~~~a~y~~~G~~ldg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA 124 (402) T protein:vir:97 46 VQTVTGTNTVSNKY-LGETELQVLAPGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLK 124 (402) T ss_pred eeeecccceEEEEE-EeeeEEeeeccccccCCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHH Confidence 44578999999975 565 557999999999999999999999997775 88999999999999 78899999999999 Q ss_pred HHHHHHHHHHhcccc-------------------ccccc-----ccC----HHHHHHHHHHhhccC--CCceEEEECHHH Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS-------------------QTVST-----KAN----VDGVQAALDIFNDED--AQAYVLIVNPKD 126 (231) Q Consensus 77 ~~vd~~~~~~l~t~~-------------------~~~~~-----~~~----~d~i~da~~~l~~~~--~~~~v~vv~p~~ 126 (231) +..|+.++..+..+. ..++. .++ ++.|.+|.+.|.+.+ ...++++++|++ T Consensus 125 ~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~ 204 (402) T protein:vir:97 125 RLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKF 204 (402) T ss_pred HHHHHHHHHHHHHhhccccccccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHH Confidence 999998875432100 00000 122 466668888887655 467899999999 Q ss_pred HHHHHhhhhhhhcc-ccccCceeeeccceeecceeEEEcCCCccC---------------ceEE--------EEEecCCc Q lcl|Aclame:pro 127 AAKIRKDANAKNIG-SEVGANALINGTYADVLGAQIVRSKKLAEG---------------SALM--------FKIVSNSP 182 (231) Q Consensus 127 ~~~L~k~~~~~~~~-~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~---------------~~~~--------~~~~~~~~ 182 (231) |+.|++++++.++. ...+.+...+|.++.+.|++|+.||++|.+ ..|. .-+++.+. T Consensus 205 y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~ 284 (402) T protein:vir:97 205 FNALRDADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSD 284 (402) T ss_pred HHHHhhcccccchhhccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecc Confidence 99999998877653 224556678999999999999999999852 1111 22345678 Q ss_pred eEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEecc-----C Q lcl|Aclame:pro 183 ALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTG-----V 231 (231) Q Consensus 183 A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~-----~ 231 (231) |++.+.-.+++.|.+||++++++.|...+.||..+.+|+++.+++.+= + T Consensus 285 Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~ 338 (402) T protein:vir:97 285 ALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) T ss_pred eEEEEEeeccccchhhchhHHHHHHHHHHHhCCcccCccceEEEEEeccccccc Confidence 999999899999999999999999999999999999999998886543 2 No 74 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.90 E-value=1.5e-25 Score=156.70 Aligned_cols=228 Identities=13% Similarity=0.042 Sum_probs=170.9 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~a~~i 75 (231) ---...+..+++|.+.++ +.+++||+++|..++++++.+++.+|.+..+.||+|.+.+ +..++.++..+++++++ T Consensus 35 ~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~ai 114 (300) T protein:vir:95 35 PQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKL 114 (300) T ss_pred ceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHH Confidence 111123456788887654 5689999999999999999999999999999999998854 45788899999999999 Q ss_pred HHHHHHHHHHHhcccc-------------------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhh Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS-------------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANA 136 (231) Q Consensus 76 a~~vd~~~~~~l~t~~-------------------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~ 136 (231) ++++|..++.+....+ ...+....+++|.++..++...+..+.+++|||..+.+|++.++. T Consensus 115 a~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~ 194 (300) T protein:vir:95 115 ARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNA 194 (300) T ss_pred HHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhcc Confidence 9999999997632110 012245678999999999988888888999999999999998766 Q ss_pred hhccccccCceeeeccceeecceeEEEcCCCccCce----EEEEEecCCceEEEeecCCccceeccc----------hhh Q lcl|Aclame:pro 137 KNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA----LMFKIVSNSPALKLVLKRGVQVETDRD----------IVT 202 (231) Q Consensus 137 ~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~----~~~~~~~~~~A~~~~~k~~v~vE~~Rd----------~~~ 202 (231) .++.- ..+...+|..++++|+||++|+.+|.+.. ..+.-.+ ..++.+..+++++++.... -.+ T Consensus 195 ~G~~i--~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf-~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~ 271 (300) T protein:vir:95 195 EGGKL--YPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDF-ETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGY 271 (300) T ss_pred CCCee--ccCccccCCCceecceeeEEecCCCCCCCCCccEEEEeec-cceEEEEEecccEEEEeeccCCCCcchhhhhc Confidence 55432 23444556788999999999999986531 1221112 3344455566766654432 223 Q ss_pred cccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 203 KTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 203 ~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ....+++.+++++.+.+|+++++|+.+|= T Consensus 272 ~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 272 NQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CcEEEEEEEeecceeecccceEEEecCCC Confidence 45678889999999999999999988777 No 75 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.90 E-value=1.6e-25 Score=156.51 Aligned_cols=228 Identities=14% Similarity=0.046 Sum_probs=176.2 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+..++--++.+|++.+ .+.+++||++++. +..++++.++++++++..+.||++...++..|+.+...++++++|++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 237 (415) T protein:vir:46 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) T ss_pred eeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 22222223444555544 4568999999986 56899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 78 ~vd~~~~~~l~t~--------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) ++|..++.+..+. ....++..++++|.+++..+.+....+..++|||..+..|++..+..+. +. T Consensus 238 ~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~--~i 315 (415) T protein:vir:46 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) T ss_pred HHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCC--ee Confidence 9999999876432 1223456789999999999988888888999999999999887654433 33 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceE--EEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSAL--MFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~--~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) ...-+.+|..++++|+||++++++|.+..- .+.+..-..++.++.+.++.++..+.. ...+.+++..++++++++|+ T Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:46 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccc-cCceEEEEEEEeccEEeccc Confidence 322345677789999999999999865421 111222244566777788888876543 44567899999999999999 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) +++.++++.. T Consensus 395 a~~~~~~~~~ 404 (415) T protein:vir:46 395 SAIVIEYDDS 404 (415) T ss_pred cEEEEEeecc Confidence 9999999999 No 76 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.90 E-value=1.6e-25 Score=156.51 Aligned_cols=228 Identities=14% Similarity=0.046 Sum_probs=176.2 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+..++--++.+|++.+ .+.+++||++++. +..++++.++++++++..+.||++...++..|+.+...++++++|++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 237 (415) T protein:vir:47 158 KRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAA 237 (415) T ss_pred eeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 22222223444555544 4568999999986 56899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 78 ~vd~~~~~~l~t~--------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) ++|..++.+..+. ....++..++++|.+++..+.+....+..++|||..+..|++..+..+. +. T Consensus 238 ~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~--~i 315 (415) T protein:vir:47 238 TRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGN--YL 315 (415) T ss_pred HHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCC--ee Confidence 9999999876432 1223456789999999999988888888999999999999887654433 33 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceE--EEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSAL--MFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~--~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) ...-+.+|..++++|+||++++++|.+..- .+.+..-..++.++.+.++.++..+.. ...+.+++..++++++++|+ T Consensus 316 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:47 316 IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccc-cCceEEEEEEEeccEEeccc Confidence 322345677789999999999999865421 111222244566777788888876543 44567899999999999999 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) +++.++++.. T Consensus 395 a~~~~~~~~~ 404 (415) T protein:vir:47 395 SAIVIEYDDS 404 (415) T ss_pred cEEEEEeecc Confidence 9999999999 No 77 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.89 E-value=3.4e-25 Score=154.75 Aligned_cols=228 Identities=16% Similarity=0.099 Sum_probs=176.7 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCccc-cccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISLDK-IGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~~~-lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) ..-..+.-++.+|++ .+.+.+++||++++..+ .++++.++++++++..+.+|++...++..|+.+...++++++++ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~ 225 (397) T protein:vir:49 146 ENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVV 225 (397) T ss_pred eeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHH Confidence 221122223556665 34577899999998654 69999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) +++|..++.+..+.. ..++.++||+|.++...+.........++|||..+..|++.++..+. +....-+.+|.-+++ T Consensus 226 ~~~d~ail~G~g~~~-~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~--~l~~~~~~~g~~~~l 302 (397) T protein:vir:49 226 VTRNKAILEAIGTLP-NKPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGD--YLMERDVKSPTGYSI 302 (397) T ss_pred HHHHHHHHhcccccc-ccccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCc--eeecccccCCCCcee Confidence 999999998766543 34566789999999999988778889999999999999988655443 222222345667899 Q ss_pred cceeEEEcC--CCccCceEE--EEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 157 LGAQIVRSK--KLAEGSALM--FKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 157 ~G~~Vv~s~--~~~~~~~~~--~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) +|+||+++. .+|.+.+-. +.+..-..++.++.++++.++.++.. .+....+++..++++.+.+|++++++++ T Consensus 303 ~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~ 382 (397) T protein:vir:49 303 DGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASF 382 (397) T ss_pred cceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEe Confidence 999998854 455443321 22222245677888899999988754 4566789999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +++ T Consensus 383 ~~~ 385 (397) T protein:vir:49 383 KAI 385 (397) T ss_pred ccc Confidence 999 No 78 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.89 E-value=5.5e-25 Score=153.60 Aligned_cols=223 Identities=13% Similarity=0.081 Sum_probs=174.0 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ..-.-.|.++++|.+.+ .+.+++||+++|..++++++.+++.++.+..+.+|+|...++..|+.+...++++++++++ T Consensus 62 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~ 141 (324) T protein:vir:96 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 11112356688998754 4668999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 79 vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|+.++....+. .....+..++++|.++...+........+++|||..+..|++.++..++ T Consensus 142 ~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~------ 215 (324) T protein:vir:96 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK------ 215 (324) T ss_pred HHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCC------ Confidence 999998653221 1123356789999999999988888888999999999999987654332 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEEE Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVITA 209 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~~ 209 (231) -.+..|..++++|+||+.++.++.+++..+...++ . +.+...+++.+|..++. .+....++. T Consensus 216 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~-~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:96 216 ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFD-K-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCCcccceeeEeeCCCCCCcceEEEEecc-e-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 23345667899999999998766655543322222 2 34666778888887764 245678889 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++|++.+.+|+++++|+.+-. T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 294 TMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEccEEecccceEEEecccc Confidence 9999999999999999876333 No 79 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.89 E-value=5.5e-25 Score=153.60 Aligned_cols=223 Identities=13% Similarity=0.081 Sum_probs=174.0 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ..-.-.|.++++|.+.+ .+.+++||+++|..++++++.+++.++.+..+.+|+|...++..|+.+...++++++++++ T Consensus 62 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~ 141 (324) T protein:vir:78 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 11112356688998754 4668999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 79 vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +|+.++....+. .....+..++++|.++...+........+++|||..+..|++.++..++ T Consensus 142 ~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~------ 215 (324) T protein:vir:78 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK------ 215 (324) T ss_pred HHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCC------ Confidence 999998653221 1123356789999999999988888888999999999999987654332 Q ss_pred ceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEEE Q lcl|Aclame:pro 146 NALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVITA 209 (231) Q Consensus 146 ~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~~ 209 (231) -.+..|..++++|+||+.++.++.+++..+...++ . +.+...+++.+|..++. .+....++. T Consensus 216 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~-~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:78 216 ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFD-K-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCCcccceeeEeeCCCCCCcceEEEEecc-e-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 23345667899999999998766655543322222 2 34666778888887764 245678889 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++|++.+.+|+++++|+.+-. T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 294 TMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEccEEecccceEEEecccc Confidence 9999999999999999876333 No 80 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.89 E-value=5.8e-25 Score=153.47 Aligned_cols=220 Identities=14% Similarity=0.098 Sum_probs=171.3 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) .--+.++..+.+|...+ .+.+++||++++..++++++.+++.++++..+.+|+|...++..|+.+...++++++++++ T Consensus 45 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~ 124 (297) T protein:vir:95 45 YQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKK 124 (297) T ss_pred eeecCCCccEEEEEEcCCceeEEeecCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHH Confidence 11233444567776544 4568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcc------------cccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 79 VDDDLLKAAKT------------TSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 79 vd~~~~~~l~t------------~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +|+.++.+..+ +....++.++|++|.+++.++.+.+.....++|||..+..|++..+..+ . T Consensus 125 ~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G-------~ 197 (297) T protein:vir:95 125 IDEAGLLGHDTPFANSVAKAAKDANKVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNK-------V 197 (297) T ss_pred HHHHHhcccCCcccccccccccccceecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCC-------c Confidence 99999864321 1122345678999999999999888888899999999999998654322 3 Q ss_pred eeeeccceeecceeEEEcCCCc--cCceEEEEEecCCceEEEeecCCccceeccch----------------hhcccEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLA--EGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VTKTTVIT 208 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~--~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~~~~~i~ 208 (231) .+.++..++++|+||+.++..+ ++..++.+ +. .+.+...+++.++..|+. .+....++ T Consensus 198 ~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd--~s--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 273 (297) T protein:vir:95 198 SIYDKAANTIDGITTVDLKSARFEKGDLLAGD--FD--NLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIR 273 (297) T ss_pred eeecCCCCcccceeeEeecCCCCCCceEEEEe--cc--cEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEE Confidence 3556677899999999887654 44444333 22 233566778888777664 34567888 Q ss_pred EEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 209 ADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 209 ~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ...++++++.+|+++++|+.+.- T Consensus 274 ~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 274 ATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred EEEEeccEeecccceEEEeecCC Confidence 99999999999999999987665 No 81 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.89 E-value=3.6e-25 Score=154.61 Aligned_cols=229 Identities=10% Similarity=0.041 Sum_probs=175.5 Q ss_pred CCC---cccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~---~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) ..+ ...+.++.+|...+ .+.+++||+.+|.. ..++++.++.+++++..+.+|+|.+.++..|+.+...++++++ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 217 (407) T protein:vir:48 138 QEATVITLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALE 217 (407) T ss_pred hhceeeecCCCceEEEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHH Confidence 111 12344677776544 35579999999865 4689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccc--------------------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTT--------------------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAA 128 (231) Q Consensus 75 ia~~vd~~~~~~l~t~--------------------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~ 128 (231) ++.++|..++.+-.+. ....++.+++|+|.++...|........+|+|||..+. T Consensus 218 i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~ 297 (407) T protein:vir:48 218 FAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLF 297 (407) T ss_pred HHHHHHhhhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHH Confidence 9999999988642110 01122346799999999999876666678999999999 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce--EEEEEecCCceEEEeecCCccceeccchhhcccE Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTV 206 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~ 206 (231) .|++.++..++. ....-+.+|..++++|.||+++++||.... ..+.+..-..++.++.+.+++++.++...++... T Consensus 298 ~L~~lkD~~Gr~--l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~ 375 (407) T protein:vir:48 298 AIRLLKDNDGNY--LWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVG 375 (407) T ss_pred HHHHhhccCCce--eeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeeccccCCcEE Confidence 998876655432 222224567778999999999999996221 1111112234677787788888887777788889 Q ss_pred EEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 207 ITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 207 i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +++.++|++++++|++++++++++. T Consensus 376 ~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 376 FYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred EEEEEEeccEEecccceEEEEeecc Confidence 9999999999999999999999999 No 82 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.89 E-value=4.2e-25 Score=154.24 Aligned_cols=219 Identities=14% Similarity=0.092 Sum_probs=168.9 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) .+ + ++..+++|++.+. +.+++||+++|..+.++++.+++++|++..+.+|+|...++..|+.+...++++++++++ T Consensus 46 ~~-~-~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~ 123 (304) T protein:vir:10 46 EP-M-TAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKA 123 (304) T ss_pred ee-c-cCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 21 2 3456889988654 568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------c-----cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------S-----QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIG 140 (231) Q Consensus 79 vd~~~~~~l~t~-------------~-----~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~ 140 (231) +|..++.+-.+. . ....+..+|++|.+++..+...+..+..++|||..+..|++.++..++ T Consensus 124 ~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~- 202 (304) T protein:vir:10 124 FDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDR- 202 (304) T ss_pred HHhhheeccCCCcccccccccccccccccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCc- Confidence 999998643211 0 012334679999999999988888888999999999999987543322 Q ss_pred ccccCceeeeccceeecceeEEEcCCCccCc--eEEEEEecCCceEEEeecCCccceeccch------------------ Q lcl|Aclame:pro 141 SEVGANALINGTYADVLGAQIVRSKKLAEGS--ALMFKIVSNSPALKLVLKRGVQVETDRDI------------------ 200 (231) Q Consensus 141 ~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~--~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~------------------ 200 (231) .+.++..++++|+||++++++|... +..+...+ .. +.+...+++.++..|+. T Consensus 203 ------~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~-~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f 274 (304) T protein:vir:10 203 ------PLFDANGNEIMGLPLSYTGADVYDKKKSLALMGDW-DY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLF 274 (304) T ss_pred ------EeecCCCccccceeeEEecccccCCCCcEEEEEeh-hh-EEEEEecceEEEEeecceeeeecccccCccchhhh Confidence 2334455899999999999998532 22221122 22 33555567777665553 Q ss_pred hhcccEEEEEEEEEEEEEcCCcEEEEEecc Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~ 230 (231) .+....+++..+|++.+.+|+++++|+.+= T Consensus 275 ~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 275 ERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred hcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234567888999999999999999998888 No 83 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.89 E-value=4.2e-25 Score=154.24 Aligned_cols=219 Identities=14% Similarity=0.092 Sum_probs=168.9 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) .+ + ++..+++|++.+. +.+++||+++|..+.++++.+++++|++..+.+|+|...++..|+.+...++++++++++ T Consensus 46 ~~-~-~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~ 123 (304) T protein:vir:94 46 EP-M-TAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKA 123 (304) T ss_pred ee-c-cCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 21 2 3456889988654 568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc-------------c-----cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 79 VDDDLLKAAKTT-------------S-----QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIG 140 (231) Q Consensus 79 vd~~~~~~l~t~-------------~-----~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~ 140 (231) +|..++.+-.+. . ....+..+|++|.+++..+...+..+..++|||..+..|++.++..++ T Consensus 124 ~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~- 202 (304) T protein:vir:94 124 FDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDR- 202 (304) T ss_pred HHhhheeccCCCcccccccccccccccccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCc- Confidence 999998643211 0 012334679999999999988888888999999999999987543322 Q ss_pred ccccCceeeeccceeecceeEEEcCCCccCc--eEEEEEecCCceEEEeecCCccceeccch------------------ Q lcl|Aclame:pro 141 SEVGANALINGTYADVLGAQIVRSKKLAEGS--ALMFKIVSNSPALKLVLKRGVQVETDRDI------------------ 200 (231) Q Consensus 141 ~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~--~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~------------------ 200 (231) .+.++..++++|+||++++++|... +..+...+ .. +.+...+++.++..|+. T Consensus 203 ------~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~-~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f 274 (304) T protein:vir:94 203 ------PLFDANGNEIMGLPLSYTGADVYDKKKSLALMGDW-DY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLF 274 (304) T ss_pred ------EeecCCCccccceeeEEecccccCCCCcEEEEEeh-hh-EEEEEecceEEEEeecceeeeecccccCccchhhh Confidence 2334455899999999999998532 22221122 22 33555567777665553 Q ss_pred hhcccEEEEEEEEEEEEEcCCcEEEEEecc Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~ 230 (231) .+....+++..+|++.+.+|+++++|+.+= T Consensus 275 ~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 275 ERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred hcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234567888999999999999999998888 No 84 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.88 E-value=8.6e-25 Score=152.52 Aligned_cols=227 Identities=12% Similarity=-0.007 Sum_probs=169.0 Q ss_pred CCCcccCceEEecccc--CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI--GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i--gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) -+-+.+...+.+|..+ +.+.+++||+.++.+++++++.++++++++..+.||++.+.++ .|+.+...++++.+++++ T Consensus 285 ~~~~~~~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~ 363 (543) T protein:vir:81 285 ARQVVATGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDEL 363 (543) T ss_pred cccccCCcceEEEEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHH Confidence 2222333356677654 3566899999999999999999999999999999999988776 699999999999999999 Q ss_pred HHHHHHHHhccc-----------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccc Q lcl|Aclame:pro 79 VDDDLLKAAKTT-----------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGS 141 (231) Q Consensus 79 vd~~~~~~l~t~-----------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~ 141 (231) +|..++.+-.+. ....+..++++++.++...+........+|+|||.++..|++..+..+. T Consensus 364 ~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~-- 441 (543) T protein:vir:81 364 EAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGA-- 441 (543) T ss_pred HHHHHhccCCCCcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCc-- Confidence 999998653221 0112235689999999999977666667899999999999987654433 Q ss_pred cccCceeeeccceeecceeEEEcCCCccCceE------EEEEecCCceEEEeecCCccceeccc------hhhcccEEEE Q lcl|Aclame:pro 142 EVGANALINGTYADVLGAQIVRSKKLAEGSAL------MFKIVSNSPALKLVLKRGVQVETDRD------IVTKTTVITA 209 (231) Q Consensus 142 ~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~------~~~~~~~~~A~~~~~k~~v~vE~~Rd------~~~~~~~i~~ 209 (231) +...+ +.+|..++++|.||+++++||.+... ...+.+....+.++...++.++.+.+ ..++...+++ T Consensus 442 ~l~~~-~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 520 (543) T protein:vir:81 442 GLWTT-IGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFA 520 (543) T ss_pred eeccC-cCCCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEE Confidence 22222 34566789999999999999865411 11111111234556667777775442 2345668899 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ..++++.+.+|++++++++++. T Consensus 521 ~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 521 YYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred EEeeccEeecccceEEEEeccc Confidence 9999999999999999999999 No 85 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.88 E-value=7.5e-25 Score=152.84 Aligned_cols=227 Identities=15% Similarity=0.077 Sum_probs=174.6 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) ......|+.. +|++ .+.+.+++||++++.. ++++++.++++++++..+.||++.+.++..|+.+...++++++|+ T Consensus 147 ~~~~~~~~~~-~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~ 225 (397) T protein:vir:48 147 NVTTLTGSRV-YEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVV 225 (397) T ss_pred eccCCcceEE-EEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHH Confidence 1112222222 3333 2346789999999865 689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) +++|..++++..+.. ..++.+++++|.++...+.........++|||..+..|++.++..++. ....-+.+|..+++ T Consensus 226 ~~~d~~il~G~g~~~-~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~--i~~~~~~~~~~~~l 302 (397) T protein:vir:48 226 VTRNKAILEAIATLP-TKPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDY--LMERDVKSPTGYSI 302 (397) T ss_pred HHHHHHHhhcccccc-cccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCce--eeccCcCCCCCcee Confidence 999999998765543 345677899999999999887778889999999999999877654432 22223456778899 Q ss_pred cceeEEEcCC--CccCceE--EEEEecCCceEEEeecCCccceeccc----hhhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 157 LGAQIVRSKK--LAEGSAL--MFKIVSNSPALKLVLKRGVQVETDRD----IVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 157 ~G~~Vv~s~~--~~~~~~~--~~~~~~~~~A~~~~~k~~v~vE~~Rd----~~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) +|.||++++. +|.+..- .+.+..-..++.++.+.++.+|.++. -.+....+++..+|++++.+|++++++++ T Consensus 303 ~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 382 (397) T protein:vir:48 303 DGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASF 382 (397) T ss_pred ccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEe Confidence 9999988653 4433221 11122224567778888899988774 35566789999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +++ T Consensus 383 ~~~ 385 (397) T protein:vir:48 383 KAI 385 (397) T ss_pred ccc Confidence 999 No 86 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.88 E-value=6.1e-25 Score=153.33 Aligned_cols=229 Identities=10% Similarity=0.032 Sum_probs=176.3 Q ss_pred CCC---cccCceEEeccccC--CcccccCCCccCccc-cccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPNDIG--DAADVAEGGEISLDK-IGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~---~~~G~ti~~P~~ig--da~~v~EG~~i~~~~-lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) ..+ ...+..+++|...+ .+.+++||+++|..+ .++++.++.+++++..+.+|++.+.++..|+.+...++++++ T Consensus 162 ~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~a 241 (425) T protein:vir:10 162 QLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTE 241 (425) T ss_pred hhceeeeccCCceEEEEEcCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHH Confidence 111 11233466775544 456899999998765 689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhc--------c-cc-----------------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAK--------T-TS-----------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAA 128 (231) Q Consensus 75 ia~~vd~~~~~~l~--------t-~~-----------------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~ 128 (231) +++++|..++.+-. + .+ ...++.+++++|+++...|........+|+|||.++. T Consensus 242 i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~ 321 (425) T protein:vir:10 242 FAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQR 321 (425) T ss_pred HHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHH Confidence 99999999987411 0 00 0123456899999999998876667778999999999 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce--EEEEEecCCceEEEeecCCccceeccchhhcccE Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTV 206 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~ 206 (231) .|++.++..++ +....-+.+|.-++++|.||+++++||.... ..+.+..-..++.++.+.++++..++...++... T Consensus 322 ~L~~lkD~~G~--~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~ 399 (425) T protein:vir:10 322 QVRKLKDGQGN--YLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVL 399 (425) T ss_pred HHHHhhcCCCc--eeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecccccCCcEE Confidence 99987765543 2222334567778999999999999995321 1111222245677787788888888777788889 Q ss_pred EEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 207 ITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 207 i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +++..||++++.+|++++++++++. T Consensus 400 ~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 400 FYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred EEEEEEeccEeecccceEEEEeecc Confidence 9999999999999999999999999 No 87 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.88 E-value=1.1e-24 Score=151.89 Aligned_cols=230 Identities=14% Similarity=0.069 Sum_probs=174.3 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCcc------ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLD------KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~------~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) -.-+ .|....+|.. .+.+.+++||+..+.+ +.++++.+++.++++..+.||++.+.++..++.+...++++ T Consensus 198 ~~~~-~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~ 276 (458) T protein:vir:10 198 ELPM-SSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLI 276 (458) T ss_pred eeec-CCcceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHH Confidence 1112 2344556643 3456678888776643 56899999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhccc--------------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHh Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTT--------------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRK 132 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~--------------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k 132 (231) ++|++++|..++.+-.+. +.......+|++|+++...+.........++|||..+..|++ T Consensus 277 ~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~ 356 (458) T protein:vir:10 277 EAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDL 356 (458) T ss_pred HHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHh Confidence 999999999998642210 011123468999999999998777788899999999999988 Q ss_pred hhhhhhccc--cccCceeeeccceeecceeEEEcCCCccCceE-EEEEecCCceEEEeecCCccceeccchhhcccEEEE Q lcl|Aclame:pro 133 DANAKNIGS--EVGANALINGTYADVLGAQIVRSKKLAEGSAL-MFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITA 209 (231) Q Consensus 133 ~~~~~~~~~--~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~-~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~ 209 (231) ..+..++.. .........|..++++|+||++++.||.+..- .+.+.....++.++.+.+++++.|+........++. T Consensus 357 lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~ 436 (458) T protein:vir:10 357 LEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYV 436 (458) T ss_pred hcccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEE Confidence 766554322 12223445567789999999999999975421 111222234567788888999888777788888999 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+++|..+++|+++|+.+++|. T Consensus 437 ~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 437 TQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred EEEecceEecccceEEEeeccC Confidence 9999999999999999999999 No 88 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.88 E-value=1.2e-24 Score=151.71 Aligned_cols=228 Identities=16% Similarity=0.068 Sum_probs=172.2 Q ss_pred CCCcccCceEEeccccC---CcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG---DAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig---da~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .+-.++--++.+|++.+ .+..++||++++. +++++++.++++++.+..+.+|++...++..|+.+...++++++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~ 232 (408) T protein:vir:74 153 ESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVV 232 (408) T ss_pred eeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHH Confidence 22222223455666532 3458999999985 6799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecccee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~ 155 (231) +++|..++.+..+.. ...+..+++++.+++. .+........+++|||..+..|++.++..+.. ....-+.+|.-++ T Consensus 233 ~~~d~~il~G~G~~~-~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~--l~~~~~~~~~~~~ 309 (408) T protein:vir:74 233 VTRNQAIIAAMGTVP-KKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKY--LLEPDPTKPNSYL 309 (408) T ss_pred HHHHHHHhhcccccc-cccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCce--EeccCcCCCCCce Confidence 999999998765543 3345678999999874 66555556678999999999999876554432 2222234566689 Q ss_pred ecceeEEEcCC--CccCce--EEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEE Q lcl|Aclame:pro 156 VLGAQIVRSKK--LAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNIT 227 (231) Q Consensus 156 ~~G~~Vv~s~~--~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~ 227 (231) ++|+||+++++ +|.... ..+.+..-..++.++.++++.++.++.. .+....+++..+|++++++|+++++++ T Consensus 310 l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~ 389 (408) T protein:vir:74 310 IKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGS 389 (408) T ss_pred ecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEE Confidence 99999998753 564322 1112222244677888889999888753 456778999999999999999999999 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) +++| T Consensus 390 ~~~~ 393 (408) T protein:vir:74 390 FTAI 393 (408) T ss_pred eecc Confidence 9999 No 89 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.88 E-value=2.1e-24 Score=150.40 Aligned_cols=227 Identities=12% Similarity=0.100 Sum_probs=170.5 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) ..-.-.+.++++|.+.+ .+.+++||++++..++++++.+++.++.+..+.+|+|...++..|+.+...++++++++++ T Consensus 49 ~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~ 128 (318) T protein:vir:24 49 QKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMA 128 (318) T ss_pred ceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHH Confidence 11112356688998754 4568999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcccc--------------ccc-ccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 79 VDDDLLKAAKTTS--------------QTV-STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 79 vd~~~~~~l~t~~--------------~~~-~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) +|..++.+-.+.. ... ......+.+.++...+........+++|||..+..|++.++..+. +. T Consensus 129 ~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~--~l 206 (318) T protein:vir:24 129 FDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGR--PL 206 (318) T ss_pred HHHhhhcccCCCCCcccccccccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCc--ee Confidence 9999997543210 011 112233556777888877777888999999999999987654432 22 Q ss_pred cCceeeec-----cceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------------hh Q lcl|Aclame:pro 144 GANALING-----TYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------------VT 202 (231) Q Consensus 144 ~~~~~~~G-----~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------------~~ 202 (231) ......+| .-++++|+||++++.+|.++...+...+. -+.+...+++.+|..|+. .+ T Consensus 207 ~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs--~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~ 284 (318) T protein:vir:24 207 FIESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFS--QLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQH 284 (318) T ss_pred ecCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecc--eEEEEEecCeEEEEeeccceeccccccccchhhhhc Confidence 11112222 23579999999999999988755433333 234566778888877763 34 Q ss_pred cccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 203 KTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 203 ~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ....+++.++|++++.+|+++++|+.++- T Consensus 285 ~~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 285 NLVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred CcEEEEEEEEEccEEecccceEEEEeecc Confidence 56788999999999999999999998766 No 90 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.88 E-value=2.2e-24 Score=150.27 Aligned_cols=228 Identities=15% Similarity=0.059 Sum_probs=171.7 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .+....--.+.+|++ .+.+.+++||++++. +..++++.++.+++++..+.||++...++..|+.+...++++++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~ 232 (408) T protein:vir:10 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) T ss_pred eeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHH Confidence 221111122344444 345668999999985 5689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecccee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~ 155 (231) .++|..++++..+.. ...+..++++|.+++. .+.........|+|||..+..|++.++..+.. ....-+.+|..++ T Consensus 233 ~~~~~~il~g~g~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~--i~~~~~~~~~~~~ 309 (408) T protein:vir:10 233 VTRNQAIIEVMKAAP-KKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKY--LLEPDPTKPNSYL 309 (408) T ss_pred HHHHHHHhhcccccc-cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCce--EeccCcCCCCCce Confidence 999999998876544 3345678999999874 56544455668999999999999987655443 2222345677789 Q ss_pred ecceeEEEcC--CCccCce--EEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEE Q lcl|Aclame:pro 156 VLGAQIVRSK--KLAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNIT 227 (231) Q Consensus 156 ~~G~~Vv~s~--~~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~ 227 (231) ++|.||++++ .+|.... ..+.+..-..++.++.++++.++.+++. .+....+++..+|++++.+|+++++++ T Consensus 310 l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~ 389 (408) T protein:vir:10 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGS 389 (408) T ss_pred ecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEE Confidence 9999999965 4564322 1111212244567788889999887764 356778999999999999999999999 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) ++++ T Consensus 390 ~~~~ 393 (408) T protein:vir:10 390 FSAI 393 (408) T ss_pred eecc Confidence 9998 No 91 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.88 E-value=1.2e-24 Score=151.68 Aligned_cols=229 Identities=10% Similarity=0.027 Sum_probs=176.3 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) ..-...|.+..+|...+ .+.+++||++.+. +..++++.++.+++++..+.+|++.+.++..|+.+...+++++++++ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~ 221 (401) T protein:vir:44 142 TVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAE 221 (401) T ss_pred eeeecCCCceEEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHH Confidence 11122355667776544 3457999999875 55799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------------------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHH Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------------------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 78 ~vd~~~~~~l~t~--------------------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~ 131 (231) ++|..++.+-.+. ....+...+|++|+++...|..+.....+|+|||..+..|+ T Consensus 222 ~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~ 301 (401) T protein:vir:44 222 QEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIR 301 (401) T ss_pred HHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHH Confidence 9999998642210 00122346799999999999876667778999999999998 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce--EEEEEecCCceEEEeecCCccceeccchhhcccEEEE Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITA 209 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~ 209 (231) +..+..++ +....-+.+|..++++|.||+++++||.... ..+.+..-..++.++.+.+++++.++...+....+++ T Consensus 302 ~lkd~~G~--~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a 379 (401) T protein:vir:44 302 LLKDTEGN--YLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYT 379 (401) T ss_pred HhhccCCc--eeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeeccccCCcEEEEE Confidence 87665443 3222234567778999999999999985321 1111122245677888888888888877778888999 Q ss_pred EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 210 DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 210 ~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ..|+++++++|++++.|++++- T Consensus 380 ~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 380 TKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEEeccEEecccceEEEEeecC Confidence 9999999999999999999999 No 92 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.88 E-value=2.4e-24 Score=150.06 Aligned_cols=228 Identities=15% Similarity=0.060 Sum_probs=169.7 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) ..-.++--++.+|++ .+.+.+++||++++. +++++++.++++++.+..+.||++...++..|+.+...++++++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~ 232 (404) T protein:vir:39 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVV 232 (404) T ss_pred eeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHH Confidence 221111122334444 234668999999985 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHHH-hhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecccee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALDI-FNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~-l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~ 155 (231) +++|+.++.+..+.. ...+..+++++.+++.. +........+++|||..+..|++.++..++ +....-+.+|..++ T Consensus 233 ~~~d~~il~g~g~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~~ 309 (404) T protein:vir:39 233 VTRNQAIIAAMGTVP-KKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGK--YLLEPDPTKPNSYL 309 (404) T ss_pred HHHHHHHHhcccccc-cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCc--eeeccCcCCCCcce Confidence 999999998766543 33455679999998864 444344667899999999999987655443 22222245566789 Q ss_pred ecceeEEEcCC--CccCceE--EEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEE Q lcl|Aclame:pro 156 VLGAQIVRSKK--LAEGSAL--MFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNIT 227 (231) Q Consensus 156 ~~G~~Vv~s~~--~~~~~~~--~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~ 227 (231) ++|+||+++++ +|..... .+.+..-..++.++.++++.++.++.. .+....+++..+|++.+++|+++++++ T Consensus 310 l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 389 (404) T protein:vir:39 310 IKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGS 389 (404) T ss_pred ecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEE Confidence 99999999764 4543221 111222244677777889999888764 356678999999999999999999999 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) ++++ T Consensus 390 ~~~~ 393 (404) T protein:vir:39 390 FTAI 393 (404) T ss_pred eecc Confidence 9999 No 93 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.88 E-value=3e-24 Score=149.52 Aligned_cols=227 Identities=11% Similarity=0.075 Sum_probs=167.3 Q ss_pred CCCc-ccCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI-NLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~-~~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =.-+ -.|.++++|.+.+ .+.+++||+++|..++++++.+++++|++..+.+|+|...++..|+.+...+++++++++ T Consensus 48 ~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~ 127 (320) T protein:vir:10 48 AQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAM 127 (320) T ss_pred cceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHH Confidence 0111 2366789998754 566899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccc------------ccccccc------CH-HHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS------------QTVSTKA------NV-DGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN 138 (231) Q Consensus 78 ~vd~~~~~~l~t~~------------~~~~~~~------~~-d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~ 138 (231) ++|+.++.+-.+.. ...+... .+ +.+.++...+......+.+++|||..+..|++.++..+ T Consensus 128 ~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G 207 (320) T protein:vir:10 128 AFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNG 207 (320) T ss_pred HHHHHhhcccCCCCCcccccccccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCC Confidence 99999986432110 0111111 12 24667777777777788899999999999998766543 Q ss_pred ccccccCceeee-----ccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch------------- Q lcl|Aclame:pro 139 IGSEVGANALIN-----GTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI------------- 200 (231) Q Consensus 139 ~~~~~~~~~~~~-----G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~------------- 200 (231) .. ...+...+ ..-++++|+||+.++.+|+++...+...+. .+ .+...+++.+|..|+. T Consensus 208 ~~--l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~-~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 283 (320) T protein:vir:10 208 RP--LFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFR-NV-IWGQVGGLSFDVTDQATLNLGTPTEPNFV 283 (320) T ss_pred ce--eeccccccCccccccCceeeeeeeEecCCCCCCceEEEEeecc-eE-EEEEecCeEEEEeecceeeeccccccccc Confidence 32 11111111 123579999999999999988653322222 23 3566678888877764 Q ss_pred ---hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 ---VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ---~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+....++...++++++.+|+++++|+..+. T Consensus 284 ~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a 317 (320) T protein:vir:10 284 SLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT 317 (320) T ss_pred hhhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 2345678888999999999999999986666 No 94 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.87 E-value=3.7e-24 Score=149.08 Aligned_cols=227 Identities=12% Similarity=0.111 Sum_probs=165.1 Q ss_pred CC-------Ccc-cCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHH Q lcl|Aclame:pro 1 EN-------GIN-LANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQ 70 (231) Q Consensus 1 ~~-------~~~-~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~ 70 (231) ++ -+. .+.++++|.+.+. +..++||+++|..++++++.++.+++++..+.+|+|...++..|+.+...++ T Consensus 46 ~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~ 125 (326) T protein:vir:42 46 ISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTK 125 (326) T ss_pred cchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHH Confidence 11 111 3567889987544 5579999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccc---------c-----c----ccccccCHHH--HHHHHHHhhccCCCceEEEECHHHHHHH Q lcl|Aclame:pro 71 LGLSLANKVDDDLLKAAKTT---------S-----Q----TVSTKANVDG--VQAALDIFNDEDAQAYVLIVNPKDAAKI 130 (231) Q Consensus 71 ~a~~ia~~vd~~~~~~l~t~---------~-----~----~~~~~~~~d~--i~da~~~l~~~~~~~~v~vv~p~~~~~L 130 (231) +++++++++|+.++.+-.+. . . ..+...++.+ +.++...+........+++|||..+.+| T Consensus 126 l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L 205 (326) T protein:vir:42 126 VATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPIL 205 (326) T ss_pred HHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHH Confidence 99999999999998643210 0 0 0111223333 3455555655566778899999999999 Q ss_pred HhhhhhhhccccccCceeee-----ccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchh---- Q lcl|Aclame:pro 131 RKDANAKNIGSEVGANALIN-----GTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV---- 201 (231) Q Consensus 131 ~k~~~~~~~~~~~~~~~~~~-----G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~---- 201 (231) ++.++..+.. ...+...+ ...++++|+||++++.+|+++...+.-.+... .+...+++.++..++.. T Consensus 206 ~~lkd~~G~~--l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~--~~~~~~~~~v~~~~e~~~~~~ 281 (326) T protein:vir:42 206 NGAKDKSGRP--LFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQL--VWGQVGGLSFDVTDQATLNLG 281 (326) T ss_pred HHhhccCCce--eeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceE--EEEEecceEEEEeecceeeec Confidence 9876544331 11111222 23468999999999999999876543333322 24455677776655532 Q ss_pred ------------hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 ------------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 ------------~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....++..+++++++.+|+++++|+.++. T Consensus 282 ~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 282 TPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 345788999999999999999999999888 No 95 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.87 E-value=8.2e-24 Score=147.16 Aligned_cols=228 Identities=15% Similarity=0.077 Sum_probs=167.3 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcC---CCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGY---GDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~---~d~~~~~~~~~a~~i 75 (231) ..-...+..+++|.+.+. +.+++||+++|..++++++.++..+|.+..+.+|+|...++. .++.+...++++++| T Consensus 32 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai 111 (298) T protein:vir:94 32 AQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKV 111 (298) T ss_pred ceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHH Confidence 111123356789987554 558999999999999999999999999999999999886554 467788999999999 Q ss_pred HHHHHHHHHHHhcccc-----------------c----ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS-----------------Q----TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) Q Consensus 76 a~~vd~~~~~~l~t~~-----------------~----~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~ 134 (231) ++++|..++.+..... . .......++++.++..++...+..+.+++|||..+..|++.+ T Consensus 112 ~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk 191 (298) T protein:vir:94 112 ARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK 191 (298) T ss_pred HHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhh Confidence 9999999997521100 0 011123478899999999888888889999999999999876 Q ss_pred hhhhccccccCceeeeccceeecceeEEEcCCCccCce---EEEEEecCCceEEEeecCCccceeccch----------h Q lcl|Aclame:pro 135 NAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA---LMFKIVSNSPALKLVLKRGVQVETDRDI----------V 201 (231) Q Consensus 135 ~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~---~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------~ 201 (231) +..+ .+...+...+|..++++|+||++++.+|.+.. ..+.+.....++.+...+++.+|..++. . T Consensus 192 d~~G--~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~ 269 (298) T protein:vir:94 192 DLQG--NALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKG 269 (298) T ss_pred ccCC--CeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhh Confidence 6543 33333445567778999999999999986421 1111111234455666678888766642 2 Q ss_pred hcccEEEEEEEEEEEEEcCCcEEEEEecc Q lcl|Aclame:pro 202 TKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) Q Consensus 202 ~~~~~i~~~~~y~~~~~~~~~vv~l~~~~ 230 (231) +....+++..++++++.+|+++++|+.+= T Consensus 270 ~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 270 YNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cCcEEEEEEEEeccEeecccceEEEEecC Confidence 34457888999999999999999994433 No 96 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.87 E-value=7.9e-24 Score=147.23 Aligned_cols=228 Identities=15% Similarity=0.079 Sum_probs=167.5 Q ss_pred CCC--c-ccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhc---CCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENG--I-NLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSG---YGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~--~-~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~---~~d~~~~~~~~~a 72 (231) ..+ + -.+..+++|.+.+. +.+++||++++..++++++.++..+|.+..+.+|+|..+++ ..++.+...++++ T Consensus 29 ~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la 108 (298) T protein:vir:16 29 RLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFA 108 (298) T ss_pred hhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHH Confidence 111 1 12344788987554 56899999999999999999999999999999999998655 4578888999999 Q ss_pred HHHHHHHHHHHHHHhccc---cc------------------ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHH Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTT---SQ------------------TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~---~~------------------~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~ 131 (231) ++|++++|..++.+.... .. .......+++|.+++.++...+.....++|||..+..|+ T Consensus 109 ~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~ 188 (298) T protein:vir:16 109 KKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALA 188 (298) T ss_pred HHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHH Confidence 999999999999763211 00 001122477899999999888888889999999999999 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCce---EEEEEecCCceEEEeecCCccceeccch-------- Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA---LMFKIVSNSPALKLVLKRGVQVETDRDI-------- 200 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~---~~~~~~~~~~A~~~~~k~~v~vE~~Rd~-------- 200 (231) +.++..++ +.......+|..++++|+||++++.+|.+.. ..+.+-....++.+...++++++..++. T Consensus 189 ~lkd~~G~--~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (298) T protein:vir:16 189 KQKDLQDN--ALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLD 266 (298) T ss_pred HhhccCCC--eeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchh Confidence 88765543 3333444567778999999999999986421 1111111244566666777777765542 Q ss_pred --hhcccEEEEEEEEEEEEEcCCcEEEEEecc Q lcl|Aclame:pro 201 --VTKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) Q Consensus 201 --~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~ 230 (231) .+....+++..++++++.+|+++++|+.+= T Consensus 267 ~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 267 LKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred hhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 234567888999999999999999995444 No 97 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.87 E-value=8.2e-24 Score=147.14 Aligned_cols=228 Identities=11% Similarity=0.013 Sum_probs=164.3 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCH----HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDP----IGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~----~~~~~~~~a~~ 74 (231) ..-...+..++||++.|. +.+++||++++.++.++++.++..+|.+..+.||+|.+.++..+. .+...++++++ T Consensus 36 ~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~a 115 (315) T protein:vir:80 36 PEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGAS 115 (315) T ss_pred ceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHH Confidence 111233456899998665 558999999999999999999999999999999999998888774 47778999999 Q ss_pred HHHHHHHHHHHHhcccc----------------cccccccCHHHHHHHHHHhhccC-CCceEEEECHHHHHHHHhhhhhh Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTS----------------QTVSTKANVDGVQAALDIFNDED-AQAYVLIVNPKDAAKIRKDANAK 137 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~----------------~~~~~~~~~d~i~da~~~l~~~~-~~~~v~vv~p~~~~~L~k~~~~~ 137 (231) |++++|..++++-.... ....+...++++++++.++...+ .....++|||.++..|++..... T Consensus 116 i~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~ 195 (315) T protein:vir:80 116 IGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPK 195 (315) T ss_pred HHHHHhhheeeccCCCCCccccccccccccccceeeccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhcc Confidence 99999999886532100 00112345888999998886554 34568999999999999886554 Q ss_pred hccc---cccCceeeeccceeecceeEEEcCCCccCce-------EEEEEecCCceEEEeecCCccceeccch------- Q lcl|Aclame:pro 138 NIGS---EVGANALINGTYADVLGAQIVRSKKLAEGSA-------LMFKIVSNSPALKLVLKRGVQVETDRDI------- 200 (231) Q Consensus 138 ~~~~---~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~-------~~~~~~~~~~A~~~~~k~~v~vE~~Rd~------- 200 (231) .... +.. ..+..|..++++|.||++|++||.+.. ..+.-.++... +...+++.+|..++. T Consensus 196 g~~~~g~~~~-~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~--~g~~~~~~i~i~~~~~~~~~~~ 272 (315) T protein:vir:80 196 GSPLAGQPMY-PAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVH--WGFQRNFPIELIEYGDPDQTGR 272 (315) T ss_pred CCcccccccc-cccccCCCceecceeeEecCcCCcccccccccccEEEEeecccEE--EEEecCeeEEEeccccccCccc Confidence 3221 111 123345568999999999999986532 11111222222 334566677766553 Q ss_pred ---hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 ---VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ---~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+....+++..++++++.+|+++++|+.++- T Consensus 273 ~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 273 DLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred chhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 2445688889999999999999999998886 No 98 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.86 E-value=1.5e-23 Score=145.72 Aligned_cols=225 Identities=20% Similarity=0.178 Sum_probs=167.3 Q ss_pred CCCcccCceEEecccc------CCcccccCCCccCcccc-ccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI------GDAADVAEGGEISLDKI-GTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i------gda~~v~EG~~i~~~~l-t~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) .-..-.|.++++|..+ +.+.+++||++++..++ +++++++.+++++..+.||++.+.++ +++.+...+++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds-~~l~~~i~~~la~ 231 (413) T protein:vir:81 153 DNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY-DFLVSYINARLLE 231 (413) T ss_pred ceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHH Confidence 1112234557777542 34678999999998775 79999999999999999999987776 5688889999999 Q ss_pred HHHHHHHHHHHHHhccc---------cc-----ccccccCHHHHHHHHHHhhcc-CCCceEEEECHHHHHHHHhhhhhhh Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTT---------SQ-----TVSTKANVDGVQAALDIFNDE-DAQAYVLIVNPKDAAKIRKDANAKN 138 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~---------~~-----~~~~~~~~d~i~da~~~l~~~-~~~~~v~vv~p~~~~~L~k~~~~~~ 138 (231) ++++++|+.++.+-.+. +. ..++...++.+.+++..+... .+....++|||.++..|++.++..+ T Consensus 232 ~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G 311 (413) T protein:vir:81 232 ELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANG 311 (413) T ss_pred HHHHHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCC Confidence 99999999998753221 11 112334678888888766543 3566679999999999998776544 Q ss_pred ccccccCcee-------eeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----hhcccEE Q lcl|Aclame:pro 139 IGSEVGANAL-------INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVI 207 (231) Q Consensus 139 ~~~~~~~~~~-------~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i 207 (231) +.- ..+.+ ..+..++++|+||++|+++|.++.++.++ ..++.++.+.+++++.++.. .+....+ T Consensus 312 ~~l--~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~---~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~ 386 (413) T protein:vir:81 312 QYY--GGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAF---RSAASVLRKGGVRIDSTNTNVDDFENNLITV 386 (413) T ss_pred cee--ccccccccccccccccCceecceeeEEcCCCCcccEEEEec---ccEEEEEEecceEEEEeccccchhhcCcEEE Confidence 321 11111 11233589999999999999998766553 34566777788889887764 4566789 Q ss_pred EEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 208 ~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++..+|++.+.+|++++++++++. T Consensus 387 r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 387 RAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred EEEEeeccEEecccceEEEEecCC Confidence 999999999999999999999998 No 99 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.86 E-value=1.2e-23 Score=146.27 Aligned_cols=229 Identities=14% Similarity=0.115 Sum_probs=168.1 Q ss_pred CCCc--------ccCceEEeccccCCc----------ccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCC Q lcl|Aclame:pro 1 ENGI--------NLANLCEYPNDIGDA----------ADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGD 62 (231) Q Consensus 1 ~~~~--------~~G~ti~~P~~igda----------~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d 62 (231) ++.+ -.+..+++|.+.+++ ..+.|++.++..++++++.++..+|.+..+.+|+|.+.++..+ T Consensus 43 ~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~ 122 (333) T protein:vir:78 43 SSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSG 122 (333) T ss_pred hchhhhhcceeeccCCceEEEEEeCCceeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHH Confidence 1111 134567888765443 3456667788999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccc---------------------ccccccccCHHHHHHHHHHhhccC-CCceEE Q lcl|Aclame:pro 63 PIGESNKQLGLSLANKVDDDLLKAAKTT---------------------SQTVSTKANVDGVQAALDIFNDED-AQAYVL 120 (231) Q Consensus 63 ~~~~~~~~~a~~ia~~vd~~~~~~l~t~---------------------~~~~~~~~~~d~i~da~~~l~~~~-~~~~v~ 120 (231) +.+...++++++|++++|..++.+-.+. ........++++|++++..+.... ....++ T Consensus 123 ~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 202 (333) T protein:vir:78 123 LYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGW 202 (333) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEE Confidence 9999999999999999999998643211 011223457999999998886543 466789 Q ss_pred EECHHHHHHHHhhhhhhhc-cccccCceeeeccceeecceeEEEcCCCccCce-------EEEEEecCCceEEEeecCCc Q lcl|Aclame:pro 121 IVNPKDAAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGSA-------LMFKIVSNSPALKLVLKRGV 192 (231) Q Consensus 121 vv~p~~~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~-------~~~~~~~~~~A~~~~~k~~v 192 (231) +|||..+..|++.....+. +.+........|..++++|+||++|+++|.+.. ..+...+. -+.+...+++ T Consensus 203 vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~--~~~~g~~~~~ 280 (333) T protein:vir:78 203 AVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFS--QLKFGFADEI 280 (333) T ss_pred EEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecc--cEEEEEeecc Confidence 9999999999876544332 233333445567778999999999999996532 12222222 2445666788 Q ss_pred cceeccch-------------hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 193 QVETDRDI-------------VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 193 ~vE~~Rd~-------------~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +++.+++. .+....+++.+++++++.+|+++++|+.+.- T Consensus 281 ~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 281 RIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred EEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 88877763 2345678899999999999999999987777 No 100 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.86 E-value=3.9e-25 Score=154.38 Aligned_cols=230 Identities=13% Similarity=0.062 Sum_probs=175.8 Q ss_pred CCCcccCceEEeccccCCc--ccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCC-HHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDA--ADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGD-PIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda--~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d-~~~~~~~~~a~~ia 76 (231) .--+++|+|++||. +|.. ..+..|+++..+.+..++.+++|+..-.. +.|.|.+..++..| +-.+..+++++++| T Consensus 46 vRti~~gkS~qf~~-~G~s~~~~~~pG~~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA 124 (401) T protein:vir:70 46 VQTVTGTNTVSNKY-LGETELQVLAPGQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLK 124 (401) T ss_pred eeeecccceEEEEE-eeeeEeeeecCCCCcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHH Confidence 33489999999975 6765 57999999999999999999999998774 89999999999999 88899999999999 Q ss_pred HHHHHHHHHHhccc-----------cc--------cc-----cccc----CHHHHHHHHHHhhccC--CCceEEEECHHH Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT-----------SQ--------TV-----STKA----NVDGVQAALDIFNDED--AQAYVLIVNPKD 126 (231) Q Consensus 77 ~~vd~~~~~~l~t~-----------~~--------~~-----~~~~----~~d~i~da~~~l~~~~--~~~~v~vv~p~~ 126 (231) +..|+.++..+..+ +. .+ ...+ ..+++.+|.+.|.+.+ ....+++++|.. T Consensus 125 ~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~ 204 (401) T protein:vir:70 125 RMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRY 204 (401) T ss_pred HHHHHHHHHHHHHhccccccccccCCCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHH Confidence 99999886654211 00 00 0111 3456778888887655 344455556666 Q ss_pred HHHHHhhhhhhhc-cccccCceeeeccceeecceeEEEcCCCccCc---------------eEE--------EEEecCCc Q lcl|Aclame:pro 127 AAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGS---------------ALM--------FKIVSNSP 182 (231) Q Consensus 127 ~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~---------------~~~--------~~~~~~~~ 182 (231) |+.|++-++..++ ....+++...+|.+..+.|+||+.||++|.+. .|. .-+++.+. T Consensus 205 Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~ 284 (401) T protein:vir:70 205 FNVLRDADRIVDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTAD 284 (401) T ss_pred HHHHHhcCcccchhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehh Confidence 6677765444333 23344566788999999999999999998532 221 12455677 Q ss_pred eEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 183 ALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 183 A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |++.+.-.+++.|.+||+.++.+.|...+.||....+|+++..++-+.+ T Consensus 285 Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 285 ALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred heEEEEeeccccchhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCc Confidence 8888888899999999999999999999999999999999998854444 No 101 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.86 E-value=2e-23 Score=145.05 Aligned_cols=231 Identities=15% Similarity=0.146 Sum_probs=167.6 Q ss_pred CCCcccCceEEeccccC----------CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG----------DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQ 70 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig----------da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~ 70 (231) ..---.|..+++|.+.+ .+.+++||++++.+++++++.+++.+|.+..+.+|+|.+.++..|+.+...++ T Consensus 51 ~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~ 130 (338) T protein:vir:78 51 ENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQAD 130 (338) T ss_pred ceeeccCCceEEEEEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHH Confidence 11112356788887543 34468899999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcccc---------------c------ccccccCHHHHHHHHHHhhcc-CCCceEEEECHHHHH Q lcl|Aclame:pro 71 LGLSLANKVDDDLLKAAKTTS---------------Q------TVSTKANVDGVQAALDIFNDE-DAQAYVLIVNPKDAA 128 (231) Q Consensus 71 ~a~~ia~~vd~~~~~~l~t~~---------------~------~~~~~~~~d~i~da~~~l~~~-~~~~~v~vv~p~~~~ 128 (231) ++++|++++|..++.+-.+.. . .......++.+.++...+... .....+++|||..+. T Consensus 131 la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~ 210 (338) T protein:vir:78 131 LAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRA 210 (338) T ss_pred HHHHHHHHHHHHhhcccCCCccccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHH Confidence 999999999999997543210 0 011234688898888887543 346778999999999 Q ss_pred HHHhhhhhhhc-cccccCceeeeccceeecceeEEEcCCCccCce-----EEEEEecCCceEEEeecCCccceeccchh- Q lcl|Aclame:pro 129 KIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGSA-----LMFKIVSNSPALKLVLKRGVQVETDRDIV- 201 (231) Q Consensus 129 ~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~-----~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~- 201 (231) .|++.....+. +.+........|.-++++|+||+++++||.... -...++.....+.+...+++.++..|+.. T Consensus 211 ~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~ 290 (338) T protein:vir:78 211 RLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATL 290 (338) T ss_pred HHHHHhhhccCCCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccc Confidence 98775443332 223333445567778999999999999985321 01111122223446667788888777642 Q ss_pred ---------------hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 ---------------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 ---------------~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....++...++++++++|+++++|+.+.= T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 335 (338) T protein:vir:78 291 TDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDED 335 (338) T ss_pred cccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccC Confidence 344678889999999999999999977555 No 102 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.85 E-value=3.1e-23 Score=144.00 Aligned_cols=227 Identities=14% Similarity=0.068 Sum_probs=170.9 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) ..-..+.-++.+|+. .+.+.+++||++++. +++++++.+++.++.+..+.+|++...++..|+.+...+++++++++ T Consensus 128 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~ 207 (371) T protein:vir:81 128 EPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRV 207 (371) T ss_pred eeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 111111123445544 345678999999974 67999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) ++|..++++..+. ...+..+++++.++.. .+........+++|||..+..|++.++..+. +....-+..|..+++ T Consensus 208 ~~~~~i~~g~g~~--~~~~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~--~l~~~~~~~~~~~~l 283 (371) T protein:vir:81 208 TRNGLIINVLNTK--AKTAIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQ--YLLQPSISSPTGRQL 283 (371) T ss_pred HHHHHHHhhcccc--cccccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCC--eeeecccCCCCCcee Confidence 9999999876543 3455678999988775 4544445667899999999999987655433 222222455777899 Q ss_pred cceeEEEcCCCccCceEE---------EEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcE Q lcl|Aclame:pro 157 LGAQIVRSKKLAEGSALM---------FKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKV 223 (231) Q Consensus 157 ~G~~Vv~s~~~~~~~~~~---------~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~v 223 (231) +|.||++++.+|.+.... +.+..-..++.++.+.+++++.++.. .+....+++..+|++++.+|+++ T Consensus 284 ~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~ 363 (371) T protein:vir:81 284 LGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAF 363 (371) T ss_pred cceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccce Confidence 999999999998543211 11111234566677788888877654 35677999999999999999999 Q ss_pred EEEEeccC Q lcl|Aclame:pro 224 VNITFTGV 231 (231) Q Consensus 224 v~l~~~~~ 231 (231) +++++++- T Consensus 364 ~~~~~~~A 371 (371) T protein:vir:81 364 VFGEVQLA 371 (371) T ss_pred EEEEEecC Confidence 99999988 No 103 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.85 E-value=3.3e-23 Score=143.85 Aligned_cols=228 Identities=13% Similarity=0.016 Sum_probs=170.2 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .+-++.--.+.+|.. .+.+.+++||++++.. .+++++.+++.++++..+.+|++...++..|+.+...++++++++ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~ 223 (395) T protein:vir:38 144 ENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDV 223 (395) T ss_pred eeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 111111112223321 2345689999999854 689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecccee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD 155 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~ 155 (231) +++|..++....+.. ...+..++++|.+++. .+........+++|||.++..|++..+..++ +....-+.+|..++ T Consensus 224 ~~~~~~il~g~g~~~-~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~~ 300 (395) T protein:vir:38 224 VTRNAKILEVMGKAP-KKPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGR--YLMQPDVTSPDKYL 300 (395) T ss_pred HHHHHHHhhcccccc-cccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCc--eeeccCcCCCCcce Confidence 999999998765543 3345668999998875 4554445667899999999999987765543 22222345677789 Q ss_pred ecceeEEEcCCCccCc--eE-EEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 156 VLGAQIVRSKKLAEGS--AL-MFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 156 ~~G~~Vv~s~~~~~~~--~~-~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) ++|.||+++++++.+. .- .+.+.....++.++.++++.++..++. .+....++...+|++++.+|++++++++ T Consensus 301 l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 380 (395) T protein:vir:38 301 IDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASF 380 (395) T ss_pred eccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 9999999998765332 11 111222244677788889999888754 3567789999999999999999999999 Q ss_pred ccC Q lcl|Aclame:pro 229 TGV 231 (231) Q Consensus 229 ~~~ 231 (231) +++ T Consensus 381 ~~~ 383 (395) T protein:vir:38 381 KTV 383 (395) T ss_pred ecc Confidence 998 No 104 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.85 E-value=4.4e-23 Score=143.14 Aligned_cols=227 Identities=11% Similarity=0.028 Sum_probs=163.3 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCC---CHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYG---DPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~---d~~~~~~~~~a~~i 75 (231) ..-.-.+..+++|.+.+. +.+++||+++|..++++++.++..+|.+..+.+|+|...++.. ++.+...++++++| T Consensus 34 ~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai 113 (311) T protein:vir:81 34 MAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVAL 113 (311) T ss_pred ceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHH Confidence 111123346899988554 5689999999999999999999999999999999998875544 57888999999999 Q ss_pred HHHHHHHHHHHhccccc-------------------cccc-ccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhh Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTSQ-------------------TVST-KANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDAN 135 (231) Q Consensus 76 a~~vd~~~~~~l~t~~~-------------------~~~~-~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~ 135 (231) ++++|..++.+...... .... ...+..+.++..++...+..+..++|||..+..|++.++ T Consensus 114 ~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd 193 (311) T protein:vir:81 114 GRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRD 193 (311) T ss_pred HHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhc Confidence 99999999976321100 0111 123456777888887777788889999999999999866 Q ss_pred hhhccccccCceeeeccceeecceeEEEcCCCccCceE----------------EEEEecCCceEEEeecCCccceeccc Q lcl|Aclame:pro 136 AKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL----------------MFKIVSNSPALKLVLKRGVQVETDRD 199 (231) Q Consensus 136 ~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~----------------~~~~~~~~~A~~~~~k~~v~vE~~Rd 199 (231) ..+.. ...+....|..++++|.||++++.||.+... .+...+. -+.+...+++++|..++ T Consensus 194 ~~G~~--l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs--~~~i~~~~~~~~~~~~~ 269 (311) T protein:vir:81 194 SQGRK--LYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFS--AFRWGVQVSIPLELIEF 269 (311) T ss_pred cCCCe--eecCccccCCCceecceeEEecccccccccccccccchhcccCCccEEEEEecc--cEEEEEeccceEEEecc Confidence 54432 2233344566789999999999999864321 1111111 23344556777777665 Q ss_pred h---------hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 I---------VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 ~---------~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) . .+....+++..++++++.+|+++++|+.+-- T Consensus 270 ~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 310 (311) T protein:vir:81 270 GDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) T ss_pred CCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeecc Confidence 4 2345678888999999999999999965444 No 105 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.85 E-value=3.4e-23 Score=143.75 Aligned_cols=228 Identities=17% Similarity=0.152 Sum_probs=163.4 Q ss_pred CCCcccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) -.-...+..+++|..++ .+.+++||+.+|..++++++.++.+++.+..+.||++...++ +++.+...++++++|++ T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~ 264 (497) T protein:vir:78 186 SSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQR 264 (497) T ss_pred cccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHH Confidence 22223455789997643 467999999999999999999999999999999999988776 57899999999999999 Q ss_pred HHHHHHHHHhccc--------cc--ccc---------------------------------------------------- Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------SQ--TVS---------------------------------------------------- 95 (231) Q Consensus 78 ~vd~~~~~~l~t~--------~~--~~~---------------------------------------------------- 95 (231) ++|..++.+-.+. +. ..+ T Consensus 265 ~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (497) T protein:vir:78 265 KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV 344 (497) T ss_pred HHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccch Confidence 9999998742110 00 000 Q ss_pred ------cccCHHHHHHHHHHhhccC-CCceEEEECHHHHHHHHhhhhhhhccccccC----ceeeeccceeecceeEEEc Q lcl|Aclame:pro 96 ------TKANVDGVQAALDIFNDED-AQAYVLIVNPKDAAKIRKDANAKNIGSEVGA----NALINGTYADVLGAQIVRS 164 (231) Q Consensus 96 ------~~~~~d~i~da~~~l~~~~-~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~----~~~~~G~ig~~~G~~Vv~s 164 (231) .....+.+.+++..+.... ..+..++|||.++..|++.++..+++-.... .....+.-++++|+||+++ T Consensus 345 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t 424 (497) T protein:vir:78 345 AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTT 424 (497) T ss_pred hccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEec Confidence 0001112223333332222 3456899999999999998776554321111 1111223358999999999 Q ss_pred CCCccCceEEEEEecCCceEEEeecCCccceeccc----hhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 165 KKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD----IVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 165 ~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd----~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +.||.++.+..++ +..++.++.+.++.++..+. -.+....+++..++++.+.+|+++++++++++ T Consensus 425 ~~~~~~~~~~Gd~--~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 425 PLIPLGTILVGHF--APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred CCCCCCceEEeec--ccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 9999998765443 45677888888888876643 24567789999999999999999999999999 No 106 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.85 E-value=3.4e-23 Score=143.75 Aligned_cols=228 Identities=17% Similarity=0.152 Sum_probs=163.4 Q ss_pred CCCcccCceEEeccccC---CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG---DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig---da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) -.-...+..+++|..++ .+.+++||+.+|..++++++.++.+++.+..+.||++...++ +++.+...++++++|++ T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~ 264 (497) T protein:vir:10 186 SSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQR 264 (497) T ss_pred cccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHH Confidence 22223455789997643 467999999999999999999999999999999999988776 57899999999999999 Q ss_pred HHHHHHHHHhccc--------cc--ccc---------------------------------------------------- Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------SQ--TVS---------------------------------------------------- 95 (231) Q Consensus 78 ~vd~~~~~~l~t~--------~~--~~~---------------------------------------------------- 95 (231) ++|..++.+-.+. +. ..+ T Consensus 265 ~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (497) T protein:vir:10 265 KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV 344 (497) T ss_pred HHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccch Confidence 9999998742110 00 000 Q ss_pred ------cccCHHHHHHHHHHhhccC-CCceEEEECHHHHHHHHhhhhhhhccccccC----ceeeeccceeecceeEEEc Q lcl|Aclame:pro 96 ------TKANVDGVQAALDIFNDED-AQAYVLIVNPKDAAKIRKDANAKNIGSEVGA----NALINGTYADVLGAQIVRS 164 (231) Q Consensus 96 ------~~~~~d~i~da~~~l~~~~-~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~----~~~~~G~ig~~~G~~Vv~s 164 (231) .....+.+.+++..+.... ..+..++|||.++..|++.++..+++-.... .....+.-++++|+||+++ T Consensus 345 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t 424 (497) T protein:vir:10 345 AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTT 424 (497) T ss_pred hccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEec Confidence 0001112223333332222 3456899999999999998776554321111 1111223358999999999 Q ss_pred CCCccCceEEEEEecCCceEEEeecCCccceeccc----hhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 165 KKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD----IVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 165 ~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd----~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +.||.++.+..++ +..++.++.+.++.++..+. -.+....+++..++++.+.+|+++++++++++ T Consensus 425 ~~~~~~~~~~Gd~--~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 425 PLIPLGTILVGHF--APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred CCCCCCceEEeec--ccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 9999998765443 45677888888888876643 24567789999999999999999999999999 No 107 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.85 E-value=3.1e-23 Score=144.00 Aligned_cols=227 Identities=14% Similarity=0.022 Sum_probs=171.4 Q ss_pred CCCcccCceEEecccc--CCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI--GDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i--gda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-...--.+.+|... +.+.+++||++++. +.+++++.++..++.+..+.+|++.+.++..++.+...+++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~ 239 (397) T protein:vir:12 160 EPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVV 239 (397) T ss_pred eeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 1111111245566543 44678999999985 56899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) ++|..++.+..+ ....+.+++++|.+++. .+.........++|||.++..|++.++..+. +....-+.+|..+++ T Consensus 240 ~~d~~il~G~g~--~~~~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~--~l~~~~~~~g~~~~l 315 (397) T protein:vir:12 240 TRNNLILAAIAS--LKKVDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGR--YLLQPDPTNPTKKLL 315 (397) T ss_pred HHHHHHHhcccc--ccccccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCc--eeecccccCCCCccc Confidence 999999987654 34556778999999885 6655555677899999999999987655443 222223456777899 Q ss_pred cceeEEEcCCC-ccCce--EEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 157 LGAQIVRSKKL-AEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 157 ~G~~Vv~s~~~-~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) +|+||++++++ |.... ..+.+..-..++.++.++++.++.++.. .+....+++..++++++.+|+++++++++ T Consensus 316 ~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 316 DGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQIT 395 (397) T ss_pred cceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 99999987754 32211 1111212244666777788888877654 35677999999999999999999999999 Q ss_pred cC Q lcl|Aclame:pro 230 GV 231 (231) Q Consensus 230 ~~ 231 (231) += T Consensus 396 ~~ 397 (397) T protein:vir:12 396 VE 397 (397) T ss_pred eC Confidence 99 No 108 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=99.85 E-value=3.2e-23 Score=143.91 Aligned_cols=227 Identities=14% Similarity=0.118 Sum_probs=155.5 Q ss_pred CCCc--ccCceEEeccc-cCCccc-----ccCCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGI--NLANLCEYPND-IGDAAD-----VAEGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~--~~G~ti~~P~~-igda~~-----v~EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~ 71 (231) |.-. ..||||+||.+ ...+.+ ..+|.++++++++.++.+++|+|..+ .|.++|++..+...|+.++..+++ T Consensus 34 ~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a 113 (392) T protein:vir:99 34 IGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQ 113 (392) T ss_pred ccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHH Confidence 4333 56999999865 222222 45688899999999999999987655 799999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhccccc-------ccccccCHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhhhhcccc- Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTTSQ-------TVSTKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIGSE- 142 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~~~-------~~~~~~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~~~~~~~- 142 (231) +++|++++|.+++..+..++. ..+....|+.|+++...|++.+. +.|+++++|+.++.|++++.|...... T Consensus 114 ~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g 193 (392) T protein:vir:99 114 VRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQG 193 (392) T ss_pred HHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeeccccc Confidence 999999999999887764432 22334579999999999988663 578999999999999999887654322 Q ss_pred -ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCcc-------------------ceeccchhh Q lcl|Aclame:pro 143 -VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQ-------------------VETDRDIVT 202 (231) Q Consensus 143 -~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~-------------------vE~~Rd~~~ 202 (231) .+...+++|.+|+++|++|+.|+++|.++++.+. +.++.+..+.++. .-.+.+... T Consensus 194 ~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~----~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~ 269 (392) T protein:vir:99 194 QSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYH----PTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI 269 (392) T ss_pred chhhhhhhcceeeeeeeeEEEeecccccccceeee----ccccccccccccccccccceeEEecccceecceeeccccee Confidence 2235688999999999999999999988765432 2222222211111 112223333 Q ss_pred cccEEEEEEEEEEEEEcCCcE----EEEEeccC Q lcl|Aclame:pro 203 KTTVITADEHYAAYLYDLTKV----VNITFTGV 231 (231) Q Consensus 203 ~~~~i~~~~~y~~~~~~~~~v----v~l~~~~~ 231 (231) .++.......+|.+.+..... ...+++++ T Consensus 270 ~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~ 302 (392) T protein:vir:99 270 TSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) T ss_pred eccccccceeEEEEEEeeccccceeeeeeeeee Confidence 333333444455554432211 11111111 No 109 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=99.85 E-value=1.2e-22 Score=140.77 Aligned_cols=227 Identities=11% Similarity=0.033 Sum_probs=170.8 Q ss_pred CCCcccCceEEeccc-cCCcccccCCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCH--HHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDP--IGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~--~~~~~~~~a~~ia 76 (231) +---++|++|+||+. .....++..+..+.++.++.+..+++++|..+ .|.|.+.+..++...+ ...+.+++...++ T Consensus 68 ~~e~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~ 147 (329) T protein:vir:10 68 DAIFMQGRSFTVIKGDVTELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVA 147 (329) T ss_pred ceeeccCcEEEEeeecccccccccCCCCccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhh Confidence 112458999999986 34466888888899999999999999999544 6999999999887655 4556677888999 Q ss_pred HHHHHHHHHHhcccc-----cccccccCHHHHHHHHHHhhccC-CCceEEEECHHHHHHHHhhhhhhhccccccCceeee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS-----QTVSTKANVDGVQAALDIFNDED-AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALIN 150 (231) Q Consensus 77 ~~vd~~~~~~l~t~~-----~~~~~~~~~d~i~da~~~l~~~~-~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~ 150 (231) ..+|...++.+-+.. ...+....|+.|.++..+|.+.+ -+++|++|+|+.+..|.+++.|... .......+++ T Consensus 148 pEiDay~~skla~~a~~~~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~-~~~~~~~~~~ 226 (329) T protein:vir:10 148 PYLDNLRFATLARNKAKHLTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQ-GDNRQQVLGK 226 (329) T ss_pred hHHHHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhcc-ccccccceee Confidence 999988777663322 12334456899999999998765 3679999999999999999888643 3445567889 Q ss_pred ccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceecc-chhhcccEEEEEEEEEEEEEcCCc--EEEEE Q lcl|Aclame:pro 151 GTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDR-DIVTKTTVITADEHYAAYLYDLTK--VVNIT 227 (231) Q Consensus 151 G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R-d~~~~~~~i~~~~~y~~~~~~~~~--vv~l~ 227 (231) |.+|++.|++|+.++...- +.+.+ ++..+.|+....|-. .+|.+| .+.++.+.+.++.||++++++|++ |.+.. T Consensus 227 g~Vg~idG~~Ii~vps~~~-k~in~-ii~~~~A~~~~~K~~-~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~ 303 (329) T protein:vir:10 227 GVQGELDGFTIVKVPSKML-QGVEA-MAVIGEVMASPIQAN-EAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIG 303 (329) T ss_pred eeeeeecCeEEEEecCCcc-cceeE-EEEcCCceeeeeeee-eeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEec Confidence 9999999999998754322 22322 334577887776655 778777 478889999999999999999984 44433 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) .++. T Consensus 304 ~~a~ 307 (329) T protein:vir:10 304 GKEV 307 (329) T ss_pred ccCc Confidence 4444 No 110 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.85 E-value=3.9e-23 Score=143.43 Aligned_cols=228 Identities=11% Similarity=-0.009 Sum_probs=172.6 Q ss_pred CCC--c-ccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG--I-NLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~--~-~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) ..+ + .....+++|.+ .+.+..++||+.++. ++.++++.++.+++.+..+.+|++...++..|+.+...+.+++ T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ 266 (437) T protein:vir:10 187 SLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIE 266 (437) T ss_pred hcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHH Confidence 111 0 11234556643 345678999999984 6689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecc Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 152 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ 152 (231) +++.+++..++++..+.....+...+++++.+++. .+........+|+|||.++..|++..+..+.+ ....-+.+|. T Consensus 267 ~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~--~~~~~~~~~~ 344 (437) T protein:vir:10 267 LRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRP--LLQPNVTAAT 344 (437) T ss_pred HHHHHHHHHHhhhhcccccccccccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCe--eeccCccCCC Confidence 99999999999998887777777788899998876 45444345668999999999999986654433 2222345677 Q ss_pred ceeecceeEEEcCCC--ccC---ceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEE Q lcl|Aclame:pro 153 YADVLGAQIVRSKKL--AEG---SALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNIT 227 (231) Q Consensus 153 ig~~~G~~Vv~s~~~--~~~---~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~ 227 (231) .++++|.||++++++ |.+ +...+ +..-..++.++.+.++.++...+-..+.+.+.+..+|++++++|++++.|+ T Consensus 345 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~-~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~ 423 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFPSASAGDVNIV-VAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLT 423 (437) T ss_pred CcccccceeEEecccccCCcCCCceEEE-EeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEE Confidence 789999999998754 433 22112 212244567777788899877766677788899999999999999999988 Q ss_pred ec--cC Q lcl|Aclame:pro 228 FT--GV 231 (231) Q Consensus 228 ~~--~~ 231 (231) .+ +| T Consensus 424 ~~~~~~ 429 (437) T protein:vir:10 424 GKLKAV 429 (437) T ss_pred eecccc Confidence 54 33 No 111 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.85 E-value=4.6e-23 Score=143.05 Aligned_cols=229 Identities=11% Similarity=-0.014 Sum_probs=168.8 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc--ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD--KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~--~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .+-....-++.+|+..+ .+.+++||+..+.+ ++++++.++++++++..+.||++...++..++.+...++++++++ T Consensus 147 ~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~ 226 (404) T protein:vir:10 147 EPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVR 226 (404) T ss_pred eeccCCccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHH Confidence 22222333566776644 45689999998765 588999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhccc-------------ccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT-------------SQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 77 ~~vd~~~~~~l~t~-------------~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) +++|..++.+..+. ....++..+++++.+++. .+........+++|||..+..|++.++..++. T Consensus 227 ~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~-- 304 (404) T protein:vir:10 227 ITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRP-- 304 (404) T ss_pred HHHHHHHhhcCCCCCcccceeeccccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCce-- Confidence 99999999764432 122345568999998886 44443345567999999999999977655432 Q ss_pred ccCceeeeccceeecceeEEE-cCCCccCceEE--EEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEE Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVR-SKKLAEGSALM--FKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAA 215 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~-s~~~~~~~~~~--~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~ 215 (231) ....-+.+|..++++|.||++ ++.++.++.-. +.+.....++.++...+++++..++. .+....+++..++++ T Consensus 305 l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~ 384 (404) T protein:vir:10 305 YLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDG 384 (404) T ss_pred eeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeecc Confidence 222224557778999999985 45555433211 11222244667777788888876654 356778999999999 Q ss_pred EEEcCCcEEEEEeccC Q lcl|Aclame:pro 216 YLYDLTKVVNITFTGV 231 (231) Q Consensus 216 ~~~~~~~vv~l~~~~~ 231 (231) .+.+|++++++++++. T Consensus 385 ~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 385 NVKDSEALLIAEIPVE 400 (404) T ss_pred EEecccceEEEEeecc Confidence 9999999999999998 No 112 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.84 E-value=1.2e-22 Score=140.67 Aligned_cols=229 Identities=12% Similarity=0.029 Sum_probs=169.3 Q ss_pred CCC--c-ccCceEEeccc---cCCcccccCCCccC-ccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG--I-NLANLCEYPND---IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~--~-~~G~ti~~P~~---igda~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) ..+ . -.+.+.++|.. .+.+..++||++.+ .++++++++++.+++++..+.||++...++..|+.+...+++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ 222 (394) T protein:vir:10 143 TLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINE 222 (394) T ss_pred hhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHH Confidence 111 1 13345666643 35567899999988 47799999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhccc-ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc--ccCceeee Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTT-SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE--VGANALIN 150 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~-~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~--~~~~~~~~ 150 (231) +++.++|..++....+. +...++..++|+|.+++...-...+ ..+++|||.++..|++.++..+..-. ...+.... T Consensus 223 ~~~~~~~~~il~g~g~~~~~~~~~~~~~d~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~ 301 (394) T protein:vir:10 223 KSVNTYNAMIAPVLQSFTAKATTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDG 301 (394) T ss_pred HHHHHHHHHHhhcccccccccccccccHHHHHHHHHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeeccccccccC Confidence 99999999999887654 3455667889999998865544444 46899999999999998765543211 11112223 Q ss_pred ccceeecceeEEEcCCC--ccCceEEEEEe-cCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEE Q lcl|Aclame:pro 151 GTYADVLGAQIVRSKKL--AEGSALMFKIV-SNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNIT 227 (231) Q Consensus 151 G~ig~~~G~~Vv~s~~~--~~~~~~~~~~~-~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~ 227 (231) |.-++++|+||++++++ |.+.+-...++ ....++.++.+++++++..++.. +.+.+++..++++++++|++++.++ T Consensus 302 ~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~~~~~~ai~~~~ 380 (394) T protein:vir:10 302 TAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI-YGRYLGAAFRFGVKQADSNAGYFVT 380 (394) T ss_pred CcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccc-cceeEEEEEEeccEEeccccEEEEE Confidence 45578999999987653 33222111111 12335667777888888777654 4567899999999999999999999 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) ++++ T Consensus 381 ~~~~ 384 (394) T protein:vir:10 381 NTDA 384 (394) T ss_pred eecc Confidence 9999 No 113 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.84 E-value=8.8e-23 Score=141.51 Aligned_cols=227 Identities=11% Similarity=0.067 Sum_probs=173.1 Q ss_pred CC--------CcccCceEEeccccC----CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHH Q lcl|Aclame:pro 1 EN--------GINLANLCEYPNDIG----DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESN 68 (231) Q Consensus 1 ~~--------~~~~G~ti~~P~~ig----da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~ 68 (231) .+ -.-.+.++++|.+.+ .+.+++||.+++.+++++++.++++++++..+.+|++...++..|+.+... T Consensus 141 ~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~ 220 (421) T protein:vir:13 141 YPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVN 220 (421) T ss_pred hhhhhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHH Confidence 11 111233567775432 234699999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCcee Q lcl|Aclame:pro 69 KQLGLSLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANAL 148 (231) Q Consensus 69 ~~~a~~ia~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~ 148 (231) ++++++++.++|..+++.+.+.. ..++..++++|.+++..+.........|+|||..+..|++.++..+. +...+ . T Consensus 221 ~~la~~~~~~~~~~i~~~~~g~~-~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~--~i~~~-~ 296 (421) T protein:vir:13 221 EEFAEFAVNTENAEIVKQAKAVL-AEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGR--PLLKE-L 296 (421) T ss_pred HHHHHHHHHHhhhhHhhhhhhcc-ccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCc--eeecC-c Confidence 99999999999999888765432 33455689999999999988777888999999999999987655433 33222 3 Q ss_pred eeccceeecceeEEEcCCCccCce--EEEEEecCCceEEEeecCCccceeccchh--hcccEEEEEEEEEEEEEcCCcEE Q lcl|Aclame:pro 149 INGTYADVLGAQIVRSKKLAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDIV--TKTTVITADEHYAAYLYDLTKVV 224 (231) Q Consensus 149 ~~G~ig~~~G~~Vv~s~~~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~--~~~~~i~~~~~y~~~~~~~~~vv 224 (231) .+|..++++|.||++++++|.+.+ ..+.+.....++.++.+++++++..++.. +....+++..+|++.+.+|++++ T Consensus 297 ~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~ 376 (421) T protein:vir:13 297 SDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSD 376 (421) T ss_pred CCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhh Confidence 456678999999999999986542 11222222345667778899999988875 44567889999999999999976 Q ss_pred EEEeccC Q lcl|Aclame:pro 225 NITFTGV 231 (231) Q Consensus 225 ~l~~~~~ 231 (231) .+...=. T Consensus 377 ~~~~~~~ 383 (421) T protein:vir:13 377 AEKIRKF 383 (421) T ss_pred eeeeccc Confidence 6544422 No 114 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.84 E-value=6.1e-23 Score=142.37 Aligned_cols=229 Identities=10% Similarity=0.030 Sum_probs=166.1 Q ss_pred CCCcccCceEEecccc---CCcccccCCCccCccccccceeEEEeehc-cceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI---GDAADVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i---gda~~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) ---+.+|..+.+|... ..+.+++||++++..++++...+++.+|. +..+.+|++.+.++..|+.+...++++++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~ 232 (409) T protein:vir:45 153 ILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIG 232 (409) T ss_pred eeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHH Confidence 1113345556666542 23568999999999999999999998775 5678999999999999999999999999999 Q ss_pred HHHHHHHHHHhccc----------------ccccccccCHHHHHHHHHHhhccCCC-ceE-EEECHHHHHHHHhhhhhhh Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT----------------SQTVSTKANVDGVQAALDIFNDEDAQ-AYV-LIVNPKDAAKIRKDANAKN 138 (231) Q Consensus 77 ~~vd~~~~~~l~t~----------------~~~~~~~~~~d~i~da~~~l~~~~~~-~~v-~vv~p~~~~~L~k~~~~~~ 138 (231) .++|..++.+-.+. ....+..+++++|++++..|...... ..| ++|||..+..|++.++..+ T Consensus 233 ~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G 312 (409) T protein:vir:45 233 RGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQG 312 (409) T ss_pred HHHHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCC Confidence 99999998643221 11223457899999999999765433 344 6789999999988765544 Q ss_pred ccccccCceeeeccceeecceeEEEcCCCccCc-eEEEEEecCCceEEEeecCCccceeccch--hhcccEEEEEEEEEE Q lcl|Aclame:pro 139 IGSEVGANALINGTYADVLGAQIVRSKKLAEGS-ALMFKIVSNSPALKLVLKRGVQVETDRDI--VTKTTVITADEHYAA 215 (231) Q Consensus 139 ~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~-~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~--~~~~~~i~~~~~y~~ 215 (231) + +....-+.+|..++++|.||+++++||... +-...+...-.-+.+..+.++.++..+|. .+....+++..+|++ T Consensus 313 ~--~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~ 390 (409) T protein:vir:45 313 R--PLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDC 390 (409) T ss_pred c--eeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEecc Confidence 3 222223455667899999999999998622 11111111111233455667777766654 445667999999999 Q ss_pred EEEcCCcEEEEEeccC Q lcl|Aclame:pro 216 YLYDLTKVVNITFTGV 231 (231) Q Consensus 216 ~~~~~~~vv~l~~~~~ 231 (231) ++.+|++++++++++. T Consensus 391 ~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 391 ILEDTSAIKALVGKGS 406 (409) T ss_pred EeechhheEEEEeccC Confidence 9999999999999888 No 115 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.84 E-value=1e-22 Score=141.10 Aligned_cols=225 Identities=14% Similarity=0.050 Sum_probs=170.9 Q ss_pred CCCc----ccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI----NLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~~----~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) -+.. -.+.++++|.+ .+.+.+++||.+.+. +++++++.++++++++..+.+|++.+.++..|+.+...+.++ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~ 244 (400) T protein:vir:38 165 KPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQ 244 (400) T ss_pred hhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHH Confidence 1111 12445677754 355778999999975 679999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeecc Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 152 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ 152 (231) ++++.++|..++....+. +..+..+++++.++....-+.. ...+++|||..+..|++.++..+. +....-+.+|. T Consensus 245 ~~~~~~~~~~i~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~~~~~~~~l~~lkd~~G~--~i~~~~~~~~~ 319 (400) T protein:vir:38 245 QIKVNTTNGAVATLLKGF--TAKTISSVDDLKHINNVDLDPA-YSRVIIASQSFYNFLDTVKDGNGR--YLLQDSILTPS 319 (400) T ss_pred HHHHHHHHHhhhhccccc--cccccccHHHHHHHHHhhhhhh-hCcEEEEcHHHHHHHHHhhccCCC--eeeecCcCCCC Confidence 999999999888766543 3456678999998877554333 356899999999999987665443 22222345677 Q ss_pred ceeecceeEEEcCCCccCce--EEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEecc Q lcl|Aclame:pro 153 YADVLGAQIVRSKKLAEGSA--LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) Q Consensus 153 ig~~~G~~Vv~s~~~~~~~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~ 230 (231) .++++|.||++++++|.+.. ..+.+..-..++.++...++.++..++. .+...+++..+|++++.+|++++.|+++. T Consensus 320 ~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 398 (400) T protein:vir:38 320 GKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQ-IYGQFLQAGMRFGVSVADEKAGYFLTYTP 398 (400) T ss_pred ccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEeccc-ccceeEEEEEEeccEEecccceEEEEeec Confidence 78999999999999885432 1111212234566777778888877664 45678999999999999999999999999 Q ss_pred C Q lcl|Aclame:pro 231 V 231 (231) Q Consensus 231 ~ 231 (231) . T Consensus 399 ~ 399 (400) T protein:vir:38 399 K 399 (400) T ss_pred C Confidence 9 No 116 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.83 E-value=2.2e-22 Score=139.37 Aligned_cols=222 Identities=15% Similarity=0.069 Sum_probs=167.7 Q ss_pred CC--Cc-ccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 EN--GI-NLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~--~~-~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) .. -+ -.+.+.++|.+ .+.+.+++||++++. +++++++.++..++++..+.+|++.+.++..|+.+...+++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~ 239 (394) T protein:vir:97 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) T ss_pred hhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHH Confidence 11 11 12234677755 234568999999985 6799999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccc Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTY 153 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~i 153 (231) ++++++|..++..+.+. +..+..++++|.+++...-+. .....|+|||.++..|++..+..++ +....-+.+|.- T Consensus 240 ~~~~~~~~~i~~g~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~--~i~~~~~~~~~~ 314 (394) T protein:vir:97 240 IKVNTTNDAIAKVLKSF--TTKTVKNLDEIKALLNGGFDP-AYNVSLIVSQSFYQTLDTLKDGNGR--YLLQDDITAVSG 314 (394) T ss_pred HHHHHHHHHHhhccccc--cccccccHHHHHHHHHhhhhh-hhCCEEEEcHHHHHHHHHhhccCCC--eeeecCcCCCCC Confidence 99999999988876543 344567899999888765433 2345789999999999987655443 222223456677 Q ss_pred eeecceeEEEcCCCc--cCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 154 ADVLGAQIVRSKKLA--EGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 154 g~~~G~~Vv~s~~~~--~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++++|+||++++... ++..+..++ ..++.++.++++.++..++.. ..+.+++.++|++++.+|+++++++++.+ T Consensus 315 ~~l~G~pv~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 315 KVLLGKPVFVLSDEVLGANKAFIGDF---KRGVLFADRKDLGLRWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred ceeccceeEEecccccCCccEEEeec---cccEEEEEecceEEEEecccc-cceeEEEEEEEccEEecccceEEEEeccc Confidence 899999999976544 444443332 334567777888888776653 45678999999999999999999999999 No 117 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=99.83 E-value=2.4e-22 Score=139.10 Aligned_cols=225 Identities=15% Similarity=0.021 Sum_probs=169.2 Q ss_pred CCCcccCceEEeccc-cCCcccccCCCccCccccccceeEEEeehcc-ceeeecHHHHHhcCCCH--HHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKKAA-KGTEITDEAALSGYGDP--IGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g-~~~~itD~~~~~~~~d~--~~~~~~~~a~~ia 76 (231) |. ++|++|+||+. .....++..+..+.+++++.+..+++++|.. ..|.|.+.+..++..++ ...+.+++...++ T Consensus 59 e~--~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~ 136 (319) T protein:vir:97 59 IF--MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVA 136 (319) T ss_pred Ee--ccCcEEEEeeecccccccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhh Confidence 33 58999999986 3446688888889999999999999999954 46999999999987655 4556778888888 Q ss_pred HHHHHHHHHHhcccc-----cccccccCHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhhhhccccccCceeee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS-----QTVSTKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALIN 150 (231) Q Consensus 77 ~~vd~~~~~~l~t~~-----~~~~~~~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~ 150 (231) ..+|...++.+-+.. ...+....|+.|.++..+|.+.+. +++|++|+|+.+..|.++++|.... ..+...+.+ T Consensus 137 PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~-~~~~~~~~~ 215 (319) T protein:vir:97 137 PYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQG-DTRQQVLGK 215 (319) T ss_pred hhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccc-cccccceee Confidence 899988777654322 223445579999999999987653 5799999999999999999887543 345567889 Q ss_pred ccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceecc-chhhcccEEEEEEEEEEEEEcCCcEEEEE-- Q lcl|Aclame:pro 151 GTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDR-DIVTKTTVITADEHYAAYLYDLTKVVNIT-- 227 (231) Q Consensus 151 G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R-d~~~~~~~i~~~~~y~~~~~~~~~vv~l~-- 227 (231) |.+|++.|++|+.++... .+.+.+ ++..+.|+....| -..+|.+| .+.++.+.+.++.+|+++|++|.+..... T Consensus 216 g~Vg~idG~~Vi~vps~~-~k~in~-i~~h~~A~~~~~k-~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~ 292 (319) T protein:vir:97 216 GVQGELDGFVIVKVPTKL-LQGLQA-IAVVGEVLASPIQ-ADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) T ss_pred eeceeecCeEEEEecccc-cccceE-EEEcCCeeeeeee-eeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEee Confidence 999999999999875422 122222 3334677766654 33677766 57888999999999999999998544443 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) -+.+ T Consensus 293 ~~~~ 296 (319) T protein:vir:97 293 GTEV 296 (319) T ss_pred cCCc Confidence 3333 No 118 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=99.83 E-value=2.4e-22 Score=139.10 Aligned_cols=225 Identities=15% Similarity=0.021 Sum_probs=169.2 Q ss_pred CCCcccCceEEeccc-cCCcccccCCCccCccccccceeEEEeehcc-ceeeecHHHHHhcCCCH--HHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKKAA-KGTEITDEAALSGYGDP--IGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g-~~~~itD~~~~~~~~d~--~~~~~~~~a~~ia 76 (231) |. ++|++|+||+. .....++..+..+.+++++.+..+++++|.. ..|.|.+.+..++..++ ...+.+++...++ T Consensus 59 e~--~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~ 136 (319) T protein:vir:94 59 IF--MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVA 136 (319) T ss_pred Ee--ccCcEEEEeeecccccccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhh Confidence 33 58999999986 3446688888889999999999999999954 46999999999987655 4556778888888 Q ss_pred HHHHHHHHHHhcccc-----cccccccCHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhhhhccccccCceeee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS-----QTVSTKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALIN 150 (231) Q Consensus 77 ~~vd~~~~~~l~t~~-----~~~~~~~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~ 150 (231) ..+|...++.+-+.. ...+....|+.|.++..+|.+.+. +++|++|+|+.+..|.++++|.... ..+...+.+ T Consensus 137 PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~-~~~~~~~~~ 215 (319) T protein:vir:94 137 PYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQG-DTRQQVLGK 215 (319) T ss_pred hhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccc-cccccceee Confidence 899988777654322 223445579999999999987653 5799999999999999999887543 345567889 Q ss_pred ccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceecc-chhhcccEEEEEEEEEEEEEcCCcEEEEE-- Q lcl|Aclame:pro 151 GTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDR-DIVTKTTVITADEHYAAYLYDLTKVVNIT-- 227 (231) Q Consensus 151 G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R-d~~~~~~~i~~~~~y~~~~~~~~~vv~l~-- 227 (231) |.+|++.|++|+.++... .+.+.+ ++..+.|+....| -..+|.+| .+.++.+.+.++.+|+++|++|.+..... T Consensus 216 g~Vg~idG~~Vi~vps~~-~k~in~-i~~h~~A~~~~~k-~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~ 292 (319) T protein:vir:94 216 GVQGELDGFVIVKVPTKL-LQGLQA-IAVVGEVLASPIQ-ADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) T ss_pred eeceeecCeEEEEecccc-cccceE-EEEcCCeeeeeee-eeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEee Confidence 999999999999875422 122222 3334677766654 33677766 57888999999999999999998544443 Q ss_pred eccC Q lcl|Aclame:pro 228 FTGV 231 (231) Q Consensus 228 ~~~~ 231 (231) -+.+ T Consensus 293 ~~~~ 296 (319) T protein:vir:94 293 GTEV 296 (319) T ss_pred cCCc Confidence 3333 No 119 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.83 E-value=2e-22 Score=139.49 Aligned_cols=224 Identities=17% Similarity=0.182 Sum_probs=153.8 Q ss_pred CCC---ccc----CceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INL----ANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~---~~~----G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~ 71 (231) ..+ +.+ --.+++|..+++ +.+++||+++|.+++++++.+++.+|.+..+.+|+|.+.++..|+.+...+++ T Consensus 370 ~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l 449 (645) T protein:vir:93 370 RFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNAL 449 (645) T ss_pred hhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHH Confidence 111 111 125788988655 45799999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhccc-----cc--------ccccccCHHHHHHHHHHhhccCCC--ceEEEECHHHHHHHHhhhhh Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTT-----SQ--------TVSTKANVDGVQAALDIFNDEDAQ--AYVLIVNPKDAAKIRKDANA 136 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~-----~~--------~~~~~~~~d~i~da~~~l~~~~~~--~~v~vv~p~~~~~L~k~~~~ 136 (231) +++|+.++|..++....+. +. ..+....+.++..++..|...+.. ..+++|||.++..|++.++. T Consensus 450 ~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~ 529 (645) T protein:vir:93 450 AEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNA 529 (645) T ss_pred HHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhcccc Confidence 9999999999999643221 11 111234566788888888665543 45799999999999998765 Q ss_pred hhccccccCceeeeccceeecceeEEEcCCCccCceEEEE----EecCCceEEEeecCCccceeccc------------- Q lcl|Aclame:pro 137 KNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFK----IVSNSPALKLVLKRGVQVETDRD------------- 199 (231) Q Consensus 137 ~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~----~~~~~~A~~~~~k~~v~vE~~Rd------------- 199 (231) .+.. ...+.-..| ++++|+||++|+.+|++-.+. + ++...+.+.+...+...++-.-. T Consensus 530 ~G~~--~~~~~~~~~--~tL~G~PV~~s~~vp~~~~~g-d~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~ 604 (645) T protein:vir:93 530 LGQK--EYPDMTLLG--GSFQGLPVIVSQYVGDQLVLV-NAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVEL 604 (645) T ss_pred CCce--eecCCCCCC--ceeeceeeEEeccCCcceeEe-ccccEEEEEecceEEEeecceeEEEeecccccccccccccc Confidence 4322 222221222 689999999999999753221 1 00111122222222222221100 Q ss_pred ---hhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 ---IVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 ---~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) -.+....|+.-.++++.+.+|+++++|+ || T Consensus 605 v~lf~~d~vaira~~r~d~~~~~p~a~~~lt--~~ 637 (645) T protein:vir:93 605 VSMFQTGSVAIRAERWINWRRRRTAAVAVIT--GV 637 (645) T ss_pred hhHhhcCceEEEEEEEEcceeeCccceEEEe--cc Confidence 1134568888899999999999999876 77 No 120 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.83 E-value=4e-22 Score=137.92 Aligned_cols=229 Identities=11% Similarity=0.015 Sum_probs=168.2 Q ss_pred CCCcc-cCceEEeccc---cCCcccccCCCccC-ccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGIN-LANLCEYPND---IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~-~G~ti~~P~~---igda~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) -+-++ .+.+.++|.+ .+.+..++||++.+ .++.++++.++.+++++..+.+|++...++..|+.+...+++++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~ 222 (389) T protein:vir:10 143 VTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKS 222 (389) T ss_pred cceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHH Confidence 11111 2345677744 34456789999887 4789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccc-ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccc--ccCceeeecc Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTT-SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE--VGANALINGT 152 (231) Q Consensus 76 a~~vd~~~~~~l~t~-~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~--~~~~~~~~G~ 152 (231) +++.|..++..+.+. +..+++..+++++.+++...-+..+ ..+++|||..+..|++.++..+++-. ...+....|. T Consensus 223 ~~~~~~~i~~g~~~~~~~~~~~~~~~d~l~~~~~~~~~~~~-~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~ 301 (389) T protein:vir:10 223 VNTYNAMIAPVLQSFTAKKTTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTA 301 (389) T ss_pred HHHHHHHHhhhhcccccccccccccHHHHHHHHHhhhhhhh-CcEEEecHHHHHHHHHhhccCCCeeeecCccccccccc Confidence 999999999887654 3455677899999988864333333 46899999999999987765443221 1112222345 Q ss_pred ceeecceeEEEcCCC-cc-Cce-EEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 153 YADVLGAQIVRSKKL-AE-GSA-LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 153 ig~~~G~~Vv~s~~~-~~-~~~-~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) .++++|+||++++++ +. ..+ ..+.+..-..++.++.++++.++..++. .+.+.+++..++++++.+|+++++++++ T Consensus 302 ~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 380 (389) T protein:vir:10 302 KGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSK-IYGKYLGAAFRFGVQKADSKAGYFVTNT 380 (389) T ss_pred ccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccc-cccceEEEEEEeccEEecccceEEEEee Confidence 578999999876543 32 211 1111222244566777788999888764 4556789999999999999999999999 Q ss_pred cC Q lcl|Aclame:pro 230 GV 231 (231) Q Consensus 230 ~~ 231 (231) ++ T Consensus 381 ~~ 382 (389) T protein:vir:10 381 DV 382 (389) T ss_pred cc Confidence 98 No 121 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.83 E-value=1.4e-23 Score=145.86 Aligned_cols=230 Identities=13% Similarity=0.064 Sum_probs=174.1 Q ss_pred CCCcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCC-HHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGD-PIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d-~~~~~~~~~a~~ia 76 (231) .--+++|+|++||. +|. +..+..|++|..+.+..++..++|+..-.. +.|.|.+..++..| +-.+..+++|+++| T Consensus 46 vRtI~~gkS~qf~~-lG~s~a~y~~pG~~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA 124 (400) T protein:vir:10 46 VQTVTGTNTVSNKY-LGETELQVLAPGQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLK 124 (400) T ss_pred eeeecccceEEEEE-eeeeEEeeecCCCCcCCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHH Confidence 33489999999975 565 457999999999999999999999997764 88999999999999 89999999999999 Q ss_pred HHHHHHHHHHhcccc------------c-------cc-----ccccC----HHHHHHHHHHhhccC--CCceEEEECHHH Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS------------Q-------TV-----STKAN----VDGVQAALDIFNDED--AQAYVLIVNPKD 126 (231) Q Consensus 77 ~~vd~~~~~~l~t~~------------~-------~~-----~~~~~----~d~i~da~~~l~~~~--~~~~v~vv~p~~ 126 (231) +..|+.++..+..+. + .+ ....+ .+++.+|.+.|.+.+ ....+++++|.. T Consensus 125 ~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~ 204 (400) T protein:vir:10 125 KMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRY 204 (400) T ss_pred HHHHHHHHHHHHHhcccccccccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHH Confidence 999998875432110 0 00 01122 334557777776554 344456666666 Q ss_pred HHHHHhhhhhhhc-cccccCceeeeccceeecceeEEEcCCCccCc---------------eEE--------EEEecCCc Q lcl|Aclame:pro 127 AAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGS---------------ALM--------FKIVSNSP 182 (231) Q Consensus 127 ~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~---------------~~~--------~~~~~~~~ 182 (231) |+.|+..+++.++ ....+++....|.+..+.|+||+.||++|.+. .|. .-+++.+. T Consensus 205 Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~s 284 (400) T protein:vir:10 205 FNVLRDADRIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTAD 284 (400) T ss_pred HHHHHhCCcccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehh Confidence 6677765444433 22233455678899999999999999998421 111 12455678 Q ss_pred eEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 183 ALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 183 A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |++.+.-.+++.|.+||++++++.|...+.||....+|+++..++.+=- T Consensus 285 Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 285 ALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred heEEEEeeccccccccchhhHHHHHHHHHHhCCcccchhheEEEEecCC Confidence 8888888999999999999999999999999999999999999988654 No 122 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.83 E-value=1.3e-22 Score=140.56 Aligned_cols=219 Identities=11% Similarity=0.051 Sum_probs=166.3 Q ss_pred CCCc----ccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI----NLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~----~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) ..+. .....++||+..+. +.+++||++++..++++++.++..++++..+.||++.+.++..++.+...+.++.+ T Consensus 390 ~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a 469 (632) T protein:vir:96 390 QMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEG 469 (632) T ss_pred hhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHH Confidence 2111 11235889987654 45799999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccc--c------------cccccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhh Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTT--S------------QTVSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKN 138 (231) Q Consensus 75 ia~~vd~~~~~~l~t~--~------------~~~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~ 138 (231) ++.++|..++.+-.+. + ...+..++++++.++..++...+ ....+++|||..+..|++...+.. T Consensus 470 ~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~ 549 (632) T protein:vir:96 470 IGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDN 549 (632) T ss_pred HHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCC Confidence 9999999999754321 1 01234568999999998887655 345679999999888876443322 Q ss_pred ccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceecc--chhhcccEEEEEEEEEEE Q lcl|Aclame:pro 139 IGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDR--DIVTKTTVITADEHYAAY 216 (231) Q Consensus 139 ~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R--d~~~~~~~i~~~~~y~~~ 216 (231) .+.+ +.. -++++|.||++|+.+|.+..++.++ ...- +....++.++.++ ........+++.++++++ T Consensus 550 ~G~~-----i~~--~~~l~G~pv~~s~~ip~~~~~~gd~--s~~~--i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~ 618 (632) T protein:vir:96 550 TGER-----IWQ--NNEVNGYRAEASNQIPADTWIFGDW--SQIV--IAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAG 618 (632) T ss_pred CCce-----eec--CCeecccceEeccccccCcEEEeec--ceEE--EEEecceEEEEccccccccCceEEEEEeecCce Confidence 2222 222 2578999999999999998765543 2222 3344567776665 445677799999999999 Q ss_pred EEcCCcEEEEEecc Q lcl|Aclame:pro 217 LYDLTKVVNITFTG 230 (231) Q Consensus 217 ~~~~~~vv~l~~~~ 230 (231) +.+|++++.++++| T Consensus 619 v~~~~af~~~k~~A 632 (632) T protein:vir:96 619 VRRKEAFCIAKKGA 632 (632) T ss_pred eechhhhhheeecC Confidence 99999999999999 No 123 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.82 E-value=1.8e-22 Score=139.74 Aligned_cols=222 Identities=12% Similarity=0.075 Sum_probs=156.6 Q ss_pred CCCcccCceEEeccccC--CcccccCCCcc-----CccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEI-----SLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i-----~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) ..-.-.+.++++|.+.+ .+.+++||+.. +.+++++++.+++.+|.+..+.||+|...++..|+.+...+++++ T Consensus 36 ~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ 115 (305) T protein:vir:25 36 QNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQ 115 (305) T ss_pred ceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 11111355788998754 46689999864 556789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcccc---------------c---ccccccCHHHHHHH----HHHhhccCCCceEEEECHHHHHHHH Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTS---------------Q---TVSTKANVDGVQAA----LDIFNDEDAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~---------------~---~~~~~~~~d~i~da----~~~l~~~~~~~~v~vv~p~~~~~L~ 131 (231) +|++++|+.++.+-.+.. . .......++++.++ ...+.........++|||..+..|+ T Consensus 116 ~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~ 195 (305) T protein:vir:25 116 AIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVA 195 (305) T ss_pred HHHHHHhhhheeccCCCCCccccccccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHH Confidence 999999999996432110 0 01112233444443 4444444556667999999999999 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch----------- Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----------- 200 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----------- 200 (231) +.++..++. +.. -++++|+||++++.+|....-...+......+.+...+++.+|..++. T Consensus 196 ~lkd~~G~~-------i~~--~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~ 266 (305) T protein:vir:25 196 NIRDANGNP-------VFR--DDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINL 266 (305) T ss_pred HhhccCCce-------eec--CCcccccceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeee Confidence 876543322 111 157999999999998853221112222223355667778888877764 Q ss_pred -hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 -VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 -~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+....+++..||++.+.||+++++++..-+ T Consensus 267 ~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 267 AERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred eecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 2335678888999999999999999999766 No 124 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.81 E-value=6e-22 Score=136.94 Aligned_cols=231 Identities=10% Similarity=-0.004 Sum_probs=163.9 Q ss_pred CCCcccCceEEeccccC--Cccc---ccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAAD---VAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~---v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) =+.++.+..+.+|.+.+ .+.. .+||..++..++++++.++.+++++..+.+|++...++..|+.+...+++++++ T Consensus 176 ~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~ 255 (434) T protein:vir:62 176 GTGVKTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAY 255 (434) T ss_pred cceeccCCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHH Confidence 12223344578887643 3333 467889999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccc------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccc Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTT------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) Q Consensus 76 a~~vd~~~~~~l~t~------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~ 143 (231) +.++|..++.+-.+. ....+..+++++|+++...+........+|+|||.++..|++.++..++.-.. T Consensus 256 ~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~ 335 (434) T protein:vir:62 256 VRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLR 335 (434) T ss_pred HHHHHHHHhccCCCCccccceeecccccccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeec Confidence 999999999643321 12234456799999999999776666778999999999999877665443222 Q ss_pred cCceeeeccceeecceeEEEcCCCccCceE--EEEEecCCceEEEeecC-Cccceeccc--hhhcccEEEEEEEEEEEEE Q lcl|Aclame:pro 144 GANALINGTYADVLGAQIVRSKKLAEGSAL--MFKIVSNSPALKLVLKR-GVQVETDRD--IVTKTTVITADEHYAAYLY 218 (231) Q Consensus 144 ~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~--~~~~~~~~~A~~~~~k~-~v~vE~~Rd--~~~~~~~i~~~~~y~~~~~ 218 (231) .......|...+++|.||++++.+|.+.+- .......-..+.++... .+.++..++ ..++...+++.++++++++ T Consensus 336 ~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i 415 (434) T protein:vir:62 336 PFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLI 415 (434) T ss_pred cCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceee Confidence 223344566678999999999999854421 11111111223344333 344544333 3455667899999999977 Q ss_pred c-CCcEEEEEeccC Q lcl|Aclame:pro 219 D-LTKVVNITFTGV 231 (231) Q Consensus 219 ~-~~~vv~l~~~~~ 231 (231) . |.++.++++++- T Consensus 416 ~~~~~~~~~~~~~~ 429 (434) T protein:vir:62 416 HSPFEVPVYKYVLK 429 (434) T ss_pred cCcccceEEEEEec Confidence 5 988888877654 No 125 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.81 E-value=7.6e-22 Score=136.38 Aligned_cols=225 Identities=11% Similarity=0.050 Sum_probs=160.6 Q ss_pred CCC---c-ccCceEEeccccCCc--ccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCC--CHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---I-NLANLCEYPNDIGDA--ADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYG--DPIGESNKQLG 72 (231) Q Consensus 1 ~~~---~-~~G~ti~~P~~igda--~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~--d~~~~~~~~~a 72 (231) ..+ + .....+++|.+.+.+ .+++||+.+|..+++++++++.+++++..+.+|++...++.. ++.+...++++ T Consensus 164 ~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~ 243 (435) T protein:vir:80 164 KLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLT 243 (435) T ss_pred hccceeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHH Confidence 111 1 112247888876554 579999999999999999999999999999999999988854 67789999999 Q ss_pred HHHHHHHHHHHHHHhcccc--c------------ccc----cccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHh Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTTS--Q------------TVS----TKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRK 132 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~~--~------------~~~----~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k 132 (231) +++++++|..++.+-.+.. . ..+ ....+.++.+++..|.... ....+++|||.++..|++ T Consensus 244 ~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~ 323 (435) T protein:vir:80 244 AAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEG 323 (435) T ss_pred HHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHh Confidence 9999999999997633211 0 011 1122456777777775432 345679999999999988 Q ss_pred hhhhhhccccccCceeeeccceeecceeEEEcCCCccCceE----EEEEecCCceEEEeecCCccceeccchh------- Q lcl|Aclame:pro 133 DANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL----MFKIVSNSPALKLVLKRGVQVETDRDIV------- 201 (231) Q Consensus 133 ~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~----~~~~~~~~~A~~~~~k~~v~vE~~Rd~~------- 201 (231) .++..+.. ...+ + .-|+++|+||++++.||..... ...+.....-+-+...+++.++..++.. T Consensus 324 lkd~~G~~--l~~~-~---~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~ 397 (435) T protein:vir:80 324 LRDGNGNK--VYPE-L---ANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGH 397 (435) T ss_pred hhccCCce--eccC-C---CCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEEEeccccccccccc Confidence 76654432 2211 1 2258999999999999864211 0111111122335667889998888753 Q ss_pred ------hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 ------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 ------~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....+++..+|++++.+|+++++|+-.++ T Consensus 398 ~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:80 398 MVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAW 433 (435) T ss_pred hhhhhhcCcceeeeeeeeCcEeecccceEEEeccCC Confidence 456789999999999999999999988888 No 126 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=99.81 E-value=1.5e-21 Score=134.68 Aligned_cols=227 Identities=9% Similarity=0.070 Sum_probs=156.3 Q ss_pred CCCcccCceEEeccc---cCCc---ccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDA---ADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda---~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) -|..-.||.+++|.| +|+. +++.+...+++.++++.+..+....++++|..+|++......+|++.++++++.+ T Consensus 39 ~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~ 118 (325) T protein:vir:95 39 QSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQ 118 (325) T ss_pred ccccccCceeeccccccccccccccccCCCCceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHH Confidence 233345999999998 5643 4677888899999999999999899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhc----cc--------------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhh Q lcl|Aclame:pro 75 LANKVDDDLLKAAK----TT--------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANA 136 (231) Q Consensus 75 ia~~vd~~~~~~l~----t~--------------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~ 136 (231) +++....++++.+. .+ +...+..++++.+.+|.++|||.......++|||.++..|+++.-. T Consensus 119 ~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~ 198 (325) T protein:vir:95 119 LAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLT 198 (325) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhcc Confidence 99887777655432 11 0111223578999999999999999999999999999999986433 Q ss_pred hhccccccCceeeeccceeecceeEEEcCCCccCc----eEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEE Q lcl|Aclame:pro 137 KNIGSEVGANALINGTYADVLGAQIVRSKKLAEGS----ALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEH 212 (231) Q Consensus 137 ~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~----~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~ 212 (231) .. ......+-+ ..+++++|.+|+++|.+|... ..+..+...+||+++....++..+.++......-....+.+ T Consensus 199 ~~-~~~~~~~g~--~~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (325) T protein:vir:95 199 NG-ERLFTYGTV--NVVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDENIIRTYQAE 275 (325) T ss_pred cc-ccccccCCc--ccccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCCCCccccccccCcccceeeeeeee Confidence 21 111111111 146789999999999998532 24556677899999988777555444332221111111222 Q ss_pred EEEEEEcCCcEEEE-EeccC Q lcl|Aclame:pro 213 YAAYLYDLTKVVNI-TFTGV 231 (231) Q Consensus 213 y~~~~~~~~~vv~l-~~~~~ 231 (231) | .+++.|.++--- +..++ T Consensus 276 ~-tf~lhp~G~sw~~s~~g~ 294 (325) T protein:vir:95 276 W-SYNIGVKGFAWDKANGGK 294 (325) T ss_pred e-eEEeecceeeeecccccC Confidence 3 244455554431 11222 No 127 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.81 E-value=1.6e-21 Score=134.58 Aligned_cols=226 Identities=12% Similarity=0.040 Sum_probs=165.0 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-..+.-+..+|+..+ .+.+++||++++.. .++++++++..++.+..+.||++.+.++..|+.+...+++++++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 143 EPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 11111112344555543 46689999999865 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) ++|..++.+..+. ...+..++++|.+++. .+.........++|||.++..|++.++..++ +....-+.+|..+++ T Consensus 223 ~~d~~~~~g~g~~--~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 223 TRNVLILGVIEKL--TKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGK--YILQSDPTQKNKKLF 298 (392) T ss_pred HHHHHHhhccccc--cccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCC--eEeecCccCCccccc Confidence 9999998876543 3445678999999884 6655555677899999999999987655433 222222445777899 Q ss_pred cceeEEEc-CC-Ccc------CceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEE Q lcl|Aclame:pro 157 LGAQIVRS-KK-LAE------GSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVV 224 (231) Q Consensus 157 ~G~~Vv~s-~~-~~~------~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv 224 (231) +|.|+++. +. .+. +... +.+..-..++.+..+.++.++.++.. .+....+++..++++++.+|++++ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 377 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAV 377 (392) T ss_pred cCcccEEEecccccCCCcccCCceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceE Confidence 99976653 22 221 2211 11222234566777788888877643 345667999999999999999999 Q ss_pred EEEeccC Q lcl|Aclame:pro 225 NITFTGV 231 (231) Q Consensus 225 ~l~~~~~ 231 (231) +++++.. T Consensus 378 ~l~~~~~ 384 (392) T protein:vir:10 378 YGEIDLS 384 (392) T ss_pred EEEeccc Confidence 9988776 No 128 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.81 E-value=1.6e-21 Score=134.58 Aligned_cols=226 Identities=12% Similarity=0.040 Sum_probs=165.0 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-..+.-+..+|+..+ .+.+++||++++.. .++++++++..++.+..+.||++.+.++..|+.+...+++++++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 143 EPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 11111112344555543 46689999999865 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) ++|..++.+..+. ...+..++++|.+++. .+.........++|||.++..|++.++..++ +....-+.+|..+++ T Consensus 223 ~~d~~~~~g~g~~--~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 223 TRNVLILGVIEKL--TKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGK--YILQSDPTQKNKKLF 298 (392) T ss_pred HHHHHHhhccccc--cccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCC--eEeecCccCCccccc Confidence 9999998876543 3445678999999884 6655555677899999999999987655433 222222445777899 Q ss_pred cceeEEEc-CC-Ccc------CceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEE Q lcl|Aclame:pro 157 LGAQIVRS-KK-LAE------GSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVV 224 (231) Q Consensus 157 ~G~~Vv~s-~~-~~~------~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv 224 (231) +|.|+++. +. .+. +... +.+..-..++.+..+.++.++.++.. .+....+++..++++++.+|++++ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 377 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAV 377 (392) T ss_pred cCcccEEEecccccCCCcccCCceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceE Confidence 99976653 22 221 2211 11222234566777788888877643 345667999999999999999999 Q ss_pred EEEeccC Q lcl|Aclame:pro 225 NITFTGV 231 (231) Q Consensus 225 ~l~~~~~ 231 (231) +++++.. T Consensus 378 ~l~~~~~ 384 (392) T protein:vir:10 378 YGEIDLS 384 (392) T ss_pred EEEeccc Confidence 9988776 No 129 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.81 E-value=1.6e-21 Score=134.58 Aligned_cols=226 Identities=12% Similarity=0.040 Sum_probs=165.0 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-..+.-+..+|+..+ .+.+++||++++.. .++++++++..++.+..+.||++.+.++..|+.+...+++++++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 143 EPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 11111112344555543 46689999999865 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) ++|..++.+..+. ...+..++++|.+++. .+.........++|||.++..|++.++..++ +....-+.+|..+++ T Consensus 223 ~~d~~~~~g~g~~--~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 223 TRNVLILGVIEKL--TKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGK--YILQSDPTQKNKKLF 298 (392) T ss_pred HHHHHHhhccccc--cccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCC--eEeecCccCCccccc Confidence 9999998876543 3445678999999884 6655555677899999999999987655433 222222445777899 Q ss_pred cceeEEEc-CC-Ccc------CceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEE Q lcl|Aclame:pro 157 LGAQIVRS-KK-LAE------GSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVV 224 (231) Q Consensus 157 ~G~~Vv~s-~~-~~~------~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv 224 (231) +|.|+++. +. .+. +... +.+..-..++.+..+.++.++.++.. .+....+++..++++++.+|++++ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 377 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAV 377 (392) T ss_pred cCcccEEEecccccCCCcccCCceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceE Confidence 99976653 22 221 2211 11222234566777788888877643 345667999999999999999999 Q ss_pred EEEeccC Q lcl|Aclame:pro 225 NITFTGV 231 (231) Q Consensus 225 ~l~~~~~ 231 (231) +++++.. T Consensus 378 ~l~~~~~ 384 (392) T protein:vir:10 378 YGEIDLS 384 (392) T ss_pred EEEeccc Confidence 9988776 No 130 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.81 E-value=1.6e-21 Score=134.58 Aligned_cols=226 Identities=12% Similarity=0.040 Sum_probs=165.0 Q ss_pred CCCcccCceEEeccccC--CcccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~ig--da~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+-..+.-+..+|+..+ .+.+++||++++.. .++++++++..++.+..+.||++.+.++..|+.+...+++++++++ T Consensus 143 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 143 EPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 11111112344555543 46689999999865 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccccCHHHHHHHHH-HhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 78 KVDDDLLKAAKTTSQTVSTKANVDGVQAALD-IFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 78 ~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~-~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) ++|..++.+..+. ...+..++++|.+++. .+.........++|||.++..|++.++..++ +....-+.+|..+++ T Consensus 223 ~~d~~~~~g~g~~--~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 223 TRNVLILGVIEKL--TKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGK--YILQSDPTQKNKKLF 298 (392) T ss_pred HHHHHHhhccccc--cccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCC--eEeecCccCCccccc Confidence 9999998876543 3445678999999884 6655555677899999999999987655433 222222445777899 Q ss_pred cceeEEEc-CC-Ccc------CceEEEEEecCCceEEEeecCCccceeccch----hhcccEEEEEEEEEEEEEcCCcEE Q lcl|Aclame:pro 157 LGAQIVRS-KK-LAE------GSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVV 224 (231) Q Consensus 157 ~G~~Vv~s-~~-~~~------~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~----~~~~~~i~~~~~y~~~~~~~~~vv 224 (231) +|.|+++. +. .+. +... +.+..-..++.+..+.++.++.++.. .+....+++..++++++.+|++++ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 377 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAV 377 (392) T ss_pred cCcccEEEecccccCCCcccCCceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceE Confidence 99976653 22 221 2211 11222234566777788888877643 345667999999999999999999 Q ss_pred EEEeccC Q lcl|Aclame:pro 225 NITFTGV 231 (231) Q Consensus 225 ~l~~~~~ 231 (231) +++++.. T Consensus 378 ~l~~~~~ 384 (392) T protein:vir:10 378 YGEIDLS 384 (392) T ss_pred EEEeccc Confidence 9988776 No 131 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.81 E-value=8.5e-22 Score=136.10 Aligned_cols=231 Identities=15% Similarity=0.140 Sum_probs=176.7 Q ss_pred CCCcccCceEEeccc-cCCcccccCCCccCccccc---cceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIG---TTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt---~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) |-..+.|....||.+ +=-+-+++||++++...|+ .++.++..+|+|-.+++|||++.+|.+|+++.+.++++++|+ T Consensus 107 k~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMa 186 (393) T protein:vir:79 107 KIRLKSGQSMIFPSIGIMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMG 186 (393) T ss_pred HHhhhcCcceeccchheeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHH Confidence 777888999999864 2235689999999988776 567888888899999999999999999999999999999999 Q ss_pred HHHHHHHHHHhccccc---------------------ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhh Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQ---------------------TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDAN 135 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~---------------------~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~ 135 (231) ++.+..+++.+++... ...++++.++|.|..-......+.+.+++|||.+|....|+.. T Consensus 187 RkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~ 266 (393) T protein:vir:79 187 RHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNEL 266 (393) T ss_pred hhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhh Confidence 9999999998765322 1234678999999999888889999999999999999999876 Q ss_pred hhhccccccCceeeecc-ceeecc-----------eeEEEcCCCcc---CceEEEEEecCCceEEEeecCCccceeccch Q lcl|Aclame:pro 136 AKNIGSEVGANALINGT-YADVLG-----------AQIVRSKKLAE---GSALMFKIVSNSPALKLVLKRGVQVETDRDI 200 (231) Q Consensus 136 ~~~~~~~~~~~~~~~G~-ig~~~G-----------~~Vv~s~~~~~---~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~ 200 (231) ....+.....++--.+. -.+.+| ++|++|+.+|- .+.+-+......+--.++.+.+++++.+-|+ T Consensus 267 me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk 346 (393) T protein:vir:79 267 MGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK 346 (393) T ss_pred hcceeeccccccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc Confidence 65443222112111110 112344 89999999983 3344444344455555666778999999999 Q ss_pred hhcccEEEEEEEEEEEEEcCCcEEEE----EeccC Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYDLTKVVNI----TFTGV 231 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~~~~vv~l----~~~~~ 231 (231) -++-+.|+-.++||.+++|..+.+-+ +++-. T Consensus 347 ~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~ 381 (393) T protein:vir:79 347 ARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKS 381 (393) T ss_pred cccceeeeeeeeeceeeeeCCceEEEEecceeecc Confidence 99999999999999999999887754 22221 No 132 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=99.80 E-value=1.6e-21 Score=134.64 Aligned_cols=223 Identities=12% Similarity=0.010 Sum_probs=146.3 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLANKV 79 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~v 79 (231) +.-.+.||||+||. .+... +.+|..+++++++.++.+++|+|..+ .|+++|++..++..|+.++.+++.+++||+++ T Consensus 38 ~e~~~~GDTV~I~v-p~~~~-v~dg~~~~~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~v 115 (418) T protein:vir:10 38 KTFGKVGDTIRLKL-PYRVK-SASGRTLVKQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQI 115 (418) T ss_pred hHHhhCCCEEEEee-CCcee-ecccCCccccccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHH Confidence 22256799999997 33322 44688899999999999999988766 69999999999999999999999999999999 Q ss_pred HHHHHHHhccccccc----ccccCHHHHHHHHHHhhccCC--C-ceEEEECHHHHHHHHhhhhhhhccccccCceeeecc Q lcl|Aclame:pro 80 DDDLLKAAKTTSQTV----STKANVDGVQAALDIFNDEDA--Q-AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 152 (231) Q Consensus 80 d~~~~~~l~t~~~~~----~~~~~~d~i~da~~~l~~~~~--~-~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ 152 (231) |.+++.....++... +....|+.++++..+|++.+. + +|++|++|+.++.|+++..+.. ......+.+++|. T Consensus 116 D~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~-~~~~~~~~lr~G~ 194 (418) T protein:vir:10 116 DRSLALTLKKAFHSSGTPGVRPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLF-KESMVEQAYKMGY 194 (418) T ss_pred HHHHHHHHhhcccccccCCcCcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccc-cccccchhhheee Confidence 999988766554332 233469999999999988764 3 5999999999999999887643 4455566899999 Q ss_pred ceeecceeEEEcCCCccCceE---EEEEecCCceEEEeecCCccc----eeccchhhcccEEEEEEEEEEEEE----cCC Q lcl|Aclame:pro 153 YADVLGAQIVRSKKLAEGSAL---MFKIVSNSPALKLVLKRGVQV----ETDRDIVTKTTVITADEHYAAYLY----DLT 221 (231) Q Consensus 153 ig~~~G~~Vv~s~~~~~~~~~---~~~~~~~~~A~~~~~k~~v~v----E~~Rd~~~~~~~i~~~~~y~~~~~----~~~ 221 (231) ||+++|++|+.|+++|..+.. ...+. .+|.... ..+.+ -+.-.....-|.+...-+++++-+ .+ T Consensus 195 IG~i~GF~V~~S~nip~~tag~~~~t~~v--~ga~~~~--~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~- 269 (418) T protein:vir:10 195 RGNVAAYEVYESQNLPKHTVGDHGGTPLV--NGTVVNG--DTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTG- 269 (418) T ss_pred eeeeeceEEEEecCCCcccccccccceee--ecccccc--eeEEEeecceeeccceeeccEEEECceeecccccccccc- Confidence 999999999999999953321 10111 1111100 00110 011111222222222222211111 00 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) ..-...+++. T Consensus 270 ~~~~f~V~~~ 279 (418) T protein:vir:10 270 LLQEFVVLED 279 (418) T ss_pred cceEEEEEee Confidence 1111111111 No 133 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.80 E-value=1.5e-21 Score=134.80 Aligned_cols=225 Identities=13% Similarity=0.051 Sum_probs=166.9 Q ss_pred CCCcccCceEEeccc---cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND---IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~---igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) ..-...+...++|.. .+.+..++||++.+. .+.++++.++++++.+..+.+|++...++..|+.+...+++++.++ T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~ 245 (397) T protein:vir:96 166 RSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSL 245 (397) T ss_pred hhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 111112234455532 345567999999874 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceee Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV 156 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~ 156 (231) .+++..+++..... ..++..++|+|.+++....+. ....+|+|||.++..|++.++..+. +....-+.+|..+++ T Consensus 246 ~~~~~~i~~g~g~~--~~~~~~~~d~~~~~~~~~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~--~~~~~~~~~~~~~~l 320 (397) T protein:vir:96 246 NTKNADIAAVLKTA--TAKSVVGVDGLKDLINKEIKK-VYDVKLFISASMYSELDKLKDKNGR--YLLQDSITAASGKQL 320 (397) T ss_pred HHHHHHHhhccccc--ccccccchHHHHHHHHHhhhh-hcCcEEEEcHHHHHHHHHhhccCCC--eEeccCccCCCcccc Confidence 99999998776543 345667899999988765443 3456899999999999987655443 333233556777899 Q ss_pred cceeEEEcCCCccCce---EEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 157 LGAQIVRSKKLAEGSA---LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 157 ~G~~Vv~s~~~~~~~~---~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +|.||+++++++.+.. ..+.+..-..++.++...++.++..++ ..+.+.+++..++++++.+|++++++++++- T Consensus 321 ~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 321 LGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDN-NIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred cccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEecc-cccceeEEEEEEEccEEecccceEEEEeecC Confidence 9999998876443221 111122223345677778888887765 4456778999999999999999999998877 No 134 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.80 E-value=1.9e-21 Score=134.16 Aligned_cols=224 Identities=10% Similarity=-0.024 Sum_probs=156.9 Q ss_pred CCC-----cccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG-----INLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~-----~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~ 73 (231) ..+ +..| .+++|++++. +.+++||+++|.+++++++.++..++.+..+.+|+|.+.++..++.+...+++++ T Consensus 97 ~lg~~~v~~~~g-~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~ 175 (366) T protein:vir:57 97 ILGARSIPLPNG-NLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILS 175 (366) T ss_pred hhceeeeecCCC-ceEEEEEeCCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHH Confidence 211 1223 4889988654 5579999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhccc--c-----------cc---cccccCH---HHHHHHHHHhhc---cCCCceEEEECHHHHHHHH Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTT--S-----------QT---VSTKANV---DGVQAALDIFND---EDAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~--~-----------~~---~~~~~~~---d~i~da~~~l~~---~~~~~~v~vv~p~~~~~L~ 131 (231) ++++++|+.++..-.+. + .. ..+..++ +...+.+..... .......++|||..+..|+ T Consensus 176 a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~ 255 (366) T protein:vir:57 176 AIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLF 255 (366) T ss_pred HHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHH Confidence 99999999998653221 0 00 0112233 333333332221 2234567899999999999 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceE----EEEEecCCceEEEeecCCccceeccchh------ Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL----MFKIVSNSPALKLVLKRGVQVETDRDIV------ 201 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~----~~~~~~~~~A~~~~~k~~v~vE~~Rd~~------ 201 (231) +.++..+.... . ...-|+++|+||++|+.+|+..+. ...+.....-+-+....++.++..|++. T Consensus 256 ~lkd~~G~~l~--~----~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g 329 (366) T protein:vir:57 256 GLRDGNGNKVY--P----EMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADG 329 (366) T ss_pred hhhccCCceec--c----CCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccc Confidence 87654433221 1 122368999999999999964221 1111122223446667788888777642 Q ss_pred -------hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 -------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 -------~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....++..++|++++.+|+++++++-.+- T Consensus 330 ~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 330 QLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 345689999999999999999999865555 No 135 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.80 E-value=3e-21 Score=133.09 Aligned_cols=224 Identities=8% Similarity=0.018 Sum_probs=160.3 Q ss_pred CC-----CcccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCC--HHHHHHHHH Q lcl|Aclame:pro 1 EN-----GINLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGD--PIGESNKQL 71 (231) Q Consensus 1 ~~-----~~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d--~~~~~~~~~ 71 (231) .. -+..| .+++|.+.+. +.+++||+.+|..+++++++++.+++++..+.+|++...++..+ +.+...+++ T Consensus 164 ~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l 242 (435) T protein:vir:14 164 KLGARTLPLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDL 242 (435) T ss_pred hhcceeeecCCC-ceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHH Confidence 11 11223 5788987654 45799999999999999999999999999999999999998655 678899999 Q ss_pred HHHHHHHHHHHHHHHhccc--c---------c---cccccc----CHHHHHHHHHHhhcc--CCCceEEEECHHHHHHHH Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTT--S---------Q---TVSTKA----NVDGVQAALDIFNDE--DAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~--~---------~---~~~~~~----~~d~i~da~~~l~~~--~~~~~v~vv~p~~~~~L~ 131 (231) +++|++++|+.++.+-.+. + . ..+... .++++.+++..+... .....+++|||.++..|+ T Consensus 243 ~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~ 322 (435) T protein:vir:14 243 TAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLE 322 (435) T ss_pred HHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHH Confidence 9999999999998653321 1 0 011112 245566666666543 345668999999999999 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceE----EEEEecCCceEEEeecCCccceeccchh------ Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL----MFKIVSNSPALKLVLKRGVQVETDRDIV------ 201 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~----~~~~~~~~~A~~~~~k~~v~vE~~Rd~~------ 201 (231) +.++..+.. ...+ ..-|+++|+||++++.||...+. ...+...-..+.+....++.++..++.. T Consensus 323 ~lkd~~G~~--l~~~----~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~ 396 (435) T protein:vir:14 323 GLRDGNGNK--VYPE----LANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADG 396 (435) T ss_pred HhhccCCce--eccC----CCCCeeecceeEeeccccccccCCCccceEEEeecccEEEEEecccEEEEecccccccccc Confidence 877644332 2111 12358999999999999864211 0111112223446677888998887643 Q ss_pred -------hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 -------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 -------~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....+++.+++++++.+|+++++++-.+. T Consensus 397 ~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:14 397 HMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAW 433 (435) T ss_pred chhhhhhcChhheeeeeeeCceeecccceEEEecCCC Confidence 456899999999999999999999988777 No 136 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.80 E-value=4.1e-22 Score=137.85 Aligned_cols=216 Identities=15% Similarity=0.099 Sum_probs=162.1 Q ss_pred CCCcccCceEEecc--c-cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--D-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) -+-++.|. .++|. . .+++.+++||+.++..+++++++++.+++++..+.||++.+.++..|+.+...+++++++++ T Consensus 117 ~~v~~~~~-~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~ 195 (352) T protein:vir:78 117 ARLTNIKG-LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 195 (352) T ss_pred eeeEecCC-ceEEEEecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 11122222 24454 3 35688999999999999999999999999999999999999999999999999999999998 Q ss_pred HHHHHHHHHhccc-----------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT-----------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 78 ~vd~~~~~~l~t~-----------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +.+..++....++ ...+++...||+|.+++..|.........++|||..+..|++..+..+. T Consensus 196 ~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~------- 268 (352) T protein:vir:78 196 KERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT------- 268 (352) T ss_pred HHHHhhhhcCCCCcccccceeccccccccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCC------- Confidence 7566566432211 1223455669999999998876655677899999999998876543222 Q ss_pred eeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l 226 (231) -+..|.-.+++|.||++++.++. .+.-++ + .+. ....++.++.+++..++...+++.++|++++++|++++.+ T Consensus 269 ~~~~~~~~~llG~PV~~~~~~~~--~~~Gdf--~-~~~--~~~~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l 341 (352) T protein:vir:78 269 NFFDTPAEKVFGKPVVFTDAAVK--PIVGDF--N-YFG--INYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 341 (352) T ss_pred cccccCCccccccceEEecCCCc--eeEeeh--h-hhh--hhhhhheeeeeccccCCeeEEEEEeeeCceeechhheEEE Confidence 12345556899999999997764 222121 1 122 2234567788888888889999999999999999999999 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) ++++. T Consensus 342 ~~~a~ 346 (352) T protein:vir:78 342 KAKES 346 (352) T ss_pred Eeecc Confidence 99999 No 137 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=99.80 E-value=3.6e-22 Score=138.14 Aligned_cols=216 Identities=15% Similarity=0.071 Sum_probs=162.8 Q ss_pred CCCcccCceEEecc--c-cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--D-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.| ..++|+ + .+++.+++||+..+..++++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 152 ~~~~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~ 230 (387) T protein:vir:94 152 ARLTNIK-GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 230 (387) T ss_pred ceeeecC-CceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 1112222 244564 2 34677899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc-----------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT-----------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 78 ~vd~~~~~~l~t~-----------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +.++.++....++ ...+++...+|+|.+++..|.........|+||+.++..+++..+..+. T Consensus 231 ~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~------- 303 (387) T protein:vir:94 231 KERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT------- 303 (387) T ss_pred HHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC------- Confidence 8777777543321 1223455679999999998876655666789999998887765432221 Q ss_pred eeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l 226 (231) -+..|.-.+++|.||++++.++. .+..++ ..++. ..++..++.+|+...+...+++.++|++++++|++++.+ T Consensus 304 ~~~~~~~~~llG~PV~~~~~~~~--~~~GDf---~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 376 (387) T protein:vir:94 304 NFFDTPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred cccccCCccccccceEEecCCCc--eeeech---hhhhh--hhhhhhheecccccCCceEEEEEEEeCcEeechhheEEE Confidence 22345567899999999998764 222221 12222 224567788888888899999999999999999999999 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) +++|- T Consensus 377 ~~ka~ 381 (387) T protein:vir:94 377 KAKEN 381 (387) T ss_pred EeecC Confidence 99888 No 138 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=99.80 E-value=3.6e-22 Score=138.14 Aligned_cols=216 Identities=15% Similarity=0.071 Sum_probs=162.8 Q ss_pred CCCcccCceEEecc--c-cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--D-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.| ..++|+ + .+++.+++||+..+..++++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 152 ~~~~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~ 230 (387) T protein:vir:96 152 ARLTNIK-GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 230 (387) T ss_pred ceeeecC-CceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 1112222 244564 2 34677899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc-----------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT-----------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 78 ~vd~~~~~~l~t~-----------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +.++.++....++ ...+++...+|+|.+++..|.........|+||+.++..+++..+..+. T Consensus 231 ~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~------- 303 (387) T protein:vir:96 231 KERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT------- 303 (387) T ss_pred HHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC------- Confidence 8777777543321 1223455679999999998876655666789999998887765432221 Q ss_pred eeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l 226 (231) -+..|.-.+++|.||++++.++. .+..++ ..++. ..++..++.+|+...+...+++.++|++++++|++++.+ T Consensus 304 ~~~~~~~~~llG~PV~~~~~~~~--~~~GDf---~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 376 (387) T protein:vir:96 304 NFFDTPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred cccccCCccccccceEEecCCCc--eeeech---hhhhh--hhhhhhheecccccCCceEEEEEEEeCcEeechhheEEE Confidence 22345567899999999998764 222221 12222 224567788888888899999999999999999999999 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) +++|- T Consensus 377 ~~ka~ 381 (387) T protein:vir:96 377 KAKEN 381 (387) T ss_pred EeecC Confidence 99888 No 139 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=99.80 E-value=3.6e-22 Score=138.14 Aligned_cols=216 Identities=15% Similarity=0.071 Sum_probs=162.8 Q ss_pred CCCcccCceEEecc--c-cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--D-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.| ..++|+ + .+++.+++||+..+..++++++.++.+++++..+.||++...++..|+.+...+++++++++ T Consensus 152 ~~~~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~ 230 (387) T protein:vir:26 152 ARLTNIK-GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 230 (387) T ss_pred ceeeecC-CceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 1112222 244564 2 34677899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc-----------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT-----------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 78 ~vd~~~~~~l~t~-----------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +.++.++....++ ...+++...+|+|.+++..|.........|+||+.++..+++..+..+. T Consensus 231 ~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~------- 303 (387) T protein:vir:26 231 KERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT------- 303 (387) T ss_pred HHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC------- Confidence 8777777543321 1223455679999999998876655666789999998887765432221 Q ss_pred eeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l 226 (231) -+..|.-.+++|.||++++.++. .+..++ ..++. ..++..++.+|+...+...+++.++|++++++|++++.+ T Consensus 304 ~~~~~~~~~llG~PV~~~~~~~~--~~~GDf---~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 376 (387) T protein:vir:26 304 NFFDTPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred cccccCCccccccceEEecCCCc--eeeech---hhhhh--hhhhhhheecccccCCceEEEEEEEeCcEeechhheEEE Confidence 22345567899999999998764 222221 12222 224567788888888899999999999999999999999 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) +++|- T Consensus 377 ~~ka~ 381 (387) T protein:vir:26 377 KAKEN 381 (387) T ss_pred EeecC Confidence 99888 No 140 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=99.80 E-value=4.7e-22 Score=137.50 Aligned_cols=216 Identities=15% Similarity=0.081 Sum_probs=162.0 Q ss_pred CCCcccCceEEecc--c-cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--D-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.|. .++|. + .+++.+++||+..+..++++++.++.+++++..+.||++.+.++..|+.+...+++++++++ T Consensus 167 ~~v~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~ 245 (402) T protein:vir:93 167 ARLTNIKG-LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 245 (402) T ss_pred ceeeecCC-ceeeeeeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 11122222 34564 2 24577899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc-----------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT-----------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 78 ~vd~~~~~~l~t~-----------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +.++.+|....++ ...+++...+|+|.+++..|.........|+||+.++..|++..+..+ . T Consensus 246 ~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~-------~ 318 (402) T protein:vir:93 246 KERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT-------T 318 (402) T ss_pred HHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCC-------C Confidence 8777766543321 122345567899999999887665566678999999888776543221 1 Q ss_pred eeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l 226 (231) -+..|.-.+++|.||++++.++. .+..++ ..++.. .+++.++.+|++..+...+++.++++++++||++++.+ T Consensus 319 ~~~~~~~~~llG~PV~~t~~~~~--i~~GDf---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 391 (402) T protein:vir:93 319 NFFDTPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGIN--YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 391 (402) T ss_pred cccccCCccccccceEEecCCCc--eeeech---hhhhhh--hhhhhhhhhhcccCCceEEEEEEEeCcEEechhheEEE Confidence 22345567899999999998763 222221 222222 24567788888888999999999999999999999999 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) ++||- T Consensus 392 ~ik~~ 396 (402) T protein:vir:93 392 KAKEN 396 (402) T ss_pred EeecC Confidence 99988 No 141 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.79 E-value=3.3e-21 Score=132.86 Aligned_cols=228 Identities=14% Similarity=0.034 Sum_probs=158.4 Q ss_pred CCC---cccCceEEeccccCC--cccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPNDIGD--AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~---~~~G~ti~~P~~igd--a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~a 72 (231) ..+ ...+..+++|.+.+. +.+++||+++|..++++++.++..+|.+..+.+|+|.+.+ +..++.+...++++ T Consensus 32 ~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la 111 (311) T protein:vir:99 32 VLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGA 111 (311) T ss_pred hhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHH Confidence 111 122345789988655 4589999999999999999999999999999999998754 45678999999999 Q ss_pred HHHHHHHHHHHHHHhcccc---------------cc----ccc-ccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHH Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTTS---------------QT----VST-KANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKI 130 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~~---------------~~----~~~-~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L 130 (231) ++|++++|+.++.+..+.. .. ... ...++++.++..++.... .....++|||..+..| T Consensus 112 ~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L 191 (311) T protein:vir:99 112 EALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGL 191 (311) T ss_pred HHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHH Confidence 9999999999997543211 00 011 123456677777776543 4456699999999999 Q ss_pred HhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEE------------Ee-cCCceEEEeecCCccceec Q lcl|Aclame:pro 131 RKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFK------------IV-SNSPALKLVLKRGVQVETD 197 (231) Q Consensus 131 ~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~------------~~-~~~~A~~~~~k~~v~vE~~ 197 (231) ++.++..++ +........+..++++|+||++|+.+|.+...... ++ .....+.+...++++++.. T Consensus 192 ~~lkd~~G~--~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~ 269 (311) T protein:vir:99 192 STARYTDGR--KKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELI 269 (311) T ss_pred HhhhccCCC--eeecCcccCCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEe Confidence 987765433 33233334456689999999999998854332110 11 1123344555666666655 Q ss_pred cch---------hhcccEEEEEEEEEEEEEcCCcEEEEEecc Q lcl|Aclame:pro 198 RDI---------VTKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) Q Consensus 198 Rd~---------~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~ 230 (231) +.. .+....+++..+|++.+.+|+.++..+.+| T Consensus 270 ~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 270 KYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred ecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 432 245567888999999999997777666666 No 142 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.79 E-value=6.2e-21 Score=131.37 Aligned_cols=225 Identities=10% Similarity=0.013 Sum_probs=157.5 Q ss_pred CCCcc----cCceEEeccccC--CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGIN----LANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~----~G~ti~~P~~ig--da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ 74 (231) ..+.+ ....+++|++.+ .+.+++||+++|..++++++.++.+++++..+.+|++.+.++..++.+...++++++ T Consensus 158 ~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~a 237 (428) T protein:vir:10 158 KLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTA 237 (428) T ss_pred hhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHH Confidence 22211 112378898754 456899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccc--c---------------cccccccCHHHHHHHHHHhh------ccCCCceEEEECHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTT--S---------------QTVSTKANVDGVQAALDIFN------DEDAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 75 ia~~vd~~~~~~l~t~--~---------------~~~~~~~~~d~i~da~~~l~------~~~~~~~v~vv~p~~~~~L~ 131 (231) |++++|..++.+-.+. + .......+++.+...+..+. ........++|||..+..|+ T Consensus 238 i~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~ 317 (428) T protein:vir:10 238 ISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLF 317 (428) T ss_pred HHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHH Confidence 9999999998653321 0 00112334444443333322 12234567999999999999 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceE----EEEEecCCceEEEeecCCccceeccchh------ Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL----MFKIVSNSPALKLVLKRGVQVETDRDIV------ 201 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~----~~~~~~~~~A~~~~~k~~v~vE~~Rd~~------ 201 (231) +..+..+.. ...+ ..-|+++|+||++++.+|.+.+. ...+......+.+...+++.++..|+.. T Consensus 318 ~lkd~~G~~--i~~~----~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~ 391 (428) T protein:vir:10 318 GLRDGNGNK--VYPE----MAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDG 391 (428) T ss_pred HhhccCCce--eccC----CCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEEeecccccccccc Confidence 876544332 2111 12358999999999999875321 1111122223445666788888887642 Q ss_pred -------hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 -------TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 -------~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +....+++..++++.+.+|+++++++--.- T Consensus 392 ~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 392 KLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 345788999999999999999999865555 No 143 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=99.79 E-value=2.5e-21 Score=133.50 Aligned_cols=224 Identities=11% Similarity=0.075 Sum_probs=143.7 Q ss_pred CCCc---ccCceEEeccccCC--ccccc--CCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI---NLANLCEYPNDIGD--AADVA--EGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~~---~~G~ti~~P~~igd--a~~v~--EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) |... +.||||+||. .++ +.++. .+..+.+++++..+..++|+|..+ .|+++|++..++..|+ +...++.+ T Consensus 36 ~ge~~~a~~GDTV~I~~-p~~~~v~d~~~~~~~~~~~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~ 113 (423) T protein:vir:35 36 LSGEINSNTGDSVSFKR-PHQFKSERTETGDITGKDKNGLFSAKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIH 113 (423) T ss_pred CcccccccCCCEEEEee-CCcceeecccCcCCCCccccccccceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHH Confidence 4333 6799999984 344 34453 356788999999999999999887 6999999999988887 56777888 Q ss_pred HHHHHHHHHHHHHHhcc-cccc---c-ccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKT-TSQT---V-STKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t-~~~~---~-~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) +++++.+|.+++..+.. ++.. . +....|+.+.++-..|++.+ ..+|++|++|+.+..|+++..+.......+. T Consensus 114 ~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~ 193 (423) T protein:vir:35 114 ERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVR 193 (423) T ss_pred HHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchh Confidence 99999999999876543 2222 1 22346899999999998776 3579999999999999976554444445567 Q ss_pred ceeeeccc-eeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEE-------------- Q lcl|Aclame:pro 146 NALINGTY-ADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITAD-------------- 210 (231) Q Consensus 146 ~~~~~G~i-g~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~-------------- 210 (231) +.+++|.+ |+++|++|+.|+++|..+.... +++.......-+....-++.......+.++ T Consensus 194 ~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~-----~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~ 268 (423) T protein:vir:35 194 TAWENAQISGNFGGIRALMSNGLASRKQGDF-----DGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQL 268 (423) T ss_pred HHHhhccceeeecceEEEEcCCCcccccccc-----ccceeeccccccccccccccccceeeeeeeeeccCCcEEecceE Confidence 78899876 9999999999999997544321 111110000011111222222222222222 Q ss_pred EEEEEEEEcCCcEEE-----------EEec-c--------C Q lcl|Aclame:pro 211 EHYAAYLYDLTKVVN-----------ITFT-G--------V 231 (231) Q Consensus 211 ~~y~~~~~~~~~vv~-----------l~~~-~--------~ 231 (231) .+-|++.++|-.-.+ ..++ + . T Consensus 269 t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~ 309 (423) T protein:vir:35 269 KFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDV 309 (423) T ss_pred EeeeeeeccccccceeecccCCceeEEEEeccccccccCce Confidence 122444443322221 1111 0 0 No 144 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.78 E-value=7.2e-21 Score=131.03 Aligned_cols=221 Identities=13% Similarity=0.150 Sum_probs=162.8 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccc-cccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDK-IGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~-lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--++ |+ +++|.. .+.+.+++||++++..+ .++++.++++++++..+.||++...++..++.+...++++++|+. T Consensus 174 ~~~~~-g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 251 (425) T protein:vir:95 174 KIRVK-GT-TRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAK 251 (425) T ss_pred eeecC-ce-eEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHH Confidence 11233 43 578865 45567899999998777 589999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccc----------------cccccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHH-H---Hhhhh Q lcl|Aclame:pro 78 KVDDDLLKAAKTTS----------------QTVSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAK-I---RKDAN 135 (231) Q Consensus 78 ~vd~~~~~~l~t~~----------------~~~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~-L---~k~~~ 135 (231) ++|..++.+-.+.. ...+...++++++++...+.... ....+++|||.++.. | ++.++ T Consensus 252 ~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd 331 (425) T protein:vir:95 252 ALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVD 331 (425) T ss_pred HHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcC Confidence 99999997532210 11234567999999988876543 245568899987532 3 33222 Q ss_pred hhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchh--hcccEEEEEEEE Q lcl|Aclame:pro 136 AKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--TKTTVITADEHY 213 (231) Q Consensus 136 ~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~--~~~~~i~~~~~y 213 (231) ..++ +... .-++..++++|.||++|+.+|++..++.++ + . +.++..+++.++..++.. +....+++..++ T Consensus 332 ~~g~--~i~~--~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd~--~-~-~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~ 403 (425) T protein:vir:95 332 SNGN--VVGK--LPNLRTPDLLGLRVVFNNFLDDDTVLFGEF--E-Q-YTLVERENITIDSSTHVKFTEDQTAFRGKGRF 403 (425) T ss_pred CCCc--eeec--cCCCCCccccceeeEEcCcCCCccEEEEec--c-c-EEEEeecceEEEeecccccccCceEEEEEEee Confidence 2222 2111 124566889999999999999987655442 2 2 445566777777766653 456789999999 Q ss_pred EEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 214 AAYLYDLTKVVNITFTGV 231 (231) Q Consensus 214 ~~~~~~~~~vv~l~~~~~ 231 (231) ++++.+|+++++++++.- T Consensus 404 d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 404 DGKPVKPEAFVLVTITDP 421 (425) T ss_pred CcEeecccceEEEEecCc Confidence 999999999999999994 No 145 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=99.77 E-value=2.8e-21 Score=133.29 Aligned_cols=216 Identities=15% Similarity=0.083 Sum_probs=160.1 Q ss_pred CCCcccCceEEecc--c-cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--D-IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~-igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) -+-++.| ..++|. . .+.+.+++||+..+..++++++.++..++++..+.||++.+.++..|+.+...+++++++++ T Consensus 152 ~~v~~~~-~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~ 230 (387) T protein:vir:93 152 ARLTNIK-GLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 230 (387) T ss_pred eeeeecC-CceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 1112222 234564 2 24577899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc-----------ccccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCc Q lcl|Aclame:pro 78 KVDDDLLKAAKTT-----------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGAN 146 (231) Q Consensus 78 ~vd~~~~~~l~t~-----------~~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~ 146 (231) +.++.++....++ ...++...++|+|.+++..|.........|+||+.++..+++.....+. T Consensus 231 ~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~------- 303 (387) T protein:vir:93 231 KERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT------- 303 (387) T ss_pred HHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC------- Confidence 8887777543221 1223445679999999998877665666789999998777654332211 Q ss_pred eeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 147 ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 147 ~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l 226 (231) -+..|.-.+++|.||++++.++. .+.-++ +. ++. ...++.++.+++..++...++...+|++++++|++++.+ T Consensus 304 ~~~~~~~~~llG~PV~~~~~~~~--~~~GDf--~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l 376 (387) T protein:vir:93 304 NFFDTPAEKVFGKPVVFTDAAVK--PIVGDF--NY-FGI--NYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred cccccCCccccccceEEecCCCc--eeeeeh--hh-hhe--ehhhheeeecccccCCceeEEEEeeeCceeechhheEEE Confidence 12234556899999999998764 222221 22 222 234567788888888899999999999999999999999 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) ++++- T Consensus 377 ~~k~~ 381 (387) T protein:vir:93 377 KAKEN 381 (387) T ss_pred EeecC Confidence 99887 No 146 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=99.77 E-value=1.4e-20 Score=129.41 Aligned_cols=226 Identities=8% Similarity=0.053 Sum_probs=145.0 Q ss_pred CCC---cccCceEEeccccCCcc--cc--cCCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPNDIGDAA--DV--AEGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~---~~~G~ti~~P~~igda~--~v--~EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) |.. -+.||||+||. .++.+ +. ..+..+++++++..+..++|+|..+ .|+++|++..+...++ ++.+++.. T Consensus 36 ~~e~~~~k~GDTV~I~~-p~~~~~~~~~~~~~~~~~~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~ 113 (423) T protein:vir:17 36 LAGEINSSTGDSVSFKR-PHQFSSLRTPTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVR 113 (423) T ss_pred CcchhhcccCCEEEEee-CCcceeecccCcccCCcccCccccceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHH Confidence 322 25799999984 45433 23 3445678899999999999999877 6999999988888886 88999999 Q ss_pred HHHHHHHHHHHHHHhcc-cccc---c-ccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKT-TSQT---V-STKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t-~~~~---~-~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) ++||+.+|.+++..+.. +... . +....|+.+.++-..|++.+ ..+|++|++|+.++.|+++..+.......+. T Consensus 114 ~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~ 193 (423) T protein:vir:17 114 QRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVR 193 (423) T ss_pred HHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccch Confidence 99999999999876532 2211 1 22346999999999998776 4689999999999999987665444455677 Q ss_pred ceeeeccc-eeecceeEEEcCCCccCceEEEE--EecC-----CceEEEee-cCCc----cceeccchhhcccEEEEEEE Q lcl|Aclame:pro 146 NALINGTY-ADVLGAQIVRSKKLAEGSALMFK--IVSN-----SPALKLVL-KRGV----QVETDRDIVTKTTVITADEH 212 (231) Q Consensus 146 ~~~~~G~i-g~~~G~~Vv~s~~~~~~~~~~~~--~~~~-----~~A~~~~~-k~~v----~vE~~Rd~~~~~~~i~~~~~ 212 (231) +.+++|.+ |+++|++|+.|+++|..+....- .... +++...-. +.-+ ...++.+-....|++.- T Consensus 194 ~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~--- 270 (423) T protein:vir:17 194 TAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKF--- 270 (423) T ss_pred HHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEe--- Confidence 78999987 99999999999999975543221 0000 01100000 0000 11122233334444433 Q ss_pred EEEEEEcCCcE-----------EEEEecc--C Q lcl|Aclame:pro 213 YAAYLYDLTKV-----------VNITFTG--V 231 (231) Q Consensus 213 y~~~~~~~~~v-----------v~l~~~~--~ 231 (231) -|++.++|-.= -..++++ . T Consensus 271 aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~ 302 (423) T protein:vir:17 271 TNTYWLQQQTKQALYNGATPISFTATVTADAN 302 (423) T ss_pred cceeeecccccccccccccccceEEEEEeccc Confidence 23333332211 1111211 0 No 147 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.76 E-value=3.3e-20 Score=127.37 Aligned_cols=221 Identities=12% Similarity=-0.016 Sum_probs=154.7 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .--.-+|....+|.+ .+.+.++.||++++. .+.++++.++++++++..+.||++.+.++..|+.+...++++++|+. T Consensus 119 ~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~ 198 (390) T protein:vir:40 119 NFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMAL 198 (390) T ss_pred eeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Confidence 111123445668876 445678999999875 58999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc--------c---------cccccccCHHHHHHHHHHhhc-------cCCCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 78 KVDDDLLKAAKTT--------S---------QTVSTKANVDGVQAALDIFND-------EDAQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 78 ~vd~~~~~~l~t~--------~---------~~~~~~~~~d~i~da~~~l~~-------~~~~~~v~vv~p~~~~~L~k~ 133 (231) ++|..++.+-.+. . .......+.+++.++...+.. ......+++|||.++..+++. T Consensus 199 ~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~ 278 (390) T protein:vir:40 199 GLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYA 278 (390) T ss_pred HHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHH Confidence 9999998742110 0 011122344444444433322 123456799999887544432 Q ss_pred hhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch--hhcccEEEEEE Q lcl|Aclame:pro 134 ANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI--VTKTTVITADE 211 (231) Q Consensus 134 ~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~--~~~~~~i~~~~ 211 (231) .... ....+..+.. ..++|.||+.|+.||+++.++.++ +. +.+...+++.++.+++. .+..+.+++.+ T Consensus 279 ~~~~---~d~~G~~v~~---~~~~g~pvv~~~~~p~~~i~~Gd~--s~--~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~ 348 (390) T protein:vir:40 279 ATSY---MTPQGVWVTG---ILPVPLEIVQSVAVPVGKAVAGRA--KD--YFMGIGSEQVIRTSTEYRLLDDETLYYAKQ 348 (390) T ss_pred Hhhc---cCCCCccccc---cCCCceeEEEcCCCCCCcEEEEee--ce--EEEEeecceEEEecchhhhhcCcEEEEEEE Confidence 1110 0011111111 235799999999999998765543 22 44566778888887765 55778999999 Q ss_pred EEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 212 HYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 212 ~y~~~~~~~~~vv~l~~~~~ 231 (231) ++++++++|++++++++++. T Consensus 349 r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 349 YANGRPKDNSSFLVFDITGL 368 (390) T ss_pred EeCCEEecccceEEEEeecc Confidence 99999999999999999999 No 148 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=99.74 E-value=6.9e-20 Score=125.62 Aligned_cols=229 Identities=8% Similarity=0.039 Sum_probs=143.1 Q ss_pred CCCc---ccCceEEeccccCCcc--ccc--CCCccCccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI---NLANLCEYPNDIGDAA--DVA--EGGEISLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~~---~~G~ti~~P~~igda~--~v~--EG~~i~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) |... +.||||+||. .++.+ ++. ++..+.+++++..+..++|+|..+ .|+++|++..+...+. ++.+++.. T Consensus 36 ~~ef~~~k~GDTV~I~~-p~~~~~~d~~~~~~~~~~~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~ 113 (423) T protein:vir:10 36 LAGEINSSTGDSVSFKR-PHQFSSLRTPTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVR 113 (423) T ss_pred CCcccccccCCEEEEee-CCceeeeccCCccccccccCccccceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHH Confidence 3332 5799999974 44433 333 455688999999999999999877 6999999988887775 88999999 Q ss_pred HHHHHHHHHHHHHHhccc-cc---cc-ccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTT-SQ---TV-STKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~-~~---~~-~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) ++||+.+|.+++...... .. .. +....|+.+.++-..|++.+ ..+|++|++|+.+..|+++..+.......+. T Consensus 114 ~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~ 193 (423) T protein:vir:10 114 QRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVR 193 (423) T ss_pred HHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccch Confidence 999999999998764332 11 11 12346899999999998776 4689999999999999987665555556777 Q ss_pred ceeeeccc-eeecceeEEEcCCCccCceEEEE----EecC---CceEEEee-cCCcc----ceeccchhhcccEEEEEEE Q lcl|Aclame:pro 146 NALINGTY-ADVLGAQIVRSKKLAEGSALMFK----IVSN---SPALKLVL-KRGVQ----VETDRDIVTKTTVITADEH 212 (231) Q Consensus 146 ~~~~~G~i-g~~~G~~Vv~s~~~~~~~~~~~~----~~~~---~~A~~~~~-k~~v~----vE~~Rd~~~~~~~i~~~~~ 212 (231) +.+++|.+ |+++|++|+.|+++|..+....- +... +++...-. +..+. ..++..-...-|.+.-.-. T Consensus 194 ~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv 273 (423) T protein:vir:10 194 TAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNT 273 (423) T ss_pred hhhhhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecce Confidence 88999987 99999999999999975443210 0000 00000000 00000 0111122222333322222 Q ss_pred EEEEEEcCC--------cEEEEEeccC Q lcl|Aclame:pro 213 YAAYLYDLT--------KVVNITFTGV 231 (231) Q Consensus 213 y~~~~~~~~--------~vv~l~~~~~ 231 (231) +.++-..-. ..-..++++. T Consensus 274 ~~v~~~tk~~~~~~~t~~~~~~~v~a~ 300 (423) T protein:vir:10 274 YWLQQQTKQALYNGATPISFTATVTAD 300 (423) T ss_pred eeecccccccccccccCcceEEEEEee Confidence 222222211 0011111111 No 149 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.73 E-value=4e-20 Score=126.93 Aligned_cols=230 Identities=13% Similarity=0.064 Sum_probs=150.1 Q ss_pred CCCcccCceEEecccc-CC--cccccCCCcc-----CccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI-GD--AADVAEGGEI-----SLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i-gd--a~~v~EG~~i-----~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) ..-...+..+++|+.. |. +..++||+.+ |..+++++..++..++++..+.||++.+.++..++.+...++++ T Consensus 194 ~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 273 (477) T protein:vir:84 194 EPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLA 273 (477) T ss_pred eeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHH Confidence 1112345568888753 33 3468898754 56678999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhccc---------cc----ccc-cc-------cCHHHHHHHHHHhhccCC-CceEEEECHHHHHHH Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTT---------SQ----TVS-TK-------ANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKI 130 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~---------~~----~~~-~~-------~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L 130 (231) ++|+.++|..++.+-.+. +. ..+ .. ..++.|.++...+..... ...+++|||..+..| T Consensus 274 ~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l 353 (477) T protein:vir:84 274 ADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASF 353 (477) T ss_pred HHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHH Confidence 999999999998653321 00 011 11 134556666665554332 456899999999999 Q ss_pred Hhhhhhhhccccc-----------cCceeeeccceeecceeEEEcCCCccCceEEE----EEecCCceEEEeecCCccce Q lcl|Aclame:pro 131 RKDANAKNIGSEV-----------GANALINGTYADVLGAQIVRSKKLAEGSALMF----KIVSNSPALKLVLKRGVQVE 195 (231) Q Consensus 131 ~k~~~~~~~~~~~-----------~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~----~~~~~~~A~~~~~k~~v~vE 195 (231) ++.++..++.-.. ....+.+|..|+++|+||++|+.||++.+... .+...-..+-+. ..++.++ T Consensus 354 ~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~-~~~~~~~ 432 (477) T protein:vir:84 354 HAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALF-ESSVRMR 432 (477) T ss_pred HHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEE-eeceeEE Confidence 8877655432211 11234556778999999999999997544211 111112223333 3455554 Q ss_pred eccc--hhhcccEEEEEEEEEEEEE-cCCcEEEEEeccC Q lcl|Aclame:pro 196 TDRD--IVTKTTVITADEHYAAYLY-DLTKVVNITFTGV 231 (231) Q Consensus 196 ~~Rd--~~~~~~~i~~~~~y~~~~~-~~~~vv~l~~~~~ 231 (231) .+++ .......++...+++...+ .|+++|.+|.+|- T Consensus 433 ~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 433 ALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred eccccccccceeeeeehhhhhhhhhccccceEEeecccc Confidence 4433 2223333333334444344 5999999999999 No 150 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=99.69 E-value=1.1e-18 Score=119.10 Aligned_cols=225 Identities=9% Similarity=0.063 Sum_probs=137.1 Q ss_pred CCC---cccCceEEecc-ccCCcccccCCCcc---CccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPN-DIGDAADVAEGGEI---SLDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~---~~~G~ti~~P~-~igda~~v~EG~~i---~~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) |.. -+.||||+||. -.+.+.+ ..+..+ ++++++..+..++|+|..+ .|+++|++..++..+. ++++++.. T Consensus 36 ~~ef~~ak~GDTV~I~~P~~~~~~d-~~~~~~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~ 113 (423) T protein:vir:10 36 LAGEINSSTGDSVSFKRPHQFKSER-TMDGDITGKSKNSLISAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPIN 113 (423) T ss_pred CccccccccCCEEEEeeCCceeeec-ccCcccCcccccccccceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHH Confidence 322 35799999974 1223333 222222 4567888889999999877 6999999988787777 78999999 Q ss_pred HHHHHHHHHHHHHHhcc-ccccc----ccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhhccccccC Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKT-TSQTV----STKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSEVGA 145 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t-~~~~~----~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~ 145 (231) ++||+.+|.++...+.. ++..+ +....|+.+.++-..|++.+ ..+|++|++|+.++.|+++..+.......+. T Consensus 114 ~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~ 193 (423) T protein:vir:10 114 ERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVR 193 (423) T ss_pred HHHHHHHHHHHHHHhhhcccccccccccccccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccch Confidence 99999999999754422 22211 22345899999999998766 4679999999999999876665555556677 Q ss_pred ceeeeccc-eeecceeEEEcCCCcc---Cce-EEEEEecCCceEEEeecCCcc-ceecc-----------chhhcccEEE Q lcl|Aclame:pro 146 NALINGTY-ADVLGAQIVRSKKLAE---GSA-LMFKIVSNSPALKLVLKRGVQ-VETDR-----------DIVTKTTVIT 208 (231) Q Consensus 146 ~~~~~G~i-g~~~G~~Vv~s~~~~~---~~~-~~~~~~~~~~A~~~~~k~~v~-vE~~R-----------d~~~~~~~i~ 208 (231) +.+++|.+ |+++|++|+.|+++|. ++. .... ..+...+..-.+. .+..+ --....|++. T Consensus 194 ~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~----~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t 269 (423) T protein:vir:10 194 TAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLT----VKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQ 269 (423) T ss_pred HHHHhcccceeecceEEEEecCCcccccccccceee----eeeeeEEEecccccccccccceeeccceeceeEEecceEe Confidence 88999976 9999999999999984 221 1111 0111111111110 00000 0112222222 Q ss_pred EEEEEEEEEEcCCcE--------EEEEecc--C Q lcl|Aclame:pro 209 ADEHYAAYLYDLTKV--------VNITFTG--V 231 (231) Q Consensus 209 ~~~~y~~~~~~~~~v--------v~l~~~~--~ 231 (231) ..-.+.++.+.-..+ -..++.+ + T Consensus 270 ~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~ 302 (423) T protein:vir:10 270 FDDTHWLNQQSKQTLYNGASALSFTATVMEDAN 302 (423) T ss_pred ecceeeecccccceeecccCCcceEEEEEeccc Confidence 222222222222110 1112211 0 No 151 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=99.68 E-value=1.2e-18 Score=118.86 Aligned_cols=224 Identities=12% Similarity=0.057 Sum_probs=154.7 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~ 78 (231) =.-.+.+.++++|.. ...+.+++||++++..++++++.++.+++++..+.||++...++..|+.+...++++++++.+ T Consensus 183 ~~v~~~~g~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~ 262 (466) T protein:vir:80 183 VRLRPLKGTARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFA 262 (466) T ss_pred eeeeecCceeEeeeecCCcceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 001111223455533 345678999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccc------------cc--c---cc---cccCHHHH--------------HHH---HHHhhccC-CCceEE Q lcl|Aclame:pro 79 VDDDLLKAAKTT------------SQ--T---VS---TKANVDGV--------------QAA---LDIFNDED-AQAYVL 120 (231) Q Consensus 79 vd~~~~~~l~t~------------~~--~---~~---~~~~~d~i--------------~da---~~~l~~~~-~~~~v~ 120 (231) +|..++++-.+. +. . .+ ..++...+ .+. ...+.... ....+| T Consensus 263 ~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w 342 (466) T protein:vir:80 263 LDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFW 342 (466) T ss_pred HhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeE Confidence 999988642211 00 0 00 01111111 111 11222222 344578 Q ss_pred EECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch Q lcl|Aclame:pro 121 IVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI 200 (231) Q Consensus 121 vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~ 200 (231) ++|+..+..|++.....+........ ..++ ..++|.||+.|++||++..++.+ ...+.++..+++.++.+++. T Consensus 343 ~~~~~~~~~l~~~~~~~~~~g~~~~~-~~~~--~~i~G~pvv~s~~~~~~~~~~g~----~~~y~i~~r~~~~i~~~~~~ 415 (466) T protein:vir:80 343 AMSSNTHAVLMSKAITFNSAGALVAS-LNNT--MPIVGGDIVILDFIPDNDIIGGY----GSLYLLAERADIKLAQSEHV 415 (466) T ss_pred EecchhHHHhhcccccccCCcccccc-CCCc--ccccccceeecCccCccceeeec----cccEEEEeecceEEEechhh Confidence 99999999988765432221111111 1122 35899999999999998854433 23455777788888888766 Q ss_pred h--hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 V--TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ~--~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) . ...+.+++.+|+++++++|+++++++++.. T Consensus 416 ~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 416 RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred hhhcCcEEEEEEEEEccEEeccCceEEEEecCC Confidence 5 566789999999999999999999999888 No 152 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=99.61 E-value=2.4e-17 Score=111.72 Aligned_cols=217 Identities=16% Similarity=0.117 Sum_probs=153.0 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccC-ccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) -+-++.|-++.+|.. .+.+.++.|+.+++ ..+.++++.++..++....+.||++...++..|+.+...++++.+++. T Consensus 120 ~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~ 199 (395) T protein:vir:95 120 INFQNAGIKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISV 199 (395) T ss_pred ceeEecCCceEEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHH Confidence 112223335678865 45566778877775 468999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc---c----------------cccccccCHHHHHHHHHHhhc--------------cCCCceEEEECH Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---S----------------QTVSTKANVDGVQAALDIFND--------------EDAQAYVLIVNP 124 (231) Q Consensus 78 ~vd~~~~~~l~t~---~----------------~~~~~~~~~d~i~da~~~l~~--------------~~~~~~v~vv~p 124 (231) ++|+.++.+-.+. + ...+...+++.+..+...+.+ .......++||| T Consensus 200 ~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~ 279 (395) T protein:vir:95 200 ALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNP 279 (395) T ss_pred HHhhheeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcc Confidence 9999988643221 1 011122344444433333221 112334689999 Q ss_pred HHHHHHHhhhhhhhccccccCceeeeccceeec--ceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch-- Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVL--GAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI-- 200 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~--G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~-- 200 (231) .++..+....-+.. .+|...+++ |+||+.|+.||+++.++.++ +. +.+..+.++.++..++. T Consensus 280 ~t~~~~~g~~~~~~----------~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdf--s~--y~i~~r~~~~i~~~~~~~~ 345 (395) T protein:vir:95 280 RDSWDVQARYTYLT----------ANGGFVTVLPYNVTIITSEFVPEGKLVAFVT--DR--YNAVRGGGLTVKKFDQTLA 345 (395) T ss_pred hhhhhcCCcceecc----------CCCcceeccCCcceEEEcCCCCCCcEEEEec--cc--EEEEEecceEEEeccchhh Confidence 99887765433221 135555665 66799999999998665543 22 56677778888766654 Q ss_pred hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++.+.+++..|++.+++||++++.++++.- T Consensus 346 ~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 346 LEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred hCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 4577889999999999999999999999844 No 153 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=99.58 E-value=7.4e-17 Score=109.00 Aligned_cols=217 Identities=18% Similarity=0.103 Sum_probs=154.4 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.+....+|.- .+.+.+++|+++++. .+.++++.++..++....+.||.+...++..|+.+...+++++++++ T Consensus 113 ~~v~~~~~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~ 192 (377) T protein:vir:96 113 INFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAV 192 (377) T ss_pred ceeEecCCceEEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHH Confidence 011223345677754 456778999999864 57999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc------------c-----cc----------c---ccccCHHHHHHHHHHh----hccC-------CC Q lcl|Aclame:pro 78 KVDDDLLKAAKTT------------S-----QT----------V---STKANVDGVQAALDIF----NDED-------AQ 116 (231) Q Consensus 78 ~vd~~~~~~l~t~------------~-----~~----------~---~~~~~~d~i~da~~~l----~~~~-------~~ 116 (231) +++..++++-.+. . .. . ....+.+.+.+.+..+ ...+ .. T Consensus 193 ~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 272 (377) T protein:vir:96 193 ALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG 272 (377) T ss_pred HHhhceEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccC Confidence 9999988642110 0 00 0 0112334444433332 1111 13 Q ss_pred ceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecce--eEEEcCCCccCceEEEEEecCCceEEEeecCCccc Q lcl|Aclame:pro 117 AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA--QIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQV 194 (231) Q Consensus 117 ~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~--~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~v 194 (231) ..+++|||.++..++....+.. .+|...+++|+ +|+.|+.+|+++.+..++ + .+.+..+.++.+ T Consensus 273 ~a~~~mn~~t~~~~~~~~~~~~----------~~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf--~--~Y~i~~r~~~~i 338 (377) T protein:vir:96 273 QVKLLLNPEDRWTLEAKFTSRN----------QFGEYVTVLPHGITILESLAVETGKAIAFVA--N--RYDAFMATASTI 338 (377) T ss_pred ceEEEEchhhHHhccccccccC----------CCCCceeccCCCceEEecCCCCcccEEEEEc--C--cEEEEEecccEE Confidence 4579999999988764333221 23555566654 588899999998665442 3 367788888888 Q ss_pred eeccch--hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 195 ETDRDI--VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 195 E~~Rd~--~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +..++. .++.+.+++.+|++.++++|++++++++++= T Consensus 339 ~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 339 EEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 876654 4677899999999999999999999998888 No 154 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=99.58 E-value=1.4e-17 Score=112.93 Aligned_cols=224 Identities=16% Similarity=0.084 Sum_probs=152.5 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccC-ccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+ + +|+ +++|.. .+.+.++.|+++++ ..+.++++.++..++....+.+|.+...++..|+.+...+++++++++ T Consensus 116 ~~-~-~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~ 192 (377) T protein:vir:98 116 KN-T-SLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAV 192 (377) T ss_pred Ee-c-Ccc-eEEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHH Confidence 32 1 243 578865 45567899998886 457899999999999988899999999999999999999999999999 Q ss_pred HHHHHHHHHhccc---------cc----------ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhh Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------SQ----------TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN 138 (231) Q Consensus 78 ~vd~~~~~~l~t~---------~~----------~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~ 138 (231) +++..++.+-.+. +. +.+.....+.+.+....+....-...+++||+.....+++.++..+ T Consensus 193 ~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G 272 (377) T protein:vir:98 193 ALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAG 272 (377) T ss_pred HHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCC Confidence 9999988753211 00 0001112233433333332222223345555555555555443332 Q ss_pred cccc------------ccCceeeeccceeeccee--EEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch--hh Q lcl|Aclame:pro 139 IGSE------------VGANALINGTYADVLGAQ--IVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI--VT 202 (231) Q Consensus 139 ~~~~------------~~~~~~~~G~ig~~~G~~--Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~--~~ 202 (231) .... .......+|...+++|+| |+.|+.+|+++....++ + .+.++.+.++.++..++. .+ T Consensus 273 ~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf--~--~Y~i~~r~~~~i~~~~~~~~~~ 348 (377) T protein:vir:98 273 QVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVA--N--RYDAFMATASTIEEYDQTFAME 348 (377) T ss_pred ceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEe--c--ceeEEeecceEEEeechhhhhc Confidence 2111 000112357777888776 78899999998765442 2 367778888888876554 45 Q ss_pred cccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 203 KTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 203 ~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +.+.+++..|++.+++||++++++++++= T Consensus 349 d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 349 DLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred CceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 77899999999999999999999999988 No 155 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=99.58 E-value=6.6e-17 Score=109.30 Aligned_cols=217 Identities=15% Similarity=0.122 Sum_probs=152.5 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.+-...+|.. .+.+.+++|+++++. .+.++++.++..++.+..+.||.+...++..|+.+...++++++++. T Consensus 110 ~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~ 189 (381) T protein:vir:10 110 LGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV 189 (381) T ss_pred eeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHH Confidence 011122234577765 455678999998864 47899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc---------cc------------ccccc-------cCHHHHHHHHHHhhcc------C-CCceEEEE Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------SQ------------TVSTK-------ANVDGVQAALDIFNDE------D-AQAYVLIV 122 (231) Q Consensus 78 ~vd~~~~~~l~t~---------~~------------~~~~~-------~~~d~i~da~~~l~~~------~-~~~~v~vv 122 (231) +++..++.+-.+. .. ..... ..++.+.+....+.-. . ....+++| T Consensus 190 ~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~m 269 (381) T protein:vir:10 190 ALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) T ss_pred HhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEE Confidence 9999888643210 00 01111 1244455444444311 1 23456899 Q ss_pred CHHHHHHHHhhhhhhhccccccCceeeeccceee--cceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccc- Q lcl|Aclame:pro 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADV--LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD- 199 (231) Q Consensus 123 ~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~--~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd- 199 (231) ||.+++.|++.+...+. +|.+-.. +|.+|+.|+.||+++.++.++ + .+.++.+.++.++..++ T Consensus 270 n~~t~~~l~~~~~~~~~----------~G~~v~~l~~g~~vv~s~~~p~~~iifgDf--s--~Y~i~~r~~~~i~~~~~~ 335 (381) T protein:vir:10 270 NPSDAFEVQAQYTHLNA----------NGVYVTALPFNLNVIESTVQEAGKVLTYVK--G--LYDGYLAGGINVQKFKET 335 (381) T ss_pred ccccHHhhccccccCCC----------CCceeecCCCCceEEecCCCCcCcEEEEec--c--cEEEEEecccEEEeechh Confidence 99999999876544321 2222222 577899999999998765543 2 36677788888877655 Q ss_pred -hhhcccEEEEEEEEEEEEEcCCcEEEEEec--cC Q lcl|Aclame:pro 200 -IVTKTTVITADEHYAAYLYDLTKVVNITFT--GV 231 (231) Q Consensus 200 -~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~--~~ 231 (231) ..++.+.+++..|++.++++|+++++++++ +. T Consensus 336 ~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:10 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCC Confidence 566778999999999999999998885544 45 No 156 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=99.58 E-value=6.6e-17 Score=109.30 Aligned_cols=217 Identities=15% Similarity=0.122 Sum_probs=152.5 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCc-cccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~-~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.+-...+|.. .+.+.+++|+++++. .+.++++.++..++.+..+.||.+...++..|+.+...++++++++. T Consensus 110 ~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~ 189 (381) T protein:vir:95 110 LGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV 189 (381) T ss_pred eeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHH Confidence 011122234577765 455678999998864 47899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc---------cc------------ccccc-------cCHHHHHHHHHHhhcc------C-CCceEEEE Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------SQ------------TVSTK-------ANVDGVQAALDIFNDE------D-AQAYVLIV 122 (231) Q Consensus 78 ~vd~~~~~~l~t~---------~~------------~~~~~-------~~~d~i~da~~~l~~~------~-~~~~v~vv 122 (231) +++..++.+-.+. .. ..... ..++.+.+....+.-. . ....+++| T Consensus 190 ~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~m 269 (381) T protein:vir:95 190 ALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) T ss_pred HhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEE Confidence 9999888643210 00 01111 1244455444444311 1 23456899 Q ss_pred CHHHHHHHHhhhhhhhccccccCceeeeccceee--cceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccc- Q lcl|Aclame:pro 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADV--LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD- 199 (231) Q Consensus 123 ~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~--~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd- 199 (231) ||.+++.|++.+...+. +|.+-.. +|.+|+.|+.||+++.++.++ + .+.++.+.++.++..++ T Consensus 270 n~~t~~~l~~~~~~~~~----------~G~~v~~l~~g~~vv~s~~~p~~~iifgDf--s--~Y~i~~r~~~~i~~~~~~ 335 (381) T protein:vir:95 270 NPSDAFEVQAQYTHLNA----------NGVYVTALPFNLNVIESTVQEAGKVLTYVK--G--LYDGYLAGGINVQKFKET 335 (381) T ss_pred ccccHHhhccccccCCC----------CCceeecCCCCceEEecCCCCcCcEEEEec--c--cEEEEEecccEEEeechh Confidence 99999999876544321 2222222 577899999999998765543 2 36677788888877655 Q ss_pred -hhhcccEEEEEEEEEEEEEcCCcEEEEEec--cC Q lcl|Aclame:pro 200 -IVTKTTVITADEHYAAYLYDLTKVVNITFT--GV 231 (231) Q Consensus 200 -~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~--~~ 231 (231) ..++.+.+++..|++.++++|+++++++++ +. T Consensus 336 ~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:95 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCC Confidence 566778999999999999999998885544 45 No 157 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.56 E-value=7.9e-16 Score=103.38 Aligned_cols=227 Identities=15% Similarity=0.108 Sum_probs=162.0 Q ss_pred CCCcc---------cCceEEeccc-cC-----CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcC--CCH Q lcl|Aclame:pro 1 ENGIN---------LANLCEYPND-IG-----DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGY--GDP 63 (231) Q Consensus 1 ~~~~~---------~G~ti~~P~~-ig-----da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~--~d~ 63 (231) ++.++ +-.+.++|+. .| ...+-+|+++.+.++.++++.++..+|....+.|+++...++. .|+ T Consensus 39 ~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~l 118 (314) T protein:vir:41 39 NSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAF 118 (314) T ss_pred ccchhhheeeecccCccceeecccccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhH Confidence 22222 2234566653 11 1223456667788999999999999999999999999998885 499 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc-----------------cccc------cccccCHHHHHHHHHHhhccC---CCc Q lcl|Aclame:pro 64 IGESNKQLGLSLANKVDDDLLKAAKT-----------------TSQT------VSTKANVDGVQAALDIFNDED---AQA 117 (231) Q Consensus 64 ~~~~~~~~a~~ia~~vd~~~~~~l~t-----------------~~~~------~~~~~~~d~i~da~~~l~~~~---~~~ 117 (231) .+..++++|+++++.....++++-.+ +... .+..++.+.+.+++..|.... ... T Consensus 119 e~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~ 198 (314) T protein:vir:41 119 EQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPR 198 (314) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCc Confidence 99999999999999998887765221 1111 112345666788888887643 235 Q ss_pred eEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc---CceEEEEEecCCceEEEeecCCccc Q lcl|Aclame:pro 118 YVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE---GSALMFKIVSNSPALKLVLKRGVQV 194 (231) Q Consensus 118 ~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~---~~~~~~~~~~~~~A~~~~~k~~v~v 194 (231) .+++||++++.++++...-. .+..+...+..|.-.+++|.||+.++.||. ++...+.. ...-+.+...+.+.+ T Consensus 199 ~~~~m~~~t~~~~r~~l~~~--~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fg--d~~nlv~~~~~~ir~ 274 (314) T protein:vir:41 199 MKFYVSNEIYNGYRKQLLVR--ETGLGDSALIGATGLQYDGIPIQYVPALDALGDDKARALLT--VPTNLVYGFWRNIRI 274 (314) T ss_pred eEEEecHHHHHHHHHHHhcc--CCcccchhhhCCCCceecceeeEecccccccCCCCceEEEe--chhheEEEeeceeEE Confidence 57999999999999865433 344555566667777899999999998874 22222222 233455566788999 Q ss_pred eeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 195 ETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 195 E~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |.+|+...+...++.+.++++.+.++.++|+..+.=- T Consensus 275 ~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 275 EPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred eecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeecc Confidence 9999999999999999999999998877776554433 No 158 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=99.56 E-value=1.1e-16 Score=108.05 Aligned_cols=219 Identities=13% Similarity=0.058 Sum_probs=151.0 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccC-ccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.|....+|.- .+.+.++.|+++++ ..+.++++.++..++....+.+|.+...++..|+.+...+++++++++ T Consensus 110 a~v~~~~~~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~ 189 (381) T protein:vir:10 110 LGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV 189 (381) T ss_pred eeeEecCcceEEEeecCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHH Confidence 011222334567754 45667889988876 457899999999999998999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc---------cc------------ccccccCHHHHHHHHHHhhc-------------c-CCCceEEEE Q lcl|Aclame:pro 78 KVDDDLLKAAKTT---------SQ------------TVSTKANVDGVQAALDIFND-------------E-DAQAYVLIV 122 (231) Q Consensus 78 ~vd~~~~~~l~t~---------~~------------~~~~~~~~d~i~da~~~l~~-------------~-~~~~~v~vv 122 (231) +++..++.+-.+. .. .....+++..+...+..+.. . .....+++| T Consensus 190 ~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vm 269 (381) T protein:vir:10 190 ALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) T ss_pred HhhceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEE Confidence 9999888652210 00 01112222222222211110 0 123457899 Q ss_pred CHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccc--h Q lcl|Aclame:pro 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD--I 200 (231) Q Consensus 123 ~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd--~ 200 (231) ||.+++.|++.+.+.+. .+..+ + .-.+|.+|+.|+.||+++.++.++ + .+.+..+.++.++..++ . T Consensus 270 n~~t~~~l~~~~~~~~~----~G~~v-~---~lp~g~~vv~~~~~p~~~i~fGDf--s--~Y~i~~r~~~~i~~~~~~~~ 337 (381) T protein:vir:10 270 NPSDAFEVQAQYTHLNA----NGVYV-T---ALPFNLNVIESTVQEAGKVLTYVK--G--LYDGYLAGGINVQKFKETLA 337 (381) T ss_pred chhhHHhhccccccCCC----CCcee-e---cCCCCceeEEcCCCCcCcEEEEEc--c--cEEEEEecccEEEeechhhh Confidence 99999999986644321 11111 1 123688999999999998665442 2 26677788888877654 4 Q ss_pred hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++.+.+++..|++.++++|+++++++++.. T Consensus 338 ~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred hcCceEEEEEEEEcCEEecCCcEEEEEEeec Confidence 5677899999999999999999999877755 No 159 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=99.55 E-value=1.7e-15 Score=101.52 Aligned_cols=228 Identities=9% Similarity=0.072 Sum_probs=142.5 Q ss_pred CCCc--ccCceEEeccc-cCCcccccCCC-ccCccccccceeEEEeehc-cceeeecHHHHHhcCCC--HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGI--NLANLCEYPND-IGDAADVAEGG-EISLDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGD--PIGESNKQLGL 73 (231) Q Consensus 1 ~~~~--~~G~ti~~P~~-igda~~v~EG~-~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~~~~d--~~~~~~~~~a~ 73 (231) +..+ ++|++|+||+- .....++..+. -.....++.+..++++.|- +..|.|.+.++.++... .-....+++.. T Consensus 33 ~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~ 112 (299) T protein:vir:79 33 NGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEE 112 (299) T ss_pred cceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHH Confidence 3333 57999999974 33445777654 4455678888999999985 44699996655554333 22334455666 Q ss_pred HHHHHHHHHHHHHhcccc---------cccccccCHHHHHHHHHHhhccCC--CceEEEECHHHHHHHHhhhhhhhcccc Q lcl|Aclame:pro 74 SLANKVDDDLLKAAKTTS---------QTVSTKANVDGVQAALDIFNDEDA--QAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) Q Consensus 74 ~ia~~vd~~~~~~l~t~~---------~~~~~~~~~d~i~da~~~l~~~~~--~~~v~vv~p~~~~~L~k~~~~~~~~~~ 142 (231) .++-.+|...++.|-+.. .+.+...-|+.|.++..+|.+.+. +++|++|+|+.+..|.++++|...... T Consensus 113 ~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~ 192 (299) T protein:vir:79 113 QKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNI 192 (299) T ss_pred HhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhccccc Confidence 777788887666542111 122344568999999999988764 689999999999999999887644333 Q ss_pred ccCceeeeccceeecceeEEE--cCCCcc------C-----ce--EEEEEecCCceEEEeecCCccceeccchhhcc--- Q lcl|Aclame:pro 143 VGANALINGTYADVLGAQIVR--SKKLAE------G-----SA--LMFKIVSNSPALKLVLKRGVQVETDRDIVTKT--- 204 (231) Q Consensus 143 ~~~~~~~~G~ig~~~G~~Vv~--s~~~~~------~-----~~--~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~--- 204 (231) .......+|.+|++.|++|+. |+.++. | .+ +-+ ++..+.|..-..|-+ .+.-+. |...+ T Consensus 193 ~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~in~-ii~~~~a~~~~~K~~-~~~~~~-P~~~~~~~ 269 (299) T protein:vir:79 193 KDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQIFM-SLVHPSAIITPVSYQ-FSKLDE-PTAVTEGK 269 (299) T ss_pred ccccceeeeeeeeecceEEEEechhhcCccceeccCccccCcccccce-EEEcCCeeeeeEeee-eEEeec-CCCCCccc Confidence 334467899999999999987 455542 1 11 112 222345554333333 222222 22222 Q ss_pred cEEEEEEEEEEEEEcCC-cEEEEEeccC Q lcl|Aclame:pro 205 TVITADEHYAAYLYDLT-KVVNITFTGV 231 (231) Q Consensus 205 ~~i~~~~~y~~~~~~~~-~vv~l~~~~~ 231 (231) ..+..+.|+.+-+++.. +.|.+.+++- T Consensus 270 ~~~~~r~y~d~~v~~nk~~~i~~~~~~a 297 (299) T protein:vir:79 270 YFYFEESFEDVFILNKKADAIQFVVEGA 297 (299) T ss_pred eeeeeeeeeeeeeeccccCeEEEEeeec Confidence 25566777777777543 3444555555 No 160 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.53 E-value=8.2e-17 Score=108.79 Aligned_cols=176 Identities=16% Similarity=0.132 Sum_probs=117.7 Q ss_pred eehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------cc----cccccCH Q lcl|Aclame:pro 42 IKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLKAAKTTS----------------QT----VSTKANV 100 (231) Q Consensus 42 ikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~----------------~~----~~~~~~~ 100 (231) |+.--. .+.|.|.+..|+..|+++++++|++++||+..|+.++..+..+. .. .+....| T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 565433 48899999999999999999999999999999999876553221 00 1112347 Q ss_pred HHHHHHHHHhhccC--CCceEEEECHHHHHHHHh--hhhhhhccccccCceeeec-cceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 101 DGVQAALDIFNDED--AQAYVLIVNPKDAAKIRK--DANAKNIGSEVGANALING-TYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 101 d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k--~~~~~~~~~~~~~~~~~~G-~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) +.|.++.++|.+.+ ...+|++++|..|+.|++ ++.+...........+.+| .+++++|++|+.||++|...+..+ T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 88899999998766 478999999998888886 3444333223333457777 699999999999999997544322 Q ss_pred EEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 176 KIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 176 ~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) . .+|..+.. ..-..|.+|-. +++.+ ..+..|+++-+++.=|- T Consensus 161 ~----~~ag~~~~-~~~~~~~yr~~--fs~~~-------glv~~~~Avgtvkl~~~ 202 (221) T protein:vir:17 161 V----TDPGDATT-SGENNGSYRPA--ITDRA-------GLVFHKEAADTVEVLLP 202 (221) T ss_pred c----cCCccccc-ccccccccccc--ccceE-------EEEEcchheeeeeeecC Confidence 2 22222211 11122333322 22222 45667778777776666 No 161 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.52 E-value=3.1e-15 Score=100.14 Aligned_cols=229 Identities=13% Similarity=0.021 Sum_probs=159.5 Q ss_pred CCCcccCceEEeccc-cCCcccccCCCccCccccccceeEEEeehc-cceeeec--HHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKKA-AKGTEIT--DEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~it--D~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .---++|++|+||+- .....++..+..+...+++.+..+.++.|- +..|.|. |++..+....+.....+++.+.++ T Consensus 30 ~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~ 109 (290) T protein:vir:78 30 NLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAG 109 (290) T ss_pred ceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhh Confidence 112358999999974 455668888888888888888888998884 5579998 877777777888888889999999 Q ss_pred HHHHHHHHHHhcccc--------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccc-cccCce Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS--------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGS-EVGANA 147 (231) Q Consensus 77 ~~vd~~~~~~l~t~~--------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~-~~~~~~ 147 (231) -.+|...++.|-+.. .+.+...-|+.|.++..+|.+.+.+++|++|+|+.+..|.++++|.-... ...... T Consensus 110 PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~ 189 (290) T protein:vir:78 110 PEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPS 189 (290) T ss_pred hhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccc Confidence 999988776553221 12234456888999999998877889999999999999999888864221 111223 Q ss_pred eeeccceeecceeEEEcCC---C-----------cc--CceEEEEEecCCceEEEeecCCccceec---cchhhcccEEE Q lcl|Aclame:pro 148 LINGTYADVLGAQIVRSKK---L-----------AE--GSALMFKIVSNSPALKLVLKRGVQVETD---RDIVTKTTVIT 208 (231) Q Consensus 148 ~~~G~ig~~~G~~Vv~s~~---~-----------~~--~~~~~~~~~~~~~A~~~~~k~~v~vE~~---Rd~~~~~~~i~ 208 (231) ..+|.++++.|++|+..+. + +. ++.+-+.++ .+.|..-..|.+ .+.-+ .........+. T Consensus 190 ~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ii~-~~~a~i~~~K~~-~~~~~~P~~~~~~d~~~~~ 267 (290) T protein:vir:78 190 SIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNFLLV-NKGSVVGGAKHA-SIYLHAPGSVGQGDGWLYQ 267 (290) T ss_pred cccceeeeecCcEEEEecccchhhhhhhhcccccccCCccceeEEEE-cCCceeeeeeee-EEEeeCCCCCcCcceeeee Confidence 3589999999999987541 1 11 222323222 244554444433 22222 22233456999 Q ss_pred EEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 209 ADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 209 ~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+.|+.+-+++-.+-.+..=.+| T Consensus 268 ~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 268 YRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred eeeeeeeeeeccccCeeEEEeeC Confidence 99999999998877777666777 No 162 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=99.51 E-value=3.7e-15 Score=99.72 Aligned_cols=221 Identities=14% Similarity=0.056 Sum_probs=137.4 Q ss_pred CCCcccCceEEeccc-cCCcc---cccCCCccCccccccceeEEEeehccce-eeecHHHHHhcCCCHHHHHH---HHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGDAA---DVAEGGEISLDKIGTTTKSVTIKKAAKG-TEITDEAALSGYGDPIGESN---KQLG 72 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igda~---~v~EG~~i~~~~lt~~~~~~tikk~g~~-~~itD~~~~~~~~d~~~~~~---~~~a 72 (231) -|..--||-.+.|.| ||+.. ++--..++++.+|+..+.......++.. ++.+.........||+.... ++++ T Consensus 42 ~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~ 121 (315) T protein:vir:96 42 NSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIAADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMA 121 (315) T ss_pred cccccccccccccccccccchhhcccCCCccccceecccccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHH Confidence 233334888888888 66642 4555667999999998876654444543 66676666666778875544 3333 Q ss_pred HHHHHHHHHHHHHHhcc----cc----cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccccc Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKT----TS----QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t----~~----~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~ 144 (231) .++-+.+-+..++++.. .+ ...+...+...+.+|.++|||.......++|||.++..|.+. .....-.... T Consensus 122 ~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q-~L~~~~~~~~ 200 (315) T protein:vir:96 122 DATMAGWIGYALNALQGAIGSNAGMNVSGELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVDE-AIDNKLYEEA 200 (315) T ss_pred HHHHHHHHHHHHhhhhhhhcccccccccccccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHHh-hhhhhccccc Confidence 33333332323333321 11 112345788999999999999999999999999999999994 3333222223 Q ss_pred CceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCC---ccceeccchhhcccEEEEEEEEEEEEEcCC Q lcl|Aclame:pro 145 ANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRG---VQVETDRDIVTKTTVITADEHYAAYLYDLT 221 (231) Q Consensus 145 ~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~---v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~ 221 (231) +.+++.+. ..++|.||+++|.||.... +...+||+++...++ +..|..|+ .+++...++-..+.+.|. T Consensus 201 ~~~~~~~~-~~~lGkrViVdD~~P~~~~----~gl~~GAi~~~~~~~~~~~~~~~~g~----e~l~~~~r~e~tf~l~p~ 271 (315) T protein:vir:96 201 GVVVYGGT-PGTLGKPVLVTDQCPATKI----FGLVAGAVMITESQAPGMRSYQIDDQ----ENLAIGFRAEGTANVEVL 271 (315) T ss_pred ceeEecCc-CcccccEEEEECCCCccee----eeeecceeeecCCCccccccccCCCc----ceeEEEEeeeeEeeeeee Confidence 33333333 3466999999999997543 334589998877666 33455444 555665555445566665 Q ss_pred cEEEEEeccC Q lcl|Aclame:pro 222 KVVNITFTGV 231 (231) Q Consensus 222 ~vv~l~~~~~ 231 (231) ++---+.++. T Consensus 272 G~sw~~~~~~ 281 (315) T protein:vir:96 272 GYKWKTKTNV 281 (315) T ss_pred eEEeecCCCc Confidence 5544332222 No 163 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=99.51 E-value=3.1e-15 Score=100.15 Aligned_cols=224 Identities=12% Similarity=0.058 Sum_probs=158.5 Q ss_pred CCCcc---------cCceEEeccc-cC-----CcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcC--CCH Q lcl|Aclame:pro 1 ENGIN---------LANLCEYPND-IG-----DAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGY--GDP 63 (231) Q Consensus 1 ~~~~~---------~G~ti~~P~~-ig-----da~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~--~d~ 63 (231) ++.++ .+.+.++++- .| ...+.+|+++.+..+.++++.++..++....+.|+++...++. .|+ T Consensus 44 ~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~ 123 (315) T protein:vir:41 44 SAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAF 123 (315) T ss_pred hhhhhhhceeeeccccccccccccccCcccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccH Confidence 11111 1233334321 11 1235567777888889999999999999888999999998874 699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccc---------------ccc--------cccccCHHHHHHHHHHhhccC---CCc Q lcl|Aclame:pro 64 IGESNKQLGLSLANKVDDDLLKAAKTT---------------SQT--------VSTKANVDGVQAALDIFNDED---AQA 117 (231) Q Consensus 64 ~~~~~~~~a~~ia~~vd~~~~~~l~t~---------------~~~--------~~~~~~~d~i~da~~~l~~~~---~~~ 117 (231) .+...+++++++++..+..++++-.++ ... .+...+.+.+.+++..|.... .+. T Consensus 124 e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~ 203 (315) T protein:vir:41 124 EQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPN 203 (315) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCc Confidence 999999999999999999888762211 000 112235677888888886533 245 Q ss_pred eEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc---CceEEEEEecCCceEEEeecCCccc Q lcl|Aclame:pro 118 YVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE---GSALMFKIVSNSPALKLVLKRGVQV 194 (231) Q Consensus 118 ~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~---~~~~~~~~~~~~~A~~~~~k~~v~v 194 (231) .+++||+.++.++||.++.. +++.....+..|.-.+++|.||+..+.||+ +....+....+ -+.+...+.+.+ T Consensus 204 ~~~imn~~t~~~~rklk~~~--g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~--nl~~~~~~~i~i 279 (315) T protein:vir:41 204 MKFYVTWDIYRAYRDALKGR--ETGLGDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPT--QLVYGFWRNIKV 279 (315) T ss_pred eEEEEcHHHHHHHHHHhccC--CCccccchhhcCCCceecccceEecccccccCCCCccEEEeccc--ceEEEeccccEE Confidence 68999999999999987643 445566666778888999999999999975 22222222222 244566688999 Q ss_pred eeccchhhcccEEEEEEEEEEEEEcCCc--EEEEEe Q lcl|Aclame:pro 195 ETDRDIVTKTTVITADEHYAAYLYDLTK--VVNITF 228 (231) Q Consensus 195 E~~Rd~~~~~~~i~~~~~y~~~~~~~~~--vv~l~~ 228 (231) |.+|+.......++.+.+.++.+.++++ +..+++ T Consensus 280 ~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 280 VPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 9999999999999999999998887776 444555 No 164 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.49 E-value=4.1e-16 Score=104.94 Aligned_cols=222 Identities=14% Similarity=0.106 Sum_probs=161.3 Q ss_pred CCCcccCceEEeccccCCccc---------ccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAAD---------VAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~---------v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~ 71 (231) -+-- .|.|++.|--..+.++ =-||++++..||+++..++.||.+|....+|.+++..|....++-.+|-+ T Consensus 167 tLP~-~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL 245 (410) T protein:vir:83 167 TLPL-NNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGL 245 (410) T ss_pred hCCC-CCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHH Confidence 1001 1678888633333321 13999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhccc------ccccccccCHHHHHHHHHHhhcc--CCCceEEEECHHHHHHHHhh---hhhhhcc Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTT------SQTVSTKANVDGVQAALDIFNDE--DAQAYVLIVNPKDAAKIRKD---ANAKNIG 140 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~------~~~~~~~~~~d~i~da~~~l~~~--~~~~~v~vv~p~~~~~L~k~---~~~~~~~ 140 (231) +.+-|+...+..-+.|..+ ....+...+...|.|+..++.+. +....++.++|+++.++.+- .+-.+.. T Consensus 246 ~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~d 325 (410) T protein:vir:83 246 GQQYAIETEALVGAALASTSTGAVGYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAH 325 (410) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcc Confidence 7776666665444433221 12234445667788988999886 67888999999998766542 1111111 Q ss_pred cc-ccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCC-ccceeccchhhcccEEEEEEEEEEEEE Q lcl|Aclame:pro 141 SE-VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRG-VQVETDRDIVTKTTVITADEHYAAYLY 218 (231) Q Consensus 141 ~~-~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~-v~vE~~Rd~~~~~~~i~~~~~y~~~~~ 218 (231) +. .+.+-+..|.-|.++|+||+++++.+++++++++ +-|+..+.... ..--++-++..-...++ -||+.... T Consensus 326 t~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~----~~Ai~~~eS~~gp~qL~d~~i~nLt~~yS--gY~a~a~~ 399 (410) T protein:vir:83 326 STGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFS----TAAIECFEQRVGTLQVVEPSVFGLQVAYA--GYFSTLVV 399 (410) T ss_pred cccccccccccchhhhhcccceEEecCCCcCeeeEec----cceeeeeecCCceeEeeCCchhhhhhhhe--eeeeeccc Confidence 11 1344455788899999999999999999998775 88999998773 33345666766666666 57788888 Q ss_pred cCCcEEEEEec Q lcl|Aclame:pro 219 DLTKVVNITFT 229 (231) Q Consensus 219 ~~~~vv~l~~~ 229 (231) +|.+++-+.-. T Consensus 400 ~~~gliPv~g~ 410 (410) T protein:vir:83 400 NEDAIVPLVGS 410 (410) T ss_pred cccceeeeccC Confidence 99999998655 No 165 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=99.48 E-value=5.9e-16 Score=104.09 Aligned_cols=216 Identities=14% Similarity=0.102 Sum_probs=146.9 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccC-ccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~ 77 (231) =+-++.|...++|.. .+.+.+++|+.+++ ..+.++++.++..++.+..+.||.+...++..|+.+...++++++|++ T Consensus 117 ~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~ 196 (383) T protein:vir:78 117 IGMRTTGLRTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAV 196 (383) T ss_pred eeeEecCCceEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHH Confidence 111222233578865 45667899988875 557999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc------------c---------cccccccCHHHHHHHHHHhhcc---------C-----CCceEEEE Q lcl|Aclame:pro 78 KVDDDLLKAAKTT------------S---------QTVSTKANVDGVQAALDIFNDE---------D-----AQAYVLIV 122 (231) Q Consensus 78 ~vd~~~~~~l~t~------------~---------~~~~~~~~~d~i~da~~~l~~~---------~-----~~~~v~vv 122 (231) ++|..++.+-.+. . ...+..++++++......+..- . ...-.+++ T Consensus 197 ~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (383) T protein:vir:78 197 ALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLV 276 (383) T ss_pred HHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEE Confidence 9999988642210 0 0112223444443333332210 0 11234788 Q ss_pred CHHHHHHHHhhhhhhhccccccCceeeeccceeecc--eeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch Q lcl|Aclame:pro 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG--AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI 200 (231) Q Consensus 123 ~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G--~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~ 200 (231) ||..++.+....... ..+|...+++| ++|+.|+.||+++.+..+ ++ .+.++.+.++.++..++. T Consensus 277 n~~~~~~~~~~~~~~----------~~~G~~~t~l~~~~~iv~s~~~p~~~iifgd--fs--~Y~i~~r~~~~i~~~~~~ 342 (383) T protein:vir:78 277 NPTDAWDVKKQYTSL----------NANGVYVTALPFNLNIIESLFVPEKKAISYV--AE--RYDALIGGPLDIGTYDQT 342 (383) T ss_pred cCcchhhhccchhcc----------CCCCceeeecCCCceEEecCCCCcccEEEee--cc--ceEEEecccceEEecchh Confidence 887776554321111 12344445554 558889999999865444 22 366778888888876543 Q ss_pred --hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 --VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 --~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++.+.+++..|++.+++||+++++++++ + T Consensus 343 ~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~-~ 374 (383) T protein:vir:78 343 LAIEDLNLYAAKQFAYGKAKDDKAAAVWTLN-I 374 (383) T ss_pred hhhcCceEEEEEEEEcCEEecCCeEEEEEEE-e Confidence 55678999999999999999998887776 5 No 166 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=99.36 E-value=4.1e-14 Score=93.98 Aligned_cols=231 Identities=13% Similarity=0.072 Sum_probs=154.0 Q ss_pred CCCcccCceEEeccccCCcc---cccCCCccCcc----------------------------------ccccceeEEEee Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAA---DVAEGGEISLD----------------------------------KIGTTTKSVTIK 43 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~---~v~EG~~i~~~----------------------------------~lt~~~~~~tik 43 (231) ..--|.|.||+|.+|.--+. .+.||...... +++-.+...+++ T Consensus 50 piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~ 129 (401) T protein:vir:95 50 NMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIH 129 (401) T ss_pred ccccccCCeEEEEecccccccccchhcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeee Confidence 56678999999999865444 56777643332 334455777899 Q ss_pred hccceeeecHHHHHhcCCCHHHHH--HHHHHH---HHHHHHHHHHHHHhc------c--------cccccccccCHHHHH Q lcl|Aclame:pro 44 KAAKGTEITDEAALSGYGDPIGES--NKQLGL---SLANKVDDDLLKAAK------T--------TSQTVSTKANVDGVQ 104 (231) Q Consensus 44 k~g~~~~itD~~~~~~~~d~~~~~--~~~~a~---~ia~~vd~~~~~~l~------t--------~~~~~~~~~~~d~i~ 104 (231) |||.+.++||+.......+.+-++ .+.++- ..-+.+..++++... + ....+.+..+++.+. T Consensus 130 qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~ 209 (401) T protein:vir:95 130 KFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLM 209 (401) T ss_pred eccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHH Confidence 999999999998888777776654 222222 223455667776541 1 122344567899999 Q ss_pred HHHHHhhccC-------------------CCceEEEECH------HHHHHHHhhhhhhhccccccCceeeeccceeecce Q lcl|Aclame:pro 105 AALDIFNDED-------------------AQAYVLIVNP------KDAAKIRKDANAKNIGSEVGANALINGTYADVLGA 159 (231) Q Consensus 105 da~~~l~~~~-------------------~~~~v~vv~p------~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~ 159 (231) ++...|.++. ...++.+||| ..+++|..++.|.....+...+-+.+|+||.+-++ T Consensus 210 rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~v 289 (401) T protein:vir:95 210 RLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKF 289 (401) T ss_pred HHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCce Confidence 9988886521 1246899999 55677778899999999999999999999999999 Q ss_pred eEEEcCCCc--------cCce----------------EEEEEecCCceEEEee--------------cCCccceecc-ch Q lcl|Aclame:pro 160 QIVRSKKLA--------EGSA----------------LMFKIVSNSPALKLVL--------------KRGVQVETDR-DI 200 (231) Q Consensus 160 ~Vv~s~~~~--------~~~~----------------~~~~~~~~~~A~~~~~--------------k~~v~vE~~R-d~ 200 (231) |+++++.+. .+.. ++...+.+..|++... |++=.=-.+| |+ T Consensus 290 R~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DP 369 (401) T protein:vir:95 290 RIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDP 369 (401) T ss_pred eEEecccceeecCCcccccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCc Confidence 999999743 2111 2222233344444421 1110001244 44 Q ss_pred hhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .-+.=.+.--.+|+..+++|+-.++|.-.+- T Consensus 370 lgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 370 YGETGFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred ccceehhhhhhhhhhheeccceeEEEEeecC Confidence 5555555556679999999999999977776 No 167 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=99.30 E-value=4.5e-13 Score=88.28 Aligned_cols=223 Identities=13% Similarity=0.107 Sum_probs=148.2 Q ss_pred CCCcccCceEEeccc-cCC-cccc-cCCC-ccCccccccceeEEEeehccceeeecHHHHHhc--CCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-IGD-AADV-AEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALSG--YGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-igd-a~~v-~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~--~~d~~~~~~~~~a~~ 74 (231) .. + ....-++|++ .|. +... .||. ..+..++++++.++..++....+.||++.+.++ ..|+.+...++++++ T Consensus 55 ~~-v-~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~ 132 (321) T protein:vir:31 55 ET-V-GAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDA 132 (321) T ss_pred ee-c-cCcceeeeeeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHH Confidence 11 1 1112345554 222 2222 2443 456678899999999999999999999988776 469999999999999 Q ss_pred HHHHHHHHHHHHhcccc---------------------cccccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTS---------------------QTVSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIR 131 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~---------------------~~~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~ 131 (231) ++..++..++++-..+. ...+..+++|.+.+++..|.+.. ...-+++||++.+.+++ T Consensus 133 ~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~ 212 (321) T protein:vir:31 133 WSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYH 212 (321) T ss_pred HHHHHHhheeeccccCCCcccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHH Confidence 99999988776522110 01123467899999999986543 23457999999988776 Q ss_pred hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhh---cccEEE Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVT---KTTVIT 208 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~---~~~~i~ 208 (231) +-.. +..+......+.+|...+++|+||+.+++||++....... .-+.+...+++.++..|+... +.+-++ T Consensus 213 ~~l~--~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~~il~t~~----~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (321) T protein:vir:31 213 YTLT--DRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDDKAMFTDP----QNLIYALYRDLEIDVLTESDKVSERDLHAR 286 (321) T ss_pred HHHh--cCCCccccchhhccccccccceeEEEcCCCCCCcEEEecc----ccEEEEEeeccEEEEeecCccccccceeeE Confidence 5322 2233445556667777789999999999999987655442 223344456777776666443 223333 Q ss_pred --EEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 209 --ADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 209 --~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+..+++.+-++++++.++--.+ T Consensus 287 ~~~~~~~~~~ve~~~a~a~~~~i~~ 311 (321) T protein:vir:31 287 YFMRGDDDFAIENTEAVVLAEGLGD 311 (321) T ss_pred eeeeeecceeEeccccEEEEecCCc Confidence 34457888889999998874334 No 168 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=99.28 E-value=2.6e-13 Score=89.57 Aligned_cols=224 Identities=15% Similarity=0.068 Sum_probs=141.5 Q ss_pred CCCc------ccCceEEecccc--CCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCC----HHHHHH Q lcl|Aclame:pro 1 ENGI------NLANLCEYPNDI--GDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGD----PIGESN 68 (231) Q Consensus 1 ~~~~------~~G~ti~~P~~i--gda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d----~~~~~~ 68 (231) .+.+ ++.....+|... +.+..+.||+..|.+++++.+.++.+++.+..+++|.+.+.++..| +.+... T Consensus 266 ~~~i~~~~~~~~i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~ 345 (517) T protein:vir:97 266 EGSLLPFIRHENLPTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVM 345 (517) T ss_pred hccceeeeeeccccceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHH Confidence 1111 111223333221 2345789999999999999999999999999999999999988877 788899 Q ss_pred HHHHHHHHHHHHHHHHHHhccccc---------c--cccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhh Q lcl|Aclame:pro 69 KQLGLSLANKVDDDLLKAAKTTSQ---------T--VSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDAN 135 (231) Q Consensus 69 ~~~a~~ia~~vd~~~~~~l~t~~~---------~--~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~ 135 (231) +++++.++++.+..++.+-.+... . ..+....+.+.+.+..|.... ....+++|||.++..|++.++ T Consensus 346 ~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~a~~a~~vmn~~t~~~I~klKD 425 (517) T protein:vir:97 346 NRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAIRFLKD 425 (517) T ss_pred HHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHHHHHHHHhhhccCCEEEECHHHHHHHHHhhc Confidence 999999999999999976332210 0 011122233444444443322 235678999999999999887 Q ss_pred hhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEEEE Q lcl|Aclame:pro 136 AKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAA 215 (231) Q Consensus 136 ~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~ 215 (231) ..+. +.....+.++...+++|+.-+.+ .++.+..... + ...+.++-..++.+-.+-|.....+.+...++.+. T Consensus 426 ~~G~--Yl~~~~~~~~~~~~l~G~~~~~~-~~~~~~~~~~---~-~~~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g 498 (517) T protein:vir:97 426 KNGN--YVFPVGVSNQTIATHFGFNRLVQ-SVAVDEKTAV---S-LSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISG 498 (517) T ss_pred CCCC--eeccCcCCcccccccCCcccccc-ccccCceeEe---e-ccccEEEeecceeeeeeeecccCceeEeeeeeecc Confidence 6554 44445556667777888533332 2232322111 1 11223333333332222222345667778888888 Q ss_pred EEEcCCcEEEEEeccC Q lcl|Aclame:pro 216 YLYDLTKVVNITFTGV 231 (231) Q Consensus 216 ~~~~~~~vv~l~~~~~ 231 (231) .|..|++++..+++-. T Consensus 499 ~i~~~~r~a~~~~~p~ 514 (517) T protein:vir:97 499 SLEYKGTTAYGTYTPP 514 (517) T ss_pred ccccccceEEEEEcCC Confidence 9999999999888776 No 169 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.27 E-value=1.1e-12 Score=86.09 Aligned_cols=229 Identities=11% Similarity=0.040 Sum_probs=138.4 Q ss_pred CCC---cccCceEEecccc--CCcccccCCCccC-ccccccceeEEEeehc-cceeeec--HHHHHhcCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENG---INLANLCEYPNDI--GDAADVAEGGEIS-LDKIGTTTKSVTIKKA-AKGTEIT--DEAALSGYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~---~~~G~ti~~P~~i--gda~~v~EG~~i~-~~~lt~~~~~~tikk~-g~~~~it--D~~~~~~~~d~~~~~~~~~ 71 (231) .++ -++|++|+||+-. ....++....-.. ...++.+..+.++.|- +..|.|. |++.......+-....+++ T Consensus 32 ~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~ 111 (346) T protein:vir:10 32 SNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFN 111 (346) T ss_pred cccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHH Confidence 111 1589999999853 2466787655553 4678888888888884 5578888 5554332223333334455 Q ss_pred HHHHHHHHHHHHHHHhccc----------ccccccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHHHHhhhhhhhc Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKTT----------SQTVSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNI 139 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t~----------~~~~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~L~k~~~~~~~ 139 (231) ....+-.+|.-.|+.|-+. ..+.+...-|+.|.++...|.+.. .++++++|+|+.+..|.+++.|... T Consensus 112 r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~ 191 (346) T protein:vir:10 112 LDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRA 191 (346) T ss_pred HHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheec Confidence 5556667787765544211 112234456788999999998765 4789999999999988888877543 Q ss_pred cccccCceeeeccceeecceeEEE--cCCCcc-------------CceEEEEEecCCceEEEeecCC-ccceeccchhhc Q lcl|Aclame:pro 140 GSEVGANALINGTYADVLGAQIVR--SKKLAE-------------GSALMFKIVSNSPALKLVLKRG-VQVETDRDIVTK 203 (231) Q Consensus 140 ~~~~~~~~~~~G~ig~~~G~~Vv~--s~~~~~-------------~~~~~~~~~~~~~A~~~~~k~~-v~vE~~Rd~~~~ 203 (231) . ..++....+|.+|++.|++|+. |+.++. ++.+-+.++ .+.|..-..|.+ +.+-+.-....+ T Consensus 192 ~-~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~~~t~ak~INfiiv-~~~A~ia~~K~~~~~if~P~~~~~g 269 (346) T protein:vir:10 192 L-TLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSKIIDTAKQIEMFLI-YNGVQIAPEKYSFVGFDQPSAATSG 269 (346) T ss_pred c-ccccccccceeeeeecCeEEEEcchhhcccchhhccCccccCCccceeEEEE-CCceeeeeeeeeeeEeeCCCCCccc Confidence 2 2233334589999999999986 454431 112222222 344444333333 222222133444 Q ss_pred ccEEEEEEEEEEEEEcCCcE-EEEEec-cC Q lcl|Aclame:pro 204 TTVITADEHYAAYLYDLTKV-VNITFT-GV 231 (231) Q Consensus 204 ~~~i~~~~~y~~~~~~~~~v-v~l~~~-~~ 231 (231) ...+..+.||.+-+++-.+- |.+.++ +. T Consensus 270 ~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~ 299 (346) T protein:vir:10 270 NYLYYEQSYDDVLLLNTKTKGIQFVVSDKP 299 (346) T ss_pred ceeeeeeeeeeeeeeccccceEEEeeeccc Confidence 56899999999999865432 222221 11 No 170 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.14 E-value=8.3e-12 Score=81.34 Aligned_cols=229 Identities=10% Similarity=0.114 Sum_probs=138.7 Q ss_pred CCC-----cccCceEEeccc-cCCcccccCCCc--cCccccccceeEEEeeh-ccceeeec--HHHHHhcCCCHHHHHHH Q lcl|Aclame:pro 1 ENG-----INLANLCEYPND-IGDAADVAEGGE--ISLDKIGTTTKSVTIKK-AAKGTEIT--DEAALSGYGDPIGESNK 69 (231) Q Consensus 1 ~~~-----~~~G~ti~~P~~-igda~~v~EG~~--i~~~~lt~~~~~~tikk-~g~~~~it--D~~~~~~~~d~~~~~~~ 69 (231) |.. -++|++|+||+- +....++..+.. .+...++.+..+.++.| ++..|.|. |++........-....+ T Consensus 29 ~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~e 108 (312) T protein:vir:10 29 DSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGE 108 (312) T ss_pred cCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHH Confidence 211 478999999974 566677775444 66667888888888777 45578888 66554444445555566 Q ss_pred HHHHHHHHHHHHHHHHHhccc------------ccccccccCHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhh Q lcl|Aclame:pro 70 QLGLSLANKVDDDLLKAAKTT------------SQTVSTKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANA 136 (231) Q Consensus 70 ~~a~~ia~~vd~~~~~~l~t~------------~~~~~~~~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~ 136 (231) ++.....-.+|.-.|+.|-+. +.+.+...-|+.|.++..+|.+.+. .+++++|+|+.+..|.++..+ T Consensus 109 f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~ 188 (312) T protein:vir:10 109 FQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEKVLE 188 (312) T ss_pred HHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhhhhc Confidence 677777778888766644311 1122344567888999999988654 589999999999666654333 Q ss_pred hhccccccCceeeeccceeecceeEEEcC--CCc------cC----------------ceEEEEEecCCceEEEeecCC- Q lcl|Aclame:pro 137 KNIGSEVGANALINGTYADVLGAQIVRSK--KLA------EG----------------SALMFKIVSNSPALKLVLKRG- 191 (231) Q Consensus 137 ~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~--~~~------~~----------------~~~~~~~~~~~~A~~~~~k~~- 191 (231) . ...........++.++.+.|++|+.-+ .+. +| +.+-+.++ .+.|..-..|.+ T Consensus 189 ~-~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv-~~~a~i~~~K~~~ 266 (312) T protein:vir:10 189 K-LTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTAGGYLKGTKALDTNFIIA-PVDVPLAITKQDK 266 (312) T ss_pred e-ecccccccceeeeeeeeecccEEEEchhhhccceeeeccCcccccccCceeecCcccccceEEe-CCceeeceeeeee Confidence 2 223333444568899999999999643 221 11 11111111 123333222222 Q ss_pred ccc-eeccchhhcccEEEEEEEEEEEEEcCCc-EEEEEeccC Q lcl|Aclame:pro 192 VQV-ETDRDIVTKTTVITADEHYAAYLYDLTK-VVNITFTGV 231 (231) Q Consensus 192 v~v-E~~Rd~~~~~~~i~~~~~y~~~~~~~~~-vv~l~~~~~ 231 (231) +.+ +.+-........+..+.|+.+-+++-.+ .+.++++.- T Consensus 267 ~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 267 MRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred eeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecc Confidence 111 1112233345699999999999986543 344555544 No 171 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.01 E-value=6.1e-11 Score=76.60 Aligned_cols=229 Identities=14% Similarity=0.143 Sum_probs=140.0 Q ss_pred CCCcccCceEEeccc-c-CCcccccCCCccCccccccceeEEEeeh-ccceeeecHHHHHhcCCCHHHHHHHH-HHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND-I-GDAADVAEGGEISLDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQ-LGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-i-gda~~v~EG~~i~~~~lt~~~~~~tikk-~g~~~~itD~~~~~~~~d~~~~~~~~-~a~~ia 76 (231) +---+||++|+||+- + ....++..+...+...++.+..+.++.| ++..|.|...++..+..-.++.++++ +..... T Consensus 34 ~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vv 113 (285) T protein:vir:79 34 TQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITI 113 (285) T ss_pred eeEecCCCEEEEeeecccccccccccccCccccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhc Confidence 112367999999985 3 4677888888888888998888888888 45568888433322222224444443 333444 Q ss_pred HHHHHHHHHHhcccc-----cccccccCHHHHHHHHHHhhccCC-CceEEEECHHHHHHHHhhhhhhhccccccCcee-- Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTS-----QTVSTKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIGSEVGANAL-- 148 (231) Q Consensus 77 ~~vd~~~~~~l~t~~-----~~~~~~~~~d~i~da~~~l~~~~~-~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~-- 148 (231) -.+|...|+.+-+.. .+.+...-|+.|.++..+|.+.+. .+++++|+|+.+..|.+.+.|.... ....+.. T Consensus 114 PEiDayrfskla~~a~~~~~~~~T~~nv~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~-~~~~~~~~~ 192 (285) T protein:vir:79 114 PHRDKVAVQKLFDSAAKKATDSITKDNALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTF-STDGTMVIN 192 (285) T ss_pred chhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheec-ccccceecc Confidence 567776665543222 223444568888999999988764 7899999999999999888775432 1111111 Q ss_pred -eeccceeecc-eeEEEc--CCCcc---CceEEEEEecCCceEEEeecCCcc-c-eeccchhhcccEEEEEEEEEEEEEc Q lcl|Aclame:pro 149 -INGTYADVLG-AQIVRS--KKLAE---GSALMFKIVSNSPALKLVLKRGVQ-V-ETDRDIVTKTTVITADEHYAAYLYD 219 (231) Q Consensus 149 -~~G~ig~~~G-~~Vv~s--~~~~~---~~~~~~~~~~~~~A~~~~~k~~v~-v-E~~Rd~~~~~~~i~~~~~y~~~~~~ 219 (231) .++.++.+.| ++|+.- +.++. ++.+-+.++ .+.|..-..|.+-. + +..-........+..+.||.+-+++ T Consensus 193 ~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv-~~~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~ 271 (285) T protein:vir:79 193 GIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILT-PLSAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLD 271 (285) T ss_pred ceeeeeccccceeEEEEcchhhccCcCcchhccEEEe-cCceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehh Confidence 2445889998 899874 45542 222333322 24444333333311 1 1111223346799999999999986 Q ss_pred CC--cEEEEEeccC Q lcl|Aclame:pro 220 LT--KVVNITFTGV 231 (231) Q Consensus 220 ~~--~vv~l~~~~~ 231 (231) -. +|.+-..+|| T Consensus 272 nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 272 NAKKGIYVAATAGV 285 (285) T ss_pred hccceeeeeecccC Confidence 53 3445556666 No 172 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=98.99 E-value=6.3e-12 Score=82.00 Aligned_cols=221 Identities=11% Similarity=0.056 Sum_probs=125.6 Q ss_pred CCCcccCceEEeccc----------------------cC-C-cccccCC----CccCccccccceeEEE---eehcccee Q lcl|Aclame:pro 1 ENGINLANLCEYPND----------------------IG-D-AADVAEG----GEISLDKIGTTTKSVT---IKKAAKGT 49 (231) Q Consensus 1 ~~~~~~G~ti~~P~~----------------------ig-d-a~~v~EG----~~i~~~~lt~~~~~~t---ikk~g~~~ 49 (231) -+.++.|-. -+|.+ .| + .-.++|+ ++.++.. ..+..+. ++++.... T Consensus 213 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~~~~~~~--~~~~~~~~~~v~~l~~~~ 289 (480) T protein:vir:40 213 LNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAEDGVDDTFISGTFKAGTDKNKSQT--ATKRSLRPQMAEAYLQMD 289 (480) T ss_pred ccccccccc-cccchhhheeechhhhhhhhhcceeeeccccceeeeeeeecccccccccc--cccchhhHHHHHHHHHhH Confidence 011111111 01111 01 0 0112222 2222111 1112222 12222233 Q ss_pred eecHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------cccccccCH-HHHHHHHHHhhccCCCce Q lcl|Aclame:pro 50 EITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLKAAKTTS----------QTVSTKANV-DGVQAALDIFNDEDAQAY 118 (231) Q Consensus 50 ~itD~~~~~~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~----------~~~~~~~~~-d~i~da~~~l~~~~~~~~ 118 (231) .+|.+..-++ .++.+...+++++.++++.++.++.+-.+.. ...+...+. +.|.+....+......+. T Consensus 290 k~t~~lLDDa-~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a 368 (480) T protein:vir:40 290 KATVRGVNDS-GALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGWTKQIEYTDLFEGITDAVAECSISDA 368 (480) T ss_pred HHHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeecccccccchhHHHHHHHHHhhhHHhhCCC Confidence 4454433333 3688899999999999999999998732211 111122233 333345555544333445 Q ss_pred -EEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcC-CCccCceEEEEEecCCceEEEeecCCcccee Q lcl|Aclame:pro 119 -VLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSK-KLAEGSALMFKIVSNSPALKLVLKRGVQVET 196 (231) Q Consensus 119 -v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~-~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~ 196 (231) ++||||.++..|++.++..++ +.-...+..|...+++|+||++++ .+|.+... +.....++.+.- +++.... T Consensus 369 ~~~vmn~~t~~~I~klKD~~G~--Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~---~~~~~~~~~~~d-~~~~~~~ 442 (480) T protein:vir:40 369 ITIVMSPQTFAELRKAKGTDGH--SRFNELATKEQIAQSFGAVNLETRVWMPKDEVA---VYNHDEYVLIGD-LNVENYN 442 (480) T ss_pred CEEEECHHHHHHHHHhhcCCCC--eeccCcccccCcceecccceeeeeccccCCcce---eeeCCccEEEEe-cccceec Confidence 689999999999999887654 444455667888999999988764 55655432 222333444444 3445555 Q ss_pred ccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 197 DRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 197 ~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +.+..+-...+....+.|..+.+|++++.++++|- T Consensus 443 ~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 443 DFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred ccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 55666777778888899999999999999999998 No 173 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=98.97 E-value=4.6e-11 Score=77.24 Aligned_cols=225 Identities=16% Similarity=0.125 Sum_probs=130.7 Q ss_pred CCCcccCceEEecc-ccCCcccccCCCccC--ccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN-DIGDAADVAEGGEIS--LDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~-~igda~~v~EG~~i~--~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) |..-|.||||.+|. +.+- ..+|..++ ++.+...+..+++++... .|++++++ ....+...+..+...++|| T Consensus 38 ~~~~r~Gdti~ip~p~~~~---~~~G~~~t~~~~~~~e~~v~~~~~~~~~V~~~~~~kE--l~~~~~~er~l~pAm~~LA 112 (430) T protein:vir:21 38 ASMQRSSNTIWMPVEQESP---TQEGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRRRIQSAARKLA 112 (430) T ss_pred hhhhcccceEEeecccccc---ccccccccCCCccceeeeEeEEEeeeccceEEeehhH--hcChhhHHHHHHHHHHHHH Confidence 66669999999994 2221 12232221 235777888888888654 68888776 3577888999999999999 Q ss_pred HHHHHHHHHHhccc--------cc-ccccccCHHHHHHHHHHhhccCC---CceEEEECHHHHHHHHhhhhhhhcccccc Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT--------SQ-TVSTKANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) Q Consensus 77 ~~vd~~~~~~l~t~--------~~-~~~~~~~~d~i~da~~~l~~~~~---~~~v~vv~p~~~~~L~k~~~~~~~~~~~~ 144 (231) ++||.++++..... .. ..++...+.++.++-+.|.+... .++.+|++|..+..|.............. T Consensus 113 ~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~ 192 (430) T protein:vir:21 113 NNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIP 192 (430) T ss_pred HHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccch Confidence 99999998764321 11 12233467888888888877653 46899999999998866432222223445 Q ss_pred Cceeeecccee-eccee-EEEcCCCcc---CceEEEEEecCCceEE----EeecCCccc----------eeccchhhccc Q lcl|Aclame:pro 145 ANALINGTYAD-VLGAQ-IVRSKKLAE---GSALMFKIVSNSPALK----LVLKRGVQV----------ETDRDIVTKTT 205 (231) Q Consensus 145 ~~~~~~G~ig~-~~G~~-Vv~s~~~~~---~~~~~~~~~~~~~A~~----~~~k~~v~v----------E~~Rd~~~~~~ 205 (231) .+..++|+||+ +.|++ ++.++.+|. +.+-.+. +.+.+..+ .+...++.. -+.---...-| T Consensus 193 ~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~t-v~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD 271 (430) T protein:vir:21 193 EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGD 271 (430) T ss_pred hHHHhhcccccccchhhhhhhcCCcccccCccCcCce-eccccccccccceeccccccccccccceeeeeecccceeccc Confidence 67789999997 88997 677887774 3322222 11111100 000011110 00001234445 Q ss_pred EEEEEEEEEEEEEcCCcE---EEEEeccC Q lcl|Aclame:pro 206 VITADEHYAAYLYDLTKV---VNITFTGV 231 (231) Q Consensus 206 ~i~~~~~y~~~~~~~~~v---v~l~~~~~ 231 (231) .+...-+|.++.+..... -...++++ T Consensus 272 ~ftiaGV~~v~~itk~~~~~l~qf~V~a~ 300 (430) T protein:vir:21 272 KISFAGVKFLGQMAKNVLAQDATFSVVRV 300 (430) T ss_pred EEEecceeeeccccccccCCcceEEEEEe Confidence 555544454444432221 11122221 No 174 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=98.94 E-value=5.9e-11 Score=76.65 Aligned_cols=224 Identities=16% Similarity=0.120 Sum_probs=130.0 Q ss_pred CCCcccCceEEecc-ccCCcccccCCCccC--ccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN-DIGDAADVAEGGEIS--LDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~-~igda~~v~EG~~i~--~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) +..-|.||||.+|. +.+-.. +|..++ ++.+...+..+++++... .|++++.+. ...+...+..+...++|| T Consensus 38 ~~~~r~Gdti~~p~~~~~~~~---~G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA 112 (430) T protein:vir:10 38 ASMQRSSNTIWMPVEQESPTQ---EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLA 112 (430) T ss_pred hhhhcccceEEeccccccccc---cCcccCCCCCccccceEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHH Confidence 56669999999994 333222 244332 235777788888888655 699998773 566777888888889999 Q ss_pred HHHHHHHHHHhccc--------cc-ccccccCHHHHHHHHHHhhccCC---CceEEEECHHHHHHHHhhhhhhhcccccc Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT--------SQ-TVSTKANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) Q Consensus 77 ~~vd~~~~~~l~t~--------~~-~~~~~~~~d~i~da~~~l~~~~~---~~~v~vv~p~~~~~L~k~~~~~~~~~~~~ 144 (231) ++||.++++....- .. ...+...+.++.++-+.|.+... .++.+|++|..++.|.............. T Consensus 113 ~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~ 192 (430) T protein:vir:10 113 NNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIP 192 (430) T ss_pred HHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccch Confidence 99999998764321 11 12234457888888888877653 35899999999999865432222223345 Q ss_pred Cceeeecccee-eccee-EEEcCCCcc---CceEEEEEecCCceEE----EeecCCc-----------cceeccchhhcc Q lcl|Aclame:pro 145 ANALINGTYAD-VLGAQ-IVRSKKLAE---GSALMFKIVSNSPALK----LVLKRGV-----------QVETDRDIVTKT 204 (231) Q Consensus 145 ~~~~~~G~ig~-~~G~~-Vv~s~~~~~---~~~~~~~~~~~~~A~~----~~~k~~v-----------~vE~~Rd~~~~~ 204 (231) .+..++|+||+ +.|+. ++.++.+|. +.+-.+. +.+.+..+ .+...++ ++ +.---...- T Consensus 193 ~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~t-v~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~-s~tg~l~~G 270 (430) T protein:vir:10 193 EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTL-SATTGLKRG 270 (430) T ss_pred hHHHhhccccccchhhhhhhhcCCcccccCccCcCce-eccccccccccceecccccccccccccceeee-ecccceecc Confidence 57789999997 88996 677887774 2222222 11111100 0000111 11 000123444 Q ss_pred cEEEEEEEEEEEEEcCCc---EEEEEeccC Q lcl|Aclame:pro 205 TVITADEHYAAYLYDLTK---VVNITFTGV 231 (231) Q Consensus 205 ~~i~~~~~y~~~~~~~~~---vv~l~~~~~ 231 (231) |.+...-+|+++.+.... +....++++ T Consensus 271 D~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~ 300 (430) T protein:vir:10 271 DKISFTGVKFLGQMAKNVLAQDATFSVVRV 300 (430) T ss_pred cEEEecceeeeccccccccCCccEEEEEEe Confidence 555554444444443322 111122221 No 175 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=98.94 E-value=5.9e-11 Score=76.65 Aligned_cols=224 Identities=16% Similarity=0.120 Sum_probs=130.0 Q ss_pred CCCcccCceEEecc-ccCCcccccCCCccC--ccccccceeEEEeehccc-eeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN-DIGDAADVAEGGEIS--LDKIGTTTKSVTIKKAAK-GTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~-~igda~~v~EG~~i~--~~~lt~~~~~~tikk~g~-~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) +..-|.||||.+|. +.+-.. +|..++ ++.+...+..+++++... .|++++.+. ...+...+..+...++|| T Consensus 38 ~~~~r~Gdti~~p~~~~~~~~---~G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA 112 (430) T protein:vir:92 38 ASMQRSSNTIWMPVEQESPTQ---EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLA 112 (430) T ss_pred hhhhcccceEEeccccccccc---cCcccCCCCCccccceEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHH Confidence 56669999999994 333222 244332 235777788888888655 699998773 566777888888889999 Q ss_pred HHHHHHHHHHhccc--------cc-ccccccCHHHHHHHHHHhhccCC---CceEEEECHHHHHHHHhhhhhhhcccccc Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT--------SQ-TVSTKANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) Q Consensus 77 ~~vd~~~~~~l~t~--------~~-~~~~~~~~d~i~da~~~l~~~~~---~~~v~vv~p~~~~~L~k~~~~~~~~~~~~ 144 (231) ++||.++++....- .. ...+...+.++.++-+.|.+... .++.+|++|..++.|.............. T Consensus 113 ~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~ 192 (430) T protein:vir:92 113 NNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIP 192 (430) T ss_pred HHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccch Confidence 99999998764321 11 12234457888888888877653 35899999999999865432222223345 Q ss_pred Cceeeecccee-eccee-EEEcCCCcc---CceEEEEEecCCceEE----EeecCCc-----------cceeccchhhcc Q lcl|Aclame:pro 145 ANALINGTYAD-VLGAQ-IVRSKKLAE---GSALMFKIVSNSPALK----LVLKRGV-----------QVETDRDIVTKT 204 (231) Q Consensus 145 ~~~~~~G~ig~-~~G~~-Vv~s~~~~~---~~~~~~~~~~~~~A~~----~~~k~~v-----------~vE~~Rd~~~~~ 204 (231) .+..++|+||+ +.|+. ++.++.+|. +.+-.+. +.+.+..+ .+...++ ++ +.---...- T Consensus 193 ~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~t-v~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~-s~tg~l~~G 270 (430) T protein:vir:92 193 EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTL-SATTGLKRG 270 (430) T ss_pred hHHHhhccccccchhhhhhhhcCCcccccCccCcCce-eccccccccccceecccccccccccccceeee-ecccceecc Confidence 57789999997 88996 677887774 2222222 11111100 0000111 11 000123444 Q ss_pred cEEEEEEEEEEEEEcCCc---EEEEEeccC Q lcl|Aclame:pro 205 TVITADEHYAAYLYDLTK---VVNITFTGV 231 (231) Q Consensus 205 ~~i~~~~~y~~~~~~~~~---vv~l~~~~~ 231 (231) |.+...-+|+++.+.... +....++++ T Consensus 271 D~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~ 300 (430) T protein:vir:92 271 DKISFTGVKFLGQMAKNVLAQDATFSVVRV 300 (430) T ss_pred cEEEecceeeeccccccccCCccEEEEEEe Confidence 555554444444443322 111122221 No 176 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=98.90 E-value=8.2e-10 Score=70.39 Aligned_cols=229 Identities=14% Similarity=0.074 Sum_probs=141.2 Q ss_pred CC---------CcccCceEEecc--ccCCcccccCCCccCccc-cccceeEEEeehccceeeecHHHHHhcCCCHH---H Q lcl|Aclame:pro 1 EN---------GINLANLCEYPN--DIGDAADVAEGGEISLDK-IGTTTKSVTIKKAAKGTEITDEAALSGYGDPI---G 65 (231) Q Consensus 1 ~~---------~~~~G~ti~~P~--~igda~~v~EG~~i~~~~-lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~---~ 65 (231) +| .++ |++.++++ -.+.++....++.++++. -|+.+.+..+...+..++|+... .+-+++++ . T Consensus 52 ~s~iL~~lpf~~ve-~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~i-adl~g~~~d~~~ 129 (330) T protein:vir:94 52 VNPLYEMMPFTEIE-GNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLI-QATRSDFMDQTS 129 (330) T ss_pred cchHHhhccccccc-CCcceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHH-HHhcCCHHHHHH Confidence 00 011 33344432 256677777788888765 46788999988888888887764 33344554 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHH---------hcc---cc-----cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHH Q lcl|Aclame:pro 66 ESNKQLGLSLANKVDDDLLKA---------AKT---TS-----QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAA 128 (231) Q Consensus 66 ~~~~~~a~~ia~~vd~~~~~~---------l~t---~~-----~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~ 128 (231) +..++..++|+.+....+++. |.. .. +..+++++.|++.+.+++....+.++.+++||+.... T Consensus 130 ~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r 209 (330) T protein:vir:94 130 VQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRR 209 (330) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHH Confidence 555666678888888888873 211 11 1234677899998888888665668899999999888 Q ss_pred HHHhhhhhhhccccccCceeeec-cceeecceeEEEcCCCccCce----------EEEEEec---CCceEEEeecC--Cc Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALING-TYADVLGAQIVRSKKLAEGSA----------LMFKIVS---NSPALKLVLKR--GV 192 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G-~ig~~~G~~Vv~s~~~~~~~~----------~~~~~~~---~~~A~~~~~k~--~v 192 (231) ++++..+..............-| .+-++.|+||+.+|++|.++. |.+.+-. ..|-.++-... ++ T Consensus 210 ~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~gl 289 (330) T protein:vir:94 210 KYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGL 289 (330) T ss_pred HHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcc Confidence 88887764332111111111123 345688999999999987432 2222211 12334443322 45 Q ss_pred cceecc-chhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 193 QVETDR-DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 193 ~vE~~R-d~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .|+.-- ...+..-......||++.+.+|.++.+|.--.+ T Consensus 290 sVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~ 329 (330) T protein:vir:94 290 RVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLIP 329 (330) T ss_pred eeeeCCCccccceeeEEEEEeeeeEEechhheeeeccccC Confidence 664432 123333344567799999999999999876555 No 177 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=98.90 E-value=6.8e-10 Score=70.85 Aligned_cols=230 Identities=11% Similarity=0.102 Sum_probs=132.0 Q ss_pred CCC-cccCceEEeccc-cCCcccccCCCccCccccccceeEEEeeh-ccceeeec--HHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENG-INLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKK-AAKGTEIT--DEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~-~~~G~ti~~P~~-igda~~v~EG~~i~~~~lt~~~~~~tikk-~g~~~~it--D~~~~~~~~d~~~~~~~~~a~~i 75 (231) ... .+||++|+||+- .....++.-+.-....+++.+..+.++.| ++..|.|. |++.-.....+-....+++.... T Consensus 37 ~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~v 116 (311) T protein:vir:99 37 EVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHV 116 (311) T ss_pred chheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhh Confidence 222 368999999974 55566776666677788888888888877 46678888 54432222222333344444555 Q ss_pred HHHHHHHHHHHhcccc--------------------cccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhh Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTTS--------------------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDAN 135 (231) Q Consensus 76 a~~vd~~~~~~l~t~~--------------------~~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~ 135 (231) .=.+|.-.++.+-+.. ...+...-++.|..++..+.+...++++++|+|+.+..|...+. T Consensus 117 vPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~ 196 (311) T protein:vir:99 117 QPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDETNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKE 196 (311) T ss_pred cchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHHHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchh Confidence 5567765554432110 01122234777888888887777789999999999997766655 Q ss_pred hhhc-cccccCceeeeccceeecceeEEEc-C--CCc------cC-------ceEEEEEecCCceEEEeecCC-ccc-ee Q lcl|Aclame:pro 136 AKNI-GSEVGANALINGTYADVLGAQIVRS-K--KLA------EG-------SALMFKIVSNSPALKLVLKRG-VQV-ET 196 (231) Q Consensus 136 ~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s-~--~~~------~~-------~~~~~~~~~~~~A~~~~~k~~-v~v-E~ 196 (231) |.-. ..........++.++.+.|++|+.. + .+. +| +.+-+.++ .+.|..-..|.. +.+ +. T Consensus 197 ~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~~~ft~G~~~~~~ak~INfiiv-~~~a~i~~~K~~~v~~f~P 275 (311) T protein:vir:99 197 FTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTKYDFTDGAKPTEDAKAINFLVV-AKPAVISIVKENAVFLFAP 275 (311) T ss_pred hheeeecccccccccccccceecCeEEEEecCchhhcchhhhcCCccccCcccccceEEe-CCCeeeeeeeeeeeeeeCC Confidence 5421 1111112224677899999998855 3 332 11 11222222 234444333332 111 11 Q ss_pred ccchhhcccEEEEEEEEEEEEEcCC-cEEEEEeccC Q lcl|Aclame:pro 197 DRDIVTKTTVITADEHYAAYLYDLT-KVVNITFTGV 231 (231) Q Consensus 197 ~Rd~~~~~~~i~~~~~y~~~~~~~~-~vv~l~~~~~ 231 (231) .-........+..+.|+.+-+++-. +.+.+.++.- T Consensus 276 ~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 276 GQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred CCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 1122344689999999999998653 3333443322 No 178 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.76 E-value=3e-09 Score=67.33 Aligned_cols=223 Identities=12% Similarity=0.131 Sum_probs=146.4 Q ss_pred CCCcccC--------ceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA--------NLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) Q Consensus 1 ~~~~~~G--------~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a 72 (231) +.=++.| ..+.+ -.-|+...|.||.++....++-...+..+.+||+.|.||.+++..--.+.+...-+.++ T Consensus 392 ~~~~~~~~~~DFk~~~~~~l-g~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g 470 (652) T protein:vir:79 392 EQWTRKGQLSDFKIAHRVGM-GGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLG 470 (652) T ss_pred HHHhccCCCccccccceeec-CCCCCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHH Confidence 2222223 22322 12356678999999999888888889999999999999999988777788999999999 Q ss_pred HHHHHHHHHHHHHHhccccc-----------------ccccccCHHHHHHHHHHhhcc-------CCCceEEEECHHHHH Q lcl|Aclame:pro 73 LSLANKVDDDLLKAAKTTSQ-----------------TVSTKANVDGVQAALDIFNDE-------DAQAYVLIVNPKDAA 128 (231) Q Consensus 73 ~~ia~~vd~~~~~~l~t~~~-----------------~~~~~~~~d~i~da~~~l~~~-------~~~~~v~vv~p~~~~ 128 (231) ++.++.+++.+++.|...+. ..+..++.+.+..|..++... +..|++++|+|.... T Consensus 471 ~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~ 550 (652) T protein:vir:79 471 RAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMES 550 (652) T ss_pred HHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHH Confidence 99999999988877643221 112456777777776555321 236789999998765 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecce-eEEEcCCCccCce--EEEEEecCCceEEE--eec-CCccceeccchhh Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGA-QIVRSKKLAEGSA--LMFKIVSNSPALKL--VLK-RGVQVETDRDIVT 202 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~-~Vv~s~~~~~~~~--~~~~~~~~~~A~~~--~~k-~~v~vE~~Rd~~~ 202 (231) ...+...... ..+.+ .-+|.+--+.|. +||+++.+..... +++---.....+.. +.. +.+.+|+.-.-.. T Consensus 551 ~a~~ll~s~~---v~~a~-~~~~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~ 626 (652) T protein:vir:79 551 VANQVIRSSS---VKGAD-INAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSV 626 (652) T ss_pred HHHHHhccCC---Ccccc-cccccccccccccccccccccCCCCcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCc Confidence 5554322111 11111 123344455664 7888888865433 43321111122333 222 2356777655566 Q ss_pred cccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 203 KTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 203 ~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) +--.++.++-||++++|=.++++.|- T Consensus 627 dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 627 DGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred ceEEEEEEEeccCceeeccceeeecC Confidence 66788899999999999999999987 No 179 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.73 E-value=1.3e-08 Score=63.78 Aligned_cols=226 Identities=13% Similarity=0.087 Sum_probs=129.6 Q ss_pred CCCcccCceEEeccccCCcccccCC-----CccCccccccceeEEEeehccceeeecHHHHHhcCCCHHH---HHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEG-----GEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIG---ESNKQLG 72 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG-----~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~---~~~~~~a 72 (231) |.+.-.++..+- .+++...+-+ ...++..-++++.+..++-.+..++|...-......+|++ +..++-. T Consensus 41 eg~~~~ynR~~~---~~~~~~~~v~~~~~~~g~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~i 117 (310) T protein:vir:97 41 EGNSLAYNRENV---LGDVIMAGVGTTFSGAGAGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKA 117 (310) T ss_pred cCCcceeeEeec---cCCcccccccccccCCCccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHH Confidence 432222222221 1222211111 1234455667888888888888888876533332344544 4467777 Q ss_pred HHHHHHHHHHHHHH---------hcc---ccc-----ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhh Q lcl|Aclame:pro 73 LSLANKVDDDLLKA---------AKT---TSQ-----TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDAN 135 (231) Q Consensus 73 ~~ia~~vd~~~~~~---------l~t---~~~-----~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~ 135 (231) ++|.++....+++. |.. .++ ...+.++.|++...++..-..+-++.++++||+...+++...+ T Consensus 118 ea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R 197 (310) T protein:vir:97 118 KSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLR 197 (310) T ss_pred HHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHH Confidence 88899999888873 221 111 1235678898888888875556688999999998777776655 Q ss_pred hhhc-cccccCceeeeccceeecceeEEEcCCCccCc------e----EEEEEec---CCceEEEeec--CCccceeccc Q lcl|Aclame:pro 136 AKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGS------A----LMFKIVS---NSPALKLVLK--RGVQVETDRD 199 (231) Q Consensus 136 ~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~------~----~~~~~~~---~~~A~~~~~k--~~v~vE~~Rd 199 (231) --.. +-+-....+..-.+-++.|+||+.++++|.++ + |.+.+-. +.|-.++... -++.|+.-.. T Consensus 198 ~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~ 277 (310) T protein:vir:97 198 ALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGE 277 (310) T ss_pred HhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCc Confidence 3321 11111111122245688999999999998642 1 2222211 1233333211 2366665442 Q ss_pred -hhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 -IVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 -~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .++..-......||++.+.+|.++.+| ..| T Consensus 278 ~~~~~v~~~~V~~Y~~~av~~~~A~a~L--~~V 308 (310) T protein:vir:97 278 SEDSDEHIWRVKWYCGLALFSEKGLACA--DGI 308 (310) T ss_pred ccCCcceeEEEEEeeeEEEecccceeee--ccc Confidence 233333455567999999999999987 456 No 180 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=98.70 E-value=6.9e-09 Score=65.33 Aligned_cols=227 Identities=11% Similarity=0.090 Sum_probs=125.8 Q ss_pred CCCcccCceEEecccc------CCcccccCCCccCccccccceeEEEeeh-ccceeeecHHHHHhc--CCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDI------GDAADVAEGGEISLDKIGTTTKSVTIKK-AAKGTEITDEAALSG--YGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G~ti~~P~~i------gda~~v~EG~~i~~~~lt~~~~~~tikk-~g~~~~itD~~~~~~--~~d~~~~~~~~~ 71 (231) ---.+||++|+||+.. ....++.-+.-.....++.+..+.++.+ ++..|.|.-.++..+ ....-....+++ T Consensus 34 ~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~ 113 (302) T protein:vir:78 34 NVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQ 113 (302) T ss_pred eEEEecCcEEEEEEEEeeccccccccccccccCccccceeeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHH Confidence 1126889999999863 3455676666666666776666666666 466687773332222 222222233344 Q ss_pred HHHHHHHHHHHHHHHhcc----cc---c----ccccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAAKT----TS---Q----TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIG 140 (231) Q Consensus 72 a~~ia~~vd~~~~~~l~t----~~---~----~~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~ 140 (231) .....=.+|.-.|+.+-+ .. . ..+...-++.|..++..+++. ++++++|.|..+..|...+.+...- T Consensus 114 r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~~i~~~~~~~~e~--~~~vl~vtp~~~~~Lk~a~~~~~~~ 191 (302) T protein:vir:78 114 RTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMGDIATAMELVDDS--NQLILVTSPTTLAGLLNTALIRESK 191 (302) T ss_pred HhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHHHHHHHHHHhhcc--CCeEEEEChHHHHHHhcchhhccce Confidence 455556677766654421 11 1 122233456667777777764 6999999999999887665554321 Q ss_pred -ccccCceeeeccceeecceeEEEcC--CCc-------------cCceEEEEEecCCceEEEeecCC-ccceeccchhhc Q lcl|Aclame:pro 141 -SEVGANALINGTYADVLGAQIVRSK--KLA-------------EGSALMFKIVSNSPALKLVLKRG-VQVETDRDIVTK 203 (231) Q Consensus 141 -~~~~~~~~~~G~ig~~~G~~Vv~s~--~~~-------------~~~~~~~~~~~~~~A~~~~~k~~-v~vE~~Rd~~~~ 203 (231) ......--.++.++.+.|++|+.-+ .+. .++.+-+.++ .+.|..-..|.+ +.+- .=+..+. T Consensus 192 ~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~~~~~~ak~INfiiv-~~~a~ia~~K~~~~~if-~P~~~~~ 269 (302) T protein:vir:78 192 NTQVLRRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKVGVPDYTGAKKIPYMIF-KRDAPTGIVKTDKVRVF-EPDTNQS 269 (302) T ss_pred eccccccccccceeeeecccEEEEchhhhcccceeccCCccccCCccceeEEEE-CCCeeeeeeeeeeeEee-CCCCCCC Confidence 1111111236679999999998654 222 1122222222 234443333333 1111 1123333 Q ss_pred c--cEEEEEEEEEEEEEcCC-cEEEEE-eccC Q lcl|Aclame:pro 204 T--TVITADEHYAAYLYDLT-KVVNIT-FTGV 231 (231) Q Consensus 204 ~--~~i~~~~~y~~~~~~~~-~vv~l~-~~~~ 231 (231) . ..+..+.|+.+-+++.. +.+.+. +++| T Consensus 270 gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~ 301 (302) T protein:vir:78 270 ADAYKVDLRLYHDLIVPKNQRPGIIKASFGTI 301 (302) T ss_pred cceeeeeeeeEeeeeeeccccCeEEEeecccc Confidence 3 49999999999999765 334444 4445 No 181 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=98.64 E-value=9.1e-09 Score=64.65 Aligned_cols=224 Identities=12% Similarity=0.137 Sum_probs=146.4 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVD 80 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia~~vd 80 (231) =..-+.-..+.+ -+-|+...|.||.++....+.-...+..+..||+.|.||.+++..--.+.+...-+.++++.++.++ T Consensus 435 ~~DFk~~~~~~l-g~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~ 513 (693) T protein:vir:95 435 LTDFKPARRVGL-GEFSSLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIG 513 (693) T ss_pred CCcccccceeec-CCCCChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHH Confidence 122222233333 2234567899999999988888888999999999999999999887778899999999999999999 Q ss_pred HHHHHHhccccc----------------c-cccccCHHHHHHHHHHhhcc------------CCCceEEEECHHHHHHHH Q lcl|Aclame:pro 81 DDLLKAAKTTSQ----------------T-VSTKANVDGVQAALDIFNDE------------DAQAYVLIVNPKDAAKIR 131 (231) Q Consensus 81 ~~~~~~l~t~~~----------------~-~~~~~~~d~i~da~~~l~~~------------~~~~~v~vv~p~~~~~L~ 131 (231) .-+++.|...+. + ....++.+.+..+..++... +..+++++++|......+ T Consensus 514 ~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~ 593 (693) T protein:vir:95 514 DLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKAN 593 (693) T ss_pred HHHHHHHhcCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHH Confidence 988887753320 1 23467788887776555321 135788999998776666 Q ss_pred hhhhhhhccccccCceeeeccceeecce-eEEEcCCCcc--CceEEEEEecCCceEEEe--ec-CCccceeccchhhccc Q lcl|Aclame:pro 132 KDANAKNIGSEVGANALINGTYADVLGA-QIVRSKKLAE--GSALMFKIVSNSPALKLV--LK-RGVQVETDRDIVTKTT 205 (231) Q Consensus 132 k~~~~~~~~~~~~~~~~~~G~ig~~~G~-~Vv~s~~~~~--~~~~~~~~~~~~~A~~~~--~k-~~v~vE~~Rd~~~~~~ 205 (231) +....... . +.+ .-+|.+--+.|+ +||.++.+.+ ++.+++........+... .. +...+|+.-.-..+-- T Consensus 594 ~l~~s~~~--~-~a~-~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~ 669 (693) T protein:vir:95 594 QIINSESV--P-GAD-VNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGV 669 (693) T ss_pred HHhccccc--c-ccc-cccccccchhccccccccceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceE Confidence 54321111 1 111 122333345553 7888888864 456665322211223332 22 2255666666666667 Q ss_pred EEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 206 VITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 206 ~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) .++.++-||++++|=.++++=.-+ T Consensus 670 ~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 670 ASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred EEEEEEeccCceeeccccccCCCC Confidence 888999999999998888776444 No 182 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.63 E-value=2.6e-09 Score=67.69 Aligned_cols=228 Identities=11% Similarity=0.047 Sum_probs=137.9 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecH-HHHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD-~~~~~~~~d~~~~~~~~~a~~ 74 (231) |+..+.||+|+|+- ..|+ .+..++.+ -.+.|++.+.+++|++...++.... .....+..|+..++.+.++.. T Consensus 49 dL~k~~Gd~v~f~L~~~L~g~--gv~Gd~~leGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w 126 (364) T protein:vir:93 49 ELESDAGDRITFDLSVHLRGK--PTYGDARVEGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDY 126 (364) T ss_pred ecCCCCCceEEeeeeeecccC--CcccCceeeccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHH Confidence 88899999999962 2333 34444444 3578999999999999988887544 345557889999999999999 Q ss_pred HHHHHHHHHHHHhcccc---------------------------------------cccccccCHHHHHHHHHHhhcc-- Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTS---------------------------------------QTVSTKANVDGVQAALDIFNDE-- 113 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~---------------------------------------~~~~~~~~~d~i~da~~~l~~~-- 113 (231) |++..|+.++-.|.++. .+++...+++.|.+|...+... T Consensus 127 ~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~ 206 (364) T protein:vir:93 127 FYKFTDELLFIYLSGARGINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQA 206 (364) T ss_pred HHHHHHHHHHHHhhcccccccccccccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCC Confidence 99999998875543211 1123346788888877765432 Q ss_pred --------------CCCceEEEECHHHHHHHHhh--hhhhhccc-----cccCceeeeccceeecceeEEEcCCCccCce Q lcl|Aclame:pro 114 --------------DAQAYVLIVNPKDAAKIRKD--ANAKNIGS-----EVGANALINGTYADVLGAQIVRSKKLAEGSA 172 (231) Q Consensus 114 --------------~~~~~v~vv~p~~~~~L~k~--~~~~~~~~-----~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~ 172 (231) +.+.+++++||.++..|+.+ +.|.+... ...++-+.+|.+|++-|+.|....+++.... T Consensus 207 ~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~ 286 (364) T protein:vir:93 207 ENPDVANMVPVSIDGDDHYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFND 286 (364) T ss_pred CCCCCcccceeEecCcceeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccc Confidence 12467999999999999954 34443322 2234568889999999998888777653221 Q ss_pred ------EEEE--EecCCceEEEe--ecCCc---cceeccchhhcccEEEEEEEEEEEEE----cCCcEEEEEeccC Q lcl|Aclame:pro 173 ------LMFK--IVSNSPALKLV--LKRGV---QVETDRDIVTKTTVITADEHYAAYLY----DLTKVVNITFTGV 231 (231) Q Consensus 173 ------~~~~--~~~~~~A~~~~--~k~~v---~vE~~Rd~~~~~~~i~~~~~y~~~~~----~~~~vv~l~~~~~ 231 (231) +.+. ++.+.-|+.+. ...+. -+|...|-.++.- |......|++-+ .+=+++.|.-.++ T Consensus 287 ~~~~~~v~~~ralllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~~-i~~~~i~G~kK~rF~~~DfGvi~idtaa~ 361 (364) T protein:vir:93 287 YGAGANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPA-IAAGFIAGMKKARFNNKDFGVISIDTAAK 361 (364) T ss_pred cccCccccchhhheecceeeEEEeecCCCCCceeeecccCCCCchh-hhhhhHhhhhhcccCCccceEEEeccccc Confidence 2111 22234444333 22221 1333333322221 222111111111 1334444444444 No 183 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=98.61 E-value=1.4e-09 Score=69.21 Aligned_cols=229 Identities=16% Similarity=0.187 Sum_probs=157.0 Q ss_pred CCCcc------cCceEEeccccCCcc--cccCCCccCccccccceeEEEeehc-cceeeecHHHHHhc--CCCHHHHHHH Q lcl|Aclame:pro 1 ENGIN------LANLCEYPNDIGDAA--DVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSG--YGDPIGESNK 69 (231) Q Consensus 1 ~~~~~------~G~ti~~P~~igda~--~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~~--~~d~~~~~~~ 69 (231) |+--| .|++++||. +|.+. .-.|...+..+.|.+++.+.-+..| |.+|-|||....++ ..+++++-.- T Consensus 31 ~~~~R~V~DF~~G~~L~I~t-iGs~~~~~~~E~~~~~~~~i~TGEIt~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~A 109 (313) T protein:vir:95 31 ETFYRNVSDFGSGETLHIKT-IGSVTLQEAEEDTPLIYNPIETGEITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAA 109 (313) T ss_pred hhhhhhhccCCCCCEEEecc-cCceeeeccccCCCeeecccccceEEEEEEeecCChhhhhhhhhhcchhHHHHhhhcch Confidence 44444 799999976 78876 4678889999999999999999987 55899999887766 4567777777 Q ss_pred HHHHHHHHHHHHHHHHHhcc------cccc------------cccccCHHHHHHHHHHhhccC--CCceEEEECHHHHHH Q lcl|Aclame:pro 70 QLGLSLANKVDDDLLKAAKT------TSQT------------VSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAK 129 (231) Q Consensus 70 ~~a~~ia~~vd~~~~~~l~t------~~~~------------~~~~~~~d~i~da~~~l~~~~--~~~~v~vv~p~~~~~ 129 (231) +.+++|-...+.|+++.... .+.. ......+.++...--.|...+ .+.++.++.|..... T Consensus 110 E~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~ 189 (313) T protein:vir:95 110 ESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEAT 189 (313) T ss_pred hhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhh Confidence 88889999999999875321 1111 112234555665556665544 578999999999988 Q ss_pred HHhhhhhhhccccccCceeeecc------ceeecceeEEEcCCCcc---------CceE-----EEEE-ecCCceEEEee Q lcl|Aclame:pro 130 IRKDANAKNIGSEVGANALINGT------YADVLGAQIVRSKKLAE---------GSAL-----MFKI-VSNSPALKLVL 188 (231) Q Consensus 130 L~k~~~~~~~~~~~~~~~~~~G~------ig~~~G~~Vv~s~~~~~---------~~~~-----~~~~-~~~~~A~~~~~ 188 (231) |-.......--+..+.=++-+|. +-.++|+.+.+||.+.. +.++ .... .+..+-++-+. T Consensus 190 L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr 269 (313) T protein:vir:95 190 LNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWR 269 (313) T ss_pred hhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhccccccccccCceeeeeeeeeecccccceeeeec Confidence 87755443322223333555554 33579999999997642 1111 1110 01122233333 Q ss_pred cCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 189 KRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 189 k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +-.+.|..|+..+..+.-..+-+||.++.+.+-++++--.|. T Consensus 270 -~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 270 -RMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSAT 311 (313) T ss_pred -cccccccccccccccccceeeeeecccceeecceeEEEeccc Confidence 445788999998888888888899999999999888877777 No 184 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.31 E-value=8.1e-08 Score=59.47 Aligned_cols=222 Identities=11% Similarity=0.074 Sum_probs=130.3 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecHH-HHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD~-~~~~~~~d~~~~~~~~~a~~ 74 (231) |+..+.||+|+|+- ..|+ .+..++.+ -.+.|++.+.+++|++....+..... ....+..|+..++.+.++.. T Consensus 62 dL~K~aGd~vtf~L~~~L~g~--gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w 139 (404) T protein:vir:10 62 DLNKQAGDEVTFSIMHKLSKR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (404) T ss_pred cCCCCCCcEEEEeEeeecccC--CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHH Confidence 88889999999972 2333 34444444 36789999999999999888866554 34557889999999999999 Q ss_pred HHHHHHHHHHHHhccccc---------------------------------------------ccccccCHHHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTSQ---------------------------------------------TVSTKANVDGVQAALDI 109 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~~---------------------------------------------~~~~~~~~d~i~da~~~ 109 (231) |++..|+-++-.|.+++. +++...+++-|.++... T Consensus 140 ~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~ 219 (404) T protein:vir:10 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (404) T ss_pred HHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHH Confidence 999999988754432110 11122355556565554 Q ss_pred hhcc----------CCC------ceEEEECHHHHHHHHhhhh---hhhccc------cccCceeeeccceeecceeEEEc Q lcl|Aclame:pro 110 FNDE----------DAQ------AYVLIVNPKDAAKIRKDAN---AKNIGS------EVGANALINGTYADVLGAQIVRS 164 (231) Q Consensus 110 l~~~----------~~~------~~v~vv~p~~~~~L~k~~~---~~~~~~------~~~~~~~~~G~ig~~~G~~Vv~s 164 (231) +... +++ .+++++||.++..|+.|+. |.+... ...++-+.+|..|++-|+.|..- T Consensus 220 ~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~ 299 (404) T protein:vir:10 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY 299 (404) T ss_pred HHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEec Confidence 4221 122 4899999999999999963 443222 13456788899999999888754 Q ss_pred CCCc------------------------cCceEEEEEecCCceEEEe--ecCC---ccceeccchhhcccEE-------- Q lcl|Aclame:pro 165 KKLA------------------------EGSALMFKIVSNSPALKLV--LKRG---VQVETDRDIVTKTTVI-------- 207 (231) Q Consensus 165 ~~~~------------------------~~~~~~~~~~~~~~A~~~~--~k~~---v~vE~~Rd~~~~~~~i-------- 207 (231) .++| ++..+..-++.+.-|+.+. ...+ --.|...|-.++.-+- T Consensus 300 ~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~k 379 (404) T protein:vir:10 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLK 379 (404) T ss_pred CCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhh Confidence 4433 0000001122334444333 2222 1234433322221111 Q ss_pred --------EEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 208 --------TADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 208 --------~~~~~y~~~~~~~~~vv~l 226 (231) -.-+-||+-+++- .++| T Consensus 380 K~rF~~~~g~~~DfGvi~idt--a~~~ 404 (404) T protein:vir:10 380 KIRFPEKSGKMQDHGVIAVDT--AVKL 404 (404) T ss_pred hccccCCCCceeeEEEEEecc--cccC Confidence 1112344444432 1222 No 185 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.31 E-value=8.1e-08 Score=59.47 Aligned_cols=222 Identities=11% Similarity=0.074 Sum_probs=130.3 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecHH-HHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD~-~~~~~~~d~~~~~~~~~a~~ 74 (231) |+..+.||+|+|+- ..|+ .+..++.+ -.+.|++.+.+++|++....+..... ....+..|+..++.+.++.. T Consensus 62 dL~K~aGd~vtf~L~~~L~g~--gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w 139 (404) T protein:vir:32 62 DLNKQAGDEVTFSIMHKLSKR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (404) T ss_pred cCCCCCCcEEEEeEeeecccC--CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHH Confidence 88889999999972 2333 34444444 36789999999999999888866554 34557889999999999999 Q ss_pred HHHHHHHHHHHHhccccc---------------------------------------------ccccccCHHHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTSQ---------------------------------------------TVSTKANVDGVQAALDI 109 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~~---------------------------------------------~~~~~~~~d~i~da~~~ 109 (231) |++..|+-++-.|.+++. +++...+++-|.++... T Consensus 140 ~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~ 219 (404) T protein:vir:32 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (404) T ss_pred HHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHH Confidence 999999988754432110 11122355556565554 Q ss_pred hhcc----------CCC------ceEEEECHHHHHHHHhhhh---hhhccc------cccCceeeeccceeecceeEEEc Q lcl|Aclame:pro 110 FNDE----------DAQ------AYVLIVNPKDAAKIRKDAN---AKNIGS------EVGANALINGTYADVLGAQIVRS 164 (231) Q Consensus 110 l~~~----------~~~------~~v~vv~p~~~~~L~k~~~---~~~~~~------~~~~~~~~~G~ig~~~G~~Vv~s 164 (231) +... +++ .+++++||.++..|+.|+. |.+... ...++-+.+|..|++-|+.|..- T Consensus 220 ~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~ 299 (404) T protein:vir:32 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY 299 (404) T ss_pred HHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEec Confidence 4221 122 4899999999999999963 443222 13456788899999999888754 Q ss_pred CCCc------------------------cCceEEEEEecCCceEEEe--ecCC---ccceeccchhhcccEE-------- Q lcl|Aclame:pro 165 KKLA------------------------EGSALMFKIVSNSPALKLV--LKRG---VQVETDRDIVTKTTVI-------- 207 (231) Q Consensus 165 ~~~~------------------------~~~~~~~~~~~~~~A~~~~--~k~~---v~vE~~Rd~~~~~~~i-------- 207 (231) .++| ++..+..-++.+.-|+.+. ...+ --.|...|-.++.-+- T Consensus 300 ~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~k 379 (404) T protein:vir:32 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLK 379 (404) T ss_pred CCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhh Confidence 4433 0000001122334444333 2222 1234433322221111 Q ss_pred --------EEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 208 --------TADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 208 --------~~~~~y~~~~~~~~~vv~l 226 (231) -.-+-||+-+++- .++| T Consensus 380 K~rF~~~~g~~~DfGvi~idt--a~~~ 404 (404) T protein:vir:32 380 KIRFPEKSGKMQDHGVIAVDT--AVKL 404 (404) T ss_pred hccccCCCCceeeEEEEEecc--cccC Confidence 1112344444432 1222 No 186 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.31 E-value=8.1e-08 Score=59.47 Aligned_cols=222 Identities=11% Similarity=0.074 Sum_probs=130.3 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecHH-HHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD~-~~~~~~~d~~~~~~~~~a~~ 74 (231) |+..+.||+|+|+- ..|+ .+..++.+ -.+.|++.+.+++|++....+..... ....+..|+..++.+.++.. T Consensus 62 dL~K~aGd~vtf~L~~~L~g~--gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w 139 (404) T protein:vir:81 62 DLNKQAGDEVTFSIMHKLSKR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (404) T ss_pred cCCCCCCcEEEEeEeeecccC--CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHH Confidence 88889999999972 2333 34444444 36789999999999999888866554 34557889999999999999 Q ss_pred HHHHHHHHHHHHhccccc---------------------------------------------ccccccCHHHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTSQ---------------------------------------------TVSTKANVDGVQAALDI 109 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~~---------------------------------------------~~~~~~~~d~i~da~~~ 109 (231) |++..|+-++-.|.+++. +++...+++-|.++... T Consensus 140 ~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~ 219 (404) T protein:vir:81 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (404) T ss_pred HHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHH Confidence 999999988754432110 11122355556565554 Q ss_pred hhcc----------CCC------ceEEEECHHHHHHHHhhhh---hhhccc------cccCceeeeccceeecceeEEEc Q lcl|Aclame:pro 110 FNDE----------DAQ------AYVLIVNPKDAAKIRKDAN---AKNIGS------EVGANALINGTYADVLGAQIVRS 164 (231) Q Consensus 110 l~~~----------~~~------~~v~vv~p~~~~~L~k~~~---~~~~~~------~~~~~~~~~G~ig~~~G~~Vv~s 164 (231) +... +++ .+++++||.++..|+.|+. |.+... ...++-+.+|..|++-|+.|..- T Consensus 220 ~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~ 299 (404) T protein:vir:81 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY 299 (404) T ss_pred HHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEec Confidence 4221 122 4899999999999999963 443222 13456788899999999888754 Q ss_pred CCCc------------------------cCceEEEEEecCCceEEEe--ecCC---ccceeccchhhcccEE-------- Q lcl|Aclame:pro 165 KKLA------------------------EGSALMFKIVSNSPALKLV--LKRG---VQVETDRDIVTKTTVI-------- 207 (231) Q Consensus 165 ~~~~------------------------~~~~~~~~~~~~~~A~~~~--~k~~---v~vE~~Rd~~~~~~~i-------- 207 (231) .++| ++..+..-++.+.-|+.+. ...+ --.|...|-.++.-+- T Consensus 300 ~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~k 379 (404) T protein:vir:81 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLK 379 (404) T ss_pred CCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhh Confidence 4433 0000001122334444333 2222 1234433322221111 Q ss_pred --------EEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 208 --------TADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 208 --------~~~~~y~~~~~~~~~vv~l 226 (231) -.-+-||+-+++- .++| T Consensus 380 K~rF~~~~g~~~DfGvi~idt--a~~~ 404 (404) T protein:vir:81 380 KIRFPEKSGKMQDHGVIAVDT--AVKL 404 (404) T ss_pred hccccCCCCceeeEEEEEecc--cccC Confidence 1112344444432 1222 No 187 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.31 E-value=8.1e-08 Score=59.47 Aligned_cols=222 Identities=11% Similarity=0.074 Sum_probs=130.3 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecHH-HHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD~-~~~~~~~d~~~~~~~~~a~~ 74 (231) |+..+.||+|+|+- ..|+ .+..++.+ -.+.|++.+.+++|++....+..... ....+..|+..++.+.++.. T Consensus 62 dL~K~aGd~vtf~L~~~L~g~--gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w 139 (404) T protein:vir:10 62 DLNKQAGDEVTFSIMHKLSKR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (404) T ss_pred cCCCCCCcEEEEeEeeecccC--CcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHH Confidence 88889999999972 2333 34444444 36789999999999999888866554 34557889999999999999 Q ss_pred HHHHHHHHHHHHhccccc---------------------------------------------ccccccCHHHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTSQ---------------------------------------------TVSTKANVDGVQAALDI 109 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~~---------------------------------------------~~~~~~~~d~i~da~~~ 109 (231) |++..|+-++-.|.+++. +++...+++-|.++... T Consensus 140 ~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~ 219 (404) T protein:vir:10 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (404) T ss_pred HHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHH Confidence 999999988754432110 11122355556565554 Q ss_pred hhcc----------CCC------ceEEEECHHHHHHHHhhhh---hhhccc------cccCceeeeccceeecceeEEEc Q lcl|Aclame:pro 110 FNDE----------DAQ------AYVLIVNPKDAAKIRKDAN---AKNIGS------EVGANALINGTYADVLGAQIVRS 164 (231) Q Consensus 110 l~~~----------~~~------~~v~vv~p~~~~~L~k~~~---~~~~~~------~~~~~~~~~G~ig~~~G~~Vv~s 164 (231) +... +++ .+++++||.++..|+.|+. |.+... ...++-+.+|..|++-|+.|..- T Consensus 220 ~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~ 299 (404) T protein:vir:10 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY 299 (404) T ss_pred HHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEec Confidence 4221 122 4899999999999999963 443222 13456788899999999888754 Q ss_pred CCCc------------------------cCceEEEEEecCCceEEEe--ecCC---ccceeccchhhcccEE-------- Q lcl|Aclame:pro 165 KKLA------------------------EGSALMFKIVSNSPALKLV--LKRG---VQVETDRDIVTKTTVI-------- 207 (231) Q Consensus 165 ~~~~------------------------~~~~~~~~~~~~~~A~~~~--~k~~---v~vE~~Rd~~~~~~~i-------- 207 (231) .++| ++..+..-++.+.-|+.+. ...+ --.|...|-.++.-+- T Consensus 300 ~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~k 379 (404) T protein:vir:10 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLK 379 (404) T ss_pred CCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhh Confidence 4433 0000001122334444333 2222 1234433322221111 Q ss_pred --------EEEEEEEEEEEcCCcEEEE Q lcl|Aclame:pro 208 --------TADEHYAAYLYDLTKVVNI 226 (231) Q Consensus 208 --------~~~~~y~~~~~~~~~vv~l 226 (231) -.-+-||+-+++- .++| T Consensus 380 K~rF~~~~g~~~DfGvi~idt--a~~~ 404 (404) T protein:vir:10 380 KIRFPEKSGKMQDHGVIAVDT--AVKL 404 (404) T ss_pred hccccCCCCceeeEEEEEecc--cccC Confidence 1112344444432 1222 No 188 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.25 E-value=3.4e-07 Score=56.05 Aligned_cols=225 Identities=14% Similarity=0.088 Sum_probs=135.5 Q ss_pred CCCccc-CceEEeccc--cCCcccccCCC-ccCccccccceeEEEeehccceeeecHHHHHhc---CCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINL-ANLCEYPND--IGDAADVAEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALSG---YGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~-G~ti~~P~~--igda~~v~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~---~~d~~~~~~~~~a~ 73 (231) ..-... -.+++++.+ .|.+..++.++ ++|..+...+.....+.+++..|.++.+++.++ +.++-..-....++ T Consensus 41 ~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~ 120 (296) T protein:vir:10 41 SNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFE 120 (296) T ss_pred ccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHH Confidence 222222 235676655 67788887654 589999999999999999999999887766443 66777777777788 Q ss_pred HHHHHHHHHHHHH--------hccccc---ccc---c---ccCHHHHHHHHHHhhcc---CCCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 74 SLANKVDDDLLKA--------AKTTSQ---TVS---T---KANVDGVQAALDIFNDE---DAQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 74 ~ia~~vd~~~~~~--------l~t~~~---~~~---~---~~~~d~i~da~~~l~~~---~~~~~v~vv~p~~~~~L~k~ 133 (231) +++.+.|+-+|.+ |-..++ ..+ + ..-+++|..++..+... ...++.++++|..+..|..- T Consensus 121 ~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~ 200 (296) T protein:vir:10 121 AHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNL 200 (296) T ss_pred HHHHhhceEEEeecccccceeEeecCCCccccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhc Confidence 8888888765533 111111 111 1 12378888888766532 25678899999999888642 Q ss_pred hhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEEE Q lcl|Aclame:pro 134 ANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHY 213 (231) Q Consensus 134 ~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y 213 (231) .. ..+.....-+-.++...+|.++|.+.+.. ..++...+.+..++.-+.+....++.+..- ........+.....+ T Consensus 201 ~~--~~~~t~l~~ik~~~~~l~i~~~~~l~~a~-~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~-e~~~l~~~~~~~~~~ 276 (296) T protein:vir:10 201 VP--GTSVSYGEFFRQNNSGVTVEFVQYLNDYN-GTGTSAAIAYEKDPNNMAIEIPEATNALPA-QPKDLHFKIPVTSKA 276 (296) T ss_pred cC--CCCccHHHHHHHhcCCceEEEeeeeccCC-CCcceEEEEEEcCCceEEEEcCcceeeecc-cccCceEEEeeEeeE Confidence 11 11111111112223334455555544332 224444555555566666655455444321 223345556666655 Q ss_pred -EEEEEcCCcEEEE---Eec Q lcl|Aclame:pro 214 -AAYLYDLTKVVNI---TFT 229 (231) Q Consensus 214 -~~~~~~~~~vv~l---~~~ 229 (231) |+-+..|.+++.+ ||+ T Consensus 277 ~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 277 TGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEEECCceeEEEeeeecC Confidence 6999999999997 777 No 189 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=98.22 E-value=1e-07 Score=58.91 Aligned_cols=229 Identities=13% Similarity=0.080 Sum_probs=130.1 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecHH-HHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD~-~~~~~~~d~~~~~~~~~a~~ 74 (231) |+.-+.||+|+|+- ..|++ +..++.+ -.+.|++.+..++|++....+.+-.. +...+..|+..++.+.++.- T Consensus 67 dL~K~~GD~Vtf~L~~~L~g~g--v~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w 144 (430) T protein:vir:10 67 DLGRNKGDEVRFHFVQPANAFP--IMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWF 144 (430) T ss_pred cCCCCCccEEEEeEeeccccCc--eecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHH Confidence 77789999999973 23443 2223333 35789999999999999888877654 34557889999999999999 Q ss_pred HHHHHHHHHHHHhccc-----------------------------c------------------------cccccccCHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTT-----------------------------S------------------------QTVSTKANVD 101 (231) Q Consensus 75 ia~~vd~~~~~~l~t~-----------------------------~------------------------~~~~~~~~~d 101 (231) |++..|+-+|-.|.++ + .+.+...+++ T Consensus 145 ~~~~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~ 224 (430) T protein:vir:10 145 MDAYLDQSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVD 224 (430) T ss_pred HHHHHHHHHHHHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHH Confidence 9999998765433211 0 0112224566 Q ss_pred HHHHHHHHhhcc----------CCC------ceEEEECHHHHHHHHhhhhhhhcc-------ccccCceeeeccceeecc Q lcl|Aclame:pro 102 GVQAALDIFNDE----------DAQ------AYVLIVNPKDAAKIRKDANAKNIG-------SEVGANALINGTYADVLG 158 (231) Q Consensus 102 ~i~da~~~l~~~----------~~~------~~v~vv~p~~~~~L~k~~~~~~~~-------~~~~~~~~~~G~ig~~~G 158 (231) -|.+|....... +++ .+++++||.++..|+.|+.+.+.. ....++-+.+|.+|++-| T Consensus 225 ~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ng 304 (430) T protein:vir:10 225 VVDSIATYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSN 304 (430) T ss_pred HHHHHHHHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecC Confidence 676666655432 122 389999999999999999875321 122346788999999999 Q ss_pred eeEEEcCCC------------cc-----------------CceEEEEEecCCceEEEeecCC-------ccceeccchhh Q lcl|Aclame:pro 159 AQIVRSKKL------------AE-----------------GSALMFKIVSNSPALKLVLKRG-------VQVETDRDIVT 202 (231) Q Consensus 159 ~~Vv~s~~~------------~~-----------------~~~~~~~~~~~~~A~~~~~k~~-------v~vE~~Rd~~~ 202 (231) +.|..-.++ .+ +..+.--++.+..|+.+..... .=.|...|-.+ T Consensus 305 vii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~ 384 (430) T protein:vir:10 305 TLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGD 384 (430) T ss_pred eEEecCCceeeecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCc Confidence 988754321 10 0000001122333443332221 11454444333 Q ss_pred cccEEE----EEE--EEEEE-----EEcCCcEEEEEeccC Q lcl|Aclame:pro 203 KTTVIT----ADE--HYAAY-----LYDLTKVVNITFTGV 231 (231) Q Consensus 203 ~~~~i~----~~~--~y~~~-----~~~~~~vv~l~~~~~ 231 (231) +.-+-. |.. +|... -+++=+++.|.-.+. T Consensus 385 ~~~i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa~ 424 (430) T protein:vir:10 385 KLELLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTAVK 424 (430) T ss_pred hhhhhhhHHhccceeeecCCCCCCceeeeeEEEEhhhhhh Confidence 222111 100 11110 112233333322222 No 190 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.15 E-value=4e-07 Score=55.68 Aligned_cols=177 Identities=8% Similarity=0.074 Sum_probs=115.1 Q ss_pred CCCcccCceEEecc---ccCCcccccCCCcc--CccccccceeEEEeehccceeeecHH-HHHhcCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN---DIGDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G~ti~~P~---~igda~~v~EG~~i--~~~~lt~~~~~~tikk~g~~~~itD~-~~~~~~~d~~~~~~~~~a~~ 74 (231) |+.-+.||+|+|+- ..|++ +..++.+ -.+.|++.+.+++|++....|..... +...+..|+..++.+.++.. T Consensus 62 dL~K~~GD~Vtf~L~~~L~g~g--v~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w 139 (318) T protein:vir:27 62 DLNKQAGDEVTFSIMHKLSKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (318) T ss_pred cCCCCCccEEEEeEeeccccCc--cccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHH Confidence 78889999999973 24443 2223333 35789999999999999888876654 34456789999999999999 Q ss_pred HHHHHHHHHHHHhccccc---------------------------------------------ccccccCHHHHHHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKAAKTTSQ---------------------------------------------TVSTKANVDGVQAALDI 109 (231) Q Consensus 75 ia~~vd~~~~~~l~t~~~---------------------------------------------~~~~~~~~d~i~da~~~ 109 (231) |++..|+-+|-.|.++++ +.+...+++-|.++... T Consensus 140 ~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~ 219 (318) T protein:vir:27 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (318) T ss_pred HHHHHHHHHHHHHhhcccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHH Confidence 999999987655422111 11122345555555444 Q ss_pred hhc----------cCCC------ceEEEECHHHHHHHHhhh---hhhhccc------cccCceeeeccceeecceeEEEc Q lcl|Aclame:pro 110 FND----------EDAQ------AYVLIVNPKDAAKIRKDA---NAKNIGS------EVGANALINGTYADVLGAQIVRS 164 (231) Q Consensus 110 l~~----------~~~~------~~v~vv~p~~~~~L~k~~---~~~~~~~------~~~~~~~~~G~ig~~~G~~Vv~s 164 (231) +.. ++++ .+++++||.++..|+.+. +|.+..+ ...++-+.+|.+|++-|+=+..- T Consensus 220 ~~~~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~ 299 (318) T protein:vir:27 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY 299 (318) T ss_pred HHHhCCCCcceeeccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeec Confidence 422 1122 489999999999999886 3444322 12345688999999999977777 Q ss_pred CCCc----cCceEEEEEecCCceEEEeecCCcc Q lcl|Aclame:pro 165 KKLA----EGSALMFKIVSNSPALKLVLKRGVQ 193 (231) Q Consensus 165 ~~~~----~~~~~~~~~~~~~~A~~~~~k~~v~ 193 (231) .++| .|.-+.+ ..++ T Consensus 300 ~~vpIrf~~G~~v~~--------------~~~~ 318 (318) T protein:vir:27 300 AGMPIRFYQGQRFWY--------------QRIT 318 (318) T ss_pred CCccEEEcCCCeeee--------------eecC Confidence 7666 1221111 0111 No 191 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.01 E-value=2.2e-06 Score=51.63 Aligned_cols=225 Identities=14% Similarity=0.126 Sum_probs=127.1 Q ss_pred CCCcc-cCceEEeccc--cCCcccccCCC-ccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGIN-LANLCEYPND--IGDAADVAEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~-~G~ti~~P~~--igda~~v~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~a~ 73 (231) ..-.. +-.++.++.+ .|.++.++.+. ++|..+...+.....+.+++..|.++..+... .+.++-..-....++ T Consensus 38 ~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~ 117 (301) T protein:vir:80 38 KFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRR 117 (301) T ss_pred ccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHH Confidence 22222 3344666643 67788887765 48999999999999999999988887766544 467777777888888 Q ss_pred HHHHHHHHHHHHHh--------ccccc----------cc---cc-----ccCHHHHHHHHHHhhcc---CCCceEEEECH Q lcl|Aclame:pro 74 SLANKVDDDLLKAA--------KTTSQ----------TV---ST-----KANVDGVQAALDIFNDE---DAQAYVLIVNP 124 (231) Q Consensus 74 ~ia~~vd~~~~~~l--------~t~~~----------~~---~~-----~~~~d~i~da~~~l~~~---~~~~~v~vv~p 124 (231) +++.+.|+-+|.+- .+.++ .. .. .--+++|.+++.++... ...+..++++| T Consensus 118 ~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p 197 (301) T protein:vir:80 118 AIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPP 197 (301) T ss_pred HHHHhhceEEeeecccccceeeecCCCcccccccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecH Confidence 88888887665431 11111 00 00 11267788888887542 24678999999 Q ss_pred HHHHHHHhhhhhhhccccccCcee-eeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCcccee-ccchhh Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANAL-INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVET-DRDIVT 202 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~-~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~-~Rd~~~ 202 (231) ..+..|..-......+. ...+.+ .+....+|.++|.+.+.. ..++...+.+..++.-+.+....++.... .+... T Consensus 198 ~~~~~L~~~~~~~~~~~-tvl~~l~~~~~~~~I~~~p~L~~~g-~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~- 274 (301) T protein:vir:80 198 KQFELINKKRYSNEDSR-SVLKVLQDNAWFSAIVRVPDLAGMG-TAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFP- 274 (301) T ss_pred HHHHhhhhccccCCCCe-eHHHHHHHHcCcceEEEcceeccCC-CCcccEEEEEecCCcEEEEEecCceeeecceecCc- Confidence 99999853211011111 111222 223334455555444322 23444444444444434443333332211 11111 Q ss_pred cccEEEEEE-EEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 203 KTTVITADE-HYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 203 ~~~~i~~~~-~y~~~~~~~~~vv~l~~~~~ 231 (231) ...+.... ..|+-+..|.+++.++ |+ T Consensus 275 -~~~~~~~~r~~Gv~i~~P~ai~~~~--GI 301 (301) T protein:vir:80 275 -RTKVPFEERTAGVVVRFPAAIVRVD--GI 301 (301) T ss_pred -eeEeeeeeeeEEEEEEccceEEEEe--cC Confidence 22222223 3578999999999874 55 No 192 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=97.98 E-value=1.7e-06 Score=52.22 Aligned_cols=225 Identities=13% Similarity=0.069 Sum_probs=127.3 Q ss_pred CCCcccCc-eEEeccc--cCCcccccCCC-ccCccccccceeEEEeehccceeeecHHHHHhc---CCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLAN-LCEYPND--IGDAADVAEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALSG---YGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~-ti~~P~~--igda~~v~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~---~~d~~~~~~~~~a~ 73 (231) ......|+ +++++.+ .|.+..++.+. ++|..+...+.....+..++..+.++..++..+ +.++-..-...+++ T Consensus 59 ~~~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~ 138 (314) T protein:vir:10 59 TNEIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFE 138 (314) T ss_pred ccCCCCceeEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHH Confidence 22222232 6777654 78888888765 489999999999999999999999887766554 66666666777777 Q ss_pred HHHHHHHHHHHHHh--------ccccc------ccccc---cCHHHHHHHHHHhhcc---CCCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 74 SLANKVDDDLLKAA--------KTTSQ------TVSTK---ANVDGVQAALDIFNDE---DAQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 74 ~ia~~vd~~~~~~l--------~t~~~------~~~~~---~~~d~i~da~~~l~~~---~~~~~v~vv~p~~~~~L~k~ 133 (231) +++.+.|+-+|-+- -..+. ...++ --+++|+.++..+... ...+..++++|..+..|..- T Consensus 139 ~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~ 218 (314) T protein:vir:10 139 AHDNLLDKLVWSGSAPHGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGL 218 (314) T ss_pred HHHHhhceEEEeecccccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhccc Confidence 77777776554321 11111 11111 1267777777777542 24678899999988777431 Q ss_pred hhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEEEEEE- Q lcl|Aclame:pro 134 ANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEH- 212 (231) Q Consensus 134 ~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~- 212 (231) ....+.....-+..|+..-+|.++|-+.+.. ..++...+.+..++.-+.+..-.+++.-.- ........+....+ T Consensus 219 --~~~~~~tvl~~l~~n~~~l~I~~~~el~~ag-~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-e~~~~~~~~~~~~r~ 294 (314) T protein:vir:10 219 --VPQTNLSYGELFTRNNPGLTIRFLQFLDNYD-GAGGKAALAFEKSPLNMSIEIPEVTNVLPA-QPKDLHFRYPVTSKA 294 (314) T ss_pred --ccCCCccHHHHHHHhCCCcEEEEcccccccC-CCcceEEEEEecCCcEEEEecCccceeecc-eecCceEEEcceeee Confidence 1111111111122233333455555444322 223333444444554444433333332211 11223334433333 Q ss_pred EEEEEEcCCcEEE---EEec Q lcl|Aclame:pro 213 YAAYLYDLTKVVN---ITFT 229 (231) Q Consensus 213 y~~~~~~~~~vv~---l~~~ 229 (231) .|+-+..|.+++. |||+ T Consensus 295 ~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 295 TGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEEEEECcceeEeeeeeecC Confidence 5789999999996 5666 No 193 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.90 E-value=2.8e-06 Score=51.04 Aligned_cols=224 Identities=10% Similarity=0.059 Sum_probs=130.9 Q ss_pred CCCcccCc-eEEeccc--cCCcccccCCC-ccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLAN-LCEYPND--IGDAADVAEGG-EISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G~-ti~~P~~--igda~~v~EG~-~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~a~ 73 (231) ......|+ +++++.+ .|.+..++.+. ++|..+...+.....+.+++..|.++..+... .+.++-..-....++ T Consensus 63 ~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~ 142 (319) T protein:vir:10 63 TTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQL 142 (319) T ss_pred ccCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHH Confidence 33333333 4666644 68888887755 48999999999999999999999888766544 366777777777788 Q ss_pred HHHHHHHHHHHHH--------hccccc----ccc---------cccCHHHHHHHHHHhhcc---CCCceEEEECHHHHHH Q lcl|Aclame:pro 74 SLANKVDDDLLKA--------AKTTSQ----TVS---------TKANVDGVQAALDIFNDE---DAQAYVLIVNPKDAAK 129 (231) Q Consensus 74 ~ia~~vd~~~~~~--------l~t~~~----~~~---------~~~~~d~i~da~~~l~~~---~~~~~v~vv~p~~~~~ 129 (231) +++.+.|+-+|.+ |.+.++ ..+ ...-+++|..++..+... ...++.++++|..+.. T Consensus 143 ~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~ 222 (319) T protein:vir:10 143 AHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKV 222 (319) T ss_pred HHHHhhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHh Confidence 8888888755532 111111 011 112356677777766422 2578899999999998 Q ss_pred HHh-hhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccEEE Q lcl|Aclame:pro 130 IRK-DANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVIT 208 (231) Q Consensus 130 L~k-~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~ 208 (231) |.. -+. .+.....-+-.++...+|.++|.+..-. ..++...+.+..++.-+.+....++++..- ........+. T Consensus 223 L~~~~~~---~~~t~l~~lk~~~~~l~I~~~pel~~ag-~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~~~l~~~~~ 297 (319) T protein:vir:10 223 LAIRMPE---TTMSYLDYFKSQNSGIEIDSIAELEDID-GAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QPKDLHFKVP 297 (319) T ss_pred hhcccCC---CCeeHHHHHHHhcCCceEEEeeeecccC-CCcceEEEEEecCCceEEEecCcceeeeee-eecCceEEEe Confidence 853 221 1111111122233334555555544432 234444455555555555554444443321 1122333443 Q ss_pred EEE-EEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 209 ADE-HYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 209 ~~~-~y~~~~~~~~~vv~l~~~~~ 231 (231) ... ..|+-+..|.+++.++ |+ T Consensus 298 ~~~r~~Gv~i~~P~ai~~~d--GI 319 (319) T protein:vir:10 298 CTSKCTGLTIYRPMTIVLIT--GV 319 (319) T ss_pred eeeeeEEEEEEccceeEeee--cC Confidence 444 3568899999999874 55 No 194 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=97.87 E-value=6.9e-06 Score=48.87 Aligned_cols=220 Identities=14% Similarity=0.114 Sum_probs=128.9 Q ss_pred CCCcc---cCceEEeccccCCcccccC-CCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGIN---LANLCEYPNDIGDAADVAE-GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~---~G~ti~~P~~igda~~v~E-G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .-+++ -...-+. -|.|+.-.+.| ..++....|+-.+.+++.+++|+.|.|+.+++..-....+....++++++.+ T Consensus 32 ~~a~~~~sdf~~~~~-~~lg~~p~l~e~~Ge~~~~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa 110 (302) T protein:vir:10 32 KIAMEVPSNTSSNDY-KWLSTFPKMRRWIGAKVVKNLKAYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAA 110 (302) T ss_pred ceeeecCCCcceeec-eecCCCCCccccccceeeccccccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHH Confidence 22222 2222222 46787766666 5778888999999999999999999999999999888899999999999999 Q ss_pred HHHHHHHHHHhcccccc------------------------------cccccCHHHHHHH---HHHhhccC-----CCce Q lcl|Aclame:pro 77 NKVDDDLLKAAKTTSQT------------------------------VSTKANVDGVQAA---LDIFNDED-----AQAY 118 (231) Q Consensus 77 ~~vd~~~~~~l~t~~~~------------------------------~~~~~~~d~i~da---~~~l~~~~-----~~~~ 118 (231) +..|.-+++.|...... ....++.+.+..+ +..+.+.. ..++ T Consensus 111 ~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~ 190 (302) T protein:vir:10 111 QLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPN 190 (302) T ss_pred hhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCC Confidence 99999888877542110 0012333334444 33333222 3678 Q ss_pred EEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEE--ee-cCCccce Q lcl|Aclame:pro 119 VLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKL--VL-KRGVQVE 195 (231) Q Consensus 119 v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~--~~-k~~v~vE 195 (231) .|+|+|.-...-++.-...... ....+. ..|. +.+++++.+.+++.+++.- .+.++.. +. .+...+| T Consensus 191 ~LiVp~~le~~A~~ll~~~~~~-~g~~Np----~~g~---~~~vv~p~L~s~~aWyL~a--~~~~i~~~~l~g~~~P~~~ 260 (302) T protein:vir:10 191 VLLVGPALEDVAKMLLTNPKLA-DNTPNP----YVGT---AELVVDGRIESDTAWFLLD--TTKPVKPFIFQPRKQPEFV 260 (302) T ss_pred EEEecchhHHHHHHHhhccccC-CCCcce----eccc---eEEEEeeccCCCCceEEEe--cCCccceEEEcCccccEEE Confidence 8999997665554432111100 001111 1122 6899999998888877542 2223222 22 2235566 Q ss_pred eccchhhcccEEEEEEEEEE------EEEcCCcEEEEEeccC Q lcl|Aclame:pro 196 TDRDIVTKTTVITADEHYAA------YLYDLTKVVNITFTGV 231 (231) Q Consensus 196 ~~Rd~~~~~~~i~~~~~y~~------~~~~~~~vv~l~~~~~ 231 (231) +.-+.....-.++....||+ +...+.....=+-+|- T Consensus 261 ~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 261 SQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred eccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 55555444445555555654 3333333333222222 No 195 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.81 E-value=1.8e-05 Score=46.63 Aligned_cols=221 Identities=12% Similarity=0.046 Sum_probs=129.6 Q ss_pred CCCcccCceEEeccc----cCCcc--cccCCCccCccccccceeEEEeehc-cceeeecHHHHHh---cCCCHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND----IGDAA--DVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALS---GYGDPIGESNKQ 70 (231) Q Consensus 1 ~~~~~~G~ti~~P~~----igda~--~v~EG~~i~~~~lt~~~~~~tikk~-g~~~~itD~~~~~---~~~d~~~~~~~~ 70 (231) +.+++ .++ .| ..++. ...||.+.+....+.....-...|+ .+.++||.-+... +..|.++.-..+ T Consensus 41 ~~a~~--~~~---~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~k 115 (317) T protein:vir:88 41 GVATA--ITH---EWQTDELRQPGKNTRVEGEDATIKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAK 115 (317) T ss_pred ceecc--cEE---EEEeeecCCccccccccCcccccccccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHH Confidence 33321 122 45 22332 2458888776665555544444553 5667777755443 235666666666 Q ss_pred HHHHHHHHHHHHHHHHhcc-----c--c----c----------------------------ccccccCHHHHHHHHHHhh Q lcl|Aclame:pro 71 LGLSLANKVDDDLLKAAKT-----T--S----Q----------------------------TVSTKANVDGVQAALDIFN 111 (231) Q Consensus 71 ~a~~ia~~vd~~~~~~l~t-----~--~----~----------------------------~~~~~~~~d~i~da~~~l~ 111 (231) -...|.+.++..++..-+. + + + ....+++-+.|.++++++- T Consensus 116 k~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~ 195 (317) T protein:vir:88 116 KSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIW 195 (317) T ss_pred HHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHH Confidence 6777778888777754221 0 0 0 0011368888999999999 Q ss_pred ccCCCceEEEECHHHHHHHHhhhhhhhccc--cccCceeeec--cceeecc-eeEEEcCCCccCceEEEEEecCCceEEE Q lcl|Aclame:pro 112 DEDAQAYVLIVNPKDAAKIRKDANAKNIGS--EVGANALING--TYADVLG-AQIVRSKKLAEGSALMFKIVSNSPALKL 186 (231) Q Consensus 112 ~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~--~~~~~~~~~G--~ig~~~G-~~Vv~s~~~~~~~~~~~~~~~~~~A~~~ 186 (231) +++..++.++|+|.....+-+.-....... ......+... .+-.-+| ++|+.+++||+++.+.++ +..+.+ T Consensus 196 ~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D----~~~~~l 271 (317) T protein:vir:88 196 RNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFD----PKMHSL 271 (317) T ss_pred hcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEc----ccccce Confidence 999888899999998877766532111110 1111111110 1111244 699999999999887766 444555 Q ss_pred eecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 187 VLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 187 ~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ..-|++..|.- -..-.++......-|+..+.||.+..+|+--.- T Consensus 272 ~~Lr~~~~e~l-aKtGd~~k~~i~~E~tLe~~N~~a~a~i~~l~~ 315 (317) T protein:vir:88 272 CYLRPFFQHEL-AKTGDSEKRQLLVEYTFRVNNEKSGALIRDVVA 315 (317) T ss_pred eecccceeecc-CCCcccceeEEEEEEEEEEcCccceeEEEEecc Confidence 44466654411 112234445555569999999999998875444 No 196 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=97.62 E-value=3.3e-05 Score=45.13 Aligned_cols=225 Identities=9% Similarity=0.042 Sum_probs=109.1 Q ss_pred CC-C--------cccCceEEeccc-cCCccc--ccC-CCccCccccccceeEE-EeehccceeeecHHHHHhcCCC---- Q lcl|Aclame:pro 1 EN-G--------INLANLCEYPND-IGDAAD--VAE-GGEISLDKIGTTTKSV-TIKKAAKGTEITDEAALSGYGD---- 62 (231) Q Consensus 1 ~~-~--------~~~G~ti~~P~~-igda~~--v~E-G~~i~~~~lt~~~~~~-tikk~g~~~~itD~~~~~~~~d---- 62 (231) |+ . .-.-.+.+||+. +|---- -.| |+.....+.+..+... +.++.-..+.++.+........ T Consensus 47 ~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~ 126 (360) T protein:vir:99 47 KGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQ 126 (360) T ss_pred hccchhhhcceeecccccccccccccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccch Confidence 00 0 001111122211 111000 001 1111111122222111 3445556677777776664332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-----------------------ccc---c------ccccc-------------- Q lcl|Aclame:pro 63 PIGESNKQLGLSLANKVDDDLLKAA-----------------------KTT---S------QTVST-------------- 96 (231) Q Consensus 63 ~~~~~~~~~a~~ia~~vd~~~~~~l-----------------------~t~---~------~~~~~-------------- 96 (231) +....+++++..+++-+..-.+.+- +.+ . ..++. T Consensus 127 f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~ 206 (360) T protein:vir:99 127 FGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSM 206 (360) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccc Confidence 2344455666555553332221110 000 0 00000 Q ss_pred -------------ccCHHHHHHHHHHhhccCC--C--ceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecce Q lcl|Aclame:pro 97 -------------KANVDGVQAALDIFNDEDA--Q--AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA 159 (231) Q Consensus 97 -------------~~~~d~i~da~~~l~~~~~--~--~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~ 159 (231) +..-+-+.+++..|-+..- . .-++++||..+...+.-. ..+.+.+|+.++.++..-.++|+ T Consensus 207 ~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L--~~R~t~LGd~~l~g~~~~~~~Gi 284 (360) T protein:vir:99 207 PSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSL--TEREDPLGSAVIFGDSDITPFSY 284 (360) T ss_pred hhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHH--hccCcccchhheeccccccccee Confidence 0011123455555544321 1 227999999888888643 34566777777776655578999 Q ss_pred eEEEcCCCccCceEEEEEecCCceEEEeecCCccc----eeccchhhcccEEEE-EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 160 QIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQV----ETDRDIVTKTTVITA-DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 160 ~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~v----E~~Rd~~~~~~~i~~-~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ||+..+.+|++...... +.-+.++.-+.+.+ |.+|...+...+++- +..+.+.+-++++||.++---+ T Consensus 285 pi~~v~~~pd~~~mlT~----p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~ 357 (360) T protein:vir:99 285 DLVGVNGFPDEYMMFTD----PNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLET 357 (360) T ss_pred eeEEcCCCCCCceEEec----cCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCC Confidence 99999999998754432 44445555556654 555544444433333 3456667777888888876555 No 197 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=97.54 E-value=2.5e-05 Score=45.80 Aligned_cols=226 Identities=9% Similarity=0.039 Sum_probs=126.3 Q ss_pred CCCcccC-ceEEeccc--cCCcccccCC-CccCccccccceeEEEeehccceeeecHHHHHhc---CCCHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA-NLCEYPND--IGDAADVAEG-GEISLDKIGTTTKSVTIKKAAKGTEITDEAALSG---YGDPIGESNKQLGL 73 (231) Q Consensus 1 ~~~~~~G-~ti~~P~~--igda~~v~EG-~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~---~~d~~~~~~~~~a~ 73 (231) .....-| .+++++.+ .|.+..++.+ +.+|..+.........+.+++..|.++..+...+ +.++-..-....++ T Consensus 68 ~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~ 147 (329) T protein:vir:79 68 TSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQN 147 (329) T ss_pred ccCCCCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHH Confidence 2222222 25666644 6778888765 4788889999999999999999988887665543 66666666777777 Q ss_pred HHHHHHHHHHHHH--------hccccc-------c--c-cc-----ccCHHHHHHHHHHhhcc---CCCceEEEECHHHH Q lcl|Aclame:pro 74 SLANKVDDDLLKA--------AKTTSQ-------T--V-ST-----KANVDGVQAALDIFNDE---DAQAYVLIVNPKDA 127 (231) Q Consensus 74 ~ia~~vd~~~~~~--------l~t~~~-------~--~-~~-----~~~~d~i~da~~~l~~~---~~~~~v~vv~p~~~ 127 (231) +++.+.|+-+|.+ |.+.+. + . .. .--+++|.+++..+... ...+..++++|..+ T Consensus 148 ~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~ 227 (329) T protein:vir:79 148 AHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMR 227 (329) T ss_pred HHHHhhccEEEeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHH Confidence 8888777655432 111111 0 0 01 11367777777777543 24678899999999 Q ss_pred HHHH-hhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhcccE Q lcl|Aclame:pro 128 AKIR-KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTV 206 (231) Q Consensus 128 ~~L~-k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~~ 206 (231) ..|. +.+.. +.....-+-.++...+|.++|-+.+.. ..++...+.+..++.-+.+..-.+.+...- ........ T Consensus 228 ~~L~~~~~~~---~~tvl~~lk~~~~~l~I~~~~el~~ag-~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-q~~~~~~~ 302 (329) T protein:vir:79 228 KVLMVRMPET---TMSYLDYFKQQNGGITIESISELEDID-GAGTKAALVYEKDPMNMSIEIPEAFNMLTA-QPKDLHFK 302 (329) T ss_pred HHhhcccCCC---CccHHHHHHHhCCCcEEEEcccccccC-CCCceEEEEEecCCceEEEecCcceeeeec-eecCceEE Confidence 8884 32221 111111111223333444454443322 223344444444554444443344333211 12222233 Q ss_pred EEEEE-EEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 207 ITADE-HYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 207 i~~~~-~y~~~~~~~~~vv~l~~~~~ 231 (231) +.... ..|+-+..|.+++.++-=-| T Consensus 303 v~~~~r~~Gv~i~~P~ai~~~dGI~~ 328 (329) T protein:vir:79 303 VPCTSKCTGLTIYRPLTLVLIKGLVV 328 (329) T ss_pred EceeeeEEEEEEECcceeeeeeeeee Confidence 33333 46789999999998754333 No 198 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=96.74 E-value=0.00021 Score=40.80 Aligned_cols=224 Identities=15% Similarity=0.004 Sum_probs=127.1 Q ss_pred CC---------CcccCc------e--EEec--cccCCcccc-------cCCCccCccccccceeEEEeehccceeeecHH Q lcl|Aclame:pro 1 EN---------GINLAN------L--CEYP--NDIGDAADV-------AEGGEISLDKIGTTTKSVTIKKAAKGTEITDE 54 (231) Q Consensus 1 ~~---------~~~~G~------t--i~~P--~~igda~~v-------~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~ 54 (231) |- +..+|. . .++- .-++.++.. .++.+++.=.++.++.+++.|-++..=+.|=| T Consensus 222 EA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~E 301 (523) T protein:vir:59 222 RLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPE 301 (523) T ss_pred cccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHH Confidence 00 000010 0 0000 001222222 23445666677888999999988887777777 Q ss_pred HHHh-----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------cccc--------cCHHHHHHHHH Q lcl|Aclame:pro 55 AALS-----GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQT-------------VSTK--------ANVDGVQAALD 108 (231) Q Consensus 55 ~~~~-----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~-------------~~~~--------~~~d~i~da~~ 108 (231) ...+ .+.|...+..+-|+..|...|+.+++..+.+.+.. .... .+|....+... T Consensus 302 LAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~ 381 (523) T protein:vir:59 302 AMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLA 381 (523) T ss_pred HHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHH Confidence 5432 36899999999999999999999999987654311 0000 11222222222 Q ss_pred ----Hhhcc---------CCCceEEEECHHHHHHHHhhhhhhhccccccCceeeec--cceeecc-eeEEEcCCCccCce Q lcl|Aclame:pro 109 ----IFNDE---------DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALING--TYADVLG-AQIVRSKKLAEGSA 172 (231) Q Consensus 109 ----~l~~~---------~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G--~ig~~~G-~~Vv~s~~~~~~~~ 172 (231) .+..+ -...++++|+|++++.|...+-+........ --+| ..|.+.| ++|++++..+..=. T Consensus 382 ~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~---~~~~~~~~g~l~~~~~vy~d~~~~~dy~ 458 (523) T protein:vir:59 382 TLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRD---GGTGIFYVGMVQGRYRLYKNIYQNQPVI 458 (523) T ss_pred HHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCcccc---ccccceeEEEecCceEEEecCCCCcceE Confidence 22211 1357899999999999987665543221111 1112 2355543 69999988664211 Q ss_pred EEEEEecCCceE-----EEeecCCccceec---cchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 173 LMFKIVSNSPAL-----KLVLKRGVQVETD---RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 173 ~~~~~~~~~~A~-----~~~~k~~v~vE~~---Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) . +..+++. +++.--=+.++.. .|+..+.-.+-...|||..|.||-....+-++=. T Consensus 459 ---~-~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~ 521 (523) T protein:vir:59 459 ---I-MGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLL 521 (523) T ss_pred ---E-EEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhhhc Confidence 1 1111111 1111111233333 3789999999999999999999987777666555 No 199 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=96.54 E-value=3.3e-05 Score=45.14 Aligned_cols=111 Identities=16% Similarity=0.195 Sum_probs=81.5 Q ss_pred EECHHHHHHHHhhhhhhhccccccCceeeeccce-eecceeEEEcCCCccCceEEEEE-------ecCCceEEEeecC-- Q lcl|Aclame:pro 121 IVNPKDAAKIRKDANAKNIGSEVGANALINGTYA-DVLGAQIVRSKKLAEGSALMFKI-------VSNSPALKLVLKR-- 190 (231) Q Consensus 121 vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig-~~~G~~Vv~s~~~~~~~~~~~~~-------~~~~~A~~~~~k~-- 190 (231) +++-.+++++..++.....-.--..+.+.+|-+. +++|.+.+.|+++|.+.+..++. ..+-++-++...+ T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~Pgya~~~~~ 80 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSPEFAPAGNT 80 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCCCcccCCCCc Confidence 6666778888877654433222234667777665 49999999999999888776653 1122333444333 Q ss_pred Cccceeccchh--hcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 191 GVQVETDRDIV--TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 191 ~v~vE~~Rd~~--~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++++.+.|... .+..-++++..-..-++.|.+.++|+-.|. T Consensus 81 Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 81 GVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred ceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 46777889888 888999999999999999999999998888 No 200 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=96.10 E-value=0.001 Score=37.00 Aligned_cols=228 Identities=7% Similarity=0.034 Sum_probs=127.7 Q ss_pred CCCcccCceEEecc--ccCCcccccCCCccCcccccc---ceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--DIGDAADVAEGGEISLDKIGT---TTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~igda~~v~EG~~i~~~~lt~---~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~i 75 (231) +-..++|.-.+|++ |.=.-+.++.+... ..+++ +.....+...+-...+.+.+...+..||...+++.+...| T Consensus 36 pv~~~~~k~~~f~~eaF~~~~t~r~~~~~~--~~v~~~~~~~~~~~~~~~~L~~~id~r~~~~~~~~~~~~av~~l~d~I 113 (307) T protein:vir:10 36 EVEKEGGKIPKFGKESFRLYKTERALRARS--NRMNPEDLGSIDIVLDEHDLEYPIDYREDQESAFPLEQAAVQTATEAI 113 (307) T ss_pred cccccccceeeECcccccchhhhcccCCCc--ceeecccccccccccccccccccCChhhcCCCCCCHHHHHHHHHHHHH Confidence 44444555555542 10001123333322 22222 2234455666666777787777788899999999999888 Q ss_pred HHHHHHHHHHHhccc-------c--c------ccccccCHHHHHHHHHHhhcc-CCCceEEEECHHHHHHHHhhhhhhhc Q lcl|Aclame:pro 76 ANKVDDDLLKAAKTT-------S--Q------TVSTKANVDGVQAALDIFNDE-DAQAYVLIVNPKDAAKIRKDANAKNI 139 (231) Q Consensus 76 a~~vd~~~~~~l~t~-------~--~------~~~~~~~~d~i~da~~~l~~~-~~~~~v~vv~p~~~~~L~k~~~~~~~ 139 (231) .+..+-.+.+.+... . + +-.+..-+.+|.++..++.+. +..++++++.++.+..|+.++..... T Consensus 114 ~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~ 193 (307) T protein:vir:10 114 QLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAAGSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEK 193 (307) T ss_pred HHHHHHHHHHHhcCccccCCCceEEeccccccCCCCCCcHHHHHHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHH Confidence 776665444332211 1 1 111223466777777777544 57899999999999999999998876 Q ss_pred cccccCceeeeccceeeccee-EEEcCCC-----------ccCceEEEEE----ecCC-----ceEEEeec-CCccceec Q lcl|Aclame:pro 140 GSEVGANALINGTYADVLGAQ-IVRSKKL-----------AEGSALMFKI----VSNS-----PALKLVLK-RGVQVETD 197 (231) Q Consensus 140 ~~~~~~~~~~~G~ig~~~G~~-Vv~s~~~-----------~~~~~~~~~~----~~~~-----~A~~~~~k-~~v~vE~~ 197 (231) ........+.--.+-.++|+. |++.... ..+..++.-+ ..+. .++|+-.+ ++-.+... T Consensus 194 lk~~~~g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~d~ 273 (307) T protein:vir:10 194 IKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDT 273 (307) T ss_pred hCCccccccCHHHHHHHhCceeEEEeeeeeeccCCccceeCCCceEEEecccccCCCCCcccccccceeEEEcCCeEeec Confidence 555444444444556677764 4432211 1122221110 0011 13444332 34344445 Q ss_pred cchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 198 RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 198 Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) |+...+...++...++--.+.-|.+-..| -.+| T Consensus 274 ~~~~~~~~~~r~~~~~~~~i~~~~~G~li-~~~~ 306 (307) T protein:vir:10 274 RIEDGKLELVRSTDIFRPYLLGADAGYLI-SGIN 306 (307) T ss_pred eecCCceeEEeccccccceeeccccccee-ccCC Confidence 66778888888888877777777654444 3344 No 201 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=95.74 E-value=0.0015 Score=36.00 Aligned_cols=229 Identities=8% Similarity=0.058 Sum_probs=125.0 Q ss_pred CCCcccCceEEeccc-c----C-----Ccc-------cccCCCccCcccc-ccceeEEEeehccceeeecHHHHHhcCCC Q lcl|Aclame:pro 1 ENGINLANLCEYPND-I----G-----DAA-------DVAEGGEISLDKI-GTTTKSVTIKKAAKGTEITDEAALSGYGD 62 (231) Q Consensus 1 ~~~~~~G~ti~~P~~-i----g-----da~-------~v~EG~~i~~~~l-t~~~~~~tikk~g~~~~itD~~~~~~~~d 62 (231) .|+-..|+.+ ||.- + + +.+ ..+.|.....-.. ..+..+..+++.+-...+.+.+...+..| T Consensus 22 ~n~~~Iad~l-fP~vpV~~~~~k~~~f~~e~f~~~~t~ra~~~~~~~v~~~~~~~~~~~~~~~~l~~~id~r~~~~~~~~ 100 (307) T protein:vir:79 22 TNAEFIGQTL-MPVVEVEKEGGKIPKFGKESFRLYQTERALRAKSNRMNPEDIDSVDVNLDEHDLEYPIDYREDQESAFP 100 (307) T ss_pred cchhhhhhhc-CCcccccccccceeeeccccccccccccccCCCcceeeeeccccccccccccchhhcccchhcCCCCCC Confidence 3333455554 4421 0 0 112 1233332221111 12334445566565566777776777889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccc-------ccc--------ccccCHHHHHHHHHHhhcc-CCCceEEEECHHH Q lcl|Aclame:pro 63 PIGESNKQLGLSLANKVDDDLLKAAKTTS-------QTV--------STKANVDGVQAALDIFNDE-DAQAYVLIVNPKD 126 (231) Q Consensus 63 ~~~~~~~~~a~~ia~~vd~~~~~~l~t~~-------~~~--------~~~~~~d~i~da~~~l~~~-~~~~~v~vv~p~~ 126 (231) |...+++.+...|....+-.+.+.+.... .+. .+..-+.+|.++..++.+. +..++++++.++. T Consensus 101 ~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a 180 (307) T protein:vir:79 101 LEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVGVIEDGKEAIRTKIGRRPNTMVIGASA 180 (307) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHHHHHHHHHHHHHhhCCccceEEeCHHH Confidence 99999999888887777665554442211 111 1223466777777777644 5789999999999 Q ss_pred HHHHHhhhhhhhccccccCceeeeccceeeccee-EEEcCCC-----------ccCceEEEEEec----C-----CceEE Q lcl|Aclame:pro 127 AAKIRKDANAKNIGSEVGANALINGTYADVLGAQ-IVRSKKL-----------AEGSALMFKIVS----N-----SPALK 185 (231) Q Consensus 127 ~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~-Vv~s~~~-----------~~~~~~~~~~~~----~-----~~A~~ 185 (231) +..|+.++.....-.......+.--.+..++|+. |.+-... ..+..++.-+.. + ..+++ T Consensus 181 ~~~l~~h~~i~~~lk~~~~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~G 260 (307) T protein:vir:79 181 YKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYG 260 (307) T ss_pred HHHHhcCHHHHHHhcCccccccCHHHHHHHhCceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCcccccccc Confidence 9999999988876554444444444556778875 5543311 111222111100 0 01233 Q ss_pred Eeec-CCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 186 LVLK-RGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 186 ~~~k-~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +-.. ++-.+...|....++..++...+.--+++-|++-..| ..+| T Consensus 261 yt~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li-~~~v 306 (307) T protein:vir:79 261 YTLRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLI-SGIN 306 (307) T ss_pred eeEEecCceEEecccCCCceeEEeecccccceeeccccchhh-ccCC Confidence 3322 2333334455677788888888777777766644433 3334 No 202 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=95.14 E-value=0.0027 Score=34.69 Aligned_cols=223 Identities=8% Similarity=-0.020 Sum_probs=111.5 Q ss_pred CCCcccCc-eEEeccc--cCCcc--cccC-CCccCccccccceeEEEeehccceeeecHHHHHh--cCCCHHH-HHHHHH Q lcl|Aclame:pro 1 ENGINLAN-LCEYPND--IGDAA--DVAE-GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYGDPIG-ESNKQL 71 (231) Q Consensus 1 ~~~~~~G~-ti~~P~~--igda~--~v~E-G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~--~~~d~~~-~~~~~~ 71 (231) .....-++ +++++.+ .|.+. .++. .++||..+...++.+.+|..++.++.++-+++.. ..+-++. .=.+.+ T Consensus 35 ~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa 114 (304) T protein:vir:52 35 DQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMAL 114 (304) T ss_pred cCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHH Confidence 33333344 6777655 68888 4454 4679999999999999999999988776554433 3333333 222333 Q ss_pred HHHHHHHHHHHHHHH---------hcccc--------ccccc----ccC----HHHHHHHHHHhhccC---CCceEEEEC Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA---------AKTTS--------QTVST----KAN----VDGVQAALDIFNDED---AQAYVLIVN 123 (231) Q Consensus 72 a~~ia~~vd~~~~~~---------l~t~~--------~~~~~----~~~----~d~i~da~~~l~~~~---~~~~v~vv~ 123 (231) .+++..++|+-.+-+ |..-+ ...+. ..| +++|.+++..+-... ..+..++++ T Consensus 115 ~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lp 194 (304) T protein:vir:52 115 NKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAID 194 (304) T ss_pred HHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeC Confidence 345666665433322 11000 00010 113 455555666654322 457789999 Q ss_pred HHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEE--cC---CCccCceEEEEEecCCceEEEeecCCccceecc Q lcl|Aclame:pro 124 PKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVR--SK---KLAEGSALMFKIVSNSPALKLVLKRGVQVETDR 198 (231) Q Consensus 124 p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~--s~---~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R 198 (231) |..+..|..- .....+.....=+..|.....-.++.|.. +. .-..|+...+.+...+.-+.+-. .+...+ T Consensus 195 p~~~~~l~~~-~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~v----P~p~~~ 269 (304) T protein:vir:52 195 SLDLAHLALV-QRANTDTTALEFLTKHLSAAAGRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDV----PMSPTV 269 (304) T ss_pred HHHHHHHhhc-cCCCCCchHHHHHHHhcccccCCcceEEEecccccccCCCCceEEEEEecChhheEEec----Cccccc Confidence 9999888531 11111111111111122111111122222 22 22234444555544444444422 222222 Q ss_pred chhhccc----EE-EEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 199 DIVTKTT----VI-TADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 199 d~~~~~~----~i-~~~~~y~~~~~~~~~vv~l~~ 228 (231) .+-...+ .+ .....-|+-+..|.+++.+.. T Consensus 270 l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 270 LDAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cchhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 2222222 12 333356888999999999999 No 203 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=94.89 E-value=0.0032 Score=34.23 Aligned_cols=223 Identities=15% Similarity=0.095 Sum_probs=109.9 Q ss_pred CCC---------------cccCceEEeccccCC--cc-------------cccCCCccCccccccceeEEEeehc-ccee Q lcl|Aclame:pro 1 ENG---------------INLANLCEYPNDIGD--AA-------------DVAEGGEISLDKIGTTTKSVTIKKA-AKGT 49 (231) Q Consensus 1 ~~~---------------~~~G~ti~~P~~igd--a~-------------~v~EG~~i~~~~lt~~~~~~tikk~-g~~~ 49 (231) |+. + .|+++++-+--|+ ++ ...||.+.+...-......-.+.|+ .+.+ T Consensus 89 ~~~l~~~~~~~Evirv~sV-ng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta~~~k~~~vsNvtQIF~~av 167 (418) T protein:vir:10 89 KGMIFYNEATGENMRLELV-NGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAW 167 (418) T ss_pred cccEEEEccCCeEEEEEEE-eCCEEEEEEecCCeeEEEEecCceEEEeccccccccccCCcceecceeccchhhhhhhhh Confidence 222 3 3788777542121 11 2467777655432222222244453 4577 Q ss_pred eecHHHHHh----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcc------c----------------cc---cc--cccc Q lcl|Aclame:pro 50 EITDEAALS----GYGDPIGESNKQLGLSLANKVDDDLLKAAKT------T----------------SQ---TV--STKA 98 (231) Q Consensus 50 ~itD~~~~~----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t------~----------------~~---~~--~~~~ 98 (231) ++|+-+... +..|+.+.-.+.. +..+..+++.++..-.. . ++ ++ .+++ T Consensus 168 svSgTaqAs~~q~Gvsn~~ese~drk-~~~av~iEkalI~G~~~~~~~~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~ 246 (418) T protein:vir:10 168 ALTDTARASYAEAGYSNITESRRDCM-DFHATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAV 246 (418) T ss_pred hhhhhhhhccccccCchHHHHHHHHH-HHHHHHHHHHHhcccccCCCcCCcchhhHHHHHHHHhhhcccceeccCCCCcc Confidence 888865542 4455554432222 22234566667655311 0 00 11 1356 Q ss_pred CHHHHHHHHHHhhc----cCC----CceEEEECHHHHHHHHhhhhhh-hccccccCceeeeccceeecceeEE------- Q lcl|Aclame:pro 99 NVDGVQAALDIFND----EDA----QAYVLIVNPKDAAKIRKDANAK-NIGSEVGANALINGTYADVLGAQIV------- 162 (231) Q Consensus 99 ~~d~i~da~~~l~~----~~~----~~~v~vv~p~~~~~L~k~~~~~-~~~~~~~~~~~~~G~ig~~~G~~Vv------- 162 (231) ++|.+.+++...-+ .+. ..++++|++++...+-+.-... ....+..-+.+.... .+....|+ T Consensus 247 s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~~~I~~~~~e~~~G~vv~~~--~~~~G~I~L~~~p~~ 324 (418) T protein:vir:10 247 TYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFGEVTVTQRETSYGMVFTEW--KFFKGRLILKEHPLF 324 (418) T ss_pred CHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhhhheeecccceeeeEEEEEE--EcceEEEEeeccccc Confidence 89999998876422 111 2377999999887776543211 111122222222221 12112222 Q ss_pred EcCCCccCceEEEEEecCCceEEEeec--CCccc-------------eeccchhhcccEEEEEE--EEEEEEEcCCcEEE Q lcl|Aclame:pro 163 RSKKLAEGSALMFKIVSNSPALKLVLK--RGVQV-------------ETDRDIVTKTTVITADE--HYAAYLYDLTKVVN 225 (231) Q Consensus 163 ~s~~~~~~~~~~~~~~~~~~A~~~~~k--~~v~v-------------E~~Rd~~~~~~~i~~~~--~y~~~~~~~~~vv~ 225 (231) ..=+||+++...++ +.++.+-.- |.... +++..-...-|...+.. -|..+++||.++++ T Consensus 325 ~~~~lp~g~mlVvD----~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~kG~iv~E~tLe~~N~~a~av 400 (418) T protein:vir:10 325 SAIGISPGFAVVVD----VPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCAV 400 (418) T ss_pred ccccCCCceEEEEc----cccceEEEeccccccchhcccCCCcccccccccccccccccccceEEEEeeeeeecccceEE Confidence 22269999877665 333333222 33332 33333333345554444 39999999999999 Q ss_pred EEeccC Q lcl|Aclame:pro 226 ITFTGV 231 (231) Q Consensus 226 l~~~~~ 231 (231) ++-=+- T Consensus 401 itgl~~ 406 (418) T protein:vir:10 401 ITGLQK 406 (418) T ss_pred eeccce Confidence 864333 No 204 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=94.77 E-value=0.0025 Score=34.80 Aligned_cols=221 Identities=13% Similarity=0.103 Sum_probs=130.6 Q ss_pred CCCcccCce---------------------E-------Eeccc-cCC-------cccccCCCccCccccccceeEEEeeh Q lcl|Aclame:pro 1 ENGINLANL---------------------C-------EYPND-IGD-------AADVAEGGEISLDKIGTTTKSVTIKK 44 (231) Q Consensus 1 ~~~~~~G~t---------------------i-------~~P~~-igd-------a~~v~EG~~i~~~~lt~~~~~~tikk 44 (231) |+|+.+=|+ + ++|.+ +++ +..+.-|++-..+.+++...+++.+- T Consensus 119 E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~ 198 (400) T protein:vir:93 119 ENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVM 198 (400) T ss_pred hcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCceeeecchhhhcccceeccCCcccceeeeeeeeccCHHH Confidence 555544333 1 12222 221 12255666677777777777777665 Q ss_pred ccceeeecHHHHHh---cCCCHHHHHHHHHHHHHHHH-HHHHHHHHhccc-------------------ccccccccCHH Q lcl|Aclame:pro 45 AAKGTEITDEAALS---GYGDPIGESNKQLGLSLANK-VDDDLLKAAKTT-------------------SQTVSTKANVD 101 (231) Q Consensus 45 ~g~~~~itD~~~~~---~~~d~~~~~~~~~a~~ia~~-vd~~~~~~l~t~-------------------~~~~~~~~~~d 101 (231) .-+..++ ++.... +++.++...++++...+-.+ ++..++-+-.+. ...++..+.+. T Consensus 199 VYk~~~l-a~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~q 277 (400) T protein:vir:93 199 VYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFA 277 (400) T ss_pred HHHHhhh-hhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHH Confidence 4444444 333333 24555788888888877754 677666441111 11123344444 Q ss_pred HHHH-HHHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecce-eEEEcCCCccCceEEEEEec Q lcl|Aclame:pro 102 GVQA-ALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA-QIVRSKKLAEGSALMFKIVS 179 (231) Q Consensus 102 ~i~d-a~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~-~Vv~s~~~~~~~~~~~~~~~ 179 (231) ++.. +++-...-..+...+|++|+.++.|++..+..... ....-..+-.|.+-+|+ ++++...+|..+.... + . T Consensus 278 dl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a--~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~-V-D 353 (400) T protein:vir:93 278 DAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANA--NVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVL-V-D 353 (400) T ss_pred HHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCccee--eeeeccccchhhhhcccceeeeeccCCCCCceee-e-e Confidence 4332 23322223356677889999999998876543321 11111223356677887 6777888877664321 1 1 Q ss_pred CCceEEEeecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 180 NSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 180 ~~~A~~~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) ...++ .-++...-.+|+...-+..|-...+.+.++--|++.+.+++. T Consensus 354 ek~~i---~~~~~~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 354 QKYHI---DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hhhhc---cccCceeccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 12222 335666667889999999999999999999999999999999 No 205 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=94.12 E-value=0.0053 Score=33.05 Aligned_cols=229 Identities=12% Similarity=0.048 Sum_probs=118.1 Q ss_pred CCCcccCc-------eEEeccc-----cCCcccccCCC---ccCccccccceeEEEeehccceeeecHHHHHh----cCC Q lcl|Aclame:pro 1 ENGINLAN-------LCEYPND-----IGDAADVAEGG---EISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYG 61 (231) Q Consensus 1 ~~~~~~G~-------ti~~P~~-----igda~~v~EG~---~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~ 61 (231) -.+++.+. +.+..++ +..+|.+++++ .++.-.++.++.+++.|-++..-+.|=|...+ -+. T Consensus 152 ~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGL 231 (457) T protein:vir:10 152 AEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGL 231 (457) T ss_pred cccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCC Confidence 00000000 0111111 23445555443 35555667789999999888877777775433 368 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------ccccc----CHHHHHHHHHHh---------hccCC Q lcl|Aclame:pro 62 DPIGESNKQLGLSLANKVDDDLLKAAKTTSQT-------------VSTKA----NVDGVQAALDIF---------NDEDA 115 (231) Q Consensus 62 d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~-------------~~~~~----~~d~i~da~~~l---------~~~~~ 115 (231) |.-++..+-|+..|...|+.+++..+.+.+.. ..... ..+.....+-.+ ..... T Consensus 232 DAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~rg 311 (457) T protein:vir:10 232 DAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTRRG 311 (457) T ss_pred ChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhccc Confidence 99999999999999999999999987653311 01111 111111111111 11235 Q ss_pred CceEEEECHHHHHHHHhhhhhhhccc---c-ccC--ceeeeccceeec-ceeEEEcCCCccCceEEEEEecCCc-----e Q lcl|Aclame:pro 116 QAYVLIVNPKDAAKIRKDANAKNIGS---E-VGA--NALINGTYADVL-GAQIVRSKKLAEGSALMFKIVSNSP-----A 183 (231) Q Consensus 116 ~~~v~vv~p~~~~~L~k~~~~~~~~~---~-~~~--~~~~~G~ig~~~-G~~Vv~s~~~~~~~~~~~~~~~~~~-----A 183 (231) ..++++|+|.+++.|...--...... . .+. +-..+...|.+. |++|+++.-...+...-+.++..+| + T Consensus 312 ~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~ 391 (457) T protein:vir:10 312 KGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDA 391 (457) T ss_pred cceEEEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceec Confidence 78899999999998876211111100 0 000 112233456664 4799998544322111111111111 1 Q ss_pred EEEeecCCccceecc--chhhcccEEEEEEEEEEEEEcCCcEEEEEec-cC Q lcl|Aclame:pro 184 LKLVLKRGVQVETDR--DIVTKTTVITADEHYAAYLYDLTKVVNITFT-GV 231 (231) Q Consensus 184 ~~~~~k~~v~vE~~R--d~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~-~~ 231 (231) ..++. -=+..+..| |+..+.-.+-...|||. ..||-..-.=... +. T Consensus 392 glfy~-PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP~~~~~~~~~~~~ 440 (457) T protein:vir:10 392 GLFYC-PYVPLQQVRAINPDTFQPKIGFKTRYGM-VSNPFAGGLTQGSGAL 440 (457) T ss_pred ceeec-ccccccccCccCCccccceeeeeeeeee-eecccccccccccccc Confidence 11110 002222222 88999999999999999 7788532111000 00 No 206 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=93.77 E-value=0.0064 Score=32.60 Aligned_cols=227 Identities=12% Similarity=0.109 Sum_probs=98.5 Q ss_pred CCCcccCceEEecc-ccCC---cccccCCCccCcc-ccccceeEEEeehccceeeecHHHHH------hc-CCCHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN-DIGD---AADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAAL------SG-YGDPIGESN 68 (231) Q Consensus 1 ~~~~~~G~ti~~P~-~igd---a~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~------~~-~~d~~~~~~ 68 (231) .+--..+..+.+-+ ..+. +..+.++.+-+.. .=.......++-..+....++..+.. .+ .......+. T Consensus 34 p~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~ 113 (348) T protein:vir:96 34 PARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIV 113 (348) T ss_pred CCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHH Confidence 11001111111111 0111 2234444433322 12233444444444444444433321 11 112233334 Q ss_pred HHHHH-------HHHHHHHHHHHHHhcccc-------------------c--------ccccccCHHHHHHHHHHhhccC Q lcl|Aclame:pro 69 KQLGL-------SLANKVDDDLLKAAKTTS-------------------Q--------TVSTKANVDGVQAALDIFNDED 114 (231) Q Consensus 69 ~~~a~-------~ia~~vd~~~~~~l~t~~-------------------~--------~~~~~~~~d~i~da~~~l~~~~ 114 (231) +++++ .+.+.++--+..+|.+.. . +.++..-+.+|.++...+.+.+ T Consensus 114 ~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G 193 (348) T protein:vir:96 114 AGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGVKADHKKQVSKSWAEPGATPLADLEDAIETARELG 193 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEEeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcC Confidence 44333 333333333344443211 0 1112233677777777777777 Q ss_pred CCceEEEECHHHHHHHHhhhhhhhcccccc--Cce----eeeccceeecceeEEEcCC------------CccCceEEEE Q lcl|Aclame:pro 115 AQAYVLIVNPKDAAKIRKDANAKNIGSEVG--ANA----LINGTYADVLGAQIVRSKK------------LAEGSALMFK 176 (231) Q Consensus 115 ~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~--~~~----~~~G~ig~~~G~~Vv~s~~------------~~~~~~~~~~ 176 (231) ..+..++|+++.+..|++++.+........ ... ..+..++.+.|++|++=+. +|++..+++. T Consensus 194 ~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~ 273 (348) T protein:vir:96 194 LNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIP 273 (348) T ss_pred CcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEc Confidence 888999999999999999988876432111 111 1123345667888775431 2333332221 Q ss_pred EecCCceEEEeecCC-----------------------ccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec-cC Q lcl|Aclame:pro 177 IVSNSPALKLVLKRG-----------------------VQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT-GV 231 (231) Q Consensus 177 ~~~~~~A~~~~~k~~-----------------------v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~-~~ 231 (231) .+.++...-.. +-+-++.+.+-....+.+..+--..+.+|++++++++- || T Consensus 274 ----~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 274 ----NGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred ----CCCceeEEeccChhhhhhhhcccccccceecCCeeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 22222111110 00111111111122233444455666788899888865 45 No 207 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=93.34 E-value=0.0074 Score=32.28 Aligned_cols=217 Identities=13% Similarity=0.080 Sum_probs=115.5 Q ss_pred CCCcccC----ceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA----NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G----~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~ 71 (231) + ++.| .+++||.+ .|-+..++.+.++|..+...+..+.+++.++-++.++.++... .+.+..++=.... T Consensus 78 v--~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA 155 (336) T protein:vir:78 78 E--SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSS 155 (336) T ss_pred c--ccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHH Confidence 2 1122 46788754 7999999999999999999999999999999999999766544 4667777666677 Q ss_pred HHHHHHHHHHHHHHH--------hc----c-ccccccc--------ccCHHHHHHHHHHhhccC------CCceEEEECH Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA--------AK----T-TSQTVST--------KANVDGVQAALDIFNDED------AQAYVLIVNP 124 (231) Q Consensus 72 a~~ia~~vd~~~~~~--------l~----t-~~~~~~~--------~~~~d~i~da~~~l~~~~------~~~~v~vv~p 124 (231) ++++..++++-.+-. +- . +..+.++ .--+++|..++..+.... ..+..++++| T Consensus 156 ~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~ 235 (336) T protein:vir:78 156 ALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPP 235 (336) T ss_pred HHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEech Confidence 777777776533211 10 0 1111111 113566666666663322 2356899999 Q ss_pred HHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc-CceEEEEE---ecCCceEEEeecCCccc-eeccc Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE-GSALMFKI---VSNSPALKLVLKRGVQV-ETDRD 199 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~-~~~~~~~~---~~~~~A~~~~~k~~v~v-E~~Rd 199 (231) ..+..|.+-..+ +. ...+.+.. .+=+++|+..+.+.. +.....-+ ..++.-+.+..-...+. ...+. T Consensus 236 ~~~~~L~~~n~~---g~-tv~~~lk~----n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~ 307 (336) T protein:vir:78 236 TAMSDLSKTNQY---GL-SAAAKLKE----IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERY 307 (336) T ss_pred HHHHhccCCCcc---Cc-cHHHHHHH----hcCccEEEEcccccccCcceEEEEEeeccCCcceeeecchhhhccceeec Confidence 999888642111 00 01111111 122355655554432 22111111 11111111111111100 01111 Q ss_pred hhhcccEEEEE-EEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 IVTKTTVITAD-EHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 ~~~~~~~i~~~-~~y~~~~~~~~~vv~l~~~~~ 231 (231) .....+... ...|+-++.|.+++++ .|+ T Consensus 308 --~~~~~v~~~~rt~Gv~i~~P~ai~~~--~GI 336 (336) T protein:vir:78 308 --SSYFRQKKSAGTWGAVIFRPFAVAQM--IGV 336 (336) T ss_pred --CceeEeccccceeeeeeeccchheee--ccC Confidence 122222222 2478889999998886 455 No 208 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=92.96 E-value=0.0093 Score=31.72 Aligned_cols=224 Identities=12% Similarity=0.064 Sum_probs=114.1 Q ss_pred CCCcccCceEEecc----------------------------c-c--CCcccccC---------CCccCccccccceeEE Q lcl|Aclame:pro 1 ENGINLANLCEYPN----------------------------D-I--GDAADVAE---------GGEISLDKIGTTTKSV 40 (231) Q Consensus 1 ~~~~~~G~ti~~P~----------------------------~-i--gda~~v~E---------G~~i~~~~lt~~~~~~ 40 (231) .....+|+...... | + |-.+..+| +.+++.-.++.++.++ T Consensus 176 ~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tV 255 (514) T protein:vir:56 176 EVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVV 255 (514) T ss_pred cccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCCcccccceeeeEEEEEEE Confidence 22222222221100 0 1 11122222 3346666788889999 Q ss_pred EeehccceeeecHHHHHh----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------cc--------ccc-- Q lcl|Aclame:pro 41 TIKKAAKGTEITDEAALS----GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQT--------VS--------TKA-- 98 (231) Q Consensus 41 tikk~g~~~~itD~~~~~----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~--------~~--------~~~-- 98 (231) +.|-++..-+.|=|...+ -+.|.-++..+-|+..|...|+.+++..+...... .+ ... T Consensus 256 tAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~ 335 (514) T protein:vir:56 256 EAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDV 335 (514) T ss_pred eeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhccccccccccccccccccccc Confidence 999888777777765432 36899999999999999999999998776432110 01 001 Q ss_pred -----CHHHHHHHHHHhhcc---------CCCceEEEECHHHHHHHHhhh--------hhhhccccc-cCceeeecccee Q lcl|Aclame:pro 99 -----NVDGVQAALDIFNDE---------DAQAYVLIVNPKDAAKIRKDA--------NAKNIGSEV-GANALINGTYAD 155 (231) Q Consensus 99 -----~~d~i~da~~~l~~~---------~~~~~v~vv~p~~~~~L~k~~--------~~~~~~~~~-~~~~~~~G~ig~ 155 (231) .++.+......+..+ -...++++|+|.+++.|..-- .+....... ....+.- |. T Consensus 336 ~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~a---G~ 412 (514) T protein:vir:56 336 KGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFA---GV 412 (514) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceEE---EE Confidence 111112122222211 146789999999999986411 111000000 0001111 33 Q ss_pred e-cceeEEEcCCCccCceEEEEEecCCceE--EEeecCC-ccceec--cchhhcccEEEEEEEEEEEEEcCCc-----E- Q lcl|Aclame:pro 156 V-LGAQIVRSKKLAEGSALMFKIVSNSPAL--KLVLKRG-VQVETD--RDIVTKTTVITADEHYAAYLYDLTK-----V- 223 (231) Q Consensus 156 ~-~G~~Vv~s~~~~~~~~~~~~~~~~~~A~--~~~~k~~-v~vE~~--Rd~~~~~~~i~~~~~y~~~~~~~~~-----v- 223 (231) + -|++|++++..+.. .+.+=+. +...+ +++. .+ +..+.. -|+..+.-.+-...|||..+ ||=. . T Consensus 413 l~~~~~vy~D~y~~~d-y~~vG~K-G~~~~~~glfy-aPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NPy~~~~~~~~ 488 (514) T protein:vir:56 413 LGGRFKVYIDQYAVND-YFTVGFK-GSTEMDAGVFY-SPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV-NPFADPTASAT 488 (514) T ss_pred ecCceEEEecCCCCcc-eEEEEEe-cCcceecceee-ccccccccccccCCccccceeeeeeeeceee-CCCCCcccccc Confidence 3 35799999887742 1111000 11000 1111 11 222223 38888888888889998865 5521 0 Q ss_pred ------------------EEEEeccC Q lcl|Aclame:pro 224 ------------------VNITFTGV 231 (231) Q Consensus 224 ------------------v~l~~~~~ 231 (231) -++.+++. T Consensus 489 ~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 489 KVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred ccCCcchhhhcccccceeeeEEEecC Confidence 12223333 No 209 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=92.52 E-value=0.011 Score=31.31 Aligned_cols=225 Identities=12% Similarity=0.126 Sum_probs=97.7 Q ss_pred CCCcccCceEEecc-------c---c---CC---cccccCCCccCccc-cccceeEEEeehccceeeecHHHHH-----h Q lcl|Aclame:pro 1 ENGINLANLCEYPN-------D---I---GD---AADVAEGGEISLDK-IGTTTKSVTIKKAAKGTEITDEAAL-----S 58 (231) Q Consensus 1 ~~~~~~G~ti~~P~-------~---i---gd---a~~v~EG~~i~~~~-lt~~~~~~tikk~g~~~~itD~~~~-----~ 58 (231) -...--++++ ||. | . +. +..++.+.+-+..+ =.....+.++-..+....++..+.. . T Consensus 23 ~~~~~l~~~~-Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~p~i~~~~~i~~~d~~~~~~~~ 101 (348) T protein:vir:27 23 NVSSTLGESI-FPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDEQMPFFKEAMLVKENDRQQLNLVK 101 (348) T ss_pred hhhhhhHhhc-CCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeeeecCccccccccCHHHHHHHHHhh Confidence 0111111111 110 0 1 11 12334443332221 1223334444444444455444322 1 Q ss_pred cCCCH--HHHHHHHHH-------HHHHHHHHHHHHHHhcccc---------------------------cccccccCHHH Q lcl|Aclame:pro 59 GYGDP--IGESNKQLG-------LSLANKVDDDLLKAAKTTS---------------------------QTVSTKANVDG 102 (231) Q Consensus 59 ~~~d~--~~~~~~~~a-------~~ia~~vd~~~~~~l~t~~---------------------------~~~~~~~~~d~ 102 (231) +..++ ...+.++++ ..+.+.++--+..+|.+.. .+.++..-+++ T Consensus 102 ~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg~~~~~~~t~~~~W~~~~adp~~d 181 (348) T protein:vir:27 102 DSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGVKPDHKKQVSKSWAEPGATPLAD 181 (348) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeecCCcccceeeeeccCCCCCCHHHH Confidence 11111 222223322 3333333333344443211 01112234677 Q ss_pred HHHHHHHhhccCCCceEEEECHHHHHHHHhhhhhhhcccccc--Ccee----eeccceeecceeEEEcCC---------- Q lcl|Aclame:pro 103 VQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVG--ANAL----INGTYADVLGAQIVRSKK---------- 166 (231) Q Consensus 103 i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~--~~~~----~~G~ig~~~G~~Vv~s~~---------- 166 (231) |.+....+.+.+..+..++|+++.+..|++++.+........ ...+ .+-.++++.|++|++=+. T Consensus 182 i~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~i~~yd~~y~d~~G~~~ 261 (348) T protein:vir:27 182 LEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYIADNFGVSIVLENGTYRNDKGEVS 261 (348) T ss_pred HHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHHHHHhhcCceEEEEeeEEEcCCCcCc Confidence 887777787777888999999999999999988775432111 1112 122345567787765431 Q ss_pred --CccCceEEEEEecCCceEEEeecCCccce------------------------eccchhhcccEEEEEEEEEEEEEcC Q lcl|Aclame:pro 167 --LAEGSALMFKIVSNSPALKLVLKRGVQVE------------------------TDRDIVTKTTVITADEHYAAYLYDL 220 (231) Q Consensus 167 --~~~~~~~~~~~~~~~~A~~~~~k~~v~vE------------------------~~Rd~~~~~~~i~~~~~y~~~~~~~ 220 (231) +|++..+++. .+.++...-. ...| .+...+--...+.+..+--..+.+| T Consensus 262 ~~~p~~~vvl~~----~~~~G~~~yG-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~ 336 (348) T protein:vir:27 262 KFYPDGHLTLIP----NGPLGNTVFG-TTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPVNVQTKVSMVALPSFERL 336 (348) T ss_pred ccccCCeEEEEc----CCcceeEEec-cCcchhhhhhccccccceeeeCCeeEEEeeecCCCceEEEEEeeeeeccccCC Confidence 2333333221 2222222111 1111 1111111122333444455666788 Q ss_pred CcEEEEEe-ccC Q lcl|Aclame:pro 221 TKVVNITF-TGV 231 (231) Q Consensus 221 ~~vv~l~~-~~~ 231 (231) ++++++++ .|| T Consensus 337 ~~~~~a~Vl~~~ 348 (348) T protein:vir:27 337 DDVYMLTVIPAV 348 (348) T ss_pred CcEEEEEEecCC Confidence 88888865 455 No 210 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=91.84 E-value=0.014 Score=30.75 Aligned_cols=223 Identities=8% Similarity=-0.045 Sum_probs=125.1 Q ss_pred CCCcccCceEEec-cccCC--cccccCCCccCcc-ccccceeEEEeehccceeeecHHHHHhcC-----CCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYP-NDIGD--AADVAEGGEISLD-KIGTTTKSVTIKKAAKGTEITDEAALSGY-----GDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G~ti~~P-~~igd--a~~v~EG~~i~~~-~lt~~~~~~tikk~g~~~~itD~~~~~~~-----~d~~~~~~~~~ 71 (231) ..=..+|-+|..| .|-.+ ...+.-=+.+.+. .-.+..-+..++|+..++.||-+.+++.. .|++.+-++.. T Consensus 41 ~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~a 120 (321) T protein:vir:34 41 PRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVA 120 (321) T ss_pred ccccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHH Confidence 2234778888888 45333 3333322233322 23455667788999999999998887764 47888889999 Q ss_pred HHHHHHHHHHHHHHH-----------hcc------cccc------------------cccccCHHHHH----HHHHHhhc Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA-----------AKT------TSQT------------------VSTKANVDGVQ----AALDIFND 112 (231) Q Consensus 72 a~~ia~~vd~~~~~~-----------l~t------~~~~------------------~~~~~~~d~i~----da~~~l~~ 112 (231) -+.++++++.++..- |+. ++++ ..+..+...|+ .+|..+.- T Consensus 121 e~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~R 200 (321) T protein:vir:34 121 EATMANDISAALYGDGTAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVR 200 (321) T ss_pred HHHHHhhhhHhhhccccccccchhhhhhhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhcc Confidence 999999999988751 110 1111 01112333333 44444554 Q ss_pred cCCCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccce-eecceeEEEcC----CCccCceEEEEEecCCceEEEe Q lcl|Aclame:pro 113 EDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYA-DVLGAQIVRSK----KLAEGSALMFKIVSNSPALKLV 187 (231) Q Consensus 113 ~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig-~~~G~~Vv~s~----~~~~~~~~~~~~~~~~~A~~~~ 187 (231) ..+.|+++++..+.|...++......+.+ ..+-..-|..+ .|.|+.|+.+. .+|++++|.+. ...+.+. T Consensus 201 g~~~PDlii~~~~~y~~y~~s~q~~qR~~--~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiN----T~yl~~r 274 (321) T protein:vir:34 201 GADMPDLIMSGNDAWTTYSNSLQVLQRFT--SAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLN----TKYLHFR 274 (321) T ss_pred CCCCccEEEechHHHHHHHHhhheeeeec--ccccccccceeeeeeeEEEEEeCCCCCCccccceeeee----cceEEEE Confidence 55789999999999999988543333322 22222223333 57899999998 68999988765 4566666 Q ss_pred ecCCccceeccchh---hcccEEEEEE--EEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 188 LKRGVQVETDRDIV---TKTTVITADE--HYAAYLYDLTKVVNITFT 229 (231) Q Consensus 188 ~k~~v~vE~~Rd~~---~~~~~i~~~~--~y~~~~~~~~~vv~l~~~ 229 (231) ..++=.+--.++.. ...|+++-.. +-..-+-||..=.+|.-. T Consensus 275 ~h~~~~~~pi~p~r~~~~NqdA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 275 PHKDRNMVPLSPSRRAAFNQDAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred EcCCCceeecCcccccccchhHHhhhhhhhheeeeecccceeEEeeC Confidence 44442222222221 1222222222 222233344333333222 No 211 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=91.70 E-value=0.015 Score=30.64 Aligned_cols=230 Identities=12% Similarity=0.014 Sum_probs=116.4 Q ss_pred CCCcccCceEEecc--ccCCccccc-----CCCccCccccccceeEEEeehccceeeecHHHHHh----cCCCHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--DIGDAADVA-----EGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYGDPIGESNK 69 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~igda~~v~-----EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~d~~~~~~~ 69 (231) -.++..|.+.++.. .+..+|.+. .+.+++.=.++.++.+++.|-++..=+.|=|...+ -+.|.-++..+ T Consensus 222 ~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsN 301 (529) T protein:vir:10 222 NAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNG 301 (529) T ss_pred ccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHH Confidence 11112222333321 122222221 12346666788889999999888777777775433 36899999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccc-----------cccccCH-------------HHHHHHHHHhhcc--------C-CC Q lcl|Aclame:pro 70 QLGLSLANKVDDDLLKAAKTTSQT-----------VSTKANV-------------DGVQAALDIFNDE--------D-AQ 116 (231) Q Consensus 70 ~~a~~ia~~vd~~~~~~l~t~~~~-----------~~~~~~~-------------d~i~da~~~l~~~--------~-~~ 116 (231) -|+..|...|+.+++..+.+.+.. .++.+++ +........+..+ . .. T Consensus 302 ILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~ 381 (529) T protein:vir:10 302 ILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGA 381 (529) T ss_pred HHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 999999999999999887643310 0111111 1111111222211 1 35 Q ss_pred ceEEEECHHHHHHHHhhhhhhhc-----cccccCceeeeccceeec-ceeEEEcCCCccCceE-EEEEecCCceEEEe-e Q lcl|Aclame:pro 117 AYVLIVNPKDAAKIRKDANAKNI-----GSEVGANALINGTYADVL-GAQIVRSKKLAEGSAL-MFKIVSNSPALKLV-L 188 (231) Q Consensus 117 ~~v~vv~p~~~~~L~k~~~~~~~-----~~~~~~~~~~~G~ig~~~-G~~Vv~s~~~~~~~~~-~~~~~~~~~A~~~~-~ 188 (231) .++++|+|+++..|..---+... ......+-..+...|.+. |++|++++..+..=.. .++-.....+..++ . T Consensus 382 ~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~P 461 (529) T protein:vir:10 382 GNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCP 461 (529) T ss_pred ceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecc Confidence 78999999999998731111000 000001111112345543 3799999887642110 01100000011111 1 Q ss_pred cCCccceeccchhhcccEEEEEEEEEEEEEcCCc--------------------------EEEEEeccC Q lcl|Aclame:pro 189 KRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK--------------------------VVNITFTGV 231 (231) Q Consensus 189 k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~--------------------------vv~l~~~~~ 231 (231) --+...-.--|+..+.-.+-...|||..+ ||=. ..++.+|+. T Consensus 462 Yv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 462 YVALTPLRGSDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccccccccccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 11111111248888999999999998865 5511 123333444 No 212 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=91.67 E-value=0.015 Score=30.62 Aligned_cols=229 Identities=12% Similarity=0.067 Sum_probs=117.0 Q ss_pred CCCccc-------CceEEeccc-----cCCcccccCC---CccCccccccceeEEEeehccceeeecHHHHHh----cCC Q lcl|Aclame:pro 1 ENGINL-------ANLCEYPND-----IGDAADVAEG---GEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYG 61 (231) Q Consensus 1 ~~~~~~-------G~ti~~P~~-----igda~~v~EG---~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~ 61 (231) -.+++. ..+.++..+ +..+|.++.+ .+++.-.++.++.+++.|-++..-+.|=|...+ -+. T Consensus 157 ~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGL 236 (462) T protein:vir:10 157 AEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGL 236 (462) T ss_pred cccccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCC Confidence 000000 001111110 1233444433 346666778889999999887777777765432 468 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----c--------cccc----CHHHHHHHHHHhh---------ccCC Q lcl|Aclame:pro 62 DPIGESNKQLGLSLANKVDDDLLKAAKTTSQT-----V--------STKA----NVDGVQAALDIFN---------DEDA 115 (231) Q Consensus 62 d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~-----~--------~~~~----~~d~i~da~~~l~---------~~~~ 115 (231) |.-++..+-|+..|...|+.+++..+.+.+.. + ..+. .++.....+-.+. -... T Consensus 237 DAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~ 316 (462) T protein:vir:10 237 DAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNAIGQETRRG 316 (462) T ss_pred ChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 99999999999999999999999988654321 0 1111 1222222222221 1125 Q ss_pred CceEEEECHHHHHHHHhhhhhhhcc-----cccc--Cceeeeccceeec-ceeEEEcCCCccCceEEEEEecCCc----- Q lcl|Aclame:pro 116 QAYVLIVNPKDAAKIRKDANAKNIG-----SEVG--ANALINGTYADVL-GAQIVRSKKLAEGSALMFKIVSNSP----- 182 (231) Q Consensus 116 ~~~v~vv~p~~~~~L~k~~~~~~~~-----~~~~--~~~~~~G~ig~~~-G~~Vv~s~~~~~~~~~~~~~~~~~~----- 182 (231) ..++++|+|++++.|.. ..+.+.. ...+ .+-..+...|.+. |++|+++.-...+...-+.++..+| T Consensus 317 ~~n~~i~S~~Va~~La~-sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~ 395 (462) T protein:vir:10 317 KGNILICSADVASALGM-AGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYD 395 (462) T ss_pred cceEEEEchhHHHHhhh-ccchhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccc Confidence 78899999999998843 2221110 0001 0111223356664 4788888643322111111111111 Q ss_pred eEEEe-ecCCccceeccchhhcccEEEEEEEEEEEEEcCC--------cE---------EEEEeccC Q lcl|Aclame:pro 183 ALKLV-LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLT--------KV---------VNITFTGV 231 (231) Q Consensus 183 A~~~~-~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~--------~v---------v~l~~~~~ 231 (231) +..++ .--+.....-.|+..+.-.+-...|||..+ ||= +- -++.+++. T Consensus 396 ~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~t~~~~~~~~~~~~~~n~y~r~~~v~~l 461 (462) T protein:vir:10 396 AGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVS-NPFSGGLTQGSGALTANANKYYRRVQVANL 461 (462) T ss_pred cceeeccccccccccccCCccccceeeeeeeeeeee-cCCCCCcCCccccccccCcceeeeEEeecc Confidence 11111 111111112238888888888888998764 442 10 12223333 No 213 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=91.55 E-value=0.015 Score=30.54 Aligned_cols=228 Identities=15% Similarity=0.098 Sum_probs=117.7 Q ss_pred CCC--------cc-------cCceEEeccc--cCCcccccCC-CccCccccccceeEEEeehccceeeecHHHHHh---- Q lcl|Aclame:pro 1 ENG--------IN-------LANLCEYPND--IGDAADVAEG-GEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---- 58 (231) Q Consensus 1 ~~~--------~~-------~G~ti~~P~~--igda~~v~EG-~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---- 58 (231) .+. ++ .+.+.++... +..+|.++++ .+++.-.++.++.+++.|-++..-+.|=|...+ T Consensus 153 ~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAi 232 (468) T protein:vir:10 153 TGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAI 232 (468) T ss_pred cccccccCCCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHh Confidence 000 00 1111222211 2334445543 346666778889999999888777777765432 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------ccc------ccC----HHHHHHHHHHhh---------c Q lcl|Aclame:pro 59 GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQT-------VST------KAN----VDGVQAALDIFN---------D 112 (231) Q Consensus 59 ~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~-------~~~------~~~----~d~i~da~~~l~---------~ 112 (231) -+.|.-++..+-|+..|...|+.+++..+.+.+.. .++ ..+ .+.....+-.+. - T Consensus 233 HGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T 312 (468) T protein:vir:10 233 HGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQET 312 (468) T ss_pred cCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhh Confidence 46899999999999999999999999887654321 111 111 111111111111 1 Q ss_pred cCCCceEEEECHHHHHHHHhhhhhhhccccccC-------ceeeec--cceeec-ceeEEEcCCCccCceEEEEEecCCc Q lcl|Aclame:pro 113 EDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGA-------NALING--TYADVL-GAQIVRSKKLAEGSALMFKIVSNSP 182 (231) Q Consensus 113 ~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~-------~~~~~G--~ig~~~-G~~Vv~s~~~~~~~~~~~~~~~~~~ 182 (231) .....++++++|.+++.|.. ..+......... +.--+| ..|.+. |++|+++.-+......-+.++..+| T Consensus 313 ~rg~gn~ii~S~~Va~~L~~-sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG 391 (468) T protein:vir:10 313 RRGKGNFLICSADVASALAM-AGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKG 391 (468) T ss_pred ccccccEEEechhHHHHHhh-cCcceecccccccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEec Confidence 13578899999999999985 222221111000 111112 245554 4799998644322211111111111 Q ss_pred eE----EEeecCC-ccceecc--chhhcccEEEEEEEEEEEEEcCCcE----------------------EEEEeccC Q lcl|Aclame:pro 183 AL----KLVLKRG-VQVETDR--DIVTKTTVITADEHYAAYLYDLTKV----------------------VNITFTGV 231 (231) Q Consensus 183 A~----~~~~k~~-v~vE~~R--d~~~~~~~i~~~~~y~~~~~~~~~v----------------------v~l~~~~~ 231 (231) .- +++. .+ +..+..| |+..+.-.+-...|||..+ ||=.. -++.+++. T Consensus 392 ~~~~d~glfy-aPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~g~~~~~~~~~~~N~y~r~~~v~~l 467 (468) T protein:vir:10 392 TSPYDAGLFY-CPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) T ss_pred Ccceeceeee-ccccccccccccCCCcccceeeeeeeeceee-cccceeccccCCCcccccccccccceeeeEEEecc Confidence 11 1111 01 1222222 8888888999999998865 66221 01222222 No 214 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=90.97 E-value=0.018 Score=30.14 Aligned_cols=217 Identities=14% Similarity=0.136 Sum_probs=112.2 Q ss_pred CCCcccC-ceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA-NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQLGLS 74 (231) Q Consensus 1 ~~~~~~G-~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~a~~ 74 (231) .+--.-+ .|++++.+ .|-+..++.+++.|..+...+..+-++..+.-+++++.++... .+.|..++=.....++ T Consensus 83 ~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~a 162 (339) T protein:vir:94 83 VKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQEISASLV 162 (339) T ss_pred ccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHH Confidence 2211222 37888865 7999999999999888877777777777776777777765544 3556666666666677 Q ss_pred HHHHHHHHHHHH--------hcc-----ccccccc---ccC----HHHHHHHHHHhhccC------CCceEEEECHHHHH Q lcl|Aclame:pro 75 LANKVDDDLLKA--------AKT-----TSQTVST---KAN----VDGVQAALDIFNDED------AQAYVLIVNPKDAA 128 (231) Q Consensus 75 ia~~vd~~~~~~--------l~t-----~~~~~~~---~~~----~d~i~da~~~l~~~~------~~~~v~vv~p~~~~ 128 (231) ++.++|+-.+-+ |-. +..+.++ ..+ +++|..++..+.... ..+..++++|..+. T Consensus 163 l~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~ 242 (339) T protein:vir:94 163 MAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALN 242 (339) T ss_pred HHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHH Confidence 777776543321 110 0001111 012 566677777664322 23567999999998 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccC--ceEEE--EEecCCceEEEeecCC---ccceeccchh Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEG--SALMF--KIVSNSPALKLVLKRG---VQVETDRDIV 201 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~--~~~~~--~~~~~~~A~~~~~k~~---v~vE~~Rd~~ 201 (231) .|.+-..+ + ....+.+... +-+++|+..+.+... +.... ....++.-+.+..-.. ..+| .. T Consensus 243 ~L~~~n~~---~-~Tvl~~lk~n----~pnl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq----~~ 310 (339) T protein:vir:94 243 NVNRTNNF---G-LSAGAKIAQT----YPNIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFAEKLRSHSIE----RY 310 (339) T ss_pred hcccCCcC---C-ccHHHHHHHh----cCCcEEEEccccccCCCceEEEEEEeccCCcceEEEcchhhhccccE----Ec Confidence 88642111 0 0111111111 224555555544321 11111 1111111111111111 1111 12 Q ss_pred hcccEEEEEE-EEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 202 TKTTVITADE-HYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 202 ~~~~~i~~~~-~y~~~~~~~~~vv~l~~~~~ 231 (231) .....+.... ..|+-++.|.+++.+ .|+ T Consensus 311 ~~~~~v~~~~rt~Gv~i~~P~ai~~~--~GI 339 (339) T protein:vir:94 311 STTTRQKHSGATFGAVIYQPWAVTQE--LGV 339 (339) T ss_pred CceEEecceeeeeeEEEEccceeeee--ecC Confidence 2233333333 379999999998886 455 No 215 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=90.29 E-value=0.022 Score=29.71 Aligned_cols=225 Identities=10% Similarity=0.033 Sum_probs=116.8 Q ss_pred CCCcccCceEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecHHHHHh----cCCCHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYGDPIGES 67 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~d~~~~~ 67 (231) -+++..|.+.+++. |-.+..+| +.+++.-.++.++.+++.|-++..-+.|=|...+ -+.|.-++. T Consensus 213 ~~~~~~~~~y~~~~--GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtEL 290 (521) T protein:vir:10 213 KKQMEAGALVEIAE--GMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAEL 290 (521) T ss_pred cccccccceeeccc--ccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHH Confidence 12233444444442 22233333 3356777788899999999888777777765432 368999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccc-----c------cccc------ccC---HHHHH----HHHHHhh--------cc-C Q lcl|Aclame:pro 68 NKQLGLSLANKVDDDLLKAAKTTS-----Q------TVST------KAN---VDGVQ----AALDIFN--------DE-D 114 (231) Q Consensus 68 ~~~~a~~ia~~vd~~~~~~l~t~~-----~------~~~~------~~~---~d~i~----da~~~l~--------~~-~ 114 (231) .+-|+..|...|+.+++..+...+ . ...+ ..+ +.... ..+..+. .- - T Consensus 291 aNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r 370 (521) T protein:vir:10 291 SGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGR 370 (521) T ss_pred HHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 999999999999999997653211 0 0011 111 11111 1111111 11 2 Q ss_pred CCceEEEECHHHHHHHHhhhhhhhcccc---ccCceeeec--cceeec-ceeEEEcCCCccCceE-EEEEecCCc---eE Q lcl|Aclame:pro 115 AQAYVLIVNPKDAAKIRKDANAKNIGSE---VGANALING--TYADVL-GAQIVRSKKLAEGSAL-MFKIVSNSP---AL 184 (231) Q Consensus 115 ~~~~v~vv~p~~~~~L~k~~~~~~~~~~---~~~~~~~~G--~ig~~~-G~~Vv~s~~~~~~~~~-~~~~~~~~~---A~ 184 (231) ...++++|+|++++.|..-..+...... .+-..-.++ ..|.+. |++|++++..+..=.. .++ +.. +. T Consensus 371 ~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K---G~~~~~~g 447 (521) T protein:vir:10 371 GEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYK---GPNEMDAG 447 (521) T ss_pred ccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEe---CCcccccc Confidence 5678999999999998853222111111 010000111 135553 4789999877642110 011 000 11 Q ss_pred EEe-ecCCccceeccchhhcccEEEEEEEEEEEEEcCCcEE---------------------------EEEeccC Q lcl|Aclame:pro 185 KLV-LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVV---------------------------NITFTGV 231 (231) Q Consensus 185 ~~~-~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv---------------------------~l~~~~~ 231 (231) .++ .--+...-.--|+..+.-.+-...|||..+ ||=..- ++.+++. T Consensus 448 lfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 448 IYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCccceeecccchhhhccccccceeeeeeecCC Confidence 111 111111111248888888898899998765 552211 1112222 No 216 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=90.09 E-value=0.023 Score=29.60 Aligned_cols=219 Identities=13% Similarity=0.080 Sum_probs=115.7 Q ss_pred CCCcccC----ceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA----NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G----~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~ 71 (231) =.-.+.| .++.||.+ .|-+..++.++++|..+......+.++..++-++.++.++... ...|..++=.... T Consensus 76 ~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA 155 (336) T protein:vir:10 76 VGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSS 155 (336) T ss_pred ccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHH Confidence 0001222 35677753 7888999999999999999999999999999999999765443 4667777777777 Q ss_pred HHHHHHHHHHHHHHH--------hcc-----ccccccc--------ccCHHHHHHHHHHhhccC------CCceEEEECH Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA--------AKT-----TSQTVST--------KANVDGVQAALDIFNDED------AQAYVLIVNP 124 (231) Q Consensus 72 a~~ia~~vd~~~~~~--------l~t-----~~~~~~~--------~~~~d~i~da~~~l~~~~------~~~~v~vv~p 124 (231) .+++..++++-.+-. +-. +..+.++ .-.+++|..++..|.... ..+..++++| T Consensus 156 ~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~ 235 (336) T protein:vir:10 156 ALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPP 235 (336) T ss_pred HHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecH Confidence 777777777533311 100 0111111 123667777777665422 3478899999 Q ss_pred HHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc--CceEEEEE--ecCCceEEEeecCCccc-eeccc Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE--GSALMFKI--VSNSPALKLVLKRGVQV-ETDRD 199 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~--~~~~~~~~--~~~~~A~~~~~k~~v~v-E~~Rd 199 (231) ..+..|.+-..+ +. ...+.+.. .+=+++|+..+.+.. |+...+.+ ..+..-..+..-...+. ... T Consensus 236 ~~~~~Ls~~n~~---g~-Tvl~~lk~----n~Pnl~i~t~pEl~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq-- 305 (336) T protein:vir:10 236 TAMSDLSKTNQY---GL-AAAAKLKD----IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIE-- 305 (336) T ss_pred HHHHhccCCCcc---Cc-cHHHHHHH----hcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhcccee-- Confidence 988877532111 00 01111111 122455555554432 22111111 11111111111111000 001 Q ss_pred hhhcccEEEEE-EEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 IVTKTTVITAD-EHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 ~~~~~~~i~~~-~~y~~~~~~~~~vv~l~~~~~ 231 (231) .......+... ...|+-++.|.+++++ .|+ T Consensus 306 ~~~~~~~v~~~~rt~Gv~i~~P~ai~~~--~GI 336 (336) T protein:vir:10 306 RYSSYFRQKKSAGTWGAVIFRPFAVAQM--IGV 336 (336) T ss_pred ecCceeEeccccceeeeeeeccchheee--ecC Confidence 11112222222 2478999999999886 455 No 217 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=89.85 E-value=0.024 Score=29.46 Aligned_cols=228 Identities=8% Similarity=0.004 Sum_probs=116.5 Q ss_pred CCCcccCce--------------EEeccc---cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh--cCC Q lcl|Aclame:pro 1 ENGINLANL--------------CEYPND---IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYG 61 (231) Q Consensus 1 ~~~~~~G~t--------------i~~P~~---igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~--~~~ 61 (231) .|+-..|+. .+|++. .=.-+.++.|.....-+++.++.+..++..+-...|..+++.. +.. T Consensus 20 ~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~~~~ 99 (309) T protein:vir:99 20 RNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNY 99 (309) T ss_pred cChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccCceeeecccceeecCCchhhhhccCCC Confidence 333333443 333321 1111345677666655666677788888888877777776554 468 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------ccc------ccccCHHHHHHHHHHhhccCCCceEEEECHHH Q lcl|Aclame:pro 62 DPIGESNKQLGLSLANKVDDDLLKAAKTTS---------QTV------STKANVDGVQAALDIFNDEDAQAYVLIVNPKD 126 (231) Q Consensus 62 d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~---------~~~------~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~ 126 (231) ||...+++.+...|....+..+.+-+.... ++- ....-+..|.++...+ ...++.+++..+. T Consensus 100 d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd~~SDPi~~i~~~~~~~---g~~PN~~vlg~~~ 176 (309) T protein:vir:99 100 NPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRT 176 (309) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCCCCCCcHHHHHHHHHhh---CCCcceEEechHH Confidence 999999999988887666654433222111 111 1122355666666665 4689999999999 Q ss_pred HHHHHhhhhhhhccccccC--ceeeeccceeecce-eEEEcCCC-----ccCceEEEEEecCCceEEEeecCC------- Q lcl|Aclame:pro 127 AAKIRKDANAKNIGSEVGA--NALINGTYADVLGA-QIVRSKKL-----AEGSALMFKIVSNSPALKLVLKRG------- 191 (231) Q Consensus 127 ~~~L~k~~~~~~~~~~~~~--~~~~~G~ig~~~G~-~Vv~s~~~-----~~~~~~~~~~~~~~~A~~~~~k~~------- 191 (231) +..|+..+.......+... ..+.--.+-.++|+ +|++.... +..++.+-.+-...-++.+..... T Consensus 177 ~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~ps 256 (309) T protein:vir:99 177 ATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTT 256 (309) T ss_pred HHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeeccccccccccccccCCcEEEEEcCCCCCCccccc Confidence 9999999988776443322 23434455678888 57764322 111111111111111111111111 Q ss_pred ------------ccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 192 ------------VQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 192 ------------v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) =.++..+-.+.+...|+...++--+++-+..-..|+-..- T Consensus 257 ~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va 308 (309) T protein:vir:99 257 FGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) T ss_pred ccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhccc Confidence 1122222223333344444444433333333222211111 No 218 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=89.22 E-value=0.028 Score=29.13 Aligned_cols=171 Identities=12% Similarity=0.050 Sum_probs=101.6 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHH---HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGE---SNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~---~~~~~a~~ia~ 77 (231) ++++.+.-+ +..-...++...=++.+++++-++.+++.++.-.+..++|+.... +-.++.-+. -.++..+++.. T Consensus 47 n~gt~~~~~--v~~~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la-~~~Gn~~~~ra~q~~~~~ka~~~ 123 (328) T protein:vir:95 47 NLPTGHRTT--IRSGLPSATWRLLNYGVQPSKSTTVQVTDSVGMLETYAEVDKSLA-DLNGNTAEFRLSEDRAFIEAMNQ 123 (328) T ss_pred ccCCcceee--EeeccCCceeeecCCccCcccceeEEEEEEEEEEecceeechHHH-hhcCCHHHHHHHHHHHHHHHHHH Confidence 223322222 222234556667777799999999999999999999999988554 444555433 44556677777 Q ss_pred HHHHHHHHH-----------hc------c------------c--ccc--------------------------------- Q lcl|Aclame:pro 78 KVDDDLLKA-----------AK------T------------T--SQT--------------------------------- 93 (231) Q Consensus 78 ~vd~~~~~~-----------l~------t------------~--~~~--------------------------------- 93 (231) ++...++.+ |. + + ..+ T Consensus 124 ~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~ 203 (328) T protein:vir:95 124 QMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVT 203 (328) T ss_pred HHHHHHhcCCccCChhhhcchhhhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCcee Confidence 777766631 10 0 0 000 Q ss_pred --------------------------------------------cccccCHHHHHHHHHHhhccCCCceEEEECHHHHHH Q lcl|Aclame:pro 94 --------------------------------------------VSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAK 129 (231) Q Consensus 94 --------------------------------------------~~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~ 129 (231) +......+.+.+|+..+-.......+++||..+... T Consensus 204 ~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~ 283 (328) T protein:vir:95 204 LEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQA 283 (328) T ss_pred eecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHH Confidence 000011234456666664344567889999999999 Q ss_pred HHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 130 IRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 130 L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) |++....... .+..-+-.....+-.+.|+||.+++.+-.++...+ T Consensus 284 L~~q~~~~~n-~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 284 LDLQSLEKTS-LAISVKETEGEWWTSFRGVPIRETDALLETEARVV 328 (328) T ss_pred HHHHHhcCcc-eeeeeeccCCcceeEECCeEEEEEeeeecCccccC Confidence 9885432221 11111212223456789999999998877766433 No 219 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=89.21 E-value=0.028 Score=29.12 Aligned_cols=217 Identities=13% Similarity=0.090 Sum_probs=115.1 Q ss_pred CCCcccC----ceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA----NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G----~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~ 71 (231) + .+.| .++.||.+ .|-+..++.++++|..+......+.++..++-++.++.++... ...|..++=.... T Consensus 78 v--~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA 155 (336) T protein:vir:36 78 E--SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSS 155 (336) T ss_pred c--cccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHH Confidence 2 1222 35677743 7888999999999999999999999999999999998655433 4667777766667 Q ss_pred HHHHHHHHHHHHHHH--------hcc-----ccccccc--------ccCHHHHHHHHHHhhccC------CCceEEEECH Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA--------AKT-----TSQTVST--------KANVDGVQAALDIFNDED------AQAYVLIVNP 124 (231) Q Consensus 72 a~~ia~~vd~~~~~~--------l~t-----~~~~~~~--------~~~~d~i~da~~~l~~~~------~~~~v~vv~p 124 (231) .+++..++++-.+-. +-. +..+.++ .-.+++|..++..+.... ..+..++++| T Consensus 156 ~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~ 235 (336) T protein:vir:36 156 ALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPP 235 (336) T ss_pred HHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEech Confidence 777777777533311 100 0011111 123667777777665422 3477899999 Q ss_pred HHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc--CceEEEEE--ecCCceEEEeecCCccc-eeccc Q lcl|Aclame:pro 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE--GSALMFKI--VSNSPALKLVLKRGVQV-ETDRD 199 (231) Q Consensus 125 ~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~--~~~~~~~~--~~~~~A~~~~~k~~v~v-E~~Rd 199 (231) ..+..|.+-..+ +. ...+.+.. .+=+++|+..+.+.. |+...+.+ ..+..-..+..-...+. ... T Consensus 236 ~~~~~Ls~~n~~---g~-Tvl~~lk~----n~Pnl~i~t~pEl~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq-- 305 (336) T protein:vir:36 236 TAMSDLSKTNQY---GL-AAAAKLKD----IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIE-- 305 (336) T ss_pred HHHHhccCCCcc---Cc-cHHHHHHH----hcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhcccee-- Confidence 988877532111 00 01111111 122355555554432 22221111 11111111111111100 001 Q ss_pred hhhcccEEEEE-EEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 200 IVTKTTVITAD-EHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 200 ~~~~~~~i~~~-~~y~~~~~~~~~vv~l~~~~~ 231 (231) .......+... ...|+-++.|.+++++ .|+ T Consensus 306 ~~~~~~~v~~~~rt~Gv~i~~P~ai~~~--~GI 336 (336) T protein:vir:36 306 RYSSYFRQKKSAGTWGAVIFRPFAVAQM--IGV 336 (336) T ss_pred ecCceeEeccccceeeeeeeccchheee--ecC Confidence 11112222222 2478899999999886 455 No 220 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=226 Identities=10% Similarity=0.032 Sum_probs=115.6 Q ss_pred CCCcccCceEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecHHHHHh----cCCCHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYGDPIGES 67 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~d~~~~~ 67 (231) -++.-.|.+.++.. |-.+..+| +.+++.-.++.++.+++.|-++..=+.|=|...+ -+.|.-++. T Consensus 214 ~s~~~~~~~y~~g~--GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtEL 291 (522) T protein:vir:69 214 IKQMEAGALVEIAE--GMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAEL 291 (522) T ss_pred ccccccccceeecc--ccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHH Confidence 11222333344431 22232333 2356777788899999999888777777765432 368999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccc-----c------c------cccccC-------HHHHHHHHHHhhc--------cC- Q lcl|Aclame:pro 68 NKQLGLSLANKVDDDLLKAAKTTS-----Q------T------VSTKAN-------VDGVQAALDIFND--------ED- 114 (231) Q Consensus 68 ~~~~a~~ia~~vd~~~~~~l~t~~-----~------~------~~~~~~-------~d~i~da~~~l~~--------~~- 114 (231) .+-|+..|...|+.+++..+...+ + + .....+ .+.....+..+.. -. T Consensus 292 aNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~r 371 (522) T protein:vir:69 292 SGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGR 371 (522) T ss_pred HHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhccc Confidence 999999999999999997653111 0 0 011111 1111111112211 11 Q ss_pred CCceEEEECHHHHHHHHhhhhhhhcccc---ccCceeeec--cceeec-ceeEEEcCCCccCceEEEEEecCCc---eEE Q lcl|Aclame:pro 115 AQAYVLIVNPKDAAKIRKDANAKNIGSE---VGANALING--TYADVL-GAQIVRSKKLAEGSALMFKIVSNSP---ALK 185 (231) Q Consensus 115 ~~~~v~vv~p~~~~~L~k~~~~~~~~~~---~~~~~~~~G--~ig~~~-G~~Vv~s~~~~~~~~~~~~~~~~~~---A~~ 185 (231) ...++++|+|++.+.|..-..+...... .+-..-.++ ..|.+. |++|++++..+..=. .+=+. +.. +.. T Consensus 372 g~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~-~vG~K-G~~~~~~gl 449 (522) T protein:vir:69 372 GEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYF-TVGYK-GANEMDAGI 449 (522) T ss_pred ccccEEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceE-EEEEe-CCcccccce Confidence 3678999999999999753211111110 010111111 135553 479999987664211 00000 100 111 Q ss_pred Ee-ecCCccceeccchhhcccEEEEEEEEEEEEEcCCcE-------EEEE--------------------eccC Q lcl|Aclame:pro 186 LV-LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKV-------VNIT--------------------FTGV 231 (231) Q Consensus 186 ~~-~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~v-------v~l~--------------------~~~~ 231 (231) ++ .--+...-.--|+..+.-.+-...|||..+ ||=.. .+|. +|+. T Consensus 450 fyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 450 YYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGV-NPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eeccccccccccccCCccccceeeeeeeeceee-cCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 11 111111112248899999999999998865 55211 1111 1111 No 221 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=88.23 E-value=0.034 Score=28.66 Aligned_cols=224 Identities=15% Similarity=0.099 Sum_probs=104.6 Q ss_pred CCC---------------cccCceEEeccccCC--cc-------------cccCCCccCccccccceeEEEeehc-ccee Q lcl|Aclame:pro 1 ENG---------------INLANLCEYPNDIGD--AA-------------DVAEGGEISLDKIGTTTKSVTIKKA-AKGT 49 (231) Q Consensus 1 ~~~---------------~~~G~ti~~P~~igd--a~-------------~v~EG~~i~~~~lt~~~~~~tikk~-g~~~ 49 (231) |++ + .||++++-+=-++ ++ .+.||.+.+...-......-.+.|+ -+.+ T Consensus 89 ~~~l~~~~~~~EvirVtsV-ng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~~~k~~~vsN~tQIf~e~v 167 (418) T protein:vir:96 89 KGMIFYNEATGENMRLELV-NGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAW 167 (418) T ss_pred cccEEEEecCCeEEEEEEE-eCCEEEEEEccCCeeeeeeecCceEEEeecCcccccccCCcceecceeccchhheehhhh Confidence 222 3 3777777542122 11 2456666654431111111133332 3456 Q ss_pred eecHHHHH---h-cCCCHHHHHHHHHHHHHHHHHHHHHHHHh------cccc-------------------ccc--cccc Q lcl|Aclame:pro 50 EITDEAAL---S-GYGDPIGESNKQLGLSLANKVDDDLLKAA------KTTS-------------------QTV--STKA 98 (231) Q Consensus 50 ~itD~~~~---~-~~~d~~~~~~~~~a~~ia~~vd~~~~~~l------~t~~-------------------~~~--~~~~ 98 (231) +||+-+.. + +..|....-.+.+-.. ..+++..++..- .+.+ .+. ...+ T Consensus 168 sVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng~p~~~t~R~m~gI~~f~~~Nvi~ag~~~~~ 246 (418) T protein:vir:96 168 ALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAIRQYAPDNVNAMPNPTAV 246 (418) T ss_pred hhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCCcccccccchhHHHHhhccccccccCCCCcC Confidence 66665433 2 4445543332223222 122333222110 1000 111 1246 Q ss_pred CHHHHHHHHHHhhc-----cCCC---ceEEEECHHHHHHHHhhhhhhh-ccccccCceeeeccceeecc-eeEEEcCCCc Q lcl|Aclame:pro 99 NVDGVQAALDIFND-----EDAQ---AYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADVLG-AQIVRSKKLA 168 (231) Q Consensus 99 ~~d~i~da~~~l~~-----~~~~---~~v~vv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~G~ig~~~G-~~Vv~s~~~~ 168 (231) ++|.+++++...-. .+.. .++++|++++...+-|.-.... ..++..-....+.. -.-+| ++|+.++++| T Consensus 247 t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~~I~~~~~en~~G~vv~~~-~Td~G~v~ii~n~~~p 325 (418) T protein:vir:96 247 TYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFGEVTVTQRETSYGMVFTEW-KFFKGRLIIKEHPLFS 325 (418) T ss_pred CHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhceeEeccccceeceEEEEE-EeeccEEEEEecCCCC Confidence 89999888765322 1122 2668999999888876532110 11122222233322 23357 5999999776 Q ss_pred cCc-----eEEEEEecCCceEEEeec--CCcccee-------------ccchhhcccEEEEEE--EEEEEEEcCCcEEEE Q lcl|Aclame:pro 169 EGS-----ALMFKIVSNSPALKLVLK--RGVQVET-------------DRDIVTKTTVITADE--HYAAYLYDLTKVVNI 226 (231) Q Consensus 169 ~~~-----~~~~~~~~~~~A~~~~~k--~~v~vE~-------------~Rd~~~~~~~i~~~~--~y~~~~~~~~~vv~l 226 (231) +.+ .+.++ +.++.+-.- |+...|. +..-...-|...+.- -|...+.||.++++| T Consensus 326 ad~I~~g~mlVvD----~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~~G~l~~Eltle~~N~~a~a~i 401 (418) T protein:vir:96 326 AIGISPGFAVVVD----VPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCAVI 401 (418) T ss_pred ccccCcceEEEEe----cCceEEEEecCCCccchhcccCCCcccccccccccccccccccCEEEEEEEEEeecccccEEe Confidence 544 34333 444444332 4444332 222222234443333 389999999999998 Q ss_pred EeccC Q lcl|Aclame:pro 227 TFTGV 231 (231) Q Consensus 227 ~~~~~ 231 (231) +-=+- T Consensus 402 tgl~~ 406 (418) T protein:vir:96 402 TGLQK 406 (418) T ss_pred ecccc Confidence 64333 No 222 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=87.62 E-value=0.038 Score=28.40 Aligned_cols=227 Identities=11% Similarity=0.098 Sum_probs=95.3 Q ss_pred CCCcccCceEEeccccCC---cccccCCCccCccc-cccceeEEEeehccceeeecHHHH--HhcCCCH-----HHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD---AADVAEGGEISLDK-IGTTTKSVTIKKAAKGTEITDEAA--LSGYGDP-----IGESNK 69 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd---a~~v~EG~~i~~~~-lt~~~~~~tikk~g~~~~itD~~~--~~~~~d~-----~~~~~~ 69 (231) .--+..-+-+.+....|. +..+.++.+-+..+ =.....+.++-..+....++..+. .....++ ...+.+ T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~ 114 (348) T protein:vir:49 35 ARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVA 114 (348) T ss_pred CccccCceeEEEEeecCceeeeeeecCCCCcceecccceeeeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHH Confidence 111111111111111111 22344443332221 123334444444444444544332 1111111 222333 Q ss_pred HHHH-------HHHHHHHHHHHHHhcccc---------------------------cccccccCHHHHHHHHHHhhccCC Q lcl|Aclame:pro 70 QLGL-------SLANKVDDDLLKAAKTTS---------------------------QTVSTKANVDGVQAALDIFNDEDA 115 (231) Q Consensus 70 ~~a~-------~ia~~vd~~~~~~l~t~~---------------------------~~~~~~~~~d~i~da~~~l~~~~~ 115 (231) +++. .+.+.++--+..+|.+.. .+.++..-+.+|.+....+.+.+. T Consensus 115 ~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~vdyg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~ 194 (348) T protein:vir:49 115 GIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGL 194 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCC Confidence 3333 333333333344443211 011122336777777777877778 Q ss_pred CceEEEECHHHHHHHHhhhhhhhcccccc--Cceee----eccceeecceeEEEcC-C-----------CccCceEEEEE Q lcl|Aclame:pro 116 QAYVLIVNPKDAAKIRKDANAKNIGSEVG--ANALI----NGTYADVLGAQIVRSK-K-----------LAEGSALMFKI 177 (231) Q Consensus 116 ~~~v~vv~p~~~~~L~k~~~~~~~~~~~~--~~~~~----~G~ig~~~G~~Vv~s~-~-----------~~~~~~~~~~~ 177 (231) .+..++|+++.+..|++++.+........ ...+. +..++.+.|++|++=+ . +|+++.+++. T Consensus 195 ~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~- 273 (348) T protein:vir:49 195 NPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAELDNYIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIP- 273 (348) T ss_pred cccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHHHHHHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEec- Confidence 88999999999999999988765422111 11111 1223456677776543 1 2222222211 Q ss_pred ecCCceEEEeecCC-----------------------ccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec-cC Q lcl|Aclame:pro 178 VSNSPALKLVLKRG-----------------------VQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT-GV 231 (231) Q Consensus 178 ~~~~~A~~~~~k~~-----------------------v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~-~~ 231 (231) .+.++...-.. +-+-.+...+--...+.+..+--..+.+|+++.++++- || T Consensus 274 ---~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 274 ---NGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred ---CCCcceeEEecChhhhhhccccccccceeecCCeEEEeeeecCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 22222111100 00111111111112233333344556788888888765 44 No 223 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=87.26 E-value=0.04 Score=28.25 Aligned_cols=220 Identities=13% Similarity=0.085 Sum_probs=117.8 Q ss_pred CCCcccCceEEeccccCCcccccCCCccC-ccccccceeEEEeehcc---------------ceeeecH-HHHHhcCCCH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEIS-LDKIGTTTKSVTIKKAA---------------KGTEITD-EAALSGYGDP 63 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~-~~~lt~~~~~~tikk~g---------------~~~~itD-~~~~~~~~d~ 63 (231) +.|+..-+|.-.-| +.|.- |.=|.++. -+..-+++-+-.-.+.| ..|.+-+ .+...-+.|+ T Consensus 60 lDGV~~N~tafsvK-tsD~p-VVig~~Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~ 137 (314) T protein:vir:98 60 LDGVQHNDTAFYVK-TSDIP-VVVGNEYNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDF 137 (314) T ss_pred ccCCCccceEEEEe-ecccc-eeecCcccCCCCcccccCCccccccCceeEEEeecccccccccchhhhccccccccCCh Confidence 67777766642111 11111 00121111 11111111111111111 1222111 1222333344 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccCHHHHHHHHHHhhccC-----CCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 64 ---IGESNKQLGLSLANKVDDDLLKAAKTTSQTVS--TKANVDGVQAALDIFNDED-----AQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 64 ---~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~~~--~~~~~d~i~da~~~l~~~~-----~~~~v~vv~p~~~~~L~k~ 133 (231) +++=++.+|.++++.+|..+=..|........ +.++.|.+.+....+...- ..+-+..+||..|..|... T Consensus 138 ~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~ltd~~~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~ 217 (314) T protein:vir:98 138 QAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLTDYSADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDH 217 (314) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcchhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhcc Confidence 55556677888888888866666654443322 3456677776665554322 2345689999999999877 Q ss_pred hhhhhccccccCceeeeccceeecceeEEEcC--CCccCceEEEEEecCCceEEEeecCCccceecc---chhhcccEEE Q lcl|Aclame:pro 134 ANAKNIGSEVGANALINGTYADVLGAQIVRSK--KLAEGSALMFKIVSNSPALKLVLKRGVQVETDR---DIVTKTTVIT 208 (231) Q Consensus 134 ~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~--~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R---d~~~~~~~i~ 208 (231) +... ..+.+..++-.|| +..+-|+-+...+ .+..+....... .-++. ..+-+.+-| .++..-..+. T Consensus 218 ~l~T-saK~SsaNIDeng-i~~FkGf~i~e~P~~~~q~g~ia~~s~----dnig~---aftGIn~aR~IesEdF~GValQ 288 (314) T protein:vir:98 218 PLTT-SAKSSSANIDQNG-IVNFKGFAIQEIPESMLQSGDVAYTYI----TNIGK---AFTGINTSRIIESEDFDGVALQ 288 (314) T ss_pred cccc-ccccceeeeccCC-cceecceEEEecchhhcCCCcEEEEcc----cccee---ecccceeeeeeecccccceeee Confidence 6543 2333344444444 5788999888776 344454433221 11221 122333444 3456678899 Q ss_pred EEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 209 ADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 209 ~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) +---||-++.+.++.+.+++++. T Consensus 289 gAGK~G~~I~edNk~Ai~k~t~t 311 (314) T protein:vir:98 289 GAGKAGEFILDDNKKAVAKVTST 311 (314) T ss_pred cccccccccccccceeeEEEecC Confidence 99999999999999999999999 No 224 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=86.69 E-value=0.044 Score=28.03 Aligned_cols=228 Identities=14% Similarity=0.134 Sum_probs=115.6 Q ss_pred CCCcccC-----------ceEEecc--ccCCcccccC--CCccCccccccceeEEEeehccceeeecHHHHHh----cCC Q lcl|Aclame:pro 1 ENGINLA-----------NLCEYPN--DIGDAADVAE--GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYG 61 (231) Q Consensus 1 ~~~~~~G-----------~ti~~P~--~igda~~v~E--G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~ 61 (231) ..+++.+ .+.++-. -+..+|.+.+ +.+++.-.++.++.+++.|-++..-+.|=|...+ -+. T Consensus 166 ~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGL 245 (470) T protein:vir:10 166 QQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGL 245 (470) T ss_pred cccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCC Confidence 1111111 0111100 0112233332 3456666778889999999887777777665432 468 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cc--------ccc----CHHHHHHHHHHhh---------ccCC Q lcl|Aclame:pro 62 DPIGESNKQLGLSLANKVDDDLLKAAKTTSQT-----VS--------TKA----NVDGVQAALDIFN---------DEDA 115 (231) Q Consensus 62 d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~-----~~--------~~~----~~d~i~da~~~l~---------~~~~ 115 (231) |.-++..+-|+..|...|+.+++..+.+.+.. ++ ... ..+.....+-.+. .... T Consensus 246 DAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~~t~r~ 325 (470) T protein:vir:10 246 NAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQRTRRG 325 (470) T ss_pred ChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 99999999999999999999999988654321 11 111 1222222222221 1235 Q ss_pred CceEEEECHHHHHHHHhhhhhhhcc----ccccCceeeeccceeecc-eeEEEcCCCccC--ceEEEEEecCCceE---- Q lcl|Aclame:pro 116 QAYVLIVNPKDAAKIRKDANAKNIG----SEVGANALINGTYADVLG-AQIVRSKKLAEG--SALMFKIVSNSPAL---- 184 (231) Q Consensus 116 ~~~v~vv~p~~~~~L~k~~~~~~~~----~~~~~~~~~~G~ig~~~G-~~Vv~s~~~~~~--~~~~~~~~~~~~A~---- 184 (231) ..++++++|.+++.|.. ..+.... .....|-..+-..|.+.| ++|+++.-+..+ ...-+.++..+|.- T Consensus 326 ~~n~~i~S~~Va~~La~-sG~l~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~ 404 (470) T protein:vir:10 326 KGNMILCSADVASALTM-AGVLDYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDA 404 (470) T ss_pred cceEEEEchhHHhHhhh-ccccccccccccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceec Confidence 77899999999998843 2322211 011111111112466544 789988633321 11111111111111 Q ss_pred EEeecCC-ccceecc--chhhcccEEEEEEEEEEEEEcCCcE-----------------EEEEeccC Q lcl|Aclame:pro 185 KLVLKRG-VQVETDR--DIVTKTTVITADEHYAAYLYDLTKV-----------------VNITFTGV 231 (231) Q Consensus 185 ~~~~k~~-v~vE~~R--d~~~~~~~i~~~~~y~~~~~~~~~v-----------------v~l~~~~~ 231 (231) +++. .+ +..+..| |+..+.-.+-...|||..+ ||=.. -++.+++. T Consensus 405 glfy-~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~i~~~~n~y~r~~~v~~l 469 (470) T protein:vir:10 405 GLFY-CPYVPLQMVRAVGQDTFQPKIGFKTRYGLVE-NPFSQGTTQGLGTLTRNSNRYYRRVKVANL 469 (470) T ss_pred ceee-ccccccccCCCCCCccccceeeeeeeeceee-cCcccCCCcccccccCCCCceeeEEEeecc Confidence 1111 11 2222223 7888888888888988765 44211 12333333 No 225 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=86.35 E-value=0.046 Score=27.91 Aligned_cols=228 Identities=12% Similarity=0.070 Sum_probs=115.0 Q ss_pred CCCcccCc-----------eEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecHHHHHh-- Q lcl|Aclame:pro 1 ENGINLAN-----------LCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS-- 58 (231) Q Consensus 1 ~~~~~~G~-----------ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~-- 58 (231) -.+...++ +.++. .|-++..+| +.+++.=.++.++.+++.|-++..=+.|=|...+ T Consensus 210 ~~gt~~~~~~~~~~~~~~~~~~~~--~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLK 287 (528) T protein:vir:80 210 KAGSESEDEVVMKLMEEGKLAEIA--FGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLR 287 (528) T ss_pred ccCCcccccccccccccccccccc--cccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHH Confidence 11111222 22221 122333333 3346666778889999999887777777765432 Q ss_pred --cCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------c------cccccC-------HHHHHHHHHHhhc Q lcl|Aclame:pro 59 --GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQ-----------T------VSTKAN-------VDGVQAALDIFND 112 (231) Q Consensus 59 --~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~-----------~------~~~~~~-------~d~i~da~~~l~~ 112 (231) -+-|.-++..+-|+..|...|+.+++..+..... + .+.+.+ ++.....+..+.. T Consensus 288 AIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~ 367 (528) T protein:vir:80 288 AVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDK 367 (528) T ss_pred HhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHH Confidence 3689999999999999999999999876632110 0 011111 2221222222221 Q ss_pred c--------C-CCceEEEECHHHHHHHHhhh-hhhh--ccccccCceeeec--cceeec-ceeEEEcCCCccCceE-EEE Q lcl|Aclame:pro 113 E--------D-AQAYVLIVNPKDAAKIRKDA-NAKN--IGSEVGANALING--TYADVL-GAQIVRSKKLAEGSAL-MFK 176 (231) Q Consensus 113 ~--------~-~~~~v~vv~p~~~~~L~k~~-~~~~--~~~~~~~~~~~~G--~ig~~~-G~~Vv~s~~~~~~~~~-~~~ 176 (231) + . ...++++|+|++++.|...- .+-. .+...+.+.-.++ ..|.+. |++|++++..+..=.. .++ T Consensus 368 ~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K 447 (528) T protein:vir:80 368 EAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYK 447 (528) T ss_pred HHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEe Confidence 1 1 24589999999999997631 0000 0000000100111 245554 4799999877642110 011 Q ss_pred EecCCceEE-EeecCCccceeccchhhcccEEEEEEEEEEEEEcCCc--------------------------EEEEEec Q lcl|Aclame:pro 177 IVSNSPALK-LVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK--------------------------VVNITFT 229 (231) Q Consensus 177 ~~~~~~A~~-~~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~--------------------------vv~l~~~ 229 (231) -.....+.. +..--+...-.-.|+..+.-.+-...|||..+ ||=. ..++.+| T Consensus 448 G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk 526 (528) T protein:vir:80 448 GDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVK 526 (528) T ss_pred CCcccccceeecccccceeeEeeCCccccceeeeeeeeceee-cCcccccCCcccccccccchhhhhcCccceeEEeeec Confidence 000000111 11111122223458889999999999998865 4411 1222333 Q ss_pred cC Q lcl|Aclame:pro 230 GV 231 (231) Q Consensus 230 ~~ 231 (231) +. T Consensus 527 ~~ 528 (528) T protein:vir:80 527 GC 528 (528) T ss_pred cC Confidence 33 No 226 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=85.99 E-value=0.049 Score=27.78 Aligned_cols=172 Identities=10% Similarity=0.040 Sum_probs=96.5 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHH---HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGE---SNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~---~~~~~a~~ia~ 77 (231) .+++-+.-+| ..-...++...=++.+++++-++.+++.++.-.+..++|+.. +.+-.++.-+. -.++..++|.. T Consensus 48 N~~t~~~~~v--rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~-la~~~Gn~~~~ra~e~~~~ik~m~~ 124 (331) T protein:vir:10 48 NGFTEHKTTV--RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKA-LADLNGNSAAWRLSEDRAFIEGMNQ 124 (331) T ss_pred cCCccceeeE--EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechH-HHhhcCCHHHHHHHHHHHHHHHHHH Confidence 2222222222 222334556666777999999999999999999999999875 44444555433 34446667777 Q ss_pred HHHHHHHHH-----------hc------------------cc--ccc--------------------------------- Q lcl|Aclame:pro 78 KVDDDLLKA-----------AK------------------TT--SQT--------------------------------- 93 (231) Q Consensus 78 ~vd~~~~~~-----------l~------------------t~--~~~--------------------------------- 93 (231) ++...++.+ |. ++ ..+ T Consensus 125 ~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~ 204 (331) T protein:vir:10 125 TQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDT 204 (331) T ss_pred HHHHHHhcCCcccChhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCcee Confidence 776666531 10 00 000 Q ss_pred -------------------------------------c--------ccccCHHHHHHHHHHhhccCCCceEEEECHHHHH Q lcl|Aclame:pro 94 -------------------------------------V--------STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAA 128 (231) Q Consensus 94 -------------------------------------~--------~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~ 128 (231) + +...-.+.+.+|...+-.-.....+++||..... T Consensus 205 ~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~ 284 (331) T protein:vir:10 205 LIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRS 284 (331) T ss_pred eecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHH Confidence 0 0000113344555555333456678999999999 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) .|++.............+-...-.+-.+.|+||-+++.+-.++...+ T Consensus 285 ~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 285 FLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred HHHHHHhhccceeeeeeeecCCcceeEECCeeEEEeeeeecCccccC Confidence 99885332211110111111112345789999999998777665433 No 227 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=85.99 E-value=0.049 Score=27.78 Aligned_cols=172 Identities=10% Similarity=0.040 Sum_probs=96.5 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHH---HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGE---SNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~---~~~~~a~~ia~ 77 (231) .+++-+.-+| ..-...++...=++.+++++-++.+++.++.-.+..++|+.. +.+-.++.-+. -.++..++|.. T Consensus 48 N~~t~~~~~v--rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~-la~~~Gn~~~~ra~e~~~~ik~m~~ 124 (331) T protein:vir:98 48 NGFTEHKTTV--RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKA-LADLNGNSAAWRLSEDRAFIEGMNQ 124 (331) T ss_pred cCCccceeeE--EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechH-HHhhcCCHHHHHHHHHHHHHHHHHH Confidence 2222222222 222334556666777999999999999999999999999875 44444555433 34446667777 Q ss_pred HHHHHHHHH-----------hc------------------cc--ccc--------------------------------- Q lcl|Aclame:pro 78 KVDDDLLKA-----------AK------------------TT--SQT--------------------------------- 93 (231) Q Consensus 78 ~vd~~~~~~-----------l~------------------t~--~~~--------------------------------- 93 (231) ++...++.+ |. ++ ..+ T Consensus 125 ~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~ 204 (331) T protein:vir:98 125 TQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDT 204 (331) T ss_pred HHHHHHhcCCcccChhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCcee Confidence 776666531 10 00 000 Q ss_pred -------------------------------------c--------ccccCHHHHHHHHHHhhccCCCceEEEECHHHHH Q lcl|Aclame:pro 94 -------------------------------------V--------STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAA 128 (231) Q Consensus 94 -------------------------------------~--------~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~ 128 (231) + +...-.+.+.+|...+-.-.....+++||..... T Consensus 205 ~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~ 284 (331) T protein:vir:98 205 LIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRS 284 (331) T ss_pred eecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHH Confidence 0 0000113344555555333456678999999999 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) .|++.............+-...-.+-.+.|+||-+++.+-.++...+ T Consensus 285 ~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 285 FLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred HHHHHHhhccceeeeeeeecCCcceeEECCeeEEEeeeeecCccccC Confidence 99885332211110111111112345789999999998777665433 No 228 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=85.99 E-value=0.049 Score=27.78 Aligned_cols=172 Identities=10% Similarity=0.040 Sum_probs=96.5 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHH---HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGE---SNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~---~~~~~a~~ia~ 77 (231) .+++-+.-+| ..-...++...=++.+++++-++.+++.++.-.+..++|+.. +.+-.++.-+. -.++..++|.. T Consensus 48 N~~t~~~~~v--rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~-la~~~Gn~~~~ra~e~~~~ik~m~~ 124 (331) T protein:vir:10 48 NGFTEHKTTV--RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKA-LADLNGNSAAWRLSEDRAFIEGMNQ 124 (331) T ss_pred cCCccceeeE--EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechH-HHhhcCCHHHHHHHHHHHHHHHHHH Confidence 2222222222 222334556666777999999999999999999999999875 44444555433 34446667777 Q ss_pred HHHHHHHHH-----------hc------------------cc--ccc--------------------------------- Q lcl|Aclame:pro 78 KVDDDLLKA-----------AK------------------TT--SQT--------------------------------- 93 (231) Q Consensus 78 ~vd~~~~~~-----------l~------------------t~--~~~--------------------------------- 93 (231) ++...++.+ |. ++ ..+ T Consensus 125 ~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~ 204 (331) T protein:vir:10 125 TQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDT 204 (331) T ss_pred HHHHHHhcCCcccChhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCcee Confidence 776666531 10 00 000 Q ss_pred -------------------------------------c--------ccccCHHHHHHHHHHhhccCCCceEEEECHHHHH Q lcl|Aclame:pro 94 -------------------------------------V--------STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAA 128 (231) Q Consensus 94 -------------------------------------~--------~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~ 128 (231) + +...-.+.+.+|...+-.-.....+++||..... T Consensus 205 ~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~ 284 (331) T protein:vir:10 205 LIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRS 284 (331) T ss_pred eecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHH Confidence 0 0000113344555555333456678999999999 Q ss_pred HHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 129 KIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 129 ~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) .|++.............+-...-.+-.+.|+||-+++.+-.++...+ T Consensus 285 ~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 285 FLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred HHHHHHhhccceeeeeeeecCCcceeEECCeeEEEeeeeecCccccC Confidence 99885332211110111111112345789999999998777665433 No 229 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=85.81 E-value=0.05 Score=27.71 Aligned_cols=230 Identities=12% Similarity=0.012 Sum_probs=116.9 Q ss_pred CCCcccCceEEecc--ccCCccccc-----CCCccCccccccceeEEEeehccceeeecHHHHHh----cCCCHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPN--DIGDAADVA-----EGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYGDPIGESNK 69 (231) Q Consensus 1 ~~~~~~G~ti~~P~--~igda~~v~-----EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~d~~~~~~~ 69 (231) -.++..|...++.. .+..+|.+. -+.+++.=.++.++.+++.|-++..=+.|=|...+ -+.|.-++..+ T Consensus 222 ~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsN 301 (529) T protein:vir:10 222 NAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNG 301 (529) T ss_pred ccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHH Confidence 11222233333321 122222221 12346666778889999999888777777775433 36899999999 Q ss_pred HHHHHHHHHHHHHHHHHhccccc-----------ccccccCH-------------HHHHHHHHHhhcc--------C-CC Q lcl|Aclame:pro 70 QLGLSLANKVDDDLLKAAKTTSQ-----------TVSTKANV-------------DGVQAALDIFNDE--------D-AQ 116 (231) Q Consensus 70 ~~a~~ia~~vd~~~~~~l~t~~~-----------~~~~~~~~-------------d~i~da~~~l~~~--------~-~~ 116 (231) -|+..|...|+.+++..+.+.+. +-++.+++ +........+..+ . .. T Consensus 302 ILStEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~ 381 (529) T protein:vir:10 302 ILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGA 381 (529) T ss_pred HHHHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 99999999999999988764331 00111111 1111111222211 1 35 Q ss_pred ceEEEECHHHHHHHHhhhhhhhc-----cccccCceeeeccceeec-ceeEEEcCCCccCceE-EEEEecCCceEEEe-e Q lcl|Aclame:pro 117 AYVLIVNPKDAAKIRKDANAKNI-----GSEVGANALINGTYADVL-GAQIVRSKKLAEGSAL-MFKIVSNSPALKLV-L 188 (231) Q Consensus 117 ~~v~vv~p~~~~~L~k~~~~~~~-----~~~~~~~~~~~G~ig~~~-G~~Vv~s~~~~~~~~~-~~~~~~~~~A~~~~-~ 188 (231) .++++|+|+++..|..---.... ......+-..+...|.+. |++|++++..+..=.. .++-.....+..++ . T Consensus 382 ~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~P 461 (529) T protein:vir:10 382 GNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCP 461 (529) T ss_pred ceEEEEchHHHHHHHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecc Confidence 78999999999998731100000 000000111112335543 3799999877642110 01100000011111 1 Q ss_pred cCCccceeccchhhcccEEEEEEEEEEEEEcCCc--------------------------EEEEEeccC Q lcl|Aclame:pro 189 KRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK--------------------------VVNITFTGV 231 (231) Q Consensus 189 k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~--------------------------vv~l~~~~~ 231 (231) --+...-.--|+..+.-.+-...|||..+ ||=. ..++.+|+. T Consensus 462 Yv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 462 YVALTPLRGFDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccccccccccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 11111111248899999999999998865 5511 123333444 No 230 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=85.51 E-value=0.052 Score=27.61 Aligned_cols=224 Identities=11% Similarity=0.041 Sum_probs=113.0 Q ss_pred CCCcccC------------------ceEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecH Q lcl|Aclame:pro 1 ENGINLA------------------NLCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITD 53 (231) Q Consensus 1 ~~~~~~G------------------~ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD 53 (231) ...++.| ++.+++ .|-.+..+| +.+++.-.++.++.+++.|-++..=+.|= T Consensus 193 ~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~--~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTi 270 (519) T protein:vir:10 193 AVTVDAGATDAAKLDAAVTALVEAGQLAEIA--EGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSI 270 (519) T ss_pred ccccCCCCcCccccccccccccccccccccc--cccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccH Confidence 1111122 222221 122233333 23466667888899999998887777777 Q ss_pred HHHHh----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccc------ccccC-------------HHHHHH Q lcl|Aclame:pro 54 EAALS----GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTS-----QTV------STKAN-------------VDGVQA 105 (231) Q Consensus 54 ~~~~~----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~-----~~~------~~~~~-------------~d~i~d 105 (231) |...+ -+.|.-++..+-|+..|...|+.+++..+..++ +-+ .+-++ .+.... T Consensus 271 ELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~ 350 (519) T protein:vir:10 271 ELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKA 350 (519) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHH Confidence 65432 368999999999999999999999997552111 000 01111 111111 Q ss_pred HHHHhhcc--------C-CCceEEEECHHHHHHHHhhhhhhhcccc---ccCceeeec--cceeec-ceeEEEcCCCccC Q lcl|Aclame:pro 106 ALDIFNDE--------D-AQAYVLIVNPKDAAKIRKDANAKNIGSE---VGANALING--TYADVL-GAQIVRSKKLAEG 170 (231) Q Consensus 106 a~~~l~~~--------~-~~~~v~vv~p~~~~~L~k~~~~~~~~~~---~~~~~~~~G--~ig~~~-G~~Vv~s~~~~~~ 170 (231) .+..+..+ . ...++++|+|+++..|....-....... .+.+.-..+ ..|.+. |++|++++..+.. T Consensus 351 L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 430 (519) T protein:vir:10 351 LLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSD 430 (519) T ss_pred HHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcc Confidence 11222111 1 3458999999999999875521111100 000000011 135553 4799999887742 Q ss_pred ceEEEEEecCCc---eEEEe-ecCCcccee--ccchhhcccEEEEEEEEEEEEEcCCcE--------------------- Q lcl|Aclame:pro 171 SALMFKIVSNSP---ALKLV-LKRGVQVET--DRDIVTKTTVITADEHYAAYLYDLTKV--------------------- 223 (231) Q Consensus 171 ~~~~~~~~~~~~---A~~~~-~k~~v~vE~--~Rd~~~~~~~i~~~~~y~~~~~~~~~v--------------------- 223 (231) .+.+=+. +.. +..++ .- +..+. --|+..+.-.+-...|||..+ ||=.- T Consensus 431 -y~~vG~K-G~~~~~~glfyaPY--v~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~g~~~~a~~~ 505 (519) T protein:vir:10 431 -YFTIGYK-GSNEMDAGIYYAPY--VALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFADPAAQAPTKRIQNGMPDIVNSL 505 (519) T ss_pred -eEEEEEe-cCcccccceeeccc--cccccccccCCccccceeeeeeeeceee-cCcccccccCccceeccCchhhhccc Confidence 1110000 100 11111 11 11122 238888888999999998764 55210 Q ss_pred ------EEEEeccC Q lcl|Aclame:pro 224 ------VNITFTGV 231 (231) Q Consensus 224 ------v~l~~~~~ 231 (231) -++.+|+. T Consensus 506 ~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 506 GLNGYFRRVYVKGI 519 (519) T ss_pred cCceeeeeeeeecC Confidence 01111111 No 231 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=85.38 E-value=0.053 Score=27.57 Aligned_cols=219 Identities=13% Similarity=0.072 Sum_probs=113.1 Q ss_pred CC----------CcccC----ceEEecc--ccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCC Q lcl|Aclame:pro 1 EN----------GINLA----NLCEYPN--DIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYG 61 (231) Q Consensus 1 ~~----------~~~~G----~ti~~P~--~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~ 61 (231) +- -.+.| .++.||. ..|-+..++.+.++|..+...+..+.++..++-++.++.++... .+. T Consensus 66 ~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~ 145 (336) T protein:vir:10 66 LVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRV 145 (336) T ss_pred eechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCC Confidence 00 01123 3456653 36888888888999999999999999999999999999766544 366 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH--------hc-----cccccccc--------ccCHHHHHHHHHHhhccC------ Q lcl|Aclame:pro 62 DPIGESNKQLGLSLANKVDDDLLKA--------AK-----TTSQTVST--------KANVDGVQAALDIFNDED------ 114 (231) Q Consensus 62 d~~~~~~~~~a~~ia~~vd~~~~~~--------l~-----t~~~~~~~--------~~~~d~i~da~~~l~~~~------ 114 (231) +..++=.+...+++..++++-.+-. +- .+..+.++ .--+++|..++..+.... T Consensus 146 ~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~ 225 (336) T protein:vir:10 146 DLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQ 225 (336) T ss_pred CcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeee Confidence 7776666666677777666432211 00 01111111 123566777776664322 Q ss_pred CCceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc-CceEEEEE---ecCCceEEEeecC Q lcl|Aclame:pro 115 AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE-GSALMFKI---VSNSPALKLVLKR 190 (231) Q Consensus 115 ~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~-~~~~~~~~---~~~~~A~~~~~k~ 190 (231) ..+..++++|..+..|.+-..+ +. ...+.+.. .+=+++|+..+.+.. +......+ ..++.-+.+..-. T Consensus 226 ~~~~tL~Lp~~~~~~L~~~n~~---g~-tv~~~lk~----n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~P~ 297 (336) T protein:vir:10 226 EAVLHMGLPPTAMSDLSKTNQY---GL-SAAAKLKE----IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTE 297 (336) T ss_pred ccceEEEechHHHHhccCCCcc---Cc-cHHHHHHH----hCCccEEEEcccccccCCceEEEEEecccCCcceeeecCh Confidence 2356899999999888642111 00 01111111 122456666554432 22111111 1111111111111 Q ss_pred Cccc-eeccchhhcccEEEEEE-EEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 191 GVQV-ETDRDIVTKTTVITADE-HYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 191 ~v~v-E~~Rd~~~~~~~i~~~~-~y~~~~~~~~~vv~l~~~~~ 231 (231) .... ...+ ......+.... -.|+-++.|-+++++ .|+ T Consensus 298 ~f~~lpvq~--~~~~~~v~~~~rt~Gv~i~rP~ai~~~--~GI 336 (336) T protein:vir:10 298 KMRAHSIER--YSSYFRQKKSAGTWGAVIFRPFAVAQM--LGV 336 (336) T ss_pred hhhccceee--cCceeEeccccceeeeeeeccchheee--ccC Confidence 1110 0111 11222222222 368888999998876 455 No 232 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=84.63 E-value=0.059 Score=27.32 Aligned_cols=226 Identities=12% Similarity=0.036 Sum_probs=115.9 Q ss_pred CCCccc----------------------C----------ceEEeccccCCcccccC---------CCccCccccccceeE Q lcl|Aclame:pro 1 ENGINL----------------------A----------NLCEYPNDIGDAADVAE---------GGEISLDKIGTTTKS 39 (231) Q Consensus 1 ~~~~~~----------------------G----------~ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~ 39 (231) +....+ | .+.++- .|-.+..+| +.+++.-.++.++.+ T Consensus 194 t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~--~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvt 271 (534) T protein:vir:10 194 TGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETS--SAMATAFAELQQGFNGSADNEWNEMSFRIDKQV 271 (534) T ss_pred ccccccccccccccccccccccCCccccccccccccccccceecc--cccchhhHhhhccCCCCcccchhhcceEEEEEE Confidence 000000 1 111111 122222222 124666678888999 Q ss_pred EEeehccceeeecHHHHHh----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--c---------------cccc Q lcl|Aclame:pro 40 VTIKKAAKGTEITDEAALS----GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQT--V---------------STKA 98 (231) Q Consensus 40 ~tikk~g~~~~itD~~~~~----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~--~---------------~~~~ 98 (231) ++.|-++..-+.|=|...+ -+.|.-++..+-|+..|...|+.+++..+.+.+.. . .... T Consensus 272 VtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~ 351 (534) T protein:vir:10 272 VEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTK 351 (534) T ss_pred EeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccc Confidence 9999887777777765432 36899999999999999999999999887643211 0 0111 Q ss_pred C-------HHHHHHHHHHhhcc---------CCCceEEEECHHHHHHHHhhhhhhhcc---ccccCceeeec--cceeec Q lcl|Aclame:pro 99 N-------VDGVQAALDIFNDE---------DAQAYVLIVNPKDAAKIRKDANAKNIG---SEVGANALING--TYADVL 157 (231) Q Consensus 99 ~-------~d~i~da~~~l~~~---------~~~~~v~vv~p~~~~~L~k~~~~~~~~---~~~~~~~~~~G--~ig~~~ 157 (231) + .+.+......+..+ -...++++|+|++++.|....-..... ...+.+.-.++ ..|.+. T Consensus 352 ~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~ 431 (534) T protein:vir:10 352 DIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLA 431 (534) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEec Confidence 1 12222222222221 135789999999999997643221110 00000000111 245554 Q ss_pred -ceeEEEcCCCccCceEEEEEecCCc---eEEE-eecCCccceeccchhhcccEEEEEEEEEEEEEcCCc---------- Q lcl|Aclame:pro 158 -GAQIVRSKKLAEGSALMFKIVSNSP---ALKL-VLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK---------- 222 (231) Q Consensus 158 -G~~Vv~s~~~~~~~~~~~~~~~~~~---A~~~-~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~---------- 222 (231) |++|++++..+.. .+.+=+. +.. +..+ ..--+......-|+..+.-.+-...|||..+ ||=. T Consensus 432 ~~~~vy~D~y~~~d-y~~vG~K-G~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~ 508 (534) T protein:vir:10 432 GKYRVYIDQYAVED-YFTVGYK-GASEMDAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYGVKL-HPMADATQNKGFAK 508 (534) T ss_pred CceEEEecCCCCcc-eEEEEEe-CCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCccccc Confidence 4799999887742 1111000 100 1111 1111112222348888888999999998765 4411 Q ss_pred -----------------EEEEEeccC Q lcl|Aclame:pro 223 -----------------VVNITFTGV 231 (231) Q Consensus 223 -----------------vv~l~~~~~ 231 (231) ..++.+|+. T Consensus 509 i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 509 ISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cccCCcchhhhcccccceeeeeeecC Confidence 122333333 No 233 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=81.79 E-value=0.083 Score=26.52 Aligned_cols=226 Identities=11% Similarity=0.073 Sum_probs=114.2 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHH--HhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAA--LSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~--~~~~~d~~~~~~~~~a~~ia 76 (231) .+-++--+-+. .| +|+...+.|+..++.++.+...+++.+|-. ...++.+.+. .++..||+.+..+.....++ T Consensus 70 ~STV~~y~~~~--~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl-~~~r~vsia~~lvn~~~d~~~~~~~dai~~va 146 (464) T protein:vir:80 70 TSTVAKYDVYL--AHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYV-SDTKNMSIATGLVNNIEDPMRILTDDAISVVA 146 (464) T ss_pred hhhhhhhheee--ccCccccccccccccccccCCCceEEEEEEeeee-ecceeeeeehhhhcchhhHHHHHHHHHHHHHH Confidence 22222221111 22 577788999999999999999999998843 3333333332 55677999998888888888 Q ss_pred HHHHHHHHHHhc---ccccc------------c---------ccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHh Q lcl|Aclame:pro 77 NKVDDDLLKAAK---TTSQT------------V---------STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRK 132 (231) Q Consensus 77 ~~vd~~~~~~l~---t~~~~------------~---------~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k 132 (231) ..++-.+|-+-. ..+.. + ...++-+.|..|....+..-..++-++||+.+.++++. T Consensus 147 ~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n 226 (464) T protein:vir:80 147 KTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVN 226 (464) T ss_pred HHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHh Confidence 888877764321 11110 0 11245555666666665444567789999999988743 Q ss_pred hhhhhhcc--ccccC----ceeeeccceeecc-eeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhccc Q lcl|Aclame:pro 133 DANAKNIG--SEVGA----NALINGTYADVLG-AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTT 205 (231) Q Consensus 133 ~~~~~~~~--~~~~~----~~~~~G~ig~~~G-~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~~~ 205 (231) .--..... ..-+. +.-.+|.++. .| +++--|.-|......--.-...++|..= .+-.+++++.-..+.... T Consensus 227 ~~l~~q~~~~~~n~~~~~~G~~v~~f~sa-~G~i~L~~s~~m~~~~~ld~~~~~~~~apaa-psvt~tv~~~~~g~f~~~ 304 (464) T protein:vir:80 227 QQLDRQVQVISDNGQNATMGFNVKGFNSA-RGFIRLHGSTVMELEQILDENRMQLPNAPQK-ATVKATLEAGTKGKFRDE 304 (464) T ss_pred hhcCceeEEEcCCCCcceeeeeccccccc-ccceeccCccccCcccccccccccCCCCcCC-ceeEEEecCCcccCCccc Confidence 21111100 00000 1111222221 11 1222222111111000000011122110 011134556555444555 Q ss_pred EEEEEEEEEEEEEc------CCcEEEEEeccC Q lcl|Aclame:pro 206 VITADEHYAAYLYD------LTKVVNITFTGV 231 (231) Q Consensus 206 ~i~~~~~y~~~~~~------~~~vv~l~~~~~ 231 (231) .+.+...|.+.+.| |+.++..++.++ T Consensus 305 ~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~ 336 (464) T protein:vir:80 305 DLTIDTEYKVVVVSDDAESAPSDVASVVIDDK 336 (464) T ss_pred cccceeEEEEEEECCCCccccceeeeeeecCc Confidence 55666677777775 445666666666 No 234 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=80.60 E-value=0.093 Score=26.23 Aligned_cols=225 Identities=10% Similarity=0.039 Sum_probs=113.6 Q ss_pred CCCcccC------------------ceEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecH Q lcl|Aclame:pro 1 ENGINLA------------------NLCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITD 53 (231) Q Consensus 1 ~~~~~~G------------------~ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD 53 (231) ......| .+.++.. |-.+..+| +..++.=.++.++.+++.|-++..-+.|= T Consensus 195 ~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~--gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTi 272 (521) T protein:vir:72 195 QVTIDAGATDAAKLDAEIKKQMEAGALVEIAE--GMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSI 272 (521) T ss_pred ccccCCCCCCccccccccccccccCceeeeec--ccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccH Confidence 1112222 2222221 22222222 23456666777899999998887777777 Q ss_pred HHHHh----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----c------cccc------ccC-------HHHHHH Q lcl|Aclame:pro 54 EAALS----GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTS-----Q------TVST------KAN-------VDGVQA 105 (231) Q Consensus 54 ~~~~~----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~-----~------~~~~------~~~-------~d~i~d 105 (231) |...+ -+.|.-++..+-|+..|...|+.+++..+...+ . ...+ ..+ .+.... T Consensus 273 ELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~ 352 (521) T protein:vir:72 273 ELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKA 352 (521) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHH Confidence 75433 368999999999999999999999997653211 0 0011 111 111111 Q ss_pred HHHHhh--------cc-CCCceEEEECHHHHHHHHhhhhhhhcccc---ccCceeeec--cceee-cceeEEEcCCCccC Q lcl|Aclame:pro 106 ALDIFN--------DE-DAQAYVLIVNPKDAAKIRKDANAKNIGSE---VGANALING--TYADV-LGAQIVRSKKLAEG 170 (231) Q Consensus 106 a~~~l~--------~~-~~~~~v~vv~p~~~~~L~k~~~~~~~~~~---~~~~~~~~G--~ig~~-~G~~Vv~s~~~~~~ 170 (231) .+..+. .- -...++++|+|++++.|..-.-+.....+ .+-..-.++ ..|.+ -|++|++++..+.. T Consensus 353 L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 432 (521) T protein:vir:72 353 LLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQD 432 (521) T ss_pred HHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcc Confidence 111111 11 25678999999999998853222211111 000000001 13444 34799999877642 Q ss_pred ceE-EEEEecCCc---eEEEe-ecCCccceeccchhhcccEEEEEEEEEEEEEcCCcE-------E-------------- Q lcl|Aclame:pro 171 SAL-MFKIVSNSP---ALKLV-LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKV-------V-------------- 224 (231) Q Consensus 171 ~~~-~~~~~~~~~---A~~~~-~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~v-------v-------------- 224 (231) =.. .++ +.. +..++ .--+...-.--|+..+.-.+-...|||..+ ||=.. . T Consensus 433 y~~vG~K---G~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~a~~i~~~~~~~~a~~~ 508 (521) T protein:vir:72 433 YFTVGYK---GPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLG 508 (521) T ss_pred eEEEEEe---CCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCcccceeecCcChhhhcCcc Confidence 110 011 000 11111 111111111248889999999999998865 66111 1 Q ss_pred ------EEEeccC Q lcl|Aclame:pro 225 ------NITFTGV 231 (231) Q Consensus 225 ------~l~~~~~ 231 (231) ++.+++. T Consensus 509 ~~sy~r~v~v~~l 521 (521) T protein:vir:72 509 KNAYFRRVYVKGI 521 (521) T ss_pred ccceeeeeeecCC Confidence 1222222 No 235 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=80.51 E-value=0.094 Score=26.21 Aligned_cols=226 Identities=10% Similarity=0.030 Sum_probs=114.8 Q ss_pred CC-------------CcccCceEEeccc--cCCccccc-----CCCccCccccccceeEEEeehccceeeecHHHHHh-- Q lcl|Aclame:pro 1 EN-------------GINLANLCEYPND--IGDAADVA-----EGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS-- 58 (231) Q Consensus 1 ~~-------------~~~~G~ti~~P~~--igda~~v~-----EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~-- 58 (231) .- +...|...+++.. +..+|.+. .+.+++.-.++.++.+++.|-++..-+.|=|...+ T Consensus 204 ~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLK 283 (524) T protein:vir:98 204 VTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLR 283 (524) T ss_pred ccccccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHH Confidence 01 1112223333321 22223221 14456777788899999999888777777765432 Q ss_pred --cCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------cc--cccc-------------CHHHHHHHHHHhhc Q lcl|Aclame:pro 59 --GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQ---------TV--STKA-------------NVDGVQAALDIFND 112 (231) Q Consensus 59 --~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~---------~~--~~~~-------------~~d~i~da~~~l~~ 112 (231) -+.|.-++..+-|+..|...|+.+++..+..++. .. ++.+ ..+........+.. T Consensus 284 AVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~ 363 (524) T protein:vir:98 284 AVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDK 363 (524) T ss_pred HhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHH Confidence 3689999999999999999999999976532110 00 0111 11221212222221 Q ss_pred c--------C-CCceEEEECHHHHHHHHh-hhhhhhccccc--cC--c---eeeeccceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 113 E--------D-AQAYVLIVNPKDAAKIRK-DANAKNIGSEV--GA--N---ALINGTYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 113 ~--------~-~~~~v~vv~p~~~~~L~k-~~~~~~~~~~~--~~--~---~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) + . ...++++|+|++++.|.. ++.+...+... .. | .+.-|.++ -|++|++++..+..=. .+ T Consensus 364 ~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~--~~~~vy~D~y~~~dy~-~v 440 (524) T protein:vir:98 364 EANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLG--GTYKVYIDQYARQDYF-TV 440 (524) T ss_pred HHHHHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEec--CceEEEecCCCCcceE-EE Confidence 1 1 357899999999999885 23332211111 10 1 12223322 3579999988764211 00 Q ss_pred EEecCCc---eEEEe-ecCCccceeccchhhcccEEEEEEEEEEEEEcCCc--------------------------EEE Q lcl|Aclame:pro 176 KIVSNSP---ALKLV-LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK--------------------------VVN 225 (231) Q Consensus 176 ~~~~~~~---A~~~~-~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~--------------------------vv~ 225 (231) =+. +.. +..++ .--+...-.--|+..+.-.+-...|||..+ ||=. ..+ T Consensus 441 G~K-G~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~ 518 (524) T protein:vir:98 441 GFK-GDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRK 518 (524) T ss_pred Eee-CCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCccccccccCcchHhhcCccceeeE Confidence 000 000 11111 111111111238888888898899998765 4411 112 Q ss_pred EEeccC Q lcl|Aclame:pro 226 ITFTGV 231 (231) Q Consensus 226 l~~~~~ 231 (231) +.+|+. T Consensus 519 ~~Vk~l 524 (524) T protein:vir:98 519 VWVKGL 524 (524) T ss_pred eeeccC Confidence 222333 No 236 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=78.34 E-value=0.12 Score=25.72 Aligned_cols=228 Identities=13% Similarity=0.045 Sum_probs=113.4 Q ss_pred CCCcccCceEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecHHHHHh----cCCCHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS----GYGDPIGES 67 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~----~~~d~~~~~ 67 (231) ...+..|.+.+++. |-.+..+| +..++.=.++.++.+++.|-++..=+.|=|...+ -+.|.-++. T Consensus 222 ~~~~a~~~~~~~~~--gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtEL 299 (529) T protein:vir:10 222 SAKIAAGELAEIAE--GMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSEL 299 (529) T ss_pred cccccccccccccc--ccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHH Confidence 01111122222221 22222333 3456777788899999999888777777775433 368999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccccc-----------cccccc-------------CHHHHHHHHHHhhcc--------C- Q lcl|Aclame:pro 68 NKQLGLSLANKVDDDLLKAAKTTSQ-----------TVSTKA-------------NVDGVQAALDIFNDE--------D- 114 (231) Q Consensus 68 ~~~~a~~ia~~vd~~~~~~l~t~~~-----------~~~~~~-------------~~d~i~da~~~l~~~--------~- 114 (231) .+-|+..|...|+.+++..+...+. +..+-+ ..+........+..+ . T Consensus 300 sNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~r 379 (529) T protein:vir:10 300 NGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGR 379 (529) T ss_pred HHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 9999999999999999985432110 000111 111111111222211 1 Q ss_pred CCceEEEECHHHHHHHHhh-hhhhhccc--cccC--ceeeeccceeec-ceeEEEcCCCccCceE-EEEEecCCceEEEe Q lcl|Aclame:pro 115 AQAYVLIVNPKDAAKIRKD-ANAKNIGS--EVGA--NALINGTYADVL-GAQIVRSKKLAEGSAL-MFKIVSNSPALKLV 187 (231) Q Consensus 115 ~~~~v~vv~p~~~~~L~k~-~~~~~~~~--~~~~--~~~~~G~ig~~~-G~~Vv~s~~~~~~~~~-~~~~~~~~~A~~~~ 187 (231) ...++++|+|+++..|..- ........ ..+- +...+-..|.+. |++|++++..+..=.. .++-.....+..++ T Consensus 380 g~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy 459 (529) T protein:vir:10 380 GAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYY 459 (529) T ss_pred ccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceee Confidence 3578999999999999631 10000000 0000 000001235553 4789998876642110 01100000011111 Q ss_pred -ecCCccceeccchhhcccEEEEEEEEEEEEEcCCc--------------------------EEEEEeccC Q lcl|Aclame:pro 188 -LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK--------------------------VVNITFTGV 231 (231) Q Consensus 188 -~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~--------------------------vv~l~~~~~ 231 (231) .--+...-.--|+..+.-.+-...|||..+ ||=. ..++.+|+. T Consensus 460 ~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 460 CPYVALTPLRGSDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccccccccccccCCCcccceeeeeeeeceee-cCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 111111112248888888999999998765 5521 122233333 No 237 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=75.41 E-value=0.15 Score=25.15 Aligned_cols=228 Identities=11% Similarity=0.061 Sum_probs=114.1 Q ss_pred CC-------CcccCc-----------eEEeccccCCcccccC---------CCccCccccccceeEEEeehccceeeecH Q lcl|Aclame:pro 1 EN-------GINLAN-----------LCEYPNDIGDAADVAE---------GGEISLDKIGTTTKSVTIKKAAKGTEITD 53 (231) Q Consensus 1 ~~-------~~~~G~-----------ti~~P~~igda~~v~E---------G~~i~~~~lt~~~~~~tikk~g~~~~itD 53 (231) -+ +.+.|+ +.++. .|-++..+| +.+++.=.++.++.+++.|-++..=+.|= T Consensus 203 ~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~--~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTi 280 (528) T protein:vir:66 203 GDSVTPQKVGSESEDEVVMKLIEEGKLAEIA--FGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSI 280 (528) T ss_pred ccccccCcccccccccccccccccccceecc--cccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccH Confidence 01 111111 11111 122222222 23466667788899999998887777777 Q ss_pred HHHHh----cCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------c------cccccC-------HHHHHH Q lcl|Aclame:pro 54 EAALS----GYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQ-----------T------VSTKAN-------VDGVQA 105 (231) Q Consensus 54 ~~~~~----~~~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~-----------~------~~~~~~-------~d~i~d 105 (231) |...+ -+.|.-.+..+-|+..|...|+.+++..+..... + .+.+.+ ++.... T Consensus 281 ELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~ 360 (528) T protein:vir:66 281 EVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKS 360 (528) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHH Confidence 75433 3689999999999999999999999876632110 0 011111 122122 Q ss_pred HHHHhhcc--------C-CCceEEEECHHHHHHHHhhh-h----hhhccccccCceeeeccceeec-ceeEEEcCCCccC Q lcl|Aclame:pro 106 ALDIFNDE--------D-AQAYVLIVNPKDAAKIRKDA-N----AKNIGSEVGANALINGTYADVL-GAQIVRSKKLAEG 170 (231) Q Consensus 106 a~~~l~~~--------~-~~~~v~vv~p~~~~~L~k~~-~----~~~~~~~~~~~~~~~G~ig~~~-G~~Vv~s~~~~~~ 170 (231) .+..+..+ . ...++++|+|+++..|...- . ..........+....=..|.+. |++|++++..+.. T Consensus 361 L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d 440 (528) T protein:vir:66 361 LIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQD 440 (528) T ss_pred HHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcc Confidence 22222211 1 24589999999999997531 0 0000000000110001135554 4799999877642 Q ss_pred ceE-EEEEecCCceEEE-eecCCccceeccchhhcccEEEEEEEEEEEEEcCCc-------------------------- Q lcl|Aclame:pro 171 SAL-MFKIVSNSPALKL-VLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTK-------------------------- 222 (231) Q Consensus 171 ~~~-~~~~~~~~~A~~~-~~k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~-------------------------- 222 (231) =.. .++-.....+..+ ..--+...-.-.|+..+.-.+-...|||..+ ||=. T Consensus 441 y~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~ 519 (528) T protein:vir:66 441 YFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI-NPFADSKSQEPSARITSGMLSKDSVGKNAY 519 (528) T ss_pred eEEEEEeCCcccccceeecccccceeeEeeCCccccceeeeeeeeceee-cCcccccCccccccccccchhhhhcCccce Confidence 110 0110000001111 1111122223458889999999999998765 4421 Q ss_pred EEEEEeccC Q lcl|Aclame:pro 223 VVNITFTGV 231 (231) Q Consensus 223 vv~l~~~~~ 231 (231) ..++.+|+. T Consensus 520 ~r~~~Vk~~ 528 (528) T protein:vir:66 520 FRRVWVKGC 528 (528) T ss_pred eEEeeeccC Confidence 122333333 No 238 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=75.30 E-value=0.15 Score=25.13 Aligned_cols=218 Identities=10% Similarity=0.048 Sum_probs=114.1 Q ss_pred CCCcccC----ceEEecc--ccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA----NLCEYPN--DIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G----~ti~~P~--~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~ 71 (231) =.-.+.| .++.||. ..|-+..++.++++|..+...+..+.++..++-++++++++... .+.+..++=.+.. T Consensus 104 ~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA 183 (379) T protein:vir:10 104 LGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMV 183 (379) T ss_pred cccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHH Confidence 0011223 3567764 37999999999999999999888889999988889998876544 3667777777777 Q ss_pred HHHHHHHHHHHHHHHh-----c-----cc----------ccccccc----cC----HHHHHHHHHHhhcc-------CCC Q lcl|Aclame:pro 72 GLSLANKVDDDLLKAA-----K-----TT----------SQTVSTK----AN----VDGVQAALDIFNDE-------DAQ 116 (231) Q Consensus 72 a~~ia~~vd~~~~~~l-----~-----t~----------~~~~~~~----~~----~d~i~da~~~l~~~-------~~~ 116 (231) .+++..++|+-.|-+- + .- ++..+.+ .+ +++|..++..+-.. +.. T Consensus 184 ~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~ 263 (379) T protein:vir:10 184 GEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKT 263 (379) T ss_pred HHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeeccccc Confidence 7778777776444331 0 00 0000011 12 45566565554322 123 Q ss_pred ceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc-Cc---eEEEEEecCCc--------eE Q lcl|Aclame:pro 117 AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE-GS---ALMFKIVSNSP--------AL 184 (231) Q Consensus 117 ~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~-~~---~~~~~~~~~~~--------A~ 184 (231) +..++++|..+..|.+-..+. ....+.+.. .+=+++|+..+.+.. +. ..++. ..... -+ T Consensus 264 ~~tL~LP~~~~~~L~~~n~~g----~Tvl~~lk~----n~Pnl~i~t~pEL~~aggg~~~~~~~-~~~~~~~~t~~~~~~ 334 (379) T protein:vir:10 264 PITIGIPNAYENYITTPTELG----YSVAQYMRE----SYPNVTFVSAPELNDANGGSSAIYYY-ADAVENNGTDDGRTW 334 (379) T ss_pred ceeEEecHHHHHhhccccccC----ccHHHHHHH----hcCCcEEEEcccccccCCCccEEEEE-eeccCCCccCCcceE Confidence 447999999998886421110 001111110 122456666555432 11 11111 11000 00 Q ss_pred EEeecCCccc-eeccchhhcccEEEEE-EEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 185 KLVLKRGVQV-ETDRDIVTKTTVITAD-EHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 185 ~~~~k~~v~v-E~~Rd~~~~~~~i~~~-~~y~~~~~~~~~vv~l~~~ 229 (231) .+......+. ... ....+..+... ...|+-++.|.+++.++-+ T Consensus 335 ~~~~p~k~~~l~ve--~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 335 LQVVPTKMFTLGVE--KKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred EEecchhhhhccce--ecCceeEeccccceeeeeeecchhhheecCC Confidence 0110010000 001 11112222222 2479999999999999777 No 239 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=74.87 E-value=0.15 Score=25.05 Aligned_cols=216 Identities=12% Similarity=0.049 Sum_probs=106.5 Q ss_pred CCCcccC----ceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh---cCCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA----NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS---GYGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G----~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~---~~~d~~~~~~~~~ 71 (231) + .+.| .+++||.+ .|-+..++.++++|..+...+..+-++..++-++++.++...+ ...|..++=.... T Consensus 106 v--~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA 183 (382) T protein:vir:96 106 I--DTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQA 183 (382) T ss_pred c--cccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHH Confidence 1 1112 47788854 7999999999999988888888888888877788887654443 3677777666666 Q ss_pred HHHHHHHHHHHHHHH----hc-------c-----cccccccc----cC----HHHHHHHHHHhhccC-------CCceEE Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA----AK-------T-----TSQTVSTK----AN----VDGVQAALDIFNDED-------AQAYVL 120 (231) Q Consensus 72 a~~ia~~vd~~~~~~----l~-------t-----~~~~~~~~----~~----~d~i~da~~~l~~~~-------~~~~v~ 120 (231) .++++.++++-.|-+ +. . +..+.++. .+ +++|..++..+.... ..+..+ T Consensus 184 ~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L 263 (382) T protein:vir:96 184 AIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITM 263 (382) T ss_pred HHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEE Confidence 667777766544421 11 0 11111111 12 566667777663322 224478 Q ss_pred EECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc----C---ceEEEEEecC-----------Cc Q lcl|Aclame:pro 121 IVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE----G---SALMFKIVSN-----------SP 182 (231) Q Consensus 121 vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~----~---~~~~~~~~~~-----------~~ 182 (231) +++|..+..|.+-..+ + ....+.+.. .+=+++|+.-+.+.. + ....+.+... +. T Consensus 264 ~LP~~~~~~Ls~~n~~---g-~Tvl~~lk~----n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~ 335 (382) T protein:vir:96 264 ALATSKVDYLSVTTPY---G-ISVSDWIEQ----TYPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGS 335 (382) T ss_pred eechHHHhhccccCcc---C-ccHHHHHHH----hcCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCc Confidence 9999888777431110 0 000011110 122345554443321 1 1111111000 00 Q ss_pred eEEEeecCCccceeccchhh--cccEEEE-EEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 183 ALKLVLKRGVQVETDRDIVT--KTTVITA-DEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 183 A~~~~~k~~v~vE~~Rd~~~--~~~~i~~-~~~y~~~~~~~~~vv~l~~~~~ 231 (231) ++. .+-+.. .......+ .+..... ....|+-++.|.+++.+ .|+ T Consensus 336 ~f~--q~~p~~-~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~--~GI 382 (382) T protein:vir:96 336 VFS--QLVQSK-FITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRY--LGI 382 (382) T ss_pred cee--ccccce-eeeccceeecceeEeccccceeeeEEEcchhhhhc--cCC Confidence 100 000000 00000000 1111111 12478999999998886 455 No 240 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=74.83 E-value=0.15 Score=25.04 Aligned_cols=160 Identities=14% Similarity=0.138 Sum_probs=92.5 Q ss_pred CCCcccCce---EEeccccCCcccccC-CCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANL---CEYPNDIGDAADVAE-GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) Q Consensus 1 ~~~~~~G~t---i~~P~~igda~~v~E-G~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~~~~~~a~~ia 76 (231) .-|++.=.| =++ -|.|+.--+.| =.+....+|+..+-+++-|.+...|.|...++.+-.........+++|++.+ T Consensus 32 ~iA~~vpSt~~~~tY-~wLg~fP~lrewiGer~i~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa 110 (305) T protein:vir:19 32 KIAMVVNSSTRSNTY-GWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAA 110 (305) T ss_pred eEEeEecCCCCcccc-cccccCCccchhhcceeeeeccccceeEeeccccceeccchhhccccccCchHHHHHHHHHHHh Confidence 111111110 011 24555322111 1357778899999999999999999999999999888888888999999888 Q ss_pred HHHHHHHHHHhccc------------------------c----------------------------------------- Q lcl|Aclame:pro 77 NKVDDDLLKAAKTT------------------------S----------------------------------------- 91 (231) Q Consensus 77 ~~vd~~~~~~l~t~------------------------~----------------------------------------- 91 (231) ..=|.-++..|+.. + T Consensus 111 ~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~ 190 (305) T protein:vir:19 111 VQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPE 190 (305) T ss_pred hchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccc Confidence 76555444432110 0 Q ss_pred ------------------------------------cccccccCHHHHHHHHHHhhc---cC-----CCceEEEECHHHH Q lcl|Aclame:pro 92 ------------------------------------QTVSTKANVDGVQAALDIFND---ED-----AQAYVLIVNPKDA 127 (231) Q Consensus 92 ------------------------------------~~~~~~~~~d~i~da~~~l~~---~~-----~~~~v~vv~p~~~ 127 (231) -.++.+++.+.+..|..++.. .+ ..++.++|+|.-. T Consensus 191 ~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE 270 (305) T protein:vir:19 191 LVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRSFEGDGGKKLGLKPTHIVVPVGLE 270 (305) T ss_pred eeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHHHHHHHHHhhcCCCCceeeeecCeEEeCchhH Confidence 001233455666666655532 21 3577899999866 Q ss_pred HHHHhhhhhhhccccccCceeeeccceeecc-eeEEEcCCC Q lcl|Aclame:pro 128 AKIRKDANAKNIGSEVGANALINGTYADVLG-AQIVRSKKL 167 (231) Q Consensus 128 ~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G-~~Vv~s~~~ 167 (231) ..-++....... ..+.. |..-.+.| +.+++++.+ T Consensus 271 ~~A~qll~s~~i--~~g~~----~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 271 KAAEQLLNRELF--ADGNT----TVSNEMKGKLQLVVADYL 305 (305) T ss_pred HHHHHHHhhccc--CCccc----cccceecceEEEEecccC Confidence 555554322111 11111 11223455 789999999 No 241 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=74.34 E-value=0.16 Score=24.96 Aligned_cols=230 Identities=9% Similarity=0.015 Sum_probs=103.2 Q ss_pred CCCcccCce---------EEeccccCC------cccccCCCccCcccc-ccceeEEEeehccceeeecHHHHHhcCCCHH Q lcl|Aclame:pro 1 ENGINLANL---------CEYPNDIGD------AADVAEGGEISLDKI-GTTTKSVTIKKAAKGTEITDEAALSGYGDPI 64 (231) Q Consensus 1 ~~~~~~G~t---------i~~P~~igd------a~~v~EG~~i~~~~l-t~~~~~~tikk~g~~~~itD~~~~~~~~d~~ 64 (231) .-..--+++ ++|-...|. +..++.+.+.+..+- .+.+.+.++-..+....++..+.+...-.+. T Consensus 27 ~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~ 106 (348) T protein:vir:98 27 QVNRFRLARWLPNVDVDDITFEFLRGGGGLAETASYRSWDTESKIGRREGLAKVMGELPPISEKIPLNEYDRLRLRKLSR 106 (348) T ss_pred CcchhhHHhcCCCccccceEEEEEeccCCceeeeeeecCCCccceeecccceeeeeeccccccccccCHHHHHHhcCChH Confidence 000011222 222111221 223455555443332 3455556666666666777766655444444 Q ss_pred HHHHHHHHHH-------HHHHHHHHHHHHhcccc-----------------cc---------cccccCHHHHHHHHHHhh Q lcl|Aclame:pro 65 GESNKQLGLS-------LANKVDDDLLKAAKTTS-----------------QT---------VSTKANVDGVQAALDIFN 111 (231) Q Consensus 65 ~~~~~~~a~~-------ia~~vd~~~~~~l~t~~-----------------~~---------~~~~~~~d~i~da~~~l~ 111 (231) .++.+.++.. +.+.++--+..++.+.. .. .++..-+++|.+....+. T Consensus 107 ~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~ 186 (348) T protein:vir:98 107 DEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGSHSVVAAVLWSVHATATPISDLESWVATYE 186 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcccccccccccCCCCCCCHHHHHHHHHHHHH Confidence 4444444433 33333332333333211 00 112234677777777776 Q ss_pred cc-CCCceEEEECHHHHHHHHhhhhhhhcccccc----Cceeeecccee---ecce-eEEEcCC-----------CccCc Q lcl|Aclame:pro 112 DE-DAQAYVLIVNPKDAAKIRKDANAKNIGSEVG----ANALINGTYAD---VLGA-QIVRSKK-----------LAEGS 171 (231) Q Consensus 112 ~~-~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~----~~~~~~G~ig~---~~G~-~Vv~s~~-----------~~~~~ 171 (231) +. +..+..++|+++.+..|++++.+........ ...+..+.+.. .+|. +|.+-+. +|++. T Consensus 187 ~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~i~~~d~~~~~~g~~~~~~p~~~ 266 (348) T protein:vir:98 187 DTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTVLSSMGLPPIEVYDAKVAVDGVSTRITPANA 266 (348) T ss_pred HccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHHHHhhCCeEEEEeeeEEEcCCceeceecCCe Confidence 54 5688999999999999999888775432111 11122222221 2344 3443321 23333 Q ss_pred eEEEEEec-----CCceEEEee------------------cCCccceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEe Q lcl|Aclame:pro 172 ALMFKIVS-----NSPALKLVL------------------KRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) Q Consensus 172 ~~~~~~~~-----~~~A~~~~~------------------k~~v~vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~ 228 (231) .+++.-.. ....++... .-++-+..+++.+--...+.+..+--..+.+|++++++++ T Consensus 267 i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~V 346 (348) T protein:vir:98 267 IALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVAATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQV 346 (348) T ss_pred EEEEecCCcccccccccccceecccchhhhccccccceeccCceeeeeeeecCCcEEEEEEeeeeeccccCCCcEEEEEE Confidence 22211000 000011000 0001111222222222334444445566789999999999 Q ss_pred cc Q lcl|Aclame:pro 229 TG 230 (231) Q Consensus 229 ~~ 230 (231) =| T Consensus 347 l~ 348 (348) T protein:vir:98 347 LA 348 (348) T ss_pred eC Confidence 99 No 242 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=71.20 E-value=0.2 Score=24.43 Aligned_cols=219 Identities=16% Similarity=0.114 Sum_probs=116.5 Q ss_pred CCCcccCceEEe------c----cc-cCCcccccCCCccCccccccce--eEEEeeh---ccceee----ecHHHHHhcC Q lcl|Aclame:pro 1 ENGINLANLCEY------P----ND-IGDAADVAEGGEISLDKIGTTT--KSVTIKK---AAKGTE----ITDEAALSGY 60 (231) Q Consensus 1 ~~~~~~G~ti~~------P----~~-igda~~v~EG~~i~~~~lt~~~--~~~tikk---~g~~~~----itD~~~~~~~ 60 (231) +.|+..-+|.-. | .| ++.=.-.+.|+.-. -.+++ ...-.++ |-..|. |.+..+.... T Consensus 35 ~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~s---sRFG~rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~ 111 (287) T protein:vir:39 35 KDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNT---SRFGQRKEVKSVNKQVSYDAPLAINEGIDDFTVNDIK 111 (287) T ss_pred ecCCcccceEEEEEecCcceEEecccCCCCcccccCCCcc---ccccceeEEEEecccccceeccccccccccccccCCh Confidence 667766666321 1 12 11111233333211 11111 0001111 111222 2222233333 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccccCHHHHHHHHHHhhccC----C---CceEEEECHHHHHHHHh Q lcl|Aclame:pro 61 GDPIGESNKQLGLSLANKVDDDLLKAAKTTSQ-TVSTKANVDGVQAALDIFNDED----A---QAYVLIVNPKDAAKIRK 132 (231) Q Consensus 61 ~d~~~~~~~~~a~~ia~~vd~~~~~~l~t~~~-~~~~~~~~d~i~da~~~l~~~~----~---~~~v~vv~p~~~~~L~k 132 (231) ...+++=+..+|.++++.+|.-+=..|..+.. +.+..++-|.+.+....+...- . -+-+..+||+.|..|.. T Consensus 112 ~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~~~t~d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD 191 (287) T protein:vir:39 112 DQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTVKLDEDSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLID 191 (287) T ss_pred hHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheeeeecccchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhc Confidence 34466666777889999998866555544332 2222355566665555543321 1 24468899999999987 Q ss_pred hhhhhhccccccCceeeeccceeecceeEEEcC--CCccCceEEEEEecCCceEEEeecCCccceecc---chhhcccEE Q lcl|Aclame:pro 133 DANAKNIGSEVGANALINGTYADVLGAQIVRSK--KLAEGSALMFKIVSNSPALKLVLKRGVQVETDR---DIVTKTTVI 207 (231) Q Consensus 133 ~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~--~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R---d~~~~~~~i 207 (231) .+... ..+.+..++--+| +..+-|+-+...+ +...|+..++. +.-++. ..+-+.+-| .++..-..+ T Consensus 192 ~~l~T-saK~SsaNiDen~-i~kFkGf~l~e~P~~~~q~g~~a~fs----~dnig~---af~GI~vaR~i~sEdF~Gval 262 (287) T protein:vir:39 192 SKLAT-TAKNSSANVDEQT-LYKFKGFILSELPDEKFQLNEGAYFA----ADNVGV---AGVGIQVTRAMDSEDFAGTAL 262 (287) T ss_pred ccccc-ccccceeeeccCC-cceecceEEEecchHhhccCcEEEEc----ccccee---ecccceeEEeeecccccceee Confidence 76543 3333344444444 5788999888776 44556654433 222221 123333444 345677889 Q ss_pred EEEEEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 208 ~~~~~y~~~~~~~~~vv~l~~~~~ 231 (231) .+---||-++.+.++.+.++.+.- T Consensus 263 QgAgK~G~~i~e~Nk~Ai~k~t~~ 286 (287) T protein:vir:39 263 QAAAKYGKYLPEKNKKAILKATVT 286 (287) T ss_pred ecccccccccccccceEEEEEecC Confidence 999999999999998888776666 No 243 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=69.93 E-value=0.22 Score=24.23 Aligned_cols=221 Identities=12% Similarity=0.037 Sum_probs=96.9 Q ss_pred CCCcccCceEEecc-------c---cC---C---cccccCCCccCccccccceeEEEeehccceeeecHHHHH--hcCCC Q lcl|Aclame:pro 1 ENGINLANLCEYPN-------D---IG---D---AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAAL--SGYGD 62 (231) Q Consensus 1 ~~~~~~G~ti~~P~-------~---ig---d---a~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~--~~~~d 62 (231) ....--+++. ||. | .| . +..++.+.+.+..+-+....+..+-..+....++..+.+ .+..+ T Consensus 34 ~~~~~l~~~~-Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~ 112 (349) T protein:vir:10 34 QYPEMLGDTL-FPAVKVPTLEVDILKAGSRVPTIASVSAFDAEAEIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRN 112 (349) T ss_pred CcchhhHhhc-CCccccccceeEEEeeccCcceeeeeecCCCCcceecccceeEEeeccccccccccCHHHHHHHhhccC Confidence 1011223332 331 0 11 1 223445555444444433444445455555666655432 33222 Q ss_pred --HHHHHHHHHH-------HHHHHHHHHHHHHHhcccc-----------------c--cc--------ccccCHHHHHHH Q lcl|Aclame:pro 63 --PIGESNKQLG-------LSLANKVDDDLLKAAKTTS-----------------Q--TV--------STKANVDGVQAA 106 (231) Q Consensus 63 --~~~~~~~~~a-------~~ia~~vd~~~~~~l~t~~-----------------~--~~--------~~~~~~d~i~da 106 (231) ......++++ ..+.+.++--+..++.+.. . .. ++..-+++|.+. T Consensus 113 ~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~ 192 (349) T protein:vir:10 113 TAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKNGIAIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDW 192 (349) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCCcEEEecccCccceeEecCcccCCCCCCCHHHHHHHH Confidence 2223333333 3333333333444443221 0 01 111225666655 Q ss_pred HHHhhccCCCceEEEECHHHHHHHHhhhhhhhccccccCceee-----eccceeecceeEEEcCC--------------- Q lcl|Aclame:pro 107 LDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALI-----NGTYADVLGAQIVRSKK--------------- 166 (231) Q Consensus 107 ~~~l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~-----~G~ig~~~G~~Vv~s~~--------------- 166 (231) .+.+ +..+..++|+++++..|++++.+...........+. +..++.+.|.+|++-+. T Consensus 193 ~~~~---g~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~ 269 (349) T protein:vir:10 193 SDSL---DVTPTRALTSKKVLRILMRSTEIKEAIFGKDTGRVVGQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNS 269 (349) T ss_pred HHHh---CCCccEEEeCHHHHHHHhcCHHHHHHhcccccccccCHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecc Confidence 5555 467889999999999999998876653222222111 23445556666665431 Q ss_pred -CccCceEEEEEecCCceEEEeecCCc-----------cc---eecc------chhhcccEEEEEEEEEEEEEcCCcEEE Q lcl|Aclame:pro 167 -LAEGSALMFKIVSNSPALKLVLKRGV-----------QV---ETDR------DIVTKTTVITADEHYAAYLYDLTKVVN 225 (231) Q Consensus 167 -~~~~~~~~~~~~~~~~A~~~~~k~~v-----------~v---E~~R------d~~~~~~~i~~~~~y~~~~~~~~~vv~ 225 (231) +|++..+++. .+..+...-..+ .. +..+ +.+--.-.+.+..+--..+.+|+++++ T Consensus 270 ~~p~~~v~l~~----~~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~ 345 (349) T protein:vir:10 270 YFPEDRIVLFN----DEVPGQKIYGPTPEENRLISSNAQVSNVGNIMAKIYETSEDPIGTWILASATMLPSFASADDVFQ 345 (349) T ss_pred cccCCeEEEec----CCCceeEEeeccchhhhhcccccceeeccceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEE Confidence 2333332221 222221111010 00 0001 111112233344445566678888888 Q ss_pred EEec Q lcl|Aclame:pro 226 ITFT 229 (231) Q Consensus 226 l~~~ 229 (231) +++= T Consensus 346 a~Vl 349 (349) T protein:vir:10 346 AKVL 349 (349) T ss_pred EEeC Confidence 8877 No 244 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=65.28 E-value=0.29 Score=23.56 Aligned_cols=215 Identities=14% Similarity=0.094 Sum_probs=109.1 Q ss_pred CCCcccCceEEe------c----cc-cCCcccccCCCccCcccccccee-EE-Eeeh---ccceeeecH-HHHHhcCCCH Q lcl|Aclame:pro 1 ENGINLANLCEY------P----ND-IGDAADVAEGGEISLDKIGTTTK-SV-TIKK---AAKGTEITD-EAALSGYGDP 63 (231) Q Consensus 1 ~~~~~~G~ti~~------P----~~-igda~~v~EG~~i~~~~lt~~~~-~~-tikk---~g~~~~itD-~~~~~~~~d~ 63 (231) +.|+..-+|.-. | .| ++.=.-.++|+.-. -.++.. ++ -.+. |-..|.+-+ .+...-+.|+ T Consensus 41 lDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGtgTg~S---sRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~ 117 (286) T protein:vir:94 41 LDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGTGTSNS---SRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDL 117 (286) T ss_pred hhCCCccceEEEEeecCcceEEecccCCCccccccCCccc---cccCceeeEEeecccccccccchhhhccccccccCCh Confidence 777777766421 2 12 11111233333211 111110 00 0111 111222211 1223333344 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhccC-----CCceEEEECHHHHHHHHhhhh Q lcl|Aclame:pro 64 ---IGESNKQLGLSLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDED-----AQAYVLIVNPKDAAKIRKDAN 135 (231) Q Consensus 64 ---~~~~~~~~a~~ia~~vd~~~~~~l~t~~~~~~~~~~~d~i~da~~~l~~~~-----~~~~v~vv~p~~~~~L~k~~~ 135 (231) +++=++.+|.++.+.+|..+=..|..+... ..++|.+.+....+...- ..+-...+||..|..|...+. T Consensus 118 ~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~---t~~~D~V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l 194 (286) T protein:vir:94 118 DAAVADRLNLQAQAKTRLFNVAMGEALATAGTD---LGAVDDVNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLAN 194 (286) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh---hhhhhhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhcccc Confidence 555566778888888887665555433322 233466665555543321 234458999999999987765 Q ss_pred hhhccccccCceeeeccceeecceeEEEcC-CCccCceEEEEEecCCceEEEeecCCccceecc---chhhcccEEEEEE Q lcl|Aclame:pro 136 AKNIGSEVGANALINGTYADVLGAQIVRSK-KLAEGSALMFKIVSNSPALKLVLKRGVQVETDR---DIVTKTTVITADE 211 (231) Q Consensus 136 ~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~-~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~R---d~~~~~~~i~~~~ 211 (231) .. ..+.+..++-.|| +..+-|+-+...+ ++-.|+...+. +.-++. ..+-+.+-| .++..-..+.+-- T Consensus 195 ~T-saK~SsaNiDeng-i~~FkGf~i~e~P~~~~~g~~aifs----~dnig~---aftGIn~aR~IesEdF~GValQgAG 265 (286) T protein:vir:94 195 VT-TAKNSAVNIDTNG-MLSFRGIAITKVPTQYMGGKAVIFA----PDNVAR---VFTGINIARTIQAIDFAGVELQGAG 265 (286) T ss_pred cc-ccccceeeeccCC-cceecceEEeecchhhccCceEEEc----ccccee---eeccceeeeeeeccccCceeeeccc Confidence 43 2333344444444 5788998887776 33345543332 222222 122333444 3455677888888 Q ss_pred EEEEEEEcCCcEEE--EEecc Q lcl|Aclame:pro 212 HYAAYLYDLTKVVN--ITFTG 230 (231) Q Consensus 212 ~y~~~~~~~~~vv~--l~~~~ 230 (231) -||-++.+.++.+. .+.+| T Consensus 266 K~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 266 KYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred cccccccccCceeEEEeecCC Confidence 99999998877665 33344 No 245 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=62.17 E-value=0.34 Score=23.15 Aligned_cols=172 Identities=12% Similarity=0.029 Sum_probs=95.9 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHHHH---HHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGE---SNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~~~---~~~~~a~~ia~ 77 (231) =|.-.++.+ .+..-...++...=++.+++++-++.+++.+++-.+..++|... +.+-.+|.-+. ..++..++|.. T Consensus 46 ~N~~tg~~t-~vrt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~-la~~~Gn~a~~ra~e~~~~ikam~q 123 (330) T protein:vir:10 46 GNLPTGHRT-SVRTGLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKA-LADLNGNTAAFRLSEDRAQIEGMNQ 123 (330) T ss_pred ccCCcccce-eEEeecCCchhhhcCCccccccceEEEEEEEeEEecchhhhhhH-HHhhcCCHHHHHHHHHHHHHHHHHH Confidence 122222333 12222234555666677999999999999999999999998775 45555566433 44556667777 Q ss_pred HHHHHHHHH-----------hc------cc--------------c--------------------c-------------c Q lcl|Aclame:pro 78 KVDDDLLKA-----------AK------TT--------------S--------------------Q-------------T 93 (231) Q Consensus 78 ~vd~~~~~~-----------l~------t~--------------~--------------------~-------------~ 93 (231) ++...+|.+ |. ++ . + . T Consensus 124 ~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~ 203 (330) T protein:vir:10 124 EVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVT 203 (330) T ss_pred HHHHHhccCCCCCChhhccchhhhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeecccee Confidence 776666542 10 00 0 0 0 Q ss_pred ---cc---------------------------------------cccCH----HHHHHHHHHhhccCCCceEEEECHHHH Q lcl|Aclame:pro 94 ---VS---------------------------------------TKANV----DGVQAALDIFNDEDAQAYVLIVNPKDA 127 (231) Q Consensus 94 ---~~---------------------------------------~~~~~----d~i~da~~~l~~~~~~~~v~vv~p~~~ 127 (231) .. +.... +-+.+|...+-.-.....+++||..+. T Consensus 204 ~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~ 283 (330) T protein:vir:10 204 IENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLR 283 (330) T ss_pred eecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHH Confidence 00 00011 223334444432334567899999999 Q ss_pred HHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEE Q lcl|Aclame:pro 128 AKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF 175 (231) Q Consensus 128 ~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~ 175 (231) ..|++...... +....-+-...-.+-.+.|+||.+++.+-.++...+ T Consensus 284 ~~L~~q~~~k~-n~~l~~~~~~g~~~t~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 284 EKLRLGIVDKI-ANNLTWETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred HHHHHHHhhcc-cceeeeeecCCeeeEEECCeEEEEEeeeecCccccC Confidence 99998642221 111111111111235789999999998877776443 No 246 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=57.72 E-value=0.43 Score=22.60 Aligned_cols=173 Identities=14% Similarity=-0.004 Sum_probs=95.0 Q ss_pred CCCcccCceEEeccccCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhcCCCHH---HHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPI---GESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~~~d~~---~~~~~~~a~~ia~ 77 (231) =|.-.++.+ .+..-...++...=++.+++++-++.+++.+++-.+..++|.. .+.+-.+|.- ....++..++|.. T Consensus 46 ~N~~tg~~~-~vrt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr-~La~~~Gn~a~~ra~e~~~~ikam~q 123 (335) T protein:vir:73 46 CNDGSKHKT-TIRAGIPEPVWRRYNQGVQPTKTQTVPVTDTTGMLYDLGFVDK-ALADRSNNAAAFRVSENMGKLQGFNN 123 (335) T ss_pred ccCCcccce-eEEEecCCchhhhcCCccccccceEEEEEEEEEEecchhhhhH-HHHhhcCCHHHHHHHHHHHHHHHHHH Confidence 122222333 1222123455566667799999999999999999999999886 5566666764 3334446667777 Q ss_pred HHHHHHHHH-----------hc-------c--c--------------ccc------------------------------ Q lcl|Aclame:pro 78 KVDDDLLKA-----------AK-------T--T--------------SQT------------------------------ 93 (231) Q Consensus 78 ~vd~~~~~~-----------l~-------t--~--------------~~~------------------------------ 93 (231) ++...+|.+ |. + + ..+ T Consensus 124 ~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g 203 (335) T protein:vir:73 124 KVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLG 203 (335) T ss_pred HHHHHhccCCcCCChhhccchhhhhcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeecc Confidence 776666542 10 0 0 000 Q ss_pred ----------------------------------------ccc----ccC----HHHHHHHHH--HhhccCCCceEEEEC Q lcl|Aclame:pro 94 ----------------------------------------VST----KAN----VDGVQAALD--IFNDEDAQAYVLIVN 123 (231) Q Consensus 94 ----------------------------------------~~~----~~~----~d~i~da~~--~l~~~~~~~~v~vv~ 123 (231) ++. +.+ .+.+++|+- .+-.-.....+++|| T Consensus 204 ~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n 283 (335) T protein:vir:73 204 DDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYAN 283 (335) T ss_pred ceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEec Confidence 000 000 112233331 122223445789999 Q ss_pred HHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCccCceEEEE Q lcl|Aclame:pro 124 PKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFK 176 (231) Q Consensus 124 p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~~~~~~~~ 176 (231) ..+...|++....... .....+-...-.+-.+.|+||.+++.+-.++...+- T Consensus 284 ~~v~~~L~~q~~~~~n-~~l~~~~~~g~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 284 KTIHAWLHKQAMNAKN-VNLTIEEYGGKKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred hHHHHHHHHHHhccCc-eeeeeeccCCceeEEECCeEEEEEeeeecCcccccC Confidence 9999999986443221 111111111112357889999999988777654321 No 247 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=55.72 E-value=0.47 Score=22.37 Aligned_cols=228 Identities=10% Similarity=-0.041 Sum_probs=99.1 Q ss_pred CCCcccCceEEeccccCC---cccccCCCccC-ccccccceeEEEeehccceeeecHHHHH---------hcCCCHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD---AADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAAL---------SGYGDPIGES 67 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd---a~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~---------~~~~d~~~~~ 67 (231) +--....++|.|=...|. +..+.+|..-. ...=.......++-..+....++-.+.. .+..++.... T Consensus 29 ~~~~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~~~~g~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~ 108 (346) T protein:vir:63 29 NEITFDTDEILFDLVFKDKKLAPFVAPNVQGRVIAARGYTTKTFRPAYVKPKDVINPNRTLKRRAGEQPIIGGMSLQERF 108 (346) T ss_pred cccccccceEEEEEecCceeeeeeecCCCCcceecccceeeeEeecCccCccceeCHHHHHHHhhhhhhccCCcCHHHHH Confidence 111222344444211232 22344443221 1111222333444444444445443322 1233444444 Q ss_pred HHHHH-------HHHHHHHHHHHHHHhcccccc------------------------------cccccCHHHHHHHHHHh Q lcl|Aclame:pro 68 NKQLG-------LSLANKVDDDLLKAAKTTSQT------------------------------VSTKANVDGVQAALDIF 110 (231) Q Consensus 68 ~~~~a-------~~ia~~vd~~~~~~l~t~~~~------------------------------~~~~~~~d~i~da~~~l 110 (231) .+.++ +.+....+.-+..+|.+.... .++..-+.+|.++...+ T Consensus 109 ~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~~vdfg~~~~~~~~lt~~~~W~~~~adp~~di~~~~~~~ 188 (346) T protein:vir:63 109 QAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQRVDFGRDPALTVQLTGGAAWDQATSDPLGNIQTMRTTA 188 (346) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEEEEeeCCCccceeeecccccCCCCCCCHHHHHHHHHHHH Confidence 33333 344444444455555432100 01112267777777777 Q ss_pred hcc-CCCceEEEECHHHHHHHHhhhhhhhcccc--cc--Cc----eeee-------ccce---eecceeEEEcC------ Q lcl|Aclame:pro 111 NDE-DAQAYVLIVNPKDAAKIRKDANAKNIGSE--VG--AN----ALIN-------GTYA---DVLGAQIVRSK------ 165 (231) Q Consensus 111 ~~~-~~~~~v~vv~p~~~~~L~k~~~~~~~~~~--~~--~~----~~~~-------G~ig---~~~G~~Vv~s~------ 165 (231) .+. +..+..++|+|+.+..|++++.+...... .+ .. .+.. |.+. .+.|+.|+.-+ T Consensus 189 ~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d~ 268 (346) T protein:vir:63 189 WKKSNSTITRLTMGLDAWSLFSQKPAVVELLNLFYKGSTSDFNRSRLDDGSPVQYQGTIGGYNGMGTLELYTYHDTYTGD 268 (346) T ss_pred HHccCCceEEEEECHHHHHHHhcCHHHHHHHhhhccccccccchhhcccchhhhhhhhHhhhhccCCeEEEEeccEEEcC Confidence 554 35788999999999999998877653211 00 00 1111 1111 23466766422 Q ss_pred ------CCccCceEEEEEecCCceEEEee-----cCCcc----ceeccchhhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 166 ------KLAEGSALMFKIVSNSPALKLVL-----KRGVQ----VETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 166 ------~~~~~~~~~~~~~~~~~A~~~~~-----k~~v~----vE~~Rd~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) -+|+++.+++.. ...|.+.+.. ..... ...+...+-..-.+.+..+--..+.+|+++++++++ T Consensus 269 ~G~~~~~ip~~~v~~~p~-~~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 269 DNTEQEILGSYDVVGTGP-GLQGTQCFGAIMDFKNGLVPTRMFPKMWEEEDPSVAMLMTQSAPLMVPAQPNASFRMTVK 346 (346) T ss_pred CCceeccccCCeEEEEec-CCcceEEEeeccccccCcccceeeeEEEEecCCCEEEEEEeeeccceecCCCcEEEEEeC Confidence 245555444321 0011111100 00000 011111112223344444455666899999999999 No 248 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=51.24 E-value=0.59 Score=21.85 Aligned_cols=219 Identities=10% Similarity=0.020 Sum_probs=109.2 Q ss_pred CCCcccC----ceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHhc---CCCHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLA----NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSG---YGDPIGESNKQL 71 (231) Q Consensus 1 ~~~~~~G----~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~~---~~d~~~~~~~~~ 71 (231) + .+.| .+++||.+ .|-+..++.++++|..+...+..+-++..++-++++++++...+ +.|..++=.... T Consensus 110 v--~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA 187 (388) T protein:vir:99 110 V--KTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGA 187 (388) T ss_pred c--cccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHH Confidence 1 1122 26777754 68999999999999999888888888888888888888765543 667766666666 Q ss_pred HHHHHHHHHHHHHHH---h--------cc-----cccccc-------cc-cC----HHHHHHHHHHhhccC-------CC Q lcl|Aclame:pro 72 GLSLANKVDDDLLKA---A--------KT-----TSQTVS-------TK-AN----VDGVQAALDIFNDED-------AQ 116 (231) Q Consensus 72 a~~ia~~vd~~~~~~---l--------~t-----~~~~~~-------~~-~~----~d~i~da~~~l~~~~-------~~ 116 (231) .++++.+.++-.|-+ . .. +....+ ++ .+ +++|..++..+.... .. T Consensus 188 ~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~ 267 (388) T protein:vir:99 188 AVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDV 267 (388) T ss_pred HHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeeccc Confidence 677777666544311 1 00 000010 11 12 566666666663221 22 Q ss_pred ceEEEECHHHHHHHHhhhhhhhccccccCceeeeccceeecceeEEEcCCCcc-----CceEEEEEecCCceEEEeecCC Q lcl|Aclame:pro 117 AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE-----GSALMFKIVSNSPALKLVLKRG 191 (231) Q Consensus 117 ~~v~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~~~-----~~~~~~~~~~~~~A~~~~~k~~ 191 (231) +..++++|..+..|.+-..+ +. ...+.+.. .+=+++|+.-+.+.. +....+-+...-.......-.+ T Consensus 268 ~~tL~LP~~~~~~Ls~~n~~---g~-Tvl~~lk~----n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~ 339 (388) T protein:vir:99 268 DITLVLPMNKVDMLSVVTDL---GI-SVRDWLKQ----TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDG 339 (388) T ss_pred ceEEEechHHHHhccccCcC---Cc-cHHHHHHH----hcCCcEEEEecccccccccCCceeEEEEecccccccccCccC Confidence 34799999888888532111 00 00011110 122455555443321 1111111110000000000000 Q ss_pred ------ccceeccc----hhhcccEEEEE-EEEEEEEEcCCcEEEEEeccC Q lcl|Aclame:pro 192 ------VQVETDRD----IVTKTTVITAD-EHYAAYLYDLTKVVNITFTGV 231 (231) Q Consensus 192 ------v~vE~~Rd----~~~~~~~i~~~-~~y~~~~~~~~~vv~l~~~~~ 231 (231) .--|..|- .......+... ..+|+-++.|.+++.+ .|+ T Consensus 340 ~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~--~GI 388 (388) T protein:vir:99 340 GDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL--IGL 388 (388) T ss_pred cceeEEecccccccccceecCceeEeccccceeeeEEeccchhhee--ccC Confidence 00011110 11111222222 3478999999999886 455 No 249 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=50.61 E-value=0.6 Score=21.78 Aligned_cols=223 Identities=15% Similarity=0.042 Sum_probs=100.3 Q ss_pred CCCcccCceEEeccccCC---cccccCCC---ccCccccccceeEEEeehccceeeecHHHHH--hc------CCCHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD---AADVAEGG---EISLDKIGTTTKSVTIKKAAKGTEITDEAAL--SG------YGDPIGE 66 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd---a~~v~EG~---~i~~~~lt~~~~~~tikk~g~~~~itD~~~~--~~------~~d~~~~ 66 (231) +--...-++|.+-...|. +..+.+|. .+... .....+.++-..+....++-.+.. .. .-++... T Consensus 30 ~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~--~~~~~~~~~p~i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~ 107 (341) T protein:vir:34 30 ESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSR--GGSTSEFTPGYVKPKHEVNPQMTLRRLPDEDPQNLADPAYR 107 (341) T ss_pred cccccccceEEEEEeeCCeeEEEeecCCCCcceeccC--ceeeeEEecCccCccceeCHHHHHHHhhccccccCcCHHHH Confidence 211222233444211232 22233333 33332 233445555555555555543332 11 2234443 Q ss_pred HHHHHHHH-------HHHHHHHHHHHHhcccc--------------------cc--ccc--------ccCHHHHHHHHHH Q lcl|Aclame:pro 67 SNKQLGLS-------LANKVDDDLLKAAKTTS--------------------QT--VST--------KANVDGVQAALDI 109 (231) Q Consensus 67 ~~~~~a~~-------ia~~vd~~~~~~l~t~~--------------------~~--~~~--------~~~~d~i~da~~~ 109 (231) ..+++... +.+.++--+..+|.+.. .. .+. ...++.+.+.... T Consensus 108 ~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vDfg~~~~~~~~~t~~~~W~~~~~~~~d~l~di~~~ 187 (341) T protein:vir:34 108 RRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGRSEENNITQSGGTEWSKRDKSTYDPTDDIEAY 187 (341) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEEeCCCCccceEecCCccCCcCCCchHHHHHHHHHH Confidence 44444433 33333333444443211 00 111 1234555665566 Q ss_pred hhccCCCceEEEECHHHHHHHHhhhhhhhcccc-cc-Cce-------eeec--cceeecceeEEEcC-----------CC Q lcl|Aclame:pro 110 FNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSE-VG-ANA-------LING--TYADVLGAQIVRSK-----------KL 167 (231) Q Consensus 110 l~~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~-~~-~~~-------~~~G--~ig~~~G~~Vv~s~-----------~~ 167 (231) +...+..+..++|+++.+..|++++.+...... .+ ... +..| .++++.|++|++-+ .+ T Consensus 188 ~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~ddG~~~~~i 267 (341) T protein:vir:34 188 ALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLGKAVSYKGMYGDVAIVVYSGQYVENGVKKNFL 267 (341) T ss_pred HHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhcccccccccccccccccceeeeeecCCceEEEEcCEEEECCcEEeee Confidence 666677889999999999999999887643211 11 000 1111 23456687776433 25 Q ss_pred ccCceEEEEEecCCceEEE-eecCCcc--------ceecc------ch-hhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 168 AEGSALMFKIVSNSPALKL-VLKRGVQ--------VETDR------DI-VTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 168 ~~~~~~~~~~~~~~~A~~~-~~k~~v~--------vE~~R------d~-~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) |+++.+.+.. ++.+. ....... .+..| +. +-..-.+.+..+--..+.+|+++++++++ T Consensus 268 p~~~v~l~p~----g~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 268 PDNTMVLGNT----QARGLRTYGCIQDADAQREGINASARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred cCCeEEEeeC----CCcceEEEeecccccccccceeeeeEeeeeeeecCCCcEEEEEEcccceeeeeCCCcEEEEEeC Confidence 6665554431 11111 0000000 01111 11 11122344444455677899999999999 No 250 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=41.98 E-value=0.9 Score=20.82 Aligned_cols=223 Identities=13% Similarity=0.111 Sum_probs=112.6 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHH-HhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~-~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+.++--+.+. .| +|+...+.|+..++..+.+....++.+|-.+-.-.+|-.+- ..+..||+....+.....++. T Consensus 74 ~STV~~y~~~~--~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~ 151 (463) T protein:vir:99 74 QSTVVKYDQYL--RHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAK 151 (463) T ss_pred hhhhhhheeee--ccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHH Confidence 22222222221 22 57788899999999999999999999998777666666553 345779999999999999999 Q ss_pred HHHHHHHHHhcc-ccc-c------------cc---------cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh Q lcl|Aclame:pro 78 KVDDDLLKAAKT-TSQ-T------------VS---------TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) Q Consensus 78 ~vd~~~~~~l~t-~~~-~------------~~---------~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~ 134 (231) .+.-.+|-+-.. ++. . +. ..++-+.|..|.......-..++-++||+.+.+.|...- T Consensus 152 tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~ 231 (463) T protein:vir:99 152 TIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSI 231 (463) T ss_pred HHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHh Confidence 998877643211 110 0 11 123445566666666544456778999999999887431 Q ss_pred hhhhcc-ccccCc-----eeeeccc---e--eecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhc Q lcl|Aclame:pro 135 NAKNIG-SEVGAN-----ALINGTY---A--DVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTK 203 (231) Q Consensus 135 ~~~~~~-~~~~~~-----~~~~G~i---g--~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~ 203 (231) --..+. .....+ .-.+|.+ | .+.|-.++-.+..=.-+ ....++|..-.. --.++++.-....+ T Consensus 232 l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~-----~~~~p~ap~~~~-~tatv~~~~~~~~~ 305 (463) T protein:vir:99 232 LGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDES-----LQPLPNAPQPAK-VTATVETKQKGAFE 305 (463) T ss_pred cCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccch-----hhcCCCCccCce-eEEEEeeccCCCCC Confidence 111110 000000 0011100 0 11221122111100000 000012211110 01244442222222 Q ss_pred ccEEEEEEEEEEEEEc------CCcEEEEEeccC Q lcl|Aclame:pro 204 TTVITADEHYAAYLYD------LTKVVNITFTGV 231 (231) Q Consensus 204 ~~~i~~~~~y~~~~~~------~~~vv~l~~~~~ 231 (231) +..=.+.+.|.+.+.+ |+.++..|++.| T Consensus 306 ~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~ 339 (463) T protein:vir:99 306 NEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) T ss_pred CcccccceEEEEEEECCCCCcccchheeeeeeec Confidence 2222333455555554 666666666643 No 251 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=41.98 E-value=0.9 Score=20.82 Aligned_cols=223 Identities=13% Similarity=0.111 Sum_probs=112.6 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHH-HhcCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~-~~~~~d~~~~~~~~~a~~ia~ 77 (231) .+.++--+.+. .| +|+...+.|+..++..+.+....++.+|-.+-.-.+|-.+- ..+..||+....+.....++. T Consensus 74 ~STV~~y~~~~--~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~ 151 (463) T protein:vir:95 74 QSTVVKYDQYL--RHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAK 151 (463) T ss_pred hhhhhhheeee--ccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHH Confidence 22222222221 22 57788899999999999999999999998777666666553 345779999999999999999 Q ss_pred HHHHHHHHHhcc-ccc-c------------cc---------cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh Q lcl|Aclame:pro 78 KVDDDLLKAAKT-TSQ-T------------VS---------TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) Q Consensus 78 ~vd~~~~~~l~t-~~~-~------------~~---------~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~ 134 (231) .+.-.+|-+-.. ++. . +. ..++-+.|..|.......-..++-++||+.+.+.|...- T Consensus 152 tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~ 231 (463) T protein:vir:95 152 TIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSI 231 (463) T ss_pred HHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHh Confidence 998877643211 110 0 11 123445566666666544456778999999999887431 Q ss_pred hhhhcc-ccccCc-----eeeeccc---e--eecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhc Q lcl|Aclame:pro 135 NAKNIG-SEVGAN-----ALINGTY---A--DVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTK 203 (231) Q Consensus 135 ~~~~~~-~~~~~~-----~~~~G~i---g--~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~ 203 (231) --..+. .....+ .-.+|.+ | .+.|-.++-.+..=.-+ ....++|..-.. --.++++.-....+ T Consensus 232 l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~-----~~~~p~ap~~~~-~tatv~~~~~~~~~ 305 (463) T protein:vir:95 232 LGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDES-----LQPLPNAPQPAK-VTATVETKQKGAFE 305 (463) T ss_pred cCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccch-----hhcCCCCccCce-eEEEEeeccCCCCC Confidence 111110 000000 0011100 0 11221122111100000 000012211110 01244442222222 Q ss_pred ccEEEEEEEEEEEEEc------CCcEEEEEeccC Q lcl|Aclame:pro 204 TTVITADEHYAAYLYD------LTKVVNITFTGV 231 (231) Q Consensus 204 ~~~i~~~~~y~~~~~~------~~~vv~l~~~~~ 231 (231) +..=.+.+.|.+.+.+ |+.++..|++.| T Consensus 306 ~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~ 339 (463) T protein:vir:95 306 NEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) T ss_pred CcccccceEEEEEEECCCCCcccchheeeeeeec Confidence 2222333455555554 666666666643 No 252 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=36.24 E-value=1.2 Score=20.18 Aligned_cols=220 Identities=13% Similarity=0.139 Sum_probs=118.4 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh-cCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS-GYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~-~~~d~~~~~~~~~a~~ia~ 77 (231) ++-++--+.+. .| +|+...+.|+..++..+.+...+++.+|-.+-.-.+|..+-++ +..||++...+.....++. T Consensus 74 ~stv~~y~~~~--~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~ 151 (468) T protein:vir:63 74 TSTVAKYDVYM--QHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAK 151 (468) T ss_pred hhhhhhheeee--ccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHH Confidence 44444433332 33 5788889999999999999999999999888888889887655 5779998888888889999 Q ss_pred HHHHHHHHH---hcccccc------------cc---------cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 78 KVDDDLLKA---AKTTSQT------------VS---------TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 78 ~vd~~~~~~---l~t~~~~------------~~---------~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~ 133 (231) .+.-.+|-+ +..++.. +. ..++-+.|..|....+..-..++-++||+.+.+.|... T Consensus 152 tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~ 231 (468) T protein:vir:63 152 TIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQ 231 (468) T ss_pred HHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhh Confidence 998877742 2112111 00 12344555666666655434566789999998888543 Q ss_pred hhhhhc--------cccccCceeeeccc---e--eecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch Q lcl|Aclame:pro 134 ANAKNI--------GSEVGANALINGTY---A--DVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI 200 (231) Q Consensus 134 ~~~~~~--------~~~~~~~~~~~G~i---g--~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~ 200 (231) --.... ....+-+ .+|.+ | .+.|-.|+.+.+++.-..........+.-+ . .+.+..... T Consensus 232 ~L~~q~~v~~~n~~~~~~G~~--v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~v-s-----aT~~~~~~g 303 (468) T protein:vir:63 232 QLSKQTQLVRDNGNNVSVGFN--IQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKV-T-----ATQEAGKKG 303 (468) T ss_pred hcCceEEEEcCCCCceeeeec--ccceecceeeeeecCceeeccccCCCcccccccccccCCcc-c-----eeeecccCC Confidence 211101 0011111 11211 1 123444444444443222211111111100 0 011111111 Q ss_pred hhcccEEEEEEEEEEEEEc------CCcEEEEEeccC Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYD------LTKVVNITFTGV 231 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~------~~~vv~l~~~~~ 231 (231) . +..---+.+.|.+.+.+ |+.++.+++++. T Consensus 304 ~-~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~ 339 (468) T protein:vir:63 304 Q-FRAEDLAAHEYKVVVSSDDAESIASEVATATVTAK 339 (468) T ss_pred c-ccCCCcceEEEEEEEECCCCccccccceEEEecCc Confidence 1 11112223556666664 456666776665 No 253 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=35.02 E-value=1.3 Score=20.04 Aligned_cols=220 Identities=13% Similarity=0.139 Sum_probs=118.3 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh-cCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS-GYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~-~~~d~~~~~~~~~a~~ia~ 77 (231) ++-++--+-+. .| +|+...+.|+..++..+.+...+++.+|-.+-.-.+|..+-++ +..||++...+.....++. T Consensus 73 ~stv~~y~~~~--~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~ 150 (467) T protein:vir:80 73 TSTVAKYDVYM--QHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAK 150 (467) T ss_pred hhhhhhheeee--ccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHH Confidence 44444433332 33 5788889999999999999999999999888888889887655 5779998888888889999 Q ss_pred HHHHHHHHH---hcccccc------------cc---------cccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhh Q lcl|Aclame:pro 78 KVDDDLLKA---AKTTSQT------------VS---------TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKD 133 (231) Q Consensus 78 ~vd~~~~~~---l~t~~~~------------~~---------~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~ 133 (231) .++-.+|-+ +..++.. +. ..++-+.|..|....+..-..++-++||+.+.+.|... T Consensus 151 tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~ 230 (467) T protein:vir:80 151 TIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQ 230 (467) T ss_pred HHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhh Confidence 998877742 2112111 00 12344555666666655434566789999998888543 Q ss_pred hhhhhc--------cccccCceeeeccc---e--eecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccch Q lcl|Aclame:pro 134 ANAKNI--------GSEVGANALINGTY---A--DVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI 200 (231) Q Consensus 134 ~~~~~~--------~~~~~~~~~~~G~i---g--~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~ 200 (231) --.... ....+-+ .+|.+ | .+.|-.|+.+.+++.-..........+.-+ . .+.+..... T Consensus 231 ~L~~q~~v~~~n~~~~~~G~~--v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~v-s-----aT~~~~~~g 302 (467) T protein:vir:80 231 QLSKQTQLVRDNGNNVSVGFN--IQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKV-T-----ATQEAGKKG 302 (467) T ss_pred hcCceEEEEcCCCCceeeeec--ccceecceeeeeecCceeeccccCCCcccccccccccCCcc-c-----eeeecccCC Confidence 211101 0011111 11211 1 123444444444443222211111111100 0 011111111 Q ss_pred hhcccEEEEEEEEEEEEEc------CCcEEEEEeccC Q lcl|Aclame:pro 201 VTKTTVITADEHYAAYLYD------LTKVVNITFTGV 231 (231) Q Consensus 201 ~~~~~~i~~~~~y~~~~~~------~~~vv~l~~~~~ 231 (231) . +..---+.+.|.+.+.+ |+.++.+++++. T Consensus 303 ~-~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~ 338 (467) T protein:vir:80 303 Q-FRAEDLAAHEYKVVVSSDDAESIASEVATATVTAK 338 (467) T ss_pred c-ccCCCcceEEEEEEEECCCCccccccceEEEecCc Confidence 1 11112223556666664 456666776665 No 254 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=34.59 E-value=1.3 Score=19.99 Aligned_cols=223 Identities=13% Similarity=0.144 Sum_probs=112.4 Q ss_pred CCCcccCceEEeccc--cCCcccccCCCccCccccccceeEEEeehccceeeecHHHHHh-cCCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS-GYGDPIGESNKQLGLSLAN 77 (231) Q Consensus 1 ~~~~~~G~ti~~P~~--igda~~v~EG~~i~~~~lt~~~~~~tikk~g~~~~itD~~~~~-~~~d~~~~~~~~~a~~ia~ 77 (231) .+-++--+-+. .| +|+...+.|+..++.++.+...+++.+|-.+-.-.+|..+-++ +..||++...+..-..++. T Consensus 74 ~sTv~~y~~~~--~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~ 151 (462) T protein:vir:96 74 QSTVQKYDVYL--RHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAK 151 (462) T ss_pred hhhhhhheeee--ccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHH Confidence 22222222111 22 5778889999999999999999999999888888888887655 5789999888888889999 Q ss_pred HHHHHHHHHhcc-ccccc----------------------ccccCHHHHHHHHHHhhccCCCceEEEECHHHHHHHHhhh Q lcl|Aclame:pro 78 KVDDDLLKAAKT-TSQTV----------------------STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA 134 (231) Q Consensus 78 ~vd~~~~~~l~t-~~~~~----------------------~~~~~~d~i~da~~~l~~~~~~~~v~vv~p~~~~~L~k~~ 134 (231) .++-.+|-+-.. ++... ...++-+.|..|....+..-..++-++||+.+.++|...- T Consensus 152 tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~ 231 (462) T protein:vir:96 152 TIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSV 231 (462) T ss_pred HHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhh Confidence 998877643211 11100 1123445555555555433346777999999998887431 Q ss_pred hhhhcc-ccccCc-----eeeeccc---e--eecceeEEEcCCCccCceEEEEEecCCceEEEeecCCccceeccchhhc Q lcl|Aclame:pro 135 NAKNIG-SEVGAN-----ALINGTY---A--DVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTK 203 (231) Q Consensus 135 ~~~~~~-~~~~~~-----~~~~G~i---g--~~~G~~Vv~s~~~~~~~~~~~~~~~~~~A~~~~~k~~v~vE~~Rd~~~~ 203 (231) --..+. .....+ .-.+|.+ | .+.|-.++-.+..=.-+ ..-.....+...+ ..++++...-... T Consensus 232 l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~---~~~~p~ap~~~~v---saTv~t~~~g~f~ 305 (462) T protein:vir:96 232 LGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDES---LQPLPNAPQPATV---KATVETGKKGLFT 305 (462) T ss_pred cCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccc---cccCCCCCCCCce---eEEEEeCCCCCCC Confidence 111110 000000 0011110 0 11111111111100000 0000000000000 0123333222111 Q ss_pred ccEEEEEEEEEEEEEc------CCcEEEEEeccC Q lcl|Aclame:pro 204 TTVITADEHYAAYLYD------LTKVVNITFTGV 231 (231) Q Consensus 204 ~~~i~~~~~y~~~~~~------~~~vv~l~~~~~ 231 (231) ...=.+-+.|.+...+ |+.+|.+|++.+ T Consensus 306 ~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~ 339 (462) T protein:vir:96 306 DEHDRAELTYKVVVNSDDAQSAPSEAVTATVNNA 339 (462) T ss_pred CccCceeEEEEEEEECCCCccccceeeEeeeecc Confidence 1111355566666665 566677777644 No 255 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=33.81 E-value=1.3 Score=19.90 Aligned_cols=227 Identities=12% Similarity=0.018 Sum_probs=96.4 Q ss_pred CCCcccCceEEeccccCC---cccccCCCccC-ccccccceeEEEeehccceeeecHHHHH--h------cCCCHHHHHH Q lcl|Aclame:pro 1 ENGINLANLCEYPNDIGD---AADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAAL--S------GYGDPIGESN 68 (231) Q Consensus 1 ~~~~~~G~ti~~P~~igd---a~~v~EG~~i~-~~~lt~~~~~~tikk~g~~~~itD~~~~--~------~~~d~~~~~~ 68 (231) +--...-++|.+-...|. +..+..+..-. ...-.....+.++-..+....++-.+.. + +.-++..... T Consensus 30 ~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~ 109 (341) T protein:vir:39 30 ETYPFSTEKVYLSQIPGLVNMALYVSPIVSGKVIRSRGGSTSEFTPGYVKPKHEVNPLMTLRRLPDEDPQNLADPVYRRR 109 (341) T ss_pred cccccCcceEEEEEecCCceeeEEecCCCCcceecccceeeeeEeccccCcccccCHHHHHHHhhcccccccCCHHHHHH Confidence 111112233333212232 22233333221 1122233444555555444455544332 1 1223444444 Q ss_pred HHHHHHHHH-------HHHHHHHHHhcccc--------------------cc--ccc--------ccCHHHHHHHHHHhh Q lcl|Aclame:pro 69 KQLGLSLAN-------KVDDDLLKAAKTTS--------------------QT--VST--------KANVDGVQAALDIFN 111 (231) Q Consensus 69 ~~~a~~ia~-------~vd~~~~~~l~t~~--------------------~~--~~~--------~~~~d~i~da~~~l~ 111 (231) +.+...+.+ .++--+..+|.+.. .. .+. ....+-+.+..+.+. T Consensus 110 ~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDfg~~~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~ 189 (341) T protein:vir:39 110 RIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDMGRSAGNNIVQAGAAAWSSRDKETYDPTDDIEAYAL 189 (341) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEeccCCccceeEecCCccCCCCCCchHHHHHHHHHHHH Confidence 444433332 22322333442211 00 011 112344444444455 Q ss_pred ccCCCceEEEECHHHHHHHHhhhhhhhcccc--ccCceee-------ec--cceeecceeEEEcCC-----------Ccc Q lcl|Aclame:pro 112 DEDAQAYVLIVNPKDAAKIRKDANAKNIGSE--VGANALI-------NG--TYADVLGAQIVRSKK-----------LAE 169 (231) Q Consensus 112 ~~~~~~~v~vv~p~~~~~L~k~~~~~~~~~~--~~~~~~~-------~G--~ig~~~G~~Vv~s~~-----------~~~ 169 (231) +.+..+..++|+++.+..|++++.+...... .....+. .| .++++.|++|++-+. +|+ T Consensus 190 ~~g~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~d~g~~~~~ip~ 269 (341) T protein:vir:39 190 NASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETALKDLGKAVSYKGMYGDVAIVVYSGQYIENDVKKNYLPD 269 (341) T ss_pred hcCCceEEEEeChHHHHHHhcCHHHHHHHhhcccccccccchhhhhhhHhhhhhhhcCceEEEEccEEEecCcEEeeecC Confidence 5567788999999999999998877654211 1111111 11 234567888766332 555 Q ss_pred CceEEEEEecCCceEEEee-------cCCccceeccc-------hhhcccEEEEEEEEEEEEEcCCcEEEEEec Q lcl|Aclame:pro 170 GSALMFKIVSNSPALKLVL-------KRGVQVETDRD-------IVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) Q Consensus 170 ~~~~~~~~~~~~~A~~~~~-------k~~v~vE~~Rd-------~~~~~~~i~~~~~y~~~~~~~~~vv~l~~~ 229 (231) ++.+++..- ..|...+.. ..+ .....|- .+-..-.+.+...--..+.+|+++++++++ T Consensus 270 ~~~~l~p~~-~~g~~~yg~~~d~~~~~~~-~~~~~~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 270 LTMVLGNTQ-ARGLRTYGCILDADAQREG-INASTRYPKNWVQTGDPAREFTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred CeEEEeeCC-CcceEEEecccchhhcccc-eeeeeeeeeeeeecCCCcEEEEEEeccccceeeCCCcEEEEEeC Confidence 555443310 011111100 000 0111110 011123333444455667899999999999 Done!