Query lcl|Aclame:protein:vir:96123|NCBI_annot:ORF013|genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Match_columns 274 No_of_seqs 116 out of 479 Neff 9.3 Searched_HMMs 1612 Date Sun Dec 1 19:16:47 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_14 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_14_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96123 Length: 274 100.0 2.5E-64 1.5E-67 369.3 26.5 274 1-274 1-274 (274) 2 protein:vir:93742 Length: 274 100.0 2.3E-63 1.4E-66 364.0 27.5 274 1-274 1-274 (274) 3 protein:vir:1239 Length: 274 # 100.0 3.5E-63 2.1E-66 363.0 26.4 274 1-274 1-274 (274) 4 protein:vir:94494 Length: 274 100.0 7E-63 4.4E-66 361.3 27.1 274 1-274 1-274 (274) 5 protein:vir:97433 Length: 274 100.0 7E-63 4.4E-66 361.3 27.1 274 1-274 1-274 (274) 6 protein:vir:96262 Length: 274 100.0 1.9E-62 1.2E-65 359.0 26.1 274 1-274 1-274 (274) 7 protein:vir:95898 Length: 274 100.0 1.9E-62 1.2E-65 359.0 26.1 274 1-274 1-274 (274) 8 protein:vir:105334 Length: 276 100.0 4.9E-62 3E-65 356.7 26.8 274 1-274 1-274 (276) 9 protein:vir:96833 Length: 275 100.0 2E-61 1.2E-64 353.4 26.7 274 1-274 1-275 (275) 10 protein:vir:3033 Length: 272 # 100.0 1.9E-60 1.2E-63 347.9 27.9 272 1-273 1-272 (272) 11 protein:vir:9820 Length: 272 # 100.0 1.9E-60 1.2E-63 347.9 27.9 272 1-273 1-272 (272) 12 protein:vir:80930 Length: 278 100.0 2.5E-60 1.5E-63 347.4 26.3 271 1-271 1-278 (278) 13 protein:vir:3613 Length: 272 # 100.0 3.7E-58 2.3E-61 335.5 25.8 268 1-270 1-272 (272) 14 protein:vir:95107 Length: 270 100.0 3E-55 1.9E-58 319.5 25.7 268 1-274 1-270 (270) 15 protein:vir:739 Length: 231 # 100.0 2.1E-44 1.3E-47 260.1 20.1 227 40-270 1-231 (231) 16 protein:vir:102944 Length: 330 100.0 1.1E-41 6.6E-45 245.2 21.4 265 1-274 1-305 (330) 17 protein:vir:7990 Length: 273 # 100.0 2.6E-41 1.6E-44 243.1 22.1 261 1-270 1-273 (273) 18 protein:vir:5974 Length: 324 # 100.0 1.2E-40 7.3E-44 239.5 22.6 262 1-274 1-292 (324) 19 protein:vir:105822 Length: 273 100.0 1.4E-40 8.5E-44 239.1 22.9 261 1-270 1-273 (273) 20 protein:vir:102605 Length: 273 100.0 1.4E-40 8.5E-44 239.1 22.9 261 1-270 1-273 (273) 21 protein:vir:80684 Length: 315 100.0 1.3E-39 7.9E-43 233.8 23.2 269 1-274 1-310 (315) 22 protein:vir:41 Length: 299 # N 100.0 2.8E-39 1.7E-42 232.0 23.2 262 1-271 6-299 (299) 23 protein:vir:1583 Length: 351 # 100.0 1.1E-39 7E-43 234.1 20.6 262 1-274 1-295 (351) 24 protein:vir:9574 Length: 300 # 100.0 4.5E-39 2.8E-42 230.8 22.8 262 1-270 1-300 (300) 25 protein:vir:9309 Length: 324 # 100.0 3.4E-38 2.1E-41 226.0 23.7 263 1-274 27-319 (324) 26 protein:vir:97148 Length: 324 100.0 4.9E-38 3E-41 225.1 23.8 263 1-274 27-319 (324) 27 protein:vir:100247 Length: 425 100.0 2.5E-38 1.6E-41 226.7 22.0 264 1-271 130-425 (425) 28 protein:vir:6242 Length: 390 # 100.0 2.7E-38 1.7E-41 226.6 21.9 265 1-271 110-390 (390) 29 protein:vir:96392 Length: 324 100.0 7E-38 4.4E-41 224.3 23.7 263 1-274 27-320 (324) 30 protein:vir:78830 Length: 324 100.0 7E-38 4.4E-41 224.3 23.7 263 1-274 27-320 (324) 31 protein:vir:485 Length: 407 # 100.0 5.4E-38 3.3E-41 224.9 22.8 267 1-274 106-404 (407) 32 protein:vir:4456 Length: 401 # 100.0 5.4E-38 3.4E-41 224.9 22.4 263 1-270 107-401 (401) 33 protein:vir:1328 Length: 392 # 100.0 8.2E-38 5.1E-41 223.9 22.7 265 1-271 110-392 (392) 34 protein:vir:96223 Length: 324 100.0 1.1E-37 7E-41 223.1 23.5 263 1-274 27-319 (324) 35 protein:vir:94622 Length: 341 100.0 1.1E-38 7.1E-42 228.6 17.9 268 1-272 1-341 (341) 36 protein:vir:2344 Length: 397 # 100.0 7.4E-38 4.6E-41 224.1 21.9 270 1-274 10-310 (397) 37 protein:vir:99749 Length: 324 100.0 1.7E-37 1E-40 222.2 23.7 263 1-274 27-319 (324) 38 protein:vir:103955 Length: 324 100.0 3.1E-37 2E-40 220.7 23.8 263 1-274 27-319 (324) 39 protein:vir:9759 Length: 303 # 100.0 2.3E-37 1.4E-40 221.4 22.5 262 1-270 1-303 (303) 40 protein:vir:94142 Length: 304 100.0 2.9E-37 1.8E-40 220.9 22.5 257 1-269 1-304 (304) 41 protein:vir:105905 Length: 304 100.0 2.9E-37 1.8E-40 220.9 22.5 257 1-269 1-304 (304) 42 protein:vir:7771 Length: 330 # 100.0 5.4E-37 3.4E-40 219.4 23.1 270 1-274 1-327 (330) 43 protein:vir:95763 Length: 297 100.0 1.2E-36 7.2E-40 217.6 23.4 260 1-271 9-297 (297) 44 protein:vir:1886 Length: 385 # 100.0 1.1E-36 7E-40 217.7 22.6 261 1-271 105-385 (385) 45 protein:vir:191 Length: 385 # 100.0 1.1E-36 7E-40 217.7 22.6 261 1-271 105-385 (385) 46 protein:vir:8187 Length: 311 # 100.0 1.2E-36 7.3E-40 217.6 22.4 262 1-271 1-311 (311) 47 protein:vir:104085 Length: 320 100.0 1.2E-36 7.4E-40 217.5 22.3 268 1-272 14-320 (320) 48 protein:vir:2430 Length: 318 # 100.0 1.3E-36 7.9E-40 217.4 22.2 270 1-274 14-317 (318) 49 protein:vir:100135 Length: 418 100.0 2.8E-36 1.7E-39 215.5 23.5 264 1-273 135-418 (418) 50 protein:vir:97053 Length: 390 100.0 2.2E-36 1.4E-39 216.1 22.6 259 1-268 113-390 (390) 51 protein:vir:94771 Length: 298 100.0 2.3E-36 1.4E-39 216.0 22.4 258 1-269 1-298 (298) 52 protein:vir:78223 Length: 333 100.0 2.8E-36 1.7E-39 215.5 22.6 267 1-271 1-333 (333) 53 protein:vir:104256 Length: 458 100.0 3.9E-36 2.4E-39 214.7 23.2 265 1-270 161-458 (458) 54 protein:vir:81070 Length: 390 100.0 4.1E-36 2.5E-39 214.6 22.6 259 1-268 113-390 (390) 55 protein:vir:101607 Length: 379 100.0 7.3E-36 4.5E-39 213.2 23.6 264 1-270 106-379 (379) 56 protein:vir:1638 Length: 298 # 100.0 6.5E-36 4E-39 213.5 22.6 258 1-269 1-298 (298) 57 protein:vir:10364 Length: 390 100.0 6.6E-36 4.1E-39 213.5 22.4 259 1-268 113-390 (390) 58 protein:vir:4856 Length: 293 # 100.0 1E-35 6.3E-39 212.4 23.4 267 1-274 5-285 (293) 59 protein:vir:4339 Length: 395 # 100.0 9.4E-36 5.9E-39 212.6 23.2 261 1-270 113-395 (395) 60 protein:vir:94673 Length: 419 100.0 1.3E-35 8.2E-39 211.8 23.4 265 1-272 123-419 (419) 61 protein:vir:78523 Length: 338 100.0 1.3E-35 7.8E-39 211.9 22.9 269 1-273 1-338 (338) 62 protein:vir:4226 Length: 326 # 100.0 1.3E-35 8.2E-39 211.8 22.1 268 1-273 20-326 (326) 63 protein:vir:81160 Length: 371 100.0 1.7E-35 1E-38 211.3 22.7 262 1-270 91-371 (371) 64 protein:vir:80446 Length: 367 100.0 7.9E-36 4.9E-39 213.0 20.4 265 1-274 1-335 (367) 65 protein:vir:4953 Length: 397 # 100.0 2.4E-35 1.5E-38 210.3 22.9 267 1-274 109-389 (397) 66 protein:vir:96978 Length: 387 100.0 3.2E-36 2E-39 215.2 17.7 257 1-274 118-385 (387) 67 protein:vir:94424 Length: 387 100.0 3.2E-36 2E-39 215.2 17.7 257 1-274 118-385 (387) 68 protein:vir:2685 Length: 387 # 100.0 3.2E-36 2E-39 215.2 17.7 257 1-274 118-385 (387) 69 protein:vir:8102 Length: 543 # 100.0 3.8E-35 2.4E-38 209.3 22.8 261 1-271 249-543 (543) 70 protein:vir:4600 Length: 415 # 100.0 5.7E-35 3.5E-38 208.3 23.8 266 1-274 120-408 (415) 71 protein:vir:4700 Length: 415 # 100.0 5.7E-35 3.5E-38 208.3 23.8 266 1-274 120-408 (415) 72 protein:vir:4830 Length: 397 # 100.0 7.5E-35 4.7E-38 207.7 23.5 267 1-274 109-389 (397) 73 protein:vir:9410 Length: 415 # 100.0 9.6E-35 6E-38 207.1 23.9 268 1-274 120-408 (415) 74 protein:vir:79987 Length: 415 100.0 1E-34 6.5E-38 206.9 24.0 268 1-274 120-408 (415) 75 protein:vir:81100 Length: 415 100.0 1E-34 6.5E-38 206.9 24.0 268 1-274 120-408 (415) 76 protein:vir:98339 Length: 415 100.0 1E-34 6.5E-38 206.9 24.0 268 1-274 120-408 (415) 77 protein:vir:78640 Length: 352 100.0 7.6E-36 4.7E-39 213.1 17.8 257 1-274 83-350 (352) 78 protein:vir:95376 Length: 425 100.0 6.8E-35 4.2E-38 207.9 22.3 262 1-274 138-425 (425) 79 protein:vir:93881 Length: 387 100.0 2.4E-35 1.5E-38 210.4 18.8 257 1-274 118-385 (387) 80 protein:vir:9361 Length: 402 # 100.0 1.7E-35 1.1E-38 211.2 17.7 257 1-274 133-400 (402) 81 protein:vir:7855 Length: 497 # 100.0 1.4E-34 8.8E-38 206.1 22.5 269 1-274 151-497 (497) 82 protein:vir:101650 Length: 497 100.0 1.4E-34 8.8E-38 206.1 22.5 269 1-274 151-497 (497) 83 protein:vir:4997 Length: 397 # 100.0 2.4E-34 1.5E-37 204.9 23.8 267 1-274 109-389 (397) 84 protein:vir:2504 Length: 305 # 100.0 8.2E-35 5.1E-38 207.5 21.1 260 1-274 1-302 (305) 85 protein:vir:1025 Length: 408 # 100.0 2.3E-34 1.4E-37 205.0 23.0 267 1-274 116-397 (408) 86 protein:vir:1268 Length: 397 # 100.0 2.3E-34 1.4E-37 205.0 22.7 262 1-270 123-397 (397) 87 protein:vir:100172 Length: 394 100.0 6.8E-34 4.2E-37 202.4 23.4 267 1-274 111-388 (394) 88 protein:vir:99920 Length: 311 100.0 3.2E-34 2E-37 204.2 21.3 261 1-270 1-311 (311) 89 protein:vir:100884 Length: 389 100.0 1.1E-33 6.6E-37 201.4 23.9 267 1-274 109-386 (389) 90 protein:vir:1383 Length: 421 # 100.0 4.6E-34 2.9E-37 203.3 21.6 264 1-274 114-387 (421) 91 protein:vir:3991 Length: 404 # 100.0 1.1E-33 6.6E-37 201.3 23.3 267 1-274 116-397 (404) 92 protein:vir:3364 Length: 347 # 100.0 2.1E-34 1.3E-37 205.2 19.3 265 1-272 1-347 (347) 93 protein:vir:96762 Length: 632 100.0 3.5E-34 2.2E-37 204.0 20.2 257 1-269 357-632 (632) 94 protein:vir:81227 Length: 413 100.0 1.9E-33 1.2E-36 200.0 23.3 268 1-273 118-413 (413) 95 protein:vir:3870 Length: 400 # 100.0 1.4E-33 8.6E-37 200.7 22.3 259 1-271 133-400 (400) 96 protein:vir:9704 Length: 394 # 100.0 2.7E-33 1.7E-36 199.1 23.7 262 1-274 127-394 (394) 97 protein:vir:93616 Length: 645 100.0 2E-33 1.2E-36 199.8 22.6 266 1-273 338-645 (645) 98 protein:vir:107593 Length: 392 100.0 2.2E-33 1.4E-36 199.6 22.7 266 1-274 106-388 (392) 99 protein:vir:102873 Length: 392 100.0 2.2E-33 1.4E-36 199.6 22.7 266 1-274 106-388 (392) 100 protein:vir:105004 Length: 392 100.0 2.2E-33 1.4E-36 199.6 22.7 266 1-274 106-388 (392) 101 protein:vir:102082 Length: 392 100.0 2.2E-33 1.4E-36 199.6 22.7 266 1-274 106-388 (392) 102 protein:vir:4511 Length: 409 # 100.0 2.1E-33 1.3E-36 199.8 22.5 267 1-273 117-409 (409) 103 protein:vir:7409 Length: 408 # 100.0 3E-33 1.8E-36 198.9 22.8 267 1-274 116-397 (408) 104 protein:vir:3845 Length: 395 # 100.0 4.6E-33 2.8E-36 197.9 23.0 267 1-274 105-387 (395) 105 protein:vir:1541 Length: 347 # 100.0 1.5E-33 9E-37 200.6 20.2 265 1-272 1-347 (347) 106 protein:vir:102119 Length: 404 100.0 5.6E-33 3.5E-36 197.4 23.2 269 1-274 110-404 (404) 107 protein:vir:1433 Length: 435 # 100.0 3.7E-33 2.3E-36 198.4 22.0 262 1-272 132-435 (435) 108 protein:vir:6212 Length: 434 # 100.0 3.5E-33 2.2E-36 198.5 21.5 267 1-273 141-434 (434) 109 protein:vir:5739 Length: 366 # 100.0 3E-33 1.9E-36 198.9 20.9 259 1-270 64-366 (366) 110 protein:vir:1084 Length: 437 # 100.0 5.4E-33 3.3E-36 197.5 22.2 265 1-274 156-431 (437) 111 protein:vir:94711 Length: 347 100.0 6.5E-34 4E-37 202.5 17.2 264 1-271 1-347 (347) 112 protein:vir:80376 Length: 435 100.0 8.9E-33 5.5E-36 196.3 22.0 261 1-272 130-435 (435) 113 protein:vir:105038 Length: 428 100.0 9.4E-33 5.8E-36 196.2 21.4 260 1-270 125-428 (428) 114 protein:vir:10450 Length: 344 100.0 1.5E-33 9.3E-37 200.5 16.7 261 1-268 1-344 (344) 115 protein:vir:8885 Length: 347 # 100.0 1.3E-33 7.9E-37 200.9 15.9 264 1-271 1-347 (347) 116 protein:vir:78739 Length: 332 100.0 4.4E-33 2.7E-36 198.0 17.9 263 1-268 7-332 (332) 117 protein:vir:4092 Length: 390 # 100.0 2.6E-32 1.6E-35 193.8 22.0 263 1-274 84-374 (390) 118 protein:vir:94576 Length: 347 100.0 2.6E-33 1.6E-36 199.2 16.3 263 1-270 1-347 (347) 119 protein:vir:2201 Length: 345 # 100.0 5.6E-33 3.5E-36 197.4 17.6 263 1-270 1-345 (345) 120 protein:vir:94989 Length: 349 100.0 7.2E-32 4.5E-35 191.3 21.5 263 1-274 1-324 (349) 121 protein:vir:78387 Length: 349 100.0 8E-32 4.9E-35 191.1 21.2 263 1-274 1-324 (349) 122 protein:vir:962 Length: 397 # 100.0 1E-31 6.3E-35 190.5 21.6 257 1-270 132-397 (397) 123 protein:vir:8420 Length: 477 # 100.0 1.8E-31 1.1E-34 189.2 17.8 272 1-274 157-475 (477) 124 protein:vir:80128 Length: 466 100.0 5.5E-31 3.4E-34 186.5 19.6 265 1-274 148-452 (466) 125 protein:vir:98635 Length: 377 100.0 1.7E-31 1E-34 189.3 15.9 265 1-270 79-377 (377) 126 protein:vir:80180 Length: 381 100.0 8.1E-31 5E-34 185.6 19.2 270 1-274 1-328 (381) 127 protein:vir:101291 Length: 381 99.9 2.6E-30 1.6E-33 182.8 18.9 258 1-274 76-374 (381) 128 protein:vir:9509 Length: 381 # 99.9 2.6E-30 1.6E-33 182.8 18.9 258 1-274 76-374 (381) 129 protein:vir:3136 Length: 322 # 99.9 1.6E-30 9.7E-34 184.0 17.6 265 1-274 1-322 (322) 130 protein:vir:95963 Length: 395 99.9 4.2E-30 2.6E-33 181.6 19.2 258 1-274 86-380 (395) 131 protein:vir:9643 Length: 377 # 99.9 6.6E-30 4.1E-33 180.6 19.6 254 1-270 79-377 (377) 132 protein:vir:100057 Length: 375 99.9 2.3E-29 1.4E-32 177.6 21.1 269 1-274 1-375 (375) 133 protein:vir:80213 Length: 334 99.9 6.3E-30 3.9E-33 180.7 16.6 265 1-270 1-334 (334) 134 protein:vir:99075 Length: 392 99.9 5.6E-29 3.5E-32 175.5 20.4 265 1-274 1-309 (392) 135 protein:vir:108303 Length: 418 99.9 2.1E-28 1.3E-31 172.3 23.5 262 1-271 1-418 (418) 136 protein:vir:103323 Length: 364 99.9 7.2E-29 4.5E-32 174.9 20.9 270 1-274 1-343 (364) 137 protein:vir:100632 Length: 381 99.9 6.5E-29 4E-32 175.1 18.1 260 1-274 76-372 (381) 138 protein:vir:78350 Length: 383 99.9 3.4E-29 2.1E-32 176.7 16.6 258 1-274 83-381 (383) 139 protein:vir:99675 Length: 324 99.9 7.7E-29 4.8E-32 174.7 15.2 233 34-274 1-302 (324) 140 protein:vir:78935 Length: 335 99.9 4.2E-28 2.6E-31 170.7 17.9 269 1-274 1-332 (335) 141 protein:vir:6324 Length: 335 # 99.9 9E-28 5.6E-31 168.9 17.8 269 1-274 1-332 (335) 142 protein:vir:108211 Length: 318 99.9 1E-27 6.3E-31 168.6 18.0 266 1-271 1-318 (318) 143 protein:vir:4197 Length: 314 # 99.9 5.4E-27 3.3E-30 164.6 21.5 265 1-273 14-314 (314) 144 protein:vir:102655 Length: 322 99.9 2.9E-26 1.8E-29 160.6 18.8 263 1-271 13-322 (322) 145 protein:vir:4159 Length: 315 # 99.9 1E-25 6.5E-29 157.5 21.2 262 1-269 18-315 (315) 146 protein:vir:79928 Length: 393 99.9 3.4E-25 2.1E-28 154.7 18.2 267 1-274 74-383 (393) 147 protein:vir:97031 Length: 402 99.9 1.2E-25 7.4E-29 157.2 15.5 270 1-274 1-339 (402) 148 protein:vir:9927 Length: 295 # 99.9 4.2E-25 2.6E-28 154.2 17.5 264 1-274 1-292 (295) 149 protein:vir:3525 Length: 423 # 99.9 1.4E-24 8.8E-28 151.3 19.3 263 1-274 1-312 (423) 150 protein:vir:105522 Length: 423 99.9 8E-24 4.9E-27 147.2 23.3 262 1-269 1-423 (423) 151 protein:vir:174 Length: 423 # 99.9 5.2E-24 3.3E-27 148.2 19.9 264 1-274 1-312 (423) 152 protein:vir:107120 Length: 329 99.9 3E-23 1.8E-26 144.1 23.6 268 1-274 30-311 (329) 153 protein:vir:97331 Length: 319 99.9 3.4E-23 2.1E-26 143.7 23.7 269 1-274 19-300 (319) 154 protein:vir:94800 Length: 319 99.9 3.4E-23 2.1E-26 143.7 23.7 269 1-274 19-300 (319) 155 protein:vir:105374 Length: 423 99.9 1.2E-23 7.6E-27 146.2 20.4 264 1-274 1-312 (423) 156 protein:vir:3158 Length: 321 # 99.8 2.2E-22 1.3E-25 139.4 21.6 267 1-274 1-316 (321) 157 protein:vir:105645 Length: 400 99.8 4.4E-23 2.7E-26 143.2 16.2 270 1-274 1-337 (400) 158 protein:vir:7019 Length: 401 # 99.8 3.2E-23 2E-26 143.9 15.1 270 1-274 1-343 (401) 159 protein:vir:97397 Length: 517 99.8 5.4E-22 3.3E-25 137.2 19.4 263 1-273 237-517 (517) 160 protein:vir:95131 Length: 325 99.8 1E-20 6.3E-24 130.2 21.0 260 1-274 1-295 (325) 161 protein:vir:106647 Length: 303 99.8 2.1E-20 1.3E-23 128.5 19.2 266 1-274 1-300 (303) 162 protein:vir:9875 Length: 296 # 99.7 1.9E-19 1.2E-22 123.2 19.0 256 1-271 1-296 (296) 163 protein:vir:79008 Length: 299 99.7 4.9E-18 3.1E-21 115.5 22.1 259 1-272 1-299 (299) 164 protein:vir:8324 Length: 410 # 99.7 6.7E-19 4.2E-22 120.2 16.0 262 1-268 127-410 (410) 165 protein:vir:96792 Length: 315 99.7 2.8E-17 1.7E-20 111.4 21.3 265 1-274 1-287 (315) 166 protein:vir:78920 Length: 290 99.7 3.1E-17 1.9E-20 111.1 21.0 255 1-269 1-290 (290) 167 protein:vir:4074 Length: 480 # 99.6 3.2E-18 2E-21 116.5 12.6 253 1-273 184-480 (480) 168 protein:vir:105464 Length: 346 99.5 5.2E-15 3.2E-18 98.9 21.2 264 1-274 1-302 (346) 169 protein:vir:102335 Length: 312 99.5 7.3E-15 4.5E-18 98.1 19.9 265 1-274 1-312 (312) 170 protein:vir:94933 Length: 330 99.5 2.5E-14 1.6E-17 95.1 21.1 267 1-271 25-330 (330) 171 protein:vir:79712 Length: 285 99.4 3.1E-14 1.9E-17 94.6 19.9 261 1-271 1-285 (285) 172 protein:vir:99523 Length: 311 99.4 1.8E-13 1.1E-16 90.4 20.7 264 1-270 1-311 (311) 173 protein:vir:2106 Length: 430 # 99.3 3.5E-13 2.2E-16 88.8 18.8 258 1-270 1-430 (430) 174 protein:vir:78090 Length: 302 99.3 1.2E-12 7.3E-16 86.0 21.2 263 1-272 1-302 (302) 175 protein:vir:1781 Length: 221 # 99.3 7.2E-14 4.5E-17 92.6 12.4 160 83-242 1-221 (221) 176 protein:vir:97255 Length: 310 99.2 1.3E-11 8.3E-15 80.2 21.6 266 1-270 1-310 (310) 177 protein:vir:9265 Length: 430 # 99.1 1.3E-11 8.4E-15 80.2 18.5 261 1-274 1-307 (430) 178 protein:vir:100939 Length: 430 99.1 1.3E-11 8.4E-15 80.2 18.5 261 1-274 1-307 (430) 179 protein:vir:79548 Length: 652 99.0 9.1E-11 5.6E-14 75.6 18.7 258 1-267 359-652 (652) 180 protein:vir:95512 Length: 693 99.0 1.5E-10 9.3E-14 74.4 18.5 259 1-268 394-693 (693) 181 protein:vir:95451 Length: 313 98.9 1.8E-10 1.1E-13 74.0 14.9 265 3-273 1-313 (313) 182 protein:vir:93696 Length: 364 98.9 3.7E-10 2.3E-13 72.3 16.5 270 1-272 1-364 (364) 183 protein:vir:99424 Length: 360 98.8 2.2E-09 1.4E-12 68.1 19.4 262 1-272 23-360 (360) 184 protein:vir:95875 Length: 401 98.8 9.6E-10 5.9E-13 70.0 16.8 270 1-271 9-401 (401) 185 protein:vir:103285 Length: 296 98.6 1.6E-08 1E-11 63.3 17.4 262 1-268 1-296 (296) 186 protein:vir:103886 Length: 302 98.6 4.2E-08 2.6E-11 61.0 18.8 253 1-270 1-302 (302) 187 protein:vir:8843 Length: 317 # 98.5 3.4E-08 2.1E-11 61.5 17.4 266 1-272 1-317 (317) 188 protein:vir:107687 Length: 319 98.5 2.3E-08 1.4E-11 62.5 16.3 262 1-268 21-319 (319) 189 protein:vir:80068 Length: 301 98.5 6.2E-08 3.9E-11 60.1 17.9 260 1-268 1-301 (301) 190 protein:vir:104342 Length: 314 98.3 1.2E-07 7.4E-11 58.5 15.5 263 1-268 1-314 (314) 191 protein:vir:79642 Length: 329 98.1 7.2E-07 4.5E-10 54.2 14.9 264 1-271 26-329 (329) 192 protein:vir:105610 Length: 430 98.0 2.1E-06 1.3E-09 51.7 15.9 270 1-274 1-426 (430) 193 protein:vir:104439 Length: 404 97.9 1.9E-06 1.2E-09 52.0 14.1 262 1-265 14-404 (404) 194 protein:vir:10123 Length: 404 97.9 1.9E-06 1.2E-09 52.0 14.1 262 1-265 14-404 (404) 195 protein:vir:819 Length: 404 # 97.9 1.9E-06 1.2E-09 52.0 14.1 262 1-265 14-404 (404) 196 protein:vir:3298 Length: 404 # 97.9 1.9E-06 1.2E-09 52.0 14.1 262 1-265 14-404 (404) 197 protein:vir:93858 Length: 400 97.9 3E-06 1.8E-09 50.9 15.0 257 1-268 117-400 (400) 198 protein:vir:95318 Length: 328 97.8 2.1E-06 1.3E-09 51.7 12.7 212 1-218 1-328 (328) 199 protein:vir:2770 Length: 318 # 97.7 1.3E-05 8.1E-09 47.4 16.4 224 1-230 1-318 (318) 200 protein:vir:5942 Length: 523 # 97.5 5.5E-06 3.4E-09 49.4 11.1 265 1-272 193-523 (523) 201 protein:vir:94070 Length: 339 97.5 1.7E-05 1.1E-08 46.7 13.8 256 1-268 35-339 (339) 202 protein:vir:78148 Length: 123 97.4 1.3E-06 7.9E-10 52.9 7.0 108 163-270 1-123 (123) 203 protein:vir:2736 Length: 348 # 97.2 0.00014 8.8E-08 41.7 20.1 263 1-271 1-348 (348) 204 protein:vir:101557 Length: 336 97.2 2.1E-05 1.3E-08 46.2 11.4 256 1-268 34-336 (336) 205 protein:vir:98525 Length: 331 97.1 6.8E-05 4.2E-08 43.4 13.5 213 1-218 1-331 (331) 206 protein:vir:107826 Length: 331 97.1 6.8E-05 4.2E-08 43.4 13.5 213 1-218 1-331 (331) 207 protein:vir:107388 Length: 331 97.1 6.8E-05 4.2E-08 43.4 13.5 213 1-218 1-331 (331) 208 protein:vir:103759 Length: 330 97.0 7.1E-05 4.4E-08 43.3 12.8 211 1-218 1-330 (330) 209 protein:vir:106286 Length: 534 96.9 0.00027 1.7E-07 40.1 15.2 273 1-274 174-525 (534) 210 protein:vir:3643 Length: 336 # 96.8 5E-05 3.1E-08 44.2 10.8 256 1-268 34-336 (336) 211 protein:vir:78558 Length: 336 96.8 4E-05 2.5E-08 44.7 10.2 256 1-268 31-336 (336) 212 protein:vir:107732 Length: 379 96.7 0.00016 9.7E-08 41.5 12.6 257 1-268 56-379 (379) 213 protein:vir:99888 Length: 309 96.7 0.00027 1.7E-07 40.1 13.7 258 1-271 1-309 (309) 214 protein:vir:103181 Length: 457 96.6 0.00011 7.1E-08 42.2 11.5 268 1-274 114-442 (457) 215 protein:vir:107882 Length: 307 96.6 0.00049 3E-07 38.7 16.3 259 1-270 1-307 (307) 216 protein:vir:96490 Length: 348 96.5 0.00055 3.4E-07 38.4 20.8 263 1-271 1-348 (348) 217 protein:vir:5255 Length: 304 # 96.5 0.00037 2.3E-07 39.4 13.3 257 1-267 1-304 (304) 218 protein:vir:106734 Length: 336 96.5 6.3E-05 3.9E-08 43.6 9.1 257 1-268 31-336 (336) 219 protein:vir:79078 Length: 307 96.5 0.00061 3.8E-07 38.2 15.5 259 1-270 1-307 (307) 220 protein:vir:107947 Length: 519 96.2 0.00074 4.6E-07 37.8 13.4 271 1-274 153-494 (519) 221 protein:vir:7324 Length: 335 # 96.2 0.00067 4.1E-07 38.0 13.1 212 1-219 1-335 (335) 222 protein:vir:4902 Length: 348 # 96.1 0.00097 6E-07 37.1 20.0 264 1-271 1-348 (348) 223 protein:vir:6901 Length: 522 # 95.7 0.0015 9.5E-07 36.0 15.6 272 1-274 167-506 (522) 224 protein:vir:5670 Length: 514 # 95.6 0.0016 9.9E-07 35.9 13.0 269 1-274 142-497 (514) 225 protein:vir:98480 Length: 348 95.1 0.0027 1.7E-06 34.6 20.8 261 1-269 1-348 (348) 226 protein:vir:1991 Length: 305 # 94.9 0.0033 2E-06 34.2 13.1 194 1-210 1-305 (305) 227 protein:vir:7214 Length: 521 # 94.7 0.0036 2.2E-06 34.0 16.3 270 1-274 166-505 (521) 228 protein:vir:3424 Length: 341 # 94.6 0.0039 2.4E-06 33.8 19.9 258 9-268 1-341 (341) 229 protein:vir:348 Length: 321 # 94.3 0.0049 3E-06 33.2 12.3 255 1-274 1-318 (321) 230 protein:vir:106590 Length: 349 94.1 0.0053 3.3E-06 33.0 18.1 260 1-268 1-349 (349) 231 protein:vir:103463 Length: 521 94.0 0.0056 3.5E-06 32.9 16.7 272 1-274 166-510 (521) 232 protein:vir:393 Length: 341 # 93.9 0.0061 3.8E-06 32.7 19.6 257 9-268 1-341 (341) 233 protein:vir:98143 Length: 524 93.7 0.0066 4.1E-06 32.5 15.2 268 1-274 167-515 (524) 234 protein:vir:80986 Length: 528 93.6 0.007 4.3E-06 32.4 14.9 269 1-274 174-519 (528) 235 protein:vir:106998 Length: 468 93.6 0.007 4.3E-06 32.4 14.3 271 1-274 121-451 (468) 236 protein:vir:100603 Length: 529 93.6 0.007 4.3E-06 32.4 14.0 273 1-274 160-520 (529) 237 protein:vir:104915 Length: 470 93.3 0.0081 5E-06 32.0 14.5 272 1-274 107-455 (470) 238 protein:vir:96079 Length: 382 92.6 0.011 6.7E-06 31.4 12.7 255 1-268 63-382 (382) 239 protein:vir:104549 Length: 462 92.3 0.012 7.4E-06 31.1 14.1 253 1-274 142-446 (462) 240 protein:vir:99576 Length: 388 92.2 0.012 7.7E-06 31.0 11.8 257 1-268 65-388 (388) 241 protein:vir:6378 Length: 346 # 91.4 0.016 9.9E-06 30.4 18.4 257 9-268 1-346 (346) 242 protein:vir:94528 Length: 286 89.9 0.024 1.5E-05 29.5 16.7 261 1-269 1-286 (286) 243 protein:vir:6601 Length: 528 # 89.7 0.025 1.6E-05 29.4 16.8 263 1-274 160-519 (528) 244 protein:vir:101039 Length: 529 87.9 0.036 2.2E-05 28.5 17.3 272 1-274 165-520 (529) 245 protein:vir:101811 Length: 529 85.8 0.05 3.1E-05 27.7 17.1 265 1-274 171-520 (529) 246 protein:vir:3969 Length: 287 # 84.9 0.056 3.5E-05 27.4 16.8 257 1-271 1-287 (287) 247 protein:vir:79246 Length: 304 77.1 0.13 8E-05 25.5 10.5 193 1-211 1-304 (304) 248 protein:vir:99228 Length: 304 76.5 0.14 8.4E-05 25.3 10.8 193 1-211 1-304 (304) 249 protein:vir:1153 Length: 338 # 61.6 0.35 0.00022 23.1 14.5 256 1-264 16-338 (338) 250 protein:vir:79157 Length: 339 57.5 0.43 0.00027 22.6 14.3 258 1-269 16-339 (339) 251 protein:vir:78186 Length: 337 56.2 0.46 0.00029 22.4 13.8 257 1-270 16-337 (337) 252 protein:vir:96442 Length: 418 54.2 0.51 0.00032 22.2 17.6 262 1-274 61-412 (418) 253 protein:vir:100331 Length: 342 52.2 0.56 0.00035 22.0 14.0 261 1-271 16-342 (342) 254 protein:vir:1829 Length: 355 # 49.3 0.64 0.0004 21.6 15.0 259 1-274 16-351 (355) 255 protein:vir:79171 Length: 337 47.7 0.69 0.00043 21.5 15.8 257 1-270 16-337 (337) 256 protein:vir:104011 Length: 337 46.0 0.75 0.00046 21.3 15.8 257 1-270 16-337 (337) 257 protein:vir:98566 Length: 355 45.5 0.77 0.00048 21.2 15.0 261 1-274 16-351 (355) 258 protein:vir:98871 Length: 314 41.9 0.91 0.00056 20.8 16.9 268 1-272 17-314 (314) 259 protein:vir:270 Length: 341 # 40.8 0.95 0.00059 20.7 13.0 261 1-274 1-338 (341) 260 protein:vir:5694 Length: 357 # 38.1 1.1 0.00067 20.4 14.4 262 1-274 16-354 (357) 261 protein:vir:2016 Length: 357 # 30.9 1.5 0.00095 19.6 14.6 262 1-274 16-354 (357) 262 protein:vir:6061 Length: 357 # 29.6 1.6 0.001 19.4 14.4 262 1-274 16-354 (357) No 1 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=2.5e-64 Score=369.32 Aligned_cols=274 Identities=100% Similarity=1.361 Sum_probs=267.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++++||+|++++++++.+.+++++++++++++++++|++++||+|+.+++++++.||++++.+++++++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|++..++..|++..+.++++++|++++|+.+++.+++++..+.++.+++|.|++|..+|+++++.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) T ss_pred EEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccccHHHHHHHHHHhcccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|+|++..+|...++.+++.+++|.+|+++|++|++|+++|++++|+++++++++..++++.+|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:96 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) T ss_pred EEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCcceeeeecCCcccccccchh Confidence 99999999999999988889988888889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++++.++++++||+++++|+++|++++++|..|| T Consensus 241 ~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred hcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 9999999999999999999999999999999999 No 2 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=2.3e-63 Score=363.97 Aligned_cols=274 Identities=80% Similarity=1.158 Sum_probs=267.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++++||+|++++.+++++.+++++++++++++++++|++++||+|+.+++++++.||++++.+++++++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|+...++..|+++.+.++++++|++++|+.+++.+.++..++.++++++|.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhhccCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|+|++..+|...+..+++.+++|.+|+++|++|++|+++|++++|++++++++++.++++.+|++|++. T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:93 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCCeEEEEecCCcccccccchh Confidence 99999999999999988889988998889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+.++++++|++++++|+++|++++++||-=| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 9999999999999999999999999999999999 No 3 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=3.5e-63 Score=363.02 Aligned_cols=274 Identities=81% Similarity=1.161 Sum_probs=267.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+.+|+.+++|+||+|+++|.+++.+.+++++++.+++++.+++|++|+||+|+.+++++++.||++++.++++.++.. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|+...++..|++..+.++++.+|++++|+.+++.+.++..+...+++++|.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~~d~i~dA~~~lgd~~~~~~ 160 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|++++..+|...++.+++.+++|.+|+++|++|++|+.+|++++|+++++|+++..++++.+|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:12 161 VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCCcceEEEEeccceeeeecCCceeccccchh Confidence 99999999999999988889998888889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+.++++++|++++++|+++|++++++||-=| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 9999999999999999999999999999999999 No 4 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=7e-63 Score=361.33 Aligned_cols=274 Identities=80% Similarity=1.156 Sum_probs=267.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++|+||+|++++.+++.+.+++++++.+++++++++|++|+||+|+.+++++++.||++++.+++++++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|+...++..|+++.+.++++++|++++|+.+++.+.++..+..++++++|.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|+|++..+|...++.+++.+++|.+|+++|++|++|+++|++++|++++++++++.++++.+|++|++. T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:94 161 VLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchh Confidence 99999999999999988889999998889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+.++++++|++++++|+++|++|+++||-=| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 9999999999999999999999999999999999 No 5 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=7e-63 Score=361.33 Aligned_cols=274 Identities=80% Similarity=1.156 Sum_probs=267.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++|+||+|++++.+++.+.+++++++.+++++++++|++|+||+|+.+++++++.||++++.+++++++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|+...++..|+++.+.++++++|++++|+.+++.+.++..+..++++++|.|++|..+|++++..++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~~~ 160 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|+|++..+|...++.+++.+++|.+|+++|++|++|+++|++++|++++++++++.++++.+|++|++. T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:97 161 VLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS 240 (274) T ss_pred EEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchh Confidence 99999999999999988889999998889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+.++++++|++++++|+++|++|+++||-=| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 9999999999999999999999999999999999 No 6 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.9e-62 Score=359.01 Aligned_cols=274 Identities=81% Similarity=1.171 Sum_probs=267.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+.+|+.+++|+||+|++++.+++.+.+++++++.+++.+++++|++|+||+|+.+++++++.+|++++.++++.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|++..++..|++..+.++++.+|++++|+.+++.++++....+.++++++.|++|..+|++++..++ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~~~ 160 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|++++..+|...++.+++.+++|.+|+++|++|++|+++|++++|+++++|+++..++++.+|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:96 161 VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS 240 (274) T ss_pred EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceeeeecCCcccccccccc Confidence 99999999999999988889999898889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++++.++++++|++++++|+++|++++++.|-=| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 9999999999999999999999999999999999 No 7 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.9e-62 Score=359.01 Aligned_cols=274 Identities=81% Similarity=1.171 Sum_probs=267.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+.+|+.+++|+||+|++++.+++.+.+++++++.+++.+++++|++|+||+|+.+++++++.+|++++.++++.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|++..++..|++..+.++++.+|++++|+.+++.++++....+.++++++.|++|..+|++++..++ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~~~ 160 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLEPM 160 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||.+++.|++++..+|...++.+++.+++|.+|+++|++|++|+++|++++|+++++|+++..++++.+|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:95 161 VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS 240 (274) T ss_pred EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceeeeecCCcccccccccc Confidence 99999999999999988889999898889999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++++.++++++|++++++|+++|++++++.|-=| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 241 TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 9999999999999999999999999999999999 No 8 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=4.9e-62 Score=356.70 Aligned_cols=274 Identities=81% Similarity=1.189 Sum_probs=266.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++|+||+|+++|.+++++.+++++++.+++++.+++|++++||+|+.++++++++||++++.+++++++.. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|+....+..|++..+.++++.+|++++|+.+++.+.++..+.+++.+++|.|++|..+|+++++.++ T Consensus 81 a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t~d~i~~A~~~lgd~~~~~~ 160 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGTLAGLEAAIDTFDDEDLEPM 160 (276) T ss_pred EEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCccc Confidence 99999999999999999999999999999999999999999999999999999999989999999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +++|||++++.|+|++..+|...++.+++.+++|++|+++|++|++|+++|++++|+++++|++++.++++.+|++|+++ T Consensus 161 ~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~ 240 (276) T protein:vir:10 161 VLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKLITKRDFFLETDRDPS 240 (276) T ss_pred EEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcceEEEEeccceeeeecCCceeecccchh Confidence 99999999999999988899999988899999999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+.++++++|++++++|+++|++++++.|.=- T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (276) T protein:vir:10 241 TKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS 274 (276) T ss_pred hcccEEEEeeEEEEEEEcCcceEEEecCCcCCcC Confidence 9999999999999999999999999999877655 No 9 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=2e-61 Score=353.42 Aligned_cols=274 Identities=80% Similarity=1.169 Sum_probs=263.1 Q ss_pred CCc-cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQ-GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~-~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) ||+ +.|+.+|+|+||+|++++++++.+.+++++++.+++++.+++|++++||+|+.+++++++.||++++.+++++++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 80 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKR 80 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhccccee Confidence 776 4599999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEP 159 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~ 159 (274) +++++++++.|+++|++..++..|++..+.++++.+|++++|+.+++.+++++.+.+++.+++|.|++|..+|++++..+ T Consensus 81 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~~d~i~dA~~~lgd~~~~~ 160 (275) T protein:vir:96 81 QATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITKLAGLQTAIDKFNDEDLEP 160 (275) T ss_pred eEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeecccc Q lcl|Aclame:pro 160 MVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDA 239 (274) Q Consensus 160 ~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~ 239 (274) ++++|||.+++.|+|++..+|...+..+++.+++|.+|+++|++|++|+++|++++|+++++|+++..++++.+|++|++ T Consensus 161 ~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA~~~~~~~~~~vE~~Rd~ 240 (275) T protein:vir:96 161 MVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGAVKLITKRDFFLETERHA 240 (275) T ss_pred cEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcceEEEEeccceeeeecCCcccccccch Confidence 99999999999999998888998888888999999999999999999999999999999999999999999999999999 Q ss_pred ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 240 SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 240 ~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +++++.++++++|++++++|+++|+++++.+.-=. T Consensus 241 ~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 241 SHKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred hhcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 99999999999999999999999999998777555 No 10 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=1.9e-60 Score=347.94 Aligned_cols=272 Identities=48% Similarity=0.760 Sum_probs=260.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++++||+|++++++++.+++++++++.+++.+.+++|++++||+++..++++|++||+++|.++++++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99999999999999999999999999999999999999888999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|++||++..++.+|+++.+.++++++|++++|+.+++.+.++.....+ ..++|.|++|+.+|++++..++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~-~~t~d~i~da~~~l~~~~~~~~ 159 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEA-TATVDGVSKALDIFNDEDDAET 159 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-ccCHHHHHHHHHHHhccCCCcc Confidence 999999999999999999999999999999999999999999999999888776654 5689999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +|+|||.+++.|++++..++...++.+++.+.+|.+|+++|+||++|+++|++++|++++++++++.++++.+|++|++. T Consensus 160 ~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~ 239 (272) T protein:vir:30 160 VIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDIT 239 (272) T ss_pred EEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccc Confidence 99999999999999988888888888888899999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) ++.+.+++++||++++++|+++|++|.++|--- T Consensus 240 ~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 240 KAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred cceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 999999999999999999999999999988777 No 11 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=1.9e-60 Score=347.94 Aligned_cols=272 Identities=48% Similarity=0.760 Sum_probs=260.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++++||+|++++++++.+++++++++.+++.+.+++|++++||+++..++++|++||+++|.++++++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99999999999999999999999999999999999999888999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|++||++..++.+|+++.+.++++++|++++|+.+++.+.++.....+ ..++|.|++|+.+|++++..++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~-~~t~d~i~da~~~l~~~~~~~~ 159 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEA-TATVDGVSKALDIFNDEDDAET 159 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-ccCHHHHHHHHHHHhccCCCcc Confidence 999999999999999999999999999999999999999999999999888776654 5689999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~ 240 (274) +|+|||.+++.|++++..++...++.+++.+.+|.+|+++|+||++|+++|++++|++++++++++.++++.+|++|++. T Consensus 160 ~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~ 239 (272) T protein:vir:98 160 VIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDIT 239 (272) T ss_pred EEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccc Confidence 99999999999999988888888888888899999999999999999999999999999999999999999999999999 Q ss_pred cCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 241 RKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 241 ~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) ++.+.+++++||++++++|+++|++|.++|--- T Consensus 240 ~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 240 KAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred cceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 999999999999999999999999999988777 No 12 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=2.5e-60 Score=347.35 Aligned_cols=271 Identities=46% Similarity=0.699 Sum_probs=255.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+++|+.+++|+||+|++++.+++++.+++++++.+++.+++.+|++|+||+|+.+++++++.||+.++.+++++++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 99999999999999999999999999999999999999999999999999999998899999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcc------cCHHHHHHHHHHHhh Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADI------TKLDGLQTAIDKFND 154 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~------~~~d~iv~a~~~l~~ 154 (274) ++++++++.|+++|++..++..|+++.+.++++++|++++|+.+++.+.++......+. ..++.++++..+|.. T Consensus 81 ~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l~~ 160 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAIED 160 (278) T ss_pred EeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhhcc Confidence 99999999999999999999999999999999999999999999999988776544322 347889999999987 Q ss_pred cCC-CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCcee Q lcl|Aclame:pro 155 EDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFL 233 (274) Q Consensus 155 ~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~v 233 (274) ++. ..++++|||.+|+.|+|++..+|...+..+++.+++|.+|+++|++|++|+++|.+++|+++++++++..++++.+ T Consensus 161 ~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~v 240 (278) T protein:vir:80 161 ESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGNALAVKAGALKTFLKRNLLA 240 (278) T ss_pred cCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcceEEEEeccceeeeecCCccc Confidence 765 4567999999999999998888988888889999999999999999999999999999999999999999999999 Q ss_pred eeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 234 EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 234 e~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) |++|+++++++.++++++|++++++|+++|++++.|++ T Consensus 241 E~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 241 ESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ccccchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 99999999999999999999999999999999999999 No 13 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=3.7e-58 Score=335.46 Aligned_cols=268 Identities=49% Similarity=0.790 Sum_probs=251.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||++.|+.+++++||+|++++.+++.+.+++++++..++.+.+.+|++|+||+|+..+++++++||++++.++++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|+++|++..++..|+++.+.++++.+|++++|+.+++.+.++.....+ .+++|.|++|+.+|++++..++ T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~~-~~~~d~i~~A~~~lgd~~~~~~ 159 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVST-KANVDGVQAALDIFNDEDAQAY 159 (272) T ss_pred EeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-cccHHHHHHHHHHhhhcCCCce Confidence 999999999999999999999999999999999999999999999999988776654 5689999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceE----EEEcCCeEEEEeccCceeeec Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEA----LLAKKGAVKLITKRDFFLEKD 236 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~----~l~~~~a~~~~~~~~~~ve~~ 236 (274) +++|||.+++.|+|++...+. ....+++.+++|.+|+++|++|++|+++|+++. |++.++|+++..++++.+|++ T Consensus 160 ~ivv~p~~~~~L~k~~~~~~~-~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~ 238 (272) T protein:vir:36 160 VLIVNPKDAAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETD 238 (272) T ss_pred EEEEcHHHHHHHhcccccccc-cccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccc Confidence 999999999999998765443 344566789999999999999999999998776 788999999999999999999 Q ss_pred cccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 237 RDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 237 r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) |+++++++.++++++|++++++|+++|++|.+.= T Consensus 239 R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 239 RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred cchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999999999999999999999999999999887 No 14 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=3e-55 Score=319.46 Aligned_cols=268 Identities=34% Similarity=0.505 Sum_probs=249.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) || .|+.+++++||+|++||.+++.+.+++++++.+++++.+++|++|+||+|+.+++++.+.||++++.+++++++.. T Consensus 1 Ma--~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~ 78 (270) T protein:vir:95 1 MT--QTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTK 78 (270) T ss_pred CC--ceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchhe Confidence 77 5889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPM 160 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~ 160 (274) ++++++++.|.++|+....+..|++..+.++++.+|++++|+.+++.++++..+.+. .++++.|++|+.+|+++...+. T Consensus 79 a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~~-~~t~~~~~dA~~~lgd~~~~~~ 157 (270) T protein:vir:95 79 VTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTATV-SADATGILDAIEVFNSENDEDY 157 (270) T ss_pred eeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-ccCHHHHHHHHHHhccccCCCc Confidence 999999999999999999999999999999999999999999999999998877644 5689999999999999999999 Q ss_pred EEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC-CCCcceEEEEcCCeEEEEeccCceeeecccc Q lcl|Aclame:pro 161 VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN-KLNKGEALLAKKGAVKLITKRDFFLEKDRDA 239 (274) Q Consensus 161 ~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~-~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~ 239 (274) +++|||++++.|+|+.. ....+.+++.+++|.+|+++|++|++++ ..+++++|+++++|+++...+++.+|++|++ T Consensus 158 ~i~vhs~~~~~Lrk~~~---~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~~~~~~~vEtdRd~ 234 (270) T protein:vir:95 158 VLYVNPKDYNKLVKSLF---KVGGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIVNKKKPEAYTDFDI 234 (270) T ss_pred EEEEcHHHHHHHHhhhc---ccccccccchhcccccceecceeEEEeCCCCCceeEEEEeccceeeeecCCceeeeccch Confidence 99999999999998863 3455667788999999999999987755 5678999999999999999999999999999 Q ss_pred ccCccEEEEEEEEEEEEEcCcceEEEE-eCCCcccC Q lcl|Aclame:pro 240 SRKSTALYSDKHYVAYLYDESKVVKIT-KGAGDEVM 274 (274) Q Consensus 240 ~~~~~~i~~~~~~~~~v~~~~avv~l~-~~aa~~~~ 274 (274) +++.+.++++.+|++++++|+++|++| ++|+|.=| T Consensus 235 ~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 235 LKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred hhcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 999999999999999999999999999 56777777 No 15 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=2.1e-44 Score=260.09 Aligned_cols=227 Identities=44% Similarity=0.681 Sum_probs=209.8 Q ss_pred cccccCCCEEEEEeecCCCCcccccCCCcccccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 40 TLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANK 119 (274) Q Consensus 40 ~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~ 119 (274) +-.-..|++|+||+| +|+++.++||++++.+++++++.++++++.++.|+++|+....+..|+.....++++.+++++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~k 78 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHHh Confidence 222356999999988 789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchh Q lcl|Aclame:pro 120 VDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA 199 (274) Q Consensus 120 ~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i 199 (274) +|+.+++.+.+++.+... .+++|.|++|..+|++++..+.+++|||+.++.||++.+... ..++.+++.+++|.+|++ T Consensus 79 vD~di~~~~~~a~l~~~~-~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~-~~~~~g~~i~~~G~iG~i 156 (231) T protein:vir:73 79 VDDDLLKAAKTTSQTVST-KANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADV 156 (231) T ss_pred hhHHHHHhhccccccccc-cccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhh-hhhhhccceeeecccceE Confidence 999999999998887665 579999999999999999999999999999999999875433 356778899999999999 Q ss_pred cceeeEEcCCCCcceE----EEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 200 LGAVIVRSNKLNKGEA----LLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 200 ~G~~Vv~s~~~p~~~~----~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +|++|++|+++|.++. |++.++|+++...+++.+|++|+++++.+.++++.+|++++++|+++|++|.+.- T Consensus 157 ~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 157 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 9999999999999887 4567999999999999999999999999999999999999999999999999987 No 16 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=100.00 E-value=1.1e-41 Score=245.22 Aligned_cols=265 Identities=17% Similarity=0.184 Sum_probs=218.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhh--cccccccccccc---cCCCEEEEEeecCC-CCcccccCCC-cccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRF--AQFADIDSTLVG---QPGDTLTFPAFTYS-GDAQVIAEGE-KIPVDQ 73 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~--~~l~~~~~~~~~---~~G~~v~ip~~~~~-~~a~~~~eg~-~~~~~~ 73 (274) ||+++|+.+++|+||+|.+|+.+++.+++.+ ++++..+.++.+ .+|+++++|+|+.+ ++++.+.||+ .++.++ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 9999999999999999999999999776544 555555444332 38999999999976 7899999996 799999 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------cc Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------------------TV 135 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------------------~~ 135 (274) ++.++....++++++.|.++|+....+..|++..+.++++.+|+++.++.+++.+.+... +. T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQSK 160 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccc Confidence 999999999999999999999999999999999999999999999999999987763211 12 Q ss_pred cCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCC---- Q lcl|Aclame:pro 136 EADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLN---- 211 (274) Q Consensus 136 ~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p---- 211 (274) ..+.++++.+++|..+++++.....+++|||.++..|++++..++...+.. ++.+++++|++|++|+.+| T Consensus 161 ~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~------~~~i~~~~G~~VivdD~~p~~~~ 234 (330) T protein:vir:10 161 ASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTA------TINIPTYLGYRVIIDDGIAPTGD 234 (330) T ss_pred cccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhccccc------CcccccccceEEEEeCCCCCCCC Confidence 334567899999999999999999999999999999999887777665443 3568999999999999998 Q ss_pred cceEEEEcCCeEEEEecc---CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC------CCccc--C Q lcl|Aclame:pro 212 KGEALLAKKGAVKLITKR---DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG------AGDEV--M 274 (274) Q Consensus 212 ~~~~~l~~~~a~~~~~~~---~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~------aa~~~--~ 274 (274) ++++|++.++|+++..+. .+.+|++|++..+++.+..|.+|..+ |.++..-... +||-- . T Consensus 235 ~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~~h---p~G~s~~~~~~~~~~~sPt~~~L~ 305 (330) T protein:vir:10 235 IYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALVMH---PYGVKWTGAEVDAGNITPSNADLA 305 (330) T ss_pred ceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEEee---eeeeeecccccccCcCCcChHHhc Confidence 577899999999998654 47899999999999999999997655 4444432211 11111 0 No 17 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=2.6e-41 Score=243.07 Aligned_cols=261 Identities=19% Similarity=0.204 Sum_probs=219.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||++ .|+||+|+.++++++++++++.++++++.+..+..|++|+||+++..+......+|..++.++++.++++ T Consensus 1 MA~~------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEE Confidence 8874 3889999999999999999999999888887788899999999987654455678999999999999999 Q ss_pred Eeehh-hhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----CcccCHHHHHHHHHHHhh Q lcl|Aclame:pro 81 AKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE-----ADITKLDGLQTAIDKFND 154 (274) Q Consensus 81 ~~~~~-~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~-----~~~~~~d~iv~a~~~l~~ 154 (274) +++++ .+..+.|+|++..++..++.+ +.+++++++++++|+.+++.+.++..... .....++.|++|...|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhh Confidence 99977 467799999988888889865 77889999999999999988876543221 122357889999999998 Q ss_pred cCC--CccEEEEcHHHHHHHHhhhcccccccccc-ccccccccccchhcceeeEEcCCCCcce---EEEEcCCeEEEEec Q lcl|Aclame:pro 155 EDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQL-GDNIIVKGAFGEALGAVIVRSNKLNKGE---ALLAKKGAVKLITK 228 (274) Q Consensus 155 ~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~-~~~~~~~g~~~~i~G~~Vv~s~~~p~~~---~~l~~~~a~~~~~~ 228 (274) +++ .+|+++++|+.++.|+++.. .+...... .+..+++|.+|+++|++|+.|+++|.++ ++.++++++++..+ T Consensus 154 ~~vP~~~R~lvv~p~~~~~Ll~~~~-~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~ 232 (273) T protein:vir:79 154 ANVPNVGRVVVVNAEMAFWLRSSGS-KLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred ccCCccCcEEEECHHHHHHHhhchh-hhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeee Confidence 875 78999999999999998642 12222222 3356889999999999999999999654 56789999998776 Q ss_pred cCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 229 RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 229 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) . ..+|.+|++.++.+.++++++||+++++|+++|.|++++. T Consensus 233 ~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 233 I-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred h-hhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 5 4889999999999999999999999999999999998888 No 18 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=100.00 E-value=1.2e-40 Score=239.50 Aligned_cols=262 Identities=20% Similarity=0.198 Sum_probs=215.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhh--cccccccc----ccc-ccCCCEEEEEeecCC-CCcccccCCCccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRF--AQFADIDS----TLV-GQPGDTLTFPAFTYS-GDAQVIAEGEKIPVD 72 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~--~~l~~~~~----~~~-~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~~ 72 (274) || +|..+++|+||+|.+|+.+++.+++.+ ++++..+. .+. ..+|+++++|+|+++ ++++.+.++.+++.+ T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 99 788999999999999999999887655 44444333 332 357999999999987 899999999999999 Q ss_pred ccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------c---ccCccc Q lcl|Aclame:pro 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------T---VEADIT 140 (274) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------~---~~~~~~ 140 (274) +++.++....++++++.|.++|+....+..|++..+.++++.+|+++.++.+|+.+++... . .....+ T Consensus 79 ~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~ 158 (324) T protein:vir:59 79 KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIY 158 (324) T ss_pred hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeecccccee Confidence 9999999999999999999999999999999999999999999999999999988865211 1 122336 Q ss_pred CHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCC--------- Q lcl|Aclame:pro 141 KLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLN--------- 211 (274) Q Consensus 141 ~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p--------- 211 (274) +++.+++|.++|+++.....+++|||+++..|++++..++...++. ++.+++++|++|++|+.+| T Consensus 159 s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~------~~~i~~~~G~~VivdD~~p~~~~~~~~~ 232 (324) T protein:vir:59 159 SAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQS------GIRFPTYMNKRVIVDDSMPVETLEDGTK 232 (324) T ss_pred cHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhcccccc------CceeeeecccEEEEeCCCCccccCCCCc Confidence 8899999999999999999999999999999999987777665443 3468899999999999998 Q ss_pred cceEEEEcCCeEEEEec-cCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 212 KGEALLAKKGAVKLITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 212 ~~~~~l~~~~a~~~~~~-~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++++|++.++|+++... .++.+|++|++..+.+.++.+.+|..++. ++.. +.++-..+- T Consensus 233 ~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p~---G~s~-~~~~~~~~s 292 (324) T protein:vir:59 233 VFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHPR---GVKF-TENAMAGTT 292 (324) T ss_pred eEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEee---eEEe-cccccCCCC Confidence 35789999999999874 45789999999999999999999765543 3322 222111111 No 19 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1.4e-40 Score=239.12 Aligned_cols=261 Identities=20% Similarity=0.212 Sum_probs=217.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||. +.|+||+|+..+++++++.+++.++++++.+..+..|++++||+++..+......+|..++.++++.++++ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEE Confidence 776 44889999999999999999999999888877778899999999987653444567888889999999999 Q ss_pred Eeehh-hhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----CcccCHHHHHHHHHHHhh Q lcl|Aclame:pro 81 AKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE-----ADITKLDGLQTAIDKFND 154 (274) Q Consensus 81 ~~~~~-~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~-----~~~~~~d~iv~a~~~l~~ 154 (274) +++++ .+..+.|+|++..++..++.+ +.+++++++++++|+.+++.+.++..... .....++.|++|...|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 99976 477799999988888888865 78889999999999999988877644322 112347899999999998 Q ss_pred cCC--CccEEEEcHHHHHHHHhhhcccccccccc-ccccccccccchhcceeeEEcCCCCcc---eEEEEcCCeEEEEec Q lcl|Aclame:pro 155 EDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQL-GDNIIVKGAFGEALGAVIVRSNKLNKG---EALLAKKGAVKLITK 228 (274) Q Consensus 155 ~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~-~~~~~~~g~~~~i~G~~Vv~s~~~p~~---~~~l~~~~a~~~~~~ 228 (274) +++ .+|+++++|+.++.|+++... +...... ++..+++|.+|+++|++|+.|+++|.+ +++.++++++++..+ T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q 232 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhh-hhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeee Confidence 875 789999999999999986421 2222222 335688999999999999999999964 467889999999876 Q ss_pred cCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 229 RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 229 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .. .+|..|++.++.+.++++.+||++|++|++++.|++++. T Consensus 233 ~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 233 ID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ee-hhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 44 889999999999999999999999999999999998888 No 20 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1.4e-40 Score=239.12 Aligned_cols=261 Identities=20% Similarity=0.212 Sum_probs=217.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||. +.|+||+|+..+++++++.+++.++++++.+..+..|++++||+++..+......+|..++.++++.++++ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEE Confidence 776 44889999999999999999999999888877778899999999987653444567888889999999999 Q ss_pred Eeehh-hhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----CcccCHHHHHHHHHHHhh Q lcl|Aclame:pro 81 AKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE-----ADITKLDGLQTAIDKFND 154 (274) Q Consensus 81 ~~~~~-~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~-----~~~~~~d~iv~a~~~l~~ 154 (274) +++++ .+..+.|+|++..++..++.+ +.+++++++++++|+.+++.+.++..... .....++.|++|...|++ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 99976 477799999988888888865 78889999999999999988877644322 112347899999999998 Q ss_pred cCC--CccEEEEcHHHHHHHHhhhcccccccccc-ccccccccccchhcceeeEEcCCCCcc---eEEEEcCCeEEEEec Q lcl|Aclame:pro 155 EDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQL-GDNIIVKGAFGEALGAVIVRSNKLNKG---EALLAKKGAVKLITK 228 (274) Q Consensus 155 ~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~-~~~~~~~g~~~~i~G~~Vv~s~~~p~~---~~~l~~~~a~~~~~~ 228 (274) +++ .+|+++++|+.++.|+++... +...... ++..+++|.+|+++|++|+.|+++|.+ +++.++++++++..+ T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q 232 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhh-hhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeee Confidence 875 789999999999999986421 2222222 335688999999999999999999964 467889999999876 Q ss_pred cCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 229 RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 229 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .. .+|..|++.++.+.++++.+||++|++|++++.|++++. T Consensus 233 ~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 233 ID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ee-hhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 44 889999999999999999999999999999999998888 No 21 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.3e-39 Score=233.83 Aligned_cols=269 Identities=14% Similarity=0.085 Sum_probs=210.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+.++..+..++|+.+++.|++.+++.+++++++++. ..++..++||++...+.+.|++||+.+|+++++|++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeCCccccccccceeeeE Confidence 99999999999999999999999999999999988653 23345699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCcc----HHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------ccccCcccC Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGD----PQGEAVRQHGLAIANKVDNDVLEALKGAT---------------LTVEADITK 141 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d----~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------------~~~~~~~~~ 141 (274) +.++|++..+++|+|+++++..+ +++++.+++++++++++|+.++....... ......... T Consensus 77 l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) T protein:vir:80 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSA 156 (315) T ss_pred eeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccc Confidence 99999999999999999988776 67888999999999999999985432111 111223345 Q ss_pred HHHHHHHHHHHhhcC-CCccEEEEcHHHHHHHHhhhccccc--cccccccccccccccchhcceeeEEcCCCCcce---- Q lcl|Aclame:pro 142 LDGLQTAIDKFNDED-LEPMVLFVNPLDAGGLRTSASDNFT--RPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE---- 214 (274) Q Consensus 142 ~d~iv~a~~~l~~~~-~~~~~~v~~p~~~~~L~~~~~~~~~--~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~---- 214 (274) +++++++...+..+. ...+.|+|||.++..|++....+.. ...... +.+..|..++|+|+||++++++|.+. T Consensus 157 ~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~-~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~ 235 (315) T protein:vir:80 157 TADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) T ss_pred hHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccc-cccccCCCceecceeeEecCcCCccccccc Confidence 789999998886554 4567899999999999987543221 111111 23455667899999999999998642 Q ss_pred -----EEEEcCCeEEEEeccCceeeecccc----------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 215 -----ALLAKKGAVKLITKRDFFLEKDRDA----------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 215 -----~~l~~~~a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .++.+.+.+.+...+++.++..++. .+++..+++..|+|++|.+|+++++|+.++|.-.- T Consensus 236 ~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~ 310 (315) T protein:vir:80 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) T ss_pred ccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCC Confidence 3344455566666777777665543 24567899999999999999999999988875555 No 22 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=2.8e-39 Score=231.96 Aligned_cols=262 Identities=17% Similarity=0.131 Sum_probs=215.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |+..++..+..++|+.+++.|++.+++.+++.++++. ...++...++|.... +.+.|++||+++|+++++|++++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~f~~v~ 80 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKA----VPMTKPEEEFTFMSG-VGAFWVDEAERIQTSKPTFTKAK 80 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhcee----eecCCCcEEEEEEcC-CceeeeecCccccccccceeEEE Confidence 7777777778899999999999999999999888864 224456788998864 67999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc------------ccccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG------------ATLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~------------a~~~~~~~~~~~d~iv~a 148 (274) +.+++++..+++|+|+++++..++++++.+++++++++++|+.++....+ +......+..++++++++ T Consensus 81 l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l~~~ 160 (299) T protein:vir:41 81 MRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDLNEA 160 (299) T ss_pred EeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHHHHH Confidence 99999999999999999999999999999999999999999999853321 122334456789999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEcCCeEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAKKGAVK 224 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~~~a~~ 224 (274) +..+..++..+..|+|||.++..|++..+.+. .....+.... ..++++|+||++++++|.++ .++.+.+.+. T Consensus 161 ~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G---~~l~~~~~~~-~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~ 236 (299) T protein:vir:41 161 IGLIEAEDLEPNGIATIRKQRVKYRSTKDGNG---MPIFNTATSN-GVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAY 236 (299) T ss_pred HHhhhcccCCcCEEEEcHHHHHHHHHhhccCC---ceeecCCcCC-CCceecceeeEEecccCCCCCceEEEEEecccEE Confidence 99999999999999999999999997654321 1222222333 34689999999999999876 4555666666 Q ss_pred EEeccCceeeeccccc----------------cCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 225 LITKRDFFLEKDRDAS----------------RKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 225 ~~~~~~~~ve~~r~~~----------------~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) +..+.++.++..++.. ++...+++..|+|+++.+|+|+++|+.++|+ T Consensus 237 i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 237 YGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 7788888888777643 3456789999999999999999999999999 No 23 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=100.00 E-value=1.1e-39 Score=234.11 Aligned_cols=262 Identities=18% Similarity=0.168 Sum_probs=211.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhh--ccccccccccc---ccCCCEEEEEeecCC-CCcccccCCCccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRF--AQFADIDSTLV---GQPGDTLTFPAFTYS-GDAQVIAEGEKIPVDQI 74 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~--~~l~~~~~~~~---~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~~~~ 74 (274) || +|+.+++|+||+|.+|+.+++.+++.+ ++++..+.++. ..+|++++||+|+.+ |+++.+.|+.+++.+++ T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 99 688899999999999999988766555 55555444332 248999999999986 89999999999999999 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------ccccCcc Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------------LTVEADI 139 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------------~~~~~~~ 139 (274) +.++....++++++.|.++|+....+..|++..+.++++.+|+++.++.+|+.+++.. .+..... T Consensus 79 tt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~~ 158 (351) T protein:vir:15 79 TSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEPM 158 (351) T ss_pred cccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccccccccc Confidence 9999999999999999999999999999999999999999999999999998876421 1123445 Q ss_pred cCHHHHHHHHHHHhhcCC-CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc------ Q lcl|Aclame:pro 140 TKLDGLQTAIDKFNDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK------ 212 (274) Q Consensus 140 ~~~d~iv~a~~~l~~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~------ 212 (274) ++++.+++|.+++++... ....|+|||.++..|++++..+|...+.. ++.+++++|++|++|+.+|. T Consensus 159 is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~------~~~i~t~~G~~VivdD~~p~~~~~~~ 232 (351) T protein:vir:15 159 FGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNG------ATPFEAYNGLRIVLDDDIEIDLTDKT 232 (351) T ss_pred cCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhcccccc------CcccceecceEEEEcCCCccccCCCC Confidence 788999999999999754 57999999999999999988888766442 34689999999999999983 Q ss_pred ---ceEEEEcCCeEEEEeccCceeeecccccc--CccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 213 ---GEALLAKKGAVKLITKRDFFLEKDRDASR--KSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 213 ---~~~~l~~~~a~~~~~~~~~~ve~~r~~~~--~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +++|++.++|++|..+.+ .+|++|++.. +.+.++.|.+|. ++|.++.--....++... T Consensus 233 ~~~ytsyl~~~GAi~~~~~~~-~ve~~rd~~~~~g~d~l~~r~~~~---~hp~G~s~~~~~~~~~~~ 295 (351) T protein:vir:15 233 KPVSTSYIFAPGAVRYSTNMR-STETKYDPLINGGQDVIVQKRVGT---IHVAGTSIKASFSPSKAS 295 (351) T ss_pred CceeEEEEEecceeeeecCCc-CcceeecccCCCCceEEEEeeeee---eeeeeeeecccccccCcC Confidence 578999999999877654 6788887765 678888888855 555554432111111111 No 24 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=4.5e-39 Score=230.81 Aligned_cols=262 Identities=13% Similarity=0.121 Sum_probs=206.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+.++..+.+ +|+.++..+++.+++.+.++++++.. ..++..+++|++...+.+.|++||+++|+++++|++++ T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~----~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 75 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQK----PIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVT 75 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhccee----eccCCceEEEEEecCcceEEeeCCcccccccccceeeE Confidence 99998888775 67779999999999999988887642 23344689999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHh---ccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccc--------------c-ccCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVL---SGFGDPQGEAVRQHGLAIANKVDNDVLEALKG---ATL--------------T-VEADI 139 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~---~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~---a~~--------------~-~~~~~ 139 (274) ++++|++..+++|+|+++ ++.+++++++.+++++++++++|+.++..... ... . ...+. T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (300) T protein:vir:95 76 IVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDT 155 (300) T ss_pred eeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccccc Confidence 999999999999999985 45688999999999999999999999965321 110 0 12344 Q ss_pred cCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----- Q lcl|Aclame:pro 140 TKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----- 214 (274) Q Consensus 140 ~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----- 214 (274) ..++++.++...+...+..+..|+|||.++..|++.++.+... ...+....|..++++|+||++++.+|.+. T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~---i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~ 232 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGK---LYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKN 232 (300) T ss_pred chHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCe---eccCccccCCCceecceeeEEecCCCCCCCCCcc Confidence 5689999999999988889999999999999998776433211 11233445677899999999999998643 Q ss_pred -EEEEcCC-eEEEEeccCceeeeccc----------cccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 215 -ALLAKKG-AVKLITKRDFFLEKDRD----------ASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 215 -~~l~~~~-a~~~~~~~~~~ve~~r~----------~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .++.+.+ .+.+..+.++.++...+ -.+++..+|+..|+|+++.+|+++++|+++++ T Consensus 233 ~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 233 TAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred EEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 3433433 34455666666655433 23456788999999999999999999999999 No 25 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=3.4e-38 Score=226.01 Aligned_cols=263 Identities=14% Similarity=0.090 Sum_probs=214.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +...++..+..++|+.+++.+++.+++.+.+.++++.. ..++..++||++...+.++|++||+.+|+.+++|++++ T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 102 (324) T protein:vir:93 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeeecCCccccccccceeEEE Confidence 22233344557899999999999999999998888653 23455699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) ++++|++..+++|+|++.++.+++++++.+++++++++++|+.+|....+. ......+..++++++++ T Consensus 103 ~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:93 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHHHH Confidence 999999999999999999999999999999999999999999998543221 11233456789999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~~~l~~~~a~~~~ 226 (274) ...+..++..+..|+|||..|..|++..+.+ |...+..+..++++|+||+.++. .+++..++.+.+.+.+. T Consensus 183 ~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~-------G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~ 255 (324) T protein:vir:93 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhCCC-------CCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEE Confidence 9999999988999999999999998764322 23334556678999999999776 45667777777777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ...++.++.+++. .+++..+++..|+|+++.+|+++++|+.+.+.+=. T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~ 319 (324) T protein:vir:93 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCC Confidence 8888888877764 34668899999999999999999999976665522 No 26 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=4.9e-38 Score=225.13 Aligned_cols=263 Identities=14% Similarity=0.088 Sum_probs=215.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +....+..+..++|+.+++.|++.+++.+.+.++++.. ..++.+++||++...+.+.|++||+.+|+++++|++++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 102 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhcchhhhccee----eccCCceEEEEEecCcceeEeccCccccccccceeEEE Confidence 32334455678999999999999999999998888653 23456799999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) ++++|++..+++|+|+++++.+++.+++.+++++++++++|+.++....+. ......+..++++|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:97 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHHHHH Confidence 999999999999999999999999999999999999999999998543321 12233456789999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCC--cceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLN--KGEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p--~~~~~l~~~~a~~~~ 226 (274) ...+..++..+..|+|||.++..|++..+.+ +...+..+..++++|+||+.++..+ +++.++.+.+.+.+. T Consensus 183 ~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~-------g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~ 255 (324) T protein:vir:97 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhcCC-------CceeecCCCCccccceeeEeecCCCCCcceEEEEecccEEEE Confidence 9999999999999999999999998765432 2233345566789999999988755 566677777777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .+.++.++.+++. .++...+++..|+|+++.+|+++++|+.+.|-.-- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) T protein:vir:97 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCC Confidence 8888888877663 34668899999999999999999999998874433 No 27 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=2.5e-38 Score=226.73 Aligned_cols=264 Identities=10% Similarity=0.029 Sum_probs=216.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc-ccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ-IGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~-~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.++++... ..+...++|+....+.+.|++||+.+|..+ ++|+++ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v 205 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQP----VSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPL 205 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeee----ccCCceEEEEEcCCcceeeecccccccccccccccee Confidence 666666667789999999999999999999999886532 233458899887778899999999999876 689999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc---------c----------------cc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA---------T----------------LT 134 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a---------~----------------~~ 134 (274) ++.+++++.++++|+|++.++.+++.+++.+++++++++++|+.++.+-... + .+ T Consensus 206 ~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 285 (425) T protein:vir:10 206 SFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNS 285 (425) T ss_pred eeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999998642110 0 11 Q ss_pred ccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc-- Q lcl|Aclame:pro 135 VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK-- 212 (274) Q Consensus 135 ~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~-- 212 (274) ..++.+++|+++++...|...+..+..|+|||.++..|++..+.+. .....+.+.+|..++|+|+||++++++|. T Consensus 286 ~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G---~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~ 362 (425) T protein:vir:10 286 GAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQG---NYLWQPSYVAGQPATLAGYPVTEVPDMPDVA 362 (425) T ss_pred cccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCC---ceeeccCccCCCCceecceeeEEecCcCCcc Confidence 2345568999999999999888888999999999999987654332 22223345667778999999999999994 Q ss_pred --ceEEEEcCC--eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 213 --GEALLAKKG--AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 213 --~~~~l~~~~--a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .+.++|+.. ++.++.+..+.+..+.+..++.+.+++..|+|++|.+|+++++++.+|+. T Consensus 363 ~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 363 ANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred CCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 334566543 35566777788888888888999999999999999999999999999999 No 28 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.7e-38 Score=226.56 Aligned_cols=265 Identities=18% Similarity=0.169 Sum_probs=211.4 Q ss_pred CCccccc-hhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTK-VSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~-~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) ....+++ .+..++|+.+...|.+.+++..++++++++... ..+..++||+....+.+.|++||+.+|+++++|+++ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~---~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i 186 (390) T protein:vir:62 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTT---SDANPLDFTVITGRSSASIVGETAEIPESYPATAQR 186 (390) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeec---CCCceeEEEEEcCCcceeeecccccccccccceeee Confidence 2223333 344677777777888888888888888765322 234569999998888999999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhc-------cc------cccccCcccCHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALK-------GA------TLTVEADITKLDGLQ 146 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~-------~a------~~~~~~~~~~~d~iv 146 (274) ++++++++..+++|+|++.++.+++.+++.+++++++++++|+.++.+-. .. .....++..++++++ T Consensus 187 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~ 266 (390) T protein:vir:62 187 SMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALI 266 (390) T ss_pred EeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHHHH Confidence 99999999999999999999999999999999999999999999985321 11 112234456899999 Q ss_pred HHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEE Q lcl|Aclame:pro 147 TAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLI 226 (274) Q Consensus 147 ~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~ 226 (274) ++...|...+.....|+|||.++..|++..+.+ +.....+.+..|..++|+|+||++++++|.+..++.+.+.+.+. T Consensus 267 ~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~---g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~ 343 (390) T protein:vir:62 267 DLFHEVPSAYRANAKYVVNDLRAAQMRKLKDAN---GQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVR 343 (390) T ss_pred HHHHhhhhhhhcCCEEEEchHHHHHHHHhhccC---CCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEE Confidence 999999887777788999999999997664332 12222344566777899999999999999998877666666677 Q ss_pred eccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 227 TKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 227 ~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .+.++.++.+.+. .++.+.+++..|+|+++++|+|+++|+.++|. T Consensus 344 ~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 344 FAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred eecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 7788887765554 45778899999999999999999999999888 No 29 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=7e-38 Score=224.27 Aligned_cols=263 Identities=14% Similarity=0.090 Sum_probs=215.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +...++..+..++|+.+.+.|++.+++.+.+.++++.. ..+|..++||++...+.+.|++||+++|+++++|++++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeEecCCccccccccceeEEE Confidence 44444566678999999999999999999998888653 24456699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) +.+++++..+++|+|++.++.+++.+++.+++++++++++|+.+|....+. ......+..++++|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:96 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHH Confidence 999999999999999999999999999999999999999999998543221 11223455689999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCC--CcceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKL--NKGEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~--p~~~~~l~~~~a~~~~ 226 (274) ...+..++..+..|+|||+++..|++..+.+ +...+..+..++++|+||+.++.. +++..++.+.+.+.+. T Consensus 183 ~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~-------G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g 255 (324) T protein:vir:96 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhccC-------CCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEE Confidence 9999999999999999999999998765432 223345567789999999997764 5566777777777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc-cC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDE-VM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~-~~ 274 (274) ...++.++.+++. .+++..+++..|+|+++.+|+|+++|+.+-+.+ +- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:96 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 8888888877653 346678999999999999999999999854443 33 No 30 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=7e-38 Score=224.27 Aligned_cols=263 Identities=14% Similarity=0.090 Sum_probs=215.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +...++..+..++|+.+.+.|++.+++.+.+.++++.. ..+|..++||++...+.+.|++||+++|+++++|++++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeEecCCccccccccceeEEE Confidence 44444566678999999999999999999998888653 24456699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) +.+++++..+++|+|++.++.+++.+++.+++++++++++|+.+|....+. ......+..++++|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:78 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHH Confidence 999999999999999999999999999999999999999999998543221 11223455689999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCC--CcceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKL--NKGEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~--p~~~~~l~~~~a~~~~ 226 (274) ...+..++..+..|+|||+++..|++..+.+ +...+..+..++++|+||+.++.. +++..++.+.+.+.+. T Consensus 183 ~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~-------G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g 255 (324) T protein:vir:78 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhccC-------CCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEE Confidence 9999999999999999999999998765432 223345567789999999997764 5566777777777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc-cC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDE-VM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~-~~ 274 (274) ...++.++.+++. .+++..+++..|+|+++.+|+|+++|+.+-+.+ +- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:78 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 8888888877653 346678999999999999999999999854443 33 No 31 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=5.4e-38 Score=224.90 Aligned_cols=267 Identities=13% Similarity=0.077 Sum_probs=215.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc-ccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ-IGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~-~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.++++... ..+..+++|+....+.+.|++||+.+|+.+ .+|+++ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i 181 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVIT----LGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLI 181 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeee----cCCCceEEEEecCCcceeeecccccccccccccceeE Confidence 665666666789999999999999999999888886532 234468899887778899999999999865 799999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------------------ccc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-------------------------TLT 134 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-------------------------~~~ 134 (274) ++.+++++..+++|+|++.++.+++.+++.+++++++++++|+.++..-.+. ..+ T Consensus 182 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 261 (407) T protein:vir:48 182 EPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIAS 261 (407) T ss_pred EeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999988532110 011 Q ss_pred ccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc-- Q lcl|Aclame:pro 135 VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK-- 212 (274) Q Consensus 135 ~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~-- 212 (274) ..++.+++|+++++...|...+..+..|+|||.++..|++.++.+. .....+.+..|..++|+|+||++++++|. T Consensus 262 ~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~G---r~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~ 338 (407) T protein:vir:48 262 GAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDG---NYLWRPGIELGQPSSLAGYGIVENEQMPDIA 338 (407) T ss_pred ccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCCceecceeeEEecCcCCcc Confidence 2334567999999999999888888899999999999987654332 12223335667778999999999999995 Q ss_pred --ceEEEEcCC--eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 213 --GEALLAKKG--AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 213 --~~~~l~~~~--a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ..+++|+.. ++.+..+.++.++.+++..++...+++..|+|++|.+|+++++++.++|+.=- T Consensus 339 ~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~~~ 404 (407) T protein:vir:48 339 ADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATRQK 404 (407) T ss_pred CCccEEEEEeccccEEEEEeeceEEEeeccccCCcEEEEEEEEeccEEecccceEEEEeeccCCCC Confidence 334565543 45566677888888877778899999999999999999999999999887655 No 32 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=5.4e-38 Score=224.89 Aligned_cols=263 Identities=13% Similarity=0.058 Sum_probs=214.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-cccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~~ 79 (274) |+..+...+..++|+.+++.|++.+++.+.+.++++.. ..++...++|.....+.+.|++||+.+|.. .++|+++ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v 182 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVI----TVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLI 182 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----ecCCCceEEEEecCCccceeeccccccCccccccceee Confidence 66666666678999999999999999999888888653 234556888888777789999999999875 5799999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------------------ccc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-------------------------TLT 134 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-------------------------~~~ 134 (274) ++.+++++.++.+|+|++.++.+++.+++.+++++++++++|..++..-.+. ..+ T Consensus 183 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t 262 (401) T protein:vir:44 183 EPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVS 262 (401) T ss_pred eeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999998542211 011 Q ss_pred ccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc-- Q lcl|Aclame:pro 135 VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK-- 212 (274) Q Consensus 135 ~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~-- 212 (274) ..++.++|+++++++..|..++..+.+|+|||.++..|++..+.+. .....+.+.+|..++|+|+||++++++|. T Consensus 263 ~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G---~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~ 339 (401) T protein:vir:44 263 GEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEG---NYLWRPGLELGQPSSLAGYGIAENEQMPDIA 339 (401) T ss_pred ccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCC---ceeecCCcCCCCCceecceeeEEecCcCCcc Confidence 2344567999999999999888888899999999999987654321 12223335667778999999999999985 Q ss_pred --ceEEEEcCC--eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 213 --GEALLAKKG--AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 213 --~~~~l~~~~--a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +..++|+.. ++.+..+.+++++.+++..++...+++..|+|+++++|+++++|+.+|| T Consensus 340 ~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 340 ADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred CCccEEEEeehhccEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 233555543 4666777888888888878888999999999999999999999999999 No 33 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=8.2e-38 Score=223.90 Aligned_cols=265 Identities=17% Similarity=0.154 Sum_probs=209.6 Q ss_pred CCccccchh-hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVS-NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~-~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) ....|++.+ .++.|+++...|.+.+.+..++++++.+. ....+..+.+|+....+.+.|++||+.+|+++++|+++ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v 186 (392) T protein:vir:13 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTF---TTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQR 186 (392) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceee---ecCCCceeEEEEEcCCcceeeecccccccccccceeeE Confidence 222333333 35666666666767777777777777543 22345668999998888999999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------cc------ccccCcccCHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG---------AT------LTVEADITKLDG 144 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~---------a~------~~~~~~~~~~d~ 144 (274) .+.+++++..+.+|++++.++.+++.+++.+++++++++++|+.++..-.+ .+ ....++..+||+ T Consensus 187 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~ 266 (392) T protein:vir:13 187 SMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDA 266 (392) T ss_pred EeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccccccccHHH Confidence 999999999999999999999999999999999999999999999953211 10 122345577999 Q ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEE Q lcl|Aclame:pro 145 LQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVK 224 (274) Q Consensus 145 iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~ 224 (274) ++++...|...+..+..|+|||.++..|++..+.+. .....+.+..|..++|+|+||++++++|.++.++.+.+.+. T Consensus 267 l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G---~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~ 343 (392) T protein:vir:13 267 LIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANG---QYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYR 343 (392) T ss_pred HHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCC---ceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeecccee Confidence 999999998887788899999999999986543221 12222345667778999999999999999998877777777 Q ss_pred EEeccCceeeeccccc--cCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 225 LITKRDFFLEKDRDAS--RKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 225 ~~~~~~~~ve~~r~~~--~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) +..+.++.++.+++.. ++.+.+++..|+|+++.+|++++.++.++|. T Consensus 344 i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 344 VRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 7888888887665554 5678899999999999999999999988777 No 34 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=1.1e-37 Score=223.14 Aligned_cols=263 Identities=14% Similarity=0.085 Sum_probs=212.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +....+..++.++|+.+++.|++.+++.+.+.++++... .++..++||++...+.+.|++||+.+|+.+++|++++ T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 102 (324) T protein:vir:96 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeeecCCccccccccceeEEE Confidence 111122334568999999999999999999888886532 3456699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) +.+++++..+++|+|++.++.+++.+++.+++++++++++|+.+|....+. ......+..++++|+++ T Consensus 103 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:96 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHHHH Confidence 999999999999999999999999999999999999999999998543221 11223455689999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCC--CcceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKL--NKGEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~--p~~~~~l~~~~a~~~~ 226 (274) ...+..++..+..|+|||..+..|++..+.+ |...+..+..++++|+||++++.. +++..++.+.+.+.+. T Consensus 183 ~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~-------G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~ 255 (324) T protein:vir:96 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhCCC-------CCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEE Confidence 9999998889999999999999998765422 223345566789999999997764 4566777777777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ...++.++.+++. .++...+++..|+|+++.+|+++++|+.+.+..-. T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~ 319 (324) T protein:vir:96 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCC Confidence 7888888877663 34567899999999999999999999977665544 No 35 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=1.1e-38 Score=228.60 Aligned_cols=268 Identities=16% Similarity=0.180 Sum_probs=217.7 Q ss_pred CC--ccccc------hhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc Q lcl|Aclame:pro 1 MA--QGTTK------VSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD 72 (274) Q Consensus 1 ma--~~~T~------~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~ 72 (274) || |+.|. ..+.||||+|+.++++.+++++++.+++. +.+.....|++|+||+++. +++.++.+|.+++.+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~-d~~~~~~~Gdtv~ip~~g~-~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVK-TWGAQVKKGDTFHVPRISE-LGVEDKATDVPVGVQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccc-cccccccCCceEEEeccCc-ceeeeecCCCccccc Confidence 55 44333 33468999999999999999999988874 4444556699999999874 578999999999999 Q ss_pred ccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------cccC Q lcl|Aclame:pro 73 QIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------------TVEA 137 (274) Q Consensus 73 ~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------------~~~~ 137 (274) +++..++++++.+. ...+.|+|++..++..|+++.+.+++++++++++|+.++..+..+.. +... T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 99999999999664 67799999999999999999999999999999999999877643321 1222 Q ss_pred cccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceE Q lcl|Aclame:pro 138 DITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEA 215 (274) Q Consensus 138 ~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~ 215 (274) ..++++.|++|...|+++++ .+|+++++|+.++.|+++. ++......++..+++|.+|+++|++|++|+++|.++. T Consensus 159 ~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~--~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~ 236 (341) T protein:vir:94 159 QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIP--QFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSA 236 (341) T ss_pred hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhch--hhhhhhccccchhheeeeeeEeceEEEEecccccccc Confidence 34568999999999998865 7899999999999999764 4555555666779999999999999999999996442 Q ss_pred -------------------------------------EEEcCCeEEEE-----------eccCceeeeccccccCccEEE Q lcl|Aclame:pro 216 -------------------------------------LLAKKGAVKLI-----------TKRDFFLEKDRDASRKSTALY 247 (274) Q Consensus 216 -------------------------------------~l~~~~a~~~~-----------~~~~~~ve~~r~~~~~~~~i~ 247 (274) +++++++++.+ ..+....+.+|++.++.+.|+ T Consensus 237 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 316 (341) T protein:vir:94 237 TGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMV 316 (341) T ss_pred ccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhh Confidence 33344444433 133456677888889999999 Q ss_pred EEEEEEEEEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 248 SDKHYVAYLYDESKVVKITKGAGDE 272 (274) Q Consensus 248 ~~~~~~~~v~~~~avv~l~~~aa~~ 272 (274) ++..||+++++|+++|.|+.+++|. T Consensus 317 ~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 317 GRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred hhhhhcccccCcceeEEEecCcCCC Confidence 9999999999999999999999998 No 36 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=7.4e-38 Score=224.14 Aligned_cols=270 Identities=17% Similarity=0.096 Sum_probs=210.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |+..+|.....++|+.+.+.+++.+++.+.++++++.. ..++..++||++...+.+.|++||+++++++++|++++ T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 85 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKI----PMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRD 85 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEcCCcceEEecCCccccccccceeEEE Confidence 66555555554555557888889999888888887652 24456699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-----------~~~~~~~~~~d~iv~a~ 149 (274) +.++|++..+.+|+|+++++.+++++++.+++++++++++|+.++....+.. .........++.++++. T Consensus 86 l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:23 86 VHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLGVSGL 165 (397) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhHHHHHHH Confidence 9999999999999999999999999999999999999999999996443211 11123445678899999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCCCCcceEE--EEcCCeEEE Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEAL--LAKKGAVKL 225 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~--l~~~~a~~~ 225 (274) ..|..++..+..|+|||..+..|++.++.+ ++.......+....+..++++|+||++++++|.++.. +.+...+.+ T Consensus 166 ~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i 245 (397) T protein:vir:23 166 TKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIW 245 (397) T ss_pred HhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEE Confidence 999999989999999999999998765432 2222222222233344568999999999999998864 334455556 Q ss_pred EeccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 226 ITKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 226 ~~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ....++.++.+++. .+++..+|+..|+|+++.+|+++++++.....+-- T Consensus 246 ~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~ 310 (397) T protein:vir:23 246 GQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTY 310 (397) T ss_pred EEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccccee Confidence 67777777766653 34567899999999999999999999987664443 No 37 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.7e-37 Score=222.21 Aligned_cols=263 Identities=14% Similarity=0.082 Sum_probs=212.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +....+..++.++|+.+++.|++.+++.+.+.++++... .++.+++||++...+.+.|++||+.+|+.+++|++++ T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) T protein:vir:99 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceeEeccCccccccccceeEEE Confidence 222223344568999999999999999999888886532 3355699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) +.++|++..+++|+|++.++.+++.+++.+++++++++++|+.++....+. ......+..++++++++ T Consensus 103 ~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:99 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHH Confidence 999999999999999999999999999999999999999999998543322 12233456789999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc--ceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK--GEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~--~~~~l~~~~a~~~~ 226 (274) ...|..++..+..|+|||.+|..|++..+.+ +...+..+..++++|+||+.++..+. +..++.+...+.+. T Consensus 183 ~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~-------g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~ 255 (324) T protein:vir:99 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhcCC-------CceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEE Confidence 9999999888899999999999998765322 12333455667899999999988664 55666677777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ...++.++..++. .+++..+++..|+|+++.+|+++++|+.+.+-.=- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~ 319 (324) T protein:vir:99 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC Confidence 8888888877663 34678899999999999999999999987654433 No 38 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=3.1e-37 Score=220.70 Aligned_cols=263 Identities=14% Similarity=0.087 Sum_probs=212.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) +....+..+..++|+.+++.|++.+++.+.+.++++.. ..++.+++||++...+.++|++||+++|+++++|++++ T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) T protein:vir:10 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCcceeEeccCccccccccceeEEE Confidence 22222334456999999999999999999988888653 23455699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc------------cccccCcccCHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a------------~~~~~~~~~~~d~iv~a 148 (274) +.++|++..+++|+|++.++.+++.+++.+++++++++++|+.+|....+. ......+..++++++++ T Consensus 103 ~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:10 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHH Confidence 999999999999999999999999999999999999999999998543322 11233456789999999 Q ss_pred HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCC--cceEEEEcCCeEEEE Q lcl|Aclame:pro 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLN--KGEALLAKKGAVKLI 226 (274) Q Consensus 149 ~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p--~~~~~l~~~~a~~~~ 226 (274) ...+..++..+..|+|||..|..|++..+.+ +...+..+..++++|+||+.++..+ ++..++.+.+.+.+. T Consensus 183 ~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~-------g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~ 255 (324) T protein:vir:10 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred HHhhhhccCCCCEEEEcHHHHHHHHHhhccC-------CceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEE Confidence 9999999888999999999999998765322 1223445566789999999988755 556677777777777 Q ss_pred eccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 227 TKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 227 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ...++.++..++. .++...+++..|+|+++.+|+++++|+.+.+-.-. T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:10 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC Confidence 7888888776653 34667899999999999999999999987766543 No 39 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=2.3e-37 Score=221.44 Aligned_cols=262 Identities=12% Similarity=0.070 Sum_probs=203.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+. +.+..++|+.+++.|++.+++.+.++++++... .++..++||++...+.+.|++||+++|+++++|++++ T Consensus 1 m~t~--t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~ 74 (303) T protein:vir:97 1 MGTE--TSKASLFDKHLVSDLINKVKGHSSLAKLSSQKP----IPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVT 74 (303) T ss_pred Cccc--CCCCeEcchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEecCcceEEeecCccccccccceeeEE Confidence 8844 445689999999999999999999999986532 3445699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHh---ccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------------ccccCc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVL---SGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------------LTVEAD 138 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~---~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-------------------~~~~~~ 138 (274) +.++|++..+++|+|+++ ++.+++++++.+++++++++++|+.++....+.. .+.... T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (303) T protein:vir:97 75 IVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTES 154 (303) T ss_pred eeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccc Confidence 999999999999999985 5567899999999999999999999996532110 011123 Q ss_pred ccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce---- Q lcl|Aclame:pro 139 ITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE---- 214 (274) Q Consensus 139 ~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~---- 214 (274) ...+++|+++...+..++..++.|+|||.++..|++.++.+..... ..+.-.++..++|+|+||++++++|... T Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~--~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 232 (303) T protein:vir:97 155 EDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKM--YPELAWGANPDSINGLKSSVNTTVGAGADEAE 232 (303) T ss_pred cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEE--ecCccCCCCCceecceeeEEecccCCccccCC Confidence 3468999999999988888999999999999999876543221111 1111223455689999999999998532 Q ss_pred ---EEEEcC--CeEEEEeccCceeeeccc----------cccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 215 ---ALLAKK--GAVKLITKRDFFLEKDRD----------ASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 215 ---~~l~~~--~a~~~~~~~~~~ve~~r~----------~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .++++. ..+.+..+.++.++..++ ..+++..+|+..|+|+++.+|+++++|+++== T Consensus 233 ~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 233 SKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 244443 456677777777765433 23455689999999999999999999998765 No 40 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=2.9e-37 Score=220.87 Aligned_cols=257 Identities=16% Similarity=0.111 Sum_probs=207.1 Q ss_pred CC--------ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc Q lcl|Aclame:pro 1 MA--------QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD 72 (274) Q Consensus 1 ma--------~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~ 72 (274) || ..+|..+..++|+.+++.|++.+++.+.+.++++.. ..++...+||++...+.+.|++|++++|++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----PMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEeecCcccccc Confidence 66 234555567999999999999999999988887653 234556899999888889999999999999 Q ss_pred ccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cccc Q lcl|Aclame:pro 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-----------------TLTV 135 (274) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-----------------~~~~ 135 (274) +++|++++++++|++..+++|+|++.++.+++++++.+++++++++++|+.++...... ..+. T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (304) T protein:vir:94 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVV 156 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999998543211 1112 Q ss_pred cCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc--- Q lcl|Aclame:pro 136 EADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK--- 212 (274) Q Consensus 136 ~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~--- 212 (274) .....+|++|+++...+..++..+..|+|||+++..|++..+.+. ..+..+..++++|+||++++++|. T Consensus 157 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G--------~~l~~~~~~~l~G~PV~~~~~~~~~~~ 228 (304) T protein:vir:94 157 TDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDAND--------RPLFDANGNEIMGLPLSYTGADVYDKK 228 (304) T ss_pred ccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCC--------cEeecCCCccccceeeEEecccccCCC Confidence 234456999999999999999999999999999999987643221 123334457899999999999984 Q ss_pred -ceEEEEcCCeEEEEeccCceeeecccc------------------ccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 213 -GEALLAKKGAVKLITKRDFFLEKDRDA------------------SRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 213 -~~~~l~~~~a~~~~~~~~~~ve~~r~~------------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) +..++.+.+.+.+..+.++.++..++. .+++..+|+..|+|+++.+|+++++|+.+- T Consensus 229 ~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 229 KSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 345555666666666667766655542 345678999999999999999999999998 No 41 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=2.9e-37 Score=220.87 Aligned_cols=257 Identities=16% Similarity=0.111 Sum_probs=207.1 Q ss_pred CC--------ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc Q lcl|Aclame:pro 1 MA--------QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD 72 (274) Q Consensus 1 ma--------~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~ 72 (274) || ..+|..+..++|+.+++.|++.+++.+.+.++++.. ..++...+||++...+.+.|++|++++|++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----PMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEeecCcccccc Confidence 66 234555567999999999999999999988887653 234556899999888889999999999999 Q ss_pred ccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cccc Q lcl|Aclame:pro 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-----------------TLTV 135 (274) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-----------------~~~~ 135 (274) +++|++++++++|++..+++|+|++.++.+++++++.+++++++++++|+.++...... ..+. T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (304) T protein:vir:10 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVV 156 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999998543211 1112 Q ss_pred cCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc--- Q lcl|Aclame:pro 136 EADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK--- 212 (274) Q Consensus 136 ~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~--- 212 (274) .....+|++|+++...+..++..+..|+|||+++..|++..+.+. ..+..+..++++|+||++++++|. T Consensus 157 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G--------~~l~~~~~~~l~G~PV~~~~~~~~~~~ 228 (304) T protein:vir:10 157 TDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDAND--------RPLFDANGNEIMGLPLSYTGADVYDKK 228 (304) T ss_pred ccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCC--------cEeecCCCccccceeeEEecccccCCC Confidence 234456999999999999999999999999999999987643221 123334457899999999999984 Q ss_pred -ceEEEEcCCeEEEEeccCceeeecccc------------------ccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 213 -GEALLAKKGAVKLITKRDFFLEKDRDA------------------SRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 213 -~~~~l~~~~a~~~~~~~~~~ve~~r~~------------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) +..++.+.+.+.+..+.++.++..++. .+++..+|+..|+|+++.+|+++++|+.+- T Consensus 229 ~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 229 KSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 345555666666666667766655542 345678999999999999999999999998 No 42 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=5.4e-37 Score=219.41 Aligned_cols=270 Identities=14% Similarity=0.071 Sum_probs=207.0 Q ss_pred CCcc------c--cchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc Q lcl|Aclame:pro 1 MAQG------T--TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD 72 (274) Q Consensus 1 ma~~------~--T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~ 72 (274) ||.+ + |..+..++|+.+.+.+++.+++.+++.++++.. ...+..+++|++...+.+.|++||+++|++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~ 76 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKV----PMGPTGISIPHWTGAVSASWTGEAERKPIT 76 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEcCCcceeEecCCCccccc Confidence 6632 1 222334566667889999999999988888652 244556999999888899999999999999 Q ss_pred ccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------- Q lcl|Aclame:pro 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------------------- 133 (274) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------------------- 133 (274) +++|+++++.++|++..+++|+|+++++.+++++++.+++++++++++|+.++..-..... T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL 156 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc Confidence 9999999999999999999999999999999999999999999999999999953221100 Q ss_pred --cccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCC Q lcl|Aclame:pro 134 --TVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNK 209 (274) Q Consensus 134 --~~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~ 209 (274) ........++++++++..+..++..+..|+|||.++..|++.++.+ ++.......+....+..++++|+||+++++ T Consensus 157 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~ 236 (330) T protein:vir:77 157 TTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADN 236 (330) T ss_pred cccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecc Confidence 0011223478899999999888888999999999999998765433 221111111222333456899999999999 Q ss_pred CCcce------EEEEcCCeEEEEeccCceeeecccc--------------------ccCccEEEEEEEEEEEEEcCcceE Q lcl|Aclame:pro 210 LNKGE------ALLAKKGAVKLITKRDFFLEKDRDA--------------------SRKSTALYSDKHYVAYLYDESKVV 263 (274) Q Consensus 210 ~p~~~------~~l~~~~a~~~~~~~~~~ve~~r~~--------------------~~~~~~i~~~~~~~~~v~~~~avv 263 (274) +|.++ .++.+.+.+.+....++.++..++. .++...+|+..|+|+++.+|+|++ T Consensus 237 ~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 316 (330) T protein:vir:77 237 VVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFV 316 (330) T ss_pred ccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceE Confidence 99764 3455566666777777777655542 245678999999999999999999 Q ss_pred EEEeCCCcccC Q lcl|Aclame:pro 264 KITKGAGDEVM 274 (274) Q Consensus 264 ~l~~~aa~~~~ 274 (274) +|+.++|.+=- T Consensus 317 ~i~~~~~~~~~ 327 (330) T protein:vir:77 317 KLTDQVAGTDP 327 (330) T ss_pred EEEeccCCcCC Confidence 99999988777 No 43 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.2e-36 Score=217.61 Aligned_cols=260 Identities=15% Similarity=0.098 Sum_probs=211.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |...+|+.+..++|+.+++.|++.+++.+.+.++++.... .++..+.+|+....+.+.|++||+++|+.+++|++++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 85 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEM---EGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVT 85 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeec---CCCccEEEEEEcCCceeEEeecCccccccccceeEEE Confidence 5555666777899999999999999999998888866321 1233478888887788999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------cccccCcccCHHHHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-----------TLTVEADITKLDGLQTAI 149 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-----------~~~~~~~~~~~d~iv~a~ 149 (274) +++++++..+++|+|+++++.+++++++.+++++++++++|+.++....+. .....++.++|++++++. T Consensus 86 l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~ 165 (297) T protein:vir:95 86 LKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINYDNILKLQ 165 (297) T ss_pred EeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCHHHHHHHH Confidence 999999999999999999999999999999999999999999998533221 112334567899999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcceEEEEcCCeEEEEe Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAVKLIT 227 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~~~l~~~~a~~~~~ 227 (274) ..+.+++..+..|+|||+.+..|++..+.. | ..+.++..++++|+||+.++. +++++.++.+.+.+.+.. T Consensus 166 ~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~-------G-~~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~ 237 (297) T protein:vir:95 166 DALYDADVEPNAFVSKIQNRSALREARDGN-------K-VSIYDKAANTIDGITTVDLKSARFEKGDLLAGDFDNLIYGV 237 (297) T ss_pred HHhhhccCCcCEEEEcHHHHHHHHHhhccC-------C-ceeecCCCCcccceeeEeecCCCCCCceEEEEecccEEEEE Confidence 999999988999999999999998764321 1 224456678999999997654 567888888877777778 Q ss_pred ccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 228 KRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 228 ~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ..++.++..++. .++...+|+..|+|+++.+|+++++|+.+.+= T Consensus 238 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 238 PYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 888887766653 24567889999999999999999999866554 No 44 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=1.1e-36 Score=217.67 Aligned_cols=261 Identities=14% Similarity=0.132 Sum_probs=207.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++. +..++|+.+...|++.+++.+.+.++++.. ..++..+++|+... .+.+.|++||+.+|+.+++|+++ T Consensus 105 ~~~~~~~-~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 179 (385) T protein:vir:18 105 LGSDADS-AGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----RTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQ 179 (385) T ss_pred hcccccc-CCceecchhhhHHHHHhhhccchhhhccee----cccCcceEEEEEecCCcceeeeccCccccccccceeEE Confidence 4333333 344566678899999999998888887653 23455789999864 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccCcccCHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+.+|++++.++ .++..++.+++++++++++|+.++.....+. ....++.+.+|.+ T Consensus 180 ~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:18 180 TANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 999999999999999988876 6799999999999999999999986432211 1112344678999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCC-eEE Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG-AVK 224 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~-a~~ 224 (274) +++...+...+..+..|+|||.++..|++..+.+. .....+ ..+|..++++|+||++++.+|.+++++.+.+ ++. T Consensus 259 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G---~~l~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~ 334 (385) T protein:vir:18 259 AHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEG---RYIFGG-PQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQ 334 (385) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC---ceeccC-cccCCCceecceeeEEcCcCCCCcEEEeecccEEE Confidence 99999999888899999999999999987653221 112222 3456678999999999999999998887755 566 Q ss_pred EEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 225 LITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 225 ~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ++.+.++.++..++. .++...+++..|+|+++.+|+++++++.++|+ T Consensus 335 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 335 VWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 777888888765543 35677899999999999999999999999999 No 45 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=1.1e-36 Score=217.67 Aligned_cols=261 Identities=14% Similarity=0.132 Sum_probs=207.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++. +..++|+.+...|++.+++.+.+.++++.. ..++..+++|+... .+.+.|++||+.+|+.+++|+++ T Consensus 105 ~~~~~~~-~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 179 (385) T protein:vir:19 105 LGSDADS-AGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----RTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQ 179 (385) T ss_pred hcccccc-CCceecchhhhHHHHHhhhccchhhhccee----cccCcceEEEEEecCCcceeeeccCccccccccceeEE Confidence 4333333 344566678899999999998888887653 23455789999864 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccCcccCHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+.+|++++.++ .++..++.+++++++++++|+.++.....+. ....++.+.+|.+ T Consensus 180 ~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:19 180 TANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 999999999999999988876 6799999999999999999999986432211 1112344678999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCC-eEE Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG-AVK 224 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~-a~~ 224 (274) +++...+...+..+..|+|||.++..|++..+.+. .....+ ..+|..++++|+||++++.+|.+++++.+.+ ++. T Consensus 259 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G---~~l~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~ 334 (385) T protein:vir:19 259 AHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEG---RYIFGG-PQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQ 334 (385) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC---ceeccC-cccCCCceecceeeEEcCcCCCCcEEEeecccEEE Confidence 99999999888899999999999999987653221 112222 3456678999999999999999998887755 566 Q ss_pred EEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 225 LITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 225 ~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ++.+.++.++..++. .++...+++..|+|+++.+|+++++++.++|+ T Consensus 335 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 335 VWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 777888888765543 35677899999999999999999999999999 No 46 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=1.2e-36 Score=217.58 Aligned_cols=262 Identities=13% Similarity=0.083 Sum_probs=200.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||.. +.+..++|+.+++.|++.+++.++++++++... .++..+++|++...+.+.|++||+++|+++++|++++ T Consensus 1 mat~--~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~----~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~ 74 (311) T protein:vir:81 1 MVAL--ATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEP----QEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVT 74 (311) T ss_pred Ccee--cCCceEcchhHHHHHHHHHHhcchhhhhcceee----cCCCceEEEEEeCCceeEEeecCcccccccceeeEEE Confidence 8843 345789999999999999999999999886532 3344699999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---c---------------cccCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---L---------------TVEADI 139 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---~---------------~~~~~~ 139 (274) +.++|++..+++|+|++++ +..++++.+.+++++++++++|+.++....... . ....+. T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~ 154 (311) T protein:vir:81 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) T ss_pred EeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeeccccc Confidence 9999999999999999864 446699999999999999999999996532111 0 011111 Q ss_pred cC-HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce---- Q lcl|Aclame:pro 140 TK-LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE---- 214 (274) Q Consensus 140 ~~-~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~---- 214 (274) .. ++.+.++..++...+..++.|+|||.++..|++.++.+.. ....+....+..++++|+||++++.+|.+. T Consensus 155 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~---~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~ 231 (311) T protein:vir:81 155 ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGR---KLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) T ss_pred chHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCC---eeecCccccCCCceecceeEEecccccccccccc Confidence 22 3456667777777777888999999999999987543321 122233455677899999999999998432 Q ss_pred --------------EEEEcCCeEEEEeccCceeeecccc---------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 215 --------------ALLAKKGAVKLITKRDFFLEKDRDA---------SRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 215 --------------~~l~~~~a~~~~~~~~~~ve~~r~~---------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .++.+.+.+.+....++.++..++. .++...+|+..|+|++|.+|+++++|+.+.=. T Consensus 232 ~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 3445555566667777777765553 24567899999999999999999999765433 No 47 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1.2e-36 Score=217.53 Aligned_cols=268 Identities=15% Similarity=0.054 Sum_probs=203.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |+..++..+..++|+.+++.+++.+++.+.+.++++.. ..++..++||++...+++.|++||+++|+++++|++++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 89 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV----PMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQN 89 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 66555555556788889999999999999888887642 33456799999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------cccC-----cc-cCH- Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-----------TVEA-----DI-TKL- 142 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-----------~~~~-----~~-~~~- 142 (274) +.++|++..+++|+|+++++.+++++++.+++++++++++|+.++..-.+... .... +. ..+ T Consensus 90 ~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (320) T protein:vir:10 90 IAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAYD 169 (320) T ss_pred EeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccccccccHH Confidence 99999999999999999999999999999999999999999999854332110 0011 11 112 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCCCCcceEEE--E Q lcl|Aclame:pro 143 DGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALL--A 218 (274) Q Consensus 143 d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l--~ 218 (274) +.++++...+...+..+.+|+|||..+..|++.++.+ ++.......+......-++++|+||++++++|.++..+ . T Consensus 170 ~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~g 249 (320) T protein:vir:10 170 AVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMG 249 (320) T ss_pred HHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCCCceEEEEe Confidence 2467777888888888999999999999998765432 22111111111112223579999999999999988643 3 Q ss_pred cCCeEEEEeccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEE-eCCCcc Q lcl|Aclame:pro 219 KKGAVKLITKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKIT-KGAGDE 272 (274) Q Consensus 219 ~~~a~~~~~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~-~~aa~~ 272 (274) +...+.+..+.++.++.+++. .+++..+++..|+|+++.+|+++++|+ .+||.| T Consensus 250 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 250 DFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred ecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 445555677777777766553 235678899999999999999999999 455555 No 48 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.3e-36 Score=217.38 Aligned_cols=270 Identities=14% Similarity=0.052 Sum_probs=210.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |+..+++.+..++|+.+.+.|++.+++.+++.++++.. ..++..++||++...+.+.|++||+++|+++++|++++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 89 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV----PMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQT 89 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 67666676777899999999999999999998888653 23456799999998889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------cccCcccCHHHHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------------TVEADITKLDGLQ 146 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------------~~~~~~~~~d~iv 146 (274) ++++|++..+++|+|++.++.+++++++.+++++++++++|+.++....+... .........+.++ T Consensus 90 ~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (318) T protein:vir:24 90 IAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVAV 169 (318) T ss_pred EeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchHHHHHH Confidence 99999999999999999999999999999999999999999999864432110 0111122335677 Q ss_pred HHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCCCCcceEE--EEcCCe Q lcl|Aclame:pro 147 TAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEAL--LAKKGA 222 (274) Q Consensus 147 ~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~--l~~~~a 222 (274) ++...+...+..+.+|+|||..+..|++.++.+ ++.......+.......++++|+||++++++|.++.. +.+.+. T Consensus 170 ~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~ 249 (318) T protein:vir:24 170 NGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQ 249 (318) T ss_pred HHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecce Confidence 888888888888899999999999998765432 2211111111112222357999999999999988764 335555 Q ss_pred EEEEeccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 223 VKLITKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 223 ~~~~~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.+....++.++..++. .+++..+++..|+|+++.+|+++++|+..+|..-- T Consensus 250 ~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~ 317 (318) T protein:vir:24 250 LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGE 317 (318) T ss_pred EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCC Confidence 66777888888776653 34667899999999999999999999997766655 No 49 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=2.8e-36 Score=215.53 Aligned_cols=264 Identities=12% Similarity=0.122 Sum_probs=212.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) ....++..+..++|+.+++.|++.+++.+.+.++++.. ..++.++++|+... .+.+.|++||+++|+++++|+++ T Consensus 135 ~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v 210 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPG----QTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLK 210 (418) T ss_pred hccCCCCCCccccchhHHHHHHHHHhhhhhHHhhccee----eccCCceeEEEEecCCCceeeeccCccccccccceeeE Confidence 33345555677999999999999999999998888653 23456689999765 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccCcccCHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+++|++++.++ .++.+++.+++++++++++|+.++....++. .....+..+++++ T Consensus 211 ~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i 289 (418) T protein:vir:10 211 NQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKI 289 (418) T ss_pred EEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHH Confidence 999999999999999998877 6899999999999999999999986432211 0112234578999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCe-EE Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA-VK 224 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a-~~ 224 (274) +++...+...+..+..|+|||.+|..|++..+.+. .....+ ..+|..++|+|+||++++++|.++.++.+.+. +. T Consensus 290 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G---~~i~~~-~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~ 365 (418) T protein:vir:10 290 RLALLQAVLAEFPATGIVLNPIDWASIELTKDSQG---RYIVGN-PVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQ 365 (418) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC---ceeccc-cccCCCceecceeeEEcCCCCCCcEEEeeccceEE Confidence 99999999888888999999999999987653221 122222 34567789999999999999999988777654 55 Q ss_pred EEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 225 LITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 225 ~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) ++.+.++.++.+++. .++.+.+++..|+|+++.+|+++|+++.++|.+= T Consensus 366 ~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 366 IFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred EEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 666778888776654 3677889999999999999999999999988777 No 50 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=2.2e-36 Score=216.06 Aligned_cols=259 Identities=14% Similarity=0.150 Sum_probs=209.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) +...+|..+..++|+.+++.|++.+++.+.+.++++.. ..++..+++|+... .+.+.|++||+++|+++++|+++ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhccee----eccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 55566666777888889999999999998888887643 23455689999865 36789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccCcccCHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+++|++++.++ .++.+++.+++++++++++|+.++..-.++. ....++...++.+ T Consensus 189 ~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~ 267 (390) T protein:vir:97 189 TDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHHH Confidence 999999999999999998887 6899999999999999999999985422111 1122345668899 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCC-eEE Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG-AVK 224 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~-a~~ 224 (274) +++...+...+..+..|+|||.+|..|++..+... .....+ ..++..++|+|+||++++.+|+++.++.+.+ ++. T Consensus 268 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G---~~l~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~ 343 (390) T protein:vir:97 268 RLAMLQASLAEYPASGIVINPIDWAAIELAKDANN---QYLIGN-ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQ 343 (390) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC---ceeecC-ccCCCCceecceeeEEcCCCCCCcEEEEeccceEE Confidence 99999999999999999999999999987653221 111112 2345567999999999999999998887765 466 Q ss_pred EEeccCceeeecccc---ccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 225 LITKRDFFLEKDRDA---SRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 225 ~~~~~~~~ve~~r~~---~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) ++.+.++.++..++. .++...+++..|+|+++.+|+++|+++.+ T Consensus 344 ~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 344 IFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 778889998887653 34667789999999999999999999999 No 51 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=2.3e-36 Score=216.00 Aligned_cols=258 Identities=12% Similarity=0.067 Sum_probs=200.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||. .+..++|+.+.+.|++.+++.+++.++++... .++..++||++...+.+.|++||+++|+++++|++++ T Consensus 1 ma~----~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 72 (298) T protein:vir:94 1 MVL----NKGTLFDPELVTDLISKVAGKSSIARLSAQKP----IPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred Cee----ccccccChhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEecCcceEEeeCCccccccccceeEEE Confidence 763 33568899999999999999999888886532 3334589999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhccc---cc-----------------cccC Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALKGA---TL-----------------TVEA 137 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a---~~-----------------~~~~ 137 (274) +.++|++..+++|+|+++++ ..++++++.+++++++++++|+.++...... .. .... T Consensus 73 l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) T protein:vir:94 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) T ss_pred EeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccc Confidence 99999999999999998644 4678899999999999999999999652110 00 0111 Q ss_pred cccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc---- Q lcl|Aclame:pro 138 DITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG---- 213 (274) Q Consensus 138 ~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~---- 213 (274) ....++++++++.++..++..+..|+|||.++..|++.++.+. .....+...+|..++|+|+||++++.+|.+ T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:94 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQG---NALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred cccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCC---CeeecCcccCCCCceecceeeEEecccccccCCC Confidence 2234778999999999888888999999999999987654332 222234456677789999999999999853 Q ss_pred -eEEEEcCC--eEEEEeccCceeeecccc----------ccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 214 -EALLAKKG--AVKLITKRDFFLEKDRDA----------SRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 214 -~~~l~~~~--a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) ..++++.+ .+.+..+.++.++..++. .+++..+++..|+|+++.+|+++++|+.+. T Consensus 230 ~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 23444433 345667777777665532 245667899999999999999999999888 No 52 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.8e-36 Score=215.54 Aligned_cols=267 Identities=13% Similarity=0.087 Sum_probs=203.9 Q ss_pred CC---------ccc------cchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccC Q lcl|Aclame:pro 1 MA---------QGT------TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE 65 (274) Q Consensus 1 ma---------~~~------T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~e 65 (274) || ..+ +.....++|+.+.+.|++.+++.+.++++++... .++...++|+....+.+.|++| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIP----ISYGETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEeCCceeEeecC Confidence 33 221 1122238999999999999999999988886532 3445689999988777777766 Q ss_pred C--------CcccccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----- Q lcl|Aclame:pro 66 G--------EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----- 132 (274) Q Consensus 66 g--------~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----- 132 (274) | +.+|.++++|+++++.++|++..+++|+|++.++.+++.+++.+++++++++++|+.++..-.... T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~ 156 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQ 156 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccc Confidence 5 567888999999999999999999999999999999999999999999999999999985332111 Q ss_pred ---------------ccccCcccCHHHHHHHHHHHhhcC-CCccEEEEcHHHHHHHHhhhcccccccccccccccccccc Q lcl|Aclame:pro 133 ---------------LTVEADITKLDGLQTAIDKFNDED-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF 196 (274) Q Consensus 133 ---------------~~~~~~~~~~d~iv~a~~~l~~~~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~ 196 (274) .....+..++++++++...+..++ ...+.|+|||..|..|++........+..........+.. T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~ 236 (333) T protein:vir:78 157 GIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQT 236 (333) T ss_pred cccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCC Confidence 112234457899999998886654 4677899999999998765432222222223344566777 Q ss_pred chhcceeeEEcCCCCcc---------eEEEEcCCeEEEEeccCceeeecccc-------------ccCccEEEEEEEEEE Q lcl|Aclame:pro 197 GEALGAVIVRSNKLNKG---------EALLAKKGAVKLITKRDFFLEKDRDA-------------SRKSTALYSDKHYVA 254 (274) Q Consensus 197 ~~i~G~~Vv~s~~~p~~---------~~~l~~~~a~~~~~~~~~~ve~~r~~-------------~~~~~~i~~~~~~~~ 254 (274) ++|+|+||++++++|.+ ..++.+...+.+..+.++.++.+++. .+++..+|+..|+|+ T Consensus 237 ~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~ 316 (333) T protein:vir:78 237 GDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGW 316 (333) T ss_pred ceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEcc Confidence 89999999999999964 35556666676777788888776653 345677899999999 Q ss_pred EEEcCcceEEEEeCCCc Q lcl|Aclame:pro 255 YLYDESKVVKITKGAGD 271 (274) Q Consensus 255 ~v~~~~avv~l~~~aa~ 271 (274) ++.+|+++++|+++.|= T Consensus 317 ~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 317 LLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEecccceEEEeccCCC Confidence 99999999999977777 No 53 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=3.9e-36 Score=214.70 Aligned_cols=265 Identities=14% Similarity=0.080 Sum_probs=209.5 Q ss_pred CCc-cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc------c Q lcl|Aclame:pro 1 MAQ-GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD------Q 73 (274) Q Consensus 1 ma~-~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~------~ 73 (274) +.. .++..+..++|+.+++.|++.+++.+.+.++++.. ..++...++|+....+.+.|++|+...+.+ + T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~ 236 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEEL----PMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVK 236 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhccee----ecCCcceEEEEecCCcceeeccccccccccccccccc Confidence 211 22334667999999999999999999888887653 234566888988888889999999988764 5 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------------cc Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------------LT 134 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-------------------~~ 134 (274) ++|+++++.+++++..+++|++++.++.+++.+++.+++++++++++|..+|..-.+.. .. T Consensus 237 ~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) T protein:vir:10 237 GALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKA 316 (458) T ss_pred ccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccc Confidence 68999999999999999999999999999999999999999999999999985321110 11 Q ss_pred ccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCCCCc Q lcl|Aclame:pro 135 VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK 212 (274) Q Consensus 135 ~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~ 212 (274) ...+..+|++|++++..+..++..+..|+|||.+|..|++..+.+ ++.... .......|..++|+|+||++++.+|. T Consensus 317 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~-~~~~~~~~~~~~l~G~pv~~~~~~p~ 395 (458) T protein:vir:10 317 DGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG-NDSVKLQGQVGRIYGLPVVVSEYFPA 395 (458) T ss_pred cccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccc-cccccccCcCceecceeeEEcccccc Confidence 123346899999999999988888899999999999998765433 221111 12335566778999999999999997 Q ss_pred ce----EEEEcC-CeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 213 GE----ALLAKK-GAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 213 ~~----~~l~~~-~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +. .++.+. ..+.++.+.++.++.+++...+.+.++...|+|+.+.+|+++|+.+++|+ T Consensus 396 ~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 396 KANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ccCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 43 233333 34667778888888888888888999999999999999999999999888 No 54 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=4.1e-36 Score=214.61 Aligned_cols=259 Identities=15% Similarity=0.162 Sum_probs=206.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) +...++..+..++|+.+...+++.+++.+.+.++++... .++..+++|++.. .+.+.|++||+.+|..+++|+++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGR----TDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceee----ccCCceEEEEEecCCcceeeecCCcccccccceeeEE Confidence 444455556667777788899999999998888876532 3455689999865 36789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccCcccCHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+++|++++.++ +++.+++.+++++++++++|+.++....++. ....++...++.+ T Consensus 189 ~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (390) T protein:vir:81 189 TDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHHH Confidence 999999999999999999887 6899999999999999999999986432211 1122344678999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCC-eEE Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG-AVK 224 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~-a~~ 224 (274) +++...+...+..+..|+|||.+|..|++..+... .....+ ...+..++++|+||++++.+|+++.++.+.+ ++. T Consensus 268 ~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G---~~l~~~-~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~ 343 (390) T protein:vir:81 268 RLAMLQASLAEYNPSGIVINPIDWAAIELAKDANN---QYLIGN-ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQ 343 (390) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC---ceeecC-cccccCceecceeeEEcCCCCCCcEEEEehhceEE Confidence 99999999999999999999999999987653221 111222 2344557899999999999999998887765 466 Q ss_pred EEeccCceeeecccc---ccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 225 LITKRDFFLEKDRDA---SRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 225 ~~~~~~~~ve~~r~~---~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) ++.+.++.++.+++. .++...+++..|+|+++.+|+++|+++.+ T Consensus 344 ~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 344 IFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 677788888887653 34667899999999999999999999999 No 55 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=7.3e-36 Score=213.20 Aligned_cols=264 Identities=11% Similarity=0.003 Sum_probs=208.9 Q ss_pred CCc-cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC--CCcccccCCCcccccccccc Q lcl|Aclame:pro 1 MAQ-GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS--GDAQVIAEGEKIPVDQIGTS 77 (274) Q Consensus 1 ma~-~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~--~~a~~~~eg~~~~~~~~~~~ 77 (274) .+. .+++....++|+.++..|++.+...+.+.++++... ..+.+++||+.... +.+.|++||+.+|+.+++|+ T Consensus 106 ~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~ 181 (379) T protein:vir:10 106 VGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVS----ISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDIS 181 (379) T ss_pred hcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeee----ccCCceEEEEeecCCCcccccccCCcccccccccee Confidence 222 223333457899999999999999888888886532 34556899987533 45678999999999999999 Q ss_pred eeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---cccCcccCHHHHHHHHHHHhh Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---TVEADITKLDGLQTAIDKFND 154 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---~~~~~~~~~d~iv~a~~~l~~ 154 (274) ++++.+++++..+++|++++.++ +++.+++.+++++.+++++|..+++.+.+... ....+..++|.+++++..+.. T Consensus 182 ~i~~~~~k~~~~~~iS~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~ 260 (379) T protein:vir:10 182 MIDVNTDFIAGFTRYSKKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIITNKNKVEMLINEIAKQEN 260 (379) T ss_pred eeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccCcccHHHHHHHHHhhhh Confidence 99999999999999999999887 57999999999999999999999887665432 233455678999999999999 Q ss_pred cCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceee Q lcl|Aclame:pro 155 EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLE 234 (274) Q Consensus 155 ~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve 234 (274) .+..++.|+|||.+|..|++.++.+.....+. +....+|...+++|+||++++.+|+|+.++.+.+.+....++++.++ T Consensus 261 ~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~-~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~ 339 (379) T protein:vir:10 261 LDFPVTAIVLRPTDYYDILVTQKSVGAGYGLP-GVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLE 339 (379) T ss_pred ccCCCCEEEEcHHHHHHHHHhhccCCceeccC-CccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEE Confidence 99999999999999999987654332111111 11123455668999999999999999988877777666677777777 Q ss_pred ecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 235 KDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 235 ~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) ..++. .++.+.+++..|+|++|.+|+++|+++.++= T Consensus 340 ~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 340 FSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred EeecccccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 65543 4567789999999999999999999999987 No 56 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=6.5e-36 Score=213.51 Aligned_cols=258 Identities=12% Similarity=0.095 Sum_probs=199.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) || ++.+ .++|+.++..|++.+++.+.+.+++.... ..+..++||++...+.+.|++||+++|+++++|++++ T Consensus 1 ma---~~gG-~lvp~~~~~~ii~~~~~~s~i~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 72 (298) T protein:vir:16 1 MV---LNKG-TLFDPTLVTDLISKVAGKSSIARLSAQKP----IPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred Cc---ccCc-ceechhHHHHHHHHHHhhhhhhhhcceee----ccCCceEEEEEecCcceEEecCCccccccccceeEEE Confidence 88 3333 46777789999999999999888886532 3334589999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc---cccc-----------------cccC Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK---GATL-----------------TVEA 137 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~---~a~~-----------------~~~~ 137 (274) +.++|++..+++|+|++++ +..++++++.+++++++++++|+.++.... +... .... T Consensus 73 l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) T protein:vir:16 73 MVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) T ss_pred EeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccc Confidence 9999999999999999964 456899999999999999999999996531 1110 0111 Q ss_pred cccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc---- Q lcl|Aclame:pro 138 DITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG---- 213 (274) Q Consensus 138 ~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~---- 213 (274) ....++++++++..+..++..+..|+|||.++..|++.++.+. .....+....|..++|+|+||++++.+|.+ T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:16 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQD---NALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred cccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCC---CeeecCcccCCCCceecceeeEEecccccccCCC Confidence 1223678999999999888888999999999999987754332 222234456677789999999999999863 Q ss_pred -eEEEEcC--CeEEEEeccCceeeecccc----------ccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 214 -EALLAKK--GAVKLITKRDFFLEKDRDA----------SRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 214 -~~~l~~~--~a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) ..++++. .++.+..+.++.++..++. .+++..+++.+|+|+++.+|+++++|+.+- T Consensus 230 ~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 2344443 3455666777776655432 236678999999999999999999998887 No 57 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=6.6e-36 Score=213.47 Aligned_cols=259 Identities=14% Similarity=0.150 Sum_probs=202.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-CCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~~~~~~~~~ 79 (274) +...++..+..++|+.+.+.+++.+++.+.+.++++.. ..++..+++|++... +.+.|++||+.+|+.+++|+++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 33333444445566667788999999988888887653 234556899998753 6789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccCcccCHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------------LTVEADITKLDGL 145 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------------~~~~~~~~~~d~i 145 (274) ++.+++++..+++|++++.++ +++.+++.+++++++++++|+.++..-.++. ....++...++.+ T Consensus 189 ~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (390) T protein:vir:10 189 TDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EEeeEEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHH Confidence 999999999999999998887 5899999999999999999999985422111 1122344568999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCC-eEE Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG-AVK 224 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~-a~~ 224 (274) +++...+...+..+..|+|||.+|..|++..+.+. .....+. ..+..++++|+||++++.+|.++.++.+.+ ++. T Consensus 268 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g---~~l~~~~-~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~ 343 (390) T protein:vir:10 268 RLAMLQASLAEYPASGIVINPIDWAAIELAKDANN---QYLIGNA-RGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQ 343 (390) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC---ceeecCC-cCcCCceecceeeEEcCCCCCCcEEEEeccceEE Confidence 99999999999999999999999999987654321 1122222 234456899999999999999998877765 455 Q ss_pred EEeccCceeeecccc---ccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 225 LITKRDFFLEKDRDA---SRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 225 ~~~~~~~~ve~~r~~---~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) ++.+.++.++.+++. .++...+++..|+|+++.+|+|+|+++.+ T Consensus 344 ~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 344 IFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 677788888876543 34677889999999999999999999999 No 58 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1e-35 Score=212.42 Aligned_cols=267 Identities=11% Similarity=0.073 Sum_probs=212.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |+..++..+..++|+.+++.|++.+++.+.+.++++.... .+. ..+..||++.. .+.++|++||+++++ ++++|++ T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~~-~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~ 82 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENV-TTL-TGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSL 82 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeec-cCC-cceEEEEeecCCCcceeeecCCcccccccccceeE Confidence 8888888888999999999999999999999888765321 112 23477877753 467899999999997 5789999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.++|++..+++|+|+++++.+++++++.+++++++++++|+.+++.+.+... ..+.++||+|+++...+..++.. T Consensus 83 i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--~~~~~~~d~i~~~~~~l~~~~~~ 160 (293) T protein:vir:48 83 IKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--KPTLTKWDDIIDLEAKVDPAIKQ 160 (293) T ss_pred EEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--cccccCHHHHHHHHHhhhhhhcC Confidence 9999999999999999999999999999999999999999999999987765443 34567899999999999988888 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCCcce----EEEEcC-C-eEEEEeccC Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLNKGE----ALLAKK-G-AVKLITKRD 230 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p~~~----~~l~~~-~-a~~~~~~~~ 230 (274) ...|+|||.++..|++.++.+. .....+.+.+|..++|+|+||++.+ .+|..+ .++++. + ++.+..+.+ T Consensus 161 ~a~~vmn~~~~~~L~~lkd~~g---~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 237 (293) T protein:vir:48 161 TSFFLTNTSGFTALKKVKNALG---DYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQ 237 (293) T ss_pred CCEEEEcHHHHHHHHHhhccCC---ceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecc Confidence 8899999999999987654322 2222334566777899999998754 344322 244443 3 456667778 Q ss_pred ceeeeccc----cccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 231 FFLEKDRD----ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 231 ~~ve~~r~----~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.+++ ..+++..+++..|+|+++.+|+++++++.+++.+=- T Consensus 238 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~ 285 (293) T protein:vir:48 238 MSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 285 (293) T ss_pred eEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCC Confidence 88877653 346778899999999999999999999977654433 No 59 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=9.4e-36 Score=212.60 Aligned_cols=261 Identities=11% Similarity=0.114 Sum_probs=206.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) +...++..+..++|+.++..|++.+++.+.+.++++... .++..+++|+... .+.+.|++||+.+|+++++|+++ T Consensus 113 ~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i 188 (395) T protein:vir:43 113 AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGT----TESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELE 188 (395) T ss_pred hhcccCCCCccccchhhHHHHHHHHHhhhhHHhhcccee----cCCCceEEEEEecCCCceeeecCCccccccccceeEE Confidence 223444445567888899999999999999988887532 3455689999755 46889999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------cccCcccCHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL----------------TVEADITKLD 143 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~----------------~~~~~~~~~d 143 (274) ++++++++..+++|++++.++ .++..++.+++++++++.+|..++........ ....+...++ T Consensus 189 ~~~~~k~~~~~~is~ell~d~-~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (395) T protein:vir:43 189 NAPVRTIAHLFKASRQILDDA-SALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRID 267 (395) T ss_pred EEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHH Confidence 999999999999999998876 57999999999999999999999864221100 0112234589 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCe- Q lcl|Aclame:pro 144 GLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA- 222 (274) Q Consensus 144 ~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a- 222 (274) .++++...+..++..+.+|+|||.++..|++..+.+ +.....+ ..+|..++++|+||++++.+|.++.++.+.+. T Consensus 268 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~---G~~i~~~-~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~ 343 (395) T protein:vir:43 268 RIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAE---NRYIIGS-PQNGTTPTLWRLPVVETQAITQDEFLTGAFSLG 343 (395) T ss_pred HHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccC---Cceeccc-cccCCCceecceeeEEcCCCCCCcEEEEeccce Confidence 999999999988888899999999999998765322 1112222 34566789999999999999999988777554 Q ss_pred EEEEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 223 VKLITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 223 ~~~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +.+..+.++.++.+++. .++...+++..|+|+++.+|+++|+++.++| T Consensus 344 ~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 344 AQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 44556667777766543 3567789999999999999999999999999 No 60 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=1.3e-35 Score=211.81 Aligned_cols=265 Identities=13% Similarity=0.120 Sum_probs=206.9 Q ss_pred CCc-cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEee--------cCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQ-GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAF--------TYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~-~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~--------~~~~~a~~~~eg~~~~~ 71 (274) |.. .++.....++|+.+.+.+.......+.+++++.... ..++.+++|+. ...+.+.|++||+.+|+ T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 198 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN----ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ 198 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeee----ccCCceeeeeeccccccccccCcccceecCCccccc Confidence 222 234445578999999999988888888888876532 23445666654 33456889999999999 Q ss_pred cccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------------c Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT------------------L 133 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~------------------~ 133 (274) ++++|+++++.+++++..+++|++++.++ .++.+++.+++++++++++|+.+|.+-.+.. . T Consensus 199 ~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) T protein:vir:94 199 STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPT 277 (419) T ss_pred cccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccc Confidence 99999999999999999999999999877 6899999999999999999999985322110 0 Q ss_pred cccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc Q lcl|Aclame:pro 134 TVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG 213 (274) Q Consensus 134 ~~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~ 213 (274) ........++++++++..+...+..+..|+|||.+|..|++..+... ......+...+|..++|+|+||++++++|++ T Consensus 278 ~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~--~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~ 355 (419) T protein:vir:94 278 APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS--GVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) T ss_pred cccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCC--CceeecCCcccCCCccccceeeEEcCCCCCc Confidence 11123345899999999999988899999999999999987654321 1112223355677789999999999999999 Q ss_pred eEEEEcCC-eEEEEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 214 EALLAKKG-AVKLITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDE 272 (274) Q Consensus 214 ~~~l~~~~-a~~~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~ 272 (274) +.++.+.. ++.++.+.++.++.+++. .++...+++..|+|+++.+|+++|+++.+++++ T Consensus 356 ~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 356 TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred cEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 98876655 455667778888776654 367788999999999999999999999999999 No 61 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=1.3e-35 Score=211.91 Aligned_cols=269 Identities=13% Similarity=0.099 Sum_probs=202.5 Q ss_pred CC---------cc------ccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-------- Q lcl|Aclame:pro 1 MA---------QG------TTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-------- 57 (274) Q Consensus 1 ma---------~~------~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-------- 57 (274) || +. .+.....++|+.+++.|++.+++.+.+.++++. ...++..++||++... T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGEN----IPISYGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCccceeecc Confidence 32 21 122233489999999999999999999998865 2345667999997643 Q ss_pred CCcccccCCCcccccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---- Q lcl|Aclame:pro 58 GDAQVIAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---- 133 (274) Q Consensus 58 ~~a~~~~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---- 133 (274) +.+.|++||+++|+++++|++++++++|++..+++|+|++.++.+++++++.+++++++++++|+.++........ T Consensus 77 ~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~ 156 (338) T protein:vir:78 77 GTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQ 156 (338) T ss_pred cccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccc Confidence 4466778999999999999999999999999999999999999999999999999999999999999964432110 Q ss_pred ------------c----ccCcccCHHHHHHHHHHHhhc-CCCccEEEEcHHHHHHHHhhhcccccccccccccccccccc Q lcl|Aclame:pro 134 ------------T----VEADITKLDGLQTAIDKFNDE-DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF 196 (274) Q Consensus 134 ------------~----~~~~~~~~d~iv~a~~~l~~~-~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~ 196 (274) + .......++.+.++...+..+ ......|+|||..+..|++........+.....+....|.. T Consensus 157 gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~ 236 (338) T protein:vir:78 157 GIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASA 236 (338) T ss_pred ccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCC Confidence 0 011223578888888877543 45677899999999988654321111122222333566777 Q ss_pred chhcceeeEEcCCCCc---------ceEEEEcCCeEEEEeccCceeeecccc----------------ccCccEEEEEEE Q lcl|Aclame:pro 197 GEALGAVIVRSNKLNK---------GEALLAKKGAVKLITKRDFFLEKDRDA----------------SRKSTALYSDKH 251 (274) Q Consensus 197 ~~i~G~~Vv~s~~~p~---------~~~~l~~~~a~~~~~~~~~~ve~~r~~----------------~~~~~~i~~~~~ 251 (274) ++|+|+||++++++|. +.+++.+.+.+.+..+.++.++..++. .+++..+|+..| T Consensus 237 ~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 316 (338) T protein:vir:78 237 GDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVT 316 (338) T ss_pred ceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 8999999999999984 224555666676777888888776653 245577899999 Q ss_pred EEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 252 YVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 252 ~~~~v~~~~avv~l~~~aa~~~ 273 (274) +|+++++|+++++|+++.+... T Consensus 317 ~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 317 FGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred eccEeecccceEEEecccCCCC Confidence 9999999999999999876666 No 62 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.3e-35 Score=211.81 Aligned_cols=268 Identities=14% Similarity=0.060 Sum_probs=199.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |... |..+..++|+.+++.+++.+++.+.+.++++.. ..++...++|+....+.+.|++||+++|+++++|++++ T Consensus 20 ~~~~-~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 94 (326) T protein:vir:42 20 AQTG-DSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI----PMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQT 94 (326) T ss_pred eecc-ccCCcceechhhHHHHHHHHHhcchhhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 3222 333344788889999999999998888887653 23456799999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------cc-----ccCcccCHH Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT------------LT-----VEADITKLD 143 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~------------~~-----~~~~~~~~d 143 (274) +.+++++..+++|+|++.++.+++++++.+++++++++++|+.++..-.+.. .. ...+..+++ T Consensus 95 ~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~ 174 (326) T protein:vir:42 95 IAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVY 174 (326) T ss_pred EeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccchhH Confidence 9999999999999999999999999999999999999999999985432110 00 011112222 Q ss_pred H--HHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCCCCcceEEE-E Q lcl|Aclame:pro 144 G--LQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALL-A 218 (274) Q Consensus 144 ~--iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l-~ 218 (274) + +.++...+...+.....|+|||..+..|++.++.+ ++.......+.......++++|+||++++.+|.++.++ + T Consensus 175 ~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 254 (326) T protein:vir:42 175 DAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQ 254 (326) T ss_pred HHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEE Confidence 2 34555566666778889999999999998765432 22111111111222234579999999999999998764 3 Q ss_pred -cCCeEEEEeccCceeeecccc----------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 219 -KKGAVKLITKRDFFLEKDRDA----------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 219 -~~~a~~~~~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) +...+.+....++.++..++. .+++..+++..|+|+++.+|+++++|+.+++..- T Consensus 255 Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 255 GDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred eecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 334455666667776655443 2456789999999999999999999999888877 No 63 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1.7e-35 Score=211.26 Aligned_cols=262 Identities=15% Similarity=0.106 Sum_probs=209.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc-ccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~-~~~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.++++.... .+ ....+.+|+....+.+.|++||+.++. ++++|+++ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~-~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i 168 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPV-TT-LSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLL 168 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeec-cC-CceeEEEEeecCCcceeeeccccccccccccceeeE Confidence 8778888888999999999999999999999888765321 11 122466777766678999999999986 57999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHH-HHhhcCCC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAID-KFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~-~l~~~~~~ 158 (274) ++++++++..+++|+|++.++.+++.+++.+++++++++.+|+.++....+.. ..+..+++++.++.. .|...+.. T Consensus 169 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~---~~~~~~~~~i~~~~~~~l~~~~~~ 245 (371) T protein:vir:81 169 QYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKA---KTAIADLDGLKQIINVQLDPVFRS 245 (371) T ss_pred EeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccccHHHHHHHHHhhcchhhhc Confidence 99999999999999999999999999999999999999999999988655432 344567899988774 56666677 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce-----------EEEEcCC--eEEE Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-----------ALLAKKG--AVKL 225 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~-----------~~l~~~~--a~~~ 225 (274) +..|+|||.+|..|++..+.+ +.....+.+.+|..++|+|+||++++++|.+. .++|+.. .+.+ T Consensus 246 ~a~~vmn~~~~~~L~~lkd~~---g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~ 322 (371) T protein:vir:81 246 TSSVIVNQDAFNWLDTLKDQN---GQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVM 322 (371) T ss_pred CCEEEEcHHHHHHHHHhhccC---CCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEE Confidence 889999999999998765432 12222333566777899999999999998542 2445432 3555 Q ss_pred EeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 226 ITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 226 ~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) ..+.++.++.+++. .+++..+++..|+|+++.+|+++++++.++| T Consensus 323 ~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 323 FDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 66778888776654 3577899999999999999999999999999 No 64 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=100.00 E-value=7.9e-36 Score=213.04 Aligned_cols=265 Identities=11% Similarity=0.119 Sum_probs=215.5 Q ss_pred CC--ccccchhhccchHHHHHHHHHHHHHh--hhhccccccccccc---ccCCCEEEEEeecCC-CCcccccCCC---cc Q lcl|Aclame:pro 1 MA--QGTTKVSNLIVPEVLAPMMQAELDKK--LRFAQFADIDSTLV---GQPGDTLTFPAFTYS-GDAQVIAEGE---KI 69 (274) Q Consensus 1 ma--~~~T~~~~~~iPe~~~~~v~~~~~~~--~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~-~~a~~~~eg~---~~ 69 (274) |+ +.-|+.+|+|+||+|..|+.++..++ ++.++++..+.++. ..+|++++||.|+++ ++.+.+.+.. ++ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 99 46699999999999999999888654 44577777776654 368999999999877 5566676654 47 Q ss_pred cccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------------- 133 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------------- 133 (274) ++.+++.++....+..+++.|..+|.....+..|++..+.++++.+|.|..++.+|+.+++-.. T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~ 160 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhcc Confidence 8889999999999999999999999999999999999999999999999999999987764210 Q ss_pred ------------------cc-cCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcccccccccccccccccc Q lcl|Aclame:pro 134 ------------------TV-EADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKG 194 (274) Q Consensus 134 ------------------~~-~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g 194 (274) +. ....++++.+++|..+|+++......++|||.+++.|++++..+|...++. +. T Consensus 161 ~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~------~~ 234 (367) T protein:vir:80 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG------QL 234 (367) T ss_pred ccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCC------cc Confidence 00 123467899999999999999999999999999999999998888876653 35 Q ss_pred ccchhcceeeEEcCCCC--------cceEEEEcCCeEEEEeccCc-eeeecccccc----CccEEEEEEEEEEEEEcCcc Q lcl|Aclame:pro 195 AFGEALGAVIVRSNKLN--------KGEALLAKKGAVKLITKRDF-FLEKDRDASR----KSTALYSDKHYVAYLYDESK 261 (274) Q Consensus 195 ~~~~i~G~~Vv~s~~~p--------~~~~~l~~~~a~~~~~~~~~-~ve~~r~~~~----~~~~i~~~~~~~~~v~~~~a 261 (274) .+++++|++|+++|.+| ++++|||+.+||+|....+. .+|++|++.+ +.+.++.|+| .+++|.+ T Consensus 235 ~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~hP~G 311 (367) T protein:vir:80 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHPGG 311 (367) T ss_pred ccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeee---EEeecce Confidence 68999999999999999 47789999999999876654 4899999986 5688899988 6778887 Q ss_pred eEEEEeC--CCc---------ccC Q lcl|Aclame:pro 262 VVKITKG--AGD---------EVM 274 (274) Q Consensus 262 vv~l~~~--aa~---------~~~ 274 (274) +...... +++ ... T Consensus 312 ~s~~~~~v~~~~~~~~~~~~~~~~ 335 (367) T protein:vir:80 312 FNWLDADVTIPDNTGSPSGITSGP 335 (367) T ss_pred eeeccccccccccccccccccccc Confidence 7653321 111 011 No 65 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=2.4e-35 Score=210.34 Aligned_cols=267 Identities=13% Similarity=0.100 Sum_probs=210.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |+..++..+..++|+.+++.|++++++.+.+.++++.... .+..|+ +.+|+... .+.+.|++||+++|. ++++|++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENV-TTLTGS-RVYEKWTDITGLANIDDEAGKIADVDDPKLSL 186 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeec-ccCccc-eEEEeeccCCcceeeecCccccccccccceee Confidence 7777777788899999999999999999999888765432 122233 56676654 367899999999997 5799999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++....++.. ....+++|+++++...+..++.. T Consensus 187 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~--~~~~~~~d~i~~~~~~l~~~~~~ 264 (397) T protein:vir:49 187 IKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPT--KPTLTKWDDIIDLEAKVDPAIKQ 264 (397) T ss_pred EEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccccccHHHHHHHHHhhhhhhcC Confidence 9999999999999999999999999999999999999999999999987654433 34557899999999999998888 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCCcce----EEEEc-CC-eEEEEeccC Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLNKGE----ALLAK-KG-AVKLITKRD 230 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p~~~----~~l~~-~~-a~~~~~~~~ 230 (274) +..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++.+ .+|.++ .++++ .+ ++.+..+.+ T Consensus 265 ~a~~vmn~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~ 341 (397) T protein:vir:49 265 TSFFLTNTSGFTALKKVKNALG---DYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQH 341 (397) T ss_pred CCEEEEcHHHHHHHHHhhcCCC---ceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecc Confidence 8999999999999987754321 1222233566777899999998754 355433 25554 33 455667788 Q ss_pred ceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 231 FFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 231 ~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.+++. .++...+++..|+|+++.+|+++++++.+++.+=- T Consensus 342 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 342 MSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 389 (397) T ss_pred eEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecccCCC Confidence 888876643 35677899999999999999999999988755433 No 66 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=3.2e-36 Score=215.21 Aligned_cols=257 Identities=12% Similarity=0.101 Sum_probs=206.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++..+..+||+.+++.|++.+++.+.++++++... .++ .++|+... .+++.|++||+..++++++|+++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 555555666789999999999999999888888876532 222 55777553 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~~~~~~d~iv~a~ 149 (274) ++.+++++..+++|+|++.++.+++.+++.+++++++++..++.++....+.. ....++..++|++++++ T Consensus 192 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~ 271 (387) T protein:vir:96 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999988877775544321 12223445699999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~ 229 (274) ..|..++..+..|+||+.++..|++..... ++.+..|...+|+|+||++++.++. ++|+.+...|.... T Consensus 272 ~~l~~~y~~na~~imn~~t~~~~~~~~~~~--------~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:96 272 ADLHEDYRDNATIYMRYADYVKIISVLSNG--------TTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHHhcC--------CCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 999988888889999999988876543211 1234456677999999999998764 45555544444455 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+..+++...+.+.+++..|+|+++++|+|++.++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:96 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 385 (387) T ss_pred hhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCC Confidence 667778888888999999999999999999999999999999999 No 67 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=3.2e-36 Score=215.21 Aligned_cols=257 Identities=12% Similarity=0.101 Sum_probs=206.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++..+..+||+.+++.|++.+++.+.++++++... .++ .++|+... .+++.|++||+..++++++|+++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 555555666789999999999999999888888876532 222 55777553 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~~~~~~d~iv~a~ 149 (274) ++.+++++..+++|+|++.++.+++.+++.+++++++++..++.++....+.. ....++..++|++++++ T Consensus 192 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~ 271 (387) T protein:vir:94 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999988877775544321 12223445699999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~ 229 (274) ..|..++..+..|+||+.++..|++..... ++.+..|...+|+|+||++++.++. ++|+.+...|.... T Consensus 272 ~~l~~~y~~na~~imn~~t~~~~~~~~~~~--------~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:94 272 ADLHEDYRDNATIYMRYADYVKIISVLSNG--------TTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHHhcC--------CCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 999988888889999999988876543211 1234456677999999999998764 45555544444455 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+..+++...+.+.+++..|+|+++++|+|++.++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:94 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 385 (387) T ss_pred hhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCC Confidence 667778888888999999999999999999999999999999999 No 68 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=3.2e-36 Score=215.21 Aligned_cols=257 Identities=12% Similarity=0.101 Sum_probs=206.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++..+..+||+.+++.|++.+++.+.++++++... .++ .++|+... .+++.|++||+..++++++|+++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 555555666789999999999999999888888876532 222 55777553 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~~~~~~d~iv~a~ 149 (274) ++.+++++..+++|+|++.++.+++.+++.+++++++++..++.++....+.. ....++..++|++++++ T Consensus 192 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~ 271 (387) T protein:vir:26 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999988877775544321 12223445699999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~ 229 (274) ..|..++..+..|+||+.++..|++..... ++.+..|...+|+|+||++++.++. ++|+.+...|.... T Consensus 272 ~~l~~~y~~na~~imn~~t~~~~~~~~~~~--------~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:26 272 ADLHEDYRDNATIYMRYADYVKIISVLSNG--------TTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHHhcC--------CCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 999988888889999999988876543211 1234456677999999999998764 45555544444455 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+..+++...+.+.+++..|+|+++++|+|++.++.++|+... T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:26 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 385 (387) T ss_pred hhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCC Confidence 667778888888999999999999999999999999999999999 No 69 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3.8e-35 Score=209.28 Aligned_cols=261 Identities=13% Similarity=0.034 Sum_probs=201.1 Q ss_pred CCc-cccchhhccchHHHHHHHH-HHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQ-GTTKVSNLIVPEVLAPMMQ-AELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~-~~T~~~~~~iPe~~~~~v~-~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) ++. .++..+..+||+.+.+.++ +.+++...+.++++.. ...| .+++|+....+.+.|++||+.+|.++++|++ T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~----~~~g-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 323 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQV----VATG-DVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQ 323 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccc----cCCc-ceEEEEecCCcceeecccCccccccccccce Confidence 333 3344455789998887765 6667777777776542 2234 4789998888899999999999999999999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc----------------cccccCcccCH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA----------------TLTVEADITKL 142 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a----------------~~~~~~~~~~~ 142 (274) +++.+++++..++||++++.++ +++.+++.+.+++++++++|..+|.+-.++ ..++..+.+++ T Consensus 324 i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 402 (543) T protein:vir:81 324 PEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFAL 402 (543) T ss_pred eeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccccccccccccccH Confidence 9999999999999999999887 799999999999999999999998542211 11233455789 Q ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce-------- Q lcl|Aclame:pro 143 DGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-------- 214 (274) Q Consensus 143 d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~-------- 214 (274) ++++++...+..++.....|+|||.+|..|++..+.+. .....+ +..|..++|+|+||++++++|.+. T Consensus 403 ~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G---~~l~~~-~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 478 (543) T protein:vir:81 403 ADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGG---AGLWTT-IGNGEPSQLLGRPVGEAEAMDANWNTSASADN 478 (543) T ss_pred HHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCC---ceeccC-cCCCCCccccceeeEEeccccccccccccCCc Confidence 99999999998888788899999999999987654322 112222 345667899999999999999654 Q ss_pred -EEEE-cCCeEEEEeccCceeeecccc------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 215 -ALLA-KKGAVKLITKRDFFLEKDRDA------SRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 215 -~~l~-~~~a~~~~~~~~~~ve~~r~~------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .++| +.+.+.++...++.++.+.+. .++...+++..|+|+++.+|+|+++++.+++. T Consensus 479 ~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 479 FVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred ceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 1344 455566666677777654432 24567899999999999999999999999888 No 70 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=5.7e-35 Score=208.31 Aligned_cols=266 Identities=12% Similarity=0.047 Sum_probs=207.4 Q ss_pred CC-ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEE--EEeecCCCCcccccCCCcccc-ccccc Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLT--FPAFTYSGDAQVIAEGEKIPV-DQIGT 76 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~--ip~~~~~~~a~~~~eg~~~~~-~~~~~ 76 (274) ++ ..+|..+..++|+.+++.|++.+++.+.+.++++... ..+...+ +++....+.+.|++||+++|+ +.++| T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR----VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceee----ccCCceeEEEEEecCCcceeecccccccccccccce Confidence 22 2344556689999999999999999999988886532 2222344 444455667899999999997 46899 Q ss_pred ceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------cccCcccCHH Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------TVEADITKLD 143 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------~~~~~~~~~d 143 (274) +++++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++....++.. ....+..+++ T Consensus 196 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) T protein:vir:46 196 FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) T ss_pred eeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchH Confidence 999999999999999999999999999999999999999999999999976643211 1234456899 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEc Q lcl|Aclame:pro 144 GLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAK 219 (274) Q Consensus 144 ~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~ 219 (274) ++++++..+...+..+..|+|||..|..|++..+.+. .....+.+.+|..++|+|+||++++++|.++ .++|+ T Consensus 276 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G---~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~g 352 (415) T protein:vir:46 276 DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIG 352 (415) T ss_pred HHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCC---CeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEE Confidence 9999999999988899999999999999987643221 1222233566777899999999999998644 24554 Q ss_pred C-C-eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 220 K-G-AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 220 ~-~-a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) . . ++.+..+.++.++..++. ..+..+++..|+|+++.+|+++++++.+++.+=- T Consensus 353 d~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:46 353 NLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred ehhccEEEEeecceEEEeeccc-cCceEEEEEEEeccEEeccccEEEEEeeccCCCC Confidence 3 3 355667778888776543 3456789999999999999999999988776655 No 71 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=5.7e-35 Score=208.31 Aligned_cols=266 Identities=12% Similarity=0.047 Sum_probs=207.4 Q ss_pred CC-ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEE--EEeecCCCCcccccCCCcccc-ccccc Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLT--FPAFTYSGDAQVIAEGEKIPV-DQIGT 76 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~--ip~~~~~~~a~~~~eg~~~~~-~~~~~ 76 (274) ++ ..+|..+..++|+.+++.|++.+++.+.+.++++... ..+...+ +++....+.+.|++||+++|+ +.++| T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR----VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceee----ccCCceeEEEEEecCCcceeecccccccccccccce Confidence 22 2344556689999999999999999999988886532 2222344 444455667899999999997 46899 Q ss_pred ceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------cccCcccCHH Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------TVEADITKLD 143 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------~~~~~~~~~d 143 (274) +++++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++....++.. ....+..+++ T Consensus 196 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) T protein:vir:47 196 FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) T ss_pred eeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchH Confidence 999999999999999999999999999999999999999999999999976643211 1234456899 Q ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEc Q lcl|Aclame:pro 144 GLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAK 219 (274) Q Consensus 144 ~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~ 219 (274) ++++++..+...+..+..|+|||..|..|++..+.+. .....+.+.+|..++|+|+||++++++|.++ .++|+ T Consensus 276 ~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G---~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~g 352 (415) T protein:vir:47 276 DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIG 352 (415) T ss_pred HHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCC---CeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEE Confidence 9999999999988899999999999999987643221 1222233566777899999999999998644 24554 Q ss_pred C-C-eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 220 K-G-AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 220 ~-~-a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) . . ++.+..+.++.++..++. ..+..+++..|+|+++.+|+++++++.+++.+=- T Consensus 353 d~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:47 353 NLKDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred ehhccEEEEeecceEEEeeccc-cCceEEEEEEEeccEEeccccEEEEEeeccCCCC Confidence 3 3 355667778888776543 3456789999999999999999999988776655 No 72 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=7.5e-35 Score=207.67 Aligned_cols=267 Identities=11% Similarity=0.084 Sum_probs=208.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEee-cCCCCcccccCCCccccc-ccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAF-TYSGDAQVIAEGEKIPVD-QIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~-~~~~~a~~~~eg~~~~~~-~~~~~~ 78 (274) |+..+++.+..++|+.+++.|++.+++.+.+.++++.... .+..|+ ..++.. ...+.+.|++||+.++.. +++|++ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV-TTLTGS-RVYEKWADITGLAKLDDEAGSIGTNDDPKLYP 186 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec-cCCcce-EEEEeecCCCcceeeeccccccccccccceee Confidence 7777777778899999999999999999999888865331 222233 233333 334568899999999976 589999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++++++++..+++|++++.++..++.+++.+++++++++++|+.++....++. .....+++|+|+++...|..++.. T Consensus 187 v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~--~~~~~~~~d~i~~~~~~l~~~~~~ 264 (397) T protein:vir:48 187 IRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLP--TKPTLTKWDDIIDLQAKVDPAIKQ 264 (397) T ss_pred EEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cccccccHHHHHHHHHHhhhhhcC Confidence 999999999999999999999999999999999999999999999998765443 334567899999999999998888 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcce----EEEEc-CC-eEEEEeccC Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGE----ALLAK-KG-AVKLITKRD 230 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~----~~l~~-~~-a~~~~~~~~ 230 (274) +..|+|||.+|..|++.++.+. .......+.+|..++|+|+||++.++ +|.+. .++++ .+ ++.++.+.+ T Consensus 265 ~a~~v~n~~~~~~L~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 341 (397) T protein:vir:48 265 TSFFLTNTSGFTALKKVKNAFG---DYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQ 341 (397) T ss_pred CCEEEECHHHHHHHHHhhcCCC---ceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecc Confidence 9999999999999987754321 12222335667778999999987543 44322 34444 33 466777888 Q ss_pred ceeeeccc----cccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 231 FFLEKDRD----ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 231 ~~ve~~r~----~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.+++ ..++.+.+++..|+|+++.+|+++++++.+++++=- T Consensus 342 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:48 342 MSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQK 389 (397) T ss_pred eEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEecccccCC Confidence 88887664 346778899999999999999999999988775433 No 73 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=9.6e-35 Score=207.08 Aligned_cols=268 Identities=12% Similarity=0.041 Sum_probs=209.7 Q ss_pred CC-ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-ccccce Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSK 78 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~ 78 (274) .+ ..+|..+..++|+.+++.|++.+++.+.+.++++.... ..+..++.+|+....+.+.|++||+++|+. .++|++ T Consensus 120 ~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~ 197 (415) T protein:vir:94 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV--TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeec--cCCceeEEEEeecCCccceecccccccccccccccee Confidence 22 23445567899999999999999999999888865431 111224667777677789999999999975 689999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------cccCcccCHHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------TVEADITKLDGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------~~~~~~~~~d~i 145 (274) +++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++.....+.. ....+..+|++| T Consensus 198 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:94 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred eEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHH Confidence 9999999999999999999999999999999999999999999999976543321 122345689999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEc-C Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAK-K 220 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~-~ 220 (274) ++++..+...+..+..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++++++|.++ .++++ . T Consensus 278 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~ 354 (415) T protein:vir:94 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCC---CeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999998888889999999999999987653221 1222233566777899999999999999655 24444 3 Q ss_pred C-eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 221 G-AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 221 ~-a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) . ++.+..+.++.++..++. .+++.+++..|+|+++.+|+++++++.+.+.+=- T Consensus 355 ~~~~~~~~~~~~~v~~~~~~-~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:94 355 KDAIVLFDRSQYQASWTDYM-HFGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred hccEEEEeecceEEEEeccc-cCceEEEEEEEeccEEeccccEEEEEEeccCCCC Confidence 3 355667778888876653 4567899999999999999999999987766554 No 74 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1e-34 Score=206.87 Aligned_cols=268 Identities=12% Similarity=0.044 Sum_probs=209.2 Q ss_pred CC-ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-ccccce Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSK 78 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~ 78 (274) ++ ..+|..+..++|+.+++.|++.+++.+.+.++++.... .+ ...++.+|+......+.|++||+++|+. .++|++ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec-cC-CceeEEEEeecCCccceeeccccccCcccccceee Confidence 22 33444566899999999999999999998888865331 11 1124566676666788999999999975 589999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------ccccCcccCHHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------LTVEADITKLDGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-------------~~~~~~~~~~d~i 145 (274) +++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++.....+. .....+..+|++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:79 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 999999999999999999999999999999999999999999999997654321 1122345789999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEcC- Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAKK- 220 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~~- 220 (274) ++++..+...+..+..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++++++|.++ .++|+. T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~ 354 (415) T protein:vir:79 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999999888899999999999999987653221 1222233566777899999999999998644 245543 Q ss_pred C-eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 221 G-AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 221 ~-a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) . ++.++.+.++.++..++.. .++.+++..|+|+++.+|+++++++.+++.+=- T Consensus 355 ~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:79 355 KDAIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred hccEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEeccCCCC Confidence 3 4556777888888776543 456789999999999999999999988776554 No 75 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1e-34 Score=206.87 Aligned_cols=268 Identities=12% Similarity=0.044 Sum_probs=209.2 Q ss_pred CC-ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-ccccce Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSK 78 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~ 78 (274) ++ ..+|..+..++|+.+++.|++.+++.+.+.++++.... .+ ...++.+|+......+.|++||+++|+. .++|++ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec-cC-CceeEEEEeecCCccceeeccccccCcccccceee Confidence 22 33444566899999999999999999998888865331 11 1124566676666788999999999975 589999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------ccccCcccCHHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------LTVEADITKLDGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-------------~~~~~~~~~~d~i 145 (274) +++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++.....+. .....+..+|++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:81 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 999999999999999999999999999999999999999999999997654321 1122345789999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEcC- Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAKK- 220 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~~- 220 (274) ++++..+...+..+..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++++++|.++ .++|+. T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~ 354 (415) T protein:vir:81 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999999888899999999999999987653221 1222233566777899999999999998644 245543 Q ss_pred C-eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 221 G-AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 221 ~-a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) . ++.++.+.++.++..++.. .++.+++..|+|+++.+|+++++++.+++.+=- T Consensus 355 ~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:81 355 KDAIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred hccEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEeccCCCC Confidence 3 4556777888888776543 456789999999999999999999988776554 No 76 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1e-34 Score=206.87 Aligned_cols=268 Identities=12% Similarity=0.044 Sum_probs=209.2 Q ss_pred CC-ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-ccccce Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSK 78 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~ 78 (274) ++ ..+|..+..++|+.+++.|++.+++.+.+.++++.... .+ ...++.+|+......+.|++||+++|+. .++|++ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-TN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec-cC-CceeEEEEeecCCccceeeccccccCcccccceee Confidence 22 33444566899999999999999999998888865331 11 1124566676666788999999999975 589999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------ccccCcccCHHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------LTVEADITKLDGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-------------~~~~~~~~~~d~i 145 (274) +++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++.....+. .....+..+|++| T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:98 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 999999999999999999999999999999999999999999999997654321 1122345789999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEcC- Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAKK- 220 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~~- 220 (274) ++++..+...+..+..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++++++|.++ .++|+. T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~ 354 (415) T protein:vir:98 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLG---NYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 99999999888899999999999999987653221 1222233566777899999999999998644 245543 Q ss_pred C-eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 221 G-AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 221 ~-a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) . ++.++.+.++.++..++.. .++.+++..|+|+++.+|+++++++.+++.+=- T Consensus 355 ~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:98 355 KDAIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred hccEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEeccCCCC Confidence 3 4556777888888776543 456789999999999999999999988776554 No 77 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=7.6e-36 Score=213.13 Aligned_cols=257 Identities=12% Similarity=0.108 Sum_probs=206.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++..+..+||+.+++.|++.+++.+.++++++... .+| .++|+... .+++.|++||+.+++++++|+++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~----~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 156 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 156 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEe----cCC--ceEEEEecCCCcccccccccccccccccceee Confidence 555566667789999999999999999988888886532 223 45677553 36799999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~~~~~~d~iv~a~ 149 (274) ++.+++++.++++|++++.++.+++++++.+++++++++..++.++....+.. ....++...||+++++. T Consensus 157 ~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~~ 236 (352) T protein:vir:78 157 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAIINAL 236 (352) T ss_pred eecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999886676665433321 11123345699999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~ 229 (274) ..|...+..+..|+||+.++..|++..+.. ++.+..|...+|+|+||++++.++. ++|+.+...+.... T Consensus 237 ~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~--------~~~~~~~~~~~llG~PV~~~~~~~~---~~~Gdf~~~~~~~~ 305 (352) T protein:vir:78 237 ADLHEDYRDNATIYMRYADYVKIISVLSNG--------TTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 305 (352) T ss_pred hccChhhhcCCEEEEehHHHHHHHHHHhcc--------CCcccccCCccccccceEEecCCCc---eeEeehhhhhhhhh Confidence 999888888899999999998887654321 1224456667899999999998765 45655555455555 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++.+++...+...++++.|+|+++++|+|++.++.+++++-. T Consensus 306 ~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~ 350 (352) T protein:vir:78 306 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSL 350 (352) T ss_pred hheeeeeccccCCeeEEEEEeeeCceeechhheEEEEeecccCCC Confidence 677788888888999999999999999999999999999888888 No 78 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=6.8e-35 Score=207.90 Aligned_cols=262 Identities=16% Similarity=0.145 Sum_probs=207.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc-ccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ-IGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~-~~~~~~ 79 (274) ++..+|+.+..++|+.+.+.|++.+++.+.+.++++... .+|+ .+||+....+.+.|++||+++|+++ .+|+++ T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~----~~g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i 212 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIR----VKGT-TRILVDTDTSPATWIEQSGALPTGDVGTIASI 212 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceee----cCce-eEEEEecCCcccccccccccccccccccccee Confidence 444556667789999999999999999999988886532 3454 6899998889999999999999887 589999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------ccccCcccCHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------------LTVEADITKLDG 144 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------------~~~~~~~~~~d~ 144 (274) ++.+++++..+++|++++.++.+++.+++.+++++++++++|+.+|.+-..+. ....++..++++ T Consensus 213 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 292 (425) T protein:vir:95 213 DFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKN 292 (425) T ss_pred eeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHH Confidence 99999999999999999999999999999999999999999999996532110 112345567899 Q ss_pred HHHHHHHHhhcC--CCccEEEEcHHHHHH----HHh--hhccccccccccccccccccccchhcceeeEEcCCCCcceEE Q lcl|Aclame:pro 145 LQTAIDKFNDED--LEPMVLFVNPLDAGG----LRT--SASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEAL 216 (274) Q Consensus 145 iv~a~~~l~~~~--~~~~~~v~~p~~~~~----L~~--~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~ 216 (274) ++++...+..++ ....+|+||+.++.. |++ +...+++.. ...+..++++|+||+.++.+|.++.+ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~-------~~~~~~~~l~G~pvv~~~~~~~~~i~ 365 (425) T protein:vir:95 293 LVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK-------LPNLRTPDLLGLRVVFNNFLDDDTVL 365 (425) T ss_pred HHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeec-------cCCCCCccccceeeEEcCcCCCccEE Confidence 999988887654 355679999997643 332 222222211 12345678999999999999999877 Q ss_pred EEcCCeEEEEeccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 217 LAKKGAVKLITKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 217 l~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.+.+.+.++.+.++.++.+++. .++.+.+++..|+|+++.+|+|+++++.+.|.+=. T Consensus 366 ~Gd~~~~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 366 FGEFEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred EEecccEEEEeecceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 76666666777777777766554 45678899999999999999999999999876666 No 79 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=2.4e-35 Score=210.35 Aligned_cols=257 Identities=12% Similarity=0.094 Sum_probs=205.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeec-CCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT-YSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~-~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++..+..+||+.+++.|++.+++.+.+++++++.. .++ .++|+.. ..+++.|++||+..+.++++|+++ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~----~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 191 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 191 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeee----cCC--ceEEEEeecCCccccccCccccccccccccee Confidence 555566667789999999999999999888888876532 223 4577754 346789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~~~~~~d~iv~a~ 149 (274) ++.+++++..+++|+|++.++.+|+.+++.+++++++++..++.++....+.. .+..++..+||+|++++ T Consensus 192 ~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~ 271 (387) T protein:vir:93 192 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINAL 271 (387) T ss_pred eeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999998887775544322 12223445699999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~ 229 (274) ..|...+.....|+||+.++..+++..... ++.+..|...+|+|+||++++.++. ++|+.+...|.... T Consensus 272 ~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~--------~~~~~~~~~~~llG~PV~~~~~~~~---~~~GDf~~~~~~~~ 340 (387) T protein:vir:93 272 ADLHEDYRDNATIYMRYADYVKIISVLSNG--------TTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 340 (387) T ss_pred hccChhhhcCCEEEEechHHHHHHHHHhcC--------CCcccccCCccccccceEEecCCCc---eeeeehhhhheehh Confidence 999988888889999999987765432211 1223345567899999999998764 45555555555566 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+..+++..++.+.++++.|+|+++++|+|++.++.++|++-. T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~~~~ 385 (387) T protein:vir:93 341 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGSL 385 (387) T ss_pred hheeeecccccCCceeEEEEeeeCceeechhheEEEEeecCCCCC Confidence 677788888888999999999999999999999999998888777 No 80 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.7e-35 Score=211.16 Aligned_cols=257 Identities=12% Similarity=0.101 Sum_probs=205.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...++..+..+||+.++..|++.+++.+.++++++... .++ .++|+... .+++.|++||+.++.++++|+++ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~----~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i 206 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 206 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeee----cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 444555556789999999999999999888888886532 222 55777653 45789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccCcccCHHHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEADITKLDGLQTAI 149 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~~~~~~d~iv~a~ 149 (274) ++.+++++..+++|+|++.++.+++.+++.+++++++++..++.++....+.. ....++...+|+++++. T Consensus 207 ~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~ 286 (402) T protein:vir:93 207 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINAL 286 (402) T ss_pred eecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHH Confidence 99999999999999999999999999999999999999988777775443321 12223445689999999 Q ss_pred HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 150 DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 150 ~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~ 229 (274) ..|...+..+..|+||+.++..|++..... ++.+..|...+|+|+||++++.++. ++|+.++..|.... T Consensus 287 ~~l~~~y~~na~~imn~~t~~~~~~~~~d~--------~~~~~~~~~~~llG~PV~~t~~~~~---i~~GDf~~~~~~~~ 355 (402) T protein:vir:93 287 ADLHEDYRDNATIYMRYADYVKIISVLSNG--------TTNFFDTPAEKVFGKPVVFTDAAVK---PIVGDFNYFGINYD 355 (402) T ss_pred hccChhhhcCCEEEEechHHHHHHHHHhcC--------CCcccccCCccccccceEEecCCCc---eeeechhhhhhhhh Confidence 999888888889999999988876543211 1234456677999999999998874 55665554455555 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.+..++++..+...+++..|+|++|++|+|++.++.++++..- T Consensus 356 ~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~~~~ 400 (402) T protein:vir:93 356 GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPL 400 (402) T ss_pred hhhhhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecCCCCC Confidence 667778888888999999999999999999999999999998888 No 81 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1.4e-34 Score=206.14 Aligned_cols=269 Identities=14% Similarity=0.108 Sum_probs=199.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |+..++.....++|+.+.+.|++.+++.+.++++++... .++..++||+... .+.+.|++||+.+|+++++|+++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccc----cCCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 666677777789999999999999999999999886532 2345689999754 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccC------------ Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEA------------ 137 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~------------ 137 (274) ++.+++++.++++|+|++.++ +++.+++.+++++++++++|+.+|..-.+.- .+... T Consensus 227 ~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) T protein:vir:78 227 YEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATV 305 (497) T ss_pred EeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhh Confidence 999999999999999999887 5799999999999999999999986422110 00000 Q ss_pred ---------------------------------------------cccCHHHHHHHHHHHhhc-CCCccEEEEcHHHHHH Q lcl|Aclame:pro 138 ---------------------------------------------DITKLDGLQTAIDKFNDE-DLEPMVLFVNPLDAGG 171 (274) Q Consensus 138 ---------------------------------------------~~~~~d~iv~a~~~l~~~-~~~~~~~v~~p~~~~~ 171 (274) .....+.+..+...+... +..+..|+|||.+|.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~ 385 (497) T protein:vir:78 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) T ss_pred hhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHH Confidence 000112233333333333 3456789999999999 Q ss_pred HHhhhccc--cccccccc-cccccccccchhcceeeEEcCCCCcceEEEEcC--CeEEEEeccCceeeeccc----cccC Q lcl|Aclame:pro 172 LRTSASDN--FTRPTQLG-DNIIVKGAFGEALGAVIVRSNKLNKGEALLAKK--GAVKLITKRDFFLEKDRD----ASRK 242 (274) Q Consensus 172 L~~~~~~~--~~~~~~~~-~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~--~a~~~~~~~~~~ve~~r~----~~~~ 242 (274) |++.++.+ ++.....+ ......+..++|+|+||++++.+|.++.++.+. .++.++.+.++.++..++ -.++ T Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n 465 (497) T protein:vir:78 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG 465 (497) T ss_pred HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcC Confidence 98765433 33222111 111223344589999999999999999876443 346667788888877543 3457 Q ss_pred ccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 243 STALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 243 ~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .+.+++..|+|+.|.+|+++|+++.+++.+-- T Consensus 466 ~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 466 KVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred cEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 78899999999999999999999998776666 No 82 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1.4e-34 Score=206.14 Aligned_cols=269 Identities=14% Similarity=0.108 Sum_probs=199.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |+..++.....++|+.+.+.|++.+++.+.++++++... .++..++||+... .+.+.|++||+.+|+++++|+++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccc----cCCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 666677777789999999999999999999999886532 2345689999754 46789999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccC------------ Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------LTVEA------------ 137 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------~~~~~------------ 137 (274) ++.+++++.++++|+|++.++ +++.+++.+++++++++++|+.+|..-.+.- .+... T Consensus 227 ~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) T protein:vir:10 227 YEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATV 305 (497) T ss_pred EeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhh Confidence 999999999999999999887 5799999999999999999999986422110 00000 Q ss_pred ---------------------------------------------cccCHHHHHHHHHHHhhc-CCCccEEEEcHHHHHH Q lcl|Aclame:pro 138 ---------------------------------------------DITKLDGLQTAIDKFNDE-DLEPMVLFVNPLDAGG 171 (274) Q Consensus 138 ---------------------------------------------~~~~~d~iv~a~~~l~~~-~~~~~~~v~~p~~~~~ 171 (274) .....+.+..+...+... +..+..|+|||.+|.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~ 385 (497) T protein:vir:10 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) T ss_pred hhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHH Confidence 000112233333333333 3456789999999999 Q ss_pred HHhhhccc--cccccccc-cccccccccchhcceeeEEcCCCCcceEEEEcC--CeEEEEeccCceeeeccc----cccC Q lcl|Aclame:pro 172 LRTSASDN--FTRPTQLG-DNIIVKGAFGEALGAVIVRSNKLNKGEALLAKK--GAVKLITKRDFFLEKDRD----ASRK 242 (274) Q Consensus 172 L~~~~~~~--~~~~~~~~-~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~--~a~~~~~~~~~~ve~~r~----~~~~ 242 (274) |++.++.+ ++.....+ ......+..++|+|+||++++.+|.++.++.+. .++.++.+.++.++..++ -.++ T Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n 465 (497) T protein:vir:10 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG 465 (497) T ss_pred HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcC Confidence 98765433 33222111 111223344589999999999999999876443 346667788888877543 3457 Q ss_pred ccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 243 STALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 243 ~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .+.+++..|+|+.|.+|+++|+++.+++.+-- T Consensus 466 ~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 466 KVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred cEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 78899999999999999999999998776666 No 83 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=2.4e-34 Score=204.88 Aligned_cols=267 Identities=12% Similarity=0.106 Sum_probs=210.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccccc-cccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQ-IGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~~-~~~~~ 78 (274) |+..++..+..++|+.+++.|++.+++.+.+.++++.... ....| ++.+|+... .+.+.|++||+.+|..+ ++|++ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENV-TTLTG-SRVYEKWADITGLAKLDDEGGQIGQNDDPKLSL 186 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeec-cCCcc-eEEEEeeccCCcceeeeccccccccccccceee Confidence 7777777788999999999999999999998888765331 11122 366776643 46789999999999875 79999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.+++++..+++|++++.++.+++.+++.+++++++++.+|+.++....++. .....++||+++++...+..++.. T Consensus 187 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~--~~~~~~~~d~i~~~~~~l~~~~~~ 264 (397) T protein:vir:49 187 IRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP--NKPTLAKWDDIIDLQAKVDPAIKQ 264 (397) T ss_pred eEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--ccccccCHHHHHHHHHhhhhhhcC Confidence 999999999999999999999999999999999999999999999997755433 334567899999999999999989 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCCcce----EEEEcC-C-eEEEEeccC Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLNKGE----ALLAKK-G-AVKLITKRD 230 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p~~~----~~l~~~-~-a~~~~~~~~ 230 (274) +..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++++ .+|.++ .++|+. + ++.++.+.+ T Consensus 265 ~a~~v~n~~~~~~l~~lkd~~g---~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 341 (397) T protein:vir:49 265 TSLFLTNTSGFTALKKVKNAMG---DYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQH 341 (397) T ss_pred CCEEEEcHHHHHHHHHhhccCC---ceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecc Confidence 9999999999999987654321 1122223556777899999998754 455432 245543 3 466777888 Q ss_pred ceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 231 FFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 231 ~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.+++. .++...+++..|+|+++.+|+++++++.+++..-- T Consensus 342 ~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 342 LSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQK 389 (397) T ss_pred cEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccccccc Confidence 888877653 46778899999999999999999999977655433 No 84 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=8.2e-35 Score=207.46 Aligned_cols=260 Identities=13% Similarity=0.071 Sum_probs=200.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcc-----cccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI-----PVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~-----~~~~~~ 75 (274) ||+.+|+.+..++|+.+++.|++.+++.+.+.++++.. ...+.+++||+....+.+.|++||+.. |.++++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~ 76 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV----NMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhccee----eccCCcEEEEEEeCCcceEEeecccccccccccccccc Confidence 99999999999999999999999999999988888653 334567999999988899999999864 556889 Q ss_pred cceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------c-----cccCc Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT------------L-----TVEAD 138 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~------------~-----~~~~~ 138 (274) |+++++.++|++..+++|+|+++++.+++++++.+++++++++++|+.++....... . ..... T Consensus 77 f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) T protein:vir:25 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) T ss_pred eeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999995432110 0 00111 Q ss_pred ccCHHHHHHHHH----HHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc-- Q lcl|Aclame:pro 139 ITKLDGLQTAID----KFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK-- 212 (274) Q Consensus 139 ~~~~d~iv~a~~----~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~-- 212 (274) ...++++.++.. .+.......+.|+|||..+..|++.++.+. ..... .++++|+||++++.+|. T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G-------~~i~~---~~~l~G~Pv~~~~~~~~~~ 226 (305) T protein:vir:25 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANG-------NPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) T ss_pred chhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCC-------ceeec---CCcccccceEEcCccCCCC Confidence 223344444433 344445666789999999999987653221 11111 24799999999999874 Q ss_pred --ceEEEEcCCeEEEEeccCceeeecccc------------ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 213 --GEALLAKKGAVKLITKRDFFLEKDRDA------------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 213 --~~~~l~~~~a~~~~~~~~~~ve~~r~~------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +..++.+.+.+.+..+.++.++.+++. .+++..+|+..|+|+.|.||+++++++..-...|- T Consensus 227 ~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~ 302 (305) T protein:vir:25 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) T ss_pred CccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccC Confidence 345666667777777778888776653 23567889999999999999999999997555554 No 85 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=2.3e-34 Score=205.04 Aligned_cols=267 Identities=14% Similarity=0.106 Sum_probs=205.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccc-ccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVD-QIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~-~~~~~~ 78 (274) |...++..+..++|+.+++.|++.+++.+.+.++++..... +..| .+.+|+... .+.+.|++||+++|+. .++|++ T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~-~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS-TSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeecc-CCcc-eEEEeeccccccceeeecCccccccccCcceee Confidence 55555555668999999999999999999998888653321 1112 355555543 3567899999999975 589999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHH-HHhhcCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAID-KFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~-~l~~~~~ 157 (274) +++.+++++..+++|++++.++..++.+++.+++++++++.+|+.++....++... ....+++++++++. .+...+. T Consensus 194 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~--~~~~~~~~l~~~~~~~~~~~~~ 271 (408) T protein:vir:10 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK--PTIAKFDDVITMINTAVDPAII 271 (408) T ss_pred EEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccccHHHHHHHHHHhhhhhhc Confidence 99999999999999999999999999999999999999999999999876655432 34567999999874 5666666 Q ss_pred CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCCcce----EEEE-cCC-eEEEEecc Q lcl|Aclame:pro 158 EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLNKGE----ALLA-KKG-AVKLITKR 229 (274) Q Consensus 158 ~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p~~~----~~l~-~~~-a~~~~~~~ 229 (274) .+..|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||++++ .+|... .+++ +.+ ++.+..+. T Consensus 272 ~~a~~v~n~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 348 (408) T protein:vir:10 272 ATSSLLTNQSGLNKLALVKTAEG---KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE 348 (408) T ss_pred cCCEEEEcHHHHHHHHHhhccCC---ceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEec Confidence 77889999999999987754332 1222233556777899999999965 456433 1444 434 35677778 Q ss_pred Cceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++.+++. .++...+++..|+|+++.+|+++++++.++++... T Consensus 349 ~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:10 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) T ss_pred ceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccCC Confidence 8888877654 35678899999999999999999999988866555 No 86 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=2.3e-34 Score=205.01 Aligned_cols=262 Identities=14% Similarity=0.070 Sum_probs=207.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-cccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~~ 79 (274) |+..++..+..++|+.+++.|++.+++.+.+.++++....- +.. ..+.+|+....+.+.|++||+++|.. .++|+++ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v 200 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVT-TRS-GTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKV 200 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeecc-CCc-eeEEEEEecCCcceeeecccccccccccccceeE Confidence 66677777788999999999999999999888887653321 112 24778887777889999999999975 6899999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHH-HHhhcCCC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAID-KFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~-~l~~~~~~ 158 (274) ++.+++++..+++|++++.++.+++.+++.+++++++++++|..++....+.. +.+.+++++++++.. .+...+.. T Consensus 201 ~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~---~~g~~~~~~i~~~~~~~l~~~~~~ 277 (397) T protein:vir:12 201 SYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLK---KVDIDGLDGIKKALNVTLDPMVAP 277 (397) T ss_pred EeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---ccccccHHHHHHHHhhccchhhhC Confidence 99999999999999999999999999999999999999999999987655432 345678999999884 67777778 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC-CCcc---e-EEEEcC-C-eEEEEeccCc Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK-LNKG---E-ALLAKK-G-AVKLITKRDF 231 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~-~p~~---~-~~l~~~-~-a~~~~~~~~~ 231 (274) +..|+|||.+|..|++..+.+. .....+.+.+|..++|+|+||+++++ +|.. . .++++. . ++.+..+.++ T Consensus 278 ~a~~~~n~~~~~~L~~lkd~~G---~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 354 (397) T protein:vir:12 278 GSIVLTNQDGYDWLDTLKDGTG---RYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQ 354 (397) T ss_pred CCEEEEcHHHHHHHHHhhccCC---ceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecce Confidence 8899999999999987653322 12223335677778999999987665 3421 1 245543 3 4556677788 Q ss_pred eeeeccccc----cCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 232 FLEKDRDAS----RKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 232 ~ve~~r~~~----~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .++.+++.. .+...+++..|+|+++.+|+++++++.++= T Consensus 355 ~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 355 SIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 887665443 567889999999999999999999999988 No 87 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=6.8e-34 Score=202.43 Aligned_cols=267 Identities=15% Similarity=0.086 Sum_probs=205.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |...++..+..++|+.|+..|++.+++...+.++++.. ..++.+.++|.... .+.+.|++|+++++. ++++|++ T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT----PVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee----eccCCceEEEEEecCCCcccccccccccccccccccee Confidence 44455566668999999999999999999999988653 23445677777653 467899999999996 6799999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.+++++.++++|+|++.++.+++.+++.+++++.+++.+|+.++....++......+..++|++.++....-...+ T Consensus 187 v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~~d~l~~~~~~~~~~~~- 265 (394) T protein:vir:10 187 VDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTDTLVDSLKHILNVDLDPAY- 265 (394) T ss_pred EEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccHHHHHHHHHhhhhhhc- Confidence 9999999999999999999999999999999999999999999999998887776666777889999998765443333 Q ss_pred ccEEEEcHHHHHHHHhhhcccc--ccccccccccccccccchhcceeeEEcCCC--Cc--ce-EEEEc-CC-eEEEEecc Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNF--TRPTQLGDNIIVKGAFGEALGAVIVRSNKL--NK--GE-ALLAK-KG-AVKLITKR 229 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~--~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~--p~--~~-~~l~~-~~-a~~~~~~~ 229 (274) ...|+|||.+|..|++..+.+. +..... .+....|..++|+|+||+++++. |. ++ .++++ .+ ++.++.+. T Consensus 266 ~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~-~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~ 344 (394) T protein:vir:10 266 SRALVVTQSLFNTLDTLKDKNGRYLLHDAS-DSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQ 344 (394) T ss_pred cCEEEecHHHHHHHHHhhccCCCeeeeccc-cccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeec Confidence 4789999999999987754332 111111 12233455678999999986643 32 22 24444 33 35566677 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++..++..+ ...+++..|+|+++.+|++++.++.+.+.+=- T Consensus 345 ~~~v~~~~~~~~-~~~~~~~~r~d~~~~~~~ai~~~~~~~~~~~~ 388 (394) T protein:vir:10 345 QVTLAWEDSKIY-GRYLGAAFRFGVKQADSNAGYFVTNTDAASGS 388 (394) T ss_pred ceEEEEeccccc-ceeEEEEEEeccEEeccccEEEEEeecccCCC Confidence 888887776554 45789999999999999999999988876655 No 88 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=3.2e-34 Score=204.20 Aligned_cols=261 Identities=15% Similarity=0.086 Sum_probs=192.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||+.+|. +..++|+.+++.|++.+++.++++++++... .++..++||++...+.+.|++||+++|+++++|++++ T Consensus 1 Mat~tt~-~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~----~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~ 75 (311) T protein:vir:99 1 MATFGTG-NLKNLPRNIADGMVKDVVQGSTVAVLSARKP----QRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVT 75 (311) T ss_pred CceecCC-CceeccHHHHHHHHHHHHhhchhhhhcceee----ccCCceEEEEEeCCceeEEeecCcccccccceeeEEE Confidence 9977655 4467899999999999999999988886532 2334589999988889999999999999999999999 Q ss_pred EeehhhhcchhccHHHHh---ccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---c------------cc---cCcc Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVL---SGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---L------------TV---EADI 139 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~---~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---~------------~~---~~~~ 139 (274) +.++|++..+++|+|+++ ++..++.+++.+++++++++++|+.++....... . .. ..+. T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:99 76 STPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTI 155 (311) T ss_pred EeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccccc Confidence 999999999999999985 5568899999999999999999999996543111 0 00 1111 Q ss_pred c-CHHHHHHHHHHHhhcC--CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceE- Q lcl|Aclame:pro 140 T-KLDGLQTAIDKFNDED--LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEA- 215 (274) Q Consensus 140 ~-~~d~iv~a~~~l~~~~--~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~- 215 (274) . .++++.++...+...+ ...+.|+|||.++..|++.++.+. .....+....+..++++|+||++++.+|.+.. T Consensus 156 ~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~ 232 (311) T protein:vir:99 156 ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDG---RKKFPELGLGIGVSSFEGIDASVSDTVNGGDEA 232 (311) T ss_pred chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCC---CeeecCcccCCCCceecceeeEeeccccccccc Confidence 2 2345566666665543 455679999999999987764332 12223334455667999999999999874221 Q ss_pred --------------EEEcC--CeEEEEeccCceeeecccc---------ccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 216 --------------LLAKK--GAVKLITKRDFFLEKDRDA---------SRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 216 --------------~l~~~--~a~~~~~~~~~~ve~~r~~---------~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) ++++. ..+.+..++++.++..++. .+++..+|+..|+|++|.+| ++++++.++| T Consensus 233 ~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 233 DPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 22222 2344666777766654432 35667889999999999987 5777788888 No 89 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1.1e-33 Score=201.36 Aligned_cols=267 Identities=15% Similarity=0.090 Sum_probs=207.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |+..++..+..++|+.+...|++.+++.+.+.++++... .++.+.++|.... .+.+.|++||++++. ++++|++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 184 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTP----VTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNK 184 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceee----ccCCeeEEEEEecCCCcccccccccccccccccccee Confidence 776777777789999999999999999999988886532 3344577777653 456689999999985 6899999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.++++++.+++|++++.++.+++.+++.+++++++++.+|..++..++++......+..++|++.+++...-+..+ T Consensus 185 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~~d~l~~~~~~~~~~~~- 263 (389) T protein:vir:10 185 VDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKKTTTDTLVDSLKHILNVDLDPAY- 263 (389) T ss_pred eeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccHHHHHHHHHhhhhhhh- Confidence 9999999999999999999999999999999999999999999999999888877777788899999998764322222 Q ss_pred ccEEEEcHHHHHHHHhhhccc--cccccccccccccccccchhcceeeEEcCC-CCcc---e-EEEEcC-C-eEEEEecc Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNK-LNKG---E-ALLAKK-G-AVKLITKR 229 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~-~p~~---~-~~l~~~-~-a~~~~~~~ 229 (274) ...|+|||.+|..|++.++.+ ++..... .+....|..++|+|+||++.++ ++.+ . .++|+. + ++.+..+. T Consensus 264 ~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 342 (389) T protein:vir:10 264 SRALVVTQSLFNTLDTLKDKNGRYLLHDAS-DSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQ 342 (389) T ss_pred CcEEEecHHHHHHHHHhhccCCCeeeecCc-ccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeec Confidence 578999999999998765432 2221111 1223345567899999987554 3322 2 355543 3 36677788 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++..++.. +.+.+++..|+|+++.+|+++++++.+.+.+.- T Consensus 343 ~~~i~~~~~~~-~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 386 (389) T protein:vir:10 343 QVTLAWEDSKI-YGKYLGAAFRFGVQKADSKAGYFVTNTDVPGSA 386 (389) T ss_pred ceEEEeecccc-ccceEEEEEEeccEEecccceEEEEeeccCCCC Confidence 88988877644 446789999999999999999999988666665 No 90 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=4.6e-34 Score=203.34 Aligned_cols=264 Identities=8% Similarity=0.052 Sum_probs=210.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCC--CcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSG--DAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~--~a~~~~eg~~~~~~~~~~~~ 78 (274) .+..++..+..++|+.++..|++.+++.+.+.++++.. ...+..+++|...... .+.|++||.+++.++++|++ T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~----~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~ 189 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVI----PVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQP 189 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhhhceee----eccCCceEEEEeecCCccceeeccccccccccccceeE Confidence 33345555678999999999999999999888888653 2334457888765543 45679999999999999999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.+++++..+++|++++.++..++++++.+++++.+++.+|..+++..++... .++..+||+|++++..+..++.. T Consensus 190 i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~--~~~~~~~d~i~~~~~~l~~~~~~ 267 (421) T protein:vir:13 190 MAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLA--EETINDYAGLVKTINSLVPNARK 267 (421) T ss_pred EEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhccc--cccccchHHHHHHHHHhhhhhcC Confidence 9999999999999999999999999999999999999999999999887765433 23345799999999999988888 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce-----EEEEcCCe-EEEEeccCce Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-----ALLAKKGA-VKLITKRDFF 232 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~-----~~l~~~~a-~~~~~~~~~~ 232 (274) +..|+|||..|..|++..+.+. .....+ ...|..++|+|+||++++++|.+. .++.+.+. +.+..+.++. T Consensus 268 ~a~~v~n~~~~~~l~~lkd~~G---~~i~~~-~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 343 (421) T protein:vir:13 268 RAIIVTNSDGRAYLDGLMDKQG---RPLLKE-LSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYL 343 (421) T ss_pred CCEEEEcHHHHHHHHHhhcCCC---ceeecC-cCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceE Confidence 8999999999999987654321 111112 345667899999999999998543 24444443 5567788888 Q ss_pred eeeccccc--cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 233 LEKDRDAS--RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 233 ve~~r~~~--~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++..++.. ++...+++..|+|+++.+|+++..+....+.+.+ T Consensus 344 v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v 387 (421) T protein:vir:13 344 IDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIV 387 (421) T ss_pred EEeecccccccCeeEEEEEeeecceeecchhhheeeecccceee Confidence 88887765 4567899999999999999999888888777766 No 91 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=1.1e-33 Score=201.35 Aligned_cols=267 Identities=13% Similarity=0.112 Sum_probs=203.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeec-CCCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT-YSGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~-~~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |...++..+..++|+.+++.|++.+++.+.+.++++.... .+..| .+.+++.. ..+.+.|++||+.+|+ +.++|++ T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~ 193 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTI 193 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeec-cCCcc-eEEEEeecCCccceeeecCccccccccccceee Confidence 5555666667899999999999999999999888865321 11112 24445443 3467899999999997 5799999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHH-HHhhcCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAID-KFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~-~l~~~~~ 157 (274) ++++++++++.+++|++++.++.+++.+++.+++++++++++|+.++....+.. ......+++++++++. .+...+. T Consensus 194 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~--~~~~~~~~~~i~~~~~~~~~~~~~ 271 (404) T protein:vir:39 194 IKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP--KKPTIAKFDDVITMINTSVDPAII 271 (404) T ss_pred EEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--cccccccHHHHHHHHHHhhhhhhc Confidence 999999999999999999999999999999999999999999999998755443 3344567999999876 4555566 Q ss_pred CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcce----EEEE-cCC-eEEEEecc Q lcl|Aclame:pro 158 EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGE----ALLA-KKG-AVKLITKR 229 (274) Q Consensus 158 ~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~----~~l~-~~~-a~~~~~~~ 229 (274) ....|+|||.+|..|++.++.+. .....+.+.++..++|+|+||+++++ +|..+ .+++ +.. ++.++.+. T Consensus 272 ~~a~~v~n~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 348 (404) T protein:vir:39 272 ATSSLLTNQSGLNKLALVKTAEG---KYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRE 348 (404) T ss_pred cCCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeec Confidence 77889999999999987653221 12222335566678999999999664 45332 2444 433 46667778 Q ss_pred Cceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++.+++. .++...+++..|+|+++.+|+++++++.++++.-- T Consensus 349 ~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~ 397 (404) T protein:vir:39 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQV 397 (404) T ss_pred ceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeeccccCC Confidence 8888877654 35677899999999999999999999977776644 No 92 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=2.1e-34 Score=205.23 Aligned_cols=265 Identities=15% Similarity=0.155 Sum_probs=217.3 Q ss_pred CCccccch------------h--h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccC Q lcl|Aclame:pro 1 MAQGTTKV------------S--N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE 65 (274) Q Consensus 1 ma~~~T~~------------~--~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~e 65 (274) ||+..+.- + + +++ |+|+.+|.+.+++.+++.++.... ++ .+|++++||+++.. ++..+.+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r-~~--~~G~sv~i~~iG~~-t~~~~~~ 75 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR-SI--ASGKSAQFPVIGRT-KAAYLKP 75 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccc-cc--cccceeEeeeccce-eeeeecC Confidence 88755543 1 1 577 999999999999999999998763 22 46999999999874 7899999 Q ss_pred CCcccc--cccccceeEEeehhhhc-chhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Q lcl|Aclame:pro 66 GEKIPV--DQIGTSKREAKVRKIGK-GTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------- 132 (274) Q Consensus 66 g~~~~~--~~~~~~~~~~~~~~~~~-~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------- 132 (274) |.+++. .+++..+.++++.++.. .+.|.|.+..++..|+++.+.++++.+++++.|+.++..+.... T Consensus 76 g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:33 76 GENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENI 155 (347) T ss_pred CCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 998865 56888999999988754 48899999999999999999999999999999999875442110 Q ss_pred -------cc----ccCc---------ccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccc Q lcl|Aclame:pro 133 -------LT----VEAD---------ITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI 190 (274) Q Consensus 133 -------~~----~~~~---------~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~ 190 (274) .. ..++ ...|+.|++|...|.++++ .+|+++++|..|+.|+++.. +......++.. T Consensus 156 ~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~--~~~~d~~~~~~ 233 (347) T protein:vir:33 156 EGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM--PNAANYQALLD 233 (347) T ss_pred ccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccc--ccccccccccc Confidence 00 0010 1126788899999988775 68999999999999998753 33333445567 Q ss_pred ccccccchhcceeeEEcCCCCcce--------------------------------EEEEcCCeEEEEeccCceeeeccc Q lcl|Aclame:pro 191 IVKGAFGEALGAVIVRSNKLNKGE--------------------------------ALLAKKGAVKLITKRDFFLEKDRD 238 (274) Q Consensus 191 ~~~g~~~~i~G~~Vv~s~~~p~~~--------------------------------~~l~~~~a~~~~~~~~~~ve~~r~ 238 (274) +.+|.+++++|++|+.|+++|.+. .++|++++++....+++++|.+|+ T Consensus 234 ~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~ 313 (347) T protein:vir:33 234 PERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred cccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccc Confidence 899999999999999999998531 257899999999999999999999 Q ss_pred cccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 239 ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDE 272 (274) Q Consensus 239 ~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~ 272 (274) +.++.+.|++.+.||+++++|+++|.|..+--++ T Consensus 314 ~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 314 ANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 9999999999999999999999999999988888 No 93 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=3.5e-34 Score=203.99 Aligned_cols=257 Identities=15% Similarity=0.092 Sum_probs=200.5 Q ss_pred CCccccchhhccchH-HHHHHHHHHHHHhhhhcccc-cccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE-VLAPMMQAELDKKLRFAQFA-DIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe-~~~~~v~~~~~~~~~~~~l~-~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) |...++..+..++|+ .++..+++.+++.++++++. +. +....| .++||+....+.+.|++||+.++.++++|++ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~---~~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~ 432 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM---LPGLVG-DVDIPKKTSGANFYWIGEDEDVQDSDFDFTT 432 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceE---eecCCc-ceEEEEEeCCceeEeecCCccccccccceee Confidence 344444445556665 56789999999988887763 32 233334 4999999888899999999999999999999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------ccccCcccCHHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------LTVEADITKLDGL 145 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-------------~~~~~~~~~~d~i 145 (274) +++.+++++..+.+|++++.++.+++.+.+.++|++++++++|+.+|.....+. .+...+.++|+++ T Consensus 433 i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i 512 (632) T protein:vir:96 433 LSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASV 512 (632) T ss_pred EEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHH Confidence 999999999999999999999999999999999999999999999996432111 1122345689999 Q ss_pred HHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeE Q lcl|Aclame:pro 146 QTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAV 223 (274) Q Consensus 146 v~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~ 223 (274) +++...+...+. .+..|+|||..+..|++....+. .|.. +.. .++++|+||++++++|.++.++.+.+.+ T Consensus 513 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~-----~G~~-i~~--~~~l~G~pv~~s~~ip~~~~~~gd~s~~ 584 (632) T protein:vir:96 513 VDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDN-----TGER-IWQ--NNEVNGYRAEASNQIPADTWIFGDWSQI 584 (632) T ss_pred HHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCC-----CCce-eec--CCeecccceEeccccccCcEEEeecceE Confidence 999988877653 45689999999888875432111 1111 221 2579999999999999999887777766 Q ss_pred EEEeccCceeeecc--ccccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 224 KLITKRDFFLEKDR--DASRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 224 ~~~~~~~~~ve~~r--~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) .+....++.+..++ +...+++.++++.++|+++.+|++++.++++| T Consensus 585 ~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 585 VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 66666666666555 44578889999999999999999999999999 No 94 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=1.9e-33 Score=199.99 Aligned_cols=268 Identities=14% Similarity=0.055 Sum_probs=201.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC----CCcccccCCCcccccc-cc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS----GDAQVIAEGEKIPVDQ-IG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~----~~a~~~~eg~~~~~~~-~~ 75 (274) ++..++..+..++|+.+++.|++.+++.+.+++++... ..++..+++|+.... +.+.|++||+.+|+++ .+ T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 193 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL----TMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFAD 193 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhccee----eccCCceeEEEeccccccccccceecCcccccccCccc Confidence 44455666778999999999999999999988888643 334556788876542 4578999999999987 58 Q ss_pred cceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------cccCcccCH Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------TVEADITKL 142 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------~~~~~~~~~ 142 (274) |+++++.+++++..++||++++.++ ..+.+++.+++++++++.+|+.+|....++.. +...+...+ T Consensus 194 f~~i~~~~~k~~~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~ 272 (413) T protein:vir:81 194 FDIVTESLSKIAGLTKITDEMIEDY-DFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELA 272 (413) T ss_pred ceeeEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccchhH Confidence 9999999999999999999999888 56999999999999999999999864321110 111223346 Q ss_pred HHHHHHHHHHhhc-CCCccEEEEcHHHHHHHHhhhccc--ccccccc--ccccccccccchhcceeeEEcCCCCcceEEE Q lcl|Aclame:pro 143 DGLQTAIDKFNDE-DLEPMVLFVNPLDAGGLRTSASDN--FTRPTQL--GDNIIVKGAFGEALGAVIVRSNKLNKGEALL 217 (274) Q Consensus 143 d~iv~a~~~l~~~-~~~~~~~v~~p~~~~~L~~~~~~~--~~~~~~~--~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l 217 (274) +.+.++...+..+ .+....|+|||.+|..|++.++.+ ++..... +......+..++++|+||++++++|.+++++ T Consensus 273 ~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~ 352 (413) T protein:vir:81 273 DSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVV 352 (413) T ss_pred HHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEE Confidence 7777777666543 456677999999999998765432 2211110 0011112234589999999999999999887 Q ss_pred EcCC-eEEEEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 218 AKKG-AVKLITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 218 ~~~~-a~~~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) .+.+ ++.+..+.++.++.+++. .+++..+++..|+|+++.+|+++++++.++|.+= T Consensus 353 gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 353 GAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCCCCC Confidence 7655 455666778888776654 4577899999999999999999999999887777 No 95 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.4e-33 Score=200.73 Aligned_cols=259 Identities=18% Similarity=0.141 Sum_probs=200.2 Q ss_pred CCcc-ccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeec-CCCCcccccCCCcccc-cccccc Q lcl|Aclame:pro 1 MAQG-TTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT-YSGDAQVIAEGEKIPV-DQIGTS 77 (274) Q Consensus 1 ma~~-~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~-~~~~a~~~~eg~~~~~-~~~~~~ 77 (274) |... ++..+..++|+.+++.|++.+++.+.+.++++... .++.+.++|... ..+.+.|++||+.++. ++++|+ T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~ 208 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQ----ASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFK 208 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEe----ccCcceEEEEEecCCCccccccccccccccccccce Confidence 3333 33445679999999999999999988888886532 234457888765 3467899999999986 579999 Q ss_pred eeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDL 157 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~ 157 (274) ++++.+++++..+++|+|++.++.+++.+++.+.+++++++.+|..++...++.. ..+..+++++.++....-+. . T Consensus 209 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~---~~~~~~~~~~~~~~~~~~~~-~ 284 (400) T protein:vir:38 209 PVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT---AKTISSVDDLKHINNVDLDP-A 284 (400) T ss_pred eeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc---ccccccHHHHHHHHHhhhhh-h Confidence 9999999999999999999999999999999999999999999999887665443 23456789998887654333 2 Q ss_pred CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce----EEEEcC-C-eEEEEeccCc Q lcl|Aclame:pro 158 EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE----ALLAKK-G-AVKLITKRDF 231 (274) Q Consensus 158 ~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~----~~l~~~-~-a~~~~~~~~~ 231 (274) ...+|+|||.+|..|++..+.+. .....+.+.+|..++|+|+||++++++|.+. .++|+. + ++.++.+.++ T Consensus 285 ~~a~~v~~~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~ 361 (400) T protein:vir:38 285 YSRVIIASQSFYNFLDTVKDGNG---RYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADF 361 (400) T ss_pred hCcEEEEcHHHHHHHHHhhccCC---CeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecce Confidence 35789999999999987654321 1122233566777899999999999988533 245543 3 3556667788 Q ss_pred eeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 232 FLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 232 ~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .+...++.. +...+++..|+|+++.+|+++++|+.+++- T Consensus 362 ~~~~~~~~~-~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 362 MVRWVDDQI-YGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEecccc-cceeEEEEEEeccEEecccceEEEEeecCC Confidence 887776644 456899999999999999999999997666 No 96 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=2.7e-33 Score=199.12 Aligned_cols=262 Identities=15% Similarity=0.095 Sum_probs=202.3 Q ss_pred CCccc-cchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccc Q lcl|Aclame:pro 1 MAQGT-TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTS 77 (274) Q Consensus 1 ma~~~-T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~ 77 (274) ++... +..+..++|+.+++.|++.+++.+.+.++++... ..+.+.++|.... .+.+.|++||+.+|. ++++|+ T Consensus 127 ~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 202 (394) T protein:vir:97 127 QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ----AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFK 202 (394) T ss_pred hccccccccccccChHHHHHHHHHHhhhhhhhhhhceeee----ccCcceEEEEEecCCCccceecccccccccccccce Confidence 33333 3345679999999999999999999888886532 2334577888753 456899999999997 579999 Q ss_pred eeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDL 157 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~ 157 (274) ++++.+++++..+++|+|++.++.+++.+++.+++++.+++.+|..++..+++.. ..+..++++++++.....+. . T Consensus 203 ~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~---~~~~~~~~~~~~~~~~~~~~-~ 278 (394) T protein:vir:97 203 DVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDP-A 278 (394) T ss_pred eEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhh-h Confidence 9999999999999999999999999999999999999999999999987665433 34456799999988655433 3 Q ss_pred CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcceEEEEcCCe-EEEEeccCceee Q lcl|Aclame:pro 158 EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGA-VKLITKRDFFLE 234 (274) Q Consensus 158 ~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~~~l~~~~a-~~~~~~~~~~ve 234 (274) ....|+|||.+|..|++..+.+. .....+.+.+|..++|+|+||+++++ ++.+++++.+.+. +.+..+.++.++ T Consensus 279 ~~a~~v~n~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~ 355 (394) T protein:vir:97 279 YNVSLIVSQSFYQTLDTLKDGNG---RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLR 355 (394) T ss_pred hCCEEEEcHHHHHHHHHhhccCC---CeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEE Confidence 35679999999999987654332 12222335667778999999999554 5666666655443 556777888888 Q ss_pred eccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 235 KDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 235 ~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ..++.. ....+++..|+|+++.+|+++++++.+.+.+=. T Consensus 356 ~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 356 WADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred Eecccc-cceeEEEEEEEccEEecccceEEEEecccccCC Confidence 766554 456889999999999999999999997666655 No 97 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2e-33 Score=199.84 Aligned_cols=266 Identities=15% Similarity=0.098 Sum_probs=190.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccc-ccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADID-STLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~-~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |.+.++..+.+++|+.+++.|++.+++.+++.++.... ..+...++ .++||+....+.+.|++||+.+|+++++|+++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~-~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v 416 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPF-NIRVHAQVSGGAAGWVGEGKTKPLTKFDFESI 416 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccC-ceeeeeeecCcceEEeccCccccccccceeEE Confidence 32333334678999999999999999999988876432 12222233 48999998888999999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ccccCcccCHHHHHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT------------LTVEADITKLDGLQT 147 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~------------~~~~~~~~~~d~iv~ 147 (274) ++.++|++.++++|+|++.++.+++.+++.+++++++++++|+.+|....++. ....+....++++.. T Consensus 417 ~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~ 496 (645) T protein:vir:93 417 TFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEA 496 (645) T ss_pred EEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHH Confidence 99999999999999999999999999999999999999999999995432221 011122234567777 Q ss_pred HHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEE Q lcl|Aclame:pro 148 AIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKL 225 (274) Q Consensus 148 a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~ 225 (274) ++..+..++. ...+|+|||.++..|++.++.+.... ..+ + ...-++|+|+||++++++|.+.. +.+.+.+.+ T Consensus 497 ~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~---~~~-~-~~~~~tL~G~PV~~s~~vp~~~~-~gd~s~~~i 570 (645) T protein:vir:93 497 AFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKE---YPD-M-TLLGGSFQGLPVIVSQYVGDQLV-LVNAPDIYL 570 (645) T ss_pred HHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCcee---ecC-C-CCCCceeeceeeEEeccCCccee-EeccccEEE Confidence 7777765554 44689999999999988764322111 111 1 12235899999999999997543 334444434 Q ss_pred EeccCceeeecccc------------------------ccCccEEEEEEEEEEEEEcCcceEEEEeC---CCccc Q lcl|Aclame:pro 226 ITKRDFFLEKDRDA------------------------SRKSTALYSDKHYVAYLYDESKVVKITKG---AGDEV 273 (274) Q Consensus 226 ~~~~~~~ve~~r~~------------------------~~~~~~i~~~~~~~~~v~~~~avv~l~~~---aa~~~ 273 (274) ....++.+...++. ..++..+++..|+|+++.+|+++++||.. +|+-= T Consensus 571 g~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 571 ADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred EEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 33333333322221 13556789999999999999999999842 33322 No 98 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=2.2e-33 Score=199.61 Aligned_cols=266 Identities=13% Similarity=0.060 Sum_probs=203.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-cccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.+++..... .+. .....+|+....+.+.|++||++++.. .++|+++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cCC-ceeEEEEeecCCccceeecccccccccccccceeE Confidence 6666666677899999999999999999998888765321 111 223567777777789999999999976 5899999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHH-HHHhhcCCC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~-~~l~~~~~~ 158 (274) ++.+++++..+++|++++.++.+++.+++.+.+++++++.+|..++...+++. ..+.++++++++++ ..|...+.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~---~~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT---KQAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccCccCHHHHHHHHHHhhhhhhcc Confidence 99999999999999999999999999999999999999999999987665433 34557899999987 467777778 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEE-cCC-CC------cce-EEEEcCC--eEEEEe Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVR-SNK-LN------KGE-ALLAKKG--AVKLIT 227 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~-s~~-~p------~~~-~~l~~~~--a~~~~~ 227 (274) +..|+|||.+|..|++.++.+ +.....+.+.+|..++|+|+|+++ ++. .| .++ .++++.+ .+.+.. T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKD---GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccC---CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 889999999999998765322 122222335567778999987665 322 22 122 2455543 345566 Q ss_pred ccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 228 KRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 228 ~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.++.+++. .++...+++..|+|+++.+|+++++++.+.+...- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 777887776543 34667899999999999999999999886655555 No 99 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=2.2e-33 Score=199.61 Aligned_cols=266 Identities=13% Similarity=0.060 Sum_probs=203.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-cccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.+++..... .+. .....+|+....+.+.|++||++++.. .++|+++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cCC-ceeEEEEeecCCccceeecccccccccccccceeE Confidence 6666666677899999999999999999998888765321 111 223567777777789999999999976 5899999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHH-HHHhhcCCC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~-~~l~~~~~~ 158 (274) ++.+++++..+++|++++.++.+++.+++.+.+++++++.+|..++...+++. ..+.++++++++++ ..|...+.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~---~~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT---KQAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccCccCHHHHHHHHHHhhhhhhcc Confidence 99999999999999999999999999999999999999999999987665433 34557899999987 467777778 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEE-cCC-CC------cce-EEEEcCC--eEEEEe Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVR-SNK-LN------KGE-ALLAKKG--AVKLIT 227 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~-s~~-~p------~~~-~~l~~~~--a~~~~~ 227 (274) +..|+|||.+|..|++.++.+ +.....+.+.+|..++|+|+|+++ ++. .| .++ .++++.+ .+.+.. T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKD---GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccC---CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 889999999999998765322 122222335567778999987665 322 22 122 2455543 345566 Q ss_pred ccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 228 KRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 228 ~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.++.+++. .++...+++..|+|+++.+|+++++++.+.+...- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 777887776543 34667899999999999999999999886655555 No 100 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=2.2e-33 Score=199.61 Aligned_cols=266 Identities=13% Similarity=0.060 Sum_probs=203.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-cccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.+++..... .+. .....+|+....+.+.|++||++++.. .++|+++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cCC-ceeEEEEeecCCccceeecccccccccccccceeE Confidence 6666666677899999999999999999998888765321 111 223567777777789999999999976 5899999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHH-HHHhhcCCC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~-~~l~~~~~~ 158 (274) ++.+++++..+++|++++.++.+++.+++.+.+++++++.+|..++...+++. ..+.++++++++++ ..|...+.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~---~~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT---KQAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccCccCHHHHHHHHHHhhhhhhcc Confidence 99999999999999999999999999999999999999999999987665433 34557899999987 467777778 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEE-cCC-CC------cce-EEEEcCC--eEEEEe Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVR-SNK-LN------KGE-ALLAKKG--AVKLIT 227 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~-s~~-~p------~~~-~~l~~~~--a~~~~~ 227 (274) +..|+|||.+|..|++.++.+ +.....+.+.+|..++|+|+|+++ ++. .| .++ .++++.+ .+.+.. T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKD---GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccC---CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 889999999999998765322 122222335567778999987665 322 22 122 2455543 345566 Q ss_pred ccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 228 KRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 228 ~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.++.+++. .++...+++..|+|+++.+|+++++++.+.+...- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 777887776543 34667899999999999999999999886655555 No 101 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=2.2e-33 Score=199.61 Aligned_cols=266 Identities=13% Similarity=0.060 Sum_probs=203.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc-cccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~-~~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+.+++..... .+. .....+|+....+.+.|++||++++.. .++|+++ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeec-cCC-ceeEEEEeecCCccceeecccccccccccccceeE Confidence 6666666677899999999999999999998888765321 111 223567777777789999999999976 5899999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHH-HHHhhcCCC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAI-DKFNDEDLE 158 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~-~~l~~~~~~ 158 (274) ++.+++++..+++|++++.++.+++.+++.+.+++++++.+|..++...+++. ..+.++++++++++ ..|...+.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~---~~~~~~~d~i~~~~~~~l~~~~~~ 260 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT---KQAIKSLDDIKDVLNVKLDPAISP 260 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccCccCHHHHHHHHHHhhhhhhcc Confidence 99999999999999999999999999999999999999999999987665433 34557899999987 467777778 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEE-cCC-CC------cce-EEEEcCC--eEEEEe Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVR-SNK-LN------KGE-ALLAKKG--AVKLIT 227 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~-s~~-~p------~~~-~~l~~~~--a~~~~~ 227 (274) +..|+|||.+|..|++.++.+ +.....+.+.+|..++|+|+|+++ ++. .| .++ .++++.+ .+.+.. T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKD---GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccC---CCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEe Confidence 889999999999998765322 122222335567778999987665 322 22 122 2455543 345566 Q ss_pred ccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 228 KRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 228 ~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) +.++.++.+++. .++...+++..|+|+++.+|+++++++.+.+...- T Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccccc Confidence 777887776543 34667899999999999999999999886655555 No 102 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=2.1e-33 Score=199.76 Aligned_cols=267 Identities=13% Similarity=0.069 Sum_probs=202.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-CCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |...+...+..++|+.+++.|++.+++.+.+.+++++... ..+..+.+|..... ..+.|++||+.+|+++++|+++ T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~ 193 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTT---SDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMG 193 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeec---CCCceEEEEeeccCcccccccccccccccccccccee Confidence 4444444456799999999999999999888888765432 23445677776543 4567999999999999999999 Q ss_pred EEeehhhh-cchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------cccccCcccCHH Q lcl|Aclame:pro 80 EAKVRKIG-KGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA---------------TLTVEADITKLD 143 (274) Q Consensus 80 ~~~~~~~~-~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a---------------~~~~~~~~~~~d 143 (274) .+..+|++ +.+++|++++.++.+++++++.+++++++++++|+.++..-... ......+.+++| T Consensus 194 ~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d 273 (409) T protein:vir:45 194 SLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQ 273 (409) T ss_pred eeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchH Confidence 99999875 67899999999999999999999999999999999998533221 112234557899 Q ss_pred HHHHHHHHHhhcCCCccE--EEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc----ceEEE Q lcl|Aclame:pro 144 GLQTAIDKFNDEDLEPMV--LFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK----GEALL 217 (274) Q Consensus 144 ~iv~a~~~l~~~~~~~~~--~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~----~~~~l 217 (274) +++++...|..++..+.. |+|||.++..|++.++.+. .....+.+.+|..++|+|+||++++++|. ...++ T Consensus 274 ~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~ 350 (409) T protein:vir:45 274 EILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQG---RPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMF 350 (409) T ss_pred HHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCC---ceeeccCcCCCCCceecceeeEEecCcCCccCCccEEE Confidence 999999999887766554 5779999999977653221 12223345667778999999999999985 23355 Q ss_pred Ec-CCeEEEEeccCceeeeccc--cccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 218 AK-KGAVKLITKRDFFLEKDRD--ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 218 ~~-~~a~~~~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) |+ .+.+.+....++.++..++ ..++.+.+++..|+|+++.+|+++++++.+++..- T Consensus 351 ~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 351 CGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 54 3444455556666665444 44677889999999999999999999999877777 No 103 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=3e-33 Score=198.90 Aligned_cols=267 Identities=12% Similarity=0.091 Sum_probs=204.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-CCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |...++..+..++|+.+++.|++.+++.+.+.++++....- +.. ..+.+|+.... +.+.|++||+++++ ++++|++ T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS-TSS-GSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTI 193 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeecc-CCc-ceEEEEeecCCcccccccccccccccccccceee Confidence 55555666678999999999999999999888887653321 111 23667766543 45679999999997 5699999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHH-HHHhhcCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAI-DKFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~-~~l~~~~~ 157 (274) +++++++++..+++|+|++.++.+++.+++.+++++++++++|+.++...++.. ......+++++++++ ..+...+. T Consensus 194 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~--~~~~~~~~~~i~~~~~~~l~~~~~ 271 (408) T protein:vir:74 194 IKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP--KKPTIANFDDVITMINTSVDPAII 271 (408) T ss_pred EEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cccccccHHHHHHHHHHhhhhhhc Confidence 999999999999999999999999999999999999999999999997755433 234456899999987 46777777 Q ss_pred CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcce----EEEE-cCC-eEEEEecc Q lcl|Aclame:pro 158 EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGE----ALLA-KKG-AVKLITKR 229 (274) Q Consensus 158 ~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~----~~l~-~~~-a~~~~~~~ 229 (274) ....|+|||.+|..|++.++.+. .....+.+.+|..++|+|+||+++++ +|... .+++ +.+ ++.++.+. T Consensus 272 ~~a~~v~n~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 348 (408) T protein:vir:74 272 ATSSLLTNQSGLNKLALVKTAEG---KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRE 348 (408) T ss_pred CCCEEEEcHHHHHHHHHhhcCCC---ceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEec Confidence 78899999999999987653322 12222335566678999999998654 55322 2444 433 46677788 Q ss_pred Cceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++.+++. .++...+++..|+|+++++|+++++++.++.+.-- T Consensus 349 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:74 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQV 397 (408) T ss_pred ceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCCC Confidence 8888877654 35678899999999999999999999985443333 No 104 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=4.6e-33 Score=197.89 Aligned_cols=267 Identities=11% Similarity=0.070 Sum_probs=203.3 Q ss_pred CCccccc--hhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCccccc-cccc Q lcl|Aclame:pro 1 MAQGTTK--VSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVD-QIGT 76 (274) Q Consensus 1 ma~~~T~--~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~~-~~~~ 76 (274) |+..++. .+..++|+.+++.|++.+++.+.+.++++.... .+..| .+.+|.... .+.+.|++||+.+|+. .++| T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f 182 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENV-TTSHG-SRVYEKLADITPLKDLDDESALIGDNDDPEL 182 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeec-cCCcc-eEEEEeeccCCccccccccccccccccccce Confidence 4444433 455799999999999999999999888865321 11122 345555433 3567899999999976 5899 Q ss_pred ceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHH-HHhhc Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAID-KFNDE 155 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~-~l~~~ 155 (274) +++++++++++..+++|++++.++.+++.+++.+++++++++.+|+.++....++.. .....+++++++++. .+... T Consensus 183 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~--~~~~~~~~~i~~~~~~~l~~~ 260 (395) T protein:vir:38 183 TVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK--KPTISQFDNIKDLENNTLDPA 260 (395) T ss_pred eeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--ccccccHHHHHHHHHHhhhhh Confidence 999999999999999999999999999999999999999999999999987554432 234567999999875 56667 Q ss_pred CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc-----eEEEE-cCC-eEEEEec Q lcl|Aclame:pro 156 DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG-----EALLA-KKG-AVKLITK 228 (274) Q Consensus 156 ~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~-----~~~l~-~~~-a~~~~~~ 228 (274) +..+..|+|||.+|..|++..+.+. .....+.+.+|..++|+|+||+++++.+.+ ..++| +.+ .+.+..+ T Consensus 261 ~~~~a~~v~n~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~ 337 (395) T protein:vir:38 261 IESTSSFITNQSGYNILSKVKDADG---RYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDR 337 (395) T ss_pred hcCCCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEe Confidence 7778899999999999987654322 222233456677789999999998865421 22444 434 4556777 Q ss_pred cCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 229 RDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 229 ~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .++.++..++. .+++..+++..|+|+++.+|+++++++.+++++-- T Consensus 338 ~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 387 (395) T protein:vir:38 338 QQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQA 387 (395) T ss_pred cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 88888877654 35678899999999999999999999998765544 No 105 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=1.5e-33 Score=200.62 Aligned_cols=265 Identities=16% Similarity=0.146 Sum_probs=214.3 Q ss_pred CCccccchh--------------h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccC Q lcl|Aclame:pro 1 MAQGTTKVS--------------N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE 65 (274) Q Consensus 1 ma~~~T~~~--------------~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~e 65 (274) ||+..+... + +++ |+|+.+|...+++.+++.++.... ...+|++++||+++. .+++.+.+ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~i-e~f~g~V~~~f~~~s~~~~~~~~~---~~~~G~sv~i~~ig~-~t~~~~~~ 75 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR---SIASGKSAQFPVIGR-TKAAYLKP 75 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHH-HHHHHHHHHHHHHhhhhhhccccc---cccccceeEeeeccc-eeeeeecc Confidence 998766431 1 233 778899999999999999988763 224689999999987 47899999 Q ss_pred CCcccc--cccccceeEEeehhhhc-chhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Q lcl|Aclame:pro 66 GEKIPV--DQIGTSKREAKVRKIGK-GTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------- 134 (274) Q Consensus 66 g~~~~~--~~~~~~~~~~~~~~~~~-~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~-------- 134 (274) |.+++. .+++..+.++++.++.. .+.|.|.+..++..|+++.+.++++.+++++.|+.++..+..+... T Consensus 76 g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:15 76 GENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENI 155 (347) T ss_pred CCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 998855 56889999999987644 5889999999999999999999999999999999998665422100 Q ss_pred -------------ccC-ccc--------CHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccc Q lcl|Aclame:pro 135 -------------VEA-DIT--------KLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI 190 (274) Q Consensus 135 -------------~~~-~~~--------~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~ 190 (274) ..+ ... -++.+.+|...|.++++ .+|+++++|..|..|+++.. +......+... T Consensus 156 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~--~~~~d~~~~~~ 233 (347) T protein:vir:15 156 EGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM--PNAANYQALID 233 (347) T ss_pred cccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccc--ccccccccccc Confidence 000 011 15666777788887765 77999999999999998753 44444455566 Q ss_pred ccccccchhcceeeEEcCCCCcc--------------------------------eEEEEcCCeEEEEeccCceeeeccc Q lcl|Aclame:pro 191 IVKGAFGEALGAVIVRSNKLNKG--------------------------------EALLAKKGAVKLITKRDFFLEKDRD 238 (274) Q Consensus 191 ~~~g~~~~i~G~~Vv~s~~~p~~--------------------------------~~~l~~~~a~~~~~~~~~~ve~~r~ 238 (274) +++|.+++++|++|+.|+++|.+ ..+++|+.|++.+..+++.+|.+|+ T Consensus 234 ~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~ 313 (347) T protein:vir:15 234 HERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred ccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeeccc Confidence 89999999999999999999831 1357899999999999999999999 Q ss_pred cccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 239 ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDE 272 (274) Q Consensus 239 ~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~ 272 (274) +.++.+.|++.+.||+++++|+++|.|..+--++ T Consensus 314 ~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 314 ANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred chhhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 9999999999999999999999999999888888 No 106 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=5.6e-33 Score=197.40 Aligned_cols=269 Identities=10% Similarity=0.059 Sum_probs=205.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccc--ccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD--QIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~--~~~~~~ 78 (274) |...++..+..++|+.+.+.|++.+++.+.+.+++..... .+. ...+.+|+....+.+.|++||+..+.+ +++|++ T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~-~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~ 187 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPV-FTR-SGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLER 187 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeec-cCC-ccceEEEEecCCcceeeccccccccccccccceee Confidence 6666667777899999999999999999988888765321 122 234788888777889999999999886 588999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ccccCcccCHHHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT------------LTVEADITKLDGLQ 146 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~------------~~~~~~~~~~d~iv 146 (274) +++++++++..+++|++++.++.+++.+++.+.+++++++++|+.++....+.. ........+++++. T Consensus 188 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~ 267 (404) T protein:vir:10 188 FNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDFK 267 (404) T ss_pred eEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccccHHHHH Confidence 999999999999999999999999999999999999999999999996543221 01123345788998 Q ss_pred HHHH-HHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEE-cCCCCcce----EEEEc- Q lcl|Aclame:pro 147 TAID-KFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVR-SNKLNKGE----ALLAK- 219 (274) Q Consensus 147 ~a~~-~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~-s~~~p~~~----~~l~~- 219 (274) +++. .+...+..+.+|+|||.+|..|++.++.+ +.....+.+.+|..++|+|+||++ ++.+|.++ .++++ T Consensus 268 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd 344 (404) T protein:vir:10 268 KCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKT---GRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGD 344 (404) T ss_pred HHHHhhhhccccCCCEEEEcHHHHHHHHHhhccC---CceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEe Confidence 8876 45555566778999999999998765422 122222335667778999999985 45555432 34444 Q ss_pred CC-eEEEEeccCceeeecccc----ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 220 KG-AVKLITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 220 ~~-a~~~~~~~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .+ ++.+..+.++.++.+++. .++...+++..|+|+++.+|+++++++.+++.+=- T Consensus 345 ~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 345 TKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 33 455666778888776554 35778899999999999999999999988877666 No 107 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=3.7e-33 Score=198.35 Aligned_cols=262 Identities=15% Similarity=0.102 Sum_probs=196.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) |...++..+..++|+.+.+.|++.+++.+.+.++... ......| .+++|++...+.+.|++||+.+|+++++|++++ T Consensus 132 ~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~--~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~ 208 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGAR--TLPLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDDLK 208 (435) T ss_pred cccCCcCCCccccchhHHHHHHHHHhhhchhhhhcce--eeecCCC-ceEEEEEeCCcceeeeccCccccccccceeEEE Confidence 2233334445799999999999999998887776321 1222233 589999988888999999999999999999999 Q ss_pred EeehhhhcchhccHHHHhccCc--cHHHHHHHHHHHHHHHHHHHHHHHHhcccc--c-----------cccCcccC---- Q lcl|Aclame:pro 81 AKVRKIGKGTELTDEAVLSGFG--DPQGEAVRQHGLAIANKVDNDVLEALKGAT--L-----------TVEADITK---- 141 (274) Q Consensus 81 ~~~~~~~~~~~is~e~~~~s~~--d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--~-----------~~~~~~~~---- 141 (274) +.+++++..+++|+|++.++.. ++++++.+++++++++++|+.++..-.++. . .......+ T Consensus 209 ~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 288 (435) T protein:vir:14 209 LTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKI 288 (435) T ss_pred eeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhH Confidence 9999999999999999999864 589999999999999999999986432211 0 00011122 Q ss_pred HHHHHHHHHHHhhc--CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc------ Q lcl|Aclame:pro 142 LDGLQTAIDKFNDE--DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG------ 213 (274) Q Consensus 142 ~d~iv~a~~~l~~~--~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~------ 213 (274) ++++.+++..+... +..+..|+|||.++..|++.++.+.. ..+....-++|+|+||++++.+|.+ T Consensus 289 ~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~-------~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~ 361 (435) T protein:vir:14 289 ETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGN-------KVYPELANGMLKGYPVGKTTQVPINLGETGK 361 (435) T ss_pred HHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCc-------eeccCCCCCeeecceeEeeccccccccCCCc Confidence 34566666666554 34577899999999999876543211 1111223458999999999999863 Q ss_pred --eEEEEcCCeEEEEeccCceeeeccccc-------------cCccEEEEEEEEEEEEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 214 --EALLAKKGAVKLITKRDFFLEKDRDAS-------------RKSTALYSDKHYVAYLYDESKVVKITKGAGDE 272 (274) Q Consensus 214 --~~~l~~~~a~~~~~~~~~~ve~~r~~~-------------~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~ 272 (274) ..++.+.+.+.+..+.++.++.+++.. +++..+++..|+|+++.+|+++++|+..+.=+ T Consensus 362 ~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 362 ESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred cceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 345555666667778888888877643 45688999999999999999999999988888 No 108 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=3.5e-33 Score=198.51 Aligned_cols=267 Identities=11% Similarity=0.012 Sum_probs=198.1 Q ss_pred CCcc-ccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc---cCCCccccccccc Q lcl|Aclame:pro 1 MAQG-TTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI---AEGEKIPVDQIGT 76 (274) Q Consensus 1 ma~~-~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~---~eg~~~~~~~~~~ 76 (274) +|.. +|..+..+||+.+++.|++.+++.+.++++++... .+| .+++|++...+.+.|. +||..+|.++++| T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~----~~~-~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f 215 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVK----TKE-NIKYPVLVKKAEAQGHKNERTNNEMPETDIEF 215 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceec----cCC-ceEEEEEecCCcccceecccccccccccccce Confidence 3322 23335578999999999999999999999886532 223 4889998766677775 5678899999999 Q ss_pred ceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------ccccCcccCHHHH Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-----------LTVEADITKLDGL 145 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~-----------~~~~~~~~~~d~i 145 (274) +++++.+++++..+++|++++.++.+++.+++.+++++++++++|+.++..-.... .....+..++|++ T Consensus 216 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l 295 (434) T protein:vir:62 216 DEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDAL 295 (434) T ss_pred eeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999995432111 1122344579999 Q ss_pred HHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce-----EEEEc- Q lcl|Aclame:pro 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-----ALLAK- 219 (274) Q Consensus 146 v~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~-----~~l~~- 219 (274) +++...+..++..+.+|+|||.++..|++.++.+....... ......|...+|+|+||++++.+|.+. .++|+ T Consensus 296 ~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~-~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gd 374 (434) T protein:vir:62 296 VKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRP-FNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGD 374 (434) T ss_pred HHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeecc-CCCccCCCCceecceeeEEecCccCccCCCceEEEEee Confidence 99999998888888899999999999987654432211111 112445666789999999999998543 14444 Q ss_pred CCeEEEEeccC-ceee--eccccccCccEEEEEEEEEEEEEc-CcceE--EEEeCCCccc Q lcl|Aclame:pro 220 KGAVKLITKRD-FFLE--KDRDASRKSTALYSDKHYVAYLYD-ESKVV--KITKGAGDEV 273 (274) Q Consensus 220 ~~a~~~~~~~~-~~ve--~~r~~~~~~~~i~~~~~~~~~v~~-~~avv--~l~~~aa~~~ 273 (274) .+.+.++.+.+ +.++ .+++...+++.+++..|+|+++++ |.++. +++.++|+.- T Consensus 375 fs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 375 FSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred ccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 44444444433 3344 345556677889999999999875 66654 4565666666 No 109 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=3e-33 Score=198.86 Aligned_cols=259 Identities=12% Similarity=0.038 Sum_probs=192.1 Q ss_pred CCccccc-hhhccchHHHHHHHHHHHHHhhhhccc-ccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTK-VSNLIVPEVLAPMMQAELDKKLRFAQF-ADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~-~~~~~iPe~~~~~v~~~~~~~~~~~~l-~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) |+..+|. .+..++|+.+.+.|++.+++.++++++ ++. +....| .+++|++...+.+.|++||+.+|+++++|++ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~---v~~~~g-~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~ 139 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARS---IPLPNG-NLSMPRLSGGATAGYVGEGKDVVATGATFDD 139 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceee---eecCCC-ceEEEEEeCCcceeeeccCccccccccceeE Confidence 5544333 355789999999999999999888877 433 222234 4999999888899999999999999999999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc--c-------------ccccCcccC-- Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA--T-------------LTVEADITK-- 141 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a--~-------------~~~~~~~~~-- 141 (274) +++.++|++..+++|+|++.++.+++++++.+++++++++++|+.++..-..+ + ........+ T Consensus 140 i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~ 219 (366) T protein:vir:57 140 VKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT 219 (366) T ss_pred EEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh Confidence 99999999999999999999999999999999999999999999998543211 0 000111222 Q ss_pred -HHHHHHHHHHH---hhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc---- Q lcl|Aclame:pro 142 -LDGLQTAIDKF---NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG---- 213 (274) Q Consensus 142 -~d~iv~a~~~l---~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~---- 213 (274) .+..++.+... ...+.....|+|||.++..|++..+.+.. ........++|+|+||++++++|.+ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~-------~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~ 292 (366) T protein:vir:57 220 TIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGN-------KVYPEMSQGILKGYPIQRTSAIPANLGDD 292 (366) T ss_pred hHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCc-------eeccCCCCCeecceeeEEccccccccccC Confidence 33334433322 23345678899999999999876543221 1111223468999999999999963 Q ss_pred ----eEEEEcCCeEEEEeccCceeeeccccc-------------cCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 214 ----EALLAKKGAVKLITKRDFFLEKDRDAS-------------RKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 214 ----~~~l~~~~a~~~~~~~~~~ve~~r~~~-------------~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) ..++.+.+.+.+....++.++..+++. +++..+++..|+|+++.+|+++++++..-= T Consensus 293 ~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 293 GNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 234445666666777788888776643 344689999999999999999999987665 No 110 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=5.4e-33 Score=197.51 Aligned_cols=265 Identities=12% Similarity=0.025 Sum_probs=200.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) ++..++..+..++|+.+.+.+.+. .+...+..+++... ......++|.... .+.+.|++|++.++. ++++|++ T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~ 230 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTES----VTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITP 230 (437) T ss_pred hhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEe----eccCceeeEEeecccccccccccccccccccccccee Confidence 445556666789999999887654 45555566654422 2233477877643 467899999999986 5689999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHH-HHhhcCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAID-KFNDEDL 157 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~-~l~~~~~ 157 (274) +++.+++++.++++|++++.++.+++.+++.+.+++++++.+|..++.+.+++.... ....+++++.+++. .+...+. T Consensus 231 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~-~~~~~~~~~~~~~~~~l~~~~~ 309 (437) T protein:vir:10 231 ILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKT-TSTYLLGDLKKVLNVTLKPQDS 309 (437) T ss_pred eeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc-ccccchhhHHHHHHhhhhhhhh Confidence 999999999999999999999999999999999999999999999998876654443 34556888988875 5666666 Q ss_pred CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCC--Ccc---e-EEEEc-CC-eEEEEecc Q lcl|Aclame:pro 158 EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKL--NKG---E-ALLAK-KG-AVKLITKR 229 (274) Q Consensus 158 ~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~--p~~---~-~~l~~-~~-a~~~~~~~ 229 (274) .+.+|+|||.++..|++..+.+. .....+.+.+|..++|+|+||++++++ |.+ + .++|+ .+ ++.++.+. T Consensus 310 ~~~~~~~~~~~~~~l~~lkd~~g---~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~ 386 (437) T protein:vir:10 310 AAASIVMSQSAYNLFDMATDAMG---RPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLT 386 (437) T ss_pred cCCEEEEcHHHHHHHHHhhccCC---CeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeee Confidence 77899999999999987654322 122223355677789999999997654 532 2 24444 34 45677788 Q ss_pred CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++.++...+...+.+.+++..|+|+++++|+++|+|+...++... T Consensus 387 ~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~ 431 (437) T protein:vir:10 387 EITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV 431 (437) T ss_pred ceEEEEecccccccceeeEEEEEccEEecccceEEEEeecccccc Confidence 888887766667778899999999999999999999976544444 No 111 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=6.5e-34 Score=202.54 Aligned_cols=264 Identities=18% Similarity=0.177 Sum_probs=209.8 Q ss_pred CCccccchh--------------hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCC Q lcl|Aclame:pro 1 MAQGTTKVS--------------NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG 66 (274) Q Consensus 1 ma~~~T~~~--------------~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg 66 (274) ||+.+=... ++++ |+|.++|...+++.+++.++.... ...+|++++||+++.. ++..+.+| T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r---~i~~G~sv~i~~iG~~-tv~~~t~G 75 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVR---TIQNGKSAQFPVMGRT-SGVYLAPG 75 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cccccceEEEecccce-eeeeecCC Confidence 776432211 3455 778888888888888888888654 2256999999999864 79999999 Q ss_pred Ccccc--cccccceeEEeehhhh-cchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccc------c Q lcl|Aclame:pro 67 EKIPV--DQIGTSKREAKVRKIG-KGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG---ATL------T 134 (274) Q Consensus 67 ~~~~~--~~~~~~~~~~~~~~~~-~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~---a~~------~ 134 (274) ++++. .+++..++++++.++. ..+.|.|.+..++..|+++.+.++++.++++..|+.++..+.. ... . T Consensus 76 ~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~ 155 (347) T protein:vir:94 76 ERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIA 155 (347) T ss_pred CCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 99854 4678888899998864 4578999999999999999999999999999999998764431 100 0 Q ss_pred c----------c-Cc--------ccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccc Q lcl|Aclame:pro 135 V----------E-AD--------ITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVK 193 (274) Q Consensus 135 ~----------~-~~--------~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~ 193 (274) + . +. ...++.|++|...|.+++. .+|+++++|..|+.|+++.. +......++..+.+ T Consensus 156 g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~--~~~~~~~~~~~~~~ 233 (347) T protein:vir:94 156 GLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALM--PNAANYAALIDPET 233 (347) T ss_pred CCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccch--hhhhhccccccccc Confidence 0 0 00 1125677788888987764 67999999999999986643 33344445566888 Q ss_pred cccchhcceeeEEcCCCCcc------------------------------------eEEEEcCCeEEEEeccCceeeecc Q lcl|Aclame:pro 194 GAFGEALGAVIVRSNKLNKG------------------------------------EALLAKKGAVKLITKRDFFLEKDR 237 (274) Q Consensus 194 g~~~~i~G~~Vv~s~~~p~~------------------------------------~~~l~~~~a~~~~~~~~~~ve~~r 237 (274) |.+++++|++|++|+++|.+ ...+|++.|++.+..+++++|.+| T Consensus 234 G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r 313 (347) T protein:vir:94 234 GNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDR 313 (347) T ss_pred cceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchh Confidence 99999999999999999831 236788899999999999999999 Q ss_pred ccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 238 DASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 238 ~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ++.++.+.|++++.||+++++|++++.|+.++|. T Consensus 314 ~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 314 DVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred chhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 9999999999999999999999999999999999 No 112 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=8.9e-33 Score=196.29 Aligned_cols=261 Identities=15% Similarity=0.106 Sum_probs=194.3 Q ss_pred CC--ccccchhhccchHHHHHHHHHHHHHhhhhccc-ccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccc Q lcl|Aclame:pro 1 MA--QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQF-ADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTS 77 (274) Q Consensus 1 ma--~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l-~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~ 77 (274) ++ ..++..+..++|+.+++.|++.+++.+.+.++ ++. +....| .+++|++...+.+.|++||+.+|+++++|+ T Consensus 130 ~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~---v~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~ 205 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART---LPLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFD 205 (435) T ss_pred hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhcccee---eecCCC-ceEEEEEeCCcceeeeccCcccccccccee Confidence 32 23333355799999999999999998888776 332 222233 489999988889999999999999999999 Q ss_pred eeEEeehhhhcchhccHHHHhccC--ccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--c-----------cccCccc-- Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGF--GDPQGEAVRQHGLAIANKVDNDVLEALKGAT--L-----------TVEADIT-- 140 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~--~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--~-----------~~~~~~~-- 140 (274) ++++.+++++..+++|++++.++. +++++++.+++++++++++|+.++..-.++. . ....... T Consensus 206 ~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:80 206 DLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTL 285 (435) T ss_pred eEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccch Confidence 999999999999999999999985 4789999999999999999999996532210 0 0011111 Q ss_pred --CHHHHHHHHHHHhhc--CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc--- Q lcl|Aclame:pro 141 --KLDGLQTAIDKFNDE--DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG--- 213 (274) Q Consensus 141 --~~d~iv~a~~~l~~~--~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~--- 213 (274) .+.++.++...+... +..+..|+|||.++..|++..+.+.. .......-++|+|+||++++.+|.+ T Consensus 286 ~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~-------~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 286 QKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGN-------KVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCc-------eeccCCCCCeEeeeeeEEeccccccccC Confidence 234667777776554 34677899999999999876543211 1111122358999999999999863 Q ss_pred -----eEEEEcCCeEEEEeccCceeeeccccc-------------cCccEEEEEEEEEEEEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 214 -----EALLAKKGAVKLITKRDFFLEKDRDAS-------------RKSTALYSDKHYVAYLYDESKVVKITKGAGDE 272 (274) Q Consensus 214 -----~~~l~~~~a~~~~~~~~~~ve~~r~~~-------------~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~ 272 (274) ..++.+.+.+.+..+.++.++..++.. ++...+++..|+|+++.+|+++++|+...==+ T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 359 AGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred CCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 234445555556677888888877653 45678999999999999999999999876555 No 113 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=9.4e-33 Score=196.17 Aligned_cols=260 Identities=12% Similarity=0.050 Sum_probs=191.7 Q ss_pred CCcccc-chhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTT-KVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T-~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) ++..++ ..+..+||+.+.+.|++.+++.+++.++... .+....| .++||++...+.+.|++||+.+|+++++|+++ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~--~~~~~~g-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i 201 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGAR--SIPLPNG-NMSLPRLAGGATASYTGENQDAKVSEARFDDV 201 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcce--eeecCCc-ceEEEEEeCCcceeeeccCccccccccceeeE Confidence 333332 3345789999999999999999888887322 1222233 38999998888899999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc--cc--------------cccCcccCHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA--TL--------------TVEADITKLD 143 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a--~~--------------~~~~~~~~~d 143 (274) ++.+++++..+++|+|++.++.+++++++.+++++++++++|+.++..-.++ +. .......+++ T Consensus 202 ~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 281 (428) T protein:vir:10 202 KLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLD 281 (428) T ss_pred EeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHH Confidence 9999999999999999999999999999999999999999999998543221 00 0111223344 Q ss_pred H---HHHHHHHH---hhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce--- Q lcl|Aclame:pro 144 G---LQTAIDKF---NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE--- 214 (274) Q Consensus 144 ~---iv~a~~~l---~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~--- 214 (274) . .++++..+ ...+.....|+|||..+..|++..+.+.. ........++|+|+||++++++|.+. T Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~-------~i~~~~~~g~l~G~pv~~~~~~p~~~~~~ 354 (428) T protein:vir:10 282 TIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGN-------KVYPEMAQGMLKGYPIQRTSAIPANLGEG 354 (428) T ss_pred HHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCc-------eeccCCCCCeeeceeeEEeccccccccCC Confidence 3 33433332 23344567899999999999876532211 11112233589999999999999643 Q ss_pred -----EEEEcCCeEEEEeccCceeeeccccc-------------cCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 215 -----ALLAKKGAVKLITKRDFFLEKDRDAS-------------RKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 215 -----~~l~~~~a~~~~~~~~~~ve~~r~~~-------------~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .++.+.+.+.+....++.++.+++.. .+...+++..|+|+++.+|++++.++...= T Consensus 355 ~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 355 GKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred CccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 34445555666777788888777642 345688999999999999999999988776 No 114 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=1.5e-33 Score=200.53 Aligned_cols=261 Identities=17% Similarity=0.184 Sum_probs=211.0 Q ss_pred CCccccch----------------hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccccc Q lcl|Aclame:pro 1 MAQGTTKV----------------SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIA 64 (274) Q Consensus 1 ma~~~T~~----------------~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~ 64 (274) ||+.+|.. -++++ |+|+.+|.+.++..++++++.... ++ .+|++++||+++.. ++..+. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r-~i--~~g~s~~~~~iG~~-~~~~~~ 75 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVR-SI--SSGKSAQFPVLGRT-QAAYLA 75 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceee-ee--cccceEEEEeecee-EEEeee Confidence 99876652 11366 899999999999999999998764 33 45999999999875 688999 Q ss_pred CCCcccc--cccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Q lcl|Aclame:pro 65 EGEKIPV--DQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------- 133 (274) Q Consensus 65 eg~~~~~--~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------- 133 (274) +|++++. .++..+++++++.+. ...+.|.|.+..++..|+++.+.+++++++++..|+.++..+..+.. T Consensus 76 ~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~ 155 (344) T protein:vir:10 76 PGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNEN 155 (344) T ss_pred cCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 9999865 468899999999885 45589999999999999999999999999999999999865532110 Q ss_pred -c---------c-------cCc----ccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccc Q lcl|Aclame:pro 134 -T---------V-------EAD----ITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI 190 (274) Q Consensus 134 -~---------~-------~~~----~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~ 190 (274) + . ... ..-++.|++|...|.++++ .+|+++++|+.|+.|+++.. +......+++. T Consensus 156 ~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~--~~~~~~~~~~~ 233 (344) T protein:vir:10 156 ITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALM--PNAANYAALID 233 (344) T ss_pred cccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccc--ccccccccccc Confidence 0 0 000 1126678888899988765 67999999999999987654 33344455667 Q ss_pred ccccccchhcceeeEEcCCCCcc-------------------------------eEEEEcCCeEEEEeccCceeeecccc Q lcl|Aclame:pro 191 IVKGAFGEALGAVIVRSNKLNKG-------------------------------EALLAKKGAVKLITKRDFFLEKDRDA 239 (274) Q Consensus 191 ~~~g~~~~i~G~~Vv~s~~~p~~-------------------------------~~~l~~~~a~~~~~~~~~~ve~~r~~ 239 (274) ..+|.+++++|++|+.|+++|.+ ...+|++.|++....+++++|..|++ T Consensus 234 ~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~ 313 (344) T protein:vir:10 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA 313 (344) T ss_pred eeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch Confidence 88999999999999999999842 12578889999999999999999999 Q ss_pred ccCccEEEEEEEEEEEEEcCcce--EEEEeC Q lcl|Aclame:pro 240 SRKSTALYSDKHYVAYLYDESKV--VKITKG 268 (274) Q Consensus 240 ~~~~~~i~~~~~~~~~v~~~~av--v~l~~~ 268 (274) .++.+.|++++.||+++++|+++ |+++.+ T Consensus 314 ~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 314 NFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hHHHHHHHHHhhcccceecccceEEEEeecC Confidence 99999999999999999999988 666666 No 115 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=1.3e-33 Score=200.93 Aligned_cols=264 Identities=16% Similarity=0.156 Sum_probs=212.3 Q ss_pred CCccccchh---------------hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccC Q lcl|Aclame:pro 1 MAQGTTKVS---------------NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE 65 (274) Q Consensus 1 ma~~~T~~~---------------~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~e 65 (274) ||+.++... ++++ |+|+.+|...++..+++.++..... ..+|++++||+++.. ++..+.+ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~---i~~G~sv~~~~iG~~-~~~~~~~ 75 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRT---IQNGKSASFPVMGRT-KGYYLAP 75 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhcccccc---ccCcceEEEeeecce-eeeeecc Confidence 997665433 3566 9999999999999999999987642 256999999999875 6888899 Q ss_pred CCcccc--cccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Q lcl|Aclame:pro 66 GEKIPV--DQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------- 133 (274) Q Consensus 66 g~~~~~--~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------- 133 (274) |.++.. .++..+++.+++.++ ...+.|.|.+..++..|+.+.+.++++++++++.|+.++..+..+.. T Consensus 76 g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:88 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) T ss_pred ccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 998754 578999999999886 45689999999999999999999999999999999999865532210 Q ss_pred c----------ccC---------cccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccccc Q lcl|Aclame:pro 134 T----------VEA---------DITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIV 192 (274) Q Consensus 134 ~----------~~~---------~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~ 192 (274) + ..+ ....++.|++|...|.+++. .+|+++++|+.|..|++... +......++..+. T Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~--~~~~~~~~~~~~~ 233 (347) T protein:vir:88 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALM--PNAANYAALIDPE 233 (347) T ss_pred CCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchh--hhhhhhccccchh Confidence 0 000 01127889999999988765 68999999999999987643 3333334445678 Q ss_pred ccccchhcceeeEEcCCCCcce-----------------------------------EEEEcCCeEEEEeccCceeeecc Q lcl|Aclame:pro 193 KGAFGEALGAVIVRSNKLNKGE-----------------------------------ALLAKKGAVKLITKRDFFLEKDR 237 (274) Q Consensus 193 ~g~~~~i~G~~Vv~s~~~p~~~-----------------------------------~~l~~~~a~~~~~~~~~~ve~~r 237 (274) +|.+++++|++|++|+++|.+. .+++++.+++.+...++++|.+| T Consensus 234 ~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r 313 (347) T protein:vir:88 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) T ss_pred cceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeee Confidence 8999999999999999998311 15577888888888999999999 Q ss_pred ccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 238 DASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 238 ~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ++.++.+.|++++.||+++++|++++.|+.++|- T Consensus 314 ~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred chhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 9999999999999999999999999887776665 No 116 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=4.4e-33 Score=197.99 Aligned_cols=263 Identities=14% Similarity=0.103 Sum_probs=209.2 Q ss_pred CCccccc-------hh--h--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcc Q lcl|Aclame:pro 1 MAQGTTK-------VS--N--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI 69 (274) Q Consensus 1 ma~~~T~-------~~--~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~ 69 (274) |.+.-.. .+ + +++ |+|+.+|.+.+++.+++.++..... + .+|++++||+++.. ++..+.+|.++ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~-i--~~G~tv~i~~ig~~-~~~~~~~g~~l 81 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYD-L--RGGKSKQFMFTGKL-SAGYHTPGTPI 81 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhcccccc-c--cccceEEEEeccce-eEeeecCCCCC Confidence 4433222 22 2 677 9999999999999999999887642 2 46999999999864 78999999998 Q ss_pred ccc-ccccceeEEeehh-hhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Q lcl|Aclame:pro 70 PVD-QIGTSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT------------- 134 (274) Q Consensus 70 ~~~-~~~~~~~~~~~~~-~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~------------- 134 (274) ... +++.+++++++.+ ....+.|.|.+..++..|+++.+.++.++++++++|+.++..+..+..+ T Consensus 82 ~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~ 161 (332) T protein:vir:78 82 VGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHV 161 (332) T ss_pred CCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccccc Confidence 665 6899999999987 4566899999999999999999999999999999999998766543211 Q ss_pred --ccCcc----cCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccc-cccccccccc-cchhcceee Q lcl|Aclame:pro 135 --VEADI----TKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQ-LGDNIIVKGA-FGEALGAVI 204 (274) Q Consensus 135 --~~~~~----~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~-~~~~~~~~g~-~~~i~G~~V 204 (274) ..+.. ..|+.|++|...|.++++ .+|+++++|..|..|++..+.++..... ..++.+++|. +++++|++| T Consensus 162 ~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V 241 (332) T protein:vir:78 162 NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRI 241 (332) T ss_pred ccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEE Confidence 01112 236889999999988876 6789999999999999854444433211 2234567764 889999999 Q ss_pred EEcCCCCcc------------------------eEEEEcCCeEEEEeccCceee---eccccccCccEEEEEEEEEEEEE Q lcl|Aclame:pro 205 VRSNKLNKG------------------------EALLAKKGAVKLITKRDFFLE---KDRDASRKSTALYSDKHYVAYLY 257 (274) Q Consensus 205 v~s~~~p~~------------------------~~~l~~~~a~~~~~~~~~~ve---~~r~~~~~~~~i~~~~~~~~~v~ 257 (274) +.|+++|.. .+++|+++++++....+..++ .+|++.++.+.|++++.||++++ T Consensus 242 ~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~ 321 (332) T protein:vir:78 242 LKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSL 321 (332) T ss_pred EecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcCcee Confidence 999999942 247889999999988877665 47889999999999999999999 Q ss_pred cCcceEEEEeC Q lcl|Aclame:pro 258 DESKVVKITKG 268 (274) Q Consensus 258 ~~~avv~l~~~ 268 (274) +|++++.|+.+ T Consensus 322 rPe~~v~l~~a 332 (332) T protein:vir:78 322 RTSVAGSFQAA 332 (332) T ss_pred cccceEEEeeC Confidence 99999999988 No 117 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=2.6e-32 Score=193.75 Aligned_cols=263 Identities=12% Similarity=-0.006 Sum_probs=197.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc-ccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~-~~~~~~~~ 79 (274) ++..+++.+..++|+.+.+.|++.+++.+.+.++++.. ..++...+||+....+.+.|++|+++++. .+++|+++ T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~----~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i 159 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFV----NTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKI 159 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceee----ecCCceeEEEEEcCCcceeeeccccccCccccccceee Confidence 66667777889999999999999999998888887653 23455688999888889999999999875 58999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------ccccCcccCHH Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------------LTVEADITKLD 143 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------------~~~~~~~~~~d 143 (274) ++.+++++..+.+|++++.++..++++++.+++++++++++|+.++.+-.+.. ....+..++++ T Consensus 160 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~ 239 (390) T protein:vir:40 160 QTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDL 239 (390) T ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccchh Confidence 99999999999999999999999999999999999999999999986422111 01122334555 Q ss_pred HHHHHHHHHhhc-------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEE Q lcl|Aclame:pro 144 GLQTAIDKFNDE-------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEAL 216 (274) Q Consensus 144 ~iv~a~~~l~~~-------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~ 216 (274) +..++...+... .....+|+|||.++..+++.. ...... .| ..+.. ...+|+||+.++++|+++.+ T Consensus 240 ~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~--~~~~d~-~G-~~v~~---~~~~g~pvv~~~~~p~~~i~ 312 (390) T protein:vir:40 240 TPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAA--TSYMTP-QG-VWVTG---ILPVPLEIVQSVAVPVGKAV 312 (390) T ss_pred hHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHH--hhccCC-CC-ccccc---cCCCceeEEEcCCCCCCcEE Confidence 555554444322 235667999999865433210 000000 01 11111 23579999999999999987 Q ss_pred EEcCCeEEEEeccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc--cC Q lcl|Aclame:pro 217 LAKKGAVKLITKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAGDE--VM 274 (274) Q Consensus 217 l~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~--~~ 274 (274) +.+.+.+.+..+.++.++.+++. .++.+.+++..|+|+++.+|+|+++++.+++.. .. T Consensus 313 ~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~ 374 (390) T protein:vir:40 313 AGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAI 374 (390) T ss_pred EEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCC Confidence 77776776777888888877655 558899999999999999999999998877732 33 No 118 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=2.6e-33 Score=199.20 Aligned_cols=263 Identities=14% Similarity=0.155 Sum_probs=213.8 Q ss_pred CCccccch------------h--h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccC Q lcl|Aclame:pro 1 MAQGTTKV------------S--N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE 65 (274) Q Consensus 1 ma~~~T~~------------~--~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~e 65 (274) |||.+|.. + + +++ |+|+.+|.+.++..++++++.... ++ .+|++++||+++.. +++.+.+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r-ti--~~G~sv~~~~iG~~-~~~~~~~ 75 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVR-SI--QSGKSAQFPVLGRT-KAAYLQP 75 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhhe-ec--cccceEEeeeccce-eEeeeec Confidence 88766554 1 1 566 999999999999999999998763 23 46999999999875 6899999 Q ss_pred CCcccc--cccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Q lcl|Aclame:pro 66 GEKIPV--DQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------- 133 (274) Q Consensus 66 g~~~~~--~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------- 133 (274) |.++.. .++..+++++++.++ ...+.|.|.+..++..|+.+.+.++++.+++++.|+.++..+..+.. T Consensus 76 G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:94 76 GENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENI 155 (347) T ss_pred CcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 999854 578999999999986 45588999999999999999999999999999999988865422110 Q ss_pred cc---------------c-----CcccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccc Q lcl|Aclame:pro 134 TV---------------E-----ADITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNII 191 (274) Q Consensus 134 ~~---------------~-----~~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~ 191 (274) .+ . .+...|+.|.+|...|.++++ .+++++++|+.|..|++.....+ ......+.+ T Consensus 156 ~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~--~~~~~~~~~ 233 (347) T protein:vir:94 156 AGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNA--ANYQALIDP 233 (347) T ss_pred ccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccc--ccccccccc Confidence 00 0 011226789999999988765 68999999999999997543322 223334557 Q ss_pred cccccchhcceeeEEcCCCCcc-----------------------------------eEEEEcCCeEEEEeccCceeeec Q lcl|Aclame:pro 192 VKGAFGEALGAVIVRSNKLNKG-----------------------------------EALLAKKGAVKLITKRDFFLEKD 236 (274) Q Consensus 192 ~~g~~~~i~G~~Vv~s~~~p~~-----------------------------------~~~l~~~~a~~~~~~~~~~ve~~ 236 (274) .+|.+++++|++|+.|+++|.+ ..++|++.+++.+...++++|.. T Consensus 234 ~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~ 313 (347) T protein:vir:94 234 STGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERA 313 (347) T ss_pred ccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeee Confidence 8899999999999999999842 13678888999888999999999 Q ss_pred cccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 237 RDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 237 r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) |++.++.+.|.++..||++++||++++.++.++| T Consensus 314 ~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 314 RRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred echhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 9999999999999999999999999999999999 No 119 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=5.6e-33 Score=197.38 Aligned_cols=263 Identities=16% Similarity=0.169 Sum_probs=214.2 Q ss_pred CCccccch--------------h--hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccccc Q lcl|Aclame:pro 1 MAQGTTKV--------------S--NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIA 64 (274) Q Consensus 1 ma~~~T~~--------------~--~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~ 64 (274) ||+.++.. . ++++ |+|+.+|.+.++..++++++.... ++ .+|++++||+.+.. ++..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~r-~i--~~gks~~~~~iG~~-~~~~~~ 75 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVR-SI--SSGKSAQFPVLGRT-QAAYLA 75 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhcccceee-ec--cccceEEEeeecce-EEEeee Confidence 88755521 1 2456 899999999999999999998763 33 45899999999865 789999 Q ss_pred CCCccccc--ccccceeEEeehhhh-cchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Q lcl|Aclame:pro 65 EGEKIPVD--QIGTSKREAKVRKIG-KGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------- 133 (274) Q Consensus 65 eg~~~~~~--~~~~~~~~~~~~~~~-~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------- 133 (274) +|+++..+ ++..++..+++.+.. ..+.|.|.+..++..|+++.+.+++++++++..|+.++..+..+.. T Consensus 76 ~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 76 PGENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred cCCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 99998664 578899999998764 4589999999999999999999999999999999998865532110 Q ss_pred ------------cccC---------cccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccc Q lcl|Aclame:pro 134 ------------TVEA---------DITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI 190 (274) Q Consensus 134 ------------~~~~---------~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~ 190 (274) +..+ +...|+.|++|...|.++++ ..++++++|+.|+.|+++.. +......+++. T Consensus 156 ~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~--~~~~~~~~~~~ 233 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALM--PNAANYAALID 233 (345) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccc--ccccccccccc Confidence 0001 11237889999999988765 67999999999999987654 33444456666 Q ss_pred ccccccchhcceeeEEcCCCCcc--------------------------------eEEEEcCCeEEEEeccCceeeeccc Q lcl|Aclame:pro 191 IVKGAFGEALGAVIVRSNKLNKG--------------------------------EALLAKKGAVKLITKRDFFLEKDRD 238 (274) Q Consensus 191 ~~~g~~~~i~G~~Vv~s~~~p~~--------------------------------~~~l~~~~a~~~~~~~~~~ve~~r~ 238 (274) ..+|.+++++|++|+.|+++|.+ ...+|++.|++.+..+++++|..|+ T Consensus 234 ~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 313 (345) T protein:vir:22 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARR 313 (345) T ss_pred cccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeec Confidence 88999999999999999998731 2367899999999999999999999 Q ss_pred cccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 239 ASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 239 ~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +.++.+.|++++.||+++++|++++.|+.+-- T Consensus 314 ~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 314 ANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 99999999999999999999999999888777 No 120 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=99.96 E-value=7.2e-32 Score=191.32 Aligned_cols=263 Identities=15% Similarity=0.140 Sum_probs=206.5 Q ss_pred CCccccchhhccchH--HHHHHHHHHHHHh--hhhccccccccccc---ccCCCEEEEEeecCC-CCccc-ccCC---Cc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE--VLAPMMQAELDKK--LRFAQFADIDSTLV---GQPGDTLTFPAFTYS-GDAQV-IAEG---EK 68 (274) Q Consensus 1 ma~~~T~~~~~~iPe--~~~~~v~~~~~~~--~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~-~~a~~-~~eg---~~ 68 (274) || +|+.+|+++|| +|.+|+.++..++ ++.++++..+..+. ..+|+.+++|+|+++ ++.+. +... .. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MA--ITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 88 88899999998 7999999887654 44477777766654 367999999999875 55442 3322 25 Q ss_pred ccccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------- Q lcl|Aclame:pro 69 IPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------------- 134 (274) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~-------------- 134 (274) ++..+++.++.......+++.|..+|.....+..|++..+.++++.+|.|..++.+|+.+++.... T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~ 158 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDM 158 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCce Confidence 788999999999999999999999999999888999999999999999999999999888653210 Q ss_pred ----ccCcccCHHHHHHHHHHHhhc-----CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeE Q lcl|Aclame:pro 135 ----VEADITKLDGLQTAIDKFNDE-----DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIV 205 (274) Q Consensus 135 ----~~~~~~~~d~iv~a~~~l~~~-----~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv 205 (274) ...+.++...+++|..+++++ ......++|||.+++.|++.+..+|...++ ....+++++|++|+ T Consensus 159 ~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~------~~~~i~ty~G~~Vi 232 (349) T protein:vir:94 159 VVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAE------NNTMFATYQGYRVI 232 (349) T ss_pred eEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcc------cCcccceecCcEEE Confidence 123346788999999998886 457789999999999999999888876544 23457899999999 Q ss_pred EcCCCC--------cceEEEEcCCeEEEEeccC-ceeeecccccc----CccEEEEEEEEEEEEEcCcceEEEEeC---- Q lcl|Aclame:pro 206 RSNKLN--------KGEALLAKKGAVKLITKRD-FFLEKDRDASR----KSTALYSDKHYVAYLYDESKVVKITKG---- 268 (274) Q Consensus 206 ~s~~~p--------~~~~~l~~~~a~~~~~~~~-~~ve~~r~~~~----~~~~i~~~~~~~~~v~~~~avv~l~~~---- 268 (274) ++|.+| ++++|+|+.+|++|....+ +.+|++|++.+ +++.+..|++ .+++|.++.-.... T Consensus 233 vDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~---~~~hp~G~s~~~a~v~~~ 309 (349) T protein:vir:94 233 VDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKT---WLLHPFGYSFTSAVITGN 309 (349) T ss_pred EeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeE---EEeeeeeeeecccccCCC Confidence 999999 4678999999999988764 56899999875 4689999999 45666665554321 Q ss_pred -------CCcc--cC Q lcl|Aclame:pro 269 -------AGDE--VM 274 (274) Q Consensus 269 -------aa~~--~~ 274 (274) +||= .. T Consensus 310 ~~~~~~~sPt~aeLa 324 (349) T protein:vir:94 310 GTETIARSASWQDLA 324 (349) T ss_pred ccccccCCCChHHhc Confidence 1110 00 No 121 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=99.96 E-value=8e-32 Score=191.08 Aligned_cols=263 Identities=14% Similarity=0.131 Sum_probs=205.1 Q ss_pred CCccccchhhccchH--HHHHHHHHHHHHh--hhhccccccccccc---ccCCCEEEEEeecCC-CCccc-c-cC--CCc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE--VLAPMMQAELDKK--LRFAQFADIDSTLV---GQPGDTLTFPAFTYS-GDAQV-I-AE--GEK 68 (274) Q Consensus 1 ma~~~T~~~~~~iPe--~~~~~v~~~~~~~--~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~-~~a~~-~-~e--g~~ 68 (274) || +|+.+|+++|| +|.+|+.++..++ ++.++++..+.++. ..+|+.+++|+|+++ ++.+. + .. ... T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MA--ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 88 88899999998 8999999887654 44477777666554 367999999999875 44432 3 22 235 Q ss_pred ccccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------- Q lcl|Aclame:pro 69 IPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------------- 134 (274) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~-------------- 134 (274) ++..+++.++.......+++.|..+|.....+..|++..+.++++.+|.|..++.+|+.+++.... T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~ 158 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDM 158 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcccc Confidence 688999999999999999999999999988888999999999999999999999999888643210 Q ss_pred ----ccCcccCHHHHHHHHHHHhhc-----CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeE Q lcl|Aclame:pro 135 ----VEADITKLDGLQTAIDKFNDE-----DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIV 205 (274) Q Consensus 135 ----~~~~~~~~d~iv~a~~~l~~~-----~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv 205 (274) ...+.++.+.+++|..+|+++ ......++|||.+++.|++.+..+|...++ ....+++++|++|+ T Consensus 159 t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~------~~~~i~ty~G~~Vi 232 (349) T protein:vir:78 159 VVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAE------NNTMFATYQGYRVI 232 (349) T ss_pred eeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcc------cCcccceecCeEEE Confidence 112236788999999998886 467789999999999999998888876544 23457899999999 Q ss_pred EcCCCC--------cceEEEEcCCeEEEEeccC-ceeeecccccc----CccEEEEEEEEEEEEEcCcceEEEEeC---- Q lcl|Aclame:pro 206 RSNKLN--------KGEALLAKKGAVKLITKRD-FFLEKDRDASR----KSTALYSDKHYVAYLYDESKVVKITKG---- 268 (274) Q Consensus 206 ~s~~~p--------~~~~~l~~~~a~~~~~~~~-~~ve~~r~~~~----~~~~i~~~~~~~~~v~~~~avv~l~~~---- 268 (274) ++|.+| ++++|+|+.+|++|....+ ..+|++|++.+ +++.+..|++| +++|.++.-.... T Consensus 233 vDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~a~v~~~ 309 (349) T protein:vir:78 233 VDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTSAVITGN 309 (349) T ss_pred EeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---EeeeeeeeeccccccCC Confidence 999999 4578999999999987655 46899999975 56899999994 5566655543321 Q ss_pred -------CCcc--cC Q lcl|Aclame:pro 269 -------AGDE--VM 274 (274) Q Consensus 269 -------aa~~--~~ 274 (274) +||= .. T Consensus 310 ~~~~~~~sPt~aeLa 324 (349) T protein:vir:78 310 GTETIARSASWQDLA 324 (349) T ss_pred ccccccCCCChHHhc Confidence 1110 00 No 122 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.96 E-value=1e-31 Score=190.49 Aligned_cols=257 Identities=15% Similarity=0.091 Sum_probs=192.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc-cccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV-DQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~-~~~~~~~ 78 (274) |+..++..+...+|+.+...++++... ..+.+.+... ..++...++|.... .+.+.|++|++..+. ++++|++ T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~~~~~-~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 206 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLEPKDI-VDLSKYVRSV----PVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVE 206 (397) T ss_pred hhcccccccccchhHHHHHHHHHhhhh-hhHHHhhhhc----cccccceeEEEEeccCCccccccccccccccccccccc Confidence 555566667789999999999875443 3344444432 12233455665442 356889999999986 6899999 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv~a~~~l~~~~~~ 158 (274) +++.+++++..+++|++++.++.+++.+++.+++++.+++.+|..+++..+.+. ..+..+||++.+++....... . T Consensus 207 i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~---~~~~~~~d~~~~~~~~~~~~~-~ 282 (397) T protein:vir:96 207 IDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTAT---AKSVVGVDGLKDLINKEIKKV-Y 282 (397) T ss_pred eeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cccccchHHHHHHHHHhhhhh-c Confidence 999999999999999999999999999999999999999999999887655433 345678999999987654443 3 Q ss_pred ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce-----EEEEc-CC-eEEEEeccCc Q lcl|Aclame:pro 159 PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-----ALLAK-KG-AVKLITKRDF 231 (274) Q Consensus 159 ~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~-----~~l~~-~~-a~~~~~~~~~ 231 (274) +..|+|||.+|..|++..+.+. .....+.+.+|..++|+|+||+++++.+.+. .++|+ .+ .+.++.+.++ T Consensus 283 ~a~~v~n~~~~~~l~~lkd~~G---~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 359 (397) T protein:vir:96 283 DVKLFISASMYSELDKLKDKNG---RYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQV 359 (397) T ss_pred CcEEEEcHHHHHHHHHhhccCC---CeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecce Confidence 5789999999999987653221 2222334566777899999999866543222 34554 34 3556777888 Q ss_pred eeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 232 FLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 232 ~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .+...++. ++.+.+++..|+|+++.+|+++|+++.++| T Consensus 360 ~~~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 360 SVSWVDNN-IYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEEeccc-ccceeEEEEEEEccEEecccceEEEEeecC Confidence 88876654 446789999999999999999999999999 No 123 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.96 E-value=1.8e-31 Score=189.19 Aligned_cols=272 Identities=12% Similarity=0.075 Sum_probs=187.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCC-CcccccCCCc-----cccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSG-DAQVIAEGEK-----IPVDQI 74 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~-~a~~~~eg~~-----~~~~~~ 74 (274) +...++..+.+++|+.+.+.|++.+++.+.+.+++.... +. ..+..++||+....+ .+.|++||+. .|.+++ T Consensus 157 ~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~-~~-~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~ 234 (477) T protein:vir:84 157 LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEP-LP-GGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDL 234 (477) T ss_pred ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceee-ec-CCcceeEEEEEecCcceeeeeccCccccccccccccc Confidence 222222233456777778999999999888887765432 12 233458999976543 3568999875 467788 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--c-----------cccCccc- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--L-----------TVEADIT- 140 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--~-----------~~~~~~~- 140 (274) +|+++++.+++++..+.+|++++.++.+++.+++.+++++++++++|..++..-.++. . +...+.. T Consensus 235 ~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t 314 (477) T protein:vir:84 235 TDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSA 314 (477) T ss_pred ceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999985432110 0 0111111 Q ss_pred ------CHHHHHHHHHHHhhcCC-CccEEEEcHHHHHHHHhhhcccccccccc----------ccccccccccchhccee Q lcl|Aclame:pro 141 ------KLDGLQTAIDKFNDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQL----------GDNIIVKGAFGEALGAV 203 (274) Q Consensus 141 ------~~d~iv~a~~~l~~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~----------~~~~~~~g~~~~i~G~~ 203 (274) .++.++++...+...+. ...+|+|||.+|..|++..+.+....... ..+.+..+..++++|+| T Consensus 315 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~p 394 (477) T protein:vir:84 315 LEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLP 394 (477) T ss_pred hhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccc Confidence 24566777776666554 45689999999999987654332111111 11235566778999999 Q ss_pred eEEcCCCCcce-------EEEEcCCeEEEEeccCceee--eccccccCccEEEEEEEEEEEEE-cCcceEEEEeCCCccc Q lcl|Aclame:pro 204 IVRSNKLNKGE-------ALLAKKGAVKLITKRDFFLE--KDRDASRKSTALYSDKHYVAYLY-DESKVVKITKGAGDEV 273 (274) Q Consensus 204 Vv~s~~~p~~~-------~~l~~~~a~~~~~~~~~~ve--~~r~~~~~~~~i~~~~~~~~~v~-~~~avv~l~~~aa~~~ 273 (274) |++++.+|.+. .++|+.++-.+.....+.++ +++........+++..++++..+ +|+++|.+|.++.++- T Consensus 395 Vv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~ 474 (477) T protein:vir:84 395 VVTDPTLPTTLGTGTDQDVIHVLRASDLALFESSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAP 474 (477) T ss_pred eEecCcccccccccCCcceEEEEEeceEEEEeeceeEEeccccccccceeeeeehhhhhhhhhccccceEEeeccccccc Confidence 99999999642 34454433223223344444 34445556666777767777665 5999999999999988 Q ss_pred C Q lcl|Aclame:pro 274 M 274 (274) Q Consensus 274 ~ 274 (274) - T Consensus 475 ~ 475 (477) T protein:vir:84 475 T 475 (477) T ss_pred c Confidence 7 No 124 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=99.95 E-value=5.5e-31 Score=186.49 Aligned_cols=265 Identities=11% Similarity=0.041 Sum_probs=193.5 Q ss_pred CCcccc-chhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTT-KVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T-~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) +....+ .....++|+.+.+.|++.+++...+.++++... .+| ..++|.....+.+.|++||+++++.+++|+++ T Consensus 148 ~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~----~~g-~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i 222 (466) T protein:vir:80 148 AQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRP----LKG-TARQNIAGAIPEGVWTEAVANLNELSLSFSQI 222 (466) T ss_pred hhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeee----cCc-eeEeeeecCCcceeecccccccccccccccce Confidence 222222 223479999999999999999888888776432 233 47888888888899999999999999999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------------c-------Cccc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV------------E-------ADIT 140 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~------------~-------~~~~ 140 (274) ++.+++++..+.+|++++.++.+++.+++.+++++++++.+|+.++.+..+.-..+ . ...+ T Consensus 223 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 302 (466) T protein:vir:80 223 EVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNL 302 (466) T ss_pred eecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeeccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999986432211000 0 0001 Q ss_pred CHHHH-----------------HHHHHHHhhcCCCc-cEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 141 KLDGL-----------------QTAIDKFNDEDLEP-MVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 141 ~~d~i-----------------v~a~~~l~~~~~~~-~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) +...+ +.+...+......+ .+|+||+..+..|++....... .+... ...+.-..++|+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~-~g~~~---~~~~~~~~i~G~ 378 (466) T protein:vir:80 303 STTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNS-AGALV---ASLNNTMPIVGG 378 (466) T ss_pred chhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccC-Ccccc---ccCCCccccccc Confidence 11111 11222223333444 4699999999988765421111 11111 111112358999 Q ss_pred eeEEcCCCCcceEEEEcCCeEEEEeccCceeeeccccc--cCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 203 VIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS--RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 203 ~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~--~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ||+.++++|.++.++.+...+.+..+.++.++.+++.. ++.+.+++..|+|+++.+|+++|+++.+..+... T Consensus 379 pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~ 452 (466) T protein:vir:80 379 DIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTT 452 (466) T ss_pred ceeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCccc Confidence 99999999999988877777778888899888877665 5778899999999999999999999888765544 No 125 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=99.95 E-value=1.7e-31 Score=189.34 Aligned_cols=265 Identities=12% Similarity=0.001 Sum_probs=206.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccc-cccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP-VDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~-~~~~~~~~~ 79 (274) +...++..+..++|+.+.+.|++.+.+.+.+.+++++. +.+|+ +++|+....+.+.|++|+++++ ..+++|+++ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----NTSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeE----ecCcc-eEEEEecCCcceeEeecccccCcccCccceeE Confidence 66666666778999999999999999998888888653 23454 7899988888999999988876 468999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------c---ccCcccC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------------T---VEADITK 141 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------------~---~~~~~~~ 141 (274) ++..++++..+.+|++++.++.+|+++++.+++++++++.+|++++.+-...-. . ....... T Consensus 154 ~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:98 154 DFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccch Confidence 999999999999999999999999999999999999999999999854322110 0 0011122 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcccc--ccccccc-----c----ccccccccchhccee--eEEcC Q lcl|Aclame:pro 142 LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNF--TRPTQLG-----D----NIIVKGAFGEALGAV--IVRSN 208 (274) Q Consensus 142 ~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~--~~~~~~~-----~----~~~~~g~~~~i~G~~--Vv~s~ 208 (274) .+.+.++...+...+.....|+||+.++..+++.++.+. +...... . ....+|+..+++|+| |+.++ T Consensus 234 ~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~ 313 (377) T protein:vir:98 234 KEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESL 313 (377) T ss_pred hhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecC Confidence 456777777777777777789999999888876544332 2210100 0 112456777888887 67899 Q ss_pred CCCcceEEEEcCCeEEEEeccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 209 KLNKGEALLAKKGAVKLITKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 209 ~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) ++|++++++.+.+.+.+..+.++.++.+++. .++++.++++.|+|+++++|+++++|+.+.+ T Consensus 314 ~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 314 AVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred CCCcccEEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 9999998888888888888888888876554 4688899999999999999999999999999 No 126 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.95 E-value=8.1e-31 Score=185.56 Aligned_cols=270 Identities=14% Similarity=0.086 Sum_probs=196.0 Q ss_pred CCc-----------cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcc Q lcl|Aclame:pro 1 MAQ-----------GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI 69 (274) Q Consensus 1 ma~-----------~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~ 69 (274) ||+ -.|+....|+||+|+..+++.+++++++.+++.. ..+.+..|++++||+++. +++.++.+|.++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~-~~~~~~~GdTV~ip~~g~-~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK-IPFEGKKGDLIHIPNISR-AAVYDKQPQTPV 78 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc-ccceeecCceEEeeccCc-ceeeeecCCCcc Confidence 442 1234446788999999999999999999888753 456667799999999985 578999999999 Q ss_pred cccccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------------- 133 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------------- 133 (274) +.++++..++++++.+. ...+.|++++..++..|+.+.+.++++.++++++|+.++..+..... T Consensus 79 ~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~ 158 (381) T protein:vir:80 79 NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGD 158 (381) T ss_pred cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccc Confidence 99999999999999764 55699999999999999999999999999999999999876532111 Q ss_pred -------cccCcccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceee Q lcl|Aclame:pro 134 -------TVEADITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVI 204 (274) Q Consensus 134 -------~~~~~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~V 204 (274) +......+++.|++|...|++++. ++++++++|+.++.|+++. ++......++..+++|.+++++|++| T Consensus 159 ~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~--~~~~ad~~~~~~l~~G~Ig~i~G~~V 236 (381) T protein:vir:80 159 GTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSIN--QFISVDFSQVKPVTSGVVGTILGMEV 236 (381) T ss_pred cccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhch--hhhhhhhccchhhhceeeeEEcceEE Confidence 011223468899999999998875 7899999999999999874 34433344556799999999999999 Q ss_pred EEcCCCCcceEEEEcCCeEEEEeccCce--eeeccccccCccEEEEEEEEEEEEEcCcceEEEEe--------------- Q lcl|Aclame:pro 205 VRSNKLNKGEALLAKKGAVKLITKRDFF--LEKDRDASRKSTALYSDKHYVAYLYDESKVVKITK--------------- 267 (274) Q Consensus 205 v~s~~~p~~~~~l~~~~a~~~~~~~~~~--ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~--------------- 267 (274) ++|+++|.+....+...+..-....+.. .....+.......++..+.||.++...-..+..-. T Consensus 237 v~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 316 (381) T protein:vir:80 237 IVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGS 316 (381) T ss_pred EeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeee Confidence 9999999865433222221111111111 11112233456778888888888764444443211 Q ss_pred -----CCCcccC Q lcl|Aclame:pro 268 -----GAGDEVM 274 (274) Q Consensus 268 -----~aa~~~~ 274 (274) ..+++|+ T Consensus 317 ~~~~~~~~~~~~ 328 (381) T protein:vir:80 317 FGGANRWATAVV 328 (381) T ss_pred ehhhhhhhhhcc Confidence 2234444 No 127 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=99.95 E-value=2.6e-30 Score=182.77 Aligned_cols=258 Identities=14% Similarity=0.044 Sum_probs=195.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc-ccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~-~~~~~~~~ 79 (274) |...++..+..++|+.+.+.|++.+.+.+.+++++++. +.+|+ .+||+....+.+.|++|+++++. .+++|+++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----ecCcc-eEEEEecCCcceeeecccccccccccccceee Confidence 65666667778999999999999999999999988653 23444 78999888889999999998875 48999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------cc---------cCcc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-----------TV---------EADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-----------~~---------~~~~ 139 (274) ++.+++++..+++|++++.++.+++++++.+++++++++.+|++++.+-...-. .. .... T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999998854321100 00 0111 Q ss_pred -------cCHHHHHHHHHHHhhc-------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccch--hccee Q lcl|Aclame:pro 140 -------TKLDGLQTAIDKFNDE-------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE--ALGAV 203 (274) Q Consensus 140 -------~~~d~iv~a~~~l~~~-------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~--i~G~~ 203 (274) ..++.+.+....+... +.....|+|||.++..|++...... . +|..-. .+|.+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~----~-------~G~~v~~l~~g~~ 299 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN----A-------NGVYVTALPFNLN 299 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCC----C-------CCceeecCCCCce Confidence 1234555555555321 2244579999999999876542111 1 122112 24777 Q ss_pred eEEcCCCCcceEEEEcCCeEEEEeccCceeeeccc--cccCccEEEEEEEEEEEEEcCcceEEEEeCC--CcccC Q lcl|Aclame:pro 204 IVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRD--ASRKSTALYSDKHYVAYLYDESKVVKITKGA--GDEVM 274 (274) Q Consensus 204 Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a--a~~~~ 274 (274) |+.++.+|++++++.+.+.+.+..+.++.++.+++ ..++++.++++.|+|+++++|+|+++++.+. ++.+. T Consensus 300 vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:10 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) T ss_pred EEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCc Confidence 99999999999888887778788888888877655 4568889999999999999999998855443 44444 No 128 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=99.95 E-value=2.6e-30 Score=182.77 Aligned_cols=258 Identities=14% Similarity=0.044 Sum_probs=195.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc-ccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~-~~~~~~~~ 79 (274) |...++..+..++|+.+.+.|++.+.+.+.+++++++. +.+|+ .+||+....+.+.|++|+++++. .+++|+++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----ecCcc-eEEEEecCCcceeeecccccccccccccceee Confidence 65666667778999999999999999999999988653 23444 78999888889999999998875 48999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------cc---------cCcc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-----------TV---------EADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-----------~~---------~~~~ 139 (274) ++.+++++..+++|++++.++.+++++++.+++++++++.+|++++.+-...-. .. .... T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:95 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999998854321100 00 0111 Q ss_pred -------cCHHHHHHHHHHHhhc-------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccch--hccee Q lcl|Aclame:pro 140 -------TKLDGLQTAIDKFNDE-------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE--ALGAV 203 (274) Q Consensus 140 -------~~~d~iv~a~~~l~~~-------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~--i~G~~ 203 (274) ..++.+.+....+... +.....|+|||.++..|++...... . +|..-. .+|.+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~----~-------~G~~v~~l~~g~~ 299 (381) T protein:vir:95 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN----A-------NGVYVTALPFNLN 299 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCC----C-------CCceeecCCCCce Confidence 1234555555555321 2244579999999999876542111 1 122112 24777 Q ss_pred eEEcCCCCcceEEEEcCCeEEEEeccCceeeeccc--cccCccEEEEEEEEEEEEEcCcceEEEEeCC--CcccC Q lcl|Aclame:pro 204 IVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRD--ASRKSTALYSDKHYVAYLYDESKVVKITKGA--GDEVM 274 (274) Q Consensus 204 Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a--a~~~~ 274 (274) |+.++.+|++++++.+.+.+.+..+.++.++.+++ ..++++.++++.|+|+++++|+|+++++.+. ++.+. T Consensus 300 vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:95 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) T ss_pred EEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCc Confidence 99999999999888887778788888888877655 4568889999999999999999998855443 44444 No 129 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.95 E-value=1.6e-30 Score=184.00 Aligned_cols=265 Identities=15% Similarity=0.115 Sum_probs=199.4 Q ss_pred CCccccch-hh-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKV-SN-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~-~~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) |+.+.-+. .. .|+||+|+.+++..+.++++...+..+.. + ..|++|+||.++. +...+|.++++++.++++..+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d-~--g~GDtV~InsIg~-~tV~dY~~~~~i~~d~ltt~~ 76 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVD-F--PDGDKLTIPSVGT-PVVRSRPEQGDFTFDNLDTGE 76 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccc-c--CCCCeEEeccccc-cccccccCCCCcccccCCCce Confidence 99644333 22 45699999999999999999888765422 2 3599999999986 579999999999999999999 Q ss_pred eEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------c-----------cccC Q lcl|Aclame:pro 79 REAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------L-----------TVEA 137 (274) Q Consensus 79 ~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------~-----------~~~~ 137 (274) +++.+.+- +..|.++| +..+...++++...++++++++..+|+.+.+.+..+. . +++. T Consensus 77 ~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 77 ISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 99999874 55699999 7788899999999999999999999998876554221 1 1233 Q ss_pred cccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhh-------hccccccccccccccccccc--cchhcceeeEE Q lcl|Aclame:pro 138 DITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTS-------ASDNFTRPTQLGDNIIVKGA--FGEALGAVIVR 206 (274) Q Consensus 138 ~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~-------~~~~~~~~~~~~~~~~~~g~--~~~i~G~~Vv~ 206 (274) ..+.|+.|+++..+|.++++ .+|++||+|..+..|... ++.+|......| ...|+ +|+++|+.|++ T Consensus 156 ~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG---~a~g~~~Vg~~~GF~V~~ 232 (322) T protein:vir:31 156 QTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESG---IAPDMQFVRSVYGIDLFV 232 (322) T ss_pred chhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcccccccccccc---chhhHHHHHHHhceeeee Confidence 45689999999999998876 689999999998766332 122333222222 23343 89999999999 Q ss_pred cCCCCcce--EEEEcCCe---------------------EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceE Q lcl|Aclame:pro 207 SNKLNKGE--ALLAKKGA---------------------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVV 263 (274) Q Consensus 207 s~~~p~~~--~~l~~~~a---------------------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv 263 (274) |+++|.+. ++..+.++ ++.+ ++-.+.|..|+.+++.+.+++++|||.++++|+.++ T Consensus 233 SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~-~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~ 311 (322) T protein:vir:31 233 SNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAW-KEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLV 311 (322) T ss_pred eccccccccccccCcccccccceeecccccccchhhhhhhhHh-hhhhhhhcccCccccccceeeeeeecceeecccceE Confidence 99997433 11111111 1111 112255889999999999999999999999999999 Q ss_pred EEEeCCCcccC Q lcl|Aclame:pro 264 KITKGAGDEVM 274 (274) Q Consensus 264 ~l~~~aa~~~~ 274 (274) .|.-.++-.-. T Consensus 312 ~~~a~~~~~~~ 322 (322) T protein:vir:31 312 CVLANADKVTF 322 (322) T ss_pred EEEeccccccC Confidence 99887776666 No 130 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=99.95 E-value=4.2e-30 Score=181.64 Aligned_cols=258 Identities=14% Similarity=0.071 Sum_probs=193.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccc-cccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP-VDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~-~~~~~~~~~ 79 (274) |...++..+..++|+.+++.|++.+++.+.+++++++. ..+| ..+||+....+.+.|+.|+.+++ ..+++|+++ T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~----~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 160 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQ----NAGI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREE 160 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEecCCcceEEeecccccCccccccceee Confidence 55556666678999999999999999999999988653 2344 47899998888999998877775 578999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc---cc---c------------ccCcccC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA---TL---T------------VEADITK 141 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a---~~---~------------~~~~~~~ 141 (274) ++.+++++..+++|++++.++.+++++++.+++++++++++|+.++.+-... +. . ..++.++ T Consensus 161 ~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t 240 (395) T protein:vir:95 161 NFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLT 240 (395) T ss_pred eeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccchhh Confidence 9999999999999999999999999999999999999999999998543321 10 0 0112223 Q ss_pred HHHHHHHHHHHhh--------------cCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhc--ceeeE Q lcl|Aclame:pro 142 LDGLQTAIDKFND--------------EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL--GAVIV 205 (274) Q Consensus 142 ~d~iv~a~~~l~~--------------~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~--G~~Vv 205 (274) ++++..+...+.. .......|+|||.++..+... ++... .+|...+++ |+||+ T Consensus 241 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~----~~~~~-------~~G~~~~~lg~g~~v~ 309 (395) T protein:vir:95 241 FADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQAR----YTYLT-------ANGGFVTVLPYNVTII 309 (395) T ss_pred hhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCc----ceecc-------CCCcceeccCCcceEE Confidence 4433333222221 122445799999998765422 22111 234455665 66789 Q ss_pred EcCCCCcceEEEEcCCeEEEEeccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 206 RSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 206 ~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .++.+|++++++.+.+.+.+..+.++.++.+++. .++++.++++.|+|+++++|+|++.|+.+.+++-. T Consensus 310 ~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 310 TSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPR 380 (395) T ss_pred EcCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCC Confidence 9999999997777766666677778877766654 45788999999999999999999999999888776 No 131 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=99.95 E-value=6.6e-30 Score=180.58 Aligned_cols=254 Identities=14% Similarity=0.027 Sum_probs=195.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc-ccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~-~~~~~~~~ 79 (274) +...++..+..++|+.+.+.|++.+.+.+.+++++++.+ .+| ..+||+....+.+.|++|+++++. .+++|+++ T Consensus 79 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~----~~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i 153 (377) T protein:vir:96 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN----TSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) T ss_pred HhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEe----cCC-ceEEEEecCCcceeEeecccccccccCccceeE Confidence 444555556789999999999999999999999887533 233 478999888889999999998865 58999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------cc---------- Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------------LT---------- 134 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------------~~---------- 134 (274) ++.+++++..+++|++++.++.+++++++.+++++++++.+|++++..-...- .+ T Consensus 154 ~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:96 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeec Confidence 99999999999999999999999999999999999999999999985322110 00 Q ss_pred ----ccCcccCHHHHHHHHHHHhhcC-----------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchh Q lcl|Aclame:pro 135 ----VEADITKLDGLQTAIDKFNDED-----------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA 199 (274) Q Consensus 135 ----~~~~~~~~d~iv~a~~~l~~~~-----------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i 199 (274) ......+.+.+++.+..+.... ....+|+|||.++..++.. +.... .+|...++ T Consensus 234 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~----~~~~~-------~~G~~~~~ 302 (377) T protein:vir:96 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK----FTSRN-------QFGEYVTV 302 (377) T ss_pred cccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccc----ccccC-------CCCCceec Confidence 0011234566666554443221 1345699999998876421 11111 23445567 Q ss_pred ccee--eEEcCCCCcceEEEEcCCeEEEEeccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 200 LGAV--IVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 200 ~G~~--Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +|+| |+.++.+|++++++++.+.+.+..+.++.++.+++. .++++.+++.+|+|+++++|+++++|+.+.+ T Consensus 303 l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred cCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 7665 778999999998888888888888889888876654 4688999999999999999999999999999 No 132 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.94 E-value=2.3e-29 Score=177.62 Aligned_cols=269 Identities=14% Similarity=0.125 Sum_probs=209.7 Q ss_pred CCc----------cccchh--------hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccc Q lcl|Aclame:pro 1 MAQ----------GTTKVS--------NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQV 62 (274) Q Consensus 1 ma~----------~~T~~~--------~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~ 62 (274) |++ .-|... ++++ |+|+.+|...++..++++++.+.. ++ .+|++++||+++.. ++.. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~r-ti--~~Gksv~f~~iG~~-t~~~ 75 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTKR-TL--KNGKSLQFIYTGRM-TSSF 75 (375) T ss_pred CccccccccCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhcccccc-cc--ccCceEEEEeeeee-EEee Confidence 443 222111 3566 899999999999999999988763 33 46899999999865 7899 Q ss_pred ccCCCcccc---cccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---- Q lcl|Aclame:pro 63 IAEGEKIPV---DQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT---- 134 (274) Q Consensus 63 ~~eg~~~~~---~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~---- 134 (274) +..|.++.. .++...++++++.+. ...+.|.|.+..++..|+++.+.+++++++++..|+.++..+..+... T Consensus 76 ~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~ 155 (375) T protein:vir:10 76 HTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPV 155 (375) T ss_pred ecCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 999998743 477788888999876 556899999999999999999999999999999999998665422100 Q ss_pred -------------------c----cCcccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccc-ccccccccc Q lcl|Aclame:pro 135 -------------------V----EADITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDN-FTRPTQLGD 188 (274) Q Consensus 135 -------------------~----~~~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~-~~~~~~~~~ 188 (274) . ..+...|+.|++|...|.++++ .+|+++|+|+.|+.|+++.+.+ +......++ T Consensus 156 ~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~ 235 (375) T protein:vir:10 156 SATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGS 235 (375) T ss_pred ccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccc Confidence 0 0112247888999999988765 6899999999999998765433 333333455 Q ss_pred ccccccccchhcceeeEEcCCCCcce--------------------------------------------------EEEE Q lcl|Aclame:pro 189 NIIVKGAFGEALGAVIVRSNKLNKGE--------------------------------------------------ALLA 218 (274) Q Consensus 189 ~~~~~g~~~~i~G~~Vv~s~~~p~~~--------------------------------------------------~~l~ 218 (274) +...+|.++++.|++|+.|+++|..+ ..+| T Consensus 236 ~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~ 315 (375) T protein:vir:10 236 ALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIF 315 (375) T ss_pred ceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEE Confidence 66788889999999999999999311 3678 Q ss_pred cCCeEEEEeccCceeeec---cccccCccEEEEEEEEEEEEEcCcceEEEEeCC-CcccC Q lcl|Aclame:pro 219 KKGAVKLITKRDFFLEKD---RDASRKSTALYSDKHYVAYLYDESKVVKITKGA-GDEVM 274 (274) Q Consensus 219 ~~~a~~~~~~~~~~ve~~---r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a-a~~~~ 274 (274) +++|++.+.-.++++|.. ++..++.+.|.++.-+|++++||+++|.|+.++ |.+.. T Consensus 316 ~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~~~ 375 (375) T protein:vir:10 316 QKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATAPSAF 375 (375) T ss_pred chhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCccccC Confidence 888999888888888864 689999999999999999999999999998884 22222 No 133 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.94 E-value=6.3e-30 Score=180.66 Aligned_cols=265 Identities=15% Similarity=0.097 Sum_probs=204.8 Q ss_pred CCcc--ccc--------hh--hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCc Q lcl|Aclame:pro 1 MAQG--TTK--------VS--NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEK 68 (274) Q Consensus 1 ma~~--~T~--------~~--~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~ 68 (274) |++. -+. .+ ++++ |+|+.+|...++..+++.++..+. ++ .+|++++||+++.. +++.+..|++ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l-e~~~geV~~af~~~s~~~~~~~~r-~i--~~G~s~~~~~iG~~-~~~~~~~g~~ 75 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI-EEHLGLVDASFMYSSKFASWMNVR-SL--RGTNQLRVDRVGAS-TIAGRKAGEE 75 (334) T ss_pred CCCCcCCCccccccccccchheehh-hhhhhHHHHHHHHhhhhhccceee-ec--cccceEEEeeecce-eeeeecCCCC Confidence 8864 111 12 2344 999999999999999999998764 23 56999999999864 7899999999 Q ss_pred ccccccccceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Q lcl|Aclame:pro 69 IPVDQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT------------- 134 (274) Q Consensus 69 ~~~~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~------------- 134 (274) +..+.++.+++++++..+ ...+.|.|.+..++..|+.+.+.+++++++|++.|+.++..+..+... T Consensus 76 l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 76 LVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 999999999999999985 455899999999999999999999999999999999887544322110 Q ss_pred --------cc--CcccCH----HHHHHHHHHHhhcCC-----CccEEEEcHHHHHHHHhhhcccccc-cccccccccccc Q lcl|Aclame:pro 135 --------VE--ADITKL----DGLQTAIDKFNDEDL-----EPMVLFVNPLDAGGLRTSASDNFTR-PTQLGDNIIVKG 194 (274) Q Consensus 135 --------~~--~~~~~~----d~iv~a~~~l~~~~~-----~~~~~v~~p~~~~~L~~~~~~~~~~-~~~~~~~~~~~g 194 (274) +. ....+. +.+.+|...|.+.+. .+++++|+|+.|+.|++++..-... ....+.....+| T Consensus 156 ~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g 235 (334) T protein:vir:80 156 ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGG 235 (334) T ss_pred cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccce Confidence 00 011223 345566777766654 3589999999999999875322111 111223457889 Q ss_pred ccchhcceeeEEcCCCCcce---------------------EEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEE Q lcl|Aclame:pro 195 AFGEALGAVIVRSNKLNKGE---------------------ALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYV 253 (274) Q Consensus 195 ~~~~i~G~~Vv~s~~~p~~~---------------------~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~ 253 (274) .+++++|++|+.|+++|.+. ..++++.|++....+++..|.+|++.++.+.|.+...|| T Consensus 236 ~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G 315 (334) T protein:vir:80 236 RIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYN 315 (334) T ss_pred eEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcC Confidence 99999999999999999431 246788999999999999999999999999999999999 Q ss_pred EEEEcCcceE--EEEeCCC Q lcl|Aclame:pro 254 AYLYDESKVV--KITKGAG 270 (274) Q Consensus 254 ~~v~~~~avv--~l~~~aa 270 (274) +++++|++++ +|+.+-| T Consensus 316 ~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 316 IGQRRPDAVAVHDITVTNP 334 (334) T ss_pred CceeccceEEEEEEeeecC Confidence 9999997655 5566666 No 134 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=99.94 E-value=5.6e-29 Score=175.45 Aligned_cols=265 Identities=16% Similarity=0.156 Sum_probs=186.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccc--cccccCCCEEEEEeecCCCCcccc-----cCCCcccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDS--TLVGQPGDTLTFPAFTYSGDAQVI-----AEGEKIPVDQ 73 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~--~~~~~~G~~v~ip~~~~~~~a~~~-----~eg~~~~~~~ 73 (274) ||+ .+|+||+|+.++++.|++++++.+++++++ ++.++.||+|+||.+.. ..+.++ .++.++..++ T Consensus 1 Ma~------~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) T ss_pred Ccc------ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccc Confidence 775 449999999999999999999999998775 45567899999998765 345554 4577788999 Q ss_pred cccceeEEeehh-hhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------cCcccCHHHHH Q lcl|Aclame:pro 74 IGTSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV------EADITKLDGLQ 146 (274) Q Consensus 74 ~~~~~~~~~~~~-~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~------~~~~~~~d~iv 146 (274) ++.+.+++++.+ .+..|.++|++..+...|+...+.++.++++++++|+.++..+.++.... ......|+.|+ T Consensus 74 ~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~ 153 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) T ss_pred cccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHH Confidence 999999999965 56779999999999999999999999999999999999998876654322 22334689999 Q ss_pred HHHHHHhhcCC-CccEEEEcHHHHHHHHhhhccccccccccc---cccccccccchhcceeeEEcCCCCcceEEEEcCCe Q lcl|Aclame:pro 147 TAIDKFNDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLG---DNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA 222 (274) Q Consensus 147 ~a~~~l~~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~---~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a 222 (274) +|...|++++. .+|+++++|..+..|+++.. |......+ ...+++|.+|+++|++|+.++++|.++.+.+++.+ T Consensus 154 ~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~--~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a 231 (392) T protein:vir:99 154 GARRALNELYIPQGRVLVVGTAVTEQILNDDR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) T ss_pred HHHHHHhhcCCCCCCEEEEcHHHHHHHhcccc--eeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccc Confidence 99999998765 67899999999999998743 44333333 24588999999999999999999999988888877 Q ss_pred EEEEeccCceeeec-------------------cccccCccEEEEEEEEEEEEEcCcceEE------EEeCCCcc-cC Q lcl|Aclame:pro 223 VKLITKRDFFLEKD-------------------RDASRKSTALYSDKHYVAYLYDESKVVK------ITKGAGDE-VM 274 (274) Q Consensus 223 ~~~~~~~~~~ve~~-------------------r~~~~~~~~i~~~~~~~~~v~~~~avv~------l~~~aa~~-~~ 274 (274) +.+....++..... .+.....+.......++.+.+...+... ++....+. +. T Consensus 232 ~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~ 309 (392) T protein:vir:99 232 FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVA 309 (392) T ss_pred cccccccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeee Confidence 76554443221110 0111111111111122233221111100 00000000 00 No 135 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=99.94 E-value=2.1e-28 Score=172.29 Aligned_cols=262 Identities=17% Similarity=0.137 Sum_probs=203.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccccccc-ccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV-GQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~-~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) || +..++++.||+|+..+++.|++++++.+++.++++-+ ...|++|+||+.... . +.+|..++.++++...+ T Consensus 1 m~---~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~-~---v~dg~~~~~~~~te~~v 73 (418) T protein:vir:10 1 MA---VQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRV-K---SASGRTLVKQPMVDQTI 73 (418) T ss_pred CC---ccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCce-e---ecccCCccccccccceE Confidence 99 7778888999999999999999999999998876543 456999999986532 2 34567788999999999 Q ss_pred EEeehh-hhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---CcccCHHHHHHHHHHHhhc Q lcl|Aclame:pro 80 EAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE---ADITKLDGLQTAIDKFNDE 155 (274) Q Consensus 80 ~~~~~~-~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~---~~~~~~d~iv~a~~~l~~~ 155 (274) ++++.+ .+..|+++++++.++..++.+.+.++.++++++++|+.++..+.++..... .....|+.++++...|+++ T Consensus 74 ~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a~~~Ld~~ 153 (418) T protein:vir:10 74 PFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRPGAFIDFANAGAKQTTY 153 (418) T ss_pred EEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCcchHHHHHHHHHHHHhc Confidence 999965 466799999999999999999999999999999999999988877654332 2334699999999999988 Q ss_pred CC--C-ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCc-------------------- Q lcl|Aclame:pro 156 DL--E-PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK-------------------- 212 (274) Q Consensus 156 ~~--~-~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~-------------------- 212 (274) +. . .|++|++|..+..|+++.. +.......+..+++|.+|+++|+.|++|+++|. T Consensus 154 ~VP~~G~R~lVv~P~~~~~L~~~~~--~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~ 231 (418) T protein:vir:10 154 AVPQDGMRHAVLDPFTCASLSDEVT--KLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGD 231 (418) T ss_pred CCCCCCceEEEeCHHHHHHHhhhcc--ccccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeecccccce Confidence 76 3 4899999999999887654 333334445679999999999999999999982 Q ss_pred -----------------ceE------------------------------------------------------------ Q lcl|Aclame:pro 213 -----------------GEA------------------------------------------------------------ 215 (274) Q Consensus 213 -----------------~~~------------------------------------------------------------ 215 (274) |+. T Consensus 232 ~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~ 311 (418) T protein:vir:10 232 TVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPV 311 (418) T ss_pred eEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccccccccccccccc Confidence 111 Q ss_pred -------------------------------EEEcCCeEEEEecc--------------------CceeeeccccccCcc Q lcl|Aclame:pro 216 -------------------------------LLAKKGAVKLITKR--------------------DFFLEKDRDASRKST 244 (274) Q Consensus 216 -------------------------------~l~~~~a~~~~~~~--------------------~~~ve~~r~~~~~~~ 244 (274) +.|+++++....+. .+++.+++|...+.+ T Consensus 312 ~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~ 391 (418) T protein:vir:10 312 SLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSE 391 (418) T ss_pred cccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccce Confidence 11223333222110 122334455666677 Q ss_pred EEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 245 ALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 245 ~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .++.+.-||++.++|+=.|++-..++| T Consensus 392 ~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 392 IHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred EEEEEeecCceeecccceEEEEeecCC Confidence 788888999999999999999999999 No 136 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.94 E-value=7.2e-29 Score=174.87 Aligned_cols=270 Identities=13% Similarity=0.044 Sum_probs=212.6 Q ss_pred CCccccchh--------h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQGTTKVS--------N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~~~T~~~--------~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) |+...+.+. + .+-=|+|+.+|.+.++..+++.++.... ++ .+|++++||+.+.. +++.+..|+++.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~r-ti--~~gkS~q~~~iG~~-~~~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQ-EV--VGTNSVSNKYIGET-ELQVLSPGKSPDA 76 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceee-ee--cccceEEeeeeeee-EEeeeccCcccCC Confidence 876443331 1 2333889999999999999998887663 33 46899999999875 6899999999988 Q ss_pred cccccceeEEeehhhhc-chhccHHHHhccCcc-HHHHHHHHHHHHHHHHHHHHHHHHhcccc--c--------c----- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGK-GTELTDEAVLSGFGD-PQGEAVRQHGLAIANKVDNDVLEALKGAT--L--------T----- 134 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~-~~~is~e~~~~s~~d-~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--~--------~----- 134 (274) +.+..++..+++..+-. .+.|.+.+..++.+| +.+.+.+++++++++..|+.++..+..+. . . T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g 156 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHG 156 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCc Confidence 89999999999987644 478999999999999 89999999999999999999875443211 0 0 Q ss_pred ------cc--CcccC----HHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhc Q lcl|Aclame:pro 135 ------VE--ADITK----LDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL 200 (274) Q Consensus 135 ------~~--~~~~~----~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~ 200 (274) .. ...++ ++.|.+|...|.+.+. ..++++|+|..|+.|++....-.......+++...+|.++.+. T Consensus 157 ~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~ 236 (364) T protein:vir:10 157 FSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSW 236 (364) T ss_pred ceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEe Confidence 00 01112 3456678888888765 7799999999999999864322112122234557899999999 Q ss_pred ceeeEEcCCCCc---------------------------------ceEEEEcCCeEEEEeccCceeeeccccccCccEEE Q lcl|Aclame:pro 201 GAVIVRSNKLNK---------------------------------GEALLAKKGAVKLITKRDFFLEKDRDASRKSTALY 247 (274) Q Consensus 201 G~~Vv~s~~~p~---------------------------------~~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~ 247 (274) |+||+.|+++|. ....+|++.+++.+..+++..|.+++..++.+.+. T Consensus 237 Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~id 316 (364) T protein:vir:10 237 NTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYID 316 (364) T ss_pred ceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeee Confidence 999999999982 11467899999999999999999999999999999 Q ss_pred EEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 248 SDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 248 ~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++..||++++||++++.|+.+++..-. T Consensus 317 a~~a~G~g~lRPeaa~~i~~~~~~~~~ 343 (364) T protein:vir:10 317 TFLAEGAIPDRWEAVAVVTAADTAELA 343 (364) T ss_pred eehcccCcccCccceEEEEecCCCCCc Confidence 999999999999999999999888766 No 137 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=99.94 E-value=6.5e-29 Score=175.12 Aligned_cols=260 Identities=13% Similarity=0.015 Sum_probs=189.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccc-cccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP-VDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~-~~~~~~~~~ 79 (274) |...++..+..++|+.+.+.|++.+.+.+.+++++++.. .+| ..++|+....+.+.|.+|+++++ ..+++|+++ T Consensus 76 ~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~----~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEe----cCc-ceEEEeecCCcceEEeecccccccccCccceeE Confidence 555666667789999999999999999999999887532 334 46899988888899999988876 458999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------cc---------cCcc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-----------TV---------EADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-----------~~---------~~~~ 139 (274) ++..++++..+++|++++.++.+|+++++.+++++++++++|++++.+-.+.-. .. .... T Consensus 151 ~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~ 230 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999988854321100 00 0111 Q ss_pred cCHHHHHHH-------HHHHhh-------cCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeE Q lcl|Aclame:pro 140 TKLDGLQTA-------IDKFND-------EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIV 205 (274) Q Consensus 140 ~~~d~iv~a-------~~~l~~-------~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv 205 (274) +++.++... ...+.. .+..+..|+|||.++..|++...... ..|.. +. ...+|.+|+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~----~~G~~-v~----~lp~g~~vv 301 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN----ANGVY-VT----ALPFNLNVI 301 (381) T ss_pred ccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCC----CCCce-ee----cCCCCceeE Confidence 222222221 111111 12245679999999999876542111 11111 11 012588999 Q ss_pred EcCCCCcceEEEEcCCeEEEEeccCceeeeccc--cccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 206 RSNKLNKGEALLAKKGAVKLITKRDFFLEKDRD--ASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 206 ~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .++.+|++++++.+.+.+.+..+.++.++..++ ..++++.++++.|+|+++++|+++++++.+.--..- T Consensus 302 ~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:10 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred EcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCcc Confidence 999999999887777777778888888877654 456888999999999999999999997765322221 No 138 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=99.94 E-value=3.4e-29 Score=176.65 Aligned_cols=258 Identities=13% Similarity=0.036 Sum_probs=188.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccc-cccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP-VDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~-~~~~~~~~~ 79 (274) |...++..+..++|+.+.+.|++.+.+.+.+.+++++. ..+|+ .+||+....+.+.|++|+.+++ ..+++|+++ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~----~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 157 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMR----TTGLR-TKFLKSETSGVAVWGKIFGEIKGQLDATFSDE 157 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeE----ecCCc-eEEEEEcCCcceEEeecccccccccCcceeeE Confidence 77777788889999999999999999999988888653 33454 6899998888999999988875 468999999 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-c----------ccc---------cCcc Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-T----------LTV---------EADI 139 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-~----------~~~---------~~~~ 139 (274) ++.+++++..+++|++++.++.+++++++.+++++++++++|++++..-... + ... .... T Consensus 158 ~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 237 (383) T protein:vir:78 158 ESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGT 237 (383) T ss_pred eecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccch Confidence 9999999999999999999999999999999999999999999998543211 0 000 1112 Q ss_pred cCHHHHHHHHHHHhhc--------------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhccee-- Q lcl|Aclame:pro 140 TKLDGLQTAIDKFNDE--------------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAV-- 203 (274) Q Consensus 140 ~~~d~iv~a~~~l~~~--------------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~-- 203 (274) ++++++......+... ......|+|||..+..+..... . . ..+|+..+++|+| T Consensus 238 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~--~----~-----~~~G~~~t~l~~~~~ 306 (383) T protein:vir:78 238 LTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYT--S----L-----NANGVYVTALPFNLN 306 (383) T ss_pred hhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchh--c----c-----CCCCceeeecCCCce Confidence 2333333333332210 0122358888877665432110 0 0 1234444566555 Q ss_pred eEEcCCCCcceEEEEcCCeEEEEeccCceeeecccc--ccCccEEEEEEEEEEEEEcCcceEEEEeCCCcc--cC Q lcl|Aclame:pro 204 IVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDA--SRKSTALYSDKHYVAYLYDESKVVKITKGAGDE--VM 274 (274) Q Consensus 204 Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~--~~ 274 (274) |+.++.+|++++++.+.+.+.+..+.++.++.+++. .++++.+++..|+|+++++|+|++.++.+-+.. +- T Consensus 307 iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~ 381 (383) T protein:vir:78 307 IIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTP 381 (383) T ss_pred EEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCC Confidence 778999999998877777787888888888876554 457889999999999999999988866553332 22 No 139 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.93 E-value=7.7e-29 Score=174.70 Aligned_cols=233 Identities=14% Similarity=0.178 Sum_probs=183.4 Q ss_pred cccccccccccCCCEEEEEeecCCCCcccccCCCccc--ccccccceeEEeehhhh-cchhccHHHHhccCccHHHHHHH Q lcl|Aclame:pro 34 FADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP--VDQIGTSKREAKVRKIG-KGTELTDEAVLSGFGDPQGEAVR 110 (274) Q Consensus 34 l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~--~~~~~~~~~~~~~~~~~-~~~~is~e~~~~s~~d~~~~~~~ 110 (274) ++ .++ .+|++++||+++.. ++..+..|.++. ..++...+..+++.++. ..+.|.|.+..++..|+++.+.+ T Consensus 1 ~v---r~i--~~g~s~~~~~iG~~-~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MT---RTI--TSGKSAQFPVMGRT-KARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYST 74 (324) T ss_pred Ce---eee--ecCceEEEeeeeee-EeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHH Confidence 22 223 45899999999875 789999999984 46789999999998764 45899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccc-------------------cccc----Ccc----cCHHHHHHHHHHHhhcCC--CccE Q lcl|Aclame:pro 111 QHGLAIANKVDNDVLEALKGAT-------------------LTVE----ADI----TKLDGLQTAIDKFNDEDL--EPMV 161 (274) Q Consensus 111 ~~a~~~a~~~d~~~i~~~~~a~-------------------~~~~----~~~----~~~d~iv~a~~~l~~~~~--~~~~ 161 (274) ++++++++..|+.++..+.... .... ... ..++.|++|...|.+++. .+|+ T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 9999999999998875542100 0000 011 126788888899988765 6799 Q ss_pred EEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce--------------------------- Q lcl|Aclame:pro 162 LFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE--------------------------- 214 (274) Q Consensus 162 ~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~--------------------------- 214 (274) ++|+|+.|+.|+++.. +......+.+.+.+|.+++++|++|+.|+++|... T Consensus 155 ~vv~P~~y~~Ll~~~~--~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky 232 (324) T protein:vir:99 155 FYTDPDTYSAILAALM--PNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKM 232 (324) T ss_pred EEeChHHHHHHhhccc--ccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccc Confidence 9999999998875533 22233345567899999999999999999998420 Q ss_pred --------EEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcce--EEEEeCCCcccC Q lcl|Aclame:pro 215 --------ALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKV--VKITKGAGDEVM 274 (274) Q Consensus 215 --------~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~av--v~l~~~aa~~~~ 274 (274) .++|++++++....+++++|..|++.++.+.|++++.||++++||+++ |.+...++..|. T Consensus 233 ~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~ 302 (324) T protein:vir:99 233 TVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVA 302 (324) T ss_pred ccccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCcccccc Confidence 157788888888899999999999999999999999999999999977 566666666666 No 140 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.93 E-value=4.2e-28 Score=170.66 Aligned_cols=269 Identities=13% Similarity=0.063 Sum_probs=210.2 Q ss_pred CCcccc----------chhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccc Q lcl|Aclame:pro 1 MAQGTT----------KVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP 70 (274) Q Consensus 1 ma~~~T----------~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~ 70 (274) |.+..+ .-.++++ |+|+.+|.+.++..+++.++..+.. + .+|++++||+.+.. +++.+.+|.++. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rt-i--~~g~s~~~~~iG~~-~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRD-L--RGSNVVRLDRLGNV-EAKGRRAGEELE 75 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceee-e--ccceeEEEeeeeee-eecccccCcccC Confidence 765431 2234677 9999999999999999999887653 2 56999999999875 689999999999 Q ss_pred ccccccceeEEeehhhh-cchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 71 VDQIGTSKREAKVRKIG-KGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------------- 133 (274) Q Consensus 71 ~~~~~~~~~~~~~~~~~-~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------------- 133 (274) .+.+..++..+++..+- ..+.|.+.+..++..|+.+.+.+++++++|+..|+.++..+..+.. T Consensus 76 ~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:78 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcc Confidence 99999999999998754 3467999999999999999999999999999999988744432221 Q ss_pred -----cccCcccCHHHHH----HHHHHHhhcCC-----CccEEEEcHHHHHHHHhhhcccccccc-ccccccccccccch Q lcl|Aclame:pro 134 -----TVEADITKLDGLQ----TAIDKFNDEDL-----EPMVLFVNPLDAGGLRTSASDNFTRPT-QLGDNIIVKGAFGE 198 (274) Q Consensus 134 -----~~~~~~~~~d~iv----~a~~~l~~~~~-----~~~~~v~~p~~~~~L~~~~~~~~~~~~-~~~~~~~~~g~~~~ 198 (274) ++.....+++.++ +|...|.+.+. ..++++|+|+.|+.|++.+..-..... ..+.+...+|.++. T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:78 156 EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAI 235 (335) T ss_pred eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEE Confidence 0111112344444 44555665543 258999999999999986432111111 11234578899999 Q ss_pred hcceeeEEcCCCCcc---------------------eEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEE Q lcl|Aclame:pro 199 ALGAVIVRSNKLNKG---------------------EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLY 257 (274) Q Consensus 199 i~G~~Vv~s~~~p~~---------------------~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~ 257 (274) +.|+||+.|+++|.+ .++++++.+++.+..+++..|.+++..++.+.|.++..||++++ T Consensus 236 v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:78 236 LNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGAR 315 (335) T ss_pred eeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCccc Confidence 999999999999932 24678899999999999999999999999999999999999999 Q ss_pred cCcceEEEEeCCCcccC Q lcl|Aclame:pro 258 DESKVVKITKGAGDEVM 274 (274) Q Consensus 258 ~~~avv~l~~~aa~~~~ 274 (274) ||++++.++.+..-++- T Consensus 316 RPe~a~~i~~tg~~~~~ 332 (335) T protein:vir:78 316 RPDTAGAIELKGIEAFD 332 (335) T ss_pred CcceEEEEEecCCCccc Confidence 99999999999888877 No 141 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.92 E-value=9e-28 Score=168.86 Aligned_cols=269 Identities=14% Similarity=0.067 Sum_probs=210.6 Q ss_pred CCccc----------cchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccc Q lcl|Aclame:pro 1 MAQGT----------TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP 70 (274) Q Consensus 1 ma~~~----------T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~ 70 (274) |.+.. ..-.++++ |+|+.+|...++..+++.++..... + .+|++++||+.+.. +++.+.+|.++. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rt-i--~~g~s~~~~~iG~~-~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRD-L--RGSNVVRLDRLGNV-EAKGRRAGEELE 75 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceee-e--ccceeEEEeeeeee-eeecccCCcCcC Confidence 76543 12234566 9999999999999999999887653 2 56999999999875 799999999999 Q ss_pred ccccccceeEEeehhhh-cchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 71 VDQIGTSKREAKVRKIG-KGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------------- 133 (274) Q Consensus 71 ~~~~~~~~~~~~~~~~~-~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------------- 133 (274) .+....++..+++..+- ..+.|.+.+..++..|+.+.+.+++++++|+..|+.++..+..+.. T Consensus 76 ~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:63 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcc Confidence 99899999999998754 3467999999999999999999999999999999988744322211 Q ss_pred -----cccCcccCHHHHH----HHHHHHhhcCC-----CccEEEEcHHHHHHHHhhhcccccccc-ccccccccccccch Q lcl|Aclame:pro 134 -----TVEADITKLDGLQ----TAIDKFNDEDL-----EPMVLFVNPLDAGGLRTSASDNFTRPT-QLGDNIIVKGAFGE 198 (274) Q Consensus 134 -----~~~~~~~~~d~iv----~a~~~l~~~~~-----~~~~~v~~p~~~~~L~~~~~~~~~~~~-~~~~~~~~~g~~~~ 198 (274) +.......++.++ +|...|.+++. ..++++|+|+.|+.|++.+..-..... ..+.+...+|.++. T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:63 156 EKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAI 235 (335) T ss_pred eeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEE Confidence 1111122455554 66677776654 348999999999999986432111111 11234578899999 Q ss_pred hcceeeEEcCCCCcc---------------------eEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEE Q lcl|Aclame:pro 199 ALGAVIVRSNKLNKG---------------------EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLY 257 (274) Q Consensus 199 i~G~~Vv~s~~~p~~---------------------~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~ 257 (274) +.|+||+.|+++|.+ .++++++.+++....++++.|.+++..++.+.|.++..||++++ T Consensus 236 v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:63 236 LNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGAR 315 (335) T ss_pred eeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCccc Confidence 999999999999931 24678899999999999999999999999999999999999999 Q ss_pred cCcceEEEEeCCCcccC Q lcl|Aclame:pro 258 DESKVVKITKGAGDEVM 274 (274) Q Consensus 258 ~~~avv~l~~~aa~~~~ 274 (274) ||++++.++.+..=++- T Consensus 316 RPe~a~~i~~tg~~~~~ 332 (335) T protein:vir:63 316 RPDTAGAIELKGIGAFD 332 (335) T ss_pred ccceEEEEEEcCCCcee Confidence 99999999988777666 No 142 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.92 E-value=1e-27 Score=168.58 Aligned_cols=266 Identities=14% Similarity=0.127 Sum_probs=189.4 Q ss_pred CCccc----------cchhhccc-hHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEE----EeecCCCCcccccC Q lcl|Aclame:pro 1 MAQGT----------TKVSNLIV-PEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTF----PAFTYSGDAQVIAE 65 (274) Q Consensus 1 ma~~~----------T~~~~~~i-Pe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~i----p~~~~~~~a~~~~e 65 (274) |.+.+ -+.++++- |+.+..++.+.+++..+...+.+... ...+-.+.+ |.+. .++++.++| T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~---a~~~~~v~f~~~~p~~~-~~d~e~VaE 76 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGG---ANPNGVVAYNEGNPSFL-EDDVADVAE 76 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhccc---ccccceeEEEecccccc-cCcHhhccC Confidence 66542 12234444 88888888888776655555554321 112334665 3343 368999999 Q ss_pred CCcccccccccceeEE-eehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---ccCcccC Q lcl|Aclame:pro 66 GEKIPVDQIGTSKREA-KVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT---VEADITK 141 (274) Q Consensus 66 g~~~~~~~~~~~~~~~-~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~---~~~~~~~ 141 (274) |+++|....++++..+ ..+|+|..++||+|.+..+..+..+...+++++.++++.|+.+++.+..+.+. +++.+.. T Consensus 77 ggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~ 156 (318) T protein:vir:10 77 FGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDN 156 (318) T ss_pred cccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCC Confidence 9999999999987765 66899999999999999999999999999999999999999999998765422 2222211 Q ss_pred ----HHHHHHHHHHH----------------hhcCCCccEEEEcHHHHHHHHhhhcccccccccccccc----ccccc-c Q lcl|Aclame:pro 142 ----LDGLQTAIDKF----------------NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI----IVKGA-F 196 (274) Q Consensus 142 ----~d~iv~a~~~l----------------~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~----~~~g~-~ 196 (274) ..++++|.... ...++.++.++|||..|..|+++.......... .+.. -..|. . T Consensus 157 ~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~-a~~~~~~~~~tg~~~ 235 (318) T protein:vir:10 157 GGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERN-ANYVSTAPDWTGNFP 235 (318) T ss_pred cccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhcc-chhhhhccccccccc Confidence 11333333211 123578899999999999998775432211111 1111 11233 3 Q ss_pred chhcceeeEEcCCCCcceEEEEcCCeEEE-EeccCceeeecccc-------ccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 197 GEALGAVIVRSNKLNKGEALLAKKGAVKL-ITKRDFFLEKDRDA-------SRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 197 ~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~-~~~~~~~ve~~r~~-------~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) ++++|+.|+.|+.+|.+++|+++++.+|+ +..+++.+++.|.+ ...++.+++++....+|.+|.|+|+||.= T Consensus 236 g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 236 GSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGI 315 (318) T ss_pred ceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeec Confidence 67899999999999999999999999996 46777888887765 45567889999999999999999999863 Q ss_pred CCc Q lcl|Aclame:pro 269 AGD 271 (274) Q Consensus 269 aa~ 271 (274) .-- T Consensus 316 ~~~ 318 (318) T protein:vir:10 316 VTP 318 (318) T ss_pred cCC Confidence 322 No 143 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.92 E-value=5.4e-27 Score=164.61 Aligned_cols=265 Identities=15% Similarity=0.107 Sum_probs=203.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC----CCcccccCCCccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS----GDAQVIAEGEKIPVDQIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~----~~a~~~~eg~~~~~~~~~~ 76 (274) |... ...+..+.|+++. ++++.+++.+.+.+++++.... +..+..||+++.. +...|.++....++++++| T Consensus 14 it~~-d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~---~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf 88 (314) T protein:vir:41 14 IDVP-DLGKGILAVQRFG-EFVREVRENSAIIKDARVLNAL---KSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTV 88 (314) T ss_pred cccc-cCCCceeChHHHH-HHHHHHHhccchhhheeeeccc---CccceeecccccCcccccccccccCCccCCcccccc Confidence 5332 2234579999985 6889999999999999865332 2345788887542 2345567777889999999 Q ss_pred ceeEEeehhhhcchhccHHHHhccC--ccHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cc---c Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSGF--GDPQGEAVRQHGLAIANKVDNDVLEALKGA-----------------TL---T 134 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s~--~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-----------------~~---~ 134 (274) +++.+.++++...++||++++.++. +|+++++.+++++++++.++..++.+-... .. . T Consensus 89 ~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~ 168 (314) T protein:vir:41 89 STNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTD 168 (314) T ss_pred cceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceee Confidence 9999999999999999999999985 499999999999999999999888543211 00 0 Q ss_pred --ccCcccCHHHHHHHHHHHhhcCC---CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC Q lcl|Aclame:pro 135 --VEADITKLDGLQTAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK 209 (274) Q Consensus 135 --~~~~~~~~d~iv~a~~~l~~~~~---~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~ 209 (274) ..+..++.+.+.++...|...+. ...+|+||++++..+++....+ .+..++..+..|...+++|+||+..+. T Consensus 169 ~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~---~~~l~~~~~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 169 AEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVR---ETGLGDSALIGATGLQYDGIPIQYVPA 245 (314) T ss_pred cCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhcc---CCcccchhhhCCCCceecceeeEeccc Confidence 11222445667888899987664 3457999999999998764332 333566667778888999999999998 Q ss_pred CC-----cceEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 210 LN-----KGEALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 210 ~p-----~~~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) +| ++.+++.++..+.+.....++++.+|+.+.++..++.+.|+||.+..++++|+....-+++= T Consensus 246 ~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 246 LDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred ccccCCCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 85 45555666777878889999999999999999999999999999998888777766655555 No 144 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.91 E-value=2.9e-26 Score=160.62 Aligned_cols=263 Identities=15% Similarity=0.181 Sum_probs=192.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHH-hhhhcccccccccccccCCCEEEEEeecCCCCcc------cccCCC-ccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDK-KLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ------VIAEGE-KIPVD 72 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~-~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~------~~~eg~-~~~~~ 72 (274) |+ |+....|+ ++|++.+...+++ .+.+.+-++..+ ...++++++.+.....+..+ ....+. +.|.. T Consensus 13 Ms---~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~ 86 (322) T protein:vir:10 13 IA---GDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKN--ESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVN 86 (322) T ss_pred ee---chhhhHHH-HHHHHHHHHHHHHhhhhhhccccccc--ccccccceeecccccccccccccccccccCcccCCCcc Confidence 55 34566777 8899988777664 344555444322 22334445554433222111 111122 35666 Q ss_pred ccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------cc Q lcl|Aclame:pro 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT----------------VE 136 (274) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~----------------~~ 136 (274) ....+...+....+...+.|.+.++.+...|+.+...+..+.+++|+.|+.+++.+.+.... .. T Consensus 87 ~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g 166 (322) T protein:vir:10 87 NKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDG 166 (322) T ss_pred ccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccC Confidence 67777788888888888999999999999999999999999999999999988655432211 11 Q ss_pred CcccCHHHHHHHHHHHhhcCC---CccEEEEcHHHHHHHHhhhccccccccccccccc-cccccchhcceeeEEcCCCCc Q lcl|Aclame:pro 137 ADITKLDGLQTAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNII-VKGAFGEALGAVIVRSNKLNK 212 (274) Q Consensus 137 ~~~~~~d~iv~a~~~l~~~~~---~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~-~~g~~~~i~G~~Vv~s~~~p~ 212 (274) +..++++.+++|..+|++++. .+++++++|..|..|+++.. +......+...+ .+|.+++|+|++|+.++++|. T Consensus 167 ~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~--~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~ 244 (322) T protein:vir:10 167 TKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITE--ATSADYTSAMDLQSKGIITNWMGYTWIVSTRLDK 244 (322) T ss_pred ccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchh--hhhhhcccchhhhhcCeeeeeeeEEEEEeccCCc Confidence 235679999999999988765 35899999999999997753 443333343444 679999999999999999983 Q ss_pred ------------------ceEEEEcCCeEEEEeccCceeeecccccc-CccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 213 ------------------GEALLAKKGAVKLITKRDFFLEKDRDASR-KSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 213 ------------------~~~~l~~~~a~~~~~~~~~~ve~~r~~~~-~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ..+++++++|++|....++..+...++++ ..+.+++.+.||+++++|++||.|...-+= T Consensus 245 ~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 245 FDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred cccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEEEEEeccC Confidence 34789999999999999988886665554 468899999999999999999999985444 No 145 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=99.90 E-value=1e-25 Score=157.54 Aligned_cols=262 Identities=13% Similarity=0.062 Sum_probs=193.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC----CCcccccCCCccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS----GDAQVIAEGEKIPVDQIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~----~~a~~~~eg~~~~~~~~~~ 76 (274) .-+.+...+..++|+.+. ++++.+.+.+.+.+++++.... .+.+..|+..+.. ....|.+|+...++++++| T Consensus 18 ~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~---~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f 93 (315) T protein:vir:41 18 KIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNAL---KSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEV 93 (315) T ss_pred hcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeecc---ccccccccccccCcccccccccccCcCCCCCCcccc Confidence 111122223368899875 5778888889999988764322 2233445544321 2345778888889999999 Q ss_pred ceeEEeehhhhcchhccHHHHhccC--ccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------------- Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSGF--GDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------------------- 132 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s~--~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------------------- 132 (274) +++.+.++++...+.+|++++.++. +|+++++..+++++++++++..++..-.++. T Consensus 94 ~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~ 173 (315) T protein:vir:41 94 KTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESD 173 (315) T ss_pred ceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccccc Confidence 9999999999999999999999985 5999999999999999999999986633210 Q ss_pred ccccCcccCHHHHHHHHHHHhhcCC---CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC Q lcl|Aclame:pro 133 LTVEADITKLDGLQTAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK 209 (274) Q Consensus 133 ~~~~~~~~~~d~iv~a~~~l~~~~~---~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~ 209 (274) .+..+...+.|.++++...|...+. .+.+|+||++++..+++.++.+ ....++..+..|+..+|+|+||+..+. T Consensus 174 ~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~---g~~lw~~~~~~g~~~tl~G~PV~~~~~ 250 (315) T protein:vir:41 174 VDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGR---ETGLGDQALTGANSILYDGRPVQYVPA 250 (315) T ss_pred cccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccC---CCccccchhhcCCCceecccceEeccc Confidence 0011223457788899999987664 4568999999999998876433 344566778888889999999999999 Q ss_pred CCc-----ceEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 210 LNK-----GEALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 210 ~p~-----~~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) ||. +.+++.+...+.+..+++++++.+|+...+...++.+.|+|+.+..+++.+.-..+- T Consensus 251 m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 251 LEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 863 444555566677888899999999999999999999999999887666633323222 No 146 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.89 E-value=3.4e-25 Score=154.73 Aligned_cols=267 Identities=17% Similarity=0.219 Sum_probs=204.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccc---cc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIG---TS 77 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~---~~ 77 (274) |+ |..++++||.++++.++|..++....+.+++.... ..|.+..||.++.+ -+.+++||.++|...+. ++ T Consensus 74 mt---t~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L---~~Grsm~F~~~g~~-Ra~~IgEGgE~~~~sld~~T~d 146 (393) T protein:vir:79 74 MA---TPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRL---KSGQSMIFPSIGIM-RAYDVAEGQEIPEDSIDWQTHE 146 (393) T ss_pred hc---CCCcceechhhhhhhhhhcccchhHHHHHHHHHhh---hcCcceeccchhee-eeccccccccccccchhhhcCC Confidence 55 77789999999999999988877666777765432 24667888888866 47789999999998654 78 Q ss_pred eeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------------cccC Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------------------TVEA 137 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------------------~~~~ 137 (274) .+++..+|.|..+.+|+|++.+|++|++..+...+.++++|+.+..++..+.+..+ ...+ T Consensus 147 sv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qN 226 (393) T protein:vir:79 147 SPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQN 226 (393) T ss_pred ceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCcccccc Confidence 89999999999999999999999999999999999999999999999977755432 1235 Q ss_pred cccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchh-----------cceeeEE Q lcl|Aclame:pro 138 DITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA-----------LGAVIVR 206 (274) Q Consensus 138 ~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i-----------~G~~Vv~ 206 (274) ++++.++++|+......+.+++++++|||-.|..+.|....+.......++-.-+.-...+- +.+.|++ T Consensus 227 GTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~ 306 (393) T protein:vir:79 227 DTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNL 306 (393) T ss_pred ccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccccceeEEE Confidence 67889999999999999999999999999999999887654443333222211111111122 3489999 Q ss_pred cCCCCcc------eEEEEcCCeEEEE-eccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC--CCcccC Q lcl|Aclame:pro 207 SNKLNKG------EALLAKKGAVKLI-TKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG--AGDEVM 274 (274) Q Consensus 207 s~~~p~~------~~~l~~~~a~~~~-~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~--aa~~~~ 274 (274) |+.+|-. ..|..+++.++.. ...+++++...|.-++-+.+.-+.|||.+|++....+..-+- -+..+- T Consensus 307 sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~y~ 383 (393) T protein:vir:79 307 SPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKSYA 383 (393) T ss_pred ecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecceeecccc Confidence 9999943 3467788888754 566788888888889999999999999999988766543221 111111 No 147 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.89 E-value=1.2e-25 Score=157.23 Aligned_cols=270 Identities=11% Similarity=0.039 Sum_probs=205.3 Q ss_pred CCccccchh--------h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQGTTKVS--------N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~~~T~~~--------~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) |+...+.+. + .+-=|+|+.+|.+.++..+++.++.... ++ .+|++++||+.+.. +++.+..|.++.. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vr-ti--~~GkS~qf~~iG~~-~a~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQ-TV--TGTNTVSNKYLGET-ELQVLAPGQSPNA 76 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceee-ee--cccceEEEEEEeee-EEeeeccccccCC Confidence 876443331 1 2333889999999999999998888763 33 46899999999875 6889999999988 Q ss_pred cccccceeEEeehhhhc-chhccHHHHhccCcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGK-GTELTDEAVLSGFGD-PQGEAVRQHGLAIANKVDNDVLEALKGATL---------------- 133 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~-~~~is~e~~~~s~~d-~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------------- 133 (274) +.+..++..+++..+-. .+.|.+.+..++.+| +.+.+.+++++++++..|+.++..+..+.. T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g 156 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccc Confidence 88999999999987544 477999999999999 899999999999999999988764432110 Q ss_pred -----ccc--CcccCH----HHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhc Q lcl|Aclame:pro 134 -----TVE--ADITKL----DGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL 200 (274) Q Consensus 134 -----~~~--~~~~~~----d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~ 200 (274) +.+ ...++. +.|.+|...|.+.+. ..++++++|+.|+.|++.+..-.......+.+...+|.++.+. T Consensus 157 ~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~ 236 (402) T protein:vir:97 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) T ss_pred cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEe Confidence 000 111233 455677778877654 6799999999999999865422122222344557899999999 Q ss_pred ceeeEEcCCCCcc---------------------------eEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEE Q lcl|Aclame:pro 201 GAVIVRSNKLNKG---------------------------EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYV 253 (274) Q Consensus 201 G~~Vv~s~~~p~~---------------------------~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~ 253 (274) |++|+.|+++|.+ .+++|++.|++...-.+++.+.+++..++.+.|.....|| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G 316 (402) T protein:vir:97 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEG 316 (402) T ss_pred ceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHhC Confidence 9999999999831 1367899999999999999999999999999999999999 Q ss_pred EEEEcCcceEEEEeCC--CcccC Q lcl|Aclame:pro 254 AYLYDESKVVKITKGA--GDEVM 274 (274) Q Consensus 254 ~~v~~~~avv~l~~~a--a~~~~ 274 (274) .++.+|+++..++..- -+++- T Consensus 317 ~g~~RPeaa~vv~~~~~~t~~~~ 339 (402) T protein:vir:97 317 AIPDRWEAVSVVTTKRDATTGDA 339 (402) T ss_pred CcccCccceEEEEEecccccccC Confidence 9999999988874332 11111 No 148 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.88 E-value=4.2e-25 Score=154.25 Aligned_cols=264 Identities=17% Similarity=0.145 Sum_probs=189.6 Q ss_pred CCcc-ccchhhccchHHHH--HHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccc Q lcl|Aclame:pro 1 MAQG-TTKVSNLIVPEVLA--PMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTS 77 (274) Q Consensus 1 ma~~-~T~~~~~~iPe~~~--~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~ 77 (274) ||.+ .|+..|+..|+.+. +.+...+.+.+..-++. +..+-..|+++++|+|..++++++++||+.||.++++.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~---r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~ 77 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVT---RRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRT 77 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccc---cccccccCCeEEeeeeeeecccccccCCcccchhhheee Confidence 9976 77777777676433 33322222322222222 233334589999999999999999999999999999986 Q ss_pred ---eeEEeehhhhcchhccHHHH-hccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccc--CHHHHHHHHHH Q lcl|Aclame:pro 78 ---KREAKVRKIGKGTELTDEAV-LSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADIT--KLDGLQTAIDK 151 (274) Q Consensus 78 ---~~~~~~~~~~~~~~is~e~~-~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~--~~d~iv~a~~~ 151 (274) ..+++++|+++.+ |+|.+ .....++.....++|...+++++|+.+++.+++++.+.....+ .++.+.+++.. T Consensus 78 ~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg~~lq~a~a~~~~al~~ 155 (295) T protein:vir:99 78 KDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKGVGLQKALSASWAKLAT 155 (295) T ss_pred eeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeehhhHHHHHHHhhhhhhh Confidence 4778888988865 99997 5666789999999999999999999999999999988776643 67777888888 Q ss_pred HhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhccee-eEEcCCCCcceEEEEcCCeEEEE--ec Q lcl|Aclame:pro 152 FNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAV-IVRSNKLNKGEALLAKKGAVKLI--TK 228 (274) Q Consensus 152 l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~-Vv~s~~~p~~~~~l~~~~a~~~~--~~ 228 (274) +.+.+..+.+++|||.+.+.||+++..++...+..|.+.+.+ ++|+. |++|+.+|+|++|......+.+. .- T Consensus 156 f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~n-----fLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~ 230 (295) T protein:vir:99 156 FNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKN-----FLGMQNVIVMPSVPEGKIYSTAVENLVFASLNV 230 (295) T ss_pred cccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhh-----hhccceEEEcccCCCceEEEeeccceEEEEecC Confidence 888777888999999999999999887777776677776664 99996 99999999999998877665542 22 Q ss_pred cCceeeeccccccCccEEEEEEE-------------EEEE---EEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 229 RDFFLEKDRDASRKSTALYSDKH-------------YVAY---LYDESKVVKITKGAGDEVM 274 (274) Q Consensus 229 ~~~~ve~~r~~~~~~~~i~~~~~-------------~~~~---v~~~~avv~l~~~aa~~~~ 274 (274) ..-.+.....-..+.+.+.+..+ +.+- .-.+++||+.+..++-.-- T Consensus 231 ~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~ 292 (295) T protein:vir:99 231 KGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPG 292 (295) T ss_pred CchhhhhhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCC Confidence 21111111111223333333222 1111 2267889999886654333 No 149 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=99.88 E-value=1.4e-24 Score=151.34 Aligned_cols=263 Identities=12% Similarity=0.059 Sum_probs=176.1 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccccccc---ccCCCEEEEEeecCCCCcccccC--CCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV---GQPGDTLTFPAFTYSGDAQVIAE--GEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~~~a~~~~e--g~~~~~~~~~ 75 (274) |||+..+ ++||.|+..+++.+++++++.++++++++-+ ++.|++|+||+... ..+.++.. +..+..++++ T Consensus 1 MAN~llT----~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~-~~v~d~~~~~~~~~~~~~~~ 75 (423) T protein:vir:35 1 MANNLES----NISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQ-FKSERTETGDITGKDKNGLF 75 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCc-ceeecccCcCCCCccccccc Confidence 9976543 6899999999999999999999998877533 35699999998764 35666643 4567888999 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccccc-c--CcccCHHHHHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG-ATLTV-E--ADITKLDGLQTAID 150 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~-a~~~~-~--~~~~~~d~iv~a~~ 150 (274) ..++++++.+. +..|+++++++.++..++++ ..++.++++++++|+.++..+.. +.... + .....|+.++++.. T Consensus 76 e~~v~l~id~~k~~a~~v~d~e~~l~i~~~~~-~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a~~ 154 (423) T protein:vir:35 76 SAKATGKVGKYITVAVEWTQIEEALKLNQLDQ-ILSPIHERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQTAS 154 (423) T ss_pred cceeeEEeccceeccceeCHHHHHhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHHHH Confidence 99999999875 55799999999988888865 55566788999999999876643 33221 1 12246899999999 Q ss_pred HHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccccccccc-chhcceeeEEcCCCCcceEEEEcCCeEEEEe Q lcl|Aclame:pro 151 KFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF-GEALGAVIVRSNKLNKGEALLAKKGAVKLIT 227 (274) Q Consensus 151 ~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~-~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~ 227 (274) .|++.+. .+|++|++|..+..|++... .+......+...+++|++ |+++|+.|++|+++|..+..-++...... . T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~-~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~-~ 232 (423) T protein:vir:35 155 FIKDIGIKTGENYAIMDPWSAQRLADAQS-GLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVK-T 232 (423) T ss_pred HHHHhcCCcCCCEEEeCHHHHHHHhcccc-ceeccccchhHHHhhccceeeecceEEEEcCCCccccccccccceeec-c Confidence 9998765 67999999999999986532 344444555677888876 99999999999999986665443322110 0 Q ss_pred ccCcee---------------e--eccccccCccEEEEEEEEEEEEEcCcceEE-----------EEeC--------CCc Q lcl|Aclame:pro 228 KRDFFL---------------E--KDRDASRKSTALYSDKHYVAYLYDESKVVK-----------ITKG--------AGD 271 (274) Q Consensus 228 ~~~~~v---------------e--~~r~~~~~~~~i~~~~~~~~~v~~~~avv~-----------l~~~--------aa~ 271 (274) ...+.. . ...+.....|. ...-|.+.++|..-.. .+.. +++ T Consensus 233 a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~---~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~ 309 (423) T protein:vir:35 233 APNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQ---LKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDV 309 (423) T ss_pred ccccccccccccccceeeeeeeeeccCCcEEecce---EEeeeeeeccccccceeecccCCceeEEEEeccccccccCce Confidence 000000 0 11111122221 2233444443333221 1111 122 Q ss_pred ccC Q lcl|Aclame:pro 272 EVM 274 (274) Q Consensus 272 ~~~ 274 (274) .|- T Consensus 310 ~v~ 312 (423) T protein:vir:35 310 TVK 312 (423) T ss_pred eEE Confidence 221 No 150 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=99.88 E-value=8e-24 Score=147.22 Aligned_cols=262 Identities=14% Similarity=0.107 Sum_probs=184.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccccccc---ccCCCEEEEEeecCCCCcccccCC--Ccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV---GQPGDTLTFPAFTYSGDAQVIAEG--EKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~~~a~~~~eg--~~~~~~~~~ 75 (274) |||..|. ++|++|+..+++.+++++++.++++++++-+ ++.||+|+||+.... .+.....+ ...+.+++. T Consensus 1 MANsl~~----l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~-~~~d~~~~~~t~~~~~~l~ 75 (423) T protein:vir:10 1 MANNLDA----NVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQF-KSERTMDGDITGKSKNSLI 75 (423) T ss_pred Ccccccc----ccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCce-eeecccCcccCcccccccc Confidence 8876654 9999999999999999999999999887533 457999999987643 34332211 223456777 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccc-cccc---cCcccCHHHHHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-TLTV---EADITKLDGLQTAID 150 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a-~~~~---~~~~~~~d~iv~a~~ 150 (274) ..++++++.+. ...|+++++++.++..++ +.+.++.+++++.++|+.+...+... .... ......|++++++.. T Consensus 76 e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a~a~~ 154 (423) T protein:vir:10 76 SAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVAQTAS 154 (423) T ss_pred cceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccHHHHHHHHH Confidence 88899999876 457999999988887888 56788889999999999997555332 2211 112235899999999 Q ss_pred HHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccccccccc-chhcceeeEEcCCCCc---ce---------- Q lcl|Aclame:pro 151 KFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF-GEALGAVIVRSNKLNK---GE---------- 214 (274) Q Consensus 151 ~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~-~~i~G~~Vv~s~~~p~---~~---------- 214 (274) .|++.+. ..|++|++|..++.|+++.. .+......+...+++|++ |+++|+.|++|+++|. ++ T Consensus 155 ~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~-~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~ 233 (423) T protein:vir:10 155 FLKDLGINSGENYAVMDPWAAQRLADAQS-GLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGT 233 (423) T ss_pred HHhhccCCcCCCEEEeCHHHHHHHhhhhh-hhccccccchHHHHhcccceeecceEEEEecCCcccccccccceeeeeee Confidence 9988764 67999999999999986432 223334455567888876 8999999999999982 11 Q ss_pred E------------------------------------------------------------------------------- Q lcl|Aclame:pro 215 A------------------------------------------------------------------------------- 215 (274) Q Consensus 215 ~------------------------------------------------------------------------------- 215 (274) . T Consensus 234 ~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i 313 (423) T protein:vir:10 234 PEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDVTVKI 313 (423) T ss_pred eEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCceEEEe Confidence 0 Q ss_pred ---------------------------------------EEEcCCeEEEEec-----------------cCceeeecccc Q lcl|Aclame:pro 216 ---------------------------------------LLAKKGAVKLITK-----------------RDFFLEKDRDA 239 (274) Q Consensus 216 ---------------------------------------~l~~~~a~~~~~~-----------------~~~~ve~~r~~ 239 (274) +.||+++|....+ -.+++.+++|. T Consensus 314 ~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~ 393 (423) T protein:vir:10 314 SGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADG 393 (423) T ss_pred ccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCccceeecccccceEEEEEeeec Confidence 0111122211110 01233345555 Q ss_pred ccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 240 SRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 240 ~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) ....+.++...-||++.++|+=.|++-..- T Consensus 394 ~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 394 DANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred cccceEEEEEeecceeeeccceEEEEEecC Confidence 556667777778999999999888876555 No 151 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=99.87 E-value=5.2e-24 Score=148.21 Aligned_cols=264 Identities=12% Similarity=0.070 Sum_probs=175.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccccccc---ccCCCEEEEEeecCCCCccccc--CCCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV---GQPGDTLTFPAFTYSGDAQVIA--EGEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~~~a~~~~--eg~~~~~~~~~ 75 (274) |||+..+ ++|+.|+..+++.+++++++.++++++++-+ ++.||+|+||+... ..+..+. .+..+..++++ T Consensus 1 MaN~llT----~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~-~~~~~~~~~~~~~~~~~~l~ 75 (423) T protein:vir:17 1 MPNNLDS----NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQ-FSSLRTPTGDISGQNKNNLI 75 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCc-ceeecccCcccCCcccCccc Confidence 9876543 6899999999999999999999998877432 45799999998654 3455543 44457788999 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----ccCcccCHHHHHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT----VEADITKLDGLQTAID 150 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~----~~~~~~~~d~iv~a~~ 150 (274) ..++++++.+. ...|+++++++.++..++ +.+.++.+++++..+|+.+++.+.+.... .......|+.++++.. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a~~ 154 (423) T protein:vir:17 76 SGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTAS 154 (423) T ss_pred cceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHHHH Confidence 99999999875 457999999988877777 56777788999999999998776443221 1122246999999999 Q ss_pred HHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccccccccc-chhcceeeEEcCCCCcceEEEEcCCeEEEE- Q lcl|Aclame:pro 151 KFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF-GEALGAVIVRSNKLNKGEALLAKKGAVKLI- 226 (274) Q Consensus 151 ~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~-~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~- 226 (274) .|++.+. .+|++|++|..+..|+++.. .+......+...+++|++ |+++|+.|++|+++|..+...++..+.... T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~-~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~ 233 (423) T protein:vir:17 155 FLKDLGVNEGENYAVMDPWSAQRLADAQT-GLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQ 233 (423) T ss_pred HHHhccCCcCCCEEEeChHHHHHHhcccc-ceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeeccc Confidence 9998765 67999999999999987542 233333445567899987 899999999999999877766654322110 Q ss_pred ------eccC-----ceee----eccccccCccEEEEEEEEEEEEEcCc--------------ceEEEE-----eCCCcc Q lcl|Aclame:pro 227 ------TKRD-----FFLE----KDRDASRKSTALYSDKHYVAYLYDES--------------KVVKIT-----KGAGDE 272 (274) Q Consensus 227 ------~~~~-----~~ve----~~r~~~~~~~~i~~~~~~~~~v~~~~--------------avv~l~-----~~aa~~ 272 (274) ...+ ..+. .+.+.....|.+ ..-|.+.++|. .+++.. ..++++ T Consensus 234 ~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~t 310 (423) T protein:vir:17 234 PTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQV---KFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVT 310 (423) T ss_pred ccccccccccccceeeeeeeeeeeccCceeecceE---EecceeeecccccccccccccccceEEEEEecccccccCceE Confidence 0000 0000 111111222222 22233333222 222211 111122 Q ss_pred cC Q lcl|Aclame:pro 273 VM 274 (274) Q Consensus 273 ~~ 274 (274) |- T Consensus 311 v~ 312 (423) T protein:vir:17 311 VT 312 (423) T ss_pred EE Confidence 21 No 152 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=99.87 E-value=3e-23 Score=144.10 Aligned_cols=268 Identities=12% Similarity=0.026 Sum_probs=197.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcc-cccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQ-FADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~-l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) .|+..---+++..-+.|+..+.+.+...+.-.. +++ +.+...+|++|+||+++. .+..+|..+..+..++++.+.. T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N--~~~e~~~g~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~ 106 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVIS--NDAIFMQGRSFTVIKGDV-TELKDYKRNATNEFDHPQIQET 106 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecc--cceeeccCcEEEEeeecc-cccccccCCCCcccccccccee Confidence 555555555566668889999888877654433 333 345567899999999976 5699999988999999999999 Q ss_pred EEeehh-hhcchhccHHHHhccCccH--HHHHHHHHHHHHHHHHHHHHHHHhccccc----cccCcccCHHHHHHHHHHH Q lcl|Aclame:pro 80 EAKVRK-IGKGTELTDEAVLSGFGDP--QGEAVRQHGLAIANKVDNDVLEALKGATL----TVEADITKLDGLQTAIDKF 152 (274) Q Consensus 80 ~~~~~~-~~~~~~is~e~~~~s~~d~--~~~~~~~~a~~~a~~~d~~~i~~~~~a~~----~~~~~~~~~d~iv~a~~~l 152 (274) ++++.+ .+..|.+.+.+..++...+ ...+.+.+...++..+|+..++.+.+... ...+....|+.|.++...| T Consensus 107 t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~~~~~t~~nay~~i~~a~~~L 186 (329) T protein:vir:10 107 TYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHLTVGSGADAQYDAVLDVSVEL 186 (329) T ss_pred EEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 999964 6788999999888876544 45566778888899999988877744322 2223344588999999999 Q ss_pred hhcCC-CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcceEEEEcCCeEEEEecc Q lcl|Aclame:pro 153 NDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAVKLITKR 229 (274) Q Consensus 153 ~~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~~~l~~~~a~~~~~~~ 229 (274) .++.. .+++++|+|..+..|+++. .|.......+..+.+|.+++++|++|+.+++ ++....++++++++.+..+. T Consensus 187 de~~vp~~Rvl~VtP~~~~~Lk~~~--~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~ 264 (329) T protein:vir:10 187 DEIGAGASRILFVTPKFYKGIKKFV--IELPQGDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQA 264 (329) T ss_pred HhcCCCCCcEEEeCHHHHHHHHhhh--hhhccccccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeeeeee Confidence 88754 6789999999999998754 4555555555678899999999999998643 33445677888998887765 Q ss_pred Cceeeecc-ccccCccEEEEEEEEEEEEEcCc--ceEEEEeCCCcccC Q lcl|Aclame:pro 230 DFFLEKDR-DASRKSTALYSDKHYVAYLYDES--KVVKITKGAGDEVM 274 (274) Q Consensus 230 ~~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~--avv~l~~~aa~~~~ 274 (274) . .++..+ .+.++.+.++++.+||++|++|+ +|....++++.+-= T Consensus 265 ~-~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~~ 311 (329) T protein:vir:10 265 N-EAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETNR 311 (329) T ss_pred e-eeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEecccCcccCC Confidence 5 445444 46677899999999999999988 44444444333333 No 153 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=99.87 E-value=3.4e-23 Score=143.75 Aligned_cols=269 Identities=13% Similarity=0.042 Sum_probs=196.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) .|+.---.+++..-+.|+..+.+.+...++-+.+. .++.+.+.+|++|+||+++. .+..+|..+..+..++++.+..+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~-~N~~~e~~gg~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~t 96 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL-ISNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEETT 96 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc-cCcceEeccCcEEEEeeecc-cccccccCCCCcccCCcccceeE Confidence 56655555666777889999877766655443332 23446667899999999986 56999999889999999999999 Q ss_pred Eeehh-hhcchhccHHHHhccCccH--HHHHHHHHHHHHHHHHHHHHHHHhcccccc----ccCcccCHHHHHHHHHHHh Q lcl|Aclame:pro 81 AKVRK-IGKGTELTDEAVLSGFGDP--QGEAVRQHGLAIANKVDNDVLEALKGATLT----VEADITKLDGLQTAIDKFN 153 (274) Q Consensus 81 ~~~~~-~~~~~~is~e~~~~s~~d~--~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~----~~~~~~~~d~iv~a~~~l~ 153 (274) +++.+ .+..|.+.+.+..++...+ ...+.+.+...++..+|+..++.+.+.... ..+....|+.|.++...|. T Consensus 97 ~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Ld 176 (319) T protein:vir:97 97 YFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELD 176 (319) T ss_pred EEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHH Confidence 99964 7788999999888876554 456677788888889999888776543322 2233445889999999998 Q ss_pred hcCC-CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcceEEEEcCCeEEEEeccC Q lcl|Aclame:pro 154 DEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAVKLITKRD 230 (274) Q Consensus 154 ~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~~~l~~~~a~~~~~~~~ 230 (274) ++++ .+++++|+|..+..|+++. .|......++..+.+|.++++.|++|+..++ +..-..++++++++.+..+.. T Consensus 177 e~~VP~~Rvl~Vtp~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~ 254 (319) T protein:vir:97 177 EIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQAD 254 (319) T ss_pred hcCCCCCcEEEeCHHHHHHHHhhh--hhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeee Confidence 8764 6799999999999998764 4555555556678999999999999998643 334456677888887766544 Q ss_pred ceeeecc-ccccCccEEEEEEEEEEEEEcCc--ceEEEEeCCCcccC Q lcl|Aclame:pro 231 FFLEKDR-DASRKSTALYSDKHYVAYLYDES--KVVKITKGAGDEVM 274 (274) Q Consensus 231 ~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~--avv~l~~~aa~~~~ 274 (274) .++..+ .+.++.+.++++.+||+.|++|+ +|.....+.+..=- T Consensus 255 -~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~ 300 (319) T protein:vir:97 255 -LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR 300 (319) T ss_pred -eeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCcccCC Confidence 344433 46677899999999999999988 44332222211111 No 154 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=99.87 E-value=3.4e-23 Score=143.75 Aligned_cols=269 Identities=13% Similarity=0.042 Sum_probs=196.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) .|+.---.+++..-+.|+..+.+.+...++-+.+. .++.+.+.+|++|+||+++. .+..+|..+..+..++++.+..+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~-~N~~~e~~gg~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~t 96 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL-ISNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEETT 96 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc-cCcceEeccCcEEEEeeecc-cccccccCCCCcccCCcccceeE Confidence 56655555666777889999877766655443332 23446667899999999986 56999999889999999999999 Q ss_pred Eeehh-hhcchhccHHHHhccCccH--HHHHHHHHHHHHHHHHHHHHHHHhcccccc----ccCcccCHHHHHHHHHHHh Q lcl|Aclame:pro 81 AKVRK-IGKGTELTDEAVLSGFGDP--QGEAVRQHGLAIANKVDNDVLEALKGATLT----VEADITKLDGLQTAIDKFN 153 (274) Q Consensus 81 ~~~~~-~~~~~~is~e~~~~s~~d~--~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~----~~~~~~~~d~iv~a~~~l~ 153 (274) +++.+ .+..|.+.+.+..++...+ ...+.+.+...++..+|+..++.+.+.... ..+....|+.|.++...|. T Consensus 97 ~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Ld 176 (319) T protein:vir:94 97 YFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELD 176 (319) T ss_pred EEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHH Confidence 99964 7788999999888876554 456677788888889999888776543322 2233445889999999998 Q ss_pred hcCC-CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC--CCcceEEEEcCCeEEEEeccC Q lcl|Aclame:pro 154 DEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAVKLITKRD 230 (274) Q Consensus 154 ~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~--~p~~~~~l~~~~a~~~~~~~~ 230 (274) ++++ .+++++|+|..+..|+++. .|......++..+.+|.++++.|++|+..++ +..-..++++++++.+..+.. T Consensus 177 e~~VP~~Rvl~Vtp~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~ 254 (319) T protein:vir:94 177 EIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQAD 254 (319) T ss_pred hcCCCCCcEEEeCHHHHHHHHhhh--hhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeee Confidence 8764 6799999999999998764 4555555556678999999999999998643 334456677888887766544 Q ss_pred ceeeecc-ccccCccEEEEEEEEEEEEEcCc--ceEEEEeCCCcccC Q lcl|Aclame:pro 231 FFLEKDR-DASRKSTALYSDKHYVAYLYDES--KVVKITKGAGDEVM 274 (274) Q Consensus 231 ~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~--avv~l~~~aa~~~~ 274 (274) .++..+ .+.++.+.++++.+||+.|++|+ +|.....+.+..=- T Consensus 255 -~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~ 300 (319) T protein:vir:94 255 -LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR 300 (319) T ss_pred -eeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCcccCC Confidence 344433 46677899999999999999988 44332222211111 No 155 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=99.87 E-value=1.2e-23 Score=146.18 Aligned_cols=264 Identities=12% Similarity=0.063 Sum_probs=173.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccccccc---ccCCCEEEEEeecCCCCccccc--CCCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV---GQPGDTLTFPAFTYSGDAQVIA--EGEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~~~a~~~~--eg~~~~~~~~~ 75 (274) |||+.-+ ++|+.|+..+++.|++++++.++++++.+-+ ++.||+|+||+... ..+..+. ++..+..++++ T Consensus 1 MaN~llT----~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~-~~~~d~~~~~~~~~~~~dl~ 75 (423) T protein:vir:10 1 MPNNLDS----NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQ-FSSLRTPTGDISGQNKNNLI 75 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCc-eeeeccCCccccccccCccc Confidence 8876533 5899999999999999999999998876432 45799999998764 3565555 34567889999 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cCcccCHHHHHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV----EADITKLDGLQTAID 150 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~----~~~~~~~d~iv~a~~ 150 (274) ..++++++.+. ...|+++++++.++..++ +.+.++.+++++.++|+.+++...+..... ......|+.++++.. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~~ 154 (423) T protein:vir:10 76 SGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTAS 154 (423) T ss_pred cceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHHH Confidence 99999999875 457999999988777777 567778889999999999998765543221 122246899999999 Q ss_pred HHhhcCC--CccEEEEcHHHHHHHHhhhcccccccccccccccccccc-chhcceeeEEcCCCCcceEEEEcCCeEEE-- Q lcl|Aclame:pro 151 KFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF-GEALGAVIVRSNKLNKGEALLAKKGAVKL-- 225 (274) Q Consensus 151 ~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~-~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~-- 225 (274) .|++.+. .+|++|++|..+..|+++.. .+......+...+++|++ |+++|+.|++|+++|..+...++..+... T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~-~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~ 233 (423) T protein:vir:10 155 FLKDLGVNEGENYAVMDPWSAQRLADAQT-GLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQ 233 (423) T ss_pred HHHhccCCcCCCEEEeChHHHHHHhcccc-ceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeec Confidence 9998765 67999999999999987542 233344555677899987 89999999999999987766555432110 Q ss_pred -----EeccC---ceee------eccccccCccEEEEEEEEEEEEE--------------cCcceEEEEeC-----CCcc Q lcl|Aclame:pro 226 -----ITKRD---FFLE------KDRDASRKSTALYSDKHYVAYLY--------------DESKVVKITKG-----AGDE 272 (274) Q Consensus 226 -----~~~~~---~~ve------~~r~~~~~~~~i~~~~~~~~~v~--------------~~~avv~l~~~-----aa~~ 272 (274) ....+ ..+. +......-.|.+ ..-|...+ .+..++++.-. +++. T Consensus 234 ~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~---t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~t 310 (423) T protein:vir:10 234 PTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQV---KFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVT 310 (423) T ss_pred ceeccccccccceeeeeeeeccccccCceeecceE---EecceeeecccccccccccccCcceEEEEEeeeeeccCCcee Confidence 00000 0110 000111111111 11111111 11111111100 0111 Q ss_pred cC Q lcl|Aclame:pro 273 VM 274 (274) Q Consensus 273 ~~ 274 (274) |- T Consensus 311 v~ 312 (423) T protein:vir:10 311 VT 312 (423) T ss_pred ee Confidence 11 No 156 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=99.84 E-value=2.2e-22 Score=139.37 Aligned_cols=267 Identities=14% Similarity=0.082 Sum_probs=189.2 Q ss_pred CC-----------------ccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc Q lcl|Aclame:pro 1 MA-----------------QGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI 63 (274) Q Consensus 1 ma-----------------~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~ 63 (274) || +..+..+...+|..+...+++.+.+.+.+.+.+++.. ......+||.++..+...|. T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~----v~~~~~~i~~~~~~~~~~~~ 76 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTET----VGAKKTRIPTLNIGERHRRP 76 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeee----ccCcceeeeeeccCCccccc Confidence 21 1112223356777788889899998888888776532 23344678888765666676 Q ss_pred c-CC-CcccccccccceeEEeehhhhcchhccHHHHhcc--CccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------ Q lcl|Aclame:pro 64 A-EG-EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSG--FGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------ 133 (274) Q Consensus 64 ~-eg-~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s--~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------ 133 (274) + |+ ...+.++++++++++.++++...+.+|++.+.++ .+|+++++.+.+++++++.++..++.+-..+.. T Consensus 77 ~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n 156 (321) T protein:vir:31 77 QDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQN 156 (321) T ss_pred ccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccc Confidence 5 33 3456678999999999999999999999999886 369999999999999999999988854322111 Q ss_pred --------------cccCcccCHHHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccc Q lcl|Aclame:pro 134 --------------TVEADITKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFG 197 (274) Q Consensus 134 --------------~~~~~~~~~d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~ 197 (274) ...++.+++|.+.++...|...+. ...+|+||++++..+++..... .+..++..+..|... T Consensus 157 ~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~---~~~~~~~~l~~~~~~ 233 (321) T protein:vir:31 157 DGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR---DTPLGDNVIMGEADV 233 (321) T ss_pred hhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC---CCccccchhhccccc Confidence 112344678999999999987664 3347999999987765422111 223455567777778 Q ss_pred hhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeecccccc---CccEEE--EEEEEEEEEEcCcceEEEEe-CCCc Q lcl|Aclame:pro 198 EALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDASR---KSTALY--SDKHYVAYLYDESKVVKITK-GAGD 271 (274) Q Consensus 198 ~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~~---~~~~i~--~~~~~~~~v~~~~avv~l~~-~aa~ 271 (274) +|+|+||+.++++|++.+++.+...+.+...+++.++..++... ....++ .+..+|+.|-++++++.++. .-|= T Consensus 234 tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~ 313 (321) T protein:vir:31 234 NPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPL 313 (321) T ss_pred cccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcch Confidence 99999999999999999999888887776777777766555432 234444 34467888899999998883 2221 Q ss_pred ccC Q lcl|Aclame:pro 272 EVM 274 (274) Q Consensus 272 ~~~ 274 (274) +-. T Consensus 314 ~~~ 316 (321) T protein:vir:31 314 EHL 316 (321) T ss_pred hcc Confidence 111 No 157 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.84 E-value=4.4e-23 Score=143.16 Aligned_cols=270 Identities=12% Similarity=0.057 Sum_probs=197.9 Q ss_pred CCccccch---------hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQGTTKV---------SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~~~T~~---------~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) |++...-+ ...+.=|+|+.+|...+...+++.++..+.. + .+|++++||+.+.. +++.+.+|+++.. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRt-I--~~gkS~qf~~lG~s-~a~y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQT-V--TGTNTVSNKYLGET-ELQVLAPGQSPAA 76 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeee-e--cccceEEEEEeeee-EEeeecCCCCcCC Confidence 77532221 1124447899999999999999988886642 3 56899999999875 7999999999988 Q ss_pred cccccceeEEeehhh-hcchhccHHHHhccCcc-HHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKI-GKGTELTDEAVLSGFGD-PQGEAVRQHGLAIANKVDNDVLEALKGAT----------------- 132 (274) Q Consensus 72 ~~~~~~~~~~~~~~~-~~~~~is~e~~~~s~~d-~~~~~~~~~a~~~a~~~d~~~i~~~~~a~----------------- 132 (274) +.+..++..+++..+ .....|.+.+..++.+| +.+.+.+++++++++..|+.++..+.-+. T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 889999999999875 44578999999999999 89999999999999999998875442111 Q ss_pred --cccc----CcccCHH----HHHHHHHHHhhcCC-CccE-EEEcHHHHHHHHhhhccccccccccccccccccccchhc Q lcl|Aclame:pro 133 --LTVE----ADITKLD----GLQTAIDKFNDEDL-EPMV-LFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL 200 (274) Q Consensus 133 --~~~~----~~~~~~d----~iv~a~~~l~~~~~-~~~~-~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~ 200 (274) ..+. ...++.+ .+.+|...|.+.++ ..++ +++.|..|..|+..+..-.......+++....|.+..+. T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~ 236 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSY 236 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEe Confidence 0010 1112333 34566667766554 2244 555555665665432111111111223446778888999 Q ss_pred ceeeEEcCCCCcc---------------------------eEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEE Q lcl|Aclame:pro 201 GAVIVRSNKLNKG---------------------------EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYV 253 (274) Q Consensus 201 G~~Vv~s~~~p~~---------------------------~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~ 253 (274) |+||+.|+++|.+ ..++|+++|++...-.+++.+.+|+..++.+.|..+..|| T Consensus 237 Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G 316 (400) T protein:vir:10 237 NCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEG 316 (400) T ss_pred ceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhC Confidence 9999999999831 1367899999999999999999999999999999999999 Q ss_pred EEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 254 AYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 254 ~~v~~~~avv~l~~~aa~~~~ 274 (274) .++.+|+++..++.+--..-. T Consensus 317 ~g~~RPeaa~vv~~~~~~~~~ 337 (400) T protein:vir:10 317 AIPDRWEAVSVVTTKRQSTGA 337 (400) T ss_pred CcccchhheEEEEecCCcccc Confidence 999999999998876554433 No 158 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.84 E-value=3.2e-23 Score=143.93 Aligned_cols=270 Identities=12% Similarity=0.074 Sum_probs=196.7 Q ss_pred CCccccchh---------hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQGTTKVS---------NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~~~T~~~---------~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) |++..+-+- ..+.=|+|+.+|...+....++.++..+.. + .+|++++||+.+.. +++.+.+|+++.. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRt-i--~~gkS~qf~~~G~s-~~~~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQT-V--TGTNTVSNKYLGET-ELQVLAPGQSPAA 76 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeee-e--cccceEEEEEeeee-EeeeecCCCCcCC Confidence 775332221 124447899999999999999988886642 3 56899999999875 7899999999988 Q ss_pred cccccceeEEeehhhh-cchhccHHHHhccCcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIG-KGTELTDEAVLSGFGD-PQGEAVRQHGLAIANKVDNDVLEALKGATL---------------- 133 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~-~~~~is~e~~~~s~~d-~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------------- 133 (274) +.+..++..+++..+- ..+.|.+.+..++.+| +.+.+.+++++++++..|+.++..+.-+.. T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 8999999999998854 4588999999999999 899999999999999999988765532110 Q ss_pred ---cccC----cccC----HHHHHHHHHHHhhcCC-CccEEEEcHH-HHHHHHhhhccccccccccccccccccccchhc Q lcl|Aclame:pro 134 ---TVEA----DITK----LDGLQTAIDKFNDEDL-EPMVLFVNPL-DAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL 200 (274) Q Consensus 134 ---~~~~----~~~~----~d~iv~a~~~l~~~~~-~~~~~v~~p~-~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~ 200 (274) .... ..++ .+.|.+|...|.+.+. ..+++++.|. .|..|+..+..-.......+.+...+|.+..+. T Consensus 157 ~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~va 236 (401) T protein:vir:70 157 FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSY 236 (401) T ss_pred eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEEe Confidence 0000 1122 3456677777877665 3355555555 454555432111111112234557888899999 Q ss_pred ceeeEEcCCCCcce---------------------------EEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEE Q lcl|Aclame:pro 201 GAVIVRSNKLNKGE---------------------------ALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYV 253 (274) Q Consensus 201 G~~Vv~s~~~p~~~---------------------------~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~ 253 (274) |+||+.|+++|.+. .++|+++|++...-.+++.+.+++..++.+.|..+..|| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g 316 (401) T protein:vir:70 237 NCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEG 316 (401) T ss_pred ceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHhC Confidence 99999999998411 367899999998999999999999999999999999999 Q ss_pred EEEEcCcceEEEEeCCC----c--ccC Q lcl|Aclame:pro 254 AYLYDESKVVKITKGAG----D--EVM 274 (274) Q Consensus 254 ~~v~~~~avv~l~~~aa----~--~~~ 274 (274) .++.+|+++..++.+-- . ..- T Consensus 317 ~g~~RPeaa~vv~~k~~~~~~~~~~~~ 343 (401) T protein:vir:70 317 AIPDRWEAVSVVTTKRNTTTGAVEGTD 343 (401) T ss_pred CcccchhheEEEeecCcccccccccCC Confidence 99999999987743322 1 111 No 159 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=99.82 E-value=5.4e-22 Score=137.19 Aligned_cols=263 Identities=10% Similarity=0.001 Sum_probs=174.9 Q ss_pred CCc--cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQ--GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~--~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) +.. ..........|..+...+...+.....+.+.++.. +.....+|.......+.|+.||...|+++++|++ T Consensus 237 ~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~------~i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~ 310 (517) T protein:vir:97 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE------NLPTLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) T ss_pred eeeecccccccccccchHHHHHHHHhhhhhccceeeeeec------cccceeeecccccceeeeeecCCcccccccceee Confidence 110 11112345678777777776666654444444331 1123566665555567889999999999999999 Q ss_pred eEEeehhhhcchhccHHHHhccCcc----HHHHHHHHHHHHHHHHHHHHHHHHhccccc-------c---ccCcccCHHH Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGD----PQGEAVRQHGLAIANKVDNDVLEALKGATL-------T---VEADITKLDG 144 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d----~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------~---~~~~~~~~d~ 144 (274) +++.++++++.+++|++++.++..| +++++.+++++.++++++++++..-.++.. + ........+. T Consensus 311 ~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~ 390 (517) T protein:vir:97 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) T ss_pred EEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccch Confidence 9999999999999999999998887 899999999999999999999965332211 0 0111122345 Q ss_pred HHHHHHHHhhcC--CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCe Q lcl|Aclame:pro 145 LQTAIDKFNDED--LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA 222 (274) Q Consensus 145 iv~a~~~l~~~~--~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a 222 (274) +.|.+..+..+. ..+..|+|||.+|..|++.++.+.. +...+.+.++...+++|+.-+. +.++.+...+..... T Consensus 391 ~~d~i~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~G~---Yl~~~~~~~~~~~~l~G~~~~~-~~~~~~~~~~~~~~~ 466 (517) T protein:vir:97 391 IQELLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGN---YVFPVGVSNQTIATHFGFNRLV-QSVAVDEKTAVSLSG 466 (517) T ss_pred HHHHHHHHHHHhhhccCCEEEECHHHHHHHHHhhcCCCC---eeccCcCCcccccccCCccccc-cccccCceeEeeccc Confidence 555555555443 2467899999999999877653322 2233445666777888853332 234444433333333 Q ss_pred EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 223 VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 223 ~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) +.++....+....+.+..+.++.+....|.++.|..|++++..+..-|.+= T Consensus 467 y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 467 YVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred cEEEeecceeeeeeeecccCceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 334444444444444445667888888999999999999999999888888 No 160 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=99.80 E-value=1e-20 Score=130.19 Aligned_cols=260 Identities=14% Similarity=0.095 Sum_probs=161.7 Q ss_pred CCccccchhhc--cchHHHHHHHHHHHHHhhhhcccc----cccccccccCCCEEEEEeecCC-C---CcccccCCCccc Q lcl|Aclame:pro 1 MAQGTTKVSNL--IVPEVLAPMMQAELDKKLRFAQFA----DIDSTLVGQPGDTLTFPAFTYS-G---DAQVIAEGEKIP 70 (274) Q Consensus 1 ma~~~T~~~~~--~iPe~~~~~v~~~~~~~~~~~~l~----~~~~~~~~~~G~~v~ip~~~~~-~---~a~~~~eg~~~~ 70 (274) || .+|+ |-|+.+..++ |.+.+.+-..+.. .... -....|+.+++|.|.+. + +.+.+.+...++ T Consensus 1 m~-----lsD~~vfN~~~~~a~~-e~~~q~~~~fn~as~gai~l~-~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt 73 (325) T protein:vir:95 1 MA-----LSDLAVYSEYAYSAFS-ETLRQQVDLFNTATGGAIMLQ-SAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVA 73 (325) T ss_pred Cc-----hhhhhhhhhhhhhhhh-hhhhhhHhhhhhcccceeEec-cccccCceeeccccccccccccccccCCCCceec Confidence 66 3332 5566555444 4444332222211 1111 12235899999999864 3 335677778899 Q ss_pred ccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHH----HHHHHHhccccc-------c----- Q lcl|Aclame:pro 71 VDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVD----NDVLEALKGATL-------T----- 134 (274) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d----~~~i~~~~~a~~-------~----- 134 (274) ..+++..+.......+++.+..+++.......+.++.+.++++..+++... +.+++.+.++.. . T Consensus 74 ~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~ 153 (325) T protein:vir:95 74 EKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANT 153 (325) T ss_pred cceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeeccc Confidence 999998887666666777777777777766667776666666555555544 445544432210 0 Q ss_pred -ccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCC-- Q lcl|Aclame:pro 135 -VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLN-- 211 (274) Q Consensus 135 -~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p-- 211 (274) .....++++.+++|.++|+++......|+||+.+|..|++++..++......... . .+++++|++|+++|.+| T Consensus 154 ~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~--~--~i~t~~G~~VIVdD~~p~~ 229 (325) T protein:vir:95 154 DAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTV--N--VVRDPFGKLLVMTDSPNLF 229 (325) T ss_pred CcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCc--c--cccccCCcEEEEeCCCCCC Confidence 1112357899999999999999999999999999999999887665443322211 1 35689999999999998 Q ss_pred ------cceEEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 212 ------KGEALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 212 ------~~~~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) ++++|+++++|+++....+......+.........+.+.+| ..+++|.++.- +++. ..+- T Consensus 230 ~~g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-tf~lhp~G~sw-~~s~-~g~s 295 (325) T protein:vir:95 230 AAGTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDENIIRTYQAEW-SYNIGVKGFAW-DKAN-GGKS 295 (325) T ss_pred CccCceeEEEEEEecCeEEecCCCCccccccccCcccceeeeeeeee-eEEeecceeee-eccc-ccCC Confidence 35579999999999887776544433221111111112232 35677887766 3221 1111 No 161 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.78 E-value=2.1e-20 Score=128.49 Aligned_cols=266 Identities=20% Similarity=0.183 Sum_probs=170.0 Q ss_pred CCc--cccchhhc--cchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccc Q lcl|Aclame:pro 1 MAQ--GTTKVSNL--IVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGT 76 (274) Q Consensus 1 ma~--~~T~~~~~--~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~ 76 (274) |+- +.|...++ .+.-.|.+++-+.+.+.+..-++.+...--.+...++.++|.+...+++++++||+.||.++++. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 874 33333333 22223444444433333322233222211122223334556667778999999999999999997 Q ss_pred c---eeEEeehhhhcchhccHHHH-hccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCc---ccCHHHHHHHH Q lcl|Aclame:pro 77 S---KREAKVRKIGKGTELTDEAV-LSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEAD---ITKLDGLQTAI 149 (274) Q Consensus 77 ~---~~~~~~~~~~~~~~is~e~~-~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~---~~~~d~iv~a~ 149 (274) . ..+++++|+++.+ |+|.+ +....+.....-++|...+++++|+.+++.+++++.+.... ..+++.|-+|+ T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s~~glq~Al 158 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKLSAENLQGAL 158 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceeecHHHHHHHH Confidence 5 5788999998866 99998 56667899999999999999999999999999887654332 35688888887 Q ss_pred HHHh------hcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeE Q lcl|Aclame:pro 150 DKFN------DEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAV 223 (274) Q Consensus 150 ~~l~------~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~ 223 (274) .... +++....+++|||.+.+.+|+++... ...+..|.+.+.+ ++|+-|++|+.+|+|+.|......+ T Consensus 159 ~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~-~~~t~fG~n~L~n-----fLG~~II~S~kv~~G~~~~T~~~Ni 232 (303) T protein:vir:10 159 SKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFIN-STGAQFGVNLLTP-----YVGVKIVEFADVPQGEVWMTVAENL 232 (303) T ss_pred HhhhhhccccccccccEEEEEchHHHHHHhhcCCcc-hhhhhhhhhhhhh-----hhcceEEEeccCCCceEEEeeccce Confidence 7653 22334568999999999999876543 3445666666664 9999999999999999998877665 Q ss_pred EEEeccCce-eeeccccccCccEEEEEEE-------------EEEE---EEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 224 KLITKRDFF-LEKDRDASRKSTALYSDKH-------------YVAY---LYDESKVVKITKGAGDEVM 274 (274) Q Consensus 224 ~~~~~~~~~-ve~~r~~~~~~~~i~~~~~-------------~~~~---v~~~~avv~l~~~aa~~~~ 274 (274) .+....+-. +.....-..+.+.+.+..+ +.+- .-.+++||+.+.+++.+-- T Consensus 233 ~~ay~~~~g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~~ 300 (303) T protein:vir:10 233 NVAYANPRGELSRAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDEAGE 300 (303) T ss_pred EEEEecCchhhhhhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccccCC Confidence 543221111 1000000112222222221 1111 2267889999987665322 No 162 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.74 E-value=1.9e-19 Score=123.23 Aligned_cols=256 Identities=23% Similarity=0.273 Sum_probs=166.5 Q ss_pred CC-------ccccchhhccc--hHHHHHHHHHHHHHhhhhcccccccccccccCCCEE-EEEeecCCCCcccccCCCccc Q lcl|Aclame:pro 1 MA-------QGTTKVSNLIV--PEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTL-TFPAFTYSGDAQVIAEGEKIP 70 (274) Q Consensus 1 ma-------~~~T~~~~~~i--Pe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v-~ip~~~~~~~a~~~~eg~~~~ 70 (274) |- ++.|...++-. .-.|.+.+.+.+.+.+..-++.+. .+-..|+++ ++|.|..++++++++||+.|| T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~---~pla~GstIkt~k~~~y~gda~dVaEGe~Ip 77 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRK---ISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhccc---ccccCCCEEeeccceeeeeccccccCCcccc Confidence 43 23344444312 223444444444443332333222 222348899 557799999999999999999 Q ss_pred ccccccce---eEEeehhhhcchhccHHHH-hccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcccCHHHHH Q lcl|Aclame:pro 71 VDQIGTSK---REAKVRKIGKGTELTDEAV-LSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQ 146 (274) Q Consensus 71 ~~~~~~~~---~~~~~~~~~~~~~is~e~~-~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~iv 146 (274) .++++... .+++++|+++.+ |+|.+ +....++....-++|...+++++|+.+++.+++++.+..+ +.+.+- T Consensus 78 lskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~---t~~~lQ 152 (296) T protein:vir:98 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDA---LGAGLQ 152 (296) T ss_pred hhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeee---chhhHH Confidence 99999864 788899988885 99997 5666789999999999999999999999999888765443 233444 Q ss_pred HHH--------HHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEE Q lcl|Aclame:pro 147 TAI--------DKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLA 218 (274) Q Consensus 147 ~a~--------~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~ 218 (274) +|+ ..+++.+....+++|||.+.+.+|+++.. ...+..|...+. .++|..|++|+.+|+|+.|.. T Consensus 153 ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~i--t~qt~fG~tyl~-----nfLG~~II~S~kV~~G~~~~T 225 (296) T protein:vir:98 153 GALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--TTQTAFGLTYLV-----DFTGTVIISTNDVTKGEIWAT 225 (296) T ss_pred HHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCcc--chhheechhhhh-----hccccEEEEcCcCCCceEEEe Confidence 433 56666666788999999999999987632 222222322222 299999999999999999998 Q ss_pred cCCeEEEEeccC--ceeeeccccccCccEEEEEEE-------------EEEE---EEcCcceEEEEeCCCc Q lcl|Aclame:pro 219 KKGAVKLITKRD--FFLEKDRDASRKSTALYSDKH-------------YVAY---LYDESKVVKITKGAGD 271 (274) Q Consensus 219 ~~~a~~~~~~~~--~~ve~~r~~~~~~~~i~~~~~-------------~~~~---v~~~~avv~l~~~aa~ 271 (274) ....+.+..-.. -.+-....-..+.+.+.+..+ +.+. .-.+++||+.+.++|- T Consensus 226 ~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 226 VPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred eecceEEEeecccccchhhhhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 777655432211 111111111122333333222 1111 2267899999998876 No 163 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=99.70 E-value=4.9e-18 Score=115.47 Aligned_cols=259 Identities=8% Similarity=0.084 Sum_probs=172.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccc--cccccCCCEEEEEeecCCCCcccccCCC-cccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDS--TLVGQPGDTLTFPAFTYSGDAQVIAEGE-KIPVDQIGTS 77 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~--~~~~~~G~~v~ip~~~~~~~a~~~~eg~-~~~~~~~~~~ 77 (274) ||..- + ++.|+..+.+.+.+.++...|....+ .....+|++|+||+++. .+..+|..+. .....+++.+ T Consensus 1 MA~~n------~-a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~-~gl~DY~R~~~g~~~g~~~~~ 72 (299) T protein:vir:79 1 MAALN------Y-AKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTIST-TGRVDSNRDTIAVAQRNYDNA 72 (299) T ss_pred Cccch------h-HHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccc-ccccccccCCCcccccccCcc Confidence 88311 2 48999999999999998877764422 23335689999999976 5688998765 5666678888 Q ss_pred eeEEeehh-hhcchhccHHHHhccCc--cHHHHHHHHHHHHHHHHHHHHHHHHhccccc----cc----cCcccCHHHHH Q lcl|Aclame:pro 78 KREAKVRK-IGKGTELTDEAVLSGFG--DPQGEAVRQHGLAIANKVDNDVLEALKGATL----TV----EADITKLDGLQ 146 (274) Q Consensus 78 ~~~~~~~~-~~~~~~is~e~~~~s~~--d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~----~~----~~~~~~~d~iv 146 (274) ..++++.+ .+..|.+.+.+..++.. .....+.+.....++..+|+..++.+..... .. .+...-|+.|. T Consensus 73 ~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~ 152 (299) T protein:vir:79 73 WEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFD 152 (299) T ss_pred eeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHH Confidence 88888854 67788898665555433 3444455666667778889887766533221 11 12233478889 Q ss_pred HHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccc-cccccccccchhcceeeEE--cCCCCc------c-- Q lcl|Aclame:pro 147 TAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLG-DNIIVKGAFGEALGAVIVR--SNKLNK------G-- 213 (274) Q Consensus 147 ~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~-~~~~~~g~~~~i~G~~Vv~--s~~~p~------~-- 213 (274) ++...|.++++ .+++++|+|..+..|+++.. |.+..... .....+|.++.+.|++|+. ++.++. | T Consensus 153 ~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~--f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~ 230 (299) T protein:vir:79 153 KLMEKMTEARVPENGRILYVTPVVNTLIKNAKE--IQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWK 230 (299) T ss_pred HHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchh--hhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCcc Confidence 99999998765 67999999999999887653 33333322 2357899999999999987 444542 1 Q ss_pred --------eEEEEcCCeEEEEeccC-ceeeeccccccCc---cEEEEEEEEEEEEE-cCcceEEEEeCCCcc Q lcl|Aclame:pro 214 --------EALLAKKGAVKLITKRD-FFLEKDRDASRKS---TALYSDKHYVAYLY-DESKVVKITKGAGDE 272 (274) Q Consensus 214 --------~~~l~~~~a~~~~~~~~-~~ve~~r~~~~~~---~~i~~~~~~~~~v~-~~~avv~l~~~aa~~ 272 (274) ..++++++|+.-....+ +.+. .|...+ .....|.++|+-|+ +....+.+..++|-+ T Consensus 231 ~~~~ak~in~ii~~~~a~~~~~K~~~~~~~---~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 231 VGAGAKQIFMSLVHPSAIITPVSYQFSKLD---EPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred ccCcccccceEEEcCCeeeeeEeeeeEEee---cCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 13566777765333222 2221 222222 25566778888877 555666777777777 No 164 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.69 E-value=6.7e-19 Score=120.21 Aligned_cols=262 Identities=14% Similarity=0.129 Sum_probs=188.1 Q ss_pred CCc----cccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc-------cCCCcc Q lcl|Aclame:pro 1 MAQ----GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI-------AEGEKI 69 (274) Q Consensus 1 ma~----~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~-------~eg~~~ 69 (274) |++ ..|.-...+||+.|-.-+++.+++.-.+.+++.. ++ ..|.++..|.......++.. .||..+ T Consensus 127 ~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~t---LP-~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L 202 (410) T protein:vir:83 127 YARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGT---LP-LNNATFYRPIVSQRPAVGLQGVAGGASDEKTEL 202 (410) T ss_pred HHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhh---CC-CCCCeeEEeeecccccccccccccccccccccc Confidence 332 3333334467777988888888887666666643 33 34778888776554544332 399999 Q ss_pred cccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-cCcccC----HHH Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV-EADITK----LDG 144 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~-~~~~~~----~d~ 144 (274) +..+++++..++.++.+|+...+|++.+++|.....+...+.|..+.|+.-++..-+.+..+.... ..+.++ ... T Consensus 203 ~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~ 282 (410) T protein:vir:83 203 DSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNVASA 282 (410) T ss_pred cccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHH Confidence 999999999999999999999999999999999999999999999999988887776664333221 111223 346 Q ss_pred HHHHHHHHhhc--CCCccEEEEcHHHHHHHHhhhc-cc--cccccccccccccccccchhcceeeEEcCCCCcceEEEEc Q lcl|Aclame:pro 145 LQTAIDKFNDE--DLEPMVLFVNPLDAGGLRTSAS-DN--FTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAK 219 (274) Q Consensus 145 iv~a~~~l~~~--~~~~~~~v~~p~~~~~L~~~~~-~~--~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~ 219 (274) ++|+..+..++ +....++.++|+++..+.+.-. .+ +......+-+.+-.|.-|.++|+||++.+..++|++++++ T Consensus 283 i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~ 362 (410) T protein:vir:83 283 IWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFS 362 (410) T ss_pred HHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEec Confidence 67887777776 6777889999999776654310 00 0111111223344677789999999999999999999999 Q ss_pred CCeEEEEecc-CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 220 KGAVKLITKR-DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 220 ~~a~~~~~~~-~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) +.|+.+|... ...-.++.++-.-+..+. .||...+.+|.+++-+... T Consensus 363 ~~Ai~~~eS~~gp~qL~d~~i~nLt~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 363 TAAIECFEQRVGTLQVVEPSVFGLQVAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred cceeeeeecCCceeEeeCCchhhhhhhhe--eeeeeccccccceeeeccC Confidence 9999988776 323344555544444444 6778899999999999888 No 165 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=99.66 E-value=2.8e-17 Score=111.35 Aligned_cols=265 Identities=15% Similarity=0.018 Sum_probs=157.3 Q ss_pred CCccccchhh--ccchHHHHHHHHHHHHHhhhhcc----cccccccccccCCCEEEEEeecCCCCc--ccccCCCccccc Q lcl|Aclame:pro 1 MAQGTTKVSN--LIVPEVLAPMMQAELDKKLRFAQ----FADIDSTLVGQPGDTLTFPAFTYSGDA--QVIAEGEKIPVD 72 (274) Q Consensus 1 ma~~~T~~~~--~~iPe~~~~~v~~~~~~~~~~~~----l~~~~~~~~~~~G~~v~ip~~~~~~~a--~~~~eg~~~~~~ 72 (274) || ||..+| +|-|. +....+|.+++.+...+ .+... .-....||-...|.|...+.. .++....+++.. T Consensus 1 ~~--~t~~sdl~vfn~~-~~~a~~e~~~~~~~~Fnaas~Gai~l-~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~ 76 (315) T protein:vir:96 1 MA--TTVNSDLVIYNDT-AQTAYLERNMDNLAVFNENSRAAIGL-NSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGT 76 (315) T ss_pred Cc--eeeecceeeehhh-hhhhHHhhhHHHHHHhhhhcCCcccc-cccccccccccccccccccchhhcccCCCccccce Confidence 77 566666 34444 44444455554433322 11110 011234787777877633322 245556668888 Q ss_pred cccccee-EEeehhhhcchhccHHHHhccCccHHHH---HHHHHHHHHHHHHHHHHHHHhcccc-------ccccCcccC Q lcl|Aclame:pro 73 QIGTSKR-EAKVRKIGKGTELTDEAVLSGFGDPQGE---AVRQHGLAIANKVDNDVLEALKGAT-------LTVEADITK 141 (274) Q Consensus 73 ~~~~~~~-~~~~~~~~~~~~is~e~~~~s~~d~~~~---~~~~~a~~~a~~~d~~~i~~~~~a~-------~~~~~~~~~ 141 (274) +++..+- .+++.....-+.++........-|++.. +.++++.++.+.+-+..++.+..+. .+...+..+ T Consensus 77 kit~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~ 156 (315) T protein:vir:96 77 KIAADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEG 156 (315) T ss_pred ecccccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccC Confidence 8777653 3444322233556665555445566654 5555555555555555554443221 112334578 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCC Q lcl|Aclame:pro 142 LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG 221 (274) Q Consensus 142 ~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~ 221 (274) ...+++|.++|+++......|+||+.+|.+|.+.+..+++.. .++..++.+.++++ |+||+++|.+|.++.|.+..+ T Consensus 157 ~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~--~~~~~~~~~~~~~l-GkrViVdD~~P~~~~~gl~~G 233 (315) T protein:vir:96 157 KKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLYE--EAGVVVYGGTPGTL-GKPVLVTDQCPATKIFGLVAG 233 (315) T ss_pred HHHHHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhccc--ccceeEecCcCccc-ccEEEEECCCCcceeeeeecc Confidence 899999999999999999999999999999999766554432 23344555555544 999999999999999999999 Q ss_pred eEEEEeccCceeeeccccccCccEEEEEEEEE-EEEEcCcceEEEEeC--CCcccC Q lcl|Aclame:pro 222 AVKLITKRDFFLEKDRDASRKSTALYSDKHYV-AYLYDESKVVKITKG--AGDEVM 274 (274) Q Consensus 222 a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~-~~v~~~~avv~l~~~--aa~~~~ 274 (274) |+++....++.... .+..+.+.+..+.|.. ...++|.++..-+.+ +||--- T Consensus 234 Ai~~~~~~~~~~~~--~~~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~sPt~ae 287 (315) T protein:vir:96 234 AVMITESQAPGMRS--YQIDDQENLAIGFRAEGTANVEVLGYKWKTKTNVNPASAT 287 (315) T ss_pred eeeecCCCcccccc--ccCCCcceeEEEEeeeeEeeeeeeeEEeecCCCcCCChHH Confidence 99998766632111 1222445666655533 356777777663211 111000 No 166 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.66 E-value=3.1e-17 Score=111.13 Aligned_cols=255 Identities=13% Similarity=0.094 Sum_probs=176.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeE Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~ 80 (274) ||.+.. +.|+..+.+.+.+.++...+...+. ...+|++|+||+++. .+..+|..+..+...+++.+..+ T Consensus 1 Main~a--------~~~~~~Ld~~~~~~~~t~~l~~~~~--~~~ggktVkI~~i~~-~gl~DY~R~~g~~~g~v~~~~et 69 (290) T protein:vir:78 1 MAINYV--------DKYGKELDQKLVFGTYTNELETPNL--LWLDAKTFKIQTITT-TGLKAHTRNKGYNEGSASNTNKS 69 (290) T ss_pred CchhHH--------HHHHHHHHHHHHhhheeeeccccce--eeccCCEEEEeeecc-CcccccccCCCcccCccccceee Confidence 886551 6899999999999998888875544 446799999999986 56999999999988888888888 Q ss_pred Eeeh-hhhcchhcc--HHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------ccCcccCHHHHHHHHH Q lcl|Aclame:pro 81 AKVR-KIGKGTELT--DEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------VEADITKLDGLQTAID 150 (274) Q Consensus 81 ~~~~-~~~~~~~is--~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~-------~~~~~~~~d~iv~a~~ 150 (274) +++. +++..|.+. |.+.......+.....+...+.++-.+|+..++.+.+...+ ..+...-|+.+.++.. T Consensus 70 ~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~ 149 (290) T protein:vir:78 70 YTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIR 149 (290) T ss_pred EEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHH Confidence 8885 467778887 66555555678888888999999999999988766443321 1122334778888888 Q ss_pred HHhhcCCCccEEEEcHHHHHHHHhhhcccccccc---ccccccccccccchhcceeeEEcCC---C-----------Ccc Q lcl|Aclame:pro 151 KFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPT---QLGDNIIVKGAFGEALGAVIVRSNK---L-----------NKG 213 (274) Q Consensus 151 ~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~---~~~~~~~~~g~~~~i~G~~Vv~s~~---~-----------p~~ 213 (274) .|.+.+..+++++|+|..+..|.++.. |.+.. +.+.+ ..+|.++.+.|++|+..+. + |.. T Consensus 150 ~ldevp~~~rvl~vtp~~~~lL~~~~~--f~r~~~~~~~~~~-~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~ 226 (290) T protein:vir:78 150 KVKKYGTQNLVMYVSPDVMAALELSDD--FVRAINVQNIGPS-SIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAA 226 (290) T ss_pred HHHhcCCCCeEEEECHHHHHHHhhChh--hhccccccccccc-cccceeeeecCcEEEEecccchhhhhhhhcccccccC Confidence 898888889999999999998876643 43322 22223 3488999999999997542 1 111 Q ss_pred -----eEEEEcCCeEEEEeccC-cee-eeccccccCccEEEEEEEEEEEEEcCc-ceEEEEeCC Q lcl|Aclame:pro 214 -----EALLAKKGAVKLITKRD-FFL-EKDRDASRKSTALYSDKHYVAYLYDES-KVVKITKGA 269 (274) Q Consensus 214 -----~~~l~~~~a~~~~~~~~-~~v-e~~r~~~~~~~~i~~~~~~~~~v~~~~-avv~l~~~a 269 (274) ..+++++++..-....+ +.+ .++.+.+.+.+.+..|.++|+-|++.. ..+....+- T Consensus 227 ~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 227 GAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred CccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 12456666654332222 222 223333445578899999999988433 333333222 No 167 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.63 E-value=3.2e-18 Score=116.51 Aligned_cols=253 Identities=11% Similarity=-0.005 Sum_probs=144.7 Q ss_pred CCccccc--------------------------hhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEee Q lcl|Aclame:pro 1 MAQGTTK--------------------------VSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAF 54 (274) Q Consensus 1 ma~~~T~--------------------------~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~ 54 (274) +...... .+...+|..+.+.+............. .++. . T Consensus 184 ~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~--~- 248 (480) T protein:vir:40 184 ERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQG------------LTLA--E- 248 (480) T ss_pred hhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhc------------ceee--e- Confidence 0000000 011111111111111111000000000 0010 0 Q ss_pred cCCCCcccccCCCccccc--ccccceeEEe---ehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 55 TYSGDAQVIAEGEKIPVD--QIGTSKREAK---VRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALK 129 (274) Q Consensus 55 ~~~~~a~~~~eg~~~~~~--~~~~~~~~~~---~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~ 129 (274) .......|++|+..-+.. ..+..+..+. ++++....+.|.+.+.++. ++++++.+++++.++++++++++.... T Consensus 249 ~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g 327 (480) T protein:vir:40 249 DGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSV 327 (480) T ss_pred ccccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 011123455544332222 1122233333 3566667778877776664 899999999999999999999997632 Q ss_pred cccc-------ccc--CcccCHHHHH-HHHHHHhhcCCCcc-EEEEcHHHHHHHHhhhccccccccccccccccccccch Q lcl|Aclame:pro 130 GATL-------TVE--ADITKLDGLQ-TAIDKFNDEDLEPM-VLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE 198 (274) Q Consensus 130 ~a~~-------~~~--~~~~~~d~iv-~a~~~l~~~~~~~~-~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~ 198 (274) +... ... +...+.++.+ +.+..+...+..+. .|+|||.+|+.|++.++.+. .+..++.+..|+..+ T Consensus 328 ~g~~~~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G---~Yi~q~~~~~~~~~~ 404 (480) T protein:vir:40 328 DGSNGFYGLKTATDGWTKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDG---HSRFNELATKEQIAQ 404 (480) T ss_pred CCccccccceeecccccccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCC---CeeccCcccccCcce Confidence 2211 111 1122344444 57777777776666 69999999999998875443 234456678888999 Q ss_pred hcceeeEEc-CCCCcceEEEEcC-CeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 199 ALGAVIVRS-NKLNKGEALLAKK-GAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 199 i~G~~Vv~s-~~~p~~~~~l~~~-~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) ++|+||+++ ..+|.+...+... .++.+.. +.+....+.+..+....+....|++++|..|+++..+++.+-=-| T Consensus 405 llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d-~~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 405 SFGAVNLETRVWMPKDEVAVYNHDEYVLIGD-LNVENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred ecccceeeeeccccCCcceeeeCCccEEEEe-cccceecccccccchhhhhhhhhhceeeEccccEEEEEeccCcCC Confidence 999998875 4567666544433 3333443 344444444555666778888899999999999999999998888 No 168 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.52 E-value=5.2e-15 Score=98.89 Aligned_cols=264 Identities=11% Similarity=0.060 Sum_probs=163.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccc---cccccccCCCEEEEEeecCCCCcccccCCCccc-cccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADI---DSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP-VDQIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~---~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~-~~~~~~ 76 (274) ||.+.. +.|+..+.+.+...++.+..... .......+|++|+||++.-..+..+|..+.... ..+++. T Consensus 1 Mainya--------~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~ 72 (346) T protein:vir:10 1 MTINYA--------EKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSN 72 (346) T ss_pred CcchhH--------HHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccc Confidence 886552 56888888888776544333311 122333578999999996333578888766664 477888 Q ss_pred ceeEEeeh-hhhcchhcc--HHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---------ccCcccCHHH Q lcl|Aclame:pro 77 SKREAKVR-KIGKGTELT--DEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT---------VEADITKLDG 144 (274) Q Consensus 77 ~~~~~~~~-~~~~~~~is--~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~---------~~~~~~~~d~ 144 (274) +..++++. +++..|.+. |+........+...+.+.....++-.+|+..++.+...... ..+...-|+. T Consensus 73 ~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~ 152 (346) T protein:vir:10 73 DWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPA 152 (346) T ss_pred ceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHH Confidence 88888885 467778887 44333323345555555566666778898877665432211 1122334678 Q ss_pred HHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEE--cCCCCc------c- Q lcl|Aclame:pro 145 LQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVR--SNKLNK------G- 213 (274) Q Consensus 145 iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~--s~~~p~------~- 213 (274) +.++...|.++.+ .+++++|+|..+..|.++. .|.+....++....+|.++.+.|++|+. ++.++. | T Consensus 153 i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~--~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~ 230 (346) T protein:vir:10 153 FDNMMLDFDEARIPSTNRILYVTPKTNAILKRAE--AMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGS 230 (346) T ss_pred HHHHHHHHHHccCCCCCeEEEECHHHHHHHhhch--hheeccccccccccceeeeeecCeEEEEcchhhcccchhhccCc Confidence 8888888987754 7799999999999877654 3444333334344689999999999986 445541 1 Q ss_pred ---------eEEEEcCCeEEEEec-cCceeeeccccccCccEEEEEEEEEEEEEc-CcceEEEEeCCCcccC Q lcl|Aclame:pro 214 ---------EALLAKKGAVKLITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYD-ESKVVKITKGAGDEVM 274 (274) Q Consensus 214 ---------~~~l~~~~a~~~~~~-~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~-~~avv~l~~~aa~~~~ 274 (274) ..++++++|..-... ..+.+...-....+.+.+..|.++|+-|++ ....+.+..+.+.+-- T Consensus 231 ~~~t~ak~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~ 302 (346) T protein:vir:10 231 KIIDTAKQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKD 302 (346) T ss_pred cccCCccceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeecccccC Confidence 125566666543222 223333332344556788999999998883 3344444443333222 No 169 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.50 E-value=7.3e-15 Score=98.09 Aligned_cols=265 Identities=10% Similarity=0.091 Sum_probs=169.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCc--ccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEK--IPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~--~~~~~~~~~~ 78 (274) |||.. --.+.|+..+.+.+...+...-+-.......-.+|++|+||++... +..+|..+.. ....+++.+. T Consensus 1 Mantl------~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~-gl~DY~R~~g~~~~~g~v~~~~ 73 (312) T protein:vir:10 1 MANTL------AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTD-GLGDYSRGSANAYVGGDVKFEY 73 (312) T ss_pred CCcch------hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecc-cccccccccCCccccccccccc Confidence 88432 3357899999999988886554433322223357899999999864 5888987655 6666777777 Q ss_pred eEEeeh-hhhcchhcc--HHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----------cCcccCHHH Q lcl|Aclame:pro 79 REAKVR-KIGKGTELT--DEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV-----------EADITKLDG 144 (274) Q Consensus 79 ~~~~~~-~~~~~~~is--~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~-----------~~~~~~~d~ 144 (274) .+.++. +++..|.+. |.+.......+.....+.....+.-.+|+..++.+.....+. .+...-|+. T Consensus 74 et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~ 153 (312) T protein:vir:10 74 ETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINK 153 (312) T ss_pred eeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHH Confidence 777774 467777777 544433345566666777778888899999887665332211 122334677 Q ss_pred HHHHHHHHhhcCC-CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCC------cce- Q lcl|Aclame:pro 145 LQTAIDKFNDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLN------KGE- 214 (274) Q Consensus 145 iv~a~~~l~~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p------~~~- 214 (274) |.++...|.++++ .+++++|+|..+..|.+.. ......... .....++.++.++|+||+.-+ .+. .|+ T Consensus 154 i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~-~~~~~~~~~-~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t 231 (312) T protein:vir:10 154 IKTGIKIIRENGYNGPLVCHLTYDSMFAIEEKV-LEKLTAVTF-AQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTT 231 (312) T ss_pred HHHHHHHHHHccCCCceEEEeChHHHHHHhhhh-hceeccccc-ccceeeeeeeeecccEEEEchhhhccceeeeccCcc Confidence 7888888888765 5899999999997766532 222222222 233458889999999999632 221 110 Q ss_pred ------------------EEEEcCCeEEEEec-cCcee-eeccccccCccEEEEEEEEEEEEE-cCcceEEEEeCCCccc Q lcl|Aclame:pro 215 ------------------ALLAKKGAVKLITK-RDFFL-EKDRDASRKSTALYSDKHYVAYLY-DESKVVKITKGAGDEV 273 (274) Q Consensus 215 ------------------~~l~~~~a~~~~~~-~~~~v-e~~r~~~~~~~~i~~~~~~~~~v~-~~~avv~l~~~aa~~~ 273 (274) .++++++|..-... ..+.+ ..+-....+.+.+..|.++|+-|+ +....+.+.++.|..+ T Consensus 232 ~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~~ 311 (312) T protein:vir:10 232 SNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDAKPV 311 (312) T ss_pred cccccCceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecccCC Confidence 23445554322111 11222 122333445678999999999888 5556677888888888 Q ss_pred C Q lcl|Aclame:pro 274 M 274 (274) Q Consensus 274 ~ 274 (274) - T Consensus 312 ~ 312 (312) T protein:vir:10 312 G 312 (312) T ss_pred C Confidence 8 No 170 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.47 E-value=2.5e-14 Score=95.13 Aligned_cols=267 Identities=11% Similarity=0.042 Sum_probs=177.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc-ccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ-IGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~-~~~~~~ 79 (274) |+.-|=.-+.-+.|+.....|+|.+.+++.+...+. +....|+..++++....+++.|...++.++++. .+|.++ T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lp----f~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~ 100 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMP----FTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKV 100 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcc----cccccCCcceeeeeecCCcceeeeccccccccCcceeeee Confidence 775554445568899999999999987765544442 111224457788888889999999999998865 578999 Q ss_pred EEeehhhhcchhccHHHHh--ccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc------------ccc---c-ccCcccC Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVL--SGFGDPQGEAVRQHGLAIANKVDNDVLEALKG------------ATL---T-VEADITK 141 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~--~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~------------a~~---~-~~~~~~~ 141 (274) +..++.++..++|+.+... .+..|+.....+...++++++.+..+|..-.+ +.+ + ..++..+ T Consensus 101 t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T 180 (330) T protein:vir:94 101 TSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLT 180 (330) T ss_pred eechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCC Confidence 9999999999999988753 34457788888899999999999999873211 111 1 1345667 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcc-------- Q lcl|Aclame:pro 142 LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG-------- 213 (274) Q Consensus 142 ~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~-------- 213 (274) .|++=+++......+.++.+++||++...+|+..........-.........-.+.+|.|+||+.+|.+|.+ T Consensus 181 ~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ 260 (330) T protein:vir:94 181 FELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATN 260 (330) T ss_pred HHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCC Confidence 777766666666566788999999998888876544221111100011122334567999999999999863 Q ss_pred -e-EEEEc--CC-----eEEEEe--ccCceeeecc-ccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 214 -E-ALLAK--KG-----AVKLIT--KRDFFLEKDR-DASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 214 -~-~~l~~--~~-----a~~~~~--~~~~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) + +|+.. .+ -.|.-. ...+.|+.-- ..++.....++..+|+.++.+|.|+.+|..-..= T Consensus 261 ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 261 ATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 2 23322 21 133221 1134443322 2334455667788999999999999998865544 No 171 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.45 E-value=3.1e-14 Score=94.62 Aligned_cols=261 Identities=18% Similarity=0.135 Sum_probs=166.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccc--cccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDS--TLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~--~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) ||..= -+.|+..+.+.+...+..+.+.+..+ .....+|++|+||++....+..+|..+...+..+++.+. T Consensus 1 Main~--------~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~ 72 (285) T protein:vir:79 1 MTVVL--------DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGK 72 (285) T ss_pred Ccchh--------hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceee Confidence 77442 25789999999998888777765433 233457899999999655568889998889999999888 Q ss_pred eEEeeh-hhhcchhccHHHHhccCccHHHHHHHH-HHHHHHHHHHHHHHHHhcccccccc----CcccCHHHHHHHHHHH Q lcl|Aclame:pro 79 REAKVR-KIGKGTELTDEAVLSGFGDPQGEAVRQ-HGLAIANKVDNDVLEALKGATLTVE----ADITKLDGLQTAIDKF 152 (274) Q Consensus 79 ~~~~~~-~~~~~~~is~e~~~~s~~d~~~~~~~~-~a~~~a~~~d~~~i~~~~~a~~~~~----~~~~~~d~iv~a~~~l 152 (274) .+.++. .++..|.+...+...+..-....+.++ ....+.-.+|+..++.+.+...... +...-++.+.++...| T Consensus 73 et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~i~~~~~~l 152 (285) T protein:vir:79 73 ETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDAYDTAEAYM 152 (285) T ss_pred eEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 888885 467777777443332322223333333 4445556788887776654332222 2233467778888888 Q ss_pred hhcCC-CccEEEEcHHHHHHHHhhhcccccccccccccc---ccccccchhcc-eeeEE--cCCCCcce------EEEEc Q lcl|Aclame:pro 153 NDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI---IVKGAFGEALG-AVIVR--SNKLNKGE------ALLAK 219 (274) Q Consensus 153 ~~~~~-~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~---~~~g~~~~i~G-~~Vv~--s~~~p~~~------~~l~~ 219 (274) .+.++ .+++++|+|..+..|.+... +.+........ -.++.++.+.| +|++. ++.++..+ .++++ T Consensus 153 de~~vp~~rvl~vTp~~~~~Lk~s~~--~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~ 230 (285) T protein:vir:79 153 FDNEVPGGFVMFVSSAYYTALKQSAA--VTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTP 230 (285) T ss_pred HHcCCCCceEEEEChHHHHHHHhhhh--hheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEec Confidence 88765 78899999999998876643 33332221111 13456889998 89987 45665433 25677 Q ss_pred CCeEEEEeccCc-ee-eeccccccCccEEEEEEEEEEEEE-cCcceEEEEeCCCc Q lcl|Aclame:pro 220 KGAVKLITKRDF-FL-EKDRDASRKSTALYSDKHYVAYLY-DESKVVKITKGAGD 271 (274) Q Consensus 220 ~~a~~~~~~~~~-~v-e~~r~~~~~~~~i~~~~~~~~~v~-~~~avv~l~~~aa~ 271 (274) ++|..-....+. .+ ..+-+...+.+.+..|.++|+-|+ +....+.+..+||= T Consensus 231 ~~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 231 LSAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred CceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 777543332221 11 223333445678899999999888 33444555544444 No 172 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.39 E-value=1.8e-13 Score=90.42 Aligned_cols=264 Identities=12% Similarity=0.077 Sum_probs=165.0 Q ss_pred CCccccchhhccc--hHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIV--PEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~i--Pe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) |.+. .+++-+ -+.|+..+.+.+...++-+-+......+ -.+|++|+||++.. .+..+|..+......+++.+. T Consensus 1 ~~~~---an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~-~~Gak~VkIp~i~~-~gl~dY~R~~g~~~g~v~~~~ 75 (311) T protein:vir:99 1 MPTD---AETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDL-VNGGRSFTLKTIST-SGLKDHTRGKGFNSGTISDEK 75 (311) T ss_pred CCCc---chhhHHHHHHHHHHHHHHHHHhhhcccceecCchhe-eecCCEEEEEeeee-ccccccccccCccccceeeee Confidence 6632 222222 5778899999998887655565544443 24699999999985 468999988888888888888 Q ss_pred eEEeeh-hhhcchhccHHHHhcc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---------------ccCccc Q lcl|Aclame:pro 79 REAKVR-KIGKGTELTDEAVLSG--FGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT---------------VEADIT 140 (274) Q Consensus 79 ~~~~~~-~~~~~~~is~e~~~~s--~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~---------------~~~~~~ 140 (274) .+.++. .++..|.+...+...+ ...+.....+.....+.-.+|+.-+..+...... .....+ T Consensus 76 et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~l 155 (311) T protein:vir:99 76 TIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETL 155 (311) T ss_pred eEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhcccccccccc Confidence 888885 4777788874443333 2334445555666667778898887766422211 011122 Q ss_pred C----HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc-cccccccccccccccccchhcceeeEEc---CCCC- Q lcl|Aclame:pro 141 K----LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN-FTRPTQLGDNIIVKGAFGEALGAVIVRS---NKLN- 211 (274) Q Consensus 141 ~----~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~-~~~~~~~~~~~~~~g~~~~i~G~~Vv~s---~~~p- 211 (274) + ++.|..++..+.+.+.++++++|+|..+..|....... .+...+.+.+. .++.++.|.|++|+.. +.+. T Consensus 156 t~~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~-i~~~V~~lDgv~Ii~V~ps~r~~t 234 (311) T protein:vir:99 156 DETNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTA-LESRITSIDGVQLIEVYESNRFMT 234 (311) T ss_pred CHHHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhheeeecccccccc-cccccceecCeEEEEecCchhhcc Confidence 3 55666777778777778999999999999775443221 22222223332 4677899999998854 3332 Q ss_pred -----cce----------EEEEcCCeEEEEeccC-ceee-eccccccCccEEEEEEEEEEEEE-cCcceEEEEeCCC Q lcl|Aclame:pro 212 -----KGE----------ALLAKKGAVKLITKRD-FFLE-KDRDASRKSTALYSDKHYVAYLY-DESKVVKITKGAG 270 (274) Q Consensus 212 -----~~~----------~~l~~~~a~~~~~~~~-~~ve-~~r~~~~~~~~i~~~~~~~~~v~-~~~avv~l~~~aa 270 (274) .|. .+++++++..-....+ +.+. ++-+...+.+.+..|.++|+-|+ +....+.+..+-| T Consensus 235 ~~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 235 KYDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred hhhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 111 2556666654333222 2221 22333445678899999999888 4455566665555 No 173 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.33 E-value=3.5e-13 Score=88.84 Aligned_cols=258 Identities=14% Similarity=0.092 Sum_probs=159.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhccccccccccc---ccCCCEEEEEeecCCCCcccccCCCccc--ccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV---GQPGDTLTFPAFTYSGDAQVIAEGEKIP--VDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~---~~~G~~v~ip~~~~~~~a~~~~eg~~~~--~~~~~ 75 (274) ||++-+....+ . -+++++.+++.+++.+++.+-..+. .+.|+++++|........ +|...+ .+++. T Consensus 1 Ma~~~~~~lti---~--~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~----~G~~~t~~~~~~~ 71 (430) T protein:vir:21 1 MALNEGQIVTL---A--VDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLTDKATGLL 71 (430) T ss_pred CccccchhhHH---H--HHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeecccccccc----ccccccCCCccce Confidence 99775443332 2 2889999999999998865443332 467999999976443222 222211 23677 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccCcccCHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------LTVEADITKLDGLQ 146 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------~~~~~~~~~~d~iv 146 (274) .+++.+++.+. ...|.++.+++ +..++.+.+.+...++++.++|..+++....-. -+.......+.++. T Consensus 72 e~~v~~~~~~~~~V~~~~~~kEl--~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A 149 (430) T protein:vir:21 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) T ss_pred eeeEeEEEeeeccceEEeehhHh--cChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHH Confidence 78888888765 45578876653 467777888888889999999999997654321 22333445688888 Q ss_pred HHHHHHhhcCC---CccEEEEcHHHHHHHHhhhccccccccccccccccccccch-hcceee-EEcCCCCc--------- Q lcl|Aclame:pro 147 TAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE-ALGAVI-VRSNKLNK--------- 212 (274) Q Consensus 147 ~a~~~l~~~~~---~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~-i~G~~V-v~s~~~p~--------- 212 (274) ++...|.+... .+|..+++|..++.|...- ..+...........++|++++ +.|+.. +.++.+|. T Consensus 150 ~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l-~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~ 228 (430) T protein:vir:21 150 DAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) T ss_pred HHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhh-ccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCc Confidence 88888887654 3589999999998874321 111111122223344555544 445432 22222221 Q ss_pred -----------------------------------------ce------------------------------------- Q lcl|Aclame:pro 213 -----------------------------------------GE------------------------------------- 214 (274) Q Consensus 213 -----------------------------------------~~------------------------------------- 214 (274) |+ T Consensus 229 tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I 308 (430) T protein:vir:21 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) T ss_pred eeccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEE Confidence 00 Q ss_pred ------------------------------------------EEEEcCCeEEEEecc---------------------C- Q lcl|Aclame:pro 215 ------------------------------------------ALLAKKGAVKLITKR---------------------D- 230 (274) Q Consensus 215 ------------------------------------------~~l~~~~a~~~~~~~---------------------~- 230 (274) -+.|++++|....+. + T Consensus 309 ~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) T protein:vir:21 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) T ss_pred eecccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccce Confidence 044566666544321 1 Q ss_pred -ceeeeccccccCccEEEEEEEEEEEEEcCcce-EEEEeCCC Q lcl|Aclame:pro 231 -FFLEKDRDASRKSTALYSDKHYVAYLYDESKV-VKITKGAG 270 (274) Q Consensus 231 -~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~av-v~l~~~aa 270 (274) +.+-+..|...+++..+...-||++.++|+-. |.|-.-+| T Consensus 389 sirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 11223344455677888888999999999985 77776666 No 174 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.32 E-value=1.2e-12 Score=85.97 Aligned_cols=263 Identities=11% Similarity=0.082 Sum_probs=158.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC----CCCcccccCCCccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY----SGDAQVIAEGEKIPVDQIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~----~~~a~~~~eg~~~~~~~~~~ 76 (274) |||.. =-.+.|+..+.+.+...+...-+..........+|++|+||++.- ..+..+|..+......+++. T Consensus 1 Mantl------~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~ 74 (302) T protein:vir:78 1 MANSL------ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTL 74 (302) T ss_pred CCchh------HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccceee Confidence 88432 123779999999998887655554333333456789999999962 33577888888888888887 Q ss_pred ceeEEeeh-hhhcchhccHHHHhcc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccCcccCHHHHHH Q lcl|Aclame:pro 77 SKREAKVR-KIGKGTELTDEAVLSG--FGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT------VEADITKLDGLQT 147 (274) Q Consensus 77 ~~~~~~~~-~~~~~~~is~e~~~~s--~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~------~~~~~~~~d~iv~ 147 (274) +..+.++. +++..|.+...+...+ ...+.....+.....+.-.+|+.-++.+.+.... .....++.+.+++ T Consensus 75 ~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~ 154 (302) T protein:vir:78 75 AWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMG 154 (302) T ss_pred eeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHH Confidence 77777774 4677777774444333 3334455555566677788999888766432211 1122345555554 Q ss_pred ----HHHHHhhcCCCccEEEEcHHHHHHHHhhhcccc-ccccccccccccccccchhcceeeEEcC--CCCc-------- Q lcl|Aclame:pro 148 ----AIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNF-TRPTQLGDNIIVKGAFGEALGAVIVRSN--KLNK-------- 212 (274) Q Consensus 148 ----a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~-~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p~-------- 212 (274) +...++++ ++++++|.|..+..|.+.....- +.....+.+ ..++.++.+.|+||+.-+ .+.. T Consensus 155 ~i~~~~~~~~e~--~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~-~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~ 231 (302) T protein:vir:78 155 DIATAMELVDDS--NQLILVTSPTTLAGLLNTALIRESKNTQVLRRG-EVDTKITFIQDVEVLQVPSEYLYDKVAPKVGV 231 (302) T ss_pred HHHHHHHHhhcc--CCeEEEEChHHHHHHhcchhhccceeccccccc-cccceeeeecccEEEEchhhhcccceeccCCc Confidence 44445554 58999999999998865432221 111122222 246778999999999632 3321 Q ss_pred --c------eEEEEcCCeEEEEeccC-ceee-eccccccCccEEEEEEEEEEEEEcCc-ceEEEEeCCCcc Q lcl|Aclame:pro 213 --G------EALLAKKGAVKLITKRD-FFLE-KDRDASRKSTALYSDKHYVAYLYDES-KVVKITKGAGDE 272 (274) Q Consensus 213 --~------~~~l~~~~a~~~~~~~~-~~ve-~~r~~~~~~~~i~~~~~~~~~v~~~~-avv~l~~~aa~~ 272 (274) + ..+++++++..-....+ +.+. .+-....+.+.+..|.++|+-|++.. ..+.....++=| T Consensus 232 ~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 232 PDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGTIA 302 (302) T ss_pred cccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccccC Confidence 0 13455666543322222 2222 12222334468899999999998544 555555555555 No 175 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.28 E-value=7.2e-14 Score=92.63 Aligned_cols=160 Identities=18% Similarity=0.206 Sum_probs=103.7 Q ss_pred ehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------------c-cCccc----CH Q lcl|Aclame:pro 83 VRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT--------------V-EADIT----KL 142 (274) Q Consensus 83 ~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~--------------~-~~~~~----~~ 142 (274) +..+ -..+.|.|.+..++..|+++.+.+++++++|+..|+.++..+..+... . ...++ .+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 5443 344889999999999999999999999999999999998766433211 0 11112 36 Q ss_pred HHHHHHHHHHhhcCC--CccEEEEcHHHHHHHHhhhccccccc-ccccccccccc-ccchhcceeeEEcCCCCc--ceE- Q lcl|Aclame:pro 143 DGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRP-TQLGDNIIVKG-AFGEALGAVIVRSNKLNK--GEA- 215 (274) Q Consensus 143 d~iv~a~~~l~~~~~--~~~~~v~~p~~~~~L~~~~~~~~~~~-~~~~~~~~~~g-~~~~i~G~~Vv~s~~~p~--~~~- 215 (274) +.|++|...|.+++. ..++++++|+.|+.|++..+..+... ....++.+++| .++.+.|++|+.|+++|. |+. T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 788889999988765 78899999999999886322111111 11223346667 589999999999999994 321 Q ss_pred --------------------------EEEcCCeEEEEecc------Cce--eeeccccccC Q lcl|Aclame:pro 216 --------------------------LLAKKGAVKLITKR------DFF--LEKDRDASRK 242 (274) Q Consensus 216 --------------------------~l~~~~a~~~~~~~------~~~--ve~~r~~~~~ 242 (274) .+|++.|++.+.-. ++. +..-|.+++. T Consensus 161 ~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~~~~~~~~~~~~~~~~ 221 (221) T protein:vir:17 161 VTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRPPLVISMFSIRRPDRR 221 (221) T ss_pred ccCCccccccccccccccccccceEEEEEcchheeeeeeecCCCCCceeeeeeeccCCCCC Confidence 34555555543211 111 1112222222 No 176 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.22 E-value=1.3e-11 Score=80.19 Aligned_cols=266 Identities=11% Similarity=0.048 Sum_probs=162.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccccc-----CCCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIA-----EGEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~-----eg~~~~~~~~~ 75 (274) |+.-|=.-+..+.++.....|+|.+.+.+.+-..+. +-...|+..++.+....+++...+ -....++...+ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~Lp----F~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t 76 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLP----FDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAAT 76 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCC----cccccCCcceeeEeeccCCcccccccccccCCCccccccc Confidence 886555556678889999999999987655444332 111224456666655444444333 23445677888 Q ss_pred cceeEEeehhhhcchhccHHHHhc--c-CccHHHHHHHHHHHHHHHHHHHHHHHHhc---------cc---ccc----cc Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVLS--G-FGDPQGEAVRQHGLAIANKVDNDVLEALK---------GA---TLT----VE 136 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~~--s-~~d~~~~~~~~~a~~~a~~~d~~~i~~~~---------~a---~~~----~~ 136 (274) |++++..++-++..++|.+..... + ..|....-.+...+++.++.+..+|..-. .. .+. .. T Consensus 77 ~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~ 156 (310) T protein:vir:97 77 FTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGAT 156 (310) T ss_pred cceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCC Confidence 999999999999999998754332 3 34556666778889999999999986321 11 111 12 Q ss_pred CcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcce-- Q lcl|Aclame:pro 137 ADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE-- 214 (274) Q Consensus 137 ~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~-- 214 (274) ++..+.|++=+++........++.+++|||+++.+|+.....-....-.........-.+.+|.|+|++.++.+|.+. T Consensus 157 gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~ 236 (310) T protein:vir:97 157 GSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTK 236 (310) T ss_pred CCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCccc Confidence 355677766665556655667899999999876666543321111111111111223345689999999999998632 Q ss_pred --------EEE--EcCCe-----EEEEe--ccCceeeecc-ccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 215 --------ALL--AKKGA-----VKLIT--KRDFFLEKDR-DASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 215 --------~~l--~~~~a-----~~~~~--~~~~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +|+ ++.++ +|... ...+.|+... -.++.....++..+|+.+|.+|.|+.+|..-.= T Consensus 237 ~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 237 GGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred cccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 233 33322 22211 1234444433 223444556677899999999999999876544 No 177 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.15 E-value=1.3e-11 Score=80.18 Aligned_cols=261 Identities=12% Similarity=0.058 Sum_probs=150.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccc---cccCCCEEEEEeecCCCCcccccCCCcccc--cccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTL---VGQPGDTLTFPAFTYSGDAQVIAEGEKIPV--DQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~---~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~--~~~~ 75 (274) ||+.-++.. +++.+++++.+++.+++.+.+.+...+ ..+.|++|++|...... .-+|..++. +++. T Consensus 1 MAn~l~~~~-----~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~----~~~G~~~t~~~~~i~ 71 (430) T protein:vir:92 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) T ss_pred CccchhhHH-----HHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccc----cccCcccCCCCCccc Confidence 998755533 368889999999999998876543322 24679999999865432 222333322 3566 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccCcccCHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------LTVEADITKLDGLQ 146 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------~~~~~~~~~~d~iv 146 (274) ..++++++.+. ...|.++..++ ...+......+...++++.++|..+++....-. -+.......+.++. T Consensus 72 e~~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A 149 (430) T protein:vir:92 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) T ss_pred cceEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHH Confidence 77888888765 45688887664 344445555577778999999999997653321 12233344678898 Q ss_pred HHHHHHhhcCC---CccEEEEcHHHHHHHHhhhccccccccccccccccccccch-hccee-eEEcCCCCcce-----EE Q lcl|Aclame:pro 147 TAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE-ALGAV-IVRSNKLNKGE-----AL 216 (274) Q Consensus 147 ~a~~~l~~~~~---~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~-i~G~~-Vv~s~~~p~~~-----~~ 216 (274) ++...|.+... .+|..+++|..++.|...- .............+++|++++ +.|+. ++.++.+|..+ .+ T Consensus 150 ~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l-~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~ 228 (430) T protein:vir:92 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) T ss_pred HHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhh-ccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCc Confidence 89889988765 3589999999999985321 122223334446689999997 88996 46677777411 11 Q ss_pred -EEcCCeEE---EE-eccC-----------ceeeeccccccCccEEEEEEEEEEEE------EcCcceEEEEeCCCcccC Q lcl|Aclame:pro 217 -LAKKGAVK---LI-TKRD-----------FFLEKDRDASRKSTALYSDKHYVAYL------YDESKVVKITKGAGDEVM 274 (274) Q Consensus 217 -l~~~~a~~---~~-~~~~-----------~~ve~~r~~~~~~~~i~~~~~~~~~v------~~~~avv~l~~~aa~~~~ 274 (274) +-+.+..+ +- .... +++ +...-....|.+..--.+.... -++.-+++.-..++++|- T Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~-s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~ 307 (430) T protein:vir:92 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTL-SATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVE 307 (430) T ss_pred eeccccccccccceecccccccccccccceeee-ecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeE Confidence 11111110 00 0000 011 0001111222222222221111 134445554445555544 No 178 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.15 E-value=1.3e-11 Score=80.18 Aligned_cols=261 Identities=12% Similarity=0.058 Sum_probs=150.7 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccc---cccCCCEEEEEeecCCCCcccccCCCcccc--cccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTL---VGQPGDTLTFPAFTYSGDAQVIAEGEKIPV--DQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~---~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~--~~~~ 75 (274) ||+.-++.. +++.+++++.+++.+++.+.+.+...+ ..+.|++|++|...... .-+|..++. +++. T Consensus 1 MAn~l~~~~-----~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~----~~~G~~~t~~~~~i~ 71 (430) T protein:vir:10 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) T ss_pred CccchhhHH-----HHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccc----cccCcccCCCCCccc Confidence 998755533 368889999999999998876543322 24679999999865432 222333322 3566 Q ss_pred cceeEEeehhh-hcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccCcccCHHHHH Q lcl|Aclame:pro 76 TSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT--------LTVEADITKLDGLQ 146 (274) Q Consensus 76 ~~~~~~~~~~~-~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~--------~~~~~~~~~~d~iv 146 (274) ..++++++.+. ...|.++..++ ...+......+...++++.++|..+++....-. -+.......+.++. T Consensus 72 e~~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A 149 (430) T protein:vir:10 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) T ss_pred cceEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHH Confidence 77888888765 45688887664 344445555577778999999999997653321 12233344678898 Q ss_pred HHHHHHhhcCC---CccEEEEcHHHHHHHHhhhccccccccccccccccccccch-hccee-eEEcCCCCcce-----EE Q lcl|Aclame:pro 147 TAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE-ALGAV-IVRSNKLNKGE-----AL 216 (274) Q Consensus 147 ~a~~~l~~~~~---~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~-i~G~~-Vv~s~~~p~~~-----~~ 216 (274) ++...|.+... .+|..+++|..++.|...- .............+++|++++ +.|+. ++.++.+|..+ .+ T Consensus 150 ~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l-~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~ 228 (430) T protein:vir:10 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) T ss_pred HHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhh-ccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCc Confidence 89889988765 3589999999999985321 122223334446689999997 88996 46677777411 11 Q ss_pred -EEcCCeEE---EE-eccC-----------ceeeeccccccCccEEEEEEEEEEEE------EcCcceEEEEeCCCcccC Q lcl|Aclame:pro 217 -LAKKGAVK---LI-TKRD-----------FFLEKDRDASRKSTALYSDKHYVAYL------YDESKVVKITKGAGDEVM 274 (274) Q Consensus 217 -l~~~~a~~---~~-~~~~-----------~~ve~~r~~~~~~~~i~~~~~~~~~v------~~~~avv~l~~~aa~~~~ 274 (274) +-+.+..+ +- .... +++ +...-....|.+..--.+.... -++.-+++.-..++++|- T Consensus 229 tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~-s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~ 307 (430) T protein:vir:10 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTL-SATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVE 307 (430) T ss_pred eeccccccccccceecccccccccccccceeee-ecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeE Confidence 11111110 00 0000 011 0001111222222222221111 134445554445555544 No 179 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=99.04 E-value=9.1e-11 Score=75.64 Aligned_cols=258 Identities=13% Similarity=0.108 Sum_probs=160.8 Q ss_pred CCccccchhh--ccchHHHHHHHHHHHHHh-hhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSN--LIVPEVLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTS 77 (274) Q Consensus 1 ma~~~T~~~~--~~iPe~~~~~v~~~~~~~-~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~ 77 (274) +|-. .+++| .+.-.+....+++.++.. .-+..++... ++ .+-+..+-.+++..++...+.||+++....++-. T Consensus 359 ~A~~-hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~-~~--~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~ 434 (652) T protein:vir:79 359 AAFT-HSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKG-QL--SDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDK 434 (652) T ss_pred HHhh-cCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccC-CC--ccccccceeecCCCCCccccCCCCccceeeecCc Confidence 3321 12334 233333333334444322 1223333221 11 2223334445566788889999999999888877 Q ss_pred eeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------ccCcccC Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT----------------VEADITK 141 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~----------------~~~~~~~ 141 (274) ..+..+.++|+.|.+|++.+.....+....+-+.++++-++.+++.+++.+.+.+.- ..++.++ T Consensus 435 ~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~ 514 (652) T protein:vir:79 435 QATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMD 514 (652) T ss_pred cceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccccCC Confidence 788999999999999999988888899999999999999999999888777544311 1234456 Q ss_pred HHHHHHHHHHHhhc-------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce-eeEEcCCCCc- Q lcl|Aclame:pro 142 LDGLQTAIDKFNDE-------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA-VIVRSNKLNK- 212 (274) Q Consensus 142 ~d~iv~a~~~l~~~-------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~-~Vv~s~~~p~- 212 (274) .+.+-.|+.++... +..|++|+++|+.....++.-........+ ...|.+.-+.|+ .|++++.+.. T Consensus 515 ~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~-----~~~~~~Np~~~~~~~i~eprL~~~ 589 (652) T protein:vir:79 515 VASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGAD-----INAGIINPVKDFATVIAEPRLDDN 589 (652) T ss_pred HHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCcccc-----cccccccccccccccccccccCCC Confidence 77777776665332 236789999999765543321111111111 112223334553 7788888864 Q ss_pred -ce-EEEEcCCe-----EEEEeccC-ceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEe Q lcl|Aclame:pro 213 -GE-ALLAKKGA-----VKLITKRD-FFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITK 267 (274) Q Consensus 213 -~~-~~l~~~~a-----~~~~~~~~-~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~ 267 (274) .+ -|++.+.. ++|..+.+ ..+|+......+...+++++-||++++|=-+++|.|- T Consensus 590 s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 590 SQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 22 34554332 44555544 3466655555666778899999999999999999776 No 180 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=99.00 E-value=1.5e-10 Score=74.45 Aligned_cols=259 Identities=14% Similarity=0.126 Sum_probs=159.9 Q ss_pred CCccccchhh--ccchHHHHHHHHHHHHHh-hhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSN--LIVPEVLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTS 77 (274) Q Consensus 1 ma~~~T~~~~--~~iPe~~~~~v~~~~~~~-~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~ 77 (274) ||-. .+++| .+.-.+....+++.++.. .-+...+.. .++ .+-+..+-..++..++...+.||+++....+... T Consensus 394 ~a~~-htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~-~~~--~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~ 469 (693) T protein:vir:95 394 LAFT-HTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKS-GIL--TDFKPARRVGLGEFSSLRQVREGAEYKYVTLGER 469 (693) T ss_pred HHHh-cCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhcc-CCC--CcccccceeecCCCCChhhcCCCCceeeeecCCc Confidence 3321 12222 233333333444433321 112222221 111 1122233334566677888999999988888877 Q ss_pred eeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------cccCcccC Q lcl|Aclame:pro 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL----------------TVEADITK 141 (274) Q Consensus 78 ~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~----------------~~~~~~~~ 141 (274) .-++.+.++|+.|.+|++.+.....+....+-+.++++.++.+++.+++.+.+.+. ++.+..++ T Consensus 470 ~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals 549 (693) T protein:vir:95 470 GEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALS 549 (693) T ss_pred cceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccccC Confidence 78899999999999999999888889999999999999999999999988865431 12234567 Q ss_pred HHHHHHHHHHHhhc------------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce-eeEEcC Q lcl|Aclame:pro 142 LDGLQTAIDKFNDE------------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA-VIVRSN 208 (274) Q Consensus 142 ~d~iv~a~~~l~~~------------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~-~Vv~s~ 208 (274) .+.+-.++.++... +..|++|+++|+.....+...........+ ...|.+.-+.|+ .||.++ T Consensus 550 ~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~-----~~~~~~NP~~~~~~vi~~p 624 (693) T protein:vir:95 550 IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGAD-----VNSGIVNPIRAFAQVIGEP 624 (693) T ss_pred hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccc-----cccccccchhccccccccc Confidence 88887776665321 236788999888766554432111111111 112222235553 678888 Q ss_pred CCCc--ceE-EEEcCCe-----EEEEeccC-ceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 209 KLNK--GEA-LLAKKGA-----VKLITKRD-FFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 209 ~~p~--~~~-~l~~~~a-----~~~~~~~~-~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) .+.. ++. |++...+ ++|..+.+ ..+|+...-..+...+++++-||++++|=-+++|=..+ T Consensus 625 rL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 625 RLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred eecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 8853 444 4544322 44555544 34566655566667888999999999999888885444 No 181 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=98.88 E-value=1.8e-10 Score=73.98 Aligned_cols=265 Identities=17% Similarity=0.154 Sum_probs=174.4 Q ss_pred ccccchh-hccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccceeEE Q lcl|Aclame:pro 3 QGTTKVS-NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKREA 81 (274) Q Consensus 3 ~~~T~~~-~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~~~ 81 (274) .+-|+-. .++..|+|++.+...+.+.+.--++......+. +|++++||.++. +......|..++....+..+++++ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~--~G~~L~I~tiGs-~~~~~~~E~~~~~~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFG--SGETLHIKTIGS-VTLQEAEEDTPLIYNPIETGEITF 77 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCC--CCCEEEecccCc-eeeeccccCCCeeecccccceEEE Confidence 3333333 367889999999999888776666665444443 589999999874 567778899999999999999999 Q ss_pred eehhh-hcchhccHHHHhccC--ccHHHHHHHHHHHHHHHHHHHHHHHHhc------cccc-----------cccCcccC Q lcl|Aclame:pro 82 KVRKI-GKGTELTDEAVLSGF--GDPQGEAVRQHGLAIANKVDNDVLEALK------GATL-----------TVEADITK 141 (274) Q Consensus 82 ~~~~~-~~~~~is~e~~~~s~--~d~~~~~~~~~a~~~a~~~d~~~i~~~~------~a~~-----------~~~~~~~~ 141 (274) .+.++ |-+|.+|+.+.+++- .+++.....+.++++-...+..+++... ..+. +.+.+... T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~ 157 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFA 157 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceeh Confidence 99886 557999999988873 2455555666677777777777775432 1111 22334556 Q ss_pred HHHHHHHHHHHhhcC--CCccEEEEcHHHHHHHHhhhccccccccccccccccccc------cchhcceeeEEcCCCCc- Q lcl|Aclame:pro 142 LDGLQTAIDKFNDED--LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGA------FGEALGAVIVRSNKLNK- 212 (274) Q Consensus 142 ~d~iv~a~~~l~~~~--~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~------~~~i~G~~Vv~s~~~p~- 212 (274) ..++..+...+..++ .++++.++.|.....|...-.... ..++.+-=++-+|. +..++|..+.+|+.+-. T Consensus 158 ~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~-~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 158 LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITH-DVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeec-ccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 778888888887765 488999999999888764422211 01111111122221 23478999998887642 Q ss_pred ----------ceE---EE--EcCC---eEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCccc Q lcl|Aclame:pro 213 ----------GEA---LL--AKKG---AVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEV 273 (274) Q Consensus 213 ----------~~~---~l--~~~~---a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~~~ 273 (274) |.+ |. .+.+ -++-|.++| +.+.+|+.++..+--..++|||.++.+.+.++.+- +.|++- T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP-~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~-~~A~~~ 313 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMP-KSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLA-TSATAY 313 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeecccc-ccccccccccccccceeeeeecccceeecceeEEE-eccccC Confidence 111 21 1221 223334444 56677777776677778999999999888777653 445555 No 182 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.88 E-value=3.7e-10 Score=72.28 Aligned_cols=270 Identities=16% Similarity=0.156 Sum_probs=160.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcc-cc--------cccccccccCCCEEEEEeecCCCCcccccCCCcc-- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQ-FA--------DIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI-- 69 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~-l~--------~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~-- 69 (274) ||.+.+..++-.....|+..+.....+.+.+.. +. ++..++....|++|+|+-..... -..+.+++.+ T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~-g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLR-GKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecc-cCCcccCceeec Confidence 998888877766667899877655544443332 22 23345556679999998877663 3444444443 Q ss_pred cccccccceeEEeehhhhcchhc-cHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTEL-TDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------------- 133 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~i-s~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~--------------- 133 (274) -++.+++.+..+.+......++. .....+.+..|+.+..++.++..|++..|+.++-.+.++.. T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~ 159 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYA 159 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccccc Confidence 45689999999999888777765 33456778899999999999999999999988866543210 Q ss_pred -----c------------------ccCcccCHHHHHHHHHHHhhcC----------------CCccEEEEcHHHHHHHHh Q lcl|Aclame:pro 134 -----T------------------VEADITKLDGLQTAIDKFNDED----------------LEPMVLFVNPLDAGGLRT 174 (274) Q Consensus 134 -----~------------------~~~~~~~~d~iv~a~~~l~~~~----------------~~~~~~v~~p~~~~~L~~ 174 (274) + .+++.++++.|-+|...+.... .+..+++|||-.+..|++ T Consensus 160 ~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) T protein:vir:93 160 GNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMRT 239 (364) T ss_pred ccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhhh Confidence 0 0124456777777777664432 123378999999999987 Q ss_pred hhccccc------cccccccccccccccchhcceeeEEcCCCCc------------ceEEEEcCCeEEEE--eccCcee- Q lcl|Aclame:pro 175 SASDNFT------RPTQLGDNIIVKGAFGEALGAVIVRSNKLNK------------GEALLAKKGAVKLI--TKRDFFL- 233 (274) Q Consensus 175 ~~~~~~~------~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~------------~~~~l~~~~a~~~~--~~~~~~v- 233 (274) +.+.+|. .......+.+..|.+|.|.|+.|.....++. .-+++++..|++.. ....... T Consensus 240 ~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g~~~~ 319 (364) T protein:vir:93 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) T ss_pred cCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCCCCce Confidence 5532221 1222334668889999999999998877752 22366666665443 3222221 Q ss_pred --eeccccccCccEEEEEEEEEEEEE----cCcceEEEEeCC-Ccc Q lcl|Aclame:pro 234 --EKDRDASRKSTALYSDKHYVAYLY----DESKVVKITKGA-GDE 272 (274) Q Consensus 234 --e~~r~~~~~~~~i~~~~~~~~~v~----~~~avv~l~~~a-a~~ 272 (274) |...|-.. ...+.+...+|.+-+ ..-+++.|-.++ +-+ T Consensus 320 w~Ee~~D~gn-~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 320 WEETVKDYGN-EPAIAAGFIAGMKKARFNNKDFGVISIDTAAKKHS 364 (364) T ss_pred eeecccCCCC-chhhhhhhHhhhhhcccCCccceEEEecccccccC Confidence 21111111 111222222211111 111122211111 111 No 183 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=98.84 E-value=2.2e-09 Score=68.06 Aligned_cols=262 Identities=12% Similarity=0.063 Sum_probs=144.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccc--ccC-CCcccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQV--IAE-GEKIPVDQIGTS 77 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~--~~e-g~~~~~~~~~~~ 77 (274) +..... .+..+-|++ +.+++++..+...+-+.+... .+...+.+|++++-. .--- ..| |..-...+.+.. T Consensus 23 it~~~l-~~g~L~p~~-a~~Fl~~v~~~t~iL~~~r~~----~~~s~~~ei~kig~G-~r~~r~~~e~~~~~~~~~~~~~ 95 (360) T protein:vir:99 23 IGLAEL-DGFQLPVDV-TEEFLERMQKGVQILGMADTM----TLARLEMEVPQFGVP-RLSGHTRDEEGSRTENSEAESG 95 (360) T ss_pred cccccc-CceeecHHH-HHHHHHHHhhccchhhhccee----ecccccccccccccc-eeeccccccCCCCCcCCcCccc Confidence 221222 134566775 455556666666665655443 222334555555431 1111 111 111111223333 Q ss_pred eeEE-eehhhhcchhccHHHHhcc----CccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------- Q lcl|Aclame:pro 78 KREA-KVRKIGKGTELTDEAVLSG----FGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------------------- 133 (274) Q Consensus 78 ~~~~-~~~~~~~~~~is~e~~~~s----~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------------------- 133 (274) ++.. ..++.-..+.+..+.+++. ...+.+.+.+.+++.+++.++...+..-..... T Consensus 96 ~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlK 175 (360) T protein:vir:99 96 SVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIA 175 (360) T ss_pred cCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHH Confidence 3333 2334444455656555544 335678889999999988877665532211100 Q ss_pred -----------cc-------------cC---------------cccCHHHHHHHHHHHhhcCCCc----cEEEEcHHHHH Q lcl|Aclame:pro 134 -----------TV-------------EA---------------DITKLDGLQTAIDKFNDEDLEP----MVLFVNPLDAG 170 (274) Q Consensus 134 -----------~~-------------~~---------------~~~~~d~iv~a~~~l~~~~~~~----~~~v~~p~~~~ 170 (274) ++ .. ....-+-|.+++..|...+.+. -+|+|+|..+. T Consensus 176 ka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~ 255 (360) T protein:vir:99 176 RAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQ 255 (360) T ss_pred HhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHH Confidence 00 00 0011123567777787765321 27999999877 Q ss_pred HHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeee----ccccccCccEE Q lcl|Aclame:pro 171 GLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEK----DRDASRKSTAL 246 (274) Q Consensus 171 ~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~----~r~~~~~~~~i 246 (274) ..+..-. .+.+.+|+..+.++..-.++|+||+..+.+|++.+++.++..+.+...++++++. ++..++..... T Consensus 256 ~yr~~L~---~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~ 332 (360) T protein:vir:99 256 SYTMSLT---EREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSR 332 (360) T ss_pred HHHHHHh---ccCcccchhheecccccccceeeeEEcCCCCCCceEEeccCceeEEeeeeeEEeecccchhhhhhceeee Confidence 6654321 1234566666666655678999999999999999999999998888777777643 33222221111 Q ss_pred E-EEEEEEEEEEcCcceEEEEe-CCCcc Q lcl|Aclame:pro 247 Y-SDKHYVAYLYDESKVVKITK-GAGDE 272 (274) Q Consensus 247 ~-~~~~~~~~v~~~~avv~l~~-~aa~~ 272 (274) + .+..+++..-+++|+|.++. .-|+| T Consensus 333 ~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 333 NWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred EEEEEEeeEEEEecccEEEEecCCCCCC Confidence 1 23457777788888888774 45566 No 184 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.82 E-value=9.6e-10 Score=70.02 Aligned_cols=270 Identities=17% Similarity=0.147 Sum_probs=156.5 Q ss_pred CCccccch---hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcc-cccCCCccccc---- Q lcl|Aclame:pro 1 MAQGTTKV---SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ-VIAEGEKIPVD---- 72 (274) Q Consensus 1 ma~~~T~~---~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~-~~~eg~~~~~~---- 72 (274) =+.+++.. ++.+----|..+.+.-.++.+++.++++.. .++...|+++++.++...+++. ...||.+-.-. T Consensus 9 ~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~-piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~ 87 (401) T protein:vir:95 9 DGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVT-NMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVN 87 (401) T ss_pred ccccccccccccceeeehhhHHHHHhhhhhhhhhhhccccc-ccccccCCeEEEEecccccccccchhcCCCcccccccC Confidence 11222222 112222344555555566679999999864 3566779999999988776543 34555422221 Q ss_pred ------------------------------ccccceeEEeehhhhcchhccHHHHhcc-CccHHHHHHHHH-HHHH---H Q lcl|Aclame:pro 73 ------------------------------QIGTSKREAKVRKIGKGTELTDEAVLSG-FGDPQGEAVRQH-GLAI---A 117 (274) Q Consensus 73 ------------------------------~~~~~~~~~~~~~~~~~~~is~e~~~~s-~~d~~~~~~~~~-a~~~---a 117 (274) .++-.++..+++++|.+.++||+..... ...+.+-+.+.+ .-+- - T Consensus 88 g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~ 167 (401) T protein:vir:95 88 GNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITE 167 (401) T ss_pred ccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHH Confidence 2233446667889999999999876644 333444333333 2222 2 Q ss_pred HHHHHHHHHHhc-----cc-c-------ccccCcccCHHHHHHHHHHHhhc-------------C-----C-CccEEEEc Q lcl|Aclame:pro 118 NKVDNDVLEALK-----GA-T-------LTVEADITKLDGLQTAIDKFNDE-------------D-----L-EPMVLFVN 165 (274) Q Consensus 118 ~~~d~~~i~~~~-----~a-~-------~~~~~~~~~~d~iv~a~~~l~~~-------------~-----~-~~~~~v~~ 165 (274) ..+.+.+|+... ++ + ........+++++..+...|..+ . . .-++.+|| T Consensus 168 d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h 247 (401) T protein:vir:95 168 AVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVG 247 (401) T ss_pred HHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEe Confidence 233344443321 01 0 11122346789999888888642 1 1 22357889 Q ss_pred HHHHHHHHhhh----cccccccccccc-ccccccccchhcceeeEEcCCCC--------c-------------------- Q lcl|Aclame:pro 166 PLDAGGLRTSA----SDNFTRPTQLGD-NIIVKGAFGEALGAVIVRSNKLN--------K-------------------- 212 (274) Q Consensus 166 p~~~~~L~~~~----~~~~~~~~~~~~-~~~~~g~~~~i~G~~Vv~s~~~p--------~-------------------- 212 (274) |.....|+... ...|+..-.+++ +.+.+|++|.+.++++++++.+. . T Consensus 248 ~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dV 327 (401) T protein:vir:95 248 SELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDV 327 (401) T ss_pred cCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccccCCCccee Confidence 96655554221 134655444443 56899999999999999988753 1 Q ss_pred ceEEEEcCCeEEEEe----ccC--ceee--------eccccccCccEEEEEE-EEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 213 GEALLAKKGAVKLIT----KRD--FFLE--------KDRDASRKSTALYSDK-HYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 213 ~~~~l~~~~a~~~~~----~~~--~~ve--------~~r~~~~~~~~i~~~~-~~~~~v~~~~avv~l~~~aa~ 271 (274) +-.+++++.|++... +.. +.+- .+|+...++....++. .|++.+++|+-.+.|...|+= T Consensus 328 yp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~~ 401 (401) T protein:vir:95 328 YPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVAPL 401 (401) T ss_pred eeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeecCC Confidence 113567888876532 111 1221 1333334555555554 678899999999999988887 No 185 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.60 E-value=1.6e-08 Score=63.31 Aligned_cols=262 Identities=8% Similarity=0.025 Sum_probs=157.0 Q ss_pred CCccccchhhccchH---HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCC-ccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE---VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGE-KIPVDQIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iPe---~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~-~~~~~~~~~ 76 (274) |...--..+-+|.-+ .+-+.|.+...+.+..+.++....... -.-.+++++.+...|.+.|++.++ ++|..+... T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIP-GYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCC-CceeEEEeeeeeccCceeEeCCCccccceeeccc Confidence 553211222233332 344556666666677777776544332 123467888887778899998754 489999999 Q ss_pred ceeEEeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhcc--------ccc----cccCcc-- Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALKG--------ATL----TVEADI-- 139 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~--------a~~----~~~~~~-- 139 (274) +.....++.++..|.++.++++.+ ..++...-....++.+++..|+.++-+... .+. +...++ T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccC Confidence 999999999999999988777554 678888888899999999999877632211 110 111111 Q ss_pred --cCHHHHHHHHHHHhhc--C-CCccEEEEcHHHHHHHHhhhccccccccccccccc-cccccchhcceeeEEcCCCC-c Q lcl|Aclame:pro 140 --TKLDGLQTAIDKFNDE--D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNII-VKGAFGEALGAVIVRSNKLN-K 212 (274) Q Consensus 140 --~~~d~iv~a~~~l~~~--~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~-~~g~~~~i~G~~Vv~s~~~p-~ 212 (274) .-+++|+.++..+... + ..+..++++|+.+..|...-. ++ .. ..-..+ .+....+|.+.|...+..-. + T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~-~~--~~-t~l~~ik~~~~~l~i~~~~~l~~a~~~g~ 235 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVP-GT--SV-SYGEFFRQNNSGVTVEFVQYLNDYNGTGT 235 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccC-CC--Cc-cHHHHHHHhcCCceEEEeeeeccCCCCcc Confidence 1267888888766542 2 467889999999988753210 10 00 000112 11122234444444332221 2 Q ss_pred ceEEEEc--CCeEEEEeccCceeeeccccccCccEEEEEEEEE-EEEEcCcceEEE---EeC Q lcl|Aclame:pro 213 GEALLAK--KGAVKLITKRDFFLEKDRDASRKSTALYSDKHYV-AYLYDESKVVKI---TKG 268 (274) Q Consensus 213 ~~~~l~~--~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~-~~v~~~~avv~l---~~~ 268 (274) +-.+++. +..+.+...++++...- +.......+....+++ +-+.+|.+++++ |++ T Consensus 236 ~~~v~~~~~~~~~~~~v~~~~~~~~~-e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 236 SAAIAYEKDPNNMAIEIPEATNALPA-QPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred eEEEEEEcCCceEEEEcCcceeeecc-cccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 2234443 55666666667655432 2233445566677764 778899999998 666 No 186 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=98.57 E-value=4.2e-08 Score=61.04 Aligned_cols=253 Identities=13% Similarity=0.047 Sum_probs=147.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh-hhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccccccee Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~-~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~~ 79 (274) |..+.... ..+-+.+...+.+.++.. .-+..++... ....+.-+...++..+....+ ..+++...++.... T Consensus 1 m~it~~~l--~~l~~~~~~~~~~~y~~a~~~~~~~a~~~----~sdf~~~~~~~lg~~p~l~e~--~Ge~~~~~l~~~~~ 72 (302) T protein:vir:10 1 MLINKQSL--NAAFVAIKTIFNNAFAAAPTTWQKIAMEV----PSNTSSNDYKWLSTFPKMRRW--IGAKVVKNLKAYKY 72 (302) T ss_pred CcccHHHH--HHHHHHHHHHHHHHHHhhhhhhhceeeec----CCCcceeeceecCCCCCcccc--ccceeeccccccce Confidence 65332111 112223333444333321 1224444321 133444555556655655333 36678888888889 Q ss_pred EEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------------------c--- Q lcl|Aclame:pro 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV--------------------E--- 136 (274) Q Consensus 80 ~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~--------------------~--- 136 (274) +++.++++..+.|+++++.+........+.+.++++.++..|+.+++.+.++.... + T Consensus 73 ~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~ 152 (302) T protein:vir:10 73 VVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGT 152 (302) T ss_pred eEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccc Confidence 99999999999999999999999999999999999999999999998887532110 0 Q ss_pred ------CcccCHHHHHHHHHHHh---hc-----CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 137 ------ADITKLDGLQTAIDKFN---DE-----DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 137 ------~~~~~~d~iv~a~~~l~---~~-----~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) ...++.+.+-+++..+. +. +..|+.+|+.|+....-++.-..... . ....+... .-+ T Consensus 153 ~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~-~-~g~~Np~~-------g~~ 223 (302) T protein:vir:10 153 APLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKL-A-DNTPNPYV-------GTA 223 (302) T ss_pred hhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcccc-C-CCCcceec-------cce Confidence 01233344444444432 21 24778899998876654433111111 0 01112111 125 Q ss_pred eeEEcCCCCcceEE-EEc-CCeEE--E-EeccCceeeeccccccCccEEEEEEEEE------EEEEcCcceEEEEeCCC Q lcl|Aclame:pro 203 VIVRSNKLNKGEAL-LAK-KGAVK--L-ITKRDFFLEKDRDASRKSTALYSDKHYV------AYLYDESKVVKITKGAG 270 (274) Q Consensus 203 ~Vv~s~~~p~~~~~-l~~-~~a~~--~-~~~~~~~ve~~r~~~~~~~~i~~~~~~~------~~v~~~~avv~l~~~aa 270 (274) .+++++.+..++.| |++ +..+. | ..+++..+++..+.+.+...++.+..|| ++...++...+-+.++| T Consensus 224 ~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 224 ELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred EEEEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 88889999877764 554 44332 2 3344456666666655555555555555 46667777777666666 No 187 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.53 E-value=3.4e-08 Score=61.53 Aligned_cols=266 Identities=12% Similarity=-0.023 Sum_probs=148.5 Q ss_pred CCccccch---hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-CCcccccCCCccccccccc Q lcl|Aclame:pro 1 MAQGTTKV---SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPVDQIGT 76 (274) Q Consensus 1 ma~~~T~~---~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~~~~~~ 76 (274) ||.++.+- -..-.-+-+++.|...-.....|.+++-. .......++.+..... ++.....||++.+...... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~----~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~ 76 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGK----GVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSF 76 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecC----ceecccEEEEEeeecCCccccccccCcccccccccC Confidence 99754433 22333444555554433333333332211 0011223444432221 1222345888776655444 Q ss_pred ceeEEeehh-hhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhcc-----cc--------------- Q lcl|Aclame:pro 77 SKREAKVRK-IGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALKG-----AT--------------- 132 (274) Q Consensus 77 ~~~~~~~~~-~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~-----a~--------------- 132 (274) ....=..-+ ..+.+.+|.-...-+ ..+.+.+-.++-...+.|.+|+.+|...+. ++ T Consensus 77 r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~ 156 (317) T protein:vir:88 77 TTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTN 156 (317) T ss_pred CEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccC Confidence 332222212 123455665443332 235555555666677889999998854321 10 Q ss_pred ------------------ccccCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcccccccccccc----cc Q lcl|Aclame:pro 133 ------------------LTVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGD----NI 190 (274) Q Consensus 133 ------------------~~~~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~----~~ 190 (274) +..+...++-+.|.++...+-+++..++.++|+|.....|-+.-..........++ +. T Consensus 157 ~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~ 236 (317) T protein:vir:88 157 GSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQ 236 (317) T ss_pred ceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEE Confidence 00122246788999999999999999999999999887775432111111110011 11 Q ss_pred ccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCceeeecccccc-CccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 191 IVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDASR-KSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 191 ~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~ve~~r~~~~-~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) ..+-....+-=++++.+.++|.++.+++++..+....-+++..|. .++. +.+.......++.++.+|.+..+|+.-+ T Consensus 237 ~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~--laKtGd~~k~~i~~E~tLe~~N~~a~a~i~~l~ 314 (317) T protein:vir:88 237 TVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHE--LAKTGDSEKRQLLVEYTFRVNNEKSGALIRDVV 314 (317) T ss_pred EEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceeec--cCCCcccceeEEEEEEEEEEcCccceeEEEEec Confidence 111111122236899999999999999999988776556654442 2222 3455666778899999999999999888 Q ss_pred Ccc Q lcl|Aclame:pro 270 GDE 272 (274) Q Consensus 270 a~~ 272 (274) ++- T Consensus 315 ~~~ 317 (317) T protein:vir:88 315 AQL 317 (317) T ss_pred ccC Confidence 887 No 188 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.53 E-value=2.3e-08 Score=62.46 Aligned_cols=262 Identities=12% Similarity=0.053 Sum_probs=150.3 Q ss_pred CC--ccccchhhccchH---HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCC-ccccccc Q lcl|Aclame:pro 1 MA--QGTTKVSNLIVPE---VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGE-KIPVDQI 74 (274) Q Consensus 1 ma--~~~T~~~~~~iPe---~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~-~~~~~~~ 74 (274) |- ...+.....|.-+ .+-+.+++...+.+..+.++....... -.-.+++++.+...|.+.|++.++ ++|..+. T Consensus 21 ~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~-~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~ 99 (319) T protein:vir:10 21 AGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELS-PTDKTFEYMTFDKVGTAQIIADYTDDLPLVDA 99 (319) T ss_pred ccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCC-CceEEEEeeeeccccceeeecCccccccceec Confidence 11 1111112234443 333455566666666677765543322 122367788888888999998755 4899999 Q ss_pred ccceeEEeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhc--------cccc-----cc--- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALK--------GATL-----TV--- 135 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~--------~a~~-----~~--- 135 (274) ..+.....++.++..|.++..+++.+ ..++...-....++.++++.|+.++-+.. +.+. .. T Consensus 100 ~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~ 179 (319) T protein:vir:10 100 LGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWI 179 (319) T ss_pred cceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCC Confidence 99998899999998898888776554 67888888889999999999987763221 1110 00 Q ss_pred cCcccC----HHHHHHHHHHHhhc--C-CCccEEEEcHHHHHHHHhhhccccccccccccccc-cccccchhcceeeEEc Q lcl|Aclame:pro 136 EADITK----LDGLQTAIDKFNDE--D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNII-VKGAFGEALGAVIVRS 207 (274) Q Consensus 136 ~~~~~~----~d~iv~a~~~l~~~--~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~-~~g~~~~i~G~~Vv~s 207 (274) ..++-+ +++|..++..+... + ..+..++++|+.|..|..-. .++ . -..-..+ .++...+|.+.|.+.. T Consensus 180 ~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~-~~~--~-~t~l~~lk~~~~~l~I~~~pel~~ 255 (319) T protein:vir:10 180 DVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRM-PET--T-MSYLDYFKSQNSGIEIDSIAELED 255 (319) T ss_pred CccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhccc-CCC--C-eeHHHHHHHhcCCceEEEeeeecc Confidence 111112 35666677666432 2 37889999999999884211 010 0 0001112 2222234455554443 Q ss_pred CCCC-cceEEEE--cCCeEEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceEEEEeC Q lcl|Aclame:pro 208 NKLN-KGEALLA--KKGAVKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVVKITKG 268 (274) Q Consensus 208 ~~~p-~~~~~l~--~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv~l~~~ 268 (274) ..-. ++-.+++ ++..+.+....++++..- +.......+....++ |+-+.+|.+++++..= T Consensus 256 ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 256 IDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred cCCCcceEEEEEecCCceEEEecCcceeeeee-eecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 2221 1222333 345566666666655432 222223344445555 4667799999998877 No 189 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.49 E-value=6.2e-08 Score=60.09 Aligned_cols=260 Identities=14% Similarity=0.113 Sum_probs=150.2 Q ss_pred CCccccchhhccch---HHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCC-ccccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVP---EVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGE-KIPVDQIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iP---e~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~-~~~~~~~~~ 76 (274) |=+.-+. .|.- +.+-+.+.+.+.+.++.+.++....... -...+++++.....+.+.+++.++ ++|..+... T Consensus 1 ~~~~~~g---~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~-~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~ 76 (301) T protein:vir:80 1 MQGKITA---TIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVN-EGAESYSFDVMTRSGAAKIIANGADDLPLVDVDM 76 (301) T ss_pred CCccccc---hhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCC-CceEEEEEeeeccceeEEEecCcccccccccccc Confidence 4433222 2333 2444566677777777777765543322 223457788887778889998755 489999999 Q ss_pred ceeEEeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhcc--------ccc--------cc-- Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALKG--------ATL--------TV-- 135 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~--------a~~--------~~-- 135 (274) +.....+..++..|.++..+++.+ ..++...-....+++++++.|+.++-.... .+. +. T Consensus 77 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~ 156 (301) T protein:vir:80 77 VRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVG 156 (301) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccc Confidence 998999999998888888766554 678888889999999999999877733211 110 00 Q ss_pred -cCccc--C----HHHHHHHHHHHhhc--C-CCccEEEEcHHHHHHHHhhhcccccccccccccccc-ccccchhcceee Q lcl|Aclame:pro 136 -EADIT--K----LDGLQTAIDKFNDE--D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIV-KGAFGEALGAVI 204 (274) Q Consensus 136 -~~~~~--~----~d~iv~a~~~l~~~--~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~-~g~~~~i~G~~V 204 (274) ..++. + +++|.+++..+... + ..+..++++|+.|..|..-...+.. . ...-..+. +....+|.+.|. T Consensus 157 ~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~-~-~tvl~~l~~~~~~~~I~~~p~ 234 (301) T protein:vir:80 157 NVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNED-S-RSVLKVLQDNAWFSAIVRVPD 234 (301) T ss_pred cccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCC-C-eeHHHHHHHHcCcceEEEcce Confidence 01111 2 56778888777442 2 3678999999999998431100000 0 00011121 222233444444 Q ss_pred EEcCCCC-cceEEEEc--CCeEEEEeccCceeeeccccccC-ccEEEEEEEE-EEEEEcCcceEEEEeC Q lcl|Aclame:pro 205 VRSNKLN-KGEALLAK--KGAVKLITKRDFFLEKDRDASRK-STALYSDKHY-VAYLYDESKVVKITKG 268 (274) Q Consensus 205 v~s~~~p-~~~~~l~~--~~a~~~~~~~~~~ve~~r~~~~~-~~~i~~~~~~-~~~v~~~~avv~l~~~ 268 (274) ..+.... ++-.+++. +..+.+...++++...- ..++ ...+-...|+ |+-+.+|.+++++..= T Consensus 235 L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~--e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 235 LAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPE--EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eccCCCCcccEEEEEecCCcEEEEEecCceeeecc--eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 4333221 12234443 33455555555543221 1111 2223334444 5677899999998877 No 190 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.32 E-value=1.2e-07 Score=58.53 Aligned_cols=263 Identities=11% Similarity=0.033 Sum_probs=148.0 Q ss_pred CCccc-----------------cchh-hccchH---HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCC Q lcl|Aclame:pro 1 MAQGT-----------------TKVS-NLIVPE---VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGD 59 (274) Q Consensus 1 ma~~~-----------------T~~~-~~~iPe---~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~ 59 (274) ||++. +..+ -+|.-+ .+-+.|.+...+.+..+.++....... ..-.+++++.+...|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~-~~~et~~~~~~e~~G~ 79 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIP-GHAKYFEYPEFDGVGI 79 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCC-CceeEEEeeeeccccc Confidence 32211 1111 122222 222334444444555566665443322 1123678888888889 Q ss_pred cccccCCC-cccccccccceeEEeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhc------ Q lcl|Aclame:pro 60 AQVIAEGE-KIPVDQIGTSKREAKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALK------ 129 (274) Q Consensus 60 a~~~~eg~-~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~------ 129 (274) +.|++.++ ++|..+...+.....++.++..+.++..+++.+ ..++...-....++.+++.+|+.++-+.. T Consensus 80 a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~G 159 (314) T protein:vir:10 80 AQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVS 159 (314) T ss_pred eeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccccee Confidence 99998765 489999999999999999999999988776654 67888888888889999888887763221 Q ss_pred --ccc----ccccCcccC----HHHHHHHHHHHhhc--C-CCccEEEEcHHHHHHHHhhhcccccccccccccccccccc Q lcl|Aclame:pro 130 --GAT----LTVEADITK----LDGLQTAIDKFNDE--D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF 196 (274) Q Consensus 130 --~a~----~~~~~~~~~----~d~iv~a~~~l~~~--~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~ 196 (274) +.+ ...+.++.+ +++|+.++..+... + ..+..++++|..+..|..-. +.. .....+-...++.. T Consensus 160 LlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~--~~~-~~tvl~~l~~n~~~ 236 (314) T protein:vir:10 160 VFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLV--PQT-NLSYGELFTRNNPG 236 (314) T ss_pred EeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccc--cCC-CccHHHHHHHhCCC Confidence 111 112233434 45667777777542 2 36788999999887663211 100 00001111122233 Q ss_pred chhcceeeEEcCCCCcceEE-EE--cCCeEEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceE---EEEeC Q lcl|Aclame:pro 197 GEALGAVIVRSNKLNKGEAL-LA--KKGAVKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVV---KITKG 268 (274) Q Consensus 197 ~~i~G~~Vv~s~~~p~~~~~-l~--~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv---~l~~~ 268 (274) -+|.+.|...+......+.+ ++ ++..+.+....+++...- +.......+....++ |+-+.+|.+++ -||.+ T Consensus 237 l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 237 LTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPA-QPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred cEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecc-eecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 34555665554443333323 33 334455555555554321 222223344445555 56778999999 55666 No 191 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.07 E-value=7.2e-07 Score=54.25 Aligned_cols=264 Identities=13% Similarity=0.062 Sum_probs=148.9 Q ss_pred CCccccchhh---ccch---HHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCC-Ccccccc Q lcl|Aclame:pro 1 MAQGTTKVSN---LIVP---EVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG-EKIPVDQ 73 (274) Q Consensus 1 ma~~~T~~~~---~~iP---e~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg-~~~~~~~ 73 (274) |.- +|..++ .|.- +.+.+.|.+...+.+..+.++....... -.-.+++++.+...|.+.|++.+ .++|..+ T Consensus 26 ~~~-~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~-~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd 103 (329) T protein:vir:79 26 LRG-AKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELS-DTDKTFEYQTFDKVGHAKIIADYTDDLSTVD 103 (329) T ss_pred ccc-ceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCC-CceeEEEeeeeecceeeeeecCcccccceee Confidence 221 222222 3443 2344556666666666777765543322 12236788888888899999875 5789889 Q ss_pred cccceeEEeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhcc--------cc--cc---cc- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALKG--------AT--LT---VE- 136 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~--------a~--~~---~~- 136 (274) ...+.....++.++..+.++..+++.+ ..++...-....++.++++.|+.++-.... .+ .+ +. T Consensus 104 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~ 183 (329) T protein:vir:79 104 ALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGW 183 (329) T ss_pred cccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCCC Confidence 988888888888888888887766554 678888888888999999998876632111 00 01 11 Q ss_pred --Cccc--C----HHHHHHHHHHHhhc--C-CCccEEEEcHHHHHHHHhhhccccccccccccccc-cccccchhcceee Q lcl|Aclame:pro 137 --ADIT--K----LDGLQTAIDKFNDE--D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNII-VKGAFGEALGAVI 204 (274) Q Consensus 137 --~~~~--~----~d~iv~a~~~l~~~--~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~-~~g~~~~i~G~~V 204 (274) ..+. + +++|.+++..+... + ..+..++++|+.+..|..-. .++ ..... ..+ .++...+|.+.|. T Consensus 184 ~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~-~~~--~~tvl-~~lk~~~~~l~I~~~~e 259 (329) T protein:vir:79 184 NNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRM-PET--TMSYL-DYFKQQNGGITIESISE 259 (329) T ss_pred CCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhccc-CCC--CccHH-HHHHHhCCCcEEEEccc Confidence 1111 1 46777777777543 2 36789999999988874211 010 00000 111 1222234555555 Q ss_pred EEcCCCC-cceEEEEc--CCeEEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 205 VRSNKLN-KGEALLAK--KGAVKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVVKITKGAGD 271 (274) Q Consensus 205 v~s~~~p-~~~~~l~~--~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv~l~~~aa~ 271 (274) ..+.... ++-.+++. +.-+.+....+.+...- +...-...+....++ |+-+.+|.+++.+..=--- T Consensus 260 l~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 260 LEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTA-QPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred ccccCCCCceEEEEEecCCceEEEecCcceeeeec-eecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 4433221 22234443 44455555666554431 222222334444554 4667799999987642222 No 192 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=97.97 E-value=2.1e-06 Score=51.68 Aligned_cols=270 Identities=16% Similarity=0.117 Sum_probs=145.5 Q ss_pred CCccccch--hhccchHHHHHHHHHHH-HHhhhhccc------------------------ccccccccccCCCEEEEEe Q lcl|Aclame:pro 1 MAQGTTKV--SNLIVPEVLAPMMQAEL-DKKLRFAQF------------------------ADIDSTLVGQPGDTLTFPA 53 (274) Q Consensus 1 ma~~~T~~--~~~~iPe~~~~~v~~~~-~~~~~~~~l------------------------~~~~~~~~~~~G~~v~ip~ 53 (274) |....|.. ++-.-...|+..+-... +++..+.-+ .++..++....|++|+|+- T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~L 80 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRFHF 80 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEEeE Confidence 76544433 44555678887664333 322221111 3333456566799999998 Q ss_pred ecCCCCcccccCCCcc--cccccccceeEEeehhhhcchhccHH-HHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 54 FTYSGDAQVIAEGEKI--PVDQIGTSKREAKVRKIGKGTELTDE-AVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG 130 (274) Q Consensus 54 ~~~~~~a~~~~eg~~~--~~~~~~~~~~~~~~~~~~~~~~is~e-~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~ 130 (274) ..... -..+..+..+ -++.++|.+..+.+....+.+..-.. ..+++..|+.+..++.|+..|++..|+.++-.+.+ T Consensus 81 ~~~L~-g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laG 159 (430) T protein:vir:10 81 VQPAN-AFPIMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLAG 159 (430) T ss_pred eeccc-cCceecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 87664 3333333333 45688999999999888777766543 45677899999999999999999999988765543 Q ss_pred cc-----------------------c---c--------------------------ccCcccCHHHHHHHHHHHhhcC-- Q lcl|Aclame:pro 131 AT-----------------------L---T--------------------------VEADITKLDGLQTAIDKFNDED-- 156 (274) Q Consensus 131 a~-----------------------~---~--------------------------~~~~~~~~d~iv~a~~~l~~~~-- 156 (274) +. + + ..++.++++-|-+|...++... T Consensus 160 arg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~ 239 (430) T protein:vir:10 160 ARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELP 239 (430) T ss_pred hhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCCC Confidence 20 0 0 0112244555556666654421 Q ss_pred --------CC------ccEEEEcHHHHHHHHhhhccc-c---c--cccccccccccccccchhcceeeEEcCCC------ Q lcl|Aclame:pro 157 --------LE------PMVLFVNPLDAGGLRTSASDN-F---T--RPTQLGDNIIVKGAFGEALGAVIVRSNKL------ 210 (274) Q Consensus 157 --------~~------~~~~v~~p~~~~~L~~~~~~~-~---~--~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~------ 210 (274) .+ ..+++|||..+..|+++.... + . .......+.+..|..|.|.|+.|.....+ T Consensus 240 i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~virf~~g 319 (430) T protein:vir:10 240 PPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPIRFYAG 319 (430) T ss_pred CcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCceeeecCC Confidence 11 267899999999999876431 1 0 11222246788999999999998865322 Q ss_pred -------------------C--------cceEEEEcCCeEEEEecc----Cce---eeeccccccCccEEEEEEEEEEEE Q lcl|Aclame:pro 211 -------------------N--------KGEALLAKKGAVKLITKR----DFF---LEKDRDASRKSTALYSDKHYVAYL 256 (274) Q Consensus 211 -------------------p--------~~~~~l~~~~a~~~~~~~----~~~---ve~~r~~~~~~~~i~~~~~~~~~v 256 (274) | ..-+++++..|+...... +.. .|...|-.. ...+.+...+|.+- T Consensus 320 ~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~-~~~i~~~~i~G~kK 398 (430) T protein:vir:10 320 DTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGD-KLELLIGAILGCSK 398 (430) T ss_pred CccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCc-hhhhhhhHHhccce Confidence 0 011234555544433222 111 121111111 11111111111111 Q ss_pred E------------cCcceEEEEeCCCcccC Q lcl|Aclame:pro 257 Y------------DESKVVKITKGAGDEVM 274 (274) Q Consensus 257 ~------------~~~avv~l~~~aa~~~~ 274 (274) + +.-+++.|- .|=.++ T Consensus 399 ~rF~~~~~~~~~~~DfGvi~id--taa~~~ 426 (430) T protein:vir:10 399 IRFAVEATNGLEYTDHGVMAID--TAVKII 426 (430) T ss_pred eeecCCCCCCceeeeeEEEEhh--hhhhhh Confidence 1 112222211 111122 No 193 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=97.88 E-value=1.9e-06 Score=51.99 Aligned_cols=262 Identities=15% Similarity=0.069 Sum_probs=138.9 Q ss_pred CCccccchhhccch--HHHHHHHHHHHHHhhhh--------cccccccccccccCCCEEEEEeecCCCCcccccCCCcc- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVP--EVLAPMMQAELDKKLRF--------AQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI- 69 (274) Q Consensus 1 ma~~~T~~~~~~iP--e~~~~~v~~~~~~~~~~--------~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~- 69 (274) +|..+|.... --| ..|+..+.........+ ....+...++....|++|+|+-..... -..+.+++.+ T Consensus 14 ~~~lft~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~-g~gv~Gd~~lE 91 (404) T protein:vir:10 14 QVALFTAANR-NRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS-KRPTMGDERVE 91 (404) T ss_pred HHHHHHHHhc-CChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc-cCCcccCceee Confidence 2222221111 011 12222211111111000 122333345666779999998877663 4445444443 Q ss_pred -cccccccceeEEeehhhhcchhccH-HHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Q lcl|Aclame:pro 70 -PVDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------- 133 (274) Q Consensus 70 -~~~~~~~~~~~~~~~~~~~~~~is~-e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------- 133 (274) -++.+++.+..+.+......+.... ...+++..|+.+..++.|+..|++..|+.++-.+.++.. T Consensus 92 Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~ 171 (404) T protein:vir:10 92 GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) T ss_pred ccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccc Confidence 4568999999999988877765533 445678899999999999999999999999865543221 Q ss_pred ------------cc------------------cCcccCHHHHHHHHHHHhh--c--------CCC------ccEEEEcHH Q lcl|Aclame:pro 134 ------------TV------------------EADITKLDGLQTAIDKFND--E--------DLE------PMVLFVNPL 167 (274) Q Consensus 134 ------------~~------------------~~~~~~~d~iv~a~~~l~~--~--------~~~------~~~~v~~p~ 167 (274) +. +++.++++-|-++...+.. . +.+ ..+++|||. T Consensus 172 ~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~ 251 (404) T protein:vir:10 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) T ss_pred ccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechH Confidence 00 1122334444445444432 1 111 367899999 Q ss_pred HHHHHHhhhcc-cccc-------ccccccccccccccchhcceeeEEcCCCCc---------------------c----- Q lcl|Aclame:pro 168 DAGGLRTSASD-NFTR-------PTQLGDNIIVKGAFGEALGAVIVRSNKLNK---------------------G----- 213 (274) Q Consensus 168 ~~~~L~~~~~~-~~~~-------~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~---------------------~----- 213 (274) .+..|+++... +|.. ......+.+..|..|.|.|+.|...+.+|- + T Consensus 252 q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~ 331 (404) T protein:vir:10 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) T ss_pred HHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcccccccccccccc Confidence 99999987531 1111 111234678889999999999987666541 0 Q ss_pred --eEEEEcCCeEEEEeccC----ce-eeeccccccCccEEEEEEEE---------------EEEEEcCcceEEE Q lcl|Aclame:pro 214 --EALLAKKGAVKLITKRD----FF-LEKDRDASRKSTALYSDKHY---------------VAYLYDESKVVKI 265 (274) Q Consensus 214 --~~~l~~~~a~~~~~~~~----~~-ve~~r~~~~~~~~i~~~~~~---------------~~~v~~~~avv~l 265 (274) -+++++..|++.+.+.. .. .|...|-.. ...+.+...+ |++|+-=...++| T Consensus 332 v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred chhheeecceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 12666766654432221 11 121111111 1111111111 3333322334444 No 194 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=97.88 E-value=1.9e-06 Score=51.99 Aligned_cols=262 Identities=15% Similarity=0.069 Sum_probs=138.9 Q ss_pred CCccccchhhccch--HHHHHHHHHHHHHhhhh--------cccccccccccccCCCEEEEEeecCCCCcccccCCCcc- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVP--EVLAPMMQAELDKKLRF--------AQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI- 69 (274) Q Consensus 1 ma~~~T~~~~~~iP--e~~~~~v~~~~~~~~~~--------~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~- 69 (274) +|..+|.... --| ..|+..+.........+ ....+...++....|++|+|+-..... -..+.+++.+ T Consensus 14 ~~~lft~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~-g~gv~Gd~~lE 91 (404) T protein:vir:10 14 QVALFTAANR-NRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS-KRPTMGDERVE 91 (404) T ss_pred HHHHHHHHhc-CChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc-cCCcccCceee Confidence 2222221111 011 12222211111111000 122333345666779999998877663 4445444443 Q ss_pred -cccccccceeEEeehhhhcchhccH-HHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Q lcl|Aclame:pro 70 -PVDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------- 133 (274) Q Consensus 70 -~~~~~~~~~~~~~~~~~~~~~~is~-e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------- 133 (274) -++.+++.+..+.+......+.... ...+++..|+.+..++.|+..|++..|+.++-.+.++.. T Consensus 92 Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~ 171 (404) T protein:vir:10 92 GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) T ss_pred ccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccc Confidence 4568999999999988877765533 445678899999999999999999999999865543221 Q ss_pred ------------cc------------------cCcccCHHHHHHHHHHHhh--c--------CCC------ccEEEEcHH Q lcl|Aclame:pro 134 ------------TV------------------EADITKLDGLQTAIDKFND--E--------DLE------PMVLFVNPL 167 (274) Q Consensus 134 ------------~~------------------~~~~~~~d~iv~a~~~l~~--~--------~~~------~~~~v~~p~ 167 (274) +. +++.++++-|-++...+.. . +.+ ..+++|||. T Consensus 172 ~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~ 251 (404) T protein:vir:10 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) T ss_pred ccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechH Confidence 00 1122334444445444432 1 111 367899999 Q ss_pred HHHHHHhhhcc-cccc-------ccccccccccccccchhcceeeEEcCCCCc---------------------c----- Q lcl|Aclame:pro 168 DAGGLRTSASD-NFTR-------PTQLGDNIIVKGAFGEALGAVIVRSNKLNK---------------------G----- 213 (274) Q Consensus 168 ~~~~L~~~~~~-~~~~-------~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~---------------------~----- 213 (274) .+..|+++... +|.. ......+.+..|..|.|.|+.|...+.+|- + T Consensus 252 q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~ 331 (404) T protein:vir:10 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) T ss_pred HHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcccccccccccccc Confidence 99999987531 1111 111234678889999999999987666541 0 Q ss_pred --eEEEEcCCeEEEEeccC----ce-eeeccccccCccEEEEEEEE---------------EEEEEcCcceEEE Q lcl|Aclame:pro 214 --EALLAKKGAVKLITKRD----FF-LEKDRDASRKSTALYSDKHY---------------VAYLYDESKVVKI 265 (274) Q Consensus 214 --~~~l~~~~a~~~~~~~~----~~-ve~~r~~~~~~~~i~~~~~~---------------~~~v~~~~avv~l 265 (274) -+++++..|++.+.+.. .. .|...|-.. ...+.+...+ |++|+-=...++| T Consensus 332 v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred chhheeecceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 12666766654432221 11 121111111 1111111111 3333322334444 No 195 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=97.88 E-value=1.9e-06 Score=51.99 Aligned_cols=262 Identities=15% Similarity=0.069 Sum_probs=138.9 Q ss_pred CCccccchhhccch--HHHHHHHHHHHHHhhhh--------cccccccccccccCCCEEEEEeecCCCCcccccCCCcc- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVP--EVLAPMMQAELDKKLRF--------AQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI- 69 (274) Q Consensus 1 ma~~~T~~~~~~iP--e~~~~~v~~~~~~~~~~--------~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~- 69 (274) +|..+|.... --| ..|+..+.........+ ....+...++....|++|+|+-..... -..+.+++.+ T Consensus 14 ~~~lft~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~-g~gv~Gd~~lE 91 (404) T protein:vir:81 14 QVALFTAANR-NRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS-KRPTMGDERVE 91 (404) T ss_pred HHHHHHHHhc-CChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc-cCCcccCceee Confidence 2222221111 011 12222211111111000 122333345666779999998877663 4445444443 Q ss_pred -cccccccceeEEeehhhhcchhccH-HHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Q lcl|Aclame:pro 70 -PVDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------- 133 (274) Q Consensus 70 -~~~~~~~~~~~~~~~~~~~~~~is~-e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------- 133 (274) -++.+++.+..+.+......+.... ...+++..|+.+..++.|+..|++..|+.++-.+.++.. T Consensus 92 Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~ 171 (404) T protein:vir:81 92 GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) T ss_pred ccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccc Confidence 4568999999999988877765533 445678899999999999999999999999865543221 Q ss_pred ------------cc------------------cCcccCHHHHHHHHHHHhh--c--------CCC------ccEEEEcHH Q lcl|Aclame:pro 134 ------------TV------------------EADITKLDGLQTAIDKFND--E--------DLE------PMVLFVNPL 167 (274) Q Consensus 134 ------------~~------------------~~~~~~~d~iv~a~~~l~~--~--------~~~------~~~~v~~p~ 167 (274) +. +++.++++-|-++...+.. . +.+ ..+++|||. T Consensus 172 ~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~ 251 (404) T protein:vir:81 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) T ss_pred ccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechH Confidence 00 1122334444445444432 1 111 367899999 Q ss_pred HHHHHHhhhcc-cccc-------ccccccccccccccchhcceeeEEcCCCCc---------------------c----- Q lcl|Aclame:pro 168 DAGGLRTSASD-NFTR-------PTQLGDNIIVKGAFGEALGAVIVRSNKLNK---------------------G----- 213 (274) Q Consensus 168 ~~~~L~~~~~~-~~~~-------~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~---------------------~----- 213 (274) .+..|+++... +|.. ......+.+..|..|.|.|+.|...+.+|- + T Consensus 252 q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~ 331 (404) T protein:vir:81 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) T ss_pred HHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcccccccccccccc Confidence 99999987531 1111 111234678889999999999987666541 0 Q ss_pred --eEEEEcCCeEEEEeccC----ce-eeeccccccCccEEEEEEEE---------------EEEEEcCcceEEE Q lcl|Aclame:pro 214 --EALLAKKGAVKLITKRD----FF-LEKDRDASRKSTALYSDKHY---------------VAYLYDESKVVKI 265 (274) Q Consensus 214 --~~~l~~~~a~~~~~~~~----~~-ve~~r~~~~~~~~i~~~~~~---------------~~~v~~~~avv~l 265 (274) -+++++..|++.+.+.. .. .|...|-.. ...+.+...+ |++|+-=...++| T Consensus 332 v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:81 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred chhheeecceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 12666766654432221 11 121111111 1111111111 3333322334444 No 196 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=97.88 E-value=1.9e-06 Score=51.99 Aligned_cols=262 Identities=15% Similarity=0.069 Sum_probs=138.9 Q ss_pred CCccccchhhccch--HHHHHHHHHHHHHhhhh--------cccccccccccccCCCEEEEEeecCCCCcccccCCCcc- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVP--EVLAPMMQAELDKKLRF--------AQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI- 69 (274) Q Consensus 1 ma~~~T~~~~~~iP--e~~~~~v~~~~~~~~~~--------~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~- 69 (274) +|..+|.... --| ..|+..+.........+ ....+...++....|++|+|+-..... -..+.+++.+ T Consensus 14 ~~~lft~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~-g~gv~Gd~~lE 91 (404) T protein:vir:32 14 QVALFTAANR-NRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS-KRPTMGDERVE 91 (404) T ss_pred HHHHHHHHhc-CChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc-cCCcccCceee Confidence 2222221111 011 12222211111111000 122333345666779999998877663 4445444443 Q ss_pred -cccccccceeEEeehhhhcchhccH-HHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Q lcl|Aclame:pro 70 -PVDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-------------- 133 (274) Q Consensus 70 -~~~~~~~~~~~~~~~~~~~~~~is~-e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-------------- 133 (274) -++.+++.+..+.+......+.... ...+++..|+.+..++.|+..|++..|+.++-.+.++.. T Consensus 92 Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~ 171 (404) T protein:vir:32 92 GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) T ss_pred ccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccc Confidence 4568999999999988877765533 445678899999999999999999999999865543221 Q ss_pred ------------cc------------------cCcccCHHHHHHHHHHHhh--c--------CCC------ccEEEEcHH Q lcl|Aclame:pro 134 ------------TV------------------EADITKLDGLQTAIDKFND--E--------DLE------PMVLFVNPL 167 (274) Q Consensus 134 ------------~~------------------~~~~~~~d~iv~a~~~l~~--~--------~~~------~~~~v~~p~ 167 (274) +. +++.++++-|-++...+.. . +.+ ..+++|||. T Consensus 172 ~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~ 251 (404) T protein:vir:32 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) T ss_pred ccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechH Confidence 00 1122334444445444432 1 111 367899999 Q ss_pred HHHHHHhhhcc-cccc-------ccccccccccccccchhcceeeEEcCCCCc---------------------c----- Q lcl|Aclame:pro 168 DAGGLRTSASD-NFTR-------PTQLGDNIIVKGAFGEALGAVIVRSNKLNK---------------------G----- 213 (274) Q Consensus 168 ~~~~L~~~~~~-~~~~-------~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~---------------------~----- 213 (274) .+..|+++... +|.. ......+.+..|..|.|.|+.|...+.+|- + T Consensus 252 q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~ 331 (404) T protein:vir:32 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) T ss_pred HHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCcccccccccccccc Confidence 99999987531 1111 111234678889999999999987666541 0 Q ss_pred --eEEEEcCCeEEEEeccC----ce-eeeccccccCccEEEEEEEE---------------EEEEEcCcceEEE Q lcl|Aclame:pro 214 --EALLAKKGAVKLITKRD----FF-LEKDRDASRKSTALYSDKHY---------------VAYLYDESKVVKI 265 (274) Q Consensus 214 --~~~l~~~~a~~~~~~~~----~~-ve~~r~~~~~~~~i~~~~~~---------------~~~v~~~~avv~l 265 (274) -+++++..|++.+.+.. .. .|...|-.. ...+.+...+ |++|+-=...++| T Consensus 332 v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:32 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred chhheeecceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 12666766654432221 11 121111111 1111111111 3333322334444 No 197 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=97.87 E-value=3e-06 Score=50.90 Aligned_cols=257 Identities=12% Similarity=0.035 Sum_probs=151.6 Q ss_pred CC-cccc-chhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccccccce Q lcl|Aclame:pro 1 MA-QGTT-KVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma-~~~T-~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~~~~~ 78 (274) .+ ...| +-.+-+.|+.+...|...+...-.+.+...+.. .++--+..+.-. ...+.-+.-|.+..++.+++.. T Consensus 117 l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n----~p~l~V~~~~dt-~~qa~gHk~G~~K~eq~~tl~~ 191 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN----VGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTI 191 (400) T ss_pred hhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeec----CCceeeecchhh-hcccceeccCCcccceeeeeee Confidence 11 2333 334447899888888888877666666443321 111112222222 2234447788999999999999 Q ss_pred eEEeehhhhcchhccHHHHhc--cCccHHHHHHHHHHHHHHHH-HHHHHHHH-hccccc-----------------cccC Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLS--GFGDPQGEAVRQHGLAIANK-VDNDVLEA-LKGATL-----------------TVEA 137 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~--s~~d~~~~~~~~~a~~~a~~-~d~~~i~~-~~~a~~-----------------~~~~ 137 (274) .++.+.-.++..++.+...+. +...+..++.++|..++-.+ ++.+++-+ ..++.. +-.+ T Consensus 192 rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a 271 (400) T protein:vir:93 192 DTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSA 271 (400) T ss_pred eccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhc Confidence 999888777766664333222 22446889999999999865 58777643 222210 1123 Q ss_pred cccCHHHHHHH-HHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce-eeEEcCCCCcce- Q lcl|Aclame:pro 138 DITKLDGLQTA-IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA-VIVRSNKLNKGE- 214 (274) Q Consensus 138 ~~~~~d~iv~a-~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~-~Vv~s~~~p~~~- 214 (274) +.+.+.++.+- .............++++|+.++.|+...+..... .......+-++.+-+|+ ++++...+|... T Consensus 272 ~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a---~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp 348 (400) T protein:vir:93 272 GKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANA---NVRIKNDDTEIASEVGVDEIIVYTGSKALKP 348 (400) T ss_pred CCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCccee---eeeeccccchhhhhcccceeeeeccCCCCCc Confidence 33445444432 2322333344556888999999888765322111 11111233445667776 555566665433 Q ss_pred -EEEEcCCeEEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 215 -ALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 215 -~~l~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) ..+-....+ ...++.-..+++-.+.+..+.+....++.+--|++...++.+ T Consensus 349 ~V~VDek~~i---~~~~~~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 349 TVLVDQKYHI---DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeehhhhc---cccCceeccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 223222222 334444445677778888999999999999999999999998 No 198 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=97.76 E-value=2.1e-06 Score=51.71 Aligned_cols=212 Identities=12% Similarity=0.032 Sum_probs=130.3 Q ss_pred CCccccch------hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCccccccc Q lcl|Aclame:pro 1 MAQGTTKV------SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQI 74 (274) Q Consensus 1 ma~~~T~~------~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~~ 74 (274) |+.--+.. +..+-|+.....|+|.+.++..+-..+. -.+...|..-...+....++++|..-|+.+++++. T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lp---f~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~ 77 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMP---FVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKS 77 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcc---eeecccCCcceeeEeeccCCceeeecCCccCcccc Confidence 77542222 3336687788888888877544322221 12222233344556677899999999999999999 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHH---HHHHHHHHHHHHHHHHHHHH-----------h----c------- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGE---AVRQHGLAIANKVDNDVLEA-----------L----K------- 129 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~---~~~~~a~~~a~~~d~~~i~~-----------~----~------- 129 (274) ++.+++..+.-++..+.+++...... .+..++ -.+...+++.+++...+|.. | . T Consensus 78 tt~q~t~~l~ilgg~~eVDr~la~~~-Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a 156 (328) T protein:vir:95 78 TTVQVTDSVGMLETYAEVDKSLADLN-GNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNA 156 (328) T ss_pred eeEEEEEEEEEEecceeechHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccc Confidence 99999999999999999998766554 344433 44557778888887777621 0 0 Q ss_pred ---------ccccc----------------------c------------------------------------------- Q lcl|Aclame:pro 130 ---------GATLT----------------------V------------------------------------------- 135 (274) Q Consensus 130 ---------~a~~~----------------------~------------------------------------------- 135 (274) ++..+ + T Consensus 157 ~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI 236 (328) T protein:vir:95 157 QNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRI 236 (328) T ss_pred cceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEE Confidence 00000 0 Q ss_pred ---cC----ccc----CHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceee Q lcl|Aclame:pro 136 ---EA----DIT----KLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVI 204 (274) Q Consensus 136 ---~~----~~~----~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~V 204 (274) +. ... ..+.+++|...+........+|.||......|++....... .+..-.......+-.|.|+|| T Consensus 237 ~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n--~~~~~~~~~g~~~t~~~gipi 314 (328) T protein:vir:95 237 ANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTS--LAISVKETEGEWWTSFRGVPI 314 (328) T ss_pred ecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcc--eeeeeeccCCcceeEECCeEE Confidence 00 000 12344666666665556678899999999999876432111 111111123334446899999 Q ss_pred EEcCCCCcceEEEE Q lcl|Aclame:pro 205 VRSNKLNKGEALLA 218 (274) Q Consensus 205 v~s~~~p~~~~~l~ 218 (274) -..+.+-.++.-+. T Consensus 315 r~~dai~~tE~~vv 328 (328) T protein:vir:95 315 RETDALLETEARVV 328 (328) T ss_pred EEEeeeecCccccC Confidence 98888765554443 No 199 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=97.72 E-value=1.3e-05 Score=47.36 Aligned_cols=224 Identities=15% Similarity=0.083 Sum_probs=127.6 Q ss_pred CCccccchhhccc------------h--HHHHHHHHHHHHHhhhhcc--------cccccccccccCCCEEEEEeecCCC Q lcl|Aclame:pro 1 MAQGTTKVSNLIV------------P--EVLAPMMQAELDKKLRFAQ--------FADIDSTLVGQPGDTLTFPAFTYSG 58 (274) Q Consensus 1 ma~~~T~~~~~~i------------P--e~~~~~v~~~~~~~~~~~~--------l~~~~~~~~~~~G~~v~ip~~~~~~ 58 (274) |++-+....+..+ | ..|+..+.....+...+.. ..++..++....|++|+|+-..... T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~ 80 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccc Confidence 6544433332111 1 2455554332222222211 2233345666679999998887664 Q ss_pred CcccccCCCcc--cccccccceeEEeehhhhcchhccH-HHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Q lcl|Aclame:pro 59 DAQVIAEGEKI--PVDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL-- 133 (274) Q Consensus 59 ~a~~~~eg~~~--~~~~~~~~~~~~~~~~~~~~~~is~-e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~-- 133 (274) -..+-.++.+ -++.+++.+..+.+......+..-. ...+++..|+.+..++.++..|++..|+.++-.+.++.. T Consensus 81 -g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~ 159 (318) T protein:vir:27 81 -KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (318) T ss_pred -cCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 3333333333 4567889999999988777765543 445677889999999999999999999999866643221 Q ss_pred ------------------------cc------------------cCcccCHHHHHHHHHHHhh--c--------CCC--- Q lcl|Aclame:pro 134 ------------------------TV------------------EADITKLDGLQTAIDKFND--E--------DLE--- 158 (274) Q Consensus 134 ------------------------~~------------------~~~~~~~d~iv~a~~~l~~--~--------~~~--- 158 (274) +. +++.++++-|-++...+.. . +.+ T Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~ 239 (318) T protein:vir:27 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (318) T ss_pred ccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccC Confidence 00 1122334444444444432 1 111 Q ss_pred ---ccEEEEcHHHHHHHHhhhcc-cccc-------ccccccccccccccchhcceeeEEcCCCCcceEEEEcCCe-EEEE Q lcl|Aclame:pro 159 ---PMVLFVNPLDAGGLRTSASD-NFTR-------PTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA-VKLI 226 (274) Q Consensus 159 ---~~~~v~~p~~~~~L~~~~~~-~~~~-------~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a-~~~~ 226 (274) ..+++|||..+..|+++... +|.. ......+.+..|..|.|.|+=+.....+|- =|..+. +.+ T Consensus 240 ~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpI----rf~~G~~v~~- 314 (318) T protein:vir:27 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI----RFYQGQRFWY- 314 (318) T ss_pred CcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccE----EEcCCCeeee- Confidence 25789999999999987421 1211 111234668889999999998887777762 111111 111 Q ss_pred eccC Q lcl|Aclame:pro 227 TKRD 230 (274) Q Consensus 227 ~~~~ 230 (274) .+.. T Consensus 315 ~~~~ 318 (318) T protein:vir:27 315 QRIT 318 (318) T ss_pred eecC Confidence 1100 No 200 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=97.45 E-value=5.5e-06 Score=49.41 Aligned_cols=265 Identities=11% Similarity=-0.015 Sum_probs=132.2 Q ss_pred CCccccchhhccchHH---HHHHH---HHHHHHhhh--hcccc-cccccccccCCCE--EEEE------eecCCCCccc- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEV---LAPMM---QAELDKKLR--FAQFA-DIDSTLVGQPGDT--LTFP------AFTYSGDAQV- 62 (274) Q Consensus 1 ma~~~T~~~~~~iPe~---~~~~v---~~~~~~~~~--~~~l~-~~~~~~~~~~G~~--v~ip------~~~~~~~a~~- 62 (274) |+..+ ..+.+.. +.... .+.+..+.. .++-. .......+..+.. ..+- .....++... T Consensus 193 itg~t----ga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~ 268 (523) T protein:vir:59 193 ASGDP----ENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDP 268 (523) T ss_pred ccccc----cccccchhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhcccccccccc Confidence 11100 0010000 00000 000000000 00000 0000000000000 0000 0000011110 Q ss_pred ccCCCcccccccccceeEEeehhhhcchhccHHHHhc-----cCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-- Q lcl|Aclame:pro 63 IAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS-----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV-- 135 (274) Q Consensus 63 ~~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~-----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~-- 135 (274) -.++..+++-..+.+.++++.+.++-.-++|-|+.+| +..|.+..+.+-|+..|...|++++|..+....... T Consensus 269 ~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~ 348 (523) T protein:vir:59 269 GFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDN 348 (523) T ss_pred ccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeee Confidence 1345567888888899999999888888888887654 368999999999999999999999998775443211 Q ss_pred ----cCccc--------------CHHHHHHHHHHH----h--------h-cCCCccEEEEcHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 136 ----EADIT--------------KLDGLQTAIDKF----N--------D-EDLEPMVLFVNPLDAGGLRTSASDNFTRPT 184 (274) Q Consensus 136 ----~~~~~--------------~~d~iv~a~~~l----~--------~-~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~ 184 (274) +.+.+ .|...++..+.| . + .....+++|++|++.+.|....-.+..... T Consensus 349 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~ 428 (523) T protein:vir:59 349 YGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDN 428 (523) T ss_pred ccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCcc Confidence 11111 122223332222 1 1 123678999999999998654322111000 Q ss_pred c-cccccccccccchh-cceeeEEcCCCCcceEEEEcCCeE-------EEEeccCceeee-ccccccCccEEEEEEEEEE Q lcl|Aclame:pro 185 Q-LGDNIIVKGAFGEA-LGAVIVRSNKLNKGEALLAKKGAV-------KLITKRDFFLEK-DRDASRKSTALYSDKHYVA 254 (274) Q Consensus 185 ~-~~~~~~~~g~~~~i-~G~~Vv~s~~~p~~~~~l~~~~a~-------~~~~~~~~~ve~-~r~~~~~~~~i~~~~~~~~ 254 (274) . ...+... .|.+ .|++|+++++.|.+-..+.-++.. -|+--.++.... -.||+.++-.+-...||+. T Consensus 429 ~~~~~~~~~---~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l 505 (523) T protein:vir:59 429 RDGGTGIFY---VGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYAL 505 (523) T ss_pred cccccccee---EEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhh Confidence 0 0001111 2344 357999999999766655444432 233333332222 2378899999999999999 Q ss_pred EEEcCcceEEEEeCCCcc Q lcl|Aclame:pro 255 YLYDESKVVKITKGAGDE 272 (274) Q Consensus 255 ~v~~~~avv~l~~~aa~~ 272 (274) .|.+|...-.|-.+--.- T Consensus 506 ~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 506 EVVRPEFYGLLYVKLLQP 523 (523) T ss_pred eecchhHhhhhhhhhcCC Confidence 998888654433222111 No 201 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.45 E-value=1.7e-05 Score=46.67 Aligned_cols=256 Identities=10% Similarity=0.035 Sum_probs=135.8 Q ss_pred CC---c----cccchhhccchHHHHHHHH----HHHHHhhhhcccccccccccccC-CCEEEEEeecCCCCcccccCCCc Q lcl|Aclame:pro 1 MA---Q----GTTKVSNLIVPEVLAPMMQ----AELDKKLRFAQFADIDSTLVGQP-GDTLTFPAFTYSGDAQVIAEGEK 68 (274) Q Consensus 1 ma---~----~~T~~~~~~iPe~~~~~v~----~~~~~~~~~~~l~~~~~~~~~~~-G~~v~ip~~~~~~~a~~~~eg~~ 68 (274) || + ..++..+..||-....+|. +...+.+....++..... +.- -.+++++.+...|.+.+++.+++ T Consensus 35 ~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~--g~w~~~t~~y~~~e~~G~a~~ygd~ad 112 (339) T protein:vir:94 35 YAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKK--GDWTTTYGVFIIAEPVGQVATYSDWSA 112 (339) T ss_pred hhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccccC--CCCcccEEEEeeeecccceEEcccccC Confidence 33 1 2445566677765555554 444555666666655332 112 24789999999999999999999 Q ss_pred ccccccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc--------cc-----c Q lcl|Aclame:pro 69 IPVDQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK--------GA-----T 132 (274) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~--------~a-----~ 132 (274) .|..+...+...-++......+.++.+++.. ...++...-.+...+++.+.+|+..+-+-. +- . T Consensus 113 ~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~ 192 (339) T protein:vir:94 113 NGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAP 192 (339) T ss_pred CCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCcccc Confidence 9888765554444444444444555554433 357788888888888888888875552111 10 1 Q ss_pred ccccCcc--cC----HHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhc Q lcl|Aclame:pro 133 LTVEADI--TK----LDGLQTAIDKFNDED------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL 200 (274) Q Consensus 133 ~~~~~~~--~~----~d~iv~a~~~l~~~~------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~ 200 (274) .+++..+ -+ +++|..++..+.... ..+..+++.|..+..|-+-+. +. ...-..+.. .+- T Consensus 193 v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~n~--~~---~Tvl~~lk~----n~p 263 (339) T protein:vir:94 193 VAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRTNN--FG---LSAGAKIAQ----TYP 263 (339) T ss_pred ccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccCCc--CC---ccHHHHHHH----hcC Confidence 1111211 12 456667776664321 245689999999998743211 10 000011211 122 Q ss_pred ceeeEEcCCCC---cceEEEEc-----CCeEEEEeccCceeeeccccccCccEEEEEEE-EEEEEEcCcceEEEEeC Q lcl|Aclame:pro 201 GAVIVRSNKLN---KGEALLAK-----KGAVKLITKRDFFLEKDRDASRKSTALYSDKH-YVAYLYDESKVVKITKG 268 (274) Q Consensus 201 G~~Vv~s~~~p---~~~~~l~~-----~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~-~~~~v~~~~avv~l~~~ 268 (274) +++++..+.+. .+...++. +.........+.+...- +.......+-...| .|+-+.+|.++++++.= T Consensus 264 nl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 264 NIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFAEKLRSHSI-ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CcEEEEccccccCCCceEEEEEEeccCCcceEEEcchhhhcccc-EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 34455433332 12222221 12222333333222111 11122233444445 46667799999998876 No 202 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=97.40 E-value=1.3e-06 Score=52.90 Aligned_cols=108 Identities=12% Similarity=0.100 Sum_probs=76.7 Q ss_pred EEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEE-----------EEe--cc Q lcl|Aclame:pro 163 FVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVK-----------LIT--KR 229 (274) Q Consensus 163 v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~-----------~~~--~~ 229 (274) +++-..|+++..+............+..+..+-.-+++|+.++.++++|.+++++++...++ |+. .. T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~Pgya~~~~~ 80 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSPEFAPAGNT 80 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCCCcccCCCCc Confidence 66666788777654433333333333334433334699999999999999999988865443 322 22 Q ss_pred Cceeeeccccc--cCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 230 DFFLEKDRDAS--RKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 230 ~~~ve~~r~~~--~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) ++.+.+.|..+ .++..+++|+..-.-|+.|.|.++|+...- T Consensus 81 Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 81 GVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred ceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 35566777776 788899999999999999999999998877 No 203 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=97.18 E-value=0.00014 Score=41.68 Aligned_cols=263 Identities=12% Similarity=0.087 Sum_probs=121.2 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh--hhhcccccccccccccCCCEEEEEee-cCCC-CcccccCCCccccc-ccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK--LRFAQFADIDSTLVGQPGDTLTFPAF-TYSG-DAQVIAEGEKIPVD-QIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~--~~~~~l~~~~~~~~~~~G~~v~ip~~-~~~~-~a~~~~eg~~~~~~-~~~ 75 (274) ||+ ..++|.|..+..++.+..... .....++.... ..+..+.+... ...+ .+..++++++.+.. .-. T Consensus 1 M~~----i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~----~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~ 72 (348) T protein:vir:27 1 MGL----IYDKVTASNIAGYFNALQENVSSTLGESIFPARK----QLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVS 72 (348) T ss_pred Ccc----hhhhcCHHHHHHHHHhccchhhhhhHhhcCCCcc----ccceeEEEEeeccCceeEeeeecCCCCcceecccc Confidence 885 468899999999887654332 11122222110 11111111111 1111 12334444443332 122 Q ss_pred cceeEEeehhhhcchhccHHHHhc-----c--CccHHHHH-------HHHHHHHHHHHHHHHHHHHhcccccc------- Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVLS-----G--FGDPQGEA-------VRQHGLAIANKVDNDVLEALKGATLT------- 134 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~~-----s--~~d~~~~~-------~~~~a~~~a~~~d~~~i~~~~~a~~~------- 134 (274) +......+-.++....++...++. . .....+.+ ...+.+.+.+.+|..+...+.++... T Consensus 73 ~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~ 152 (348) T protein:vir:27 73 AEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVN 152 (348) T ss_pred eeeeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCee Confidence 333334443444333443332111 1 11122222 23344455555666655555432110 Q ss_pred --------------c-----cCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcccc-ccccccccccc--- Q lcl|Aclame:pro 135 --------------V-----EADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNF-TRPTQLGDNII--- 191 (274) Q Consensus 135 --------------~-----~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~-~~~~~~~~~~~--- 191 (274) . .++...+++|.++...+.+.+..+..++|+++.+..|++++...- ..........+ T Consensus 153 ~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~ 232 (348) T protein:vir:27 153 KDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKA 232 (348) T ss_pred EEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHH Confidence 0 111223567777777787778889999999999999987654321 11111111111 Q ss_pred -cccccchhcceeeEEc------------CCCCcceEEEEcCCeEEEEeccCc-----------------------eeee Q lcl|Aclame:pro 192 -VKGAFGEALGAVIVRS------------NKLNKGEALLAKKGAVKLITKRDF-----------------------FLEK 235 (274) Q Consensus 192 -~~g~~~~i~G~~Vv~s------------~~~p~~~~~l~~~~a~~~~~~~~~-----------------------~ve~ 235 (274) .....+++.|++|++- +.+|.+..+++..+..|...--.+ -+.+ T Consensus 233 ~~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (348) T protein:vir:27 233 ELENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTT 312 (348) T ss_pred HHHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEe Confidence 1122345667777651 336777777777665542211000 0000 Q ss_pred ccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 236 DRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 236 ~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) +.+.+--...+.+-.+.=..+.+|++++++++-+|- T Consensus 313 ~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 313 TKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred eecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 011010112223333333445588889988888877 No 204 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.16 E-value=2.1e-05 Score=46.21 Aligned_cols=256 Identities=8% Similarity=-0.004 Sum_probs=138.0 Q ss_pred CCc----cccchhhccchHHHHHHHHHHH----HHhhhhcccccccccccccCC-CEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQ----GTTKVSNLIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~----~~T~~~~~~iPe~~~~~v~~~~----~~~~~~~~l~~~~~~~~~~~G-~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) +|+ ..++.++.-+|..+..+|...+ ..-.....|+-... .+.-. .++.++.+...|.+..++.++++|. T Consensus 34 da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~ 111 (336) T protein:vir:10 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) T ss_pred hhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccc--cCCccceeEEEeeeeceeeEEEeeccCCCce Confidence 132 3455567788888888774322 22233333433222 11111 3678888888899999999999999 Q ss_pred cccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc--------cc-----cccc Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK--------GA-----TLTV 135 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~--------~a-----~~~~ 135 (274) .+...+..+-+++.++..+.++.+++.. ...++...-++...+++.+++++..+-... +. ..+. T Consensus 112 ~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~ 191 (336) T protein:vir:10 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) T ss_pred eecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCcccccc Confidence 9988888888888888888888765544 356777877788888888888765441111 11 1111 Q ss_pred cC---cccC----HHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 136 EA---DITK----LDGLQTAIDKFNDED------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 136 ~~---~~~~----~d~iv~a~~~l~~~~------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) .+ +..+ +++|..++..|.... ..+..++|.|..+..|-+-+ ++. ......+.. .+-++ T Consensus 192 ~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~n--~~g---~Tvl~~lk~----n~Pnl 262 (336) T protein:vir:10 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--QYG---LAAAAKLKD----IFPKL 262 (336) T ss_pred CCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCCC--ccC---ccHHHHHHH----hcCcc Confidence 11 1122 566777777775422 24788999999888774221 110 000011111 12344 Q ss_pred eeEEcCCCC---cceEEEEcCC-----eEEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceEEEEeC Q lcl|Aclame:pro 203 VIVRSNKLN---KGEALLAKKG-----AVKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVVKITKG 268 (274) Q Consensus 203 ~Vv~s~~~p---~~~~~l~~~~-----a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv~l~~~ 268 (274) .++..+.+. .+..+++-+. ........+.+...- +.......+-...|+ |+-+.+|.++++++.= T Consensus 263 ~i~t~pEl~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EEEEccccccCCCceEEEEEEecCCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeeecC Confidence 555444442 1222332111 111111111111000 011112233334444 5566799999998776 No 205 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.09 E-value=6.8e-05 Score=43.42 Aligned_cols=213 Identities=9% Similarity=0.041 Sum_probs=120.1 Q ss_pred CCcc---ccchhhc---cchH-HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc Q lcl|Aclame:pro 1 MAQG---TTKVSNL---IVPE-VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) Q Consensus 1 ma~~---~T~~~~~---~iPe-~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~ 73 (274) |++. .=+..++ +-|+ .+...|+|.+.++..+-..+. -.++..+..-...+....+++.|..=|+.+++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lp---f~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~ 77 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMT---VIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhce---eeeccCCccceeeEEeccCCchhhccCCccCccc Confidence 8863 2222332 3344 345567777776544322111 1122111111123345678899999999999999 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHH---HHHHHHHHHHHHHHHHHHHHHHh-----------c---------- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQ---GEAVRQHGLAIANKVDNDVLEAL-----------K---------- 129 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~---~~~~~~~a~~~a~~~d~~~i~~~-----------~---------- 129 (274) .++.+++..+.-++..+.+++...... .+.. ....+...+.+..++.+.+|..- . T Consensus 78 ~tt~q~t~~l~ilgg~~eVDk~la~~~-Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~ 156 (331) T protein:vir:98 78 SRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) T ss_pred ceeEEEEEEEEEeccceeechHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccc Confidence 999999999999999999998766554 3443 33445577778877777776210 0 Q ss_pred ----------ccccc----------------------c------------------------------------------ Q lcl|Aclame:pro 130 ----------GATLT----------------------V------------------------------------------ 135 (274) Q Consensus 130 ----------~a~~~----------------------~------------------------------------------ 135 (274) ++..+ + T Consensus 157 ~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~r 236 (331) T protein:vir:98 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) T ss_pred ccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 00000 0 Q ss_pred ----cC-----cccC----HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 136 ----EA-----DITK----LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 136 ----~~-----~~~~----~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) +. ++.+ .+.+++|...+........+|.||.+....|++............. .......+-.+.|+ T Consensus 237 i~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~-~~~~g~~~t~~~gi 315 (331) T protein:vir:98 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM-EEIAGKKVVAFDGI 315 (331) T ss_pred EeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee-eecCCcceeEECCe Confidence 00 0001 1234455555554455667899999999999876432211111111 11122233468899 Q ss_pred eeEEcCCCCcceEEEE Q lcl|Aclame:pro 203 VIVRSNKLNKGEALLA 218 (274) Q Consensus 203 ~Vv~s~~~p~~~~~l~ 218 (274) ||-..+.+-.++.-+. T Consensus 316 pir~~dai~~tE~~Vv 331 (331) T protein:vir:98 316 PCRRTDALLLTEARVV 331 (331) T ss_pred eEEEeeeeecCccccC Confidence 9988887755554433 No 206 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.09 E-value=6.8e-05 Score=43.42 Aligned_cols=213 Identities=9% Similarity=0.041 Sum_probs=120.1 Q ss_pred CCcc---ccchhhc---cchH-HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc Q lcl|Aclame:pro 1 MAQG---TTKVSNL---IVPE-VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) Q Consensus 1 ma~~---~T~~~~~---~iPe-~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~ 73 (274) |++. .=+..++ +-|+ .+...|+|.+.++..+-..+. -.++..+..-...+....+++.|..=|+.+++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lp---f~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~ 77 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMT---VIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhce---eeeccCCccceeeEEeccCCchhhccCCccCccc Confidence 8863 2222332 3344 345567777776544322111 1122111111123345678899999999999999 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHH---HHHHHHHHHHHHHHHHHHHHHHh-----------c---------- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQ---GEAVRQHGLAIANKVDNDVLEAL-----------K---------- 129 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~---~~~~~~~a~~~a~~~d~~~i~~~-----------~---------- 129 (274) .++.+++..+.-++..+.+++...... .+.. ....+...+.+..++.+.+|..- . T Consensus 78 ~tt~q~t~~l~ilgg~~eVDk~la~~~-Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~ 156 (331) T protein:vir:10 78 SRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) T ss_pred ceeEEEEEEEEEeccceeechHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccc Confidence 999999999999999999998766554 3443 33445577778877777776210 0 Q ss_pred ----------ccccc----------------------c------------------------------------------ Q lcl|Aclame:pro 130 ----------GATLT----------------------V------------------------------------------ 135 (274) Q Consensus 130 ----------~a~~~----------------------~------------------------------------------ 135 (274) ++..+ + T Consensus 157 ~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~r 236 (331) T protein:vir:10 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) T ss_pred ccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 00000 0 Q ss_pred ----cC-----cccC----HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 136 ----EA-----DITK----LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 136 ----~~-----~~~~----~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) +. ++.+ .+.+++|...+........+|.||.+....|++............. .......+-.+.|+ T Consensus 237 i~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~-~~~~g~~~t~~~gi 315 (331) T protein:vir:10 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM-EEIAGKKVVAFDGI 315 (331) T ss_pred EeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee-eecCCcceeEECCe Confidence 00 0001 1234455555554455667899999999999876432211111111 11122233468899 Q ss_pred eeEEcCCCCcceEEEE Q lcl|Aclame:pro 203 VIVRSNKLNKGEALLA 218 (274) Q Consensus 203 ~Vv~s~~~p~~~~~l~ 218 (274) ||-..+.+-.++.-+. T Consensus 316 pir~~dai~~tE~~Vv 331 (331) T protein:vir:10 316 PCRRTDALLLTEARVV 331 (331) T ss_pred eEEEeeeeecCccccC Confidence 9988887755554433 No 207 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.09 E-value=6.8e-05 Score=43.42 Aligned_cols=213 Identities=9% Similarity=0.041 Sum_probs=120.1 Q ss_pred CCcc---ccchhhc---cchH-HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc Q lcl|Aclame:pro 1 MAQG---TTKVSNL---IVPE-VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) Q Consensus 1 ma~~---~T~~~~~---~iPe-~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~ 73 (274) |++. .=+..++ +-|+ .+...|+|.+.++..+-..+. -.++..+..-...+....+++.|..=|+.+++++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lp---f~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~ 77 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMT---VIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEK 77 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhce---eeeccCCccceeeEEeccCCchhhccCCccCccc Confidence 8863 2222332 3344 345567777776544322111 1122111111123345678899999999999999 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHH---HHHHHHHHHHHHHHHHHHHHHHh-----------c---------- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQ---GEAVRQHGLAIANKVDNDVLEAL-----------K---------- 129 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~---~~~~~~~a~~~a~~~d~~~i~~~-----------~---------- 129 (274) .++.+++..+.-++..+.+++...... .+.. ....+...+.+..++.+.+|..- . T Consensus 78 ~tt~q~t~~l~ilgg~~eVDk~la~~~-Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~ 156 (331) T protein:vir:10 78 SRTVQVKDSMGMLETYAEVDKALADLN-GNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAEN 156 (331) T ss_pred ceeEEEEEEEEEeccceeechHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccc Confidence 999999999999999999998766554 3443 33445577778877777776210 0 Q ss_pred ----------ccccc----------------------c------------------------------------------ Q lcl|Aclame:pro 130 ----------GATLT----------------------V------------------------------------------ 135 (274) Q Consensus 130 ----------~a~~~----------------------~------------------------------------------ 135 (274) ++..+ + T Consensus 157 ~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~r 236 (331) T protein:vir:10 157 GQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVR 236 (331) T ss_pred ccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 00000 0 Q ss_pred ----cC-----cccC----HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 136 ----EA-----DITK----LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 136 ----~~-----~~~~----~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) +. ++.+ .+.+++|...+........+|.||.+....|++............. .......+-.+.|+ T Consensus 237 i~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~-~~~~g~~~t~~~gi 315 (331) T protein:vir:10 237 IANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM-EEIAGKKVVAFDGI 315 (331) T ss_pred EeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee-eecCCcceeEECCe Confidence 00 0001 1234455555554455667899999999999876432211111111 11122233468899 Q ss_pred eeEEcCCCCcceEEEE Q lcl|Aclame:pro 203 VIVRSNKLNKGEALLA 218 (274) Q Consensus 203 ~Vv~s~~~p~~~~~l~ 218 (274) ||-..+.+-.++.-+. T Consensus 316 pir~~dai~~tE~~Vv 331 (331) T protein:vir:10 316 PCRRTDALLLTEARVV 331 (331) T ss_pred eEEEeeeeecCccccC Confidence 9988887755554433 No 208 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=96.99 E-value=7.1e-05 Score=43.31 Aligned_cols=211 Identities=12% Similarity=0.092 Sum_probs=121.8 Q ss_pred CCc---cccchhh---ccchHHHHHHHHHHHHHhhhh-cccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc Q lcl|Aclame:pro 1 MAQ---GTTKVSN---LIVPEVLAPMMQAELDKKLRF-AQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) Q Consensus 1 ma~---~~T~~~~---~~iPe~~~~~v~~~~~~~~~~-~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~ 73 (274) |++ +.=+..+ .+-|+.....|+|.+.++..+ .-+.-.... ...|..-.+ ....|++.|..=++.+++++ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N--~~tg~~t~v--rt~LP~~~fR~lN~g~~~s~ 76 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGN--LPTGHRTSV--RTGLPTPTWRKLYGGVLPNK 76 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhcc--CCcccceeE--EeecCCchhhhcCCcccccc Confidence 663 3333333 466777777888888765443 222221111 112222222 24568899999999999999 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHH---HHHHHHHHHHHHHHHHHHHHHH-----------hc---------- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQ---GEAVRQHGLAIANKVDNDVLEA-----------LK---------- 129 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~---~~~~~~~a~~~a~~~d~~~i~~-----------~~---------- 129 (274) .++.+++..+.-++..+.+.+...... .+.. ....+...+++.+++.+.+|.. |. T Consensus 77 ~tt~qvt~~l~ilgg~~eVDr~la~~~-Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~ 155 (330) T protein:vir:10 77 SSTAQVTDNCGMLEAYAEVDKALADLN-GNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) T ss_pred ceEEEEEEEeEEecchhhhhhHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCc Confidence 999999999999999999998765543 3443 4445567788888887777621 00 Q ss_pred ----------ccccc--------------------------------------cc------------------------- Q lcl|Aclame:pro 130 ----------GATLT--------------------------------------VE------------------------- 136 (274) Q Consensus 130 ----------~a~~~--------------------------------------~~------------------------- 136 (274) ++..+ +. T Consensus 156 ~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~v 235 (330) T protein:vir:10 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) T ss_pred hhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccE Confidence 00000 00 Q ss_pred -------C----cccCHHHHHH----HHHHHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcc Q lcl|Aclame:pro 137 -------A----DITKLDGLQT----AIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG 201 (274) Q Consensus 137 -------~----~~~~~d~iv~----a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G 201 (274) . .....+++++ |...+........+|.||......|++...... .....-.....-.+-.+.| T Consensus 236 vRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~--n~~l~~~~~~g~~~t~~~g 313 (330) T protein:vir:10 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKI--ANNLTWETVSGERVMTFDG 313 (330) T ss_pred EEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcc--cceeeeeecCCeeeEEECC Confidence 0 0001224444 444444444466789999999999987643221 1111111111122346889 Q ss_pred eeeEEcCCCCcceEEEE Q lcl|Aclame:pro 202 AVIVRSNKLNKGEALLA 218 (274) Q Consensus 202 ~~Vv~s~~~p~~~~~l~ 218 (274) +||-.++.+-.++.-+. T Consensus 314 ipir~~Dail~tE~~vv 330 (330) T protein:vir:10 314 IPVQRTDALLNTESRVV 330 (330) T ss_pred eEEEEEeeeecCccccC Confidence 99988888765554443 No 209 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=96.87 E-value=0.00027 Score=40.11 Aligned_cols=273 Identities=10% Similarity=0.001 Sum_probs=129.2 Q ss_pred CCccccch------hhccchHHHHH--HHHHHHHHhhhhcccccccc---ccccc--CCCEEEEEeecCCCCcccc---- Q lcl|Aclame:pro 1 MAQGTTKV------SNLIVPEVLAP--MMQAELDKKLRFAQFADIDS---TLVGQ--PGDTLTFPAFTYSGDAQVI---- 63 (274) Q Consensus 1 ma~~~T~~------~~~~iPe~~~~--~v~~~~~~~~~~~~l~~~~~---~~~~~--~G~~v~ip~~~~~~~a~~~---- 63 (274) .+..++.. .....++.... ..........+......... ..... .+....+-.--....++.. T Consensus 174 ~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~g 253 (534) T protein:vir:10 174 FVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFN 253 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccccchhhHhhhccCC Confidence 11111110 00001100000 00000000000000000000 00000 0001111000001112211 Q ss_pred -cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--- Q lcl|Aclame:pro 64 -AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV--- 135 (274) Q Consensus 64 -~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~--- 135 (274) ..+.++++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|++++|..+....... T Consensus 254 gs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~ 333 (534) T protein:vir:10 254 GSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTG 333 (534) T ss_pred CCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecc Confidence 112357778888899999999888888888887654 358999999999999999999999998775433211 Q ss_pred -------cCcccCH-------------HHHHHHHHHHhh---------cCCCccEEEEcHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 136 -------EADITKL-------------DGLQTAIDKFND---------EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQL 186 (274) Q Consensus 136 -------~~~~~~~-------------d~iv~a~~~l~~---------~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~ 186 (274) ..+.+++ +.+-.....+.. .....+++|++|++.+.|......++-..... T Consensus 334 ~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~ 413 (534) T protein:vir:10 334 WTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGA 413 (534) T ss_pred cccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccc Confidence 1122211 111112222211 12367899999999999875443222111111 Q ss_pred ccccc--ccc--ccchhc-ceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEE Q lcl|Aclame:pro 187 GDNII--VKG--AFGEAL-GAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAY 255 (274) Q Consensus 187 ~~~~~--~~g--~~~~i~-G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~ 255 (274) ..+.- ..+ ..|.+. |++|+++++.|.+-..+.-++. +-|+--.+......-|++.++-.+-...||+.. T Consensus 414 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~ 493 (534) T protein:vir:10 414 NTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYGVK 493 (534) T ss_pred cccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeecee Confidence 11100 011 134443 6799999999976555443432 224433444444456888999999999999876 Q ss_pred EEcCcce-------EEEEeCCCc-------ccC Q lcl|Aclame:pro 256 LYDESKV-------VKITKGAGD-------EVM 274 (274) Q Consensus 256 v~~~~av-------v~l~~~aa~-------~~~ 274 (274) + +|-+. -++..+.++ .-| T Consensus 494 ~-NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~ 525 (534) T protein:vir:10 494 L-HPMADATQNKGFAKISNGMPQHTNMFGKNAF 525 (534) T ss_pred e-cCcccccCCccccccccCCcchhhhcccccc Confidence 4 34321 122221111 111 No 210 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=96.83 E-value=5e-05 Score=44.16 Aligned_cols=256 Identities=8% Similarity=-0.005 Sum_probs=135.9 Q ss_pred CCc----cccchhhccchHHHHHHHHH----HHHHhhhhcccccccccccccCC-CEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQ----GTTKVSNLIVPEVLAPMMQA----ELDKKLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~----~~T~~~~~~iPe~~~~~v~~----~~~~~~~~~~l~~~~~~~~~~~G-~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) .|+ ..++.++.=||..+..+|.. .+..-.....|+-... .+.-. .++.++.+...|.+..++.++++|. T Consensus 34 da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~ 111 (336) T protein:vir:36 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) T ss_pred hhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccc--cCCccceeEEEeeeeceeeEEEeeccCCCce Confidence 132 23334566688888887742 2223333334433322 11111 3678888888899999999999999 Q ss_pred cccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc--------cc-----cccc Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK--------GA-----TLTV 135 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~--------~a-----~~~~ 135 (274) .+...+..+-+++.++..+.++.+++.. ...++...-++...+++.+++++..+-... +. ..+. T Consensus 112 ~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~ 191 (336) T protein:vir:36 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) T ss_pred eecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCcccccc Confidence 9988888888888888888888655543 356777777777778888777764441111 11 1111 Q ss_pred cC---cccC----HHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcce Q lcl|Aclame:pro 136 EA---DITK----LDGLQTAIDKFNDED------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) Q Consensus 136 ~~---~~~~----~d~iv~a~~~l~~~~------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~ 202 (274) .+ +..+ +++|..++..|.... ..+..++|.|..+..|-+-+ ++. ......+.. .+-++ T Consensus 192 ~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~n--~~g---~Tvl~~lk~----n~Pnl 262 (336) T protein:vir:36 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--QYG---LAAAAKLKD----IFPKL 262 (336) T ss_pred CCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCCC--ccC---ccHHHHHHH----hcCcc Confidence 11 1122 566777777765422 24778999999888774221 110 000011111 12244 Q ss_pred eeEEcCCCC---cceEEEEcCC-----eEEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceEEEEeC Q lcl|Aclame:pro 203 VIVRSNKLN---KGEALLAKKG-----AVKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVVKITKG 268 (274) Q Consensus 203 ~Vv~s~~~p---~~~~~l~~~~-----a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv~l~~~ 268 (274) .++..+.+. .+..+++-+. ........+.+...- +.......+-...|+ |+-+.+|.++++++.= T Consensus 263 ~i~t~pEl~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EEEEccccccCCCceEEEEEEecCCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeeecC Confidence 555444432 1222332111 111111111111000 011112233334444 5566799999998776 No 211 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.81 E-value=4e-05 Score=44.68 Aligned_cols=256 Identities=9% Similarity=-0.000 Sum_probs=138.7 Q ss_pred CC---c----cccchhhccchHHHHHHHH----HHHHHhhhhcccccccccccccCC-CEEEEEeecCCCCcccccCCCc Q lcl|Aclame:pro 1 MA---Q----GTTKVSNLIVPEVLAPMMQ----AELDKKLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEGEK 68 (274) Q Consensus 1 ma---~----~~T~~~~~~iPe~~~~~v~----~~~~~~~~~~~l~~~~~~~~~~~G-~~v~ip~~~~~~~a~~~~eg~~ 68 (274) || + ..++.++.-+|..+..+|. +.+........|+-.... +.-. .+++++.+...|.+..++.+++ T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~--g~W~~~~~~~~~~e~~G~a~~ygd~~D 108 (336) T protein:vir:78 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcccccC--CCccccEEEEeeeecceeeEEeecccC Confidence 33 2 2344555567887877774 222233333444433221 1111 4688888888899999999999 Q ss_pred ccccccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc--------c-----cc Q lcl|Aclame:pro 69 IPVDQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK--------G-----AT 132 (274) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~--------~-----a~ 132 (274) +|..+...+...-+++.++..+.++.+++.. ...++...-++..++++.+++++..+-... + +. T Consensus 109 ~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~ 188 (336) T protein:vir:78 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred CCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcc Confidence 9999999999999999999999998776654 356777777777777787777764431111 1 11 Q ss_pred ccccCc---ccC----HHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchh Q lcl|Aclame:pro 133 LTVEAD---ITK----LDGLQTAIDKFNDED------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA 199 (274) Q Consensus 133 ~~~~~~---~~~----~d~iv~a~~~l~~~~------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i 199 (274) .++.+. ..+ +++|..++..+.... ..+..+++.|..+..|.+-+. +. ......+.. .+ T Consensus 189 ~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~n~--~g---~tv~~~lk~----n~ 259 (336) T protein:vir:78 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQ--YG---LSAAAKLKE----IF 259 (336) T ss_pred cccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCCc--cC---ccHHHHHHH----hc Confidence 111111 123 445666666653321 245689999999988843211 10 000011111 12 Q ss_pred cceeeEEcCCCC---cceEEEEcCCe-----EEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceEEEEeC Q lcl|Aclame:pro 200 LGAVIVRSNKLN---KGEALLAKKGA-----VKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVVKITKG 268 (274) Q Consensus 200 ~G~~Vv~s~~~p---~~~~~l~~~~a-----~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv~l~~~ 268 (274) -++.++..+.+. .+..+++.+.. .......+.+...- +........-...|+ |+-+.+|.++++++.= T Consensus 260 Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred CccEEEEcccccccCcceEEEEEeeccCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeeccC Confidence 234555444442 22233432221 11222222211100 111122233344454 4556699999998776 No 212 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=96.70 E-value=0.00016 Score=41.45 Aligned_cols=257 Identities=11% Similarity=0.042 Sum_probs=132.7 Q ss_pred CCccc-----------cchhhccchHHHHHH---HHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCC Q lcl|Aclame:pro 1 MAQGT-----------TKVSNLIVPEVLAPM---MQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG 66 (274) Q Consensus 1 ma~~~-----------T~~~~~~iPe~~~~~---v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg 66 (274) |...- +..++--+|+.+..+ +++-+..-.+...|+-.... ..-.-.++.++.+...|.+..|+.+ T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~-g~W~~~~~~~~v~e~~G~A~~ygd~ 134 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTV-GQWDDEQIVQRVLEGLGTAQPYTDG 134 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccC-CCceeeeEEEeeeeeeeeeEEeccc Confidence 43221 111222345544433 34433333333444433221 1111146788888888999999999 Q ss_pred CcccccccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc----------cc-- Q lcl|Aclame:pro 67 EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK----------GA-- 131 (274) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~----------~a-- 131 (274) +++|..+...+...-+++.+...+.++++++.. ...++...-.+...+++.+.+|+..+-... +. T Consensus 135 ~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~ 214 (379) T protein:vir:10 135 GNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPN 214 (379) T ss_pred cCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCC Confidence 999998888777777777777778887766554 357888888888888888888876653311 00 Q ss_pred ---ccccc------Cccc--C----HHHHHHHHHHHhhc--C-----CCccEEEEcHHHHHHHHhhhccccccccccccc Q lcl|Aclame:pro 132 ---TLTVE------ADIT--K----LDGLQTAIDKFNDE--D-----LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDN 189 (274) Q Consensus 132 ---~~~~~------~~~~--~----~d~iv~a~~~l~~~--~-----~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~ 189 (274) ..++. ..+. + +++|..++..+... + ..+..+++.|..+..|-+-+ ++. ..... T Consensus 215 l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n--~~g---~Tvl~ 289 (379) T protein:vir:10 215 LPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPT--ELG---YSVAQ 289 (379) T ss_pred CcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcccc--ccC---ccHHH Confidence 01111 1111 2 35566666655322 1 23447999999998885321 110 00001 Q ss_pred cccccccchhcceeeEEcCCCCc-----ceEEEEcCCeEE----------EEeccCceeeeccccccCccEEEEEEE-EE Q lcl|Aclame:pro 190 IIVKGAFGEALGAVIVRSNKLNK-----GEALLAKKGAVK----------LITKRDFFLEKDRDASRKSTALYSDKH-YV 253 (274) Q Consensus 190 ~~~~g~~~~i~G~~Vv~s~~~p~-----~~~~l~~~~a~~----------~~~~~~~~ve~~r~~~~~~~~i~~~~~-~~ 253 (274) .+.. .+-++.++..+.+.. ...+++.+..-+ .....+.+...- +........-...| .| T Consensus 290 ~lk~----n~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~v-e~~~~~~~~~~~~rt~G 364 (379) T protein:vir:10 290 YMRE----SYPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGV-EKKIKGYAEGYTNATAG 364 (379) T ss_pred HHHH----hcCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccc-eecCceeEeccccceee Confidence 1111 122455555444421 234444332111 111222111100 11111223333344 45 Q ss_pred EEEEcCcceEEEEeC Q lcl|Aclame:pro 254 AYLYDESKVVKITKG 268 (274) Q Consensus 254 ~~v~~~~avv~l~~~ 268 (274) +-+.+|.+++++..+ T Consensus 365 v~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 365 AMLKRPFATYRQTGA 379 (379) T ss_pred eeeecchhhheecCC Confidence 667799999999988 No 213 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=96.66 E-value=0.00027 Score=40.15 Aligned_cols=258 Identities=10% Similarity=0.018 Sum_probs=128.6 Q ss_pred CCccccchhhccchHH-HHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCc----ccccCCCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEV-LAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDA----QVIAEGEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~-~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a----~~~~eg~~~~~~~~~ 75 (274) |++. .++++. +.++-+.--.+.++-..++.. ...+....++|.++....+ ..++.++..-.-+++ T Consensus 1 ~~~~------~~~~dp~LT~~A~gy~n~~~Ia~~l~P~----vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~ 70 (309) T protein:vir:99 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVLPR----VPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFS 70 (309) T ss_pred CCCC------CcCcCHhHHHHHhhccChhhhhhhcCCc----cccCccccceeeechhhcccccchhhccCCCcceEeec Confidence 5433 455553 444444333333333333322 1112223555555432111 224555554444445 Q ss_pred cceeEEeehhhhcchhccHHHHh--ccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------cccC-----cc Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVL--SGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------TVEA-----DI 139 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~--~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------~~~~-----~~ 139 (274) ..+.+..++..+-...|..+... .+.+|+++...+.+.+.+.+..|...-..+.++.. +++. +. T Consensus 71 ~~~~~~~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd~~S 150 (309) T protein:vir:99 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTS 150 (309) T ss_pred ccCceeeecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCCCCC Confidence 55556666666655566665544 44688999999999998887777665554443321 1221 11 Q ss_pred cCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcc-ccccccccccccccccccchhcce-eeEEcCCC-----C- Q lcl|Aclame:pro 140 TKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASD-NFTRPTQLGDNIIVKGAFGEALGA-VIVRSNKL-----N- 211 (274) Q Consensus 140 ~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~-~~~~~~~~~~~~~~~g~~~~i~G~-~Vv~s~~~-----p- 211 (274) .-...|-+++..+ ...|+.++|..+.|..|++.+.. +-+..+....+.+..-++..++|+ .|++.... + T Consensus 151 DPi~~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g 227 (309) T protein:vir:99 151 NPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPG 227 (309) T ss_pred CcHHHHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeeccccc Confidence 2233444554444 56899999999999999876532 222222222234545556678888 56653322 1 Q ss_pred --cceEEEEcCCe----------------EEEEe----ccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 212 --KGEALLAKKGA----------------VKLIT----KRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 212 --~~~~~l~~~~a----------------~~~~~----~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) ..-.++++... +||-- +..=.++..+..+.+...+++..++.-.++-+++-..|.... T Consensus 228 ~~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~v 307 (309) T protein:vir:99 228 QNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) T ss_pred cccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhcc Confidence 01113333322 22211 111011222223344455666555555566666666665554 Q ss_pred Cc Q lcl|Aclame:pro 270 GD 271 (274) Q Consensus 270 a~ 271 (274) |- T Consensus 308 a~ 309 (309) T protein:vir:99 308 AA 309 (309) T ss_pred cC Confidence 44 No 214 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=96.64 E-value=0.00011 Score=42.18 Aligned_cols=268 Identities=10% Similarity=0.030 Sum_probs=129.0 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhh-----hccc-cccccccc---c-cCCCEEEEEeec---CCCCcccccCC- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLR-----FAQF-ADIDSTLV---G-QPGDTLTFPAFT---YSGDAQVIAEG- 66 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~-----~~~l-~~~~~~~~---~-~~G~~v~ip~~~---~~~~a~~~~eg- 66 (274) .+...... -+|.+-++....+..... ..+. .......+ + ......+...+. ....++..+++ T Consensus 114 q~~~~~a~----~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~ 189 (457) T protein:vir:10 114 ERNPAAAG----YDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDST 189 (457) T ss_pred cccccccc----ccceeeeccCcccCcccccccccccccccccccccccccCccccccccccccccchhhhhhhccCCCC Confidence 11111000 122221111111000000 0000 00000000 0 000000000000 01123333322 Q ss_pred --CcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----- Q lcl|Aclame:pro 67 --EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV----- 135 (274) Q Consensus 67 --~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~----- 135 (274) ..+++-..+.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..+....... T Consensus 190 ~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~ 269 (457) T protein:vir:10 190 ANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNT 269 (457) T ss_pred CccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeecccc Confidence 346667777788999999888888888887655 358999999999999999999999998775433221 Q ss_pred -cCcccCH----------HHHHHHHHHH---------hhcCCCccEEEEcHHHHHHHHhhhcccccccccccc-----cc Q lcl|Aclame:pro 136 -EADITKL----------DGLQTAIDKF---------NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGD-----NI 190 (274) Q Consensus 136 -~~~~~~~----------d~iv~a~~~l---------~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~-----~~ 190 (274) +.+.+++ +.+-.....+ ....+..++++++|.+.+.|......++....+... +. T Consensus 270 ~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~ 349 (457) T protein:vir:10 270 ATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDD 349 (457) T ss_pred ccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhcccccccccc Confidence 1111111 1111111111 122457789999999999886543333332211110 11 Q ss_pred ccccccchh-cceeeEEcCCCC----cceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcC Q lcl|Aclame:pro 191 IVKGAFGEA-LGAVIVRSNKLN----KGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDE 259 (274) Q Consensus 191 ~~~g~~~~i-~G~~Vv~s~~~p----~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~ 259 (274) ......|.+ .|++|+++++.. .+-..+.-++. +-|+--.+.....--|++.++-.+....||+. ..|| T Consensus 350 ~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP 428 (457) T protein:vir:10 350 TSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGM-VSNP 428 (457) T ss_pred ccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCccccceeeeeeeeee-eecc Confidence 122223454 468999996553 32222222332 22333333322223388899999999999998 6778 Q ss_pred cceEEEEeCCCcccC Q lcl|Aclame:pro 260 SKVVKITKGAGDEVM 274 (274) Q Consensus 260 ~avv~l~~~aa~~~~ 274 (274) ... .++.+.+.-+. T Consensus 429 ~~~-~~~~~~~~~~~ 442 (457) T protein:vir:10 429 FAG-GLTQGSGALTV 442 (457) T ss_pred ccc-ccccccccccc Confidence 744 33333332222 No 215 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=96.58 E-value=0.00049 Score=38.73 Aligned_cols=259 Identities=10% Similarity=0.057 Sum_probs=132.1 Q ss_pred CCccccchhhccchH-HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcc----cccCCCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE-VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ----VIAEGEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe-~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~----~~~eg~~~~~~~~~ 75 (274) |. +.+..++.+ ++.+..+.-..+.++-..++... ..+....++|.|+.. .+. .++.++..-.-++. T Consensus 1 m~----~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~v----pv~~~~~k~~~f~~e-aF~~~~t~r~~~~~~~~v~~~ 71 (307) T protein:vir:10 1 MG----RLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPVV----EVEKEGGKIPKFGKE-SFRLYKTERALRARSNRMNPE 71 (307) T ss_pred CC----CCCCCcccChhHHHHHHhhcchhhhhhhcCCcc----cccccccceeeECcc-cccchhhhcccCCCcceeecc Confidence 43 122223333 45565555444544434443321 111223455555432 111 12222222111111 Q ss_pred -cceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------cccc-----Cccc Q lcl|Aclame:pro 76 -TSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------LTVE-----ADIT 140 (274) Q Consensus 76 -~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~---------~~~~-----~~~~ 140 (274) .+..+..+...+-...+.+.....+..|+.+...+.+.+.|.+..|..+-..+.++. .+++ ++.. T Consensus 72 ~~~~~~~~~~~~~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~~sD 151 (307) T protein:vir:10 72 DLGSIDIVLDEHDLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAAGSD 151 (307) T ss_pred cccccccccccccccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEeccccccCCCCCC Confidence 122233344444445566666667788999999999999888887776655443322 1122 1222 Q ss_pred CHHHHHHHHHHHhh-cCCCccEEEEcHHHHHHHHhhhcc-ccccccccccccccccccchhcceeeEE-cCCC-----C- Q lcl|Aclame:pro 141 KLDGLQTAIDKFND-EDLEPMVLFVNPLDAGGLRTSASD-NFTRPTQLGDNIIVKGAFGEALGAVIVR-SNKL-----N- 211 (274) Q Consensus 141 ~~d~iv~a~~~l~~-~~~~~~~~v~~p~~~~~L~~~~~~-~~~~~~~~~~~~~~~g~~~~i~G~~Vv~-s~~~-----p- 211 (274) .+.+|.+++.++.. .+..|+.++|.++.|..|++.+.. +.+..+. .+.+..-.+..++|+.-+. .... + T Consensus 152 Pi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~--~g~it~~~la~ll~v~~i~vg~a~~~~~~~~ 229 (307) T protein:vir:10 152 PVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSM--KGIVTVDLLKEIFEVENIAVGEAIYADDKDR 229 (307) T ss_pred cHHHHHHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCcc--ccccCHHHHHHHhCceeEEEeeeeeeccCCc Confidence 34566667776654 567999999999999999876542 2222222 2344444556677764443 1110 0 Q ss_pred -----cceEEE-Ec-------C-----CeEEEEec-cCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 212 -----KGEALL-AK-------K-----GAVKLITK-RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 212 -----~~~~~l-~~-------~-----~a~~~~~~-~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) .+..++ +. . .++||-.+ .+-.+...++...+...+++.....-.++-|++-..|+.+-+ T Consensus 230 ~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 230 FTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred cceeCCCceEEEecccccCCCCCcccccccceeEEEcCCeEeeceecCCceeEEeccccccceeecccccceeccCCC Confidence 112222 10 0 13454332 222332234445566667777777777777888777777777 No 216 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=96.51 E-value=0.00055 Score=38.44 Aligned_cols=263 Identities=12% Similarity=0.087 Sum_probs=119.9 Q ss_pred CCccccchhhccchHHHHHHHHHHHHH--hhhhcccccccccccccCCCEEEEEee-cCCC-CcccccCCCccccc-ccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDK--KLRFAQFADIDSTLVGQPGDTLTFPAF-TYSG-DAQVIAEGEKIPVD-QIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~--~~~~~~l~~~~~~~~~~~G~~v~ip~~-~~~~-~a~~~~eg~~~~~~-~~~ 75 (274) ||+ ..+.|.|..+..++.+.... ......++.... ..+..+.+... .... .+..+.++.+-+.. .-. T Consensus 1 M~~----i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~----~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~ 72 (348) T protein:vir:96 1 MGL----IYDKVTASNIAGYFNTLQENVDSTLGESIFPARK----QLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVS 72 (348) T ss_pred Ccc----hhhccCHHHHHHHHHhcccchhhhhhhhcCCCcc----ccceeEEEEeecCCceeEeeeecCCCCcceecccc Confidence 874 35678888888888654322 122223332111 11111221111 1111 13345555444332 223 Q ss_pred cceeEEeehhhhcchhccHHHHh------cc-CccHHHHHHHH-------HHHHHHHHHHHHHHHHhccccc-------- Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVL------SG-FGDPQGEAVRQ-------HGLAIANKVDNDVLEALKGATL-------- 133 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~------~s-~~d~~~~~~~~-------~a~~~a~~~d~~~i~~~~~a~~-------- 133 (274) +......+-.++....++..+.+ .+ .....+.+.+. +.+.+.+.+|..+...+.++.. T Consensus 73 ~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~ 152 (348) T protein:vir:96 73 AEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVN 152 (348) T ss_pred eeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCee Confidence 34444444444433333322211 11 11222333333 3344555556555555543211 Q ss_pred -------------ccc-----CcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccc-ccccccccc---- Q lcl|Aclame:pro 134 -------------TVE-----ADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFT-RPTQLGDNI---- 190 (274) Q Consensus 134 -------------~~~-----~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~-~~~~~~~~~---- 190 (274) +.. ++..-+++|.++...+.+.+..++.++|+++.+..|++++..... ......... T Consensus 153 ~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~ 232 (348) T protein:vir:96 153 KDIDYGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKA 232 (348) T ss_pred EEEeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHH Confidence 111 112234567777777777778899999999999999876543211 111111111 Q ss_pred ccccccchhcceeeEEc------------CCCCcceEEEEcCCeEEEEeccCc-----------------------eeee Q lcl|Aclame:pro 191 IVKGAFGEALGAVIVRS------------NKLNKGEALLAKKGAVKLITKRDF-----------------------FLEK 235 (274) Q Consensus 191 ~~~g~~~~i~G~~Vv~s------------~~~p~~~~~l~~~~a~~~~~~~~~-----------------------~ve~ 235 (274) .....++++.|+++++= +.+|.+..+++..+..|...--++ .+.+ T Consensus 233 ~~~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (348) T protein:vir:96 233 ELQNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTT 312 (348) T ss_pred HHHHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEe Confidence 11222345667777751 336677777766655442211000 0000 Q ss_pred ccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 236 DRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 236 ~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) +.+.+-....+.+-.+.=..+.+|++++++++-+|- T Consensus 313 ~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 313 TKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred eecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 001000112223333333445589999998888877 No 217 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=96.47 E-value=0.00037 Score=39.41 Aligned_cols=257 Identities=9% Similarity=0.002 Sum_probs=128.9 Q ss_pred CCccccchhhccchH--HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcc--cccC-CCcccccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE--VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ--VIAE-GEKIPVDQIG 75 (274) Q Consensus 1 ma~~~T~~~~~~iPe--~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~--~~~e-g~~~~~~~~~ 75 (274) |+- ...++.| .+-..|.+.-.+.+..+.++.+..... -.-.++.+..++..|.+. |++. ..++|..+.. T Consensus 1 ~~~-----lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~-~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~ 74 (304) T protein:vir:52 1 MSL-----LAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTA-VGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVG 74 (304) T ss_pred Cch-----HHHHHHHHHHHhhhhhccccccchhhhhccccCCCC-cccceEEEeeeeccCcccccccCCcCCccceeecc Confidence 321 1112222 122233332233444555555443322 122367778887778888 7654 4679999999 Q ss_pred cceeEEeehhhhcchhccHHHHhcc---CccHHHHHHHHHHHHHHHHHHHHHHHHhc---c------ccc------ccc- Q lcl|Aclame:pro 76 TSKREAKVRKIGKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALK---G------ATL------TVE- 136 (274) Q Consensus 76 ~~~~~~~~~~~~~~~~is~e~~~~s---~~d~~~~~~~~~a~~~a~~~d~~~i~~~~---~------a~~------~~~- 136 (274) .++....++.++..+.++-++++.+ ..++...-.+.+.+++...+|+..+-+-. + .+. ++. T Consensus 75 ~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~ 154 (304) T protein:vir:52 75 FTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAA 154 (304) T ss_pred cceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCc Confidence 9999999999988888877665543 45666666666666777777765542211 0 000 000 Q ss_pred --Ccc--cCHH----HHHHHHHHHhhc--C-CCccEEEEcHHHHHHHHhhhccccccccccc-----cccccccccchhc Q lcl|Aclame:pro 137 --ADI--TKLD----GLQTAIDKFNDE--D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLG-----DNIIVKGAFGEAL 200 (274) Q Consensus 137 --~~~--~~~d----~iv~a~~~l~~~--~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~-----~~~~~~g~~~~i~ 200 (274) ..+ -|.+ +|.+++..+... + ..+..++|.|..+..|......+. . .... ++...+|..-+|. T Consensus 155 a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~-~-~Tvl~~l~~n~~~~~g~~l~I~ 232 (304) T protein:vir:52 155 QNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANT-D-TTALEFLTKHLSAAAGRQVAIK 232 (304) T ss_pred cCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCC-C-chHHHHHHHhcccccCCcceEE Confidence 011 1333 455566555332 2 367889999999998843211110 0 0000 0111111111233 Q ss_pred ceeeEEcCCCCc--ceEEEEc--CCeEEEEeccCceeeeccccccCc--cEEEEEEEE-EEEEEcCcceEEEEe Q lcl|Aclame:pro 201 GAVIVRSNKLNK--GEALLAK--KGAVKLITKRDFFLEKDRDASRKS--TALYSDKHY-VAYLYDESKVVKITK 267 (274) Q Consensus 201 G~~Vv~s~~~p~--~~~~l~~--~~a~~~~~~~~~~ve~~r~~~~~~--~~i~~~~~~-~~~v~~~~avv~l~~ 267 (274) ++|.-....-.. +-.+++. +..+.+...++++.... ..++. ..+-...|+ |+-+..|.+++.+-. T Consensus 233 ~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~--q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 233 ALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDA--QPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred EecccccccCCCCceEEEEEecChhheEEecCccccccch--hhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 332211111111 2234444 33455555455443332 11222 223234444 566779999999998 No 218 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.47 E-value=6.3e-05 Score=43.63 Aligned_cols=257 Identities=8% Similarity=-0.012 Sum_probs=136.8 Q ss_pred CC---c----cccchhhccchHHHHHHHHHHH----HHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcc Q lcl|Aclame:pro 1 MA---Q----GTTKVSNLIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI 69 (274) Q Consensus 1 ma---~----~~T~~~~~~iPe~~~~~v~~~~----~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~ 69 (274) || + ..++.++.-+|..+..+|.-.+ ........|+-.... ..---..+.++.....|.+..|+.+.++ T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~-g~w~~~~~~~~~~e~~G~a~~ygd~~d~ 109 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK-GDWTTLVAAFITAEPTTKVATYGDYSSD 109 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhcccccC-CCcceeeEEEEeeeeeeeEEEccccCCC Confidence 33 2 2344555567887877774222 222223333333221 1111245778888888899999999999 Q ss_pred cccccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHh--------cc-----ccc Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEAL--------KG-----ATL 133 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~--------~~-----a~~ 133 (274) |..+...+...-+++.++..+.++.+++.. ...++...-++...+++.+++++..+-.- -+ +.. T Consensus 110 P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~ 189 (336) T protein:vir:10 110 GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) T ss_pred cceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCccc Confidence 999999988888899998899998776654 35677777777777777777776443111 01 111 Q ss_pred cccCc---ccC----HHHHHHHHHHHhhcC------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhc Q lcl|Aclame:pro 134 TVEAD---ITK----LDGLQTAIDKFNDED------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEAL 200 (274) Q Consensus 134 ~~~~~---~~~----~d~iv~a~~~l~~~~------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~ 200 (274) +..+. ..+ +++|..++..+.... ..+..+++.|..+..|.+-+. +. ......+.. .+- T Consensus 190 t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~n~--~g---~tv~~~lk~----n~P 260 (336) T protein:vir:10 190 TATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQ--YG---LSAAAKLKE----IFP 260 (336) T ss_pred ccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCCc--cC---ccHHHHHHH----hCC Confidence 11111 123 456666666663322 235689999999988843211 10 000011111 122 Q ss_pred ceeeEEcCCCC---cceEEEEcCCe-----EEEEeccCceeeeccccccCccEEEEEEEE-EEEEEcCcceEEEEeC Q lcl|Aclame:pro 201 GAVIVRSNKLN---KGEALLAKKGA-----VKLITKRDFFLEKDRDASRKSTALYSDKHY-VAYLYDESKVVKITKG 268 (274) Q Consensus 201 G~~Vv~s~~~p---~~~~~l~~~~a-----~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~avv~l~~~ 268 (274) +++++..+.+. .+..+++.+.. .......+.+...- +........-...|+ |+-+.+|.+++++..= T Consensus 261 nl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~P~~f~~lpv-q~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 261 KLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ccEEEEcccccccCCceEEEEEecccCCcceeeecChhhhccce-eecCceeEeccccceeeeeeeccchheeeccC Confidence 35555544442 22334443321 11222222211100 111122233334454 4556699999998776 No 219 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=96.45 E-value=0.00061 Score=38.20 Aligned_cols=259 Identities=10% Similarity=0.046 Sum_probs=131.9 Q ss_pred CCccccchhhccchH-HHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcc----cccCCCccccccc- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE-VLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ----VIAEGEKIPVDQI- 74 (274) Q Consensus 1 ma~~~T~~~~~~iPe-~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~----~~~eg~~~~~~~~- 74 (274) |.. .+..++.+ ++.+..+.-.++..+-..++... ..+....+++.++... +. ..+.++....-.. T Consensus 1 m~~----~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP~v----pV~~~~~k~~~f~~e~-f~~~~t~ra~~~~~~~v~~~ 71 (307) T protein:vir:79 1 MGR----LSKLRIVDPVLTNLAIGYTNAEFIGQTLMPVV----EVEKEGGKIPKFGKES-FRLYQTERALRAKSNRMNPE 71 (307) T ss_pred CCC----CCCCcccCHHHHHHHhhccchhhhhhhcCCcc----cccccccceeeecccc-ccccccccccCCCcceeeee Confidence 542 12222222 35555554333333323332211 1122235555554321 11 1233332222221 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------ccc-----Cccc Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------TVE-----ADIT 140 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~---------~~~-----~~~~ 140 (274) .++..+..+.+.+....+.+.....+..++++...+.+.+.+.+..|..+-..+.++.. +++ ++.. T Consensus 72 ~~~~~~~~~~~~~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sD 151 (307) T protein:vir:79 72 DIDSVDVNLDEHDLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSD 151 (307) T ss_pred ccccccccccccchhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCC Confidence 22333444444444455666666667788999989999888888888776665543321 122 1222 Q ss_pred CHHHHHHHHHHHhh-cCCCccEEEEcHHHHHHHHhhhcc-ccccccccccccccccccchhccee-eEEcCCC------- Q lcl|Aclame:pro 141 KLDGLQTAIDKFND-EDLEPMVLFVNPLDAGGLRTSASD-NFTRPTQLGDNIIVKGAFGEALGAV-IVRSNKL------- 210 (274) Q Consensus 141 ~~d~iv~a~~~l~~-~~~~~~~~v~~p~~~~~L~~~~~~-~~~~~~~~~~~~~~~g~~~~i~G~~-Vv~s~~~------- 210 (274) .+.+|.+++.++.+ .+..|+.++|.++.|..|++.+.. +.+..+. .+.+..-.+..++|+. |++-... T Consensus 152 Pi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~--~g~it~~~la~l~~v~~V~vg~a~y~~~~~~ 229 (307) T protein:vir:79 152 PVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSM--KGIVTVDLLKEIFEVENIAVGEAIYADDKDR 229 (307) T ss_pred cHHHHHHHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCcc--ccccCHHHHHHHhCceeEEEeeeeeeccccc Confidence 34566667776654 567999999999999999876542 2222222 2444444556788876 4432111 Q ss_pred -----CcceEEEEcC------------CeEEEEecc-CceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCC Q lcl|Aclame:pro 211 -----NKGEALLAKK------------GAVKLITKR-DFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) Q Consensus 211 -----p~~~~~l~~~------------~a~~~~~~~-~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa 270 (274) +.+..+++-+ .++||-.+. +-.+...++...+...+++.....-.++-|++-..|+.+-+ T Consensus 230 ~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 230 FTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred chhcCCCceEEEecccccCCCCCcccccccceeEEecCceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 1111122110 124443322 22222223334566677777777777888887777777766 No 220 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=96.16 E-value=0.00074 Score=37.76 Aligned_cols=271 Identities=9% Similarity=0.014 Sum_probs=127.6 Q ss_pred CCc------cccchhhcc-chHHHHHHH-------HHHHHHhhhhcccccccc-cc----cccCCCEEEEEeecCCCCcc Q lcl|Aclame:pro 1 MAQ------GTTKVSNLI-VPEVLAPMM-------QAELDKKLRFAQFADIDS-TL----VGQPGDTLTFPAFTYSGDAQ 61 (274) Q Consensus 1 ma~------~~T~~~~~~-iPe~~~~~v-------~~~~~~~~~~~~l~~~~~-~~----~~~~G~~v~ip~~~~~~~a~ 61 (274) -.. .+-...... ....+.... ..-...-....+...... .. ....|+...+..--....++ T Consensus 153 SG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aE 232 (519) T protein:vir:10 153 SGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAE 232 (519) T ss_pred CccccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhh Confidence 100 000000000 000000000 000000011111110000 00 00011111111100011111 Q ss_pred c---c--cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 62 V---I--AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT 132 (274) Q Consensus 62 ~---~--~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~ 132 (274) . . ..+.++++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..+.-.. T Consensus 233 al~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa 312 (519) T protein:vir:10 233 LQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSA 312 (519) T ss_pred ccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhh Confidence 1 1 123457888888899999999888888888887655 358999999999999999999999997663221 Q ss_pred c--------c--ccCcccCHHH-------------HHHHHHHHh--------h-cCCCccEEEEcHHHHHHHHhhhcccc Q lcl|Aclame:pro 133 L--------T--VEADITKLDG-------------LQTAIDKFN--------D-EDLEPMVLFVNPLDAGGLRTSASDNF 180 (274) Q Consensus 133 ~--------~--~~~~~~~~d~-------------iv~a~~~l~--------~-~~~~~~~~v~~p~~~~~L~~~~~~~~ 180 (274) . + ..++.++++. +-.....+. . .....++++++|++.+.|.......+ T Consensus 313 ~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~ 392 (519) T protein:vir:10 313 QVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVS 392 (519) T ss_pred hcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhc Confidence 1 0 1112222211 111222221 1 12456899999999999876542222 Q ss_pred cccccccccccccc----ccchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEE Q lcl|Aclame:pro 181 TRPTQLGDNIIVKG----AFGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSD 249 (274) Q Consensus 181 ~~~~~~~~~~~~~g----~~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~ 249 (274) ..........-.+. ..|.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-.+-.. T Consensus 393 ~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 472 (519) T protein:vir:10 393 YAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 472 (519) T ss_pred cccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeeee Confidence 11110000000111 12344 36799999999975555433332 223333343333445888999999999 Q ss_pred EEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 250 KHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 250 ~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) .||+..+ ||-+- ....+..+.. T Consensus 473 tRY~l~~-NP~~~--~~~~~~~~~i 494 (519) T protein:vir:10 473 TRYGIGI-NPFAD--PAAQAPTKRI 494 (519) T ss_pred eeeceee-cCccc--ccccCcccee Confidence 9998764 45321 1122222222 No 221 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=96.15 E-value=0.00067 Score=38.00 Aligned_cols=212 Identities=13% Similarity=-0.023 Sum_probs=119.3 Q ss_pred CCccccch------hhccchHHHHHHHHHHHHHhhhh-cccccccccccccCCCEEEEEeecCCCCcccccCCCcccccc Q lcl|Aclame:pro 1 MAQGTTKV------SNLIVPEVLAPMMQAELDKKLRF-AQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQ 73 (274) Q Consensus 1 ma~~~T~~------~~~~iPe~~~~~v~~~~~~~~~~-~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~~~ 73 (274) |+.--+.. +..+-|......|+|.+.++..+ .-+.-.... ...|..-.+ ....|++.|..=++.+++++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N--~~tg~~~~v--rt~LP~~~fR~lN~g~~~s~ 76 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCN--DGSKHKTTI--RAGIPEPVWRRYNQGVQPTK 76 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhccc--CCcccceeE--EEecCCchhhhcCCcccccc Confidence 77543322 22355666666788888765443 222221111 112222222 24568899999999999999 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccH---HHHHHHHHHHHHHHHHHHHHHHH-----------h-------c--- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDP---QGEAVRQHGLAIANKVDNDVLEA-----------L-------K--- 129 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~---~~~~~~~~a~~~a~~~d~~~i~~-----------~-------~--- 129 (274) .++.+++..+.-++..+.|.+...... .+. .....+...+.+.+++.+.+|.. | . T Consensus 77 ~tt~qvt~~l~ilgg~~eVDr~La~~~-Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~ 155 (335) T protein:vir:73 77 TQTVPVTDTTGMLYDLGFVDKALADRS-NNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSK 155 (335) T ss_pred ceEEEEEEEEEEecchhhhhHHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccc Confidence 999999999999999999998655443 444 34444557778888887777621 0 0 Q ss_pred -------------ccccc----------------------c--------------------------------------- Q lcl|Aclame:pro 130 -------------GATLT----------------------V--------------------------------------- 135 (274) Q Consensus 130 -------------~a~~~----------------------~--------------------------------------- 135 (274) ++..+ + T Consensus 156 a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~ 235 (335) T protein:vir:73 156 AASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRS 235 (335) T ss_pred cCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCccc Confidence 00000 0 Q ss_pred -------c-----CcccCHHHHH----HHHH--HHhhcCCCccEEEEcHHHHHHHHhhhccccccccccccccccccccc Q lcl|Aclame:pro 136 -------E-----ADITKLDGLQ----TAID--KFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFG 197 (274) Q Consensus 136 -------~-----~~~~~~d~iv----~a~~--~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~ 197 (274) + .+..+.++++ +|.. .+........+|.||......|++...... .....-+....-.+- T Consensus 236 vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~--n~~l~~~~~~g~~~t 313 (335) T protein:vir:73 236 ISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAK--NVNLTIEEYGGKKIV 313 (335) T ss_pred EEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccC--ceeeeeeccCCceeE Confidence 0 0111122333 3332 223223344689999999999987643221 111111112222234 Q ss_pred hhcceeeEEcCCCCcceEEEEc Q lcl|Aclame:pro 198 EALGAVIVRSNKLNKGEALLAK 219 (274) Q Consensus 198 ~i~G~~Vv~s~~~p~~~~~l~~ 219 (274) .+.|+||-..+.+-.++.-+.. T Consensus 314 ~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 314 SFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred EECCeEEEEEeeeecCcccccC Confidence 6889999988887655554433 No 222 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=96.13 E-value=0.00097 Score=37.11 Aligned_cols=264 Identities=12% Similarity=0.084 Sum_probs=117.8 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh--hhhcccccccccccccCCCEEEEEeecCCC-CcccccCCCccccc-cccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK--LRFAQFADIDSTLVGQPGDTLTFPAFTYSG-DAQVIAEGEKIPVD-QIGT 76 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~--~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~-~a~~~~eg~~~~~~-~~~~ 76 (274) ||+ ..++|.|..+..++.+..... .....++...... +-+.+.+....... .+..+.++++-+.. .-.+ T Consensus 1 M~~----l~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~---~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~ 73 (348) T protein:vir:49 1 MGL----IYDKVTASNIAGYFNALQENVDSTLGESIFPARKQL---GTKLSYITGASGQSVALKAAAFDTNVTVRDRVSA 73 (348) T ss_pred Ccc----hhhhcCHHHHHHHHHhccccchhhhHhhcCCCcccc---CceeEEEEeecCceeeeeeecCCCCcceecccce Confidence 885 357888898888887544221 1112222211100 00111111111111 12334444333322 2223 Q ss_pred ceeEEeehhhhcchhccHHHHh------cc-CccHHHHHHHH-------HHHHHHHHHHHHHHHHhcccccc-------- Q lcl|Aclame:pro 77 SKREAKVRKIGKGTELTDEAVL------SG-FGDPQGEAVRQ-------HGLAIANKVDNDVLEALKGATLT-------- 134 (274) Q Consensus 77 ~~~~~~~~~~~~~~~is~e~~~------~s-~~d~~~~~~~~-------~a~~~a~~~d~~~i~~~~~a~~~-------- 134 (274) ......+-.++....++....+ .+ .....+.+.+. +.+.+.+.+|..+...+.++... T Consensus 74 ~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~ 153 (348) T protein:vir:49 74 EMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNK 153 (348) T ss_pred eeeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceE Confidence 3334444444433334332211 11 11122223233 33445556666666655432110 Q ss_pred -------------c-----cCcccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccc-ccccccccccccc--- Q lcl|Aclame:pro 135 -------------V-----EADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN-FTRPTQLGDNIIV--- 192 (274) Q Consensus 135 -------------~-----~~~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~-~~~~~~~~~~~~~--- 192 (274) . .++..-+++|.+....+.+.+..+..++|+++.+..|++++... ...........+. T Consensus 154 ~vdyg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~ 233 (348) T protein:vir:49 154 DIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAE 233 (348) T ss_pred EEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHH Confidence 0 11122345677777777777888999999999999998765332 1111111111111 Q ss_pred -ccccchhcceeeEE------------cCCCCcceEEEEcCCeEEEEeccCc--------------ee---------eec Q lcl|Aclame:pro 193 -KGAFGEALGAVIVR------------SNKLNKGEALLAKKGAVKLITKRDF--------------FL---------EKD 236 (274) Q Consensus 193 -~g~~~~i~G~~Vv~------------s~~~p~~~~~l~~~~a~~~~~~~~~--------------~v---------e~~ 236 (274) ....+++.|++|++ .+.+|.++.+++..+..|...--++ .+ .++ T Consensus 234 ~~~~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (348) T protein:vir:49 234 LDNYIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTT 313 (348) T ss_pred HHHHHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeee Confidence 11223566777765 1335667777766554432110000 00 000 Q ss_pred cccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 237 RDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 237 r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) ...+--...+.+....=..+.+|+++++.++-+|- T Consensus 314 ~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 314 KTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred ecCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 00000011222223333345588899998888877 No 223 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=95.74 E-value=0.0015 Score=36.02 Aligned_cols=272 Identities=10% Similarity=0.024 Sum_probs=133.0 Q ss_pred CCc-cccchhhccchHH--HHHHHHHHHHHhhhhccccccccccc-----c-cCCCEEEEEeecCCCCccc---c--cCC Q lcl|Aclame:pro 1 MAQ-GTTKVSNLIVPEV--LAPMMQAELDKKLRFAQFADIDSTLV-----G-QPGDTLTFPAFTYSGDAQV---I--AEG 66 (274) Q Consensus 1 ma~-~~T~~~~~~iPe~--~~~~v~~~~~~~~~~~~l~~~~~~~~-----~-~~G~~v~ip~~~~~~~a~~---~--~eg 66 (274) .+. ..|...+.+.... ....+........ ..+.......+. . ..|....+..=-....++- . ..+ T Consensus 167 ~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t-~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~ 245 (522) T protein:vir:69 167 LAASTQTKVGDIYTHFFQETGTVYLQASAQVT-ISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTD 245 (522) T ss_pred cccccccccccccccccccccceeeecccCCc-CCCCCcccccccchhccccccccceeeccccchhhhhhcccCCCCcc Confidence 211 1111122211100 0000000000000 000000000000 0 0111122111000111111 1 113 Q ss_pred CcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Q lcl|Aclame:pro 67 EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------- 134 (274) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~-------- 134 (274) .++++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|++++|..+.....- T Consensus 246 ~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~ 325 (522) T protein:vir:69 246 NPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNI 325 (522) T ss_pred cchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccc Confidence 457888888899999999888888888887655 35899999999999999999999999776422111 Q ss_pred --ccCcccCH-------------HHHHHHHHHH--------hhc-CCCccEEEEcHHHHHHHHhhhcccccccccccccc Q lcl|Aclame:pro 135 --VEADITKL-------------DGLQTAIDKF--------NDE-DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI 190 (274) Q Consensus 135 --~~~~~~~~-------------d~iv~a~~~l--------~~~-~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~ 190 (274) ..++.++. +.+-.....+ ..- ....+++|++|++...|.......+........+. T Consensus 326 ~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~ 405 (522) T protein:vir:69 326 VGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGF 405 (522) T ss_pred cccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccccccccc Confidence 11112111 1111111121 111 23678999999999988654322222221111111 Q ss_pred ccccc----cchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcC Q lcl|Aclame:pro 191 IVKGA----FGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDE 259 (274) Q Consensus 191 ~~~g~----~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~ 259 (274) ..+.. .|.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-.+-...||+..+ || T Consensus 406 ~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v-NP 484 (522) T protein:vir:69 406 NTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGV-NP 484 (522) T ss_pred cccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cC Confidence 11111 1344 36799999999876655543432 2344344444444568889999999999998764 34 Q ss_pred cce-------EEEEeCCCcccC Q lcl|Aclame:pro 260 SKV-------VKITKGAGDEVM 274 (274) Q Consensus 260 ~av-------v~l~~~aa~~~~ 274 (274) -+. .+|..+.|+.-- T Consensus 485 ~~~~~~~~~~~ri~~g~p~~~~ 506 (522) T protein:vir:69 485 FAESSLQAPGARIQSGMPSILN 506 (522) T ss_pred cccccCCcccceeecccchhhc Confidence 321 245555554322 No 224 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=95.61 E-value=0.0016 Score=35.92 Aligned_cols=269 Identities=10% Similarity=0.014 Sum_probs=124.4 Q ss_pred CCccccchh--------------------hccchH--HHHHHHHHHH--HHhhhhccccccc--cccc-cc-CCCEEEEE Q lcl|Aclame:pro 1 MAQGTTKVS--------------------NLIVPE--VLAPMMQAEL--DKKLRFAQFADID--STLV-GQ-PGDTLTFP 52 (274) Q Consensus 1 ma~~~T~~~--------------------~~~iPe--~~~~~v~~~~--~~~~~~~~l~~~~--~~~~-~~-~G~~v~ip 52 (274) |-..-|+-+ +.+..+ .....+.... ....+........ ..+. .. .|....+. T Consensus 142 ~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~ 221 (514) T protein:vir:56 142 TRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEID 221 (514) T ss_pred ccccCcCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhh Confidence 211111100 000000 0000000000 0000000000000 0000 00 00001111 Q ss_pred eecCCCCccc---c--cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 53 AFTYSGDAQV---I--AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDND 123 (274) Q Consensus 53 ~~~~~~~a~~---~--~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~ 123 (274) .--....++- . ..+.++++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|+++ T Consensus 222 ~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINRe 301 (514) T protein:vir:56 222 AGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNRE 301 (514) T ss_pred hhhhhhhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 0000111111 1 124457778888899999999888888888887654 358999999999999999999999 Q ss_pred HHHHhcccccc---------ccCcccCHH---------HHHHHHH----HHh-h--------cCCCccEEEEcHHHHHHH Q lcl|Aclame:pro 124 VLEALKGATLT---------VEADITKLD---------GLQTAID----KFN-D--------EDLEPMVLFVNPLDAGGL 172 (274) Q Consensus 124 ~i~~~~~a~~~---------~~~~~~~~d---------~iv~a~~----~l~-~--------~~~~~~~~v~~p~~~~~L 172 (274) +|..+....+. ...+.++.+ ...+... .+. + .....++++++|.+.+.| T Consensus 302 ii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L 381 (514) T protein:vir:56 302 IVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSAL 381 (514) T ss_pred HHHHHHhheeehhcccccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHH Confidence 98777543321 111111111 1122111 121 1 124678999999999998 Q ss_pred Hhhhccccccccc------ccc--ccccccccchhcceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccc Q lcl|Aclame:pro 173 RTSASDNFTRPTQ------LGD--NIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRD 238 (274) Q Consensus 173 ~~~~~~~~~~~~~------~~~--~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~ 238 (274) .......+..... ..+ ..+..|.+ -.|++|+++++.|.+-..+.-++. +-|+--.+.......| T Consensus 382 ~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l--~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~d 459 (514) T protein:vir:56 382 SMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVL--GGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSD 459 (514) T ss_pred HhhhhhccccccCccccccccccCcceEEEEe--cCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccC Confidence 6432211111110 001 11112222 257899999999976555443332 2233333433333458 Q ss_pred cccCccEEEEEEEEEEEEEcCcc-----eEEEEeCCCcccC Q lcl|Aclame:pro 239 ASRKSTALYSDKHYVAYLYDESK-----VVKITKGAGDEVM 274 (274) Q Consensus 239 ~~~~~~~i~~~~~~~~~v~~~~a-----vv~l~~~aa~~~~ 274 (274) ++.++-.+-...||+..+ ||-. ...+ ..-.-.| T Consensus 460 p~sfqP~~g~~tRY~l~~-NPy~~~~~~~~~~--~~~~~~~ 497 (514) T protein:vir:56 460 SKNFQPVIGFKTRYGVQV-NPFADPTASATKV--GNGAPVA 497 (514) T ss_pred Cccccceeeeeeeeceee-CCCCCcccccccc--CCcchhh Confidence 889999999999998764 4431 1110 0000011 No 225 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=95.11 E-value=0.0027 Score=34.62 Aligned_cols=261 Identities=14% Similarity=0.102 Sum_probs=117.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh----hhhcccccccccccccCCCEEEEEeecC---CC-CcccccCCCccccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK----LRFAQFADIDSTLVGQPGDTLTFPAFTY---SG-DAQVIAEGEKIPVD 72 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~----~~~~~l~~~~~~~~~~~G~~v~ip~~~~---~~-~a~~~~eg~~~~~~ 72 (274) |+... -.+++-|..+..++.+...+. .....++.. ..-+.+.|-.+.. .+ .+..++.+.+.+.. T Consensus 1 M~~~~--~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~------~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 72 (348) T protein:vir:98 1 MSWTL--DTEFIEPTQLTGLIREALRDLQVNRFRLARWLPN------VDVDDITFEFLRGGGGLAETASYRSWDTESKIG 72 (348) T ss_pred Ccchh--hhhccCHHHHHHHHHHHhhccCcchhhHHhcCCC------ccccceEEEEEeccCCceeeeeeecCCCcccee Confidence 77543 567899998998887654321 122223221 1111233322211 11 23445555555544 Q ss_pred cc-ccceeEEeehhhhcchhccHHHHhccCccHHHHHH-------HHHHHHHHHHHHHHHHHHhccccc----------- Q lcl|Aclame:pro 73 QI-GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAV-------RQHGLAIANKVDNDVLEALKGATL----------- 133 (274) Q Consensus 73 ~~-~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~-------~~~a~~~a~~~d~~~i~~~~~a~~----------- 133 (274) +- .+......+-.++....++.+++........+.+. .++.+.+.+.+|-.+...+.++.. T Consensus 73 ~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDy 152 (348) T protein:vir:98 73 RREGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDF 152 (348) T ss_pred ecccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEcc Confidence 32 34555555555555555555444332222222232 334444555555555554443211 Q ss_pred --------ccc------CcccCHHHHHHHHHHHhh-cCCCccEEEEcHHHHHHHHhhhcccc-cccccc--ccccccccc Q lcl|Aclame:pro 134 --------TVE------ADITKLDGLQTAIDKFND-EDLEPMVLFVNPLDAGGLRTSASDNF-TRPTQL--GDNIIVKGA 195 (274) Q Consensus 134 --------~~~------~~~~~~d~iv~a~~~l~~-~~~~~~~~v~~p~~~~~L~~~~~~~~-~~~~~~--~~~~~~~g~ 195 (274) ++. ++...+++|.+++..+.+ .+..++.++|+++.+..|++++.... ...... ....+..+. T Consensus 153 g~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~ 232 (348) T protein:vir:98 153 GRIGSHSVVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQ 232 (348) T ss_pred ccCcccccccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHH Confidence 111 112235677777777765 46789999999999999986643221 111110 111122222 Q ss_pred cch---hccee-eEEc-----------CCCCcceEEEEcCCe---------EEEEe-c-----c------------Ccee Q lcl|Aclame:pro 196 FGE---ALGAV-IVRS-----------NKLNKGEALLAKKGA---------VKLIT-K-----R------------DFFL 233 (274) Q Consensus 196 ~~~---i~G~~-Vv~s-----------~~~p~~~~~l~~~~a---------~~~~~-~-----~------------~~~v 233 (274) +.. .+|+| |.+- +.+|.+..+++..+. +|+-. + . .+-+ T Consensus 233 ~~~~~~~~g~~~i~~~d~~~~~~g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~ 312 (348) T protein:vir:98 233 LNTVLSSMGLPPIEVYDAKVAVDGVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVA 312 (348) T ss_pred HHHHHHhhCCeEEEEeeeEEEcCCceeceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCceee Confidence 222 23444 3331 224555554432211 11000 0 0 0011 Q ss_pred eeccccccCccEEEEEEEEEEEEEcCcceEEEEeCC Q lcl|Aclame:pro 234 EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) Q Consensus 234 e~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a 269 (274) .++.+.+--...+.+..+.=..+.+|++++++++-| T Consensus 313 ~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 313 ATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 111111111122333334334456899999988888 No 226 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=94.89 E-value=0.0033 Score=34.22 Aligned_cols=194 Identities=12% Similarity=0.084 Sum_probs=112.4 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh-hhhcccccccccccccCCCEEEEEeecCCCCc-ccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDA-QVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~-~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a-~~~~eg~~~~~~~~~~~~ 78 (274) |..+......++ ..+...+.+.+... .-+..++.. ..+.+..=++..++..|.. +|+ .+....+++... T Consensus 1 M~i~~~~l~~l~--~~~~~~f~~~~~~a~~~~~~iA~~----vpSt~~~~tY~wLg~fP~lrewi---Ger~i~~l~~~~ 71 (305) T protein:vir:19 1 MIVTPASIKALM--TSWRKDFQGGLEDAPSQYNKIAMV----VNSSTRSNTYGWLGKFPTLKEWV---GKRTIQQMEAHG 71 (305) T ss_pred CccCHHHHHHHH--HHHHHHHHHHHhhcCcccceEEeE----ecCCCCcccccccccCCccchhh---cceeeeeccccc Confidence 653222211111 11222222222111 111222221 1233344455555666655 456 467788888899 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------- Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------------------------- 133 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------------------------- 133 (274) .+++-+++...+.|.+.++++....+...+.++++++.+..-|..+++.+..+.. T Consensus 72 y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~ 151 (305) T protein:vir:19 72 YSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGS 151 (305) T ss_pred eeEeeccccceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCccccc Confidence 9999999999999999999999999999999999999999888888765532100 Q ss_pred -------------c--------------------------------------------------------------ccCc Q lcl|Aclame:pro 134 -------------T--------------------------------------------------------------VEAD 138 (274) Q Consensus 134 -------------~--------------------------------------------------------------~~~~ 138 (274) + ++.+ T Consensus 152 ~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~ 231 (305) T protein:vir:19 152 AVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKG 231 (305) T ss_pred ccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCC Confidence 0 0124 Q ss_pred ccCHHHHHHHHHHHhhc--------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcc-eeeEEcCC Q lcl|Aclame:pro 139 ITKLDGLQTAIDKFNDE--------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG-AVIVRSNK 209 (274) Q Consensus 139 ~~~~d~iv~a~~~l~~~--------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G-~~Vv~s~~ 209 (274) +++.+.+-.|+.++... +..|+.+||.|+....-++.-..+....... +...-+.| +.+++++. T Consensus 232 ~Ls~~nl~aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i~~g~~-------~~~Np~~g~~eliV~P~ 304 (305) T protein:vir:19 232 DLTLDNLWKGWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNT-------TVSNEMKGKLQLVVADY 304 (305) T ss_pred CCCHHHHHHHHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhcccCCccc-------cccceecceEEEEeccc Confidence 45667777777776432 2367789999987655433211122211111 11123455 68889999 Q ss_pred C Q lcl|Aclame:pro 210 L 210 (274) Q Consensus 210 ~ 210 (274) + T Consensus 305 L 305 (305) T protein:vir:19 305 L 305 (305) T ss_pred C Confidence 9 No 227 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=94.74 E-value=0.0036 Score=33.97 Aligned_cols=270 Identities=9% Similarity=-0.006 Sum_probs=132.5 Q ss_pred CC-ccccchhhccchHHHHH-H-HHHHHHHhhhhccccccccc---c-cc-cCCCEEEEEeecCCCCccc---cc--CCC Q lcl|Aclame:pro 1 MA-QGTTKVSNLIVPEVLAP-M-MQAELDKKLRFAQFADIDST---L-VG-QPGDTLTFPAFTYSGDAQV---IA--EGE 67 (274) Q Consensus 1 ma-~~~T~~~~~~iPe~~~~-~-v~~~~~~~~~~~~l~~~~~~---~-~~-~~G~~v~ip~~~~~~~a~~---~~--eg~ 67 (274) .+ ...+..++.+....... . +........+..+....... + .. ..+....+..--....++- .+ .+. T Consensus 166 ~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~ 245 (521) T protein:vir:72 166 LAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDN 245 (521) T ss_pred cccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCCcccc Confidence 11 12222233333221110 0 00000011111111110000 0 00 0111122211101111221 11 133 Q ss_pred cccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------- Q lcl|Aclame:pro 68 KIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT--------- 134 (274) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~--------- 134 (274) .+++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|++++|..+.-.... T Consensus 246 ~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~ 325 (521) T protein:vir:72 246 PWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTP 325 (521) T ss_pred cccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeecc Confidence 46777777789999999888888888887655 35899999999999999999999999766422110 Q ss_pred -ccCcccCHH---------HHH----HHHHHH--------hhc-CCCccEEEEcHHHHHHHHhhhccccccccc--cc-- Q lcl|Aclame:pro 135 -VEADITKLD---------GLQ----TAIDKF--------NDE-DLEPMVLFVNPLDAGGLRTSASDNFTRPTQ--LG-- 187 (274) Q Consensus 135 -~~~~~~~~d---------~iv----~a~~~l--------~~~-~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~--~~-- 187 (274) ..++.++++ ... .....+ ..- -...+++|++|++.+.|.......+..... .+ T Consensus 326 ~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~ 405 (521) T protein:vir:72 326 GSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFS 405 (521) T ss_pred CccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccccccc Confidence 112222211 111 111111 111 256789999999999886533222211111 01 Q ss_pred ---cccccccccchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEEE Q lcl|Aclame:pro 188 ---DNIIVKGAFGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLY 257 (274) Q Consensus 188 ---~~~~~~g~~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~ 257 (274) .+.+.. |.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-.+-...||+..+ T Consensus 406 ~d~~~~~~~---G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~- 481 (521) T protein:vir:72 406 TDTTKSVFA---GVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI- 481 (521) T ss_pred ccCCCceEE---EEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee- Confidence 011222 233 46899999999876655543432 2233333444344568889999999999998764 Q ss_pred cCcc-------eEEEEeCCCcccC Q lcl|Aclame:pro 258 DESK-------VVKITKGAGDEVM 274 (274) Q Consensus 258 ~~~a-------vv~l~~~aa~~~~ 274 (274) ||-. ..+|...-++--. T Consensus 482 NP~~~~~~~~~a~~i~~~~~~~~a 505 (521) T protein:vir:72 482 NPFAESAAQAPASRIQSGMPSILN 505 (521) T ss_pred cCcccccCcccceeecCcChhhhc Confidence 4532 2334433333221 No 228 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=94.61 E-value=0.0039 Score=33.76 Aligned_cols=258 Identities=14% Similarity=0.028 Sum_probs=118.9 Q ss_pred hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-CCcccccCCCcccc-cccccceeEEeehhh Q lcl|Aclame:pro 9 SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPV-DQIGTSKREAKVRKI 86 (274) Q Consensus 9 ~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~-~~~~~~~~~~~~~~~ 86 (274) -|+|.+..+..++.........+....=... ...+-++|.+-..... .-+..+.++.+-+. ..-.+.....++-++ T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~d~~fp~~--~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~i 78 (341) T protein:vir:34 1 MSMYTTAQLLAANEQKFKFDPLFLRLFFRES--YPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYV 78 (341) T ss_pred CCCcCHHHHHHHHHhccCccchhHHhcCCcc--cccccceEEEEEeeCCeeEEEeecCCCCcceeccCceeeeEEecCcc Confidence 7888888888877655433322222210000 0011123433222211 11222333332211 112233334444444 Q ss_pred hcchhccHHHHh--cc------CccHHHHHHHHH-------HHHHHHHHHHHHHHHhccccc------------------ Q lcl|Aclame:pro 87 GKGTELTDEAVL--SG------FGDPQGEAVRQH-------GLAIANKVDNDVLEALKGATL------------------ 133 (274) Q Consensus 87 ~~~~~is~e~~~--~s------~~d~~~~~~~~~-------a~~~a~~~d~~~i~~~~~a~~------------------ 133 (274) +....++-+++. .. ..+..+.+.+.+ .+.+.+.+|..+...+.++.. T Consensus 79 ~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vDfg~~~ 158 (341) T protein:vir:34 79 KPKHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGRSE 158 (341) T ss_pred CccceeCHHHHHHHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEEeCCCC Confidence 444444433321 11 112333333333 334555566666666642211 Q ss_pred ------cccC-----cccCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcccc-cccccccccc-------cccc Q lcl|Aclame:pro 134 ------TVEA-----DITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNF-TRPTQLGDNI-------IVKG 194 (274) Q Consensus 134 ------~~~~-----~~~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~-~~~~~~~~~~-------~~~g 194 (274) ++.. +...++.+-+....+...+..+..++|+++++..|+++....- ........+. +..+ T Consensus 159 ~~~~~~t~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 238 (341) T protein:vir:34 159 ENNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLGKA 238 (341) T ss_pred ccceEecCCccCCcCCCchHHHHHHHHHHHHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhcccccccccccccccccc Confidence 0111 1123466666666777778889999999999999987654321 1111111110 1111 Q ss_pred --ccchhcceeeEEc-----------CCCCcceEEEEcCCeEE---EEeccCc------eeeeccc------c-ccCccE Q lcl|Aclame:pro 195 --AFGEALGAVIVRS-----------NKLNKGEALLAKKGAVK---LITKRDF------FLEKDRD------A-SRKSTA 245 (274) Q Consensus 195 --~~~~i~G~~Vv~s-----------~~~p~~~~~l~~~~a~~---~~~~~~~------~ve~~r~------~-~~~~~~ 245 (274) ..+++.|+++++- +.+|.+.++++..+..+ |..-.+. .....+. . +-.... T Consensus 239 ~~~~~~~~g~~i~~y~~~y~ddG~~~~~ip~~~v~l~p~g~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~~ 318 (341) T protein:vir:34 239 VSYKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINASARYPKNWVTTGDPAREF 318 (341) T ss_pred eeeeeecCCceEEEEcCEEEECCcEEeeecCCeEEEeeCCCcceEEEeecccccccccceeeeeEeeeeeeecCCCcEEE Confidence 1235667777642 23788888887665433 2211110 0011111 1 111233 Q ss_pred EEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 246 LYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 246 i~~~~~~~~~v~~~~avv~l~~~ 268 (274) +.+..+.=..+.+|+++++.+++ T Consensus 319 ~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 319 TMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred EEEcccceeeeeCCCcEEEEEeC Confidence 44455555666799999999999 No 229 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=94.27 E-value=0.0049 Score=33.25 Aligned_cols=255 Identities=13% Similarity=0.070 Sum_probs=133.6 Q ss_pred CCccccchhhccchHHHHHHHH-------HHHHHhhh-hcccccccccccccCCCEEEEEeecC-CCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQ-------AELDKKLR-FAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~-------~~~~~~~~-~~~l~~~~~~~~~~~G~~v~ip~~~~-~~~a~~~~eg~~~~~ 71 (274) |.-.. ..|.....+. +.+.++.. +.-|..........+|.+|..|-.-. ..++.|+..-+.+.. T Consensus 1 mp~~~-------lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~ 73 (321) T protein:vir:34 1 MPFPN-------ISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPT 73 (321) T ss_pred CCCch-------HHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeecc Confidence 55311 1121111111 11111110 11111110111123567788776433 567888875555544 Q ss_pred c-ccccceeEEeehhhhcchhccHHHHhcc-----CccHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------cc-- Q lcl|Aclame:pro 72 D-QIGTSKREAKVRKIGKGTELTDEAVLSG-----FGDPQGEAVRQHGLAIANKVDNDVLEALKG-----------AT-- 132 (274) Q Consensus 72 ~-~~~~~~~~~~~~~~~~~~~is~e~~~~s-----~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~-----------a~-- 132 (274) . .-.+++.+...++....+.||-+.+..+ .+|+++.-.+..-+.++..++..+.....+ .. T Consensus 74 ~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~ 153 (321) T protein:vir:34 74 APQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGLDGAVPV 153 (321) T ss_pred chhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhhhhhccc Confidence 4 4556777888899888899988776654 356777777777777888888877642221 11 Q ss_pred --cccc-------------------CcccCHHHHHHHHHHH----hhcCCCccEEEEcHHHHHHHHhhhc--cccccccc Q lcl|Aclame:pro 133 --LTVE-------------------ADITKLDGLQTAIDKF----NDEDLEPMVLFVNPLDAGGLRTSAS--DNFTRPTQ 185 (274) Q Consensus 133 --~~~~-------------------~~~~~~d~iv~a~~~l----~~~~~~~~~~v~~p~~~~~L~~~~~--~~~~~~~~ 185 (274) .+++ +++.+..++..+...+ ......|..|++..+.|...++.-. ..+... T Consensus 154 ~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~-- 231 (321) T protein:vir:34 154 DPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSA-- 231 (321) T ss_pred CCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeeccc-- Confidence 1110 1112344555554443 2345688999999998887765321 111111 Q ss_pred ccccccccccc-chhcceeeEEcC----CCCcceEEEEcCCeEEEEeccCce---eeeccccccCccEEEEEEEEEEEEE Q lcl|Aclame:pro 186 LGDNIIVKGAF-GEALGAVIVRSN----KLNKGEALLAKKGAVKLITKRDFF---LEKDRDASRKSTALYSDKHYVAYLY 257 (274) Q Consensus 186 ~~~~~~~~g~~-~~i~G~~Vv~s~----~~p~~~~~l~~~~a~~~~~~~~~~---ve~~r~~~~~~~~i~~~~~~~~~v~ 257 (274) +....|.. -.+.|..|+.++ .+|.+++|.+..+.+.+..-++-. +.+.|-.-..++.++-...+-+.++ T Consensus 232 ---~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~NqdA~~q~I~~~GnL~ 308 (321) T protein:vir:34 232 ---EEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFNQDAEAQILAWAGNLT 308 (321) T ss_pred ---ccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccchhHHhhhhhhhheee Confidence 11111211 258899999988 589999999999998876433322 2233311234454444333333333 Q ss_pred cCcceEEEEeCCCcccC Q lcl|Aclame:pro 258 DESKVVKITKGAGDEVM 274 (274) Q Consensus 258 ~~~avv~l~~~aa~~~~ 274 (274) +...-+++|| T Consensus 309 -------~sn~~~~~vL 318 (321) T protein:vir:34 309 -------CSGAQFQGRL 318 (321) T ss_pred -------eecccceeEE Confidence 2333344444 No 230 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=94.12 E-value=0.0053 Score=33.04 Aligned_cols=260 Identities=10% Similarity=0.023 Sum_probs=109.5 Q ss_pred CCccccch---------hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecC-CC-CcccccCCCcc Q lcl|Aclame:pro 1 MAQGTTKV---------SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SG-DAQVIAEGEKI 69 (274) Q Consensus 1 ma~~~T~~---------~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~-~~-~a~~~~eg~~~ 69 (274) |-|+..+. .++|.+..+..++.+.-.+..+...++.... ..+..+.+..... .+ .+..++.+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~~~~~~~~~~~l~~~~Fp~~~----~~~~~~~~~~~~~~~~~~a~~v~~~~~~ 76 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLDYTRNRQYPEMLGDTLFPAVK----VPTLEVDILKAGSRVPTIASVSAFDAEA 76 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHHHHHhcCcchhhHhhcCCccc----cccceeEEEeeccCcceeeeeecCCCCc Confidence 77654432 3456666666666543222222222222111 1111222222111 11 12334444444 Q ss_pred cccccccceeEEeehhhhcchhccHHHHh--cc--CccHHHHHHH-------HHHHHHHHHHHHHHHHHhcccc------ Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTELTDEAVL--SG--FGDPQGEAVR-------QHGLAIANKVDNDVLEALKGAT------ 132 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~is~e~~~--~s--~~d~~~~~~~-------~~a~~~a~~~d~~~i~~~~~a~------ 132 (274) +..+-........+-.++....++.+.+. .+ .......+.+ .+.+.+.+.+|..+...+.++. T Consensus 77 ~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~ 156 (349) T protein:vir:10 77 EIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKN 156 (349) T ss_pred ceecccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCC Confidence 43332222333333333434444433322 11 1222233333 3344455555655666554321 Q ss_pred ---------------ccccCccc--C---HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhcccc-ccccccccccc Q lcl|Aclame:pro 133 ---------------LTVEADIT--K---LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNF-TRPTQLGDNII 191 (274) Q Consensus 133 ---------------~~~~~~~~--~---~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~-~~~~~~~~~~~ 191 (274) .++...+. + +++|.+.+ ...+..++.++|+++++..|++++...- ...+..+ ..+ T Consensus 157 g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~---~~~g~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~-~~~ 232 (349) T protein:vir:10 157 GIAIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDWS---DSLDVTPTRALTSKKVLRILMRSTEIKEAIFGKDTG-RVV 232 (349) T ss_pred cEEEecccCccceeEecCcccCCCCCCCHHHHHHHHH---HHhCCCccEEEeCHHHHHHHhcCHHHHHHhcccccc-ccc Confidence 11111111 1 33444443 3346678999999999999987644321 1111111 111 Q ss_pred ----cccccchhcceeeEEc----------------CCCCcceEEEEcCCeEE---EEec---cCc-----e-------- Q lcl|Aclame:pro 192 ----VKGAFGEALGAVIVRS----------------NKLNKGEALLAKKGAVK---LITK---RDF-----F-------- 232 (274) Q Consensus 192 ----~~g~~~~i~G~~Vv~s----------------~~~p~~~~~l~~~~a~~---~~~~---~~~-----~-------- 232 (274) .....+.+.|++|++- +.+|.+.+++...+..| |..- .+. . T Consensus 233 ~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~ 312 (349) T protein:vir:10 233 GQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPTPEENRLISSNAQVSNVGNIM 312 (349) T ss_pred CHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeeccchhhhhcccccceeeccceE Confidence 1222334556556542 13567777666554443 2111 000 0 Q ss_pred -eeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 233 -LEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 233 -ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) ..+..+.+-....+.+..+.=..+.+|++++++++= T Consensus 313 ~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 313 AKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred EEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 000001111122334444444556688888887776 No 231 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=94.02 E-value=0.0056 Score=32.91 Aligned_cols=272 Identities=8% Similarity=-0.009 Sum_probs=130.4 Q ss_pred CCccccch-hhccchH--HHHHHHHHHHHHhhhhccccccccc----cc--ccCCCEEEEEeecCCCCcccc-----cCC Q lcl|Aclame:pro 1 MAQGTTKV-SNLIVPE--VLAPMMQAELDKKLRFAQFADIDST----LV--GQPGDTLTFPAFTYSGDAQVI-----AEG 66 (274) Q Consensus 1 ma~~~T~~-~~~~iPe--~~~~~v~~~~~~~~~~~~l~~~~~~----~~--~~~G~~v~ip~~~~~~~a~~~-----~eg 66 (274) .+..++.. .+.+-.. .......+...... .....+.... .. ...+....+..--....+|-. .-+ T Consensus 166 ~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t-~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~ 244 (521) T protein:vir:10 166 LAASTQTTVGDIYTHFFQDTGTVYLQASAQVT-ISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTD 244 (521) T ss_pred ccccccccccccccccccccccceeccccccc-CCCcccccccccccccccccccceeecccccchhhHhhhccCCCCcc Confidence 22222221 1111110 00000000000000 0000000000 00 001111222111011111111 113 Q ss_pred CcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Q lcl|Aclame:pro 67 EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------- 134 (274) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~-------- 134 (274) ..+++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|++++|..+.-.... T Consensus 245 ~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~ 324 (521) T protein:vir:10 245 NPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLT 324 (521) T ss_pred ccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeec Confidence 457888888899999999888888888887655 35899999999999999999999999766422111 Q ss_pred --ccCcccCHH---------HHH----HHHHHH--------hhc-CCCccEEEEcHHHHHHHHhhhcccccccccccccc Q lcl|Aclame:pro 135 --VEADITKLD---------GLQ----TAIDKF--------NDE-DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNI 190 (274) Q Consensus 135 --~~~~~~~~d---------~iv----~a~~~l--------~~~-~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~ 190 (274) ..++.++++ ... .....+ ..- -...+++|++|++.+.|.......+........+. T Consensus 325 ~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~ 404 (521) T protein:vir:10 325 PGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGF 404 (521) T ss_pred cCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccc Confidence 112222211 111 111111 111 25678999999999988754322222211111111 Q ss_pred ccccc----cchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcC Q lcl|Aclame:pro 191 IVKGA----FGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDE 259 (274) Q Consensus 191 ~~~g~----~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~ 259 (274) ..+.+ .|.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-.+-...||+..+ || T Consensus 405 ~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP 483 (521) T protein:vir:10 405 NTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NP 483 (521) T ss_pred cccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cC Confidence 11111 1344 36799999999876655543432 2233333444344568889999999999998764 45 Q ss_pred cceE-------EEEeCCCccc-----C Q lcl|Aclame:pro 260 SKVV-------KITKGAGDEV-----M 274 (274) Q Consensus 260 ~avv-------~l~~~aa~~~-----~ 274 (274) -+.- .|...-++.- | T Consensus 484 ~~~~~~~~~~~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:10 484 FAESAAQAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred cccccCCccceeecccchhhhcccccc Confidence 3321 1222211110 1 No 232 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=93.87 E-value=0.0061 Score=32.72 Aligned_cols=257 Identities=13% Similarity=0.019 Sum_probs=116.6 Q ss_pred hhccchHHHHHHHHHHHHHhhhhccc-ccccccccccCCCEEEEEeecCC-CCcccccCCCcccc-cccccceeEEeehh Q lcl|Aclame:pro 9 SNLIVPEVLAPMMQAELDKKLRFAQF-ADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPV-DQIGTSKREAKVRK 85 (274) Q Consensus 9 ~~~~iPe~~~~~v~~~~~~~~~~~~l-~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~-~~~~~~~~~~~~~~ 85 (274) -|+|.+..+..++.+.......+... +... ...+.+.|.+-..... .-+..+.++.+-+. ..-.+.....++-+ T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~~~~Fp~~---~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~ 77 (341) T protein:vir:39 1 MSVYTTAQLLAVNEKKFKFDPLFLRIFFRET---YPFSTEKVYLSQIPGLVNMALYVSPIVSGKVIRSRGGSTSEFTPGY 77 (341) T ss_pred CCccCHHHHHHHHHhhcCccchhHhhcCCcc---cccCcceEEEEEecCCceeeEEecCCCCcceecccceeeeeEeccc Confidence 77888888888887654433322222 2110 0111223433322211 11222333332221 12223334444545 Q ss_pred hhcchhccHHHHh--cc------CccHHHHHHH-------HHHHHHHHHHHHHHHHHhccccc----------------- Q lcl|Aclame:pro 86 IGKGTELTDEAVL--SG------FGDPQGEAVR-------QHGLAIANKVDNDVLEALKGATL----------------- 133 (274) Q Consensus 86 ~~~~~~is~e~~~--~s------~~d~~~~~~~-------~~a~~~a~~~d~~~i~~~~~a~~----------------- 133 (274) ++....++-++.. .. .-+..+...+ .+.+.+.+.+|..+...+.++.. T Consensus 78 i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDfg~~ 157 (341) T protein:vir:39 78 VKPKHEVNPLMTLRRLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDMGRS 157 (341) T ss_pred cCcccccCHHHHHHHhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEeccCC Confidence 5444444433322 11 1122222222 23344445555555555532110 Q ss_pred -------cccCcc-----cCHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHhhhccccc-cccccccccc-------cc Q lcl|Aclame:pro 134 -------TVEADI-----TKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFT-RPTQLGDNII-------VK 193 (274) Q Consensus 134 -------~~~~~~-----~~~d~iv~a~~~l~~~~~~~~~~v~~p~~~~~L~~~~~~~~~-~~~~~~~~~~-------~~ 193 (274) ++...+ ...+.+-+....+...+..+..++|+++.+..|++++...-. .......+.+ .. T Consensus 158 ~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 237 (341) T protein:vir:39 158 AGNNIVQAGAAAWSSRDKETYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETALKDLGK 237 (341) T ss_pred ccceeEecCCccCCCCCCchHHHHHHHHHHHHhcCCceEEEEeChHHHHHHhcCHHHHHHHhhcccccccccchhhhhhh Confidence 011111 124555555556666777889999999999999876442211 1111111111 11 Q ss_pred c--ccchhcceeeEEc-----------CCCCcceEEEEcCCeEE---EEeccCc------eeeecccc-------ccCcc Q lcl|Aclame:pro 194 G--AFGEALGAVIVRS-----------NKLNKGEALLAKKGAVK---LITKRDF------FLEKDRDA-------SRKST 244 (274) Q Consensus 194 g--~~~~i~G~~Vv~s-----------~~~p~~~~~l~~~~a~~---~~~~~~~------~ve~~r~~-------~~~~~ 244 (274) | ..+++.|+++++= +.+|++.++++..+..+ |..-.+. .....+.+ +-... T Consensus 238 ~~~~~~~~~g~~i~~y~~~y~d~g~~~~~ip~~~~~l~p~~~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp~~~ 317 (341) T protein:vir:39 238 AVSYKGMYGDVAIVVYSGQYIENDVKKNYLPDLTMVLGNTQARGLRTYGCILDADAQREGINASTRYPKNWVQTGDPARE 317 (341) T ss_pred HhhhhhhhcCceEEEEccEEEecCcEEeeecCCeEEEeeCCCcceEEEecccchhhcccceeeeeeeeeeeeecCCCcEE Confidence 1 1235678777652 23788888777655433 2211110 01111111 11123 Q ss_pred EEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 245 ALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 245 ~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) .+.+-.+.=..+.+|+++++++++ T Consensus 318 ~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 318 FTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred EEEEeccccceeeCCCcEEEEEeC Confidence 344444545566799999999999 No 233 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=93.71 E-value=0.0066 Score=32.53 Aligned_cols=268 Identities=10% Similarity=0.040 Sum_probs=128.2 Q ss_pred CCccccchhhccchHHHHHHHH---HHHHHhhhhcccccccccc---------cc-cCCCEEEEEeecCCCCcccc---- Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQ---AELDKKLRFAQFADIDSTL---------VG-QPGDTLTFPAFTYSGDAQVI---- 63 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~---~~~~~~~~~~~l~~~~~~~---------~~-~~G~~v~ip~~~~~~~a~~~---- 63 (274) .++.++. ............+. ...-... ..+.......- .. ..|....+..--....++.. T Consensus 167 ~s~~~~g-~~~~~g~~~~~~~~~~g~~~~~~~-~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g 244 (524) T protein:vir:98 167 FAKITTG-TAIATGAIVYHIFQETGIAYFQNV-TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFN 244 (524) T ss_pred ccccccc-cccccccccccccccccceecccc-ccCcccccccccccccccccccccccceeecccccchhhhhhhccCC Confidence 1111100 00000000000000 0000000 00000000000 00 01111122111111112211 Q ss_pred -cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--- Q lcl|Aclame:pro 64 -AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV--- 135 (274) Q Consensus 64 -~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~--- 135 (274) ..+..+++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|++++|..+....... T Consensus 245 ~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g 324 (524) T protein:vir:98 245 GSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSG 324 (524) T ss_pred CCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceee Confidence 124557888888899999999888888888887654 358999999999999999999999997765322110 Q ss_pred -------cCcccCH-------------HHHHHHHHHHh--------hcC-CCccEEEEcHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 136 -------EADITKL-------------DGLQTAIDKFN--------DED-LEPMVLFVNPLDAGGLRTSASDNFTRPTQL 186 (274) Q Consensus 136 -------~~~~~~~-------------d~iv~a~~~l~--------~~~-~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~ 186 (274) .++.++. +.+-.....+. .-. ...+++|++|++.+.|..... -+...+.. T Consensus 325 ~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~-g~~~~s~~ 403 (524) T protein:vir:98 325 FTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDS-GITPASQG 403 (524) T ss_pred cccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhc-ccccccch Confidence 1121111 11111222221 112 357899999999998864211 11111110 Q ss_pred cc--------ccccccccchhcceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEE Q lcl|Aclame:pro 187 GD--------NIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHY 252 (274) Q Consensus 187 ~~--------~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~ 252 (274) .+ ..+..|.++ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-.+-...|| T Consensus 404 ~~~~~~~d~~~~~~~G~l~--~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY 481 (524) T protein:vir:98 404 LQKTLNVDTTKAVFAGVLG--GTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRY 481 (524) T ss_pred hhcccccCCccceEEEEec--CceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeeeeee Confidence 00 112223332 47899999999876655543432 223333343333446888999999999999 Q ss_pred EEEEEcCcceE-------EEEeCCCc------ccC Q lcl|Aclame:pro 253 VAYLYDESKVV-------KITKGAGD------EVM 274 (274) Q Consensus 253 ~~~v~~~~avv-------~l~~~aa~------~~~ 274 (274) +..+ ||-..- ++...... ..| T Consensus 482 ~l~~-NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~ 515 (524) T protein:vir:98 482 GIGI-NPFANSRSQAPADRITSGMISKEMCGKNAY 515 (524) T ss_pred ceee-cCcccccCCccccccccCcchHhhcCccce Confidence 8764 453321 22222221 222 No 234 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=269 Identities=9% Similarity=0.019 Sum_probs=128.5 Q ss_pred CCccc-cchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCC-----------EEEEEeecCCCCccc---c-- Q lcl|Aclame:pro 1 MAQGT-TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGD-----------TLTFPAFTYSGDAQV---I-- 63 (274) Q Consensus 1 ma~~~-T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~-----------~v~ip~~~~~~~a~~---~-- 63 (274) ++..+ ++.++.+..+.-..-. -.. ...+...+.. ....+..++ ..++..--....++. . T Consensus 174 ~~~~~~~~~G~~~~~t~~~tg~-~~~-~~~~~~~~~~--~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ 249 (528) T protein:vir:80 174 LAIGTQIEAGDIVHHTFAETGI-AYL-QNVTAEQVTP--TKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNG 249 (528) T ss_pred ccccccccccceeccccccccc-ccc-ccccccccCc--cccCCcccccccccccccccccccccccchhhhhhhcccCC Confidence 11000 0011111100000000 000 0000000000 000000111 111111000111111 1 Q ss_pred cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----- Q lcl|Aclame:pro 64 AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT----- 134 (274) Q Consensus 64 ~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~----- 134 (274) ..+..+++-..+.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..++..... T Consensus 250 ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~ 329 (528) T protein:vir:80 250 SSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGM 329 (528) T ss_pred CccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeee Confidence 113446777888888999999888888888887655 25899999999999999999999999777432211 Q ss_pred -----ccCcccC-------------HHHHHHHHHHHhh---------cCCCccEEEEcHHHHHHHHhhhccccccccc-- Q lcl|Aclame:pro 135 -----VEADITK-------------LDGLQTAIDKFND---------EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQ-- 185 (274) Q Consensus 135 -----~~~~~~~-------------~d~iv~a~~~l~~---------~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~-- 185 (274) ..++.++ .+.+-.....+.. .....++++++|++...|.......+..... T Consensus 330 t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~ 409 (528) T protein:vir:80 330 TQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAA 409 (528) T ss_pred eeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccc Confidence 0111111 1222222222211 1235689999999999886543211111100 Q ss_pred ccc--ccccccccchhc-ceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEE Q lcl|Aclame:pro 186 LGD--NIIVKGAFGEAL-GAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYL 256 (274) Q Consensus 186 ~~~--~~~~~g~~~~i~-G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v 256 (274) .+. +....-..|.+. |++|+++++.|.+-..+.-++. +-|+--.+.....-.|++.++-.+-...||+..+ T Consensus 410 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~ 489 (528) T protein:vir:80 410 KGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIGI 489 (528) T ss_pred cccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceeeeeeeeceee Confidence 000 100000134443 6799999999876655543432 2344444555556678999999999999998764 Q ss_pred EcCcce-------EEEEeCCCc------ccC Q lcl|Aclame:pro 257 YDESKV-------VKITKGAGD------EVM 274 (274) Q Consensus 257 ~~~~av-------v~l~~~aa~------~~~ 274 (274) ||-+. .++....+. ..| T Consensus 490 -NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 519 (528) T protein:vir:80 490 -NPFADSKSQAPSARITSGMLSKDSVGKNAY 519 (528) T ss_pred -cCcccccCCcccccccccchhhhhcCccce Confidence 45332 122222222 122 No 235 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=271 Identities=10% Similarity=0.004 Sum_probs=126.8 Q ss_pred CCccccchhhccchH---HHHHHHHH------HHHHhhhhcccccccc-cccccCCCEEEEEeecCCCCcccccC-CCcc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPE---VLAPMMQA------ELDKKLRFAQFADIDS-TLVGQPGDTLTFPAFTYSGDAQVIAE-GEKI 69 (274) Q Consensus 1 ma~~~T~~~~~~iPe---~~~~~v~~------~~~~~~~~~~l~~~~~-~~~~~~G~~v~ip~~~~~~~a~~~~e-g~~~ 69 (274) -.++..+ ..|.-| .|+..--. ................ ......+...++..--....++..++ +.++ T Consensus 121 Y~n~~g~--EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f 198 (468) T protein:vir:10 121 YENQAGE--EALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLF 198 (468) T ss_pred ecCCCCc--cceeccccccccccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCccc Confidence 1111100 000000 00000000 0000000000000000 00000011111111001112233333 3457 Q ss_pred cccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccCcc Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT------VEADI 139 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~------~~~~~ 139 (274) ++-..+.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|++++|..+...... ...+. T Consensus 199 ~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv 278 (468) T protein:vir:10 199 REMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGI 278 (468) T ss_pred ceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccccccc Confidence 777888888999998888888888887655 35899999999999999999999999877543322 11122 Q ss_pred cCH------HHHHH----HHHHH---------hhcCCCccEEEEcHHHHHHHHhhhccccccccccccc-----cccccc Q lcl|Aclame:pro 140 TKL------DGLQT----AIDKF---------NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDN-----IIVKGA 195 (274) Q Consensus 140 ~~~------d~iv~----a~~~l---------~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~-----~~~~g~ 195 (274) +++ ..-++ ....+ ....+..++++++|.+.+.|......++....+.... .-..|. T Consensus 279 ~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~ 358 (468) T protein:vir:10 279 FDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGN 358 (468) T ss_pred ccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcc Confidence 221 11111 11111 1123577899999999999875333333221111100 001111 Q ss_pred --cchh-cceeeEEcCCCC----cceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcce Q lcl|Aclame:pro 196 --FGEA-LGAVIVRSNKLN----KGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKV 262 (274) Q Consensus 196 --~~~i-~G~~Vv~s~~~p----~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~av 262 (274) .|.+ .|++|+++++.. .+-..+.-++. +-|+--.+.....--|++.++-.+....||+..+ +|-.. T Consensus 359 ~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~ 437 (468) T protein:vir:10 359 LAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NPFVT 437 (468) T ss_pred eEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeeeeeeceee-cccce Confidence 2333 368999997653 23222222222 2233333444334448889999999999998764 56432 Q ss_pred -EEEEeCCCcc-cC Q lcl|Aclame:pro 263 -VKITKGAGDE-VM 274 (274) Q Consensus 263 -v~l~~~aa~~-~~ 274 (274) -.++-..++. -| T Consensus 438 ~~~~~~g~~~~~~~ 451 (468) T protein:vir:10 438 TNGLYNGTPDGEAL 451 (468) T ss_pred eccccCCCcccccc Confidence 1223333332 11 No 236 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=273 Identities=7% Similarity=0.019 Sum_probs=126.5 Q ss_pred CCccccchhhcc----chHHHH--HHHHHHHHH-hhhhcccc-----cccccccc-----------cCCCEEEEEeecCC Q lcl|Aclame:pro 1 MAQGTTKVSNLI----VPEVLA--PMMQAELDK-KLRFAQFA-----DIDSTLVG-----------QPGDTLTFPAFTYS 57 (274) Q Consensus 1 ma~~~T~~~~~~----iPe~~~--~~v~~~~~~-~~~~~~l~-----~~~~~~~~-----------~~G~~v~ip~~~~~ 57 (274) +..........+ .+...+ +-....+.+ ...+.+-. +......+ ..+....+..--.. T Consensus 160 ~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsT 239 (529) T protein:vir:10 160 KGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccch Confidence 110000000000 000000 000000000 00000000 00000000 00111111100000 Q ss_pred CCcccc-----cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 58 GDAQVI-----AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEAL 128 (274) Q Consensus 58 ~~a~~~-----~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~ 128 (274) ..+|.. ..+..+++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..+ T Consensus 240 a~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i 319 (529) T protein:vir:10 240 SIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHh Confidence 111111 124457788888899999999888888888887655 25899999999999999999999999865 Q ss_pred cccccc----------ccCcccCHH-------------HHHHHHHHHhh---------cCCCccEEEEcHHHHHHHHhhh Q lcl|Aclame:pro 129 KGATLT----------VEADITKLD-------------GLQTAIDKFND---------EDLEPMVLFVNPLDAGGLRTSA 176 (274) Q Consensus 129 ~~a~~~----------~~~~~~~~d-------------~iv~a~~~l~~---------~~~~~~~~v~~p~~~~~L~~~~ 176 (274) .....- ..++.++++ .+-.....+.. .....++++++|++...|.... T Consensus 320 ~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVD 399 (529) T ss_pred hhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhc Confidence 432211 011122211 11111122211 1235788999999999886321 Q ss_pred cccccccccccccccccc----ccchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccE Q lcl|Aclame:pro 177 SDNFTRPTQLGDNIIVKG----AFGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTA 245 (274) Q Consensus 177 ~~~~~~~~~~~~~~~~~g----~~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~ 245 (274) ...+-..-....+...+. ..|.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-. T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~ 479 (529) T protein:vir:10 400 AGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPV 479 (529) T ss_pred cccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccce Confidence 111111000001111111 12333 36799999999876655543432 22443444444445688899999 Q ss_pred EEEEEEEEEEEEcCcceE-------EEEeCCCcc------cC Q lcl|Aclame:pro 246 LYSDKHYVAYLYDESKVV-------KITKGAGDE------VM 274 (274) Q Consensus 246 i~~~~~~~~~v~~~~avv-------~l~~~aa~~------~~ 274 (274) +-...||+..+ ||-..- ++....+.+ .| T Consensus 480 ~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 520 (529) T protein:vir:10 480 MGFKTRYAIGV-NPFAESRTQAPTSRISNGMPGAHSVGKNAY 520 (529) T ss_pred eeeeeeeceee-cCccccccccccccccCCcchhhhcCccce Confidence 99999998764 453321 222222222 22 No 237 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=93.28 E-value=0.0081 Score=32.04 Aligned_cols=272 Identities=11% Similarity=0.033 Sum_probs=123.5 Q ss_pred CCcccc-----------chh-hccchH---HHHHHHHHHHHHhhh------------hccccc----ccccccccCCCEE Q lcl|Aclame:pro 1 MAQGTT-----------KVS-NLIVPE---VLAPMMQAELDKKLR------------FAQFAD----IDSTLVGQPGDTL 49 (274) Q Consensus 1 ma~~~T-----------~~~-~~~iPe---~~~~~v~~~~~~~~~------------~~~l~~----~~~~~~~~~G~~v 49 (274) |.-+|- ..+ ..|.-| .|+......-..... ..+... .........+... T Consensus 107 MTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y 186 (470) T protein:vir:10 107 MNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDY 186 (470) T ss_pred CCccceeeeEEEEEecCCCccceeeecCCcccCccccccccccccccccccccccccccccccccccccccccccccccc Confidence 111100 000 000000 011100000000000 000000 0000000000011 Q ss_pred EEEeecCCCCccccc--CCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 50 TFPAFTYSGDAQVIA--EGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDND 123 (274) Q Consensus 50 ~ip~~~~~~~a~~~~--eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~ 123 (274) .+-.--....++..+ .+.++++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+..|+++ T Consensus 187 ~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINRe 266 (470) T protein:vir:10 187 NVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINRE 266 (470) T ss_pred ccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHH Confidence 110000011122232 24457777788888999998888778888887654 368999999999999999999999 Q ss_pred HHHHhcccccc------ccCcccCH----------HHHHHHHHHH---------hhcCCCccEEEEcHHHHHHHHhhhcc Q lcl|Aclame:pro 124 VLEALKGATLT------VEADITKL----------DGLQTAIDKF---------NDEDLEPMVLFVNPLDAGGLRTSASD 178 (274) Q Consensus 124 ~i~~~~~a~~~------~~~~~~~~----------d~iv~a~~~l---------~~~~~~~~~~v~~p~~~~~L~~~~~~ 178 (274) +|..+.+.... ...+.+++ +.+-.....+ ...-...++++++|.+.+.|...... T Consensus 267 ii~~l~~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l 346 (470) T protein:vir:10 267 VIRTIYNVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVL 346 (470) T ss_pred HHHHHhhhhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhcccc Confidence 99887654432 11222111 1111111111 11235778999999999988544322 Q ss_pred ccccccccccccccccc--cchh-cceeeEEcCCCC------cceEEEEcCCe------EEEEeccCceeeeccccccCc Q lcl|Aclame:pro 179 NFTRPTQLGDNIIVKGA--FGEA-LGAVIVRSNKLN------KGEALLAKKGA------VKLITKRDFFLEKDRDASRKS 243 (274) Q Consensus 179 ~~~~~~~~~~~~~~~g~--~~~i-~G~~Vv~s~~~p------~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~ 243 (274) ++........+.=..|. .|.+ .|++|++++++. .+-..+.-++. +-|+--.++....--|++.++ T Consensus 347 ~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfq 426 (470) T protein:vir:10 347 DYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQ 426 (470) T ss_pred ccccccccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCcccc Confidence 33221111000001111 2444 357999997533 22222222222 223322232222233788888 Q ss_pred cEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 244 TALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 244 ~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) -.+....||+..+ +|-..- ++-..|.-+. T Consensus 427 P~~g~~tRY~l~~-NP~~~~-~~~~~~~i~~ 455 (470) T protein:vir:10 427 PKIGFKTRYGLVE-NPFSQG-TTQGLGTLTR 455 (470) T ss_pred ceeeeeeeeceee-cCcccC-CCcccccccC Confidence 8999999998764 455321 2222222222 No 238 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=92.57 E-value=0.011 Score=31.36 Aligned_cols=255 Identities=11% Similarity=0.018 Sum_probs=126.7 Q ss_pred CC-c---cccchhhccchHHHHHHHHHHHHH----hhhhcccccccccccccCC-CEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MA-Q---GTTKVSNLIVPEVLAPMMQAELDK----KLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma-~---~~T~~~~~~iPe~~~~~v~~~~~~----~~~~~~l~~~~~~~~~~~G-~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) |= + .+| +.+.=+|-++..++...+-+ -.....|+-.... +.-. +++.++.+...|.+..++.++++|. T Consensus 63 mDa~~~~~~t-~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~--g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl 139 (382) T protein:vir:96 63 MDSNFTAPVT-TPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTV--GSWEDQEIVQGIVEPAGTAVEYGDHTNIPL 139 (382) T ss_pred cccccCCccc-cCCccHHHHHHhhhhhhhhhhhhhhhhhhhhcccccc--CCccceEEEEeeeecccceEEeecccCCCc Confidence 32 1 233 33444588888777654433 3333444433221 1111 4688998888899999999999988 Q ss_pred cccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHh----cc-------cc----- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEAL----KG-------AT----- 132 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~----~~-------a~----- 132 (274) .+...+...-+++.....+.+.++++.+ ...++.+.-++...+++.+.+|+..+-.. .+ -+ T Consensus 140 ~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~ 219 (382) T protein:vir:96 140 TSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPF 219 (382) T ss_pred cccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccc Confidence 8777666666666665667776544433 36778887777888888888887665221 11 00 Q ss_pred cc-ccCccc--C----HHHHHHHHHHHhhcC-------CCccEEEEcHHHHHHHHhhhccccccccccccccccccccch Q lcl|Aclame:pro 133 LT-VEADIT--K----LDGLQTAIDKFNDED-------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE 198 (274) Q Consensus 133 ~~-~~~~~~--~----~d~iv~a~~~l~~~~-------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~ 198 (274) .+ .+.++. + +++|..++..+.... ..+..+++.|..+..|.+-+ .+. ...-..+.. . T Consensus 220 ~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~n--~~g---~Tvl~~lk~----n 290 (382) T protein:vir:96 220 QTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVTT--PYG---ISVSDWIEQ----T 290 (382) T ss_pred cccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccccC--ccC---ccHHHHHHH----h Confidence 11 111121 2 456666776663322 13457889998877763211 010 000011111 1 Q ss_pred hcceeeEEcCCCC---------cceEEEEcCCeE-----------EEEeccCceeeecccccc--CccEEEEEE-EEEEE Q lcl|Aclame:pro 199 ALGAVIVRSNKLN---------KGEALLAKKGAV-----------KLITKRDFFLEKDRDASR--KSTALYSDK-HYVAY 255 (274) Q Consensus 199 i~G~~Vv~s~~~p---------~~~~~l~~~~a~-----------~~~~~~~~~ve~~r~~~~--~~~~i~~~~-~~~~~ 255 (274) +-++.++.-+.+. ..-.+++.+..- .|....+..... ...+. ......... ..|+- T Consensus 291 ~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~-l~ve~~~~~~~~~~s~~t~Gv~ 369 (382) T protein:vir:96 291 YPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFIT-LGVEKRAKSYVEDFSNGTAGAL 369 (382) T ss_pred cCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeee-ccceeecceeEeccccceeeeE Confidence 2244444443332 111122222210 000000000000 00001 111111122 35677 Q ss_pred EEcCcceEEEEeC Q lcl|Aclame:pro 256 LYDESKVVKITKG 268 (274) Q Consensus 256 v~~~~avv~l~~~ 268 (274) +.+|.++++++.= T Consensus 370 i~~P~ai~~~~GI 382 (382) T protein:vir:96 370 CKRPWAVVRYLGI 382 (382) T ss_pred EEcchhhhhccCC Confidence 7899999997766 No 239 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=92.31 E-value=0.012 Score=31.13 Aligned_cols=253 Identities=11% Similarity=0.024 Sum_probs=126.3 Q ss_pred CCcc-ccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeec---CCCCcccccC---CCcccccc Q lcl|Aclame:pro 1 MAQG-TTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT---YSGDAQVIAE---GEKIPVDQ 73 (274) Q Consensus 1 ma~~-~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~---~~~~a~~~~e---g~~~~~~~ 73 (274) .+.. .+.....+.+..-.. ..... ... .| +.++.-.. ....++..+. +..+++-. T Consensus 142 ~~~~~~~~~~~~~~~~~g~~------------~~~~~--~~~---~g-~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMa 203 (462) T protein:vir:10 142 LSNYDPTASSSAVNDAEGAN------------PGLLN--DSP---AG-TYEVTGDATGMATATAEALDDSSASTAFREMG 203 (462) T ss_pred cccccccccccccccccccc------------ceeec--CCC---cc-ceecccccccccchhccccCCccCCcchhhce Confidence 2211 111111111110000 00000 000 00 01110000 0112333332 34678888 Q ss_pred cccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------cCcccCH- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV------EADITKL- 142 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~~~------~~~~~~~- 142 (274) ++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..+....... ..+.+++ T Consensus 204 FsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl~ 283 (462) T protein:vir:10 204 FSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDLD 283 (462) T ss_pred eEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeeec Confidence 88899999999888888888887655 358999999999999999999999998876544321 2222221 Q ss_pred -----HHHHH----HHHHH---------hhcCCCccEEEEcHHHHHHHHhhhccccccccc---ccc--ccccccccchh Q lcl|Aclame:pro 143 -----DGLQT----AIDKF---------NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQ---LGD--NIIVKGAFGEA 199 (274) Q Consensus 143 -----d~iv~----a~~~l---------~~~~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~---~~~--~~~~~g~~~~i 199 (274) ...++ ....+ ...-...+++|++|++.+.|......++...-+ .+. +..-....|.+ T Consensus 284 ~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l 363 (462) T protein:vir:10 284 VDSNGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTL 363 (462) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEe Confidence 11222 22222 112357789999999999885443223322111 110 00111123444 Q ss_pred -cceeeEEcCCC----CcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 200 -LGAVIVRSNKL----NKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 200 -~G~~Vv~s~~~----p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) .|++|+++++. |.+-..+.-++. +-|+--.+.....--|++.++-.+....||+..+ ||-..- ++.+ T Consensus 364 ~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~t~~-~~~~ 441 (462) T protein:vir:10 364 NGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVS-NPFSGG-LTQG 441 (462) T ss_pred cCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeeeee-cCCCCC-cCCc Confidence 46899999753 332222222222 2233333333333448888998998899998764 344211 1111 Q ss_pred CCcccC Q lcl|Aclame:pro 269 AGDEVM 274 (274) Q Consensus 269 aa~~~~ 274 (274) -+ .++ T Consensus 442 ~~-~~~ 446 (462) T protein:vir:10 442 SG-ALT 446 (462) T ss_pred cc-ccc Confidence 11 122 No 240 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=92.21 E-value=0.012 Score=31.04 Aligned_cols=257 Identities=7% Similarity=-0.028 Sum_probs=129.9 Q ss_pred CCcc-----ccchhhccchHHHHHHHHHHHH----HhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc Q lcl|Aclame:pro 1 MAQG-----TTKVSNLIVPEVLAPMMQAELD----KKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) Q Consensus 1 ma~~-----~T~~~~~~iPe~~~~~v~~~~~----~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~ 71 (274) ||+. -.+.+++=+|-.+..++...+- .-.....|+-.... ..=.-+++.++.+...|.+..|+.++++|. T Consensus 65 ~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~-g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl 143 (388) T protein:vir:99 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV-GSWEDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) T ss_pred cccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhcccccc-CCccceeEEEeeeecceeEEEeecccCCCc Confidence 3432 2333444468778777654332 22222333322221 110113688888888899999999999999 Q ss_pred cccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhc-----------ccc----- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALK-----------GAT----- 132 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~-----------~a~----- 132 (274) .+...+...-+++.....+.++++++.. ...++...-++...+++.+.+++..+-... +-+ T Consensus 144 ~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~ 223 (388) T protein:vir:99 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) T ss_pred eeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccc Confidence 8877777777777766667787766554 357788888888888888887776652211 000 Q ss_pred cccc--C-c--c--cC----HHHHHHHHHHHhhcC-------CCccEEEEcHHHHHHHHhhhcccccccccccccccccc Q lcl|Aclame:pro 133 LTVE--A-D--I--TK----LDGLQTAIDKFNDED-------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKG 194 (274) Q Consensus 133 ~~~~--~-~--~--~~----~d~iv~a~~~l~~~~-------~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g 194 (274) .+.+ . . + -+ +++|..++..+.... ..+..+++.|..+..|-+-+ .+ . ......+.. T Consensus 224 v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~n--~~--g-~Tvl~~lk~- 297 (388) T protein:vir:99 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--DL--G-ISVRDWLKQ- 297 (388) T ss_pred cccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhccccC--cC--C-ccHHHHHHH- Confidence 0111 1 1 1 12 456666766663321 13447889998888874221 11 0 000011111 Q ss_pred ccchhcceeeEEcCCCC------cceE-EEEcCCeE-------------EEEeccCceeeeccccccCccEEEEEEE-EE Q lcl|Aclame:pro 195 AFGEALGAVIVRSNKLN------KGEA-LLAKKGAV-------------KLITKRDFFLEKDRDASRKSTALYSDKH-YV 253 (274) Q Consensus 195 ~~~~i~G~~Vv~s~~~p------~~~~-~l~~~~a~-------------~~~~~~~~~ve~~r~~~~~~~~i~~~~~-~~ 253 (274) .+-++.++.-+.+. .+.. +++.+.-- ......+.+... -+........-...| +| T Consensus 298 ---n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~-vq~~~~~~~~~~~~rt~G 373 (388) T protein:vir:99 298 ---TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLG-VEKRVKNYVEAYSNATAG 373 (388) T ss_pred ---hcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEeccccccccc-ceecCceeEeccccceee Confidence 13345555443332 1122 22222110 001111111100 000111122323333 46 Q ss_pred EEEEcCcceEEEEeC Q lcl|Aclame:pro 254 AYLYDESKVVKITKG 268 (274) Q Consensus 254 ~~v~~~~avv~l~~~ 268 (274) +-+.+|.++++++.= T Consensus 374 v~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 374 VMLKRPWAVVRLIGL 388 (388) T ss_pred eEEeccchhheeccC Confidence 667799999998776 No 241 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=91.41 E-value=0.016 Score=30.43 Aligned_cols=257 Identities=11% Similarity=0.010 Sum_probs=113.3 Q ss_pred hhccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC-CCcccccCCCcccc-cccccceeEEeehhh Q lcl|Aclame:pro 9 SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS-GDAQVIAEGEKIPV-DQIGTSKREAKVRKI 86 (274) Q Consensus 9 ~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~-~~a~~~~eg~~~~~-~~~~~~~~~~~~~~~ 86 (274) -|+|-+..+...+.+.-........++... .....+++.|-..... .-+..+.++.+-+. ..-........+-.+ T Consensus 1 ~d~f~~~~l~~~i~~~p~~~~l~~~~fp~~---~~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~~~~g~~~~~~~~p~i 77 (346) T protein:vir:63 1 MEIFDTLTLAGVIQSGPALSMYWQGFYPNE---ITFDTDEILFDLVFKDKKLAPFVAPNVQGRVIAARGYTTKTFRPAYV 77 (346) T ss_pred CCccCHHHHHHHHHhcCCccchhhhcCccc---cccccceEEEEEecCceeeeeeecCCCCcceecccceeeeEeecCcc Confidence 778888888877765433222222222110 1112234444332211 11233333332211 111222333444444 Q ss_pred hcchhccHHHH--h-------ccCccHHHHHH-------HHHHHHHHHHHHHHHHHHhcccccc---------------- Q lcl|Aclame:pro 87 GKGTELTDEAV--L-------SGFGDPQGEAV-------RQHGLAIANKVDNDVLEALKGATLT---------------- 134 (274) Q Consensus 87 ~~~~~is~e~~--~-------~s~~d~~~~~~-------~~~a~~~a~~~d~~~i~~~~~a~~~---------------- 134 (274) +..-.++.+++ + .+..+..+.+. ..+.+.+.+.+|......+.++... T Consensus 78 ~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~~vdfg~~ 157 (346) T protein:vir:63 78 KPKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQRVDFGRD 157 (346) T ss_pred CccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEEEEeeCCC Confidence 44333433222 1 12222333333 3344555566666666666543211 Q ss_pred --------cc-----CcccCHHHHHHHHHHHhhc-CCCccEEEEcHHHHHHHHhhhcccc-c---ccccccc---ccccc Q lcl|Aclame:pro 135 --------VE-----ADITKLDGLQTAIDKFNDE-DLEPMVLFVNPLDAGGLRTSASDNF-T---RPTQLGD---NIIVK 193 (274) Q Consensus 135 --------~~-----~~~~~~d~iv~a~~~l~~~-~~~~~~~v~~p~~~~~L~~~~~~~~-~---~~~~~~~---~~~~~ 193 (274) +. ++..-+.++.++...+.++ +..+..++|+++.+..|.+++...- . .....+. ..+.. T Consensus 158 ~~~~~~lt~~~~W~~~~adp~~di~~~~~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~l~~ 237 (346) T protein:vir:63 158 PALTVQLTGGAAWDQATSDPLGNIQTMRTTAWKKSNSTITRLTMGLDAWSLFSQKPAVVELLNLFYKGSTSDFNRSRLDD 237 (346) T ss_pred ccceeeecccccCCCCCCCHHHHHHHHHHHHHHccCCceEEEEECHHHHHHHhcCHHHHHHHhhhccccccccchhhccc Confidence 00 1111245666666666554 5688899999999999986643221 1 1100000 01111 Q ss_pred c-------ccc---hhcceeeEE------------cCCCCcceEEEEcCCeEE---EEeccCc---ee------eecccc Q lcl|Aclame:pro 194 G-------AFG---EALGAVIVR------------SNKLNKGEALLAKKGAVK---LITKRDF---FL------EKDRDA 239 (274) Q Consensus 194 g-------~~~---~i~G~~Vv~------------s~~~p~~~~~l~~~~a~~---~~~~~~~---~v------e~~r~~ 239 (274) + .+. .+.|+.|+. .+.+|.+.++++..+..+ |..-.+. .. ..+... T Consensus 238 ~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d~~G~~~~~ip~~~v~~~p~~~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~ 317 (346) T protein:vir:63 238 GSPVQYQGTIGGYNGMGTLELYTYHDTYTGDDNTEQEILGSYDVVGTGPGLQGTQCFGAIMDFKNGLVPTRMFPKMWEEE 317 (346) T ss_pred chhhhhhhhHhhhhccCCeEEEEeccEEEcCCCceeccccCCeEEEEecCCcceEEEeeccccccCcccceeeeEEEEec Confidence 1 111 234666654 233677777776544332 3211110 00 011111 Q ss_pred ccCccEEEEEEEEEEEEEcCcceEEEEeC Q lcl|Aclame:pro 240 SRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) Q Consensus 240 ~~~~~~i~~~~~~~~~v~~~~avv~l~~~ 268 (274) +-....+.+-.+.=..+.+|++++++++. T Consensus 318 dp~~~~~~~~s~plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 318 DPSVAMLMTQSAPLMVPAQPNASFRMTVK 346 (346) T ss_pred CCCEEEEEEeeeccceecCCCcEEEEEeC Confidence 11122334444444456789999999999 No 242 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=89.86 E-value=0.024 Score=29.47 Aligned_cols=261 Identities=11% Similarity=0.071 Sum_probs=128.3 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccccccccccC-CCEEEEEeecCCCC-cccc--cCCCcc---cccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQP-GDTLTFPAFTYSGD-AQVI--AEGEKI---PVDQ 73 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~-G~~v~ip~~~~~~~-a~~~--~eg~~~---~~~~ 73 (274) |++...-.+--.-..+|...+...++.+..|+...--.-.+.+.+ .++.-.-+-.+.+- .+.| +|+..+ +-.. T Consensus 1 m~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGtgTg~S 80 (286) T protein:vir:94 1 MATTNNDLPVRVYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGTGTSNS 80 (286) T ss_pred CCCCccccceeehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCccccccCCccc Confidence 884322222223334577777777777777644321111111111 11111111111110 0111 222221 1112 Q ss_pred cccceeEEee---hhhhcchhccHHH---HhccCccHHHHHH---HHHHHHHHHHHHHHHHHHhccccccccCcccCHHH Q lcl|Aclame:pro 74 IGTSKREAKV---RKIGKGTELTDEA---VLSGFGDPQGEAV---RQHGLAIANKVDNDVLEALKGATLTVEADITKLDG 144 (274) Q Consensus 74 ~~~~~~~~~~---~~~~~~~~is~e~---~~~s~~d~~~~~~---~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~~~d~ 144 (274) --|++..--. ...-..+.++--+ ...-.-|+...+. +..+.++++.+|..+-..+..+... ..+.|. T Consensus 81 sRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~----t~~~D~ 156 (286) T protein:vir:94 81 SRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGTD----LGAVDD 156 (286) T ss_pred cccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hhhhhh Confidence 2344332111 1122222222111 1112234444444 4457888888888776655332221 223478 Q ss_pred HHHHHHHHhhcCC-----CccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcCC-CCcceEEEE Q lcl|Aclame:pro 145 LQTAIDKFNDEDL-----EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK-LNKGEALLA 218 (274) Q Consensus 145 iv~a~~~l~~~~~-----~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~-~p~~~~~l~ 218 (274) +...+..+...+. .+-..-+||+.|..|.-.+...-. ..+..+ +-...+..+.|+-+...+. +-.|...+| T Consensus 157 V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l~Tsa--K~SsaN-iDengi~~FkGf~i~e~P~~~~~g~~aif 233 (286) T protein:vir:94 157 VNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLANVTTA--KNSAVN-IDTNGMLSFRGIAITKVPTQYMGGKAVIF 233 (286) T ss_pred HHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhcccccccc--ccceee-eccCCcceecceEEeecchhhccCceEEE Confidence 8777777765443 344578999999998755432211 111112 3333466788988877652 234888888 Q ss_pred cCCeEEEEeccCcee-eeccccccCccEEEEEEEEEEEEE--cCcceEEEEeCC Q lcl|Aclame:pro 219 KKGAVKLITKRDFFL-EKDRDASRKSTALYSDKHYVAYLY--DESKVVKITKGA 269 (274) Q Consensus 219 ~~~a~~~~~~~~~~v-e~~r~~~~~~~~i~~~~~~~~~v~--~~~avv~l~~~a 269 (274) .+..++..-- ++.+ .+-..+++....+.+-.-||-.++ +..|+++.+.++ T Consensus 234 s~dnig~aft-GIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 234 APDNVARVFT-GINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred ccccceeeec-cceeeeeeeccccCceeeeccccccccccccCceeEEEeecCC Confidence 8887765321 2222 122334566778888888887777 556777777777 No 243 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=89.66 E-value=0.025 Score=29.36 Aligned_cols=263 Identities=8% Similarity=0.010 Sum_probs=129.4 Q ss_pred CCc-------------ccc--chhhccchHHHHHHHHHHHHHhhhhcccccccc--ccc----c-----------cCCCE Q lcl|Aclame:pro 1 MAQ-------------GTT--KVSNLIVPEVLAPMMQAELDKKLRFAQFADIDS--TLV----G-----------QPGDT 48 (274) Q Consensus 1 ma~-------------~~T--~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~~~--~~~----~-----------~~G~~ 48 (274) ++. .-+ +..+.+... . ....++....... ... . ..+.. T Consensus 160 ~a~~gGpTGliFAm~s~y~s~~~g~ea~~n----e------a~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~ 229 (528) T protein:vir:66 160 EATVGSPTGTAFAKLTLSQAITAGDIVYHT----F------AETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKL 229 (528) T ss_pred cccccCCccceeecccccccccccceeeec----c------cccceeeeccccccccccCcccccccccccccccccccc Confidence 221 110 111111000 0 0001111110000 000 0 00111 Q ss_pred EEEEeecCCCCccc---cc--CCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 49 LTFPAFTYSGDAQV---IA--EGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANK 119 (274) Q Consensus 49 v~ip~~~~~~~a~~---~~--eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~ 119 (274) .++..--....++. .+ .+..+++-..+.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|+.. T Consensus 230 ~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlE 309 (528) T protein:vir:66 230 AEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLE 309 (528) T ss_pred eecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHH Confidence 11110000011111 11 13347777888889999999888888888887665 25899999999999999999 Q ss_pred HHHHHHHHhcccccc----------ccCcccC-------------HHHHHHHHHHHhh---------cCCCccEEEEcHH Q lcl|Aclame:pro 120 VDNDVLEALKGATLT----------VEADITK-------------LDGLQTAIDKFND---------EDLEPMVLFVNPL 167 (274) Q Consensus 120 ~d~~~i~~~~~a~~~----------~~~~~~~-------------~d~iv~a~~~l~~---------~~~~~~~~v~~p~ 167 (274) |++++|..++..... ..++.++ .+.+-.....+.. .....++++++|+ T Consensus 310 INREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~ 389 (528) T protein:vir:66 310 INREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRN 389 (528) T ss_pred hhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchH Confidence 999999777432211 0111111 1222222222211 1235689999999 Q ss_pred HHHHHHhhhccccccccc--cc--cccccccccchhc-ceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeec Q lcl|Aclame:pro 168 DAGGLRTSASDNFTRPTQ--LG--DNIIVKGAFGEAL-GAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKD 236 (274) Q Consensus 168 ~~~~L~~~~~~~~~~~~~--~~--~~~~~~g~~~~i~-G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~ 236 (274) +...|.......+..... .+ .+....-..|.+. |++|+++++.|.+-..+.-++. +-|+--.+.....- T Consensus 390 Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~ 469 (528) T protein:vir:66 390 VVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRA 469 (528) T ss_pred HHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEe Confidence 999886543211111110 00 0100000124444 6799999999876655543432 23444445555566 Q ss_pred cccccCccEEEEEEEEEEEEEcCcceE-------EEEeCCCc------ccC Q lcl|Aclame:pro 237 RDASRKSTALYSDKHYVAYLYDESKVV-------KITKGAGD------EVM 274 (274) Q Consensus 237 r~~~~~~~~i~~~~~~~~~v~~~~avv-------~l~~~aa~------~~~ 274 (274) .|++.++-.+-...||+..+ ||-..- ++....+. ..| T Consensus 470 ~dp~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~ 519 (528) T protein:vir:66 470 TDPQSFHPVLGFKTRYGIGI-NPFADSKSQEPSARITSGMLSKDSVGKNAY 519 (528) T ss_pred eCCccccceeeeeeeeceee-cCcccccCccccccccccchhhhhcCccce Confidence 78999999999999998764 454321 12222221 222 No 244 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=87.88 E-value=0.036 Score=28.51 Aligned_cols=272 Identities=8% Similarity=0.019 Sum_probs=127.7 Q ss_pred CCccccchhhccchHHHHH-HHHHH-H-HHhhhhccccccc-----ccccc-----------cCCCEEEEEeecCCCCcc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAP-MMQAE-L-DKKLRFAQFADID-----STLVG-----------QPGDTLTFPAFTYSGDAQ 61 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~-~v~~~-~-~~~~~~~~l~~~~-----~~~~~-----------~~G~~v~ip~~~~~~~a~ 61 (274) ..+.+|-... -.+..++. -..+. + +....+++...-. ....+ ..+....+..--....++ T Consensus 165 ~~~~~~~~~~-t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aE 243 (529) T protein:vir:10 165 TTDGTPFAKL-TAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAE 243 (529) T ss_pred ccCccccccc-cccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccchhhhh Confidence 1111100000 00000000 00000 0 0111111111000 00000 001111111100011111 Q ss_pred cc-----cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 62 VI-----AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT 132 (274) Q Consensus 62 ~~-----~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~ 132 (274) -. ..+..+++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..+.... T Consensus 244 aL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a 323 (529) T protein:vir:10 244 LRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTA 323 (529) T ss_pred ccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhh Confidence 11 123357777888889999999888888888887655 258999999999999999999999998775433 Q ss_pred cc----------ccCcccCHH-------------HHHHHHHHHhh---------cCCCccEEEEcHHHHHHHHhhhcccc Q lcl|Aclame:pro 133 LT----------VEADITKLD-------------GLQTAIDKFND---------EDLEPMVLFVNPLDAGGLRTSASDNF 180 (274) Q Consensus 133 ~~----------~~~~~~~~d-------------~iv~a~~~l~~---------~~~~~~~~v~~p~~~~~L~~~~~~~~ 180 (274) .- ...+.++++ .+-.....+.. .....++++++|++...|.......+ T Consensus 324 ~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~ 403 (529) T protein:vir:10 324 QVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNIS 403 (529) T ss_pred hhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhcc Confidence 21 111222221 11111122211 12357889999999998864221111 Q ss_pred cccc--cccc--ccccccccchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEEE Q lcl|Aclame:pro 181 TRPT--QLGD--NIIVKGAFGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYSD 249 (274) Q Consensus 181 ~~~~--~~~~--~~~~~g~~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~~ 249 (274) -... ..+. +.......|.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....--|++.++-.+-.. T Consensus 404 ~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~ 483 (529) T protein:vir:10 404 PAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGFK 483 (529) T ss_pred ccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeeee Confidence 1000 0000 11111122343 35799999999876655543432 223333344433446888999999999 Q ss_pred EEEEEEEEcCcceE-------EEEe------CCCcccC Q lcl|Aclame:pro 250 KHYVAYLYDESKVV-------KITK------GAGDEVM 274 (274) Q Consensus 250 ~~~~~~v~~~~avv-------~l~~------~aa~~~~ 274 (274) .||+..+ ||-..- ++.. .+....| T Consensus 484 tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 520 (529) T protein:vir:10 484 TRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAY 520 (529) T ss_pred eeeceee-cCccccccccccccccCCcchhhhcCccce Confidence 9998764 453221 1222 2222222 No 245 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=85.76 E-value=0.05 Score=27.70 Aligned_cols=265 Identities=8% Similarity=-0.002 Sum_probs=127.4 Q ss_pred CCccc----cchhhccchHHHHHHHHHHHHHhhhhccccc----cc-cc-----c------cccCCCEEEEEeecCCCCc Q lcl|Aclame:pro 1 MAQGT----TKVSNLIVPEVLAPMMQAELDKKLRFAQFAD----ID-ST-----L------VGQPGDTLTFPAFTYSGDA 60 (274) Q Consensus 1 ma~~~----T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~----~~-~~-----~------~~~~G~~v~ip~~~~~~~a 60 (274) ++.-+ ....+. -+.+-.. ....+++... .. .. . ....|....+..--....+ T Consensus 171 ~~~~ta~~~~a~g~g--~ea~f~e------a~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~a 242 (529) T protein:vir:10 171 FAKLTAGQAIAEGDI--VGHFFYE------SGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIA 242 (529) T ss_pred ccccccccccccccc--ceeeecc------cCceeeccccccccccCccccCcccccccccccccccccccccchhhhhh Confidence 22111 111110 0000000 0000111000 00 00 0 0001111111110011111 Q ss_pred ccc-----cCCCcccccccccceeEEeehhhhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 61 QVI-----AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEALKGA 131 (274) Q Consensus 61 ~~~-----~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a 131 (274) +-. ..+.++++-.++.+.++++.+.++-.-++|-|+.+| -..|.+..+.+-|+..|...|++++|..+.+. T Consensus 243 EaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~ 322 (529) T protein:vir:10 243 ELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYT 322 (529) T ss_pred hccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhh Confidence 211 123457777888889999999888888888887655 25899999999999999999999999877543 Q ss_pred ccc----------ccCcccCHH-------------HHHHHHHHHhh---------cCCCccEEEEcHHHHHHHHhhhccc Q lcl|Aclame:pro 132 TLT----------VEADITKLD-------------GLQTAIDKFND---------EDLEPMVLFVNPLDAGGLRTSASDN 179 (274) Q Consensus 132 ~~~----------~~~~~~~~d-------------~iv~a~~~l~~---------~~~~~~~~v~~p~~~~~L~~~~~~~ 179 (274) ..- ...+.++++ .+-.....+.. .....++++++|++...|....... T Consensus 323 a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~ 402 (529) T protein:vir:10 323 AQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNI 402 (529) T ss_pred hhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcccc Confidence 321 011222221 11111122211 1235788999999999886321111 Q ss_pred ccc----ccccccccccccccchh-cceeeEEcCCCCcceEEEEcCCe------EEEEeccCceeeeccccccCccEEEE Q lcl|Aclame:pro 180 FTR----PTQLGDNIIVKGAFGEA-LGAVIVRSNKLNKGEALLAKKGA------VKLITKRDFFLEKDRDASRKSTALYS 248 (274) Q Consensus 180 ~~~----~~~~~~~~~~~g~~~~i-~G~~Vv~s~~~p~~~~~l~~~~a------~~~~~~~~~~ve~~r~~~~~~~~i~~ 248 (274) +-. .+....+.......|.+ .|++|+++++.|.+-..+.-++. +-|+--.+.....-.|++.++-.+-. T Consensus 403 ~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 482 (529) T protein:vir:10 403 SPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPVMGF 482 (529) T ss_pred cccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeee Confidence 000 00000011111122343 35799999999876655543432 23443334433344588899999999 Q ss_pred EEEEEEEEEcCcceE-------EEEe------CCCcccC Q lcl|Aclame:pro 249 DKHYVAYLYDESKVV-------KITK------GAGDEVM 274 (274) Q Consensus 249 ~~~~~~~v~~~~avv-------~l~~------~aa~~~~ 274 (274) ..||+..+ ||-..- ++.. .+....| T Consensus 483 ~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~ 520 (529) T protein:vir:10 483 KTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKNAY 520 (529) T ss_pred eeeeceee-cCccccccccccccccCCcchhhhcCccce Confidence 99998764 453221 1122 2222222 No 246 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=84.95 E-value=0.056 Score=27.43 Aligned_cols=257 Identities=16% Similarity=0.094 Sum_probs=130.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHhhhhcccccc-cccccccC-CCEEEEEeecCCCC-cccc--cCCCcc---ccc Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADI-DSTLVGQP-GDTLTFPAFTYSGD-AQVI--AEGEKI---PVD 72 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~~~~~~l~~~-~~~~~~~~-G~~v~ip~~~~~~~-a~~~--~eg~~~---~~~ 72 (274) ||.-. +. .+|...+...++.++.|....-- .-.+.+.. .++.---+-.+++- .+.| +|+..+ +-. T Consensus 1 ~avr~------y~-Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ 73 (287) T protein:vir:39 1 MAIKY------FT-KQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGN 73 (287) T ss_pred CCccc------cc-HHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCc Confidence 66433 22 45888888888877776443211 00011111 12211111111110 0111 222211 111 Q ss_pred ccccceeEE-ee--hhhhcchh------ccHHHHhccCccHHHHHH---HHHHHHHHHHHHHHHHHHhccccccccCccc Q lcl|Aclame:pro 73 QIGTSKREA-KV--RKIGKGTE------LTDEAVLSGFGDPQGEAV---RQHGLAIANKVDNDVLEALKGATLTVEADIT 140 (274) Q Consensus 73 ~~~~~~~~~-~~--~~~~~~~~------is~e~~~~s~~d~~~~~~---~~~a~~~a~~~d~~~i~~~~~a~~~~~~~~~ 140 (274) .--|++..- .- ...-.-+. +.+.... -|+...+. +..+.++++.+|+.+-..+........+-.+ T Consensus 74 ssRFG~rkEi~y~dt~V~Y~~~~~ihEGiD~~TVN---nd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~~~ 150 (287) T protein:vir:39 74 TSRFGQRKEVKSVNKQVSYDAPLAINEGIDDFTVN---DIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTVKL 150 (287) T ss_pred cccccceeEEEEecccccceecccccccccccccc---CChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheeeee Confidence 222333221 11 11111122 2222222 23444444 4558899999998877666555444333347 Q ss_pred CHHHHHHHHHHHhhcCCC-------ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCC Q lcl|Aclame:pro 141 KLDGLQTAIDKFNDEDLE-------PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLN 211 (274) Q Consensus 141 ~~d~iv~a~~~l~~~~~~-------~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p 211 (274) +-|.+...+..+...+.. +-+.-+||+.|..|.-.+...-. ..+..+ +-...+..+.|+-+...+ ... T Consensus 151 t~d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~Tsa--K~SsaN-iDen~i~kFkGf~l~e~P~~~~q 227 (287) T protein:vir:39 151 DEDSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTA--KNSSAN-VDEQTLYKFKGFILSELPDEKFQ 227 (287) T ss_pred cccchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhcccccccc--ccceee-eccCCcceecceEEEecchHhhc Confidence 778888887777665442 33467899999998755432211 111112 333346678999887765 455 Q ss_pred cceEEEEcCCeEEEEeccCcee-eeccccccCccEEEEEEEEEEEEEcCcceEEEEeCCCc Q lcl|Aclame:pro 212 KGEALLAKKGAVKLITKRDFFL-EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) Q Consensus 212 ~~~~~l~~~~a~~~~~~~~~~v-e~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~aa~ 271 (274) .|+..+|.+..++..-- ++.+ ..-..+++....+.+-.-||-.+.+.++...++.+..- T Consensus 228 ~g~~a~fs~dnig~af~-GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 228 LNEGAYFAADNVGVAGV-GIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred cCcEEEEccccceeecc-cceeEEeeecccccceeeecccccccccccccceEEEEEecCC Confidence 78888888888775321 2222 12234456778888888888888855544443333333 No 247 >protein:vir:79246 Length: 304 # NCBI annotation: conserved hypothetical protein # Family: family:all:776 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469162;genbank:gi:157835004;genbank:GeneID:5648827 Probab=77.07 E-value=0.13 Score=25.47 Aligned_cols=193 Identities=13% Similarity=0.128 Sum_probs=101.5 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh-hhhcccccccccccccCCCEEEEEeecCCCCc-ccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDA-QVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~-~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a-~~~~eg~~~~~~~~~~~~ 78 (274) ||.-+....+.+. .-+...+.+.+... .-+..++.. ..+.+.+=+...++..|.. +|++ +....++.... T Consensus 1 M~ii~~~~L~~l~-~~~~~~f~~~~~~a~~~~~~iA~~----VpSt~~~~tY~WLg~~P~mreWiG---~r~i~~l~~~~ 72 (304) T protein:vir:79 1 MAIITPALISALK-TSFQKHFQDALATAPSTYLQVATV----IPSTTASNTYGWLGQFPKLREWIG---QRVIKDMAAQG 72 (304) T ss_pred CCccCHHHHHHHH-HHHHHHHHHHHhhcCcccceeEeE----eecCccccccchhcccccchhhhh---hhhhhhhhhcc Confidence 8743322111111 01222222222111 001111111 1111222222333333333 3443 23455666666 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------- Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------------------------- 133 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------------------------- 133 (274) .+++=+++-.-+.|.+.++++....+...+.+++++..+..=|..+++.+..+.. T Consensus 73 y~I~Nk~fE~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~d~~g~ 152 (304) T protein:vir:79 73 YQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGT 152 (304) T ss_pred ceeeccccccceeeccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccccccccccc Confidence 6777677777789999999999999999999999999998888877754431000 Q ss_pred ---------------------------------------------------------------------------cccCc Q lcl|Aclame:pro 134 ---------------------------------------------------------------------------TVEAD 138 (274) Q Consensus 134 ---------------------------------------------------------------------------~~~~~ 138 (274) .++.+ T Consensus 153 ~~~vsn~~~~~~~~g~~w~LlD~sr~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a 232 (304) T protein:vir:79 153 ATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSLTKEDNEQVFMADEYVYGVRSRCNVGFGFWQLAAMSTE 232 (304) T ss_pred cccceeeccCCCCCCCeEEEEeCCCcccceeeeccccceeeecCCCCchhhhhhcceEEeeeeeeccchhhhhhhhhcCC Confidence 00234 Q ss_pred ccCHHHHHHHHHHHhhc--------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcc-eeeEEcCC Q lcl|Aclame:pro 139 ITKLDGLQTAIDKFNDE--------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG-AVIVRSNK 209 (274) Q Consensus 139 ~~~~d~iv~a~~~l~~~--------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G-~~Vv~s~~ 209 (274) +++.+.+-.|+.++... +..|+.+||.|+....=++.-..+.. .+|...-+.| +.+++++. T Consensus 233 ~Ls~~nl~aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~~A~~ll~a~~~----------~~G~tNp~~g~~eliV~P~ 302 (304) T protein:vir:79 233 ELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL----------ANGADNPNFELVQVLDTAW 302 (304) T ss_pred ccchHHHHHHHHHHHhhcCCCCceeccccCEEEecchhHHHHHHHHhhhhc----------CCCCcceecceEEEEeecc Confidence 56667777777776432 23677899999865543322111111 1122223555 68888888 Q ss_pred CC Q lcl|Aclame:pro 210 LN 211 (274) Q Consensus 210 ~p 211 (274) +. T Consensus 303 Ld 304 (304) T protein:vir:79 303 LN 304 (304) T ss_pred cC Confidence 87 No 248 >protein:vir:99228 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:776 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950457;genbank:gi:119953658;genbank:GeneID:4643088 Probab=76.46 E-value=0.14 Score=25.35 Aligned_cols=193 Identities=13% Similarity=0.132 Sum_probs=101.6 Q ss_pred CCccccchhhccchHHHHHHHHHHHHHh-hhhcccccccccccccCCCEEEEEeecCCCCc-ccccCCCcccccccccce Q lcl|Aclame:pro 1 MAQGTTKVSNLIVPEVLAPMMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFTYSGDA-QVIAEGEKIPVDQIGTSK 78 (274) Q Consensus 1 ma~~~T~~~~~~iPe~~~~~v~~~~~~~-~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a-~~~~eg~~~~~~~~~~~~ 78 (274) ||.-+....+.+. .-+...+.+.+... .-+..++.. ..+.+.+=+...++..+.. +|++ +....++.... T Consensus 1 M~ii~~~~L~~l~-~~~~~~f~~~~~~a~~~~~~iA~~----VpSt~~~~~Y~WLg~~P~mreWiG---~r~i~~l~~~~ 72 (304) T protein:vir:99 1 MAIITPALISALK-TSFQKHFQDALATAPSTYLQVATV----IPSTTASNTYGWLGQFPKLREWIG---QRVIKDMAAQG 72 (304) T ss_pred CCccCHHHHHHHH-HHHHHHHHHHHhhcCcccceeEeE----eecCccccccchhcccccchhhhh---hhhhhhhhhcc Confidence 7743322111111 01222222222111 001111111 1111222222333333333 3443 23455666666 Q ss_pred eEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------- Q lcl|Aclame:pro 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL------------------------- 133 (274) Q Consensus 79 ~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~------------------------- 133 (274) .+++=+++-.-+.|.+.++++....+...+.+++++..+..=|..+++.+..+.. T Consensus 73 y~I~Nk~fE~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~dg~g~ 152 (304) T protein:vir:99 73 YQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGT 152 (304) T ss_pred ceeeccccccccccccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCcccccccccCc Confidence 6777677777789999999999999999999999999998888877754421000 Q ss_pred -------------c--------------------------------------------------------------ccCc Q lcl|Aclame:pro 134 -------------T--------------------------------------------------------------VEAD 138 (274) Q Consensus 134 -------------~--------------------------------------------------------------~~~~ 138 (274) . ++.+ T Consensus 153 ~~~vsn~~~~~~~~g~~w~Lld~~r~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a 232 (304) T protein:vir:99 153 ATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) T ss_pred ccccceeccCCCCCCCcEEEEeCCCCccceeeeccccceeeeccCCCchhhhhhcceeEeeeeeeccchhhhhhhhhcCC Confidence 0 0234 Q ss_pred ccCHHHHHHHHHHHhhc--------CCCccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcc-eeeEEcCC Q lcl|Aclame:pro 139 ITKLDGLQTAIDKFNDE--------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG-AVIVRSNK 209 (274) Q Consensus 139 ~~~~d~iv~a~~~l~~~--------~~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G-~~Vv~s~~ 209 (274) +++.+.+-.|+.++... +..|+.+||.|+....=++.-..+.. .+|...-+.| +.+++++. T Consensus 233 ~Lt~~nl~aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~aA~~ll~a~~~----------~~G~tNp~~g~~eliV~P~ 302 (304) T protein:vir:99 233 ELNTANFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL----------ANGADNPNFELVQVLDTAW 302 (304) T ss_pred CcChHHHHHHHHHHHhhcCCCCceeccccCeEEecchHHHHHHHHHhhhcc----------CCCCcceecceEEEEeecc Confidence 56677777777776432 23677899999865543322111111 1122223555 68888888 Q ss_pred CC Q lcl|Aclame:pro 210 LN 211 (274) Q Consensus 210 ~p 211 (274) +. T Consensus 303 Ld 304 (304) T protein:vir:99 303 LN 304 (304) T ss_pred cC Confidence 88 No 249 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=61.61 E-value=0.35 Score=23.08 Aligned_cols=256 Identities=12% Similarity=0.083 Sum_probs=114.0 Q ss_pred CC--ccccchhh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccC-CCccc--ccc Q lcl|Aclame:pro 1 MA--QGTTKVSN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE-GEKIP--VDQ 73 (274) Q Consensus 1 ma--~~~T~~~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~e-g~~~~--~~~ 73 (274) +| +.+...+. .+.|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-.+... ++-.| ... T Consensus 16 ~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~Invv~-V~e~~Ge~v~lg~~g~iagrtdT~~~~~R~~~~~~~ 93 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPS-VQQKLEQRIQESSEFLKQINVYG-VDELQGEKIGIGVSGTIASRTDTTGDGVRKPRDVSA 93 (338) T ss_pred HHHHhCCCcccceeeeCHH-HHHHHHHHHHHHHHhhccCceec-ccceeeeEeeeccCccccccccCCCCCccccccccc Confidence 33 33332222 24444 66677777777776666554432 1223344443322222111111111 11122 223 Q ss_pred cccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhc---------------------- Q lcl|Aclame:pro 74 IGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALK---------------------- 129 (274) Q Consensus 74 ~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~---------------------- 129 (274) +.-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +.. T Consensus 94 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~AL--D~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ 171 (338) T protein:vir:11 94 LDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQAL--DRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQ 171 (338) T ss_pred cCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhh--chhhhcccceeeccCCChhhCcCccccchhHHH Confidence 444444555555556666666665556678998888888888874 443331 110 Q ss_pred ------------ccccc-----cc---CcccCHHHHH-HHHHHH-hhcCC-C-ccEEEEcHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 130 ------------GATLT-----VE---ADITKLDGLQ-TAIDKF-NDEDL-E-PMVLFVNPLDAGGLRTSASDNFTRPTQ 185 (274) Q Consensus 130 ------------~a~~~-----~~---~~~~~~d~iv-~a~~~l-~~~~~-~-~~~~v~~p~~~~~L~~~~~~~~~~~~~ 185 (274) ....+ .. ++--+.|.++ |+...| .+... . .-+++|+.+..+.=. .......+ T Consensus 172 ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~----~~l~n~~~ 247 (338) T protein:vir:11 172 QYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKY----FPMVNKDQ 247 (338) T ss_pred HHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHH----hHHHhcCC Confidence 00000 00 1123455543 566533 44332 3 347788877544210 00111111 Q ss_pred cccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCcee-eeccccccCcc-------EEEEEEEEE- Q lcl|Aclame:pro 186 LGDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFL-EKDRDASRKST-------ALYSDKHYV- 253 (274) Q Consensus 186 ~~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~v-e~~r~~~~~~~-------~i~~~~~~~- 253 (274) ...+.+.... ..++.|+|.+.-+.+|.+..++..-..+-+..+.+..- .....+++... .=++...|+ T Consensus 248 ~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~ 327 (338) T protein:vir:11 248 PATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIENYESSNDAYVVEDYGL 327 (338) T ss_pred ChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhccceeeecccc Confidence 1111122221 24799999999999999999887655544333333211 11111222111 112222222 Q ss_pred EEEEcCcceEE Q lcl|Aclame:pro 254 AYLYDESKVVK 264 (274) Q Consensus 254 ~~v~~~~avv~ 264 (274) +.+++.-.++. T Consensus 328 ~a~ieni~~~~ 338 (338) T protein:vir:11 328 GCLVENIEVAE 338 (338) T ss_pred EEEeecceecC Confidence 22333222222 No 250 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=57.54 E-value=0.43 Score=22.58 Aligned_cols=258 Identities=11% Similarity=0.084 Sum_probs=114.6 Q ss_pred CC--ccccchhh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccc--ccCCCccccccc Q lcl|Aclame:pro 1 MA--QGTTKVSN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQV--IAEGEKIPVDQI 74 (274) Q Consensus 1 ma--~~~T~~~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~--~~eg~~~~~~~~ 74 (274) +| +.+...+. .+.|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-.+ -.+..+.....+ T Consensus 16 ~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l 93 (339) T protein:vir:79 16 IAKLNGVERVDEKFSVAPS-VQQKLETKVQESSDFLKSINFYG-VPEQEGEKIGLGVSGPVASTTDTTQQDRETSDISTM 93 (339) T ss_pred HHHHhCcccccceeeecHH-HHHHHHHHHHHHHHHhccCcccc-cccceeeEEeeccCcceeecccCCCCCccccccccc Confidence 22 22222222 13444 56666777777776666554432 1223344444322221111111 112222222344 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--H------------------------- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--A------------------------- 127 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~------------------------- 127 (274) .-....+.-.....++++...+.=-..+||...+++.+.+.+|. |...|+ + T Consensus 94 ~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 171 (339) T protein:vir:79 94 DGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQAL--DRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQN 171 (339) T ss_pred CCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhh--ccceecccceeeecCCChhhCcCccccchhHHHH Confidence 44444555555556666666655555678888888888888764 333221 1 Q ss_pred h---------ccccc-c------cc-CcccCHHHH-HHHHHHH-hhcCC-C-ccEEEEcHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 128 L---------KGATL-T------VE-ADITKLDGL-QTAIDKF-NDEDL-E-PMVLFVNPLDAGGLRTSASDNFTRPTQL 186 (274) Q Consensus 128 ~---------~~a~~-~------~~-~~~~~~d~i-v~a~~~l-~~~~~-~-~~~~v~~p~~~~~L~~~~~~~~~~~~~~ 186 (274) + ..... . +. ++--+.|.+ .|+...| .+... . .-+++|..+..+.=. .+. ....+. T Consensus 172 ~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~-~~l---~n~~~~ 247 (339) T protein:vir:79 172 LREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKY-FPL---VNRDRD 247 (339) T ss_pred HHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHh-hhH---hhcCCC Confidence 0 10000 0 11 112245554 4566533 44332 3 356778877654311 010 101111 Q ss_pred ccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-----eeeccc--cccCc-cEEEEEEEEEE- Q lcl|Aclame:pro 187 GDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-----LEKDRD--ASRKS-TALYSDKHYVA- 254 (274) Q Consensus 187 ~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-----ve~~r~--~~~~~-~~i~~~~~~~~- 254 (274) -.+.+.... ..++-|+|.+.-+.+|.+..++..-+.+-+..+.+.. -+.+|+ +.+.+ ..=++...|++ T Consensus 248 ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~ 327 (339) T protein:vir:79 248 PVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIENYESSNDAYVIEDLACA 327 (339) T ss_pred hHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchhhccceeeeeccccE Confidence 111122222 2479999999999999999988765554433332221 111222 11111 11122233332 Q ss_pred EEEcCcceEEEEeCC Q lcl|Aclame:pro 255 YLYDESKVVKITKGA 269 (274) Q Consensus 255 ~v~~~~avv~l~~~a 269 (274) ..++ =+++..+| T Consensus 328 a~iE---ni~~~~aa 339 (339) T protein:vir:79 328 AMAE---NIALAAAA 339 (339) T ss_pred EEee---eeecccCC Confidence 2222 13444444 No 251 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=56.18 E-value=0.46 Score=22.42 Aligned_cols=257 Identities=12% Similarity=0.081 Sum_probs=114.6 Q ss_pred CC--ccccchhhc--cchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCc--cccccc Q lcl|Aclame:pro 1 MA--QGTTKVSNL--IVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEK--IPVDQI 74 (274) Q Consensus 1 ma--~~~T~~~~~--~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~--~~~~~~ 74 (274) +| +.+...+.- +.|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-.+-+.+.- ....++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l 93 (337) T protein:vir:78 16 IAKLNDTGDVSKKFAVEPT-VQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTAL 93 (337) T ss_pred HHHhcChhhhcceeecChH-HHHHHHHHHHHHHHHhccCCccc-cccceeeEEecccCcceeeeecCCCccccccccccc Confidence 22 333333332 4444 56667777777776666555432 1223344443322222111111122222 222334 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhcc---------------------- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALKG---------------------- 130 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~~---------------------- 130 (274) +-....+.-.....++++...+.=-..+||...+++.+.+.+|. |...|+ +..- T Consensus 94 ~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 171 (337) T protein:vir:78 94 DSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQ 171 (337) T ss_pred CCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhh--ccceecccceeeccCCChhhCcCccccchHHHHH Confidence 44444555555556666666665556678888888888888764 333321 1100 Q ss_pred ----cc---------cc-----cc-CcccCHHHH-HHHHHH-HhhcC--CCccEEEEcHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 131 ----AT---------LT-----VE-ADITKLDGL-QTAIDK-FNDED--LEPMVLFVNPLDAGGLRTSASDNFTRPTQLG 187 (274) Q Consensus 131 ----a~---------~~-----~~-~~~~~~d~i-v~a~~~-l~~~~--~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~ 187 (274) ++ .. +. ++--+.|.+ .|+... +.+.. ...-+++|..+..+.=. .+ .....+.- T Consensus 172 ~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~-~~---l~n~~~~p 247 (337) T protein:vir:78 172 YRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY-FP---IVNATQAP 247 (337) T ss_pred HHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHH-HH---HHhcCCCc Confidence 00 00 01 112245554 456654 34433 23457788887655311 01 01111111 Q ss_pred cccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-eeeccccccCcc-------EEEEEEEEEEE- Q lcl|Aclame:pro 188 DNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-LEKDRDASRKST-------ALYSDKHYVAY- 255 (274) Q Consensus 188 ~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-ve~~r~~~~~~~-------~i~~~~~~~~~- 255 (274) .+.+.... ..++-|+|.+.-+.+|.+..++..-+.+-+..+.+.. -.....+++... .=++...|++. T Consensus 248 tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a 327 (337) T protein:vir:78 248 TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGC 327 (337) T ss_pred HHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchhhccceeeeeccccEE Confidence 11122111 2479999999999999999988765554433332221 111111222111 11222222222 Q ss_pred EEcCcceEEEEeCCC Q lcl|Aclame:pro 256 LYDESKVVKITKGAG 270 (274) Q Consensus 256 v~~~~avv~l~~~aa 270 (274) .++ -|+.+.| T Consensus 328 ~iE-----nI~~~~a 337 (337) T protein:vir:78 328 VAE-----NIELAAA 337 (337) T ss_pred EEe-----ceeecCC Confidence 221 1333333 No 252 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=54.18 E-value=0.51 Score=22.19 Aligned_cols=262 Identities=16% Similarity=0.082 Sum_probs=115.3 Q ss_pred CC-ccccch----hh---ccchHHHHHHHHHHHHHhhhhccccccc--c---cccccCCCEEEEEeecCCCCcccccCCC Q lcl|Aclame:pro 1 MA-QGTTKV----SN---LIVPEVLAPMMQAELDKKLRFAQFADID--S---TLVGQPGDTLTFPAFTYSGDAQVIAEGE 67 (274) Q Consensus 1 ma-~~~T~~----~~---~~iPe~~~~~v~~~~~~~~~~~~l~~~~--~---~~~~~~G~~v~ip~~~~~~~a~~~~eg~ 67 (274) |- +..+.+ ++ +-+++--. +.+. .++..+ . .+....|+++++-+=...-.++.++.|. T Consensus 61 l~~~~~~~ta~~~a~~T~i~V~~~~~------f~~~----~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~ 130 (418) T protein:vir:96 61 MVFASAVVTAEALADATVLTVENSDG------LTKG----MIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANT 130 (418) T ss_pred eeeeeEEEEEEEecCceEEEecCCcc------cccc----cEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCc Confidence 22 111111 11 22222100 2222 222111 1 1122358888887743333344555544 Q ss_pred -------cccccccccceeEEeehh-------hhcchhccHHHHhc----cCccHHHHHHHHHHHHHHHHHHHHHHHHh- Q lcl|Aclame:pro 68 -------KIPVDQIGTSKREAKVRK-------IGKGTELTDEAVLS----GFGDPQGEAVRQHGLAIANKVDNDVLEAL- 128 (274) Q Consensus 68 -------~~~~~~~~~~~~~~~~~~-------~~~~~~is~e~~~~----s~~d~~~~~~~~~a~~~a~~~d~~~i~~~- 128 (274) .+++..-..+....+..+ +...+.+|+-.... ...+......++|-.. ..++|++++..- T Consensus 131 ~~~~ig~~~eEGsd~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~ 209 (418) T protein:vir:96 131 KLIVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQA 209 (418) T ss_pred eEEEeecCcccccccCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhcccc Confidence 234432222221111111 12334455543221 2223333333333333 345666665322 Q ss_pred -----cccc-------------------c-cccCcccCHHHHHHHHHHHhh--cCC--Cc----cEEEEcHHHHHHHHhh Q lcl|Aclame:pro 129 -----KGAT-------------------L-TVEADITKLDGLQTAIDKFND--EDL--EP----MVLFVNPLDAGGLRTS 175 (274) Q Consensus 129 -----~~a~-------------------~-~~~~~~~~~d~iv~a~~~l~~--~~~--~~----~~~v~~p~~~~~L~~~ 175 (274) ++.+ . +.....+++|.++++....-. .+. .. ..+.+++++...|-+. T Consensus 210 ~~~~~ng~p~~~t~R~m~gI~~f~~~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~ 289 (418) T protein:vir:96 210 FMGTYNGQPLHTTQGIVDAIRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRF 289 (418) T ss_pred ccCCCCCcccccccchhHHHHhhccccccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhh Confidence 1100 0 112234689999998766432 111 11 4478899998887654 Q ss_pred hccccccccccccccccccccchhcc-eeeEEcCCC-----CcceEEEEcCCeEEE--EeccCceeeeccccc------- Q lcl|Aclame:pro 176 ASDNFTRPTQLGDNIIVKGAFGEALG-AVIVRSNKL-----NKGEALLAKKGAVKL--ITKRDFFLEKDRDAS------- 240 (274) Q Consensus 176 ~~~~~~~~~~~~~~~~~~g~~~~i~G-~~Vv~s~~~-----p~~~~~l~~~~a~~~--~~~~~~~ve~~r~~~------- 240 (274) ...=....++..-+.+.+... +-+| ++|+.++++ |+|+.++++++.+.. ...++...|..--.. T Consensus 290 ~~~I~~~~~en~~G~vv~~~~-Td~G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~ 368 (418) T protein:vir:96 290 FGEVTVTQRETSYGMVFTEWK-FFKGRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGA 368 (418) T ss_pred hceeEeccccceeceEEEEEE-eeccEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCcccccc Confidence 321011112222222333222 2335 588888865 556679999998764 444665554331111 Q ss_pred ------cCccEE--EEEEEEEEEEEcCcceEEEEe--CCCcccC Q lcl|Aclame:pro 241 ------RKSTAL--YSDKHYVAYLYDESKVVKITK--GAGDEVM 274 (274) Q Consensus 241 ------~~~~~i--~~~~~~~~~v~~~~avv~l~~--~aa~~~~ 274 (274) .+.|.. ....-+..++++|.+.++|+. +|--.|- T Consensus 369 ~~~~~~~~~D~~~G~l~~Eltle~~N~~a~a~itgl~~~~~~~~ 412 (418) T protein:vir:96 369 TDYSYGHGVDAQGGSLTSEWALELLNPQGCAVITGLQKAKERVY 412 (418) T ss_pred cccccccccccccCEEEEEEEEEeecccccEEeecccccccccc Confidence 111222 234456788999999888763 1111121 No 253 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=52.21 E-value=0.56 Score=21.96 Aligned_cols=261 Identities=11% Similarity=0.089 Sum_probs=113.6 Q ss_pred CC--cccc----chh-h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCccccc---CCCcc Q lcl|Aclame:pro 1 MA--QGTT----KVS-N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIA---EGEKI 69 (274) Q Consensus 1 ma--~~~T----~~~-~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~---eg~~~ 69 (274) +| +++. ..+ . .+.|. ....+.+++.+++.|-+..++..- ....|..|-+-..+...+-.+.. |.... T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~V-~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~ 93 (342) T protein:vir:10 16 QAELNNLPFNALATGIKFTVQPS-VQQKLYEKVRESSDFLKSISFVFV-DEQTGETLGLDSAHTVASTTDTSGDGERKTT 93 (342) T ss_pred HHHHhCCChhHccccceeecChH-HHHHHHHHHHHHHHHhccCccccc-ccceeeEEecccCcccccccccCCCCCcccc Confidence 32 2221 111 1 13444 666777777777777665554321 22334444432222221111111 22222 Q ss_pred cccccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhc------------------ Q lcl|Aclame:pro 70 PVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALK------------------ 129 (274) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~------------------ 129 (274) ....++-....+.-.....++++...+.=...+||...+++.+.+.+|. |...|+ +.. T Consensus 94 ~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfNGts~A~~Td~~~nPllqDVN~ 171 (342) T protein:vir:10 94 SIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKR--DLIMIGFNGTSRAATSDRNSNPLLQDVAK 171 (342) T ss_pred cccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhh--ccceecccceeeccCCChhhCcCccccch Confidence 3334444445555555566666666665556678888888888888764 333321 110 Q ss_pred ----------------ccccc-----cc-CcccCHHHHH-HHHHHH-hhcC--CCccEEEEcHHHHHHH--Hhhhccccc Q lcl|Aclame:pro 130 ----------------GATLT-----VE-ADITKLDGLQ-TAIDKF-NDED--LEPMVLFVNPLDAGGL--RTSASDNFT 181 (274) Q Consensus 130 ----------------~a~~~-----~~-~~~~~~d~iv-~a~~~l-~~~~--~~~~~~v~~p~~~~~L--~~~~~~~~~ 181 (274) +...+ +. ++--+.|.++ ||...| .+.. ...-+++|..+..+.= ...+. .. T Consensus 172 GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~--~~ 249 (342) T protein:vir:10 172 GWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQ--QN 249 (342) T ss_pred HHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhc--CC Confidence 00000 01 1223455554 566543 4433 2345778888765531 11110 00 Q ss_pred cccccccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-eeeccccccCccEEEEEEEEEEEEE Q lcl|Aclame:pro 182 RPTQLGDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-LEKDRDASRKSTALYSDKHYVAYLY 257 (274) Q Consensus 182 ~~~~~~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-ve~~r~~~~~~~~i~~~~~~~~~v~ 257 (274) .++ +.+.... ..++-|+|.+.-+.+|.+..++..-..+-+..+.+.. -.....+++....-+-..--|..|- T Consensus 250 ~pt----E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVE 325 (342) T protein:vir:10 250 APT----EELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDRIETYESENIDYVVE 325 (342) T ss_pred ChH----HHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhccceeee Confidence 111 1111111 2479999999999999999988765554433322221 1111111211110000000011111 Q ss_pred cCcceEE---EEeCCCc Q lcl|Aclame:pro 258 DESKVVK---ITKGAGD 271 (274) Q Consensus 258 ~~~avv~---l~~~aa~ 271 (274) ++.+.+. ++.+-|. T Consensus 326 d~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 326 DYGCAALIENITLKDKE 342 (342) T ss_pred ccccEEEeecceecCCC Confidence 2221111 2222222 No 254 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=49.25 E-value=0.64 Score=21.63 Aligned_cols=259 Identities=13% Similarity=0.096 Sum_probs=114.5 Q ss_pred CC--ccccc--hhh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc---cCCCcccc Q lcl|Aclame:pro 1 MA--QGTTK--VSN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI---AEGEKIPV 71 (274) Q Consensus 1 ma--~~~T~--~~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~---~eg~~~~~ 71 (274) +| +.++. .+. .+.|. ....+.+++++++.|-+..++.. .....|..|-+-..+...+-.+. .|+..... T Consensus 16 ~A~~ngv~~~~~~~~Fsv~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~ 93 (355) T protein:vir:18 16 LAKLNGISVDDVSKKFTVEPS-VTQTLMNTVQASSAFLQMINILP-VAEMKGEKIGVGVTGTIASTTDTSGDKERQTADF 93 (355) T ss_pred HHHHhCCChhHccceeccCHH-HHHHHHHHHHHHHHHhhcCceec-cccceeeEEeeccCcceeeccccCCCCCcccccc Confidence 22 22221 111 23444 56666677777766655554422 12233444433222211111111 13333333 Q ss_pred cccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhccccc---------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALKGATL---------------- 133 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~~a~~---------------- 133 (274) ..+.-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +..-+.+ T Consensus 94 ~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~AL--D~i~IGfNG~s~A~~Td~~~nPllqDVNkGW 171 (355) T protein:vir:18 94 TALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQAL--DFIMAGFNGTTRADTSDRVKNPMLQDVAVGW 171 (355) T ss_pred cccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhh--chhhhcccceeeeccCChhhCcCccccchhH Confidence 44444445555555555666665555445578888888888888864 443331 1100000 Q ss_pred ------------------------c-----cc-CcccCHHHHH-HHHHH-HhhcC--CCccEEEEcHHHHHH----HHhh Q lcl|Aclame:pro 134 ------------------------T-----VE-ADITKLDGLQ-TAIDK-FNDED--LEPMVLFVNPLDAGG----LRTS 175 (274) Q Consensus 134 ------------------------~-----~~-~~~~~~d~iv-~a~~~-l~~~~--~~~~~~v~~p~~~~~----L~~~ 175 (274) + +. ++--+.|.++ |+... +.+.. ...-+++|+.+..+. |.+. T Consensus 172 lQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~ 251 (355) T protein:vir:18 172 LQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNK 251 (355) T ss_pred HHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhc Confidence 0 00 1112345543 56643 44432 234577888775442 2211 Q ss_pred hccccccccccccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCcee-----eeccc--cccCc-c Q lcl|Aclame:pro 176 ASDNFTRPTQLGDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFL-----EKDRD--ASRKS-T 244 (274) Q Consensus 176 ~~~~~~~~~~~~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~v-----e~~r~--~~~~~-~ 244 (274) . ..++ +.+.... ..++.|+|.+.-+.+|.+..++..-..+-+..+.+..- +.+|+ +.+.+ . T Consensus 252 ~----~~pt----E~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~N 323 (355) T protein:vir:18 252 Q----QENT----ESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMN 323 (355) T ss_pred c----CChH----HHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc Confidence 0 0111 1122222 24799999999999999999887655544333333211 11221 11111 1 Q ss_pred EEEEEEEEEE-EEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 245 ALYSDKHYVA-YLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 245 ~i~~~~~~~~-~v~~~~avv~l~~~aa~~~~ 274 (274) .=++...|++ .+++ -+++....+.+.- T Consensus 324 e~YvVEd~~~~a~ie---ni~~~~~~~~~~~ 351 (355) T protein:vir:18 324 IDYVVEAYAAGCLLE---NITLGDFTAPAAP 351 (355) T ss_pred ceeeeeccccEEEEe---eeeecCCCCcccc Confidence 1122233332 2232 1333322221111 No 255 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=47.68 E-value=0.69 Score=21.45 Aligned_cols=257 Identities=12% Similarity=0.078 Sum_probs=115.2 Q ss_pred CC--ccccchhh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc--ccc Q lcl|Aclame:pro 1 MA--QGTTKVSN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV--DQI 74 (274) Q Consensus 1 ma--~~~T~~~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~--~~~ 74 (274) +| +.+...+. .+.|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-..-+.+.--|. .++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~Invv~-V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l 93 (337) T protein:vir:79 16 IAKLNDTGDVSKKFAVEPT-VQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTAL 93 (337) T ss_pred HHHhcChhhhcceeeecHH-HHHHHHHHHHHHHHhhccCceec-cccceeeEEeeccCcceeeeecCCCCcccccccccc Confidence 22 33332222 25553 66667777777776655554422 1123344443322221111111122222222 333 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhcc---------------------- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALKG---------------------- 130 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~~---------------------- 130 (274) +-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +..- T Consensus 94 ~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~ 171 (337) T protein:vir:79 94 DSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQ 171 (337) T ss_pred CCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhh--chhhhcccceeeccCCChhhCcCccccchhHHHH Confidence 44444454445556666666665556678988888888888874 443331 1110 Q ss_pred ----cc---------cc-----c-cCcccCHHH-HHHHHHH-HhhcC--CCccEEEEcHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 131 ----AT---------LT-----V-EADITKLDG-LQTAIDK-FNDED--LEPMVLFVNPLDAGGLRTSASDNFTRPTQLG 187 (274) Q Consensus 131 ----a~---------~~-----~-~~~~~~~d~-iv~a~~~-l~~~~--~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~ 187 (274) ++ .. + .++--+.|. +.|+... +.+.. ...-+++|..+..+.= ........+.- T Consensus 172 ~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk----~~~l~n~~~~p 247 (337) T protein:vir:79 172 YRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDK----YFPIVNATQAP 247 (337) T ss_pred HHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHH----hhHHhccCCCc Confidence 00 00 0 011224555 3466654 34433 2345678887765421 11111111111 Q ss_pred cccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCcee-eeccccccCcc-------EEEEEEEEEEE- Q lcl|Aclame:pro 188 DNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFL-EKDRDASRKST-------ALYSDKHYVAY- 255 (274) Q Consensus 188 ~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~v-e~~r~~~~~~~-------~i~~~~~~~~~- 255 (274) .+.+.... ..++.|+|.+.-+.+|.+..++..-..+-+..+.+..- .....+++... .=++...|++. T Consensus 248 tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a 327 (337) T protein:vir:79 248 TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGC 327 (337) T ss_pred HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhccceeeeeccccEE Confidence 11122111 24799999999999999999887655544333333211 11111222111 11222222222 Q ss_pred EEcCcceEEEEeCCC Q lcl|Aclame:pro 256 LYDESKVVKITKGAG 270 (274) Q Consensus 256 v~~~~avv~l~~~aa 270 (274) .++ -|+.+.| T Consensus 328 ~ie-----nI~~~~a 337 (337) T protein:vir:79 328 VAE-----NIELAAA 337 (337) T ss_pred EEe-----ceeecCC Confidence 221 1333333 No 256 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=46.02 E-value=0.75 Score=21.27 Aligned_cols=257 Identities=12% Similarity=0.085 Sum_probs=114.9 Q ss_pred CC--ccccchhh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccccCCCcccc--ccc Q lcl|Aclame:pro 1 MA--QGTTKVSN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV--DQI 74 (274) Q Consensus 1 ma--~~~T~~~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~~eg~~~~~--~~~ 74 (274) +| +.+...+. .+.|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-..-+.+.--|. .++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~Invv~-V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l 93 (337) T protein:vir:10 16 IAKLNDTGDVSKKFAVEPT-VQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTAL 93 (337) T ss_pred HHHhcChhhhcceeeecHH-HHHHHHHHHHHHHHhhccCceec-cccceeeEEeeccCcceeeeecCCCCcccccccccc Confidence 22 23322222 25553 66667777777776655554422 1123344443322221111111122222222 333 Q ss_pred ccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhcc---------------------- Q lcl|Aclame:pro 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALKG---------------------- 130 (274) Q Consensus 75 ~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~~---------------------- 130 (274) +-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +..- T Consensus 94 ~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~ 171 (337) T protein:vir:10 94 DSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQ 171 (337) T ss_pred CCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhh--chhhhcccceeeccCCChhhCcCccccchhHHHH Confidence 44444454445556666666665556678988888888888874 443331 1110 Q ss_pred ----cc---------cc-----cc-CcccCHHH-HHHHHHH-HhhcC--CCccEEEEcHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 131 ----AT---------LT-----VE-ADITKLDG-LQTAIDK-FNDED--LEPMVLFVNPLDAGGLRTSASDNFTRPTQLG 187 (274) Q Consensus 131 ----a~---------~~-----~~-~~~~~~d~-iv~a~~~-l~~~~--~~~~~~v~~p~~~~~L~~~~~~~~~~~~~~~ 187 (274) ++ .. +. ++--+.|. +.|+... +.+.. ...-+++|..+..+.= ........+.- T Consensus 172 ~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk----~~~l~n~~~~p 247 (337) T protein:vir:10 172 YRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK----YFPIVNATQAP 247 (337) T ss_pred HHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHH----hhHHhccCCCc Confidence 00 00 00 11224555 3466654 34433 2345678887765421 11111111111 Q ss_pred cccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCcee-eeccccccCcc-------EEEEEEEEEEE- Q lcl|Aclame:pro 188 DNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFL-EKDRDASRKST-------ALYSDKHYVAY- 255 (274) Q Consensus 188 ~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~v-e~~r~~~~~~~-------~i~~~~~~~~~- 255 (274) .+.+.... ..++.|+|.+.-+.+|.+..++..-..+-+..+.+..- .....+++... .=++...|++. T Consensus 248 tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a 327 (337) T protein:vir:10 248 TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGC 327 (337) T ss_pred HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhccceeeeeccccEE Confidence 11122111 24799999999999999999887655544333333211 11111222111 11222222222 Q ss_pred EEcCcceEEEEeCCC Q lcl|Aclame:pro 256 LYDESKVVKITKGAG 270 (274) Q Consensus 256 v~~~~avv~l~~~aa 270 (274) .++ -|+.+.| T Consensus 328 ~ie-----nI~~~~a 337 (337) T protein:vir:10 328 VAE-----NIELAAA 337 (337) T ss_pred EEe-----ceeecCC Confidence 221 1333333 No 257 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=45.51 E-value=0.77 Score=21.21 Aligned_cols=261 Identities=13% Similarity=0.095 Sum_probs=113.5 Q ss_pred CC--ccccc--hh-h-ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc---cCCCcccc Q lcl|Aclame:pro 1 MA--QGTTK--VS-N-LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI---AEGEKIPV 71 (274) Q Consensus 1 ma--~~~T~--~~-~-~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~---~eg~~~~~ 71 (274) +| +.++. .+ . .+.|. ....+.+++++++.|-+..++.. .....|..|-+-..+...+-.+. .|...... T Consensus 16 ~A~~ngv~~~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~ 93 (355) T protein:vir:98 16 VAELNNISTDDVSKKFTVEPS-VTQTLMNTVQASSAFLKTINILP-VAEMKGEKIGVGVTGTIASTTDTSGDKERQTADF 93 (355) T ss_pred HHHHhCCChhHccceeecCHH-HHHHHHHHHHHHHHHhhcCceec-cccceeeEeeeccCccccccccCCCCCCcccccc Confidence 22 22221 11 1 24444 55566677777766655554422 12233444443222221111111 12333333 Q ss_pred cccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhc-------------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALK-------------------- 129 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~-------------------- 129 (274) ..+.-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +.. T Consensus 94 ~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~AL--D~i~IGfNG~s~A~~Td~~~nPllqDVNkGW 171 (355) T protein:vir:98 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQAL--DLIMAGFNGTTRADTSDRTKNTLLQDVAVGW 171 (355) T ss_pred cccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhh--chhhhcccceeeeccCChhhCcCccccchhH Confidence 34444444555545555666655555445578888888888888864 443331 110 Q ss_pred -----c-cc------------c--c-----c-cCcccCHHHHH-HHHHH-HhhcC--CCccEEEEcHHHHHH----HHhh Q lcl|Aclame:pro 130 -----G-AT------------L--T-----V-EADITKLDGLQ-TAIDK-FNDED--LEPMVLFVNPLDAGG----LRTS 175 (274) Q Consensus 130 -----~-a~------------~--~-----~-~~~~~~~d~iv-~a~~~-l~~~~--~~~~~~v~~p~~~~~----L~~~ 175 (274) . ++ . + + .++--+.|.++ |+... +.+.. ...-+++|+.+..+. |.+. T Consensus 172 lQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~ 251 (355) T protein:vir:98 172 LQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNK 251 (355) T ss_pred HHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhc Confidence 0 00 0 0 0 01122345543 56654 34432 234577888775442 2211 Q ss_pred hccccccccccc-cccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-----eeeccc--cccCc-cEE Q lcl|Aclame:pro 176 ASDNFTRPTQLG-DNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-----LEKDRD--ASRKS-TAL 246 (274) Q Consensus 176 ~~~~~~~~~~~~-~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-----ve~~r~--~~~~~-~~i 246 (274) . ..+++.. ...+. ...++.|+|.+.-+.+|.+..++..-..+-+..+.+.. -+.+|+ +.+.+ ..= T Consensus 252 ~----~~ptE~~Aa~~i~--s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~ 325 (355) T protein:vir:98 252 Q----QENSESLAADIII--SQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMNID 325 (355) T ss_pred c----CCcHHHHHHHHHH--HhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhcce Confidence 1 1111110 01111 12589999999999999999988765554433333321 111221 11111 111 Q ss_pred EEEEEEE-EEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 247 YSDKHYV-AYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 247 ~~~~~~~-~~v~~~~avv~l~~~aa~~~~ 274 (274) ++...|+ +.+++ -+++....+.+-- T Consensus 326 YvVEd~~~~a~ie---nI~~~~~~~~~~~ 351 (355) T protein:vir:98 326 YVVEVYAAGCLLE---NITLGDFTAPAAP 351 (355) T ss_pred eeeeccccEEEee---ceeeeCCCCCccc Confidence 2233333 22332 1333222221111 No 258 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=41.87 E-value=0.91 Score=20.81 Aligned_cols=268 Identities=12% Similarity=0.034 Sum_probs=130.3 Q ss_pred CCccccchhhc--cchHHHHHHHHHHHHHhhhhcccccc-cccccccC-CCEEEEEeecCCC----CcccccCCCcc--- Q lcl|Aclame:pro 1 MAQGTTKVSNL--IVPEVLAPMMQAELDKKLRFAQFADI-DSTLVGQP-GDTLTFPAFTYSG----DAQVIAEGEKI--- 69 (274) Q Consensus 1 ma~~~T~~~~~--~iPe~~~~~v~~~~~~~~~~~~l~~~-~~~~~~~~-G~~v~ip~~~~~~----~a~~~~eg~~~--- 69 (274) .|..|+--... .-..+|...+...++.+.+++...-- .-.+.+.+ .++.-.-+-.+++ .....+|+..+ T Consensus 17 ~~~~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGtG 96 (314) T protein:vir:98 17 FASGTANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGEG 96 (314) T ss_pred eeeccccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCcccccC Confidence 44333322221 22345777777777777776543211 00111111 1221111111111 11112232222 Q ss_pred cccccccceeEEee---hhhhcchhccHHH---HhccCccHHHHHH---HHHHHHHHHHHHHHHHHHhcccc-ccccCcc Q lcl|Aclame:pro 70 PVDQIGTSKREAKV---RKIGKGTELTDEA---VLSGFGDPQGEAV---RQHGLAIANKVDNDVLEALKGAT-LTVEADI 139 (274) Q Consensus 70 ~~~~~~~~~~~~~~---~~~~~~~~is~e~---~~~s~~d~~~~~~---~~~a~~~a~~~d~~~i~~~~~a~-~~~~~~~ 139 (274) +-..--|++..--. ...-..+.++--+ ...-.-|+...+. +..+.++++.+|..+-..+.... .+..... T Consensus 97 Tg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~ltd 176 (314) T protein:vir:98 97 TSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLTD 176 (314) T ss_pred CccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Confidence 11122344332111 1122222222111 1112234555444 44588899989887766664433 3334445 Q ss_pred cCHHHHHHHHHHHhhcCCC-----ccEEEEcHHHHHHHHhhhccccccccccccccccccccchhcceeeEEcC--CCCc Q lcl|Aclame:pro 140 TKLDGLQTAIDKFNDEDLE-----PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN--KLNK 212 (274) Q Consensus 140 ~~~d~iv~a~~~l~~~~~~-----~~~~v~~p~~~~~L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~--~~p~ 212 (274) ++.|.+......+...+.. +-+.-+||+.|..|.-.+...-. ..+..+ +-...+..+.|+-+...+ .+.. T Consensus 177 ~~~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~Tsa--K~SsaN-IDengi~~FkGf~i~e~P~~~~q~ 253 (314) T protein:vir:98 177 YSADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDHPLTTSA--KSSSAN-IDQNGIVNFKGFAIQEIPESMLQS 253 (314) T ss_pred cchhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhcccccccc--ccceee-eccCCcceecceEEEecchhhcCC Confidence 6778888888777765543 34567899999998755432211 111112 333346678999887654 4666 Q ss_pred ceEEEEcCCeEEEEeccCcee-eeccccccCccEEEEEEEEEEEEEcCcceEEEEeCC-Ccc Q lcl|Aclame:pro 213 GEALLAKKGAVKLITKRDFFL-EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGA-GDE 272 (274) Q Consensus 213 ~~~~l~~~~a~~~~~~~~~~v-e~~r~~~~~~~~i~~~~~~~~~v~~~~avv~l~~~a-a~~ 272 (274) +...++....++..-- ++.+ .+-..+++....+.+-.-||-.+++.++...++.++ |.+ T Consensus 254 g~ia~~s~dnig~aft-GIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~tp~~ 314 (314) T protein:vir:98 254 GDVAYTYITNIGKAFT-GINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTSTPEG 314 (314) T ss_pred CcEEEEccccceeecc-cceeeeeeecccccceeeecccccccccccccceeeEEEecCCCC Confidence 7777777666654211 2221 122334567788888888888877554444433332 333 No 259 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=40.84 E-value=0.95 Score=20.70 Aligned_cols=261 Identities=13% Similarity=0.105 Sum_probs=110.8 Q ss_pred CC---------------------ccccchhh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCC Q lcl|Aclame:pro 1 MA---------------------QGTTKVSN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYS 57 (274) Q Consensus 1 ma---------------------~~~T~~~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~ 57 (274) |+ +.+...+. .+-|. ....+.+.+.+++.|-+..++... ....|..|-+-.-+.. T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~-v~q~L~~~i~ess~FL~~Invv~V-~e~~Ge~v~lg~~g~i 78 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQ-LETKLRAAITESAEFLKMITVTTV-DQIEGQVVDVGVSGLY 78 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHH-HHHHHHHHHHhhHHhhhcCccccc-cceeeeEeecccccce Confidence 22 11111111 24554 556777777777766555544321 1223433332211111 Q ss_pred CCcccccCCCcccccccccceeEEeehhhhcchhccHHHHhc---cCccHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Q lcl|Aclame:pro 58 GDAQVIAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL- 133 (274) Q Consensus 58 ~~a~~~~eg~~~~~~~~~~~~~~~~~~~~~~~~~is~e~~~~---s~~d~~~~~~~~~a~~~a~~~d~~~i~~~~~a~~- 133 (274) ++-..-+......+++-....+.......++++...+.-. +.++|...+++.+.+.++...-..-+.+..-+.+ T Consensus 79 --agrtdt~R~~r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~T 156 (341) T protein:vir:27 79 --TGRKAGGRFTKQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADT 156 (341) T ss_pred --eeccCCCceecccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCC Confidence 1111111111111333333444444444555554443322 2478888888888888874333222211110000 Q ss_pred ---------------------------------c-c-cCcccCHHHH-HHHHHHH-hhcCC-C-ccEEEEcHHHHHH--- Q lcl|Aclame:pro 134 ---------------------------------T-V-EADITKLDGL-QTAIDKF-NDEDL-E-PMVLFVNPLDAGG--- 171 (274) Q Consensus 134 ---------------------------------~-~-~~~~~~~d~i-v~a~~~l-~~~~~-~-~~~~v~~p~~~~~--- 171 (274) . + .++--+.|.+ .||...| .+... . .-+++|..+..+. T Consensus 157 d~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~ 236 (341) T protein:vir:27 157 DPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQA 236 (341) T ss_pred ChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhh Confidence 0 0 0112235553 4566543 44332 3 3477888775442 Q ss_pred -HHhhhccccccccccccccccccccchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCcee-----eecc--ccccCc Q lcl|Aclame:pro 172 -LRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFL-----EKDR--DASRKS 243 (274) Q Consensus 172 -L~~~~~~~~~~~~~~~~~~~~~g~~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~v-----e~~r--~~~~~~ 243 (274) |.+.. ..+++.....+. ..++.|+|.+.-+.+|.+..++..-..+-+..+.+..- +.+| .+.+.+ T Consensus 237 ~l~n~~----~~ptE~~Aa~~i---~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~yes 309 (341) T protein:vir:27 237 KLYDKA----DKPSEQIAAQKL---DKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHTG 309 (341) T ss_pred hhhccC----CCCHHHHHHHHH---HHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccccchhh Confidence 22110 112221111111 35899999999999999998887655554333333211 1122 122211 Q ss_pred cEEEEEEEEEEEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 244 TALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 244 ~~i~~~~~~~~~v~~~~avv~l~~~aa~~~~ 274 (274) -++..-|||.---+-.-||+..+|-.--- T Consensus 310 --~YvVEdyg~~~~~~~~~vkl~~~~~~~~~ 338 (341) T protein:vir:27 310 --AWKVTQWVCWKRSPLTTQKKSTSALNHRS 338 (341) T ss_pred --hheeehhhhhhhccccccccCcccccccc Confidence 13334445433222223333222211111 No 260 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=38.14 E-value=1.1 Score=20.40 Aligned_cols=262 Identities=13% Similarity=0.094 Sum_probs=115.0 Q ss_pred CC--ccccch--hh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc---cCCCcccc Q lcl|Aclame:pro 1 MA--QGTTKV--SN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI---AEGEKIPV 71 (274) Q Consensus 1 ma--~~~T~~--~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~---~eg~~~~~ 71 (274) +| +.++.. +. .+-|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-.+. .|..+... T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~ 93 (357) T protein:vir:56 16 VAELNGIDAGDVSKKFTVEPS-VTQTLMNTMQESSDFLTRINIVP-VSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHH-HHHHHHHHHHHHHHHhccCCccc-cccceeeEEecccCccccccccCCCCCCcccccc Confidence 33 222211 11 13444 56666677777776666555432 12233444443222222111111 12222233 Q ss_pred cccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hh--------------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--AL--------------------- 128 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~--------------------- 128 (274) ..+.-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfNGts~A~~Td~~~nPllqDVN~GW 171 (357) T protein:vir:56 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSL--DFIMAGFNGVKRAETSDRSSNPMLQDVAVGW 171 (357) T ss_pred cccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhh--ccceecccceeeeccCChhhCcCccccchhH Confidence 34444444555555555666665555445578888888888887764 333221 11 Q ss_pred ----cc-cc------------cc-------cc-CcccCHHHHH-HHHHHH-hhcC--CCccEEEEcHHHHHH----HHhh Q lcl|Aclame:pro 129 ----KG-AT------------LT-------VE-ADITKLDGLQ-TAIDKF-NDED--LEPMVLFVNPLDAGG----LRTS 175 (274) Q Consensus 129 ----~~-a~------------~~-------~~-~~~~~~d~iv-~a~~~l-~~~~--~~~~~~v~~p~~~~~----L~~~ 175 (274) .. ++ .. +. ++--+.|.++ |+...| .+.. ...-+++|..+..+. |.+. T Consensus 172 lQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~ 251 (357) T protein:vir:56 172 LQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNK 251 (357) T ss_pred HHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhc Confidence 00 00 00 00 1123455554 566543 4433 234567888876543 2111 Q ss_pred hccccccccccccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-----eeeccc--cccCc-c Q lcl|Aclame:pro 176 ASDNFTRPTQLGDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-----LEKDRD--ASRKS-T 244 (274) Q Consensus 176 ~~~~~~~~~~~~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-----ve~~r~--~~~~~-~ 244 (274) . ...++ .+.... ..++-|+|.+.-+.+|.+..++..-..+-+..+.+.. -+.+|+ +.+.+ . T Consensus 252 ~----~~pTE----~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~N 323 (357) T protein:vir:56 252 E----QDNSE----MLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYESMN 323 (357) T ss_pred c----CChHH----HHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc Confidence 0 01111 122211 2479999999999999999988765554433322211 111221 11111 1 Q ss_pred EEEEEEEEE-EEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 245 ALYSDKHYV-AYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 245 ~i~~~~~~~-~~v~~~~avv~l~~~aa~~~~ 274 (274) .=++...|+ +.+++.-.+.....++..+-. T Consensus 324 e~YvVEd~~~~a~iE~i~i~~~~~~~~~~~~ 354 (357) T protein:vir:56 324 IDYVVEDYAAGCLVEKIKVGDFSTPAKATEE 354 (357) T ss_pred ceeeeeccccEEEeeeeeeccCCCCcccCCC Confidence 112223333 233332222222222222222 No 261 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=30.90 E-value=1.5 Score=19.56 Aligned_cols=262 Identities=14% Similarity=0.096 Sum_probs=114.4 Q ss_pred CC--ccccch--hh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc---cCCCcccc Q lcl|Aclame:pro 1 MA--QGTTKV--SN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI---AEGEKIPV 71 (274) Q Consensus 1 ma--~~~T~~--~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~---~eg~~~~~ 71 (274) +| +.++.. +. .+-|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-.+. .|..+... T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~ 93 (357) T protein:vir:20 16 VAELNGIDAGDVSKKFTVEPS-VTQTLMNTMQESSDFLTRINIVP-VSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHH-HHHHHHHHHHHHHHHhccCCccc-cccceeeEEecccCccccccccCCCCCCcccccc Confidence 33 222211 11 13444 56666677777776666555432 12233444443222221111111 12222233 Q ss_pred cccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hhc-------------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--ALK-------------------- 129 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~~-------------------- 129 (274) ..+.-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +.. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfNGts~A~~Td~~~nPllqDVN~GW 171 (357) T protein:vir:20 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSL--DFIMAGFNGVKRAETSDRSSNPMLQDVAVGW 171 (357) T ss_pred cccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhh--ccceecccceeeeccCChhhCcCccccchhH Confidence 34444444555555555666665555445578888888888887764 333221 110 Q ss_pred -----c-cc------------cc-------cc-CcccCHHHHH-HHHHHH-hhcC--CCccEEEEcHHHHHH----HHhh Q lcl|Aclame:pro 130 -----G-AT------------LT-------VE-ADITKLDGLQ-TAIDKF-NDED--LEPMVLFVNPLDAGG----LRTS 175 (274) Q Consensus 130 -----~-a~------------~~-------~~-~~~~~~d~iv-~a~~~l-~~~~--~~~~~~v~~p~~~~~----L~~~ 175 (274) . ++ .. +. ++--+.|.++ |+...| .+.. ...-+++|..+..+. |.+. T Consensus 172 lQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~ 251 (357) T protein:vir:20 172 LQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNK 251 (357) T ss_pred HHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhc Confidence 0 00 00 00 1122455554 566543 4433 234567888876543 2111 Q ss_pred hccccccccccccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-----eeeccc--cccCc-c Q lcl|Aclame:pro 176 ASDNFTRPTQLGDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-----LEKDRD--ASRKS-T 244 (274) Q Consensus 176 ~~~~~~~~~~~~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-----ve~~r~--~~~~~-~ 244 (274) . ..+++ .+.... ..++-|+|.+.-+.+|.+..++..-..+-+..+.+.. -+.+|+ +.+.+ . T Consensus 252 ~----~~ptE----~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~N 323 (357) T protein:vir:20 252 E----QDNSE----MLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYESMN 323 (357) T ss_pred c----CChHH----HHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc Confidence 0 01111 122211 2479999999999999999988765554433322211 111221 11111 1 Q ss_pred EEEEEEEEE-EEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 245 ALYSDKHYV-AYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 245 ~i~~~~~~~-~~v~~~~avv~l~~~aa~~~~ 274 (274) .=++...|+ +.+++.-.+......++.+.. T Consensus 324 e~YvVEd~~~~a~iE~i~~~~~~~p~~~~~~ 354 (357) T protein:vir:20 324 IDYVVEDYAAGCLVEKIKVGDFSTPAKATAE 354 (357) T ss_pred ceeeeeccccEEEeeeeeeccccCCccCCCC Confidence 112223333 223332222222222222222 No 262 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=29.57 E-value=1.6 Score=19.39 Aligned_cols=262 Identities=13% Similarity=0.095 Sum_probs=116.0 Q ss_pred CC--ccccch--hh--ccchHHHHHHHHHHHHHhhhhcccccccccccccCCCEEEEEeecCCCCcccc---cCCCcccc Q lcl|Aclame:pro 1 MA--QGTTKV--SN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVI---AEGEKIPV 71 (274) Q Consensus 1 ma--~~~T~~--~~--~~iPe~~~~~v~~~~~~~~~~~~l~~~~~~~~~~~G~~v~ip~~~~~~~a~~~---~eg~~~~~ 71 (274) +| +.++.. +. .+-|. ....+.+++.+++.|-+..++.. .....|..|-+-..+...+-.+. .|...... T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~-V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~ 93 (357) T protein:vir:60 16 VAELNGIDAGDVSKKFTVEPS-VTQTLMNTMQESSDFLTRINIVP-VSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHH-HHHHHHHHHHHHHHHhccCCccc-cccceeeEEecccCcccccccccCCCCCcccccc Confidence 33 222211 11 13444 56666677777776666555432 12233444443222221111111 12222233 Q ss_pred cccccceeEEeehhhhcchhccHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHH--Hh--------------------- Q lcl|Aclame:pro 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLE--AL--------------------- 128 (274) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~is~e~~~~s~~d~~~~~~~~~a~~~a~~~d~~~i~--~~--------------------- 128 (274) ..+.-....+.-.....++++...+.=-..+||...+++.+.+.++. |...|+ +. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~AL--D~i~IGfNGts~A~~Td~~~nPllqDVN~GW 171 (357) T protein:vir:60 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSL--DLIMAGFNGVRRAETSDRSSNQMLQDVAVGW 171 (357) T ss_pred cccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhh--ccceecccceeeeccCChhhCcCccccchhH Confidence 34444444555545555666655555444578888888888887764 332221 11 Q ss_pred ----cc-cc------------cc-------cc-CcccCHHHHH-HHHHHH-hhcC--CCccEEEEcHHHHHH----HHhh Q lcl|Aclame:pro 129 ----KG-AT------------LT-------VE-ADITKLDGLQ-TAIDKF-NDED--LEPMVLFVNPLDAGG----LRTS 175 (274) Q Consensus 129 ----~~-a~------------~~-------~~-~~~~~~d~iv-~a~~~l-~~~~--~~~~~~v~~p~~~~~----L~~~ 175 (274) .. ++ .. +. ++--+.|.++ |+...| .+.. ...-+++|..+..+. |.+. T Consensus 172 lQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~ 251 (357) T protein:vir:60 172 LQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNR 251 (357) T ss_pred HHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhc Confidence 00 00 00 00 1123455554 566543 4433 234567888876543 2211 Q ss_pred hccccccccccccccccccc---cchhcceeeEEcCCCCcceEEEEcCCeEEEEeccCce-----eeeccc--cccCc-c Q lcl|Aclame:pro 176 ASDNFTRPTQLGDNIIVKGA---FGEALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFF-----LEKDRD--ASRKS-T 244 (274) Q Consensus 176 ~~~~~~~~~~~~~~~~~~g~---~~~i~G~~Vv~s~~~p~~~~~l~~~~a~~~~~~~~~~-----ve~~r~--~~~~~-~ 244 (274) . ..++ +.+.... ..++-|+|.+.-+.+|.+..++..-..+-+..+.+.. -+.+|+ +.+.+ . T Consensus 252 ~----~~pT----E~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~N 323 (357) T protein:vir:60 252 E----QDNS----EMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYESMN 323 (357) T ss_pred C----CChH----HHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc Confidence 0 0111 1122111 2479999999999999999988765554433322211 111221 11111 1 Q ss_pred EEEEEEEEE-EEEEcCcceEEEEeCCCcccC Q lcl|Aclame:pro 245 ALYSDKHYV-AYLYDESKVVKITKGAGDEVM 274 (274) Q Consensus 245 ~i~~~~~~~-~~v~~~~avv~l~~~aa~~~~ 274 (274) .=++...|+ +.+++.-.+......++.+.. T Consensus 324 e~YvVEd~~~~a~iE~i~~~~~~~pa~~~~~ 354 (357) T protein:vir:60 324 IDYVVEDYAAGCLVEKIKVGDFSTPAKATAE 354 (357) T ss_pred ceeeeeccccEEEeeeeeeccCcccccCCCC Confidence 112223333 333333223322333333333 Done!