Query lcl|Aclame:protein:vir:7990|NCBI_annot:gp6|genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Match_columns 273 No_of_seqs 123 out of 380 Neff 9.3 Searched_HMMs 1612 Date Sat Nov 30 13:03:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105822 Length: 273 100.0 2.7E-71 1.7E-74 407.5 29.3 273 1-273 1-273 (273) 2 protein:vir:102605 Length: 273 100.0 2.7E-71 1.7E-74 407.5 29.3 273 1-273 1-273 (273) 3 protein:vir:7990 Length: 273 # 100.0 5.3E-71 3.3E-74 405.9 29.1 273 1-273 1-273 (273) 4 protein:vir:94622 Length: 341 100.0 3.8E-55 2.4E-58 318.9 21.7 269 1-273 1-339 (341) 5 protein:vir:78739 Length: 332 100.0 3.5E-51 2.2E-54 297.2 18.8 266 1-271 7-332 (332) 6 protein:vir:3364 Length: 347 # 100.0 2.7E-50 1.7E-53 292.3 19.4 266 1-273 1-345 (347) 7 protein:vir:80930 Length: 278 100.0 9.1E-49 5.6E-52 284.0 24.5 267 1-273 1-277 (278) 8 protein:vir:1541 Length: 347 # 100.0 6.4E-49 3.9E-52 284.8 21.6 268 1-273 1-345 (347) 9 protein:vir:96262 Length: 274 100.0 3.6E-48 2.2E-51 280.7 24.6 261 1-273 1-271 (274) 10 protein:vir:95898 Length: 274 100.0 3.6E-48 2.2E-51 280.7 24.6 261 1-273 1-271 (274) 11 protein:vir:1239 Length: 274 # 100.0 5.2E-48 3.2E-51 279.8 24.7 261 1-273 1-271 (274) 12 protein:vir:94711 Length: 347 100.0 1.4E-49 8.6E-53 288.4 16.1 267 1-273 1-346 (347) 13 protein:vir:10450 Length: 344 100.0 5E-49 3.1E-52 285.4 18.0 265 1-271 1-344 (344) 14 protein:vir:96123 Length: 274 100.0 1.6E-47 1E-50 277.1 25.6 261 1-273 1-270 (274) 15 protein:vir:97433 Length: 274 100.0 1.8E-47 1.1E-50 276.8 25.5 261 1-273 1-271 (274) 16 protein:vir:94494 Length: 274 100.0 1.8E-47 1.1E-50 276.8 25.5 261 1-273 1-271 (274) 17 protein:vir:3613 Length: 272 # 100.0 1.4E-47 8.5E-51 277.5 23.7 263 1-273 1-272 (272) 18 protein:vir:3136 Length: 322 # 100.0 5.9E-49 3.7E-52 285.0 15.9 264 1-273 1-318 (322) 19 protein:vir:96833 Length: 275 100.0 3.1E-47 1.9E-50 275.6 23.7 261 1-273 3-271 (275) 20 protein:vir:93742 Length: 274 100.0 9.7E-47 6E-50 272.8 25.6 261 1-273 1-271 (274) 21 protein:vir:99075 Length: 392 100.0 3.9E-47 2.4E-50 275.0 22.5 268 1-273 1-307 (392) 22 protein:vir:8885 Length: 347 # 100.0 1.1E-47 6.5E-51 278.1 18.8 267 1-273 1-346 (347) 23 protein:vir:2201 Length: 345 # 100.0 2.1E-47 1.3E-50 276.5 19.0 267 1-273 1-345 (345) 24 protein:vir:108303 Length: 418 100.0 3.9E-46 2.4E-49 269.5 25.8 265 1-273 1-417 (418) 25 protein:vir:100057 Length: 375 100.0 1.9E-46 1.1E-49 271.3 22.4 268 1-273 1-370 (375) 26 protein:vir:94576 Length: 347 100.0 9.3E-47 5.8E-50 272.9 18.3 267 1-273 1-347 (347) 27 protein:vir:105374 Length: 423 100.0 3.2E-45 2E-48 264.5 25.4 270 1-272 1-423 (423) 28 protein:vir:97331 Length: 319 100.0 8.9E-45 5.5E-48 262.1 26.1 265 1-273 25-294 (319) 29 protein:vir:94800 Length: 319 100.0 8.9E-45 5.5E-48 262.1 26.1 265 1-273 25-294 (319) 30 protein:vir:80213 Length: 334 100.0 1E-45 6.2E-49 267.3 19.8 268 1-273 1-332 (334) 31 protein:vir:105334 Length: 276 100.0 6.5E-45 4E-48 262.8 23.4 261 1-273 1-271 (276) 32 protein:vir:103323 Length: 364 100.0 7.5E-45 4.7E-48 262.5 23.1 268 1-273 1-339 (364) 33 protein:vir:107120 Length: 329 100.0 3E-44 1.9E-47 259.2 26.3 265 1-273 36-305 (329) 34 protein:vir:80180 Length: 381 100.0 7.9E-44 4.9E-47 256.9 23.5 267 1-273 15-305 (381) 35 protein:vir:3525 Length: 423 # 100.0 4.4E-44 2.7E-47 258.3 21.1 268 1-273 1-303 (423) 36 protein:vir:174 Length: 423 # 100.0 1.6E-43 9.7E-47 255.3 21.8 268 1-273 1-302 (423) 37 protein:vir:78935 Length: 335 100.0 1.1E-43 6.7E-47 256.2 20.5 268 1-273 1-328 (335) 38 protein:vir:6324 Length: 335 # 100.0 1.7E-43 1E-46 255.1 20.6 268 1-273 1-328 (335) 39 protein:vir:105522 Length: 423 100.0 1E-42 6.2E-46 250.9 23.2 270 1-273 1-302 (423) 40 protein:vir:99675 Length: 324 100.0 1.2E-43 7.4E-47 255.9 17.8 238 28-273 1-296 (324) 41 protein:vir:97031 Length: 402 100.0 4.8E-42 3E-45 247.1 18.1 268 1-273 1-335 (402) 42 protein:vir:9820 Length: 272 # 100.0 1.3E-40 7.9E-44 239.3 25.3 260 1-273 1-269 (272) 43 protein:vir:3033 Length: 272 # 100.0 1.3E-40 7.9E-44 239.3 25.3 260 1-273 1-269 (272) 44 protein:vir:79008 Length: 299 100.0 1E-39 6.4E-43 234.3 27.7 271 1-273 1-299 (299) 45 protein:vir:95107 Length: 270 100.0 6.2E-40 3.9E-43 235.5 21.8 259 1-273 1-267 (270) 46 protein:vir:78920 Length: 290 100.0 3.8E-38 2.4E-41 225.7 25.7 266 1-273 1-290 (290) 47 protein:vir:7019 Length: 401 # 100.0 2.2E-38 1.4E-41 227.0 18.0 267 1-273 1-333 (401) 48 protein:vir:105645 Length: 400 100.0 6.9E-38 4.3E-41 224.3 18.6 268 1-273 1-333 (400) 49 protein:vir:739 Length: 231 # 100.0 3.6E-37 2.2E-40 220.4 20.3 228 34-273 1-231 (231) 50 protein:vir:102655 Length: 322 100.0 8.2E-37 5.1E-40 218.4 20.7 269 1-273 13-321 (322) 51 protein:vir:102335 Length: 312 100.0 1.7E-35 1.1E-38 211.1 25.2 270 1-273 1-310 (312) 52 protein:vir:105464 Length: 346 100.0 6.1E-35 3.8E-38 208.2 25.3 269 1-273 1-297 (346) 53 protein:vir:79712 Length: 285 100.0 1.5E-32 9.1E-36 195.1 23.6 268 1-273 1-283 (285) 54 protein:vir:99523 Length: 311 100.0 3.1E-32 1.9E-35 193.3 23.0 267 1-273 8-310 (311) 55 protein:vir:78090 Length: 302 100.0 3.3E-31 2.1E-34 187.7 23.7 268 1-273 1-300 (302) 56 protein:vir:100939 Length: 430 99.9 1.9E-28 1.2E-31 172.5 20.4 269 1-273 1-429 (430) 57 protein:vir:9265 Length: 430 # 99.9 1.9E-28 1.2E-31 172.5 20.4 269 1-273 1-429 (430) 58 protein:vir:2106 Length: 430 # 99.9 1.8E-27 1.1E-30 167.2 20.2 268 1-273 1-429 (430) 59 protein:vir:1781 Length: 221 # 99.9 3E-27 1.9E-30 166.0 13.9 178 77-265 1-221 (221) 60 protein:vir:5974 Length: 324 # 99.9 1.6E-23 9.8E-27 145.6 21.2 261 1-273 1-296 (324) 61 protein:vir:102944 Length: 330 99.8 1.1E-22 6.7E-26 141.0 21.2 260 1-273 1-298 (330) 62 protein:vir:95451 Length: 313 99.8 3.4E-24 2.1E-27 149.3 12.7 269 1-273 1-311 (313) 63 protein:vir:1583 Length: 351 # 99.8 5.5E-22 3.4E-25 137.1 20.8 262 1-273 1-296 (351) 64 protein:vir:94142 Length: 304 99.6 8.6E-16 5.3E-19 103.2 22.8 257 1-272 1-304 (304) 65 protein:vir:105905 Length: 304 99.6 8.6E-16 5.3E-19 103.2 22.8 257 1-272 1-304 (304) 66 protein:vir:41 Length: 299 # N 99.6 1.2E-15 7.4E-19 102.4 21.6 259 1-273 6-298 (299) 67 protein:vir:96223 Length: 324 99.6 1.3E-15 7.9E-19 102.3 21.8 258 1-273 30-315 (324) 68 protein:vir:9309 Length: 324 # 99.6 1.6E-15 9.8E-19 101.7 21.9 258 1-273 30-315 (324) 69 protein:vir:97148 Length: 324 99.6 2E-15 1.2E-18 101.2 22.0 258 1-273 31-315 (324) 70 protein:vir:99749 Length: 324 99.6 2.9E-15 1.8E-18 100.3 21.9 258 1-273 30-315 (324) 71 protein:vir:96392 Length: 324 99.5 3.3E-15 2E-18 100.0 22.2 258 1-273 30-315 (324) 72 protein:vir:78830 Length: 324 99.5 3.3E-15 2E-18 100.0 22.2 258 1-273 30-315 (324) 73 protein:vir:9759 Length: 303 # 99.5 2.4E-15 1.5E-18 100.8 21.3 264 1-273 1-303 (303) 74 protein:vir:94771 Length: 298 99.5 5.9E-15 3.7E-18 98.6 22.4 262 1-272 1-298 (298) 75 protein:vir:1638 Length: 298 # 99.5 7.5E-15 4.6E-18 98.0 22.4 262 1-272 1-298 (298) 76 protein:vir:103955 Length: 324 99.5 6.7E-15 4.2E-18 98.3 21.8 258 1-273 30-315 (324) 77 protein:vir:78523 Length: 338 99.5 1.3E-14 8.1E-18 96.7 23.1 266 1-273 1-335 (338) 78 protein:vir:80684 Length: 315 99.5 6.2E-15 3.8E-18 98.5 21.1 265 1-273 1-306 (315) 79 protein:vir:104085 Length: 320 99.5 9.6E-15 5.9E-18 97.4 22.1 262 1-273 14-317 (320) 80 protein:vir:9410 Length: 415 # 99.5 1.1E-14 6.9E-18 97.1 22.2 264 1-273 127-404 (415) 81 protein:vir:7771 Length: 330 # 99.5 1.5E-14 9.5E-18 96.3 22.8 266 1-273 1-325 (330) 82 protein:vir:8187 Length: 311 # 99.5 1.3E-14 8E-18 96.7 22.3 263 1-273 1-310 (311) 83 protein:vir:4339 Length: 395 # 99.5 1.9E-14 1.2E-17 95.8 23.2 258 1-273 117-395 (395) 84 protein:vir:79987 Length: 415 99.5 1.5E-14 9.1E-18 96.4 22.5 264 1-273 120-404 (415) 85 protein:vir:81100 Length: 415 99.5 1.5E-14 9.1E-18 96.4 22.5 264 1-273 120-404 (415) 86 protein:vir:98339 Length: 415 99.5 1.5E-14 9.1E-18 96.4 22.5 264 1-273 120-404 (415) 87 protein:vir:9574 Length: 300 # 99.5 1.2E-14 7.2E-18 97.0 21.5 263 1-273 1-300 (300) 88 protein:vir:191 Length: 385 # 99.5 1.9E-14 1.2E-17 95.8 22.5 258 1-273 105-384 (385) 89 protein:vir:1886 Length: 385 # 99.5 1.9E-14 1.2E-17 95.8 22.5 258 1-273 105-384 (385) 90 protein:vir:2430 Length: 318 # 99.5 1.7E-14 1.1E-17 96.1 21.9 265 1-273 14-313 (318) 91 protein:vir:95763 Length: 297 99.5 2.1E-14 1.3E-17 95.6 22.2 258 1-273 9-296 (297) 92 protein:vir:78223 Length: 333 99.5 2.5E-14 1.6E-17 95.2 22.5 266 1-273 20-332 (333) 93 protein:vir:94673 Length: 419 99.5 3.3E-14 2.1E-17 94.5 22.9 260 1-273 130-417 (419) 94 protein:vir:4600 Length: 415 # 99.5 3.5E-14 2.2E-17 94.3 22.4 262 1-273 120-404 (415) 95 protein:vir:4700 Length: 415 # 99.5 3.5E-14 2.2E-17 94.3 22.4 262 1-273 120-404 (415) 96 protein:vir:4226 Length: 326 # 99.5 3.3E-14 2.1E-17 94.5 21.8 265 1-273 22-323 (326) 97 protein:vir:2344 Length: 397 # 99.5 3.6E-14 2.2E-17 94.3 21.8 264 1-273 10-306 (397) 98 protein:vir:6242 Length: 390 # 99.5 1.8E-14 1.1E-17 96.0 20.0 261 1-273 116-389 (390) 99 protein:vir:97053 Length: 390 99.5 5.2E-14 3.2E-17 93.4 22.1 256 1-271 113-390 (390) 100 protein:vir:100135 Length: 418 99.5 5.5E-14 3.4E-17 93.3 22.2 258 1-273 136-415 (418) 101 protein:vir:108211 Length: 318 99.4 1.6E-14 9.8E-18 96.2 18.0 261 1-273 22-317 (318) 102 protein:vir:104256 Length: 458 99.4 8.4E-14 5.2E-17 92.3 21.9 265 1-273 165-458 (458) 103 protein:vir:1328 Length: 392 # 99.4 7.4E-14 4.6E-17 92.6 20.9 261 1-273 114-391 (392) 104 protein:vir:485 Length: 407 # 99.4 9.6E-14 6E-17 91.9 21.1 262 1-273 106-400 (407) 105 protein:vir:93616 Length: 645 99.4 8.8E-14 5.4E-17 92.2 20.8 266 1-273 344-641 (645) 106 protein:vir:81070 Length: 390 99.4 1.9E-13 1.1E-16 90.4 22.1 256 1-271 113-390 (390) 107 protein:vir:4511 Length: 409 # 99.4 7.5E-14 4.6E-17 92.5 19.8 266 1-273 117-406 (409) 108 protein:vir:9927 Length: 295 # 99.4 1.1E-14 6.9E-18 97.1 14.8 256 1-273 1-290 (295) 109 protein:vir:4456 Length: 401 # 99.4 1.1E-13 6.6E-17 91.7 20.1 262 1-273 107-401 (401) 110 protein:vir:99920 Length: 311 99.4 1.4E-13 9E-17 91.0 20.8 264 1-272 1-311 (311) 111 protein:vir:10364 Length: 390 99.4 3.7E-13 2.3E-16 88.7 22.9 255 1-271 114-390 (390) 112 protein:vir:100247 Length: 425 99.4 1.3E-13 7.9E-17 91.3 20.4 263 1-273 130-424 (425) 113 protein:vir:101607 Length: 379 99.4 6E-13 3.7E-16 87.6 22.7 258 1-273 109-379 (379) 114 protein:vir:80376 Length: 435 99.4 7.2E-13 4.5E-16 87.1 23.0 262 1-273 130-433 (435) 115 protein:vir:80446 Length: 367 99.4 1.8E-13 1.1E-16 90.4 18.9 261 1-273 1-320 (367) 116 protein:vir:1433 Length: 435 # 99.4 6.8E-13 4.2E-16 87.3 21.9 262 1-273 130-433 (435) 117 protein:vir:5739 Length: 366 # 99.4 1.1E-12 7.1E-16 86.1 23.1 262 1-273 64-366 (366) 118 protein:vir:8102 Length: 543 # 99.4 5.4E-13 3.3E-16 87.8 21.1 260 1-273 249-542 (543) 119 protein:vir:4856 Length: 293 # 99.4 1.1E-12 6.8E-16 86.1 22.7 258 1-273 5-281 (293) 120 protein:vir:100172 Length: 394 99.3 9.5E-13 5.9E-16 86.5 22.1 259 1-273 111-384 (394) 121 protein:vir:95376 Length: 425 99.3 5.3E-13 3.3E-16 87.9 20.7 260 1-273 138-421 (425) 122 protein:vir:4830 Length: 397 # 99.3 1.4E-12 8.4E-16 85.6 22.3 258 1-273 109-385 (397) 123 protein:vir:2504 Length: 305 # 99.3 9.9E-13 6.2E-16 86.4 21.6 256 1-273 1-298 (305) 124 protein:vir:3870 Length: 400 # 99.3 6.4E-13 4E-16 87.4 20.3 253 1-273 140-399 (400) 125 protein:vir:4953 Length: 397 # 99.3 3.2E-12 2E-15 83.6 22.6 258 1-273 109-385 (397) 126 protein:vir:106647 Length: 303 99.3 7.8E-14 4.8E-17 92.4 13.1 252 1-273 1-296 (303) 127 protein:vir:9875 Length: 296 # 99.3 1.8E-13 1.1E-16 90.5 15.0 246 1-273 22-295 (296) 128 protein:vir:4997 Length: 397 # 99.3 4.5E-12 2.8E-15 82.8 22.5 258 1-273 109-385 (397) 129 protein:vir:3991 Length: 404 # 99.3 4.7E-12 2.9E-15 82.7 22.4 258 1-273 116-393 (404) 130 protein:vir:81160 Length: 371 99.3 5.8E-12 3.6E-15 82.2 22.7 257 1-273 91-371 (371) 131 protein:vir:1025 Length: 408 # 99.3 5.5E-12 3.4E-15 82.3 22.0 258 1-273 121-393 (408) 132 protein:vir:1383 Length: 421 # 99.3 3.7E-12 2.3E-15 83.3 20.9 255 1-273 116-385 (421) 133 protein:vir:9704 Length: 394 # 99.3 4.1E-12 2.5E-15 83.0 21.0 252 1-273 133-390 (394) 134 protein:vir:96762 Length: 632 99.3 1.4E-12 8.7E-16 85.6 18.4 257 1-272 357-632 (632) 135 protein:vir:105038 Length: 428 99.3 1.1E-11 6.7E-15 80.7 23.1 262 1-273 125-428 (428) 136 protein:vir:6212 Length: 434 # 99.3 2E-12 1.3E-15 84.7 18.8 264 1-273 141-429 (434) 137 protein:vir:81227 Length: 413 99.2 2.3E-11 1.5E-14 78.9 23.5 262 1-273 118-410 (413) 138 protein:vir:78640 Length: 352 99.2 2.3E-12 1.4E-15 84.4 17.6 251 1-273 83-346 (352) 139 protein:vir:102119 Length: 404 99.2 1.2E-11 7.7E-15 80.4 21.3 265 1-273 110-400 (404) 140 protein:vir:7409 Length: 408 # 99.2 2.1E-11 1.3E-14 79.2 22.4 258 1-273 116-393 (408) 141 protein:vir:4197 Length: 314 # 99.2 3.2E-11 2E-14 78.1 23.0 265 1-273 19-313 (314) 142 protein:vir:100884 Length: 389 99.2 2.1E-11 1.3E-14 79.1 21.9 258 1-273 109-382 (389) 143 protein:vir:4092 Length: 390 # 99.2 3.6E-11 2.2E-14 77.8 22.3 258 1-273 84-368 (390) 144 protein:vir:3845 Length: 395 # 99.2 4.8E-11 3E-14 77.2 22.2 258 1-273 105-383 (395) 145 protein:vir:94424 Length: 387 99.1 9.8E-12 6.1E-15 80.9 15.7 250 1-273 118-381 (387) 146 protein:vir:96978 Length: 387 99.1 9.8E-12 6.1E-15 80.9 15.7 250 1-273 118-381 (387) 147 protein:vir:2685 Length: 387 # 99.1 9.8E-12 6.1E-15 80.9 15.7 250 1-273 118-381 (387) 148 protein:vir:1268 Length: 397 # 99.1 8.5E-11 5.2E-14 75.8 20.8 257 1-273 123-397 (397) 149 protein:vir:78387 Length: 349 99.1 1.1E-10 6.5E-14 75.3 21.1 263 1-273 1-317 (349) 150 protein:vir:93881 Length: 387 99.1 3.5E-11 2.2E-14 77.9 17.9 251 1-273 118-381 (387) 151 protein:vir:9361 Length: 402 # 99.1 1.6E-11 1E-14 79.7 16.0 250 1-273 133-396 (402) 152 protein:vir:962 Length: 397 # 99.1 7.3E-11 4.5E-14 76.2 19.2 251 1-273 138-397 (397) 153 protein:vir:8420 Length: 477 # 99.0 6.3E-11 3.9E-14 76.5 18.2 267 1-273 157-471 (477) 154 protein:vir:1084 Length: 437 # 99.0 1.4E-10 8.7E-14 74.6 20.1 256 1-273 156-427 (437) 155 protein:vir:7855 Length: 497 # 99.0 5.4E-10 3.4E-13 71.4 22.8 264 1-273 151-493 (497) 156 protein:vir:101650 Length: 497 99.0 5.4E-10 3.4E-13 71.4 22.8 264 1-273 151-493 (497) 157 protein:vir:94989 Length: 349 99.0 2.7E-10 1.7E-13 73.1 21.0 263 1-273 1-307 (349) 158 protein:vir:102082 Length: 392 99.0 4.6E-10 2.9E-13 71.8 21.0 257 1-273 106-384 (392) 159 protein:vir:107593 Length: 392 99.0 4.6E-10 2.9E-13 71.8 21.0 257 1-273 106-384 (392) 160 protein:vir:105004 Length: 392 99.0 4.6E-10 2.9E-13 71.8 21.0 257 1-273 106-384 (392) 161 protein:vir:102873 Length: 392 99.0 4.6E-10 2.9E-13 71.8 21.0 257 1-273 106-384 (392) 162 protein:vir:80128 Length: 466 98.8 1.7E-09 1E-12 68.7 16.7 259 1-273 154-448 (466) 163 protein:vir:95875 Length: 401 98.8 1.8E-09 1.1E-12 68.5 16.2 269 1-273 19-400 (401) 164 protein:vir:79928 Length: 393 98.7 1.5E-09 9.1E-13 69.0 15.6 267 1-273 74-377 (393) 165 protein:vir:9643 Length: 377 # 98.7 6.5E-09 4E-12 65.5 19.0 253 1-273 84-377 (377) 166 protein:vir:3158 Length: 321 # 98.7 3.5E-08 2.2E-11 61.5 20.9 262 1-273 19-311 (321) 167 protein:vir:101291 Length: 381 98.7 1.1E-08 7E-12 64.2 18.2 253 1-273 76-368 (381) 168 protein:vir:9509 Length: 381 # 98.7 1.1E-08 7E-12 64.2 18.2 253 1-273 76-368 (381) 169 protein:vir:100632 Length: 381 98.6 1.7E-08 1E-11 63.2 18.5 255 1-273 80-370 (381) 170 protein:vir:95963 Length: 395 98.6 2.1E-08 1.3E-11 62.7 18.7 253 1-273 86-376 (395) 171 protein:vir:4159 Length: 315 # 98.6 4.2E-08 2.6E-11 61.0 20.1 264 1-272 19-315 (315) 172 protein:vir:93696 Length: 364 98.6 5.1E-08 3.2E-11 60.5 19.4 269 1-273 1-361 (364) 173 protein:vir:78350 Length: 383 98.5 7.2E-08 4.5E-11 59.7 18.4 252 1-273 83-375 (383) 174 protein:vir:8324 Length: 410 # 98.5 3.1E-08 1.9E-11 61.8 15.4 254 1-271 127-410 (410) 175 protein:vir:95131 Length: 325 98.4 3.6E-07 2.2E-10 55.9 19.8 260 1-273 1-295 (325) 176 protein:vir:98635 Length: 377 98.4 1.8E-07 1.1E-10 57.6 18.0 252 1-273 79-377 (377) 177 protein:vir:2770 Length: 318 # 98.3 4.3E-07 2.6E-10 55.5 19.0 222 1-234 22-318 (318) 178 protein:vir:79548 Length: 652 98.2 1E-06 6.4E-10 53.4 18.1 259 1-270 359-652 (652) 179 protein:vir:104439 Length: 404 98.1 3.3E-06 2.1E-09 50.6 18.8 268 1-273 22-403 (404) 180 protein:vir:819 Length: 404 # 98.1 3.3E-06 2.1E-09 50.6 18.8 268 1-273 22-403 (404) 181 protein:vir:3298 Length: 404 # 98.1 3.3E-06 2.1E-09 50.6 18.8 268 1-273 22-403 (404) 182 protein:vir:10123 Length: 404 98.1 3.3E-06 2.1E-09 50.6 18.8 268 1-273 22-403 (404) 183 protein:vir:96792 Length: 315 98.1 5.1E-06 3.2E-09 49.6 20.2 260 1-273 1-281 (315) 184 protein:vir:105610 Length: 430 97.9 6.2E-06 3.9E-09 49.1 17.5 270 1-273 1-424 (430) 185 protein:vir:95512 Length: 693 97.9 5.9E-06 3.6E-09 49.3 17.1 260 1-273 394-693 (693) 186 protein:vir:3969 Length: 287 # 97.9 4.8E-06 3E-09 49.8 16.5 267 1-273 1-286 (287) 187 protein:vir:98871 Length: 314 97.5 3.8E-05 2.4E-08 44.8 16.0 267 1-273 21-311 (314) 188 protein:vir:97397 Length: 517 97.2 0.00011 6.8E-08 42.3 15.3 254 1-273 237-516 (517) 189 protein:vir:97255 Length: 310 96.9 0.00029 1.8E-07 40.0 20.4 262 1-273 1-310 (310) 190 protein:vir:94528 Length: 286 96.8 0.00032 2E-07 39.8 15.9 260 1-273 1-285 (286) 191 protein:vir:94933 Length: 330 96.8 0.00036 2.2E-07 39.5 18.3 264 1-273 25-329 (330) 192 protein:vir:4074 Length: 480 # 96.0 0.00023 1.4E-07 40.5 9.9 257 1-273 184-477 (480) 193 protein:vir:107687 Length: 319 95.3 0.0023 1.4E-06 35.0 17.3 258 1-271 24-319 (319) 194 protein:vir:99424 Length: 360 95.2 0.0024 1.5E-06 34.9 19.1 261 1-273 1-357 (360) 195 protein:vir:94070 Length: 339 94.3 0.0048 3E-06 33.3 16.4 256 1-271 49-339 (339) 196 protein:vir:80068 Length: 301 94.1 0.0054 3.4E-06 33.0 18.5 259 1-271 1-301 (301) 197 protein:vir:103886 Length: 302 93.5 0.0074 4.6E-06 32.3 19.9 258 1-273 1-302 (302) 198 protein:vir:79078 Length: 307 91.6 0.015 9.2E-06 30.6 13.3 265 1-273 1-307 (307) 199 protein:vir:103285 Length: 296 91.1 0.018 1.1E-05 30.2 18.3 258 1-271 1-296 (296) 200 protein:vir:79642 Length: 329 91.0 0.018 1.1E-05 30.1 19.0 259 1-273 26-328 (329) 201 protein:vir:107882 Length: 307 90.6 0.02 1.3E-05 29.9 14.8 266 1-273 1-307 (307) 202 protein:vir:78148 Length: 123 82.5 0.032 2E-05 28.8 6.8 106 164-273 1-123 (123) 203 protein:vir:104342 Length: 314 80.1 0.098 6.1E-05 26.1 16.5 257 1-271 1-314 (314) 204 protein:vir:15 Length: 472 # N 78.2 0.12 7.2E-05 25.7 10.8 257 1-273 52-362 (472) 205 protein:vir:3643 Length: 336 # 73.3 0.17 0.00011 24.8 13.3 255 1-271 45-336 (336) 206 protein:vir:4786 Length: 295 # 72.1 0.19 0.00012 24.6 15.1 245 1-252 1-295 (295) 207 protein:vir:5942 Length: 523 # 56.1 0.46 0.00029 22.4 14.0 261 1-273 193-521 (523) 208 protein:vir:101557 Length: 336 49.1 0.65 0.0004 21.6 14.7 255 1-271 45-336 (336) 209 protein:vir:99888 Length: 309 48.7 0.66 0.00041 21.6 14.1 262 1-273 1-308 (309) 210 protein:vir:8843 Length: 317 # 47.4 0.7 0.00044 21.4 17.8 256 1-273 1-315 (317) 211 protein:vir:78558 Length: 336 40.4 0.97 0.0006 20.6 14.1 255 1-271 45-336 (336) 212 protein:vir:79399 Length: 455 36.3 1.2 0.00073 20.2 11.7 260 1-273 45-354 (455) 213 protein:vir:107732 Length: 379 35.1 1.3 0.00078 20.1 16.7 256 1-271 74-379 (379) 214 protein:vir:10324 Length: 320 28.8 1.7 0.0011 19.3 17.3 253 3-273 1-317 (320) 215 protein:vir:2736 Length: 348 # 27.3 1.9 0.0012 19.1 18.4 260 1-272 1-348 (348) 216 protein:vir:96079 Length: 382 20.1 2.8 0.0018 18.1 15.1 256 1-271 73-382 (382) No 1 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=2.7e-71 Score=407.47 Aligned_cols=273 Identities=99% Similarity=1.354 Sum_probs=262.8 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEee Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) ||+++++||+|+++++++|++.+++.+++++++++++.+||||+||+++.+++++|+++++++.++++++++++++||++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeee Confidence 99999999999999999999999999999999999999999999999999999999999988999999999999999999 Q ss_pred eeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCCCcCC Q lcl|Aclame:pro 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVG 160 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~ 160 (273) +++++.|+|+|+.+.+++++++++|++++|++++|++++++++.++..+..+++.++.++++.|.+++..|++++||.++ T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~ 160 (273) T protein:vir:10 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANVPNVG 160 (273) T ss_pred eecceEeecHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCCCcCC Confidence 99999999999999999988899999999999999999999988888777788888899999999999999999999999 Q ss_pred cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEecceeeecc Q lcl|Aclame:pro 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) Q Consensus 161 r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~ve~~~ 240 (273) |+++|+|++++.|+++++++.+.+..++...+++|.||+++||+|++|+++|.+++..++++|++|+++++|++++|.+| T Consensus 161 R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r 240 (273) T protein:vir:10 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) T ss_pred CEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehhhccc Confidence 99999999999999998888888888888899999999999999999999999988889999999999999999999999 Q ss_pred CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++++|+|.|+++++||++++||+++++|+++|| T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999999999999 No 2 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=2.7e-71 Score=407.47 Aligned_cols=273 Identities=99% Similarity=1.354 Sum_probs=262.8 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEee Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) ||+++++||+|+++++++|++.+++.+++++++++++.+||||+||+++.+++++|+++++++.++++++++++++||++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeee Confidence 99999999999999999999999999999999999999999999999999999999999988999999999999999999 Q ss_pred eeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCCCcCC Q lcl|Aclame:pro 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVG 160 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~ 160 (273) +++++.|+|+|+.+.+++++++++|++++|++++|++++++++.++..+..+++.++.++++.|.+++..|++++||.++ T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~ 160 (273) T protein:vir:10 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANVPNVG 160 (273) T ss_pred eecceEeecHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCCCcCC Confidence 99999999999999999988899999999999999999999988888777788888899999999999999999999999 Q ss_pred cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEecceeeecc Q lcl|Aclame:pro 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) Q Consensus 161 r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~ve~~~ 240 (273) |+++|+|++++.|+++++++.+.+..++...+++|.||+++||+|++|+++|.+++..++++|++|+++++|++++|.+| T Consensus 161 R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r 240 (273) T protein:vir:10 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) T ss_pred CEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehhhccc Confidence 99999999999999998888888888888899999999999999999999999988889999999999999999999999 Q ss_pred CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++++|+|.|+++++||++++||+++++|+++|| T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999999999999 No 3 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=5.3e-71 Score=405.88 Aligned_cols=273 Identities=100% Similarity=1.363 Sum_probs=262.2 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEee Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) ||+++|+||+|+++++++|++++++.++++++++.++.+||||+||+++.+++++|+++++++.++++++++++++||++ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEeee Confidence 99999999999999999999999999999999999999999999999999999999999998999999999999999999 Q ss_pred eeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCCCcCC Q lcl|Aclame:pro 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVG 160 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~ 160 (273) +++++.|+|+|+.+.+++++++++|++++|++++|+++++++..++.....+++.++.++++.|.+++..|++++||.+| T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~ 160 (273) T protein:vir:79 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVG 160 (273) T ss_pred cccceeeccHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhhccCCccC Confidence 99999999999999999998899999999999999999999988877777778888889999999999999999999999 Q ss_pred cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEecceeeecc Q lcl|Aclame:pro 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) Q Consensus 161 r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~ve~~~ 240 (273) |+++++|+++..|+++++++.+.+..++.+.+++|.||+++||+|++|+++|.+++..++++|++|+++++|.+++|.+| T Consensus 161 R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~~~~~e~~r 240 (273) T protein:vir:79 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) T ss_pred cEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeeehhhhhccc Confidence 99999999999999988888888888888899999999999999999999999888889999999999999999999999 Q ss_pred CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++++|+|.|+++++||++++||+++++|+++|| T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 999999999999999999999999999999999 No 4 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=3.8e-55 Score=318.91 Aligned_cols=269 Identities=20% Similarity=0.276 Sum_probs=227.0 Q ss_pred Cc--c------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc Q lcl|Aclame:pro 1 MA--F------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA--~------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~ 66 (273) || | +.|+||+|++++++.|+++++|.+++ ++++.++.+||||+||+++.+++.+|.+ +.+++++ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~-~d~~~~~~~Gdtv~ip~~g~~~~~d~~~-~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV-KTWGAQVKKGDTFHVPRISELGVEDKAT-DVPVGVQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc-ccccccccCCceEEEeccCcceeeeecC-CCccccc Confidence 44 3 24789999999999999999999987 6888888889999999999999888865 5578899 Q ss_pred ccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---------cccCC Q lcl|Aclame:pro 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALTG---------SAPSD 136 (273) Q Consensus 67 ~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~---------~~~~~ 136 (273) ++++++++++||+++++++.|+|+|+.+.++++ .+++++++++|++++|++++++++..+..... .++.. T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 999999999999999999999999999999986 67899999999999999999988664432211 11112 Q ss_pred HhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC Q lcl|Aclame:pro 137 ADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD 216 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~ 216 (273) ....++.|.++++.|++++||.++|+++|+|++|+.|+++++ |.+.+..++ ..+++|.||+++||+|++||++|.+++ T Consensus 159 ~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~-~~~~~~~g~-~~l~~G~ig~i~G~~V~~Sn~lp~~~~ 236 (341) T protein:vir:94 159 QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQ-FISKDFINN-APIAQGQIGSLMGVRVIRTSLIGNNSA 236 (341) T ss_pred hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchh-hhhhhcccc-chhheeeeeeEeceEEEEecccccccc Confidence 234688999999999999999999999999999999999876 555666654 578999999999999999999986543 Q ss_pred c----------------------------------EEEEEeCceEEEEE------------ecceeeeccCCCcceeeEE Q lcl|Aclame:pro 217 E----------------------------------QFVAFHPSAAAYVS------------QIDTVEALRDQDSFSDRIR 250 (273) Q Consensus 217 ~----------------------------------~~~~~~~~a~~~~~------------~~~~ve~~~~~~~~~~~v~ 250 (273) . ..+++|+++.+.++ +...+|..+++.+|+|+|. T Consensus 237 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 316 (341) T protein:vir:94 237 TGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMV 316 (341) T ss_pred ccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhh Confidence 2 23778888887765 3345677889999999999 Q ss_pred eeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 251 ALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 251 ~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++.|||++|||+|+|.|+.++- T Consensus 317 ~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 317 GRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred hhhhhcccccCcceeEEEecCcC Confidence 99999999999999999988877 No 5 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=3.5e-51 Score=297.19 Aligned_cols=266 Identities=20% Similarity=0.234 Sum_probs=223.6 Q ss_pred Ccc-----------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCcc Q lcl|Aclame:pro 1 MAF-----------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT 63 (273) Q Consensus 1 MA~-----------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~ 63 (273) |++ .++. |+|++++++.|++.++|.+++... ....|+|++||+++..++.+|.+ |.++ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r---~i~~G~tv~i~~ig~~~~~~~~~-g~~l 81 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTGKLSAGYHTP-GTPI 81 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccc---cccccceEEEEeccceeEeeecC-CCCC Confidence 322 2344 999999999999999999988642 23469999999999999988776 4456 Q ss_pred CCc-ccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc------------- Q lcl|Aclame:pro 64 SAD-AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTA------------- 128 (273) Q Consensus 64 ~~~-~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~------------- 128 (273) .++ ++++++++++||+.+++++.|+|.|+.+.++++ .+++++++++|++++|+.++..+..++.. T Consensus 82 ~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~ 161 (332) T protein:vir:78 82 VGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHV 161 (332) T ss_pred CCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccccc Confidence 665 589999999999999999999999999999987 56899999999999999999988765432 Q ss_pred -ccccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcc-hHHhhhhhcccccceeeeee-eeeecceEE Q lcl|Aclame:pro 129 -LTGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSS-GSKLTSADTSGDAAGLRAGT-IGNLLGARI 205 (273) Q Consensus 129 -~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~-~~~~~~~~~~~~~~~~~~G~-ig~i~G~~i 205 (273) ...+...++.++++.|.++++.|++++||.+|||++|+|++|..|++. ++.+.+.+..+....+++|. |++++||+| T Consensus 162 ~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V 241 (332) T protein:vir:78 162 NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRI 241 (332) T ss_pred ccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEE Confidence 122334567789999999999999999999999999999999999973 23466666666667888886 899999999 Q ss_pred EEecccccCCC---------------------cEEEEEeCceEEEEEecc----eeeeccCCCcceeeEEeeeeeeeEEE Q lcl|Aclame:pro 206 VESNNLRDTDD---------------------EQFVAFHPSAAAYVSQID----TVEALRDQDSFSDRIRALHVYGGKVV 260 (273) Q Consensus 206 ~~s~~l~~~~~---------------------~~~~~~~~~a~~~~~~~~----~ve~~~~~~~~~~~v~~~~~~g~~vl 260 (273) |+||++|..++ ..++++|++|++.+++.+ .+|.+|++++|+|.|.+++.||++++ T Consensus 242 ~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~ 321 (332) T protein:vir:78 242 LKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSL 321 (332) T ss_pred EecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcCcee Confidence 99999995432 246889999999987543 56789999999999999999999999 Q ss_pred cCceEEEEecC Q lcl|Aclame:pro 261 RPTGVVVFNKT 271 (273) Q Consensus 261 ~p~~~v~~~~~ 271 (273) |||++++|+++ T Consensus 322 rPe~~v~l~~a 332 (332) T protein:vir:78 322 RTSVAGSFQAA 332 (332) T ss_pred cccceEEEeeC Confidence 99999999999 No 6 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=2.7e-50 Score=292.30 Aligned_cols=266 Identities=19% Similarity=0.158 Sum_probs=219.7 Q ss_pred Ccc---------------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCC Q lcl|Aclame:pro 1 MAF---------------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAA 59 (273) Q Consensus 1 MA~---------------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) ||| .++. |+|++++++.|+++++|.+++... ....|++++||+++..++.+|.+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r---~~~~G~sv~i~~iG~~t~~~~~~- 75 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR---SIASGKSAQFPVIGRTKAAYLKP- 75 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccc---cccccceeEeeeccceeeeeecC- Confidence 664 1344 999999999999999999998742 23459999999999999987775 Q ss_pred CCcc--CCcccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Q lcl|Aclame:pro 60 GRQT--SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTA-------- 128 (273) Q Consensus 60 ~~~~--~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~-------- 128 (273) |..+ +.+++..++.+++||+.+++++.|+|.|+.++++++ .++.++++++|++++|+.++..+...... T Consensus 76 g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:33 76 GENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENI 155 (347) T ss_pred CCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 4444 345688899999999999999999999999999986 56899999999999999998765421100 Q ss_pred -----------ccccc------cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccce Q lcl|Aclame:pro 129 -----------LTGSA------PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAG 191 (273) Q Consensus 129 -----------~~~~~------~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~ 191 (273) ...++ ..++.++++.|.++++.|++++||.++||+||+|++|+.|+++++ +.+.+..+ ... T Consensus 156 ~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~-~~~~d~~~-~~~ 233 (347) T protein:vir:33 156 EGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM-PNAANYQA-LLD 233 (347) T ss_pred ccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccc-cccccccc-ccc Confidence 00011 112456799999999999999999999999999999999999875 55666654 567 Q ss_pred eeeeeeeeecceEEEEecccccCCC-----------------------------cEEEEEeCceEEEEEecc-eeeeccC Q lcl|Aclame:pro 192 LRAGTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRD 241 (273) Q Consensus 192 ~~~G~ig~i~G~~i~~s~~l~~~~~-----------------------------~~~~~~~~~a~~~~~~~~-~ve~~~~ 241 (273) +.+|.|++++||+||+||++|.... ..++++|++|+|.+++.+ ++|..|+ T Consensus 234 ~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~ 313 (347) T protein:vir:33 234 PERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred cccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccc Confidence 8899999999999999999986432 124788999999998776 8999999 Q ss_pred CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 242 QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 242 ~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++|+|+|.+++.||++++||+++|.|+-.+= T Consensus 314 ~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 314 ANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 99999999999999999999999999976665 No 7 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=9.1e-49 Score=283.97 Aligned_cols=267 Identities=19% Similarity=0.169 Sum_probs=229.0 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+ +.|+||+|+++++++|.+.+++.+++..++++++.+|++|+||+|..++..+...++..+++++++.++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 998 46899999999999999999999999989888888999999999998887666666778999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-cCCHhHHHHHHHHHHHHHh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSA-PSDADDAFDLIASALKELT 152 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~a~~~l~ 152 (273) ++|++. +.+|.++|++..+...+ ++...+++++++++++|+++++.+..+.....++. ..+....++.|.++...|+ T Consensus 81 ~~i~~~-~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l~ 159 (278) T protein:vir:80 81 HGIKKA-GKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAIE 159 (278) T ss_pred Eeeehh-hccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhhc Confidence 999774 66899999998888776 57889999999999999999999977655544333 2345567889999999999 Q ss_pred hcCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE- Q lcl|Aclame:pro 153 KANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV- 230 (273) Q Consensus 153 ~~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~- 230 (273) ++++|. .++++++|++|+.|+++. ..+.+.+..+ ++.+++|.||+++||+|++|+++|.+ ++++++++|+++. T Consensus 160 ~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gAi~~~~ 234 (278) T protein:vir:80 160 DESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLG-DDLLVKGAFGELLGWEIVRTKKLADG---NALAVKAGALKTFL 234 (278) T ss_pred ccCCCc-ccEEEECHHHHHHHHhhhhhhcccccccc-ccceeeccceeecceeEEEcCCCCcc---eEEEEeccceeeee Confidence 999985 568999999999998764 2344444444 45889999999999999999999864 4788899999976 Q ss_pred EecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 231 SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++...+|.+|++++++|.++++++||++++||+++|++++.+- T Consensus 235 ~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 235 KRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred cCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 5666899999999999999999999999999999999987766 No 8 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=6.4e-49 Score=284.82 Aligned_cols=268 Identities=18% Similarity=0.133 Sum_probs=218.5 Q ss_pred Cccc--------------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCC Q lcl|Aclame:pro 1 MAFN--------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~--------------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~ 60 (273) |||. .+.=|+|++++++.|++.+++.+++.+. ....|++++||+++..++.+|.+.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~---~~~~G~sv~i~~ig~~t~~~~~~g~ 77 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR---SIASGKSAQFPVIGRTKAAYLKPGE 77 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc---cccccceeEeeeccceeeeeeccCC Confidence 5551 1234889999999999999999998642 2446999999999999998877643 Q ss_pred C-ccCCcccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Q lcl|Aclame:pro 61 R-QTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL--------- 129 (273) Q Consensus 61 ~-~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~--------- 129 (273) . +.+.++++.++++++||+.+++++.|+|.|+.+.++++ .+++++++++|++++|+.++..+....... T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:15 78 NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEG 157 (347) T ss_pred CCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 3 22445688999999999999999999999999999986 568999999999999999998875431100 Q ss_pred ----------cc--cccC----CHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceee Q lcl|Aclame:pro 130 ----------TG--SAPS----DADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) Q Consensus 130 ----------~~--~~~~----~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~ 193 (273) .. +... ....+++.|.+|++.|++++||.++||++|+|++|..|+++++ +.+.+..+ ...++ T Consensus 158 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~-~~~~d~~~-~~~~~ 235 (347) T protein:vir:15 158 LGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM-PNAANYQA-LIDHE 235 (347) T ss_pred cCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccc-cccccccc-ccccc Confidence 00 0001 1345688999999999999999999999999999999999875 55666654 45689 Q ss_pred eeeeeeecceEEEEecccccCCC-----------------------------cEEEEEeCceEEEEEecc-eeeeccCCC Q lcl|Aclame:pro 194 AGTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) Q Consensus 194 ~G~ig~i~G~~i~~s~~l~~~~~-----------------------------~~~~~~~~~a~~~~~~~~-~ve~~~~~~ 243 (273) +|.|++++||+||+||++|.... ...+++|++|++.++..+ ++|..|+++ T Consensus 236 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~ 315 (347) T protein:vir:15 236 RGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN 315 (347) T ss_pred ceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch Confidence 99999999999999999985322 124788999999998665 899999999 Q ss_pred cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 244 SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 244 ~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +|+|+|.+++.||++++||+++|.|+..+= T Consensus 316 ~~~d~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 316 YQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhhehhhhcCCceeccccEEEEecCCC Confidence 999999999999999999999999876665 No 9 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=3.6e-48 Score=280.70 Aligned_cols=261 Identities=18% Similarity=0.207 Sum_probs=222.4 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+ ++++||+|+.++++++.+.++|.+++..+.++++.+|+||+||+|..++.++...+|..+++++++.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 999 47889999999999999999999998888888888999999999998876666667788999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ ++.+|.++|++..+..++ ++++++|++.++++++|+++++.+..+..... +....++.|.+|...|++ T Consensus 81 ~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~-----~~~~~~d~i~~A~~~lgd 154 (274) T protein:vir:96 81 AKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVE-----ADITKLTGLQTAIDKFND 154 (274) T ss_pred EEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhcc Confidence 99977 689999999999888776 57899999999999999999999865443321 122347889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~-~ 231 (273) ++. .+|+++|+|++++.|+++. ..|.+.+..+ .+++++|.||+++||+|++|+++|.. ++++++++|+++. + T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~~~~---t~~l~~~gA~~~~~~ 228 (274) T protein:vir:96 155 EDL--EPMVLFISPLDAGKLRGDATTNFTRATELG-DDVIVKGAFGEALGAVIVRSNKLEAG---TAILAKKGAVKLITK 228 (274) T ss_pred ccc--cccEEEeCHHHHHHHHhhcccccccccccc-ccceeccccceecCeEEEEeCCCCCc---eEEEEeccceeeeec Confidence 774 6899999999999999874 2344444444 47899999999999999999999854 4688899999985 5 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEe-cCCC Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN-KTGS 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~-~~~s 273 (273) |...+|.+|++++++|.+++++|||+++++|+++|+++ .++| T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:96 229 RDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 66689999999999999999999999999999999996 3444 No 10 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=3.6e-48 Score=280.70 Aligned_cols=261 Identities=18% Similarity=0.207 Sum_probs=222.4 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+ ++++||+|+.++++++.+.++|.+++..+.++++.+|+||+||+|..++.++...+|..+++++++.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 999 47889999999999999999999998888888888999999999998876666667788999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ ++.+|.++|++..+..++ ++++++|++.++++++|+++++.+..+..... +....++.|.+|...|++ T Consensus 81 ~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~-----~~~~~~d~i~~A~~~lgd 154 (274) T protein:vir:95 81 AKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVE-----ADITKLTGLQTAIDKFND 154 (274) T ss_pred EEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhcc Confidence 99977 689999999999888776 57899999999999999999999865443321 122347889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~-~ 231 (273) ++. .+|+++|+|++++.|+++. ..|.+.+..+ .+++++|.||+++||+|++|+++|.. ++++++++|+++. + T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~~~~---t~~l~~~gA~~~~~~ 228 (274) T protein:vir:95 155 EDL--EPMVLFISPLDAGKLRGDATTNFTRATELG-DDVIVKGAFGEALGAVIVRSNKLEAG---TAILAKKGAVKLITK 228 (274) T ss_pred ccc--cccEEEeCHHHHHHHHhhcccccccccccc-ccceeccccceecCeEEEEeCCCCCc---eEEEEeccceeeeec Confidence 774 6899999999999999874 2344444444 47899999999999999999999854 4688899999985 5 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEe-cCCC Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN-KTGS 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~-~~~s 273 (273) |...+|.+|++++++|.+++++|||+++++|+++|+++ .++| T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:95 229 RDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 66689999999999999999999999999999999996 3444 No 11 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=5.2e-48 Score=279.80 Aligned_cols=261 Identities=19% Similarity=0.209 Sum_probs=223.9 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |||. .++||+|+++++++|.+.+++.+++.+++++++.+|+||+||+|..++..+...+|..+++++++.++.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 9994 5899999999999999999999999999999998999999999998876665667778999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ ++.+|.++|++..+..++ +++..+|++.++++++|++++..+..+..... .....++.|.+|+..|++ T Consensus 81 ~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~-----~~a~~~d~i~dA~~~lgd 154 (274) T protein:vir:12 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN-----ADITKLNGLQSAIDKFND 154 (274) T ss_pred EEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhcc Confidence 99977 689999999998888777 57889999999999999999999866543322 123457889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~-~ 231 (273) ++. .+|+++|+|+++..|+++. ..|.+.+..+ .+.+++|.||+++||+|++|+.+|.. ++++++++|+++. + T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gA~~~~~~ 228 (274) T protein:vir:12 155 EDL--EPMVLFINPLDAGKLRGDASTNFTRATELG-DDIIVKGAFGEALGAIIVRSNKLEAG---TAILAKKGAVKLILK 228 (274) T ss_pred ccc--cccEEEeCHHHHHHHHhhhhhhcccccccc-ccceecccceeecCeeEEEeCCCCcc---eEEEEeccceeeeec Confidence 774 6799999999999999874 2344544444 47899999999999999999999864 4688899999986 5 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEe-cCCC Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN-KTGS 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~-~~~s 273 (273) +...+|.+|+++++.|.+++++|||+++++|+++|+++ +.+| T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCcc Confidence 56689999999999999999999999999999999885 4555 No 12 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=1.4e-49 Score=288.45 Aligned_cols=267 Identities=18% Similarity=0.144 Sum_probs=216.9 Q ss_pred Cccc-------------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCC Q lcl|Aclame:pro 1 MAFN-------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR 61 (273) Q Consensus 1 MA~~-------------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~ 61 (273) ||+. -+.=|+|..+++..|++.++|.+++.+. ....|++++||++|...+.+|++ |+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r---~i~~G~sv~i~~iG~~tv~~~t~-G~ 76 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVR---TIQNGKSAQFPVMGRTSGVYLAP-GE 76 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc---cccccceEEEecccceeeeeecC-CC Confidence 5441 1112678888888899999999887543 34569999999999999988776 44 Q ss_pred cc--CCcccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------c- Q lcl|Aclame:pro 62 QT--SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL-------T- 130 (273) Q Consensus 62 ~~--~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~-------~- 130 (273) .+ +.+++.+++++++||+++++.+.|+|.|+.+.++++ .++.++++++|++.+|+.++..+...+... . T Consensus 77 ~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g 156 (347) T protein:vir:94 77 RLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAG 156 (347) T ss_pred CcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCC Confidence 44 456789999999999999999999999999999986 568999999999999999988764311100 0 Q ss_pred ---------------ccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeee Q lcl|Aclame:pro 131 ---------------GSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAG 195 (273) Q Consensus 131 ---------------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G 195 (273) .+...++..+++.|.++++.|++++||.++||++|+|++|+.|+.+. .+.+.+..+ ...+++| T Consensus 157 ~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~-~~~~~~~~~-~~~~~~G 234 (347) T protein:vir:94 157 LGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAAL-MPNAANYAA-LIDPETG 234 (347) T ss_pred CcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccc-hhhhhhccc-ccccccc Confidence 01112345678899999999999999999999999999999998765 466655544 4568899 Q ss_pred eeeeecceEEEEecccccCC---------------------------------CcEEEEEeCceEEEEEecc-eeeeccC Q lcl|Aclame:pro 196 TIGNLLGARIVESNNLRDTD---------------------------------DEQFVAFHPSAAAYVSQID-TVEALRD 241 (273) Q Consensus 196 ~ig~i~G~~i~~s~~l~~~~---------------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~~ 241 (273) .|++++||+||+||++|... ....+++||+|++.+++++ ++|.+|+ T Consensus 235 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 314 (347) T protein:vir:94 235 NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD 314 (347) T ss_pred ceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc Confidence 99999999999999998421 1235788999999999887 8999999 Q ss_pred CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 242 QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 242 ~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++|+|+|.++++||++++|||++++|+.+.. T Consensus 315 ~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 315 VDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred hhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999999998866 No 13 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=5e-49 Score=285.37 Aligned_cols=265 Identities=20% Similarity=0.195 Sum_probs=215.3 Q ss_pred Cccc---------------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCC Q lcl|Aclame:pro 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAA 59 (273) Q Consensus 1 MA~~---------------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) |||. .+.=|+|.+++++.|.+.++|.+++.+. ....|++++||++|...++.+.+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r---~i~~g~s~~~~~iG~~~~~~~~~- 76 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVR---SISSGKSAQFPVLGRTQAAYLAP- 76 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceee---eecccceEEEEeeceeEEEeeec- Confidence 7752 1123999999999999999999998642 34459999999999999875554 Q ss_pred CCccCC--cccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Q lcl|Aclame:pro 60 GRQTSA--DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTA-------- 128 (273) Q Consensus 60 ~~~~~~--~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~-------- 128 (273) |++++. +++.+++++++||+.+++.+.|+|.|+.++++++ .++.++++++|++.+|+.++..++..+.. T Consensus 77 G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~ 156 (344) T protein:vir:10 77 GENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENI 156 (344) T ss_pred CCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 555544 5688999999999999999999999999999986 56899999999999999998877532210 Q ss_pred ------------ccc----cccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccccee Q lcl|Aclame:pro 129 ------------LTG----SAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 129 ------------~~~----~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 192 (273) ... ....++..+++.|.+++..|++++||.++||++|+|++|+.|++++. +.+.+. ++...+ T Consensus 157 ~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~-~~~~~~-~~~~~~ 234 (344) T protein:vir:10 157 TGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALM-PNAANY-AALIDP 234 (344) T ss_pred ccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccc-cccccc-ccccce Confidence 000 01123356789999999999999999999999999999999998765 445554 455678 Q ss_pred eeeeeeeecceEEEEecccccCC----------------------------CcEEEEEeCceEEEEEecc-eeeeccCCC Q lcl|Aclame:pro 193 RAGTIGNLLGARIVESNNLRDTD----------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) Q Consensus 193 ~~G~ig~i~G~~i~~s~~l~~~~----------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~~~~ 243 (273) ++|.|++++||+||+||++|.+. ....+++||+|++.+++.+ ++|.+|+++ T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 89999999999999999998421 1234688999999998876 899999999 Q ss_pred cceeeEEeeeeeeeEEEcCceE--EEEecC Q lcl|Aclame:pro 244 SFSDRIRALHVYGGKVVRPTGV--VVFNKT 271 (273) Q Consensus 244 ~~~~~v~~~~~~g~~vl~p~~~--v~~~~~ 271 (273) +|+|+|.+++.||++++|||++ |+++.. T Consensus 315 ~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 315 FQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHHHHHhhcccceecccceEEEEeecC Confidence 9999999999999999999987 455444 No 14 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=1.6e-47 Score=277.08 Aligned_cols=261 Identities=20% Similarity=0.221 Sum_probs=223.7 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+ ++++||+|+..++++|.+.+++.+++.+++++++.+|++|+||+|...+..+...++..+++++++.++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 997 47889999999999999999999999999988888999999999997665555566778999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ ++++|.++|++..+...+ +....+++++++++++|+++++.+..++.... .....++.|.+|...|++ T Consensus 81 ~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~-----~~~~~~d~i~dA~~~l~d 154 (274) T protein:vir:96 81 AKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE-----ADITKLDGLQTAIDKFND 154 (274) T ss_pred EEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcC-----cccccHHHHHHHHHHhcc Confidence 99977 588999999998887776 57789999999999999999999865433221 123357899999999999 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEe Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~ 232 (273) +++ .+|+++|+|+++..|+++. ..|...+..+ ++.+++|.||+++||+|++|+++|.. ++++++++|+++..+ T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g-~~~~~~g~ig~~~G~~Vi~s~~~p~~---t~~l~~~gA~~~~~~ 228 (274) T protein:vir:96 155 EDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLG-DNIIVKGAFGEALGAVIVRSNKLNKG---EALLAKKGAVKLITK 228 (274) T ss_pred cCC--CceEEEeCHHHHHHHHhccccccccccccc-ccceeecccceecCeeEEEcCCCCcc---eEEEEeCcceeeeec Confidence 875 6799999999999998874 2344444444 46899999999999999999999865 478899999999765 Q ss_pred cc-eeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 233 ID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~~-~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .. .+|.+|++.+++|.++++++||+++++|+++|+++.+.. T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 229 RDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred CCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 54 899999999999999999999999999999999998887 No 15 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=1.8e-47 Score=276.82 Aligned_cols=261 Identities=18% Similarity=0.201 Sum_probs=223.2 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |||. .++||+|++++++++++.++|.+++.+++++++.+|++|+||+|..++..+...+|..+++++++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 9994 5899999999999999999999999999998888999999999998776665566778999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ .+++|.++|++..+..++ +++..+++++++++++|+++++.+..++....+ ....++.|.+|+..|++ T Consensus 81 ~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~-----~~~~~d~i~dA~~~l~d 154 (274) T protein:vir:97 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA-----DITKLNGLQSAIDKFND 154 (274) T ss_pred EEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc-----cccCHHHHHHHHHHhhc Confidence 99977 578999999999888776 578899999999999999999998765544322 22347889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~-~ 231 (273) ++. .+|+++|+|+++..|+++. ..|.+.+..+ ++.+++|.||+++||+|++|+++|.. ++++++++|+++. + T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gA~~~~~~ 228 (274) T protein:vir:97 155 EDL--EPMVLFVNPLDAGKLRGDASTNFTRATELG-DDIIVKGAFGEALGAIIVRTNKLEAG---TAILAKKGAVKLILK 228 (274) T ss_pred cCC--CceEEEeCHHHHHHHHhhhhhhccccCccc-ccceeccccceecCeeEEEcCCCCcc---eEEEEeCcceEeeec Confidence 875 6789999999999999874 2344444444 46889999999999999999999854 4788999999986 4 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCC-C Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG-S 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~-s 273 (273) +...+|.+|++++++|.+++++|||+++++|+++|+++.++ | T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:97 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 55689999999999999999999999999999999986554 4 No 16 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=1.8e-47 Score=276.82 Aligned_cols=261 Identities=18% Similarity=0.201 Sum_probs=223.2 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |||. .++||+|++++++++++.++|.+++.+++++++.+|++|+||+|..++..+...+|..+++++++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 9994 5899999999999999999999999999998888999999999998776665566778999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ .+++|.++|++..+..++ +++..+++++++++++|+++++.+..++....+ ....++.|.+|+..|++ T Consensus 81 ~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~-----~~~~~d~i~dA~~~l~d 154 (274) T protein:vir:94 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA-----DITKLNGLQSAIDKFND 154 (274) T ss_pred EEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc-----cccCHHHHHHHHHHhhc Confidence 99977 578999999999888776 578899999999999999999998765544322 22347889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~-~ 231 (273) ++. .+|+++|+|+++..|+++. ..|.+.+..+ ++.+++|.||+++||+|++|+++|.. ++++++++|+++. + T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gA~~~~~~ 228 (274) T protein:vir:94 155 EDL--EPMVLFVNPLDAGKLRGDASTNFTRATELG-DDIIVKGAFGEALGAIIVRTNKLEAG---TAILAKKGAVKLILK 228 (274) T ss_pred cCC--CceEEEeCHHHHHHHHhhhhhhccccCccc-ccceeccccceecCeeEEEcCCCCcc---eEEEEeCcceEeeec Confidence 875 6789999999999999874 2344444444 46889999999999999999999854 4788999999986 4 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCC-C Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG-S 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~-s 273 (273) +...+|.+|++++++|.+++++|||+++++|+++|+++.++ | T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:94 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 55689999999999999999999999999999999986554 4 No 17 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=1.4e-47 Score=277.52 Aligned_cols=263 Identities=21% Similarity=0.183 Sum_probs=228.4 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||| +.++||+|++.++++|.+.+++.+++.++.++++.+|+||+||+|+.++.++...+|..+++++++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 997 47789999999999999999999999999988888999999999999988887788889999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) ++|++ ++.+|.++|++..+..++ +..+.++++.++++++|+++++.+..+... .++...++.|.+|+..|++ T Consensus 81 ~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~------~~~~~~~d~i~~A~~~lgd 153 (272) T protein:vir:36 81 VTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQT------VSTKANVDGVQAALDIFND 153 (272) T ss_pred Eeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccccccHHHHHHHHHHhhh Confidence 99976 578999999998888777 477899999999999999999988543322 2334457889999999999 Q ss_pred cCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCc-EEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE-QFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~-~~~~~~~~a~~~~-~ 231 (273) ++.+ .++++++|..+..|+++..+.. .....+.+++++|.||+++|++|++|+++|.+++. ..+++.++|+++. + T Consensus 154 ~~~~--~~~ivv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~ 230 (272) T protein:vir:36 154 EDAQ--AYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLK 230 (272) T ss_pred cCCC--ceEEEEcHHHHHHHhccccccc-ccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeec Confidence 9875 5799999999999998876443 33344567899999999999999999999987764 4578889999976 5 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |..++|.+|++++++|.+++++|||+++++|+++|+++-+|= T Consensus 231 ~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 231 RGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred CCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 556899999999999999999999999999999999988888 No 18 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=5.9e-49 Score=284.97 Aligned_cols=264 Identities=15% Similarity=0.181 Sum_probs=212.7 Q ss_pred Ccc--------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MAF--------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~--------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) ||. .+|.||+|+++++..|++.+++..++++.. ...|||||||.++.+++.||...+ +++++++++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d---~g~GDtV~InsIg~~tV~dY~~~~-~i~~d~ltt~~ 76 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVD---FPDGDKLTIPSVGTPVVRSRPEQG-DFTFDNLDTGE 76 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccc---cCCCCeEEeccccccccccccCCC-CcccccCCCce Confidence 884 257799999999999999999988876533 246999999999999999998755 57899999999 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccc------------ccccCC Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGT---ALT------------GSAPSD 136 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~---~~~------------~~~~~~ 136 (273) .+++|||.||++|.|+| |+.+..+++ ....++++++|+..+|+.+..+++..+. +.. .+++.+ T Consensus 77 ~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 77 ISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 99999999999999999 999999997 5678999999999999999887765331 111 123445 Q ss_pred HhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHH---------hcchHHhhhhhcccccceeeeeeeeeecceEEEE Q lcl|Aclame:pro 137 ADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWL---------RSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L---------~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~ 207 (273) +...|+.|++++.+|++++||.+|||+||+|.++..| +++++ |.....+|..+.++ .||+++||+|++ T Consensus 156 ~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~r-f~~i~~sG~a~g~~--~Vg~~~GF~V~~ 232 (322) T protein:vir:31 156 QTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPR-WEGIVESGIAPDMQ--FVRSVYGIDLFV 232 (322) T ss_pred chhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcccc-ccccccccchhhHH--HHHHHhceeeee Confidence 6678999999999999999999999999999998765 44443 43344444333222 389999999999 Q ss_pred ecccccCCCcEEEEEeC---------------------ceEEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEE Q lcl|Aclame:pro 208 SNNLRDTDDEQFVAFHP---------------------SAAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVV 266 (273) Q Consensus 208 s~~l~~~~~~~~~~~~~---------------------~a~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v 266 (273) ||+++.++ +++++++. ..++..+|+.+.|.+|++++++|.++++++||++++|||.++ T Consensus 233 SN~l~~~~-~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~ 311 (322) T protein:vir:31 233 SNLLADAN-ETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLV 311 (322) T ss_pred eccccccc-cccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceE Confidence 99997533 33333333 333445578889999999999999999999999999999999 Q ss_pred EEecCCC Q lcl|Aclame:pro 267 VFNKTGS 273 (273) Q Consensus 267 ~~~~~~s 273 (273) .+.++.- T Consensus 312 ~~~a~~~ 318 (322) T protein:vir:31 312 CVLANAD 318 (322) T ss_pred EEEeccc Confidence 9988877 No 19 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=3.1e-47 Score=275.59 Aligned_cols=261 Identities=19% Similarity=0.203 Sum_probs=224.1 Q ss_pred Ccc-----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~-----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ||+ ++++||+|+.++++++.+.++|.+++..+.++++.+|++|+||+|..++..+...+|..+++++++.++.++ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~ 82 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKRQA 82 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccceeeE Confidence 555 477899999999999999999999998888888889999999999998777766778889999999999999 Q ss_pred EEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKA 154 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 154 (273) +|.+ ++++|.++|++..+..++ +...++|++.++++++|+++++.+..+..... +....++.|.+|...|.++ T Consensus 83 ~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~-----~~~~~~d~i~dA~~~lgd~ 156 (275) T protein:vir:96 83 TIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVE-----ADITKLAGLQTAIDKFNDE 156 (275) T ss_pred Eeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhccc Confidence 9955 699999999998888776 57899999999999999999999865433221 1223478899999999887 Q ss_pred CCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEe- Q lcl|Aclame:pro 155 NVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ- 232 (273) Q Consensus 155 ~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~- 232 (273) +. .+|+++|+|+++..|+++. ..|...+..+ ++.+++|.||+++|++|++|+++|.+ ++++++++|+++..+ T Consensus 157 ~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~i~~~gA~~~~~~~ 230 (275) T protein:vir:96 157 DL--EPMVLFVNPLDAGKLRASATDNFTRATLLG-DNVIVKGAFGEALGAIIVRSNKIKEG---EAILAKRGAVKLITKR 230 (275) T ss_pred cC--CccEEEeCHHHHHHHHhccccccccccccc-ccceeccccceecCeeEEEeCCCCcc---eEEEEeccceeeeecC Confidence 64 6789999999999998874 2455555554 46889999999999999999999865 467889999998754 Q ss_pred cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 233 IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ...+|.+|++++++|.+++++|||+++++|+++|+++++.| T Consensus 231 ~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 231 DFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred CcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 45899999999999999999999999999999999999999 No 20 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=9.7e-47 Score=272.84 Aligned_cols=261 Identities=18% Similarity=0.202 Sum_probs=221.4 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |||+ .++||+|++++++++++.+++.+++.+++++++.+|++|+||+|..++..+...+|..+++++++.++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 9995 5889999999999999999999999999998888999999999998765555566778999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +++++ .++.|.++|++..+...+ ++...+++++++++++|+++++.+..+..... +....++.|.+|+..|++ T Consensus 81 ~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~-----~~~~~~d~i~dA~~~l~d 154 (274) T protein:vir:93 81 AKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN-----ADITKLNGLQSAIDKFND 154 (274) T ss_pred EEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhhh Confidence 99977 578999999998888776 47789999999999999999999865543322 122347889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEe Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~ 232 (273) ++. .+|+++|+|++++.|+++. ..|.+....+ ++.+++|.||+++||+|++|+++|.. ++++++++|+++..+ T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gai~~~~~ 228 (274) T protein:vir:93 155 EDL--EPMVLFINPLDAGKLRGDASTNFTRATELG-DDIIVKGAFGEALGAIIVRTNKLEAG---TAILAKKGAVKLILK 228 (274) T ss_pred ccC--CccEEEeCHHHHHHHHhhhhhccccccccc-ccceeecccceecCeeEEEcCCCCcc---eEEEEeCCeEEEEec Confidence 875 6789999999999999874 2233444443 46789999999999999999999864 478999999999754 Q ss_pred -cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEe-cCCC Q lcl|Aclame:pro 233 -IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN-KTGS 273 (273) Q Consensus 233 -~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~-~~~s 273 (273) ...+|.+|++++++|.+++++|||+++++|+++|+++ +.+| T Consensus 229 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s 271 (274) T protein:vir:93 229 RDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCccc Confidence 4589999999999999999999999999999999885 5556 No 21 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=3.9e-47 Score=275.03 Aligned_cols=268 Identities=18% Similarity=0.178 Sum_probs=205.1 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccc--cCCcEEEEEecccccccccc----CCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIA--SKGNVVHIAGVVAPTVKDYK----AAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~--~~Gdtv~ip~~~~~~~~d~~----~~~~~~~~~~~~~~~~~ 74 (273) |||++|+||+|++++++.|+++++|+++++|+|+.++ .+||||+||+++.++..+|. +++.+++++++.++.++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceEE Confidence 9999999999999999999999999999999997764 57999999999999998885 34567889999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccccCCHhHHHHHHHHHHHHHh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT-GSAPSDADDAFDLIASALKELT 152 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~a~~~l~ 152 (273) ++||++++++|.|+|+|+.+..+++ ++++++++++|++++|.++++++..++.... .....++.+.++.|.++++.|+ T Consensus 81 ~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~~L~ 160 (392) T protein:vir:99 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) T ss_pred EEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHHHHh Confidence 9999999999999999999998886 6789999999999999999999876654433 2344667788999999999999 Q ss_pred hcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccc--ceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE Q lcl|Aclame:pro 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~ 230 (273) +++||. ||+++++|+++..|++++. |.+.+..++. ..+++|.||+++||+|++|+++|... .+++|++++.++ T Consensus 161 ~~~vP~-~R~~vv~p~~~~~l~~~~~-~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t---~~a~~~~a~~~a 235 (392) T protein:vir:99 161 ELYIPQ-GRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD---AYLYHPTAFIMA 235 (392) T ss_pred hcCCCC-CCEEEEcHHHHHHHhcccc-eeecccccchhhhhhhcceeeeeeeeEEEeeccccccc---ceeeeccccccc Confidence 999996 8999999999999999876 4555555544 46899999999999999999998765 467788877766 Q ss_pred Eecceeeec--------------------cCCCcceeeEEeeeeeeeEEEcCceEE-EEe--------cCCC Q lcl|Aclame:pro 231 SQIDTVEAL--------------------RDQDSFSDRIRALHVYGGKVVRPTGVV-VFN--------KTGS 273 (273) Q Consensus 231 ~~~~~ve~~--------------------~~~~~~~~~v~~~~~~g~~vl~p~~~v-~~~--------~~~s 273 (273) ++...+... .+....++.......+|.+.+....-. ... .+.+ T Consensus 236 t~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~ 307 (392) T protein:vir:99 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) T ss_pred cccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceee Confidence 543211000 111111222222333444333211100 000 0000 No 22 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=1.1e-47 Score=278.13 Aligned_cols=267 Identities=18% Similarity=0.131 Sum_probs=217.5 Q ss_pred Cccc--------------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCC Q lcl|Aclame:pro 1 MAFN--------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~--------------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~ 60 (273) |||. .+.=|+|++++++.|++.++|.+++... ....|++++||++|..++..+.+ | T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r---~i~~G~sv~~~~iG~~~~~~~~~-g 76 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR---TIQNGKSASFPVMGRTKGYYLAP-G 76 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccc---cccCcceEEEeeecceeeeeecc-c Confidence 6651 2234999999999999999999988642 24569999999999998865444 4 Q ss_pred CccCC--cccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------- Q lcl|Aclame:pro 61 RQTSA--DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL-------- 129 (273) Q Consensus 61 ~~~~~--~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~-------- 129 (273) .+++. .++.+++++++||+.+++.+.|+|.|+.+.++|+ .++.++++++|++++|+.++..+...+... T Consensus 77 ~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred cCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 44433 5788999999999999999999999999999996 568999999999999999998775432110 Q ss_pred --------cccc-------cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeee Q lcl|Aclame:pro 130 --------TGSA-------PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) Q Consensus 130 --------~~~~-------~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~ 194 (273) ..++ ...+..+++.|.++++.|++++||.++|+++|+|++|+.|+++.. +.+.+.. +...+++ T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~-~~~~~~~-~~~~~~~ 234 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALM-PNAANYA-ALIDPET 234 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchh-hhhhhhc-cccchhc Confidence 0000 112334688999999999999999999999999999999998764 5555554 4457889 Q ss_pred eeeeeecceEEEEecccccCCC--------------------------------cEEEEEeCceEEEEEecc-eeeeccC Q lcl|Aclame:pro 195 GTIGNLLGARIVESNNLRDTDD--------------------------------EQFVAFHPSAAAYVSQID-TVEALRD 241 (273) Q Consensus 195 G~ig~i~G~~i~~s~~l~~~~~--------------------------------~~~~~~~~~a~~~~~~~~-~ve~~~~ 241 (273) |.|++++||+|++||++|.+.. ...+.+|++|++.+++++ ++|.+|+ T Consensus 235 G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~ 314 (347) T protein:vir:88 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) T ss_pred ceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeec Confidence 9999999999999999984211 123678999999998776 8999999 Q ss_pred CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 242 QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 242 ~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++|+|+|.+++.||++++|||++++|+-+.+ T Consensus 315 ~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred hhhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 99999999999999999999999999988777 No 23 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=2.1e-47 Score=276.54 Aligned_cols=267 Identities=18% Similarity=0.165 Sum_probs=217.8 Q ss_pred Ccc-------------------c--chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCC Q lcl|Aclame:pro 1 MAF-------------------N--NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAA 59 (273) Q Consensus 1 MA~-------------------~--~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) ||+ + .+.=|+|.+++++.|.+.++|.+++.. . ....|++++||++|...+..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~-r--~i~~gks~~~~~iG~~~~~~~~~- 76 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-R--SISSGKSAQFPVLGRTQAAYLAP- 76 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee-e--eccccceEEEeeecceEEEeeec- Confidence 332 1 233599999999999999999999864 2 34459999999999999876554 Q ss_pred CCccCCc--ccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------ Q lcl|Aclame:pro 60 GRQTSAD--AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT------ 130 (273) Q Consensus 60 ~~~~~~~--~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~------ 130 (273) |++++.+ ++..++.+++||+.+++.+.|+|.|+.+.++++ .+++++++++||+.+|+.++..+...+.... T Consensus 77 G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~ 156 (345) T protein:vir:22 77 GENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENI 156 (345) T ss_pred CCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 5555443 577888999999999999999999999999996 5689999999999999999987754321100 Q ss_pred --------------c----cccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccccee Q lcl|Aclame:pro 131 --------------G----SAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 131 --------------~----~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 192 (273) + ....++..+++.|.++++.|++++||.++||++|+|++|+.|++++. +.+.++. +.+.. T Consensus 157 ~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~-~~~~~~~-~~~~~ 234 (345) T protein:vir:22 157 EGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALM-PNAANYA-ALIDP 234 (345) T ss_pred cccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccc-ccccccc-ccccc Confidence 0 11123567899999999999999999999999999999999998775 5555554 45667 Q ss_pred eeeeeeeecceEEEEecccccCC-----------------------------CcEEEEEeCceEEEEEecc-eeeeccCC Q lcl|Aclame:pro 193 RAGTIGNLLGARIVESNNLRDTD-----------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQ 242 (273) Q Consensus 193 ~~G~ig~i~G~~i~~s~~l~~~~-----------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~~~ 242 (273) ++|.|++++||+||+||++|.+. ....+.+|++|++.+++++ ++|.+|++ T Consensus 235 ~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~ 314 (345) T protein:vir:22 235 EKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA 314 (345) T ss_pred ccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeech Confidence 89999999999999999997421 1244788999999999886 89999999 Q ss_pred CcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 243 DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 243 ~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++|+|+|.+++.||++++|||++++|+-.-- T Consensus 315 ~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 315 NFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 9999999999999999999999998865555 No 24 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=3.9e-46 Score=269.55 Aligned_cols=265 Identities=20% Similarity=0.178 Sum_probs=213.9 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhcccccc-ccCCcEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~-~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) ||. ++++||+|++++++.|++++++.++++|+|+.+ .+.||||+||+++.+...++. .+.+++++++.++++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~----~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR----TLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC----CccccccccceEEEE Confidence 994 677899999999999999999999999999887 556999999999998887643 467889999999999 Q ss_pred EEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKAN 155 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~ 155 (273) ||++++++|.|+|+|+.+...++ ++++++++++||+++|+++++++..++.... +..+..+.+++|.++++.|++++ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~g--t~gt~~~~~~~i~~a~~~Ld~~~ 154 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSG--TPGVRPGAFIDFANAGAKQTTYA 154 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cCCcCcchHHHHHHHHHHHHhcC Confidence 99999999999999999988886 6789999999999999999999877655432 23334456999999999999999 Q ss_pred CCcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC------------------ Q lcl|Aclame:pro 156 VPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD------------------ 216 (273) Q Consensus 156 vp~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~------------------ 216 (273) ||.+| |++|++|++|..|+++..++ .+..+..+.+++|.||+++||+||+||++|..+. T Consensus 155 VP~~G~R~lVv~P~~~~~L~~~~~~~--~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~ 232 (418) T protein:vir:10 155 VPQDGMRHAVLDPFTCASLSDEVTKL--FKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDT 232 (418) T ss_pred CCCCCceEEEeCHHHHHHHhhhcccc--ccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeeccccccee Confidence 99985 99999999999998876544 3455666789999999999999999999873110 Q ss_pred -----------------c--------------------------------------E----------------------- Q lcl|Aclame:pro 217 -----------------E--------------------------------------Q----------------------- 218 (273) Q Consensus 217 -----------------~--------------------------------------~----------------------- 218 (273) . + T Consensus 233 ~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~ 312 (418) T protein:vir:10 233 VGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVS 312 (418) T ss_pred EEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecccccccccccccccccccc Confidence 0 0 Q ss_pred -----------------------------EEEEeCceEEEEEeccee---------------------eeccCCCcceee Q lcl|Aclame:pro 219 -----------------------------FVAFHPSAAAYVSQIDTV---------------------EALRDQDSFSDR 248 (273) Q Consensus 219 -----------------------------~~~~~~~a~~~~~~~~~v---------------------e~~~~~~~~~~~ 248 (273) -++||++|++++.+-..+ -.++|.+...+. T Consensus 313 ~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~ 392 (418) T protein:vir:10 313 LTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEI 392 (418) T ss_pred ccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccceE Confidence 034566666666543321 112444455577 Q ss_pred EEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 249 IRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 249 v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++-+..||++.+|||..+.|.-.+| T Consensus 393 ~r~d~l~g~~~~~p~~~~~~~g~~~ 417 (418) T protein:vir:10 393 HRIDAVWGADMIYGELALRLWGAAS 417 (418) T ss_pred EEEEeecCceeecccceEEEEeecC Confidence 8889999999999997776655555 No 25 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=1.9e-46 Score=271.31 Aligned_cols=268 Identities=18% Similarity=0.175 Sum_probs=219.7 Q ss_pred Ccc---------------------c--chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccccccccc Q lcl|Aclame:pro 1 MAF---------------------N--NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYK 57 (273) Q Consensus 1 MA~---------------------~--~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~ 57 (273) ||+ + .+.=|+|.+++++.|++.+++.+++... ....|++++||++|...+.+|. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~r---ti~~Gksv~f~~iG~~t~~~~t 77 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKR---TLKNGKSLQFIYTGRMTSSFHT 77 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhcccccc---ccccCceEEEEeeeeeEEeeec Confidence 222 1 2345999999999999999999988642 3445999999999999998877 Q ss_pred CCCCccCCc---ccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---- Q lcl|Aclame:pro 58 AAGRQTSAD---AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL---- 129 (273) Q Consensus 58 ~~~~~~~~~---~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~---- 129 (273) + |.++..+ +...++++++||+.+++.+.|+|.|+.+.++++ .++.++++++|++.+|+.++..+..++... T Consensus 78 ~-G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 78 P-GTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred C-CcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 6 4444444 566788899999999999999999999999986 568999999999999999999886543111 Q ss_pred ------------------cccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcc--hHHhhhhhccccc Q lcl|Aclame:pro 130 ------------------TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSS--GSKLTSADTSGDA 189 (273) Q Consensus 130 ------------------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~--~~~~~~~~~~~~~ 189 (273) ......++.++++.|.++++.|++++||.++||++|+|++|+.|+++ .+++.+.+.. +. T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~-~~ 235 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQ-GS 235 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccc-cc Confidence 11223467889999999999999999999999999999999999865 3567777764 45 Q ss_pred ceeeeeeeeeecceEEEEecccccCCC-----------------------------------------------cEEEEE Q lcl|Aclame:pro 190 AGLRAGTIGNLLGARIVESNNLRDTDD-----------------------------------------------EQFVAF 222 (273) Q Consensus 190 ~~~~~G~ig~i~G~~i~~s~~l~~~~~-----------------------------------------------~~~~~~ 222 (273) +...+|.+++++||+||+||++|..+. ...+++ T Consensus 236 ~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~ 315 (375) T protein:vir:10 236 ALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIF 315 (375) T ss_pred ceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEE Confidence 677789999999999999999995321 235788 Q ss_pred eCceEEEEEecc-eee---eccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 223 HPSAAAYVSQID-TVE---ALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~~~a~~~~~~~~-~ve---~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |++|.+.++.+. .+| ..++..+|+|+|.+++.|||++||||++|+|+..++ T Consensus 316 ~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 316 QKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred chhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 999999887654 455 447999999999999999999999999999999988 No 26 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=9.3e-47 Score=272.95 Aligned_cols=267 Identities=18% Similarity=0.155 Sum_probs=215.8 Q ss_pred Cccc--------------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCC Q lcl|Aclame:pro 1 MAFN--------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~--------------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~ 60 (273) |||. .+.=|+|++++++.|.+.++|.+++.+. ....|++++||++|...+..+. +| T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r---ti~~G~sv~~~~iG~~~~~~~~-~G 76 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVR---SIQSGKSAQFPVLGRTKAAYLQ-PG 76 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhhe---eccccceEEeeeccceeEeeee-cC Confidence 6641 1334999999999999999999998642 2456999999999999986555 45 Q ss_pred CccC--CcccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------- Q lcl|Aclame:pro 61 RQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL-------- 129 (273) Q Consensus 61 ~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~-------- 129 (273) .++. .+++.+++++++||+.+++.+.|+|.|+.++++++ .++.++++++|++++|+.++..+...+... T Consensus 77 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:94 77 ENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIA 156 (347) T ss_pred cCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 5443 35789999999999999999999999999999986 568999999999999999987765422110 Q ss_pred ----------------cccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceee Q lcl|Aclame:pro 130 ----------------TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) Q Consensus 130 ----------------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~ 193 (273) ..+...++..+++.|.+++..|++++||.++||+|++|++|..|++... ....+. +..+.++ T Consensus 157 g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~-~~~~~~-~~~~~~~ 234 (347) T protein:vir:94 157 GLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALM-PNAANY-QALIDPS 234 (347) T ss_pred cCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhc-cccccc-ccccccc Confidence 0011134567799999999999999999999999999999999997532 333333 3446788 Q ss_pred eeeeeeecceEEEEecccccCC--------------------------------CcEEEEEeCceEEEEEecc-eeeecc Q lcl|Aclame:pro 194 AGTIGNLLGARIVESNNLRDTD--------------------------------DEQFVAFHPSAAAYVSQID-TVEALR 240 (273) Q Consensus 194 ~G~ig~i~G~~i~~s~~l~~~~--------------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~ 240 (273) +|.|++++||+||+||++|... ....+++|++|.+.++.++ .+|.+| T Consensus 235 ~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~ 314 (347) T protein:vir:94 235 TGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERAR 314 (347) T ss_pred cceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeee Confidence 9999999999999999998532 0134889999999887766 799999 Q ss_pred CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++++|+|+|.+.+.||++++|||++++|..+.. T Consensus 315 ~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 315 RANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred chhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 999999999999999999999999997766666 No 27 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=3.2e-45 Score=264.51 Aligned_cols=270 Identities=16% Similarity=0.106 Sum_probs=214.6 Q ss_pred Ccccch--hHHHHHHHHHHHHHHhhccchhhhccccccc---cCCcEEEEEeccccccccccCC-CCccCCcccccceEE Q lcl|Aclame:pro 1 MAFNNF--IPELWSDMLLEEWTAQTVFANLVNREYEGIA---SKGNVVHIAGVVAPTVKDYKAA-GRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~--~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~---~~Gdtv~ip~~~~~~~~d~~~~-~~~~~~~~~~~~~~~ 74 (273) |||+++ +||+|++++++.|+++++++++++|+|+.++ +.||||+||+++.+...+|... +..++++++.+..++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 999976 4999999999999999999999999998774 4799999999999999999853 345678999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKA 154 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 154 (273) ++||++|+++|+++|+|+.+...++++++++++++|++++|+++++++...+....++ +.+..+.++.+.+++..|+++ T Consensus 81 l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt-~~t~~~a~~~i~~a~~~Ld~~ 159 (423) T protein:vir:10 81 GRVGNYITVAVEYQQLEEAIKLNQLEEILAPVRQRIVTDLETELAHFMMNNGALSLGS-PNTPITKWSDVAQTASFLKDL 159 (423) T ss_pred EEeeceeeeeeeechHHHhcChhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-CCcccchHHHHHHHHHHHHhc Confidence 9999999999999999998777778889999999999999999999887765544333 333445689999999999999 Q ss_pred CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeee-eeecceEEEEecccccCCCcE--------------- Q lcl|Aclame:pro 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTDDEQ--------------- 218 (273) Q Consensus 155 ~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~i-g~i~G~~i~~s~~l~~~~~~~--------------- 218 (273) ++|..+|++|++|+++..|++++.++...+. +..+.+++|.| |+++||+||+||++|..++.. T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~-~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~ 238 (423) T protein:vir:10 160 GVNEGENYAVMDPWSAQRLADAQTGLHASDQ-LVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTY 238 (423) T ss_pred cCCcCCCEEEeChHHHHHHhccccceecccc-cchhhhhhccceeeecceEEEEeCCCccccccccccceeeeecceecc Confidence 9999999999999999999988876665443 44578999987 999999999999988421000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 219 -------------------------------------------------------------------------------- 218 (273) Q Consensus 219 -------------------------------------------------------------------------------- 218 (273) T Consensus 239 ~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i 318 (423) T protein:vir:10 239 NAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPI 318 (423) T ss_pred ccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceeeeccCccc Confidence Q ss_pred ---------------------------------EEEEeCceEEEEEecce------------------eeeccCCCccee Q lcl|Aclame:pro 219 ---------------------------------FVAFHPSAAAYVSQIDT------------------VEALRDQDSFSD 247 (273) Q Consensus 219 ---------------------------------~~~~~~~a~~~~~~~~~------------------ve~~~~~~~~~~ 247 (273) -+++|++|++++.+-.. +-.+++.+...+ T Consensus 319 ~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~ 398 (423) T protein:vir:10 319 YDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQ 398 (423) T ss_pred cccCCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeeccccCceEEEEEeeeccccce Confidence 01334444444433211 011233333346 Q ss_pred eEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 248 RIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 248 ~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) .++-+..||++.+|||-.+.+...- T Consensus 399 ~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 399 KMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred EEEEEeecceeeeccceEEEEEecC Confidence 6788888999999999888776555 No 28 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=8.9e-45 Score=262.09 Aligned_cols=265 Identities=20% Similarity=0.186 Sum_probs=222.5 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccch-hhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFAN-LVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~-~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) =+|++...|.|++.|.+.+...+...+ ++++++++ ..|++|+||+++..++.||++.++ .++++++.++.+++|++ T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~--~gg~tVkIp~i~~~gl~DY~R~~g-~~~g~vt~~~~t~tidq 101 (319) T protein:vir:97 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF--MEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEe--ccCcEEEEeeecccccccccCCCC-cccCCcccceeEEEeec Confidence 345677789999998887777766654 45666654 569999999999999999998765 67889999999999999 Q ss_pred eeeceeEechHHHHHhHHHHH--H-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLE--A-YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANV 156 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~--~-~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~v 156 (273) +++++|.|++.|..++.+.+. . ..+++...+++++|.+++++++..+... .+...+++++|+.|.++++.|++++| T Consensus 102 dR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~-~~~~~t~~n~y~~i~~a~~~Lde~~V 180 (319) T protein:vir:97 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-LTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) T ss_pred ccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCC Confidence 999999999999999887763 3 3566788999999999999998765543 34557889999999999999999999 Q ss_pred CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEeccee Q lcl|Aclame:pro 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) Q Consensus 157 p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~v 236 (273) | ++|+++|+|+++..|++++.+..+.+. ..+.+++|.||+++||+|+++++... .+..++++|++|+.++.|.+.+ T Consensus 181 P-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~--~~~~~~~g~Vg~idG~~Vi~vps~~~-k~in~i~~h~~A~~~~~k~~~~ 256 (319) T protein:vir:97 181 P-ENRVLFVSPTFYKGIKKFVIALPQGDT--RQQVLGKGVQGELDGFVIVKVPTKLL-QGLQAIAVVGEVLASPIQADLA 256 (319) T ss_pred C-CCcEEEeCHHHHHHHHhhhhhhccccc--cccceeeeeceeecCeEEEEeccccc-ccceEEEEcCCeeeeeeeeeee Confidence 9 699999999999999999876654433 34678899999999999999865443 3456899999999999999999 Q ss_pred eeccC-CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 237 EALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 237 e~~~~-~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |.+++ +++++|.|+++++||++|++|++..++.+..+ T Consensus 257 ~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eccCCCccccceeeeeeeeeeeEEeccccceEEEeecC Confidence 98875 77889999999999999999997777765555 No 29 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=8.9e-45 Score=262.09 Aligned_cols=265 Identities=20% Similarity=0.186 Sum_probs=222.5 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccch-hhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFAN-LVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~-~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) =+|++...|.|++.|.+.+...+...+ ++++++++ ..|++|+||+++..++.||++.++ .++++++.++.+++|++ T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~--~gg~tVkIp~i~~~gl~DY~R~~g-~~~g~vt~~~~t~tidq 101 (319) T protein:vir:94 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF--MEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEe--ccCcEEEEeeecccccccccCCCC-cccCCcccceeEEEeec Confidence 345677789999998887777766654 45666654 569999999999999999998765 67889999999999999 Q ss_pred eeeceeEechHHHHHhHHHHH--H-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLE--A-YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANV 156 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~--~-~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~v 156 (273) +++++|.|++.|..++.+.+. . ..+++...+++++|.+++++++..+... .+...+++++|+.|.++++.|++++| T Consensus 102 dR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~-~~~~~t~~n~y~~i~~a~~~Lde~~V 180 (319) T protein:vir:94 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-LTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) T ss_pred ccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCC Confidence 999999999999999887763 3 3566788999999999999998765543 34557889999999999999999999 Q ss_pred CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEeccee Q lcl|Aclame:pro 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) Q Consensus 157 p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~v 236 (273) | ++|+++|+|+++..|++++.+..+.+. ..+.+++|.||+++||+|+++++... .+..++++|++|+.++.|.+.+ T Consensus 181 P-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~--~~~~~~~g~Vg~idG~~Vi~vps~~~-k~in~i~~h~~A~~~~~k~~~~ 256 (319) T protein:vir:94 181 P-ENRVLFVSPTFYKGIKKFVIALPQGDT--RQQVLGKGVQGELDGFVIVKVPTKLL-QGLQAIAVVGEVLASPIQADLA 256 (319) T ss_pred C-CCcEEEeCHHHHHHHHhhhhhhccccc--cccceeeeeceeecCeEEEEeccccc-ccceEEEEcCCeeeeeeeeeee Confidence 9 699999999999999999876654433 34678899999999999999865443 3456899999999999999999 Q ss_pred eeccC-CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 237 EALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 237 e~~~~-~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |.+++ +++++|.|+++++||++|++|++..++.+..+ T Consensus 257 ~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eccCCCccccceeeeeeeeeeeEEeccccceEEEeecC Confidence 98875 77889999999999999999997777765555 No 30 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=1e-45 Score=267.30 Aligned_cols=268 Identities=17% Similarity=0.198 Sum_probs=222.6 Q ss_pred Cccc-----------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCcc Q lcl|Aclame:pro 1 MAFN-----------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT 63 (273) Q Consensus 1 MA~~-----------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~ 63 (273) |++- .+.=|+|.+++++.|..+++|.+++.+. ....|+|++||+++..++. |...|+++ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r---~i~~G~s~~~~~iG~~~~~-~~~~g~~l 76 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVR---SLRGTNQLRVDRVGASTIA-GRKAGEEL 76 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceee---eccccceEEEeeecceeee-eecCCCCC Confidence 7651 1122999999999999999999988642 3456999999999999986 55567888 Q ss_pred CCcccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------ Q lcl|Aclame:pro 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT------------ 130 (273) Q Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~------------ 130 (273) +.+.+.+++++++||+.+++.+.|+|.|+.+.++|+ .++.++++++||+++|+.++..+..++.... T Consensus 77 ~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~ 156 (334) T protein:vir:80 77 VVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGI 156 (334) T ss_pred CCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCc Confidence 999999999999999999999999999999999997 5689999999999999999887765432110 Q ss_pred ----------ccccCCHhHHHHHHHHHHHHHhhcCCCc---CCcEEEECHHHHHHHhcchHHhhhhhccc--ccceeeee Q lcl|Aclame:pro 131 ----------GSAPSDADDAFDLIASALKELTKANVPN---VGRVVVVNAEMAFWLRSSGSKLTSADTSG--DAAGLRAG 195 (273) Q Consensus 131 ----------~~~~~~~~~~~~~i~~a~~~l~~~~vp~---~~r~lvv~p~~~~~L~~~~~~~~~~~~~~--~~~~~~~G 195 (273) .....++..+++++..|++.|++++||. .+|+++|+|++|+.|+.+++ +.+.++.+ +...+.+| T Consensus 157 ~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r-~~n~d~~~s~~~~~~~~g 235 (334) T protein:vir:80 157 LLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDR-LMNVEFGAKEGGNSFVGG 235 (334) T ss_pred ceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccc-cccceeccccccccccce Confidence 0111334456788999999999999994 67999999999999999876 55555433 34567899 Q ss_pred eeeeecceEEEEecccccCCC------------------cEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeeeee Q lcl|Aclame:pro 196 TIGNLLGARIVESNNLRDTDD------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYG 256 (273) Q Consensus 196 ~ig~i~G~~i~~s~~l~~~~~------------------~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~g 256 (273) .|++++||+|++||++|..+. ..+..+|++|++.++.+. ..|.+|++++|+|+|.+.+.|| T Consensus 236 ~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G 315 (334) T protein:vir:80 236 RIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYN 315 (334) T ss_pred eEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcC Confidence 999999999999999995531 124678999999998875 7899999999999999999999 Q ss_pred eEEEcCceEEEEecCCC Q lcl|Aclame:pro 257 GKVVRPTGVVVFNKTGS 273 (273) Q Consensus 257 ~~vl~p~~~v~~~~~~s 273 (273) ++++|||++++++-+.+ T Consensus 316 ~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 316 IGQRRPDAVAVHDITVT 332 (334) T ss_pred CceeccceEEEEEEeee Confidence 99999999999999999 No 31 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=6.5e-45 Score=262.83 Aligned_cols=261 Identities=18% Similarity=0.218 Sum_probs=223.0 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+ ++++||+|+.++.+++.+.++|.+++.++.++++.+|++|+||.|..++.++...+|..+++++++.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 997 35889999999999999999999999999988888999999999999988887888889999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) .+|.+ ++++|.++|++.....++ +...+++++.++++++|+++++.+..+..... .....++.|.+|...|++ T Consensus 81 a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~-----~~~~t~d~i~~A~~~lgd 154 (276) T protein:vir:10 81 AKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVS-----ADIGTLAGLEAAIDTFDD 154 (276) T ss_pred EEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhcc Confidence 99955 799999999999888777 57899999999999999999999866443322 112347889999999998 Q ss_pred cCCCcCCcEEEECHHHHHHHhcch-HHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEE- Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSG-SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS- 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~- 231 (273) ++. ..++++|+|+.+..|+++. ..|.+.+..+ .+.+++|.||.++|++|++|+++|.. ++++++++|+++.. T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gAi~~~~~ 228 (276) T protein:vir:10 155 EDL--EPMVLFINPKDAGKLRSSASDNFTRATELG-DNIIVKGAFGEALGAVIVRSKKLDEG---EAILAKRGAVKLITK 228 (276) T ss_pred ccC--cccEEEEcHHHHHHHHHhcccccccccccc-ccceeccccceecceeEEEcCCCCcc---eEEEEeccceeeeec Confidence 875 6789999999999997642 2354555544 45789999999999999999999864 46788999999764 Q ss_pred ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEe-cCCC Q lcl|Aclame:pro 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN-KTGS 273 (273) Q Consensus 232 ~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~-~~~s 273 (273) +...+|.+|++++++|.+++++|||+++++|+++|+++ +++| T Consensus 229 ~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 229 RDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred CCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 55589999999999999999999999999999999885 4555 No 32 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=7.5e-45 Score=262.48 Aligned_cols=268 Identities=16% Similarity=0.110 Sum_probs=220.8 Q ss_pred Cccc---------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC Q lcl|Aclame:pro 1 MAFN---------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~---------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 65 (273) |++- .+.=|+|.+++++.|...+++.+++.. . ....|++++||.+|...+ .|...|+.+++ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~-r--ti~~gkS~q~~~iG~~~~-~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-Q--EVVGTNSVSNKYIGETEL-QVLSPGKSPDA 76 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-e--eecccceEEeeeeeeeEE-eeeccCcccCC Confidence 7651 223499999999999999999988754 2 355799999999999888 55666777899 Q ss_pred cccccceEEEEEEeeeeceeEechHHHHHhHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHhhc-cc----------cc-- Q lcl|Aclame:pro 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADMLVDNG-TA----------LT-- 130 (273) Q Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~iD~~~~~~~~~~~-~~----------~~-- 130 (273) +.+..++.+++||+.+++.+.|+|+|+.+.+++ ++ ++.++++++|++.+|+.++..+..++ .. .. T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g 156 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHG 156 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCc Confidence 999999999999999999999999999999998 65 57799999999999999987765432 00 00 Q ss_pred ---------ccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcc-cccceeeeeeeeee Q lcl|Aclame:pro 131 ---------GSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGNL 200 (273) Q Consensus 131 ---------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~-~~~~~~~~G~ig~i 200 (273) .+....+..++++|.++...|+|++||.++|+++++|++|+.|+++++++ +.++. .+...+.+|.|+++ T Consensus 157 ~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lv-n~d~~~~~~~~~~~G~v~~v 235 (364) T protein:vir:10 157 FSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIV-DKSYTIAASDNTVDGFVLKS 235 (364) T ss_pred ceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccc-cccccccCCCccccceeEEE Confidence 01112334578889999999999999999999999999999999987644 44432 12356789999999 Q ss_pred cceEEEEecccccCC------------------------------CcEEEEEeCceEEEEEecc-eeeeccCCCcceeeE Q lcl|Aclame:pro 201 LGARIVESNNLRDTD------------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRI 249 (273) Q Consensus 201 ~G~~i~~s~~l~~~~------------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v 249 (273) +||+|++||++|... ...++.|||+|++.++.++ .+|.+|++++|+|++ T Consensus 236 ~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~i 315 (364) T protein:vir:10 236 WNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYI 315 (364) T ss_pred eceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeee Confidence 999999999998410 1235789999999998774 889999999999999 Q ss_pred EeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 250 RALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 250 ~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+.+.||++++|||+++++++..+ T Consensus 316 da~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 316 DTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred eeehcccCcccCccceEEEEecCC Confidence 999999999999999999998888 No 33 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=3e-44 Score=259.17 Aligned_cols=265 Identities=17% Similarity=0.165 Sum_probs=222.5 Q ss_pred CcccchhHHHHHHHHHHHHHHhhc-cchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTV-FANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v-~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) =.|++...|.|++.|.+.|...+. ...+++++++ +..|++|+||+++..+++||++.++ ..+++++.++.+++|++ T Consensus 36 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e--~~~g~tVkIp~i~~~gl~DY~R~~g-~~~g~vt~~~~t~tidq 112 (329) T protein:vir:10 36 EPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAI--FMQGRSFTVIKGDVTELKDYKRNAT-NEFDHPQIQETTYFLDQ 112 (329) T ss_pred CCchhHHHHHHHHHHHHHHHhhceeeeeeccccee--eccCcEEEEeeecccccccccCCCC-ccccccccceeEEEeec Confidence 445677899999999999987754 4456777776 4569999999999999999998775 57889999999999999 Q ss_pred eeeceeEechHHHHHhHHHHH--H-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLE--A-YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANV 156 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~--~-~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~v 156 (273) +++++|.|+++|..++.+.+. . ..+++...+++++|++++++++..+... ..+..+++++|+.|.+++..|+++++ T Consensus 113 dR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~-~~~~~t~~nay~~i~~a~~~Lde~~v 191 (329) T protein:vir:10 113 EKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKH-LTVGSGADAQYDAVLDVSVELDEIGA 191 (329) T ss_pred ccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCC Confidence 999999999999999887653 3 3456888999999999999998765543 34557889999999999999999999 Q ss_pred CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEeccee Q lcl|Aclame:pro 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) Q Consensus 157 p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~v 236 (273) | ++|+++++|+++..|++++.+.... ...++.+++|.||+++||+|+++++.+. .+..++++|++|++++.|...+ T Consensus 192 p-~~Rvl~VtP~~~~~Lk~~~~f~~~~--~~~~~~~~~g~Vg~idG~~Ii~vps~~~-k~in~ii~~~~A~~~~~K~~~~ 267 (329) T protein:vir:10 192 G-ASRILFVTPKFYKGIKKFVIELPQG--DNRQQVLGKGVQGELDGFTIVKVPSKML-QGVEAMAVIGEVMASPIQANEA 267 (329) T ss_pred C-CCcEEEeCHHHHHHHHhhhhhhccc--cccccceeeeeeeeecCeEEEEecCCcc-cceeEEEEcCCceeeeeeeeee Confidence 8 5999999999999999887655432 2344678899999999999999876554 3456899999999999999999 Q ss_pred eeccC-CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 237 EALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 237 e~~~~-~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |.+++ +++++|.|+++++||++|++|++..++....+ T Consensus 268 ~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~ 305 (329) T protein:vir:10 268 KLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGK 305 (329) T ss_pred eeeCCCCccchheeeeeeeeeeEEEccccCEEEEeccc Confidence 99885 77889999999999999999997776655444 No 34 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=7.9e-44 Score=256.89 Aligned_cols=267 Identities=21% Similarity=0.269 Sum_probs=205.8 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |+. +.|+||+|++++++.|++.+++.+++++ .+.++.+||||+||+++.+++.++.+ +.+++++++++++++++| T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~-~~~~~~~GdTV~ip~~g~~~a~d~~~-g~~i~~~~~~~~~~~itI 92 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK-IPFEGKKGDLIHIPNISRAAVYDKQP-QTPVNLQARTDSEFTFTV 92 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc-ccceeecCceEEeeccCcceeeeecC-CCcccccccCCceEEEEE Confidence 332 4778999999999999999999998764 44456779999999999998887765 567899999999999999 Q ss_pred EeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----------------ccccCCHhH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT-----------------GSAPSDADD 139 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~-----------------~~~~~~~~~ 139 (273) |+++++++.|+|.|+.+.++++ +.+.++++.+|++++|++++..+........ ..+..+... T Consensus 93 D~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~ 172 (381) T protein:vir:80 93 TKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPAPL 172 (381) T ss_pred eeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccchhhH Confidence 9999999999999999999986 5689999999999999999988754322110 012233456 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcE- Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ- 218 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~- 218 (273) +++.|.++++.|++++||.++|+++++|++|+.|+++++ +.+.+. +..+.+++|.||+++||+|++||++|...... T Consensus 173 t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~-~~~ad~-~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~ 250 (381) T protein:vir:80 173 TYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQ-FISVDF-SQVKPVTSGVVGTILGMEVIVTTQIGINSLTGY 250 (381) T ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchh-hhhhhh-ccchhhhceeeeEEcceEEEeecccccccccce Confidence 789999999999999999999999999999999999875 555665 44568999999999999999999999755432 Q ss_pred -EEEEeCceEEEEEecceeeeccCCCcceeeEEeeeeeeeEEE-cCceEEEEecCCC Q lcl|Aclame:pro 219 -FVAFHPSAAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVV-RPTGVVVFNKTGS 273 (273) Q Consensus 219 -~~~~~~~a~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl-~p~~~v~~~~~~s 273 (273) ..++++.+.. .+........+..+.+..+.....|+.++. +-..+-++.-++| T Consensus 251 ~~~agap~~~~--~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~ 305 (381) T protein:vir:80 251 VNGQGAPTQPT--PGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGA 305 (381) T ss_pred eeecccccccc--ccccccccccccccceeeeeeeeeeceeeeeeeccceeeeccee Confidence 1222222221 111111111233345789999999998885 5556656666666 No 35 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=4.4e-44 Score=258.28 Aligned_cols=268 Identities=14% Similarity=0.127 Sum_probs=199.1 Q ss_pred Ccccchh--HHHHHHHHHHHHHHhhccchhhhccccccc---cCCcEEEEEeccccccccccCC-CCccCCcccccceEE Q lcl|Aclame:pro 1 MAFNNFI--PELWSDMLLEEWTAQTVFANLVNREYEGIA---SKGNVVHIAGVVAPTVKDYKAA-GRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~--pev~~~~v~~~l~~~~v~~~~~~~d~~~~~---~~Gdtv~ip~~~~~~~~d~~~~-~~~~~~~~~~~~~~~ 74 (273) |||+++. ||+|++++++.|+++++++++++|+|+.++ +.||||+||+++.+.+.+|... +..++++++.+..++ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~v~ 80 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccceee Confidence 9999765 999999999999999999999999998874 5699999999999999998643 456788999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKA 154 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|+++|+|+.+...+++.++++++++|+.++|.++++.+........ +++.+..+.++.|.++++.|+++ T Consensus 81 l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a~~ala~~vd~~l~~~l~~~a~~~v-gt~~t~~~~~~~i~~a~~~Ld~~ 159 (423) T protein:vir:35 81 GKVGKYITVAVEWTQIEEALKLNQLDQILSPIHERMVTDLETELAHFMMNNGALSL-GSPNTAIKKWADVAQTASFIKDI 159 (423) T ss_pred EEeccceeccceeCHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccccCCcchHHHHHHHHHHHHHh Confidence 99999999999999999998888888889999999999999999998766444332 33334446689999999999999 Q ss_pred CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeee-eeecceEEEEecccccCCCcEE--EEEeCceEE--- Q lcl|Aclame:pro 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTDDEQF--VAFHPSAAA--- 228 (273) Q Consensus 155 ~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~i-g~i~G~~i~~s~~l~~~~~~~~--~~~~~~a~~--- 228 (273) +||..+|++|++|+++..|++++.++...+.. ..+.+++|.| |+++||+||+||++|..++..+ ...+..+.. T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~ 238 (423) T protein:vir:35 160 GIKTGENYAIMDPWSAQRLADAQSGLHAADQL-VRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDY 238 (423) T ss_pred cCCcCCCEEEeCHHHHHHHhccccceeccccc-hhHHHhhccceeeecceEEEEcCCCccccccccccceeecccccccc Confidence 99999999999999999999887767655443 4567888876 9999999999999996544321 111111110 Q ss_pred EEE----ec-ceee----eccCCCcceeeEEeeeeeeeEEEcCceEEEE-------------ecCC-C Q lcl|Aclame:pro 229 YVS----QI-DTVE----ALRDQDSFSDRIRALHVYGGKVVRPTGVVVF-------------NKTG-S 273 (273) Q Consensus 229 ~~~----~~-~~ve----~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~-------------~~~~-s 273 (273) .+. +. ..+. ...+.-..+|.+ ..-|.+.++|-.-.++ .++. + T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~ 303 (423) T protein:vir:35 239 LSVKDSYQFTVALTGATPSKTGFLKAGDQL---KFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNS 303 (423) T ss_pred ccccccccceeeeeeeeeccCCcEEecceE---EeeeeeeccccccceeecccCCceeEEEEeccccc Confidence 000 00 0000 112223345543 3345555544433222 2111 1 No 36 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=1.6e-43 Score=255.26 Aligned_cols=268 Identities=13% Similarity=0.109 Sum_probs=199.2 Q ss_pred Ccccch--hHHHHHHHHHHHHHHhhccchhhhccccccc---cCCcEEEEEeccccccccccCCC-CccCCcccccceEE Q lcl|Aclame:pro 1 MAFNNF--IPELWSDMLLEEWTAQTVFANLVNREYEGIA---SKGNVVHIAGVVAPTVKDYKAAG-RQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~--~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~---~~Gdtv~ip~~~~~~~~d~~~~~-~~~~~~~~~~~~~~ 74 (273) |||+++ +||+|++++++.|++++++.++++|+|+.++ +.||||+||+++.+...+|.... ..++++++.+..++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~v~ 80 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccceeE Confidence 999976 5999999999999999999999999998764 47999999999999999987543 34678999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKA 154 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|+++|+|+.+...++++++++++++|++++|+++++++...+.... +++.+..+.++++.+++..|+++ T Consensus 81 l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~-gt~~t~~~a~~~i~~a~~~Ld~~ 159 (423) T protein:vir:17 81 GRVGNYITVAVEYQQLEEAIKLNQLEEILAPVRQRIVTDLETELAHFMMNNGALSL-GSPNTPITKWSDVAQTASFLKDL 159 (423) T ss_pred EEeeceeeeeeeecHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccCCcccccHHHHHHHHHHHHhc Confidence 99999999999999999987777788899999999999999999999866544333 23334445689999999999999 Q ss_pred CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeee-eeecceEEEEecccccCCCcEE--EEEeCce--E-E Q lcl|Aclame:pro 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTDDEQF--VAFHPSA--A-A 228 (273) Q Consensus 155 ~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~i-g~i~G~~i~~s~~l~~~~~~~~--~~~~~~a--~-~ 228 (273) ++|..+|++|++|+++..|++++.++...+ .+..+.+++|.| |+++||+||+||++|..++..+ .+.+..+ + + T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~-~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~ 238 (423) T protein:vir:17 160 GVNEGENYAVMDPWSAQRLADAQTGLHASD-QLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTY 238 (423) T ss_pred cCCcCCCEEEeChHHHHHHhccccceeccc-ccchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccc Confidence 999999999999999999998887665443 445678999987 8999999999999996654321 1111000 0 0 Q ss_pred EE-----Eecceee----eccCCCcceeeEEeeeeeeeEEEcCceEE-------------EEecCCC Q lcl|Aclame:pro 229 YV-----SQIDTVE----ALRDQDSFSDRIRALHVYGGKVVRPTGVV-------------VFNKTGS 273 (273) Q Consensus 229 ~~-----~~~~~ve----~~~~~~~~~~~v~~~~~~g~~vl~p~~~v-------------~~~~~~s 273 (273) .+ .+...+. ...+.-..+|.+ ..-|.+.+.|-.-- +.+++++ T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~ 302 (423) T protein:vir:17 239 NAVKDSYQFTVTLTGATTSVTGFLKAGDQV---KFTNTYWLQQQTKQALYNGATPISFTATVTADAN 302 (423) T ss_pred cccccccceeeeeeeeeeeccCceeecceE---EecceeeecccccccccccccccceEEEEEeccc Confidence 00 0001111 112333345644 33455555443321 2233232 No 37 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=1.1e-43 Score=256.16 Aligned_cols=268 Identities=13% Similarity=0.147 Sum_probs=223.9 Q ss_pred Cccc---------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC Q lcl|Aclame:pro 1 MAFN---------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~---------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 65 (273) |+|- .+.=|+|.+++++.|...++|.+++.+. ....|++++||.+|...+ .|...|.+++. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~r---ti~~g~s~~~~~iG~~~~-~~~~pG~~l~~ 76 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIR---DLRGSNVVRLDRLGNVEA-KGRRAGEELER 76 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhcccccee---eeccceeEEEeeeeeeee-cccccCcccCC Confidence 7761 1223999999999999999999988643 346699999999999887 67888899999 Q ss_pred cccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|Aclame:pro 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT-------------- 130 (273) Q Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~-------------- 130 (273) +.+..++..++||+.+++.+.|+|.|+.++++|+ .++.++++++||+..|+.++..+..++.... T Consensus 77 ~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcce Confidence 9999999999999999999999999999999997 5689999999999999999877765442111 Q ss_pred ------ccccCCHhHHHHHHHHHHHHHhhcCCCcC---CcEEEECHHHHHHHhcchHHhhhhhcc--cccceeeeeeeee Q lcl|Aclame:pro 131 ------GSAPSDADDAFDLIASALKELTKANVPNV---GRVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIGN 199 (273) Q Consensus 131 ------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~---~r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~~G~ig~ 199 (273) .....++..+.+++.++...|++++||.. +|+++|+|++|+.|+.+++++ +.++. ++...+.+|.+++ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~-n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:78 157 KLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLM-SVEYQATGATNDYVKSRVAI 235 (335) T ss_pred eeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccc-cccccccccccccccceeEE Confidence 11122455568889999999999999965 699999999999999987654 44432 3335678999999 Q ss_pred ecceEEEEecccccCCC------------------cEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeeeeeeEEE Q lcl|Aclame:pro 200 LLGARIVESNNLRDTDD------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVV 260 (273) Q Consensus 200 i~G~~i~~s~~l~~~~~------------------~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~g~~vl 260 (273) ++||+|++||++|..+. ..+..+|+.|++.++.++ ..|.+++.++|+|+|.+.+.||++++ T Consensus 236 v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:78 236 LNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGAR 315 (335) T ss_pred eeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCccc Confidence 99999999999995431 246789999999998886 67999999999999999999999999 Q ss_pred cCceEEEEecCCC Q lcl|Aclame:pro 261 RPTGVVVFNKTGS 273 (273) Q Consensus 261 ~p~~~v~~~~~~s 273 (273) |||++++|+=+|. T Consensus 316 RPe~a~~i~~tg~ 328 (335) T protein:vir:78 316 RPDTAGAIELKGI 328 (335) T ss_pred CcceEEEEEecCC Confidence 9999999998887 No 38 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=1.7e-43 Score=255.10 Aligned_cols=268 Identities=13% Similarity=0.144 Sum_probs=221.5 Q ss_pred Cccc-c--------------hhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC Q lcl|Aclame:pro 1 MAFN-N--------------FIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~-~--------------~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 65 (273) |+|- . +.=|+|.+++++.|...++|.+++.+. ....|++++||.+|...+. |...|++++. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~r---ti~~g~s~~~~~iG~~~~~-~~~pG~~l~~ 76 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIR---DLRGSNVVRLDRLGNVEAK-GRRAGEELER 76 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhcccccee---eeccceeEEEeeeeeeeee-cccCCcCcCC Confidence 7761 1 112999999999999999999987543 3466999999999998875 5666888899 Q ss_pred cccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|Aclame:pro 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALT-------------- 130 (273) Q Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~-------------- 130 (273) +.+..++.+++||+..+..+.|+|.|+.+.++|+ .++.++++++||+..|+.++..+..++.... T Consensus 77 ~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcce Confidence 9999999999999999999999999999999997 5689999999999999999887765443211 Q ss_pred ------ccccCCHhHHHHHHHHHHHHHhhcCCCcCC---cEEEECHHHHHHHhcchHHhhhhhcc--cccceeeeeeeee Q lcl|Aclame:pro 131 ------GSAPSDADDAFDLIASALKELTKANVPNVG---RVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIGN 199 (273) Q Consensus 131 ------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~---r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~~G~ig~ 199 (273) .+...++..+.+++.++...|++++||.++ |+++|+|++|+.|+.+++++ |.++. ++.....+|.|++ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~-n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:63 157 KLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLM-NVEYQATGATNDYVKSRVAI 235 (335) T ss_pred eeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccc-ccccccccccccccCceeEE Confidence 011123455667888999999999999754 99999999999999987654 44432 2335678899999 Q ss_pred ecceEEEEecccccCCC------------------cEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeeeeeeEEE Q lcl|Aclame:pro 200 LLGARIVESNNLRDTDD------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVV 260 (273) Q Consensus 200 i~G~~i~~s~~l~~~~~------------------~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~g~~vl 260 (273) ++||+|++||++|..+. ..++++|++|++.++..+ ..|.+++.++|+|+|.+.+.||++++ T Consensus 236 v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:63 236 LNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGAR 315 (335) T ss_pred eeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCccc Confidence 99999999999995432 256889999999998876 78999999999999999999999999 Q ss_pred cCceEEEEecCCC Q lcl|Aclame:pro 261 RPTGVVVFNKTGS 273 (273) Q Consensus 261 ~p~~~v~~~~~~s 273 (273) |||++++++=+|- T Consensus 316 RPe~a~~i~~tg~ 328 (335) T protein:vir:63 316 RPDTAGAIELKGI 328 (335) T ss_pred ccceEEEEEEcCC Confidence 9999999988777 No 39 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=1e-42 Score=250.85 Aligned_cols=270 Identities=14% Similarity=0.081 Sum_probs=194.3 Q ss_pred Ccccc--hhHHHHHHHHHHHHHHhhccchhhhccccccc---cCCcEEEEEeccccccccccCC-CCccCCcccccceEE Q lcl|Aclame:pro 1 MAFNN--FIPELWSDMLLEEWTAQTVFANLVNREYEGIA---SKGNVVHIAGVVAPTVKDYKAA-GRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~--~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~---~~Gdtv~ip~~~~~~~~d~~~~-~~~~~~~~~~~~~~~ 74 (273) |||++ |+||+|++++++.|+++++++++++|+|+.++ +.||||+||+++.+...+.... .....++++.+.+++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~v~ 80 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAKAT 80 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccceEE Confidence 99987 99999999999999999999999999998773 3699999999999888764322 123345678888999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKA 154 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|+++|+|+.+...++++++++++++|++++|++++..+...+....++ +.+..+.++++.++++.|+++ T Consensus 81 l~id~~k~~a~~v~d~E~~l~i~~~~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt-~~t~~~a~~~~a~a~~~L~~~ 159 (423) T protein:vir:10 81 GEVGNYITVAVEYRQIEEALKLNQLDQILVPINERMVTDLETELALFMMKHGALSLGS-PNTPIKKWSDVAQTASFLKDL 159 (423) T ss_pred EEecceeeeeeeeChHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc-cccccccHHHHHHHHHHHhhc Confidence 9999999999999999998777778889999999999999999987776544443333 333345689999999999999 Q ss_pred CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeee-eeecceEEEEecccccCC-CcEEEEEeCceEEEEEe Q lcl|Aclame:pro 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTD-DEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 155 ~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~i-g~i~G~~i~~s~~l~~~~-~~~~~~~~~~a~~~~~~ 232 (273) ++|..+|++|++|+++..|++++.++...+.. ..+.+++|.| |+++||+||+||++|..+ +....++|..+...+.+ T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~ 238 (423) T protein:vir:10 160 GINSGENYAVMDPWAAQRLADAQSGLHVSEQL-VRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNY 238 (423) T ss_pred cCCcCCCEEEeCHHHHHHHhhhhhhhcccccc-chHHHHhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEe Confidence 99999999999999999999887767665544 4567889976 999999999999999543 33445566655554433 Q ss_pred cceeeeccC--------CCcceeeEEeeee--eeeEEEcCc--------------eEEEEecCCC Q lcl|Aclame:pro 233 IDTVEALRD--------QDSFSDRIRALHV--YGGKVVRPT--------------GVVVFNKTGS 273 (273) Q Consensus 233 ~~~ve~~~~--------~~~~~~~v~~~~~--~g~~vl~p~--------------~~v~~~~~~s 273 (273) ....+.... ...-+++-.++.. -|.+.++|= ..+| .++.+ T Consensus 239 a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V-~~~~~ 302 (423) T protein:vir:10 239 DSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATV-MEDAN 302 (423) T ss_pred cccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEE-Eeccc Confidence 211000000 0001122222211 222222221 1112 22111 No 40 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=1.2e-43 Score=255.89 Aligned_cols=238 Identities=18% Similarity=0.174 Sum_probs=192.3 Q ss_pred hhhccccccccCCcEEEEEeccccccccccCCCCccC--CcccccceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHH Q lcl|Aclame:pro 28 LVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTR 104 (273) Q Consensus 28 ~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~ 104 (273) ++ | ....|++++||++|...+..+.+ |.++. ++++...+.+++||+.+++.+.|+|.|+.++++++ .++.+ T Consensus 1 ~v-r----~i~~g~s~~~~~iG~~~~~~~~~-G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MT-R----TITSGKSAQFPVMGRTKARYLKQ-GQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYST 74 (324) T ss_pred Ce-e----eeecCceEEEeeeeeeEeccccC-CCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHH Confidence 22 2 35569999999999999876554 55553 46789999999999999999999999999999996 56899 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccc----------------------ccccccCCHhHHHHHHHHHHHHHhhcCCCcCCcE Q lcl|Aclame:pro 105 AGATALATDTDKFIADMLVDNGTA----------------------LTGSAPSDADDAFDLIASALKELTKANVPNVGRV 162 (273) Q Consensus 105 ~~~~ala~~iD~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~ 162 (273) +++++|++.+|+.++..+...... .......++..+++.|.+++..|++++||.++|| T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 999999999999998876532110 0001123455779999999999999999999999 Q ss_pred EEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCC--------------------------- Q lcl|Aclame:pro 163 VVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD--------------------------- 215 (273) Q Consensus 163 lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~--------------------------- 215 (273) ++|+|++|+.|+.+. ++.+.+. ++.+.+++|.|++++||+||+||++|... T Consensus 155 ~vv~P~~y~~Ll~~~-~~~~~~~-~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky 232 (324) T protein:vir:99 155 FYTDPDTYSAILAAL-MPNAANY-AALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKM 232 (324) T ss_pred EEeChHHHHHHhhcc-ccccccc-ccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccc Confidence 999999999887554 4555444 44567899999999999999999998531 Q ss_pred -----CcEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 216 -----DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 -----~~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ....+.+|+++.+.++.+. ++|.+|++++|+|+|.+++.||++++|||++++++-.++ T Consensus 233 ~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 233 TVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred ccccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 1234788999999888776 799999999999999999999999999998876654444 No 41 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=4.8e-42 Score=247.09 Aligned_cols=268 Identities=15% Similarity=0.114 Sum_probs=219.2 Q ss_pred Cccc---------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC Q lcl|Aclame:pro 1 MAFN---------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~---------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 65 (273) |++- .+.=|+|.+++++.|...++|.+++.. . ....|++++||.+|...+ .|...|+.+++ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v-r--ti~~GkS~qf~~iG~~~a-~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-Q--TVTGTNTVSNKYLGETEL-QVLAPGQSPNA 76 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-e--eecccceEEEEEEeeeEE-eeeccccccCC Confidence 7651 233499999999999999999988754 2 355799999999999888 56666777899 Q ss_pred cccccceEEEEEEeeeeceeEechHHHHHhHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------- Q lcl|Aclame:pro 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADMLVDNGTAL-------------- 129 (273) Q Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~iD~~~~~~~~~~~~~~-------------- 129 (273) +.+..++..++||+..++.+.|+|+|+.+.+++ ++ ++.++++++|++.+|+.++..+..++-.. T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g 156 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccc Confidence 999999999999999999999999999999998 65 57899999999999999988775432100 Q ss_pred -----cc---cccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcc-cccceeeeeeeeee Q lcl|Aclame:pro 130 -----TG---SAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGNL 200 (273) Q Consensus 130 -----~~---~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~-~~~~~~~~G~ig~i 200 (273) .+ ....++..+.++|.++...|++++||.++|+++++|++|+.|+++++++ +.++. .+.+.+.+|.|+++ T Consensus 157 ~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~-n~d~~~~~~g~~~~G~v~~v 235 (402) T protein:vir:97 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIV-DKTYTISQSGATINGFVLSS 235 (402) T ss_pred cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhccccc-chhhccccCCccccceeEEE Confidence 00 0124556678899999999999999999999999999999999987644 44442 33456789999999 Q ss_pred cceEEEEecccccCC------------------------CcEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeeee Q lcl|Aclame:pro 201 LGARIVESNNLRDTD------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVY 255 (273) Q Consensus 201 ~G~~i~~s~~l~~~~------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~ 255 (273) +||+||+||++|... ...++.|||+|++.++.++ ..|.||+.++|+|+|.+.+.| T Consensus 236 ~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~ 315 (402) T protein:vir:97 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) T ss_pred eceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHh Confidence 999999999998521 0245789999999988665 679999999999999999999 Q ss_pred eeEEEcCceEEEEecCC--C Q lcl|Aclame:pro 256 GGKVVRPTGVVVFNKTG--S 273 (273) Q Consensus 256 g~~vl~p~~~v~~~~~~--s 273 (273) |++++|||+++++.... + T Consensus 316 G~g~~RPeaa~vv~~~~~~t 335 (402) T protein:vir:97 316 GAIPDRWEAVSVVTTKRDAT 335 (402) T ss_pred CCcccCccceEEEEEecccc Confidence 99999999888884333 3 No 42 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=1.3e-40 Score=239.30 Aligned_cols=260 Identities=19% Similarity=0.184 Sum_probs=218.6 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+. .++||+|++.+.+.+.+.+++.+++.++.++++.+|++|+||++...+.+....+|..++.++++.++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 9974 6899999999999999999999999988888888999999999988777777778888999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +++++ .+..+.++|.+..+...++ ..+.+++++++++++|+++++.+..+.... +....++.|.+|...|++ T Consensus 81 ~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~------~~~~t~d~i~da~~~l~~ 153 (272) T protein:vir:98 81 MTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV------EATATVDGVSKALDIFND 153 (272) T ss_pred EEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHHHHHHHHhc Confidence 99977 4678999999988877775 668899999999999999999875543322 222347789999999988 Q ss_pred cCCCcCCcEEEECHHHHHHHhcchH-HhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEe Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSGS-KLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~~-~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~ 232 (273) ++. ..++++++|+++..|+++.. .+......+ .+.+++|.+|+++|++|++|+.+|.+ ++++++++++++..+ T Consensus 154 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~-~~~~~~g~ig~i~G~~Vi~s~~~p~~---t~~~~~~~a~~~~~~ 227 (272) T protein:vir:98 154 EDD--AETVIVMNPADASTLRLDAAKEWLGATEVG-ANRVVSGVYGEVLGVQIVRSRKCPKG---TAYMVRKGALRIMLK 227 (272) T ss_pred cCC--CccEEEEcHHHHHHHHHhcccccccccccc-ccccccccchhhcCeeEEEcCCCCcc---eEEEEcCCeEEEEec Confidence 764 56899999999999987631 122223333 35788999999999999999999864 478899999999865 Q ss_pred c-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 233 I-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) . ..+|.+|++.++.+.++++++||+++++|+++|+++-+.| T Consensus 228 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 228 RNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred CCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 4 4899999999999999999999999999999999988888 No 43 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=1.3e-40 Score=239.30 Aligned_cols=260 Identities=19% Similarity=0.184 Sum_probs=218.6 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||+. .++||+|++.+.+.+.+.+++.+++.++.++++.+|++|+||++...+.+....+|..++.++++.++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 9974 6899999999999999999999999988888888999999999988777777778888999999999999 Q ss_pred EEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +++++ .+..+.++|.+..+...++ ..+.+++++++++++|+++++.+..+.... +....++.|.+|...|++ T Consensus 81 ~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~------~~~~t~d~i~da~~~l~~ 153 (272) T protein:vir:30 81 MTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV------EATATVDGVSKALDIFND 153 (272) T ss_pred EEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHHHHHHHHhc Confidence 99977 4678999999988877775 668899999999999999999875543322 222347789999999988 Q ss_pred cCCCcCCcEEEECHHHHHHHhcchH-HhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEe Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSGS-KLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~~-~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~ 232 (273) ++. ..++++++|+++..|+++.. .+......+ .+.+++|.+|+++|++|++|+.+|.+ ++++++++++++..+ T Consensus 154 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~-~~~~~~g~ig~i~G~~Vi~s~~~p~~---t~~~~~~~a~~~~~~ 227 (272) T protein:vir:30 154 EDD--AETVIVMNPADASTLRLDAAKEWLGATEVG-ANRVVSGVYGEVLGVQIVRSRKCPKG---TAYMVRKGALRIMLK 227 (272) T ss_pred cCC--CccEEEEcHHHHHHHHHhcccccccccccc-ccccccccchhhcCeeEEEcCCCCcc---eEEEEcCCeEEEEec Confidence 764 56899999999999987631 122223333 35788999999999999999999864 478899999999865 Q ss_pred c-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 233 I-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) . ..+|.+|++.++.+.++++++||+++++|+++|+++-+.| T Consensus 228 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 228 RNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred CCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 4 4899999999999999999999999999999999988888 No 44 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=1e-39 Score=234.33 Aligned_cols=271 Identities=14% Similarity=0.129 Sum_probs=213.7 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccc--cccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEG--IASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~--~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) ||.-+ ++|+|+++|+++|.+.+++..|.++.+.. .+..|++|+||+++..++.||++.+......+++.++.+++|+ T Consensus 1 MA~~n-~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ld 79 (299) T protein:vir:79 1 MAALN-YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKVLT 79 (299) T ss_pred Cccch-hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEEee Confidence 99533 47999999999999999999887765543 3455899999999999999999876434455789999999999 Q ss_pred eeeeceeEechHHHHHhHHHH--HHHH-HHHHHHHHHHHHHHHHHHHHhhcccc---cccccCCHhHHHHHHHHHHHHHh Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQVAGSL--EAYT-RAGATALATDTDKFIADMLVDNGTAL---TGSAPSDADDAFDLIASALKELT 152 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~--~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~---~~~~~~~~~~~~~~i~~a~~~l~ 152 (273) |++++.|.|++.|..++...+ ...+ +.+...+++++|++++++++..+... ..++..+++++|+.|.++++.|+ T Consensus 80 qdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~~~ld 159 (299) T protein:vir:79 80 NQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLMEKMT 159 (299) T ss_pred ccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHHHHHH Confidence 999999999977666665543 3443 45677899999999999987665433 23345688999999999999999 Q ss_pred hcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEE--ecccccC----C---------Cc Q lcl|Aclame:pro 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE--SNNLRDT----D---------DE 217 (273) Q Consensus 153 ~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~--s~~l~~~----~---------~~ 217 (273) +++||.+||+++++|+++..|++++.+.... ...+.+..++|.||+++||+|++ |++++.. + .- T Consensus 160 e~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~-~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~i 238 (299) T protein:vir:79 160 EARVPENGRILYVTPVVNTLIKNAKEIQRTV-NIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQI 238 (299) T ss_pred hcCCCCCCeEEEeCHHHHHHHhhchhhhccc-ccccccceeeeeeeeecceEEEEechhhcCccceeccCccccCccccc Confidence 9999999999999999999999887654443 33445577899999999999987 4445421 1 13 Q ss_pred EEEEEeCceEEEEEecceeeeccCCCcce--eeEEeeeeeeeEEEcCceE---EEEecCCC Q lcl|Aclame:pro 218 QFVAFHPSAAAYVSQIDTVEALRDQDSFS--DRIRALHVYGGKVVRPTGV---VVFNKTGS 273 (273) Q Consensus 218 ~~~~~~~~a~~~~~~~~~ve~~~~~~~~~--~~v~~~~~~g~~vl~p~~~---v~~~~~~s 273 (273) .++++|++|.....+.+.+..+.|...+. +++..+.++++.|++...- +.++.+++ T Consensus 239 n~ii~~~~a~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 239 FMSLVHPSAIITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred ceEEEcCCeeeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 57899999999988888888777755543 4577899999999987743 44566777 No 45 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=6.2e-40 Score=235.52 Aligned_cols=259 Identities=14% Similarity=0.081 Sum_probs=215.7 Q ss_pred Cccc----chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFN----NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~----~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) ||.+ .++||+|+..+.+++.+.++|.+++..+.++.+.+|++|+||.|..++..+...+|..+++++++.++...+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 9985 458999999999999999999999999999999999999999999999888888899999999999999999 Q ss_pred EEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKAN 155 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~ 155 (273) |.+ ++.+|.++|++.....++ +.+..+|++..+++++|+++++.+..+.... +....++.|.+|...|.+.. T Consensus 81 i~~-~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~------~~~~t~~~~~dA~~~lgd~~ 153 (270) T protein:vir:95 81 VKE-TGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA------TVSADATGILDAIEVFNSEN 153 (270) T ss_pred eeh-hhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHHHHHHHhcccc Confidence 955 589999999988777666 6889999999999999999999886543222 12233567888989997665 Q ss_pred CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEec-c Q lcl|Aclame:pro 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQI-D 234 (273) Q Consensus 156 vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~ 234 (273) ....+++|+|..+..|+++. ++ .... ...+.+++|.||.++|++|+++++.+. ...+++++++|+++..+. . T Consensus 154 --~~~~~i~vhs~~~~~Lrk~~-~~-~~~~-~~~~~~~~G~ig~~~G~~Viv~s~~~~--~~~~~l~~~gAi~~~~~~~~ 226 (270) T protein:vir:95 154 --DEDYVLYVNPKDYNKLVKSL-FK-VGGN-VQDRAISKGDLVEIVGVSDIVKSKRVS--ENTAFLQRYGAMEIVNKKKP 226 (270) T ss_pred --CCCcEEEEcHHHHHHHHhhh-cc-cccc-cccchhcccccceecceeEEEeCCCCC--ceeEEEEeccceeeeecCCc Confidence 33468999999999998764 33 2223 235678899999999999877665442 345789999999988654 4 Q ss_pred eeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEe--cCCC Q lcl|Aclame:pro 235 TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN--KTGS 273 (273) Q Consensus 235 ~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~--~~~s 273 (273) .+|.+|++.++.|.+++++|||+++++|+++|+++ +++| T Consensus 227 ~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~ 267 (270) T protein:vir:95 227 EAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGS 267 (270) T ss_pred eeeeccchhhcccEEEeeeEEEEEEEccceEEEEEecCCCC Confidence 89999999999999999999999999999999865 5555 No 46 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=3.8e-38 Score=225.71 Aligned_cols=266 Identities=15% Similarity=0.110 Sum_probs=214.0 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEee Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) ||++. .++|++.|+++|...+++..+.+.+++. ..|++|+||+++..+++||++.++. ...+++.++.+++|+++ T Consensus 1 Main~--a~~~~~~Ld~~~~~~~~t~~l~~~~~~~--~ggktVkI~~i~~~gl~DY~R~~g~-~~g~v~~~~et~tl~qd 75 (290) T protein:vir:78 1 MAINY--VDKYGKELDQKLVFGTYTNELETPNLLW--LDAKTFKIQTITTTGLKAHTRNKGY-NEGSASNTNKSYTIDFD 75 (290) T ss_pred CchhH--HHHHHHHHHHHHHhhheeeeccccceee--ccCCEEEEeeeccCcccccccCCCc-ccCccccceeeEEeecc Confidence 99986 4899999999999999999998877655 4599999999999999999998764 55678999999999999 Q ss_pred eeceeEec--hHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--cccccCCHhHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 81 KSIDFLVD--DIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTAL--TGSAPSDADDAFDLIASALKELTKAN 155 (273) Q Consensus 81 ~~~~~~i~--d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~a~~~l~~~~ 155 (273) +++.|.|| |+|+++....+... .+++.+.+++++|++++++++..+... ..+...+++++++.|.++...|++ T Consensus 76 R~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~lde-- 153 (290) T protein:vir:78 76 RDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRKVKK-- 153 (290) T ss_pred ccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHHHHHh-- Confidence 99999999 77777666666554 556788899999999999987655432 223456889999999999999986 Q ss_pred CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc---c-----------ccCCC--cEE Q lcl|Aclame:pro 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN---L-----------RDTDD--EQF 219 (273) Q Consensus 156 vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~---l-----------~~~~~--~~~ 219 (273) ||.+||+|+++|+++..|++++.+....+.....+...+|.|++++||+|++.+. + +.+++ -.+ T Consensus 154 vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ 233 (290) T protein:vir:78 154 YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNF 233 (290) T ss_pred cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCCccceeE Confidence 8999999999999999999888765544443334445589999999999998652 1 11222 357 Q ss_pred EEEeCceEEEEEecceeeeccCCCc---ceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 220 VAFHPSAAAYVSQIDTVEALRDQDS---FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~a~~~~~~~~~ve~~~~~~~---~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++|++|.....+...+..+.|.-. -+|++..+.+|++.|++...-.++.+..= T Consensus 234 ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 234 LLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred EEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 8999999988888877777665544 46899999999999999887666654444 No 47 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=2.2e-38 Score=227.03 Aligned_cols=267 Identities=16% Similarity=0.138 Sum_probs=213.4 Q ss_pred Ccc-c--------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC Q lcl|Aclame:pro 1 MAF-N--------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~-~--------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 65 (273) |++ | .|.=|+|.+++++.|...+++.++.... ....|++++||++|...+. |...|+.++. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vR---ti~~gkS~qf~~~G~s~~~-~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ---TVTGTNTVSNKYLGETELQ-VLAPGQSPAA 76 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceee---eecccceEEEEEeeeeEee-eecCCCCcCC Confidence 765 1 2335899999999999999998887542 3567999999999998876 4556777889 Q ss_pred cccccceEEEEEEeeeeceeEechHHHHHhHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------- Q lcl|Aclame:pro 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADMLVDNGTAL-------------- 129 (273) Q Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~iD~~~~~~~~~~~~~~-------------- 129 (273) +.+..++..|+||...+..+.|+|.|+.+.+++ ++ ++.++++++||+.+|+.++.++..++-.. T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 999999999999999999999999999999998 65 68899999999999999988885432110 Q ss_pred -----c---ccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEE-ECHHHHHHHhcchHHhhhhhccc-ccceeeeeeeee Q lcl|Aclame:pro 130 -----T---GSAPSDADDAFDLIASALKELTKANVPNVGRVVV-VNAEMAFWLRSSGSKLTSADTSG-DAAGLRAGTIGN 199 (273) Q Consensus 130 -----~---~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lv-v~p~~~~~L~~~~~~~~~~~~~~-~~~~~~~G~ig~ 199 (273) . ..+..++..+.++|.+++..|++++||.. |+++ ++|..|..|+..++ +.+.++.. +.+...+|.|.+ T Consensus 157 ~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~~d~-L~nrd~~~s~~g~~~~G~v~~ 234 (401) T protein:vir:70 157 FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADR-IVDKTYTISQSGATIQGFTLS 234 (401) T ss_pred eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhcCc-ccchhhccccCCccccceEEE Confidence 0 01223455688899999999999999965 6655 56667777776654 44555432 345688999999 Q ss_pred ecceEEEEecccccCC---------------C---------cEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeee Q lcl|Aclame:pro 200 LLGARIVESNNLRDTD---------------D---------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHV 254 (273) Q Consensus 200 i~G~~i~~s~~l~~~~---------------~---------~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~ 254 (273) ++||+||+||++|..+ . ...+.|||+|.+.++.++ ..|.||+.+.|+|++.+.+. T Consensus 235 vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a 314 (401) T protein:vir:70 235 SYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMA 314 (401) T ss_pred EeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHH Confidence 9999999999998532 0 235788999999987765 67899999999999999999 Q ss_pred eeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 255 YGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 255 ~g~~vl~p~~~v~~~~~~s 273 (273) ||++++|||++++++...+ T Consensus 315 ~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 315 EGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred hCCcccchhheEEEeecCc Confidence 9999999999999865555 No 48 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=6.9e-38 Score=224.33 Aligned_cols=268 Identities=15% Similarity=0.132 Sum_probs=214.1 Q ss_pred Ccc---------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC Q lcl|Aclame:pro 1 MAF---------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~---------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 65 (273) |++ ..|.=|+|.+++++.|...+++.+++... ....|+|++||++|...+. |...|++++. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vR---tI~~gkS~qf~~lG~s~a~-y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ---TVTGTNTVSNKYLGETELQ-VLAPGQSPAA 76 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceee---eecccceEEEEEeeeeEEe-eecCCCCcCC Confidence 765 13346999999999999999998887542 3567999999999998875 6667888899 Q ss_pred cccccceEEEEEEeeeeceeEechHHHHHhHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------- Q lcl|Aclame:pro 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADMLVDNGTAL-------------- 129 (273) Q Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~iD~~~~~~~~~~~~~~-------------- 129 (273) +++..++..|+||...+....|+|.|+.+.++| ++ ++.++++++||+.+|+.++..+..++... T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 999999999999999999999999999999998 65 57899999999999999987775442100 Q ss_pred --------cccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccc-cceeeeeeeeee Q lcl|Aclame:pro 130 --------TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNL 200 (273) Q Consensus 130 --------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~-~~~~~~G~ig~i 200 (273) ......++..+..+|.++...|+|++||.+.++++++|+.|..|+..++ +.+.++..+ .....+|.|.++ T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dk-Lvnrdf~~s~~g~~~~g~v~~v 235 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADR-IVDKSYTISQSGATIQGFVLSS 235 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCc-ccchhccccCCCccccceEEEE Confidence 0011124455777899999999999999765555677777777877664 445554322 345778999999 Q ss_pred cceEEEEecccccCC------------------------CcEEEEEeCceEEEEEecc-eeeeccCCCcceeeEEeeeee Q lcl|Aclame:pro 201 LGARIVESNNLRDTD------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVY 255 (273) Q Consensus 201 ~G~~i~~s~~l~~~~------------------------~~~~~~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~ 255 (273) +||+|++||++|... ....+.|||+|.+.++.++ ..|.||++++|+|++.+.+.| T Consensus 236 ~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~ 315 (400) T protein:vir:10 236 YNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSE 315 (400) T ss_pred eceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHh Confidence 999999999998532 0235789999999987765 679999999999999999999 Q ss_pred eeEEEcCceEEEEecCCC Q lcl|Aclame:pro 256 GGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 256 g~~vl~p~~~v~~~~~~s 273 (273) |++++|||++++++-... T Consensus 316 G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 316 GAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred CCcccchhheEEEEecCC Confidence 999999999999977666 No 49 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=3.6e-37 Score=220.39 Aligned_cols=228 Identities=21% Similarity=0.185 Sum_probs=192.1 Q ss_pred cccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHH Q lcl|Aclame:pro 34 EGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALAT 112 (273) Q Consensus 34 ~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~ 112 (273) +.-.+.||||+||+| ++.++...+|..++++.++.++.+++|.+ .+.+|.|+|++.....++ +.+..+|++.+||+ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 222455999999987 88888899999999999999999999965 599999999999888877 47889999999999 Q ss_pred HHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccccee Q lcl|Aclame:pro 113 DTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 113 ~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 192 (273) ++|+++++.+..+..... ....++.|.+|...|++.. ..+++++|+|+.+..|+++.+++...+. +.++++ T Consensus 78 kvD~di~~~~~~a~l~~~------~~~t~d~i~~A~~~fgde~--~~~~vivv~p~~~~~Lrk~~~~~~~~~~-~g~~i~ 148 (231) T protein:vir:73 78 KVDDDLLKAAKTTSQTVS------TKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSE-VGANAL 148 (231) T ss_pred hhhHHHHHhhcccccccc------ccccHHHHHHHHHHhcccc--ccceEEEEcchHHHhhhhccchhhhhhh-hcccee Confidence 999999998865443221 2235788999999998876 4568999999999999998765544433 456789 Q ss_pred eeeeeeeecceEEEEecccccCCCcEE-EEEeCceEEEEEe-cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEec Q lcl|Aclame:pro 193 RAGTIGNLLGARIVESNNLRDTDDEQF-VAFHPSAAAYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) Q Consensus 193 ~~G~ig~i~G~~i~~s~~l~~~~~~~~-~~~~~~a~~~~~~-~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~ 270 (273) ++|.||+++|++|++|+++|.+++... +++.++|+++..+ ...+|.+|++++++|.+++++||++++++|+++|+++= T Consensus 149 ~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~ 228 (231) T protein:vir:73 149 INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) T ss_pred eecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEe Confidence 999999999999999999998766432 5678999998754 45899999999999999999999999999999999988 Q ss_pred CCC Q lcl|Aclame:pro 271 TGS 273 (273) Q Consensus 271 ~~s 273 (273) +|- T Consensus 229 ~g~ 231 (231) T protein:vir:73 229 TGV 231 (231) T ss_pred ecC Confidence 888 No 50 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=8.2e-37 Score=218.43 Aligned_cols=269 Identities=17% Similarity=0.116 Sum_probs=198.7 Q ss_pred Cccc--chhHHHHHHHHHHHHH-HhhccchhhhccccccccCCcEEEEEeccccccc------cccCCCC-ccCCccccc Q lcl|Aclame:pro 1 MAFN--NFIPELWSDMLLEEWT-AQTVFANLVNREYEGIASKGNVVHIAGVVAPTVK------DYKAAGR-QTSADAISD 70 (273) Q Consensus 1 MA~~--~~~pev~~~~v~~~l~-~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~------d~~~~~~-~~~~~~~~~ 70 (273) ||.+ ...-+.|++++...++ +...|.+.+.. .....++++++.|.....+.- ...+.+. +....+... T Consensus 13 Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~~~ 90 (322) T protein:vir:10 13 IAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQH--KNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNKPF 90 (322) T ss_pred eechhhhHHHHHHHHHHHHHHHHhhhhhhccccc--ccccccccceeecccccccccccccccccccCcccCCCcccccc Confidence 6653 1124778888888775 44566665532 223445677777765443321 1112222 122234456 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----------ccCCHh Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTALTGS-----------APSDAD 138 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~-----------~~~~~~ 138 (273) +...+.+.+ +++++.|++.|+.+.+++. +.++++++.+|+++.|..+++.+...+.....+ ...+.. T Consensus 91 ~~r~~~~~d-~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g~~g 169 (322) T protein:vir:10 91 AKRRTNVDT-YDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDGTKP 169 (322) T ss_pred ceEEEeecc-cccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccCccc Confidence 777766644 5889999999999999986 568999999999999999987665433221111 111234 Q ss_pred HHHHHHHHHHHHHhhcCCCcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC- Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~- 216 (273) ..++.|+++++.|++++||.++ ||++++|++|.+||+++. +.+.++.+.+...++|.|++++||+|++|++||..+. T Consensus 170 ~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~-~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t 248 (322) T protein:vir:10 170 ISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITE-ATSADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDPT 248 (322) T ss_pred hhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchh-hhhhhcccchhhhhcCeeeeeeeEEEEEeccCCccccc Confidence 4688999999999999999864 999999999999998775 6788888776666889999999999999999984322 Q ss_pred --------------cEEEEEeCceEEEEEecc-eee-eccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 --------------EQFVAFHPSAAAYVSQID-TVE-ALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 --------------~~~~~~~~~a~~~~~~~~-~ve-~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..|+++|++|++++++.+ +++ .+++.+.+++.|++.+.||+++++|++||+|.-.-| T Consensus 249 ~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 249 QWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred cccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 358999999999998653 566 556677779999999999999999999999999999 No 51 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=100.00 E-value=1.7e-35 Score=211.14 Aligned_cols=270 Identities=12% Similarity=0.102 Sum_probs=207.9 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCcc-CCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT-SADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~~~~~~~~~tid~ 79 (273) |||++-+.+.|++.|++++...+++..+...........|++|+||++...+++||+|.++.. +..+++.++.+++|++ T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~tl~q 80 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKTMTQ 80 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEEeee Confidence 999888999999999999999999888864433334566899999999999999999976622 3447899999999999 Q ss_pred eeeceeEec--hHHHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccccc------cccCCHhHHHHHHHHHHHH Q lcl|Aclame:pro 80 EKSIDFLVD--DIDRVQVAGSLEAYTR-AGATALATDTDKFIADMLVDNGTALTG------SAPSDADDAFDLIASALKE 150 (273) Q Consensus 80 ~~~~~~~i~--d~d~~~~~~~~~~~~~-~~~~ala~~iD~~~~~~~~~~~~~~~~------~~~~~~~~~~~~i~~a~~~ 150 (273) ++++.|.|| |+|+++....+..++. ++...+++++|+++|++++..+....+ +...+++++++.|.++... T Consensus 81 DR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~~~~~~ 160 (312) T protein:vir:10 81 DRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIKTGIKI 160 (312) T ss_pred cccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHHHHHHH Confidence 999999999 7777665566666554 577889999999999999865543321 3346889999999999999 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec--ccc------cC-------- Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLR------DT-------- 214 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~--~l~------~~-------- 214 (273) |++++|| .+|+|+++|.++..|.++. ..+.......+...+|.|++++|++|++.+ ++. ++ T Consensus 161 lde~~vp-~~rvl~vTp~~~~lLk~~~--~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~g 237 (312) T protein:vir:10 161 IRENGYN-GPLVCHLTYDSMFAIEEKV--LEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTAG 237 (312) T ss_pred HHHccCC-CceEEEeChHHHHHHhhhh--hceecccccccceeeeeeeeecccEEEEchhhhccceeeeccCcccccccC Confidence 9999999 6999999999998887643 222222233445568999999999999854 332 01 Q ss_pred --------CCcEEEEEeCceEEEEEecceeeeccCCC---cceeeEEeeeeeeeEEEcCceEEEE---ecCCC Q lcl|Aclame:pro 215 --------DDEQFVAFHPSAAAYVSQIDTVEALRDQD---SFSDRIRALHVYGGKVVRPTGVVVF---NKTGS 273 (273) Q Consensus 215 --------~~~~~~~~~~~a~~~~~~~~~ve~~~~~~---~~~~~v~~~~~~g~~vl~p~~~v~~---~~~~s 273 (273) ..-.+++.|++|.....+.+.+..+.|.. ..+|++..+.++++.|++...-.+. +.+.. T Consensus 238 g~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~ 310 (312) T protein:vir:10 238 GYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDAKP 310 (312) T ss_pred ceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecccC Confidence 11347899999998887777776665533 3479999999999999988855543 22222 No 52 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=100.00 E-value=6.1e-35 Score=208.17 Aligned_cols=269 Identities=13% Similarity=0.103 Sum_probs=206.1 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccch-hhhccc--cccccCCcEEEEEecc-ccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFAN-LVNREY--EGIASKGNVVHIAGVV-APTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~-~~~~d~--~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) ||.+. .+.|+..|+++|...++... +..... ..++..|++|+||++. ..+.+||++.++.....+++.++.+++ T Consensus 1 Mainy--a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~t 78 (346) T protein:vir:10 1 MTINY--AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSYE 78 (346) T ss_pred Ccchh--HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeEEE Confidence 99985 57899999999988765533 322211 2234568999999997 468999999888765578999999999 Q ss_pred EEeeeeceeEec--hHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccc----cccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 77 IDQEKSIDFLVD--DIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTAL----TGSAPSDADDAFDLIASALK 149 (273) Q Consensus 77 id~~~~~~~~i~--d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~----~~~~~~~~~~~~~~i~~a~~ 149 (273) |++++++.|.|| |+|++.....+..++ +.+....++++|+++|++++..+... ..+...+++++++.|.++.. T Consensus 79 l~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~i~~~~~ 158 (346) T protein:vir:10 79 LKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPAFDNMML 158 (346) T ss_pred eeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHHHHHHHH Confidence 999999999999 666554444455554 45677789999999999987654332 22344688999999999999 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEE--eccccc-----------CC- Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE--SNNLRD-----------TD- 215 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~--s~~l~~-----------~~- 215 (273) .|++++||.+||+|+++|+++..|++++.+..+.+. ++.+. .+|.|++++||+|++ |++++. .+ T Consensus 159 ~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v-~~~~~-i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~~~t~a 236 (346) T protein:vir:10 159 DFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTL-KDPNN-IQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSKIIDTA 236 (346) T ss_pred HHHHccCCCCCeEEEECHHHHHHHhhchhheecccc-ccccc-cceeeeeecCeEEEEcchhhcccchhhccCccccCCc Confidence 999999999999999999999999888765544443 33334 489999999999987 444431 11 Q ss_pred -CcEEEEEeCceEEEEEecceeeeccCCCcc--eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 216 -DEQFVAFHPSAAAYVSQIDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 -~~~~~~~~~~a~~~~~~~~~ve~~~~~~~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .-.++++|++|.....+.+.+..+.|.... +|++..+.+|++.|++...-.++....+ T Consensus 237 k~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~ 297 (346) T protein:vir:10 237 KQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSD 297 (346) T ss_pred cceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeec Confidence 235789999999988888888777765443 3799999999999999886666544433 No 53 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=100.00 E-value=1.5e-32 Score=195.10 Aligned_cols=268 Identities=18% Similarity=0.166 Sum_probs=207.0 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccc--cccccCCcEEEEEecc-ccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREY--EGIASKGNVVHIAGVV-APTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~--~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) ||++. .+.|.+.|.++|...+.+..+...+. ...+..|++|+||++. ..++.+|.+..+ ....+++.++.+++| T Consensus 1 Main~--~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g-~~~g~v~~~~et~tl 77 (285) T protein:vir:79 1 MTVVL--DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQD-NARKTISVGKETVKL 77 (285) T ss_pred Ccchh--hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccC-ccccccceeeeEEEe Confidence 99984 68999999999999888877764432 3345568999999996 578999998776 567789999999999 Q ss_pred EeeeeceeEechHHHHHh-HHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQV-AGSLEAYTRA-GATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKAN 155 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~-~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~ 155 (273) ++++++.|.||..|..++ ......++++ +...+++++|+++|++++..+.... ++..+++++++.|.++...|++++ T Consensus 78 ~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~-~~~~T~~nv~~~i~~~~~~lde~~ 156 (285) T protein:vir:79 78 THEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKA-TDSITKDNALDAYDTAEAYMFDNE 156 (285) T ss_pred eccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhccccc-ccccCHHHHHHHHHHHHHHHHHcC Confidence 999999999995554442 2234555555 5667899999999999987665443 455788999999999999999999 Q ss_pred CCcCCcEEEECHHHHHHHhcchHHhhhhhccccc-ceeeeeeeeeecc-eEEEEec--ccccCC---CcEEEEEeCceEE Q lcl|Aclame:pro 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AGLRAGTIGNLLG-ARIVESN--NLRDTD---DEQFVAFHPSAAA 228 (273) Q Consensus 156 vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~~~~G~ig~i~G-~~i~~s~--~l~~~~---~~~~~~~~~~a~~ 228 (273) || .+|+|+++|+++..|++++.+....+..... ..-.++.|++++| ++|++.+ +++..+ .-.+++.|++|.. T Consensus 157 vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~~~a~i 235 (285) T protein:vir:79 157 VP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTPLSAIA 235 (285) T ss_pred CC-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEecCceec Confidence 99 6999999999999999888755444332211 1123678999999 8998854 443222 2357999999987 Q ss_pred EEEecceeeeccCC---CcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 229 YVSQIDTVEALRDQ---DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 229 ~~~~~~~ve~~~~~---~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ...+.+.+..+.|+ +.-+|++..+.+|++.|++...-.++.+..+ T Consensus 236 ~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a 283 (285) T protein:vir:79 236 PIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATA 283 (285) T ss_pred cceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecc Confidence 77767776666555 4457999999999999999987777766666 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=100.00 E-value=3.1e-32 Score=193.33 Aligned_cols=267 Identities=14% Similarity=0.134 Sum_probs=202.0 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEee Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) ||.| +.+.|++.|+++|...++...+.+.+.+. ...|++|+||++...+++||+|.++. ...+++.++.+++|+++ T Consensus 8 mAln--ya~~~~~~Ld~~~~~~~~t~~l~~~~~~~-~~Gak~VkIp~i~~~gl~dY~R~~g~-~~g~v~~~~et~tl~~D 83 (311) T protein:vir:99 8 RGFN--YVTKDGNLLDQKITAGLFTAALGTPEVDL-VNGGRSFTLKTISTSGLKDHTRGKGF-NSGTISDEKTIYTMGQD 83 (311) T ss_pred hHHH--HHHHHHHHHHHHHHhhhcccceecCchhe-eecCCEEEEEeeeeccccccccccCc-cccceeeeeeEEEeeec Confidence 5543 58999999999999999988888777654 34589999999999999999998874 56789999999999999 Q ss_pred eeceeEec--hHHHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccccc--------------ccccCCHhHHHHH Q lcl|Aclame:pro 81 KSIDFLVD--DIDRVQVAGSLEAYTR-AGATALATDTDKFIADMLVDNGTALT--------------GSAPSDADDAFDL 143 (273) Q Consensus 81 ~~~~~~i~--d~d~~~~~~~~~~~~~-~~~~ala~~iD~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~ 143 (273) +++.|.|| |+|+......+..++. .+....++++|++++++++..+.... .....+.+++++. T Consensus 84 R~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~nvl~~ 163 (311) T protein:vir:99 84 RDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDETNAYSQ 163 (311) T ss_pred cceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHHHHHHH Confidence 99999999 5555444444555544 46677899999999999976543221 2234678899999 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEe-c--cccc------- Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES-N--NLRD------- 213 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s-~--~l~~------- 213 (273) |..+...|++ +|.+||+|+++|+.+..|++++.+....+.........++.|++++|++|++. + +++. T Consensus 164 l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~~~ft~G 241 (311) T protein:vir:99 164 LKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTKYDFTDG 241 (311) T ss_pred HHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcchhhhcCC Confidence 9999999987 68899999999999998887765543343332222334788999999999865 3 3321 Q ss_pred ----CC--CcEEEEEeCceEEEEEecceeeeccC---CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 214 ----TD--DEQFVAFHPSAAAYVSQIDTVEALRD---QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 ----~~--~~~~~~~~~~a~~~~~~~~~ve~~~~---~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+ .-.+++.|++|.....+.+.+..+.| .+..+|++..+.++++.|++...-.+..+-.. T Consensus 242 ~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~ 310 (311) T protein:vir:99 242 AKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKK 310 (311) T ss_pred ccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeec Confidence 11 13578999999988777777666554 33458999999999999998875554422222 No 55 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.96 E-value=3.3e-31 Score=187.68 Aligned_cols=268 Identities=15% Similarity=0.147 Sum_probs=202.6 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecc-----ccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVV-----APTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~-----~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) |||++-+.+.|+..|+++|...+++..|...........|++|+||++. ..+.+||.|.++.. ..+++.++.++ T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~-~g~v~~~~et~ 79 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFT-QGSVTLAWSDY 79 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCcc-ccceeeeeeeE Confidence 9998888999999999999999988888544333346668999999997 45899999988754 56789999999 Q ss_pred EEEeeeeceeEechH--HHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccccc-----ccccCCHhHHHHHHHHH Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDI--DRVQVAGSLEAYTR-AGATALATDTDKFIADMLVDNGTALT-----GSAPSDADDAFDLIASA 147 (273) Q Consensus 76 tid~~~~~~~~i~d~--d~~~~~~~~~~~~~-~~~~ala~~iD~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~i~~a 147 (273) +|++++++.|.||-. |+......+..++. .+....++++|+++|++++..+.... .++..++.+++++|..+ T Consensus 80 tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~~i~~~ 159 (302) T protein:vir:78 80 TLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMGDIATA 159 (302) T ss_pred EeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHHHHHHH Confidence 999999999999944 44443444556554 46778899999999999976544322 22346788999999999 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccC------------- Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT------------- 214 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~------------- 214 (273) ...|+++ ++|+|+++|..+..|+++..+-...+.......-.++.|++++|++|++.+.-+.. T Consensus 160 ~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~~~~~ 235 (302) T protein:vir:78 160 MELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKVGVPDYT 235 (302) T ss_pred HHHhhcc----CCeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhcccceeccCCccccC Confidence 9999996 48999999999999987764332222222222234788999999999986532221 Q ss_pred --CCcEEEEEeCceEEEEEecceeeeccCCCcc---eeeEEeeeeeeeEEEcCceEEEEecC-CC Q lcl|Aclame:pro 215 --DDEQFVAFHPSAAAYVSQIDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKT-GS 273 (273) Q Consensus 215 --~~~~~~~~~~~a~~~~~~~~~ve~~~~~~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~-~s 273 (273) ..-.+++.|++|.....+.+.+..+.|...+ +|++..+.++++.|++.....++.+. +. T Consensus 236 ~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~ 300 (302) T protein:vir:78 236 GAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGT 300 (302) T ss_pred CccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccc Confidence 1235799999999988888888777664444 57999999999999998865555322 22 No 56 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.94 E-value=1.9e-28 Score=172.55 Aligned_cols=269 Identities=17% Similarity=0.125 Sum_probs=192.9 Q ss_pred Ccccchh-HHHHHHHHHHHHHHhhccchh--hhcccccc-ccCCcEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFNNFI-PELWSDMLLEEWTAQTVFANL--VNREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~-pev~~~~v~~~l~~~~v~~~~--~~~d~~~~-~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) |||+... -+++++++++.|++.+++... ++|+++.+ .+.||||++|.+......+ +......++++.+.+++++ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~--G~~~t~~~~~i~e~~v~~~ 78 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc--CcccCCCCCccccceEEEE Confidence 9998776 489999999999999999997 45777655 4679999999997766543 2111223456888999999 Q ss_pred EEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALT---GSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~---~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +++++.+.|.+++.|+. ...+.++++++++++||.+||.++++.+...+..+. .+++....+.+.++..+++.|++ T Consensus 79 v~~~k~V~~~~~~kel~-~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~ 157 (430) T protein:vir:10 79 MGEPDNDFFQLRADDLR-DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFS 157 (430) T ss_pred EeeeccceEEechhHhc-ChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHH Confidence 99999999999998853 333456788999999999999999999866554332 23444445667889999999999 Q ss_pred cCCCcC-CcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeee-ecceEE-EEecccccCC--------------- Q lcl|Aclame:pro 154 ANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTD--------------- 215 (273) Q Consensus 154 ~~vp~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~-i~G~~i-~~s~~l~~~~--------------- 215 (273) .++|.+ +|.++++|+.+..|......+...+.. ..+++++|.|++ +.||++ ++++.+|.+. T Consensus 158 ~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~-~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~ 236 (430) T protein:vir:10 158 RELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSF 236 (430) T ss_pred hcCCCCCCcEEEeChHHHHHHHhhhccccccccc-hhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccc Confidence 999995 799999999999997543223222222 346789999997 889975 5565554300 Q ss_pred ----------------------------------------C--------------------------------------- Q lcl|Aclame:pro 216 ----------------------------------------D--------------------------------------- 216 (273) Q Consensus 216 ----------------------------------------~--------------------------------------- 216 (273) + T Consensus 237 ~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~ 316 (430) T protein:vir:10 237 KPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALD 316 (430) T ss_pred ccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccc Confidence 0 Q ss_pred --------------------------------cEEEEEeCceEEEEEeccee--------ee----------------cc Q lcl|Aclame:pro 217 --------------------------------EQFVAFHPSAAAYVSQIDTV--------EA----------------LR 240 (273) Q Consensus 217 --------------------------------~~~~~~~~~a~~~~~~~~~v--------e~----------------~~ 240 (273) ..-++||++|++++.+...+ +. .+ T Consensus 317 ~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~y 396 (430) T protein:vir:10 317 DVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQG 396 (430) T ss_pred cccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEec Confidence 00145677777777654321 11 13 Q ss_pred CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |.+...+.++-+..||++.+|||..+++-+.-+ T Consensus 397 d~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~ 429 (430) T protein:vir:10 397 DISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) T ss_pred ccccCceEEEEeeeccceecCcceEEEEcCCCC Confidence 333345677889999999999998655533322 No 57 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.94 E-value=1.9e-28 Score=172.55 Aligned_cols=269 Identities=17% Similarity=0.125 Sum_probs=192.9 Q ss_pred Ccccchh-HHHHHHHHHHHHHHhhccchh--hhcccccc-ccCCcEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFNNFI-PELWSDMLLEEWTAQTVFANL--VNREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~-pev~~~~v~~~l~~~~v~~~~--~~~d~~~~-~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) |||+... -+++++++++.|++.+++... ++|+++.+ .+.||||++|.+......+ +......++++.+.+++++ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~--G~~~t~~~~~i~e~~v~~~ 78 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc--CcccCCCCCccccceEEEE Confidence 9998776 489999999999999999997 45777655 4679999999997766543 2111223456888999999 Q ss_pred EEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALT---GSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~---~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +++++.+.|.+++.|+. ...+.++++++++++||.+||.++++.+...+..+. .+++....+.+.++..+++.|++ T Consensus 79 v~~~k~V~~~~~~kel~-~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~ 157 (430) T protein:vir:92 79 MGEPDNDFFQLRADDLR-DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFS 157 (430) T ss_pred EeeeccceEEechhHhc-ChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHH Confidence 99999999999998853 333456788999999999999999999866554332 23444445667889999999999 Q ss_pred cCCCcC-CcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeee-ecceEE-EEecccccCC--------------- Q lcl|Aclame:pro 154 ANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTD--------------- 215 (273) Q Consensus 154 ~~vp~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~-i~G~~i-~~s~~l~~~~--------------- 215 (273) .++|.+ +|.++++|+.+..|......+...+.. ..+++++|.|++ +.||++ ++++.+|.+. T Consensus 158 ~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~-~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~ 236 (430) T protein:vir:92 158 RELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSF 236 (430) T ss_pred hcCCCCCCcEEEeChHHHHHHHhhhccccccccc-hhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccc Confidence 999995 799999999999997543223222222 346789999997 889975 5565554300 Q ss_pred ----------------------------------------C--------------------------------------- Q lcl|Aclame:pro 216 ----------------------------------------D--------------------------------------- 216 (273) Q Consensus 216 ----------------------------------------~--------------------------------------- 216 (273) + T Consensus 237 ~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~ 316 (430) T protein:vir:92 237 KPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALD 316 (430) T ss_pred ccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccc Confidence 0 Q ss_pred --------------------------------cEEEEEeCceEEEEEeccee--------ee----------------cc Q lcl|Aclame:pro 217 --------------------------------EQFVAFHPSAAAYVSQIDTV--------EA----------------LR 240 (273) Q Consensus 217 --------------------------------~~~~~~~~~a~~~~~~~~~v--------e~----------------~~ 240 (273) ..-++||++|++++.+...+ +. .+ T Consensus 317 ~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~y 396 (430) T protein:vir:92 317 DVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQG 396 (430) T ss_pred cccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEec Confidence 00145677777777654321 11 13 Q ss_pred CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) |.+...+.++-+..||++.+|||..+++-+.-+ T Consensus 397 d~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~ 429 (430) T protein:vir:92 397 DISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) T ss_pred ccccCceEEEEeeeccceecCcceEEEEcCCCC Confidence 333345677889999999999998655533322 No 58 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.92 E-value=1.8e-27 Score=167.22 Aligned_cols=268 Identities=18% Similarity=0.142 Sum_probs=189.7 Q ss_pred Ccccc--hhHHHHHHHHHHHHHHhhccchh--hhcccccc-ccCCcEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNN--FIPELWSDMLLEEWTAQTVFANL--VNREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~--~~pev~~~~v~~~l~~~~v~~~~--~~~d~~~~-~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) |||+. +. ++.-+++++.|+..+++.++ ++++++.+ .+.||||++|.+......+ +......++++.+.++++ T Consensus 1 Ma~~~~~~l-ti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~--G~~~t~~~~~~~e~~v~~ 77 (430) T protein:vir:21 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAV 77 (430) T ss_pred Cccccchhh-HHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccc--cccccCCCccceeeeEeE Confidence 99963 32 33339999999999999997 45777654 4679999999987755433 221123346789999999 Q ss_pred EEEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccCCHhHHHHHHHHHHHHHh Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALT---GSAPSDADDAFDLIASALKELT 152 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~---~~~~~~~~~~~~~i~~a~~~l~ 152 (273) ++++++.+.|.+++.|.. .....++++++++++||.+||.++++.++..+..+. .+++....+.++++..++..|+ T Consensus 78 ~~~~~~~V~~~~~~kEl~-~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~ 156 (430) T protein:vir:21 78 NMGEPDNDFFQLRADDLR-DETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEEIMF 156 (430) T ss_pred EEeeeccceEEeehhHhc-ChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHHHHHHH Confidence 999999999999988743 333357899999999999999999999876554332 2344445566889999999999 Q ss_pred hcCCCcC-CcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeee-ecceEE-EEecccccCC-------------- Q lcl|Aclame:pro 153 KANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTD-------------- 215 (273) Q Consensus 153 ~~~vp~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~-i~G~~i-~~s~~l~~~~-------------- 215 (273) +.++|.+ +|.++++|..+..|......+...+.. ..+++++|.|++ +.||++ +.++.+|.++ T Consensus 157 ~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~-~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~ 235 (430) T protein:vir:21 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQS 235 (430) T ss_pred HhcCCCCCCcEEEeChHHHHHHhhhhccccccccc-hhHHHhhcccccccchhhhhhhcCCcccccCccCcCceeccccc Confidence 9999995 799999999999986654333333332 346889999997 899985 5666655300 Q ss_pred -----------------------------------------C-------------------------------------- Q lcl|Aclame:pro 216 -----------------------------------------D-------------------------------------- 216 (273) Q Consensus 216 -----------------------------------------~-------------------------------------- 216 (273) + T Consensus 236 ~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~ 315 (430) T protein:vir:21 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVAL 315 (430) T ss_pred cccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEeeccccc Confidence 0 Q ss_pred ---------------------------------cEEEEEeCceEEEEEeccee--------ee----------------c Q lcl|Aclame:pro 217 ---------------------------------EQFVAFHPSAAAYVSQIDTV--------EA----------------L 239 (273) Q Consensus 217 ---------------------------------~~~~~~~~~a~~~~~~~~~v--------e~----------------~ 239 (273) ..-++||++|++++.+...+ +. . T Consensus 316 ~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~ 395 (430) T protein:vir:21 316 DDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQ 395 (430) T ss_pred ccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEc Confidence 00145677777776653311 00 1 Q ss_pred cCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 240 RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 240 ~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.+...+.++-+..||++.+|||..+++-+.-+ T Consensus 396 yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~ 429 (430) T protein:vir:21 396 GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) T ss_pred cccccCceEEEEEeecCccccCcceEEEEcCCCC Confidence 2333345678899999999999998655533322 No 59 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.91 E-value=3e-27 Score=166.00 Aligned_cols=178 Identities=21% Similarity=0.229 Sum_probs=128.2 Q ss_pred EEeeeeceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccccccCCHhHHH Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTA--------------LTGSAPSDADDAF 141 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~--------------~~~~~~~~~~~~~ 141 (273) ||......+.|+|+|+.++++++ .++++|++++||+.+|+.++..++.++.. ...+...++..++ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 99999999999999999999997 56899999999999999999988765321 1112334567789 Q ss_pred HHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhc-chHHhhhhhcccccceeeee-eeeeecceEEEEecccccCCCc-- Q lcl|Aclame:pro 142 DLIASALKELTKANVPNVGRVVVVNAEMAFWLRS-SGSKLTSADTSGDAAGLRAG-TIGNLLGARIVESNNLRDTDDE-- 217 (273) Q Consensus 142 ~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~-~~~~~~~~~~~~~~~~~~~G-~ig~i~G~~i~~s~~l~~~~~~-- 217 (273) +.|.+++..|+|++||.++||++++|++|+.|++ +++++.+.+..++...+++| .|++++||+||+||++|...+. T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 9999999999999999999999999998888876 45667777777777778888 4999999999999999964332 Q ss_pred ------------------------EEEEEeCceEEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceE Q lcl|Aclame:pro 218 ------------------------QFVAFHPSAAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGV 265 (273) Q Consensus 218 ------------------------~~~~~~~~a~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~ 265 (273) ..+++||+|.|.++.+-. .-|++ ++-.. ..+-||+.- T Consensus 161 ~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~--~~~~~-----~~~~~----~~~~~~~~~ 221 (221) T protein:vir:17 161 VTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLP--PSRPP-----LVISM----FSIRRPDRR 221 (221) T ss_pred ccCCccccccccccccccccccceEEEEEcchheeeeeeecC--CCCCc-----eeeee----eeccCCCCC Confidence 234555555555543321 00110 00000 011112111 No 60 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.87 E-value=1.6e-23 Score=145.58 Aligned_cols=261 Identities=12% Similarity=0.080 Sum_probs=185.6 Q ss_pred Ccc----cchhHHHHHHHHHHHHHHhhccc--hhhhc----cccc-cccCCcEEEEEecccc-ccccccCCCCccCCccc Q lcl|Aclame:pro 1 MAF----NNFIPELWSDMLLEEWTAQTVFA--NLVNR----EYEG-IASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAI 68 (273) Q Consensus 1 MA~----~~~~pev~~~~v~~~l~~~~v~~--~~~~~----d~~~-~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~ 68 (273) ||. .+|.||||+..+.+++.+.+.|. .++.+ +-.+ ...+|++|++|.|+.+ +..+...++..++++.+ T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l 80 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKI 80 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchhhc Confidence 996 46889999999999998887662 22222 2222 2457999999999998 66666667888999999 Q ss_pred ccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------cccccccCCHhHH Q lcl|Aclame:pro 69 SDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPSDADDA 140 (273) Q Consensus 69 ~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~-------~~~~~~~~~~~~~ 140 (273) +.++...++. .+..+|.++|+....+.++ +.++.+|.+..++++.++++++.+...-. ....++..+.... T Consensus 81 ~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~s 159 (324) T protein:vir:59 81 NAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIYS 159 (324) T ss_pred ccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccceec Confidence 9999888885 5889999999876655555 57789999999999999999998854211 1111222222234 Q ss_pred HHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccC------ Q lcl|Aclame:pro 141 FDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT------ 214 (273) Q Consensus 141 ~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~------ 214 (273) .+.|.+|...|.++. ..-..++++|..+..|.+.. +.+.-...+ ..+.|+.++|..|+++..+|.. T Consensus 160 ~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~--li~~~~~s~----~~~~i~~~~G~~VivdD~~p~~~~~~~~ 231 (324) T protein:vir:59 160 AETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQD--LIEFVKDSQ----SGIRFPTYMNKRVIVDDSMPVETLEDGT 231 (324) T ss_pred HHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhh--hhhhccccc----cCceeeeecccEEEEeCCCCccccCCCC Confidence 577999999997764 23347899999999998753 222211111 1467899999999999988842 Q ss_pred CCcEEEEEeCceEEEEE--ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEE-------ecCCC Q lcl|Aclame:pro 215 DDEQFVAFHPSAAAYVS--QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVF-------NKTGS 273 (273) Q Consensus 215 ~~~~~~~~~~~a~~~~~--~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~-------~~~~s 273 (273) ..+.++.+.++|+++.. +...+|..|++.+..|.++.+.+|...+ -|+--. ..+.+ T Consensus 232 ~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p---~G~s~~~~~~~~~sPt~~ 296 (324) T protein:vir:59 232 KVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHP---RGVKFTENAMAGTTPTDE 296 (324) T ss_pred ceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEe---eeEEecccccCCCCCChh Confidence 23467889999999875 3456899999998888888888876544 333322 22222 No 61 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.85 E-value=1.1e-22 Score=140.99 Aligned_cols=260 Identities=12% Similarity=0.098 Sum_probs=182.9 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccc--hhhhccc----cccccCCcEEEEEecccc-ccccccCCC-CccCCc Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFA--NLVNREY----EGIASKGNVVHIAGVVAP-TVKDYKAAG-RQTSAD 66 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~--~~~~~d~----~~~~~~Gdtv~ip~~~~~-~~~d~~~~~-~~~~~~ 66 (273) ||+ .+|.||+|+..+.+++.+.+.|. +.+..+- .+++ +|+++++|.|+.+ +..+...++ +.++++ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~-~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~ 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITS-GGLLVNMPFWNDLTGDSEVLGNGDKALETG 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhc-CCCEEEecccccCCCcccccCCCccccchh Confidence 997 37789999999999998876552 2232222 2233 7999999999988 666655555 468889 Q ss_pred ccccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcc------c-----c--ccc Q lcl|Aclame:pro 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGT------A-----L--TGS 132 (273) Q Consensus 67 ~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~------~-----~--~~~ 132 (273) .++.++...++ +.+..+|.++|+.....-.+ +.++.+|.+...+++.+..+++.+...-. . . ... T Consensus 80 ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:10 80 KITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQ 158 (330) T ss_pred hcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecc Confidence 99999888888 44788899999876555455 67888999999999999998887753110 0 0 001 Q ss_pred ccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 133 APSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) +...+....+.|.+|...|.++. ..-..++++|..+..|.+.. +.+.-.... .++.|+.++|..|++++.+| T Consensus 159 ~~~~a~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~--li~~~~~s~----~~~~i~~~~G~~VivdD~~p 230 (330) T protein:vir:10 159 SKASTGIDAGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKDN--LIQYIQPTT----ATINIPTYLGYRVIIDDGIA 230 (330) T ss_pred cccccccCHHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHhh--hhhhhcccc----cCcccccccceEEEEeCCCC Confidence 11122223567899999997765 23357899999999998642 222222211 14679999999999999998 Q ss_pred cCC-CcEEEEEeCceEEEEE----ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEec----CC-C Q lcl|Aclame:pro 213 DTD-DEQFVAFHPSAAAYVS----QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK----TG-S 273 (273) Q Consensus 213 ~~~-~~~~~~~~~~a~~~~~----~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~----~~-s 273 (273) ... .++++++.++|+++.. +...+|..|++.+..|.+..+.+|... |-|.---.+ .+ | T Consensus 231 ~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~~h---p~G~s~~~~~~~~~~~s 298 (330) T protein:vir:10 231 PTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALVMH---PYGVKWTGAEVDAGNIT 298 (330) T ss_pred CCCCceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEEee---eeeeeecccccccCcCC Confidence 554 3467888999999864 445789999999888999999987654 455443322 11 2 No 62 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.85 E-value=3.4e-24 Score=149.27 Aligned_cols=269 Identities=18% Similarity=0.158 Sum_probs=194.0 Q ss_pred Cc---c--cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MA---F--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA---~--~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) |- | .++.+|+|+++++..|.+.+.--.+ .|+.. .+..|++.+||.+|.+.+++ .++.++..++++.++++++ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~-~R~V~-DF~~G~~L~I~tiGs~~~~~-~~E~~~~~~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETF-YRNVS-DFGSGETLHIKTIGSVTLQE-AEEDTPLIYNPIETGEITF 77 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhh-hhhhc-cCCCCCEEEecccCceeeec-cccCCCeeecccccceEEE Confidence 53 4 3778999999999999988753333 34332 46679999999999999876 5567788999999999999 Q ss_pred EEEeeeeceeEechHHHHHhHH--HH-HHHHHHHHHHHHHHHHHHHHHHHHhh------ccccc------ccccCCHhHH Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVAG--SL-EAYTRAGATALATDTDKFIADMLVDN------GTALT------GSAPSDADDA 140 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~~--~~-~~~~~~~~~ala~~iD~~~~~~~~~~------~~~~~------~~~~~~~~~~ 140 (273) .|..+++-++.|++.-+..... .+ .....++++++.+....|++++..+. +..++ .++.++.... T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~ 157 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFA 157 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceeh Confidence 9999999999999875554322 22 34556788999999999999876431 11121 2456666777 Q ss_pred HHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeee------eeeecceEEEEecccccC Q lcl|Aclame:pro 141 FDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGT------IGNLLGARIVESNNLRDT 214 (273) Q Consensus 141 ~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~------ig~i~G~~i~~s~~l~~~ 214 (273) +..+..++..|++.++|.+||+.+++|.....|..-..........+ .-++..|. |.++||++++.||.|.+. T Consensus 158 ~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~-k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 158 LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFG-KMILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeeccccccc-ceeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 88999999999999999999999999999999865443221122111 11333443 568999999999998754 Q ss_pred CCc--------EE---EE-EeCc----eEEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 215 DDE--------QF---VA-FHPS----AAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 215 ~~~--------~~---~~-~~~~----a~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +.+ .| |. .-.. -.+.=+++.+.|.+++..+-.+-....++||.++.|.|.++.+-+.++ T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSAT 311 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecceeEEEeccc Confidence 321 11 11 1111 122224567788888877766777788999999999999998877777 No 63 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.83 E-value=5.5e-22 Score=137.14 Aligned_cols=262 Identities=13% Similarity=0.107 Sum_probs=181.9 Q ss_pred Cccc----chhHHHHHHHHHHHHHHhhccc--hhhhcccccc---ccCCcEEEEEecccc-ccccccCCCCccCCccccc Q lcl|Aclame:pro 1 MAFN----NFIPELWSDMLLEEWTAQTVFA--NLVNREYEGI---ASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISD 70 (273) Q Consensus 1 MA~~----~~~pev~~~~v~~~l~~~~v~~--~~~~~d~~~~---~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~ 70 (273) ||.+ +|+||+|+..+.+++.+.+.|- +++..+-+.. ..+|+++++|.|+.+ +..+...++..++++.++. T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 9984 5889999999999998877652 2332222221 137999999999997 6777777788899999999 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcc---c-------ccccccCCHhH Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGT---A-------LTGSAPSDADD 139 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~---~-------~~~~~~~~~~~ 139 (273) ++...++ +.+..+|.++|+......++ +.++.+|.+...++++++.+++.+...-. . ....++.+... T Consensus 81 ~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~~i 159 (351) T protein:vir:15 81 GKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEPMF 159 (351) T ss_pred cceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecccccccccccc Confidence 9988888 55788899999876665555 57788999999999999999998754210 0 01112222333 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccC----- Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT----- 214 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~----- 214 (273) ..+.|.+|...|.+..- ..-..++++|.++..|.+.. +.+.-...+ .++.|+.+.|..|++++.+|.. T Consensus 160 s~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~~--li~~~~~s~----~~~~i~t~~G~~VivdD~~p~~~~~~~ 232 (351) T protein:vir:15 160 GAKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQG--LIETIQPQN----GATPFEAYNGLRIVLDDDIEIDLTDKT 232 (351) T ss_pred CHHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhhh--hhhhccccc----cCcccceecceEEEEcCCCccccCCCC Confidence 45789999999866431 11246789999999998653 222222211 1456899999999999999852 Q ss_pred -CCcEEEEEeCceEEEEEecceeeeccCCCcc--eeeEEeeeeeeeEEEcCceEEEEec-----CCC Q lcl|Aclame:pro 215 -DDEQFVAFHPSAAAYVSQIDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNK-----TGS 273 (273) Q Consensus 215 -~~~~~~~~~~~a~~~~~~~~~ve~~~~~~~~--~~~v~~~~~~g~~vl~p~~~v~~~~-----~~s 273 (273) ..+.++.+.++|+++..+...+|..|++... .+.+..+.+| ++.|-|..--.+ ..| T Consensus 233 ~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~~---~~hp~G~s~~~~~~~~~~~s 296 (351) T protein:vir:15 233 KPVSTSYIFAPGAVRYSTNMRSTETKYDPLINGGQDVIVQKRVG---TIHVAGTSIKASFSPSKASF 296 (351) T ss_pred CceeEEEEEecceeeeecCCcCcceeecccCCCCceEEEEeeee---eeeeeeeeecccccccCcCC Confidence 1245788899999999888888988887653 3555555554 455555553321 112 No 64 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.59 E-value=8.6e-16 Score=103.18 Aligned_cols=257 Identities=11% Similarity=0.036 Sum_probs=168.0 Q ss_pred Ccc--------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc Q lcl|Aclame:pro 1 MAF--------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA~--------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~ 66 (273) ||. ..++|+.+...+.+.+++..++.+++.+- ...+.+++||+......+....+++..... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----PMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEeecCcccccc Confidence 654 24579999999999999999888877542 223567899998765556667777777767 Q ss_pred ccccceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-hc-----------ccccccc Q lcl|Aclame:pro 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD-NG-----------TALTGSA 133 (273) Q Consensus 67 ~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~-~~-----------~~~~~~~ 133 (273) +++.+.+++++.+. +.-+.|++.-..++..++.++ .++.++++++++|..++.--.. .+ ......+ T Consensus 77 ~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 155 (304) T protein:vir:94 77 KPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNV 155 (304) T ss_pred cceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccc Confidence 77777777777553 344567765555566677664 5678899999999987742100 00 0011112 Q ss_pred cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccccc Q lcl|Aclame:pro 134 PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 134 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~ 213 (273) ..+....+++|.++...+..+..... .++++|..+..|.+..+ . ....+..+..++++|.+|+.++.+|. T Consensus 156 ~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~lkd------~--~G~~l~~~~~~~l~G~PV~~~~~~~~ 225 (304) T protein:vir:94 156 VTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNALD------A--NDRPLFDANGNEIMGLPLSYTGADVY 225 (304) T ss_pred cccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhc------c--CCcEeecCCCccccceeeEEeccccc Confidence 22334568889999888887765433 58899999999975321 1 12344455568899999999999986 Q ss_pred CCC-cEEEEEeCceEEEEEec-ceeeecc----------CCC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 214 TDD-EQFVAFHPSAAAYVSQI-DTVEALR----------DQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 214 ~~~-~~~~~~~~~a~~~~~~~-~~ve~~~----------~~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) ... ..++.+..+-+.+..+. ..++..+ +.+ .| ...+++.+++|..+++|+++++|+.+- T Consensus 226 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 226 DKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 543 23444444433332221 1222211 111 12 257788999999999999999999999 No 65 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.59 E-value=8.6e-16 Score=103.18 Aligned_cols=257 Identities=11% Similarity=0.036 Sum_probs=168.0 Q ss_pred Ccc--------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc Q lcl|Aclame:pro 1 MAF--------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA~--------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~ 66 (273) ||. ..++|+.+...+.+.+++..++.+++.+- ...+.+++||+......+....+++..... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----PMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEeecCcccccc Confidence 654 24579999999999999999888877542 223567899998765556667777777767 Q ss_pred ccccceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-hc-----------ccccccc Q lcl|Aclame:pro 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD-NG-----------TALTGSA 133 (273) Q Consensus 67 ~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~-~~-----------~~~~~~~ 133 (273) +++.+.+++++.+. +.-+.|++.-..++..++.++ .++.++++++++|..++.--.. .+ ......+ T Consensus 77 ~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 155 (304) T protein:vir:10 77 KPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNV 155 (304) T ss_pred cceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccc Confidence 77777777777553 344567765555566677664 5678899999999987742100 00 0011112 Q ss_pred cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccccc Q lcl|Aclame:pro 134 PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 134 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~ 213 (273) ..+....+++|.++...+..+..... .++++|..+..|.+..+ . ....+..+..++++|.+|+.++.+|. T Consensus 156 ~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~lkd------~--~G~~l~~~~~~~l~G~PV~~~~~~~~ 225 (304) T protein:vir:10 156 VTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNALD------A--NDRPLFDANGNEIMGLPLSYTGADVY 225 (304) T ss_pred cccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhc------c--CCcEeecCCCccccceeeEEeccccc Confidence 22334568889999888887765433 58899999999975321 1 12344455568899999999999986 Q ss_pred CCC-cEEEEEeCceEEEEEec-ceeeecc----------CCC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 214 TDD-EQFVAFHPSAAAYVSQI-DTVEALR----------DQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 214 ~~~-~~~~~~~~~a~~~~~~~-~~ve~~~----------~~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) ... ..++.+..+-+.+..+. ..++..+ +.+ .| ...+++.+++|..+++|+++++|+.+- T Consensus 226 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 226 DKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 543 23444444433332221 1222211 111 12 257788999999999999999999999 No 66 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.57 E-value=1.2e-15 Score=102.40 Aligned_cols=259 Identities=10% Similarity=0.007 Sum_probs=167.8 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |+.. .++|+.++.++.+.+++.+++..++.. ....|.+.++|...... +....+++.+...+++.+.++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~~~~~~~~-a~~v~E~~~~~~~~~~f~~v~ 80 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKA----VPMTKPEEEFTFMSGVG-AFWVDEAERIQTSKPTFTKAK 80 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhcee----eecCCCcEEEEEEcCCc-eeeeecCccccccccceeEEE Confidence 3332 457999999999999999988888743 22346778899887654 456777887777777778777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHH---------HHhhcccccccccCCHhHHHHHH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADM---------LVDNGTALTGSAPSDADDAFDLI 144 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~i 144 (273) +...+. +.-+.|++.-..++..++.+ ..++.++++++++|+.++.= +..... ...........+++| T Consensus 81 l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~--~~~~~~~~~~~~~~l 157 (299) T protein:vir:41 81 MRSKKM-GVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATD--ASNLVEETANKYDDL 157 (299) T ss_pred EeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccc--cceeeccccccHHHH Confidence 777553 45567777655556667766 55678999999999987731 111000 011112223457889 Q ss_pred HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcE-EEEEe Q lcl|Aclame:pro 145 ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ-FVAFH 223 (273) Q Consensus 145 ~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~-~~~~~ 223 (273) .++...+..++.+.. .++++|+.+..|.+..+.-. ..... ... .+..++++|.+|+.++.+|.+++.. ++.+. T Consensus 158 ~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~~G--~~l~~-~~~-~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd 231 (299) T protein:vir:41 158 NEAIGLIEAEDLEPN--GIATIRKQRVKYRSTKDGNG--MPIFN-TAT-SNGVDDVLGLPIAYTPKYTFGDKDISELVGD 231 (299) T ss_pred HHHHHhhhcccCCcC--EEEEcHHHHHHHHHhhccCC--ceeec-CCc-CCCCceecceeeEEecccCCCCCceEEEEEe Confidence 988888887776433 58999999999975432111 11111 111 2334689999999999999765432 34444 Q ss_pred CceEEEEEe-cceeeeccCCC--------------cc--eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 224 PSAAAYVSQ-IDTVEALRDQD--------------SF--SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~~~~-~~~ve~~~~~~--------------~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) -+-+.+..+ ...++..++.. .+ ...+++.+++|.++.+|+++++|+..+| T Consensus 232 fs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 232 WNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred cccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 333333322 22444333221 11 2567888999999999999999998888 No 67 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.57 E-value=1.3e-15 Score=102.25 Aligned_cols=258 Identities=10% Similarity=-0.022 Sum_probs=168.4 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |+. ..+.|+.|..++.+.+++.+++..++.+- ..+|.+++||+....+.+....+++.....+++.+.+++.. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~ 105 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeeecCCccccccccceeEEEEEe Confidence 432 34679999999999999999888887542 23366789999866556667788888877778888877777 Q ss_pred EeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc-------cccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~i~~a~~ 149 (273) .+. ..-+.|++.-..++..++.. +.++.+++++.++|..++.--..... .............+++|.++.. T Consensus 106 ~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) T protein:vir:96 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHHHHHH Confidence 553 44467777655555667755 56678999999999987742111100 0001111122345788888888 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEE Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..+..... .++++|..+..|.+..+ ..| ...+..+.-+.++|++|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~i~~~~~~~~--~~i~n~~~~~~L~~lkd------~~G-~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~s~~~~ 254 (324) T protein:vir:96 185 LLEDDELEAN--AFISKTQNRSLLRKIVD------PET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCCC--EEEEcHHHHHHHHHhhC------CCC-CeeecCCCCCcccceeeEeecCCCCCcc-eEEEEecceEEE Confidence 8877665333 58999999999865322 112 2233446667899999999877665543 355555554433 Q ss_pred EEe-cceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 230 VSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 ~~~-~~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+ ...++..++.. .| ...+++.+++|+++++|++++.|+.+.- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 322 22343333211 12 2678999999999999999999975433 No 68 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.57 E-value=1.6e-15 Score=101.74 Aligned_cols=258 Identities=10% Similarity=-0.017 Sum_probs=168.7 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |+- ..++|+.|.+++.+.+++.+++..++.+- ..+|.++++|+....+.+....+++.+...+++.+.++++. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 105 (324) T protein:vir:93 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeeecCCccccccccceeEEEEEe Confidence 222 24679999999999999999888887442 23366789999866555667788888877778888777777 Q ss_pred EeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc--c-----ccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT--A-----LTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~--~-----~~~~~~~~~~~~~~~i~~a~~ 149 (273) .+. +.-+.|++.-..++..++.. ..++.++++++++|+.++.--..... . ............+++|.++.. T Consensus 106 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) T protein:vir:93 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHHHHHH Confidence 553 45567777655556667755 45678899999999987642111100 0 000111122345788999888 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEE Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..+.... ..++++|..+..|.+..+ ..| ...+..+.-++++|.+|+.++..+.+.. .++++..+-+.+ T Consensus 185 ~l~~~~~~~--~~~v~n~~~~~~L~~l~d------~~G-~~~~~~~~~~~l~G~PVv~~~~~~~~~~-~i~~gdfs~~~~ 254 (324) T protein:vir:93 185 LLEDDELEA--NAFISKTQNRSLLRKIVD------PET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCC--CEEEEcHHHHHHHHHhhC------CCC-CeeecCCCCCcccceeeEeecCCCCCcc-eEEEEecceEEE Confidence 888776533 368999999999965321 122 2234456667899999999877655443 355666554444 Q ss_pred EEe-cceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 230 VSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 ~~~-~~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+ ...++..++.. .| ...+++.+++|+.+++|+++++|+.+.- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 332 22444333321 12 3688999999999999999999974332 No 69 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.56 E-value=2e-15 Score=101.20 Aligned_cols=258 Identities=10% Similarity=-0.028 Sum_probs=168.2 Q ss_pred Ccc--cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEE Q lcl|Aclame:pro 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~--~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) ++. ..++|+.|..++.+.+++.+++..++.+- ..+|.++++|+....+.+....+++.+...+++.+.++++.. T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~ 106 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcchhhhccee----eccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeE Confidence 222 35679999999999999999888887432 234668999998766666677888888777777777777775 Q ss_pred eeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc------c-ccccccCCHhHHHHHHHHHHHH Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT------A-LTGSAPSDADDAFDLIASALKE 150 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~------~-~~~~~~~~~~~~~~~i~~a~~~ 150 (273) +. ..-+.|++.-..++..++.. +.++.+++++.++|+.++.--..... . ............+++|.++... T Consensus 107 k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) T protein:vir:97 107 KL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) T ss_pred EE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHHHHHHHh Confidence 53 44557777555555667765 45678999999999988743211100 0 0011111233457889998888 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~ 230 (273) +..++.... .++++|..+..|.+..+ ..| ...+..+.-+.++|.+|+.++..+.+.+. ++.+..+.+.+. T Consensus 186 l~~~~~~~~--~~v~n~~~~~~L~~lkd------~~g-~~~~~~~~~~tl~G~PV~~~~~~~~~~~~-~~~gd~~~~~i~ 255 (324) T protein:vir:97 186 LEDDELEAN--AFISKTQNRSLLRKIVD------PET-KERIYDRNSDTLDGLPVVNLKSSNLKRGE-LITGDFDKLIYG 255 (324) T ss_pred hhhccCCCC--EEEEcHHHHHHHHHhhc------CCC-ceeecCCCCccccceeeEeecCCCCCcce-EEEEecccEEEE Confidence 887765333 67899999998865322 112 12333355568999999998877665543 455555444443 Q ss_pred Eec-ceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 231 SQI-DTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~-~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+. ..++..++.. .| ...+++.+++|+++.+|+++++|+.+.- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 332 2444433321 11 3678889999999999999998865433 No 70 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.55 E-value=2.9e-15 Score=100.30 Aligned_cols=258 Identities=10% Similarity=-0.017 Sum_probs=168.2 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |+- ..+.|+.|...+.+.+++.+++..++..- ...|.+++||+....+.+....+++.+...+++.+.++++. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:99 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeEeccCccccccccceeEEEEee Confidence 332 24679999999999999999888877432 23366799999876666677788888777777777777776 Q ss_pred EeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccc-------ccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------LTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~i~~a~~ 149 (273) .+. +.-+.|++.-..++..++.. +.++++++++.++|+.++.--...... .........+..+++|.++.. T Consensus 106 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) T protein:vir:99 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHH Confidence 553 44567777655555566655 566789999999999887432111100 001111223345788999988 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEE Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .|..+..... .++++|..+..|.+..+ ..| ...+..+.-+.++|.+|+.++..+.+.+ .++.++.+-+.+ T Consensus 185 ~l~~~~~~~~--~~v~n~~~~~~L~~l~d------~~g-~~~~~~~~~~~l~G~PVv~~~~~~~~~~-~~i~gd~~~~~~ 254 (324) T protein:vir:99 185 LLEDDELEAN--AFISKTQNRSLLRKIVD------PET-KERIYDRNSDTLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCCC--EEEEcHHHHHHHHHhhc------CCC-ceeecCCCCccccceeEEeecCCCCCcc-eEEEEecccEEE Confidence 8887765333 57899999998864321 112 1233334456799999999987766554 345555554444 Q ss_pred EEe-cceeeeccCC--------C-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 230 VSQ-IDTVEALRDQ--------D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 ~~~-~~~ve~~~~~--------~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+ ...++..++. . .| ...+++.+++|+.+++|+++++|+.+.- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 332 2244433321 1 11 3678889999999999999999865433 No 71 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.55 E-value=3.3e-15 Score=99.98 Aligned_cols=258 Identities=9% Similarity=-0.027 Sum_probs=169.2 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |.. ..++|+.|...+++.+++.+++.+++.+ ...+|.++++|+....+.+....+++.++..+++.+.++++. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:96 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCcceeEecCCccccccccceeEEEEee Confidence 322 2578999999999999999988888754 233467799999876666677788888877777777777777 Q ss_pred EeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc--c-----ccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT--A-----LTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~--~-----~~~~~~~~~~~~~~~i~~a~~ 149 (273) .+. ..-+.|++.-..++..++.. +.++.++++++++|..++.--..... . .........+..++.|.++.. T Consensus 106 ~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~ 184 (324) T protein:vir:96 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHH Confidence 543 44556777555555667765 45678999999999987732111100 0 001111223345888999888 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEE Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..+..... .++++|..+..|.+..+ ..| ...+..|.-+.++|.+|+.++..+.+.+ .++.++.+-+.+ T Consensus 185 ~l~~~~~~~~--~~vmn~~~~~~L~~l~d------~~G-~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~~~~~~ 254 (324) T protein:vir:96 185 LLEDDELEAN--AFISKTQNRSLLRKIVD------PET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCCC--EEEEcHHHHHHHHHhhc------cCC-CeeecCCCCCcccceeeEeeCCCCCCcc-eEEEEecceEEE Confidence 8887765333 68999999999865322 112 2234456667899999999877655443 345555544433 Q ss_pred EEe-cceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 230 VSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 ~~~-~~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+ ...++..++.. .| ...+++.+++|+.+++|+++++|+.+.- T Consensus 255 g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 332 22444333211 12 3678899999999999999999975332 No 72 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.55 E-value=3.3e-15 Score=99.98 Aligned_cols=258 Identities=9% Similarity=-0.027 Sum_probs=169.2 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |.. ..++|+.|...+++.+++.+++.+++.+ ...+|.++++|+....+.+....+++.++..+++.+.++++. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:78 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCcceeEecCCccccccccceeEEEEee Confidence 322 2578999999999999999988888754 233467799999876666677788888877777777777777 Q ss_pred EeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc--c-----ccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT--A-----LTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~--~-----~~~~~~~~~~~~~~~i~~a~~ 149 (273) .+. ..-+.|++.-..++..++.. +.++.++++++++|..++.--..... . .........+..++.|.++.. T Consensus 106 ~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~ 184 (324) T protein:vir:78 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHH Confidence 543 44556777555555667765 45678999999999987732111100 0 001111223345888999888 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEE Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..+..... .++++|..+..|.+..+ ..| ...+..|.-+.++|.+|+.++..+.+.+ .++.++.+-+.+ T Consensus 185 ~l~~~~~~~~--~~vmn~~~~~~L~~l~d------~~G-~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~~~~~~ 254 (324) T protein:vir:78 185 LLEDDELEAN--AFISKTQNRSLLRKIVD------PET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCCC--EEEEcHHHHHHHHHhhc------cCC-CeeecCCCCCcccceeeEeeCCCCCCcc-eEEEEecceEEE Confidence 8887765333 68999999999865322 112 2234456667899999999877655443 345555544433 Q ss_pred EEe-cceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 230 VSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 ~~~-~~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+ ...++..++.. .| ...+++.+++|+.+++|+++++|+.+.- T Consensus 255 g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 332 22444333211 12 3678899999999999999999975332 No 73 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.55 E-value=2.4e-15 Score=100.77 Aligned_cols=264 Identities=12% Similarity=0.027 Sum_probs=165.0 Q ss_pred Ccc----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) ||. ..+.|+.++.++++.+++.+.+..++.+- ..++.++++|+....+.+....+++.+...+++.+.+++. T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~ 76 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQK----PIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIV 76 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEecCcceEEeecCccccccccceeeEEee Confidence 887 36789999999999999999888887542 2335678999987666778888888887777777776666 Q ss_pred EEeeeeceeEechHHHHHh---HHHHHH-HHHHHHHHHHHHHHHHHHHHHHh---hcc------c-----ccccccCCHh Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADMLVD---NGT------A-----LTGSAPSDAD 138 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~iD~~~~~~~~~---~~~------~-----~~~~~~~~~~ 138 (273) ..+. +.-+.++++-..+. ..++.+ ..++++++++.++|+.++.-... ... . ....+..+.. T Consensus 77 ~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (303) T protein:vir:97 77 PIKV-EYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESE 155 (303) T ss_pred eEEE-EEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccccccccc Confidence 5332 34456665533222 234544 56678999999999988743110 000 0 0011112334 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC-- Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~-- 216 (273) ..+++|.++...+...+.... .++++|..+..|.+..+.-.+.-. ....-..+..++++|.+++.|+.+|.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~L~~lkd~~g~~~~--~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 231 (303) T protein:vir:97 156 DADANIEAAVNLIQGAEGVVT--GLAMDTEFSTALAKVTNGEMGPKM--YPELAWGANPDSINGLKSSVNTTVGAGADEA 231 (303) T ss_pred chHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccCCCeEE--ecCccCCCCCceecceeeEEecccCCccccC Confidence 567889988888876665333 588999999999754321111000 11111123456899999999999985431 Q ss_pred ---cEEEEEe-CceEEEEEe-cceeeecc--CCC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ---EQFVAFH-PSAAAYVSQ-IDTVEALR--DQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ---~~~~~~~-~~a~~~~~~-~~~ve~~~--~~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++++. ..++.+..+ ...+|... +.+ .| -..+++.+++|+++++|++++.|+.+.= T Consensus 232 ~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 232 ESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 1233332 233333332 22333221 111 12 1478889999999999999999987766 No 74 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.53 E-value=5.9e-15 Score=98.58 Aligned_cols=262 Identities=13% Similarity=0.050 Sum_probs=166.4 Q ss_pred Cccc--chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEE Q lcl|Aclame:pro 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~--~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) ||.+ .++|+.+..++.+.+++.+++..++..- ..++..+++|+....+.+....+++.....+++.+.+++... T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeee Confidence 9985 6788999999999999998888876432 223456889998665556677888877777777777777765 Q ss_pred eeeeceeEechHHHHHh---HHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-hcc--------------cccccccCCHhH Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADMLVD-NGT--------------ALTGSAPSDADD 139 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~iD~~~~~~~~~-~~~--------------~~~~~~~~~~~~ 139 (273) +. ..-+.|++.-..++ ..++.+ ..++.++++++++|..++.-... .+. ............ T Consensus 77 k~-~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:94 77 KV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred EE-EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccccccccccc Confidence 43 34456666544322 234544 56678999999999988743110 000 000011222334 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--- Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--- 216 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--- 216 (273) .+++|.++...+..++.... .++++|..+..|.+..+.-.+. ...+....|.-+.++|++|+.++.+|...+ T Consensus 156 ~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~G~~---l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:94 156 PNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQGNA---LFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred HHHHHHHHHHhhhhcCCCcc--EEEEcHHHHHHHHHhhccCCCe---eecCcccCCCCceecceeeEEecccccccCCCc Confidence 57789999888887776433 6999999999996543211111 011233346667899999999999985432 Q ss_pred cEEEEEeCc-eEEEEE-ecceeeecc--CCC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 217 EQFVAFHPS-AAAYVS-QIDTVEALR--DQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 217 ~~~~~~~~~-a~~~~~-~~~~ve~~~--~~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) ..++.+.-+ ++.+.. +...++..+ +++ .| ...+++.+++|..+.+|++++.|+.+- T Consensus 231 ~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234555433 233322 222333222 111 12 246888999999999999999997766 No 75 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.53 E-value=7.5e-15 Score=98.03 Aligned_cols=262 Identities=13% Similarity=0.046 Sum_probs=164.4 Q ss_pred Cccc--chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEE Q lcl|Aclame:pro 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~--~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) ||.+ .+.|+.+..++.+.+++...+..++.+- ..++..+++|+....+.+....++..+...+++.+.+++... T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee----eccCCceEEEEEecCcceEEecCCccccccccceeEEEEeee Confidence 9974 6677777888999999988888877532 223456789998776667778888887777777776666664 Q ss_pred eeeeceeEechHHHHHhH---HHHHH-HHHHHHHHHHHHHHHHHHHHHH---hhccc------------ccccccCCHhH Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQVA---GSLEA-YTRAGATALATDTDKFIADMLV---DNGTA------------LTGSAPSDADD 139 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~---~~~~~-~~~~~~~ala~~iD~~~~~~~~---~~~~~------------~~~~~~~~~~~ 139 (273) +. ..-+.|+++-..++. .++.+ ..++.++++++++|..++.-.. ..+.. ........... T Consensus 77 k~-a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:16 77 KV-EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred eE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccccccc Confidence 43 333566665443332 34544 5567899999999998875311 00000 00011112234 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--- Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--- 216 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--- 216 (273) .+++|.++...+..++.+.. .++++|..+..|.+..+.-.+. -.......|.-+++.|.+|+.++.+|.... T Consensus 156 ~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~G~~---i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:16 156 PNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQDNA---LFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred HHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhccCCCe---eecCcccCCCCceecceeeEEecccccccCCCc Confidence 46678888888887776443 4888999999997643211111 012233456667999999999999985432 Q ss_pred cEEEEEeC-ceEEEE-EecceeeeccC--CC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 217 EQFVAFHP-SAAAYV-SQIDTVEALRD--QD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 217 ~~~~~~~~-~a~~~~-~~~~~ve~~~~--~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) ..++.+.- .++.+. .+...++..+. +. .| -..+++.+++|+++++|++++.|+.+- T Consensus 231 ~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 23444432 333332 22223333222 21 12 157889999999999999999997666 No 76 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.52 E-value=6.7e-15 Score=98.29 Aligned_cols=258 Identities=10% Similarity=-0.017 Sum_probs=166.6 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |+- ..+.|+.|...+.+.+++.+.+..++.+- ...+.++++|+....+.+....+++.+...+++.+.+++.. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:10 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCcceeEeccCccccccccceeEEEEee Confidence 332 24679999999999999999888877442 23356799999876666777888888777777777777766 Q ss_pred EeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccc-------ccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------LTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~i~~a~~ 149 (273) .+. +.-+.|+..-..++..++.. +.+++++++++++|..++.--...... .........+..+++|.++.. T Consensus 106 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~ 184 (324) T protein:vir:10 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHH Confidence 543 44456776555555566655 556789999999999877432111100 001111223345788999888 Q ss_pred HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEE Q lcl|Aclame:pro 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..+..... .++++|..+..|.+..+ ..| ...+..+.-+.++|.+|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~l~~~~~~~~--~~v~n~~~~~~L~~l~d------~~g-~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~~~~~~ 254 (324) T protein:vir:10 185 LLEDDELEAN--AFISKTQNRSLLRKIVD------PET-KERIYDRNSDTLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCCC--EEEEcHHHHHHHHHhhc------cCC-ceeecCCCCccccceeEEeecCCCCCcc-eEEEEecccEEE Confidence 8877665333 57899999999865322 112 1233334456799999999887665544 345555554444 Q ss_pred EEe-cceeeeccCC--------C-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 230 VSQ-IDTVEALRDQ--------D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 ~~~-~~~ve~~~~~--------~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+ ...++..++. . .| ...+++.+++|+.+++|+++++|+.+.- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 332 2234433221 1 11 3678889999999999999999865443 No 77 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.52 E-value=1.3e-14 Score=96.69 Aligned_cols=266 Identities=12% Similarity=0.005 Sum_probs=159.1 Q ss_pred Ccc---------------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccc------- Q lcl|Aclame:pro 1 MAF---------------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPT------- 52 (273) Q Consensus 1 MA~---------------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~------- 52 (273) ||+ ..++|+-|..++.+.+++..++..++.+ ...+|..+++|+..... T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGEN----IPISYGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCccceeecc Confidence 221 1268999999999999999988888754 23346788999864321 Q ss_pred -cccccCCCCccCCcccccceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh------ Q lcl|Aclame:pro 53 -VKDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD------ 124 (273) Q Consensus 53 -~~d~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~------ 124 (273) ...+..+++.....+++.+.+++...+. +.-+.|++.-..++..++.++ .++.++++++++|..++.--.. T Consensus 77 ~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~ 155 (338) T protein:vir:78 77 GTSNEQREGGTKPLSGTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSAL 155 (338) T ss_pred cccccccccccccccccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccc Confidence 1223456666666666777766666443 445567775555566677664 5678999999999988742110 Q ss_pred -----hccc----ccccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhh-cccccceeee Q lcl|Aclame:pro 125 -----NGTA----LTGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRA 194 (273) Q Consensus 125 -----~~~~----~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~~~~ 194 (273) .... ............++.|.++...+.... ......++++|..+..|.+.. .+.+.+ .......... T Consensus 156 ~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~m~~~~~~~L~~~~-~l~d~~g~~l~~~~~~~ 233 (338) T protein:vir:78 156 QGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANT-DVDFNGWAADPRYRARLLRSQ-AYRDANGNVDPTRINLA 233 (338) T ss_pred cccccccccccccccccccccchhhHHHHHHHHHHhhhhc-cccceEEEEchHHHHHHHHHh-hhccCCCceeecccccC Confidence 0000 011112223445777888777665432 223346899999999886532 122211 0001122334 Q ss_pred eeeeeecceEEEEecccccCC----C--cEEEEEeCceEEEEEec-ceeeeccCC-------------Ccc---eeeEEe Q lcl|Aclame:pro 195 GTIGNLLGARIVESNNLRDTD----D--EQFVAFHPSAAAYVSQI-DTVEALRDQ-------------DSF---SDRIRA 251 (273) Q Consensus 195 G~ig~i~G~~i~~s~~l~~~~----~--~~~~~~~~~a~~~~~~~-~~ve~~~~~-------------~~~---~~~v~~ 251 (273) |.-+.++|.+|+.++.+|... + ..++.+.-+.+-+..+. ..++..+.. ..| -..+++ T Consensus 234 ~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 313 (338) T protein:vir:78 234 ASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILI 313 (338) T ss_pred CCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 566789999999999988521 1 23444444433333221 233322221 111 156889 Q ss_pred eeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 252 LHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 252 ~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+++|+.+++|+++++|+.+.- T Consensus 314 ~~r~d~~v~~~~a~~~l~~~~~ 335 (338) T protein:vir:78 314 EVTFGWLLGDKQAFVKFVDDED 335 (338) T ss_pred EEEeccEeecccceEEEecccC Confidence 9999999999999998866544 No 78 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.52 E-value=6.2e-15 Score=98.48 Aligned_cols=265 Identities=13% Similarity=0.048 Sum_probs=162.6 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) ||.. .+.|+.++.++++.+++.+++..++.+- ...+..++||+....+.+....+++.+...+.+.++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ----PTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeCCccccccccceeeeE Confidence 9973 6679999999999999999888887542 23356789999877666677888888877777777777 Q ss_pred EEEEeeeeceeEechHHHHHhHH----HHHH-HHHHHHHHHHHHHHHHHHHHHHh-hcccc-------c-cc-ccCCHhH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAG----SLEA-YTRAGATALATDTDKFIADMLVD-NGTAL-------T-GS-APSDADD 139 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~----~~~~-~~~~~~~ala~~iD~~~~~~~~~-~~~~~-------~-~~-~~~~~~~ 139 (273) +...+. ..-+.|+++-..++.. .++. +.++.++++++++|..++.=-.. .+... . .+ ....... T Consensus 77 l~~~kl-~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) T protein:vir:80 77 AQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDS 155 (315) T ss_pred eeeeeE-EeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeecccc Confidence 776443 3345666654433332 2544 45678899999999877732100 00000 0 00 1111223 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccc--cceeeeeeeeeecceEEEEecccccCCC- Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD--AAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~--~~~~~~G~ig~i~G~~i~~s~~l~~~~~- 216 (273) .+++|.++...+..+...... ..+++|..+..|.+....-. .+..+. -..+..|.-++++|.+|+.++.+|.... T Consensus 156 ~~~d~~~~~~~~~~~~~~~~~-~~imn~~~~~~L~~l~~~~g-~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~ 233 (315) T protein:vir:80 156 ATADLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKG-SPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEM 233 (315) T ss_pred chHHHHHHHHHHhhccCccce-EEEEcHHHHHHHHHHhhccC-CcccccccccccccCCCceecceeeEecCcCCccccc Confidence 466788877777655543333 47899999999976532111 111111 0123345557899999999999885432 Q ss_pred -----cEEEEEeCc--eEEEEEecceeeeccCC--C-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 -----EQFVAFHPS--AAAYVSQIDTVEALRDQ--D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -----~~~~~~~~~--a~~~~~~~~~ve~~~~~--~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++.+.-+ .++....+ .++..+.. . .| ...+++.+++|.++.+|+++++|+..++ T Consensus 234 ~~~~~~~~~~GDfs~~~~g~~~~~-~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 234 SPASGVKAIVGDFSRVHWGFQRNF-PIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred ccccccEEEEeecccEEEEEecCe-eEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 122333222 23332222 33322221 1 11 2578888999999999999999987776 No 79 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.52 E-value=9.6e-15 Score=97.44 Aligned_cols=262 Identities=9% Similarity=0.022 Sum_probs=157.7 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |+.. -+.|+.|..++++.+++.+++..++.+ ....|.+++||+....+.+....+++.+...+++.++++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 89 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQK----VPMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQN 89 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcce----eeccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 4332 256888889999999998888887644 223467789999876666677788888877777777777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-hhc-------cccc--ccccCCHhH---H Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLV-DNG-------TALT--GSAPSDADD---A 140 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~-~~~-------~~~~--~~~~~~~~~---~ 140 (273) +...+. +.-+.|++.-..++..++.++ .++.++++++++|+.++.--. ..+ .... .....+... . T Consensus 90 ~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (320) T protein:vir:10 90 IAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAY 168 (320) T ss_pred EeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccccccccH Confidence 766543 445677776555666677664 567889999999998873110 000 0000 011111111 1 Q ss_pred HHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhh-----hcccccceeeeeeeeeecceEEEEecccccCC Q lcl|Aclame:pro 141 FDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSA-----DTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 141 ~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~-----~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~ 215 (273) .+.+.++...+..... ..-.++++|..+..|.+..+.-.+. ...+... ...-++++|++++.++.+|.+. T Consensus 169 ~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~---~~~~~~i~g~pv~~~~~~~~~~ 243 (320) T protein:vir:10 169 DAVAVNGLSLLVNAKK--KWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENS---PFRAGRIVSRPTILSDHVADGT 243 (320) T ss_pred HHHHHHHHhhhhcccC--CCcEEEEcHHHHHHHHHhhccCCceeeccccccCccc---cccCceeeeeeeEecCCCCCCc Confidence 2235555555555443 3347899999999996543211110 0011111 1112468999999999987654 Q ss_pred CcEEEEEeCceEEEEEec-ceeeeccC--------CCc-----c---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 216 DEQFVAFHPSAAAYVSQI-DTVEALRD--------QDS-----F---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~~~~a~~~~~~~-~~ve~~~~--------~~~-----~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) . .++.++.+-+.+..+. ..++..++ ... | ...+++.+++|+++++|+++++|+..++ T Consensus 244 ~-~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a 317 (320) T protein:vir:10 244 T-VGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT 317 (320) T ss_pred e-EEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 2 2333443333333221 23332222 111 1 2567888999999999999999987777 No 80 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.51 E-value=1.1e-14 Score=97.07 Aligned_cols=264 Identities=13% Similarity=0.023 Sum_probs=162.8 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceEEEEEEe Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) -....++|+.|...+++.++....+..++..-.. .....++.+|+....+......++..... +..+.+.+++.+.+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k 204 (415) T protein:vir:94 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV--TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccccCcHHHHHHHHHHHHhhhhhhhhcceeec--cCCceeEEEEeecCCccceeccccccccccccccceeeEeehee Confidence 1123568999999999999999888887754221 11123455565544444555666665542 33456666666644 Q ss_pred eeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCHhHHHHHHHHHHHH Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDLIASALKE 150 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~i~~a~~~ 150 (273) . +.-+.|++.-..++..++.++ .++.+++++..+|..++.-....... .......+....+++|.++... T Consensus 205 ~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 283 (415) T protein:vir:94 205 H-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL 283 (415) T ss_pred e-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHh Confidence 3 334566765555556677665 45678899999999887654321110 0111222333457888888888 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEEEEEe-CceE Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFH-PSAA 227 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~~~~~-~~a~ 227 (273) +...... +-.++++|..+..|.+..+.-.+ ... ...+.+|..+.+.|++|+.++.+|.++. ..++.+. ..++ T Consensus 284 ~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G~--~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~ 358 (415) T protein:vir:94 284 NVKPNYE--HNVAIVSQTMFAKLDKMKDKLGN--YLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI 358 (415) T ss_pred hhhhccC--CCEEEEcHHHHHHHHHhhccCCC--eee-ccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccE Confidence 7766653 23688999999999654321111 111 1123456677899999999998886553 2345554 3344 Q ss_pred EEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 228 AYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ....+. ..++..+ ...+.+.+++-+++|+.+++|++++.++-+.+ T Consensus 359 ~~~~~~~~~v~~~~-~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 359 VLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 444333 3444333 33456788999999999999999999865555 No 81 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.51 E-value=1.5e-14 Score=96.32 Aligned_cols=266 Identities=12% Similarity=0.007 Sum_probs=160.2 Q ss_pred Ccc--------------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc Q lcl|Aclame:pro 1 MAF--------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA~--------------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~ 66 (273) ||- ..+.|+.+..++++.+++.+++.+++.. ....+..+++|+....+.+....+++.+... T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~ 76 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARK----VPMGPTGISIPHWTGAVSASWTGEAERKPIT 76 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcce----eeccCCceEEEEEcCCcceeEecCCCccccc Confidence 432 1223444567899999999988888754 2233566899998766666677888888777 Q ss_pred ccccceEEEEEEeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHH----------HHhhcc------cc Q lcl|Aclame:pro 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADM----------LVDNGT------AL 129 (273) Q Consensus 67 ~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~----------~~~~~~------~~ 129 (273) +++..++++...+. +.-+.|++.-..++..++.+ +.++.+++++.++|+.++.= +..... .. T Consensus 77 ~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~ 155 (330) T protein:vir:77 77 KGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTN 155 (330) T ss_pred cceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeeccc Confidence 77777777777443 44457777655555667766 55678999999999988721 110000 01 Q ss_pred cccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcc--cccceeeeeeeeeecceEEEE Q lcl|Aclame:pro 130 TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 130 ~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~~G~ig~i~G~~i~~ 207 (273) ...........+++|.++...+..++.+. ..++++|..+..|.+..+.-.+.-.. ........+.-++++|++|+. T Consensus 156 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~ 233 (330) T protein:vir:77 156 LTTASGPQGNAYLAVNNALSLLVNSGKKW--TGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYV 233 (330) T ss_pred ccccccccchhHHHHHHHHHhhhhcCCCc--cEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEE Confidence 11122233456778888888887776533 35799999999987533211110000 000011112335799999999 Q ss_pred ecccccCCCc---EEEEEeCceEEEEEec-ceeeeccCC-----------------Ccc---eeeEEeeeeeeeEEEcCc Q lcl|Aclame:pro 208 SNNLRDTDDE---QFVAFHPSAAAYVSQI-DTVEALRDQ-----------------DSF---SDRIRALHVYGGKVVRPT 263 (273) Q Consensus 208 s~~l~~~~~~---~~~~~~~~a~~~~~~~-~~ve~~~~~-----------------~~~---~~~v~~~~~~g~~vl~p~ 263 (273) ++.+|..++. .++.+..+.+.+..+. ..++..++. ..| ...+++.+++|+.+.+|+ T Consensus 234 ~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 313 (330) T protein:vir:77 234 ADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKD 313 (330) T ss_pred eccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEeccc Confidence 9999865432 2344454444333221 123222111 111 367899999999999999 Q ss_pred eEEEEec--CCC Q lcl|Aclame:pro 264 GVVVFNK--TGS 273 (273) Q Consensus 264 ~~v~~~~--~~s 273 (273) ++++|+. +++ T Consensus 314 a~~~i~~~~~~~ 325 (330) T protein:vir:77 314 AFVKLTDQVAGT 325 (330) T ss_pred ceEEEEeccCCc Confidence 9888743 333 No 82 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.51 E-value=1.3e-14 Score=96.73 Aligned_cols=263 Identities=14% Similarity=0.072 Sum_probs=162.1 Q ss_pred Ccc----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) ||- ..++|+.+...+++.+++..++..++.+- ..++..+++|+....+.+....+++.....+++.+++++. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i----~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~ 76 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhccee----ecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEe Confidence 876 37889999999999999999888887542 2234568999987666667778888877777777777776 Q ss_pred EEeeeeceeEechHHHHHh---HHHHHH-HHHHHHHHHHHHHHHHHHHHHHhh-cc-------------cccccccCCHh Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADMLVDN-GT-------------ALTGSAPSDAD 138 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~-~~-------------~~~~~~~~~~~ 138 (273) ..+. +.-+.|+++-..++ ..++.+ ..++.+++++.++|..++.--... +. .....+..+.. T Consensus 77 ~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:81 77 PRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) T ss_pred eEEE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccc Confidence 6544 33456666533222 233544 567789999999999877432100 00 00111222223 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC-- Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~-- 216 (273) ..+..|.++...+...+... ..++++|..+..|.+..+.-.+. . .......|..+.++|.+++.++.+|.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~G~~--l-~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~ 230 (311) T protein:vir:81 156 TPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRDSQGRK--L-YPELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) T ss_pred hHHHHHHHHHHHhhhcCCCc--eEEEEcHHHHHHHHhhhccCCCe--e-ecCccccCCCceecceeEEeccccccccccc Confidence 34455667766766655433 34899999999996543211110 0 11122235568899999999998874321 Q ss_pred -------------cEEEEEeCceEEEEEe-cceeeeccCCC------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 -------------EQFVAFHPSAAAYVSQ-IDTVEALRDQD------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -------------~~~~~~~~~a~~~~~~-~~~ve~~~~~~------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++++.-+-+.+..+ ...++..++.. .| ...+++.+++|+++++|++++.|+.+.+ T Consensus 231 ~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 310 (311) T protein:vir:81 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) T ss_pred ccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeecc Confidence 1123333332222222 22333333221 11 2578888999999999999999988888 No 83 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.51 E-value=1.9e-14 Score=95.78 Aligned_cols=258 Identities=16% Similarity=0.056 Sum_probs=162.1 Q ss_pred Ccc--cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~--~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) .+. ..+.|+.|...+++.+++...+.+++.+-. ..|.++++|+... ...+....+++.....+++.+.+++++ T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~ 192 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGT----TESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPV 192 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhcccee----cCCCceEEEEEecCCCceeeecCCccccccccceeEEEEee Confidence 111 135677788999999999988888875432 2356788988644 234555677777666677777777777 Q ss_pred EeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHH----------Hhhcc-cccccccCCHhHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADML----------VDNGT-ALTGSAPSDADDAFDLIA 145 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~----------~~~~~-~~~~~~~~~~~~~~~~i~ 145 (273) .+. +.-+.|++.-.. ...++.++ .++.+.+++.++|..++.-- ..... ....+...+....++.|. T Consensus 193 ~k~-~~~~~is~ell~-d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~ 270 (395) T protein:vir:43 193 RTI-AHLFKASRQILD-DASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIR 270 (395) T ss_pred eeE-EEeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHH Confidence 554 334567765333 33457665 45678899999999887421 00000 011122233345688888 Q ss_pred HHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCc Q lcl|Aclame:pro 146 SALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) Q Consensus 146 ~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~ 225 (273) ++...+.....+. -.++++|..+..|.+..+.-.+ ... . ...+|.-+.++|++|+.++.+|.+. ++.+.-+ T Consensus 271 ~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~G~--~i~-~-~~~~~~~~~l~G~pVv~~~~~~~~~---~~~gd~~ 341 (395) T protein:vir:43 271 LAILQAQLAEFPA--SGIVLNPIDWALIELNKDAENR--YII-G-SPQNGTTPTLWRLPVVETQAITQDE---FLTGAFS 341 (395) T ss_pred HHHHhhccccCCC--cEEEEcHHHHHHHHHhhccCCc--eec-c-ccccCCCceecceeeEEcCCCCCCc---EEEEecc Confidence 8888887766533 3689999999988654321111 111 1 1234666789999999999998654 3444332 Q ss_pred -eEEEEE-ecceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 226 -AAAYVS-QIDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 -a~~~~~-~~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++-... ....++..+... .| ...+++.+++|+++++|++++.++-++| T Consensus 342 ~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 342 LGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred ceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 222222 233444444322 12 3578889999999999999999988888 No 84 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.51 E-value=1.5e-14 Score=96.42 Aligned_cols=264 Identities=13% Similarity=0.029 Sum_probs=162.3 Q ss_pred Cc-------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccce Q lcl|Aclame:pro 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA-------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~ 72 (273) ++ ...++|+.|...+++.++...++..++..-.. ....-++.+|+...........++..... +..+.+. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV--TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec--cCCceeEEEEeecCCccceeeccccccCcccccceee Confidence 11 13578999999999999998888887654221 11112455555544444455566655543 3345666 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCHhHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDL 143 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~ 143 (273) +++.+.+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.-....... .......+....+++ T Consensus 198 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) T protein:vir:79 198 LAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) T ss_pred EEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhH Confidence 66666543 334566666555556677765 45678899999999887654221110 111222233456888 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEEEE Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~~~ 221 (273) |.++...+....... -.++++|..+..|.+..+.-. .... ...+.+|..+.+.|++|+.++.+|.++. ..++. T Consensus 277 i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd~~G--~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:79 277 IKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLG--NYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccCC--ceee-ccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 988888887766543 257899999999965322111 1111 1123456667999999999988886543 33455 Q ss_pred Ee-CceEEEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 222 FH-PSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~-~~a~~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +. ..++....+. ..++..+. ..+.+.+++-+++|+.+++|++++.++-+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 352 GNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 54 3334344333 34444333 3456788899999999999999999976666 No 85 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.51 E-value=1.5e-14 Score=96.42 Aligned_cols=264 Identities=13% Similarity=0.029 Sum_probs=162.3 Q ss_pred Cc-------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccce Q lcl|Aclame:pro 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA-------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~ 72 (273) ++ ...++|+.|...+++.++...++..++..-.. ....-++.+|+...........++..... +..+.+. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV--TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec--cCCceeEEEEeecCCccceeeccccccCcccccceee Confidence 11 13578999999999999998888887654221 11112455555544444455566655543 3345666 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCHhHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDL 143 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~ 143 (273) +++.+.+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.-....... .......+....+++ T Consensus 198 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) T protein:vir:81 198 LAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) T ss_pred EEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhH Confidence 66666543 334566666555556677765 45678899999999887654221110 111222233456888 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEEEE Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~~~ 221 (273) |.++...+....... -.++++|..+..|.+..+.-. .... ...+.+|..+.+.|++|+.++.+|.++. ..++. T Consensus 277 i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd~~G--~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:81 277 IKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLG--NYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccCC--ceee-ccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 988888887766543 257899999999965322111 1111 1123456667999999999988886543 33455 Q ss_pred Ee-CceEEEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 222 FH-PSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~-~~a~~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +. ..++....+. ..++..+. ..+.+.+++-+++|+.+++|++++.++-+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 352 GNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 54 3334344333 34444333 3456788899999999999999999976666 No 86 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.51 E-value=1.5e-14 Score=96.42 Aligned_cols=264 Identities=13% Similarity=0.029 Sum_probs=162.3 Q ss_pred Cc-------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccce Q lcl|Aclame:pro 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA-------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~ 72 (273) ++ ...++|+.|...+++.++...++..++..-.. ....-++.+|+...........++..... +..+.+. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV--TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec--cCCceeEEEEeecCCccceeeccccccCcccccceee Confidence 11 13578999999999999998888887654221 11112455555544444455566655543 3345666 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCHhHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDL 143 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~ 143 (273) +++.+.+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.-....... .......+....+++ T Consensus 198 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) T protein:vir:98 198 LAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) T ss_pred EEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhH Confidence 66666543 334566666555556677765 45678899999999887654221110 111222233456888 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEEEE Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~~~ 221 (273) |.++...+....... -.++++|..+..|.+..+.-. .... ...+.+|..+.+.|++|+.++.+|.++. ..++. T Consensus 277 i~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~lkd~~G--~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:98 277 IKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLG--NYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccCC--ceee-ccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 988888887766543 257899999999965322111 1111 1123456667999999999988886543 33455 Q ss_pred Ee-CceEEEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 222 FH-PSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~-~~a~~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +. ..++....+. ..++..+. ..+.+.+++-+++|+.+++|++++.++-+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 352 GNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 54 3334344333 34444333 3456788899999999999999999976666 No 87 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.50 E-value=1.2e-14 Score=96.97 Aligned_cols=263 Identities=11% Similarity=0.019 Sum_probs=165.8 Q ss_pred Cccc-----chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~-----~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ||.. .++|+.+..++++.++..+++..++.. ...++..+++|+....+.+...++++.....+++.+.+++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~----~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l 76 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQ----KPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTI 76 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcce----eeccCCceEEEEEecCcceEEeeCCcccccccccceeeEe Confidence 9973 456888899999999998888777643 2234556889997666667788888888777777777777 Q ss_pred EEEeeeeceeEechHHHHHh---HHHHHH-HHHHHHHHHHHHHHHHHHHHHH---hhcc----------cccccccCCHh Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADMLV---DNGT----------ALTGSAPSDAD 138 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~iD~~~~~~~~---~~~~----------~~~~~~~~~~~ 138 (273) +..+. +.-+.|+++-..++ ..++.+ +.++.+++++.++|..++.-.. ..+. ........+.. T Consensus 77 ~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (300) T protein:vir:95 77 VPLKV-EYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDT 155 (300) T ss_pred eeEEE-EEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccccc Confidence 76443 44456666533222 234544 5667899999999998884310 0000 01111223345 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC-- Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~-- 216 (273) ..++.|.++...+...+.... ..+++|..+..|.+..+.-.+. . .......|.-++++|.+|+.++.+|.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~L~~lkd~~G~~-i--~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 230 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERDIT--GAILDPIFTTALSKMKNAEGGK-L--YPELAWGGVPDAINGLAVDKNRTVSYSQTDP 230 (300) T ss_pred chHHHHHHHHHHhhhcCCCcc--EEEECHHHHHHHHHhhccCCCe-e--ccCccccCCCceecceeeEEecCCCCCCCCC Confidence 567889898888877665333 5889999999996543211110 0 11222345678999999999999986542 Q ss_pred -cEEEEEeC-ceEEEE-Eecceeee--ccCCC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 -EQFVAFHP-SAAAYV-SQIDTVEA--LRDQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -~~~~~~~~-~a~~~~-~~~~~ve~--~~~~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++++.- .++-+. .+...++. +.+.+ .| ...+++.+++|+.+++|++++.|+..+= T Consensus 231 ~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 231 KNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 22344432 222222 22222222 22221 12 2678999999999999999999976666 No 88 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.50 E-value=1.9e-14 Score=95.76 Aligned_cols=258 Identities=15% Similarity=0.063 Sum_probs=160.3 Q ss_pred Ccc-----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~-----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |.. ..+.|+.+...+++.+.+.+.+..++.+- ...|.++++|+... ...+....+++.+...+++...++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 180 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----RTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQT 180 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee----cccCcceEEEEEecCCcceeeeccCccccccccceeEEE Confidence 222 23456667788999999888887776442 22356789998754 344556677777776777777777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh---------hcccccccccCCHhHHHHHH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD---------NGTALTGSAPSDADDAFDLI 144 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~i 144 (273) +++.+. +.-+.|+..-.. ....+.++ .++.+.+++.++|..++.--.. .+.....+...+....++.| T Consensus 181 ~~~~k~-~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:19 181 ANVKTI-AHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EeeeeE-EEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 777654 344567764332 33456665 4568899999999887742100 01111111222344567889 Q ss_pred HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeC Q lcl|Aclame:pro 145 ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~ 224 (273) .++...+....... -.++++|..+..|.+..+.-.+ ... . ....|..+.++|.+|+.++.+|.+. ++++.. T Consensus 259 ~~~~~~l~~~~~~~--~~~~~~~~~~~~l~~lkd~~G~--~l~-~-~~~~~~~~~l~G~pV~~~~~~p~~~---~~~gd~ 329 (385) T protein:vir:19 259 AHAIYQVTESEFSA--SGIVLNPRDWHNIALLKDNEGR--YIF-G-GPQAFTSNIMWGLPVVPTKAQAAGT---FTVGGF 329 (385) T ss_pred HHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcCCCc--eec-c-CcccCCCceecceeeEEcCcCCCCc---EEEeec Confidence 99888887666433 3689999999998654321111 111 1 1234666889999999999998654 444443 Q ss_pred -ceEEEEEec-ceeeeccCC-Ccc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 -SAAAYVSQI-DTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 -~a~~~~~~~-~~ve~~~~~-~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .++....+. ..++..+.. ..| ...+++.+++|+.+.+|+++++++-+.+ T Consensus 330 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 330 DMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred ccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 344444332 234433322 112 2578899999999999999999965555 No 89 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.50 E-value=1.9e-14 Score=95.76 Aligned_cols=258 Identities=15% Similarity=0.063 Sum_probs=160.3 Q ss_pred Ccc-----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~-----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |.. ..+.|+.+...+++.+.+.+.+..++.+- ...|.++++|+... ...+....+++.+...+++...++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 180 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----RTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQT 180 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee----cccCcceEEEEEecCCcceeeeccCccccccccceeEEE Confidence 222 23456667788999999888887776442 22356789998754 344556677777776777777777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh---------hcccccccccCCHhHHHHHH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD---------NGTALTGSAPSDADDAFDLI 144 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~i 144 (273) +++.+. +.-+.|+..-.. ....+.++ .++.+.+++.++|..++.--.. .+.....+...+....++.| T Consensus 181 ~~~~k~-~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:18 181 ANVKTI-AHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EeeeeE-EEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 777654 344567764332 33456665 4568899999999887742100 01111111222344567889 Q ss_pred HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeC Q lcl|Aclame:pro 145 ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~ 224 (273) .++...+....... -.++++|..+..|.+..+.-.+ ... . ....|..+.++|.+|+.++.+|.+. ++++.. T Consensus 259 ~~~~~~l~~~~~~~--~~~~~~~~~~~~l~~lkd~~G~--~l~-~-~~~~~~~~~l~G~pV~~~~~~p~~~---~~~gd~ 329 (385) T protein:vir:18 259 AHAIYQVTESEFSA--SGIVLNPRDWHNIALLKDNEGR--YIF-G-GPQAFTSNIMWGLPVVPTKAQAAGT---FTVGGF 329 (385) T ss_pred HHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcCCCc--eec-c-CcccCCCceecceeeEEcCcCCCCc---EEEeec Confidence 99888887666433 3689999999998654321111 111 1 1234666889999999999998654 444443 Q ss_pred -ceEEEEEec-ceeeeccCC-Ccc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 -SAAAYVSQI-DTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 -~a~~~~~~~-~~ve~~~~~-~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .++....+. ..++..+.. ..| ...+++.+++|+.+.+|+++++++-+.+ T Consensus 330 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 330 DMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred ccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 344444332 234433322 112 2578899999999999999999965555 No 90 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.50 E-value=1.7e-14 Score=96.08 Aligned_cols=265 Identities=9% Similarity=-0.029 Sum_probs=158.1 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |++ ..+.|+.+..++++.+++..++.+++.+- ..++.++++|+....+.+....+++.+...+++.+.++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 89 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV----PMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQT 89 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 443 14568889999999999998888887542 22466789999877666677788888877777777777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-hc-------ccccc-cccCCHhHHHHHH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVD-NG-------TALTG-SAPSDADDAFDLI 144 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~-~~-------~~~~~-~~~~~~~~~~~~i 144 (273) ++..+. ..-+.+++.-..++..++.+ +.++.+++++.++|..++.--.. .+ ..... ..........+.+ T Consensus 90 ~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (318) T protein:vir:24 90 IAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVA 168 (318) T ss_pred EeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchHHHHH Confidence 776543 44456777555556667755 45678999999999988742110 00 00000 1111112223344 Q ss_pred HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccc--ccceeeeeeeeeecceEEEEecccccCCCcEEEEE Q lcl|Aclame:pro 145 ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG--DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAF 222 (273) Q Consensus 145 ~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~--~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~ 222 (273) .++...+..... ..-.++++|..+..|.+..+.-.+.-... .........-+.+.|++++.++.++.+.. .++.+ T Consensus 169 ~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~-~~~~g 245 (318) T protein:vir:24 169 VNGLSLLVNDGK--KWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGTT-VGFMG 245 (318) T ss_pred HHHHHhhccccC--CCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCCcc-EEEEe Confidence 555555544432 33468999999999965432111100000 00011111225789999999999876543 23444 Q ss_pred eCceEEEEEe-cceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 223 HPSAAAYVSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~~~a~~~~~~-~~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+.+.+..+ ...++..++.. .| ...+++.+++|+++++|++++.|+...+ T Consensus 246 dfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 246 DFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred ecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeecc Confidence 4443333322 22333322211 01 2678999999999999999999977555 No 91 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.49 E-value=2.1e-14 Score=95.58 Aligned_cols=258 Identities=10% Similarity=0.016 Sum_probs=163.8 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |.. ..++|+.|..++.+.+++..++.+++.+-. .. .+..+.+|+......+....+++.+...+.+.+.++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~--~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 85 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQE--ME-GEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVT 85 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee--cC-CCccEEEEEEcCCceeEEeecCccccccccceeEEE Confidence 211 246799999999999999998888875522 11 123456777665555566778877766667777776 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-hhcccc-----cccccCCHhHHHHHHHHH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLV-DNGTAL-----TGSAPSDADDAFDLIASA 147 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~-~~~~~~-----~~~~~~~~~~~~~~i~~a 147 (273) +...+. +.-+.|++.-..++..++.+ +.+++++++++++|..++.--. ..+... ...........+++|.++ T Consensus 86 l~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 164 (297) T protein:vir:95 86 LKAHKL-GIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINYDNILKL 164 (297) T ss_pred EeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCHHHHHHH Confidence 666443 44567777655556667765 4567899999999998873110 000000 000111122347788888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceE Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...+..++.+.. .++++|+.+..|.+..+ . ....+.++..+.++|.+++.++..+...+ .++++..+.+ T Consensus 165 ~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d------~--~G~~i~~~~~~~l~G~Pv~~~~~~~~~~~-~~~~gd~s~~ 233 (297) T protein:vir:95 165 QDALYDADVEPN--AFVSKIQNRSALREARD------G--NKVSIYDKAANTIDGITTVDLKSARFEKG-DLLAGDFDNL 233 (297) T ss_pred HHHhhhccCCcC--EEEEcHHHHHHHHHhhc------c--CCceeecCCCCcccceeeEeecCCCCCCc-eEEEEecccE Confidence 888887776443 57899999999864321 1 12345566678899999998876554443 3555554444 Q ss_pred EEEEec-ceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 228 AYVSQI-DTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~~~~~-~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+..+. ..++..++.. .| ...+++.+++|.++++|+++++|+.+-- T Consensus 234 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 234 IYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred EEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCC Confidence 333222 2333322211 11 3578889999999999999999987766 No 92 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.49 E-value=2.5e-14 Score=95.15 Aligned_cols=266 Identities=9% Similarity=-0.010 Sum_probs=156.7 Q ss_pred Cc-c-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCC--------ccCCccccc Q lcl|Aclame:pro 1 MA-F-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR--------QTSADAISD 70 (273) Q Consensus 1 MA-~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--------~~~~~~~~~ 70 (273) |. . ..+.|+.+..++.+.+++.+++..++.+- ...+.++++|+......+.+..++. .+....++. T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f 95 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQI----PISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAW 95 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCceeEeecCcccccccccccccccccce Confidence 11 1 12679999999999999999888877542 2335678899987765555544432 233333444 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-hcc--------------ccccccc Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD-NGT--------------ALTGSAP 134 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~-~~~--------------~~~~~~~ 134 (273) ..+++...+. +.-+.|++.-..++..++.++ .+++++++++++|..++.--.. .+. ....... T Consensus 96 ~~i~l~~~kl-~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~~ 174 (333) T protein:vir:78 96 DTRSVSPIKL-ATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYLQ 174 (333) T ss_pred eEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccccccccccccccccc Confidence 4444444322 334566665444556677664 5678999999999988741110 000 0001112 Q ss_pred CCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhh-cccccceeeeeeeeeecceEEEEeccccc Q lcl|Aclame:pro 135 SDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 135 ~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~~~~G~ig~i~G~~i~~s~~l~~ 213 (273) ......+++|.++...+..+.. .....++++|..+..|++.. ...+.+ ..-.......|.-++++|++|+.++.+|. T Consensus 175 ~~~~~~~~~i~~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~~~-~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~ 252 (333) T protein:vir:78 175 ETGDPLLDRLLDGYDLVSANTD-VEFNGWAVDPRFRAHLLRAQ-AYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGG 252 (333) T ss_pred cccchhHHHHHHHHHhhccccc-cCceEEEEcchHHHHHHHHh-hhcCCCCceeecCccccCCCceeeceeeEEccccCC Confidence 2334457788888777654431 22336888999999887543 122211 01112233446678999999999999885 Q ss_pred CC------CcEEEEEeCceEEEEEe-cceeeeccCC-----C-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 214 TD------DEQFVAFHPSAAAYVSQ-IDTVEALRDQ-----D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~------~~~~~~~~~~a~~~~~~-~~~ve~~~~~-----~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .. ...++++..+-+.+..+ ...++..+.. + .| -..+++.+++|+++++|+++++|+.+-. T Consensus 253 ~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 253 DLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred CccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 42 12345554444333322 2233322221 1 11 1467889999999999999999987766 No 93 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.49 E-value=3.3e-14 Score=94.48 Aligned_cols=260 Identities=13% Similarity=0.036 Sum_probs=157.4 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc--------ccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP--------TVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~--------~~~d~~~~~~~~~~~~~~~~~ 72 (273) -....+.|+.+...+.......+.+..++..- ...+.++++|+.... ..+....++...+..+++.+. T Consensus 130 ~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 205 (419) T protein:vir:94 130 NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ----NADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDT 205 (419) T ss_pred CCcccccchhhhHHHHHHHhhhhhhhhcceee----eccCCceeeeeeccccccccccCcccceecCCccccccccceee Confidence 11224568888888887777666666665431 223556777764322 223345667666666677777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHhhcc------------cccccccCCHh Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADM-LVDNGT------------ALTGSAPSDAD 138 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~-~~~~~~------------~~~~~~~~~~~ 138 (273) +++++.+. +.-+.|+..-... ..++.+++ ++.+.+++.++|..++.= ....+. ........+.. T Consensus 206 i~~~~~k~-~~~~~is~ell~d-~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~ 283 (419) T protein:vir:94 206 ITTTLKTV-AHWLPITRQAADD-NSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDE 283 (419) T ss_pred EEeeeeeE-EEeehhhHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccccc Confidence 77777554 3345667654333 34576755 458999999999988731 000000 01112233445 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcE Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~ 218 (273) ..++.|.++...+.....+.. .++++|..+..|.+..+.-.+. .. .......|..+.++|++|+.++.+|.+. T Consensus 284 ~~~~~l~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~~k~~~~~~-~~-~~~~~~~~~~~~l~G~pV~~~~~~~~~~--- 356 (419) T protein:vir:94 284 PPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSGV-FR-VIANVQGEATPRIWGLNVVSTVAIAQGT--- 356 (419) T ss_pred hhHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHHhhcCCCc-ee-ecCCcccCCCccccceeeEEcCCCCCcc--- Confidence 568889999988887766433 6899999999986543211110 11 1112335666789999999999998654 Q ss_pred EEEEe-CceEEEE-EecceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 219 FVAFH-PSAAAYV-SQIDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 219 ~~~~~-~~a~~~~-~~~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.+. ..++... .+...++..+... .| ...+++.+++|+.+++|+++++++-+++ T Consensus 357 ~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred EEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 33332 3333333 2333444433322 12 3678999999999999999998866666 No 94 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.48 E-value=3.5e-14 Score=94.34 Aligned_cols=262 Identities=13% Similarity=0.025 Sum_probs=158.5 Q ss_pred Cc-------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEec--cccccccccCCCCccCC-ccccc Q lcl|Aclame:pro 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGV--VAPTVKDYKAAGRQTSA-DAISD 70 (273) Q Consensus 1 MA-------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~-~~~~~ 70 (273) ++ ...++|+.|...+++.+++..++..++..-.. .+.+.++|.+ ..........++..... +..+. T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV----TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec----cCCceeEEEEEecCCcceeecccccccccccccce Confidence 11 12578999999999999999888887753211 1223344433 33333344556655442 33455 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCHhHHH Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAF 141 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~--------~~~~~~~~~~~~~ 141 (273) +.+++...+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.-....... .......+....+ T Consensus 196 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:46 196 FQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccch Confidence 6666666443 344567766555556677665 55689999999999888654221110 1111222333457 Q ss_pred HHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEE Q lcl|Aclame:pro 142 DLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQF 219 (273) Q Consensus 142 ~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~ 219 (273) ++|.++...+....... -.++++|+.+..|.+..+.-.+ ... ...+.+|.-+.+.|++|+.++.+|.+++ ..+ T Consensus 275 ~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~L~~lkd~~G~--~i~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~ 349 (415) T protein:vir:46 275 DDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLGN--YLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTL 349 (415) T ss_pred HHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccCCC--eee-ccCcCCCCCccccceeeEEeccccccCCCccEE Confidence 78888888777666532 3688999999998643221111 111 1123456678899999999998886543 234 Q ss_pred EEEeCc-eEEEEEe-cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 220 VAFHPS-AAAYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~-a~~~~~~-~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +.+.-+ ++....+ ...++... .....+.+++-+++|+++++|++++.++-+.+ T Consensus 350 ~~gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 350 IIGNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEehhccEEEEeecceEEEeec-cccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 555433 3434433 23444333 23345678899999999999999998865544 No 95 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.48 E-value=3.5e-14 Score=94.34 Aligned_cols=262 Identities=13% Similarity=0.025 Sum_probs=158.5 Q ss_pred Cc-------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEec--cccccccccCCCCccCC-ccccc Q lcl|Aclame:pro 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGV--VAPTVKDYKAAGRQTSA-DAISD 70 (273) Q Consensus 1 MA-------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~-~~~~~ 70 (273) ++ ...++|+.|...+++.+++..++..++..-.. .+.+.++|.+ ..........++..... +..+. T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV----TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec----cCCceeEEEEEecCCcceeecccccccccccccce Confidence 11 12578999999999999999888887753211 1223344433 33333344556655442 33455 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCHhHHH Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAF 141 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~--------~~~~~~~~~~~~~ 141 (273) +.+++...+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.-....... .......+....+ T Consensus 196 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:47 196 FQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccch Confidence 6666666443 344567766555556677665 55689999999999888654221110 1111222333457 Q ss_pred HHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEE Q lcl|Aclame:pro 142 DLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQF 219 (273) Q Consensus 142 ~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~ 219 (273) ++|.++...+....... -.++++|+.+..|.+..+.-.+ ... ...+.+|.-+.+.|++|+.++.+|.+++ ..+ T Consensus 275 ~~i~~~~~~~~~~~~~~--~~~v~n~~~~~~L~~lkd~~G~--~i~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~ 349 (415) T protein:vir:47 275 DDIKDAINLNVKPNYEH--NVAIVSQTMFAKLDKMKDKLGN--YLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTL 349 (415) T ss_pred HHHHHHHHhhhhhccCC--CEEEEcHHHHHHHHHhhccCCC--eee-ccCcCCCCCccccceeeEEeccccccCCCccEE Confidence 78888888777666532 3688999999998643221111 111 1123456678899999999998886543 234 Q ss_pred EEEeCc-eEEEEEe-cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 220 VAFHPS-AAAYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~-a~~~~~~-~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +.+.-+ ++....+ ...++... .....+.+++-+++|+++++|++++.++-+.+ T Consensus 350 ~~gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 350 IIGNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEehhccEEEEeecceEEEeec-cccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 555433 3434433 23444333 23345678899999999999999998865544 No 96 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.47 E-value=3.3e-14 Score=94.46 Aligned_cols=265 Identities=9% Similarity=-0.029 Sum_probs=150.3 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) .+. .-+.|+-+..++.+.+++...+..++.+- ...+.+.++|+....+.+....+++.+...+++.+.+++.. T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 97 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI----PMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAP 97 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcchhhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEEEee Confidence 111 12468888899999999988887776542 22356789999877666667788887777777777777777 Q ss_pred EeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---------hh---cccccccccCCHhHHHHH- Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLV---------DN---GTALTGSAPSDADDAFDL- 143 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~---------~~---~~~~~~~~~~~~~~~~~~- 143 (273) .+. +.-+.|++.-..++..++.++ .++.+++++.++|+.++.=-. .. .......+..+......+ T Consensus 98 ~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (326) T protein:vir:42 98 HKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDA 176 (326) T ss_pred EEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccchhHHH Confidence 543 455677776555666777664 566789999999998873100 00 000011111111112222 Q ss_pred -HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcc--cccceeeeeeeeeecceEEEEecccccCCCcEEE Q lcl|Aclame:pro 144 -IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFV 220 (273) Q Consensus 144 -i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~ 220 (273) +..+...+.... ..+-..+++|..+..|.+..+.-.+.-.. ...........+.+.|++++.++.+|.+... ++ T Consensus 177 ~~~~~~~~~~~~~--~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-~~ 253 (326) T protein:vir:42 177 VAVNALSLLVNAG--KKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVV-GY 253 (326) T ss_pred HHHHHHhhhhhhc--cCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceE-EE Confidence 222222222222 23346789999999997543211110000 0001111223457999999999999875432 22 Q ss_pred EEeCceEEEEEe-cceeeeccC--------CC-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 221 AFHPSAAAYVSQ-IDTVEALRD--------QD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~~~~-~~~ve~~~~--------~~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+.-+.+-+..+ ...++..++ +. .| ...+++.+++|+++.+|++++.|+.... T Consensus 254 ~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 254 QGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred EeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 222221112111 112221111 11 12 2678999999999999999998866555 No 97 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.47 E-value=3.6e-14 Score=94.28 Aligned_cols=264 Identities=12% Similarity=-0.007 Sum_probs=158.0 Q ss_pred Ccc-------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF-------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~-------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |+. ..+.|++ ...+++.+++...+.+++.+ ...++.+++||+....+.+....+++.+...+++.+.+ T Consensus 10 ~~~~~t~~~~g~l~~~~-~~~ii~~l~~~s~i~~l~~~----~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v 84 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQ-AKDYFAEAEKTSIVQRVAQK----IPMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKR 84 (397) T ss_pred HhhccCCCCccccchhH-HHHHHHHHHhccchhhhcce----eeccCCceEEEEEcCCcceEEecCCccccccccceeEE Confidence 333 2455665 56788888888888887643 22235678999987766677778888887777787877 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh------cccccccccCCHhHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDN------GTALTGSAPSDADDAFDLIAS 146 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~------~~~~~~~~~~~~~~~~~~i~~ 146 (273) ++++.+. ..-+.|+++-..++..++.++ .++.+++++.++|+.++.--... ...............++.+.+ T Consensus 85 ~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (397) T protein:vir:23 85 DVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLGVS 163 (397) T ss_pred EEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhHHHHH Confidence 7777543 445677776566666777665 56689999999999887421100 000011111222334556666 Q ss_pred HHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcc--cccceeeeeeeeeecceEEEEecccccCCCcEEEEEeC Q lcl|Aclame:pro 147 ALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 147 a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~ 224 (273) +...|..+..+ +-.++++|..+..|.+..+.-.+.-.. ........+..+++.|++++.++.+|.+.- .++.+.. T Consensus 164 ~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~-~~~~gDf 240 (397) T protein:vir:23 164 GLTKLVTDGKK--WTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDV-VGYAGDF 240 (397) T ss_pred HHHhhhhcccC--CCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCce-EEEEeec Confidence 66667666543 236899999999997543211110000 001111123346899999999999986542 2233322 Q ss_pred ceEEEEE-ecceeeeccCCC-------------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 SAAAYVS-QIDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~a~~~~~-~~~~ve~~~~~~-------------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +-+-+.. +...++..++.. .| ...+++.+++|+++++|++++.++.+.. T Consensus 241 s~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 241 SQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred ceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccc Confidence 2221221 212233222211 12 2577888999999999999999876554 No 98 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.47 E-value=1.8e-14 Score=96.00 Aligned_cols=261 Identities=13% Similarity=0.050 Sum_probs=159.7 Q ss_pred Ccc-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) -++ ..+.|+++...+.+.++...++..++++-. ...|..+.+|+....+.+.+..+++.+..++++...++++..+ T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~---~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k 192 (390) T protein:vir:62 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFT---TSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFK 192 (390) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceeee---cCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeee Confidence 112 355678887777777777766666664421 2235678999887766667788888877777777777777755 Q ss_pred eeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-------HHHhhccccc-ccccCCHhHHHHHHHHHHHH Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-------MLVDNGTALT-GSAPSDADDAFDLIASALKE 150 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-------~~~~~~~~~~-~~~~~~~~~~~~~i~~a~~~ 150 (273) . +.-+.|++.-..++..++.+++ ++.+++++.++|..++. .+...+.... ..........+++|.++... T Consensus 193 ~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 271 (390) T protein:vir:62 193 Y-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFHE 271 (390) T ss_pred E-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHHHHHHHHh Confidence 4 3445677665656666777654 56789999999998774 1111100000 01111122346778887777 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~ 230 (273) |+.... .+-..+++|..+..|.+..+. +..+. ....+..|.-+.+.|++|+.++.+|... ++.+.-+.+... T Consensus 272 l~~~~~--~~a~~vmn~~~~~~L~~lkd~--~g~~l-~~~~~~~g~~~~l~G~Pv~~~~~~p~~~---i~~gd~s~~~i~ 343 (390) T protein:vir:62 272 VPSAYR--ANAKYVVNDLRAAQMRKLKDA--NGQYL-WQSGLTVGAPSLFNGKVVETDDGMPADK---ILFADLSKYRVR 343 (390) T ss_pred hhhhhh--cCCEEEEchHHHHHHHHhhcc--CCCee-ecCCcCCCccceecccceEEecCCCCcc---EEEeeccceeEE Confidence 765432 233678999999988543211 11111 1122334666789999999999998643 333443333222 Q ss_pred Eec-ceeeeccCCCcc--eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 231 SQI-DTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~-~~ve~~~~~~~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+. ..++...+.... .+.+++.+++|+++++|+++++|+-+.+ T Consensus 344 ~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 344 FAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPG 389 (390) T ss_pred eecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecC Confidence 222 233333332221 3578999999999999999999877766 No 99 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.46 E-value=5.2e-14 Score=93.43 Aligned_cols=256 Identities=13% Similarity=0.027 Sum_probs=161.8 Q ss_pred Cc-c-----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MA-F-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA-~-----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |. . -.+.|+.+...+.+.+.....+.+++..- ...+.++++|..... ..+....+++.....+++.+.+ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhccee----eccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 11 1 13567777788999898888877776432 223567888887543 3456677888777777788888 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh---------cccccccccCCHhHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDN---------GTALTGSAPSDADDAFDL 143 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~ 143 (273) ++++.+. +.-+.|++.-..+. .++.++ .++.+.+++.++|..++.--... +.........+....++. T Consensus 189 ~~~~~k~-~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~ 266 (390) T protein:vir:97 189 TDTTHVI-AHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQ 266 (390) T ss_pred EEeeeeE-EEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHH Confidence 8887654 34456776533333 456664 55689999999999877421000 001111122234456778 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEe Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~ 223 (273) +.++...+.....+.. .++++|..+..|.+..+.-.+ ..- .. ...|..++++|.+|++++.+|.+. ++.+. T Consensus 267 ~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~~G~--~l~-~~-~~~~~~~~l~G~pV~~~~~~~~~~---~~~gd 337 (390) T protein:vir:97 267 LRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDANNQ--YLI-GN-ARGTLTPTLWGLPVVATQAMAPGE---FLVGA 337 (390) T ss_pred HHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCCCc--eee-cC-ccCCCCceecceeeEEcCCCCCCc---EEEEe Confidence 8888888887776544 578999999999653221111 110 11 123445789999999999998653 44443 Q ss_pred Cc-eEEEEE-ecceeeeccCCCcc-e--eeEEeeeeeeeEEEcCceEEEEecC Q lcl|Aclame:pro 224 PS-AAAYVS-QIDTVEALRDQDSF-S--DRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 224 ~~-a~~~~~-~~~~ve~~~~~~~~-~--~~v~~~~~~g~~vl~p~~~v~~~~~ 271 (273) .+ ++-... ....++..+....| . ..+++.++||..+++|++++++.=+ T Consensus 338 ~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 338 FDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred ccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 33 343333 33356655543322 2 4688889999999999999999888 No 100 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.46 E-value=5.5e-14 Score=93.29 Aligned_cols=258 Identities=15% Similarity=0.042 Sum_probs=158.2 Q ss_pred Cc-----ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MA-----FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA-----~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) +. -..++|+.|...+.+.++....+.+++..- ...|.++++|+... ...+....+++.....+++.+.++ T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 211 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPG----QTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKN 211 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhccee----eccCCceeEEEEecCCCceeeeccCccccccccceeeEE Confidence 11 124789999999999999998888877532 22356788888655 344456677777766677777777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhh--c-------ccccccccCCHhHHHHHH Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDN--G-------TALTGSAPSDADDAFDLI 144 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~--~-------~~~~~~~~~~~~~~~~~i 144 (273) +...+. +.-+.|++.-.. ...++.+++ ++.+.+++.++|..++.--... + .....+...+....+++| T Consensus 212 ~~~~k~-~~~~~is~ell~-ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i 289 (418) T protein:vir:10 212 QPVRTI-AHLFKASRQILD-DAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKI 289 (418) T ss_pred EeeeeE-EEeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHH Confidence 776553 334566665333 334676654 5578999999999887421000 0 011111222233456778 Q ss_pred HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeC Q lcl|Aclame:pro 145 ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~ 224 (273) .++...+...+.+.. .++++|..+..|.+..+.-.+ .. ..+ ..+|..+.++|++|+.++.+|.+. ++.+.. T Consensus 290 ~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~~G~--~i-~~~-~~~~~~~~l~G~pV~~~~~~p~~~---~~~gd~ 360 (418) T protein:vir:10 290 RLALLQAVLAEFPAT--GIVLNPIDWASIELTKDSQGR--YI-VGN-PVNGTTPRLWNLPVVETQAMTANE---FLVGAF 360 (418) T ss_pred HHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCCCc--ee-ccc-cccCCCceecceeeEEcCCCCCCc---EEEeec Confidence 887777766554333 588999999988653221111 11 111 224666789999999999998654 444443 Q ss_pred c-eEEEEE-ecceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 S-AAAYVS-QIDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~-a~~~~~-~~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) + ++-... ....++..+... .| ...+++.+++|+.+++|++++.++-+.. T Consensus 361 s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 361 SMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred cceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccC Confidence 3 332332 222344333222 12 3578888999999999999987755544 No 101 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.44 E-value=1.6e-14 Score=96.25 Aligned_cols=261 Identities=12% Similarity=0.041 Sum_probs=164.2 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc---ccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP---TVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~---~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) |++ |+++..++.+.+.+..+...++. ... -..+-.+.+.+.... +..+...+++++.......+...+-. T Consensus 22 l~~----P~~I~~~i~e~~~~~~iad~lf~-~~~--a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~~G~~~ia~ 94 (318) T protein:vir:10 22 VGN----PLWIPTALKKMMVNQFISESLFR-NGG--ANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGARGLPRTAF 94 (318) T ss_pred hCC----chhHHHHHHHHHhccchhhhhhh-ccc--ccccceeEEEecccccccCcHhhccCcccccccCCCCCchhhhh Confidence 333 77777777776655555444443 322 123457777553221 33344577777766666776666644 Q ss_pred EeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccc-CCHh----HH---HHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTG-SAP-SDAD----DA---FDLIASA 147 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~-~~~-~~~~----~~---~~~i~~a 147 (273) -+..+..+.|+++-......+ ++..+++++.+++++.|+.++..+..+.+.... +++ .... .+ .+.+..+ T Consensus 95 ~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~A~e~v~~a 174 (318) T protein:vir:10 95 AVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAIAIEQISTA 174 (318) T ss_pred hehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccchhhhhhhhhh Confidence 344578899998776655555 688899999999999999999988654432211 111 1111 11 1112211 Q ss_pred HHHHhh-------cCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccc-----eeeeeee-eeecceEEEEecccccC Q lcl|Aclame:pro 148 LKELTK-------ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAA-----GLRAGTI-GNLLGARIVESNNLRDT 214 (273) Q Consensus 148 ~~~l~~-------~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~-----~~~~G~i-g~i~G~~i~~s~~l~~~ 214 (273) ..-+.. .+..-..-.+|++|..+..|++++++ ..... ++.+ .--.|-+ |+++|++|+.|+++|.+ T Consensus 175 ~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~-~~~y~-~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~ 252 (318) T protein:vir:10 175 APTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENF-MKVYE-RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPID 252 (318) T ss_pred hhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhh-hhhhh-ccchhhhhcccccccccceeeceEEeecCccCCC Confidence 111111 11111123799999999999988653 22221 1111 1113555 67899999999999975 Q ss_pred CCcEEEEEeCceEEEEEe--cceeeeccCC-------CcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 215 DDEQFVAFHPSAAAYVSQ--IDTVEALRDQ-------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 215 ~~~~~~~~~~~a~~~~~~--~~~ve~~~~~-------~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) . ++++.++.+|+..- -.+++.+|++ ...+|.+++......+|.+|.+++.|+-=+| T Consensus 253 ~---alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 253 R---VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVT 317 (318) T ss_pred e---eEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccC Confidence 5 67888998887642 2356778866 4457899999999999999999999999999 No 102 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.44 E-value=8.4e-14 Score=92.26 Aligned_cols=265 Identities=12% Similarity=0.049 Sum_probs=155.7 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc------ccccc Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD------AISDT 71 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~------~~~~~ 71 (273) +.+ ..+.|+.+...+++.++...++..++.+- ...|....+|+....+.+....++.....+ ..+.+ T Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~ 240 (458) T protein:vir:10 165 SSVEVSSESYETIFSQRIIRDLQKELVVGALFEEL----PMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALK 240 (458) T ss_pred ccCccccceehhhHhHHHHHHHHhhhhHHhhccee----ecCCcceEEEEecCCcceeecccccccccccccccccccce Confidence 121 24679999999999999988887776542 234566777776655555555555433221 23344 Q ss_pred eEEEEEEeeeece-eEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHhhccc------------c-cccccC Q lcl|Aclame:pro 72 GVDLLIDQEKSID-FLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM-LVDNGTA------------L-TGSAPS 135 (273) Q Consensus 72 ~~~~tid~~~~~~-~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~-~~~~~~~------------~-~~~~~~ 135 (273) .+ ++.-++... +.|++.-..++..++.++ .++++++++.++|..++.= ....+.+ . ..+... T Consensus 241 ~i--~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 318 (458) T protein:vir:10 241 EI--HFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADG 318 (458) T ss_pred ee--EeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccc Confidence 44 444444333 566666555555677664 5568999999999988741 0000000 0 001111 Q ss_pred CHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhc-ccccceeeeeeeeeecceEEEEecccccC Q lcl|Aclame:pro 136 DADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADT-SGDAAGLRAGTIGNLLGARIVESNNLRDT 214 (273) Q Consensus 136 ~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~-~~~~~~~~~G~ig~i~G~~i~~s~~l~~~ 214 (273) .....+++|.++...+...... +-.++++|..+..|.+..+.-.+.-. .........|..+.+.|.+|+.++.+|.. T Consensus 319 ~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~ 396 (458) T protein:vir:10 319 SVLVTAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) T ss_pred cccccHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccc Confidence 1223477888888888766543 33579999999988643221111000 00112333566678999999999999976 Q ss_pred CCcEE--EEEeCceEEEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 215 DDEQF--VAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 215 ~~~~~--~~~~~~a~~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++... +..-.+++....+. ..++...-...-...+++..++|..+++|+++|..+-++| T Consensus 397 ~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 397 ANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred cCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 54322 22222333333332 2333222222223578889999999999999999999999 No 103 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.43 E-value=7.4e-14 Score=92.56 Aligned_cols=261 Identities=11% Similarity=0.015 Sum_probs=157.5 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ti 77 (273) ++. ..+.|+++...+.+.+....++..++..-. ...|..+.+|.......+....+++.+...+++.+.+++.. T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 190 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFT---TSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGG 190 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhcceeee---cCCCceeEEEEEcCCcceeeecccccccccccceeeEEeee Confidence 221 245677787777776666666666553311 22356788988877555566778877776777777777777 Q ss_pred EeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhcc--------cc-cccccCCHhHHHHHHHH Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGT--------AL-TGSAPSDADDAFDLIAS 146 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~--------~~-~~~~~~~~~~~~~~i~~ 146 (273) .+. +.-+.|++.-..++..++.+++ ++.+.+++.++|..++. .....+. .. ..+........+++|.+ T Consensus 191 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~ 269 (392) T protein:vir:13 191 FKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALID 269 (392) T ss_pred eeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccccccccHHHHHH Confidence 543 3445677665556666777754 56889999999998874 1111110 00 00111122234777877 Q ss_pred HHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCce Q lcl|Aclame:pro 147 ALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSA 226 (273) Q Consensus 147 a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a 226 (273) +...|..... .+-..+++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+|+.++.+|... ++.++-+. T Consensus 270 ~~~~l~~~~~--~~a~~v~n~~~~~~l~~lkd~~G~--~l-~~~~~~~g~~~~l~G~Pv~~~~~~~~~~---i~~Gdf~~ 341 (392) T protein:vir:13 270 LFHEVPSAYR--KNAKFVVNDLRAAQMRKLKDANGQ--YL-WQSALTVGAPDTFNGKVVETDDGMPADK---VLFADLSK 341 (392) T ss_pred HHHhhhhhhh--cCCEEEEcHHHHHHHHHhhccCCc--ee-ecCCcCCCCCceecceeeEEcCCCCCCc---EEEeeccc Confidence 7766654432 223568899999988643221111 00 1112334555689999999999998643 45555554 Q ss_pred EEEEEec-ceeeeccCCCcc--eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQI-DTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~~-~~ve~~~~~~~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +....+. ..++...+.... .+.+++..++|+++.+|+++++++-+.+ T Consensus 342 ~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 342 YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred eeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 4333322 233333333222 2688999999999999999998866666 No 104 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.42 E-value=9.6e-14 Score=91.95 Aligned_cols=262 Identities=15% Similarity=0.088 Sum_probs=155.5 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc-ccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD-AISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~-~~~~~~~ 73 (273) |.. -.++|+.|..++++.+++.+++.+++..- ...+.++.+|.....+.+....++...... ..+...+ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i 181 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVI----TLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLI 181 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceee----ecCCCceEEEEecCCcceeeecccccccccccccceeE Confidence 322 24789999999999999988887776531 122446788776554445556666655432 3455666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccc-----------ccc Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM---------LVDNGTAL-----------TGS 132 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~---------~~~~~~~~-----------~~~ 132 (273) ++.+.+. ..-+.|++.-..++..++.++ .++.+++++.++|..++.= +....... ... T Consensus 182 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 260 (407) T protein:vir:48 182 EPFMGEI-YGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIA 260 (407) T ss_pred Eeeeeee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccccccccccc Confidence 6666443 223466666555666677665 5668899999999876631 00000000 001 Q ss_pred ccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 133 APSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) +.......+++|.++...|.....+ +-.++++|..+..|.+..+.-.+ .. ....+..|..+.++|.+|+.++.+| T Consensus 261 ~~~~~~~~~d~i~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lkD~~Gr--~l-~~~~~~~g~~~~l~G~PV~~~~~~p 335 (407) T protein:vir:48 261 SGAASGVTADAIIKLIYTLRKAHRS--GAKFMMNNSSLFAIRLLKDNDGN--YL-WRPGIELGQPSSLAGYGIVENEQMP 335 (407) T ss_pred cccccccChHHHHHHHHhhchhhhc--CCEEEEcHHHHHHHHHhhccCCc--ee-eccCcCCCCCceecceeeEEecCcC Confidence 1112223477888888777665432 22578999999988653321111 10 1112335667789999999999998 Q ss_pred cCCC-cEEEE-EeC-ceEEEEEecceeeeccCC--CcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 213 DTDD-EQFVA-FHP-SAAAYVSQIDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~-~~~~~-~~~-~a~~~~~~~~~ve~~~~~--~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+. ..+++ +.- .++-...+. .++..++. ..--..+++.+++|+++++|+++++++-+++ T Consensus 336 ~~~~~~~~i~~Gd~~~~~~i~~~~-~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 336 DIAADAKAIAFGNFKRGYTIVDRI-GTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred CccCCccEEEEEeccccEEEEEee-ceEEEeeccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 6442 22332 332 233333222 12222221 1223578899999999999999999976666 No 105 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.42 E-value=8.8e-14 Score=92.17 Aligned_cols=266 Identities=14% Similarity=0.064 Sum_probs=157.9 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccc-cccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEG-IASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~-~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) -+-.++.|+.+.+++++.++..+++..+..+-... ...+| .++||+....+.+.+..+++.....+.+.+.+++...+ T Consensus 344 ~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~-~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k 422 (645) T protein:vir:93 344 WAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPF-NIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAK 422 (645) T ss_pred ccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccC-ceeeeeeecCcceEEeccCccccccccceeEEEEeeEE Confidence 11245789999999999999988887775443222 12233 56888876555667788888887777777777776644 Q ss_pred eeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhh-----cccc--cccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDN-----GTAL--TGSAPSDADDAFDLIASALKEL 151 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~-----~~~~--~~~~~~~~~~~~~~i~~a~~~l 151 (273) . +.-+.|++.-..++..++++++ +.++++++.++|..++.--... +... .............++..+...+ T Consensus 423 l-a~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~ 501 (645) T protein:vir:93 423 V-SAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQF 501 (645) T ss_pred E-EEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHH Confidence 2 3344566654455566777765 5689999999999887421111 1111 0111111223445677777778 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCC----CcEEEEEeCceE Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD----DEQFVAFHPSAA 227 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~----~~~~~~~~~~a~ 227 (273) ..+++...+-..+++|..+..|.+..+...+. . ..+.-..| +.++|.+|+.|+.+|..- ...++++....+ T Consensus 502 ~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~--~-~~~~~~~~--~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~~~v 576 (645) T protein:vir:93 502 VAANLQPTGAVWLMSSTNALALSMRKNALGQK--E-YPDMTLLG--GSFQGLPVIVSQYVGDQLVLVNAPDIYLADDGGV 576 (645) T ss_pred HhcCCCccccEEEEcHHHHHHHHhccccCCce--e-ecCCCCCC--ceeeceeeEEeccCCcceeEeccccEEEEEecce Confidence 77777655557889999999997653211111 0 01111112 579999999999987521 111222222222 Q ss_pred EEEE-ecceeeecc-CC-C------------cc--eeeEEeeeeeeeEEEcCceEEEEecCC--C Q lcl|Aclame:pro 228 AYVS-QIDTVEALR-DQ-D------------SF--SDRIRALHVYGGKVVRPTGVVVFNKTG--S 273 (273) Q Consensus 228 ~~~~-~~~~ve~~~-~~-~------------~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~--s 273 (273) .+.. +...++... +. . .+ -..|++-++++..+.+|+++++|+... | T Consensus 577 ~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~ 641 (645) T protein:vir:93 577 AVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGS 641 (645) T ss_pred EEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCc Confidence 2211 111122110 00 0 11 257899999999999999999997542 2 No 106 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.41 E-value=1.9e-13 Score=90.39 Aligned_cols=256 Identities=12% Similarity=0.019 Sum_probs=159.3 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) +.. ..+.|+-+...+++.+.+.+.+.+++.+- ...+.++++|+.... ..+....+++.+...+++.+.+ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhccee----eccCCceEEEEEecCCcceeeecCCcccccccceeeEE Confidence 111 12445556677888898888888877532 233567888887553 3445567777777777777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh---------cccccccccCCHhHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDN---------GTALTGSAPSDADDAFDL 143 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~ 143 (273) ++.+.+. +.-+.|++.-.... .++.++ .++.+.+++.++|..++.--... +.........+....++. T Consensus 189 ~~~~~k~-~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 266 (390) T protein:vir:81 189 TDTTHVI-AHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQ 266 (390) T ss_pred EEeeeEE-EEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHH Confidence 7777654 34456776533333 457665 45688999999999877421000 001111122233445778 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEe Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~ 223 (273) |.++...+.....+.. .++++|..+..|.+..+.-.+ ..- ... ..|..+.++|.+|+.++.+|.+. ++.+. T Consensus 267 ~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~~G~--~l~-~~~-~~~~~~~l~G~pv~~~~~~p~~~---~~~gd 337 (390) T protein:vir:81 267 LRLAMLQASLAEYNPS--GIVINPIDWAAIELAKDANNQ--YLI-GNA-RGTLTPTLWGLPVVATQAMAPGE---FLVGA 337 (390) T ss_pred HHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCCCc--eee-cCc-ccccCceecceeeEEcCCCCCCc---EEEEe Confidence 8888888887776443 578999999998653321111 110 111 13445689999999999998654 44444 Q ss_pred Cc-eEEEEEe-cceeeeccCCCcc---eeeEEeeeeeeeEEEcCceEEEEecC Q lcl|Aclame:pro 224 PS-AAAYVSQ-IDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 224 ~~-a~~~~~~-~~~ve~~~~~~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~ 271 (273) -+ ++....+ ...++..+....| ...+++.+++|.++++|+++|+++-+ T Consensus 338 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 338 FDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred hhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 33 4443433 3345554433332 24688999999999999999999988 No 107 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.41 E-value=7.5e-14 Score=92.54 Aligned_cols=266 Identities=11% Similarity=0.041 Sum_probs=154.8 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+.|...+++.++....+.+++++-. ...|..+.+|..... ..+....++......++....+ T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~ 193 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILT---TSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMG 193 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeee---cCCCceEEEEeeccCcccccccccccccccccccccee Confidence 322 146899999999999998888877765421 122445666666542 3445667777666556665555 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHhh---cc----cc--cccccCCHhHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM-LVDN---GT----AL--TGSAPSDADDAFD 142 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~-~~~~---~~----~~--~~~~~~~~~~~~~ 142 (273) ++...+....-+.|++.-..++..++.++ .++.+++++.++|..++.- -... +. .. ......+....++ T Consensus 194 ~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d 273 (409) T protein:vir:45 194 SLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQ 273 (409) T ss_pred eeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchH Confidence 54432221222457776555566677665 5568899999999987741 1100 00 00 0111122233467 Q ss_pred HHHHHHHHHhhcCCCcCCcE-EEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC-cE-E Q lcl|Aclame:pro 143 LIASALKELTKANVPNVGRV-VVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQ-F 219 (273) Q Consensus 143 ~i~~a~~~l~~~~vp~~~r~-lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~-~~-~ 219 (273) +|.++...|..... ....+ ++++|..+..|.+-.+.-.+ .. ....+..|.-+.++|.+|+.++.+|..+. .. + T Consensus 274 ~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd~~G~--~i-~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i 349 (409) T protein:vir:45 274 EILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMEDGQGR--PL-WLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFM 349 (409) T ss_pred HHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhcCCCc--ee-eccCcCCCCCceecceeeEEecCcCCccCCccEE Confidence 78888777765542 23344 57899999888543211111 10 11123345557899999999999986332 22 3 Q ss_pred EEEeCceEEEEEe-cceeeeccCCCcc--eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 220 VAFHPSAAAYVSQ-IDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~a~~~~~~-~~~ve~~~~~~~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +.+.-+.+.+..+ ...++..++.-.. ...|++..++|+++.+|+++++++-.+| T Consensus 350 ~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 350 FCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred EEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 3333332222222 2233333332211 2568999999999999999999988877 No 108 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.40 E-value=1.1e-14 Score=97.09 Aligned_cols=256 Identities=11% Similarity=0.071 Sum_probs=137.7 Q ss_pred Ccccchh-------HHHHHHHHHHHHHHhh-ccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccc- Q lcl|Aclame:pro 1 MAFNNFI-------PELWSDMLLEEWTAQT-VFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDT- 71 (273) Q Consensus 1 MA~~~~~-------pev~~~~v~~~l~~~~-v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~- 71 (273) ||-++++ |+.+. ...+|.+++ .+..+.....-.....|++|++|+|.+.+.+....+|+.++.+.++.+ T Consensus 1 mAe~nlt~~~dL~~~~sid--fv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~ 78 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSID--FVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTK 78 (295) T ss_pred CCCcccccHhhccCceeeh--hhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeee Confidence 9875443 22211 111222211 111111111111245599999999999999999999999999998865 Q ss_pred --eEEEEEEeeeeceeEechHHHH-HhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 72 --GVDLLIDQEKSIDFLVDDIDRV-QVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 72 --~~~~tid~~~~~~~~i~d~d~~-~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a 147 (273) ..++++++++. .++|+..+ ...++ +.+..+|+.++|++++|++++..+..+..... ..+....++.+..+ T Consensus 79 ~~t~t~kikK~rK---~tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~t---g~~lq~a~a~~~~a 152 (295) T protein:vir:99 79 DKDYTVKWFKKRR---ATTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVK---GVGLQKALSASWAK 152 (295) T ss_pred eeeeEEEeeeecc---cccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeee---hhhHHHHHHHhhhh Confidence 57777766543 34766643 33334 46678899999999999999999965433321 11112223333333 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHH-hhhhhcccccceeeeeeeeeecceE-EEEecccccCCCc------EE Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGAR-IVESNNLRDTDDE------QF 219 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~~~~G~ig~i~G~~-i~~s~~l~~~~~~------~~ 219 (273) ...+.+.. ....+++++|...+.|+++... +..+...| .+.+. +++|++ |++|+.+|.+... -. T Consensus 153 l~~f~Ee~--~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG-~~~L~-----nfLG~q~II~S~kv~~G~~~aT~~~Ni~ 224 (295) T protein:vir:99 153 LATFNEFE--GSPLVSFVSPLDVANYLGDTKVGADASNVFG-MTLLK-----NFLGMQNVIVMPSVPEGKIYSTAVENLV 224 (295) T ss_pred hhhccccc--CCceEEEEehHHHHHHHhccccccchhhhhh-hhhhh-----hhhccceEEEcccCCCceEEEeeccceE Confidence 33333332 1245899999999999887532 23333333 23333 599997 9999999876531 12 Q ss_pred EEEeCceEE-EEEecce-------eeeccCCCcceeeEEeeeeeeeEEE---cCceEEEEe--cCCC Q lcl|Aclame:pro 220 VAFHPSAAA-YVSQIDT-------VEALRDQDSFSDRIRALHVYGGKVV---RPTGVVVFN--KTGS 273 (273) Q Consensus 220 ~~~~~~a~~-~~~~~~~-------ve~~~~~~~~~~~v~~~~~~g~~vl---~p~~~v~~~--~~~s 273 (273) +++-+--.+ +...... +-...+.....=.+.. ..+.+-++ +++++|+.+ +.+| T Consensus 225 ~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et-~~~~~~~lfpE~~dgiv~~tI~~~~~ 290 (295) T protein:vir:99 225 FASLNVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYES-VFFGANVLFAEIPEGVVEATIEAAAV 290 (295) T ss_pred EEEecCCchhhhhhhhhccCcccceEEEeccccceeeehh-hhHhHHHhcccccceEEEEEEecCcC Confidence 222111100 1111110 1111222221111222 22233334 555887654 4444 No 109 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.40 E-value=1.1e-13 Score=91.69 Aligned_cols=262 Identities=14% Similarity=0.069 Sum_probs=155.7 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |+. -.++|+.|..++++.++...++..++..- ...|.+..+|.......+....++..... +..+.+.+ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v 182 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVI----TVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLI 182 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----ecCCCceEEEEecCCccceeeccccccCccccccceee Confidence 433 25789999999999999988887776532 22355667776644444445566655432 23455555 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHhhccc-------------------cccc Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM-LVDNGTA-------------------LTGS 132 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~-~~~~~~~-------------------~~~~ 132 (273) ++...+. +.-+.|+..-..++..++.++ .+..+++++.++|..++.= -...+.. .... T Consensus 183 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 261 (401) T protein:vir:44 183 EPFMGEI-YGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIV 261 (401) T ss_pred eeehhhe-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccc Confidence 5555433 333466665555556677665 4567899999999887731 0000000 0001 Q ss_pred ccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 133 APSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) +.......+++|.++...|..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+|+.++.+| T Consensus 262 t~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~~G~--~l-~~~~~~~g~~~~l~G~PVv~~~~~p 336 (401) T protein:vir:44 262 SGEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKDTEGN--YL-WRPGLELGQPSSLAGYGIAENEQMP 336 (401) T ss_pred cccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhccCCc--ee-ecCCcCCCCCceecceeeEEecCcC Confidence 111222347788888777765432 233678999999999654321111 10 1112335677789999999999998 Q ss_pred cCCC-cEE-EEEeC-ceEEEEEecceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 213 DTDD-EQF-VAFHP-SAAAYVSQIDTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~-~~~-~~~~~-~a~~~~~~~~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+. ..+ +.+.- .++....+.. ++..++.- .=-..+++.+++|+.+++|++++.|+-++| T Consensus 337 ~~~~~~~~i~~Gd~~~~~~i~~~~~-~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 337 DIAADAKAIAFGNFKRGYTIVDRIG-TRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred CccCCccEEEEeehhccEEEEEecc-eEEeeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 5433 223 33433 3444443332 22222221 112568899999999999999999988888 No 110 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.40 E-value=1.4e-13 Score=90.98 Aligned_cols=264 Identities=16% Similarity=0.067 Sum_probs=154.2 Q ss_pred Cccc-----chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~-----~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ||.. .++|+.++.++++.+++..++..++.+- ..++..++||+....+.+....+++.+...+++.+++++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i----~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l 76 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARK----PQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTS 76 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCceeEEeecCcccccccceeeEEEE Confidence 9972 5579999999999999999888877542 222346789998666666777888887777777777777 Q ss_pred EEEeeeeceeEechHHHHH---hHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh-hcc-------------cccccccCCH Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQ---VAGSLEA-YTRAGATALATDTDKFIADMLVD-NGT-------------ALTGSAPSDA 137 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~---~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~-~~~-------------~~~~~~~~~~ 137 (273) ...+. +.-+.|+++-..+ ...++.+ ..++++++++.++|+.++.--.. .+. .....+..+. T Consensus 77 ~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:99 77 TPKKA-QVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTI 155 (311) T ss_pred eeEEE-EEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccccc Confidence 66443 3445666654332 2345655 45678999999999988743210 000 0000111222 Q ss_pred hHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC- Q lcl|Aclame:pro 138 DDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~- 216 (273) .....++.++...+..++.....-.++++|..+..|.+..+...+ .. .......+..+++.|++++.++.+|.... T Consensus 156 ~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~--~l-~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~ 232 (311) T protein:vir:99 156 ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGR--KK-FPELGLGIGVSSFEGIDASVSDTVNGGDEA 232 (311) T ss_pred chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCC--ee-ecCcccCCCCceecceeeEeeccccccccc Confidence 233445666655555444322111379999999999654321111 00 11122234457899999999998874321 Q ss_pred ------------cEEEEEeCc-eEEEE-EecceeeeccC--CC----cc---eeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 217 ------------EQFVAFHPS-AAAYV-SQIDTVEALRD--QD----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 217 ------------~~~~~~~~~-a~~~~-~~~~~ve~~~~--~~----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) ..++++.-+ ++.+. .+...++..+. .+ .| ...+++.+++|+.+++|+.+++..+++ T Consensus 233 ~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 233 DPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred ccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 112222211 22221 11122222211 11 11 246788999999999998887777777 No 111 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.40 E-value=3.7e-13 Score=88.73 Aligned_cols=255 Identities=13% Similarity=0.026 Sum_probs=156.0 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) +.. .++.|+ +...+++.+...+.+.+++.. ....+.++++|++... +.+....+++.....+++.+.+ T Consensus 114 ~~~~~~~~g~~~~~~-~~~~ii~~~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:10 114 STDAAGSAGALTTPN-RLPGFITQPDARLTVRDLIGS----GRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hcccccccccccchh-HHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 111 134444 456788888888777777643 2223567899987553 3456677777777677777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh---------hcccccccccCCHhHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD---------NGTALTGSAPSDADDAFDL 143 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~ 143 (273) ++.+.+. +.-+.|++.-... ..++.++ .++.+++++.++|..++.--.. .+.........+....++. T Consensus 189 ~~~~~k~-~~~~~is~ell~d-~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 266 (390) T protein:vir:10 189 TDTTHVI-AHTMKATRQILSD-APQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQ 266 (390) T ss_pred EEeeEEE-EEeehhhHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHH Confidence 7777654 3455666643333 3456665 4568889999999987742100 0011111122233445778 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEe Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~ 223 (273) +.++...+.....+.. .++++|..+..|.+..+.-.+ ... .... .+.-+.+.|.+|+.++.+|.+. ++.+. T Consensus 267 ~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~~g~--~l~-~~~~-~~~~~~l~G~pv~~~~~~p~~~---~~~gd 337 (390) T protein:vir:10 267 LRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDANNQ--YLI-GNAR-GTLTPTLWGLPVVATQAMAPGE---FLVGA 337 (390) T ss_pred HHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCCCc--eee-cCCc-CcCCceecceeeEEcCCCCCCc---EEEEe Confidence 8888888887776544 578999999998653321111 111 1111 2334679999999999998654 44444 Q ss_pred Cc-eEEEEEe-cceeeeccCCCcc---eeeEEeeeeeeeEEEcCceEEEEecC Q lcl|Aclame:pro 224 PS-AAAYVSQ-IDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 224 ~~-a~~~~~~-~~~ve~~~~~~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~ 271 (273) -+ ++....+ ...++..+....| ...+++.+++|+++++|+++++++-+ T Consensus 338 f~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 338 FDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 33 3333332 2345544432222 35788889999999999999999888 No 112 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.40 E-value=1.3e-13 Score=91.26 Aligned_cols=263 Identities=12% Similarity=0.030 Sum_probs=156.5 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcc-cccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-ISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~-~~~~~~ 73 (273) |.. -.++|+-|...+++.++...++.+++..- ...+...++|.....+.+....+++.....+ .+.+.+ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v 205 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQ----PVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPL 205 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----eccCCceEEEEEcCCcceeeecccccccccccccccee Confidence 322 13789999999999999998888887532 1223456777765544555566666544333 345555 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhccccc-----------cc Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM---------LVDNGTALT-----------GS 132 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~---------~~~~~~~~~-----------~~ 132 (273) ++...+. +.-+.|+..-..++..++.++ .++.+++++.++|..++.= +........ .. T Consensus 206 ~~~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 284 (425) T protein:vir:10 206 SFASGEI-YANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVN 284 (425) T ss_pred eeeheee-EeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccccccccccccccccc Confidence 6555332 333466665555556677665 5678999999999987741 100000000 01 Q ss_pred ccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 133 APSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) +.......+++|.++...|..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+|+.++.+| T Consensus 285 ~~~~~~~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~L~~lkD~~G~--~l-~~~~~~~g~~~~l~G~PV~~~~~~p 359 (425) T protein:vir:10 285 SGAAADITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQVRKLKDGQGN--YL-WQPSYVAGQPATLAGYPVTEVPDMP 359 (425) T ss_pred ccccccccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHHHHHhhcCCCc--ee-eccCccCCCCceecceeeEEecCcC Confidence 112233456778777766654432 233678999999998654321111 00 1122345666789999999999998 Q ss_pred cCCC-cE-EEEEe-CceEEEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 213 DTDD-EQ-FVAFH-PSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~-~~-~~~~~-~~a~~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .... .. ++.+. ..++-...+. .++.......+--..+++.+++|+++++|+++++|+-++| T Consensus 360 ~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 360 DVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred CccCCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeecc Confidence 5433 22 33332 2333333332 1222221122223678899999999999999999999999 No 113 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.38 E-value=6e-13 Score=87.57 Aligned_cols=258 Identities=9% Similarity=0.054 Sum_probs=158.2 Q ss_pred Ccc----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccc--cccccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPT--VKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~--~~d~~~~~~~~~~~~~~~~~~~ 74 (273) |.. ..++|+-|...+++.+...+.+.+++..- ...+.++.+|+....+ ......+++.....+++.+.++ T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~ 184 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAV----SISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMID 184 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhceee----eccCCceEEEEeecCCCcccccccCCccccccccceeeeE Confidence 221 23568999999999988888777776431 2235678999864332 2223566766666677778877 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +.+.+... -+.|++.-.... ..+.+++ ++.+++++.++|..++.-+...+ ..+....+....++.|.++...+.. T Consensus 185 ~~~~k~~~-~~~iS~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~--~~~~~~~~~~~~~d~i~~~~~~~~~ 260 (379) T protein:vir:10 185 VNTDFIAG-FTRYSKKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAANA--TASTEIITNKNKVEMLINEIAKQEN 260 (379) T ss_pred eeeeeEEe-eehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhccccccc--ccccccccCcccHHHHHHHHHhhhh Confidence 77766533 346666533333 3466655 45788899999988776543222 1222223334456778888777777 Q ss_pred cCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccc-eeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE-E Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAA-GLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-S 231 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~-~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~-~ 231 (273) ...+.. .++++|..|..|.+..+... ....... ....|.-..++|++|+.++.+|.+. ++.+.-+..... . T Consensus 261 ~~~~~~--~~vmn~~~~~~l~~lkd~~G--~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~---~~~gdf~~~~~~~~ 333 (379) T protein:vir:10 261 LDFPVT--AIVLRPTDYYDILVTQKSVG--AGYGLPGVVTQDNGVLRINGIPLFRATWLAANK---YYVGDWTRVTKVTT 333 (379) T ss_pred ccCCCC--EEEEcHHHHHHHHHhhccCC--ceeccCCccCCCCCcceecceeeEecCCCCCCc---eEEeecccEEEEEE Confidence 765443 57889999999865432111 1111111 1123444589999999999987643 344433332222 2 Q ss_pred ecceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 232 QIDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 232 ~~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +...++..+... .| -..+++.+++|+.+++|+++|.++=++= T Consensus 334 ~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 334 EGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred eceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 223444444322 23 3588899999999999999998765555 No 114 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.38 E-value=7.2e-13 Score=87.15 Aligned_cols=262 Identities=12% Similarity=0.068 Sum_probs=155.8 Q ss_pred Ccc--------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MAF--------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~--------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) ++. -.++|+.|...+++.++...++..+..+-. ....| .+.+|+....+.+....+++..+..+++.+. T Consensus 130 ~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v--~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~ 206 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTL--PLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDD 206 (435) T ss_pred hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceee--ecCCC-ceEEEEEeCCcceeeeccCccccccccceee Confidence 211 246799999999999988877766522211 12223 5889988665556667777777666677777 Q ss_pred EEEEEEeeeeceeEechHHHHHh--HHHHHHH-HHHHHHHHHHHHHHHHHHHHHh--hcc----------cccccccCCH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQV--AGSLEAY-TRAGATALATDTDKFIADMLVD--NGT----------ALTGSAPSDA 137 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~--~~~~~~~-~~~~~~ala~~iD~~~~~~~~~--~~~----------~~~~~~~~~~ 137 (273) +++...+. +.-+.|++.-..++ ..+++++ .++.+++++.++|..++.--.. .+. ....+...+. T Consensus 207 i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:80 207 LKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTL 285 (435) T ss_pred EEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccch Confidence 77666543 34456666544443 2356665 4568899999999987742100 000 0111122233 Q ss_pred hHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC- Q lcl|Aclame:pro 138 DDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~- 216 (273) .....++.++...|........+-.++++|..+..|.+..+. .|. ..+....=+.++|.+|+.++.+|.... T Consensus 286 ~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~------~G~-~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 286 QKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDG------NGN-KVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhcc------CCc-eeccCCCCCeEeeeeeEEeccccccccC Confidence 344556777766676655433344678999999988553221 111 111111124789999999999985321 Q ss_pred ----cEEEEEeCceEEEEEe-cceeeeccCCC----------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ----EQFVAFHPSAAAYVSQ-IDTVEALRDQD----------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ----~~~~~~~~~a~~~~~~-~~~ve~~~~~~----------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++.++.+-+-+..+ ...++..+... .| ...+++.+++|+++.+|++++.|+..+= T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:80 359 AGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAW 433 (435) T ss_pred CCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCC Confidence 2244454443333322 22344333321 11 3688999999999999999999987765 No 115 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=99.36 E-value=1.8e-13 Score=90.42 Aligned_cols=261 Identities=12% Similarity=0.057 Sum_probs=160.1 Q ss_pred Ccc--------cchhHHHHHHHHHHHHHHhhcc--chhhhcccccc---ccCCcEEEEEecccccccccc-CCC---Ccc Q lcl|Aclame:pro 1 MAF--------NNFIPELWSDMLLEEWTAQTVF--ANLVNREYEGI---ASKGNVVHIAGVVAPTVKDYK-AAG---RQT 63 (273) Q Consensus 1 MA~--------~~~~pev~~~~v~~~l~~~~v~--~~~~~~d~~~~---~~~Gdtv~ip~~~~~~~~d~~-~~~---~~~ 63 (273) ||- .+|+||+|...+.+.-.+...| .+++..+-++. ...|+++++|.|+.+.-.+.+ .++ ..+ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 993 1489999999998877655433 34454554442 256999999999997432221 111 234 Q ss_pred CCcccccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Q lcl|Aclame:pro 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGT--------------- 127 (273) Q Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~--------------- 127 (273) ++..++.++....+ .++..+|..+|.....+-.+ ++++..|.+.--.+...+.+++.+...=. T Consensus 81 t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~ 159 (367) T protein:vir:80 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) T ss_pred cccccccchheeee-ehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhc Confidence 55666666555444 55677787777765444333 45556665555555555556665542100 Q ss_pred -----------c-cc---ccccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccccee Q lcl|Aclame:pro 128 -----------A-LT---GSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 128 -----------~-~~---~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 192 (273) . .. .+...+.....+.+.+|+..|.+++- .=..++++|.++..|.+.. +...-...+ . T Consensus 160 ~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~--~l~~i~mHS~V~~~L~~~~--li~~i~~sd-~-- 232 (367) T protein:vir:80 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNND--EIEFIPDSK-G-- 232 (367) T ss_pred cccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccc--cccEEEEchHHHHHHHhcc--ccccccCCC-C-- Confidence 0 00 00111122346678999999977652 2246899999999997653 222211111 1 Q ss_pred eeeeeeeecceEEEEecccccC-----CCcEEEEEeCceEEEEEecc--eeeeccCCCcc----eeeEEeeeeeeeEEEc Q lcl|Aclame:pro 193 RAGTIGNLLGARIVESNNLRDT-----DDEQFVAFHPSAAAYVSQID--TVEALRDQDSF----SDRIRALHVYGGKVVR 261 (273) Q Consensus 193 ~~G~ig~i~G~~i~~s~~l~~~-----~~~~~~~~~~~a~~~~~~~~--~ve~~~~~~~~----~~~v~~~~~~g~~vl~ 261 (273) +..|+.+.|..|++++.+|.. ..++++++-++|+++..... .+|..|++... -|.+..+-+ +++. T Consensus 233 -~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~h 308 (367) T protein:vir:80 233 -QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVH 308 (367) T ss_pred -ccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeee---EEee Confidence 356899999999999999952 23567889999999886554 45888988753 144444444 5778 Q ss_pred CceEEEEecCCC Q lcl|Aclame:pro 262 PTGVVVFNKTGS 273 (273) Q Consensus 262 p~~~v~~~~~~s 273 (273) |-|+--..++.+ T Consensus 309 P~G~s~~~~~v~ 320 (367) T protein:vir:80 309 PGGFNWLDADVT 320 (367) T ss_pred cceeeecccccc Confidence 887776554321 No 116 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.36 E-value=6.8e-13 Score=87.29 Aligned_cols=262 Identities=12% Similarity=0.067 Sum_probs=156.0 Q ss_pred Ccc--------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MAF--------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~--------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) ++. -.++|+.|...+++.++..+++..+..+... ...| .+++|+....+.+....+++..+..+++.+. T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~--~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~ 206 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLP--LSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDD 206 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeee--cCCC-ceEEEEEeCCcceeeeccCccccccccceeE Confidence 111 1368999999999999888777666322221 2223 5889998665556667777777666777677 Q ss_pred EEEEEEeeeeceeEechHHHHHhH--HHHHHH-HHHHHHHHHHHHHHHHHHHHHh--hccc----------ccccccCCH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVA--GSLEAY-TRAGATALATDTDKFIADMLVD--NGTA----------LTGSAPSDA 137 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~--~~~~~~-~~~~~~ala~~iD~~~~~~~~~--~~~~----------~~~~~~~~~ 137 (273) +++...+. +.-+.|++.-..++. ..++++ .++.+++++.++|+.++.--.. .+.. .......+. T Consensus 207 i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:14 207 LKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTL 285 (435) T ss_pred EEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccch Confidence 76666443 334566665444442 246665 4568899999999988731100 0100 111122334 Q ss_pred hHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC- Q lcl|Aclame:pro 138 DDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~- 216 (273) ..+..++.++...+.....-..+..++++|..+..|.+..+. .|. ..+....=|.++|.+|+.++.+|...+ T Consensus 286 ~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~------~G~-~l~~~~~~g~l~G~Pv~~~~~~p~~~~~ 358 (435) T protein:vir:14 286 QKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDG------NGN-KVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhcc------CCc-eeccCCCCCeeecceeEeeccccccccC Confidence 445566777776666554322344679999999988653321 111 111111124689999999999886321 Q ss_pred ----cEEEEEeCceEEEEEec-ceeeeccCCC----------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ----EQFVAFHPSAAAYVSQI-DTVEALRDQD----------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ----~~~~~~~~~a~~~~~~~-~~ve~~~~~~----------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++.+.-+.+.+..+. ..++..+... .| ...+++.+++|+++.+|+++++++..+= T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:14 359 TGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAW 433 (435) T ss_pred CCccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCC Confidence 23445554443333222 2333222211 11 2688999999999999999999987665 No 117 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.36 E-value=1.1e-12 Score=86.05 Aligned_cols=262 Identities=15% Similarity=0.073 Sum_probs=151.8 Q ss_pred Ccc-------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF-------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~-------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |+. -.++|+.+..++.+.+++..++..+-.+-. ....| .+++|+....+.+....++..++..+++.+.+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v--~~~~g-~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i 140 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSI--PLPNG-NLSMPRLSGGATAGYVGEGKDVVATGATFDDV 140 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeee--ecCCC-ceEEEEEeCCcceeeeccCccccccccceeEE Confidence 332 145799999999999998887776622221 12234 58899986655566678888777777777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh--hcccc-------c---c--cccCCHh Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD--NGTAL-------T---G--SAPSDAD 138 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~--~~~~~-------~---~--~~~~~~~ 138 (273) ++...+. +.-+.|++.-..++..+++++ .++.++++++++|+.++.--.. .+... . . .+..+.. T Consensus 141 ~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~ 219 (366) T protein:vir:57 141 KLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT 219 (366) T ss_pred EEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh Confidence 7766443 344567766555566677775 4678999999999977732100 01000 0 0 1111111 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC-- Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~-- 216 (273) .+...+..+.......+....+-..+++|..+..|.+..+ ..|. ..+....-|.+.|++|+.++.+|...+ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd------~~G~-~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~ 292 (366) T protein:vir:57 220 TIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD------GNGN-KVYPEMSQGILKGYPIQRTSAIPANLGDD 292 (366) T ss_pred hHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhc------cCCc-eeccCCCCCeecceeeEEccccccccccC Confidence 2111122222222222211223356899999999865322 1111 111122235799999999999986321 Q ss_pred ---cEEEEEeCceEEEEEec-ceeeeccCC-----C-----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ---EQFVAFHPSAAAYVSQI-DTVEALRDQ-----D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ---~~~~~~~~~a~~~~~~~-~~ve~~~~~-----~-----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++.+..+-+-+..+. ..++..++. . .| ...+++.++++..+.||+++++++..+= T Consensus 293 ~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 293 GNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 22444555444333222 233332221 1 11 2589999999999999999999998888 No 118 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.36 E-value=5.4e-13 Score=87.84 Aligned_cols=260 Identities=12% Similarity=0.015 Sum_probs=154.3 Q ss_pred Cc--c-----cchhHHHHHHHHH-HHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MA--F-----NNFIPELWSDMLL-EEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA--~-----~~~~pev~~~~v~-~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) ++ . -.++|+-+...++ ..+.....+..++.+- ...| .+.+|+....+.+....++..+....++.+. T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~----~~~g-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 323 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQV----VATG-DVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQ 323 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccc----cCCc-ceEEEEecCCcceeecccCccccccccccce Confidence 11 1 1456776666554 5556666666665431 2234 4567776666666778888888777777777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH----------HHhhc-ccccccccCCHhHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM----------LVDNG-TALTGSAPSDADDA 140 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~----------~~~~~-~~~~~~~~~~~~~~ 140 (273) ++++..+. +.-+.|+..-.... .++.++ .+.++.+++.++|..++.= +.... .....++....... T Consensus 324 i~~~~~k~-~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 401 (543) T protein:vir:81 324 PEIPVKKA-QGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFA 401 (543) T ss_pred eeeeeeee-EeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccccccccccc Confidence 77777554 34456776433333 466554 5568899999999987631 10000 00111222333445 Q ss_pred HHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC---- Q lcl|Aclame:pro 141 FDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD---- 216 (273) Q Consensus 141 ~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~---- 216 (273) ++++.++...+.....+ +-.++++|.++..|.+..+.-.+ ... ..+..|.-+.++|.+|+.++.+|.... T Consensus 402 ~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~l~~lkd~~G~--~l~--~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~ 475 (543) T protein:vir:81 402 LADVYAVYEQLAARHRR--QGAWLANNLIYNKIRQFDTQGGA--GLW--TTIGNGEPSQLLGRPVGEAEAMDANWNTSAS 475 (543) T ss_pred HHHHHHHHHhhhccccC--CcEEEEcHHHHHHHHHhhcCCCc--eec--cCcCCCCCccccceeeEEecccccccccccc Confidence 77888887777655432 23689999999999754321111 111 122345567899999999999886431 Q ss_pred ---cEEEEEeCceEEEEEec-ceeeecc------CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ---EQFVAFHPSAAAYVSQI-DTVEALR------DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ---~~~~~~~~~a~~~~~~~-~~ve~~~------~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++.++.+.+.+..+. ..++... +..+-...+++.+++|+.+++|++++.++-+.| T Consensus 476 ~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 476 ADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred CCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEeccc Confidence 22444554444433322 2232211 111123578889999999999999999977777 No 119 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.35 E-value=1.1e-12 Score=86.14 Aligned_cols=258 Identities=10% Similarity=-0.005 Sum_probs=159.5 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCC-cccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~-~~~~~~~ 72 (273) |+- ..++|+.|..++.+.+++...+.++++.-. ......+..+|+... .+.++...+++.+.. +.++.+. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~--~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~ 82 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVEN--VTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSL 82 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeee--ccCCcceEEEEeecCCCcceeeecCCcccccccccceeE Confidence 554 257799999999999999998888764321 111223566777654 344567777776643 3466777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKEL 151 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 151 (273) +++...+. +.-+.|+++-..++..+++++ .++.+++++.+.|+.++.-..... .......+++|.++...+ T Consensus 83 i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-------~~~~~~~~d~i~~~~~~l 154 (293) T protein:vir:48 83 IKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLP-------TKPTLTKWDDIIDLEAKV 154 (293) T ss_pred EEEeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccc-------ccccccCHHHHHHHHHhh Confidence 77777554 344677776666666777664 566889999999998886543221 112223467788888777 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc--cccCCC-c-EEEEEe-Cce Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD-E-QFVAFH-PSA 226 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~--l~~~~~-~-~~~~~~-~~a 226 (273) ..+..+ +-.++++|..+..|.+..+.-.+ .. ....+.+|.-++++|.+|+.+.. +|..+. . .++.+. +.+ T Consensus 155 ~~~~~~--~a~~vmn~~~~~~L~~lkd~~g~--~l-~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~ 229 (293) T protein:vir:48 155 DPAIKQ--TSFFLTNTSGFTALKKVKNALGD--YL-MERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQA 229 (293) T ss_pred hhhhcC--CCEEEEcHHHHHHHHHhhccCCc--eE-eecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccce Confidence 655432 33678999999998654321111 11 11123456667999999987544 333222 2 234443 334 Q ss_pred EEEEEec-ceeeeccCC-C---cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQI-DTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~~-~~ve~~~~~-~---~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +....+. ..++..+.. + +-...+++.+++|+++.+|++++.++-+.. T Consensus 230 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 230 VTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred EEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 4444332 344443322 1 224689999999999999999998864443 No 120 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.35 E-value=9.5e-13 Score=86.49 Aligned_cols=259 Identities=13% Similarity=0.039 Sum_probs=152.2 Q ss_pred Cc------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccC-Ccccccce Q lcl|Aclame:pro 1 MA------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~-~~~~~~~~ 72 (273) |. -..++|+-|...+++.++...++.++++.- ..++.+.++|.+... +......+++... .+.++... T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT----PVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee----eccCCceEEEEEecCCCcccccccccccccccccccee Confidence 11 125789999999999999998888877532 223556777766542 3334556655543 34567777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKEL 151 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 151 (273) +++.+.+. +.-+.|++.-..++..++.++ .+.++++++.++|..++...... . .........+++|.++.... T Consensus 187 v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~---~--~~~~~~~~~~d~l~~~~~~~ 260 (394) T protein:vir:10 187 VDWSVSTY-RGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSF---T--AKATTTDTLVDSLKHILNVD 260 (394) T ss_pred EEeeeeee-EeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc---c--cccccccccHHHHHHHHHhh Confidence 77777554 333567776566666677665 55688899999999887654221 1 11122223345565554322 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccc-cceeeeeeeeeecceEEEEecc--cccCCCcE-EEEEe-Cce Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNLLGARIVESNN--LRDTDDEQ-FVAFH-PSA 226 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~-~~~~~~G~ig~i~G~~i~~s~~--l~~~~~~~-~~~~~-~~a 226 (273) -.... +-.++++|..+..|.+..+.-.+.-...+ ......|.-++++|.+|+.++. ++...+.. ++.+. ..+ T Consensus 261 ~~~~~---~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~ 337 (394) T protein:vir:10 261 LDPAY---SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRG 337 (394) T ss_pred hhhhc---cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeecccc Confidence 22221 23689999999999754321111000000 0111123346899999988654 33333333 33333 233 Q ss_pred EEEEEe-cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~-~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +-...+ ...++.. +...+.+.+++-+++|+++++|++++.++-+.+ T Consensus 338 ~~~~~~~~~~v~~~-~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 338 VLFADRQQVTLAWE-DSKIYGRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred EEEEeecceEEEEe-cccccceeEEEEEEeccEEeccccEEEEEeecc Confidence 333333 3344433 334566788999999999999999998865555 No 121 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.35 E-value=5.3e-13 Score=87.91 Aligned_cols=260 Identities=17% Similarity=0.125 Sum_probs=156.1 Q ss_pred Cc------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcc-cccceE Q lcl|Aclame:pro 1 MA------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-ISDTGV 73 (273) Q Consensus 1 MA------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~-~~~~~~ 73 (273) ++ ...++|+.+..++++.++....+.+++..- ...|+ .++|+....+.+.+..+++.+...+ .+.+.+ T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~----~~~g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i 212 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKI----RVKGT-TRILVDTDTSPATWIEQSGALPTGDVGTIASI 212 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhcee----ecCce-eEEEEecCCcccccccccccccccccccccee Confidence 11 124789999999999999988888876431 22354 5899988877777788887765444 345666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHH---hhccc-------ccccccCCHhHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLV---DNGTA-------LTGSAPSDADDAFD 142 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~---~~~~~-------~~~~~~~~~~~~~~ 142 (273) ++...+. +.-+.|++.-..++..++.+++ ++.+++++.++|..++.--. ..+.+ ............++ T Consensus 213 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~ 291 (425) T protein:vir:95 213 DFDGFKV-GKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLK 291 (425) T ss_pred eeeheee-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHH Confidence 6655432 3445777765556666777755 56789999999998875210 00000 00111112234567 Q ss_pred HHHHHHHHHhhcCCCcCCcEEEECHHHHH-HHhcchHHhhhhhccccc-ceeeeeeeeeecceEEEEecccccCCCcEEE Q lcl|Aclame:pro 143 LIASALKELTKANVPNVGRVVVVNAEMAF-WLRSSGSKLTSADTSGDA-AGLRAGTIGNLLGARIVESNNLRDTDDEQFV 220 (273) Q Consensus 143 ~i~~a~~~l~~~~vp~~~r~lvv~p~~~~-~L~~~~~~~~~~~~~~~~-~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~ 220 (273) .+.++...+.....+..+-++++++..+. .|... ....+ ..|.. ...-.+..+.++|.+|+.++.+|... ++ T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l-~~~kd--~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~---i~ 365 (425) T protein:vir:95 292 NLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEF-SIQVD--SNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDT---VL 365 (425) T ss_pred HHHHHHHhhhhhccccCceEEEEeChHHHHHHHHH-HhhcC--CCCceeeccCCCCCccccceeeEEcCcCCCcc---EE Confidence 78877766665544334445667776543 34221 11111 11110 00113556789999999999998653 33 Q ss_pred EEeCceEEEEEec-ceeeeccCCCcc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 221 AFHPSAAAYVSQI-DTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~~~~~-~~ve~~~~~~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+.-+-..+..+. ..++...+. +| ...+++.+++++++++|+++++++-+-+ T Consensus 366 ~Gd~~~~~~~~~~~~~i~~~~~~-~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 366 FGEFEQYTLVERENITIDSSTHV-KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred EEecccEEEEeecceEEEeeccc-ccccCceEEEEEEeeCcEeecccceEEEEecCc Confidence 3333322222222 233333332 22 4689999999999999999999977776 No 122 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.34 E-value=1.4e-12 Score=85.64 Aligned_cols=258 Identities=11% Similarity=-0.003 Sum_probs=154.8 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEec-cccccccccCCCCccCC-cccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGV-VAPTVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~-~~~~~~~ 72 (273) |+. -.++|+-|...+++.+++..++..++.+-.. ....|+. .++.. ...+.+....+++.+.. +.++.+. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV-TTLTGSR-VYEKWADITGLAKLDDEAGSIGTNDDPKLYP 186 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec-cCCcceE-EEEeecCCCcceeeeccccccccccccceee Confidence 322 2568999999999999999888887754221 1222322 23332 22233455666665532 3456677 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKEL 151 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 151 (273) ++++..+. +.-+.|++.-..++..++.++ .++.+++++.++|..++.-.. + ..+.+....+++|.++...| T Consensus 187 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g---~----~~~~~~~~~~d~i~~~~~~l 258 (397) T protein:vir:48 187 IRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIA---T----LPTKPTLTKWDDIIDLQAKV 258 (397) T ss_pred EEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc---c----cccccccccHHHHHHHHHHh Confidence 77776543 344567776555666677664 566889999999998875321 1 12222334567788888888 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc--cccCC--CcEEEEEeCc-e Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTD--DEQFVAFHPS-A 226 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~--l~~~~--~~~~~~~~~~-a 226 (273) .....+. -.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.+.. ++..+ ...++.+.-+ + T Consensus 259 ~~~~~~~--a~~v~n~~~~~~L~~lkd~~G~--~i-~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~ 333 (397) T protein:vir:48 259 DPAIKQT--SFFLTNTSGFTALKKVKNAFGD--YL-MERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQA 333 (397) T ss_pred hhhhcCC--CEEEECHHHHHHHHHhhcCCCc--ee-eccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccce Confidence 7766433 3678999999999754321111 11 11123456677999999987543 33322 2234444433 4 Q ss_pred EEEEEe-cceeeeccCCC----cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQ-IDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~-~~~ve~~~~~~----~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +....+ ...++..+... .-...+++.+++|..+++|++++.++-+++ T Consensus 334 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 334 VTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred EEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 444433 23444433221 223688999999999999999998864444 No 123 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.34 E-value=9.9e-13 Score=86.39 Aligned_cols=256 Identities=13% Similarity=0.070 Sum_probs=151.2 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-----cccc Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-----DAIS 69 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-----~~~~ 69 (273) ||. ..++|+.+...+++.+++.+++.+++.+ ....+.++++|+....+.+....++..... .+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~----~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~ 76 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQN----VNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcce----eeccCCcEEEEEEeCCcceEEeecccccccccccccccc Confidence 987 2567999999999999999988888753 222356789999877666666666654322 2344 Q ss_pred cceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh-----------hccc-----cccc Q lcl|Aclame:pro 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVD-----------NGTA-----LTGS 132 (273) Q Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~-----------~~~~-----~~~~ 132 (273) .+.+++...+. +.-+.|+++-..++..++.++ .++.++++++++|..++.=-.. .... .... T Consensus 77 f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) T protein:vir:25 77 WANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG 155 (305) T ss_pred eeeEEeeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccc Confidence 45555554332 344567776555566777665 4568899999999988731000 0000 0001 Q ss_pred ccCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 133 APSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) ......+.++.+..+...+........ -++++|..+..|.+..+ ..| ...+.. +.++|.+++.++.+| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd------~~G-~~i~~~---~~l~G~Pv~~~~~~~ 223 (305) T protein:vir:25 156 GVANESDIVGATNRAAKAVASAGWAPD--TLLSSLALRYEVANIRD------ANG-NPVFRD---DSFAGFRTFFNRNGA 223 (305) T ss_pred cchhhhHHHHHHHHHHHhhhhcccccc--eeEecHHHHHHHHHhhc------cCC-ceeecC---CcccccceEEcCccC Confidence 111112334455555555544433222 27889999999864321 111 112222 468999999999887 Q ss_pred cCCCc-EEEEEeCceEEEEEec-ceeeeccCC-----C----cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 213 DTDDE-QFVAFHPSAAAYVSQI-DTVEALRDQ-----D----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~~-~~~~~~~~a~~~~~~~-~~ve~~~~~-----~----~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..... .++.+..+.+.+..+. ..++..++. . .| ...+++..++|..+++|++++.+..+.. T Consensus 224 ~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 224 WDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred CCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 54332 3444554444333221 223322211 1 11 2467888999999999999999977644 No 124 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.33 E-value=6.4e-13 Score=87.43 Aligned_cols=253 Identities=13% Similarity=0.023 Sum_probs=149.0 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccC-CcccccceEEEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~-~~~~~~~~~~~tid 78 (273) -....++|+-|...+++.++....+.+++..- ...+.++++|.+... +......+++... .++++.+.++++.. T Consensus 140 ~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~ 215 (400) T protein:vir:38 140 ADAASTIPETISNTPQRELQTVVDLKPFTNVF----QASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVE 215 (400) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhcceeE----eccCcceEEEEEecCCCccccccccccccccccccceeeEeehh Confidence 11135789999999999999888777766432 223446677776432 3334455555443 23456666666664 Q ss_pred eeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVP 157 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp 157 (273) +. +.-+.|++.-..++..++.++ .+..+++++.++|..++...... .... ...+++|.++....-+ + T Consensus 216 k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~----~~~~----~~~~~~~~~~~~~~~~---~ 283 (400) T protein:vir:38 216 TY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF----TAKT----ISSVDDLKHINNVDLD---P 283 (400) T ss_pred he-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc----cccc----cccHHHHHHHHHhhhh---h Confidence 43 344566665444555667664 45678888888888776543211 1111 1224445544332211 1 Q ss_pred cCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEEEEEeCc-eEEEEEe-c Q lcl|Aclame:pro 158 NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFHPS-AAAYVSQ-I 233 (273) Q Consensus 158 ~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~~~~~~~-a~~~~~~-~ 233 (273) ..+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.++..|..+. ..++.+.-+ ++....+ . T Consensus 284 ~~~a~~v~~~~~~~~l~~lkd~~G~--~i-~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~ 360 (400) T protein:vir:38 284 AYSRVIIASQSFYNFLDTVKDGNGR--YL-LQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRAD 360 (400) T ss_pred hhCcEEEEcHHHHHHHHHhhccCCC--ee-eecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecc Confidence 2234688999999999754321111 11 11123346667899999999998875443 334444433 3333333 3 Q ss_pred ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 234 DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 234 ~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++..+ ...+.+.+++.+++|+++++|++++.|+-+.. T Consensus 361 ~~~~~~~-~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 361 FMVRWVD-DQIYGQFLQAGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred eEEEEec-ccccceeEEEEEEeccEEecccceEEEEeecC Confidence 3444333 34567889999999999999999888865555 No 125 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.30 E-value=3.2e-12 Score=83.61 Aligned_cols=258 Identities=10% Similarity=-0.009 Sum_probs=157.2 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCC-cccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~-~~~~~~~ 72 (273) |+. ..++|+.|...+++.++...++.+++..-.. ....| ++.+|+.... +.+....++..+.. +.++.+. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENV-TTLTG-SRVYEKWTDITGLANIDDEAGKIADVDDPKLSL 186 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeec-ccCcc-ceEEEeeccCCcceeeecCccccccccccceee Confidence 332 2567999999999999999988888754221 11223 3456655443 34556777776543 4567777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKEL 151 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 151 (273) +++++.+. +.-+.|++.-..++..++.++ .++.+++++..+|..++.-... .. +......+++|.++...+ T Consensus 187 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~---~~----~~~~~~~~d~i~~~~~~l 258 (397) T protein:vir:49 187 IKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAA---LP----TKPTLTKWDDIIDLEAKV 258 (397) T ss_pred EEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc---cc----cccccccHHHHHHHHHhh Confidence 77777543 445567776555556677664 5568899999999988764321 11 112223467788888888 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec--ccccCCC-c-EEEEEe-Cce Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLRDTDD-E-QFVAFH-PSA 226 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~--~l~~~~~-~-~~~~~~-~~a 226 (273) ..+..+. -.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.+. .+|.++. . .++.+. ..+ T Consensus 259 ~~~~~~~--a~~vmn~~~~~~l~~lkd~~G~--~l-~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~ 333 (397) T protein:vir:49 259 DPAIKQT--SFFLTNTSGFTALKKVKNALGD--YL-MERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQA 333 (397) T ss_pred hhhhcCC--CEEEEcHHHHHHHHHhhcCCCc--ee-eccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccce Confidence 7766433 3688999999999754321111 11 1112334666789999998754 3444332 2 233343 334 Q ss_pred EEEEEe-cceeeeccCC-C---cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQ-IDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~-~~~ve~~~~~-~---~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +....+ ...++..+.. + +-...+++.+++|+++++|++++.++-++. T Consensus 334 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 334 VTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred EEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecc Confidence 444433 3344443322 1 224678999999999999999998864443 No 126 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.29 E-value=7.8e-14 Score=92.44 Aligned_cols=252 Identities=10% Similarity=-0.015 Sum_probs=134.1 Q ss_pred Ccc--cchhH------------HHHHHHHHHHHHHhh---ccchhhhccccccccCCcEEEEEeccccccccccCCCCcc Q lcl|Aclame:pro 1 MAF--NNFIP------------ELWSDMLLEEWTAQT---VFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT 63 (273) Q Consensus 1 MA~--~~~~p------------ev~~~~v~~~l~~~~---v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~ 63 (273) |+. |...+ +.|+.-+.+ |.+.| .+.++. ++...++.++|.+.+.+.+...++|+.+ T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~-L~~~LGv~r~~pla------~Gt~iktyK~~~~~y~gda~dVaEGe~I 73 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNK-LFEALAIQNKIPMN------VGSALKQYRFKVEDSEKPNGDVAEGDVI 73 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHH-HHHHhhhhcccccc------CCceeeeeeeeceeeccccccccCCccc Confidence 664 22233 334433332 33333 222222 2333344456666677888889999999 Q ss_pred CCcccccc---eEEEEEEeeeeceeEechHHHH-HhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--ccCC Q lcl|Aclame:pro 64 SADAISDT---GVDLLIDQEKSIDFLVDDIDRV-QVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGS--APSD 136 (273) Q Consensus 64 ~~~~~~~~---~~~~tid~~~~~~~~i~d~d~~-~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~--~~~~ 136 (273) +.+.++.+ ..++++++++. .++|+..+ ...++ +.+.-+|+.++|++++|++++..+..+......+ +..+ T Consensus 74 plskvt~~~~~t~~~~~kK~rK---~tTdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s 150 (303) T protein:vir:10 74 PLTKVTREQVDITELQFAKYRK---STSAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKLS 150 (303) T ss_pred chhhheeeecceEEEEeecccc---cccHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceeec Confidence 99988864 67888877644 33666543 22333 4567789999999999999999987654332222 2233 Q ss_pred HhHHHHHHHHHHHHH---hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccccc Q lcl|Aclame:pro 137 ADDAFDLIASALKEL---TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 137 ~~~~~~~i~~a~~~l---~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~ 213 (273) ..++-.++......| ++.. ..-++|++|...+.++++.....+....| .+.+. +++|+.|++|+.+|. T Consensus 151 ~~glq~Al~~~~~kl~~~~ed~---~~~V~FvNP~Daa~yl~~A~i~~~~t~fG-~n~L~-----nfLG~~II~S~kv~~ 221 (303) T protein:vir:10 151 AENLQGALSKGRANLSVLLDDE---ITPIAFVNPNDTAEYLANGFINSTGAQFG-VNLLT-----PYVGVKIVEFADVPQ 221 (303) T ss_pred HHHHHHHHHhhhhhcccccccc---ccEEEEEchHHHHHHhhcCCcchhhhhhh-hhhhh-----hhhcceEEEeccCCC Confidence 333333333333332 3322 22489999999999998765443323333 34444 599999999999987 Q ss_pred CCCc------EEEEEeCceEEE-EEecc-e------eeeccCCCcceeeEEeeeeeeeEEE---cCceEEEEecCCC Q lcl|Aclame:pro 214 TDDE------QFVAFHPSAAAY-VSQID-T------VEALRDQDSFSDRIRALHVYGGKVV---RPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~~~------~~~~~~~~a~~~-~~~~~-~------ve~~~~~~~~~~~v~~~~~~g~~vl---~p~~~v~~~~~~s 273 (273) +... -.+++.+-- |- ..-+. . +-...+.....=.+.. ....+-++ +++++|+.+-++. T Consensus 222 G~~~~T~~~Ni~~ay~~~~-g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT-~~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 222 GEVWMTVAENLNVAYANPR-GELSRAFAFATDATGFVGVLHDIQPQRLTSDT-IYASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred ceEEEeeccceEEEEecCc-hhhhhhhhhccccccceEEEeccccceeeehh-HhHhHHHhcccccceEEEEEEecc Confidence 6531 122222110 10 00000 0 1111222211111222 22223333 5558887655444 No 127 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.29 E-value=1.8e-13 Score=90.45 Aligned_cols=246 Identities=10% Similarity=0.068 Sum_probs=137.2 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEE-EEEeccccccccccCCCCccCCcccccc---eEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVV-HIAGVVAPTVKDYKAAGRQTSADAISDT---GVDLL 76 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv-~ip~~~~~~~~d~~~~~~~~~~~~~~~~---~~~~t 76 (273) |+..+ -+.|+.-+.+ |.+.| +.+ ...+ ...|.+| ++|+|.+.+.+....+|+.++.+.++.+ ..+++ T Consensus 22 ~siDf--~~~f~~~i~~-L~~~L---Gv~--r~~p-la~GstIkt~k~~~y~gda~dVaEGe~Iplskvt~~~~~t~t~~ 92 (296) T protein:vir:98 22 ITIDV--TNKFQENISK-LLEML---GVT--RKIS-VSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIE 92 (296) T ss_pred hhhhh--HHHHhhhHHH-HHHHh---hhc--cccc-ccCCCEEeeccceeeeeccccccCCcccchhhheeeecceEEEE Confidence 22221 2444444443 33332 111 1111 3349999 5577999999999999999999998865 57888 Q ss_pred EEeeeeceeEechHHHH-HhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHH----HHHHHHHHHH Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRV-QVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDA----FDLIASALKE 150 (273) Q Consensus 77 id~~~~~~~~i~d~d~~-~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~----~~~i~~a~~~ 150 (273) +++++.. ++|+..+ ...++ +.+..+|+.+++++++|++++..+..+... ...+..+. ...+.++... T Consensus 93 ikK~rK~---tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t----~~~t~~~lQ~Ala~~~~~l~~~ 165 (296) T protein:vir:98 93 LKKYRKA---TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT----QDALGAGLQGALASAWGKLQVL 165 (296) T ss_pred eeccccc---cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccce----eeechhhHHHHHHHHhhhhhhh Confidence 8776433 4665532 33334 466788999999999999999998544221 11222222 2234455566 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeee-eecceEEEEecccccCCCcE------EEEEe Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIG-NLLGARIVESNNLRDTDDEQ------FVAFH 223 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig-~i~G~~i~~s~~l~~~~~~~------~~~~~ 223 (273) |++.+ ....+++++|...+.++++.+ +...... -+... +++|..|++|+.+|.+.... .+++. T Consensus 166 feded--~~~~V~FVnP~D~a~ylg~a~-it~qt~f-------G~tyl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~ 235 (296) T protein:vir:98 166 FEDYG--SERAIVFANSLDVAEYIAKAG-ITTQTAF-------GLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYI 235 (296) T ss_pred ccccC--CCceEEEEehHHHHHHhcCCc-cchhhee-------chhhhhhccccEEEEcCcCCCceEEEeeecceEEEee Confidence 66654 245689999999999998764 3221111 23333 48999999999999765321 22332 Q ss_pred CceEE-EEEecc----e---eeeccCCCcceeeEEeeeeeeeEEE---cCceEEEEecCCC Q lcl|Aclame:pro 224 PSAAA-YVSQID----T---VEALRDQDSFSDRIRALHVYGGKVV---RPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~-~~~~~~----~---ve~~~~~~~~~~~v~~~~~~g~~vl---~p~~~v~~~~~~s 273 (273) +--.+ ++.... + +-...+.....=.+.. ..+.+-++ +++++|+.+-+++ T Consensus 236 ~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT-~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 236 NPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQT-LLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred cccccchhhhhccccccccceEEEeccccceeeehh-HhHhHHHhcccccceEEEEEecCC Confidence 21101 011000 0 1111222211111222 22223333 5568888777666 No 128 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.29 E-value=4.5e-12 Score=82.78 Aligned_cols=258 Identities=11% Similarity=-0.002 Sum_probs=154.4 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCc-ccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSAD-AISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~-~~~~~~ 72 (273) |+. ..++|+.|...+++.+++..++..++.+-... ...| ++.+|+... .+.+....+++.+... ..+.+. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVT-TLTG-SRVYEKWADITGLAKLDDEGGQIGQNDDPKLSL 186 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeecc-CCcc-eEEEEeeccCCcceeeeccccccccccccceee Confidence 332 25679999999999999998888876542211 1112 455666543 3445667777665433 345666 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKEL 151 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l 151 (273) +++.+.+. +.-+.|+..-..++..++.++ .+..+++++.++|..++.-. ++.. +......+++|.++...+ T Consensus 187 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~---g~~~----~~~~~~~~d~i~~~~~~l 258 (397) T protein:vir:49 187 IRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI---GTLP----NKPTLAKWDDIIDLQAKV 258 (397) T ss_pred eEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc---cccc----ccccccCHHHHHHHHHhh Confidence 66666543 334566665555556677664 56688999999999876432 1111 122223467788888888 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec--ccccCCC--cEEEEEe-Cce Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLRDTDD--EQFVAFH-PSA 226 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~--~l~~~~~--~~~~~~~-~~a 226 (273) ..+..+. -.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.+. .+|...+ ..++.+. +.+ T Consensus 259 ~~~~~~~--a~~v~n~~~~~~l~~lkd~~g~--~l-~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~ 333 (397) T protein:vir:49 259 DPAIKQT--SLFLTNTSGFTALKKVKNAMGD--YL-MERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQA 333 (397) T ss_pred hhhhcCC--CEEEEcHHHHHHHHHhhccCCc--ee-ecccccCCCCceecceeeEEecccccccccCCceeEEEeeccce Confidence 7776543 3689999999999654321111 10 0112334556789999998754 3444332 2233333 334 Q ss_pred EEEEEe-cceeeeccCCC----cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQ-IDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~-~~~ve~~~~~~----~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +-...+ ...++..+... +-...+++.+++|+.+++|++++.++-++. T Consensus 334 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 334 VTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred EEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccc Confidence 444433 33444433222 224578999999999999999998853333 No 129 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.28 E-value=4.7e-12 Score=82.68 Aligned_cols=258 Identities=12% Similarity=0.023 Sum_probs=149.3 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccC-Ccccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~-~~~~~~~~ 72 (273) |.. -.+.|+.+...+++.+++..++..++..-. .....-++.+++.... +.+....+++... .+.++.+. T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~ 193 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES--VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTI 193 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceee--ccCCcceEEEEeecCCccceeeecCccccccccccceee Confidence 221 246799999999999999988888774321 1111123344444332 3344566666554 34567777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHH-H Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALK-E 150 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~-~ 150 (273) +++++.+. +.-+.|++.-..++..++.++ .++.+++++.++|..++.-. +.....+.. ..++++.++.. . T Consensus 194 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~---g~~~~~~~~----~~~~~i~~~~~~~ 265 (404) T protein:vir:39 194 IKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM---GTVPKKPTI----AKFDDVITMINTS 265 (404) T ss_pred EEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccccccccc----ccHHHHHHHHHHh Confidence 77777554 344577776555566667664 56789999999999877532 121111122 22455555543 3 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc--cccCCC--cEEEEEeCc- Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD--EQFVAFHPS- 225 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~--l~~~~~--~~~~~~~~~- 225 (273) +..... .+-.++++|..+..|.+..+.-.+ ..- ...+..|.-++++|++|+.+.. +|..+. ..++.+..+ T Consensus 266 ~~~~~~--~~a~~v~n~~~~~~L~~lkd~~G~--~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (404) T protein:vir:39 266 VDPAII--ATSSLLTNQSGLNKLALVKTAEGK--YLL-EPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQ 340 (404) T ss_pred hhhhhc--cCCEEEEcHHHHHHHHHhhccCCc--eee-ccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccc Confidence 333222 223689999999999754221111 110 1122345557899999998654 443322 234555433 Q ss_pred eEEEEE-ecceeeeccCCC----cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 226 AAAYVS-QIDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~~~-~~~~ve~~~~~~----~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++-... +...++..+... +-...+++.+++|+.+++|++++.++-+.+ T Consensus 341 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred cEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 344343 233444433321 224678999999999999999999974443 No 130 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.28 E-value=5.8e-12 Score=82.18 Aligned_cols=257 Identities=14% Similarity=0.041 Sum_probs=152.5 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccC-CcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~ 73 (273) |+. ..++|+-+...+++.+++.+++..++..... ....-++.+|+....+.+....++.... ...++.+.+ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i 168 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPV--TTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLL 168 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeec--cCCceeEEEEeecCCcceeeeccccccccccccceeeE Confidence 332 2568999999999999999988887753221 1112345566655545555677776653 345666777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KEL 151 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) +++..+. +.-+.|++.-..++..++.++ .++.+++++.++|..++.-.... .... ...++++..+. ..| T Consensus 169 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~----~~~~----~~~~~~i~~~~~~~l 239 (371) T protein:vir:81 169 QYQVKKY-AGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTK----AKTA----IADLDGLKQIINVQL 239 (371) T ss_pred EeeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----cccc----cccHHHHHHHHHhhc Confidence 7776554 334567776555555677665 45678899999998776643211 1111 11234444433 233 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC---------cEEEEE Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD---------EQFVAF 222 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~---------~~~~~~ 222 (273) ..... .+-..+++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+|+.++.+|.+.. ..++.+ T Consensus 240 ~~~~~--~~a~~vmn~~~~~~L~~lkd~~g~--~l-~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~G 314 (371) T protein:vir:81 240 DPVFR--STSSVIVNQDAFNWLDTLKDQNGQ--YL-LQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVG 314 (371) T ss_pred chhhh--cCCEEEEcHHHHHHHHHhhccCCC--ee-eecccCCCCCceecceeEEEecccccCccccccccCCcceEEEE Confidence 33221 233689999999999754321111 11 11123346668999999999998875431 123333 Q ss_pred e-CceEEEEEe-cceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 223 H-PSAAAYVSQ-IDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~-~~a~~~~~~-~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) . +.++....+ ...++..+... .| ...+++.+++|.++++|+++++++-+.| T Consensus 315 d~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 315 DLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 3 222322222 22333333221 22 3688999999999999999999998888 No 131 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.27 E-value=5.5e-12 Score=82.29 Aligned_cols=258 Identities=11% Similarity=0.008 Sum_probs=150.0 Q ss_pred Ccc-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCC-cccccceEEEEE Q lcl|Aclame:pro 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSA-DAISDTGVDLLI 77 (273) Q Consensus 1 MA~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~-~~~~~~~~~~ti 77 (273) .+. ..++|+.|..++++.+++...+..++..-.. .....++.+|+.... +.+....+++...- +.++.+.+++.. T Consensus 121 ~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~ 198 (408) T protein:vir:10 121 DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV--STSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLI 198 (408) T ss_pred ccCCceeccHhHHHHHHHHHHhhchhhhhcceeec--cCCcceEEEeeccccccceeeecCccccccccCcceeeEEeee Confidence 221 2567999999999999999888887753221 111123445544332 33455666665542 345667777766 Q ss_pred EeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HHHhhcC Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KELTKAN 155 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l~~~~ 155 (273) .+. +.-+.|+..-..++..++..+ .+..+++++.++|..++.-.... ....+ ...+++|.++. ..++... T Consensus 199 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~---~~~~~----~~~~~~l~~~~~~~~~~~~ 270 (408) T protein:vir:10 199 KRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA---PKKPT----IAKFDDVITMINTAVDPAI 270 (408) T ss_pred eeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccc----cccHHHHHHHHHHhhhhhh Confidence 544 344567766555566677665 45678999999999877543221 11111 22345565544 3343322 Q ss_pred CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec--ccccCCCc--EEEEEeCc-eEEEE Q lcl|Aclame:pro 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLRDTDDE--QFVAFHPS-AAAYV 230 (273) Q Consensus 156 vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~--~l~~~~~~--~~~~~~~~-a~~~~ 230 (273) . .+-.++++|..+..|.+..+.-.+ .. ....+.+|.-+.++|++|+.++ .+|..++. .++.+.-+ ++... T Consensus 271 ~--~~a~~v~n~~~~~~l~~lkd~~G~--~i-~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~ 345 (408) T protein:vir:10 271 I--ATSSLLTNQSGLNKLALVKTAEGK--YL-LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLF 345 (408) T ss_pred c--cCCEEEEcHHHHHHHHHhhccCCc--eE-eccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEE Confidence 1 233688999999999764321111 11 1112335666789999999865 45543332 23444433 34444 Q ss_pred Ee-cceeeeccCC-C---cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 231 SQ-IDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~-~~~ve~~~~~-~---~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+ ...++..+.. . +-...+++.+++|+++++|++++.++-+.. T Consensus 346 ~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 346 DRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred EecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 33 3344433322 1 124689999999999999999999874443 No 132 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.26 E-value=3.7e-12 Score=83.26 Aligned_cols=255 Identities=8% Similarity=0.007 Sum_probs=156.7 Q ss_pred Ccc----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccccccc--ccCCCCccCCcccccceEE Q lcl|Aclame:pro 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKD--YKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d--~~~~~~~~~~~~~~~~~~~ 74 (273) +.. ..++|+.|...+++.++....+.+++..- ...+.++++|.+....... ...++..+...+++.+.++ T Consensus 116 ~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~----~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~ 191 (421) T protein:vir:13 116 IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVI----PVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMA 191 (421) T ss_pred ccccCCcceecchhhHHHHHHHHHhhhhhhhhceee----eccCCceEEEEeecCCccceeeccccccccccccceeEEE Confidence 111 25679999999999999888887776432 2234567777765543322 2456666666667777777 Q ss_pred EEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhh Q lcl|Aclame:pro 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) Q Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~ 153 (273) +.+.+. +.-+.|++.-..++..++.++ .++.+++++..+|..+++.+..... .+ ....+++|.++...+.. T Consensus 192 ~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~---~~----~~~~~d~i~~~~~~l~~ 263 (421) T protein:vir:13 192 YDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLA---EE----TINDYAGLVKTINSLVP 263 (421) T ss_pred eeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhccc---cc----cccchHHHHHHHHHhhh Confidence 777554 334566766555556667665 4567888999999888876532211 11 12236678888777776 Q ss_pred cCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC--cEEEEEeCc-eEEEE Q lcl|Aclame:pro 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFHPS-AAAYV 230 (273) Q Consensus 154 ~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~--~~~~~~~~~-a~~~~ 230 (273) +..+. -.++++|..+..|.+..+.-.+ ..- . ....|.-+.+.|.+|+.++.+|.++. ..++.+.-+ ++-+. T Consensus 264 ~~~~~--a~~v~n~~~~~~l~~lkd~~G~--~i~-~-~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 337 (421) T protein:vir:13 264 NARKR--AIIVTNSDGRAYLDGLMDKQGR--PLL-K-ELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFM 337 (421) T ss_pred hhcCC--CEEEEcHHHHHHHHHhhcCCCc--eee-c-CcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEE Confidence 66543 2578999999999653321111 111 1 12245567899999999998875543 234455433 33333 Q ss_pred E-ecceeeeccCCCcc--eeeEEeeeeeeeEEEcCceEEEEecCC--C Q lcl|Aclame:pro 231 S-QIDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTG--S 273 (273) Q Consensus 231 ~-~~~~ve~~~~~~~~--~~~v~~~~~~g~~vl~p~~~v~~~~~~--s 273 (273) . +...++..+..... -..+++.+++|..+.+|+++..+.... . T Consensus 338 ~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a 385 (421) T protein:vir:13 338 DRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGV 385 (421) T ss_pred EecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccce Confidence 3 33455554444322 258899999999999999865443221 1 No 133 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.26 E-value=4.1e-12 Score=83.02 Aligned_cols=252 Identities=11% Similarity=-0.051 Sum_probs=147.2 Q ss_pred Ccc-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccC-CcccccceEEEEE Q lcl|Aclame:pro 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTGVDLLI 77 (273) Q Consensus 1 MA~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~-~~~~~~~~~~~ti 77 (273) -++ ..++|+-|...+.+.++...++.+++..- ...+.+.++|.+... +......++.... .+.++.+.+++.. T Consensus 133 ~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~ 208 (394) T protein:vir:97 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY----QAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNI 208 (394) T ss_pred cccccccChHHHHHHHHHHhhhhhhhhhhceee----eccCcceEEEEEecCCCccceecccccccccccccceeEEeeh Confidence 111 25689999999999998888887776532 122345677776432 2233455665543 2345666666666 Q ss_pred EeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANV 156 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~v 156 (273) .+. +.-+.|+..-..++..++.+++ ++.+++++..+|..++.-.... ...+. ..+++|.++....-+. T Consensus 209 ~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~----~~~~~----~~~~~~~~~~~~~~~~-- 277 (394) T protein:vir:97 209 DTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----TTKTV----KNLDEIKALLNGGFDP-- 277 (394) T ss_pred hhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccccc----ccHHHHHHHHHhhhhh-- Confidence 443 3445666655555566677654 5678889999988777543211 11111 1244455444322111 Q ss_pred CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEe-CceEEEEEe-cc Q lcl|Aclame:pro 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH-PSAAAYVSQ-ID 234 (273) Q Consensus 157 p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~-~~a~~~~~~-~~ 234 (273) ..+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.++....+... ++.+. ..++....+ .. T Consensus 278 -~~~a~~v~n~~~~~~l~~lkd~~G~--~i-~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-~~~gd~~~~~~~~~~~~~ 352 (394) T protein:vir:97 278 -AYNVSLIVSQSFYQTLDTLKDGNGR--YL-LQDDITAVSGKVLLGKPVFVLSDEVLGANK-AFIGDFKRGVLFADRKDL 352 (394) T ss_pred -hhCCEEEEcHHHHHHHHHhhccCCC--ee-eecCcCCCCCceeccceeEEecccccCCcc-EEEeeccccEEEEEecce Confidence 1233578999999998654321111 11 111233455578999999987765444432 44444 223333333 33 Q ss_pred eeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 235 TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 235 ~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .++... ...+.+.+++-+++|+++.+|++++.++-+.. T Consensus 353 ~~~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 353 GLRWAD-NEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred EEEEec-ccccceeEEEEEEEccEEecccceEEEEeccc Confidence 444333 34456788999999999999999998877766 No 134 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.26 E-value=1.4e-12 Score=85.57 Aligned_cols=257 Identities=11% Similarity=0.041 Sum_probs=155.6 Q ss_pred Ccc------cchhH-HHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIP-ELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~p-ev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++| +++.+.+++.++..+++..+-.+-. .+..| .++||+....+.+.+..+++.+...+++.+.+ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~--~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i 433 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARML--PGLVG-DVDIPKKTSGANFYWIGEDEDVQDSDFDFTTL 433 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEe--ecCCc-ceEEEEEeCCceeEeecCCccccccccceeeE Confidence 111 12444 6668889999988877766622211 12223 58899886655556678888777777777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh--hccccc------ccccCCHhHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVD--NGTALT------GSAPSDADDAFDLI 144 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~--~~~~~~------~~~~~~~~~~~~~i 144 (273) ++...+. +.-+.|+..-..++..++++++ +.++.+++.++|..++.--.. .+.... ..+.......++.+ T Consensus 434 ~l~~~k~-~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i 512 (632) T protein:vir:96 434 SFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASV 512 (632) T ss_pred EeeeeEE-EEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHH Confidence 7776543 3334556544445556676655 568999999999988742110 011110 00111222346778 Q ss_pred HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeC Q lcl|Aclame:pro 145 ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~ 224 (273) .++...+...++...+-..+++|..+..|.... +. +..| ..+.++ +.+.|.+++.++.+|... ++.+.- T Consensus 513 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~--l~--d~~G--~~i~~~--~~l~G~pv~~s~~ip~~~---~~~gd~ 581 (632) T protein:vir:96 513 VDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ--VF--DNTG--ERIWQN--NEVNGYRAEASNQIPADT---WIFGDW 581 (632) T ss_pred HHHHHHHhhcccccCccEEEEchhHHHHHHHHh--cc--CCCC--ceeecC--CeecccceEeccccccCc---EEEeec Confidence 888888888776555556788999888776432 11 1222 222221 468999999999998654 333333 Q ss_pred ceEEEEEe-ccee--eeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 225 SAAAYVSQ-IDTV--EALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 225 ~a~~~~~~-~~~v--e~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) +-+-+... ...+ ..+.....=...+++.++++.++.+|+++++++..+ T Consensus 582 s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 582 SQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 32222211 1122 222222223468999999999999999999999888 No 135 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.26 E-value=1.1e-11 Score=80.70 Aligned_cols=262 Identities=13% Similarity=0.076 Sum_probs=150.8 Q ss_pred Ccc-------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF-------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~-------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) +++ -.++|+-|...+.+.+++.+++..+..+-. ....| .+.+|+....+.+....+++..+..+++.+.+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~--~~~~g-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i 201 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSI--PLPNG-NMSLPRLAGGATASYTGENQDAKVSEARFDDV 201 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceee--ecCCc-ceEEEEEeCCcceeeeccCccccccccceeeE Confidence 221 246799999999999998888777633221 12223 47899886655566678888877777777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh--hcccc-------c--ccccCCHhHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVD--NGTAL-------T--GSAPSDADDAF 141 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~--~~~~~-------~--~~~~~~~~~~~ 141 (273) ++...+. +.-+.|++.-..++..++.+++ +..+++++.++|..++.--.. .+... . ..........+ T Consensus 202 ~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 280 (428) T protein:vir:10 202 KLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNL 280 (428) T ss_pred EeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccH Confidence 7777544 3446777765555666777754 568999999999987631000 00000 0 00001111112 Q ss_pred HHH---HHHHHHHhh-cCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC- Q lcl|Aclame:pro 142 DLI---ASALKELTK-ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 142 ~~i---~~a~~~l~~-~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~- 216 (273) +.+ .++...+.. ......+-..+++|..+..|.+..+. .|. ..+....-|.+.|.+|+.++.+|...+ T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~------~G~-~i~~~~~~g~l~G~pv~~~~~~p~~~~~ 353 (428) T protein:vir:10 281 DTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDG------NGN-KVYPEMAQGMLKGYPIQRTSAIPANLGE 353 (428) T ss_pred HHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhcc------CCc-eeccCCCCCeeeceeeEEeccccccccC Confidence 222 222211111 11111223568899999988654321 111 111111224799999999999885422 Q ss_pred ----cEEEEEeCceEEEEEec-ceeeeccCCC----------cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ----EQFVAFHPSAAAYVSQI-DTVEALRDQD----------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ----~~~~~~~~~a~~~~~~~-~~ve~~~~~~----------~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..++.+..+.+-+..+. ..++..+... .| ...+++.+++|+.+.+|+++++++...= T Consensus 354 ~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 354 GGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred CCccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 23445555444333222 2333322211 11 2578999999999999999999988877 No 136 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.25 E-value=2e-12 Score=84.68 Aligned_cols=264 Identities=12% Similarity=0.054 Sum_probs=148.4 Q ss_pred Ccc-------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccc---cCCCCccCCccccc Q lcl|Aclame:pro 1 MAF-------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDY---KAAGRQTSADAISD 70 (273) Q Consensus 1 MA~-------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~---~~~~~~~~~~~~~~ 70 (273) +|. -+++|+.|...+++.++...++..++++-. ..| .+.+|.+...+.+.. ..++...+..+++. T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~----~~~-~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f 215 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVK----TKE-NIKYPVLVKKAEAQGHKNERTNNEMPETDIEF 215 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceec----cCC-ceEEEEEecCCcccceecccccccccccccce Confidence 221 246899999999999999988888875421 223 477887643332221 23333444445556 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHH-Hhhc-----ccccccccCCHhHHHHH Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADML-VDNG-----TALTGSAPSDADDAFDL 143 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~-~~~~-----~~~~~~~~~~~~~~~~~ 143 (273) ..+++...+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.=- ...+ .....+...+....+++ T Consensus 216 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~ 294 (434) T protein:vir:62 216 DEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDA 294 (434) T ss_pred eeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhH Confidence 6666666443 233466665555566677765 45689999999999887311 0000 00111222333456888 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCc--EEE- Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE--QFV- 220 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~--~~~- 220 (273) |.++...+.....+ +-..+++|..+..|.+..+.-.+. ..........|.-..+.|.+|+.++.+|.+++. ..+ T Consensus 295 l~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd~~G~~-l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~ 371 (434) T protein:vir:62 295 LVKMKNTPVKEVRK--KARWVLNTAALTKIETMKTDDGFP-LLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFY 371 (434) T ss_pred HHHHHhhcchhhhc--CCEEEEcHHHHHHHHHhhccCCCE-eeccCCCccCCCCceecceeeEEecCccCccCCCceEEE Confidence 98888877665432 225688999999986543211110 000001122344457999999999988865432 223 Q ss_pred EEeCceEEEEEecceeeeccCCCcc----eeeEEeeeeeeeEEEc-CceEEEEecCCC Q lcl|Aclame:pro 221 AFHPSAAAYVSQIDTVEALRDQDSF----SDRIRALHVYGGKVVR-PTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~~~~~~~ve~~~~~~~~----~~~v~~~~~~g~~vl~-p~~~v~~~~~~s 273 (273) .+.-+..-...+...++..+....| ...+.+..+++++++. |+.+.+++-.+. T Consensus 372 ~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~ 429 (434) T protein:vir:62 372 FGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLK 429 (434) T ss_pred EeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEec Confidence 2333433233232222222222212 2458999999999774 887777633322 No 137 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.23 E-value=2.3e-11 Score=78.86 Aligned_cols=262 Identities=16% Similarity=0.041 Sum_probs=149.7 Q ss_pred Cc------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccc----cccccCCCCccCCcc-cc Q lcl|Aclame:pro 1 MA------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPT----VKDYKAAGRQTSADA-IS 69 (273) Q Consensus 1 MA------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~----~~d~~~~~~~~~~~~-~~ 69 (273) ++ ...++|+.|...+++.++....+..++..- ...|.++.+|+..... .+....+++...-.+ .. T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 193 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL----TMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFAD 193 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhccee----eccCCceeEEEeccccccccccceecCcccccccCccc Confidence 11 134569999999999999888887776432 2235567777764422 233456665543333 34 Q ss_pred cceEEEEEEeeeeceeEechHHHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHH---------HhhcccccccccCCHhH Q lcl|Aclame:pro 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAYTR-AGATALATDTDKFIADML---------VDNGTALTGSAPSDADD 139 (273) Q Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~~~-~~~~ala~~iD~~~~~~~---------~~~~~~~~~~~~~~~~~ 139 (273) .+.+++.+.+. +.-+.|++.-..++ ..+.++++ ..+++++.++|..++.-- ...+. ....+..+... T Consensus 194 f~~i~~~~~k~-~~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~-~~~~~~~~~~~ 270 (413) T protein:vir:81 194 FDIVTESLSKI-AGLTKITDEMIEDY-DFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDG-IQTLAVSNKDE 270 (413) T ss_pred ceeeEeeeeeE-EEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccc-cccccccccch Confidence 56666666544 33457776544333 34766554 578899999999877421 00000 01112223345 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhh----hcccccceeeeeeeeeecceEEEEecccccCC Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSA----DTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~----~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~ 215 (273) .++.+.++...+.....-..+ .++++|..+..|.+..+.-.+. ...+.......+..+.++|.+|+.|+.+|.+. T Consensus 271 ~~~~i~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~ 349 (413) T protein:vir:81 271 LADSIYKAMTNISLATPFQAD-ALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGK 349 (413) T ss_pred hHHHHHHHHHHhhhhccCCCc-EEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCccc Confidence 567777777665444322222 3789999999986432211110 00000000011223579999999999988643 Q ss_pred CcEEEEEe-CceEEEEEe-cceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 216 DEQFVAFH-PSAAAYVSQ-IDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~~-~~a~~~~~~-~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.+. ..++-...+ ...++..+... .| ...+++.++||+.+.+|+++++++-+.. T Consensus 350 ---~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 350 ---PVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred ---EEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 34443 334433333 33455444322 22 3588899999999999999998876665 No 138 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.22 E-value=2.3e-12 Score=84.40 Aligned_cols=251 Identities=12% Similarity=0.057 Sum_probs=148.1 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++....+..+++. .. ..| .++|.... .+.+....++......+++.+.+ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v--~~--~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 156 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL--TN--IKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 156 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheee--Ee--cCC--ceEEEEecCCCcccccccccccccccccceee Confidence 321 2578999999999999888877777643 11 112 24565433 23455667777766667777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc-----cccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGT-----ALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~a 147 (273) ++...+. ..-+.|+..-..++..++.+++ +..+++++.+.+..++......+. ...+....++.+.++.|.++ T Consensus 157 ~~~~~k~-~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~ 235 (352) T protein:vir:78 157 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAIINA 235 (352) T ss_pred eecceeE-EeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHHHHHHH Confidence 7777554 2335777766666667777654 557888877655555543221111 01112223444557888888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceE Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|..... .+-..++++..+..|++-.+ + + ...+..|.-..++|.+|+.++..+. .+.+.-+-. T Consensus 236 ~~~l~~~~~--~~a~~~mn~~t~~~l~~~~~---~----~-~~~~~~~~~~~llG~PV~~~~~~~~-----~~~Gdf~~~ 300 (352) T protein:vir:78 236 LADLHEDYR--DNATIYMRYADYVKIISVLS---N----G-TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 300 (352) T ss_pred HhccChhhh--cCCEEEEehHHHHHHHHHHh---c----c-CCcccccCCccccccceEEecCCCc-----eeEeehhhh Confidence 777755543 23356788888877754321 1 1 1223334445799999999875432 222322111 Q ss_pred EEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 228 AYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) -.......++.+++...--..+.+.+++|+++++|++++.++.+.| T Consensus 301 ~~~~~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 301 GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred hhhhhhheeeeeccccCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 0011112334444444334678889999999999999998866666 No 139 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.21 E-value=1.2e-11 Score=80.37 Aligned_cols=265 Identities=12% Similarity=0.013 Sum_probs=149.2 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcc--cccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADA--ISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~--~~~~~ 72 (273) |.. ..++|+-|..++++.+++..++..++.... ......++.+|+....+......+++....+. ++.+. T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~--~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~ 187 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEP--VFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLER 187 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceee--ccCCccceEEEEecCCcceeeccccccccccccccceee Confidence 321 246799999999999999888888764422 12222356677665544455566666544432 44455 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcc-------cccccccCCHhHHHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGT-------ALTGSAPSDADDAFDLI 144 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~i 144 (273) ++++..+. +.-+.|++.-..++..++.++ .+..+++++.++|..++.--..... ....+...+....++++ T Consensus 188 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~ 266 (404) T protein:vir:10 188 FNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDF 266 (404) T ss_pred eEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccccHHHH Confidence 55555433 334566665444555667664 5568999999999987743211000 00011122233345666 Q ss_pred HHHHH-HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec-ccccCC-C-cEEE Q lcl|Aclame:pro 145 ASALK-ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN-NLRDTD-D-EQFV 220 (273) Q Consensus 145 ~~a~~-~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~-~l~~~~-~-~~~~ 220 (273) ..+.. .+. ...... -.++++|..+..|.+..+...+ ... ...+..|..+++.|.+|+..+ .++..+ + ..++ T Consensus 267 ~~~~~~~l~-~~~~~~-~~~v~n~~~~~~L~~lkd~~G~--~l~-~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~ 341 (404) T protein:vir:10 267 KKCKNVELL-NVFKAT-SSWIVNQDGFNYLDSLEDKTGR--PYL-QPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVL 341 (404) T ss_pred HHHHHhhhh-ccccCC-CEEEEcHHHHHHHHHhhccCCc--eee-ccCcCCCCCccccceeeEEecccccCCCCCccEEE Confidence 66554 233 233222 3578999999998754321111 111 112334666789999998643 344322 2 2344 Q ss_pred EEeCc-eEEEEEe-cceeeeccCC-C---cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 221 AFHPS-AAAYVSQ-IDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~-a~~~~~~-~~~ve~~~~~-~---~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.-+ ++....+ ...++..+.. . +-...+++.+++|+.+++|+++++++-+.+ T Consensus 342 ~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 342 LGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred EEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 55433 4433332 2234433222 1 224679999999999999999998866666 No 140 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.21 E-value=2.1e-11 Score=79.16 Aligned_cols=258 Identities=11% Similarity=0.026 Sum_probs=150.9 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccC-Ccccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~-~~~~~~~~ 72 (273) |.. ..++|+.|...+++.+++...+..++..-.. .....++.+|+.... ..+....+++... .+.++.+. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV--STSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTI 193 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeec--cCCcceEEEEeecCCcccccccccccccccccccceee Confidence 221 2467999999999999999888887753221 111234566665443 3344566666554 24566777 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KE 150 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~ 150 (273) +++++.+. +.-+.|++.-..++..++.++ .++.+++++.++|..++.-. ++....+. ...++++.++. .. T Consensus 194 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~---G~~~~~~~----~~~~~~i~~~~~~~ 265 (408) T protein:vir:74 194 IKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM---GTVPKKPT----IANFDDVITMINTS 265 (408) T ss_pred EEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---cccccccc----cccHHHHHHHHHHh Confidence 77777553 444577776665666677664 55688999999999776431 12111112 22345565544 34 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc--cccCCCc--EEEEEe-Cc Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDDE--QFVAFH-PS 225 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~--l~~~~~~--~~~~~~-~~ 225 (273) +.....+ +-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.+++ +|..++. .++.+. +. T Consensus 266 l~~~~~~--~a~~v~n~~~~~~l~~lkd~~G~--~l-~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~ 340 (408) T protein:vir:74 266 VDPAIIA--TSSLLTNQSGLNKLALVKTAEGK--YL-LEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQ 340 (408) T ss_pred hhhhhcC--CCEEEEcHHHHHHHHHhhcCCCc--eE-eccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhc Confidence 4444332 33678999999999754221111 11 11112345557899999998654 4543322 234443 33 Q ss_pred eEEEEEe-cceeeeccCC----CcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 226 AAAYVSQ-IDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~~~~-~~~ve~~~~~----~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++....+ ...++..+.. .+....+++.+++|+++++|++++.++-++. T Consensus 341 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAI 393 (408) T ss_pred cEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecc Confidence 4444432 3344433321 1234678999999999999999999975444 No 141 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.20 E-value=3.2e-11 Score=78.12 Aligned_cols=265 Identities=14% Similarity=0.064 Sum_probs=159.6 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccc----cccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPT----VKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~----~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) .+--.+.|+.+. ++++.+++...+.+++++-.. .+..+.+||+++... ..+-..+.+..+.++++.+++++. T Consensus 19 ~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t---~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~ 94 (314) T protein:vir:41 19 LGKGILAVQRFG-EFVREVRENSAIIKDARVLNA---LKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLE 94 (314) T ss_pred CCCceeChHHHH-HHHHHHHhccchhhheeeecc---cCccceeecccccCcccccccccccCCccCCcccccccceeee Confidence 222357899985 688899999999988865322 123467888876421 111122233344566778888888 Q ss_pred EEeeeeceeEechHHHHHhHH--HHHHH-HHHHHHHHHHHHHHHHHH-----------------HHHhhcccccccccCC Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAG--SLEAY-TRAGATALATDTDKFIAD-----------------MLVDNGTALTGSAPSD 136 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~--~~~~~-~~~~~~ala~~iD~~~~~-----------------~~~~~~~~~~~~~~~~ 136 (273) ..+.. ..+.|++.-...+.. +++++ ....+++++.......+. -+..+...+...+..+ T Consensus 95 ~~kl~-~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~~~~ 173 (314) T protein:vir:41 95 MKELV-TKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDAEPED 173 (314) T ss_pred eEEEE-EeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeecCccc Confidence 86654 457788766665543 57664 455777788777554432 1111112222222233 Q ss_pred HhHHHHHHHHHHHHHhhcCCCc-CCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCC Q lcl|Aclame:pro 137 ADDAFDLIASALKELTKANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vp~-~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~ 215 (273) ..+..+.|.++...|...---. ..-..+++++.+..+++.. .+.....++..+..|.-..++|++|+.++.+|... T Consensus 174 ~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l---~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~ 250 (314) T protein:vir:41 174 ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQL---LVRETGLGDSALIGATGLQYDGIPIQYVPALDALG 250 (314) T ss_pred cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHH---hccCCcccchhhhCCCCceecceeeEecccccccC Confidence 3345556677776664432111 1224678999888775422 11111223345555666679999999999887543 Q ss_pred --CcEEEEEeCceEEEEEec-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEE--ecCCC Q lcl|Aclame:pro 216 --DEQFVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVF--NKTGS 273 (273) Q Consensus 216 --~~~~~~~~~~a~~~~~~~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~--~~~~s 273 (273) ...++.++++-+.++.+. ..++.+|+.......+...++.|+.+.+++++|+. +.+.+ T Consensus 251 ~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~ 313 (314) T protein:vir:41 251 DDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSG 313 (314) T ss_pred CCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCC Confidence 345666777776665433 36788888877778999999999999888766644 33333 No 142 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.20 E-value=2.1e-11 Score=79.12 Aligned_cols=258 Identities=14% Similarity=0.049 Sum_probs=149.3 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccC-Ccccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~-~~~~~~~~ 72 (273) |+. ..++|+-|...+++.+++...+..+++.- ...+.+.++|.... .+......+++... .+.++.+. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 184 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT----PVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNK 184 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhccee----eccCCeeEEEEEecCCCcccccccccccccccccccee Confidence 332 25679999999999999888887776432 22244567776643 22223455555443 34566677 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHH-H Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALK-E 150 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~-~ 150 (273) +++.+.+. +.-+.|++.-..++..++.+++ +..+++++...|..++.-..... ....+....++++.++.. . T Consensus 185 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-----~~~~~~~~~~d~l~~~~~~~ 258 (389) T protein:vir:10 185 VDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT-----AKKTTTDTLVDSLKHILNVD 258 (389) T ss_pred eeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc-----cccccccccHHHHHHHHHhh Confidence 77776543 3445667665555566777654 45788899999988876553221 122223334556665543 2 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccc-cceeeeeeeeeecceEEEEecc--cccCCCcE-EEEEe-Cc Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNLLGARIVESNN--LRDTDDEQ-FVAFH-PS 225 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~-~~~~~~G~ig~i~G~~i~~s~~--l~~~~~~~-~~~~~-~~ 225 (273) ++.. .+-.++++|..+..|.+..+.-.+.-...+ ......|..++++|.+|+.++. ++...+.. ++.+. .. T Consensus 259 ~~~~----~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~ 334 (389) T protein:vir:10 259 LDPA----YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKR 334 (389) T ss_pred hhhh----hCcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccc Confidence 3322 234689999999999764321111000000 0111124446899999987543 33333332 33343 23 Q ss_pred eEEEEE-ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 226 AAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~~~-~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++-... +...++..+ ...+.+.+++-+++|+.+++|++++.++-+.. T Consensus 335 ~~~~~~~~~~~i~~~~-~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 335 GVLFTDRQQVTLAWED-SKIYGKYLGAAFRFGVQKADSKAGYFVTNTDV 382 (389) T ss_pred cEEEEeecceEEEeec-cccccceEEEEEEeccEEecccceEEEEeecc Confidence 343333 333454443 34456788999999999999999998863333 No 143 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.18 E-value=3.6e-11 Score=77.85 Aligned_cols=258 Identities=13% Similarity=0.026 Sum_probs=148.9 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) ++. ..++|+-+...+.+.+++...+..+++.- ...+....+|+....+.+....+++.+.. .+.+.+.+ T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~----~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i 159 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFV----NTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKI 159 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceee----ecCCceeEEEEEcCCcceeeeccccccCccccccceee Confidence 211 35689999999999999888877776442 23356678888766666666676665543 35677777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHhhc--------ccc------cccccCCH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADM-LVDNG--------TAL------TGSAPSDA 137 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~-~~~~~--------~~~------~~~~~~~~ 137 (273) ++...+. +.-+.|+..-..++..++.++ .+..+++++.++|..++.= ....+ ... ......+. T Consensus 160 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~ 238 (390) T protein:vir:40 160 QTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTD 238 (390) T ss_pred EeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccch Confidence 7777554 344677776666677778775 5668999999999987741 00000 000 00111222 Q ss_pred hHHHHHHHHHHHHHhhcCCCc-CCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC Q lcl|Aclame:pro 138 DDAFDLIASALKELTKANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vp~-~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~ 216 (273) .+..+.+......+....-.. .+-.++++|..+..++.....+.+ ..| ..+.. ...+|.+|+.++.+|.+. T Consensus 239 ~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d--~~G--~~v~~---~~~~g~pvv~~~~~p~~~- 310 (390) T protein:vir:40 239 LTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMT--PQG--VWVTG---ILPVPLEIVQSVAVPVGK- 310 (390) T ss_pred hhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccC--CCC--ccccc---cCCCceeEEEcCCCCCCc- Confidence 222333333333333322111 234678898876555443221211 111 11211 124799999999998654 Q ss_pred cEEEEEeCceEEEEEec-ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 EQFVAFHPSAAAYVSQI-DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ~~~~~~~~~a~~~~~~~-~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.+..+-+-+..+. ..++...... +-.+.+++.+++|+++.+|+++++++-++- T Consensus 311 --i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 311 --AVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred --EEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeecc Confidence 444544443333332 2343332221 123789999999999999999998853222 No 144 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.16 E-value=4.8e-11 Score=77.16 Aligned_cols=258 Identities=12% Similarity=0.019 Sum_probs=146.5 Q ss_pred Cc--------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCC-ccccc Q lcl|Aclame:pro 1 MA--------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSA-DAISD 70 (273) Q Consensus 1 MA--------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~-~~~~~ 70 (273) |+ ...++|+-|...+++.++...++..++..-. .....| ++.+|.... .+.+....+++.+.- +.++. T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f 182 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVEN-VTTSHG-SRVYEKLADITPLKDLDDESALIGDNDDPEL 182 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceee-ccCCcc-eEEEEeeccCCccccccccccccccccccce Confidence 11 1256799999999999999988888774321 111122 344444433 233445666665532 23455 Q ss_pred ceEEEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALK 149 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 149 (273) ..++++..+. +.-+.|++.-...+..++.++ .++.+++++.++|..++.-... ... ......+++|.++.. T Consensus 183 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~---~~~----~~~~~~~~~i~~~~~ 254 (395) T protein:vir:38 183 TVVKYLIHRY-AGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGK---APK----KPTISQFDNIKDLEN 254 (395) T ss_pred eeEEeeeeee-EeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---ccc----ccccccHHHHHHHHH Confidence 6666665443 333466665555555667664 5678899999999887753211 111 111222445555433 Q ss_pred -HHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccC--CC-cEEEEEeCc Q lcl|Aclame:pro 150 -ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT--DD-EQFVAFHPS 225 (273) Q Consensus 150 -~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~--~~-~~~~~~~~~ 225 (273) .+..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|++|+.++..+.+ .+ ..++.+.-+ T Consensus 255 ~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~~G~--~l-~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~ 329 (395) T protein:vir:38 255 NTLDPAIE--STSSFITNQSGYNILSKVKDADGR--YL-MQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLK 329 (395) T ss_pred Hhhhhhhc--CCCEEEEcHHHHHHHHHhhccCCc--ee-eccCcCCCCcceeccceeEEecccccCcCCCcceEEEEecc Confidence 3332221 234689999999999654321111 11 111233566678999999998764433 22 234444433 Q ss_pred -eEEEEE-ecceeeeccCCC-cc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 226 -AAAYVS-QIDTVEALRDQD-SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 -a~~~~~-~~~~ve~~~~~~-~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++-... +...++..+... .| ...+++..++|+++++|++++.++-+.+ T Consensus 330 ~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 330 QGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred ccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 333332 333455444322 12 4688999999999999999998865544 No 145 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=99.10 E-value=9.8e-12 Score=80.94 Aligned_cols=250 Identities=12% Similarity=0.069 Sum_probs=147.1 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++....+..+++.- .. .| .++|++... ..+....++...+..+++.+.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~--~~--~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--NI--KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee--ec--CC--ceeeeeeccCCcccccccccccccccccccee Confidence 221 25689999999999998887777766431 11 12 356665432 3345567777666667777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc-----cccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGT-----ALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+++ ++.+++++.+.+..++......+. ...+....++...+++|.++ T Consensus 192 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~ 270 (387) T protein:vir:94 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7766544 3335677665556667777754 567888888767666644322111 01122233445568888888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCc-e Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-A 226 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~-a 226 (273) ...|.....+ .. ..++++..+..|++-. .+ + ...+..|.-..++|.+|+.++..+. .+.+.-+ + T Consensus 271 ~~~l~~~y~~-na-~~imn~~t~~~~~~~~---~~----~-~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:94 271 LADLHEDYRD-NA-TIYMRYADYVKIISVL---SN----G-TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc-CC-EEEEechHHHHHHHHH---hc----C-CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 7777655432 23 4567777766664321 11 1 1233345556799999999875432 2222211 1 Q ss_pred EEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ... .+....+.+++...--..+++.+++|+++++|+++++++-+++ T Consensus 336 ~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 336 GIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hhh-hhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 110 1112233444443334678889999999999999998876555 No 146 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=99.10 E-value=9.8e-12 Score=80.94 Aligned_cols=250 Identities=12% Similarity=0.069 Sum_probs=147.1 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++....+..+++.- .. .| .++|++... ..+....++...+..+++.+.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~--~~--~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--NI--KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee--ec--CC--ceeeeeeccCCcccccccccccccccccccee Confidence 221 25689999999999998887777766431 11 12 356665432 3345567777666667777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc-----cccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGT-----ALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+++ ++.+++++.+.+..++......+. ...+....++...+++|.++ T Consensus 192 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~ 270 (387) T protein:vir:96 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7766544 3335677665556667777754 567888888767666644322111 01122233445568888888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCc-e Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-A 226 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~-a 226 (273) ...|.....+ .. ..++++..+..|++-. .+ + ...+..|.-..++|.+|+.++..+. .+.+.-+ + T Consensus 271 ~~~l~~~y~~-na-~~imn~~t~~~~~~~~---~~----~-~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:96 271 LADLHEDYRD-NA-TIYMRYADYVKIISVL---SN----G-TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc-CC-EEEEechHHHHHHHHH---hc----C-CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 7777655432 23 4567777766664321 11 1 1233345556799999999875432 2222211 1 Q ss_pred EEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ... .+....+.+++...--..+++.+++|+++++|+++++++-+++ T Consensus 336 ~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 336 GIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hhh-hhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 110 1112233444443334678889999999999999998876555 No 147 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=99.10 E-value=9.8e-12 Score=80.94 Aligned_cols=250 Identities=12% Similarity=0.069 Sum_probs=147.1 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++....+..+++.- .. .| .++|++... ..+....++...+..+++.+.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~--~~--~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--NI--KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee--ec--CC--ceeeeeeccCCcccccccccccccccccccee Confidence 221 25689999999999998887777766431 11 12 356665432 3345567777666667777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc-----cccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGT-----ALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+++ ++.+++++.+.+..++......+. ...+....++...+++|.++ T Consensus 192 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~ 270 (387) T protein:vir:26 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7766544 3335677665556667777754 567888888767666644322111 01122233445568888888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCc-e Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-A 226 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~-a 226 (273) ...|.....+ .. ..++++..+..|++-. .+ + ...+..|.-..++|.+|+.++..+. .+.+.-+ + T Consensus 271 ~~~l~~~y~~-na-~~imn~~t~~~~~~~~---~~----~-~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:26 271 LADLHEDYRD-NA-TIYMRYADYVKIISVL---SN----G-TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc-CC-EEEEechHHHHHHHHH---hc----C-CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 7777655432 23 4567777766664321 11 1 1233345556799999999875432 2222211 1 Q ss_pred EEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ... .+....+.+++...--..+++.+++|+++++|+++++++-+++ T Consensus 336 ~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 336 GIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hhh-hhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 110 1112233444443334678889999999999999998876555 No 148 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.10 E-value=8.5e-11 Score=75.81 Aligned_cols=257 Identities=15% Similarity=0.047 Sum_probs=151.6 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |+. ..++|+.|...+.+.++...++..+++.-... ...| ++.+|+....+.+....+++.... +..+.+.+ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v 200 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVT-TRSG-TRLLEKNADMVPFSPVEELGNLPEIDQPRFTKV 200 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeecc-CCce-eEEEEEecCCcceeeecccccccccccccceeE Confidence 332 25679999999999999988887776532211 1122 456666555455566777766543 34566666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHH-HH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALK-EL 151 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~-~l 151 (273) ++...+. +.-+.|++.-..++..++.++ .++.++++++++|..++.-... . .+.+ ...+++|.++.. .+ T Consensus 201 ~~~~~k~-~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~-----~--~~~g-~~~~~~i~~~~~~~l 271 (397) T protein:vir:12 201 SYSIIDY-GGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIAS-----L--KKVD-IDGLDGIKKALNVTL 271 (397) T ss_pred Eeeheee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----c--cccc-cccHHHHHHHHhhcc Confidence 6666443 334566666555556677665 5568999999999887753211 1 1111 122455665442 34 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc-cccCC--CcEEEEEeC-ceE Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN-LRDTD--DEQFVAFHP-SAA 227 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~-l~~~~--~~~~~~~~~-~a~ 227 (273) .... ..+-.++++|..+..|.+..+.-.+ .. ....+.+|.-+.++|.+|+.+++ .|..+ ...++.+.- .++ T Consensus 272 ~~~~--~~~a~~~~n~~~~~~L~~lkd~~G~--~l-~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~ 346 (397) T protein:vir:12 272 DPMV--APGSIVLTNQDGYDWLDTLKDGTGR--YL-LQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAI 346 (397) T ss_pred chhh--hCCCEEEEcHHHHHHHHHhhccCCc--ee-ecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceE Confidence 3322 2234688999999999653221111 11 11223456667899999988765 33222 223444443 344 Q ss_pred EEE-EecceeeeccCCC----cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 228 AYV-SQIDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~~-~~~~~ve~~~~~~----~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ... .+...++..+... .-...+++.+++|.++++|+++++++-++= T Consensus 347 ~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 347 VLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 333 3333454433222 124689999999999999999998876666 No 149 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=99.09 E-value=1.1e-10 Score=75.28 Aligned_cols=263 Identities=11% Similarity=0.071 Sum_probs=154.4 Q ss_pred Cccc----chhHH--HHHHHHHHHHHHhhcc--chhhhcccccc---ccCCcEEEEEeccccc-ccc--ccC--CCCccC Q lcl|Aclame:pro 1 MAFN----NFIPE--LWSDMLLEEWTAQTVF--ANLVNREYEGI---ASKGNVVHIAGVVAPT-VKD--YKA--AGRQTS 64 (273) Q Consensus 1 MA~~----~~~pe--v~~~~v~~~l~~~~v~--~~~~~~d~~~~---~~~Gdtv~ip~~~~~~-~~d--~~~--~~~~~~ 64 (273) ||.+ .++|| +|...+.+.-.+...| .+++..+-++. ...|+.+++|.|+.+. ..+ +.. ....++ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 9964 56787 8999888877555433 34454454432 2459999999999864 333 211 122455 Q ss_pred CcccccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhc------c-cc---cc-- Q lcl|Aclame:pro 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNG------T-AL---TG-- 131 (273) Q Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~------~-~~---~~-- 131 (273) +..++..+....+ .++..+|..+|.-...+-.+ ++.+.++.+.--.+...+.+++.+...= . .. .. T Consensus 81 ~~kitt~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t 159 (349) T protein:vir:78 81 PRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMV 159 (349) T ss_pred cccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcccce Confidence 6666666555444 56677777777654433333 4556666666556666666666654310 0 00 00 Q ss_pred -cccCCHhHHHHHHHHHHHHHhhcCC--CcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEE Q lcl|Aclame:pro 132 -SAPSDADDAFDLIASALKELTKANV--PNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 132 -~~~~~~~~~~~~i~~a~~~l~~~~v--p~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~ 207 (273) ..+.+.....+.+.++...|.++-. ..+. ..++++|.++..|.+.. .+. +... .-+...|+.+.|..|++ T Consensus 160 ~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~-li~---~i~~--s~~~~~i~ty~G~~Viv 233 (349) T protein:vir:78 160 VDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQ-LID---FIRD--AENNTMFATYQGYRVIV 233 (349) T ss_pred eeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhh-hhh---hccC--cccCcccceecCeEEEE Confidence 0011111223456666666665421 0111 36799999999997643 221 1111 11245689999999999 Q ss_pred ecccccCC-----CcEEEEEeCceEEEEEec--ceeeeccCCCcc----eeeEEeeeeeeeEEEcCceEEEEecCC---- Q lcl|Aclame:pro 208 SNNLRDTD-----DEQFVAFHPSAAAYVSQI--DTVEALRDQDSF----SDRIRALHVYGGKVVRPTGVVVFNKTG---- 272 (273) Q Consensus 208 s~~l~~~~-----~~~~~~~~~~a~~~~~~~--~~ve~~~~~~~~----~~~v~~~~~~g~~vl~p~~~v~~~~~~---- 272 (273) ++.+|..+ .++++++-++|+++.... ..+|..|++... .|.+..+.+| ++.|-|+.-..+.. T Consensus 234 DD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~a~v~~~~ 310 (349) T protein:vir:78 234 DDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTSAVITGNG 310 (349) T ss_pred eCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---EeeeeeeeeccccccCCc Confidence 99999643 246678889999987543 357888887643 3667666655 45566655544321 Q ss_pred ------C Q lcl|Aclame:pro 273 ------S 273 (273) Q Consensus 273 ------s 273 (273) | T Consensus 311 ~~~~~~s 317 (349) T protein:vir:78 311 TETIARS 317 (349) T ss_pred cccccCC Confidence 1 No 150 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=99.08 E-value=3.5e-11 Score=77.90 Aligned_cols=251 Identities=12% Similarity=0.059 Sum_probs=143.5 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++....+..++..- . ..| ..+|.... .+.+....++......+++.+.+ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~--~--~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 191 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--N--IKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 191 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeee--e--cCC--ceEEEEeecCCccccccCccccccccccccee Confidence 222 24789999999999998887776666431 1 112 34665432 23344566776666666777776 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc-----cccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGT-----ALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+++ ++.+++++.+.+..++......+. .....+..++...+++|.++ T Consensus 192 ~~~~~k~-~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~ 270 (387) T protein:vir:93 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eeeheee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 6666444 2335677665656667787754 557888888777766644322111 01122333445568888888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceE Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|...... .. ..++++..+..|++-. .+ + +..+..|.=..++|.+|+.++..+. .+++.-+-. T Consensus 271 ~~~l~~~~~~-~a-~~~mn~~t~~~~~~~~---~d----~-~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:93 271 LADLHEDYRD-NA-TIYMRYADYVKIISVL---SN----G-TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc-CC-EEEEechHHHHHHHHH---hc----C-CCcccccCCccccccceEEecCCCc-----eeeeehhhh Confidence 7777665432 23 4567777666554321 11 1 1122234445799999999875432 223322211 Q ss_pred EEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 228 AYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ........++.++....-...+.+..++|+++++|++++.++-+.+ T Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 336 GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred heehhhheeeecccccCCceeEEEEeeeCceeechhheEEEEeecC Confidence 1111112233333333334567788999999999999998755333 No 151 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=99.07 E-value=1.6e-11 Score=79.70 Aligned_cols=250 Identities=12% Similarity=0.070 Sum_probs=145.8 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++....+..+++.- .. .| .++|++.. .+.+....++...+..+++.+.+ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~--~~--~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i 206 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--NI--KG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 206 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceee--ec--CC--ceeeeeeccCCcccccccccccccccccccee Confidence 221 25789999999999998888777776531 11 12 34666543 23345567777666666677776 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcc-----cccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGT-----ALTGSAPSDADDAFDLIASA 147 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+++ ++.+++++.+.+..++......+. ...+....++...+++|.++ T Consensus 207 ~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~ 285 (402) T protein:vir:93 207 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 285 (402) T ss_pred eecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 6666443 3335677665556667777654 567888888776666644322111 01112233445567888888 Q ss_pred HHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCc-e Q lcl|Aclame:pro 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-A 226 (273) Q Consensus 148 ~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~-a 226 (273) ...|+.... ... ..++++..+..|++-. .+ + ...+..|.=..++|.+|+.++..+. .+.+.-+ + T Consensus 286 ~~~l~~~y~-~na-~~imn~~t~~~~~~~~---~d----~-~~~~~~~~~~~llG~PV~~t~~~~~-----i~~GDf~~~ 350 (402) T protein:vir:93 286 LADLHEDYR-DNA-TIYMRYADYVKIISVL---SN----G-TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 350 (402) T ss_pred HhccChhhh-cCC-EEEEechHHHHHHHHH---hc----C-CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 877765543 233 4567777666554321 11 1 1223334445799999999875432 2222111 1 Q ss_pred EEEEEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 227 AAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ... .....++.+++...--..+++..++|+++++|++++.++-++. T Consensus 351 ~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 351 GIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred hhh-hhhhhhhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecC Confidence 111 1112234445544334678899999999999999998765444 No 152 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.07 E-value=7.3e-11 Score=76.16 Aligned_cols=251 Identities=11% Similarity=0.008 Sum_probs=135.1 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecc--ccccccccCCCCccC-CcccccceEEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVV--APTVKDYKAAGRQTS-ADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~--~~~~~d~~~~~~~~~-~~~~~~~~~~~ti 77 (273) +......|+-+...+.+. .....+..++.. ....+....+|.+. ....+ ...+++... ..+.+.+.+++++ T Consensus 138 ~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~~E~~~~~~~~~~~~~~i~~~~ 211 (397) T protein:vir:96 138 VEGGALIPQELLQPQLEP-KDIVDLSKYVRS----VPVNSASGKFPVISKSGSKMA-TVQQLEKNPQLANPKMVEIDYSV 211 (397) T ss_pred cccccchhHHHHHHHHHh-hhhhhHHHhhhh----ccccccceeEEEEeccCCccc-cccccccccccccccccceeecH Confidence 333456788888777763 333333333322 11122344555443 32322 344444332 3456666767666 Q ss_pred EeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANV 156 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~v 156 (273) .+. +.-+.++..-..++..++.+++ +..+.+++...|..++.-.... ... +...+++|.++........ T Consensus 212 ~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~----~~~----~~~~~d~~~~~~~~~~~~~- 281 (397) T protein:vir:96 212 ATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTA----TAK----SVVGVDGLKDLINKEIKKV- 281 (397) T ss_pred hHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccc----cccchHHHHHHHHHhhhhh- Confidence 443 3344556554555556676654 4578888888888777543211 111 1223455555543322221 Q ss_pred CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCC--C-cEEEEEeCc-eEEEEEe Q lcl|Aclame:pro 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD--D-EQFVAFHPS-AAAYVSQ 232 (273) Q Consensus 157 p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~--~-~~~~~~~~~-a~~~~~~ 232 (273) .+-..+++|..+..|.+..+.-.+ +. ....+..|.-+.+.|.+|+.++....+. + ..++.+.-+ ++....+ T Consensus 282 --~~a~~v~n~~~~~~l~~lkd~~G~--~~-~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~ 356 (397) T protein:vir:96 282 --YDVKLFISASMYSELDKLKDKNGR--YL-LQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDR 356 (397) T ss_pred --cCcEEEEcHHHHHHHHHhhccCCC--eE-eccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEee Confidence 233689999999999754321111 11 1112334556789999999876543222 2 234444433 3333333 Q ss_pred c-ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 233 I-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~-~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) . ..++.. +...+.+.+++-+++|+++.+|++++.++-+.. T Consensus 357 ~~~~~~~~-~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 357 KQVSVSWV-DNNIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred cceEEEEe-cccccceeEEEEEEEccEEecccceEEEEeecC Confidence 2 233333 334456788999999999999999999964333 No 153 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.05 E-value=6.3e-11 Score=76.51 Aligned_cols=267 Identities=16% Similarity=0.089 Sum_probs=138.1 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccccc-ccccCCCCccC-----Cccc Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTV-KDYKAAGRQTS-----ADAI 68 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~-~d~~~~~~~~~-----~~~~ 68 (273) +.. ..+.|+.+..++++.++...++..++.+-. ....+.++.||+....+. +....++...+ ..++ T Consensus 157 ~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~--~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~ 234 (477) T protein:vir:84 157 LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEP--LPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDL 234 (477) T ss_pred ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceee--ecCCcceeEEEEEecCcceeeeeccCccccccccccccc Confidence 111 134577788899999988877777664321 122345789998644322 23345544332 2233 Q ss_pred ccceEEEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHh-hcccc---------c-ccccC Q lcl|Aclame:pro 69 SDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADM-LVD-NGTAL---------T-GSAPS 135 (273) Q Consensus 69 ~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~-~~~-~~~~~---------~-~~~~~ 135 (273) +...+++...+. +.-+.|++.-..++..++.+++ ++.+++++.++|..++.= ... .+... . +.+.. T Consensus 235 ~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~ 313 (477) T protein:vir:84 235 TDGFVQANVKTI-AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGS 313 (477) T ss_pred ceeeEEEeeeeE-EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeecccccccccccccc Confidence 445555555332 2334566655555566777755 568999999999987731 100 01100 0 00111 Q ss_pred C---HhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhh----hccc------ccceeeeeeeeeecc Q lcl|Aclame:pro 136 D---ADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSA----DTSG------DAAGLRAGTIGNLLG 202 (273) Q Consensus 136 ~---~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~----~~~~------~~~~~~~G~ig~i~G 202 (273) + ....++.|.++...++.... ......+++|..+..|.+..+.-.+. +..+ ....+.+|..|.+.| T Consensus 314 t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G 392 (477) T protein:vir:84 314 ALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG 392 (477) T ss_pred chhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc Confidence 1 12345556665555443322 12346788999999886543211110 0000 011234556678999 Q ss_pred eEEEEecccccCCC-----cEEEEEeCceEEEEEecceeeeccCCCcce----eeEEeeeeeeeEEEc-CceEEEEecCC Q lcl|Aclame:pro 203 ARIVESNNLRDTDD-----EQFVAFHPSAAAYVSQIDTVEALRDQDSFS----DRIRALHVYGGKVVR-PTGVVVFNKTG 272 (273) Q Consensus 203 ~~i~~s~~l~~~~~-----~~~~~~~~~a~~~~~~~~~ve~~~~~~~~~----~~v~~~~~~g~~vl~-p~~~v~~~~~~ 272 (273) .+|+.++.+|...+ ..++.+.-+.+-.... .++...+...++ ..++..-++++..+| |+++|+++-++ T Consensus 393 ~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~--~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~ 470 (477) T protein:vir:84 393 LPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFES--SVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTA 470 (477) T ss_pred cceEecCcccccccccCCcceEEEEEeceEEEEee--ceeEEeccccccccceeeeeehhhhhhhhhccccceEEeeccc Confidence 99999999996422 1233333333222211 122222222222 222222234445666 99999998888 Q ss_pred C Q lcl|Aclame:pro 273 S 273 (273) Q Consensus 273 s 273 (273) . T Consensus 471 ~ 471 (477) T protein:vir:84 471 L 471 (477) T ss_pred c Confidence 7 No 154 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.05 E-value=1.4e-10 Score=74.61 Aligned_cols=256 Identities=11% Similarity=0.011 Sum_probs=137.9 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccc-ccccccCCCCccC-Ccccccce Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~-~~~~~~~~ 72 (273) ++. -.++|+-+...+.. +.....+..++++- .....+.++|..... +......+++... .++.+.+. T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~ 230 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTE----SVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITP 230 (437) T ss_pred hhhcccccccccchHHHHHHHHH-hhhhhhhhhcceeE----eeccCceeeEEeecccccccccccccccccccccccee Confidence 211 14678888776654 44444444444321 112334567665332 2334445554432 23345555 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHHH-H Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALK-E 150 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~-~ 150 (273) +++...+. +.-+.|+..-..++..++.+++ +..+++++.++|..++.-.... ... .+....+++|.++.. . T Consensus 231 v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~---~~~---~~~~~~~~~~~~~~~~~ 303 (437) T protein:vir:10 231 ILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDG---IKK---TTSTYLLGDLKKVLNVT 303 (437) T ss_pred eeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc---ccc---cccccchhhHHHHHHhh Confidence 55555333 3335666655555566777655 4578899999998887654221 111 112222334444332 3 Q ss_pred HhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc--cccCCCc--EEEEEe-Cc Q lcl|Aclame:pro 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDDE--QFVAFH-PS 225 (273) Q Consensus 151 l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~--l~~~~~~--~~~~~~-~~ 225 (273) +..... .+-..+++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+|+.++. +|..+.. .++.+. .. T Consensus 304 l~~~~~--~~~~~~~~~~~~~~l~~lkd~~g~--~~-~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 378 (437) T protein:vir:10 304 LKPQDS--AAASIVMSQSAYNLFDMATDAMGR--PL-LQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKK 378 (437) T ss_pred hhhhhh--cCCEEEEcHHHHHHHHHhhccCCC--ee-eccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccc Confidence 333322 233679999999998654321111 11 11123346667899999998764 3543322 234443 23 Q ss_pred eEEEEEe-cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 226 AAAYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~~~~-~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++....+ ...++...+-..+.+.+.+-+++|+++++|++++.|+.... T Consensus 379 ~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 379 AVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred cEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeecc Confidence 4444432 33455444444556788888999999999999999874322 No 155 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.03 E-value=5.4e-10 Score=71.38 Aligned_cols=264 Identities=14% Similarity=0.058 Sum_probs=145.0 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+.|...+++.+++.+.+..++.+- ...+.++.||+... ...+....+++.....+++.+.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~----~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc----ccCCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 222 25678899999999999888888887542 22345789998644 23456778888777777777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHH---------HHhhccccccc----------- Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADM---------LVDNGTALTGS----------- 132 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~---------~~~~~~~~~~~----------- 132 (273) ++...+... -+.|++.-.... .++.+++ ++.+++++.++|..++.= +.......... T Consensus 227 ~~~~~k~a~-~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) T protein:vir:78 227 YEQVGKVAN-ALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) T ss_pred EeeeeeeEe-ecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhh Confidence 777655422 345665433333 3476654 568899999999887631 10000000000 Q ss_pred ------------------------------------------ccCCHhHHHHHHHHHHHHHhhcCC-CcCCcEEEECHHH Q lcl|Aclame:pro 133 ------------------------------------------APSDADDAFDLIASALKELTKANV-PNVGRVVVVNAEM 169 (273) Q Consensus 133 ------------------------------------------~~~~~~~~~~~i~~a~~~l~~~~v-p~~~r~lvv~p~~ 169 (273) ...+.......+..+...+..... +.. ..+++|.. T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vmn~~~ 382 (497) T protein:vir:78 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN--AVVMNPRD 382 (497) T ss_pred hhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC--eEEEchHH Confidence 000001111222222222222211 111 47899999 Q ss_pred HHHHhcchHHhhhhhcc---cccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEecc-eeeeccC-CCc Q lcl|Aclame:pro 170 AFWLRSSGSKLTSADTS---GDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID-TVEALRD-QDS 244 (273) Q Consensus 170 ~~~L~~~~~~~~~~~~~---~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~-~ve~~~~-~~~ 244 (273) +..|.+..+.-.+.-.. +.......+.-..+.|.+|+.++.+|.+.. .+-.+...++....+.. .++.... ... T Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~-~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~ 461 (497) T protein:vir:78 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI-LVGHFAPSVIQTARREGVTMQMTNSNGTD 461 (497) T ss_pred HHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCce-EEeecccceEEEEEecccEEEeecccchh Confidence 99886543211111000 000000112234789999999999986542 11123334455444332 3332221 121 Q ss_pred c---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 245 F---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 245 ~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) | -..|++.+++|+.+.+|+++++++-+.+ T Consensus 462 f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 462 FVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred hhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 2 3578899999999999999999854444 No 156 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.03 E-value=5.4e-10 Score=71.38 Aligned_cols=264 Identities=14% Similarity=0.058 Sum_probs=145.0 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccccCCCCccCCcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+.|...+++.+++.+.+..++.+- ...+.++.||+... ...+....+++.....+++.+.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~----~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc----ccCCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 222 25678899999999999888888887542 22345789998644 23456778888777777777777 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHH---------HHhhccccccc----------- Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADM---------LVDNGTALTGS----------- 132 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~---------~~~~~~~~~~~----------- 132 (273) ++...+... -+.|++.-.... .++.+++ ++.+++++.++|..++.= +.......... T Consensus 227 ~~~~~k~a~-~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) T protein:vir:10 227 YEQVGKVAN-ALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) T ss_pred EeeeeeeEe-ecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhh Confidence 777655422 345665433333 3476654 568899999999887631 10000000000 Q ss_pred ------------------------------------------ccCCHhHHHHHHHHHHHHHhhcCC-CcCCcEEEECHHH Q lcl|Aclame:pro 133 ------------------------------------------APSDADDAFDLIASALKELTKANV-PNVGRVVVVNAEM 169 (273) Q Consensus 133 ------------------------------------------~~~~~~~~~~~i~~a~~~l~~~~v-p~~~r~lvv~p~~ 169 (273) ...+.......+..+...+..... +.. ..+++|.. T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vmn~~~ 382 (497) T protein:vir:10 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN--AVVMNPRD 382 (497) T ss_pred hhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC--eEEEchHH Confidence 000001111222222222222211 111 47899999 Q ss_pred HHHHhcchHHhhhhhcc---cccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEecc-eeeeccC-CCc Q lcl|Aclame:pro 170 AFWLRSSGSKLTSADTS---GDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID-TVEALRD-QDS 244 (273) Q Consensus 170 ~~~L~~~~~~~~~~~~~---~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~-~ve~~~~-~~~ 244 (273) +..|.+..+.-.+.-.. +.......+.-..+.|.+|+.++.+|.+.. .+-.+...++....+.. .++.... ... T Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~-~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~ 461 (497) T protein:vir:10 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI-LVGHFAPSVIQTARREGVTMQMTNSNGTD 461 (497) T ss_pred HHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCce-EEeecccceEEEEEecccEEEeecccchh Confidence 99886543211111000 000000112234789999999999986542 11123334455444332 3332221 121 Q ss_pred c---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 245 F---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 245 ~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) | -..|++.+++|+.+.+|+++++++-+.+ T Consensus 462 f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 462 FVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred hhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 2 3578899999999999999999854444 No 157 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=99.03 E-value=2.7e-10 Score=73.07 Aligned_cols=263 Identities=11% Similarity=0.069 Sum_probs=154.1 Q ss_pred Cccc----chhHH--HHHHHHHHHHHHhhcc--chhhhcccccc---ccCCcEEEEEecccc-cccc--ccCCC--CccC Q lcl|Aclame:pro 1 MAFN----NFIPE--LWSDMLLEEWTAQTVF--ANLVNREYEGI---ASKGNVVHIAGVVAP-TVKD--YKAAG--RQTS 64 (273) Q Consensus 1 MA~~----~~~pe--v~~~~v~~~l~~~~v~--~~~~~~d~~~~---~~~Gdtv~ip~~~~~-~~~d--~~~~~--~~~~ 64 (273) ||.+ .++|| +|..-+.+.-.+...| .+++..+-++. ...|+.+++|.|+.+ +..+ |.... ..++ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 9964 56787 7999888877554433 34555554443 245999999999886 4434 22211 1345 Q ss_pred CcccccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhc------ccc----c--- Q lcl|Aclame:pro 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNG------TAL----T--- 130 (273) Q Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~iD~~~~~~~~~~~------~~~----~--- 130 (273) +..++..+....+ .++..+|..+|.-...+-.+ ++.+.++.+.--.+...+.+++.+...= ... . T Consensus 81 ~~kit~~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~ 159 (349) T protein:vir:94 81 PRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMV 159 (349) T ss_pred cccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCcee Confidence 5565555544443 55677777777654433333 5566666666666666666777664310 000 0 Q ss_pred ccccCCHhHHHHHHHHHHHHHhhcCC--CcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEE Q lcl|Aclame:pro 131 GSAPSDADDAFDLIASALKELTKANV--PNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 131 ~~~~~~~~~~~~~i~~a~~~l~~~~v--p~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~ 207 (273) .....+.....+.+.+|...|.++.. ..+. ..++++|.++..|.+.. .+. +... .-+...|+.+.|..|++ T Consensus 160 ~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~-li~---~i~~--s~~~~~i~ty~G~~Viv 233 (349) T protein:vir:94 160 VDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQ-LID---FIRD--AENNTMFATYQGYRVIV 233 (349) T ss_pred EEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcc-hhh---hccC--cccCcccceecCcEEEE Confidence 00011111223456666666655421 1111 35789999999997653 221 1111 11244588999999999 Q ss_pred ecccccCC-----CcEEEEEeCceEEEEEecc--eeeeccCCCcc----eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 208 SNNLRDTD-----DEQFVAFHPSAAAYVSQID--TVEALRDQDSF----SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 208 s~~l~~~~-----~~~~~~~~~~a~~~~~~~~--~ve~~~~~~~~----~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.+|... .++++++-++|+++..... .+|..|++.+. .|.+..+.+| ++.|-|+.-..+..+ T Consensus 234 DD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a~v~ 307 (349) T protein:vir:94 234 DDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSAVIT 307 (349) T ss_pred eCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---EeeeeeeeecccccC Confidence 99998532 2456788899999987653 47888887643 3666666654 566666665544321 No 158 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.00 E-value=4.6e-10 Score=71.77 Aligned_cols=257 Identities=12% Similarity=0.007 Sum_probs=144.7 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |+. ..++|+.+...+++.++...++..++..-. ......+..+|+....+.+....++..... +.++.+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 332 247899999999999999888877764311 111112345555554444455666665543 23566666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KEL 151 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) ++...+. +.-+.|++.-..++..++..+ .+..+++++.++|..++.-.... ...+ ...+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~----~~~~----~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL----TKQA----IKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccC----ccCHHHHHHHHHHhh Confidence 6666443 445577776555555667664 45678899999998887543211 1111 12245566554 344 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEe--ccccc----CCCcE-EEEEeC Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES--NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s--~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+++.. +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G~--~l-~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDGK--YI-LQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCCC--eE-eecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 44332 234689999999999653221111 00 011223455678999876652 22221 11222 333332 Q ss_pred c-eEEEEEe-cceeeeccCC-Ccc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 S-AAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~-a~~~~~~-~~~ve~~~~~-~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) + ++....+ ...++..+.. ..| ...+++.+++|..+++|++++.++-+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 3333322 2233333221 112 3568999999999999999999877766 No 159 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.00 E-value=4.6e-10 Score=71.77 Aligned_cols=257 Identities=12% Similarity=0.007 Sum_probs=144.7 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |+. ..++|+.+...+++.++...++..++..-. ......+..+|+....+.+....++..... +.++.+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 332 247899999999999999888877764311 111112345555554444455666665543 23566666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KEL 151 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) ++...+. +.-+.|++.-..++..++..+ .+..+++++.++|..++.-.... ...+ ...+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~----~~~~----~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL----TKQA----IKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccC----ccCHHHHHHHHHHhh Confidence 6666443 445577776555555667664 45678899999998887543211 1111 12245566554 344 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEe--ccccc----CCCcE-EEEEeC Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES--NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s--~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+++.. +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G~--~l-~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDGK--YI-LQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCCC--eE-eecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 44332 234689999999999653221111 00 011223455678999876652 22221 11222 333332 Q ss_pred c-eEEEEEe-cceeeeccCC-Ccc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 S-AAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~-a~~~~~~-~~~ve~~~~~-~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) + ++....+ ...++..+.. ..| ...+++.+++|..+++|++++.++-+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 3333322 2233333221 112 3568999999999999999999877766 No 160 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.00 E-value=4.6e-10 Score=71.77 Aligned_cols=257 Identities=12% Similarity=0.007 Sum_probs=144.7 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |+. ..++|+.+...+++.++...++..++..-. ......+..+|+....+.+....++..... +.++.+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 332 247899999999999999888877764311 111112345555554444455666665543 23566666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KEL 151 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) ++...+. +.-+.|++.-..++..++..+ .+..+++++.++|..++.-.... ...+ ...+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~----~~~~----~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL----TKQA----IKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccC----ccCHHHHHHHHHHhh Confidence 6666443 445577776555555667664 45678899999998887543211 1111 12245566554 344 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEe--ccccc----CCCcE-EEEEeC Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES--NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s--~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+++.. +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G~--~l-~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDGK--YI-LQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCCC--eE-eecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 44332 234689999999999653221111 00 011223455678999876652 22221 11222 333332 Q ss_pred c-eEEEEEe-cceeeeccCC-Ccc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 S-AAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~-a~~~~~~-~~~ve~~~~~-~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) + ++....+ ...++..+.. ..| ...+++.+++|..+++|++++.++-+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 3333322 2233333221 112 3568999999999999999999877766 No 161 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.00 E-value=4.6e-10 Score=71.77 Aligned_cols=257 Identities=12% Similarity=0.007 Sum_probs=144.7 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |+. ..++|+.+...+++.++...++..++..-. ......+..+|+....+.+....++..... +.++.+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 332 247899999999999999888877764311 111112345555554444455666665543 23566666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHHHHHHH-HHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KEL 151 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) ++...+. +.-+.|++.-..++..++..+ .+..+++++.++|..++.-.... ...+ ...+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~----~~~~----~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL----TKQA----IKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccC----ccCHHHHHHHHHHhh Confidence 6666443 445577776555555667664 45678899999998887543211 1111 12245566554 344 Q ss_pred hhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEe--ccccc----CCCcE-EEEEeC Q lcl|Aclame:pro 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES--NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s--~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-.++++|..+..|.+..+.-.+ .. ....+..|.-+.++|.+++.. +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G~--~l-~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDGK--YI-LQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCCC--eE-eecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 44332 234689999999999653221111 00 011223455678999876652 22221 11222 333332 Q ss_pred c-eEEEEEe-cceeeeccCC-Ccc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 225 S-AAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~-a~~~~~~-~~~ve~~~~~-~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) + ++....+ ...++..+.. ..| ...+++.+++|..+++|++++.++-+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 3333322 2233333221 112 3568999999999999999999877766 No 162 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.77 E-value=1.7e-09 Score=68.69 Aligned_cols=259 Identities=14% Similarity=0.050 Sum_probs=139.8 Q ss_pred Ccc-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccceEEEEEEe Q lcl|Aclame:pro 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) -+. ..++|+.+...+.+.+.....+.+++... ...| ++++|.-+..+.+....++..+...+++.+.+++.+.+ T Consensus 154 ~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~----~~~g-~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k 228 (466) T protein:vir:80 154 VSGAELTIPDVMLELLRDNMHRYSKLISKVRLR----PLKG-TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYK 228 (466) T ss_pred hccccccccHHHHHHHHHhhhhhhhhhhheeee----ecCc-eeEeeeecCCcceeecccccccccccccccceeeccee Confidence 111 25689999999999888877776666422 1123 45677666555555566777666556667777766654 Q ss_pred eeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHhhccc--------ccccc---------cCCHhH- Q lcl|Aclame:pro 80 EKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADM-LVDNGTA--------LTGSA---------PSDADD- 139 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~-~~~~~~~--------~~~~~---------~~~~~~- 139 (273) . +.-+.|++.-..++..++.+++ +..+++++..+|..++.= ....+.+ ..... ..+... T Consensus 229 ~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (466) T protein:vir:80 229 V-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNL 307 (466) T ss_pred e-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhh Confidence 4 3345677765656667787755 468899999999877641 0000000 00000 000000 Q ss_pred ---------HHHHHHHHHHHH--hhcCCCcCCc-EEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEE Q lcl|Aclame:pro 140 ---------AFDLIASALKEL--TKANVPNVGR-VVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 140 ---------~~~~i~~a~~~l--~~~~vp~~~r-~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~ 207 (273) ....+.+....+ .+.+. ..++ +.++++..+..|.+..... + ..|. -+..-+.-..+.|.+|+. T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~w~~~~~~~~~l~~~~~~~-~--~~g~-~~~~~~~~~~i~G~pvv~ 382 (466) T protein:vir:80 308 LKIDPTGKSAEEFFSELVLKLSKARANY-SNGMKFWAMSSNTHAVLMSKAITF-N--SAGA-LVASLNNTMPIVGGDIVI 382 (466) T ss_pred hhhhhhccchhhHHHHHHHHHHhhhccc-cCCceeEEecchhHHHhhcccccc-c--CCcc-ccccCCCcccccccceee Confidence 000111111111 11111 2333 4578888888886543111 1 0010 000001112588999999 Q ss_pred ecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCCc--ceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 208 SNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQDS--FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 208 s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~~--~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ++.+|.+. .+.+..+.+.+..+. ..++....... =.+.+++.+++|+++.+|++++.++-+.- T Consensus 383 s~~~~~~~---~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 383 LDFIPDND---IIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred cCccCccc---eeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCC Confidence 99988754 444444544444332 23333222211 13579999999999999999998842222 No 163 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.75 E-value=1.8e-09 Score=68.54 Aligned_cols=269 Identities=13% Similarity=0.095 Sum_probs=143.0 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccc-cccCCCCccCCc------------- Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVK-DYKAAGRQTSAD------------- 66 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~-d~~~~~~~~~~~------------- 66 (273) |+.++ ..--|-..++.-..+.+++..+++. +..-.+.|+||.+.+...+..+ ....+|.+...+ T Consensus 19 ~~~~~-~t~y~~~k~L~~Aa~~lv~~~fA~~-~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y~~~rd 96 (401) T protein:vir:95 19 NSDQM-QTFFWLKKAIITARKEQYFMPLASV-TNMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLYGSSKD 96 (401) T ss_pred cccee-eehhhHHHHHhhhhhhhhhhhcccc-cccccccCCeEEEEecccccccccchhcCCCcccccccCccccccccc Confidence 22221 0112444455455556888888853 2223456999999998775542 222333222221 Q ss_pred ---------------------ccccceEEEEEEeeeeceeEechHHHHHhHHH-HHHHH-HHHHH-H---HHHHHHHHHH Q lcl|Aclame:pro 67 ---------------------AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYT-RAGAT-A---LATDTDKFIA 119 (273) Q Consensus 67 ---------------------~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~-~~~~~-a---la~~iD~~~~ 119 (273) ..+-.++..+|.|+ ++=..++|+-..-..+. +..++ +++.. + -...+-++++ T Consensus 97 v~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qy-G~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~dll 175 (401) T protein:vir:95 97 IGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKF-GFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQKDLL 175 (401) T ss_pred cceeecccccccccccccccccceeeeeeeeeeec-cCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHHHHH Confidence 11112334445443 22235555433222222 33322 22211 1 1122233333 Q ss_pred HHH----Hh-hccccc---ccccCCHhHHHHHHHHHHHHHhhcCCCc-----------------CCcEEEECH------H Q lcl|Aclame:pro 120 DML----VD-NGTALT---GSAPSDADDAFDLIASALKELTKANVPN-----------------VGRVVVVNA------E 168 (273) Q Consensus 120 ~~~----~~-~~~~~~---~~~~~~~~~~~~~i~~a~~~l~~~~vp~-----------------~~r~lvv~p------~ 168 (273) +.. ++ +++..+ .....+..-.++.+..+...|+++..|. ..|+++++| . T Consensus 176 ~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~ 255 (401) T protein:vir:95 176 AAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELK 255 (401) T ss_pred hhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHHH Confidence 211 00 001000 1112223335788999999999877665 235789999 4 Q ss_pred HHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc--------cCCC-----------------cEEEEEe Q lcl|Aclame:pro 169 MAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR--------DTDD-----------------EQFVAFH 223 (273) Q Consensus 169 ~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~--------~~~~-----------------~~~~~~~ 223 (273) ...+|++++. |.....+++...+.+|.||++-+|.++.++.+. .... +..+++- T Consensus 256 a~~D~~~~~~-fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G 334 (401) T protein:vir:95 256 AMKDLFGNKA-FIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVG 334 (401) T ss_pred HHHHhcCCCC-ceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccccCCCcceeeeeeEEc Confidence 4456666665 566777888888999999999999999987643 1100 1123344 Q ss_pred CceEEEEE----------ec-ceee----ec-cCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 224 PSAAAYVS----------QI-DTVE----AL-RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~~~----------~~-~~ve----~~-~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..|++..- ++ .+-. +. -|+-.+.=.+..++.|++.+|+|+.++.|+...- T Consensus 335 ~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 335 DDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred cccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeecC Confidence 55555321 10 0000 11 2333455578889999999999999999987777 No 164 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=98.75 E-value=1.5e-09 Score=69.02 Aligned_cols=267 Identities=12% Similarity=0.099 Sum_probs=160.4 Q ss_pred Ccc---cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccc-cceEEEE Q lcl|Aclame:pro 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAIS-DTGVDLL 76 (273) Q Consensus 1 MA~---~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~-~~~~~~t 76 (273) |+- ++++|.++...+.+.-+.-..-.+++..- ..+.|.+..||.++.+-.-+ .++|+++.-+.++ .+.-.++ T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~---~L~~Grsm~F~~~g~~Ra~~-IgEGgE~~~~sld~~T~dsv~ 149 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKI---RLKSGQSMIFPSIGIMRAYD-VAEGQEIPEDSIDWQTHESPE 149 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHHHHH---hhhcCcceeccchheeeecc-ccccccccccchhhhcCCcee Confidence 765 68899999998888555443334444321 23458888999998644332 4667777766666 3334556 Q ss_pred EEeee-eceeEechHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------ccccc------CCHhH Q lcl|Aclame:pro 77 IDQEK-SIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL---------TGSAP------SDADD 139 (273) Q Consensus 77 id~~~-~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~iD~~~~~~~~~~~~~~---------~~~~~------~~~~~ 139 (273) +.+.| +..+.+++.-...+--++ .-.++++.++|+++.+.-++......++.+ ...++ -.++. T Consensus 150 ~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTl 229 (393) T protein:vir:79 150 IRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTF 229 (393) T ss_pred EEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccc Confidence 65544 345666665444444454 567899999999999999998886654411 00111 12233 Q ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchH-------Hhhhhhcccccceeeee---eeeee-cceEEEEe Q lcl|Aclame:pro 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGS-------KLTSADTSGDAAGLRAG---TIGNL-LGARIVES 208 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~-------~~~~~~~~~~~~~~~~G---~ig~i-~G~~i~~s 208 (273) .+++|.++....-.+.. .+-+++++|-.|+.+.+... .+.|.+..+-......| .-+++ +.++|+.| T Consensus 230 SleDllDm~~av~~~hy--t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~s 307 (393) T protein:vir:79 230 SAEDFLDLIIAVMANEY--TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLS 307 (393) T ss_pred cHHHHHHHHHHHhcccC--CcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccccceeEEEe Confidence 56677776655444443 34479999999998865421 11122111111111111 01121 34899999 Q ss_pred cccccCCCc---EEEEEeCceEEE--EEecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 209 NNLRDTDDE---QFVAFHPSAAAY--VSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 209 ~~l~~~~~~---~~~~~~~~a~~~--~~~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +-+|--... .++++.++..+. ++-..+++.+.+.-+--+-++-..+||.+||+....+..-+.=| T Consensus 308 Pfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~ 377 (393) T protein:vir:79 308 PFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNIS 377 (393) T ss_pred cccccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecce Confidence 988843332 345666666654 34344667666666555789999999999999987765544444 No 165 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.74 E-value=6.5e-09 Score=65.49 Aligned_cols=253 Identities=12% Similarity=0.042 Sum_probs=144.4 Q ss_pred Ccc-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceEEEEEE Q lcl|Aclame:pro 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLID 78 (273) Q Consensus 1 MA~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid 78 (273) =+. -.++|+-+..++.+.+.+...+.+++++-. ..| ...+|+....+.+....++..+.. .+.+.+.+++... T Consensus 84 ~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~----~~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~ 158 (377) T protein:vir:96 84 GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN----TSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQF 158 (377) T ss_pred CCCCceecCHHHHHHHHHHHHhhhhhhhhceeEe----cCC-ceEEEEecCCcceeEeecccccccccCccceeEeeeee Confidence 111 357899999999999999888888875421 123 467887666555555555554433 2445555555553 Q ss_pred eeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhccc--------c------------------- Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGTA--------L------------------- 129 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~~--------~------------------- 129 (273) +. +.-+.|+..-..++..++++++ ++.+++++..+|..++. .....+.+ . T Consensus 159 kl-~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (377) T protein:vir:96 159 KL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAI 237 (377) T ss_pred eE-EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccccc Confidence 33 3335677665556677787755 56788999999987763 11000000 0 Q ss_pred cccccCCHhHHHHHHHHHHHHHhhcC--CCc---CCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecc-- Q lcl|Aclame:pro 130 TGSAPSDADDAFDLIASALKELTKAN--VPN---VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLG-- 202 (273) Q Consensus 130 ~~~~~~~~~~~~~~i~~a~~~l~~~~--vp~---~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G-- 202 (273) ...+..+.+...+.+..+...+...+ -|. .+-+++++|..+..+.... .+.+ . +|....++| T Consensus 238 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~-~~~~--~--------~G~~~~~l~~p 306 (377) T protein:vir:96 238 ADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF-TSRN--Q--------FGEYVTVLPHG 306 (377) T ss_pred cccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccc-cccC--C--------CCCceeccCCC Confidence 00011233344444444444443322 111 2235789999887764321 1211 1 234344544 Q ss_pred eEEEEecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 203 ARIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 203 ~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +.++.|+.+|.+. ++.+..+-..+..+. ..++..++.. +-.+.+++.+++|.++++|++++++.-++- T Consensus 307 ~~v~~s~~~p~~~---i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 307 ITILESLAVETGK---AIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ceEEecCCCCccc---EEEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 5678888888643 333433434444433 2343332211 123689999999999999999999987777 No 166 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.67 E-value=3.5e-08 Score=61.48 Aligned_cols=262 Identities=12% Similarity=0.086 Sum_probs=134.5 Q ss_pred Cc-----ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccC--CCCccCCcccccceE Q lcl|Aclame:pro 1 MA-----FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKA--AGRQTSADAISDTGV 73 (273) Q Consensus 1 MA-----~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~--~~~~~~~~~~~~~~~ 73 (273) |. .-+..|.-++..+.+.+.+...+.+.++.-. ....+..||.++..+...... .+......+++.+.+ T Consensus 19 ~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~----v~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 94 (321) T protein:vir:31 19 LTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTET----VGAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTI 94 (321) T ss_pred ccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeee----ccCcceeeeeeccCCcccccccccccccccccceeeee Confidence 21 1223344455667777887777777665321 223345667665432221121 122233445667777 Q ss_pred EEEEEeeeeceeEechHHHHHhH--HHHHHH-HHHHHHHHHHHHHHHHHHH-HHhh--------------cccccccccC Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVA--GSLEAY-TRAGATALATDTDKFIADM-LVDN--------------GTALTGSAPS 135 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~--~~~~~~-~~~~~~ala~~iD~~~~~~-~~~~--------------~~~~~~~~~~ 135 (273) ++.+.+. ..-+.|+..-..... .++++. .+..+++++..++...+.= -... .......... T Consensus 95 ~~~~~k~-~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~ 173 (321) T protein:vir:31 95 DISTEKA-TVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAA 173 (321) T ss_pred eeeeEEE-EeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccccccccc Confidence 7777554 345567665444332 356554 4456777777776644411 0000 0001001111 Q ss_pred CHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCC Q lcl|Aclame:pro 136 DADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 136 ~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~ 215 (273) +....++.|.++...|+...--..+-+.+++++.+..++.- +.+.........+..|...+++|++++.++.+|... T Consensus 174 ~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~---l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~~ 250 (321) T protein:vir:31 174 DDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYT---LTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDDK 250 (321) T ss_pred ccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHH---HhcCCCccccchhhccccccccceeEEEcCCCCCCc Confidence 22233566777777766543211233678999987665431 111111112334555666689999999999998754 Q ss_pred CcEEEEEeCceEEEEE-ecceeeeccCCCc---cee--eEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 216 DEQFVAFHPSAAAYVS-QIDTVEALRDQDS---FSD--RIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~~~~a~~~~~-~~~~ve~~~~~~~---~~~--~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++.+.+-+.+.. +...++..++... ..+ .......+|+.+-++++++.+.-=.- T Consensus 251 ---il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~ 311 (321) T protein:vir:31 251 ---AMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGD 311 (321) T ss_pred ---EEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCc Confidence 5666666655433 3334444433221 112 22334457777788888888763222 No 167 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.66 E-value=1.1e-08 Score=64.15 Aligned_cols=253 Identities=10% Similarity=0.026 Sum_probs=142.8 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc-ccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD-AISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~-~~~~~~~ 73 (273) |.- -.+.|+-+...+.+.+.+...+.+++++- ...|+ ..||+....+.+....++..+..+ +.+.+.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----ecCcc-eEEEEecCCcceeeecccccccccccccceee Confidence 111 26789999999999999998888887542 22343 578887665655555555444322 3455555 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhccc-----------ccc--------- Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGTA-----------LTG--------- 131 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~~-----------~~~--------- 131 (273) ++...+. +.-+.|+..-..++..++++++ +..+++++..+|..++. .....+.+ ..+ T Consensus 151 ~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~ 229 (381) T protein:vir:10 151 TAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) T ss_pred eecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCccccccccccccccccc Confidence 5555433 3445777665666677887765 56788999999886652 11111100 000 Q ss_pred -cccCCHhHHHHHHHHHHHHHhhc----C-CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeee--cce Q lcl|Aclame:pro 132 -SAPSDADDAFDLIASALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL--LGA 203 (273) Q Consensus 132 -~~~~~~~~~~~~i~~a~~~l~~~----~-vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i--~G~ 203 (273) .+..+....++.|.+....+... . .+..+-+++++|..+..|..... ..+ . +|..-.. +|. T Consensus 230 t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~-~~~-----~-----~G~~v~~l~~g~ 298 (381) T protein:vir:10 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN-----A-----NGVYVTALPFNL 298 (381) T ss_pred ccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc-cCC-----C-----CCceeecCCCCc Confidence 01111222344454444444221 1 12234467899999888754321 111 1 2322222 467 Q ss_pred EEEEecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 204 RIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 204 ~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +|+.++.+|.+. ++.+.-+......|. ..++...... .-.+.|++.+++|.++++|++++++.-+.+ T Consensus 299 ~vv~s~~~p~~~---iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 299 NVIESTVQEAGK---VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred eEEecCCCCcCc---EEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 799999888654 344444444444333 2333332211 113689999999999999999998654444 No 168 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.66 E-value=1.1e-08 Score=64.15 Aligned_cols=253 Identities=10% Similarity=0.026 Sum_probs=142.8 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCc-ccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD-AISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~-~~~~~~~ 73 (273) |.- -.+.|+-+...+.+.+.+...+.+++++- ...|+ ..||+....+.+....++..+..+ +.+.+.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----ecCcc-eEEEEecCCcceeeecccccccccccccceee Confidence 111 26789999999999999998888887542 22343 578887665655555555444322 3455555 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhccc-----------ccc--------- Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGTA-----------LTG--------- 131 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~~-----------~~~--------- 131 (273) ++...+. +.-+.|+..-..++..++++++ +..+++++..+|..++. .....+.+ ..+ T Consensus 151 ~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~ 229 (381) T protein:vir:95 151 TAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) T ss_pred eecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCccccccccccccccccc Confidence 5555433 3445777665666677887765 56788999999886652 11111100 000 Q ss_pred -cccCCHhHHHHHHHHHHHHHhhc----C-CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeee--cce Q lcl|Aclame:pro 132 -SAPSDADDAFDLIASALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL--LGA 203 (273) Q Consensus 132 -~~~~~~~~~~~~i~~a~~~l~~~----~-vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i--~G~ 203 (273) .+..+....++.|.+....+... . .+..+-+++++|..+..|..... ..+ . +|..-.. +|. T Consensus 230 t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~-~~~-----~-----~G~~v~~l~~g~ 298 (381) T protein:vir:95 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN-----A-----NGVYVTALPFNL 298 (381) T ss_pred ccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc-cCC-----C-----CCceeecCCCCc Confidence 01111222344454444444221 1 12234467899999888754321 111 1 2322222 467 Q ss_pred EEEEecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 204 RIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 204 ~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +|+.++.+|.+. ++.+.-+......|. ..++...... .-.+.|++.+++|.++++|++++++.-+.+ T Consensus 299 ~vv~s~~~p~~~---iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:95 299 NVIESTVQEAGK---VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred eEEecCCCCcCc---EEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 799999888654 344444444444333 2333332211 113689999999999999999998654444 No 169 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.64 E-value=1.7e-08 Score=63.20 Aligned_cols=255 Identities=11% Similarity=0.067 Sum_probs=138.7 Q ss_pred Ccc--cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceEEEEE Q lcl|Aclame:pro 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLI 77 (273) Q Consensus 1 MA~--~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~ti 77 (273) ..- ..+.|+-+...+.+.+.....+.+++++- . ..| ...+|+....+.+....+...... .+.+.+.+++.. T Consensus 80 t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~--~--~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:10 80 VGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK--N--AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred CCCCCceecCHHHHHHHHHHHHhhcceeeeeeeE--e--cCc-ceEEEeecCCcceEEeecccccccccCccceeEeecc Confidence 211 26789999999999999998888887542 1 223 457777766555544444333322 234555555555 Q ss_pred EeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhcccc-----------cccc----------c Q lcl|Aclame:pro 78 DQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGTAL-----------TGSA----------P 134 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~~~-----------~~~~----------~ 134 (273) .+. +.-+.|+..-...+..++++++ ++.+++++..+|..++. .....+.+. .+.. . T Consensus 155 ~kl-~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~ 233 (381) T protein:vir:10 155 NKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTF 233 (381) T ss_pred eeE-EeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccc Confidence 433 4445777665666677787765 46788999999886541 111111100 0000 0 Q ss_pred CCHhHHHHHHHHHHHHHhh----cCC-CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec Q lcl|Aclame:pro 135 SDADDAFDLIASALKELTK----ANV-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) Q Consensus 135 ~~~~~~~~~i~~a~~~l~~----~~v-p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~ 209 (273) .+....++.+......+.. ... ...+.+++++|..+..|..... +.+ ..| ..+. .+ -+|.+|+.++ T Consensus 234 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~-~~~--~~G--~~v~--~l--p~g~~vv~~~ 304 (381) T protein:vir:10 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN--ANG--VYVT--AL--PFNLNVIEST 304 (381) T ss_pred cchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccc-cCC--CCC--ceee--cC--CCCceeEEcC Confidence 0111122222222222211 111 2334578899999888865432 111 111 1110 11 1588899999 Q ss_pred ccccCCCcEEEEEeCceEEEEEecc-eeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecC--CC Q lcl|Aclame:pro 210 NLRDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKT--GS 273 (273) Q Consensus 210 ~l~~~~~~~~~~~~~~a~~~~~~~~-~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~--~s 273 (273) .+|.+. ++.+.-+......+.. .++...... .-.+.|++.+++|.++++|++++++.-+ ++ T Consensus 305 ~~p~~~---i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:10 305 VQEAGK---VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred CCCcCc---EEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCC Confidence 988654 3344433343333332 333322211 1136899999999999999999985444 44 No 170 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.63 E-value=2.1e-08 Score=62.71 Aligned_cols=253 Identities=12% Similarity=0.050 Sum_probs=139.6 Q ss_pred Cc-----c-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MA-----F-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA-----~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |. . -.++|+-+..++.+.+++..++.+++++- ...| ++.+|+....+.+....+...... .+++.+.+ T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~----~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 160 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQ----NAGI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREE 160 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEecCCcceEEeecccccCccccccceee Confidence 11 1 24679999999999999998888887542 2224 467888766555544444333332 34555666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHh--hcccc-----------ccc--cc-C Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADM-LVD--NGTAL-----------TGS--AP-S 135 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~~-~~~--~~~~~-----------~~~--~~-~ 135 (273) ++...+. +.-+.|+..-..++..++++++ +..+++++.++|+.++.= ... .+.+. ... +. . T Consensus 161 ~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~ 239 (395) T protein:vir:95 161 NFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTL 239 (395) T ss_pred eeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccchh Confidence 6655433 4445777766666677787755 568899999999877631 100 01100 000 00 0 Q ss_pred C---HhHHHHHHHHHHHHHhh----cC-CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeec--ceEE Q lcl|Aclame:pro 136 D---ADDAFDLIASALKELTK----AN-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL--GARI 205 (273) Q Consensus 136 ~---~~~~~~~i~~a~~~l~~----~~-vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~--G~~i 205 (273) + ....++.+..+...+.- .. ....+..++++|..+..+.+.. .+.. . .|...+++ |.+| T Consensus 240 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~-~~~~--~--------~G~~~~~lg~g~~v 308 (395) T protein:vir:95 240 TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARY-TYLT--A--------NGGFVTVLPYNVTI 308 (395) T ss_pred hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcc-eecc--C--------CCcceeccCCcceE Confidence 1 11122223332222210 01 1112235688888877664332 1211 1 24444554 6678 Q ss_pred EEecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 206 VESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 206 ~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +.++.+|.+. ++.+.-+-+-...+. ..++..++.. +-.+.+++.+++|+++++|+++++++-+.+ T Consensus 309 ~~~~~~p~~~---i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 309 ITSEFVPEGK---LVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred EEcCCCCCCc---EEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 9999998654 333433333233222 2333332211 113689999999999999999999877766 No 171 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.62 E-value=4.2e-08 Score=61.02 Aligned_cols=264 Identities=11% Similarity=0.013 Sum_probs=144.8 Q ss_pred Ccc-----cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccc----cccccCCCCccCCcccccc Q lcl|Aclame:pro 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPT----VKDYKAAGRQTSADAISDT 71 (273) Q Consensus 1 MA~-----~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~----~~d~~~~~~~~~~~~~~~~ 71 (273) |-. -.+.|+.+.. +++.+.+...+.++++.-. ...+.+..++.++... ..+...++...+.+.++.+ T Consensus 19 ~t~~d~~Gg~l~P~~~~~-~i~~~~e~s~~l~~~~vi~---~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~ 94 (315) T protein:vir:41 19 IDVPDLGRGVLSVDRFGE-FVKAVRDSAVIIPEARIDN---ALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVK 94 (315) T ss_pred cCCcCCCCceechHHHHH-HHHHHHhhhhhhhhceeee---ccccccccccccccCcccccccccccCcCCCCCCccccc Confidence 222 2457999764 7778888888888876421 1123344455543211 1122222333333456667 Q ss_pred eEEEEEEeeeeceeEechHHHHHhHH--HHHHH-HHHHHHHHHHHHHHHHHHHHHh---------------hcccccc-- Q lcl|Aclame:pro 72 GVDLLIDQEKSIDFLVDDIDRVQVAG--SLEAY-TRAGATALATDTDKFIADMLVD---------------NGTALTG-- 131 (273) Q Consensus 72 ~~~~tid~~~~~~~~i~d~d~~~~~~--~~~~~-~~~~~~ala~~iD~~~~~~~~~---------------~~~~~~~-- 131 (273) ..++.+.+. +..+.|++.-...+.. ++++. ..+.+++++.+.+...+.==.. +...... T Consensus 95 ~~~l~~~~l-~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~ 173 (315) T protein:vir:41 95 TNTLYMREM-VTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESD 173 (315) T ss_pred eeeeceeee-eeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccccc Confidence 777766554 3445777665555543 56654 4567788888776655422000 0000000 Q ss_pred cccCCHhHHHHHHHHHHHHHhhcCCC-cCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc Q lcl|Aclame:pro 132 SAPSDADDAFDLIASALKELTKANVP-NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) Q Consensus 132 ~~~~~~~~~~~~i~~a~~~l~~~~vp-~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~ 210 (273) ..........+.|.++...|...--. ..+-..+++++.+..+++-.+ .+. ....+..+..|.-..++|.+|+.++. T Consensus 174 ~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~--~~g-~~lw~~~~~~g~~~tl~G~PV~~~~~ 250 (315) T protein:vir:41 174 VDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALK--GRE-TGLGDQALTGANSILYDGRPVQYVPA 250 (315) T ss_pred cccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhc--cCC-CccccchhhcCCCceecccceEeccc Confidence 11111122344556665555432211 123367899999888754321 111 12234455567767899999999998 Q ss_pred cccCCC--cEEEEEeCceEEEEEe-cceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 211 LRDTDD--EQFVAFHPSAAAYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 211 l~~~~~--~~~~~~~~~a~~~~~~-~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) +|.... ..++.++.+-+.+..+ ...++.+|+.......++..++.|+....++++++..-+. T Consensus 251 m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 251 LEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 876442 3345555555555433 3467778887766678888899999887777654433333 No 172 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.57 E-value=5.1e-08 Score=60.55 Aligned_cols=269 Identities=17% Similarity=0.152 Sum_probs=148.7 Q ss_pred Ccccch------hHHHHHHHHHHHHHHhhccch-hhhc--------cccccccCCcEEEEEeccccccccccCCCC--cc Q lcl|Aclame:pro 1 MAFNNF------IPELWSDMLLEEWTAQTVFAN-LVNR--------EYEGIASKGNVVHIAGVVAPTVKDYKAAGR--QT 63 (273) Q Consensus 1 MA~~~~------~pev~~~~v~~~l~~~~v~~~-~~~~--------d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~ 63 (273) ||.+.+ -..+|+..+...-.+...|.+ ++.. -.+++..+||+|+|+....++- +.+.++. +. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g-~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRG-KPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeeccc-CCcccCceeec Confidence 997533 246788877777766655544 4332 2345556799999998876653 2222222 23 Q ss_pred CCcccccceEEEEEEeeeeceeEec-hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc------------- Q lcl|Aclame:pro 64 SADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA------------- 128 (273) Q Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~------------- 128 (273) ..+.++..+.+++||+.+. ++... .++..-+..++++..+. +..=+++..|+.+|-.++.+... T Consensus 80 nee~L~~~~~~i~idq~r~-~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~ 158 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRH-SVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGY 158 (364) T ss_pred cccceeEEeeEEEEeeccc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcccc Confidence 3457888899999988754 33332 34455566777666554 55667888888888776542100 Q ss_pred -cc-----------------ccccCCHh--HHHHHHHHHHHHHhhcCCCc-------------CC-cEEEECHHHHHHHh Q lcl|Aclame:pro 129 -LT-----------------GSAPSDAD--DAFDLIASALKELTKANVPN-------------VG-RVVVVNAEMAFWLR 174 (273) Q Consensus 129 -~~-----------------~~~~~~~~--~~~~~i~~a~~~l~~~~vp~-------------~~-r~lvv~p~~~~~L~ 174 (273) ++ .....+.+ ..++.|.++...++..+.+. ++ -+++++|..+..|+ T Consensus 159 ~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) T protein:vir:93 159 AGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMR 238 (364) T ss_pred cccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhh Confidence 00 00011111 24677888888776654321 11 26799999999998 Q ss_pred cch-HHhhh----h-hcccccceeeeeeeeeecceEEEEecccccCCCc----------EEEEEeCceE--EEEE----e Q lcl|Aclame:pro 175 SSG-SKLTS----A-DTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE----------QFVAFHPSAA--AYVS----Q 232 (273) Q Consensus 175 ~~~-~~~~~----~-~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~----------~~~~~~~~a~--~~~~----~ 232 (273) .+. ..|.. + ...+..+.+-+|.+|++.|+-|++.++++..... ..+++ ..|. ++.+ + T Consensus 239 ~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllG-aQA~~~a~g~~~g~~ 317 (364) T protein:vir:93 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG-RQAGVIAYGTANGLR 317 (364) T ss_pred hcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheec-ceeeEEEeecCCCCC Confidence 532 12222 1 1334556788999999999999988776532211 01111 2232 3322 1 Q ss_pred cceeeeccCCCcceeeEEeeeeeeeEEEcCc----eEEEEecCCC Q lcl|Aclame:pro 233 IDTVEALRDQDSFSDRIRALHVYGGKVVRPT----GVVVFNKTGS 273 (273) Q Consensus 233 ~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~----~~v~~~~~~s 273 (273) ..-.|...|-+. .-.|......|.+-.|-+ |+++|-.++- T Consensus 318 ~~w~Ee~~D~gn-~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~ 361 (364) T protein:vir:93 318 FDWEETVKDYGN-EPAIAAGFIAGMKKARFNNKDFGVISIDTAAK 361 (364) T ss_pred ceeeecccCCCC-chhhhhhhHhhhhhcccCCccceEEEeccccc Confidence 111222222222 123444445554333322 6666644443 No 173 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.50 E-value=7.2e-08 Score=59.73 Aligned_cols=252 Identities=11% Similarity=0.045 Sum_probs=136.3 Q ss_pred Ccc------cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccC-CcccccceE Q lcl|Aclame:pro 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~ 73 (273) |.. -.++|+-|...+.+.+.....+.+++++ . ...|+ ..||+....+.+....++..+. ..+.+.+.+ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v--~--~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 157 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGM--R--TTGLR-TKFLKSETSGVAVWGKIFGEIKGQLDATFSDE 157 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeee--E--ecCCc-eEEEEEcCCcceEEeecccccccccCcceeeE Confidence 222 2678999999999999998888887753 1 22354 5898887766665555544443 234555666 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhccc-----------ccccc---cCCH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGTA-----------LTGSA---PSDA 137 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~~-----------~~~~~---~~~~ 137 (273) ++...+. +.-+.|+..-...+..++++++ +..+++++.++|+.++. .....+.. ..+.. .... T Consensus 158 ~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~ 236 (383) T protein:vir:78 158 ESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATG 236 (383) T ss_pred eecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccc Confidence 6666443 4446777766666677787755 56789999999987662 11111100 00000 0011 Q ss_pred hHHHHHHHHHHHH---HhhcC--------CCcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeec--ce Q lcl|Aclame:pro 138 DDAFDLIASALKE---LTKAN--------VPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL--GA 203 (273) Q Consensus 138 ~~~~~~i~~a~~~---l~~~~--------vp~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~--G~ 203 (273) ...++++...... +.+.. ....+ ...+++|..+..+..... . +.. +|....++ |. T Consensus 237 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~-~------~~~----~G~~~t~l~~~~ 305 (383) T protein:vir:78 237 TLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYT-S------LNA----NGVYVTALPFNL 305 (383) T ss_pred hhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchh-c------cCC----CCceeeecCCCc Confidence 1111112111111 11111 00111 245677765554432211 0 000 23333444 45 Q ss_pred EEEEecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCCcc---eeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 204 RIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 204 ~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~~~---~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .|++++.+|.+. ++.+..+.+....+. ..++...+ .+| .+.+++.+++|.++++|++++++.-+-. T Consensus 306 ~iv~s~~~p~~~---iifgdfs~Y~i~~r~~~~i~~~~~-~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 306 NIIESLFVPEKK---AISYVAERYDALIGGPLDIGTYDQ-TLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred eEEecCCCCccc---EEEeeccceEEEecccceEEecch-hhhhcCceEEEEEEEEcCEEecCCeEEEEEEEec Confidence 678888887654 334444444444332 23433222 222 3789999999999999999998754333 No 174 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=98.46 E-value=3.1e-08 Score=61.76 Aligned_cols=254 Identities=13% Similarity=0.134 Sum_probs=146.9 Q ss_pred Cc-----------ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccc-cccccc------cCCCCc Q lcl|Aclame:pro 1 MA-----------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDY------KAAGRQ 62 (273) Q Consensus 1 MA-----------~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~-~~~~d~------~~~~~~ 62 (273) |+ +..+.|+ |-+-+++.+.+.-...+++.+ -+.+|.|+..|.... +++..+ ..+|.. T Consensus 127 ~r~a~~~~~Tgd~~~~i~~~-~v~d~i~li~q~r~i~slf~t----LP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~ 201 (410) T protein:vir:83 127 YARAADHQKTGDLQGVIPDP-IVGPVIDFIDSARPLVSTLGT----LPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTE 201 (410) T ss_pred HHHhhccCcccccccccchh-HhhhHHHHHhhccchhhhhhh----CCCCCCeeEEeeeccccccccccccccccccccc Confidence 11 1234555 777788777777666666643 356689999976633 333333 246777 Q ss_pred cCCcccccceEEEEEEeeeeceeEec--hHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHH Q lcl|Aclame:pro 63 TSADAISDTGVDLLIDQEKSIDFLVD--DIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDA 140 (273) Q Consensus 63 ~~~~~~~~~~~~~tid~~~~~~~~i~--d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~ 140 (273) +.+..+..++.+..|+.+-+-. .++ .+|+ +....++-.++.+..+-|....+..-+.+........+....|+.++ T Consensus 202 L~~gKl~~~t~tA~ikTyGGyt-~LSRQ~IER-s~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~ 279 (410) T protein:vir:83 202 LDSQKMVIDRLTVNAKTLGGYV-NVSRQAIDF-SSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNV 279 (410) T ss_pred ccccceeeeeccceeehhcCcc-cccceeeec-CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHH Confidence 8888999999999998764322 222 2222 22334455555554444555555444444443333445566688888 Q ss_pred HHHHHHHHHHHhhc--CCCcCCcEEEECHHHHHHHhcchHHhhhhh-----ccc-ccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 141 FDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-----TSG-DAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 141 ~~~i~~a~~~l~~~--~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~-----~~~-~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) ...|.++....+++ ++ .-+++.|+|++...+... |...+ ..| +-+.+.+|.-|+++|++|++.+.++ T Consensus 280 ~~~i~da~~~v~da~~~~--~~~~i~vS~DVl~~~~~~---f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~ 354 (410) T protein:vir:83 280 ASAIWQAAGAVYTAVKGM--GRLVIAIAPDVLGDFGPL---FAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALG 354 (410) T ss_pred HHHHHHHHHHHhhhhccc--eeeeEEechhhhhhccce---eeccCCCCcccccccccccccchhhhhcccceEEecCCC Confidence 88888998888887 43 235789999997666432 32222 222 1123346777999999999998887 Q ss_pred cCCCcEEEEEeCceEEEE-Eecceee-eccCCCcceeeEEeeeeeeeEEEcCceEEEEecC Q lcl|Aclame:pro 213 DTDDEQFVAFHPSAAAYV-SQIDTVE-ALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 213 ~~~~~~~~~~~~~a~~~~-~~~~~ve-~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~ 271 (273) .+++ ..+.+.|+-+= ...-.+- ...+.-...-.+. -+|+..+..|.+++=+.-. T Consensus 355 AgTA---~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 355 SGDA---YLFSTAAIECFEQRVGTLQVVEPSVFGLQVAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred cCee---eEeccceeeeeecCCceeEeeCCchhhhhhhhe--eeeeeccccccceeeeccC Confidence 6654 33456665432 2110000 0011111111222 4556677788888766543 No 175 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.40 E-value=3.6e-07 Score=55.90 Aligned_cols=260 Identities=12% Similarity=0.058 Sum_probs=132.2 Q ss_pred Cccc---chhHHHHHHHHHHHHHHhh-ccchhhhcc---ccccccCCcEEEEEeccccccc----cccCCCCccCCcccc Q lcl|Aclame:pro 1 MAFN---NFIPELWSDMLLEEWTAQT-VFANLVNRE---YEGIASKGNVVHIAGVVAPTVK----DYKAAGRQTSADAIS 69 (273) Q Consensus 1 MA~~---~~~pev~~~~v~~~l~~~~-v~~~~~~~d---~~~~~~~Gdtv~ip~~~~~~~~----d~~~~~~~~~~~~~~ 69 (273) ||.. +|-|+++...+. .+++++ +|.. .... .......||.++.|-|..+... +.......+++..++ T Consensus 1 m~lsD~~vfN~~~~~a~~e-~~~q~~~~fn~-as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~kit 78 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSE-TLRQQVDLFNT-ATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVLK 78 (325) T ss_pred Cchhhhhhhhhhhhhhhhh-hhhhhHhhhhh-cccceeEeccccccCceeeccccccccccccccccCCCCceeccceec Confidence 8874 345677655444 344433 3322 1111 1122345999999999875321 112223345555555 Q ss_pred cceEEEEEEeeeeceeEechHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcccc-----cccccCC--- Q lcl|Aclame:pro 70 DTGVDLLIDQEKSIDFLVDDIDR-VQVAGSLEAYTRAGATALATDTDKFIADMLVD----NGTAL-----TGSAPSD--- 136 (273) Q Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~-~~~~~~~~~~~~~~~~ala~~iD~~~~~~~~~----~~~~~-----~~~~~~~--- 136 (273) ..+. +.+.-.+..++.-+|... .....++..+.++.+..+++...+++++.+.+ +-... ..++..+ T Consensus 79 t~~~-~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~~~ 157 (325) T protein:vir:95 79 HLVD-TSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDAAD 157 (325) T ss_pred cccc-eeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCccc Confidence 4332 233234445554444432 22233456666666666666666665544422 11111 1111111 Q ss_pred HhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCC- Q lcl|Aclame:pro 137 ADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD- 215 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~- 215 (273) .....+.|.+|+..|.+++- .=..+++++.+|..|.+. .+.+.........+ -.|+.++|-.|+++..+|... T Consensus 158 ~~~s~~~l~~A~~klGD~~~--~l~~~~MHS~v~~~L~~~--~L~~~~~~~~~~g~--~~i~t~~G~~VIVdD~~p~~~~ 231 (325) T protein:vir:95 158 KLPTWNNLNNGQAKFGDQSS--QIAAWIMHSTPMHKLYGS--NLTNGERLFTYGTV--NVVRDPFGKLLVMTDSPNLFAA 231 (325) T ss_pred ccccHHHHHHHHHHhccccc--ceeEEEEchHHHHHHHHh--hccccccccccCCc--ccccccCCcEEEEeCCCCCCCc Confidence 11235678999999876541 112578999999999764 23222111111111 135678999999999888543 Q ss_pred ----CcEEEEEeCceEEEEEecc----eeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecC--CC Q lcl|Aclame:pro 216 ----DEQFVAFHPSAAAYVSQID----TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT--GS 273 (273) Q Consensus 216 ----~~~~~~~~~~a~~~~~~~~----~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~--~s 273 (273) .++++.+.++|+++....+ ..|..+++ ..+..++. +| .++|.|-|+---++. .| T Consensus 232 g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~-tf~lhp~G~sw~~s~~g~s 295 (325) T protein:vir:95 232 GTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDE-NIIRTYQA--EW-SYNIGVKGFAWDKANGGKS 295 (325) T ss_pred cCceeEEEEEEecCeEEecCCCCccccccccCccc-ceeeeeee--ee-eEEeecceeeeecccccCC Confidence 2456778899988765322 12222322 22233322 22 456666666653332 12 No 176 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.39 E-value=1.8e-07 Score=57.56 Aligned_cols=252 Identities=11% Similarity=0.039 Sum_probs=134.6 Q ss_pred Cc-----c-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCC-cccccceE Q lcl|Aclame:pro 1 MA-----F-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA-----~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) |. . ..++|+-+..++.+.+.....+.+++++- ...|+ +++|+....+.+....++..+.. .+.+.+.+ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----NTSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeE----ecCcc-eEEEEecCCcceeEeecccccCcccCccceeE Confidence 22 2 35679999999999999888877777532 22354 57887655554444455444432 23444544 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHhhccc--------cc----ccccCC--- Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIAD-MLVDNGTA--------LT----GSAPSD--- 136 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~iD~~~~~-~~~~~~~~--------~~----~~~~~~--- 136 (273) ++...+. +.-+.|+..-..++..++++++ ++.+++++..+|..++. .....+.+ .. .....+ T Consensus 154 ~l~~~kl-~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~ 232 (377) T protein:vir:98 154 DFSQFKL-TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKT 232 (377) T ss_pred eecceeE-EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccc Confidence 4444332 3335677665556667787754 56889999999887653 11111100 00 000000 Q ss_pred -HhHHH--------------HHHHHHHH--HHhhcCCCcCCcE-EEECHHHHHHHhcchHHhhhhhcccccceeeeeeee Q lcl|Aclame:pro 137 -ADDAF--------------DLIASALK--ELTKANVPNVGRV-VVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIG 198 (273) Q Consensus 137 -~~~~~--------------~~i~~a~~--~l~~~~vp~~~r~-lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig 198 (273) .+... .++..... .+++.+ ...|++ ++++|..+..+..... . .. .+|... T Consensus 233 ~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klk-d~~G~~i~~~n~~~~~~~~p~~~------~-~~----~~G~~~ 300 (377) T protein:vir:98 233 DKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPL-KIAGQVKLILNPEDRWALEAQFT------S-RN----QFGEYV 300 (377) T ss_pred hhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhh-ccCCceEEEecccchhhcccccc------c-cC----CCCccc Confidence 00110 01111111 112111 134554 4578876655532211 0 00 134444 Q ss_pred eecce--EEEEecccccCCCcEEEEEeCceEEEEEec-ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 199 NLLGA--RIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 199 ~i~G~--~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .++|+ .|+.|+.+|... ++.+..+......+. ..++...+.. .-.+.+++.+++|.++++|+++++|.-++- T Consensus 301 t~lg~p~~vv~s~~~p~~~---i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 301 TVLPHGITILESLAVETGK---AIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred cccCCCceEEecCCCCccc---EEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 56655 477888887654 334444444444332 2333332221 123789999999999999999999977766 No 177 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.34 E-value=4.3e-07 Score=55.51 Aligned_cols=222 Identities=13% Similarity=0.042 Sum_probs=124.9 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchh--------hhccccccccCCcEEEEEeccccccccccCCCC--ccCCccccc Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~--------~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~~~~~~~~ 70 (273) ++|+. .-.+|+..+...-.+...+..+ +.+-.+++...||+|+|+....++-. -+.++. +...+.++. T Consensus 22 ~~~~~-~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~g~-gv~Gd~~lEGnee~L~~ 99 (318) T protein:vir:27 22 NRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKR-PTMGDERVEGRGEDLSH 99 (318) T ss_pred hcCCh-HHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccccC-ccccCceeeccccceEE Confidence 44443 2457877655444444333322 33334555678999999887665432 121122 233456788 Q ss_pred ceEEEEEEeeeeceeEec-hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc-------------------- Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~-------------------- 128 (273) .+..++||+.+. ++... .++..-+..++++..+. +..-+++..|+-+|-.++.+... T Consensus 100 ~~d~l~IDq~r~-~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~ 178 (318) T protein:vir:27 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (318) T ss_pred EeeEEEEeeecc-ccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccchhhh Confidence 888999988643 33222 33444456677665554 56667888888777666432210 Q ss_pred -cccccc----------------CCHhH--HHHHHHHHHHHHhhcCCCc-------CC-------cEEEECHHHHHHHhc Q lcl|Aclame:pro 129 -LTGSAP----------------SDADD--AFDLIASALKELTKANVPN-------VG-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~~----------------~~~~~--~~~~i~~a~~~l~~~~vp~-------~~-------r~lvv~p~~~~~L~~ 175 (273) +...+| ++.++ .++.|.+++..+++..-|. +. ++++++|.++..|+. T Consensus 179 ~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~~Lrt 258 (318) T protein:vir:27 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (318) T ss_pred hcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCCcceEEEEechHHHHHHhh Confidence 000110 11111 2555667777776633221 12 567999999999987 Q ss_pred chH--Hh----hhhhcc--cccceeeeeeeeeecceEEEEecccc--cCCCcEEEEEeCceEEEEEecc Q lcl|Aclame:pro 176 SGS--KL----TSADTS--GDAAGLRAGTIGNLLGARIVESNNLR--DTDDEQFVAFHPSAAAYVSQID 234 (273) Q Consensus 176 ~~~--~~----~~~~~~--~~~~~~~~G~ig~i~G~~i~~s~~l~--~~~~~~~~~~~~~a~~~~~~~~ 234 (273) +.. .| .++... +..+.|..|.+|++.|+=+++.+.+| ...+..+ -+.++. T Consensus 259 dt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~~G~~v---------~~~~~~ 318 (318) T protein:vir:27 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRF---------WYQRIT 318 (318) T ss_pred cCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEcCCCee---------eeeecC Confidence 741 12 233333 45677999999999999999988654 2222111 111111 No 178 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.19 E-value=1e-06 Score=53.42 Aligned_cols=259 Identities=12% Similarity=0.069 Sum_probs=142.9 Q ss_pred Cc--ccc-hhHHHHHHHHHHHHHHhh-----ccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MA--FNN-FIPELWSDMLLEEWTAQT-----VFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA--~~~-~~pev~~~~v~~~l~~~~-----v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) +| ++. =.|-++.+.+-+.|.+.. .|..++.+..-..+++-+.+.+ +..+.=+..+++++.....+.++. T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~l---g~~~~L~~V~E~gEyk~~t~~e~~ 435 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGM---GGFSALRQVREGAEYKYVTTGDKQ 435 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeec---CCCCCccccCCCCccceeeecCcc Confidence 22 221 136666655555554433 2233333322224555555544 443333446778878887888888 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhcccc-ccccc----------CCHh Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY---TRAGATALATDTDKFIADMLVDNGTAL-TGSAP----------SDAD 138 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~---~~~~~~ala~~iD~~~~~~~~~~~~~~-~~~~~----------~~~~ 138 (273) .++.+..+ +.-|.|+-. ...++|+..+ .+..+++-++.++..+++.+...+.-. .+.+- ..+. T Consensus 436 e~~~l~ty-G~~~~iTRq--aiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa 512 (652) T protein:vir:79 436 ATIALATY-GELFSITRQ--AIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAA 512 (652) T ss_pred ceeeeecc-cCeeeeehh--eeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeeccccccccccccc Confidence 89999765 555666542 2345566554 446777888888888888876544221 11000 0112 Q ss_pred HHHHHHHHHHHHHhhcCCC-----cCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecce-EEEEecccc Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVP-----NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGA-RIVESNNLR 212 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp-----~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~-~i~~s~~l~ 212 (273) ...+.|..++..|.+++-. -.+++|+++|+......+ .+......+ ...-.|.+--+.|+ +++.+++|. T Consensus 513 ~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~---ll~s~~v~~--a~~~~~~~Np~~~~~~~i~eprL~ 587 (652) T protein:vir:79 513 MDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQ---VIRSSSVKG--ADINAGIINPVKDFATVIAEPRLD 587 (652) T ss_pred CCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHH---HhccCCCcc--cccccccccccccccccccccccC Confidence 2345677777777655521 124688999987654432 121111111 11123455556664 788888887 Q ss_pred cCCCcEEEEE-eCc--eE--EEEE--ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEec Q lcl|Aclame:pro 213 DTDDEQFVAF-HPS--AA--AYVS--QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) Q Consensus 213 ~~~~~~~~~~-~~~--a~--~~~~--~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~ 270 (273) .......++. .+. .+ +|.. +...+|....-...|-.++..+-||+++++-.+++..++ T Consensus 588 ~~s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 588 DNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CCCcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 6554444333 332 22 2221 223344433333346688889999999999999998888 No 179 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.08 E-value=3.3e-06 Score=50.62 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=136.4 Q ss_pred CcccchhHHHHHHHHHHHHHHhhcc--------chhhhccccccccCCcEEEEEeccccccccccCCCC--ccCCccccc Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVF--------ANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~--------~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~~~~~~~~ 70 (273) +.|+.. -.+|...+...-..+..+ ...+.+-.+++...||+|+|+....++- +.+.++. +...+.++. T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g-~gv~Gd~~lEGnee~L~~ 99 (404) T protein:vir:10 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSK-RPTMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeeccc-CCcccCceeeccccceeE Confidence 555542 344444322222222111 1223333455667799999998876653 2222222 233457888 Q ss_pred ceEEEEEEeeeeceeEec-hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc-------------------- Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... .++..-+..++++..+. +..-+++..|+.+|-.++..... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:10 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 899999988754 33322 34444556777766654 56668888999888766532210 Q ss_pred -cccccc----------------CCHh--HHHHHHHHHHHHHhhcCCCcC-------C-------cEEEECHHHHHHHhc Q lcl|Aclame:pro 129 -LTGSAP----------------SDAD--DAFDLIASALKELTKANVPNV-------G-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~~----------------~~~~--~~~~~i~~a~~~l~~~~vp~~-------~-------r~lvv~p~~~~~L~~ 175 (273) +...+| .+.+ ..++.|.++.+.+++..-|.. . ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:10 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 000010 0111 135567788888877444432 1 578999999999998 Q ss_pred chH--Hhhhhh------cccccceeeeeeeeeecceEEEEecccc--cCCC------------------------cEEEE Q lcl|Aclame:pro 176 SGS--KLTSAD------TSGDAAGLRAGTIGNLLGARIVESNNLR--DTDD------------------------EQFVA 221 (273) Q Consensus 176 ~~~--~~~~~~------~~~~~~~~~~G~ig~i~G~~i~~s~~l~--~~~~------------------------~~~~~ 221 (273) +.. .|.+.. ..+..+.|.+|.+|++.|+-|++.++.| ...+ ..+++ T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:10 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 842 132221 1256678999999999999998766543 1000 00112 Q ss_pred EeCceE--EEEEe----cceeeeccCCCcceeeEEeeeeeeeEEEc-C--------ceEEEEecCCC Q lcl|Aclame:pro 222 FHPSAA--AYVSQ----IDTVEALRDQDSFSDRIRALHVYGGKVVR-P--------TGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~~~~----~~~ve~~~~~~~~~~~v~~~~~~g~~vl~-p--------~~~v~~~~~~s 273 (273) + ..|+ ++.+- ..-.|...|-+. .-.|......|.+-+| | =++++|-.++= T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:10 339 G-AQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2 2222 22221 111222222221 1233333444433333 1 13333322222 No 180 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.08 E-value=3.3e-06 Score=50.62 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=136.4 Q ss_pred CcccchhHHHHHHHHHHHHHHhhcc--------chhhhccccccccCCcEEEEEeccccccccccCCCC--ccCCccccc Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVF--------ANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~--------~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~~~~~~~~ 70 (273) +.|+.. -.+|...+...-..+..+ ...+.+-.+++...||+|+|+....++- +.+.++. +...+.++. T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g-~gv~Gd~~lEGnee~L~~ 99 (404) T protein:vir:81 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSK-RPTMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeeccc-CCcccCceeeccccceeE Confidence 555542 344444322222222111 1223333455667799999998876653 2222222 233457888 Q ss_pred ceEEEEEEeeeeceeEec-hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc-------------------- Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... .++..-+..++++..+. +..-+++..|+.+|-.++..... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:81 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 899999988754 33322 34444556777766654 56668888999888766532210 Q ss_pred -cccccc----------------CCHh--HHHHHHHHHHHHHhhcCCCcC-------C-------cEEEECHHHHHHHhc Q lcl|Aclame:pro 129 -LTGSAP----------------SDAD--DAFDLIASALKELTKANVPNV-------G-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~~----------------~~~~--~~~~~i~~a~~~l~~~~vp~~-------~-------r~lvv~p~~~~~L~~ 175 (273) +...+| .+.+ ..++.|.++.+.+++..-|.. . ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:81 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 000010 0111 135567788888877444432 1 578999999999998 Q ss_pred chH--Hhhhhh------cccccceeeeeeeeeecceEEEEecccc--cCCC------------------------cEEEE Q lcl|Aclame:pro 176 SGS--KLTSAD------TSGDAAGLRAGTIGNLLGARIVESNNLR--DTDD------------------------EQFVA 221 (273) Q Consensus 176 ~~~--~~~~~~------~~~~~~~~~~G~ig~i~G~~i~~s~~l~--~~~~------------------------~~~~~ 221 (273) +.. .|.+.. ..+..+.|.+|.+|++.|+-|++.++.| ...+ ..+++ T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:81 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 842 132221 1256678999999999999998766543 1000 00112 Q ss_pred EeCceE--EEEEe----cceeeeccCCCcceeeEEeeeeeeeEEEc-C--------ceEEEEecCCC Q lcl|Aclame:pro 222 FHPSAA--AYVSQ----IDTVEALRDQDSFSDRIRALHVYGGKVVR-P--------TGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~~~~----~~~ve~~~~~~~~~~~v~~~~~~g~~vl~-p--------~~~v~~~~~~s 273 (273) + ..|+ ++.+- ..-.|...|-+. .-.|......|.+-+| | =++++|-.++= T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:81 339 G-AQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2 2222 22221 111222222221 1233333444433333 1 13333322222 No 181 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.08 E-value=3.3e-06 Score=50.62 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=136.4 Q ss_pred CcccchhHHHHHHHHHHHHHHhhcc--------chhhhccccccccCCcEEEEEeccccccccccCCCC--ccCCccccc Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVF--------ANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~--------~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~~~~~~~~ 70 (273) +.|+.. -.+|...+...-..+..+ ...+.+-.+++...||+|+|+....++- +.+.++. +...+.++. T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g-~gv~Gd~~lEGnee~L~~ 99 (404) T protein:vir:32 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSK-RPTMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeeccc-CCcccCceeeccccceeE Confidence 555542 344444322222222111 1223333455667799999998876653 2222222 233457888 Q ss_pred ceEEEEEEeeeeceeEec-hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc-------------------- Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... .++..-+..++++..+. +..-+++..|+.+|-.++..... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:32 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 899999988754 33322 34444556777766654 56668888999888766532210 Q ss_pred -cccccc----------------CCHh--HHHHHHHHHHHHHhhcCCCcC-------C-------cEEEECHHHHHHHhc Q lcl|Aclame:pro 129 -LTGSAP----------------SDAD--DAFDLIASALKELTKANVPNV-------G-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~~----------------~~~~--~~~~~i~~a~~~l~~~~vp~~-------~-------r~lvv~p~~~~~L~~ 175 (273) +...+| .+.+ ..++.|.++.+.+++..-|.. . ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:32 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 000010 0111 135567788888877444432 1 578999999999998 Q ss_pred chH--Hhhhhh------cccccceeeeeeeeeecceEEEEecccc--cCCC------------------------cEEEE Q lcl|Aclame:pro 176 SGS--KLTSAD------TSGDAAGLRAGTIGNLLGARIVESNNLR--DTDD------------------------EQFVA 221 (273) Q Consensus 176 ~~~--~~~~~~------~~~~~~~~~~G~ig~i~G~~i~~s~~l~--~~~~------------------------~~~~~ 221 (273) +.. .|.+.. ..+..+.|.+|.+|++.|+-|++.++.| ...+ ..+++ T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:32 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 842 132221 1256678999999999999998766543 1000 00112 Q ss_pred EeCceE--EEEEe----cceeeeccCCCcceeeEEeeeeeeeEEEc-C--------ceEEEEecCCC Q lcl|Aclame:pro 222 FHPSAA--AYVSQ----IDTVEALRDQDSFSDRIRALHVYGGKVVR-P--------TGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~~~~----~~~ve~~~~~~~~~~~v~~~~~~g~~vl~-p--------~~~v~~~~~~s 273 (273) + ..|+ ++.+- ..-.|...|-+. .-.|......|.+-+| | =++++|-.++= T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:32 339 G-AQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2 2222 22221 111222222221 1233333444433333 1 13333322222 No 182 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.08 E-value=3.3e-06 Score=50.62 Aligned_cols=268 Identities=14% Similarity=0.064 Sum_probs=136.4 Q ss_pred CcccchhHHHHHHHHHHHHHHhhcc--------chhhhccccccccCCcEEEEEeccccccccccCCCC--ccCCccccc Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVF--------ANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~--------~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~~~~~~~~ 70 (273) +.|+.. -.+|...+...-..+..+ ...+.+-.+++...||+|+|+....++- +.+.++. +...+.++. T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g-~gv~Gd~~lEGnee~L~~ 99 (404) T protein:vir:10 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSK-RPTMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeeccc-CCcccCceeeccccceeE Confidence 555542 344444322222222111 1223333455667799999998876653 2222222 233457888 Q ss_pred ceEEEEEEeeeeceeEec-hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc-------------------- Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... .++..-+..++++..+. +..-+++..|+.+|-.++..... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:10 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 899999988754 33322 34444556777766654 56668888999888766532210 Q ss_pred -cccccc----------------CCHh--HHHHHHHHHHHHHhhcCCCcC-------C-------cEEEECHHHHHHHhc Q lcl|Aclame:pro 129 -LTGSAP----------------SDAD--DAFDLIASALKELTKANVPNV-------G-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~~----------------~~~~--~~~~~i~~a~~~l~~~~vp~~-------~-------r~lvv~p~~~~~L~~ 175 (273) +...+| .+.+ ..++.|.++.+.+++..-|.. . ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:10 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 000010 0111 135567788888877444432 1 578999999999998 Q ss_pred chH--Hhhhhh------cccccceeeeeeeeeecceEEEEecccc--cCCC------------------------cEEEE Q lcl|Aclame:pro 176 SGS--KLTSAD------TSGDAAGLRAGTIGNLLGARIVESNNLR--DTDD------------------------EQFVA 221 (273) Q Consensus 176 ~~~--~~~~~~------~~~~~~~~~~G~ig~i~G~~i~~s~~l~--~~~~------------------------~~~~~ 221 (273) +.. .|.+.. ..+..+.|.+|.+|++.|+-|++.++.| ...+ ..+++ T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:10 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 842 132221 1256678999999999999998766543 1000 00112 Q ss_pred EeCceE--EEEEe----cceeeeccCCCcceeeEEeeeeeeeEEEc-C--------ceEEEEecCCC Q lcl|Aclame:pro 222 FHPSAA--AYVSQ----IDTVEALRDQDSFSDRIRALHVYGGKVVR-P--------TGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~~~~----~~~ve~~~~~~~~~~~v~~~~~~g~~vl~-p--------~~~v~~~~~~s 273 (273) + ..|+ ++.+- ..-.|...|-+. .-.|......|.+-+| | =++++|-.++= T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~-~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:10 339 G-AQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCc-hhhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2 2222 22221 111222222221 1233333444433333 1 13333322222 No 183 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=98.08 E-value=5.1e-06 Score=49.58 Aligned_cols=260 Identities=13% Similarity=0.071 Sum_probs=121.2 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhh-ccchhhhccc--cccccCCcEEEEEeccc---cccccccCCCCccCCccc Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQT-VFANLVNREY--EGIASKGNVVHIAGVVA---PTVKDYKAAGRQTSADAI 68 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~-v~~~~~~~d~--~~~~~~Gdtv~ip~~~~---~~~~d~~~~~~~~~~~~~ 68 (273) ||.+ +|.+.+... .++.+++.+ +|.....--. ...+-.||-...|.+.. +..+++.. ++.+++..+ T Consensus 1 ~~~t~~sdl~vfn~~~~~a-~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~-~~~~t~~ki 78 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTA-YLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNS-TATVAGTKI 78 (315) T ss_pred Cceeeecceeeehhhhhhh-HHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCC-Cccccceec Confidence 8863 333444444 444455443 4433221101 11233477766665542 22334433 334555555 Q ss_pred ccc-eEEEEEEeeeeceeEechHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhhcc--cccccccCCHhHHH Q lcl|Aclame:pro 69 SDT-GVDLLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIA----DMLVDNGT--ALTGSAPSDADDAF 141 (273) Q Consensus 69 ~~~-~~~~tid~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~iD~~~~----~~~~~~~~--~~~~~~~~~~~~~~ 141 (273) +.. .+.+.+ ...+.++..+....+..-.+...+.....+.++..+-+.++ +.+.+.-. .....+..+..... T Consensus 79 t~~~dvaVk~-~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~~ 157 (315) T protein:vir:96 79 AADEMVSVKV-PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEGK 157 (315) T ss_pred ccccceeEEE-eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccCH Confidence 433 333333 23334455554444333233433333333333333333333 33322111 11111222233345 Q ss_pred HHHHHHHHHHhhcCCCcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEE Q lcl|Aclame:pro 142 DLIASALKELTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFV 220 (273) Q Consensus 142 ~~i~~a~~~l~~~~vp~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~ 220 (273) +.|.+|..+|.++. +. -..++++.+|..|.+ . .+.+.-+.-....++.+..+. +|-.|+++..+|.. +.+ T Consensus 158 ~~l~dA~~klGD~~---~~l~~~vMHS~v~~~L~~-q-~L~~~~~~~~~~~~~~~~~~~-lGkrViVdD~~P~~---~~~ 228 (315) T protein:vir:96 158 KVLTKGLRTMGDKA---SSIAIWVMDSTSYFDIVD-E-AIDNKLYEEAGVVVYGGTPGT-LGKPVLVTDQCPAT---KIF 228 (315) T ss_pred HHHHHHHHHhcccc---cCeeEEEEchHHHHHHHH-h-hhhhhcccccceeEecCcCcc-cccEEEEECCCCcc---eee Confidence 67888888886654 22 246899999999987 3 344332222222333333444 49999999999863 456 Q ss_pred EEeCceEEEEEecc-eeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 221 AFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+.++|+++...-+ ...... .++.-+++....+-+++.+.|.|+---++++. T Consensus 229 gl~~GAi~~~~~~~~~~~~~~-~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~ 281 (315) T protein:vir:96 229 GLVAGAVMITESQAPGMRSYQ-IDDQENLAIGFRAEGTANVEVLGYKWKTKTNV 281 (315) T ss_pred eeecceeeecCCCcccccccc-CCCcceeEEEEeeeeEeeeeeeeEEeecCCCc Confidence 66788888754221 011111 11223344443333445566666554333222 No 184 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=97.92 E-value=6.2e-06 Score=49.12 Aligned_cols=270 Identities=11% Similarity=0.054 Sum_probs=134.1 Q ss_pred Cc--------ccchhHHHHHHHHHHHHHHh-hccchh------------------------hhccccccccCCcEEEEEe Q lcl|Aclame:pro 1 MA--------FNNFIPELWSDMLLEEWTAQ-TVFANL------------------------VNREYEGIASKGNVVHIAG 47 (273) Q Consensus 1 MA--------~~~~~pev~~~~v~~~l~~~-~v~~~~------------------------~~~d~~~~~~~Gdtv~ip~ 47 (273) |- ++.....+|+..+...-.+. ..+..+ +.+-.+++...||+|+|+. T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~L 80 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRFHF 80 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEEeE Confidence 32 22334678887765544332 221111 2334455567799999998 Q ss_pred ccccccccccCCCC--ccCCcccccceEEEEEEeeeeceeEech-HHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 48 VVAPTVKDYKAAGR--QTSADAISDTGVDLLIDQEKSIDFLVDD-IDRVQVAGSLEAYTRA-GATALATDTDKFIADMLV 123 (273) Q Consensus 48 ~~~~~~~d~~~~~~--~~~~~~~~~~~~~~tid~~~~~~~~i~d-~d~~~~~~~~~~~~~~-~~~ala~~iD~~~~~~~~ 123 (273) ...++-.- +.++. +...+.++..+..++||+.+. ++.+.. ++..-+..++++..+. +..=+++..|+-+|-.++ T Consensus 81 ~~~L~g~g-v~Gd~~lEGnee~L~~~~d~l~IDq~R~-~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~la 158 (430) T protein:vir:10 81 VQPANAFP-IMGSEYAEGKGTGLKIGSDQLRVNQARF-PVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLA 158 (430) T ss_pred eeccccCc-eecCceeeccccceEEEeeEEEEeeecc-ccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 86654321 21121 223456888899999998754 444442 2444455677665554 444567777776666554 Q ss_pred hh-----------------------------cccc-----cccc-----------cCCHh--HHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 124 DN-----------------------------GTAL-----TGSA-----------PSDAD--DAFDLIASALKELTKANV 156 (273) Q Consensus 124 ~~-----------------------------~~~~-----~~~~-----------~~~~~--~~~~~i~~a~~~l~~~~v 156 (273) .+ ++.. ++.+ .++.+ ..++.|.+++..++.... T Consensus 159 Garg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~ 238 (430) T protein:vir:10 159 GARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIEL 238 (430) T ss_pred hhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCC Confidence 32 1100 0000 01111 136678888888877653 Q ss_pred Cc-------CC-------cEEEECHHHHHHHhcchHH--h----hhhhcccccceeeeeeeeeecceEEEEeccc-ccC- Q lcl|Aclame:pro 157 PN-------VG-------RVVVVNAEMAFWLRSSGSK--L----TSADTSGDAAGLRAGTIGNLLGARIVESNNL-RDT- 214 (273) Q Consensus 157 p~-------~~-------r~lvv~p~~~~~L~~~~~~--~----~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l-~~~- 214 (273) |. +. ++++++|.++..|+.+..+ | .+....+..+.|.+|.+|++.|+-|++.... +.. T Consensus 239 ~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~virf~~ 318 (430) T protein:vir:10 239 PPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPIRFYA 318 (430) T ss_pred CCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCceeeecC Confidence 21 22 5789999999999988653 1 1123344567889999999999999876422 100 Q ss_pred -------C----Cc------------------EEEEEeC-ceEEEEEe-cc-----eeeeccCCCcceeeEEeeeeeeeE Q lcl|Aclame:pro 215 -------D----DE------------------QFVAFHP-SAAAYVSQ-ID-----TVEALRDQDSFSDRIRALHVYGGK 258 (273) Q Consensus 215 -------~----~~------------------~~~~~~~-~a~~~~~~-~~-----~ve~~~~~~~~~~~v~~~~~~g~~ 258 (273) + .. ..+.+-. -++++.+- .. =.|...|-+. .-.|......|.+ T Consensus 319 g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~-~~~i~~~~i~G~k 397 (430) T protein:vir:10 319 GDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGD-KLELLIGAILGCS 397 (430) T ss_pred CCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCc-hhhhhhhHHhccc Confidence 0 00 0011100 11222220 00 0222222111 1122222333322 Q ss_pred EE------------cCceEEEEecCCC Q lcl|Aclame:pro 259 VV------------RPTGVVVFNKTGS 273 (273) Q Consensus 259 vl------------~p~~~v~~~~~~s 273 (273) -. +.=++++|-.++- T Consensus 398 K~rF~~~~~~~~~~~DfGvi~idtaa~ 424 (430) T protein:vir:10 398 KIRFAVEATNGLEYTDHGVMAIDTAVK 424 (430) T ss_pred eeeecCCCCCCceeeeeEEEEhhhhhh Confidence 22 1224444422222 No 185 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.90 E-value=5.9e-06 Score=49.27 Aligned_cols=260 Identities=12% Similarity=0.095 Sum_probs=140.5 Q ss_pred Ccc--c-chhHHHHHHHHHHHHHHhh-----ccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MAF--N-NFIPELWSDMLLEEWTAQT-----VFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~--~-~~~pev~~~~v~~~l~~~~-----v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) ||. + .=.|-++.+.+...|.+.. .|..++.+..-..+++-..+.+ +.++.=+...++++.....+.+.. T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~l---g~~~~L~~V~E~gEyk~~t~~e~~ 470 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGL---GEFSSLRQVREGAEYKYVTLGERG 470 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeec---CCCCChhhcCCCCceeeeecCCcc Confidence 322 1 1136666554444443322 2333433322224555555544 443333346777777777888888 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhcccccc-----------cccCCHh Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY---TRAGATALATDTDKFIADMLVDNGTALTG-----------SAPSDAD 138 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~---~~~~~~ala~~iD~~~~~~~~~~~~~~~~-----------~~~~~~~ 138 (273) .++.+..+ +.-|.|+-. ...++|+..+ .+..+++-++.++..++..+...+.-..+ .++.... T Consensus 471 e~~~l~ty-G~~~~iTRq--aiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sa 547 (693) T protein:vir:95 471 EQIILATY-GELFSITRQ--AIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASA 547 (693) T ss_pred ceeehhhc-CCeeeecHH--hhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccc Confidence 88888665 555666643 2345666654 44678888899999999888654321111 1111112 Q ss_pred HHHHHHHHHHHHHhhcCCC----------cCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecce-EEEE Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVP----------NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGA-RIVE 207 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp----------~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~-~i~~ 207 (273) ...+.+..++..|..++.+ -.+++++++|+......+ .+......+ ...-.|.+--+.|+ +++. T Consensus 548 ls~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~---l~~s~~~~~--a~~~~~~~NP~~~~~~vi~ 622 (693) T protein:vir:95 548 LSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQ---IINSESVPG--ADVNSGIVNPIRAFAQVIG 622 (693) T ss_pred cChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHH---Hhccccccc--cccccccccchhccccccc Confidence 2355677777776555421 134688888886665532 121111111 11223555556674 7888 Q ss_pred ecccccCCCc-EEEEEeCc--eE--EEEE--ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 208 SNNLRDTDDE-QFVAFHPS--AA--AYVS--QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 208 s~~l~~~~~~-~~~~~~~~--a~--~~~~--~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++|...+++ .+++..+. .+ +|.. +...+|....-..-|-.++..+-||+++++-.+++ |.+|. T Consensus 623 ~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~--kn~GA 693 (693) T protein:vir:95 623 EPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQ--KSNGA 693 (693) T ss_pred cceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccc--cCCCC Confidence 8888765443 33443333 22 2221 22234444333334668888899999999988865 45555 No 186 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=97.89 E-value=4.8e-06 Score=49.76 Aligned_cols=267 Identities=16% Similarity=0.166 Sum_probs=150.6 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhcccc-cccc-CCcEEEEEeccc--cccccccCC--CCc--cCCcccccce Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYE-GIAS-KGNVVHIAGVVA--PTVKDYKAA--GRQ--TSADAISDTG 72 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~-~~~~-~Gdtv~ip~~~~--~~~~d~~~~--~~~--~~~~~~~~~~ 72 (273) ||.-. +.+.|+..+...|.++..|...+.-..+ ..+. ..+|.---+... +.+.+|... .+. .+......+. T Consensus 1 ~avr~-y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~ 79 (287) T protein:vir:39 1 MAIKY-FTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQ 79 (287) T ss_pred CCccc-ccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccc Confidence 99976 4677889999999999988876532221 1222 233322112211 122344311 111 1111111111 Q ss_pred E--EEEEEee-e-eceeEec-hHHHHHhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCHhHHHHH Q lcl|Aclame:pro 73 V--DLLIDQE-K-SIDFLVD-DIDRVQVAGSLEA----YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDL 143 (273) Q Consensus 73 ~--~~tid~~-~-~~~~~i~-d~d~~~~~~~~~~----~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (273) . .+-.|.+ . .+++.|- -+|..-.+.++++ .++.++.+-++.+|..+-..+...+...... ..+.+.+... T Consensus 80 rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~-~~t~d~V~~L 158 (287) T protein:vir:39 80 RKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTV-KLDEDSVTKL 158 (287) T ss_pred eeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheee-eecccchHHH Confidence 1 1111111 1 2333332 2344333444432 3455777788889887777776665544333 3666677777 Q ss_pred HHHHHHHHhhcCCCcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEE Q lcl|Aclame:pro 144 IASALKELTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAF 222 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~ 222 (273) |.++...+-.+++.... ...+|+|+.|..|...+- .......+. .+-+-.|-++-||.+.+.+.-..-.+. ...+ T Consensus 159 F~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l--~TsaK~Ssa-NiDen~i~kFkGf~l~e~P~~~~q~g~-~a~f 234 (287) T protein:vir:39 159 FSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKL--ATTAKNSSA-NVDEQTLYKFKGFILSELPDEKFQLNE-GAYF 234 (287) T ss_pred HHHHHHHhhccceeeEEEEEEEEChhHHhHHhcccc--cccccccee-eeccCCcceecceEEEecchHhhccCc-EEEE Confidence 88888888877775543 567999999999986542 222222222 233444568999999887632222222 3344 Q ss_pred eCceEEEEE-ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 223 HPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~~~a~~~~~-~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .++.+|.+- -+........++.-|...++---||-.+++..+..+++++.. T Consensus 235 s~dnig~af~GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~ 286 (287) T protein:vir:39 235 AADNVGVAGVGIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVT 286 (287) T ss_pred ccccceeecccceeEEeeecccccceeeecccccccccccccceEEEEEecC Confidence 455544332 122233444555567899999999999999999999998888 No 187 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=97.48 E-value=3.8e-05 Score=44.82 Aligned_cols=267 Identities=13% Similarity=0.096 Sum_probs=146.7 Q ss_pred Ccccc----hhHHHHHHHHHHHHHHhhccchhhhcccc-ccccC-CcEEEEEeccccc--c-ccccC-----CCCccCCc Q lcl|Aclame:pro 1 MAFNN----FIPELWSDMLLEEWTAQTVFANLVNREYE-GIASK-GNVVHIAGVVAPT--V-KDYKA-----AGRQTSAD 66 (273) Q Consensus 1 MA~~~----~~pev~~~~v~~~l~~~~v~~~~~~~d~~-~~~~~-Gdtv~ip~~~~~~--~-~d~~~-----~~~~~~~~ 66 (273) -+|+. .+.+.|+.-+.+.|..+.+|...+.-..+ ..+.+ .++.---+....+ + .+|.. .|. .+.. T Consensus 21 t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGt-GTg~ 99 (314) T protein:vir:98 21 TANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGE-GTSR 99 (314) T ss_pred cccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCccccc-CCcc Confidence 33332 25788999999999999998876543222 12222 2332111211111 1 23431 111 1111 Q ss_pred ccccceEE--EEEEee-e-eceeEec-hHHHHHhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCH Q lcl|Aclame:pro 67 AISDTGVD--LLIDQE-K-SIDFLVD-DIDRVQVAGSLEA----YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDA 137 (273) Q Consensus 67 ~~~~~~~~--~tid~~-~-~~~~~i~-d~d~~~~~~~~~~----~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~~ 137 (273) ....+.+. +-.|.. + .+++.|- -+|..-.+.++++ .++.++.+-.+.+|..+-..+...+......+..+. T Consensus 100 SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~ltd~~~ 179 (314) T protein:vir:98 100 STRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLTDYSA 179 (314) T ss_pred ccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcch Confidence 11111111 111111 1 1233322 2344434444433 345567777888888776667666555544555566 Q ss_pred hHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCc Q lcl|Aclame:pro 138 DDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE 217 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~ 217 (273) +.+...|..+...+-...+- .....+|+|+.|..|...+- .......+. .+-+--|-++-||.+.+.+.-....+. T Consensus 180 d~V~~LF~~as~~yvn~ev~-~~~~AyV~~evYnaiiD~~l--~TsaK~Ssa-NIDengi~~FkGf~i~e~P~~~~q~g~ 255 (314) T protein:vir:98 180 DNVLRLFNELSKYYVNIEAI-GTKAAKVSPELYNAIVDHPL--TTSAKSSSA-NIDQNGIVNFKGFAIQEIPESMLQSGD 255 (314) T ss_pred hhHHHHHHHHHhhhhcceee-EEEEEEEchhHHhHhhcccc--cccccccee-eeccCCcceecceEEEecchhhcCCCc Confidence 66666677777777666653 23678999999999986542 222222222 233344568999999988765555554 Q ss_pred EEEEEeCceEEEEE-ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 218 QFVAFHPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 218 ~~~~~~~~a~~~~~-~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .++. ..+.++.+- -+........++.-|..+++-=-||-.+++..+..+++-+.+ T Consensus 256 ia~~-s~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~t 311 (314) T protein:vir:98 256 VAYT-YITNIGKAFTGINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTST 311 (314) T ss_pred EEEE-ccccceeecccceeeeeeecccccceeeecccccccccccccceeeEEEecC Confidence 3322 224444321 122233344455557889999999999999999999888888 No 188 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=97.16 E-value=0.00011 Score=42.29 Aligned_cols=254 Identities=10% Similarity=-0.008 Sum_probs=111.8 Q ss_pred Cc-------c-cchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcccccce Q lcl|Aclame:pro 1 MA-------F-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA-------~-~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~ 72 (273) +. . ....|.-+...+...+.....+...+... +.....+|.......+....+|......+++... T Consensus 237 ~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~------~i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~ 310 (517) T protein:vir:97 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE------NLPTLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) T ss_pred eeeecccccccccccchHHHHHHHHhhhhhccceeeeeec------cccceeeecccccceeeeeecCCcccccccceee Confidence 00 0 12235555555555444433333322211 1122334433322233344555555455666666 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHH----HHHHH-HHHHHHHHHHHHHHHHHHHHhhcc---c--cc---cccc-CCHh Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGS----LEAYT-RAGATALATDTDKFIADMLVDNGT---A--LT---GSAP-SDAD 138 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~----~~~~~-~~~~~ala~~iD~~~~~~~~~~~~---~--~~---~~~~-~~~~ 138 (273) +++.+.+. +.-+.++.........+ +++++ +++++.|+.+.++.++.=-..... . .. .+.. .... T Consensus 311 ~~~~~~~i-a~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~ 389 (517) T protein:vir:97 311 RVLTPQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTT 389 (517) T ss_pred EEeeHhhh-hhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccc Confidence 66665332 33345555444333333 55644 568889999999877631100000 0 00 0011 1111 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcE Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~ 218 (273) ...+.+......+.+ ..+-.++++|..+..|.+..+.-.+ + -.+..+..+....++|+.-+.. .++.. .. T Consensus 390 ~~~d~i~~l~~a~~~----a~~a~~vmn~~t~~~I~klKD~~G~--Y-l~~~~~~~~~~~~l~G~~~~~~-~~~~~--~~ 459 (517) T protein:vir:97 390 NIQELLEKLSVATPK----AADSTLVIHRNDLAAIRFLKDKNGN--Y-VFPVGVSNQTIATHFGFNRLVQ-SVAVD--EK 459 (517) T ss_pred hHHHHHHHHHHHhhh----ccCCEEEECHHHHHHHHHhhcCCCC--e-eccCcCCcccccccCCcccccc-ccccC--ce Confidence 222222222222222 2234678999999999765432222 1 1222334455566777533332 12221 11 Q ss_pred EEEEeCceEEEEEecceeeeccCC--CcceeeEEeeeeeeeEEEcCceEE--EEecCCC Q lcl|Aclame:pro 219 FVAFHPSAAAYVSQIDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVV--VFNKTGS 273 (273) Q Consensus 219 ~~~~~~~a~~~~~~~~~ve~~~~~--~~~~~~v~~~~~~g~~vl~p~~~v--~~~~~~s 273 (273) .+.+. ..+..+.+.. ++..++- ..-.+.+...++.|..|+.|+.++ +++.+.+ T Consensus 460 ~~~~~-~~y~i~~~~g-~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~ 516 (517) T protein:vir:97 460 TAVSL-SGYVTNGSRG-MEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) T ss_pred eEeec-cccEEEeecc-eeeeeeeecccCceeEeeeeeeccccccccceEEEEEcCCCC Confidence 22222 2222222211 1212211 111355666778888888898554 6677766 No 189 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=96.87 E-value=0.00029 Score=40.02 Aligned_cols=262 Identities=11% Similarity=0.053 Sum_probs=118.0 Q ss_pred Cc-cc-----chhHHHHHHHHHHHHHHhh-ccchhhhccccccccCCcEEEEEecccccc---ccccCC--CCccCCccc Q lcl|Aclame:pro 1 MA-FN-----NFIPELWSDMLLEEWTAQT-VFANLVNREYEGIASKGNVVHIAGVVAPTV---KDYKAA--GRQTSADAI 68 (273) Q Consensus 1 MA-~~-----~~~pev~~~~v~~~l~~~~-v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~---~d~~~~--~~~~~~~~~ 68 (273) |+ .+ .+.+.-....+++.|.+.. ++..+-.. ...|.+.++.+....+. ++..-+ ......... T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~-----~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~ 75 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFD-----SIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAA 75 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcc-----cccCCcceeeEeeccCCcccccccccccCCCcccccc Confidence 77 42 2335555667777775443 22222111 12355666655543322 221100 011112233 Q ss_pred ccceEEEEEEeeeeceeEechH--HHHHh-HHH-HHHHHHHHHHHHHHHHHHHHHH----------HHHhh-cccccccc Q lcl|Aclame:pro 69 SDTGVDLLIDQEKSIDFLVDDI--DRVQV-AGS-LEAYTRAGATALATDTDKFIAD----------MLVDN-GTALTGSA 133 (273) Q Consensus 69 ~~~~~~~tid~~~~~~~~i~d~--d~~~~-~~~-~~~~~~~~~~ala~~iD~~~~~----------~~~~~-~~~~~~~~ 133 (273) +.+.++..|.- ....+.|+.. |.... ..+ +..++++..+++.++....++. +.... +.....+. T Consensus 76 t~~~~~~~L~i-~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~ 154 (310) T protein:vir:97 76 TFTKVNSNLTT-IMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTG 154 (310) T ss_pred ccceeeeeeee-eeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecC Confidence 34444444421 1223333321 11111 222 3456777788888887766544 11111 11110000 Q ss_pred cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeee-eeeeecceEEEEecccc Q lcl|Aclame:pro 134 PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAG-TIGNLLGARIVESNNLR 212 (273) Q Consensus 134 ~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G-~ig~i~G~~i~~s~~l~ 212 (273) ...+...++++.++....-+. ..+..+++.+|+++..+..-.+.............. -| .+-.+.|++|+.++.+| T Consensus 155 ~~gg~~t~d~LDeLl~~v~~~--~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~-~G~~v~~~~GiPi~~~d~ip 231 (310) T protein:vir:97 155 ATGSAISFAILDELMDLVVDK--DGQVDYLTMHARTLRSYKALLRALGGASINEVVELP-SGAEVPAYSGTPIFRNDYIP 231 (310) T ss_pred CCCCCCCHHHHHHHHHHHhcC--CCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccC-CCCEEeeeCCeEEEEeCccC Confidence 001111234454443332211 124458999998866664332222221221111111 23 35689999999999998 Q ss_pred cCC-------CcEEEEEeCc-------eEEEEE---ecceeeecc---CCCcceeeEEeeeeeeeEEEcCceEEEEecCC Q lcl|Aclame:pro 213 DTD-------DEQFVAFHPS-------AAAYVS---QIDTVEALR---DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 213 ~~~-------~~~~~~~~~~-------a~~~~~---~~~~ve~~~---~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~ 272 (273) .+. ...+++..-+ -.|+-. -...|+.-- +..-+.+.| .++||..++.|.++++|+.-- T Consensus 232 ~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V--~~Y~~~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 232 TNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRV--KWYCGLALFSEKGLACADGIT 309 (310) T ss_pred CCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEE--EEeeeEEEecccceeeecccc Confidence 642 2234444322 122210 112333321 222233444 679999999999999985433 Q ss_pred C Q lcl|Aclame:pro 273 S 273 (273) Q Consensus 273 s 273 (273) - T Consensus 310 ~ 310 (310) T protein:vir:97 310 N 310 (310) T ss_pred C Confidence 3 No 190 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=96.82 E-value=0.00032 Score=39.77 Aligned_cols=260 Identities=17% Similarity=0.196 Sum_probs=135.9 Q ss_pred Ccc--c----chhHHHHHHHHHHHHHHhhccchhhhcccc-ccccC-CcEEEEEeccc--cccccccC-----CCCccCC Q lcl|Aclame:pro 1 MAF--N----NFIPELWSDMLLEEWTAQTVFANLVNREYE-GIASK-GNVVHIAGVVA--PTVKDYKA-----AGRQTSA 65 (273) Q Consensus 1 MA~--~----~~~pev~~~~v~~~l~~~~v~~~~~~~d~~-~~~~~-Gdtv~ip~~~~--~~~~d~~~-----~~~~~~~ 65 (273) |+- + -.+.+.|+.-+.+.|+.+.+|.+.+.- .+ ..+.+ .++.---+... +.+.+|.. .|. .+. T Consensus 1 m~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~fgg-lQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGt-gTg 78 (286) T protein:vir:94 1 MATTNNDLPVRVYSKEFLQLLSTVYQAQSVFTPTFGA-LQALDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGT-GTS 78 (286) T ss_pred CCCCccccceeehhHHHHHHHHHHHhhHHHhhhhhcc-hhhhhCCCccceEEEEeecCcceEEecccCCCcccccc-CCc Confidence 552 2 135788889999999999988776532 11 12222 23221112111 12233431 111 111 Q ss_pred cccccceEE--EEEEee-e-eceeEec-hHHHHHhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCC Q lcl|Aclame:pro 66 DAISDTGVD--LLIDQE-K-SIDFLVD-DIDRVQVAGSLEA----YTRAGATALATDTDKFIADMLVDNGTALTGSAPSD 136 (273) Q Consensus 66 ~~~~~~~~~--~tid~~-~-~~~~~i~-d~d~~~~~~~~~~----~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~ 136 (273) .....+... +-.|.. + .+++.|- -+|..-.+.++++ .++.++.+-.+.+|..+-..+...+... ... T Consensus 79 ~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~t---~~~- 154 (286) T protein:vir:94 79 NSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGTDL---GAV- 154 (286) T ss_pred cccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh---hhh- Confidence 111111111 111111 1 1333322 2344334444433 3455677777888876655554443321 111 Q ss_pred HhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC Q lcl|Aclame:pro 137 ADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD 216 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~ 216 (273) +.+...|..+...+-...|.. ..-.+|+|+.|..|...+- .......+. .+-+--|-++-||.+.+.+.-...+ T Consensus 155 -D~V~~LF~~as~~yvn~ev~~-~~~ayV~~evYnaiiD~~l--~TsaK~Ssa-NiDengi~~FkGf~i~e~P~~~~~g- 228 (286) T protein:vir:94 155 -DDVNALFESAVEKYTDLEVIA-PVRAYVTASVYNAIIDLAN--VTTAKNSAV-NIDTNGMLSFRGIAITKVPTQYMGG- 228 (286) T ss_pred -hhHHHHHHHHHHHhhhhheee-eeEEEEchhHHHHHhcccc--cccccccee-eeccCCcceecceEEeecchhhccC- Confidence 344445666666666666633 2348999999999986542 222222222 2333445689999999887432332 Q ss_pred cEEEEEeCceEEEEE-ecceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 EQFVAFHPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ~~~~~~~~~a~~~~~-~~~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ...++.++.++-+- -+........++.-|...++-=-||-.+++..+..+++++-- T Consensus 229 -~~aifs~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k 285 (286) T protein:vir:94 229 -KAVIFAPDNVARVFTGINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPK 285 (286) T ss_pred -ceEEEccccceeeeccceeeeeeeccccCceeeeccccccccccccCceeEEEeecC Confidence 24556666665442 222233444555557888888999999999998888866555 No 191 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=96.76 E-value=0.00036 Score=39.49 Aligned_cols=264 Identities=12% Similarity=0.010 Sum_probs=114.7 Q ss_pred Cccc------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccCCcc-cccceE Q lcl|Aclame:pro 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-ISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~-~~~~~~ 73 (273) |+.- .+.|.-....+++.|.+..-+..+ ..+....|.+.++++...++.+....-+....... .+...+ T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~----lpf~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~ 100 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEM----MPFTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKV 100 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhh----cccccccCCcceeeeeecCCcceeeeccccccccCcceeeee Confidence 4421 122444556666666544322211 11111234556666655544433322222222221 122233 Q ss_pred EEEEEeeeeceeEechHHHHHhH---HHH-HHHHHHHHHHHHHHHHHHHHH----------HHHh-hcccccccccCCHh Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIAD----------MLVD-NGTALTGSAPSDAD 138 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~iD~~~~~----------~~~~-~~~~~~~~~~~~~~ 138 (273) +..+.-. .-.+.|+. ..+... .+. ..+.++..++|.++....++. ++.. .+.....+.+.++. T Consensus 101 t~~l~~l-~~~~~Vd~-~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~ 178 (330) T protein:vir:94 101 TSELTTL-IGDAEVNG-LIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGT 178 (330) T ss_pred eechhhh-hhhHHHHH-HHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCC Confidence 3332111 11122221 122222 232 345666777888777665554 1110 01111001011111 Q ss_pred HHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeee-eeeecceEEEEecccccCCC- Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGT-IGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~-ig~i~G~~i~~s~~l~~~~~- 216 (273) -.++++.++.....+. +...-+++++..+...+..-.+...+...... ....-|. |-.+.|++|+.++.+|.+.. T Consensus 179 ~T~d~LDeLl~~v~~~--~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~-~~~~~G~~v~~~~GvPi~~~d~ip~~~~~ 255 (330) T protein:vir:94 179 LTFELLDQLLDLVKDK--DGQVDYLMSSFAMRRKYFSLLRALGGAAIGEV-MTLPSGRQIPTYRGVPWFVNDFIPSNMTQ 255 (330) T ss_pred CCHHHHHHHHHHhcCC--CCCCcEEEechhHHHHHHHHHHhccCCCCCCc-ccccCCCEEeeeCCeEEEecccccCCCCc Confidence 2234444433332111 22334888888877777543333222222111 1111243 56789999999999886421 Q ss_pred ------cEEEEEeC-------ceEEEEE---ecceeeecc-CCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 ------EQFVAFHP-------SAAAYVS---QIDTVEALR-DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ------~~~~~~~~-------~a~~~~~---~~~~ve~~~-~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+++..- +..|+-. ....|+.-- ...+-..-+...++||..++.|+++.+|+.-.- T Consensus 256 ~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~ 329 (330) T protein:vir:94 256 GTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLIP 329 (330) T ss_pred ccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEEeeeeEEechhheeeeccccC Confidence 23344431 1123211 122332211 112222334557899999999999999976544 No 192 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=96.02 E-value=0.00023 Score=40.54 Aligned_cols=257 Identities=15% Similarity=0.111 Sum_probs=98.6 Q ss_pred Ccccch-hHHHHHHH----HHHHHHHhhcc----------chhhhccccc--ccc---CCcEEEEEec-cccccccccCC Q lcl|Aclame:pro 1 MAFNNF-IPELWSDM----LLEEWTAQTVF----------ANLVNREYEG--IAS---KGNVVHIAGV-VAPTVKDYKAA 59 (273) Q Consensus 1 MA~~~~-~pev~~~~----v~~~l~~~~v~----------~~~~~~d~~~--~~~---~Gdtv~ip~~-~~~~~~d~~~~ 59 (273) +....- ....+... .+...+..... +.+....... ... .+-++..... ......+...+ T Consensus 184 ~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~ 263 (480) T protein:vir:40 184 ERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLAEDGVDDTFISGTFKAG 263 (480) T ss_pred hhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcceeeeccccceeeeeeeecc Confidence 111100 00111110 01111111000 0000000000 000 0000000000 00001111111 Q ss_pred CCccCCcccccceEEEEEEee-eeceeEechHHHH--HhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------- Q lcl|Aclame:pro 60 GRQTSADAISDTGVDLLIDQE-KSIDFLVDDIDRV--QVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTA------- 128 (273) Q Consensus 60 ~~~~~~~~~~~~~~~~tid~~-~~~~~~i~d~d~~--~~~~~~~~~-~~~~~~ala~~iD~~~~~~~~~~~~~------- 128 (273) +...... ..+ ..++... .+.-..+...... .-..+++++ .+++++.++.+.++.++.--...... T Consensus 264 ~~~~~~~--~~~--~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~ 339 (480) T protein:vir:40 264 TDKNKSQ--TAT--KRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTA 339 (480) T ss_pred ccccccc--ccc--cchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceee Confidence 1110000 000 0111100 0000001111111 111235554 45677788888877665431010000 Q ss_pred -ccccccCCHhHHHHHHHHHHHHHhhcCCCcCCc-EEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEE Q lcl|Aclame:pro 129 -LTGSAPSDADDAFDLIASALKELTKANVPNVGR-VVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIV 206 (273) Q Consensus 129 -~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~~r-~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~ 206 (273) ...+...+.++.++.|..+ |.+..- .+. .+|++|..+..|.+-.+.-.+ +-.+..+..|....++|++++ T Consensus 340 ~~~~~~~~~~~d~id~L~~a---l~~~y~--~~a~~~vmn~~t~~~I~klKD~~G~---Yi~q~~~~~~~~~~llG~pvv 411 (480) T protein:vir:40 340 TDGWTKQIEYTDLFEGITDA---VAECSI--SDAITIVMSPQTFAELRKAKGTDGH---SRFNELATKEQIAQSFGAVNL 411 (480) T ss_pred cccccccchhHHHHHHHHHh---hhHHhh--CCCCEEEECHHHHHHHHHhhcCCCC---eeccCcccccCcceeccccee Confidence 0011223344444444443 332221 223 578999999999665432121 223345667888899999988 Q ss_pred Eecc-cccCCCcEEEEEeCceEEEEEecceeeeccC--CCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 207 ESNN-LRDTDDEQFVAFHPSAAAYVSQIDTVEALRD--QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 207 ~s~~-l~~~~~~~~~~~~~~a~~~~~~~~~ve~~~~--~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) +++. +|. + ...+.....++.+..+. ++.+++ ...-...+....+.|..+.+|+++..++..|| T Consensus 412 ~~~~~~~~-~-~~~~~~~~~~~~~~d~~--~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 412 ETRVWMPK-D-EVAVYNHDEYVLIGDLN--VENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred eeeccccC-C-cceeeeCCccEEEEecc--cceecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 7643 332 1 11122222333344432 333322 22224566777889999999999999999999 No 193 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=95.32 E-value=0.0023 Score=35.05 Aligned_cols=258 Identities=9% Similarity=0.011 Sum_probs=119.6 Q ss_pred Cccc-----chhHHHHH---HHHHHHHHHhhccchhhhccccccccCC-cEEEEEecccccccccc-CCCCccCCccccc Q lcl|Aclame:pro 1 MAFN-----NFIPELWS---DMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYK-AAGRQTSADAISD 70 (273) Q Consensus 1 MA~~-----~~~pev~~---~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~-~~~~~~~~~~~~~ 70 (273) |..+ .|..+.|. ..+.+.....++...++... ..+..| .++.++.....+.+... .....+..-+... T Consensus 24 ~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~--~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~ 101 (319) T protein:vir:10 24 KQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVT--TELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALG 101 (319) T ss_pred hhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccc--cCCCCceEEEEeeeeccccceeeecCccccccceeccc Confidence 1111 33343333 23333333334444444322 122333 46677776555544322 2222333334555 Q ss_pred ceEEEEEEeeeeceeEechHHHHHh--HH-HHH-HHHHHHHHHHHHHHHHHHHH---------HHHhhccc---cc---c Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQV--AG-SLE-AYTRAGATALATDTDKFIAD---------MLVDNGTA---LT---G 131 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~--~~-~~~-~~~~~~~~ala~~iD~~~~~---------~~~~~~~~---~~---~ 131 (273) +.....+-. .+.++.++..|.... .+ +++ +....++++++.+.|+-+|- ++...+.. .. . T Consensus 102 ~~~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~ 180 (319) T protein:vir:10 102 TSEFGKVFR-LGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWID 180 (319) T ss_pred eeeEEEEEE-EEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCCC Confidence 566666633 355666665554433 22 343 34556677888888775441 11110000 00 0 Q ss_pred cccCCHhHHHHHHHHHHHHHhhc--CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEec Q lcl|Aclame:pro 132 SAPSDADDAFDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) Q Consensus 132 ~~~~~~~~~~~~i~~a~~~l~~~--~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~ 209 (273) ....+..+++++|..+...+..+ ++- ..-.|+++|+.|..|..- ..+.+...- +-++ -+.-+.+|+..+ T Consensus 181 ~~t~t~~~i~~di~~~~~~l~~~s~g~~-~p~~L~L~p~~~~~L~~~---~~~~~~t~l-~~lk----~~~~~l~I~~~p 251 (319) T protein:vir:10 181 VSTMKPETAEAELTQAIETIETITRGQH-RATNILIPPSMRKVLAIR---MPETTMSYL-DYFK----SQNSGIEIDSIA 251 (319) T ss_pred ccccCHHHHHHHHHHHHHHHHHhcCcee-eceEEEecHHHHHhhhcc---cCCCCeeHH-HHHH----HhcCCceEEEee Confidence 11234568899999988888643 331 223789999999988421 111110000 0111 112345666666 Q ss_pred ccccCCC---cEEEEEe--CceEEEEEecc-eeeeccCCCcceeeEEeeeee-eeEEEcCceEEEEecC Q lcl|Aclame:pro 210 NLRDTDD---EQFVAFH--PSAAAYVSQID-TVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 210 ~l~~~~~---~~~~~~~--~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~-g~~vl~p~~~v~~~~~ 271 (273) .+...++ ...+++. +.-+.++.... .+.. .........+....++ |+-+.+|.+++.+.-= T Consensus 252 el~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~-~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 252 ELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLP-AQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eecccCCCcceEEEEEecCCceEEEecCcceeeee-eeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 6554322 1123332 23232221111 1111 1122234566555554 4788899999977655 No 194 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=95.25 E-value=0.0024 Score=34.90 Aligned_cols=261 Identities=11% Similarity=0.069 Sum_probs=103.7 Q ss_pred Cccc---------------------------chhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEecccccc Q lcl|Aclame:pro 1 MAFN---------------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTV 53 (273) Q Consensus 1 MA~~---------------------------~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~ 53 (273) |+++ .+.|+++.. +++..++...+.+.+..- ...-.+.+|++++.... T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~~-Fl~~v~~~t~iL~~~r~~----~~~s~~~ei~kig~G~r 75 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTEE-FLERMQKGVQILGMADTM----TLARLEMEVPQFGVPRL 75 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccCceeecHHHHHH-HHHHHhhccchhhhccee----eccccccccccccccee Confidence 3331 334776644 455556665565555321 12234556666655211 Q ss_pred --ccccCCCCccCCcccccceEEE-EEEeeeeceeEechHHHHH-hHHH---HHH-HHHHHHHHHHHHH-------HHH- Q lcl|Aclame:pro 54 --KDYKAAGRQTSADAISDTGVDL-LIDQEKSIDFLVDDIDRVQ-VAGS---LEA-YTRAGATALATDT-------DKF- 117 (273) Q Consensus 54 --~d~~~~~~~~~~~~~~~~~~~~-tid~~~~~~~~i~d~d~~~-~~~~---~~~-~~~~~~~ala~~i-------D~~- 117 (273) .....++........+..+++. ..++. ...+.++..+... .... .+. +....+..+++.+ |.+ T Consensus 76 ~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~-~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds 154 (360) T protein:vir:99 76 SGHTRDEEGSRTENSEAESGSVKFNATDKS-YYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASS 154 (360) T ss_pred eccccccCCCCCcCCcCccccCccccccce-eeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchh Confidence 1111111111112233333333 22222 2333444333222 1211 111 1112222222221 111 Q ss_pred ------------------HHHHHHhhcccc------c------------ccccCC---------HhHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 ------------------IADMLVDNGTAL------T------------GSAPSD---------ADDAFDLIASALKELT 152 (273) Q Consensus 118 ------------------~~~~~~~~~~~~------~------------~~~~~~---------~~~~~~~i~~a~~~l~ 152 (273) .++++....... . .+.+.. .......|.++...|. T Consensus 155 ~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp 234 (360) T protein:vir:99 155 GNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLD 234 (360) T ss_pred cccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcc Confidence 122211000000 0 000000 0001223455665554 Q ss_pred hcCC--CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCCcEEEEEeCceEEEE Q lcl|Aclame:pro 153 KANV--PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~v--p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~ 230 (273) ..-- |....+++++|..+...... +.+....-++.++..+..-.+.|++++..+.+|... ++..++.-+.+. T Consensus 235 ~kyr~~~~~~~~~~~s~~~~~~yr~~---L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~---~mlT~p~NLi~g 308 (360) T protein:vir:99 235 SRYRESDAYSPVLMTSPNQVQSYTMS---LTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEY---MMFTDPNNLAFG 308 (360) T ss_pred hhhhcCcccceEEEccCchHHHHHHH---HhccCcccchhheecccccccceeeeEEcCCCCCCc---eEEeccCceeEE Confidence 4321 11122677888766655432 222222223344443333357899999999998643 677777777654 Q ss_pred -Eeccee----eeccCCCcceeeEE-eeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 231 -SQIDTV----EALRDQDSFSDRIR-ALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 -~~~~~v----e~~~~~~~~~~~v~-~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+...+ |..+..++.-..++ .+..+.+.+-+++++|++.--.. T Consensus 309 ~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~ 357 (360) T protein:vir:99 309 LYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLET 357 (360) T ss_pred eeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCC Confidence 222222 32222222222222 23344455555667776643222 No 195 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=94.31 E-value=0.0048 Score=33.31 Aligned_cols=256 Identities=8% Similarity=0.011 Sum_probs=122.4 Q ss_pred Ccccch---hHHHHHHHHHHHHHHhhccchhhhccccccccCC-cEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFNNF---IPELWSDMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~---~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) +++..| ..+.+...+.+.....+....++.... .+..+ ++++++.....+.+...+.+.+....+...+....+ T Consensus 49 ~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t--~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~ 126 (339) T protein:vir:94 49 TANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVK--KGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQ 126 (339) T ss_pred ccccchhhhhhhhhchhheeecccccchhhhccccc--CCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEe Confidence 333322 122222333344444444555554322 23333 589999987776655444444332333333333333 Q ss_pred EEeeeeceeEechHHHHHhHH---HHH-HHHHHHHHHHHHHHHHHHH---------HHHHhhccc---cccc---ccCCH Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVAG---SLE-AYTRAGATALATDTDKFIA---------DMLVDNGTA---LTGS---APSDA 137 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~---~~~-~~~~~~~~ala~~iD~~~~---------~~~~~~~~~---~~~~---~~~~~ 137 (273) + .....++.+...|+..... ++. ...+.+++++.+++|+-.+ +++. .++. ++.+ +..|+ T Consensus 127 v-~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN-~P~l~~~v~~s~~Wa~kT~ 204 (339) T protein:vir:94 127 N-YRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMN-DPSLPAPVAATVNWATAAP 204 (339) T ss_pred E-EEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEe-CCCccccccCCCCcccCCH Confidence 3 2235667777666554322 332 3344556667777765322 1110 0111 1111 22567 Q ss_pred hHHHHHHHHHHHHHhhcC----CCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccccc Q lcl|Aclame:pro 138 DDAFDLIASALKELTKAN----VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~----vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~ 213 (273) ..++++|..+...+..+- -+.....|+++|..+..|..- +.. + .... +-++ .++-+++|+..+.+.. T Consensus 205 ~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~-~--~Tvl-~~lk----~n~pnl~i~~~~el~~ 275 (339) T protein:vir:94 205 EDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNF-G--LSAG-AKIA----QTYPNIQFVAVPEFDT 275 (339) T ss_pred HHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcC-C--ccHH-HHHH----HhcCCcEEEEcccccc Confidence 788999998888875553 122334799999999988432 110 0 0000 0111 1123466776666654 Q ss_pred CCCcEEEEEeCc-------eEEEEEecceeeeccCCCcceeeEEeeee-eeeEEEcCceEEEEecC Q lcl|Aclame:pro 214 TDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) Q Consensus 214 ~~~~~~~~~~~~-------a~~~~~~~~~ve~~~~~~~~~~~v~~~~~-~g~~vl~p~~~v~~~~~ 271 (273) .++.....+... .+.+..++......+ ....+.+....+ .|+-+.+|.+++.+.-= T Consensus 276 a~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~--~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 276 ASGRLVQLWVPEVNGQPTGEVAFAEKLRSHSIER--YSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CCCceEEEEEEeccCCcceEEEcchhhhccccEE--cCceEEecceeeeeeEEEEccceeeeeecC Confidence 443333222211 122222221111112 223456666666 56788899988866544 No 196 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=94.09 E-value=0.0054 Score=33.01 Aligned_cols=259 Identities=12% Similarity=0.076 Sum_probs=123.6 Q ss_pred Cccc---chhHHHHH---HHHHHHHHHhhccchhhhccccccccCC-cEEEEEeccccccccccCC-CCccCCcccccce Q lcl|Aclame:pro 1 MAFN---NFIPELWS---DMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAA-GRQTSADAISDTG 72 (273) Q Consensus 1 MA~~---~~~pev~~---~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~-~~~~~~~~~~~~~ 72 (273) |=+. .|..+.|. .++.+.+...++...++... ..+..| .++.++.....+.+..... ..++...+...+. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~--~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQK--FDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccc--cCCCCceEEEEEeeeccceeEEEecCccccccccccccee Confidence 5552 34444443 44445555555544444222 123333 5677777766665553322 2334444555566 Q ss_pred EEEEEEeeeeceeEechHHHHHh--HH-HHH-HHHHHHHHHHHHHHHHHHHHH---------HHhhcc--ccc---c--- Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQV--AG-SLE-AYTRAGATALATDTDKFIADM---------LVDNGT--ALT---G--- 131 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~--~~-~~~-~~~~~~~~ala~~iD~~~~~~---------~~~~~~--~~~---~--- 131 (273) ....|-. .+.+|.+...|.... .+ +++ +....++++++++.|+.+|-= +..... ... + T Consensus 79 ~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~ 157 (301) T protein:vir:80 79 KSVPIYS-IGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGN 157 (301) T ss_pred EEEEEEE-EEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccccc Confidence 6666644 355666665544432 22 343 355667788888888755411 111000 000 0 Q ss_pred ---cccCCHhHHHHHHHHHHHHHhhc--CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEE Q lcl|Aclame:pro 132 ---SAPSDADDAFDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIV 206 (273) Q Consensus 132 ---~~~~~~~~~~~~i~~a~~~l~~~--~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~ 206 (273) =...++..++++|.++...+..+ ++- ..-.|+++|+.|..|..- ...+ ..+...+ +-...+.-+..|+ T Consensus 158 ~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~-~p~~L~L~p~~~~~L~~~--~~~~---~~~~tvl-~~l~~~~~~~~I~ 230 (301) T protein:vir:80 158 VSKWEKKTAEQIIDEIGEAHTKITVLPGYGT-ASLKLCLPPKQFELINKK--RYSN---EDSRSVL-KVLQDNAWFSAIV 230 (301) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCcee-cccEEEecHHHHHhhhhc--cccC---CCCeeHH-HHHHHHcCcceEE Confidence 02235677899999999888654 321 223699999999988421 0100 0111010 0000112234555 Q ss_pred EecccccCC--C-cEEEEEeCc--eEEEE--EecceeeeccCCCcceeeEEeeeee-eeEEEcCceEEEEecC Q lcl|Aclame:pro 207 ESNNLRDTD--D-EQFVAFHPS--AAAYV--SQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 207 ~s~~l~~~~--~-~~~~~~~~~--a~~~~--~~~~~ve~~~~~~~~~~~v~~~~~~-g~~vl~p~~~v~~~~~ 271 (273) ..+.+...+ + ..++++..+ -+.+. ..+.....++.. ....+....++ |+-+.+|++++.+.-= T Consensus 231 ~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~--~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 231 RVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSF--PRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred EcceeccCCCCcccEEEEEecCCcEEEEEecCceeeecceecC--ceeEeeeeeeeEEEEEEccceEEEEecC Confidence 555554322 1 122333222 11111 111111111111 13344444444 6788899999987655 No 197 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=93.47 E-value=0.0074 Score=32.25 Aligned_cols=258 Identities=11% Similarity=-0.017 Sum_probs=121.0 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccc-cccCCcEEEEEecccc-ccccccCCCCccCCcccccceEEEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEG-IASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~-~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) |..+--.+..+...+...|.+..-..+-.++.+-. ...-..+-+....+.+ .+.+. .++.....+++...++++. T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~---~Ge~~~~~l~~~~~~i~~~ 77 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRW---IGAKVVKNLKAYKYVVENE 77 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCcccc---ccceeeccccccceeEEee Confidence 88864334444444445554443222211111111 0111223333334432 22222 2445667788888888886 Q ss_pred eeeeceeEechHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------------ccccc--- Q lcl|Aclame:pro 79 QEKSIDFLVDDIDRVQ-VAGSLEAYTRAGATALATDTDKFIADMLVDNGTAL--------------------TGSAP--- 134 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~-~~~~~~~~~~~~~~ala~~iD~~~~~~~~~~~~~~--------------------~~~~~--- 134 (273) ++ ...+.|+-.+... ..+-+....+.++++-++..|..++.++....+.. +.+.. T Consensus 78 ~~-g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~ 156 (302) T protein:vir:10 78 DF-EATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLS 156 (302) T ss_pred cc-cceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhhh Confidence 54 5566666544321 12334566778889999999999999986532110 00000 Q ss_pred -CCHhHHHHHHHHHHHHHhh----cCCCc--CCcEEEECHHHHHHHhcchHHhhhhh-cccccceeeeeeeeeecceEEE Q lcl|Aclame:pro 135 -SDADDAFDLIASALKELTK----ANVPN--VGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIV 206 (273) Q Consensus 135 -~~~~~~~~~i~~a~~~l~~----~~vp~--~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~~~~G~ig~i~G~~i~ 206 (273) .......+.+.+++..|.+ .+-|- ..++|+|+|.....-.+- +.... ..+..+.++ |. ++++ T Consensus 157 ~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~l---l~~~~~~~g~~Np~~-g~------~~~v 226 (302) T protein:vir:10 157 NASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKML---LTNPKLADNTPNPYV-GT------AELV 226 (302) T ss_pred hcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHH---hhccccCCCCcceec-cc------eEEE Confidence 0000111224444444433 23222 246899998866544321 11100 112233332 22 5888 Q ss_pred EecccccCCCcEEEEEeCceEEEE----EecceeeeccCCCcceeeEEeeeeeee------EEEcCceEEEEecCCC Q lcl|Aclame:pro 207 ESNNLRDTDDEQFVAFHPSAAAYV----SQIDTVEALRDQDSFSDRIRALHVYGG------KVVRPTGVVVFNKTGS 273 (273) Q Consensus 207 ~s~~l~~~~~~~~~~~~~~a~~~~----~~~~~ve~~~~~~~~~~~v~~~~~~g~------~vl~p~~~v~~~~~~s 273 (273) .++.+..+++ .+++..+..+-.. .+...++...+...-+-.++..+.||+ +-..|..+.-=+.++| T Consensus 227 v~p~L~s~~a-WyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 227 VDGRIESDTA-WFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred EeeccCCCCc-eEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 8888865443 4444455543211 122344444444444556666677774 3333333333344444 No 198 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=91.65 E-value=0.015 Score=30.61 Aligned_cols=265 Identities=12% Similarity=0.160 Sum_probs=108.9 Q ss_pred Ccc-c-chhH-HHHHHHHHHHHHHh-hccchhhhccccccccCCcEEEEEecccccccccc-CCCCccCCccc---ccce Q lcl|Aclame:pro 1 MAF-N-NFIP-ELWSDMLLEEWTAQ-TVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYK-AAGRQTSADAI---SDTG 72 (273) Q Consensus 1 MA~-~-~~~p-ev~~~~v~~~l~~~-~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~-~~~~~~~~~~~---~~~~ 72 (273) |+. + .+.. -+++...+ .+++. .+-..++-+ . ..+..+.+++.++.-.+..++ ..+.-.....+ ..+. T Consensus 1 m~~~~~~~~~dp~LT~~A~-gy~n~~~Iad~lfP~-v---pV~~~~~k~~~f~~e~f~~~~t~ra~~~~~~~v~~~~~~~ 75 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAI-GYTNAEFIGQTLMPV-V---EVEKEGGKIPKFGKESFRLYQTERALRAKSNRMNPEDIDS 75 (307) T ss_pred CCCCCCCcccCHHHHHHHh-hccchhhhhhhcCCc-c---cccccccceeeeccccccccccccccCCCcceeeeecccc Confidence 553 2 2222 23444333 23322 221222111 1 111223344444332211111 11111111112 2234 Q ss_pred EEEEEEeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc-----cc--cccc--cCCHhHHHH Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-----AL--TGSA--PSDADDAFD 142 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~-----~~--~~~~--~~~~~~~~~ 142 (273) .++.++++ .....|+..+.....++++. .++.....|....+.-+..++..... .. +++. .....+.+. T Consensus 76 ~~~~~~~~-~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~ 154 (307) T protein:vir:79 76 VDVNLDEH-DLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVG 154 (307) T ss_pred cccccccc-chhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHH Confidence 45555543 33345555554444555433 34444444433333333333322211 11 1110 012334566 Q ss_pred HHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceE-EEEeccccc-------- Q lcl|Aclame:pro 143 LIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR-IVESNNLRD-------- 213 (273) Q Consensus 143 ~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~-i~~s~~l~~-------- 213 (273) +|.+++..+.+... .....++++++.+..|+..+.......... ...+..-.+.+++|++ |+.-..... T Consensus 155 di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~-~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~ 232 (307) T protein:vir:79 155 VIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSM-KGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTD 232 (307) T ss_pred HHHHHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCcc-ccccCHHHHHHHhCceeEEEeeeeeecccccchh Confidence 78887777765432 233489999999999999987666554332 2222222345677876 333222211 Q ss_pred -CCCcEEEEEeCc------------eEEEEEec--ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 214 -TDDEQFVAFHPS------------AAAYVSQI--DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 -~~~~~~~~~~~~------------a~~~~~~~--~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+...+++.+. .+|+--|. ......+.+...++.|+.....--.++=|++=..|+.+.- T Consensus 233 iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 233 IWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred cCCCceEEEecccccCCCCCcccccccceeEEecCceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 111222333211 23333222 1122223344556677666666555555553333322222 No 199 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=91.08 E-value=0.018 Score=30.20 Aligned_cols=258 Identities=14% Similarity=0.042 Sum_probs=123.0 Q ss_pred Cccc------chhHHHH---HHHHHHHHHHhhccchhhhccccccccCC-cEEEEEecccccccc-ccCCCCccCCcccc Q lcl|Aclame:pro 1 MAFN------NFIPELW---SDMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKD-YKAAGRQTSADAIS 69 (273) Q Consensus 1 MA~~------~~~pev~---~~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d-~~~~~~~~~~~~~~ 69 (273) |-.. .|..+.| -..+.+.....++...++.... .+..| +++.++.....+.+. |...+..+..-+.. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~--~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~ 78 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSN--EIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDAL 78 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceeccccc--CCCCceeEEEeeeeeccCceeEeCCCccccceeecc Confidence 5442 2223222 2333333344444444443222 22223 567777765554433 23323334344455 Q ss_pred cceEEEEEEeeeeceeEechHHHH--HhHH-HHH-HHHHHHHHHHHHHHHHHHHH---------HHHhhcc-ccc-cccc Q lcl|Aclame:pro 70 DTGVDLLIDQEKSIDFLVDDIDRV--QVAG-SLE-AYTRAGATALATDTDKFIAD---------MLVDNGT-ALT-GSAP 134 (273) Q Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~--~~~~-~~~-~~~~~~~~ala~~iD~~~~~---------~~~~~~~-~~~-~~~~ 134 (273) -+.....+.. .+.++.++..|.. ...+ +++ +....++++++...|+-+|- ++..... ... .+.= T Consensus 79 ~~~~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W 157 (296) T protein:vir:10 79 ATERQGKVFR-FGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSW 157 (296) T ss_pred ceeEEEEEEE-EEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCc Confidence 5566666633 3556666655543 3322 343 34556677888888775441 1111000 000 1111 Q ss_pred CCHhHHHHHHHHHHHHHhhc--CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccc Q lcl|Aclame:pro 135 SDADDAFDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 135 ~~~~~~~~~i~~a~~~l~~~--~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~ 212 (273) .++++++++|.++...+..+ ++- ..-.++++|+.|..|...- .+.+..-- +-++ -+.-+.+|+..+.+. T Consensus 158 ~~~t~i~~Di~~~~~~l~~~s~g~~-~p~~l~L~p~~~~~L~~~~---~~~~~t~l-~~ik----~~~~~l~i~~~~~l~ 228 (296) T protein:vir:10 158 SQPTTAVSDITSLLDIIETSTNGQH-RATHLLLPTTARRIMQNLV---PGTSVSYG-EFFR----QNNSGVTVEFVQYLN 228 (296) T ss_pred cCHHHHHHHHHHHHHHHHHhhCcee-cceeEEeCHHHHHHHhhcc---CCCCccHH-HHHH----HhcCCceEEEeeeec Confidence 34567899999998877543 432 2236889999999885321 11110000 0111 122355666666554 Q ss_pred cCCC---cEEEEEe--CceEEEEEecc-eeeeccCCCcceeeEEeeeee-eeEEEcCceEEEE---ecC Q lcl|Aclame:pro 213 DTDD---EQFVAFH--PSAAAYVSQID-TVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVF---NKT 271 (273) Q Consensus 213 ~~~~---~~~~~~~--~~a~~~~~~~~-~ve~~~~~~~~~~~v~~~~~~-g~~vl~p~~~v~~---~~~ 271 (273) ..++ ...+++. +.-+.++.... .+-. .......+.+...... |+-+.+|.+++.+ +=+ T Consensus 229 ~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~-~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 229 DYNGTGTSAAIAYEKDPNNMAIEIPEATNALP-AQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred cCCCCcceEEEEEEcCCceEEEEcCcceeeec-ccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 3322 1223333 33333322111 1111 1222345677777766 5899999999987 444 No 200 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=90.99 E-value=0.018 Score=30.14 Aligned_cols=259 Identities=10% Similarity=0.021 Sum_probs=113.8 Q ss_pred Cccc--------chhH---HHHHHHHHHHHHHhhccchhhhccccccccCC-cEEEEEecccccccccc-CCCCccCCcc Q lcl|Aclame:pro 1 MAFN--------NFIP---ELWSDMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYK-AAGRQTSADA 67 (273) Q Consensus 1 MA~~--------~~~p---ev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~-~~~~~~~~~~ 67 (273) |+.. .|.. +.+-..+.+.....++...++.... .+..| ++++++.....+.+... .....+...+ T Consensus 26 ~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~--~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd 103 (329) T protein:vir:79 26 LRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTS--ELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVD 103 (329) T ss_pred cccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhccccc--CCCCceeEEEeeeeecceeeeeecCcccccceee Confidence 2221 1222 1122333333333444444443221 22233 57777777665554432 2222333344 Q ss_pred cccceEEEEEEeeeeceeEechHHHHHh--HH-HHH-HHHHHHHHHHHHHHHHHHH---------HHHHhhccc------ Q lcl|Aclame:pro 68 ISDTGVDLLIDQEKSIDFLVDDIDRVQV--AG-SLE-AYTRAGATALATDTDKFIA---------DMLVDNGTA------ 128 (273) Q Consensus 68 ~~~~~~~~tid~~~~~~~~i~d~d~~~~--~~-~~~-~~~~~~~~ala~~iD~~~~---------~~~~~~~~~------ 128 (273) +..+.....+.. ...++.++..|.... .+ +++ +..+.++++++.+.|+-+| .++..-... T Consensus 104 ~~~~~~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~ 182 (329) T protein:vir:79 104 ALMTSEFGKVFR-LGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAG 182 (329) T ss_pred cccceeEEEEEE-EEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCC Confidence 455555555533 355566655554433 22 343 3455567778888776443 111110000 Q ss_pred --ccccccCCHhHHHHHHHHHHHHHhhc--CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceE Q lcl|Aclame:pro 129 --LTGSAPSDADDAFDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR 204 (273) Q Consensus 129 --~~~~~~~~~~~~~~~i~~a~~~l~~~--~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~ 204 (273) ...-+..++..++++|.++...+..+ ++ ...-.|+++|+.+..|..-- .+.+...- +-+++ +.-.++ T Consensus 183 ~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~-~~p~~L~Lpp~~~~~L~~~~---~~~~~tvl-~~lk~----~~~~l~ 253 (329) T protein:vir:79 183 WNNAAGTGKKPETAQDELEQAIEKIETLTNGQ-HRANMILIPPSMRKVLMVRM---PETTMSYL-DYFKQ----QNGGIT 253 (329) T ss_pred CCCccccccCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHHhhccc---CCCCccHH-HHHHH----hCCCcE Confidence 00112235677899999988888664 22 12236999999998884210 01110000 00110 112344 Q ss_pred EEEecccccCCC---cEEEEEeCc--eEEEE--EecceeeeccCCCcceeeEEeeeee-eeEEEcCceEEEEecCCC Q lcl|Aclame:pro 205 IVESNNLRDTDD---EQFVAFHPS--AAAYV--SQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 205 i~~s~~l~~~~~---~~~~~~~~~--a~~~~--~~~~~ve~~~~~~~~~~~v~~~~~~-g~~vl~p~~~v~~~~~~s 273 (273) |...+.+...+. ..++++..+ -+.+. ..+......+ ....+.+....++ |+-+.+|.+++.+.-=.- T Consensus 254 I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~--~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~ 328 (329) T protein:vir:79 254 IESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQP--KDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVV 328 (329) T ss_pred EEEcccccccCCCCceEEEEEecCCceEEEecCcceeeeecee--cCceEEEceeeeEEEEEEECcceeeeeeeeee Confidence 555554432221 122333222 22221 1111111111 2223455555554 478889998875522111 No 201 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=90.56 E-value=0.02 Score=29.88 Aligned_cols=266 Identities=12% Similarity=0.171 Sum_probs=109.5 Q ss_pred Cc-cc-chhH-HHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccc-cCCCCccCCcccc---cceE Q lcl|Aclame:pro 1 MA-FN-NFIP-ELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDY-KAAGRQTSADAIS---DTGV 73 (273) Q Consensus 1 MA-~~-~~~p-ev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~-~~~~~~~~~~~~~---~~~~ 73 (273) |+ .+ .+.. -++....+.-.....+-..++-+ .. .. ..+.++|.++.-....+ +..+.-..++.++ .+.. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~-vp-v~--~~~~k~~~f~~eaF~~~~t~r~~~~~~~~v~~~~~~~~ 76 (307) T protein:vir:10 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPV-VE-VE--KEGGKIPKFGKESFRLYKTERALRARSNRMNPEDLGSI 76 (307) T ss_pred CCCCCCCcccChhHHHHHHhhcchhhhhhhcCCc-cc-cc--ccccceeeECcccccchhhhcccCCCcceeeccccccc Confidence 44 32 2222 24445444333333322222211 11 11 12233344433221111 1111111112222 2233 Q ss_pred EEEEEeeeeceeEechHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcc-----c--cccccc--CCHhHHHHH Q lcl|Aclame:pro 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-----A--LTGSAP--SDADDAFDL 143 (273) Q Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~-----~--~~~~~~--~~~~~~~~~ 143 (273) +..+..+ .....++..+.....++.++ .++.....|....+.-+..++..... . .+++.. ....+.+.+ T Consensus 77 ~~~~~~~-~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~~sDPi~d 155 (307) T protein:vir:10 77 DIVLDEH-DLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAAGSDPVGV 155 (307) T ss_pred ccccccc-cccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEeccccccCCCCCCcHHH Confidence 4444332 34455665555555566533 44444444433333333333322211 1 111110 123345667 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecc-ccc--------- Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN-LRD--------- 213 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~-l~~--------- 213 (273) |.+++..+.+... .....++++++.+..|+..+.........+ ...+..-.+.+++|++.+.... ... T Consensus 156 i~~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~-~g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~i 233 (307) T protein:vir:10 156 IEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSM-KGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDI 233 (307) T ss_pred HHHHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCcc-ccccCHHHHHHHhCceeEEEeeeeeeccCCcccee Confidence 8888777765432 233489999999999999987666554332 2222222345678876665321 111 Q ss_pred CCCcEEEEEeCc------------eEEEEEec--ceeeeccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 214 TDDEQFVAFHPS------------AAAYVSQI--DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~~~~~~~~~~~------------a~~~~~~~--~~ve~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ..+...+++.+. .+|+--|. ..+...+.+...++.|+....+--.++-|++=..|+-+.- T Consensus 234 w~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 234 WGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred CCCceEEEecccccCCCCCcccccccceeEEEcCCeEeeceecCCceeEEeccccccceeecccccceeccCCC Confidence 111222222111 23333221 1222223344556666665555555555553333322222 No 202 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=82.48 E-value=0.032 Score=28.80 Aligned_cols=106 Identities=16% Similarity=0.098 Sum_probs=65.4 Q ss_pred EECHHHHHHHhcchHHhhhhhcccccceeeeeeee-eecceEEEEecccccCCCcEEEEEeCc-----------eEEEE- Q lcl|Aclame:pro 164 VVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIG-NLLGARIVESNNLRDTDDEQFVAFHPS-----------AAAYV- 230 (273) Q Consensus 164 vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig-~i~G~~i~~s~~l~~~~~~~~~~~~~~-----------a~~~~- 230 (273) +++--+++.++++... ..+--.-+.+.+.+|.+. +.+|..|+.++++|.+.. .++... +-+|+ T Consensus 1 vvsdlqfA~~~g~~v~-~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a---~vlDst~lGgmaDE~l~~Pgya~ 76 (123) T protein:vir:78 1 MLSGAQFAKLIGILVD-DKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTDP---WLFDVEQLGGMADEKLLSPEFAP 76 (123) T ss_pred CcchhhHHHHhcchhc-ccccccccCCceEecCcceeeeceeeeecCCCCCCcc---ceeehhhhccccccccCCCcccC Confidence 5555557777665321 111111124567778775 699999999999995442 111111 11222 Q ss_pred Eec--ceeeeccCCC--cceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 231 SQI--DTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~--~~ve~~~~~~--~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) .+. .+++..|..+ .-++.++++-..-.-+++|.+.+.|+-.|- T Consensus 77 ~~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 77 AGNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CCCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 111 2445555444 446888998888889999999999988888 No 203 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=80.08 E-value=0.098 Score=26.11 Aligned_cols=257 Identities=13% Similarity=0.037 Sum_probs=115.4 Q ss_pred Cccc------------------------chhHHHH---HHHHHHHHHHhhccchhhhccccccccCC-cEEEEEeccccc Q lcl|Aclame:pro 1 MAFN------------------------NFIPELW---SDMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPT 52 (273) Q Consensus 1 MA~~------------------------~~~pev~---~~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~ 52 (273) ||-+ .|..+.| -..+.+.....+....++.... .+..+ +++.++.....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~--~~~~~~et~~~~~~e~~G 78 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTN--EIPGHAKYFEYPEFDGVG 78 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeecccc--CCCCceeEEEeeeecccc Confidence 2211 1111111 1122222222233333332211 22223 477777776665 Q ss_pred cccc-cCCCCccCCcccccceEEEEEEeeeeceeEechHHHHHh--HH-HHH-HHHHHHHHHHHHHHHHHHH-------- Q lcl|Aclame:pro 53 VKDY-KAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQV--AG-SLE-AYTRAGATALATDTDKFIA-------- 119 (273) Q Consensus 53 ~~d~-~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~--~~-~~~-~~~~~~~~ala~~iD~~~~-------- 119 (273) .+.. ...+..+..-+..-+.....+.. .+.++.++..|.... .+ +++ +....++.+++...|+-+| T Consensus 79 ~a~~~~d~~~dip~vd~~~~~~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~ 157 (314) T protein:vir:10 79 IAQIIADYSDDLPLVDAFMTEKQGKVFR-FGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGI 157 (314) T ss_pred ceeeeCCcccccceeecccceeEEEEEE-EEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccc Confidence 5443 23233344445555666666643 356667665554433 22 343 3445566677777766443 Q ss_pred -HHHHhhc-c-cccccccCCHhHHHHHHHHHHHHHhhc--CCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeee Q lcl|Aclame:pro 120 -DMLVDNG-T-ALTGSAPSDADDAFDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) Q Consensus 120 -~~~~~~~-~-~~~~~~~~~~~~~~~~i~~a~~~l~~~--~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~ 194 (273) +++.... + ....+.=.++..++++|.++...+.++ ++- ..-.|+++|+.+..|..- ..+.+...-+-..++ T Consensus 158 ~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~-~p~~l~Lpp~~~~~L~~~---~~~~~~tvl~~l~~n 233 (314) T protein:vir:10 158 VSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLH-HVTDILLPASARRVMQGL---VPQTNLSYGELFTRN 233 (314) T ss_pred eeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccc-cceeEEecHHHHHhhccc---ccCCCccHHHHHHHh Confidence 1111100 0 011112246778899999999998764 321 123689999999866321 001010000000111 Q ss_pred eeeeeecceEEEEecccccCCCc---EEEEEeCc--eEEEEE--ecceeeeccCCCcceeeEEeeeee-eeEEEcCceEE Q lcl|Aclame:pro 195 GTIGNLLGARIVESNNLRDTDDE---QFVAFHPS--AAAYVS--QIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVV 266 (273) Q Consensus 195 G~ig~i~G~~i~~s~~l~~~~~~---~~~~~~~~--a~~~~~--~~~~ve~~~~~~~~~~~v~~~~~~-g~~vl~p~~~v 266 (273) .-+.+|...+.+...+.. ..+++..+ -+.+.. .+..... ........+.....+ |+-+.+|.+++ T Consensus 234 -----~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~--e~~~~~~~~~~~~r~~Gv~i~~P~ai~ 306 (314) T protein:vir:10 234 -----NPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPA--QPKDLHFRYPVTSKATGLIVYRPLTMA 306 (314) T ss_pred -----CCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecc--eecCceEEEcceeeeEEEEEECcceeE Confidence 124556665555432221 12333222 122111 1111111 112234566555665 58888999998 Q ss_pred EE---ecC Q lcl|Aclame:pro 267 VF---NKT 271 (273) Q Consensus 267 ~~---~~~ 271 (273) .+ +=+ T Consensus 307 ~~dGI~~~ 314 (314) T protein:vir:10 307 VIKGITFA 314 (314) T ss_pred eeeeeecC Confidence 54 333 No 204 >protein:vir:15 Length: 472 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073691;swissprot:sw:q9fzw7;genbank:gi:12248115;uniprot:Q9FZW7;genbank:GeneID:919909 Probab=78.22 E-value=0.12 Score=25.70 Aligned_cols=257 Identities=12% Similarity=0.062 Sum_probs=116.5 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhccccccc-----cCCcEEEEEeccccccccccCCC---CccCCcccccce Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIA-----SKGNVVHIAGVVAPTVKDYKAAG---RQTSADAISDTG 72 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~-----~~Gdtv~ip~~~~~~~~d~~~~~---~~~~~~~~~~~~ 72 (273) ||++.+..|.+.+-+.+ ....++ ..+.+++.-..+ .-|++|.=--.....-.-|.++. .+.....++-.. T Consensus 52 ~~d~~~QnEf~~sLv~R-Igst~V-~~~s~~NPLa~Fk~~~~~fG~~Ieei~~D~a~~~~yd~~k~Ev~pFk~~~P~IkA 129 (472) T protein:vir:15 52 LADKTLQNDFIHTLVDR-IGLVVV-HHKLMQNPLKIFKKGTLEYGRKIEEIFTDLTREHVYDPEKAETEVFKREIPNVKT 129 (472) T ss_pred hhhhhhHHHHHHHHHhh-hcchhh-hhhhccChHHHHhhcCccchhhhhhhhcccccccccchhhhhccccccCCCccee Confidence 77776665555443322 221111 122222221111 12555432222221111222211 122233344444 Q ss_pred EEEEEEeeeeceeEechHHHHH---hHHHHHHHHHHHHHHHH--HHHHHHHH-HHHHhhcc-----c---ccccccCCHh Q lcl|Aclame:pro 73 VDLLIDQEKSIDFLVDDIDRVQ---VAGSLEAYTRAGATALA--TDTDKFIA-DMLVDNGT-----A---LTGSAPSDAD 138 (273) Q Consensus 73 ~~~tid~~~~~~~~i~d~d~~~---~~~~~~~~~~~~~~ala--~~iD~~~~-~~~~~~~~-----~---~~~~~~~~~~ 138 (273) .-.+.+........|++..... +...+++++++...++. ..+|.+.. ..+-.... . .......+.. T Consensus 130 ~~H~~nR~~~y~~Ti~~d~i~~AF~S~~gld~fi~~i~~si~sSde~dEY~~~k~li~~~~~k~lf~v~~i~~d~~~~~v 209 (472) T protein:vir:15 130 LFHERDRQVFYKQTISDQQLKTAFTNAQKFDEFLSTIVTSIYNSAEVDEFRYTKLLIDNYFSKNLFKIVPVSVDPATGIV 209 (472) T ss_pred EEeeccccceeeeeeeHHHHHHhhcChhhHHHHHHHHHHHHhccccHHHHHHHHHHHHHhhhccceEEEecCCCcccccc Confidence 4455555555555666544332 23446778888777665 35665432 11111110 1 1111222222 Q ss_pred HHHHHHHHHHHHHhhcCCCc----------------CCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecc Q lcl|Aclame:pro 139 DAFDLIASALKELTKANVPN----------------VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLG 202 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vp~----------------~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G 202 (273) ...+-+...+..-.+..+|. ++.+++++|++-..| +.+++.++-. -+ .+-.-+.+-.+.| T Consensus 210 ~~kd~~K~lr~aa~kM~lP~gT~~yN~~gv~~~td~~DL~lI~~~dtq~ev--dv~~LA~AFN-~d-~vd~~~~~i~Vd~ 285 (472) T protein:vir:15 210 NTKEFLAKTRATATKMTLPMGTRDFNSMAVHTRTDMDDLYIIMDADTQAEV--DVNELASAFN-LN-KADFIGRRILIDG 285 (472) T ss_pred cHHHHHHHHHHHHHHhcCCCCCCCCCccccceeccceeeeEEeCCCceEee--cHHHHHHHhC-cc-hhhcCceeEEecc Confidence 22233333444445556662 345888898877666 2323332211 11 1111233445666 Q ss_pred eEEEEecccccCCCcEEEEEeCceEEEEEecceeeeccCCCcce-------------eeEEeeeeeeeEEEcCc---eEE Q lcl|Aclame:pro 203 ARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSFS-------------DRIRALHVYGGKVVRPT---GVV 266 (273) Q Consensus 203 ~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~ve~~~~~~~~~-------------~~v~~~~~~g~~vl~p~---~~v 266 (273) | + .++..+++..+.++..-.+..+.|..|++..-- ..+.....||++..=|. .++ T Consensus 286 F--------a-~~d~~a~l~sk~~f~i~D~l~~m~s~rnprgL~~Ny~lHv~q~~s~s~F~naiaF~~g~~v~~~~~~~i 356 (472) T protein:vir:15 286 F--------A-STGLKAVMVDKDFFMLYDQVFRMESQRNAQGMYWNYYLHVWQVLSTSRFANAVAFVDSALIDGDVSQVI 356 (472) T ss_pred c--------C-CCCceeeeehhhHHHHHHHHHhcccccCcccchhHHHHHHHHHHHhccccceEEEeccccCCCccceEE Confidence 5 2 233446777777777766777888888877422 33444566777664443 333 Q ss_pred EEecCCC Q lcl|Aclame:pro 267 VFNKTGS 273 (273) Q Consensus 267 ~~~~~~s 273 (273) + ..+.+ T Consensus 357 v-~p~~~ 362 (472) T protein:vir:15 357 V-TPTVG 362 (472) T ss_pred E-eeccc Confidence 3 44433 No 205 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=73.28 E-value=0.17 Score=24.77 Aligned_cols=255 Identities=10% Similarity=0.012 Sum_probs=114.4 Q ss_pred CcccchhHHHHHHHHHHHHHH----hhccchhhhccccccccCC-cEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTA----QTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~----~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ++|.- +|..++.-+...+.+ ......++-.+. .+.=. +++.++.....+.+...+.+.+....+...+..+. T Consensus 45 ~~~~~-~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~ 121 (336) T protein:vir:36 45 TGSSG-IPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQR 121 (336) T ss_pred CCCcc-hHHHHHHhhccceEeeecchhhhhhhccccc--cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeee Confidence 11111 244444322222211 111222222111 11112 46777777665554433433344334445555555 Q ss_pred EEEeeeeceeEechHHHHHhH---HHH-HHHHHHHHHHHHHHHHHHHH---------HHHHhhccc---ccc----cccC Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA---------DMLVDNGTA---LTG----SAPS 135 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~iD~~~~---------~~~~~~~~~---~~~----~~~~ 135 (273) ++-. ...++.+...|..... .++ ....+.+++++.+++++..+ ..+. .++. .+. .... T Consensus 122 ~v~~-~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN-dP~l~a~~t~~t~~~~~~ 199 (336) T protein:vir:36 122 QSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIN-DPSLSAPITATTPWSGSP 199 (336) T ss_pred eEEE-EEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEe-cCCCccccccCCCccccc Confidence 6533 3556777766554332 233 33444556677777765322 1111 1111 011 1223 Q ss_pred CHhHHHHHHHHHHHHHhhcC--C-C-cCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccc Q lcl|Aclame:pro 136 DADDAFDLIASALKELTKAN--V-P-NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) Q Consensus 136 ~~~~~~~~i~~a~~~l~~~~--v-p-~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l 211 (273) ++..++++|..+...+..+. . . ...-.|+++|..+..|..- +. .+ -.+ .+-.-.++=+++|+..+.+ T Consensus 200 t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~-n~------~g-~Tv-l~~lk~n~Pnl~i~t~pEl 270 (336) T protein:vir:36 200 AVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQ------YG-LAA-AAKLKDIFPKLEFVTIPEY 270 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCC-Cc------cC-ccH-HHHHHHhcCccEEEEcccc Confidence 55778999998888886633 1 1 1234789999998887432 11 11 000 0000011334567776666 Q ss_pred ccCCCcEEEEEeCc-------eEEEEEecceeeeccCCCcceeeEEeeee-eeeEEEcCceEEEEecC Q lcl|Aclame:pro 212 RDTDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) Q Consensus 212 ~~~~~~~~~~~~~~-------a~~~~~~~~~ve~~~~~~~~~~~v~~~~~-~g~~vl~p~~~v~~~~~ 271 (273) ...++..+..+-+. -+++..++......+... .+.+....+ .|+-+.+|-+++.+.-= T Consensus 271 ~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~--~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 271 DTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS--YFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccCCCceEEEEEEecCCCcceeeecchhhhccceeecCc--eeEeccccceeeeeeeccchheeeecC Confidence 54444433333221 112222222111122222 344555444 45677788887765444 No 206 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=72.12 E-value=0.19 Score=24.58 Aligned_cols=245 Identities=15% Similarity=0.152 Sum_probs=120.9 Q ss_pred Cc-ccc----hhHHHHHHHHHHHHHHhhccchhhhcccc-ccccC-CcEEEEEeccc--cccccccC------CCCccCC Q lcl|Aclame:pro 1 MA-FNN----FIPELWSDMLLEEWTAQTVFANLVNREYE-GIASK-GNVVHIAGVVA--PTVKDYKA------AGRQTSA 65 (273) Q Consensus 1 MA-~~~----~~pev~~~~v~~~l~~~~v~~~~~~~d~~-~~~~~-Gdtv~ip~~~~--~~~~d~~~------~~~~~~~ 65 (273) |+ |+. .+.+.|++-+.+.|.++.+|.+.+.- .+ ..+.+ .++.---+... +.+.+|.. .|. .+. T Consensus 1 mp~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg-lQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFGt-GTg 78 (295) T protein:vir:47 1 MPSNQNNAVRRYEKQYAGILETVFGVRAAFSNALAP-IQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFGD-NSG 78 (295) T ss_pred CCCCCCccchhhhHHHHHHHHHHHhHHHHHhhhhcc-hhhhhCCCccceEEEEeecCcceEeecccCCCccccccc-CCc Confidence 87 332 35788889899999999988776532 11 12222 22221111111 11223431 111 111 Q ss_pred cccccceEE--EEEEee-e-eceeEec-hHHHHHhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCC Q lcl|Aclame:pro 66 DAISDTGVD--LLIDQE-K-SIDFLVD-DIDRVQVAGSLEA----YTRAGATALATDTDKFIADMLVDNGTALTGSAPSD 136 (273) Q Consensus 66 ~~~~~~~~~--~tid~~-~-~~~~~i~-d~d~~~~~~~~~~----~~~~~~~ala~~iD~~~~~~~~~~~~~~~~~~~~~ 136 (273) .....+..+ +-.|.. + .+++.|- -+|..-.+.++++ .++.++.+-++.+|..+-+.+.+.+......+..+ T Consensus 79 ~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~te~~td~t 158 (295) T protein:vir:47 79 AQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKTEALADFT 158 (295) T ss_pred cccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc Confidence 111111111 111111 1 1333332 2344434444433 34557777788898877777777666655556667 Q ss_pred HhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEecccccCCC Q lcl|Aclame:pro 137 ADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD 216 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l~~~~~ 216 (273) .+.+...|.++...+-...|... .-.+|+|+.|..|...+- .......+. .+-+--|-++-||.+.+.+.-....+ T Consensus 159 ~d~V~~LF~~as~~yvn~ev~~~-~~AyV~~evYnaiiD~~l--~TsaK~Ssa-NiDengi~~FkGf~i~e~P~~~~q~G 234 (295) T protein:vir:47 159 DDKVKALFNKLSAFYTNNEVTAP-ITVYLRSEFYNAIVDMAS--VTSAKGATI-SLDENGLPKYKGFTLEETPAQYFETG 234 (295) T ss_pred chhHHHHHHHHHHHhhhhheeee-eEEEEchhHHHHHhcccc--cccccccee-eeccCCcceecceEEEeccHhhccCC Confidence 77777778888888877777433 348999999999986542 222222222 23334456899999998876555444 Q ss_pred cEEEEEeCceEEEE------EecceeeeccCCC----------------c-c---eeeEEee Q lcl|Aclame:pro 217 EQFVAFHPSAAAYV------SQIDTVEALRDQD----------------S-F---SDRIRAL 252 (273) Q Consensus 217 ~~~~~~~~~a~~~~------~~~~~ve~~~~~~----------------~-~---~~~v~~~ 252 (273) .. ..+.++.+|-+ .|..+.|.+.... + | ..+.+-+ T Consensus 235 ~~-aifs~dnig~aftGIn~aR~IesEdF~GValQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 235 VI-AIFSPNGIIIPFVGISTARVIEAENFDGVNCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred cE-EEEccccceeecccceeeeeeecccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 32 23334433321 1111111110000 0 0 0000000 No 207 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=56.07 E-value=0.46 Score=22.41 Aligned_cols=261 Identities=14% Similarity=0.102 Sum_probs=104.2 Q ss_pred CcccchhHHHHHHH----------HHHHHHHhhccchhhhccccccccCCc--------EEEEEeccccccccccCCCCc Q lcl|Aclame:pro 1 MAFNNFIPELWSDM----------LLEEWTAQTVFANLVNREYEGIASKGN--------VVHIAGVVAPTVKDYKAAGRQ 62 (273) Q Consensus 1 MA~~~~~pev~~~~----------v~~~l~~~~v~~~~~~~d~~~~~~~Gd--------tv~ip~~~~~~~~d~~~~~~~ 62 (273) |+.. .+..+.-. ..+.+........ ..+... ...|. ...+..-......+-...... T Consensus 193 itg~--tga~fa~s~~~an~astAss~Al~gEA~t~~--sTd~at-~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~ 267 (523) T protein:vir:59 193 ASGD--PENTVAYPLPRYNRIVGAVGSALYARLFFVT--GSDFAT-VAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPD 267 (523) T ss_pred cccc--ccccccchhhccccccccccccccccccccc--cccccc-cCCCcccccccccccccccccchhhccccccccc Confidence 1110 00000000 0001110000000 000000 00000 000100000111110000000 Q ss_pred cCCcccccceEEEEEEeeee----cee--Eech---HHHHHhH-H-HHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccc- Q lcl|Aclame:pro 63 TSADAISDTGVDLLIDQEKS----IDF--LVDD---IDRVQVA-G-SLEAYTR-AGATALATDTDKFIADMLVDNGTAL- 129 (273) Q Consensus 63 ~~~~~~~~~~~~~tid~~~~----~~~--~i~d---~d~~~~~-~-~~~~~~~-~~~~ala~~iD~~~~~~~~~~~~~~- 129 (273) .........+..++|+|... -.. ..+- -|+.-.| + |.+..+. =+...|+.+|+++++..+...+... T Consensus 268 ~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~ 347 (523) T protein:vir:59 268 PGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTD 347 (523) T ss_pred cccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeee Confidence 00112334456666665421 111 1111 1222344 3 3444333 3667789999999999887654321 Q ss_pred -c-----c----cccCCHh--------HHHHHHHHHHHHHhhcC--C-----CcCCcEEEECHHHHHHHhcchHHhhhhh Q lcl|Aclame:pro 130 -T-----G----SAPSDAD--------DAFDLIASALKELTKAN--V-----PNVGRVVVVNAEMAFWLRSSGSKLTSAD 184 (273) Q Consensus 130 -~-----~----~~~~~~~--------~~~~~i~~a~~~l~~~~--v-----p~~~r~lvv~p~~~~~L~~~~~~~~~~~ 184 (273) . + ....++. ...+++.....++++.. + -..+-+++++|++...|-.++- +... T Consensus 348 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~-~~~~- 425 (523) T protein:vir:59 348 NYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPG-FTPG- 425 (523) T ss_pred eccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccc-cccC- Confidence 1 0 0111110 11233333222222110 1 1134589999999999876653 3211 Q ss_pred cccccceeeeee--eeeec-ceEEEEecccccCCCcEEEEEeCceEE-------EE--EecceeeeccCCCcceeeEEee Q lcl|Aclame:pro 185 TSGDAAGLRAGT--IGNLL-GARIVESNNLRDTDDEQFVAFHPSAAA-------YV--SQIDTVEALRDQDSFSDRIRAL 252 (273) Q Consensus 185 ~~~~~~~~~~G~--ig~i~-G~~i~~s~~l~~~~~~~~~~~~~~a~~-------~~--~~~~~ve~~~~~~~~~~~v~~~ 252 (273) ++.....+|. .|.+. |+.||..+..+. ..++++.++..+ |+ .-+..+....|+..|.-.+-.+ T Consensus 426 --~~~~~~~~~~~~~g~l~~~~~vy~d~~~~~---dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~ 500 (523) T protein:vir:59 426 --NDNRDGGTGIFYVGMVQGRYRLYKNIYQNQ---PVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLM 500 (523) T ss_pred --CccccccccceeEEEecCceEEEecCCCCc---ceEEEEecccCCcccccceecccchhhcccccccCCcccceeeee Confidence 1111111222 34443 468888875432 345555554332 11 0011123345888899999999 Q ss_pred eeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 253 HVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 253 ~~~g~~vl~p~~~v~~~~~~s 273 (273) .+||..|.+|.....|.-.-= T Consensus 501 tRY~l~v~nP~~~~~~~~~~~ 521 (523) T protein:vir:59 501 TRYALEVVRPEFYGLLYVKLL 521 (523) T ss_pred eehhheecchhHhhhhhhhhc Confidence 999999989986654421111 No 208 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=49.15 E-value=0.65 Score=21.62 Aligned_cols=255 Identities=10% Similarity=0.002 Sum_probs=114.7 Q ss_pred CcccchhHHHHHHHHHHHHHHh----hccchhhhccccccccCC-cEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQ----TVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~----~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ++| .-+|..++.-+...+.+- .....++-.+. .+.=. +++.++.....+.+...+.+.+....+...+..+. T Consensus 45 ~~~-~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~ 121 (336) T protein:vir:10 45 TGS-SGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQR 121 (336) T ss_pred CCC-chhHHHHHhhcccceeeehhhhhhhhhhccccc--cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeee Confidence 212 123443333222222221 11122222111 11112 46777777665554433434444334545555555 Q ss_pred EEEeeeeceeEechHHHHHhH---HHH-HHHHHHHHHHHHHHHHHHHH---------HHHHhhccc---ccc----cccC Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA---------DMLVDNGTA---LTG----SAPS 135 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~iD~~~~---------~~~~~~~~~---~~~----~~~~ 135 (273) ++-. ...++.+...|+.... .++ ....+.+++++.+++++..+ ..+. .++. .+. .... T Consensus 122 ~v~~-~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN-~P~l~a~~t~~t~~~~~~ 199 (336) T protein:vir:10 122 QSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIN-DPSLSAPITATTPWSGSP 199 (336) T ss_pred eEEE-EEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEe-CCCCccccccCCCccccc Confidence 6533 3556777766654332 233 23444566677777765322 1111 1111 111 1223 Q ss_pred CHhHHHHHHHHHHHHHhhcC--C-C-cCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccc Q lcl|Aclame:pro 136 DADDAFDLIASALKELTKAN--V-P-NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) Q Consensus 136 ~~~~~~~~i~~a~~~l~~~~--v-p-~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l 211 (273) ++..++++|.++...|..+. + . ...-.|+++|..+..|..- +. .+ -.+ .+-.-.++=+++|+..+.+ T Consensus 200 t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~-n~------~g-~Tv-l~~lk~n~Pnl~i~t~pEl 270 (336) T protein:vir:10 200 AVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQ------YG-LAA-AAKLKDIFPKLEFVTIPEY 270 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCC-Cc------cC-ccH-HHHHHHhcCccEEEEcccc Confidence 55778999998888886633 1 2 1234789999998887421 11 11 000 0000012334567777666 Q ss_pred ccCCCcEEEEEeCc-------eEEEEEecceeeeccCCCcceeeEEeeee-eeeEEEcCceEEEEecC Q lcl|Aclame:pro 212 RDTDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) Q Consensus 212 ~~~~~~~~~~~~~~-------a~~~~~~~~~ve~~~~~~~~~~~v~~~~~-~g~~vl~p~~~v~~~~~ 271 (273) ...++..+..+-+. -+++..++......+... .+.+....+ .|+-+.+|-+++.+.-= T Consensus 271 ~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~--~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 271 DTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS--YFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccCCCceEEEEEEecCCCcceeeecchhhhccceeecCc--eeEeccccceeeeeeeccchheeeecC Confidence 54444433333221 112222222111122222 344555444 45677788887765444 No 209 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=48.68 E-value=0.66 Score=21.57 Aligned_cols=262 Identities=12% Similarity=0.070 Sum_probs=102.8 Q ss_pred CcccchhHHHHHHHHHHHHHHh-hccchhhhccccccccCCcEEEEEeccc---cccccc-cCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQ-TVFANLVNREYEGIASKGNVVHIAGVVA---PTVKDY-KAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~-~v~~~~~~~d~~~~~~~Gdtv~ip~~~~---~~~~d~-~~~~~~~~~~~~~~~~~~~ 75 (273) |||..+.+.-....+...+++. .+-..++-+ ...+..+.++|.++. +...+. .+.++....=+.+.+..++ T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~----vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~ 76 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPR----VPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETG 76 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCc----cccCccccceeeechhhcccccchhhccCCCcceEeecccCcee Confidence 9998766553333333334332 222222211 111122334444433 222221 1222221111334455666 Q ss_pred EEEeeeeceeEechHHHHH--hHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc----ccccC-----CHhHHHHH Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQ--VAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALT----GSAPS-----DADDAFDL 143 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~--~~~~~~~-~~~~~~~ala~~iD~~~~~~~~~~~~~~~----~~~~~-----~~~~~~~~ 143 (273) .+..+ .....|+..+..+ ..++.++ ..+.....|....+.....++...++..+ .-+++ ...+.+.. T Consensus 77 ~~~~~-~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd~~SDPi~~ 155 (309) T protein:vir:99 77 STEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPV 155 (309) T ss_pred eeccc-ceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCCCCCCcHHH Confidence 66443 4445555555333 3355433 34444444333333322333222221110 00111 12344556 Q ss_pred HHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhcccccc-eeeeeeeeeecceE-EEEeccccc-----CCC Q lcl|Aclame:pro 144 IASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAA-GLRAGTIGNLLGAR-IVESNNLRD-----TDD 216 (273) Q Consensus 144 i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~-~~~~G~ig~i~G~~-i~~s~~l~~-----~~~ 216 (273) |.+++..+ +. ..-.++++.+.|..|+..+.........+.+. .+..-.+.+++|++ |+....... .++ T Consensus 156 i~~~~~~~---g~--~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~ 230 (309) T protein:vir:99 156 ITDALDSV---IL--RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNP 230 (309) T ss_pred HHHHHHhh---CC--CcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeecccccccc Confidence 66665544 22 22479999999999999887665554433222 22223346788884 554322221 111 Q ss_pred cEEEEEeCc-eEEEEEecc-eee--------------------eccCCCcceeeEEeeeeeeeEEEcCceEEEEecCCC Q lcl|Aclame:pro 217 EQFVAFHPS-AAAYVSQID-TVE--------------------ALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ~~~~~~~~~-a~~~~~~~~-~ve--------------------~~~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~~~~s 273 (273) ...-+|-++ ++.+..... ..+ ....... +..|+...++--.+.-+++=..|.-+.| T Consensus 231 ~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g-~~~vr~~~~~k~~i~~~d~G~li~~~va 308 (309) T protein:vir:99 231 NLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRG-GQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) T ss_pred ccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCC-ceEEEEeccccchhcchhcchhhhhccc Confidence 111111111 122211110 110 0000000 1123333333333333443334444444 No 210 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=47.39 E-value=0.7 Score=21.42 Aligned_cols=256 Identities=11% Similarity=0.039 Sum_probs=117.1 Q ss_pred Cccc--ch-------hHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccc----cCCCCccCCcc Q lcl|Aclame:pro 1 MAFN--NF-------IPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDY----KAAGRQTSADA 67 (273) Q Consensus 1 MA~~--~~-------~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~----~~~~~~~~~~~ 67 (273) ||+- .+ .-|=+.++|...=+....|.+++.. ....++ +..|....+++. ..+|.+..... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~-----~~a~~~--~~~W~~d~l~~~~~~~~~EG~da~~~~ 73 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGK-----GVATAI--THEWQTDELRQPGKNTRVEGEDATIKA 73 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecC-----ceeccc--EEEEEeeecCCccccccccCccccccc Confidence 8872 11 1122333333322333444443321 011222 233433222221 12333322222 Q ss_pred cccceEEEEEEeeeeceeEechHHHHHhHH---H-HHHHHHHHHHHHHHHHHHHHHHHHHhh----cc---cccc----- Q lcl|Aclame:pro 68 ISDTGVDLLIDQEKSIDFLVDDIDRVQVAG---S-LEAYTRAGATALATDTDKFIADMLVDN----GT---ALTG----- 131 (273) Q Consensus 68 ~~~~~~~~tid~~~~~~~~i~d~d~~~~~~---~-~~~~~~~~~~ala~~iD~~~~~~~~~~----~~---~~~~----- 131 (273) .......-...|--...+.|+...++.... + +....+.....|.+.++..++.--++. ++ ...+ T Consensus 74 ~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i 153 (317) T protein:vir:88 74 GSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYY 153 (317) T ss_pred ccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHh Confidence 233333333444445667777665544322 2 334455566777888877665432110 00 0000 Q ss_pred --------------------cc-cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhh-hhc---c Q lcl|Aclame:pro 132 --------------------SA-PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTS-ADT---S 186 (273) Q Consensus 132 --------------------~~-~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~-~~~---~ 186 (273) .+ ........+.|.++.+.+=+++... ..++++|.....|-+ +..+ ... . T Consensus 154 ~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~--~~i~v~a~~k~~i~~---~~~~~~~~i~~~ 228 (317) T protein:vir:88 154 KTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQA--NSIQTSSSIKKAISK---NMKGRATEITLD 228 (317) T ss_pred ccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCC--CEEEeChHHHHHHHH---HhcCCceeEEEc Confidence 00 0001134566888888888888633 467899998777732 2211 100 1 Q ss_pred cccceee---eeeeeeecceEEEEecccccCCCcEEEEEeCceEEEEEec-ceeeecc-CCCcceeeEEeeeeeeeEEEc Q lcl|Aclame:pro 187 GDAAGLR---AGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALR-DQDSFSDRIRALHVYGGKVVR 261 (273) Q Consensus 187 ~~~~~~~---~G~ig~i~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~-~~ve~~~-~~~~~~~~v~~~~~~g~~vl~ 261 (273) ++.+.+. +-.+..+-=+.|+.++.+|.. .++++.++.+.++... ...|..- ..++ +-...-.-||..+.. T Consensus 229 ~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~---~~~~~D~~~~~l~~Lr~~~~e~laKtGd~--~k~~i~~E~tLe~~N 303 (317) T protein:vir:88 229 ASDNRIAQTVDVYESDFGKYTIRANRWFHEN---TLFVFDPKMHSLCYLRPFFQHELAKTGDS--EKRQLLVEYTFRVNN 303 (317) T ss_pred ccCeEEEEEEEEEEeCCeEEEEEeCCCCCCC---eEEEEcccccceeecccceeeccCCCccc--ceeEEEEEEEEEEcC Confidence 1111111 011112222577778777743 4778888877655422 2222221 1222 223333348889999 Q ss_pred CceEEEEecCCC Q lcl|Aclame:pro 262 PTGVVVFNKTGS 273 (273) Q Consensus 262 p~~~v~~~~~~s 273 (273) |.+.++|.--.+ T Consensus 304 ~~a~a~i~~l~~ 315 (317) T protein:vir:88 304 EKSGALIRDVVA 315 (317) T ss_pred ccceeEEEEecc Confidence 998888875555 No 211 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=40.38 E-value=0.97 Score=20.65 Aligned_cols=255 Identities=10% Similarity=0.027 Sum_probs=116.4 Q ss_pred CcccchhHHHHHHHHHHHHH----HhhccchhhhccccccccCC-cEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWT----AQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~----~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ++|.- .|..++.-+...+. .......++-.+.. +.-- +++.++.....+.+...+.+.+....+..-+.... T Consensus 45 ~~~~g-~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~--g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~ 121 (336) T protein:vir:78 45 TGSSG-IPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQR 121 (336) T ss_pred CCCcc-hHHHHHHhcccceeeehhhhhhhhhhcccccC--CCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEE Confidence 22221 23444333322222 11222233322221 1112 47888887766665544444444344555566666 Q ss_pred EEEeeeeceeEechHHHHHhH---HHH-HHHHHHHHHHHHHHHHHHHH---------HHHHhhccc---cccc----ccC Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA---------DMLVDNGTA---LTGS----APS 135 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~iD~~~~---------~~~~~~~~~---~~~~----~~~ 135 (273) ++.. ...++.+...|..... .++ ....+.+++++.+++++..+ ..+. .++. .+.+ ... T Consensus 122 ~v~~-~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN-~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:78 122 QSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIN-DPSLSAPITATTPWSGSP 199 (336) T ss_pred EEEE-EEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEe-CCCCCcccccCcCccccc Confidence 6633 3566777766654332 233 23344456667777664221 1111 0111 1111 124 Q ss_pred CHhHHHHHHHHHHHHHhhcC---C-CcCCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEeccc Q lcl|Aclame:pro 136 DADDAFDLIASALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) Q Consensus 136 ~~~~~~~~i~~a~~~l~~~~---v-p~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s~~l 211 (273) +...++++|..+...+..+- + +.....|+++|..+..|..- +.. +.... +-+++ ++=+++|+..+.+ T Consensus 200 T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~-n~~---g~tv~-~~lk~----n~Pnl~i~t~pel 270 (336) T protein:vir:78 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQY---GLSAA-AKLKE----IFPKLEFVTIPEY 270 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-Ccc---CccHH-HHHHH----hcCccEEEEcccc Confidence 56778999998888775443 1 12234799999999988432 110 00000 01111 1224566666666 Q ss_pred ccCCCcEEEEEeCce-------EEEEEecceeeeccCCCcceeeEEeeee-eeeEEEcCceEEEEecC Q lcl|Aclame:pro 212 RDTDDEQFVAFHPSA-------AAYVSQIDTVEALRDQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) Q Consensus 212 ~~~~~~~~~~~~~~a-------~~~~~~~~~ve~~~~~~~~~~~v~~~~~-~g~~vl~p~~~v~~~~~ 271 (273) -..++.....+.+.. +.+..++......+.. ..+.+....+ .|+-+.+|-+++.+.-= T Consensus 271 ~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~--~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 271 DTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYS--SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cccCcceEEEEEeeccCCcceeeecchhhhccceeecC--ceeEeccccceeeeeeeccchheeeccC Confidence 544444333332221 1222222211122222 2345555444 45677788877765444 No 212 >protein:vir:79399 Length: 455 # NCBI annotation: head protein # Family: family:all:4054 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333662;genbank:gi:151266299;genbank:GeneID:5329881 Probab=36.29 E-value=1.2 Score=20.19 Aligned_cols=260 Identities=12% Similarity=0.061 Sum_probs=108.2 Q ss_pred CcccchhHHHHHHHHHHHHHHhhccchhhhcccccccc-----CCcEEEEEeccccccccccC-----CCCccCCccccc Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIAS-----KGNVVHIAGVVAPTVKDYKA-----AGRQTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~-----~Gdtv~ip~~~~~~~~d~~~-----~~~~~~~~~~~~ 70 (273) |+++.+..|.+.+-+.. ....+ ...+.+++.-..+. -|+||.=--.....-.-|.. +-.+.....++- T Consensus 45 i~d~~~qnEf~~sLI~R-Igs~L-~~d~S~~NPLa~FK~g~~~fGdtIeei~~d~ak~~~yd~~~~~aev~pFk~e~P~I 122 (455) T protein:vir:79 45 MSDNITRNEFMSALINR-IGSTL-IRDLSWKNPLAVFKQGMMNFGDTIEEVHMDYIKPTIYEEQRDYLERDVFGQAPPPV 122 (455) T ss_pred hhhhhHHHHHHHHHHhc-cccEE-EecccccCchHHhccccchhhhhhhhhhhccccccccCcchhhhhccccccCCCce Confidence 67766555555443322 11111 11111111111222 25554321111111111222 222333344444 Q ss_pred ceEEEEEEeeeeceeEechHHHHH---hHHHHHHHHHHHHHHHH--HHHHHHH-----HHHHHhhcccc--cccccCC-- Q lcl|Aclame:pro 71 TGVDLLIDQEKSIDFLVDDIDRVQ---VAGSLEAYTRAGATALA--TDTDKFI-----ADMLVDNGTAL--TGSAPSD-- 136 (273) Q Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~---~~~~~~~~~~~~~~ala--~~iD~~~-----~~~~~~~~~~~--~~~~~~~-- 136 (273) ...-.+.+........|++..... +...+++++++...++. ..+|.+. +..+.....-. -...+.+ T Consensus 123 kA~~H~~nR~~~y~~TI~dd~i~~AF~S~~gldefi~~i~~si~sSde~dEY~ylk~Li~~~~~~~~f~~~~I~D~~t~~ 202 (455) T protein:vir:79 123 KSAFHTINRKEKFKITVNRDVLRRAFLSDNGLSEMLSQTMAVAASSDQWSEFLYMTRLFKTYEDSFGFYRMQISDMNTFE 202 (455) T ss_pred eEEEeeccccceeeeeeeHHHHHHhhcChhhHHHHHHHHHHHHhcccchHHHHHHHHHHHHhhhhccceEEEeccccccc Confidence 555555555555556666554332 23446778888777765 3556542 22222221100 0111111 Q ss_pred -HhHHH----HHHHHHHHHH-------hhcCCCc----CCcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeee Q lcl|Aclame:pro 137 -ADDAF----DLIASALKEL-------TKANVPN----VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL 200 (273) Q Consensus 137 -~~~~~----~~i~~a~~~l-------~~~~vp~----~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i 200 (273) ....+ ..++.+..+| +-.++|. ++.+++++|++-..| +.+++.++-. -+ .+-.-+.+-.+ T Consensus 203 ~d~~~~~~~iK~lr~aA~kM~lPTR~yN~~gv~~~tdi~DL~lI~~~dtq~ev--dv~~LA~AFN-~d-~vd~~~~~i~V 278 (455) T protein:vir:79 203 PDKNKVDAALKALRVAANKMQYPTPAFNSAGVHSFARPEDLVLITTPEFKANV--DVTSLSAAFN-RS-DAEAPSHIITV 278 (455) T ss_pred cchhHHHHHHHHHHHHHHHhcCCCcccccccCcccccceeeEEEeCCCceeee--cHHHHHHHhC-cc-chhcCceeEEe Confidence 12222 3333333332 2223332 245889999887766 2223322211 11 11112334455 Q ss_pred cceEEEEecccccCCCcEEEEEeCceEEEEEecceeeeccCCCcceeeEEee-------eeee---eEEEcCceEEEEec Q lcl|Aclame:pro 201 LGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSFSDRIRAL-------HVYG---GKVVRPTGVVVFNK 270 (273) Q Consensus 201 ~G~~i~~s~~l~~~~~~~~~~~~~~a~~~~~~~~~ve~~~~~~~~~~~v~~~-------~~~g---~~vl~p~~~v~~~~ 270 (273) .||.. ..++..+++..+..+..-.+..+.|..|++..-.|-|..- ..|- +++-.|..++|--. T Consensus 279 d~f~f-------a~~~~~a~~~sk~~~~i~D~l~~~~si~np~~l~~Ny~~H~w~ils~S~F~~a~af~~~~~~~~vtp~ 351 (455) T protein:vir:79 279 PGETL-------GMDDTSAILTSKQFFVIKDILLENRTISNPEGLYDNYWLHHWSILSASPFTPAIAFGTKPNTIVVTPK 351 (455) T ss_pred ccccc-------ccCCceEEEeehhhhhhhhhhhhcccccCcccceeehhhhhhhhhhhccccceeeeecCCceEEEccc Confidence 56521 2233335666666666555666778888777544333221 2221 23334444444444 Q ss_pred CCC Q lcl|Aclame:pro 271 TGS 273 (273) Q Consensus 271 ~~s 273 (273) .+| T Consensus 352 ~~~ 354 (455) T protein:vir:79 352 AET 354 (455) T ss_pred ccc Confidence 444 No 213 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=35.11 E-value=1.3 Score=20.05 Aligned_cols=256 Identities=9% Similarity=0.012 Sum_probs=112.1 Q ss_pred CcccchhHHHHH---HHHHHHHHHhhccchhhhccccccccCC-cEEEEEeccccccccccCCCCccCCcccccceEEEE Q lcl|Aclame:pro 1 MAFNNFIPELWS---DMLLEEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~pev~~---~~v~~~l~~~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) ++|.-+ |..+. ..+.+.+..-.+...++-.+.. +.-. +++.++.....+-+...+...+....+...+....+ T Consensus 74 ~~~~g~-~~~l~~~~p~~i~~~tap~~a~~l~pv~t~--g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~ 150 (379) T protein:vir:10 74 VSIPGL-IQFLQNWLPGHVRILTAVREADEFLGLSTV--GQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRT 150 (379) T ss_pred ccccch-HHHHHhhcchHHHHHhhhhhhhhhcccccC--CCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeee Confidence 333221 33332 2333333222222333322221 1111 577788876665544333333332233333443444 Q ss_pred EEeeeeceeEechHHHHHhH---HHHH-HHHHHHHHHHHHHHHHHHHHHHHh----------hccc------ccc----- Q lcl|Aclame:pro 77 IDQEKSIDFLVDDIDRVQVA---GSLE-AYTRAGATALATDTDKFIADMLVD----------NGTA------LTG----- 131 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~---~~~~-~~~~~~~~ala~~iD~~~~~~~~~----------~~~~------~~~----- 131 (273) + .....++.+.+.|+.... .++. ...+.+++++.+++|+-.|-=..+ .++. .++ T Consensus 151 v-~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t 229 (379) T protein:vir:10 151 V-VRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSP 229 (379) T ss_pred e-EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCccccc Confidence 4 223466777766654332 2332 334445666666666532211000 0000 000 Q ss_pred -cccCCHhHHHHHHHHHHHHHhhc--CC--CcCC-cEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEE Q lcl|Aclame:pro 132 -SAPSDADDAFDLIASALKELTKA--NV--PNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARI 205 (273) Q Consensus 132 -~~~~~~~~~~~~i~~a~~~l~~~--~v--p~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i 205 (273) =+..|...++++|..+...+..+ ++ |.+- ..|+++|..+..|..- +. .+ -. +.+-.-.++=+++| T Consensus 230 ~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~------~g-~T-vl~~lk~n~Pnl~i 300 (379) T protein:vir:10 230 LWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TE------LG-YS-VAQYMRESYPNVTF 300 (379) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cc------cC-cc-HHHHHHHhcCCcEE Confidence 11236677888888887775433 22 3322 3689999999988532 11 00 00 00000012335677 Q ss_pred EEecccccCCC--cEEEEEeCceEE------------EEEecceeeeccCCCcceeeEEeeee-eeeEEEcCceEEEEec Q lcl|Aclame:pro 206 VESNNLRDTDD--EQFVAFHPSAAA------------YVSQIDTVEALRDQDSFSDRIRALHV-YGGKVVRPTGVVVFNK 270 (273) Q Consensus 206 ~~s~~l~~~~~--~~~~~~~~~a~~------------~~~~~~~ve~~~~~~~~~~~v~~~~~-~g~~vl~p~~~v~~~~ 270 (273) +..+.+...++ ..++++-++..+ +..++.....++.. ..+.+....+ .|+-+.+|-+++-+.- T Consensus 301 ~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~--~~~~~~~~~rt~Gv~ir~P~Ai~~~~G 378 (379) T protein:vir:10 301 VSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKI--KGYAEGYTNATAGAMLKRPFATYRQTG 378 (379) T ss_pred EEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccceecC--ceeEeccccceeeeeeecchhhheecC Confidence 77776644332 233444333221 11111111111111 2334444444 4577789988887665 Q ss_pred C Q lcl|Aclame:pro 271 T 271 (273) Q Consensus 271 ~ 271 (273) + T Consensus 379 ~ 379 (379) T protein:vir:10 379 A 379 (379) T ss_pred C Confidence 5 No 214 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=28.84 E-value=1.7 Score=19.30 Aligned_cols=253 Identities=11% Similarity=0.033 Sum_probs=94.3 Q ss_pred ccchhHHHHHHHHHHHHHHhhccchhhhccccccccCCcEEEEEeccccccccccCCCCccC-CcccccceEEEEEEeee Q lcl|Aclame:pro 3 FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQEK 81 (273) Q Consensus 3 ~~~~~pev~~~~v~~~l~~~~v~~~~~~~d~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~tid~~~ 81 (273) .| +.|..|.. ++..|.... +...+....+...|.---+|...+ |.+.. ...-.-...++++-+.+ T Consensus 1 i~-~~P~~~g~-~~glff~~~---~v~T~~V~ie~~~~~l~lip~v~r---------g~~g~~~~~~~~~~~~f~~p~~~ 66 (320) T protein:vir:10 1 MN-LLPVNYGD-SRALFAREK---KVRTRTILVEEKNGVLTLIQSREP---------GSTENVAKRGKRKVRSFVIPHLP 66 (320) T ss_pred CC-cCCchhhh-hhhhccCCC---CcccceEEEEEecCceeeeeccCC---------CCCceeecCCcceEEEEecceec Confidence 43 35777764 333332111 111222222233343333333322 22211 11112233333432222 Q ss_pred eceeEechHHHHHhH--H-----HHH----HHHHHHHHHHHHHHHHHHHHHHHhh-----cc---------cccc----- Q lcl|Aclame:pro 82 SIDFLVDDIDRVQVA--G-----SLE----AYTRAGATALATDTDKFIADMLVDN-----GT---------ALTG----- 131 (273) Q Consensus 82 ~~~~~i~d~d~~~~~--~-----~~~----~~~~~~~~ala~~iD~~~~~~~~~~-----~~---------~~~~----- 131 (273) . ...|+-.|..... + .++ +.+..+.+.+....+-.++..+... ++ .... T Consensus 67 ~-~d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~ 145 (320) T protein:vir:10 67 L-EDVILPDEYEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYF 145 (320) T ss_pred c-CCccCHHHHcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEE Confidence 1 2233333322111 1 011 1111222222222222233333210 00 0000 Q ss_pred cccCCHhHHHHHH----HHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccc--ceeeeeee--eeecce Q lcl|Aclame:pro 132 SAPSDADDAFDLI----ASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTI--GNLLGA 203 (273) Q Consensus 132 ~~~~~~~~~~~~i----~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~~~~G~i--g~i~G~ 203 (273) .-.....++...+ ..+...|. +.+..+-+++++|+++..|.+.+. +......... +.++.... -.+.|+ T Consensus 146 ~l~~a~~dv~~~~~~~~~~i~~~l~--g~~~t~v~al~g~~f~~al~~h~~-Vke~y~~~~~~~~~l~~~~~~~f~~gGi 222 (320) T protein:vir:10 146 GLDNKDANVAESCRQVLRHVEDNLR--GDVMKDVSVDVSEEFFDKFIKHAS-VKEVFLNHEAAVNRLGGDTRKGFKFGGL 222 (320) T ss_pred ecCCCCccHHHHHHHHHHHHHHHhc--cCCCCceEEEEChHHHHHHhcCHH-HHHHHHhhhhhhhhccccccceEEecCE Confidence 0000111222233 33333343 344556578999999999998764 3332221111 12221111 157888 Q ss_pred EEEEecc------------cccCCCcEEEEEeCceE----EEEEeccee---------eeccCCCcceeeEEeeeeeeeE Q lcl|Aclame:pro 204 RIVESNN------------LRDTDDEQFVAFHPSAA----AYVSQIDTV---------EALRDQDSFSDRIRALHVYGGK 258 (273) Q Consensus 204 ~i~~s~~------------l~~~~~~~~~~~~~~a~----~~~~~~~~v---------e~~~~~~~~~~~v~~~~~~g~~ 258 (273) .|.+.+. +|...+..+-.+.++.+ +-+.....+ ..+..+...+..+...+.-=.- T Consensus 223 ~~~~Y~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apad~~e~vnt~g~p~y~k~~~~~~~~g~~l~~qS~PLpi 302 (320) T protein:vir:10 223 IFNENRARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPADFNETAGTLGKRYYAKMEPRRMGRGFDLHSQSNVLPM 302 (320) T ss_pred EEEEcccEEEcCCCCeeEeecCCeeEEEEecCchhheeeecccCcHhhcCCcccccccccccccCCCeEEEEeeeccccc Confidence 8877432 22222111112222221 111111100 1122233334444444444355 Q ss_pred EEcCceEEEEecCCC Q lcl|Aclame:pro 259 VVRPTGVVVFNKTGS 273 (273) Q Consensus 259 vl~p~~~v~~~~~~s 273 (273) ..||+.++.+++++. T Consensus 303 ~~rP~~lv~~~~~a~ 317 (320) T protein:vir:10 303 CCRPGVLVELDAAAQ 317 (320) T ss_pred ccCcceEEEEEecCC Confidence 679999999999988 No 215 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=27.27 E-value=1.9 Score=19.11 Aligned_cols=260 Identities=12% Similarity=0.069 Sum_probs=93.3 Q ss_pred Ccc--cchhHHHHHHHHHHHHHHh------hccchhhhccccc---cccCCcEEEEEeccccccccccCCCCccCCcc-c Q lcl|Aclame:pro 1 MAF--NNFIPELWSDMLLEEWTAQ------TVFANLVNREYEG---IASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-I 68 (273) Q Consensus 1 MA~--~~~~pev~~~~v~~~l~~~------~v~~~~~~~d~~~---~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~-~ 68 (273) ||+ ++|.+..+...+.+.-+.. ..|+..-..+.+. .+..|..+ ++.+...+.+..... - T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~---------~a~~v~~~~~~~~~~r~ 71 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSV---------ALKAAAFDTNVTIRDRV 71 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCcee---------EeeeecCCCCcceeccc Confidence 995 5777777776554321111 1122111111111 11111111 123333222221111 1 Q ss_pred ccceEEEEEEeeeeceeEechHHHHHhH--HH-H-HHHHHHHHHHHH-----------HHHHHHHHHHHHhhccc----- Q lcl|Aclame:pro 69 SDTGVDLLIDQEKSIDFLVDDIDRVQVA--GS-L-EAYTRAGATALA-----------TDTDKFIADMLVDNGTA----- 128 (273) Q Consensus 69 ~~~~~~~tid~~~~~~~~i~d~d~~~~~--~~-~-~~~~~~~~~ala-----------~~iD~~~~~~~~~~~~~----- 128 (273) ..+.....+-.. .....|+..|..... .. . .+..++....++ ..++--+...+...... T Consensus 72 ~~~~~~~~~p~i-~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~ 150 (348) T protein:vir:27 72 SAEMHDEQMPFF-KEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDG 150 (348) T ss_pred ceeeeeeecCcc-ccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCC Confidence 112222333111 122345544433211 10 0 111112222222 22222222232211100 Q ss_pred --------------ccccc--cCCHhHHHHHHHHHHHHHhhcCCCcCCcEEEECHHHHHHHhcchHHhhhhhccccc-ce Q lcl|Aclame:pro 129 --------------LTGSA--PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AG 191 (273) Q Consensus 129 --------------~~~~~--~~~~~~~~~~i~~a~~~l~~~~vp~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~ 191 (273) .+.+. .....+.+++|.++...+++.+. ...+++++++++..|++++......+..... .. T Consensus 151 ~~~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~--~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~ 228 (348) T protein:vir:27 151 VNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGL--NPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSA 228 (348) T ss_pred eeEEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCC--cccEEEECHHHHHHHhcCHHHHHHhcccCccccc Confidence 00000 01123467788888888877665 3347899999999999887544333222111 11 Q ss_pred eeee----eeeeecceEEEEecccc-cCCC--------cEEEEEeCceEEEEEecceeeec------------------- Q lcl|Aclame:pro 192 LRAG----TIGNLLGARIVESNNLR-DTDD--------EQFVAFHPSAAAYVSQIDTVEAL------------------- 239 (273) Q Consensus 192 ~~~G----~ig~i~G~~i~~s~~l~-~~~~--------~~~~~~~~~a~~~~~~~~~ve~~------------------- 239 (273) +... .++.+.|++|+..+.-. ..++ ..++++..+..|...-....|.. T Consensus 229 i~~~~~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~ 308 (348) T protein:vir:27 229 VTKAELENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGI 308 (348) T ss_pred cCHHHHHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCee Confidence 1111 13456788887644322 1111 22334444433422111111100 Q ss_pred -----cCCCcceeeEEeeeeeeeEEEcCceEEEEe--cCC Q lcl|Aclame:pro 240 -----RDQDSFSDRIRALHVYGGKVVRPTGVVVFN--KTG 272 (273) Q Consensus 240 -----~~~~~~~~~v~~~~~~g~~vl~p~~~v~~~--~~~ 272 (273) ...+-.+..+.+...-=--+.+|+++.+++ ++. T Consensus 309 ~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 309 AVTTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 000000111111111111223455444331 222 No 216 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=20.07 E-value=2.8 Score=18.10 Aligned_cols=256 Identities=13% Similarity=0.068 Sum_probs=109.7 Q ss_pred CcccchhHHHHHHHHHHHHHH----hhccchhhhccccccccCC-cEEEEEeccccccccccCCCCccCCcccccceEEE Q lcl|Aclame:pro 1 MAFNNFIPELWSDMLLEEWTA----QTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~v~~~l~~----~~v~~~~~~~d~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) ++|.- +|-.+..-++..+.+ ......++-.+.. +.-. +++.++.....+.+...+.+.+....+..-+.... T Consensus 73 ~~~~g-~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~--g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r 149 (382) T protein:vir:96 73 TPSIP-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTV--GSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERR 149 (382) T ss_pred cCCcc-HHHHHHhhhhhhhhhhhhhhhhhhhhcccccc--CCccceEEEEeeeecccceEEeecccCCCccccccceeEE Confidence 44432 255554444444433 3333344433322 2112 57888887666554443433333233334344444 Q ss_pred EEEeeeeceeEechHHHHHhH---HHH-HHHHHHHHHHHHHHHHHHHH-H--HHHh--------hccc---c----cccc Q lcl|Aclame:pro 76 LIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA-D--MLVD--------NGTA---L----TGSA 133 (273) Q Consensus 76 tid~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~iD~~~~-~--~~~~--------~~~~---~----~~~~ 133 (273) ++ .....++.+.+.|+.... .++ ......+++++.+.+|+-.| . .... .++. . ..-+ T Consensus 150 ~v-~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa 228 (382) T protein:vir:96 150 TI-VRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWA 228 (382) T ss_pred EE-EEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcc Confidence 44 234567787776665432 233 33344456667777765333 1 0000 0110 0 0113 Q ss_pred cCCHhHHHHHHHHHHHHHhhcCC----CcC-CcEEEECHHHHHHHhcchHHhhhhhcccccceeeeeeeeeecceEEEEe Q lcl|Aclame:pro 134 PSDADDAFDLIASALKELTKANV----PNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES 208 (273) Q Consensus 134 ~~~~~~~~~~i~~a~~~l~~~~v----p~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~~~~G~ig~i~G~~i~~s 208 (273) ..|...++++|..+...+..+-- |.. ...|+++|..+..|-.. + ..+ -.+ .+-.--++=+++|+.. T Consensus 229 ~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n------~~g-~Tv-l~~lk~n~Pnl~i~t~ 299 (382) T protein:vir:96 229 TADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-T------PYG-ISV-SDWIEQTYPKMRIVSA 299 (382) T ss_pred cccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-C------ccC-ccH-HHHHHHhcCCcEEEEc Confidence 34667789999988888855442 222 23688999998887422 1 000 000 0000001223455555 Q ss_pred cccccCC------CcEEEEEeCceE---------EEE-Eeccee-----eeccCCCcceeeEEeee-eeeeEEEcCceEE Q lcl|Aclame:pro 209 NNLRDTD------DEQFVAFHPSAA---------AYV-SQIDTV-----EALRDQDSFSDRIRALH-VYGGKVVRPTGVV 266 (273) Q Consensus 209 ~~l~~~~------~~~~~~~~~~a~---------~~~-~~~~~v-----e~~~~~~~~~~~v~~~~-~~g~~vl~p~~~v 266 (273) +.+.... ....+.+.+..- ..+ .|.... ..++... .+.+.... ..|+-+.+|.+++ T Consensus 300 peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~--~~~~~~s~~t~Gv~i~~P~ai~ 377 (382) T protein:vir:96 300 PELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAK--SYVEDFSNGTAGALCKRPWAVV 377 (382) T ss_pred cccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecc--eeEeccccceeeeEEEcchhhh Confidence 4442110 011111111100 000 000000 0001111 12222222 2567788888777 Q ss_pred EEecC Q lcl|Aclame:pro 267 VFNKT 271 (273) Q Consensus 267 ~~~~~ 271 (273) .+.-= T Consensus 378 ~~~GI 382 (382) T protein:vir:96 378 RYLGI 382 (382) T ss_pred hccCC Confidence 55433 Done!